BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004533
(746 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 1028 bits (2658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/776 (63%), Positives = 578/776 (74%), Gaps = 75/776 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW SLI+KAK GG+DVIQTYVFWNLHEPQ+GQ+ F+GR D++RF+KEIQ+QGLY CLRIG
Sbjct: 32 MWSSLISKAKAGGIDVIQTYVFWNLHEPQQGQFYFNGRADLVRFVKEIQAQGLYACLRIG 91
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEWTYGGLP WLHD+ G+V+RSDN+P+K
Sbjct: 92 PFIESEWTYGGLPFWLHDIPGMVYRSDNQPFKYHMKRFVSRIVSMMKSEKLYASQGGPII 151
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY+ +E AFHEKGP YV WAA MAV+ TGVPWVMCKQDDAP PVIN+CNGMRC
Sbjct: 152 LSQVENEYKNVEAAFHEKGPSYVRWAALMAVNLQTGVPWVMCKQDDAPDPVINSCNGMRC 211
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKPSIWTEDWTSFYQV+G + Y+RSAQDIAFHVALFIAK GSYVNYYMYH
Sbjct: 212 GETFAGPNSPNKPSIWTEDWTSFYQVYGEETYMRSAQDIAFHVALFIAKTGSYVNYYMYH 271
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRTA+AF IT YYDQAPLDEYGL+R+PKWGHLKELHAAIK CS+ LL G S
Sbjct: 272 GGTNFGRTASAFTITSYYDQAPLDEYGLIRQPKWGHLKELHAAIKSCSKLLLHGAHKTFS 331
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG LQ+A+VF+ SG CAAFLVNND ++ V VLF++ SY+LP+KSISILPDCKT+ FNT
Sbjct: 332 LGPLQQAYVFQGNSGQCAAFLVNNDGKQEVEVLFQSNSYKLPQKSISILPDCKTMTFNTA 391
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ QY RS N KF+S KWEEY E I FD T LRA LL+ +S KD SDY WYT
Sbjct: 392 KVNAQYTTRSMKPNQKFNSVGKWEEYNEPIPEFDKTSLRANRLLEHMSTTKDTSDYLWYT 451
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
FRF N NAQ+ + QSHGH+LHA+VNG + G HGSH N SF+L+ TV L+ GTN A
Sbjct: 452 FRFQQNLPNAQSVFNAQSHGHVLHAYVNGVHAGFGHGSHQNTSFSLQTTVRLKNGTNSVA 511
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVL 509
LLS TVGLPDSGA+LER+VAG+ RVR+Q+K FT +WGYQVGL+GE+LQIY+ G NKV
Sbjct: 512 LLSATVGLPDSGAYLERRVAGLRRVRIQNKDFTTYTWGYQVGLLGERLQIYTENGSNKVK 571
Query: 510 WSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNP 569
W+ + + R L WYKT F APAGNDP+ALNL SMGKGEAWVNGQSIGRYWVSF TS+G+P
Sbjct: 572 WNKLGT-NRPLMWYKTLFDAPAGNDPVALNLGSMGKGEAWVNGQSIGRYWVSFHTSQGSP 630
Query: 570 SQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIR 629
SQT Y++PRAFLKPTGNLLVLLEEE G P GITVDT+++
Sbjct: 631 SQTW--------------------YNIPRAFLKPTGNLLVLLEEEKGYPPGITVDTVSVT 670
Query: 630 KVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDG 689
KVCG+ + SHL VQ SCPL + IS I+FASFG P G
Sbjct: 671 KVCGYASESHL-----------------------SAVQLSCPLKRNISSIIFASFGTPSG 707
Query: 690 DCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+CE YA+G+CHSS S+ VE+ACIGK CSIP + +FGGDPCPGI K LLV+A+C
Sbjct: 708 NCESYAIGNCHSSSSKANVEKACIGKRSCSIPQSNHFFGGDPCPGIPKVLLVEAKC 763
>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
Length = 798
Score = 1022 bits (2643), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 496/780 (63%), Positives = 581/780 (74%), Gaps = 55/780 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI+KA+ GGLD I TYVFWNLHEPQ+GQYDFSGR D++RFIKE+ +QGLYVCLRIG
Sbjct: 38 MWPYLISKARAGGLDAIDTYVFWNLHEPQQGQYDFSGRKDLVRFIKEVHAQGLYVCLRIG 97
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEWTYGGLP WLHDV GIVFRSDNKP+K
Sbjct: 98 PFIESEWTYGGLPFWLHDVPGIVFRSDNKPFKYHMERYAKMIVKMLKAEKLYASQGGPII 157
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AFHEKGPPYV WAAKMAV HTGVPWVMCKQDDAP PVINACNG+RC
Sbjct: 158 LSQIENEYGNVEAAFHEKGPPYVKWAAKMAVGLHTGVPWVMCKQDDAPDPVINACNGLRC 217
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSP KP+IWTE+WTS YQ +G + RSA+DIAFH ALFIAK GS+VNYYMYH
Sbjct: 218 GETFSGPNSPRKPAIWTENWTSVYQTYGKETRSRSAEDIAFHAALFIAKGGSFVNYYMYH 277
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRTAA ++ T YYDQAPLDEYGL+R+PK GHLKELHAAIKLC +PLL+ S
Sbjct: 278 GGTNFGRTAAEYVPTSYYDQAPLDEYGLLRQPKHGHLKELHAAIKLCRKPLLSRKWINFS 337
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LGQLQEAF FE S CAAFLVN+D R TV F+ SY+LP KSISILP CKTVAFNT
Sbjct: 338 LGQLQEAFAFERNSDECAAFLVNHDGRSNATVHFKGSSYKLPPKSISILPHCKTVAFNTA 397
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+VSTQY R T KFDS E+W+EY+E I +FD + LRA LL+ ++ KD+SDY WYT
Sbjct: 398 QVSTQYGTRLATRRHKFDSIEQWKEYKEYIPSFDKSSLRANTLLEHMNTTKDSSDYLWYT 457
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
FRFH NSSNA + L V S GH LHAFVNGE+ GSAHGSHDN SFTL+ ++ L++GTN +
Sbjct: 458 FRFHQNSSNAHSVLTVNSLGHNLHAFVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNYVS 517
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDK----SFTNCSWGYQVGLIGEKLQIYSNLGL 505
LLSV GLPD+GA+LER+VAG+ RV +Q + FT WGY+VGL GE +Q++ N
Sbjct: 518 LLSVMTGLPDAGAYLERRVAGLRRVTIQRQHELHDFTTYLWGYKVGLSGENIQLHRNNAS 577
Query: 506 NKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTS 565
K WS S +R LTWYK+ F APAGNDP+ALNL SMGKGEAWVNG+SIGRYWVSF S
Sbjct: 578 VKAYWSRYASSSRPLTWYKSIFDAPAGNDPVALNLASMGKGEAWVNGRSIGRYWVSFLDS 637
Query: 566 KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDT 625
GNP QT H+PR+FLKP+GNLLV+LEEE GNPLGI++ T
Sbjct: 638 DGNPYQTW--------------------NHIPRSFLKPSGNLLVILEEERGNPLGISLGT 677
Query: 626 IAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFG 685
++I KVCGHV+ SH PP+ SW Q T +K+G++P VQ CP G+KIS ++F+SFG
Sbjct: 678 MSITKVCGHVSISHPPPVISWQGENQINGTRKRKYGRRPKVQLRCPRGRKISSVLFSSFG 737
Query: 686 NPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
P GDCE YA+GSCH+S+S+ VE+AC+GK RCSIP+ S+ F GDPCPGI K+LLVDA+C
Sbjct: 738 TPSGDCETYAIGSCHASNSRATVEKACLGKERCSIPVSSKNFKGDPCPGIAKSLLVDAKC 797
>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
Length = 771
Score = 991 bits (2563), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/754 (62%), Positives = 564/754 (74%), Gaps = 75/754 (9%)
Query: 24 NLHEPQKG-QYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGI 82
++H P+ +YDF GR D+++F+ E+Q+QGLY LRIGPFIE EWTYGGLP WLHDV+GI
Sbjct: 60 SIHYPRSTPEYDFDGRKDLVKFLLEVQAQGLYAALRIGPFIEGEWTYGGLPFWLHDVSGI 119
Query: 83 VFRSDNKPYK-------------------------------IENEYQTIEPAFHEKGPPY 111
VFRSDN+P+K IENEYQ +E AFHEKG Y
Sbjct: 120 VFRSDNEPFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQIENEYQNVETAFHEKGSRY 179
Query: 112 VLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTS 171
V WAA MAV +TGVPWVMCKQ DAP PVIN CNGMRCGETF GPNSPNKPS+WTE+WTS
Sbjct: 180 VHWAANMAVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFAGPNSPNKPSMWTENWTS 239
Query: 172 FYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAP 231
FYQV+GG+PYIR+A+DIAFHVALFIA+NGSYVNYYMYHGGTNFGRT +AF+ T YYDQAP
Sbjct: 240 FYQVFGGEPYIRTAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRTGSAFVTTSYYDQAP 299
Query: 232 LDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLV 291
LDEYGL+R+PKWGHLK+LHA IK CS+ L+ GT LG+LQEA+VF E SG C AFLV
Sbjct: 300 LDEYGLIRQPKWGHLKDLHAKIKSCSKTLIRGTHQTFPLGRLQEAYVFREKSGDCVAFLV 359
Query: 292 NNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEK 351
NND R+ VTV F+N SYELP KSISILPDCK++ FNT +V+TQY RS T + +F S K
Sbjct: 360 NNDGRRDVTVRFQNRSYELPHKSISILPDCKSITFNTAKVNTQYATRSATLSQEFSSVGK 419
Query: 352 WEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHI 411
WEEY+E + FD+T LRA+ LLD +S KD SDY WYTFRF + S Q+ L S GH+
Sbjct: 420 WEEYKETVATFDSTSLRAKTLLDHLSTTKDTSDYLWYTFRFQNHFSRPQSTLRAYSRGHV 479
Query: 412 LHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV 471
LHA+VNG Y GSAHGSH++ SFTL N+V L+ GTN+ ALLSVTVGLPDSGA+LER+VAG+
Sbjct: 480 LHAYVNGVYAGSAHGSHESTSFTLENSVRLKNGTNNVALLSVTVGLPDSGAYLERRVAGL 539
Query: 472 HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPA 531
HRVR+Q+K FT SWGYQVGL+GEKLQIY++ GLNKV W+ R T+ LTWYKT F APA
Sbjct: 540 HRVRIQNKDFTTYSWGYQVGLLGEKLQIYTDNGLNKVSWNEFRGTTQPLTWYKTQFDAPA 599
Query: 532 GNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKAT 591
G+DPIALNL SMGKGEAWVNGQSIGRYWVSF TSKGNPSQT+
Sbjct: 600 GSDPIALNLHSMGKGEAWVNGQSIGRYWVSFSTSKGNPSQTR------------------ 641
Query: 592 NTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQ 651
YH+P++F+KPTGNLLVLLEEE G P GITVD+I+I KVCGHV+ SH
Sbjct: 642 --YHIPQSFVKPTGNLLVLLEEEKGYPPGITVDSISISKVCGHVSESH------------ 687
Query: 652 RGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERA 711
K VQ SCP + IS+I+F+SFG P+G+C +YA+G CHSS+S+ +VE+A
Sbjct: 688 -----------KSVVQLSCPPNRNISRILFSSFGTPEGNCNQYAIGKCHSSNSRAIVEKA 736
Query: 712 CIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
CIGK++C I +R+FGGDPCPGI K LLVDA+C
Sbjct: 737 CIGKTKCIILRSNRFFGGDPCPGIRKGLLVDAKC 770
>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
Length = 828
Score = 984 bits (2543), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 480/785 (61%), Positives = 586/785 (74%), Gaps = 49/785 (6%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW SLIAKAKEGGLDVI TYVFWNLHEPQ GQYDFSGR DI+RFIKE+Q+QGLYVCLRIG
Sbjct: 54 MWQSLIAKAKEGGLDVIDTYVFWNLHEPQPGQYDFSGRRDIVRFIKEVQAQGLYVCLRIG 113
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+ EW+YGGLP WLHD+ GIVFRSDN+P+K
Sbjct: 114 PFIQGEWSYGGLPFWLHDIPGIVFRSDNEPFKVQMQGFTTKIVTMMQSEKLYVSQGGPII 173
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY T+E A+HEKGP YV WAA+MAV +TGVPWVMCKQ+DAP PVINACNG+RC
Sbjct: 174 LSQIENEYGTVEEAYHEKGPAYVKWAAQMAVGLNTGVPWVMCKQNDAPDPVINACNGLRC 233
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI-AKNGSYVNYYMY 208
ETF GPNSPNKP+IWTE+WT+ Y + G IRS +DIAF V FI AK GS+VNYYMY
Sbjct: 234 AETFVGPNSPNKPAIWTENWTTRYVITGENIRIRSVEDIAFQVTQFIVAKKGSFVNYYMY 293
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGRTA+AF+ T YYDQAP+DEYGL+R+PKWGHLKE+HAAIKLC PLL+G Q I
Sbjct: 294 HGGTNFGRTASAFVPTSYYDQAPIDEYGLIRQPKWGHLKEMHAAIKLCLTPLLSGGQVTI 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ Q+AFVF SG CAAFL+NND +V FRN SY+LP SISILPDCKTVAFNT
Sbjct: 354 SLGQQQQAFVFTGLSGECAAFLLNNDTANTASVQFRNASYDLPPNSISILPDCKTVAFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+VSTQY RS T + D ++KW +Y+EAI+NFD T +++E +L+Q+S KDASDY WY
Sbjct: 414 AKVSTQYTTRSMTRSKLLDGEDKWVQYQEAIVNFDETSVKSEAILEQMSTTKDASDYLWY 473
Query: 389 TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDG 448
TFRF SS+ QA L+V+S GH+LHAFVNG+ G A GSH N FTL++TV L +G N+
Sbjct: 474 TFRFQQESSDTQAVLNVRSLGHVLHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVNNV 533
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRVQD----KSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LLSV VG+PDSGA++ER+ AG+ +V++Q+ K FTN SWGYQVGL+GEKLQI+++ G
Sbjct: 534 SLLSVMVGMPDSGAYMERRAAGLRKVKIQEKEGNKEFTNYSWGYQVGLLGEKLQIFTDQG 593
Query: 505 LNKVLWSSI-RSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
++V W++ ++ LTWYKT F AP + P+ALNL SMGKGEAWVNGQSIGRYW S++
Sbjct: 594 SSQVQWANFSKNALNPLTWYKTLFDAPLEDAPVALNLGSMGKGEAWVNGQSIGRYWPSYR 653
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
S G+ SQ YA + AI +A Y+VPR+FLKP GNLLV+LEE GNPL I+V
Sbjct: 654 ASDGS-SQIWYAY-----FNTGAIFRAVR-YNVPRSFLKPKGNLLVVLEESGGNPLQISV 706
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTD-IKKFGKKPTVQPSCPLGKKISKIVFA 682
DT +I K+C HVT SHLP +SSW +R +TD +P V+ CP KIS I+FA
Sbjct: 707 DTASISKICSHVTASHLPLVSSW---SKRTNTDNNNSLQARPRVKLDCPSNTKISNILFA 763
Query: 683 SFGNPDGDC-ERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLV 741
S+G P+G C + YAVG CHSS S+ +V++AC+G+ RCSIP+ S+YFGGDPC K+LLV
Sbjct: 764 SYGTPEGTCGDAYAVGMCHSSSSEAIVQKACLGQMRCSIPVSSKYFGGDPCSANEKSLLV 823
Query: 742 DAQCR 746
A+C+
Sbjct: 824 VAECK 828
>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 801
Score = 983 bits (2542), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/779 (61%), Positives = 571/779 (73%), Gaps = 58/779 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAKEGG+DVIQTYVFWNLHEPQ+G Y+FSGR DI+RF+KEIQ+QGLY CLRIG
Sbjct: 46 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 105
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW+YGGLP WLHDV GIV+RSDN+P+K
Sbjct: 106 PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 165
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AF EKGPPYV WAAKMAV TGVPW MCKQ+DAP PVIN CNGMRC
Sbjct: 166 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 225
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI-AKNGSYVNYYMY 208
GETF GPNSPNKPSIWTE+WTSFYQ +G +PYIRSA++IAFHVALFI AKNG+YVNYYMY
Sbjct: 226 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 285
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR+A+AFMITGYYDQ+PLDEYGL REPKWGHLKELHAA+KLCS PLLTGT++
Sbjct: 286 HGGTNFGRSASAFMITGYYDQSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNF 345
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAV--TVLFRNISYELPRKSISILPDCKTVAF 326
SLGQ EA VF+ S CAAFLVN R A+ VLF+N++YELP SISILPDCK VAF
Sbjct: 346 SLGQSVEAIVFKTESNECAAFLVN---RGAIDSNVLFQNVTYELPLGSISILPDCKNVAF 402
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
NT RVS Q+N RS + KFD E WEE++E I N D+T LRA LL+ + KD SDY
Sbjct: 403 NTRRVSVQHNTRSMMAVQKFDLLE-WEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYL 461
Query: 387 WYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
WYTFR +S ++Q L+V S H LHAFVNG+Y GSAHG + F+L + LR G N
Sbjct: 462 WYTFRVQQDSPDSQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGIN 521
Query: 447 DGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLN 506
+ +LLSV VGLPDSGAFLE +VAG+ RV +Q + F+ WGY+VGL GE+ QI+ + G +
Sbjct: 522 NISLLSVMVGLPDSGAFLETRVAGLRRVGIQGEDFSEQHWGYKVGLSGEQSQIFLDTGSS 581
Query: 507 KVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSK 566
V WS + + ++ LTWYKT F AP G+DPIALNL SMGKG WVNG+ IGRYWVSF T K
Sbjct: 582 NVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPK 641
Query: 567 GNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTI 626
G PSQ Y+VPR+FLKPT N LV+LEEE GNP+ I++D++
Sbjct: 642 GEPSQ--------------------KWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSV 681
Query: 627 AIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGN 686
I K CG V+ SH P ++SW+ +++ +K ++P VQ SCP KKIS I+FASFG
Sbjct: 682 LITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSKKKISNILFASFGT 741
Query: 687 PDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
P GDC+ YA+G CHS +S+ +VE AC+G+++CSIP+ + F GDPCP + K LLVDAQC
Sbjct: 742 PSGDCQSYAIGLCHSPNSRAIVEHACLGRAKCSIPISNLNFRGDPCPHVTKTLLVDAQC 800
>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 756
Score = 981 bits (2536), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/779 (61%), Positives = 571/779 (73%), Gaps = 58/779 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAKEGG+DVIQTYVFWNLHEPQ+G Y+FSGR DI+RF+KEIQ+QGLY CLRIG
Sbjct: 1 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW+YGGLP WLHDV GIV+RSDN+P+K
Sbjct: 61 PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 120
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AF EKGPPYV WAAKMAV TGVPW MCKQ+DAP PVIN CNGMRC
Sbjct: 121 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 180
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI-AKNGSYVNYYMY 208
GETF GPNSPNKPSIWTE+WTSFYQ +G +PYIRSA++IAFHVALFI AKNG+YVNYYMY
Sbjct: 181 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 240
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR+A+AFMITGYYDQ+PLDEYGL REPKWGHLKELHAA+KLCS PLLTGT++
Sbjct: 241 HGGTNFGRSASAFMITGYYDQSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNF 300
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAV--TVLFRNISYELPRKSISILPDCKTVAF 326
SLGQ EA VF+ S CAAFLVN R A+ VLF+N++YELP SISILPDCK VAF
Sbjct: 301 SLGQSVEAIVFKTESNECAAFLVN---RGAIDSNVLFQNVTYELPLGSISILPDCKNVAF 357
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
NT RVS Q+N RS + KFD E WEE++E I N D+T LRA LL+ + KD SDY
Sbjct: 358 NTRRVSVQHNTRSMMAVQKFDLLE-WEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYL 416
Query: 387 WYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
WYTFR +S ++Q L+V S H LHAFVNG+Y GSAHG + F+L + LR G N
Sbjct: 417 WYTFRVQQDSPDSQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGIN 476
Query: 447 DGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLN 506
+ +LLSV VGLPDSGAFLE +VAG+ RV +Q + F+ WGY+VGL GE+ QI+ + G +
Sbjct: 477 NISLLSVMVGLPDSGAFLETRVAGLRRVGIQGEDFSEQHWGYKVGLSGEQSQIFLDTGSS 536
Query: 507 KVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSK 566
V WS + + ++ LTWYKT F AP G+DPIALNL SMGKG WVNG+ IGRYWVSF T K
Sbjct: 537 NVQWSRLGNSSQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPK 596
Query: 567 GNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTI 626
G PSQ Y+VPR+FLKPT N LV+LEEE GNP+ I++D++
Sbjct: 597 GEPSQ--------------------KWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSV 636
Query: 627 AIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGN 686
I K CG V+ SH P ++SW+ +++ +K ++P VQ SCP KKIS I+FASFG
Sbjct: 637 LITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSKKKISNILFASFGT 696
Query: 687 PDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
P GDC+ YA+G CHS +S+ +VE AC+G+++CSIP+ + F GDPCP + K LLVDAQC
Sbjct: 697 PSGDCQSYAIGLCHSPNSRAIVEHACLGRAKCSIPISNLNFRGDPCPHVTKTLLVDAQC 755
>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
Length = 817
Score = 974 bits (2519), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/783 (60%), Positives = 575/783 (73%), Gaps = 62/783 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI++AK+GG+DVI+TYVFWN HEP+ GQYDFSGR DI+RFI+E+Q+QGLY CLRIG
Sbjct: 58 MWPSLISQAKQGGIDVIETYVFWNQHEPKPGQYDFSGRRDIVRFIREVQAQGLYACLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW YGG P WLHDV GIV+R+DN+P+K
Sbjct: 118 PFIQAEWNYGGFPFWLHDVPGIVYRTDNEPFKFYMRNFTTKIVEIMKSENLYASQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+T+E F E G YVLWAA MAV TGVPWVMCKQDDAP PVIN+CNG C
Sbjct: 178 LQQIENEYKTVEANFGEAGKRYVLWAANMAVGLETGVPWVMCKQDDAPDPVINSCNGRLC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK-NGSYVNYYMY 208
GETF GPNSPNKP+IWTE+WTS Y ++G R +DIAFHVALF+AK NGS++NYYMY
Sbjct: 238 GETFAGPNSPNKPAIWTENWTSSYPLFGEDARPRPVEDIAFHVALFVAKMNGSFINYYMY 297
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGRTA+A++ T YYD+APLDEYGL+++P WGHLKELHAA+KLCS LL G Q+ +
Sbjct: 298 HGGTNFGRTASAYVQTAYYDEAPLDEYGLIQQPTWGHLKELHAAVKLCSETLLQGAQSNL 357
Query: 269 SLG-QLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
SLG +LQEA+VF SG CAAFLVNND R VTV+F+N SYELPRKSISILPDCK AFN
Sbjct: 358 SLGTKLQEAYVFRGQSGKCAAFLVNNDSRTDVTVVFQNTSYELPRKSISILPDCKNEAFN 417
Query: 328 TERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
T + S + S + KF+S E+WEEY+E+ILNFD+T RA LL+ ++ KDASDY W
Sbjct: 418 TAKASFRPGLISIQTVTKFNSTEQWEEYKESILNFDDTSSRANTLLEHMNTTKDASDYLW 477
Query: 388 YTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTND 447
YTFR++ + SN Q+ L S H LHAF+NG +TGS HGS N+SF+L NTV R G N+
Sbjct: 478 YTFRYNNDPSNGQSVLSTNSRAHALHAFINGRHTGSQHGSSSNLSFSLDNTVSFRAGINN 537
Query: 448 GALLSVTVGLPDSGAFLERKVAGVHRVRVQD----KSFTNCSWGYQVGLIGEKLQIYSNL 503
+LLSV VGLPDSGA+LER+VAG+ RVR+Q K FTN WGYQVGL+GEKLQIY+++
Sbjct: 538 VSLLSVMVGLPDSGAYLERRVAGLRRVRIQSNGSLKDFTNNPWGYQVGLLGEKLQIYTDV 597
Query: 504 GLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSF 562
G KV WS S T LTWYKT F APAGN+P+ALNL SM KGE WVNGQSIGRYWVSF
Sbjct: 598 GSQKVQWSKFGSSTSGLLTWYKTVFDAPAGNEPVALNLVSMRKGEVWVNGQSIGRYWVSF 657
Query: 563 KTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGIT 622
T G PSQ YH+PR+FLKPTGNLLVLLEEE G+P+GI+
Sbjct: 658 LTPSGKPSQIW--------------------YHIPRSFLKPTGNLLVLLEEETGHPVGIS 697
Query: 623 VDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFA 682
+ ++I K+CGHV+ SHLPP+ S + +++ + G++P VQ CP + IS+I+FA
Sbjct: 698 IGKVSIPKICGHVSESHLPPVISRVIYKKHEN----HHGRRPKVQLRCPSNRNISRILFA 753
Query: 683 SFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVD 742
SFG P GDC+ YAVGSCHSS+S+ VE+AC+GK CS+PL + FGGDPCPG KALLVD
Sbjct: 754 SFGTPSGDCQSYAVGSCHSSNSRSNVEKACLGKGMCSVPLSYKRFGGDPCPGTPKALLVD 813
Query: 743 AQC 745
QC
Sbjct: 814 VQC 816
>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
Length = 813
Score = 973 bits (2515), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/783 (61%), Positives = 563/783 (71%), Gaps = 60/783 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI+KAKEGG+DVI+TY FWN HEP++GQYDFSGR DI++F KE+Q+QGLY CLRIG
Sbjct: 54 MWPSLISKAKEGGIDVIETYAFWNQHEPKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIG 113
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW YGGLP WLHDV GI++RSDN+P+K
Sbjct: 114 PFIESEWNYGGLPFWLHDVPGIIYRSDNEPFKFYMQNFTTKIVNLMKSENLYASQGGPII 173
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+ +E AFHEKGPPYV WAAKMAVD TGVPWVMCKQDDAP PVINACNGM+C
Sbjct: 174 LSQIENEYKNVEAAFHEKGPPYVRWAAKMAVDLQTGVPWVMCKQDDAPDPVINACNGMKC 233
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPN PNKP+IWTE+WTS Y+V+G R+A+D+AF VALFIA KNGS++NYYMY
Sbjct: 234 GETFAGPNKPNKPAIWTENWTSVYEVYGEDKRGRAAEDLAFQVALFIAKKNGSFINYYMY 293
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGRT++++++T YYDQAPLDEYGL+R+PKWGHLKELHA IKLCS LL G Q
Sbjct: 294 HGGTNFGRTSSSYVLTAYYDQAPLDEYGLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNY 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQLQEA++F+ SG CAAFLVNND+R+ VTVLF+N +YEL SISILPDCK +AFNT
Sbjct: 354 SLGQLQEAYLFKRPSGQCAAFLVNNDKRRNVTVLFQNTNYELAANSISILPDCKKIAFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+VSTQ+N RS + F S ++W EYRE I +F T L+A LL+ + KDASDY WY
Sbjct: 414 AKVSTQFNTRSVQTRATFGSTKQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWY 473
Query: 389 TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDG 448
T RF NSSNAQ L V S H+LHAFVNG+Y SAHGSH N SF+L N V L G N
Sbjct: 474 TLRFIQNSSNAQPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRI 533
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRVQD----KSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LLSV VGLPD+G +LE KVAG+ RV +QD K F+ WGYQVGL+GEK QIY++ G
Sbjct: 534 SLLSVMVGLPDAGPYLEHKVAGIRRVEIQDGGDSKDFSKHPWGYQVGLMGEKSQIYTSPG 593
Query: 505 LNKVLWSSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
KV W + S R LTWYKT F AP GNDP+ L SMGKGEAWVNGQSIGRYWVS+
Sbjct: 594 SQKVQWHGLGSHGRGPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYL 653
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
T G PSQT Y+VPRAFL P GNLLV+ EEE+G+PL I++
Sbjct: 654 TPSGEPSQTW--------------------YNVPRAFLNPKGNLLVVQEEESGDPLKISI 693
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFAS 683
T+++ VCGHVT+SH PP+ SW D + GK P VQ CP ISKI FAS
Sbjct: 694 GTVSVTNVCGHVTDSHPPPIISW---TTSDDGNESHHGKIPKVQLRCPPSSNISKITFAS 750
Query: 684 FGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDA 743
FG P G CE YA+GSCHS +S V E+AC+GK+ CSIP + FG DPCPG KALLV A
Sbjct: 751 FGTPVGGCESYAIGSCHSPNSLAVAEKACLGKNMCSIPHSLKSFGDDPCPGTPKALLVAA 810
Query: 744 QCR 746
QC+
Sbjct: 811 QCK 813
>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
Length = 821
Score = 972 bits (2513), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 479/783 (61%), Positives = 563/783 (71%), Gaps = 60/783 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI+KAKEGG+DVI+TY FWN HEP++GQYDFSGR DI++F KE+Q+QGLY CLRIG
Sbjct: 62 MWPSLISKAKEGGIDVIETYAFWNQHEPKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW YGGLP WLHDV GI++RSDN+P+K
Sbjct: 122 PFIESEWNYGGLPFWLHDVPGIIYRSDNEPFKFYMQNFTTKIVNLMKSENLYASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+ +E AFHEKGPPYV WAAKMAVD TGVPWVMCKQDDAP PVINACNGM+C
Sbjct: 182 LSQIENEYKNVEAAFHEKGPPYVRWAAKMAVDLQTGVPWVMCKQDDAPDPVINACNGMKC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPN PNKP+IWTE+WTS Y+V+G R+A+D+AF VALFIA KNGS++NYYMY
Sbjct: 242 GETFAGPNKPNKPAIWTENWTSVYEVYGEDKRGRAAEDLAFQVALFIAKKNGSFINYYMY 301
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGRT++++++T YYDQAPLDEYGL+R+PKWGHLKELHA IKLCS LL G Q
Sbjct: 302 HGGTNFGRTSSSYVLTAYYDQAPLDEYGLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNY 361
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQLQEA++F+ SG CAAFLVNND+R+ VTVLF+N +YEL SISILPDCK +AFNT
Sbjct: 362 SLGQLQEAYLFKRPSGQCAAFLVNNDKRRNVTVLFQNTNYELAANSISILPDCKKIAFNT 421
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+VSTQ+N RS + F S ++W EYRE I +F T L+A LL+ + KDASDY WY
Sbjct: 422 AKVSTQFNTRSVQTRATFGSTKQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWY 481
Query: 389 TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDG 448
T RF NSSNAQ L V S H+LHAFVNG+Y SAHGSH N SF+L N V L G N
Sbjct: 482 TLRFIQNSSNAQPVLRVDSLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRI 541
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRVQD----KSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LLSV VGLPD+G +LE KVAG+ RV +QD K F+ WGYQVGL+GEK QIY++ G
Sbjct: 542 SLLSVMVGLPDAGPYLEHKVAGIRRVEIQDGGDSKDFSKHPWGYQVGLMGEKSQIYTSPG 601
Query: 505 LNKVLWSSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
KV W + S R LTWYKT F AP GNDP+ L SMGKGEAWVNGQSIGRYWVS+
Sbjct: 602 SQKVQWHGLGSHGRGPLTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYL 661
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
T G PSQT Y+VPRAFL P GNLLV+ EEE+G+PL I++
Sbjct: 662 TPSGEPSQTW--------------------YNVPRAFLNPKGNLLVVQEEESGDPLKISI 701
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFAS 683
T+++ VCGHVT+SH PP+ SW D + GK P VQ CP ISKI FAS
Sbjct: 702 GTVSVTNVCGHVTDSHPPPIISW---TTSDDGNESHHGKIPKVQLRCPPSSNISKITFAS 758
Query: 684 FGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDA 743
FG P G CE YA+GSCHS +S V E+AC+GK+ CSIP + FG DPCPG KALLV A
Sbjct: 759 FGTPVGGCESYAIGSCHSPNSLAVAEKACLGKNMCSIPHSLKSFGDDPCPGTPKALLVAA 818
Query: 744 QCR 746
QC+
Sbjct: 819 QCK 821
>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
Length = 818
Score = 928 bits (2398), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/785 (57%), Positives = 556/785 (70%), Gaps = 62/785 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAK GG+DVI TYVFWN+HEPQ+GQ+DFSGR DI++FIKE+++ GLYVCLRIG
Sbjct: 55 MWPSLIAKAKSGGIDVIDTYVFWNIHEPQQGQFDFSGRRDIVKFIKEVKAHGLYVCLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+ EW+YGGLP WLH+V GIVFR+DN+P+K
Sbjct: 115 PFIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAQMIVKLMKSENLYASQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + AF + G YV WAAK+AV+ TGVPWVMCKQDDAP P++NACNG +C
Sbjct: 175 LSQIENEYGMVARAFRQDGKSYVKWAAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETFKGPNSPNKP+IWTE+WTSFYQ +G +P IRSA+DIAFHVALFIAKNGS+VNYYMYH
Sbjct: 235 GETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR A+ F+IT YYDQAPLDEYGL+R+PKWGHLKELHAA+KLC PLL+G Q IS
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+LQ AFVF + + +CAA LVN D + TV FRN SY L KSIS+LPDCK VAFNT
Sbjct: 355 LGKLQTAFVFGKKANLCAALLVNQD-KCDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTA 413
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ QYN R++ S WE++ E + +F T +R+E LL+ ++ +D SDY W T
Sbjct: 414 KVNAQYNTRTRKPRQNLSSPHMWEKFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
RF S A + L V GH+LHAFVN + GS HG+ SF L + L GTN+ A
Sbjct: 474 TRFE-QSEGAPSVLKVNHLGHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTNNMA 532
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKS----FTNCSWGYQVGLIGEKLQIYSNLGL 505
LLSV VGLP+SGA LER+V G V + + S F N SWGYQVGL GEK +Y+ G
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVNIWNGSYQLFFNNYSWGYQVGLKGEKYHVYTEDGA 592
Query: 506 NKVLWSSIR-SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT 564
KV W R S ++ LTWYK +F P G DP+ALNL SMGKGEAWVNGQSIGRYWVSF T
Sbjct: 593 KKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFYT 652
Query: 565 SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE-NGNPLGITV 623
SKGNPSQ YH+PR+FLKP NLLV+LEEE G PLGIT+
Sbjct: 653 SKGNPSQIW--------------------YHIPRSFLKPNSNLLVILEEEREGYPLGITI 692
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLR--HRQRGDTDIK-KFGKKPTVQPSCPLGKKISKIV 680
DT+++ +VCGHV+N+H P+ S + H + +K ++ +KP VQ CP G+KISK++
Sbjct: 693 DTVSVTEVCGHVSNTHPHPVISPRKKGHNRNEQRHLKYRYDRKPKVQLQCPTGRKISKVL 752
Query: 681 FASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALL 740
FA+FGNP+G C Y+VGSCHS +S VV++AC+ KSRCS+P+ S+ FGGD CP K+LL
Sbjct: 753 FATFGNPNGSCGSYSVGSCHSPNSLAVVQKACLRKSRCSVPVWSKTFGGDLCPQTVKSLL 812
Query: 741 VDAQC 745
V AQC
Sbjct: 813 VRAQC 817
>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
Precursor
gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
Length = 815
Score = 924 bits (2387), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/782 (57%), Positives = 553/782 (70%), Gaps = 59/782 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAK GG+DV+ TYVFWN+HEPQ+GQ+DFSG DI++FIKE+++ GLYVCLRIG
Sbjct: 55 MWPSLIAKAKSGGIDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+ EW+YGGLP WLH+V GIVFR+DN+P+K
Sbjct: 115 PFIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + AF ++G YV W AK+AV+ TGVPWVMCKQDDAP P++NACNG +C
Sbjct: 175 LSQIENEYGMVGRAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETFKGPNSPNKP+IWTE+WTSFYQ +G +P IRSA+DIAFHVALFIAKNGS+VNYYMYH
Sbjct: 235 GETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR A+ F+IT YYDQAPLDEYGL+R+PKWGHLKELHAA+KLC PLL+G Q IS
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+LQ AFVF + + +CAA LVN D+ ++ TV FRN SY L KS+S+LPDCK VAFNT
Sbjct: 355 LGKLQTAFVFGKKANLCAAILVNQDKCES-TVQFRNSSYRLSPKSVSVLPDCKNVAFNTA 413
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ QYN R++ + S + WEE+ E + +F T +R+E LL+ ++ +D SDY W T
Sbjct: 414 KVNAQYNTRTRKARQNLSSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
RF S A + L V GH LHAFVNG + GS HG+ F L + L GTN+ A
Sbjct: 474 TRFQ-QSEGAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLA 532
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKS----FTNCSWGYQVGLIGEKLQIYSNLGL 505
LLSV VGLP+SGA LER+V G V++ + F N SWGYQVGL GEK +Y+ G
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGS 592
Query: 506 NKVLWSSIR-SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT 564
KV W R S ++ LTWYK +F P G DP+ALNL SMGKGEAWVNGQSIGRYWVSF T
Sbjct: 593 AKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFHT 652
Query: 565 SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE-NGNPLGITV 623
KGNPSQ YH+PR+FLKP NLLV+LEEE GNPLGIT+
Sbjct: 653 YKGNPSQIW--------------------YHIPRSFLKPNSNLLVILEEEREGNPLGITI 692
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFAS 683
DT+++ +VCGHV+N++ P+ S + ++ +KP VQ CP G+KISKI+FAS
Sbjct: 693 DTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFAS 752
Query: 684 FGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDA 743
FG P+G C Y++GSCHS +S VV++AC+ KSRCS+P+ S+ FGGD CP K+LLV A
Sbjct: 753 FGTPNGSCGSYSIGSCHSPNSLAVVQKACLKKSRCSVPVWSKTFGGDSCPHTVKSLLVRA 812
Query: 744 QC 745
QC
Sbjct: 813 QC 814
>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 911 bits (2354), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/783 (57%), Positives = 540/783 (68%), Gaps = 87/783 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAKEGGLD I+TYVFWN+HEPQ G YDFSG +DI+RFIKE+Q+QGLY CLRIG
Sbjct: 56 MWPSLIAKAKEGGLDAIETYVFWNVHEPQPGHYDFSGGHDIVRFIKEVQAQGLYACLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+SEW+YGGLP WLHD+ GIVFRSDN+P+K
Sbjct: 116 PFIQSEWSYGGLPFWLHDIPGIVFRSDNEPFKVYMQNFTAKVVSMMQSENLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY T++ A+ ++G YV WAA+MA TGVPWVMCKQ++APG VIN+CNGM+C
Sbjct: 176 LSQIENEYGTVQKAYGQEGLAYVQWAAQMAEGLQTGVPWVMCKQNNAPGHVINSCNGMKC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI-AKNGSYVNYYMY 208
G+TF GPNSPNKPSIWTE+WT+ +SA+DIAFHV LFI AK GS+VNYYMY
Sbjct: 236 GQTFVGPNSPNKPSIWTENWTT-----------QSAEDIAFHVTLFIAAKKGSFVNYYMY 284
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGRTA+AF+ T YYDQAPLDEYGL +PKWGHLKELHAAIKLCS PLL+G Q +
Sbjct: 285 HGGTNFGRTASAFVTTSYYDQAPLDEYGLTTQPKWGHLKELHAAIKLCSTPLLSGVQVNL 344
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG Q+A++F SG CAAFL+NND A +V FRN SY+LP SISILPDCK
Sbjct: 345 YLGPQQQAYIFNAVSGECAAFLINNDSSNAASVPFRNASYDLPPMSISILPDCK------ 398
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
VSTQY R+ D+ + W+E+ EAI NFD+T R+E LL+Q++ KD+SDY WY
Sbjct: 399 -NVSTQYTTRTMGRGEVLDAADVWQEFTEAIPNFDSTSTRSETLLEQMNTTKDSSDYLWY 457
Query: 389 TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDG 448
TFRF + SS+ QA LDV S GH LHAFVNG+ GS GS N F +V L +G N+
Sbjct: 458 TFRFQHESSDTQAILDVSSLGHALHAFVNGQAVGSVQGSRKNPRFKFETSVSLSKGINNV 517
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS----FTNCSWGYQVGLIGEKLQIYSNLG 504
+LLSV VG+PDSGAFLE + AG+ V ++DK FTN SWGYQ+GL GE LQIY+ G
Sbjct: 518 SLLSVMVGMPDSGAFLENRAAGLRTVMIRDKQDNNDFTNYSWGYQIGLQGETLQIYTEQG 577
Query: 505 LNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT 564
++V W + LTWYKT AP G+ P+ LNL SMGKGEAWVNGQSIGRYW S
Sbjct: 578 SSQVQWKKFSNAGNPLTWYKTQVDAPPGDVPVGLNLASMGKGEAWVNGQSIGRYWPS--- 634
Query: 565 SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVD 624
YHVPR+FLKPTGNLLVL EEE GNPL +++D
Sbjct: 635 -----------------------------YHVPRSFLKPTGNLLVLQEEEGGNPLQVSLD 665
Query: 625 TIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASF 684
T+ I +VCGHVT SHL P+SSW+ H QR K G++P V +CP KIS+I FAS+
Sbjct: 666 TVTISQVCGHVTASHLAPVSSWIEHNQRYKNPAKVSGRRPKVLLACPSKSKISRISFASY 725
Query: 685 GNPDGDCER-YAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDA 743
G P G+C AVG+CHS +S+ VVE AC+GK +CSIP+ R FGGDPCP K+L+V A
Sbjct: 726 GTPLGNCRNSMAVGTCHSQNSKAVVEEACLGKMKCSIPVSVRQFGGDPCPAKAKSLMVVA 785
Query: 744 QCR 746
+CR
Sbjct: 786 ECR 788
>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 900 bits (2327), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 437/765 (57%), Positives = 541/765 (70%), Gaps = 59/765 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAK GG+DV+ TYVFWN+HEPQ+GQ+DFSG DI++FIKE+++ GLYVCLRIG
Sbjct: 55 MWPSLIAKAKSGGIDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+ EW+YGGLP WLH+V GIVFR+DN+P+K
Sbjct: 115 PFIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + AF ++G YV W AK+AV+ TGVPWVMCKQDDAP P++NACNG +C
Sbjct: 175 LSQIENEYGMVGRAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETFKGPNSPNKP+IWTE+WTSFYQ +G +P IRSA+DIAFHVALFIAKNGS+VNYYMYH
Sbjct: 235 GETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR A+ F+IT YYDQAPLDEYGL+R+PKWGHLKELHAA+KLC PLL+G Q IS
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+LQ AFVF + + +CAA LVN D+ ++ TV FRN SY L KS+S+LPDCK VAFNT
Sbjct: 355 LGKLQTAFVFGKKANLCAAILVNQDKCES-TVQFRNSSYRLSPKSVSVLPDCKNVAFNTA 413
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ QYN R++ + S + WEE+ E + +F T +R+E LL+ ++ +D SDY W T
Sbjct: 414 KVNAQYNTRTRKARQNLSSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
RF S A + L V GH LHAFVNG + GS HG+ F L + L GTN+ A
Sbjct: 474 TRFQ-QSEGAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLA 532
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKS----FTNCSWGYQVGLIGEKLQIYSNLGL 505
LLSV VGLP+SGA LER+V G V++ + F N SWGYQVGL GEK +Y+ G
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGS 592
Query: 506 NKVLWSSIR-SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT 564
KV W R S ++ LTWYK +F P G DP+ALNL SMGKGEAWVNGQSIGRYWVSF T
Sbjct: 593 AKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFHT 652
Query: 565 SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE-NGNPLGITV 623
KGNPSQ YH+PR+FLKP NLLV+LEEE GNPLGIT+
Sbjct: 653 YKGNPSQIW--------------------YHIPRSFLKPNSNLLVILEEEREGNPLGITI 692
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFAS 683
DT+++ +VCGHV+N++ P+ S + ++ +KP VQ CP G+KISKI+FAS
Sbjct: 693 DTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFAS 752
Query: 684 FGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
FG P+G C Y++GSCHS +S VV++AC+ KSRCS+P+ S+ FG
Sbjct: 753 FGTPNGSCGSYSIGSCHSPNSLAVVQKACLKKSRCSVPVWSKTFG 797
>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
Length = 780
Score = 866 bits (2238), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/782 (54%), Positives = 532/782 (68%), Gaps = 81/782 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAK GG+DV+ TYVFWN+HEPQ+GQ+DFSG DI++FIKE+++ GLYVCLRIG
Sbjct: 42 MWPSLIAKAKSGGIDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIG 101
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+ EW+YGGLP WLH+V GIVFR+DN+P+K
Sbjct: 102 PFIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPII 161
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + AF ++G YV W AK+AV+ TGVPWVMCKQDDAP P++NACNG +C
Sbjct: 162 LSQIENEYGMVGRAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQC 221
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETFKGPNSPNKP+IWTE+WTS SA+DIAFHVALFIAKNGS+VNYYMYH
Sbjct: 222 GETFKGPNSPNKPAIWTENWTSL-----------SAEDIAFHVALFIAKNGSFVNYYMYH 270
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR A+ F+IT YYDQAPLDEYGL+R+PKWGHLKELHAA+KLC PLL+G Q IS
Sbjct: 271 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 330
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+LQ AFVF + + +CAA LVN D+ ++ TV FRN SY L KS+S+LPDCK VAFNT
Sbjct: 331 LGKLQTAFVFGKKANLCAAILVNQDKCES-TVQFRNSSYRLSPKSVSVLPDCKNVAFNTA 389
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ QYN R++ + S + WEE+ E + +F T +R+E LL+ ++ +D SDY W T
Sbjct: 390 KVNAQYNTRTRKARQNLSSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 449
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
RF S A + L V GH LHAFVNG + GS HG+ F L + L GTN+ A
Sbjct: 450 TRFQ-QSEGAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLA 508
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKS----FTNCSWGYQVGLIGEKLQIYSNLGL 505
LLSV VGLP+SGA LER+V G V++ + F N SWGYQVGL GEK +Y+ G
Sbjct: 509 LLSVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGS 568
Query: 506 NKVLWSSIR-SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT 564
KV W R S ++ LTWYK +F P G DP+ALNL SMGKGEAWVNGQSI +
Sbjct: 569 AKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIAMF------ 622
Query: 565 SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE-NGNPLGITV 623
+ YH+PR+FLKP NLLV+LEEE GNPLGIT+
Sbjct: 623 -------------------------SYFRYHIPRSFLKPNSNLLVILEEEREGNPLGITI 657
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFAS 683
DT+++ +VCGHV+N++ P+ S + ++ +KP VQ CP G+KISKI+FAS
Sbjct: 658 DTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFAS 717
Query: 684 FGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDA 743
FG P+G C Y++GSCHS +S VV++AC+ KSRCS+P+ S+ FGGD CP K+LLV A
Sbjct: 718 FGTPNGSCGSYSIGSCHSPNSLAVVQKACLKKSRCSVPVWSKTFGGDSCPHTVKSLLVRA 777
Query: 744 QC 745
QC
Sbjct: 778 QC 779
>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 696
Score = 864 bits (2232), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/659 (64%), Positives = 492/659 (74%), Gaps = 53/659 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LIAKAKEGGLDVIQTYVFWNLHEPQ+GQYDF G +I+RFIKEIQ+QGLYV LRIG
Sbjct: 57 MWPNLIAKAKEGGLDVIQTYVFWNLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+IESE TYGGLP+WLHD+ GIVFRSDN+ +K
Sbjct: 117 PYIESECTYGGLPLWLHDIPGIVFRSDNEQFKFHMQRFTAKIVNLMKSANLFASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AFHEKG Y+ WAA+MAV TGVPWVMCKQD+AP PVIN CNGM+C
Sbjct: 177 LSQIENEYGNVEGAFHEKGLSYIRWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TFKGPNSPNKPS+WTE+WTSFYQV+G PYIRSA+DIA++VALFIAK GSYVNYYMYH
Sbjct: 237 GKTFKGPNSPNKPSLWTENWTSFYQVFGEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF R A+AF++T YYD+APLDEYGLVREPKWGHLKELH AIK CS LL GTQ S
Sbjct: 297 GGTNFDRIASAFVVTAYYDEAPLDEYGLVREPKWGHLKELHEAIKSCSNSLLYGTQTSFS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG Q A+VF +S CAAFL N ++R +VT+ F+NI Y+LP SISILPDCK VAFNT
Sbjct: 357 LGTQQNAYVFRRSSIECAAFLENTEDR-SVTIQFQNIPYQLPPNSISILPDCKNVAFNTA 415
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V Q N R+ S L+F+S EKW+ YREAI +F +T LRA LLDQIS AKD SDY WYT
Sbjct: 416 KVRAQ-NARAMKSQLQFNSAEKWKVYREAIPSFADTSLRANTLLDQISTAKDTSDYLWYT 474
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
FR + NS+NAQ+ L SHGH+LHAFVNG GS HGSH NVSF + N ++L G N+ +
Sbjct: 475 FRLYDNSANAQSILSAYSHGHVLHAFVNGNLVGSKHGSHKNVSFVMENKLNLISGMNNIS 534
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVL 509
LS TVGLP+SGA+LE +VAG+ ++VQ + FTN +WGYQVGL+GEKLQIY+ G +KV
Sbjct: 535 FLSATVGLPNSGAYLEGRVAGLRSLKVQGRDFTNQAWGYQVGLLGEKLQIYTASGSSKVK 594
Query: 510 WSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNP 569
W S S T+ LTWYKTTF AP GNDP+ LNL SMGKG WVNGQ IGRYWVSF T +G P
Sbjct: 595 WESFLSSTKPLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWVNGQGIGRYWVSFHTPQGTP 654
Query: 570 SQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAI 628
SQ YH+PR+ LK TGNLLVLLEEE GNPLGIT+DT+ I
Sbjct: 655 SQKW--------------------YHIPRSLLKSTGNLLVLLEEETGNPLGITLDTVYI 693
>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 697
Score = 862 bits (2228), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 421/659 (63%), Positives = 493/659 (74%), Gaps = 53/659 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LIAKAKEGGLDVIQTYVFWNLHEPQ+GQYDF G +I+RFIKEIQ+QGLYV LRIG
Sbjct: 58 MWPNLIAKAKEGGLDVIQTYVFWNLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+IESE TYGGLP+WLHD+ GIVFRSDN+ +K
Sbjct: 118 PYIESECTYGGLPLWLHDIPGIVFRSDNEQFKFHMQKFSAKIVNLMKSANLFASQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AFHEKG Y+ WAA+MAV TGVPWVMCKQD+AP PVIN CNGM+C
Sbjct: 178 LSQIENEYGNVEGAFHEKGLSYIRWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TFKGPNSPNKPS+WTE+WTSFYQV+G PYIRSA+DIA++VALFIAK GSYVNYYMYH
Sbjct: 238 GKTFKGPNSPNKPSLWTENWTSFYQVFGEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF R A+AF+IT YYD+APLDEYGLVREPKWGHLKELHAAIK CS +L GTQ S
Sbjct: 298 GGTNFDRIASAFVITAYYDEAPLDEYGLVREPKWGHLKELHAAIKSCSNSILHGTQTSFS 357
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG Q A+VF+ +S CAAFL N E ++VT+ F+NI Y+LP SISILPDCK VAFNT
Sbjct: 358 LGTQQNAYVFKRSSIECAAFL-ENTEDQSVTIQFQNIPYQLPPNSISILPDCKNVAFNTA 416
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+VS Q N R+ S L+F+S E W+ Y+EAI +F +T LRA LLDQIS KD SDY WYT
Sbjct: 417 KVSIQ-NARAMKSQLEFNSAETWKVYKEAIPSFGDTSLRANTLLDQISTTKDTSDYLWYT 475
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
FR + NS NAQ+ L SHGH+LHAFVNG GS HGSH N+SF + N ++L G N+ +
Sbjct: 476 FRLYDNSPNAQSILSAYSHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNNIS 535
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVL 509
LS TVGLP+SGA+LER+VAG+ ++VQ + FTN +WGYQ+GL+GEKLQIY+ G +KV
Sbjct: 536 FLSATVGLPNSGAYLERRVAGLRSLKVQGRDFTNQAWGYQIGLLGEKLQIYTASGSSKVQ 595
Query: 510 WSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNP 569
W S +S T+ LTWYKTTF AP GNDP+ LNL SMGKG W+NGQ IGRYWVSF T +G P
Sbjct: 596 WESFQSSTKPLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYWVSFHTPQGTP 655
Query: 570 SQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAI 628
SQ YH+PR+ LK TGNLLVLLEEE GNPLGIT+DT+ I
Sbjct: 656 SQKW--------------------YHIPRSLLKSTGNLLVLLEEETGNPLGITLDTVYI 694
>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
Length = 758
Score = 853 bits (2203), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/678 (61%), Positives = 493/678 (72%), Gaps = 57/678 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW SLIAKAKEGG+DVIQTYVFWN HEPQ GQYDF+GR D+ +FIKEIQ+QGLY CLRIG
Sbjct: 92 MWASLIAKAKEGGVDVIQTYVFWNRHEPQPGQYDFNGRYDLAKFIKEIQAQGLYACLRIG 151
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW+YGGLP WLHDV GIV+R+DN+P+K
Sbjct: 152 PFIESEWSYGGLPFWLHDVHGIVYRTDNEPFKFYMQNFTTKIVNLMKSEGLYASQGGPII 211
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ IE AF+EKGP YV WAAKMAV+ TGVPWVMCKQ DAP PVIN CNGMRC
Sbjct: 212 LSQIENEYQNIEAAFNEKGPSYVRWAAKMAVELQTGVPWVMCKQSDAPDPVINTCNGMRC 271
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPNSPNKPS+WTE+WTSFY+V+GG+ Y+RSA+DIAFHVALFIA+NGSYVNYYMYH
Sbjct: 272 GQTFTGPNSPNKPSMWTENWTSFYEVFGGETYLRSAEDIAFHVALFIARNGSYVNYYMYH 331
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR ++A++ T YYDQAPLDEYGL+R+PKWGHLKELHAAI LCS PLL G Q+ IS
Sbjct: 332 GGTNFGRASSAYIKTSYYDQAPLDEYGLIRQPKWGHLKELHAAITLCSTPLLNGVQSNIS 391
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LGQLQEA+VF+E G C AFLVNNDE TVLF+N+S EL KSISILPDCK V FNT
Sbjct: 392 LGQLQEAYVFQEEMGGCVAFLVNNDEGNNSTVLFQNVSIELLPKSISILPDCKNVIFNTA 451
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+++T YN+R TS+ FD+ ++WEEY++AI NF +T L++ +L+ ++ KD SDY WYT
Sbjct: 452 KINTGYNERIATSSQSFDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT 511
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
FRF NSS + L ++S H +HAFVN Y G+ HGSHD FT ++ + L N+ +
Sbjct: 512 FRFQPNSSCTEPLLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNIS 571
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLG 504
+LSV VG PDSGA+LE + AG+ RV +Q F N +WGYQVGL GEKL IY
Sbjct: 572 ILSVMVGFPDSGAYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEEN 631
Query: 505 LNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
L+ V W T Q LTWYK F P+G+DP+ALNL +MGKGEAWVNGQSIGRYWVSF
Sbjct: 632 LSNVEWRKTEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFH 691
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
SKG+PSQT YHVPRAFLK + NLLVLLEE NG+PL I++
Sbjct: 692 NSKGDPSQT--------------------LYHVPRAFLKTSENLLVLLEEANGDPLHISL 731
Query: 624 DTIAIRKVCGHVTNSHLP 641
+TI+ + HV HLP
Sbjct: 732 ETISRTDLPDHVLYHHLP 749
>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
Length = 766
Score = 852 bits (2202), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 438/783 (55%), Positives = 521/783 (66%), Gaps = 107/783 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI+KAKEGG+DVI+TY FWN HEP++GQYDFSGR DI++F KE+Q+QGLY CLRIG
Sbjct: 54 MWPSLISKAKEGGIDVIETYAFWNQHEPKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIG 113
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW YGGLP WLHDV GI++RSDN+P+K
Sbjct: 114 PFIESEWNYGGLPFWLHDVPGIIYRSDNEPFKFYMQNFTTKIVNLMKSENLYASQGGPII 173
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+ +E AFHEKGPPYV WAAKMAVD T + +
Sbjct: 174 LSQIENEYKNVEAAFHEKGPPYVRWAAKMAVDLQTAMRYY-------------------- 213
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK-NGSYVNYYMY 208
GE +G R+A+D+AF VALFIAK NGS++NYYMY
Sbjct: 214 GEDKRG---------------------------RAAEDLAFQVALFIAKKNGSFINYYMY 246
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGRT++++++T YYDQAPLDEYGL+R+PKWGHLKELHA IKLCS LL G Q
Sbjct: 247 HGGTNFGRTSSSYVLTAYYDQAPLDEYGLIRQPKWGHLKELHAVIKLCSDTLLXGVQYNY 306
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQLQEA++F+ SG CAAFLVNND+R+ VTVLF+N +YEL SISILPDCK +AFNT
Sbjct: 307 SLGQLQEAYLFKRPSGQCAAFLVNNDKRRNVTVLFQNTNYELAANSISILPDCKKIAFNT 366
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+VSTQ+N RS + F S ++W EYRE I +F T L+A LL+ + KDASDY WY
Sbjct: 367 AKVSTQFNTRSVQTRATFGSTKQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWY 426
Query: 389 TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDG 448
T RF +NSSNAQ L V S H+L AFVNG+Y SAHGSH N SF+L N V L G N
Sbjct: 427 TLRFIHNSSNAQPVLRVDSLAHVLLAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRI 486
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRVQD----KSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LLSV VGLPD+G +LE KVAG+ RV +QD K F+ WGYQVGL+GEKLQIY++ G
Sbjct: 487 SLLSVMVGLPDAGPYLEHKVAGIRRVEIQDGGXSKDFSKHPWGYQVGLMGEKLQIYTSPG 546
Query: 505 LNKVLWSSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
KV W + S R LTWYKT F AP GNDP+ L SMGKGEAWVNGQSIGRYWVS+
Sbjct: 547 SQKVQWYGLGSHGRGPLTWYKTLFDAPRGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYL 606
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
T G PSQT Y+VPRAFL P GNLLV+ EEE+G+PL I++
Sbjct: 607 TPSGEPSQTW--------------------YNVPRAFLNPKGNLLVVQEEESGDPLKISI 646
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFAS 683
T+++ VCGHVT+SH PP+ SW D + GK P VQ CP ISKI FAS
Sbjct: 647 GTVSVTNVCGHVTDSHPPPIISW---TTSDDGNESHHGKIPKVQLRCPPSSNISKITFAS 703
Query: 684 FGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDA 743
FG P G CE YA+GSCHS +S V E+AC+GK+ CSIP + FG DPCPG KALLV A
Sbjct: 704 FGTPVGGCESYAIGSCHSPNSLAVAEKACLGKNXCSIPHSLKSFGDDPCPGTPKALLVAA 763
Query: 744 QCR 746
QC+
Sbjct: 764 QCK 766
>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
Length = 729
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/685 (60%), Positives = 490/685 (71%), Gaps = 64/685 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW SLIAKAKEGG+DVIQTYVFWN HEPQ GQYDF+GR D+ +FIKEIQ+QGLY CLRIG
Sbjct: 56 MWASLIAKAKEGGVDVIQTYVFWNRHEPQPGQYDFNGRYDLAKFIKEIQAQGLYACLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW+YGGLP WLHDV GIV+R+DN+P+K
Sbjct: 116 PFIESEWSYGGLPFWLHDVHGIVYRTDNEPFKFYMQNFTTKIVNLMKSEGLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ IE AF+EKGP YV WAAKMAV+ TGVPWVMCKQ DAP PVIN CNGMRC
Sbjct: 176 LSQIENEYQNIEAAFNEKGPSYVRWAAKMAVELQTGVPWVMCKQSDAPDPVINTCNGMRC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPNSPNKPS+WTE+WTSFY+V+GG+ Y+RSA+DIAFHVALFIA+NGSYVNYYMYH
Sbjct: 236 GQTFTGPNSPNKPSMWTENWTSFYEVFGGETYLRSAEDIAFHVALFIARNGSYVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR ++A++ T YYDQAPLDEYGL+R+PKWGHLKELHAAI LCS PLL G Q+ IS
Sbjct: 296 GGTNFGRASSAYIKTSYYDQAPLDEYGLIRQPKWGHLKELHAAITLCSTPLLNGVQSNIS 355
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LGQLQEA+VF+E G C AFLVNNDE TVLF+N+S EL KSISILPDCK V FNT
Sbjct: 356 LGQLQEAYVFQEEMGGCVAFLVNNDEGNNSTVLFQNVSIELLPKSISILPDCKNVIFNTA 415
Query: 330 RVST-------QYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
+V + + + S++ FD+ ++WEEY++AI NF +T L++ +L+ ++ KD
Sbjct: 416 KVCSSSRQSAYKIQELSRSCIQSFDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDE 475
Query: 383 SDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
SDY WYTFRF NSS + L ++S H +HAFVN Y G+ HGSHD FT ++ + L
Sbjct: 476 SDYLWYTFRFQPNSSCTEPLLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLN 535
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
N+ ++LSV VG PDSGA+LE + AG+ RV +Q F N +WGYQVGL GEKL
Sbjct: 536 NEMNNISILSVMVGFPDSGAYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKL 595
Query: 498 QIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
IY L+ V W T Q LTWYK F P+G+DP+ALNL +MGKGEAWVNGQSIG
Sbjct: 596 HIYKEENLSNVEWRKTEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIG 655
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYWVSF SKG+PSQT YHVPRAFLK + NLLVLLEE NG
Sbjct: 656 RYWVSFHNSKGDPSQT--------------------LYHVPRAFLKTSENLLVLLEEANG 695
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLP 641
+PL I+++TI+ + HV HLP
Sbjct: 696 DPLHISLETISRTDLPDHVLYHHLP 720
>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
Length = 694
Score = 836 bits (2160), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/660 (62%), Positives = 488/660 (73%), Gaps = 53/660 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI+KAKEGGLDVIQTYVFWNLHEPQ+GQY+F+GR D++ FIKEIQ+QGLYV LRIG
Sbjct: 56 MWPDLISKAKEGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+IESE TYGGLP+WLHDV GIVFR+DN +K
Sbjct: 116 PYIESECTYGGLPLWLHDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +I+ F G PY+ WAA+MAV TGVPW+MCKQDDAP PVINACNGM+C
Sbjct: 176 LSQIENEYGSIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G FKGPNSPNKPS+WTE+WTSF Q +GG PY+RSA DIA++VALFIAK GSYVNYYMYH
Sbjct: 236 GRNFKGPNSPNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF R A+AF+IT YYD+APLDEYGLVR+PKWGHLKELHA+IK CS+PLL GTQ S
Sbjct: 296 GGTNFDRLASAFIITAYYDEAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFS 355
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG Q+A+VF +S CAAFL N+ R VT+ F+NISYELP KSISILP CK V FNT
Sbjct: 356 LGSEQQAYVF-RSSTECAAFLENSGPRD-VTIQFQNISYELPGKSISILPGCKNVVFNTG 413
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+VS Q N R+ L+F+S E W+ Y EAI NF +T RA+ LLDQIS AKD SDY WYT
Sbjct: 414 KVSIQNNVRAMKPRLQFNSAENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYT 473
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
FRF+ S NA++ L + S G +LH+F+NG TGSAHGS +N T++ V+L G N+ +
Sbjct: 474 FRFNNKSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNNIS 533
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVL 509
+LS TVGLP+SGAFLE +VAG+ +V VQ + F++ SWGYQVGL+GEKLQI++ G +KV
Sbjct: 534 ILSATVGLPNSGAFLESRVAGLRKVEVQGRDFSSYSWGYQVGLLGEKLQIFTVSGSSKVQ 593
Query: 510 WSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNP 569
W S +S T+ LTWY+TTF APAGNDP+ +NL SMGKG AWVNGQ IGRYWVSF G P
Sbjct: 594 WKSFQSSTKPLTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRYWVSFHKPDGTP 653
Query: 570 SQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIR 629
SQ YH+PR+FLK TGNLLV+LEEE GNPLGIT+DT+ I+
Sbjct: 654 SQ--------------------QWYHIPRSFLKSTGNLLVILEEETGNPLGITLDTVYIK 693
>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 832 bits (2148), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/678 (59%), Positives = 481/678 (70%), Gaps = 57/678 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK+GGLDVIQTYVFWNLHEPQ G YDFSGR D++ FIKEIQ+QGLYVCLRIG
Sbjct: 57 MWPDLIAKAKQGGLDVIQTYVFWNLHEPQPGMYDFSGRYDLVGFIKEIQAQGLYVCLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEWTYGG P WLHDV GIV+R+DN+P+K
Sbjct: 117 PFIESEWTYGGFPFWLHDVPGIVYRTDNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ I+ AF G YV WAAKMAV TGVPW+MCKQ DAP PVIN CNGMRC
Sbjct: 177 LSQIENEYQNIQKAFGTAGSQYVQWAAKMAVGLDTGVPWIMCKQTDAPDPVINTCNGMRC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP++WTE+WTSFYQV+GG PYIRSA+DIAFHV LFIA+NGSYVNYYMYH
Sbjct: 237 GETFTGPNSPNKPALWTENWTSFYQVYGGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT +A++ITGYYDQAPLDEYGL+R+PKWGHLK+LH IK CS LL G Q +
Sbjct: 297 GGTNFGRTGSAYVITGYYDQAPLDEYGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFT 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LGQL E +VFEE G C AFL+NND TV FRN SYEL KSISILPDC+ V F+T
Sbjct: 357 LGQLLEVYVFEEEKGECVAFLINNDRDNKATVQFRNSSYELLPKSISILPDCQNVTFSTA 416
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
V+T N+R + F S + W+++++ I NFDNT L+++ LL+Q++ KD SDY WYT
Sbjct: 417 NVNTTSNRRIISPKQNFSSVDDWQQFQDVISNFDNTSLKSDSLLEQMNTTKDKSDYLWYT 476
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
RF YN S ++ L VQS H+ HAFVN Y G HG+HD SFTL V + QGTN+ +
Sbjct: 477 LRFEYNLSCSKPTLSVQSAAHVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLS 536
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LSV VGLPDSGAFLER+ AG+ V +Q + TN +WGYQVGL+GE+LQ+Y
Sbjct: 537 ILSVMVGLPDSGAFLERRFAGLISVELQCSEQESLNLTNSTWGYQVGLMGEQLQVYKEQN 596
Query: 505 LNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
+ WS + + Q L WYKTTF P G+DP+ L+L SMGKGEAWVNG+SIGRYW+ F
Sbjct: 597 NSDTGWSQLGNVMEQTLFWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYWILFH 656
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
SKGNPSQ+ YHVPR+FLK +GN+LVLLEE GNPLGI++
Sbjct: 657 DSKGNPSQS--------------------LYHVPRSFLKDSGNVLVLLEEGGGNPLGISL 696
Query: 624 DTIAIRKVCGHVTNSHLP 641
DT+++ + + + LP
Sbjct: 697 DTVSVTDLQQNFSKLSLP 714
>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
Length = 715
Score = 829 bits (2142), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/680 (59%), Positives = 490/680 (72%), Gaps = 59/680 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSL+AKA+EGG+DVIQTYVFWNLHEP+ G+YDFSGRND++RFIKEIQ+QGLYVCLRIG
Sbjct: 55 MWPSLVAKAREGGVDVIQTYVFWNLHEPRPGEYDFSGRNDLVRFIKEIQAQGLYVCLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEWTYGG P WLHDV IV+RSDN+P+K
Sbjct: 115 PFIESEWTYGGFPFWLHDVPDIVYRSDNEPFKFYMQNFTTKIVNMMKSEGLYASQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF +KGPPYV+WAAKMAV+ TGVPWVMCKQ DAP PVIN CNGMRC
Sbjct: 175 LSQIENEYQNVEAAFRDKGPPYVIWAAKMAVELQTGVPWVMCKQTDAPDPVINTCNGMRC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSP KPS+WTE+WTSFYQV+GG+PYIRSA+DIAFHV LFIAKNGSY+NYYM+H
Sbjct: 235 GETFGGPNSPTKPSLWTENWTSFYQVYGGEPYIRSAEDIAFHVTLFIAKNGSYINYYMFH 294
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRTA+A++IT YYDQAPLDEYGL+R+PKWGHLKELHAAIK CS +L G Q+ S
Sbjct: 295 GGTNFGRTASAYVITSYYDQAPLDEYGLIRQPKWGHLKELHAAIKSCSSTILEGVQSNFS 354
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LGQLQ+A++FEE CAAFLVNND++ TV FRNI++EL KSIS+LPDC+ + FNT
Sbjct: 355 LGQLQQAYIFEEEGAGCAAFLVNNDQKNNATVEFRNITFELLPKSISVLPDCENIIFNTA 414
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ + N+ ++TS+ FD ++WE Y + I NF +T L+++ LL+ ++ KD SDY WYT
Sbjct: 415 KVNAKGNEITRTSSQLFDDADRWEAYTDVIPNFADTNLKSDTLLEHMNTTKDKSDYLWYT 474
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVS-FTLRNTVHLRQGTNDG 448
F F NSS + L V+S H+ AFVN +Y GSAHGS D FT+ + L N
Sbjct: 475 FSFLPNSSCTEPILHVESLAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQMNTI 534
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFT-NCSWGYQVGLIGEKLQIYSN 502
++LS VGL DSGAFLER+ AG+ RV + + +FT N WGYQ GL GE L IY
Sbjct: 535 SILSTMVGLQDSGAFLERRYAGLTRVEIRCAQQEIYNFTNNYEWGYQAGLSGESLNIYMR 594
Query: 503 LGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVS 561
L+ + WS + S T Q L+W+K F AP GNDP+ LNL +MGKGEAWVNGQSIGRYW+S
Sbjct: 595 EHLDNIEWSEVVSATDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVNGQSIGRYWLS 654
Query: 562 FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGI 621
F TSKG PSQT YH+PRAFL +GNLLVLLEE G+PL I
Sbjct: 655 FLTSKGQPSQT--------------------LYHIPRAFLNSSGNLLVLLEESGGDPLHI 694
Query: 622 TVDTIAIRKVCGHVTNSHLP 641
++DT++ + H + H P
Sbjct: 695 SLDTVSRTGLQEHASRYHPP 714
>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 824 bits (2129), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/681 (59%), Positives = 480/681 (70%), Gaps = 57/681 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK+GGLDVIQTYVFWNLHEPQ G YDF GR D++ FIKEIQ+QGLYVCLRIG
Sbjct: 57 MWPDLIAKAKQGGLDVIQTYVFWNLHEPQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+SEW YGG P WLHDV GIV+R+DN+ +K
Sbjct: 117 PFIQSEWKYGGFPFWLHDVPGIVYRTDNESFKFYMQNFTTKIVNMMKEEGLYASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ I+ AF G YV WAAKMAV +TGVPWVMCKQ DAP PVIN CNGMRC
Sbjct: 177 LSQIENEYQNIQKAFGTAGSQYVQWAAKMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP++WTE+WTSFYQV+GG PYIRSA+DIAFHV LFIA+NGSYVNYYMYH
Sbjct: 237 GETFTGPNSPNKPALWTENWTSFYQVYGGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRTA+A++ITGYYDQAPLDEYGL+R+PKWGHLK+LH IK CS LL G Q S
Sbjct: 297 GGTNFGRTASAYVITGYYDQAPLDEYGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LGQLQE +VFEE G C AFL NND VTV FRN SYEL +SISILPDC+ VAFNT
Sbjct: 357 LGQLQEGYVFEEEKGECVAFLKNNDRDNKVTVQFRNRSYELLPRSISILPDCQNVAFNTA 416
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
V+T N+R + F S + W+++++ I FDNT LR++ LL+Q++ KD SDY WYT
Sbjct: 417 NVNTTSNRRIISPKQNFSSLDDWKQFQDVIPYFDNTSLRSDSLLEQMNTTKDKSDYLWYT 476
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
RF YN S + L VQS H+ HAF+N Y G HG+HD SFTL V + QGTN+ +
Sbjct: 477 LRFEYNLSCRKPTLSVQSAAHVAHAFINNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLS 536
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LS VGLPDSGAFLER+ AG+ V +Q + TN +WGYQVGL+GE+LQ+Y
Sbjct: 537 ILSAMVGLPDSGAFLERRFAGLISVELQCSEQESLNLTNSTWGYQVGLLGEQLQVYKKQN 596
Query: 505 LNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
+ + WS + + Q L WYKTTF P G+DP+ L+L SMGKGEAWVN QSIGRYW+ F
Sbjct: 597 NSDIGWSQLGNIMEQLLIWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNEQSIGRYWILFH 656
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
SKGNPSQ+ YHVPR+FLK TGN+LVL+EE GNPLGI++
Sbjct: 657 DSKGNPSQS--------------------LYHVPRSFLKDTGNVLVLVEEGGGNPLGISL 696
Query: 624 DTIAIRKVCGHVTNSHLPPLS 644
DT+++ + + + LP S
Sbjct: 697 DTVSVIDLQQNFSKLTLPSSS 717
>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 673
Score = 819 bits (2116), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/664 (60%), Positives = 482/664 (72%), Gaps = 66/664 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LI+KAKEGGLDVIQTYVFWNLHEPQ GQYDFSGR D++RFIKEIQ QGLYVCLRIG
Sbjct: 34 MWPALISKAKEGGLDVIQTYVFWNLHEPQFGQYDFSGRYDLVRFIKEIQVQGLYVCLRIG 93
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+IESEWTYGG P WLHDV IV+R+DN+P+K
Sbjct: 94 PYIESEWTYGGFPFWLHDVPAIVYRTDNQPFKLYMQNFTTKIVSMMQSEGLYASQGGPII 153
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF E G YV WAA+MAV TGVPW+MCKQ DAP P+IN CNGMRC
Sbjct: 154 LSQIENEYQNVEKAFGEDGSRYVQWAAEMAVGLKTGVPWLMCKQTDAPDPLINTCNGMRC 213
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPNSPNKP+ WTE+WTSFYQV+GG+PYIRSA+DIAFHV LFIA KNGSYVNYYMY
Sbjct: 214 GETFTGPNSPNKPAFWTENWTSFYQVYGGEPYIRSAEDIAFHVTLFIARKNGSYVNYYMY 273
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTN GRT+++++IT YYDQAPLDEYGL+R+PKWGHLKELHAAIK CS LL G Q+
Sbjct: 274 HGGTNLGRTSSSYVITSYYDQAPLDEYGLLRQPKWGHLKELHAAIKSCSTTLLEGKQSNF 333
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQLQE +VFEE G C AFLVNND K TV FRN SYELP KSISILPDC+ V FNT
Sbjct: 334 SLGQLQEGYVFEE-EGKCVAFLVNNDHVKMFTVQFRNRSYELPSKSISILPDCQNVTFNT 392
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V+T+ N+R ++ F S +KWE++++ I NFD T L + LL+Q++ KD SDY WY
Sbjct: 393 ATVNTKSNRRMTSTIQTFSSADKWEQFQDVIPNFDQTTLISNSLLEQMNVTKDKSDYLWY 452
Query: 389 TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDG 448
T +++ L QS H+ HAF +G Y G AHGSHD SFT + + L +GTN+
Sbjct: 453 TL--------SESKLTAQSAAHVTHAFADGTYLGGAHGSHDVKSFTTQVPLKLNEGTNNI 504
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRVQ--DKSF--TNCSWGYQVGLIGEKLQIYSNLG 504
++LSV VGLPD+GAFLER+ AG+ V +Q ++S+ TN +WGYQVGL+GE+L+IY
Sbjct: 505 SILSVMVGLPDAGAFLERRFAGLTAVEIQCSEESYDLTNSTWGYQVGLLGEQLEIYEEKS 564
Query: 505 LNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
+ + WS + + Q LTWYKT F +P G++P+ALNL+SMGKG+AWVNG+SIGRYW+SF
Sbjct: 565 NSSIQWSPLGNTCNQTLTWYKTAFDSPKGDEPVALNLESMGKGQAWVNGESIGRYWISFH 624
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
SKG PSQT YHVPR+FLK GN LVL EEE GNPL I++
Sbjct: 625 DSKGQPSQT--------------------LYHVPRSFLKDIGNSLVLFEEEGGNPLHISL 664
Query: 624 DTIA 627
DTI+
Sbjct: 665 DTIS 668
>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
Length = 706
Score = 813 bits (2100), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/672 (59%), Positives = 483/672 (71%), Gaps = 65/672 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI+KAKEGGLDVIQTYVFWNLHEPQ+GQY+F+GR D++ FIKEIQ+QGLYV LRIG
Sbjct: 56 MWPDLISKAKEGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+IESE TYGGLP+WLHDV GIVFR+DN +K
Sbjct: 116 PYIESECTYGGLPLWLHDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +I+ F G PY+ WAA+MAV TGVPW+MCKQDDAP PVINACNGM+C
Sbjct: 176 LSQIENEYGSIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G FKGPNSPNKPS+WTE+WTSF Q +GG PY+RSA DIA++VALFIAK GSYVNYYMYH
Sbjct: 236 GRNFKGPNSPNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAKKGSYVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF R A+AF+IT YYD+APLDEYGLVR+PKWGHLKELHA+IK CS+PLL GTQ S
Sbjct: 296 GGTNFDRLASAFIITAYYDEAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFS 355
Query: 270 LGQLQEA-----------FVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISI 317
LG Q+ +F E V ++ ++ + VT+ F+NISYELP KSISI
Sbjct: 356 LGSEQQVIKNESSWTYFPLMFSEVPQNVLLSWKISGP--RDVTIQFQNISYELPGKSISI 413
Query: 318 LPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQIS 377
LP CK V FNT +VS Q N R+ L+F+S E W+ Y EAI NF +T RA+ LLDQIS
Sbjct: 414 LPGCKNVVFNTGKVSIQNNVRAMKPRLQFNSAENWKVYTEAIPNFAHTSKRADTLLDQIS 473
Query: 378 AAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
AKD SDY WYTFRF+ S NA++ L + S G +LH+F+NG TGSAHGS +N T++
Sbjct: 474 TAKDTSDYMWYTFRFNNKSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKK 533
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKL 497
V+L G N+ ++LS TVGLP+SGAFLE +VAG+ +V VQ + F++ SWGYQVGL+GEKL
Sbjct: 534 NVNLINGMNNISILSATVGLPNSGAFLESRVAGLRKVEVQGRDFSSYSWGYQVGLLGEKL 593
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
QI++ G +KV W S +S T+ LTWY+TTF APAGNDP+ +NL SMGKG AWVNGQ IGR
Sbjct: 594 QIFTVSGSSKVQWKSFQSSTKPLTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGR 653
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YWVSF G PSQ YH+PR+FLK TGNLLV+LEEE GN
Sbjct: 654 YWVSFHKPDGTPSQ--------------------QWYHIPRSFLKSTGNLLVILEEETGN 693
Query: 618 PLGITVDTIAIR 629
PLGIT+DT+ I+
Sbjct: 694 PLGITLDTVYIK 705
>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
Length = 719
Score = 806 bits (2083), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/680 (58%), Positives = 480/680 (70%), Gaps = 59/680 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK+GGLDVIQTYVFWNLHEPQ G+YDFSGRND++ FIKEI +QGLYV LRIG
Sbjct: 57 MWPGLIAKAKQGGLDVIQTYVFWNLHEPQPGKYDFSGRNDLVGFIKEIHAQGLYVSLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW YGG P WLHDV GIV+R+DN+P+K
Sbjct: 117 PFIESEWNYGGFPFWLHDVPGIVYRTDNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ AF G YV WAAKMAV +TGVPWVMCKQ DAP PVIN CNGMRC
Sbjct: 177 LSQIENEYGNIQKAFGTAGSQYVEWAAKMAVGLNTGVPWVMCKQPDAPDPVINTCNGMRC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP++WTE+WTSFYQV+GG PYIRSA+DIAFHV LF+A+NGS+VNYYMYH
Sbjct: 237 GETFTGPNSPNKPAMWTENWTSFYQVYGGVPYIRSAEDIAFHVTLFVARNGSFVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT++A+MITGYYDQAPLDEYGL R+PKWGHLKELHAAIK CS LL G Q S
Sbjct: 297 GGTNFGRTSSAYMITGYYDQAPLDEYGLFRQPKWGHLKELHAAIKSCSTTLLQGVQRNFS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+LQE +VFEE +G CAAFL+NND+ VTV F N SY+L KSISILPDC+ VAFNT
Sbjct: 357 LGELQEGYVFEEENGKCAAFLINNDKGNTVTVQFNNSSYKLLPKSISILPDCQNVAFNTA 416
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
++T N+R TS F S + W+++++ I NFD+T LR++ LL+Q++ KD SDY WYT
Sbjct: 417 HLNTTSNRRIITSRQNFSSVDDWKQFQDVIPNFDDTSLRSDSLLEQMNTTKDKSDYLWYT 476
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
R N S L VQS H+ +AFVN Y G HG+HD SFTL + L + TN+ +
Sbjct: 477 LRLENNLSCNDPILHVQSSAHVAYAFVNNTYIGGEHGNHDVKSFTLELPITLNERTNNIS 536
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LS VGLPDSGAFLE++ AG++ V +Q + N +WGYQVGL+GE+L++Y+
Sbjct: 537 ILSGMVGLPDSGAFLEKRFAGLNNVELQCSEQESLNLNNSTWGYQVGLLGEQLKVYTEQN 596
Query: 505 LNKVLWSSIRSPTRQ---LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVS 561
+ W+ + + T LTWYKTTF P G+DPIAL+L SM KGEAWVNGQSIGRYW+
Sbjct: 597 STDIKWTQLGNITIDEVTLTWYKTTFDTPKGDDPIALDLSSMAKGEAWVNGQSIGRYWIL 656
Query: 562 FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGI 621
F SKGNPSQ+ YHVPR+FLK + N LVLL+E GNPL I
Sbjct: 657 FLDSKGNPSQS--------------------LYHVPRSFLKDSENSLVLLDEGGGNPLDI 696
Query: 622 TVDTIAIRKVCGHVTNSHLP 641
+++T+++ + + + P
Sbjct: 697 SLNTVSVTDLQDNFSKLPFP 716
>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
Length = 811
Score = 805 bits (2080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/788 (52%), Positives = 505/788 (64%), Gaps = 77/788 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK GGLDVIQTYVFWN+HEP +GQY+F GR D+++FI+EIQ+QGLYV LRIG
Sbjct: 59 MWPKLIAKAKNGGLDVIQTYVFWNVHEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+E+EW YGG P WLHDV I FRSDN+P+K
Sbjct: 119 PFVEAEWKYGGFPFWLHDVPSITFRSDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ IEPAF GP YV WAA MAV TGVPW+MCKQ+DAP PVIN CNG+ C
Sbjct: 179 ISQIENEYQMIEPAFGASGPRYVRWAAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLIC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPNSPNKP++WTE+WTS Y ++G +R +DIAF VAL+IA K GS+V+YYMY
Sbjct: 239 GETFVGPNSPNKPALWTENWTSRYPIYGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMY 298
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR AA+++ T YYD APLDEYGL+ +P WGHL+ELH A+K S PLL G+ +
Sbjct: 299 HGGTNFGRFAASYVTTSYYDGAPLDEYGLIWQPTWGHLRELHCAVKQSSEPLLFGSYSNF 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ QEA VF ET C AFLVN D+ V FRNIS EL KSIS+L DC+ V F T
Sbjct: 359 SLGQQQEAHVF-ETDFKCVAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFET 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+ Q+ R+ + + W+ + E + + + L +Q+ KD +DY W
Sbjct: 418 AKVNAQHGSRTANAVQSLNDINNWKAFIEPVPQDLSKSTYTGNQLFEQLPTTKDETDYLW 477
Query: 388 YTFRFHYNSS--NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT-VHLRQG 444
Y + +S N A L V+S HILHAFVN EY GS HGSHD + NT + L++G
Sbjct: 478 YIVSYKNRASDGNQIARLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEG 537
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQI 499
N +LLSV VG PDSGA++ER+ G+ V +Q N WGYQVGL GEK I
Sbjct: 538 DNTISLLSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSI 597
Query: 500 YSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
Y+ G N V W I + LTWYKTTF P GND + LNL SMGKGE WVNG+SIGRY
Sbjct: 598 YTQEGPNSVRWMDINNLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRY 657
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
WVSFK G PSQ+ YH+PR FL P NLLVL+EE G+P
Sbjct: 658 WVSFKAPSGQPSQS--------------------LYHIPRGFLTPKDNLLVLVEEMGGDP 697
Query: 619 LGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISK 678
L ITV+T+++ VCG+V +PPL S GK P V+ C GK+IS
Sbjct: 698 LQITVNTMSVTTVCGNVDEFSVPPLQS--------------RGKVPKVRIWCQGGKRISS 743
Query: 679 IVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKA 738
I FAS+GNP GDC + +GSCH+ S+ VV+++CIG+ CSIP+++ FGGDPCPGI K+
Sbjct: 744 IEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFGGDPCPGIQKS 803
Query: 739 LLVDAQCR 746
LLV A CR
Sbjct: 804 LLVVADCR 811
>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
Length = 821
Score = 805 bits (2080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/788 (51%), Positives = 512/788 (64%), Gaps = 77/788 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +IAKA++GG+DVIQTYVFWN+HEP +G+Y+F GR +I++FI+EIQ+QGLYV LRIG
Sbjct: 69 MWPKIIAKARKGGIDVIQTYVFWNVHEPVQGKYNFEGRYNIVKFIREIQAQGLYVSLRIG 128
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW YGG P WLH+V I FR+DN+P+K
Sbjct: 129 PFIEAEWKYGGFPFWLHEVPNITFRTDNEPFKQHMQGFVTHMVNMMKNEGLYYPQGGPII 188
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +EPAF GP YV WAA +AV TGVPW+MCKQ+DAP P+IN CNG+ C
Sbjct: 189 ISQIENEYQMVEPAFGPGGPRYVQWAASLAVGLQTGVPWMMCKQNDAPDPIINTCNGLIC 248
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPNSPNKP++WTE+WT+ Y ++G +RS DI F VALFIA K GS+V+YYMY
Sbjct: 249 GETFVGPNSPNKPALWTENWTTRYPIYGNDTKLRSTGDITFAVALFIARKGGSFVSYYMY 308
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR A++++ T YYD APLDEYGL+ +P WGHLKELHAA+KL S PLL GT +
Sbjct: 309 HGGTNFGRFASSYVTTSYYDGAPLDEYGLIWQPTWGHLKELHAAVKLSSEPLLYGTYSNF 368
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG+ QEA VF ET C AFLVN D+ + TV+FRNIS +L KSISIL DC+TV F T
Sbjct: 369 SLGEDQEAHVF-ETKLKCVAFLVNFDKHQRPTVIFRNISLQLAPKSISILSDCRTVVFET 427
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+ Q+ R+ + W+ ++E+I + + L + +S KD +DY W
Sbjct: 428 GKVNAQHGSRTAEVVQSLNDTHTWKAFKESIPQDISKAAYTGKQLFEHLSTTKDETDYLW 487
Query: 388 YTFRFHYNSSNAQ--APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN-TVHLRQG 444
Y + Y S+ L+V+S HILHAFVNGE+ GS HGSH + + N T+ L++G
Sbjct: 488 YIASYEYRPSDDSHLVLLNVESQAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKEG 547
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQI 499
N +LL+V VG PDSGA +ER+ G+H+V +Q N WGYQVGL GE +I
Sbjct: 548 QNTISLLNVMVGSPDSGAHMERRSFGIHKVSIQQGQHALHLLNNELWGYQVGLFGEGNRI 607
Query: 500 YSNLGLNKVLWSSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
Y+ G + V W+ + + T LTWY+TTF P GND + LNL SMGKGE W+NG+SIGRY
Sbjct: 608 YTQEGSHSVEWTDVNNLTYLPLTWYQTTFATPMGNDAVTLNLTSMGKGEVWINGESIGRY 667
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
WVSFKT G PSQ+ YH+P+ FLK T NLLVL+EE GNP
Sbjct: 668 WVSFKTPSGQPSQS--------------------LYHIPQHFLKNTDNLLVLVEEMGGNP 707
Query: 619 LGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISK 678
L ITV+T++I VC V PP+ S GK P V+ C GK IS
Sbjct: 708 LQITVNTVSITTVCSSVNELSAPPVQSQ--------------GKDPEVRLRCQKGKHISA 753
Query: 679 IVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKA 738
+ FAS+GNP GDC + +GSCH+ S+ VV++ACIGK CSIP+ FGGDPCPGI K+
Sbjct: 754 VEFASYGNPAGDCRTFTIGSCHAESSESVVKQACIGKRSCSIPVGPGSFGGDPCPGIQKS 813
Query: 739 LLVDAQCR 746
LLV A CR
Sbjct: 814 LLVVAHCR 821
>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
Length = 710
Score = 781 bits (2016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/671 (57%), Positives = 463/671 (69%), Gaps = 84/671 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW SLIAKAKEGG+DVIQTYVFWN HEPQ GQYDF+GR D+ +FIKEIQ+QGLY CLRIG
Sbjct: 56 MWASLIAKAKEGGVDVIQTYVFWNRHEPQPGQYDFNGRYDLXKFIKEIQAQGLYACLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW+YGGLP WLHDV GIV+R+DN+P+K
Sbjct: 116 PFIESEWSYGGLPFWLHDVHGIVYRTDNEPFKFYMQNFTTKIVNLMKSEGLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ IE AF+EKGP YV WAAKMAV+ TGVPWVMCKQ DAP PVIN CNGMRC
Sbjct: 176 LSQIENEYQNIEAAFNEKGPSYVRWAAKMAVELQTGVPWVMCKQSDAPDPVINTCNGMRC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPNSPNKPS+WTE+WTSFY+V+GG+ Y+RSA+DIAFHVALFIA+NGSYVNYYM
Sbjct: 236 GQTFTGPNSPNKPSMWTENWTSFYEVFGGETYLRSAEDIAFHVALFIARNGSYVNYYMV- 294
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
L+R+PKWGHLKELHAAI LCS PLL G Q+ IS
Sbjct: 295 --------------------------SLIRQPKWGHLKELHAAITLCSTPLLNGVQSNIS 328
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LGQLQEA+VF+E G C AFLVNNDE TVLF+N+S EL KSISILPDCK V FNT
Sbjct: 329 LGQLQEAYVFQEEMGGCVAFLVNNDEGNNSTVLFQNVSIELLPKSISILPDCKNVIFNTA 388
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+++T YN+R TS+ FD+ ++WEEY++AI NF +T L++ +L+ ++ KD SDY WYT
Sbjct: 389 KINTGYNERITTSSQSFDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT 448
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
FRF NSS + L ++S H +HAFVN Y G+ HGSHD FT ++ + L N+ +
Sbjct: 449 FRFQPNSSCTEPLLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNIS 508
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLG 504
+LSV VG PDSGA+LE + AG+ RV +Q F N +WGYQVGL GEKL IY
Sbjct: 509 ILSVMVGFPDSGAYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEEN 568
Query: 505 LNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFK 563
L+ V W T Q LTWYK F P+G+DP+ALNL +MGKGEAWVNGQSIGRYWVSF
Sbjct: 569 LSNVEWRKTEISTNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFH 628
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
SKG+PSQT YHVPRAFLK + NLLVLLEE NG+PL I++
Sbjct: 629 NSKGDPSQT--------------------LYHVPRAFLKTSENLLVLLEEANGDPLHISL 668
Query: 624 DTIAIRKVCGH 634
+TI+ + H
Sbjct: 669 ETISRTDLPDH 679
>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
Length = 716
Score = 757 bits (1955), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 367/667 (55%), Positives = 454/667 (68%), Gaps = 60/667 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI K KEGG+DVIQTYVFWNLHEP+ GQYDFSGRND+++FIKEI+SQGLYVCLRIG
Sbjct: 60 MWPSLIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW YGGLP WL DV G+V+R+DN+P+K
Sbjct: 120 PFIEAEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTTKIVNLMKSEGLYASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AFHEKG Y+ WA +MAV TGVPW+MCK DAP PVIN CNGMRC
Sbjct: 180 LSQIENEYANVEAAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMRC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP +WTEDWTSF+QV+G +PYIRSA+DIAFH LFIAKNGSY+NYYMYH
Sbjct: 240 GETFPGPNSPNKPKMWTEDWTSFFQVYGTEPYIRSAEDIAFHAVLFIAKNGSYINYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT++++ ITGYYDQAPLDEYGL+R+PK+GHLKELHAAIK + PLL G Q ++S
Sbjct: 300 GGTNFGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILS 359
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG +Q+A+VFE+ S C AFLVNND K + FR SY L KSI IL +CK + + T
Sbjct: 360 LGPMQQAYVFEDASSGCVAFLVNNDA-KVSQIQFRKSSYSLSPKSIGILQNCKNLIYETA 418
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ + NKR T F+ EKWE +RE I F T L+A LL+ + KD +DY WYT
Sbjct: 419 KVNVEKNKRVTTPVQVFNVPEKWEGFRETIPAFSGTSLKANALLEHTNLTKDKTDYLWYT 478
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
F +S + ++S GH++H FVN GS HGS D L+ L G N +
Sbjct: 479 SSFKPDSPCTNPSIYIESSGHVVHVFVNNALAGSGHGSRDIKVVKLQVPASLTNGQNSIS 538
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LS VGLPDSGA++ERK G+ +V++ + + WGY VGL+GEK+++
Sbjct: 539 ILSGMVGLPDSGAYMERKSYGLTKVQISCGGTKPIDLSGSQWGYSVGLLGEKVRLQQWRN 598
Query: 505 LNKVLWSSIRS---PTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVS 561
LN+V WS + R L WYKT F P G+ P+ LN+ SMGKGE WVNG+SIGRYWVS
Sbjct: 599 LNRVKWSMNNAGLIKNRPLIWYKTIFDGPNGDGPVGLNMSSMGKGEIWVNGESIGRYWVS 658
Query: 562 FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGI 621
F T G+PSQ+ YH+PR FLKP+GNLLV+ EEE G+PLGI
Sbjct: 659 FLTPSGHPSQS--------------------IYHIPREFLKPSGNLLVVFEEEGGDPLGI 698
Query: 622 TVDTIAI 628
+++TI++
Sbjct: 699 SLNTISV 705
>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 718
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/667 (54%), Positives = 458/667 (68%), Gaps = 60/667 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI KAKEGG+DVIQTYVFWNLHEP+ GQYDFSGRND+++FIKEI+SQGLYVCLRIG
Sbjct: 62 MWPSLIKKAKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW YGGLP WL DV G+V+R+DN+P+K
Sbjct: 122 PFIEAEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AFHEKG Y+ WA +MAV TGVPW+MCK DAP PVIN CNGM+C
Sbjct: 182 LSQIENEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP +WTEDWTSF+QV+G +PYIRSA+DIAFH ALF+AKNGSY+NYYMYH
Sbjct: 242 GETFPGPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYH 301
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT++++ ITGYYDQAPLDEYGL+R+PK+GHLKELHAAIK + PLL G Q ++S
Sbjct: 302 GGTNFGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILS 361
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG +Q+A+VFE+ + C AFLVNND KA + FRN +Y L KSI IL +CK + + T
Sbjct: 362 LGPMQQAYVFEDANNGCVAFLVNNDA-KASQIQFRNNAYSLSPKSIGILQNCKNLIYETA 420
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ + N R T F+ + W +RE I F T L+ LL+ + KD +DY WYT
Sbjct: 421 KVNVKMNTRVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYT 480
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
F +S + +S GH++H FVN GS HGS D L+ V L G N+ +
Sbjct: 481 SSFKLDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNIS 540
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LS VGLPDSGA++ER+ G+ +V++ + + WGY VGL+GEK+++Y
Sbjct: 541 ILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKN 600
Query: 505 LNKVLWSSIRS---PTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVS 561
LN+V WS ++ R L WYKTTF P G+ P+ L++ SMGKGE WVNG+SIGRYWVS
Sbjct: 601 LNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVS 660
Query: 562 FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGI 621
F T G PSQ+ YH+PRAFLKP+GNLLV+ EEE G+PLGI
Sbjct: 661 FLTPAGQPSQS--------------------IYHIPRAFLKPSGNLLVVFEEEGGDPLGI 700
Query: 622 TVDTIAI 628
+++TI++
Sbjct: 701 SLNTISV 707
>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
Length = 718
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/667 (54%), Positives = 457/667 (68%), Gaps = 60/667 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI K KEGG+DVIQTYVFWNLHEP+ GQYDFSGRND+++FIKEI+SQGLYVCLRIG
Sbjct: 62 MWPSLIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW YGGLP WL DV G+V+R+DN+P+K
Sbjct: 122 PFIEAEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AFHEKG Y+ WA +MAV TGVPW+MCK DAP PVIN CNGM+C
Sbjct: 182 LSQIENEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP +WTEDWTSF+QV+G +PYIRSA+DIAFH ALF+AKNGSY+NYYMYH
Sbjct: 242 GETFPGPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYH 301
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT++++ ITGYYDQAPLDEYGL+R+PK+GHLKELHAAIK + PLL G Q ++S
Sbjct: 302 GGTNFGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILS 361
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG +Q+A+VFE+ + C AFLVNND KA + FRN +Y L KSI IL +CK + + T
Sbjct: 362 LGPMQQAYVFEDANNGCVAFLVNNDA-KASQIQFRNNAYSLSPKSIGILQNCKNLIYETA 420
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ + N R T F+ + W +RE I F T L+ LL+ + KD +DY WYT
Sbjct: 421 KVNVKMNTRVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYT 480
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
F +S + +S GH++H FVN GS HGS D L+ V L G N+ +
Sbjct: 481 SSFKLDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNIS 540
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LS VGLPDSGA++ER+ G+ +V++ + + WGY VGL+GEK+++Y
Sbjct: 541 ILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKN 600
Query: 505 LNKVLWSSIRS---PTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVS 561
LN+V WS ++ R L WYKTTF P G+ P+ L++ SMGKGE WVNG+SIGRYWVS
Sbjct: 601 LNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVS 660
Query: 562 FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGI 621
F T G PSQ+ YH+PRAFLKP+GNLLV+ EEE G+PLGI
Sbjct: 661 FLTPAGQPSQS--------------------IYHIPRAFLKPSGNLLVVFEEEGGDPLGI 700
Query: 622 TVDTIAI 628
+++TI++
Sbjct: 701 SLNTISV 707
>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/667 (54%), Positives = 456/667 (68%), Gaps = 60/667 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI K KEGG+DVIQTYVFWNLHEP+ GQYDFSGRND+++FIKEI+SQGLYVCLRIG
Sbjct: 62 MWPSLIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW YGGLP WL DV G+V+R+DN+P+K
Sbjct: 122 PFIEAEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E AFHEKG Y+ WA +MAV TGVPW+MCK DAP PVIN CNGM+C
Sbjct: 182 LSQIENEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP +WTEDWTSF+QV+G +PYIRSA+DIAFH ALF+AKNGSY+NYYMYH
Sbjct: 242 GETFPGPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAKNGSYINYYMYH 301
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT++++ ITGYYDQAPLDEYGL+R+PK+GHLKELHAAIK + PLL G Q ++S
Sbjct: 302 GGTNFGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILS 361
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG +Q+A+VFE+ + C AFLVNND KA + FRN +Y L KSI IL +CK + + T
Sbjct: 362 LGPMQQAYVFEDANNGCVAFLVNNDA-KASQIQFRNNAYSLSPKSIGILQNCKNLIYETA 420
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ + N R T F+ + W +RE I LL+ LL+ + KD +DY WYT
Sbjct: 421 KVNVKMNTRVTTPVQVFNVPDNWNLFRETIPASQAHLLKTNALLEHTNLTKDKTDYLWYT 480
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
F +S + +S GH++H FVN GS HGS D L+ V L G N+ +
Sbjct: 481 SSFKLDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNIS 540
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFTNCSWGYQVGLIGEKLQIYSNLG 504
+LS VGLPDSGA++ER+ G+ +V++ + + WGY VGL+GEK+++Y
Sbjct: 541 ILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKN 600
Query: 505 LNKVLWSSIRS---PTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVS 561
LN+V WS ++ R L WYKTTF P G+ P+ L++ SMGKGE WVNG+SIGRYWVS
Sbjct: 601 LNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVS 660
Query: 562 FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGI 621
F T G PSQ+ YH+PRAFLKP+GNLLV+ EEE G+PLGI
Sbjct: 661 FLTPAGQPSQS--------------------IYHIPRAFLKPSGNLLVVFEEEGGDPLGI 700
Query: 622 TVDTIAI 628
+++TI++
Sbjct: 701 SLNTISV 707
>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
Length = 765
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/788 (48%), Positives = 472/788 (59%), Gaps = 123/788 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK GGLDVIQTYVFWN+HEP +GQY+F GR D+++FI+EIQ+QGLYV LRIG
Sbjct: 59 MWPKLIAKAKNGGLDVIQTYVFWNVHEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+E+EW YGG P WLHDV I FRSDN+P+K
Sbjct: 119 PFVEAEWKYGGFPFWLHDVPSITFRSDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ IEPAF GP YV WAA MAV TGVPW+MCKQ+DAP PVIN CNG+ C
Sbjct: 179 ISQIENEYQMIEPAFGASGPRYVRWAAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLIC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPNSPNKP++WTE+WTS Y ++G +R+ +DIAF VALFIA K GS+V+YYMY
Sbjct: 239 GETFVGPNSPNKPALWTENWTSRYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMY 298
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR AA+++ T YYD APLDEY
Sbjct: 299 HGGTNFGRFAASYVTTSYYDGAPLDEYDFK------------------------------ 328
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
C AFLVN D+ V FRNIS EL KSIS+L DC+ V F T
Sbjct: 329 -----------------CVAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFET 371
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+ Q+ R+ + + W+ + E + + + L +Q++ KD +DY W
Sbjct: 372 AKVNAQHGSRTANAVQSLNDINNWKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLW 431
Query: 388 YTFRFHYNSS--NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT-VHLRQG 444
Y + +S N A L V+S HILHAFVN EY GS HGSHD + NT + L++G
Sbjct: 432 YIVSYKNRASDGNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEG 491
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQI 499
N +LLSV VG PDSGA++ER+ G+ V +Q N WGYQVGL GEK I
Sbjct: 492 DNTISLLSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSI 551
Query: 500 YSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
Y+ G N V W I + LTWYKTTF P GND + LNL SMGKGE WVNG+SIGRY
Sbjct: 552 YTQEGTNSVRWMDINNLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRY 611
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
WVSFK G PSQ+ YH+PR FL P NLLVL+EE G+P
Sbjct: 612 WVSFKAPSGQPSQS--------------------LYHIPRGFLTPKDNLLVLVEEMGGDP 651
Query: 619 LGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISK 678
L ITV+T+++ VCG+V +PPL S GK P V+ C G +IS
Sbjct: 652 LQITVNTMSVTTVCGNVDEFSVPPLQS--------------RGKVPKVRIWCQGGNRISS 697
Query: 679 IVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKA 738
I FAS+GNP GDC + +GSCH+ S+ VV+++CIG+ CSIP+++ FGGDPCPGI K+
Sbjct: 698 IEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFGGDPCPGIQKS 757
Query: 739 LLVDAQCR 746
LLV A CR
Sbjct: 758 LLVVADCR 765
>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
Length = 761
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/788 (48%), Positives = 472/788 (59%), Gaps = 123/788 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK GGLDVIQTYVFWN+HEP +GQY+F GR D+++FI+EIQ+QGLYV LRIG
Sbjct: 55 MWPKLIAKAKNGGLDVIQTYVFWNVHEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+E+EW YGG P WLHDV I FRSDN+P+K
Sbjct: 115 PFVEAEWKYGGFPFWLHDVPSITFRSDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ IEPAF GP YV WAA MAV TGVPW+MCKQ+DAP PVIN CNG+ C
Sbjct: 175 ISQIENEYQMIEPAFGASGPRYVRWAAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLIC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPNSPNKP++WTE+WTS Y ++G +R +DIAF VAL+IA K GS+V+YYMY
Sbjct: 235 GETFVGPNSPNKPALWTENWTSRYPIYGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMY 294
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR AA+++ T YYD APLDEY
Sbjct: 295 HGGTNFGRFAASYVTTSYYDGAPLDEYDFK------------------------------ 324
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
C AFLVN D+ V FRNIS EL KSIS+L DC+ V F T
Sbjct: 325 -----------------CVAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFET 367
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+ Q+ R+ + + W+ + E + + + L +Q++ KD +DY W
Sbjct: 368 AKVNAQHGSRTANAVQSLNDINNWKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLW 427
Query: 388 YTFRFHYNSS--NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT-VHLRQG 444
Y + +S N A L V+S HILHAFVN EY GS HGSHD + NT + L++G
Sbjct: 428 YIVSYKNRASDGNQIARLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEG 487
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQI 499
N +LLSV VG PDSGA++ER+ G+ V +Q N WGYQVGL GEK I
Sbjct: 488 DNTISLLSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSI 547
Query: 500 YSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
Y+ G N V W I + LTWYKTTF P GND + LNL SMGKGE WVNG+SIGRY
Sbjct: 548 YTQEGPNSVRWMDINNLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRY 607
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
WVSFK G PSQ+ YH+PR FL P NLLVL+EE G+P
Sbjct: 608 WVSFKAPSGQPSQS--------------------LYHIPRGFLTPKDNLLVLVEEMGGDP 647
Query: 619 LGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISK 678
L ITV+T+++ VCG+V +PPL S GK P V+ C GK+IS
Sbjct: 648 LQITVNTMSVTTVCGNVDEFSVPPLQS--------------RGKVPKVRIWCQGGKRISS 693
Query: 679 IVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKA 738
I FAS+GNP GDC + +GSCH+ S+ VV+++CIG+ CSIP+++ FGGDPCPGI K+
Sbjct: 694 IEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFGGDPCPGIQKS 753
Query: 739 LLVDAQCR 746
LLV A CR
Sbjct: 754 LLVVADCR 761
>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
Length = 775
Score = 727 bits (1877), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/798 (48%), Positives = 472/798 (59%), Gaps = 133/798 (16%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK GGLDVIQTYVFWN+HEP +GQY+F GR D+++FI+EIQ+QGLYV LRIG
Sbjct: 59 MWPKLIAKAKNGGLDVIQTYVFWNVHEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+E+EW YGG P WLHDV I FRSDN+P+K
Sbjct: 119 PFVEAEWKYGGFPFWLHDVPSITFRSDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ IEPAF GP YV WAA MAV TGVPW+MCKQ+DAP PVIN CNG+ C
Sbjct: 179 ISQIENEYQMIEPAFGASGPRYVRWAAAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLIC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTS----------FYQVWGGKPYIRSAQDIAFHVALFIA-K 198
GETF GPNSPNKP++WTE+WTS Y ++G +R+ +DIAF VALFIA K
Sbjct: 239 GETFVGPNSPNKPALWTENWTSRSNGQNNSAFSYPIYGNDTKLRAPEDIAFAVALFIARK 298
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
GS+V+YYMYHGGTNFGR AA+++ T YYD APLDEY
Sbjct: 299 KGSFVSYYMYHGGTNFGRFAASYVTTSYYDGAPLDEYDFK-------------------- 338
Query: 259 PLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISIL 318
C AFLVN D+ V FRNIS EL KSIS+L
Sbjct: 339 ---------------------------CVAFLVNFDQHNTPKVEFRNISLELAPKSISVL 371
Query: 319 PDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQIS 377
DC+ V F T +V+ Q+ R+ + + W+ + E + + + L +Q++
Sbjct: 372 SDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWKAFIEPVPQDLSKSTYTGNQLFEQLT 431
Query: 378 AAKDASDYFWYTFRFHYNSS--NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
KD +DY WY + +S N A L V+S HILHAFVN EY GS HGSHD +
Sbjct: 432 TTKDETDYLWYIVSYKNRASDGNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIV 491
Query: 436 RNT-VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQ 489
NT + L++G N +LLSV VG PDSGA++ER+ G+ V +Q N WGYQ
Sbjct: 492 LNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQ 551
Query: 490 VGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
VGL GEK IY+ G N V W I + LTWYKTTF P GND + LNL SMGKGE
Sbjct: 552 VGLFGEKDSIYTQEGTNSVRWMDINNLIYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEV 611
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
WVNG+SIGRYWVSFK G PSQ+ YH+PR FL P NLL
Sbjct: 612 WVNGESIGRYWVSFKAPSGQPSQS--------------------LYHIPRGFLTPKDNLL 651
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
VL+EE G+PL ITV+T+++ VCG+V +PPL S GK P V+
Sbjct: 652 VLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS--------------RGKVPKVRI 697
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
C G +IS I FAS+GNP GDC + +GSCH+ S+ VV+++CIG+ CSIP+++ FG
Sbjct: 698 WCQGGNRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFG 757
Query: 729 GDPCPGIHKALLVDAQCR 746
GDPCPGI K+LLV A CR
Sbjct: 758 GDPCPGIQKSLLVVADCR 775
>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
thaliana]
Length = 636
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/584 (58%), Positives = 419/584 (71%), Gaps = 38/584 (6%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAK GG+DV+ TYVFWN+HEPQ+GQ+DFSG DI++FIKE+++ GLYVCLRIG
Sbjct: 55 MWPSLIAKAKSGGIDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+ EW+YGGLP WLH+V GIVFR+DN+P+K
Sbjct: 115 PFIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + AF ++G YV W AK+AV+ TGVPWVMCKQDDAP P++NACNG +C
Sbjct: 175 LSQIENEYGMVGRAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETFKGPNSPNKP+IWTE+WTSFYQ +G +P IRSA+DIAFHVALFIAKNGS+VNYYMYH
Sbjct: 235 GETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR A+ F+IT YYDQAPLDEYGL+R+PKWGHLKELHAA+KLC PLL+G Q IS
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+LQ AFVF + + +CAA LVN D+ ++ TV FRN SY L KS+S+LPDCK VAFNT
Sbjct: 355 LGKLQTAFVFGKKANLCAAILVNQDKCES-TVQFRNSSYRLSPKSVSVLPDCKNVAFNTA 413
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V+ QYN R++ + S + WEE+ E + +F T +R+E LL+ ++ +D SDY W T
Sbjct: 414 KVNAQYNTRTRKARQNLSSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
RF S A + L V GH LHAFVNG + GS HG+ F L + L GTN+ A
Sbjct: 474 TRFQ-QSEGAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLA 532
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKS----FTNCSWGYQVGLIGEKLQIYSNLGL 505
LLSV VGLP+SGA LER+V G V++ + F N SWGYQVGL GEK +Y+ G
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGS 592
Query: 506 NKVLWSSIR-SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
KV W R S ++ LTWYK +F P G DP+ALNL SMGKGEA
Sbjct: 593 AKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEA 636
>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
Length = 835
Score = 712 bits (1837), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/793 (46%), Positives = 484/793 (61%), Gaps = 76/793 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK GGL+VIQTYVFWN+HEP++G+++F G D+++FIK I G++ LR+G
Sbjct: 61 MWPELILKAKRGGLNVIQTYVFWNIHEPEQGKFNFEGPYDLVKFIKTIGENGMFATLRLG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FRSDN P+K
Sbjct: 121 PFIQAEWNHGGLPYWLREIPDIIFRSDNAPFKHHMEKFVTKIIDMMKEEKLFASQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY T++ A+ G Y+ WA MA+ +TGVPWVMCKQ DAPGPVIN CNG C
Sbjct: 181 LSQIENEYNTVQLAYKNLGVSYIQWAGNMALGLNTGVPWVMCKQKDAPGPVINTCNGRHC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN PNKPS+WTE+WT+ ++V+G P RSA+D AF VA + +KNGS VNYYMYH
Sbjct: 241 GDTFTGPNKPNKPSLWTENWTAQFRVFGDPPSQRSAEDTAFSVARWFSKNGSLVNYYMYH 300
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTAA+F+ T YYD+APLDEYGL REPKWGHLK+LH A+ LC + LL G NV
Sbjct: 301 GGTNFDRTAASFVTTRYYDEAPLDEYGLQREPKWGHLKDLHRALNLCKKALLWGNPNVQK 360
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
L EA +E+ + VCAAFL +N+ ++A TV FR Y LP +SISILPDCKTV +NT
Sbjct: 361 LSADVEARFYEQPGTKVCAAFLASNNSKEAETVKFRGQEYYLPARSISILPDCKTVVYNT 420
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI---LNFDNTLLRAEGLLDQISAAKDASDY 385
V +Q+N R+ + K + E W Y E I L D++L + + + KD +DY
Sbjct: 421 MTVVSQHNSRNFVKSRKTNKLE-WNMYSETIPAQLQVDSSLPK-----ELYNLTKDKTDY 474
Query: 386 FWYTF-----RFHYNSSNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
W+T R N P L V S GH + AFVNGE+ GSAHGS SF L+++V
Sbjct: 475 VWFTTTINVDRRDMNERKRINPVLRVASLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHSV 534
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIG 494
L+ G N LL VGLPDSGA++E + AG V + + T+ WG+QVGL G
Sbjct: 535 DLKPGINFVTLLGTLVGLPDSGAYMEHRYAGPRGVSILGLNTGTLDLTSNGWGHQVGLSG 594
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
E ++++ G KV W+ ++ +TWYKT F AP G P+A+ + M KG W+NG+S
Sbjct: 595 ETAKLFTKEGGGKVTWTKVQKAGPPVTWYKTHFDAPEGKSPVAVRMTGMNKGMIWINGKS 654
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
IGRYW+++ + G P+Q++ YH+PR++LKPT NL+V+ EEE
Sbjct: 655 IGRYWMTYVSPLGEPTQSE--------------------YHIPRSYLKPTDNLMVIFEEE 694
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGK 674
NP I + T+ +C +VT H P + SW R + + KP CP K
Sbjct: 695 EANPEKIEILTVNRDTICSYVTEYHPPSVKSWERKNNKFTPVVDN--AKPAAHLKCPNQK 752
Query: 675 KISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG--DPC 732
KI + FASFG+P G C YAVG+CHS S+ VVE C+GK+ C IP+ F G D C
Sbjct: 753 KIIAVQFASFGDPLGTCGDYAVGTCHSLVSKQVVEEHCLGKTSCDIPIDKGLFAGKKDDC 812
Query: 733 PGIHKALLVDAQC 745
PGI K L V +C
Sbjct: 813 PGISKTLAVQVKC 825
>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/792 (45%), Positives = 487/792 (61%), Gaps = 71/792 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP ++ KAK GGL++IQTYVFWN+HEP +GQ++F G D+++FIK I GLY LRIG
Sbjct: 62 MWPDILQKAKHGGLNLIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGDYGLYATLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW +GG P WL +V I+FRS N+P+K
Sbjct: 122 PFIEAEWNHGGFPYWLREVPDIIFRSYNEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +I+ A+ E G YV WA KMAV GVPW+MCKQ DAP PVIN CNG C
Sbjct: 182 LAQIENEYNSIQLAYRELGVQYVQWAGKMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN PNKPS+WTE+WT+ Y+V+G P R+A+D+AF VA FI+KNG+ NYYMYH
Sbjct: 242 GDTFTGPNRPNKPSLWTENWTAQYRVFGDPPSQRAAEDLAFSVARFISKNGTLANYYMYH 301
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT ++F+ T YYD+APLDEYGL REPKWGHLK+LH+A++LC + L TG+ V
Sbjct: 302 GGTNFGRTGSSFVTTRYYDEAPLDEYGLQREPKWGHLKDLHSALRLCKKALFTGSPGVEK 361
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+ +E +E+ + +CAAFL NN R+A T+ FR Y LP SISILPDCKTV +NT
Sbjct: 362 LGKDKEVRFYEKPGTHICAAFLTNNHSREAATLTFRGEEYFLPPHSISILPDCKTVVYNT 421
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+N R+ + + + KWE +E I + + + ++ + KD SDY W+
Sbjct: 422 QRVVAQHNARNFVKSKIANKNLKWEMSQEPIPVMTDMKILTKSPMELYNFLKDRSDYAWF 481
Query: 389 TFRFHYNSSNAQAP--------LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
SN P L + + GH + AFVNG + GSAHGS+ +F R V
Sbjct: 482 VTSIEL--SNYDLPMKKDIIPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVK 539
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGE 495
+ GTN ALL +TVGLP+SGA++E + AG+H V++ + TN WG QVG+ GE
Sbjct: 540 FKAGTNYIALLCMTVGLPNSGAYMEHRYAGIHSVQILGLNTGTLDITNNGWGQQVGVNGE 599
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
++ Y+ G ++V W++ + +TWYKT F P GNDP+ L + SM KG AWVNG++I
Sbjct: 600 HVKAYTQGGSHRVQWTAAKGKGPAMTWYKTYFDMPEGNDPVILRMTSMAKGMAWVNGKNI 659
Query: 556 GRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
GRYW+S+ + PSQ++ YHVPRA+LKP+ NLLV+ EE
Sbjct: 660 GRYWLSYLSPLEKPSQSE--------------------YHVPRAWLKPSDNLLVIFEETG 699
Query: 616 GNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKK 675
GNP I V+ + +C VT H P + SW RH + + + KP CP K
Sbjct: 700 GNPEEIEVELVNRDTICSIVTEYHPPHVKSWQRHDSKIRAVVDEV--KPKGHLKCPNYKV 757
Query: 676 ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD--PCP 733
I K+ FASFGNP G C + +G+C + +S+ VVE+ C+GK+ C IP+ + F G+ C
Sbjct: 758 IVKVDFASFGNPLGACGDFEMGNCTAPNSKKVVEQHCMGKTTCEIPMEAGIFDGNSGACS 817
Query: 734 GIHKALLVDAQC 745
I K L V +C
Sbjct: 818 DITKTLAVQVRC 829
>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/795 (45%), Positives = 483/795 (60%), Gaps = 79/795 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK GGL+VIQTYVFWN+HEP++G+++F G D+++FIK I G+ +R+G
Sbjct: 61 MWPELIQKAKRGGLNVIQTYVFWNIHEPEQGKFNFEGSYDLVKFIKTIGENGMSATIRLG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FRSDN P+K
Sbjct: 121 PFIQAEWNHGGLPYWLREIPDIIFRSDNAPFKLHMERFVTMIINKLKEEKLFASQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY T++ A+ G YV WA MA+ TGVPWVMCKQ DAPGPVIN CNG C
Sbjct: 181 LAQIENEYNTVQLAYRNLGVSYVQWAGNMALGLKTGVPWVMCKQKDAPGPVINTCNGRHC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPNSP+KPS+WTE+WT+ ++V+G P RSA+D AF VA + +KNGS VNYYMYH
Sbjct: 241 GDTFTGPNSPDKPSLWTENWTAQFRVFGDPPSQRSAEDTAFSVARWFSKNGSLVNYYMYH 300
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTAA+F+ T YYD+APLDEYGL REPKWGHLK+LH A+ LC + LL GT NV
Sbjct: 301 GGTNFDRTAASFVTTRYYDEAPLDEYGLQREPKWGHLKDLHRALNLCKKALLWGTPNVQR 360
Query: 270 LGQLQEAFVFEE-TSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
L EA FE+ + CAAFL NN+ + TV FR Y LP KSISILPDCKTV +NT
Sbjct: 361 LSADVEARFFEQPRTNDCAAFLANNNTKDPETVTFRGKKYYLPAKSISILPDCKTVVYNT 420
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V +Q+N R+ + K D +W+ + E I + N L+ + + + KD +DY W+
Sbjct: 421 MTVVSQHNSRNFVKSRKTDGKLEWKMFSETIPS--NLLVDSRIPRELYNLTKDKTDYAWF 478
Query: 389 TFRFHYNSSNAQAPLD------VQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + + ++ A D V S GH + AF+NGE+ GSAHGS SF L+++V L+
Sbjct: 479 TTTINVDRNDLSARKDINPVLRVASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLK 538
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
G N LL VGLPDSGA++E + AG V + + ++ WG+QV L GE
Sbjct: 539 PGINFVTLLGSLVGLPDSGAYMEHRYAGPRGVSILGLNTGTLDLSSNGWGHQVALSGETA 598
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
++++ G KV W+ + +TWYKT F AP G P+A+ + M KG W+NG+SIGR
Sbjct: 599 KVFTKEGGRKVTWTKVNKDGPPVTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGR 658
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW+++ + G P+Q++ YH+PR++LKPT NL+V+LEEE +
Sbjct: 659 YWMNYISPLGEPTQSE--------------------YHIPRSYLKPTNNLMVILEEEGAS 698
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF-----GKKPTVQPSCPL 672
P I + T+ +C +VT H P + SW R KKF KP + CP
Sbjct: 699 PEKIEILTVNRDTICSYVTEYHPPNVRSWERKN-------KKFTPVADDAKPAARLKCPN 751
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG--D 730
KKI + FASFG+P G C +AVG+C S S+ VVE+ C+GK+ C IP+ F G D
Sbjct: 752 KKKIVAVQFASFGDPSGTCGNFAVGTCDSPISKQVVEQHCLGKTSCDIPMDKGLFNGKKD 811
Query: 731 PCPGIHKALLVDAQC 745
CP + K L V +C
Sbjct: 812 NCPNLTKNLAVQVKC 826
>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
Length = 806
Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/790 (43%), Positives = 472/790 (59%), Gaps = 68/790 (8%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ KA++GG++V+QTYVFWN+HE +KG+Y + D I+FIK IQ +G+YV LR+GP
Sbjct: 40 WAGILDKARQGGINVVQTYVFWNIHETEKGKYSIEPQYDYIKFIKLIQKKGMYVTLRVGP 99
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK----------------------------- 92
FI++EW +GGLP WL +V I+FRS+N+P+K
Sbjct: 100 FIQAEWNHGGLPYWLREVPEIIFRSNNEPFKKHMKKYVSTVIKTVKDANLFAPQGGPIIL 159
Query: 93 --IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
IENEY I+ AF E+G YV WAAKMAV GVPW+MCKQ DAP PVINACNG CG
Sbjct: 160 AQIENEYNHIQRAFREEGDNYVQWAAKMAVSLDIGVPWIMCKQTDAPDPVINACNGRHCG 219
Query: 151 ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
+TF GPN P KP+IWTE+WT+ Y+V+G P RSA+DIAF VA F +KNGS VNYYMYHG
Sbjct: 220 DTFSGPNKPYKPAIWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSKNGSLVNYYMYHG 279
Query: 211 GTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
GTNFGRT++AF T YYD+APLDEYG+ REPKW HL+++H A+ LC R L G V +
Sbjct: 280 GTNFGRTSSAFTTTRYYDEAPLDEYGMQREPKWSHLRDVHRALSLCKRALFNGASTVTKM 339
Query: 271 GQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
Q E VFE+ S +CAAF+ NN + T+ FR Y +P +SISILPDCKTV FNT+
Sbjct: 340 SQHHEVIVFEKPGSNLCAAFITNNHTKVPTTISFRGTDYYMPPRSISILPDCKTVVFNTQ 399
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+++Q++ R+ ++ + D KWE Y E I + ++ S KD SDY WYT
Sbjct: 400 CIASQHSSRNFKRSMAAN-DHKWEVYSETIPTTKQIPTHEKNPIELYSLLKDTSDYAWYT 458
Query: 390 FRFHY------NSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
++ L + S GH L AFVNGE+ GS HGSH+ F + V L+
Sbjct: 459 TSVELRPEDLPKKNDIPTILRIMSLGHSLLAFVNGEFIGSNHGSHEEKGFEFQKPVTLKV 518
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQ 498
G N A+L+ TVGLPDSGA++E + AG + + T+ WG++VG+ GEKL
Sbjct: 519 GVNQIAILASTVGLPDSGAYMEHRFAGPKSIFILGLNSGKMDLTSNGWGHEVGIKGEKLG 578
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
I++ G KV W + P ++WYKT F P G DP+A+ + MGKG W+NG+SIGR+
Sbjct: 579 IFTEEGSKKVQWKEAKGPGPAVSWYKTNFATPEGTDPVAIRMTGMGKGMVWINGKSIGRH 638
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
W+S+ + G P+Q++ YH+PR + P NLLV+ EEE NP
Sbjct: 639 WMSYLSPLGQPTQSE--------------------YHIPRTYFNPKDNLLVVFEEEIANP 678
Query: 619 LGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISK 678
+ + T+ +C VT +H P + SW ++ + P+ CP + I
Sbjct: 679 EKVEILTVNRDTICSFVTENHPPNVKSWAIKSEKFQAVVNDL--VPSASLKCPHQRTIKA 736
Query: 679 IVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF--GGDPCPGIH 736
+ FASFG+P G C +A+G C++ + +VE+ C+GK+ C +P+ F G D CP +
Sbjct: 737 VEFASFGDPAGACGAFALGKCNAPAIKQIVEKQCLGKASCLVPIDKDAFTKGQDACPNVT 796
Query: 737 KALLVDAQCR 746
KAL + +C
Sbjct: 797 KALAIQVRCE 806
>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
Length = 784
Score = 697 bits (1800), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/787 (48%), Positives = 464/787 (58%), Gaps = 115/787 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGLD+IQTYVFWN+HEP +GQY+F GR D++RFIKEIQ+QGLYV LRIG
Sbjct: 72 MWPKLIAKAKEGGLDMIQTYVFWNVHEPVQGQYNFEGRYDLVRFIKEIQAQGLYVSLRIG 131
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW YGG P WLHDV I FRSDN+P+K
Sbjct: 132 PFIESEWKYGGFPFWLHDVPNITFRSDNEPFKQHMQRFVTDIVNMMKHEGLYYPQGGPII 191
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF G YV WAA MAVD TGVPW MCKQ+DAP PV+ G+
Sbjct: 192 TSQIENEYQMVEHAFGSSGQRYVSWAAAMAVDRQTGVPWTMCKQNDAPDPVV----GIHS 247
Query: 150 GET-FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYM 207
PN+ Y ++G +RS +DIAF V FIA KNGSYV+YYM
Sbjct: 248 HTIPLDFPNASRN-----------YLIYGNDTKLRSPEDIAFAVVYFIARKNGSYVSYYM 296
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
YHGGTNFGR A++++ T YYD APLDEYGL+ +P WGHL+ELHAA+K S PLL GT +
Sbjct: 297 YHGGTNFGRFASSYVTTSYYDAAPLDEYGLIWQPTWGHLRELHAAVKQSSEPLLFGTYSY 356
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
+SLGQ QEA +F ET C AFLVN D V+FRNIS EL KSISIL DCK V F
Sbjct: 357 LSLGQEQEAHIF-ETESQCVAFLVNFDRHHISEVVFRNISLELAPKSISILSDCKRVVFE 415
Query: 328 TERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYF 386
T +V+ Q+ R+ F W ++E I + + L + +S KD +DY
Sbjct: 416 TAKVTAQHGSRTAEEVQSFSDINTWTAFKEPIPQDVSKAMYSGNRLFEHLSTTKDDTDYL 475
Query: 387 WYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT-VHLRQGT 445
WY +N IL G HGSH + + NT + L++G
Sbjct: 476 WYIVGLFHN---------------IL---------GRIHGSHGGPANIILNTNISLKEGP 511
Query: 446 NDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQIY 500
N +LLS VG PDSGA +ER+V G+ +V +Q + N WGYQVGL GE+ IY
Sbjct: 512 NTISLLSAMVGSPDSGAHMERRVFGLQKVSIQQGQEPENLLNNELWGYQVGLFGERNSIY 571
Query: 501 SNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+ G V W++I + LTWYKTTF PAGND + LNL MGKGE WVNG+SIGRYW
Sbjct: 572 TQEGSKSVEWTTIYNLAYSPLTWYKTTFSTPAGNDAVTLNLTGMGKGEVWVNGESIGRYW 631
Query: 560 VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPL 619
VSFK GNPSQ+ YH+PR FL P N+LVL EE GNP
Sbjct: 632 VSFKAPSGNPSQS--------------------LYHIPRQFLNPQDNILVLFEEMGGNPQ 671
Query: 620 GITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKI 679
ITV+T+++ +VC +V P L + K+P V C GK+IS I
Sbjct: 672 QITVNTVSVTRVCVNVNELSAPSL--------------QYKNKEPAVDLRCQEGKQISAI 717
Query: 680 VFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKAL 739
FAS+GNP GDC++ GSCH+ S+ VV++AC+GKS CSIP+ FGGDPCPGI K+L
Sbjct: 718 EFASYGNPIGDCKKIRFGSCHAGSSESVVKQACLGKSGCSIPITPIKFGGDPCPGIKKSL 777
Query: 740 LVDAQCR 746
LV A CR
Sbjct: 778 LVVANCR 784
>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
Length = 759
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/786 (48%), Positives = 466/786 (59%), Gaps = 114/786 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGLDVIQTYVFWN+HEP +GQY+F GR D++RFIKEIQ+QGLYV LRIG
Sbjct: 48 MWPKLIAKAKEGGLDVIQTYVFWNVHEPIQGQYNFEGRYDLVRFIKEIQAQGLYVSLRIG 107
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIESEW YGG P WLHDV I FRSDN+P+K
Sbjct: 108 PFIESEWKYGGFPFWLHDVPNITFRSDNEPFKQHMQRFVTDIVNMMKHEGLYYPQGGPII 167
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +EPAF G YV WAA MAVD TGVPW MCKQ+DAP PV+
Sbjct: 168 TSQIENEYQMVEPAFGSSGQRYVSWAAAMAVDLQTGVPWTMCKQNDAPDPVV-------- 219
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
G +S P + D + Y ++G +RS QDI F VALFIA KNGSYV+YYMY
Sbjct: 220 -----GIHSYTIPVNFQND-SRNYLIYGNDTKLRSPQDITFAVALFIARKNGSYVSYYMY 273
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR A++++ T YYD APLDEYGL+ +P WGHL+ELHAA+K S PLL GT + +
Sbjct: 274 HGGTNFGRFASSYVTTSYYDGAPLDEYGLIWQPTWGHLRELHAAVKQSSEPLLFGTYSNL 333
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+GQ QEA +F ET C AFLVN D+ V+FRNIS EL KSISIL DCK V F T
Sbjct: 334 SIGQEQEAHIF-ETETQCVAFLVNFDQHHISEVVFRNISLELAPKSISILLDCKQVVFET 392
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+ Q+ R+ F W+ ++E I + + L + +S KDA+DY W
Sbjct: 393 AKVNAQHGSRTAEEVQSFSDISTWKAFKEPIPQDVSKSAYSGNRLFEHLSTTKDATDYLW 452
Query: 388 YTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDN-VSFTLRNTVHLRQGTN 446
Y I+ F+N G HGSH + + L++G N
Sbjct: 453 Y----------------------IVGLFLN--ILGRIHGSHGGPANIIFSTNISLQEGPN 488
Query: 447 DGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYS 501
+LLS VG PDSGA +ER+V G+ +V +Q + N WGYQVGL GE+ IY+
Sbjct: 489 TISLLSAMVGSPDSGAHMERRVFGIRKVSIQQGQEPENLLNNELWGYQVGLFGERNNIYT 548
Query: 502 NLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWV 560
W++I + T LTWYKTTF P GND + LNL MGKGE WVNG+SIGRYWV
Sbjct: 549 Q-DSKITEWTTIDNLTYSPLTWYKTTFSTPVGNDAVTLNLTGMGKGEVWVNGESIGRYWV 607
Query: 561 SFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLG 620
SFK GNPSQ+ YH+PR FL P N LVL EE GNP
Sbjct: 608 SFKAPSGNPSQS--------------------LYHIPREFLNPQDNTLVLFEEMGGNPQL 647
Query: 621 ITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIV 680
ITV+T+++ +VCG+V P L Q D K+P V CP GK IS I
Sbjct: 648 ITVNTMSVSRVCGNVNELSAPSL-------QYKD-------KEPAVDLWCPEGKHISAIE 693
Query: 681 FASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALL 740
FAS+G P GDC+++ G CH+ S+ VV++AC+GKS CS+P+ FGGDPCPGI K+LL
Sbjct: 694 FASYGGPTGDCKKFGFGRCHAGSSESVVKQACLGKSGCSVPVTPIKFGGDPCPGIQKSLL 753
Query: 741 VDAQCR 746
V A R
Sbjct: 754 VVANYR 759
>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
Length = 830
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/794 (44%), Positives = 477/794 (60%), Gaps = 74/794 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAKEGGL+VIQTYVFWN+HEP +GQ++F G D+++FIK I QGLYV LRIG
Sbjct: 58 MWPEIIRKAKEGGLNVIQTYVFWNIHEPVQGQFNFEGNYDLVKFIKAIGEQGLYVTLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+IE+EW GG P WL +V I FRS N+P+
Sbjct: 118 PYIEAEWNQGGFPYWLREVPNITFRSYNEPFIHHMKKYSEMVIDLVKKEKLFAPQGGPII 177
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY ++ A+ + G Y+ WAA MA + GVPW+MCKQ DAP VIN CNG C
Sbjct: 178 MAQIENEYNNVQLAYRDNGKKYIEWAANMATSLYNGVPWIMCKQKDAPPQVINTCNGRHC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TF GPN PNKPS+WTE+WT+ Y+ +G P R+A+DIAF VA F AKNG+ NYYMY+
Sbjct: 238 ADTFTGPNGPNKPSLWTENWTAQYRTFGDPPSQRAAEDIAFSVARFFAKNGTLTNYYMYY 297
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTN+GRT+++F+ T YYD+APLDE+GL REPKW HL++LH A++L R LL GT V
Sbjct: 298 GGTNYGRTSSSFVTTRYYDEAPLDEFGLYREPKWSHLRDLHRALRLSRRALLWGTPTVQK 357
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+ Q E VFE+ S CAAFL NN + T+ FR Y LP KS+SILPDCKTV +NT
Sbjct: 358 INQDLEITVFEKPGSTDCAAFLTNNHTTQPSTIKFRGKDYYLPEKSVSILPDCKTVVYNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+ + +Q+N R+ ++ K + KWE Y+E + + L+ L+ S KD SDY WY
Sbjct: 418 QTIVSQHNSRNFITSEK-SKNLKWEMYQEKVPTIADLPLKNREPLELYSLTKDTSDYAWY 476
Query: 389 TFRFHYNSSNAQAP------LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ + L + S GH L AFVNGEY G HG++ SF + + L+
Sbjct: 477 STSITLERHDLPMRPDILPVLQIASMGHALAAFVNGEYVGFGHGNNIEKSFVFQKPIILK 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKL 497
GTN +L+ TVG P+SGA++E++ AG V +Q T +WG++VG+ GEK
Sbjct: 537 PGTNTITILAETVGFPNSGAYMEKRFAGPRGVTIQGLMAGTLDITQNNWGHEVGVFGEKQ 596
Query: 498 QIYSNLGLNKVLWSSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
++++ G KV W+ + P + +TWYKT F AP GN+P+AL + M KG WVNG+S+G
Sbjct: 597 ELFTEEGAKKVQWTPVTGPPKGAVTWYKTYFDAPEGNNPVALKMDKMEKGMMWVNGKSLG 656
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW SF + G P+Q + YH+PRA+LKPT NLLV+ EE G
Sbjct: 657 RYWTSFLSPLGQPTQAE--------------------YHIPRAYLKPTNNLLVIFEETGG 696
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK--KPTVQPSCPLGK 674
+P I V T+ +C +T H P + SW +R TD + K +CP K
Sbjct: 697 HPTNIEVQTVNRDTICSIITEYHPPHVKSW----ERSGTDFVAVVEDLKSGAHLTCPDNK 752
Query: 675 KISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF---GGDP 731
I K+ FAS+GNPDG C G+C+S++S VVE+ C+GK+ C+IP+ + DP
Sbjct: 753 IIEKVEFASYGNPDGACGNLFNGNCNSANSLKVVEQHCLGKNTCTIPIEREIYDEPSKDP 812
Query: 732 CPGIHKALLVDAQC 745
CP I K L V +C
Sbjct: 813 CPNIFKTLAVQVKC 826
>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 846
Score = 689 bits (1777), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/790 (43%), Positives = 481/790 (60%), Gaps = 68/790 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGL+ I+TY+FWN+HEP+KGQ+DF GR DI+RF K IQ +Y +R+G
Sbjct: 71 MWPELIAKAKEGGLNTIETYIFWNIHEPEKGQFDFEGRYDIVRFFKLIQEHNMYAMVRLG 130
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ IVFR++N+PYK
Sbjct: 131 PFIQAEWNHGGLPYWLREIPDIVFRTNNEPYKMHMETFVKIIIKRLKDANLFASQGGPII 190
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF G Y+ WAA MA+ + G+PW+MCKQ AP VI CNG C
Sbjct: 191 LAQIENEYQHLEAAFKNDGTKYIKWAANMAISTNVGIPWIMCKQTKAPSDVIPTCNGRNC 250
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ GP + + P +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYMYH
Sbjct: 251 GDTWPGPMNKSMPLLWTENWTAQYRVFGDPPSQRSAEDIAFAVARFFSVGGTMTNYYMYH 310
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+AAF++ YYD+APLDE+GL +EPKWGHL++LH A+KLC + LL G +
Sbjct: 311 GGTNFGRTSAAFVMPKYYDEAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLWGKTSTEK 370
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+ EA VFE VC AFL N++ + VT+ FR SY +PR SISIL DCKTV F T
Sbjct: 371 LGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADCKTVVFGT 430
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+ V+ Q+N+R+ + + W+ + E + + + +R D + KD +DY W
Sbjct: 431 QHVNAQHNQRTFHFADQTTQNNVWQMFDEEKVPKYKQSKIRLRKAGDLYNLTKDKTDYVW 490
Query: 388 YTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT F + + + L+V SHGH AFVN ++ G HG+ N +FTL + L
Sbjct: 491 YTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDL 550
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEK 496
++G N A+L+ T+G+ DSGA+LE ++AGV RV+++ + TN WG+ VGL+GE+
Sbjct: 551 KKGVNHVAVLASTMGMMDSGAYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQ 610
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
QIY++ G+ V W + R LTWYK F P+G DPI L++ +MGKG +VNGQ IG
Sbjct: 611 KQIYTDKGMGSVTWKPAVN-DRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIG 669
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW+S+K + G PSQ YH+PR+FL+ N+LVL EEE G
Sbjct: 670 RYWISYKHALGRPSQ--------------------QLYHIPRSFLRQKDNVLVLFEEEFG 709
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKI 676
P I + T+ +C ++ + + SW R+ + KP +C K I
Sbjct: 710 RPDAIMILTVKRDNICTFISERNPAHIKSW--ERKDSQITVTAADLKPRATLTCSPKKLI 767
Query: 677 SKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGI 735
++VFAS+GNP G C Y +GSCH+ ++ +VE+AC+GK C++P+ + +GGD CPG
Sbjct: 768 QQVVFASYGNPMGICGNYTIGSCHTPRAKELVEKACLGKRICTLPVSADVYGGDVNCPGT 827
Query: 736 HKALLVDAQC 745
L V A+C
Sbjct: 828 TATLAVQAKC 837
>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
Length = 825
Score = 688 bits (1776), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/796 (43%), Positives = 478/796 (60%), Gaps = 80/796 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP ++ KA+ GGL++IQTYVFWN HEP+K + +F GR D+++F+K +Q +G+YV LRIG
Sbjct: 58 MWPDILDKARRGGLNLIQTYVFWNGHEPEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V I+FRS+N+P+K
Sbjct: 118 PFIQAEWNHGGLPYWLREVPDIIFRSNNEPFKKYMKEYVSIVINRMKEEKLFAPQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G YV WAAKMAV + GVPWVMCKQ DAP PVINACNG C
Sbjct: 178 LAQIENEYNHIQLAYEADGDNYVQWAAKMAVSLYNGVPWVMCKQKDAPDPVINACNGRHC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN P KP IWTE+WT+ Y+V+G P RSA+DIAF VA F +K+GS VNYYMYH
Sbjct: 238 GDTFTGPNKPYKPFIWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSKHGSLVNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT +AF T YYD+APLDE+GL REPKW HL++ H A+ LC + LL G
Sbjct: 298 GGTNFGRTTSAFTTTRYYDEAPLDEFGLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQK 357
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+ Q E V+E+ S +CAAF+ NN + A T+ FR Y LP +SISILPDCKTV FNT
Sbjct: 358 ISQYHEVIVYEKKESNLCAAFITNNHTQTAKTLSFRGSDYFLPPRSISILPDCKTVVFNT 417
Query: 329 ERVSTQYNKR----SKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASD 384
+ +++Q++ R SKT N D KWE + E I + + + + S KD +D
Sbjct: 418 QNIASQHSSRHFEKSKTGN-----DFKWEVFSEPIPSAKELPSKQKLPAELYSLLKDKTD 472
Query: 385 YFWYTFRFHY------NSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
Y WYT S+ L + S GH L AFVNGEY GS HGSH+ F +
Sbjct: 473 YGWYTTSVELGPEDIPKKSDVAPVLRILSLGHSLQAFVNGEYIGSKHGSHEEKGFEFQKP 532
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLI 493
V+ + G N A+L+ VGLPDSGA++E + AG + + T+ WG+QVGL
Sbjct: 533 VNFKVGVNQIAILANLVGLPDSGAYMEHRYAGPKTITILGLMSGTIDLTSNGWGHQVGLQ 592
Query: 494 GEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
GE I++ G KV W + ++WYKT F P G +P+A+ ++ M KG WVNG+
Sbjct: 593 GENDSIFTEKGSKKVEWKDGKGKGSTISWYKTNFDTPEGTNPVAIGMEGMAKGMIWVNGE 652
Query: 554 SIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE 613
SIGR+W+S+ + G P+Q++ YH+PR+FLKP NLLV+ EE
Sbjct: 653 SIGRHWMSYLSPLGKPTQSE--------------------YHIPRSFLKPKDNLLVIFEE 692
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK--PTVQPSCP 671
E +P I + T+ +C +T +H P + S+ Q+ +++ G+ P +CP
Sbjct: 693 EAISPDKIAILTVNRDTICSFITENHPPNIRSFASKNQK----LERVGENLTPEAFITCP 748
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF--GG 729
KKI+ + FASFG+P G C + +G C++ S+ +VE+ C+GK CS+P++ F G
Sbjct: 749 DQKKITAVEFASFGDPSGFCGSFIMGKCNAPSSKKIVEQLCLGKPTCSVPMVKATFTGGN 808
Query: 730 DPCPGIHKALLVDAQC 745
D CP + K L + +C
Sbjct: 809 DGCPDVVKTLAIQVKC 824
>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
Length = 844
Score = 688 bits (1775), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/790 (43%), Positives = 483/790 (61%), Gaps = 68/790 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGL+ I+TYVFWN+HEP+KGQ++F GR D+++F K IQ ++ +R+G
Sbjct: 68 MWPELIAKAKEGGLNTIETYVFWNIHEPEKGQFNFEGRYDMVKFFKLIQEHDMFAMVRLG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ IVFR++N+PYK
Sbjct: 128 PFIQAEWNHGGLPYWLREIPDIVFRTNNEPYKMHMETFVKIVIKRLKDANLFASQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF E+G Y+ WAA+MA+ + G+PW+MCKQ APG VI CNG C
Sbjct: 188 LAQIENEYQHLEAAFKEEGTKYIHWAAQMAIGTNIGIPWIMCKQTKAPGDVIPTCNGRNC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ GP + P +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYMYH
Sbjct: 248 GDTWPGPMNKTMPLLWTENWTAQYRVFGDPPSQRSAEDIAFAVARFFSVGGTMTNYYMYH 307
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRTAAAF++ YYD+APLDE+GL +EPKWGHL++LH A+KLC + LL G +
Sbjct: 308 GGTNFGRTAAAFVMPKYYDEAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLWGKPSTEK 367
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+ EA VFE VC AFL N++ + VT+ FR Y +PR SISIL DCKTV F T
Sbjct: 368 LGKQLEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQPYFVPRHSISILADCKTVVFGT 427
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+ V+ Q+N+R+ + + + W+ + E + + +R D + KD +DY W
Sbjct: 428 QHVNAQHNQRTFHFADQTNQNNVWQMFDEEKVPKYKQAKIRTRKAADLYNLTKDKTDYVW 487
Query: 388 YTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT F + + ++V SHGH AFVN ++ G HG+ N +FTL + L
Sbjct: 488 YTSSFKLEPDDMPIRRDIKTVVEVNSHGHASVAFVNNKFAGCGHGTKMNKAFTLEKPMEL 547
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEK 496
++G N A+L+ ++G+ DSGA+LE ++AGV RV++ + TN WG+ VGL+GE+
Sbjct: 548 KKGVNHVAVLASSMGMMDSGAYLEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGEQ 607
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
+IY+ G+ V W + + LTWYK F P+G DPI L++ +MGKG +VNGQ IG
Sbjct: 608 KEIYTEKGMASVTWKPAVN-DKPLTWYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIG 666
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW+S+K + G PSQ YH+PR+FL+P N+LVL EEE G
Sbjct: 667 RYWMSYKHALGRPSQ--------------------QLYHIPRSFLRPKDNVLVLFEEEFG 706
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKI 676
P I + T+ +C +++ + + SW R + + T+ +CP K I
Sbjct: 707 RPDAIMILTVKRDNICTYISERNPAHIKSWERKDSQITATADDLKARATL--TCPPKKLI 764
Query: 677 SKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGI 735
++VFAS+GNP G C Y +GSCH+ ++ VVE++C+GK C++P+ + +GGD CPG
Sbjct: 765 QQVVFASYGNPVGICGNYTIGSCHTPRAKEVVEKSCLGKRTCTLPVSADVYGGDVNCPGT 824
Query: 736 HKALLVDAQC 745
L V A+C
Sbjct: 825 TATLAVQAKC 834
>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
Length = 847
Score = 686 bits (1770), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/805 (43%), Positives = 476/805 (59%), Gaps = 86/805 (10%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP ++ KA+ GGL+VIQTYVFWN HEP++G+++F G ND+++FI+ +QS+G+YV LR+GP
Sbjct: 66 WPDILDKARHGGLNVIQTYVFWNAHEPEQGKFNFEGNNDLVKFIRLVQSKGMYVTLRVGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK----------------------------- 92
FI++EW +GGLP WL +V GI+FRSDN+PYK
Sbjct: 126 FIQAEWNHGGLPYWLREVPGIIFRSDNEPYKKYMKAYVSKIIQMMKDEKLFAPQGGPIIL 185
Query: 93 --IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
IENEY I+ A+ EKG YV WAA MAV GVPW+MCKQ DAP PVINACNG CG
Sbjct: 186 AQIENEYNHIQLAYEEKGDSYVQWAANMAVALDIGVPWIMCKQKDAPDPVINACNGRHCG 245
Query: 151 ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
+TF GPN P KPS+WTE+WT+ Y+V+G RSA+DIAF VA F +KNG+ VNYYMYHG
Sbjct: 246 DTFSGPNKPYKPSLWTENWTAQYRVFGDPVSQRSAEDIAFSVARFFSKNGNLVNYYMYHG 305
Query: 211 GTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
GTNFGRT +AF T YYD+APLDEYG+ R+PKW HL++ H A+ LC + +L G V L
Sbjct: 306 GTNFGRTTSAFTTTRYYDEAPLDEYGMERQPKWSHLRDAHKALLLCRKAILGGVPTVQKL 365
Query: 271 GQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
E +FE+ + C+AF+ NN +A T+ FR +Y LP SIS+LPDCKTV +NT+
Sbjct: 366 NDYHEVRIFEKPGTSTCSAFITNNHTNQAATISFRGSNYFLPAHSISVLPDCKTVVYNTQ 425
Query: 330 RVS-------------------TQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAE 370
V +Q+NKR+ + ++ KWE + EAI + +
Sbjct: 426 NVMNQLVYYKLISSHLIIKLIVSQHNKRNFVKS-AVANNLKWELFLEAIPSSKKLESNQK 484
Query: 371 GLLDQISAAKDASDYFWYTFRFHYNSSN---AQAPLDVQSHGHILHAFVNGEYTGSAHGS 427
L+ + KD +DY WYT F + A L + S GH L AFVNG+Y G+ HG+
Sbjct: 485 IPLELYTLLKDTTDYGWYTTSFELGPEDLPKKSAILRIMSLGHTLSAFVNGQYIGTDHGT 544
Query: 428 HDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFT 482
H+ SF + + GTN ++L+ TVGLPDSGA++E + AG + + T
Sbjct: 545 HEEKSFEFEQPANFKVGTNYISILATTVGLPDSGAYMEHRYAGPKSISILGLNKGKLELT 604
Query: 483 NCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQS 542
WG++VGL GE+L++++ G KV W + TR L+W KT F P G P+A+ +
Sbjct: 605 KNGWGHRVGLRGEQLKVFTEEGSKKVQWDPVTGETRALSWLKTRFATPEGRGPVAIRMTG 664
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLK 602
MGKG WVNG+SIGR+W+SF + G PSQ + YH+PR +L
Sbjct: 665 MGKGMIWVNGKSIGRHWMSFLSPLGQPSQEE--------------------YHIPRDYLN 704
Query: 603 PTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK 662
NLLV+LEEE G+P I + + +C ++T + ++SW + + + GK
Sbjct: 705 AKDNLLVVLEEEKGSPEKIEIMIVDRDTICSYITENSPANVNSW----GSKNGEFRSVGK 760
Query: 663 K--PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSI 720
P CP GKKI + FASFGNP G C +A+G+C+ ++GVVE+AC+GK C +
Sbjct: 761 NSGPQASLKCPSGKKIVAVEFASFGNPSGYCGDFALGNCNGGAAKGVVEKACLGKEECLV 820
Query: 721 PLLSRYFGGDPCPGIHKALLVDAQC 745
+ F G C G L + A+C
Sbjct: 821 EVNRANFNGQGCAGSVNTLAIQAKC 845
>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
Length = 850
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/790 (43%), Positives = 481/790 (60%), Gaps = 66/790 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGL+ I+TYVFWN+HEP+KG+++F G+ND++RF + IQ +Y +R+G
Sbjct: 73 MWPELIAKAKEGGLNTIETYVFWNIHEPEKGEFNFEGQNDVVRFFQLIQEHDMYAMVRLG 132
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ IVFR++N+PYK
Sbjct: 133 PFIQAEWNHGGLPYWLREIPDIVFRTNNEPYKMHMETFVKIIIKRLKDANLFASQGGPII 192
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF ++G Y+ WAAKMA+ + G+PW+MCKQ AP VI CNG C
Sbjct: 193 LAQIENEYQHMEAAFKDEGTKYINWAAKMAISTNIGIPWIMCKQTKAPSDVIPTCNGRNC 252
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ GP + + P +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYMYH
Sbjct: 253 GDTWPGPTNKSMPLLWTENWTAQYRVFGDPPSQRSAEDIAFAVARFFSVGGTLANYYMYH 312
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+AAF++ YYD+APLDE+GL +EPKWGHL++LH A+KLC + LL GT +
Sbjct: 313 GGTNFGRTSAAFVMPKYYDEAPLDEFGLYKEPKWGHLRDLHQALKLCKKALLWGTPSTEK 372
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+ EA VFE VC AFL N++ + T+ FR Y +PR SIS+L DC+TV F T
Sbjct: 373 LGKQLEARVFEMPEQKVCVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGT 432
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYR-EAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+ V+ Q+N+R+ + + WE + E + + +R D + KD +DY W
Sbjct: 433 QHVNAQHNQRTFHFADQTAQNNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVW 492
Query: 388 YTFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT F + S+ + L+V SHGH AFVN ++ G HG+ N +FTL + L
Sbjct: 493 YTSSFKLEADDMPIRSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDL 552
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEK 496
++G N A+L+ ++G+ DSGA++E ++AGV RV++ + TN WG+ VGL+GE+
Sbjct: 553 KKGVNHVAVLASSMGMTDSGAYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGER 612
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
QIY++ G+ V W + R LTWYK F P+G DP+ L++ +MGKG +VNGQ IG
Sbjct: 613 KQIYTDKGMGSVTWKPAMN-DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIG 671
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW+S+K + G PSQ YHVPR+FL+ N+LVL EEE G
Sbjct: 672 RYWISYKHALGRPSQ--------------------QLYHVPRSFLRQKDNMLVLFEEEFG 711
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKI 676
P I + T+ +C ++ + + SW R + + +CP K I
Sbjct: 712 RPDAIMILTVKRDNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLI 771
Query: 677 SKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP-CPGI 735
++VFAS+GNP G C Y VGSCH+ ++ VVE+AC+GK C++P+ + +GGD C G
Sbjct: 772 QQVVFASYGNPAGICGNYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDANCSGT 831
Query: 736 HKALLVDAQC 745
L V A+C
Sbjct: 832 TATLAVQAKC 841
>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
Length = 844
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/791 (43%), Positives = 472/791 (59%), Gaps = 69/791 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP ++ KA+ GGL+VIQTYVFWN HEP+ G+++F G D+++FI+ +Q++G++V LR+G
Sbjct: 76 MWPDILDKARRGGLNVIQTYVFWNAHEPEPGKFNFQGNYDLVKFIRLVQAKGMFVTLRVG 135
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V GI+FRSDN+PYK
Sbjct: 136 PFIQAEWNHGGLPYWLREVPGIIFRSDNEPYKFHMKAFVSKIIQMMKDEKLFAPQGGPII 195
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ EKG YV WAA MAV GVPW+MCKQ DAP PVINACNG C
Sbjct: 196 LAQIENEYNHIQLAYEEKGDSYVQWAANMAVATDIGVPWLMCKQRDAPDPVINACNGRHC 255
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN P KP+IWTE+WT+ Y+V G P RSA+DIAF VA F +KNG+ VNYYMYH
Sbjct: 256 GDTFAGPNKPYKPAIWTENWTAQYRVHGDPPSQRSAEDIAFSVARFFSKNGNLVNYYMYH 315
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT++ F T YYD+APLDEYGL REPKW HL+++H A+ LC R +L G +V
Sbjct: 316 GGTNFGRTSSVFSTTRYYDEAPLDEYGLPREPKWSHLRDVHKALLLCRRAILGGVPSVQK 375
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
L E FE + +CAAF+ NN + T+ FR +Y LP SISILPDCKTV FNT
Sbjct: 376 LNHFHEVRTFERVGTNMCAAFITNNHTMEPATINFRGTNYFLPPHSISILPDCKTVVFNT 435
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+++ +Q+N R+ + + + WE + EAI + + S KD +DY WY
Sbjct: 436 QQIVSQHNSRNYERSPAAN-NFHWEMFNEAIPTAKKMPINLPVPAELYSLLKDTTDYAWY 494
Query: 389 TFRFHYNSSNAQAP------LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F + + L V S GH + AFVNG+ G+AHG+H+ SF + V LR
Sbjct: 495 TTSFELSQEDMSMKPGVLPVLRVMSLGHSMVAFVNGDIVGTAHGTHEEKSFEFQTPVLLR 554
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
GTN +LLS TVGLPDSGA++E + AG + + + T WG++VGL GE
Sbjct: 555 VGTNYISLLSSTVGLPDSGAYMEHRYAGPKSINILGLNRGTLDLTRNGWGHRVGLKGEGK 614
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+++S G V W + + R L+WY+T F P G P+A+ + M KG WVNG +IGR
Sbjct: 615 KVFSEEGSTSVKWKPLGAVPRALSWYRTRFGTPEGTGPVAIRMSGMAKGMVWVNGNNIGR 674
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW+S+ + G P+Q++ YH+PR+FL P NLLV+ EEE
Sbjct: 675 YWMSYLSPLGKPTQSE--------------------YHIPRSFLNPQDNLLVIFEEEARV 714
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKIS 677
P + + + +C V ++SW+ R +K G ++ +C GK+I
Sbjct: 715 PAQVEILNVNRDTICSVVGERDPANVNSWVSRRGNFHPVVKSVGAAASM--ACATGKRIV 772
Query: 678 KIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF---GGDPCPG 734
+ FASFGNP G C +A+GSC+++ S+ +VER C+G+ C++ L F G D CP
Sbjct: 773 AVEFASFGNPSGYCGDFAMGSCNAAASKQIVERECLGQEACTLALDRAVFNNNGVDACPD 832
Query: 735 IHKALLVDAQC 745
+ K L V +C
Sbjct: 833 LVKQLAVQVRC 843
>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
Length = 766
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/791 (44%), Positives = 482/791 (60%), Gaps = 73/791 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW ++ KA+ GGL+VIQTYVFWN+HEP +GQ++F G D+++FIK I + +YV LR+G
Sbjct: 1 MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL + I+FRS N +K
Sbjct: 61 PFIQAEWNHGGLPYWLREKPNIIFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPIV 120
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G YV WAA MAV GVPW+MCKQ DAP PVIN CNG C
Sbjct: 121 LAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRHC 180
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN P KP++WTE+WT+ Y+V+G P R+A+DIAF VA F +KNGS VNYYMYH
Sbjct: 181 GDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSKNGSLVNYYMYH 240
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A F T YYD+APLDE+GL REPKWGHL+++H A+ LC +PLL GT +
Sbjct: 241 GGTNFGRTSAVFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGIQV 300
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+G+ EA +E+ + +CAAFL NND + A T+ FR + LP +SISILPDCKTV FNT
Sbjct: 301 IGKGLEARFYEKPGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCKTVVFNT 360
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
E + +Q+N R+ + K + KW+ E+I + + + L+ S KD +DY WY
Sbjct: 361 ETIVSQHNARNFIPS-KNANKLKWKMSPESIPTVEQVPVNNKIPLELYSLLKDTTDYGWY 419
Query: 389 TFRFHYNSSN-AQAP-----LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + + ++ P L + S GH + FVNGEY G+AHGSH+ +F + +V +
Sbjct: 420 TTSIELDKEDVSKRPDILPVLRIASLGHAMLVFVNGEYIGTAHGSHEEKNFVFQGSVPFK 479
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
G N+ ALL + VGLPDSGA++E + AG + + + + WG+QV L GEK+
Sbjct: 480 AGVNNIALLGILVGLPDSGAYMEHRFAGPRSITILGLNTGTLDISKNGWGHQVALQGEKV 539
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
++++ G ++V WS I+ LTWYKT F AP GNDP+A+ + MGKG+ WVNG+SIGR
Sbjct: 540 KVFTQGGSHRVDWSEIKEEKSALTWYKTYFDAPEGNDPVAIRMNGMGKGQIWVNGKSIGR 599
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW+S+ + +Q++ YH+PR+F+KP+ NLLV+LEEEN
Sbjct: 600 YWMSYLSPLKLSTQSE--------------------YHIPRSFIKPSENLLVILEEENVT 639
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQ--RGDTDIKKFGKKPTVQPSCPLGKK 675
P + + + +C +T H P + SW R + R D K G CP KK
Sbjct: 640 PEKVEILLVNRDTICSFITQYHPPNVKSWERKDKQFRAVVDDVKTG----AHLRCPHDKK 695
Query: 676 ISKIVFASFGNPDGDCERYAVGSCH-SSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPG 734
I+ I FASFG+P G C + G CH SS ++ +VE+ C+GK CS+P+ + + C
Sbjct: 696 ITNIEFASFGDPSGVCGNFEHGKCHSSSDTKKLVEQHCLGKENCSVPMDAFDNFKNECDS 755
Query: 735 IHKALLVDAQC 745
K L + A+C
Sbjct: 756 --KTLAIQAKC 764
>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
Length = 854
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/794 (45%), Positives = 475/794 (59%), Gaps = 60/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TY+FWN+HEP G Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 59 MWEDLIRKAKDGGLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR++N+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKFVPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G Y+ WAAKMAV TGVPWVMCK+DDAP PVINACNG C
Sbjct: 179 LSQIENEYGPESRELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI GS+VNYYMYH
Sbjct: 239 -DAFS-PNKPYKPRIWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+R+PK+GHLKELH AIKLC +++ VI
Sbjct: 297 GGTNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVI 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A VF G CAAFL N + + + V+F N+ Y+LP SISILPDC+TV FNT
Sbjct: 357 SLGSYQQAHVFSSGRGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNT 416
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYF 386
RV Q + R +N K S WE Y E I + ++ + A GLL+QI+ +D++DY
Sbjct: 417 ARVGVQTSHMRMFPTNSKLHS---WETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYL 473
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + +SS + Q P L VQS GH +H F+NG+Y+GSA+G+ +N FT +
Sbjct: 474 WYMTSVNIDSSESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAAN 533
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLS+ VGLP+ G E G+ H + + + W YQVGL G
Sbjct: 534 LHAGTNRIALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKG 593
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+ + +Q L WYK F AP G++P+AL+++SMGKG+ W+N
Sbjct: 594 EAMNLVSPNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWIN 653
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
GQSIGRYW+++ N H C YHVPR++LKPT NLL++
Sbjct: 654 GQSIGRYWMAYAKGDCNVCSYSGTYRPPKCQHGCG-HPTQRWYHVPRSWLKPTQNLLIIF 712
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+ I + A++ VC N H P L +W ++ + +V C
Sbjct: 713 EELGGDASKIALMKRAMKSVCADA-NEHHPTLENWHTESPSESEEL----HQASVHLQCA 767
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I+FASFG P G C + G+CH+ +SQ ++E+ CIG+ +CS+P+ + YFG DP
Sbjct: 768 PGQSISTIMFASFGTPSGTCGSFQKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFGADP 827
Query: 732 CPGIHKALLVDAQC 745
CP + K L V+A C
Sbjct: 828 CPNVLKRLSVEAAC 841
>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
Length = 854
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/794 (45%), Positives = 475/794 (59%), Gaps = 60/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TY+FWN+HEP G Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 59 MWEDLIRKAKDGGLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR++N+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKFVPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G Y+ WAAKMAV TGVPWVMCK+DDAP PVINACNG C
Sbjct: 179 LSQIENEYGPESRELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI GS+VNYYMYH
Sbjct: 239 -DAFS-PNKPYKPRIWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+R+PK+GHLKELH AIKLC +++ VI
Sbjct: 297 GGTNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVI 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A VF G CAAFL N + + + V+F N+ Y+LP SISILPDC+TV FNT
Sbjct: 357 SLGSYQQAHVFSSGRGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNT 416
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYF 386
RV Q + R +N K S WE Y E I + ++ + A GLL+QI+ +D++DY
Sbjct: 417 ARVGVQTSHMRMFPTNSKLHS---WETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYL 473
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + +SS + Q P L VQS GH +H F+NG+Y+GSA+G+ +N FT +
Sbjct: 474 WYMTSVNIDSSESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAAN 533
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLS+ VGLP+ G E G+ H + + + W YQVGL G
Sbjct: 534 LHAGTNRIALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKG 593
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+ + +Q L WYK F AP G++P+AL+++SMGKG+ W+N
Sbjct: 594 EAMNLVSPNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWIN 653
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
GQSIGRYW+++ N H C YHVPR++LKPT NLL++
Sbjct: 654 GQSIGRYWMAYAKGDCNVCSYSGTYRPPKCQHGCG-HPTQRWYHVPRSWLKPTQNLLIIF 712
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+ I + A++ VC N H P L +W ++ + +V C
Sbjct: 713 EELGGDASKIALMKRAMKSVCADA-NEHHPTLENWHTESPSESEEL----HZASVHLQCA 767
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I+FASFG P G C + G+CH+ +SQ ++E+ CIG+ +CS+P+ + YFG DP
Sbjct: 768 PGQSISTIMFASFGTPSGTCGSFQKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFGADP 827
Query: 732 CPGIHKALLVDAQC 745
CP + K L V+A C
Sbjct: 828 CPNVLKRLSVEAAC 841
>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
Length = 854
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/794 (45%), Positives = 475/794 (59%), Gaps = 60/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TY+FWN+HEP G Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 59 MWEDLIRKAKDGGLDVIDTYIFWNVHEPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR++N+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKFVPGISFRTNNEPFKMAMQGFTQKIVHMMKSENLFASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G Y+ WAAKMAV TGVPWVMCK+DDAP PVINACNG C
Sbjct: 179 LSQIENEYGPESRELGAAGHAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI GS+VNYYMYH
Sbjct: 239 -DAFS-PNKPYKPRIWTEAWSGWFTEFGGTIHRRPVQDLAFGVARFIQNGGSFVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+R+PK+GHLKELH AIKLC +++ VI
Sbjct: 297 GGTNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHAVVSADPTVI 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A VF G CAAFL N + + + V+F N+ Y+LP SISILPDC+TV FNT
Sbjct: 357 SLGSYQQAHVFSSGRGNCAAFLSNYNPKSSARVIFNNVHYDLPAWSISILPDCRTVVFNT 416
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYF 386
RV Q + R +N K S WE Y E I + ++ + A GLL+QI+ +D++DY
Sbjct: 417 ARVGVQTSHMRMFPTNSKLHS---WETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYL 473
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + +SS + Q P L VQS GH +H F+NG+Y+GSA+G+ +N FT +
Sbjct: 474 WYMTSVNIDSSESFLRRGQTPTLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAAN 533
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLS+ VGLP+ G E G+ H + + + W YQVGL G
Sbjct: 534 LHAGTNRIALLSIAVGLPNVGLHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKG 593
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+ + +Q L WYK F AP G++P+AL+++SMGKG+ W+N
Sbjct: 594 EAMNLVSPNGVSAVEWVRGSLAAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWIN 653
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
GQSIGRYW+++ N H C YHVPR++LKPT NLL++
Sbjct: 654 GQSIGRYWMAYAKGDCNVCSYSGTYRPPKCQHGCG-HPTQRWYHVPRSWLKPTQNLLIIF 712
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+ I + A++ VC N H P L +W ++ + +V C
Sbjct: 713 EELGGDASKIALMKRAMKSVCADA-NEHHPTLENWHTESPSESEEL----HEASVHLQCA 767
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I+FASFG P G C + G+CH+ +SQ ++E+ CIG+ +CS+P+ + YFG DP
Sbjct: 768 PGQSISTIMFASFGTPSGTCGSFQKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFGADP 827
Query: 732 CPGIHKALLVDAQC 745
CP + K L V+A C
Sbjct: 828 CPNVLKRLSVEAAC 841
>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 675 bits (1742), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/792 (43%), Positives = 478/792 (60%), Gaps = 74/792 (9%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP LIA+AKEGGL+VI++YVFWN+HEP+ G Y+F GR D+I+F K IQ ++ +RIGP
Sbjct: 67 WPDLIARAKEGGLNVIESYVFWNIHEPEMGVYNFEGRYDMIKFFKLIQEHEMFAMVRIGP 126
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK----------------------------- 92
F+++EW +GGLP WL +V IVFR+DN+PYK
Sbjct: 127 FVQAEWNHGGLPYWLREVPDIVFRTDNEPYKKLMQKFVTLVVNKLKDAKLFASQGGPIIL 186
Query: 93 --IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
IENEYQ +E AF E G Y+ WAAKMA+ TGVPW+MCKQ AP VI CNG CG
Sbjct: 187 AQIENEYQHMEAAFKENGTRYIDWAAKMAISTSTGVPWIMCKQTKAPAEVIPTCNGRHCG 246
Query: 151 ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
+T+ GP NKP +WTE+WT+ Y+V+G P RSA+DIAF VA F + GS VNYYMYHG
Sbjct: 247 DTWPGPTDKNKPLLWTENWTAQYRVFGDPPSQRSAEDIAFAVARFFSVGGSMVNYYMYHG 306
Query: 211 GTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
GTNFGRT A+F++ YYD+APLDE+G+ +EPKWGHL++LH A++LC + LL G + L
Sbjct: 307 GTNFGRTGASFVMPRYYDEAPLDEFGMYKEPKWGHLRDLHHALRLCKKALLRGNPSTQPL 366
Query: 271 GQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
G+L EA +FE VC AFL N++ ++ TV FR Y +PR+S+SIL DCKTV F+T+
Sbjct: 367 GKLYEARLFEIPEQKVCVAFLSNHNTKEDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQ 426
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASDYFW 387
V+ Q+N+R+ + + WE Y E + + T R+E L+ + KD +DY W
Sbjct: 427 HVNAQHNQRTFHLTDQTLQNNVWEMYTEGDKVPTYKFTTDRSEKPLEAYNMTKDKTDYLW 486
Query: 388 YTFRFHYNSS------NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT F + + + L+ SHGH + AFVNG+ G+AHG+ N +F+L + +
Sbjct: 487 YTTSFKLEAEDLPFRQDIKPVLEASSHGHAMVAFVNGKLVGAAHGTKMNKAFSLEKPIEV 546
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEK 496
R G N ++LS T+GL DSGA+LE + AGVH V +Q + ++ WG+ VGL GE+
Sbjct: 547 RAGINHVSILSSTLGLQDSGAYLEHRQAGVHSVTIQGLNTGTLDLSSNGWGHIVGLDGER 606
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
Q + + G +V W LTWY+ F P+G DP+ ++L MGKG +VNG+ +G
Sbjct: 607 KQAHMDKG-GEVQWKPAVFDL-PLTWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLG 664
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW S+K + G PSQ YHVPR FLKPTGN+L + EEE G
Sbjct: 665 RYWSSYKHALGRPSQY--------------------LYHVPRCFLKPTGNVLTIFEEEGG 704
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK--KPTVQPSCPLGK 674
P I + T+ +C ++ + + SW +R D+ + KP +CP K
Sbjct: 705 RPDAIMILTVKRDNICSFISEKNPGHVRSW----ERKDSQLTVVADDLKPRAVLTCPEKK 760
Query: 675 KISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCP 733
I ++VFAS+GNP G C Y VG+CH+ ++ VVE+AC+GK C + + +GGD CP
Sbjct: 761 TIQQVVFASYGNPLGICGNYTVGNCHTPKAKEVVEKACVGKKSCVLAVSHEVYGGDLNCP 820
Query: 734 GIHKALLVDAQC 745
G L V A+C
Sbjct: 821 GTTATLAVQAKC 832
>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
Flags: Precursor
gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
sativa Japonica Group]
gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
Length = 848
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/797 (43%), Positives = 479/797 (60%), Gaps = 77/797 (9%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP LI+KAKEGGL+VI++YVFWN HEP++G Y+F GR D+I+F K IQ + +Y +RIGP
Sbjct: 64 WPDLISKAKEGGLNVIESYVFWNGHEPEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK----------------------------- 92
F+++EW +GGLP WL ++ I+FR++N+P+K
Sbjct: 124 FVQAEWNHGGLPYWLREIPDIIFRTNNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIIL 183
Query: 93 --IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
IENEYQ +E AF E G Y+ WAAKMA+ +TGVPW+MCKQ APG VI CNG CG
Sbjct: 184 AQIENEYQHLEVAFKEAGTKYINWAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCG 243
Query: 151 ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
+T+ GP KP +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYMYHG
Sbjct: 244 DTWPGPADKKKPLLWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMYHG 303
Query: 211 GTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
GTNFGR AAF++ YYD+APLDE+GL +EPKWGHL++LH A++ C + LL G +V L
Sbjct: 304 GTNFGRNGAAFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPL 363
Query: 271 GQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
G+L EA VFE + VC AFL N++ ++ TV FR Y + R+SISIL DCKTV F+T+
Sbjct: 364 GKLYEARVFEMKEKNVCVAFLSNHNTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQ 423
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V++Q+N+R+ + D WE Y E I + T +R + L+Q + KD +DY WY
Sbjct: 424 HVNSQHNQRTFHFADQTVQDNVWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWY 483
Query: 389 TFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F + + + L+V SHGH + AFVN + G HG+ N +FT+ + L+
Sbjct: 484 TTSFRLETDDLPYRKEVKPVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLK 543
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
G N A+LS T+GL DSG++LE ++AGV+ V ++ + T WG+ VGL GE+
Sbjct: 544 VGVNHVAILSSTLGLMDSGSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERR 603
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+++S G+ V W + + LTWY+ F P+G DP+ ++L MGKG +VNG+ +GR
Sbjct: 604 RVHSEQGMGAVAWKPGKD-NQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGR 662
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YWVS+ + G PSQ YHVPR+ L+P GN L+ EEE G
Sbjct: 663 YWVSYHHALGKPSQY--------------------LYHVPRSLLRPKGNTLMFFEEEGGK 702
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK--------KPTVQPS 669
P I + T+ +C +T + P W + D+ K KPT S
Sbjct: 703 PDAIMILTVKRDNICTFMTEKN-PAHVRW--SWESKDSQPKAVAGAGAGAGGLKPTAVLS 759
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
CP K I +VFAS+GNP G C Y VGSCH+ ++ VVE+ACIG+ CS+ + S +GG
Sbjct: 760 CPTKKTIQSVVFASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGG 819
Query: 730 DP-CPGIHKALLVDAQC 745
D CPG L V A+C
Sbjct: 820 DVHCPGTTGTLAVQAKC 836
>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
Length = 843
Score = 669 bits (1726), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/795 (44%), Positives = 473/795 (59%), Gaps = 81/795 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GG++ I+TYVFWN HEP +GQY+F G D+++FIK I LY +R+G
Sbjct: 79 MWPDLIKKAKQGGINAIETYVFWNGHEPVEGQYNFEGEFDLVKFIKLIHEHKLYAVVRVG 138
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V GI+FRSDN+P+K
Sbjct: 139 PFIQAEWNHGGLPYWLREVPGIIFRSDNEPFKKHMKRFVTLIVDKLKQEKLFAPQGGPII 198
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY TI+ AF EKG YV WA K+A+ + VPW+MCKQ DAP P+IN CNG C
Sbjct: 199 LAQIENEYNTIQRAFREKGDSYVQWAGKLALSLNANVPWIMCKQRDAPDPIINTCNGRHC 258
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN NKP++WTE+WT+ Y+V+G P RSA+D+A+ VA F +KNGS VNYYM++
Sbjct: 259 GDTFYGPNKRNKPALWTENWTAQYRVFGDPPSQRSAEDLAYSVARFFSKNGSMVNYYMHY 318
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A+F T YYD+ PLDE+GL REPKWGHLK++H A+ LC R L G +
Sbjct: 319 GGTNFGRTSASFTTTRYYDEGPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLK 378
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG Q+A V+++ + CAAFL NN+ R A V FR LP +SIS+LPDCKTV FNT
Sbjct: 379 LGPDQQAIVWQQPGTSACAAFLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNT 438
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI---LNFDNTLLRAEGLLDQISAAKDASDY 385
+ V+TQ+N R+ + + + WE RE L F + R + KD +DY
Sbjct: 439 QLVTTQHNSRNFVRSEIANKNFNWEMCREVPPVGLGFKFDVPR-----ELFHLTKDTTDY 493
Query: 386 FWYTFRFHYN------SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WYT N + L V S GH +HA+VNGEY GSAHGS SF L+ V
Sbjct: 494 AWYTTSLLLGRRDLPMKKNVRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAV 553
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIG 494
L++G N ALL VGLPDSGA++E++ AG + + + + WG+QVG+ G
Sbjct: 554 SLKEGENHIALLGYLVGLPDSGAYMEKRFAGPRSITILGLNTGTLDISQNGWGHQVGIDG 613
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTR--QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
EK ++++ G V W+ P + LTWYK F AP G++P+A+ + MGKG WVNG
Sbjct: 614 EKKKLFTEEGSKSVQWT---KPDQGGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNG 670
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+SIGRYW ++ + P+Q++ YH+PRA+LKP NL+VLLE
Sbjct: 671 RSIGRYWNNYLSPLKKPTQSE--------------------YHIPRAYLKPK-NLIVLLE 709
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
EE GNP + + T+ +C V+ H P S L + G K KP + CP
Sbjct: 710 EEGGNPKDVHIVTVNRDTICSAVSEIH--PPSPRLFETKNGSLQAKVNDLKPRAELKCPG 767
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG--GD 730
K+I + FAS+G+P G C Y +G+C + S+ VVE+ C+GK C IPL S F D
Sbjct: 768 KKQIVAVEFASYGDPFGACGAYFIGNCTAPESKQVVEKYCLGKPSCQIPLDSIPFSNQND 827
Query: 731 PCPGIHKALLVDAQC 745
C + K L V +C
Sbjct: 828 ACTHLRKTLAVQLKC 842
>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
Length = 848
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/797 (43%), Positives = 478/797 (59%), Gaps = 77/797 (9%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP LI+KAKEGGL+VI++YVFWN HEP++G Y+F GR D+I+F K IQ + +Y +RIGP
Sbjct: 64 WPDLISKAKEGGLNVIESYVFWNGHEPEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK----------------------------- 92
F+++EW +GGLP WL ++ I+FR++N+P+K
Sbjct: 124 FVQAEWNHGGLPYWLREIPDIIFRTNNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIIL 183
Query: 93 --IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
IENEYQ +E AF E G Y+ WAAKMA+ +TGVPW+MCKQ APG VI CNG CG
Sbjct: 184 AQIENEYQHLEVAFKEAGTKYINWAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCG 243
Query: 151 ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
+T+ GP KP +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYMYHG
Sbjct: 244 DTWPGPADKKKPLLWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMYHG 303
Query: 211 GTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
GTNFGR AAF++ YYD+AP DE+GL +EPKWGHL++LH A++ C + LL G +V L
Sbjct: 304 GTNFGRNGAAFVMPRYYDEAPFDEFGLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPL 363
Query: 271 GQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
G+L EA VFE + VC AFL N++ ++ TV FR Y + R+SISIL DCKTV F+T+
Sbjct: 364 GKLYEARVFEMKEKNVCVAFLSNHNTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQ 423
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V++Q+N+R+ + D WE Y E I + T +R + L+Q + KD +DY WY
Sbjct: 424 HVNSQHNQRTFHFADQTVQDNVWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWY 483
Query: 389 TFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F + + + L+V SHGH + AFVN + G HG+ N +FT+ + L+
Sbjct: 484 TTSFRLETDDLPYRKEVKPVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLK 543
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
G N A+LS T+GL DSG++LE ++AGV+ V ++ + T WG+ VGL GE+
Sbjct: 544 VGVNHVAILSSTLGLMDSGSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERR 603
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+++S G+ V W + + LTWY+ F P+G DP+ ++L MGKG +VNG+ +GR
Sbjct: 604 RVHSEQGMGAVAWKPGKD-NQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGR 662
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YWVS+ + G PSQ YHVPR+ L+P GN L+ EEE G
Sbjct: 663 YWVSYHHALGKPSQY--------------------LYHVPRSLLRPKGNTLMFFEEEGGK 702
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK--------KPTVQPS 669
P I + T+ +C +T + P W + D+ K KPT S
Sbjct: 703 PDAIMILTVKRDNICTFMTEKN-PAHVRW--SWESKDSQPKAVAGAGAGAGGFKPTAVLS 759
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
CP K I +VFAS+GNP G C Y VGSCH+ ++ VVE+ACIG+ CS+ + S +GG
Sbjct: 760 CPTKKTIQSVVFASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGG 819
Query: 730 DP-CPGIHKALLVDAQC 745
D CPG L V A+C
Sbjct: 820 DVHCPGTTGTLAVQAKC 836
>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/800 (45%), Positives = 469/800 (58%), Gaps = 73/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F GR D+++FIK ++ GLYV LRIG
Sbjct: 69 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGEYYFEGRYDLVKFIKLVKEAGLYVHLRIG 128
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL + GI FR+DN+P+K
Sbjct: 129 PYACAEWNFGGFPVWLKYIPGISFRTDNEPFKTAMAGFTKKIVDMMKEEELFETQGGPII 188
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MAV TGVPWVMCKQDDAP P+IN CN C
Sbjct: 189 LSQIENEYGPVEWEIGAPGQAYTKWAANMAVGLGTGVPWVMCKQDDAPDPIINTCNDHYC 248
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP++WTE WTS++ +GG R A+D+AF +A FI + GS++NYYMYH
Sbjct: 249 --DWFSPNKNYKPTMWTEAWTSWFTAFGGPVPYRPAEDMAFAIAKFIQRGGSFINYYMYH 306
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+R+PKWGHLK+LH AIK+C L++G V
Sbjct: 307 GGTNFGRTAGGPFVATSYDYDAPIDEYGLIRQPKWGHLKDLHKAIKMCEAALVSGDPIVT 366
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QE+ VF+ SG CAAFL N DE+ V F+ + Y LP SISILPDC FNT
Sbjct: 367 SLGSSQESHVFKSESGDCAAFLANYDEKSFAKVAFQGMHYNLPPWSISILPDCVNTVFNT 426
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
RV Q + + TS + D WE Y E ++D+ + EGLL+QI+ +D +DY W
Sbjct: 427 ARVGAQTSSMTMTS---VNPDGFSWETYNEETASYDDASITMEGLLEQINVTRDVTDYLW 483
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT + + N + P L V S GH LH F+NGE +G+ +GS DN T +V L
Sbjct: 484 YTTDITIDPNEGFLKNGEYPVLTVMSAGHALHIFINGELSGTVYGSVDNPKLTYTGSVKL 543
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV--------QDKSFTNCSWGYQVGLI 493
G N ++LS+ VGLP+ GA E GV V +D S+ N W Y++GL
Sbjct: 544 LAGNNKISVLSIAVGLPNIGAHFETWNTGVLGPVVLNGLNEGRRDLSWQN--WSYKIGLK 601
Query: 494 GEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
GE LQ++S G + V WSS+ + + LTWYKTTF AP GN P AL++ MGKG+ W+NGQ
Sbjct: 602 GEALQLHSLTGSSSVEWSSLIAQKQPLTWYKTTFNAPEGNGPFALDMSMMGKGQIWINGQ 661
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGRYW ++K + GN + Y N + C + YHVP ++L PT NLLV+
Sbjct: 662 SIGRYWPAYK-AYGNCGECSYTGRYNEKKCLANCG-EASQRWYHVPSSWLYPTANLLVVF 719
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG-----KKPTV 666
EE G+P GI++ C ++ H P L W IK +G ++P
Sbjct: 720 EEWGGDPTGISLVRRTTGSACAFISEWH-PTLRKW---------HIKDYGRAERPRRPKA 769
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
SC G+KIS I FASFG P G C + GSCH+ S + E+ C+G+ CS+ +
Sbjct: 770 HLSCADGQKISSIKFASFGTPQGVCGNFTEGSCHAHKSYDIFEKNCVGQQWCSVTISPDV 829
Query: 727 FGGDPCPGIHKALLVDAQCR 746
FGGDPCP + K L V+A C+
Sbjct: 830 FGGDPCPNVMKNLAVEAICQ 849
>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
Length = 823
Score = 667 bits (1720), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/795 (42%), Positives = 480/795 (60%), Gaps = 75/795 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIA+AKEGGL+VI++YVFWN HEP+ G Y+F GR D+I+F K +Q ++ +RIG
Sbjct: 45 MWPDLIARAKEGGLNVIESYVFWNGHEPEMGVYNFEGRYDMIKFFKLVQEHEMFAMVRIG 104
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+++EW +GGLP WL +V I+FR++N+P+K
Sbjct: 105 PFVQAEWNHGGLPYWLREVPDIIFRTNNEPFKKHMQKFVTMIVNKLKDAKLFASQGGPII 164
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF E G Y+ WAAKMA D + GVPW+MCKQ APG VI CNG C
Sbjct: 165 LAQIENEYQHLEAAFKENGTTYIHWAAKMASDLNIGVPWIMCKQTKAPGEVIPTCNGRHC 224
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ GP NKP +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ VNYYMYH
Sbjct: 225 GDTWPGPTDKNKPLLWTENWTAQYRVFGDPPSQRSAEDIAFAVARFYSVGGTMVNYYMYH 284
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A+F++ YYD+APLDE+GL +EPKWGHL++LH A++LC + +L G +
Sbjct: 285 GGTNFGRTGASFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALRLCKKAILWGNPSNQP 344
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+L EA +FE +C AFL N++ ++ TV FR Y +PR+S+SIL DCKTV F+T
Sbjct: 345 LGKLYEARLFEIPEQKICVAFLSNHNTKEDGTVTFRGQQYFVPRRSVSILADCKTVVFST 404
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASDYF 386
+ V++Q+N+R+ + + WE Y E+ + + T +R + L+ + KD +DY
Sbjct: 405 QHVNSQHNQRTFHFSDQTVQGNVWEMYTESDKVPTYKFTNIRTQKPLEAYNLTKDKTDYV 464
Query: 387 WYTFRFHYNSSNAQ------APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WYT F + + L+V SHGH + AFVNG+Y G+ HG+ N +FT+ +
Sbjct: 465 WYTTSFKLEAEDLPFRKDIWPVLEVSSHGHAMVAFVNGKYVGAGHGTKINKAFTMEKPIE 524
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGE 495
+R G N ++LS T+G+ DSG +LE + AG+ V +Q + T+ WG+ VGL GE
Sbjct: 525 VRTGINHVSILSTTLGMQDSGVYLEHRQAGIDGVTIQGLNTGTLDLTSNGWGHLVGLEGE 584
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
+ ++ G + V W R LTWY+ F P G+DP+ +++ MGKG +VNG+ +
Sbjct: 585 RRNAHTEKGGDGVQWVPAVF-DRPLTWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGL 643
Query: 556 GRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE- 614
GRYW S+K + G PSQ YHVPR FLKPTGN++ + EEE
Sbjct: 644 GRYWSSYKHALGRPSQY--------------------LYHVPRCFLKPTGNVMTIFEEEG 683
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK---KPTVQPSCP 671
G P GI + T+ +C ++ + + SW +R D+ +K KP SCP
Sbjct: 684 GGQPDGIMILTVKRDNICSFISEKNPAHVKSW----ERKDSHLKSVADADLKPQAVLSCP 739
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD- 730
K I ++VFAS+GNP G C Y VG+CH+ ++ +VE+AC+GK C + + +G D
Sbjct: 740 EKKLIQQVVFASYGNPLGICGNYTVGNCHAPKAKEIVEKACVGKKSCVLQVSHEVYGADL 799
Query: 731 PCPGIHKALLVDAQC 745
CPG L V A+C
Sbjct: 800 NCPGSTGTLAVQAKC 814
>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 854
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/801 (45%), Positives = 478/801 (59%), Gaps = 74/801 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDV++TYVFWN+HEP G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 58 MWEDLIQKAKDGGLDVVETYVFWNVHEPTPGNYNFEGRYDLVRFLKTIQKAGLYAHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTQKIVGLMKSESLFESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAA+MAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 178 LSQIENEYGAQSKLFGAAGHNYITWAAEMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP+IWTE W+ ++ +GG + R QD+A+ VA FI K GS+VNYYMYH
Sbjct: 238 -DSFS-PNRPYKPTIWTETWSGWFTEFGGPIHQRPVQDLAYAVATFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL+R+PK+GHLKELH AIK+C R L++ +
Sbjct: 296 GGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCERALVSADPIIT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A+V+ SG C+AFL N+D + A V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 356 SLGNFQQAYVYTSESGDCSAFLSNHDSKSAARVMFNNMHYNLPPWSISILPDCRNVVFNT 415
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q ++ +N+ S WE Y E + + D+ + + A GLL+QI+ +D++DY
Sbjct: 416 AKVGVQTSQMQMLPTNIPMLS---WESYDEDLTSMDDSSTMTAPGLLEQINVTRDSTDYL 472
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY +SS + + P L VQS GH +H F+NG+ TGSA G+ ++ FT V+
Sbjct: 473 WYITSVDIDSSESFLHGGELPTLIVQSTGHAVHIFINGQLTGSAFGTRESRRFTYTGKVN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR GTN ALLSV VGLP+ G E G+ H + + W YQVGL G
Sbjct: 533 LRAGTNKIALLSVAVGLPNVGGHFEAWNTGILGPVALHGLNQGKWDLSWQKWTYQVGLKG 592
Query: 495 EKLQIYSNLGLNKVLWSS----IRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
E + + S + V W S + + LTW+KT F P G++P+AL+++ MGKG+ W+
Sbjct: 593 EAMNLVSQNAFSSVEWISGSLIAQKKQQPLTWHKTIFNEPEGSEPLALDMEGMGKGQIWI 652
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
NGQSIGRYW +F + GN + YA + K T YHVPR++LKPT NLLV
Sbjct: 653 NGQSIGRYWTAF--ANGNCNGCSYAGGFRPTKCQSGCGKPTQRYYHVPRSWLKPTQNLLV 710
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KP 664
L EE G+P I++ A+ VC V H P + +W I+ +GK P
Sbjct: 711 LFEELGGDPSRISLVKRAVSSVCSEVAEYH-PTIKNW---------HIESYGKVEDFHSP 760
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
V C G+ IS I FASFG P G C Y G+CH++ S VV++ CIGK RC++ + +
Sbjct: 761 KVHLRCNPGQAISSIKFASFGTPLGTCGSYQEGTCHATTSYSVVQKKCIGKQRCAVTISN 820
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
F GDPCP + K L V+A C
Sbjct: 821 SNF-GDPCPKVLKRLSVEAVC 840
>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/803 (44%), Positives = 472/803 (58%), Gaps = 78/803 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ KAK+GGLDVIQTYVFWN+HEP G Y+F GR D++RF+K +Q GLY+ LRIG
Sbjct: 60 MWEGLMQKAKDGGLDVIQTYVFWNVHEPSPGNYNFEGRYDLVRFVKTVQKAGLYMHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKMAMQGFTEKIVQMMKSESLFESQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + A G Y+ WAAKMAV TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 180 LSQIENEYGSESKALGAPGHAYMTWAAKMAVGLRTGVPWVMCKEDDAPDPVINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG + R +D+AF VA FI K GS++NYYMYH
Sbjct: 240 -DAFT-PNKPYKPTMWTEAWSGWFTEFGGTVHERPVEDLAFAVARFIQKGGSFINYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH AIKLC L++ V
Sbjct: 298 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKLCEPALISADPIVT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q++ VF +G CAAFL N + V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 358 SLGPYQQSHVFSSGTGGCAAFLSNYNPNSVARVMFNNMHYSLPPWSISILPDCRNVVFNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDE----KWEEYREAILNF-DNTLLRAEGLLDQISAAKDAS 383
+V Q TS + + E WE Y E I + DN+++ A GLL+Q++ +D S
Sbjct: 418 AKVGVQ------TSQMHMSAGETKLLSWEMYDEDIASLGDNSMITAVGLLEQLNVTRDTS 471
Query: 384 DYFWYTFRFHYNSSNAQ------APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WY + S + L VQS GH LH ++NG+ +GSAHGS +N FT
Sbjct: 472 DYLWYMTSVDISPSESSLRGGRPPVLTVQSAGHALHVYINGQLSGSAHGSRENRRFTFTG 531
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
V++R G N ALLS+ V LP+ G E GV H + + T W YQVG
Sbjct: 532 DVNMRAGINRIALLSIAVELPNVGLHYESTNTGVLGPVVLHGLDQGKRDLTWQKWSYQVG 591
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPTRQ---LTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE + + + G++ V W T++ LTWYK F AP G++P+AL+L SMGKG+
Sbjct: 592 LKGEAMNLVAPSGISYVEWMQASFATQKLQPLTWYKAYFNAPGGDEPLALDLGSMGKGQV 651
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNL 607
W+NG+SIGRYW + + G+ + YA + T YHVPR++L+PT NL
Sbjct: 652 WINGESIGRYWTA--AANGDCNHCSYAGTYRAPKCQTGCGQPTQRWYHVPRSWLQPTKNL 709
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK----- 662
LV+ EE G+ GI++ ++ VC V+ H P + +W I+ +G+
Sbjct: 710 LVIFEEIGGDASGISLVKRSVSSVCADVSEWH-PTIKNW---------HIESYGRSEELH 759
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
+P V C +G+ IS I FASFG P G C + G CHS +S ++E+ CIG+ RC++ +
Sbjct: 760 RPKVHLRCAMGQSISAIKFASFGTPLGTCGSFQQGPCHSPNSHAILEKKCIGQQRCAVTI 819
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
FGGDPCP + K + V+A C
Sbjct: 820 SMNNFGGDPCPNVMKRVAVEAIC 842
>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
Length = 842
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/794 (45%), Positives = 460/794 (57%), Gaps = 62/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGG+DVIQTYVFWN HEP++G+Y F R D+++FIK + GLYV LR+G
Sbjct: 61 MWPDLIQKAKEGGVDVIQTYVFWNGHEPEQGKYYFEERYDLVKFIKLVHQAGLYVNLRVG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 121 PYACAEWNFGGFPVWLKYVPGISFRTDNEPFKAAMQKFTTKIVNMMKAERLYESQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E F E+G Y WAAKMA+D TGVPW+MCKQDDAP PVIN CNG C
Sbjct: 181 LSQIENEYGPLEVRFGEQGKSYAEWAAKMALDLGTGVPWLMCKQDDAPDPVINTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP IWTE WT+++ +G R +D+AF VA FI GS++NYYMYH
Sbjct: 241 DYFY--PNKAYKPKIWTEAWTAWFTEFGSPVPYRPVEDLAFGVANFIQTGGSFINYYMYH 298
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDE+GL+R+PKWGHLK+LH AIKLC L++G V
Sbjct: 299 GGTNFGRTAGGPFVATSYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG Q+A VF TSG CAAFL NND TV F N Y LP SISILPDCK +NT
Sbjct: 359 ALGNYQKAHVFRSTSGACAAFLANNDPNSFATVAFGNKHYNLPPWSISILPDCKHTVYNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q T + W+ Y + +D+ GLL+Q++ +D SDY WY
Sbjct: 419 ARVGAQSALMKMTPA---NEGYSWQSYNDQTAFYDDNAFTVVGLLEQLNTTRDVSDYLWY 475
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ S + P L V S G LH FVNG+ G+ +GS T V+LR
Sbjct: 476 MTDVKIDPSEGFLRSGNWPWLTVSSAGDALHVFVNGQLAGTVYGSLKKQKITFSKAVNLR 535
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS+ VGLP+ G E GV + + T W Y+VGL GE
Sbjct: 536 AGVNKISLLSIAVGLPNIGPHFETWNTGVLGPVSLSGLDEGKRDLTWQKWSYKVGLKGEA 595
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ + + LTWYKTTF APAGN+P+AL++ SMGKG+ W+NGQS
Sbjct: 596 LNLHSLSGSSSVEWVEGSLVAQRQPLTWYKTTFNAPAGNEPLALDMNSMGKGQVWINGQS 655
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
IGRYW +K S G YA N + C + YHVPR++L PTGNLLV+ E
Sbjct: 656 IGRYWPGYKAS-GTCDACNYAGPFNEKKCLSNCG-DASQRWYHVPRSWLHPTGNLLVVFE 713
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSW-LRHRQRGDTDIKKFGKKPTVQPSCP 671
E G+P GI++ + VC + N P L +W L+ + D + +P SC
Sbjct: 714 EWGGDPNGISLVKRELASVCADI-NEWQPQLVNWQLQASGKVDKPL-----RPKAHLSCT 767
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+KI+ I FASFG P G C ++ GSCH+ HS E+ CIG+ C++P+ FGGDP
Sbjct: 768 SGQKITSIKFASFGTPQGVCGSFSEGSCHAHHSYDAFEKYCIGQESCTVPVTPEIFGGDP 827
Query: 732 CPGIHKALLVDAQC 745
CP + K L V+A C
Sbjct: 828 CPSVMKKLSVEAVC 841
>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
Length = 838
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/793 (46%), Positives = 463/793 (58%), Gaps = 60/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAKEGG+DVIQTYVFWN HEPQ+G+Y F GR D+++FIK + GLYV LR+G
Sbjct: 57 MWPGIIQKAKEGGVDVIQTYVFWNGHEPQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 117 PYACAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQKFTAKIVNMMKAERLYETQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP P+INACNG C
Sbjct: 177 LSQIENEYGPMEWELGAPGKSYAQWAAKMAVGLDTGVPWVMCKQDDAPDPIINACNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP IWTE WT+++ +G R A+D+AF VA FI K GS++NYYMYH
Sbjct: 237 --DYFSPNKAYKPKIWTEAWTAWFTGFGNPVPYRPAEDLAFSVAKFIQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G V
Sbjct: 295 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPAVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG QEA VF +G CAAFL N D+ TV F N Y LP SISILPDCK FNT
Sbjct: 355 ALGHQQEAHVFRSKAGSCAAFLANYDQHSFATVSFANRHYNLPPWSISILPDCKNTVFNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
R+ Q + T W+ + E +++++ GLL+QI+ +D SDY WY
Sbjct: 415 ARIGAQSAQMKMT---PVSRGLPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWY 471
Query: 389 TFRFHYNSSN-----AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ +S + P L + S GH LH FVNG+ G+A+GS + T V+LR
Sbjct: 472 STDVKIDSREKFLRGGKWPWLTIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLR 531
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS+ VGLP+ G E AGV + + T W Y+VGL GE
Sbjct: 532 AGVNKISLLSIAVGLPNIGPHFETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEA 591
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ + + LTWYK+TF APAGNDP+AL+L +MGKG+ W+NGQS
Sbjct: 592 LSLHSLSGSSSVEWVEGSLVAQRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQS 651
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW +K S GN YA N + C + YHVPR++L PTGNLLVL E
Sbjct: 652 LGRYWPGYKAS-GNCGACNYAGWFNEKKCLSNCG-EASQRWYHVPRSWLYPTGNLLVLFE 709
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G P GI++ + VC + N P L +W + + G D +P SC
Sbjct: 710 EWGGEPHGISLVKREVASVCADI-NEWQPQLVNW-QMQASGKVDKP---LRPKAHLSCAS 764
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KI+ I FASFG P G C + GSCH+ HS ER CIG++ CS+P+ FGGDPC
Sbjct: 765 GQKITSIKFASFGTPQGVCGSFREGSCHAFHSYDAFERYCIGQNSCSVPVTPEIFGGDPC 824
Query: 733 PGIHKALLVDAQC 745
P + K L V+ C
Sbjct: 825 PHVMKKLSVEVIC 837
>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
Length = 838
Score = 664 bits (1712), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/793 (46%), Positives = 463/793 (58%), Gaps = 60/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAKEGG+DVIQTYVFWN HEPQ+G+Y F GR D+++FIK + GLYV LR+G
Sbjct: 57 MWPGIIQKAKEGGVDVIQTYVFWNGHEPQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 117 PYACAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQKFTAKIVNMMKAERLYETQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP P+INACNG C
Sbjct: 177 LSQIENEYGPMEWELGAPGKSYAQWAAKMAVGLDTGVPWVMCKQDDAPDPIINACNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP IWTE WT+++ +G R A+D+AF VA FI K GS++NYYMYH
Sbjct: 237 --DYFSPNKAYKPKIWTEAWTAWFTGFGNPVPYRPAEDLAFSVAKFIQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G V
Sbjct: 295 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPAVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG QEA VF +G CAAFL N D+ TV F N Y LP SISILPDCK FNT
Sbjct: 355 ALGHQQEAHVFRSKAGSCAAFLANYDQHSFATVSFANRHYNLPPWSISILPDCKNTVFNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
R+ Q + T W+ + E +++++ GLL+QI+ +D SDY WY
Sbjct: 415 ARIGAQSAQMKMTP---VSRGLPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWY 471
Query: 389 TFRFHYNSSN-----AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ +S + P L + S GH LH FVNG+ G+A+GS + T V+LR
Sbjct: 472 STDVKIDSREKFLRGGKWPWLTIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLR 531
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS+ VGLP+ G E AGV + + T W Y+VGL GE
Sbjct: 532 AGVNKISLLSIAVGLPNIGPHFETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEA 591
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ + + LTWYK+TF APAGNDP+AL+L +MGKG+ W+NGQS
Sbjct: 592 LSLHSLSGSSSVEWVEGSLVAQRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQS 651
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW +K S GN YA N + C + YHVPR++L PTGNLLVL E
Sbjct: 652 LGRYWPGYKAS-GNCGACNYAGWFNEKKCLSNCG-EASQRWYHVPRSWLYPTGNLLVLFE 709
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G P GI++ + VC + N P L +W + + G D +P SC
Sbjct: 710 EWGGEPHGISLVKREVASVCADI-NEWQPQLVNW-QMQASGKVDKP---LRPKAHLSCAP 764
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KI+ I FASFG P G C + GSCH+ HS ER CIG++ CS+P+ FGGDPC
Sbjct: 765 GQKITSIKFASFGTPQGVCGSFREGSCHAFHSYDAFERYCIGQNSCSVPVTPEIFGGDPC 824
Query: 733 PGIHKALLVDAQC 745
P + K L V+ C
Sbjct: 825 PHVMKKLSVEVIC 837
>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 849
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/802 (46%), Positives = 475/802 (59%), Gaps = 74/802 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDVI+TYVFWN+HEP +G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 62 MWEDLIYKAKEGGLDVIETYVFWNVHEPSRGNYNFEGRYDLVRFVKTIQKAGLYANLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 122 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G YV WAAKMAV+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 182 LSQIENEYGAQSKLLGSAGQNYVNWAAKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KPSIWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 242 --DYFTPNKPYKPSIWTEAWSGWFSEFGGPNHERPVQDLAFGVARFIQKGGSFVNYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL+R+PK+GHLKELH AIK+C R L++ V
Sbjct: 300 GGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCERALVSTDPAVT 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A V+ SG CAAFL N D + +V V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 360 SLGNFQQAHVYSAKSGDCAAFLSNFDTKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNT 419
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDN---TLLRAEGLLDQISAAKDASD 384
+V Q ++ +N + S WE + E I + D+ GLL+QI+ +D SD
Sbjct: 420 AKVGVQTSQMQMLPTNTRMFS---WESFDEDISSLDDGSSITTTTSGLLEQINVTRDTSD 476
Query: 385 YFWYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
Y WY SS + + P L VQS GH +H F+NG+ +GSA+G+ ++ FT T
Sbjct: 477 YLWYITSVDIGSSESFLRGGKLPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFTYTGT 536
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLE---RKVAGVHRVRVQDKSFTNCS---WGYQVGL 492
V+LR GTN ALLSV VGLP+ G E + G +R D+ + S W YQVGL
Sbjct: 537 VNLRAGTNRIALLSVAVGLPNVGGHFETWNTGILGPVVLRGFDQGKLDLSWQKWTYQVGL 596
Query: 493 IGEKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE + + S G++ V W S++ S Q LTW+KT F AP G++P+AL+++ MGKG+ W
Sbjct: 597 KGEAMNLASPNGISSVEWMQSALVSDKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIW 656
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLL 608
+NG SIGRYW + + GN + YA + T YHVPR++LKP NLL
Sbjct: 657 INGLSIGRYWTAL--AAGNCNGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPDHNLL 714
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK----- 663
V+ EE G+P I++ ++ VC V+ H P + +W I +GK
Sbjct: 715 VVFEELGGDPSKISLVKRSVSSVCADVSEYH-PNIRNW---------HIDSYGKSEEFHP 764
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLL 723
P V C G+ IS I FASFG P G C Y G CHSS S +E+ CIGK RC++ +
Sbjct: 765 PKVHLHCSPGQTISSIKFASFGTPLGTCGNYEKGVCHSSTSHATLEKKCIGKPRCTVTVS 824
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
+ FG DPCP + K L V+A C
Sbjct: 825 NSNFGQDPCPNVLKRLSVEAVC 846
>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 845
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/799 (44%), Positives = 470/799 (58%), Gaps = 70/799 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDV++TYVFWN+HEP G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 58 MWEDLINKAKEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G Y WAA MAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 178 LSQIENEYGPQAKVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F PN P KP+IWTE W+ ++ +GG + R QD+AF VA FI + GS+VNYYMYH
Sbjct: 238 DNFF--PNKPYKPAIWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH A+K+C + +++ +
Sbjct: 296 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAIT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG LQ+A+V+ +G CAAFL NND + A V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 356 SLGNLQQAYVYSSETGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q SK L +S+ WE Y E I D+ + +R+ GLL+QI+ +D SDY
Sbjct: 416 AKVGVQ---TSKMEMLPTNSEMLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYL 472
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S+ + + P L V++ GH +H F+NG+ +GSA G+ N F + V+
Sbjct: 473 WYITSVDIGSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV-HRVRVQ--DKSFTNCSWG---YQVGLIG 494
LR G+N ALLSV VGLP+ G E GV V +Q D + SW YQVGL G
Sbjct: 533 LRAGSNRIALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKG 592
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S I + LTW+K F P G++P+AL++ SMGKG+ W+N
Sbjct: 593 EAMNLVSTNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
GQSIGRYW ++ T N Q C YHVPR++LKPT NLLVL
Sbjct: 653 GQSIGRYWTAYATGDCNGCQYSGVFRPPKCQLGCG-EPTQKWYHVPRSWLKPTQNLLVLF 711
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK-----PTV 666
EE G+P I++ ++ VC +V H P + +W I+ +GK P V
Sbjct: 712 EELGGDPTRISLVKRSVTNVCSNVAEYH-PNIKNW---------QIENYGKTEEFHLPKV 761
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
+ C G+ IS I FASFG P G C + G+CH+ S VVE+ C+G+ C++ + +
Sbjct: 762 RIHCAPGQSISSIKFASFGTPLGTCGSFKQGTCHAPDSHAVVEKKCLGRQTCAVTISNSN 821
Query: 727 FGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 822 FGEDPCPNVLKRLSVEAHC 840
>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 661 bits (1706), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 364/800 (45%), Positives = 477/800 (59%), Gaps = 72/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDV++TYVFWN+HEP G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 57 MWEDLILKAKEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSERLFESQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + G YV WAAKMAV+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 177 LSQIENEYGAQSKLQGDAGQNYVNWAAKMAVEMGTGVPWVMCKEDDAPDPVINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI + GS+VNYYMYH
Sbjct: 237 -DKFT-PNRPYKPMIWTEAWSGWFTEFGGPIHKRPVQDLAFAVARFIIRGGSFVNYYMYH 294
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PK+GHLKELH AIK+C R L++ +
Sbjct: 295 GGTNFGRTAGGPFIATSYDYDAPLDEYGLIRQPKYGHLKELHRAIKMCERALVSTDPIIT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG+ Q+A V+ SG CAAFL N D + + V+F N+ Y LP S+SILPDC+ V FNT
Sbjct: 355 SLGESQQAHVYTTESGDCAAFLSNYDSKSSARVMFNNMHYNLPPWSVSILPDCRNVVFNT 414
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q ++ +N + S WE + E + + D+ + + A GLL+QI+ KDASDY
Sbjct: 415 AKVGVQTSQMQMLPTNTQLFS---WESFDEDVYSVDDSSAIMAPGLLEQINVTKDASDYL 471
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY SS + + P L VQS GH +H F+NG+ +GSA+G+ + F V+
Sbjct: 472 WYITSVDIGSSESFLRGGELPTLIVQSRGHAVHVFINGQLSGSAYGTREYRRFMYTGKVN 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR G N ALLSV +GLP+ G E G+ H + + W YQVGL G
Sbjct: 532 LRAGINRIALLSVAIGLPNVGEHFESWSTGILGPVALHGLDQGKWDLSGQKWTYQVGLKG 591
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+I Q LTW+KT F AP G++P+AL+++ MGKG+ W+N
Sbjct: 592 EAMDLASPNGISSVAWMQSAIVVQRNQPLTWHKTHFDAPEGDEPLALDMEGMGKGQIWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW +F T GN + YA + + T YHVPR++LKPT NLLV+
Sbjct: 652 GQSIGRYWTTFAT--GNCNDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVI 709
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK-----PT 665
EE GNP I++ ++ VC V+ H P + +W I+ +GK P
Sbjct: 710 FEELGGNPSKISLVKRSVSSVCADVSEYH-PNIKNW---------HIESYGKSEEFHPPK 759
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
V C G+ IS I FASFG P G C Y G+CHS S ++E+ CIGK RC++ + +
Sbjct: 760 VHLHCSPGQTISSIKFASFGTPLGTCGNYEQGACHSPASYAILEKRCIGKPRCTVTVSNS 819
Query: 726 YFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 820 NFGQDPCPKVLKRLSVEAVC 839
>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
Length = 845
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/799 (44%), Positives = 469/799 (58%), Gaps = 70/799 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDV++TYVFWN+HEP G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 58 MWEDLINKAKEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRADNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G Y WAA MAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 178 LSQIENEYGPQAKVLGAPGHQYSTWAANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F PN P KP+ WTE W+ ++ +GG + R QD+AF VA FI + GS+VNYYMYH
Sbjct: 238 DNFF--PNKPYKPATWTEAWSGWFSEFGGPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH A+K+C + +++ +
Sbjct: 296 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKSIVSADPAIT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG LQ+A+V+ +G CAAFL NND + A V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 356 SLGNLQQAYVYSSETGGCAAFLSNNDWKSAARVMFNNMHYNLPPWSISILPDCRNVVFNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q SK L +S+ WE Y E I D+ + +R+ GLL+QI+ +D SDY
Sbjct: 416 AKVGVQ---TSKMEMLPTNSEMLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYL 472
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S+ + + P L V++ GH +H F+NG+ +GSA G+ N F + V+
Sbjct: 473 WYITSVDIGSTESFLHGGELPTLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV-HRVRVQ--DKSFTNCSWG---YQVGLIG 494
LR G+N ALLSV VGLP+ G E GV V +Q D + SW YQVGL G
Sbjct: 533 LRAGSNRIALLSVAVGLPNIGGHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKG 592
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S I + LTW+K F P G++P+AL++ SMGKG+ W+N
Sbjct: 593 EAMNLVSTNGISAVDWMQGSLIAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
GQSIGRYW ++ T N Q C YHVPR++LKPT NLLVL
Sbjct: 653 GQSIGRYWTAYATGDCNGCQYSGVFRPPKCQLGCG-EPTQKWYHVPRSWLKPTQNLLVLF 711
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK-----PTV 666
EE G+P I++ ++ VC +V H P + +W I+ +GK P V
Sbjct: 712 EELGGDPTRISLVKRSVTNVCSNVAEYH-PNIKNW---------QIENYGKTEEFHLPKV 761
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
+ C G+ IS I FASFG P G C + G+CH+ S VVE+ C+G+ C++ + +
Sbjct: 762 RIHCAPGQSISSIKFASFGTPLGTCGSFKQGTCHAPDSHAVVEKKCLGRQTCAVTISNSN 821
Query: 727 FGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 822 FGEDPCPNVLKRLSVEAHC 840
>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
Length = 853
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/801 (44%), Positives = 473/801 (59%), Gaps = 74/801 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GG+DVI+TYVFWNLHEP G+YDF GRND++RF+K I GLY LRIG
Sbjct: 60 MWEGLIQKAKDGGIDVIETYVFWNLHEPTPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +G Y+ WAAKMA+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 180 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 240 -DSF-APNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA +T YD AP+DEYGL+REPK+GHLKELH AIK+C + L++ V
Sbjct: 298 GGTNFGRTAGGPFVTTSYDYDAPIDEYGLIREPKYGHLKELHRAIKMCEKALVSADPVVT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G Q+A V+ SG C+AFL N D A VLF N+ Y LP SISILPDC+ FNT
Sbjct: 358 SIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q S+ L D+ +W+ Y E + + D+ + +GLL+QI+ +D SDY
Sbjct: 418 AKVGVQ---TSQMEMLPTDTKNFQWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYL 474
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + + + P L +QS GH +H FVNG+ +GSA G+ N FT + ++
Sbjct: 475 WYMTSVDIGDTESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKIN 534
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + + + W YQVGL G
Sbjct: 535 LHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKG 594
Query: 495 EKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
E + + + W +++ P + LTW+KT F AP GN+P+AL+++ MGKG+ WV
Sbjct: 595 EAMNLAFPTNTRSIGWMDASLTVQKP-QPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWV 653
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
NG+SIGRYW +F T G+ SQ Y + + T YHVPR++LKP+ NLLV
Sbjct: 654 NGESIGRYWTAFAT--GDCSQCSYTGTYKPNKCQTGCGQPTQRYYHVPRSWLKPSQNLLV 711
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KP 664
+ EE GNP +++ ++ VC V+ H P + +W I+ +GK +P
Sbjct: 712 IFEELGGNPSSVSLVKRSVSGVCAEVSEYH-PNIKNW---------QIESYGKGQTFHRP 761
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
V C G+ I+ I FASFG P G C Y G CH++ S ++ER C+GK+RC++ + +
Sbjct: 762 KVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISN 821
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 822 TNFGKDPCPNVLKRLTVEAVC 842
>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 828
Score = 658 bits (1698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/791 (45%), Positives = 457/791 (57%), Gaps = 56/791 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP +G+Y F GR D++RFIK ++ GLYV LRIG
Sbjct: 47 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIG 106
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR++N+P+K
Sbjct: 107 PYVCAEWNFGGFPVWLKYVQGINFRTNNEPFKWHMQRFTKKIVDMMKSEGLFESQGGPII 166
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP P+IN CNG C
Sbjct: 167 LSQIENEYGPMEYEIGAPGRAYTEWAAKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYC 226
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 227 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPHRPAEDLAFSVARFIQKGGSFINYYMYH 284
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDE+GL+R+PKWGHLK+LH AIKLC L++G V
Sbjct: 285 GGTNFGRTAGGPFIATSYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALISGDPTVT 344
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG +EA VF SG CAAFL N + R V FRN+ Y LP SISILPDCK +NT
Sbjct: 345 SLGNYEEAHVFHSKSGACAAFLANYNPRSYAKVSFRNMHYNLPPWSISILPDCKNTVYNT 404
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
R+ Q T W+ Y E ++D++ A GLL+QI+ +D SDY WY
Sbjct: 405 ARLGAQSATMKMT---PVSGRFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWY 461
Query: 389 T--FRFHYNS----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ + YN S L V S GH LH F+NG +G+A+GS +N T V LR
Sbjct: 462 STDVKIGYNEGFLKSGRYPVLTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLR 521
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLS+ VGLP+ G E AGV + + + + W Y+VGL GE
Sbjct: 522 AGVNTIALLSIAVGLPNVGPHFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEA 581
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ + + LTWYKTTF AP GN P+AL++ SMGKG+ W+NGQ+
Sbjct: 582 LSLHSLSGSSSVEWVEGSLMARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQN 641
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
+GRYW ++K + G + + YHVP ++L PTGNLLV+ EE
Sbjct: 642 VGRYWPAYKATGGCGDCNYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEES 701
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGK 674
GNP GI++ I VC + P L + + + + K +P C G+
Sbjct: 702 GGNPAGISLVEREIESVCADIYEWQ-PTL---MNYEMQASGKVNK-PLRPKAHLWCAPGQ 756
Query: 675 KISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPG 734
KIS I FASFG P+G C Y GSCH+ S ER+CIG + CS+ + FGGDPCP
Sbjct: 757 KISSIKFASFGTPEGVCGSYREGSCHAHKSYDAFERSCIGMNSCSVTVAPEIFGGDPCPS 816
Query: 735 IHKALLVDAQC 745
+ K L V+A C
Sbjct: 817 VMKKLSVEAIC 827
>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/800 (45%), Positives = 467/800 (58%), Gaps = 72/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDV++TYVFWN+HEP G Y+F GR D++RFIK IQ GLY LRIG
Sbjct: 59 MWEGLIQKAKEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLVRFIKTIQKAGLYANLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 179 LSQIENEYGVQSKLFGAAGQNYMTWAAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG + R QD+AF VALFI K GS++NYYMYH
Sbjct: 239 -DAFS-PNRPYKPTMWTEAWSGWFNEFGGPIHQRPVQDLAFAVALFIQKGGSFINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH A+K+C + L++ V
Sbjct: 297 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKALVSADPIVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A+V+ SG CAAFL N D A V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 357 SLGSSQQAYVYTSESGNCAAFLSNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYF 386
+V Q S+ L +S WE Y E + D+T + A GLL+QI+ KD SDY
Sbjct: 417 AKVGVQ---TSQLEMLPTNSPMLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYL 473
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S+ + + P L VQS GH +H F+NG +GSA GS +N FT V+
Sbjct: 474 WYITSVDIGSTESFLHGGELPTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVN 533
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
R G N ALLSV VGLP+ G E G+ H + + W Y+VGL G
Sbjct: 534 FRAGRNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKG 593
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S + LTW+K+ F AP G++P+A++++ MGKG+ W+N
Sbjct: 594 EAMNLVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWIN 653
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
G SIGRYW ++ T GN + YA + T YHVPRA+LKP NLLV+
Sbjct: 654 GVSIGRYWTAYAT--GNCDKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVV 711
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPT 665
EE GNP I++ ++ VC V+ H P L +W I+ +GK +P
Sbjct: 712 FEELGGNPTSISLVKRSVTGVCADVSEYH-PTLKNW---------HIESYGKSEDLHRPK 761
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
V C G I+ I FASFG P G C Y G+CH+ S ++E+ CIGK RC++ + +
Sbjct: 762 VHLKCSAGYSITSIKFASFGTPLGTCGSYQQGTCHAPMSYDILEKRCIGKQRCAVTISNT 821
Query: 726 YFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+ C
Sbjct: 822 NFGQDPCPNVLKRLSVEVVC 841
>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 658 bits (1697), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/791 (45%), Positives = 457/791 (57%), Gaps = 56/791 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP +G+Y F GR D++RFIK ++ GLYV LRIG
Sbjct: 60 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR++N+P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYVQGINFRTNNEPFKWHMQRFTKKIVDMMKSEGLFESQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP P+IN CNG C
Sbjct: 180 LSQIENEYGPMEYEIGAPGRAYTEWAAKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 240 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPHRPAEDLAFSVARFIQKGGSFINYYMYH 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDE+GL+R+PKWGHLK+LH AIKLC L++G V
Sbjct: 298 GGTNFGRTAGGPFIATSYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALISGDPTVT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG +EA VF SG CAAFL N + R V FRN+ Y LP SISILPDCK +NT
Sbjct: 358 SLGNYEEAHVFHSKSGACAAFLANYNPRSYAKVSFRNMHYNLPPWSISILPDCKNTVYNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
R+ Q T W+ Y E ++D++ A GLL+QI+ +D SDY WY
Sbjct: 418 ARLGAQSATMKMT---PVSGRFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWY 474
Query: 389 T--FRFHYNS----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ + YN S L V S GH LH F+NG +G+A+GS +N T V LR
Sbjct: 475 STDVKIGYNEGFLKSGRYPVLTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLR 534
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLS+ VGLP+ G E AGV + + + + W Y+VGL GE
Sbjct: 535 AGVNTIALLSIAVGLPNVGPHFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEA 594
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ + + LTWYKTTF AP GN P+AL++ SMGKG+ W+NGQ+
Sbjct: 595 LSLHSLSGSSSVEWVEGSLMARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQN 654
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
+GRYW ++K + G + + YHVP ++L PTGNLLV+ EE
Sbjct: 655 VGRYWPAYKATGGCGDCNYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEES 714
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGK 674
GNP GI++ I VC + P L + + + + K +P C G+
Sbjct: 715 GGNPAGISLVEREIESVCADIYEWQ-PTL---MNYEMQASGKVNK-PLRPKAHLWCAPGQ 769
Query: 675 KISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPG 734
KIS I FASFG P+G C Y GSCH+ S ER+CIG + CS+ + FGGDPCP
Sbjct: 770 KISSIKFASFGTPEGVCGSYREGSCHAHKSYDAFERSCIGMNSCSVTVAPEIFGGDPCPS 829
Query: 735 IHKALLVDAQC 745
+ K L V+A C
Sbjct: 830 VMKKLSVEAIC 840
>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/801 (44%), Positives = 470/801 (58%), Gaps = 74/801 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GG+DVI+TYVFWNLHEP G+YDF GRND++RF+K I GLY LRIG
Sbjct: 60 MWEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +G Y+ WAAKMA+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 180 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 240 -DSF-APNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA +T YD AP+DEYGL+R+PK+GHLKELH AIK+C + L++ V
Sbjct: 298 GGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G Q+A V+ SG C+AFL N D A VLF N+ Y LP SISILPDC+ FNT
Sbjct: 358 SIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q S+ L D+ +WE Y E + + D+ + GLL+QI+ +D SDY
Sbjct: 418 AKVGVQ---TSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYL 474
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S + + P L +QS GH +H FVNG+ +GSA G+ N FT + ++
Sbjct: 475 WYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKIN 534
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + + W YQVGL G
Sbjct: 535 LHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKG 594
Query: 495 EKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
E + + + W +++ P + LTW+KT F AP GN+P+AL+++ MGKG+ WV
Sbjct: 595 EAMNLAFPTNTPSIGWMDASLTVQKP-QPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWV 653
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
NG+SIGRYW +F T G+ S Y + + T YHVPRA+LKP+ NLLV
Sbjct: 654 NGESIGRYWTAFAT--GDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLV 711
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KP 664
+ EE GNP +++ ++ VC V+ H P + +W I+ +GK +P
Sbjct: 712 IFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNW---------QIESYGKGQTFHRP 761
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
V C G+ I+ I FASFG P G C Y G CH++ S ++ER C+GK+RC++ + +
Sbjct: 762 KVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISN 821
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 822 SNFGKDPCPNVLKRLTVEAVC 842
>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 856
Score = 656 bits (1692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/801 (44%), Positives = 470/801 (58%), Gaps = 74/801 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GG+DVI+TYVFWNLHEP G+YDF GRND++RF+K I GLY LRIG
Sbjct: 63 MWEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 123 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +G Y+ WAAKMA+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 183 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 243 -DSF-APNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYH 300
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA +T YD AP+DEYGL+R+PK+GHLKELH AIK+C + L++ V
Sbjct: 301 GGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVT 360
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G Q+A V+ SG C+AFL N D A VLF N+ Y LP SISILPDC+ FNT
Sbjct: 361 SIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNT 420
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q S+ L D+ +WE Y E + + D+ + GLL+QI+ +D SDY
Sbjct: 421 AKVGVQ---TSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYL 477
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S + + P L +QS GH +H FVNG+ +GSA G+ N FT + ++
Sbjct: 478 WYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKIN 537
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + + W YQVGL G
Sbjct: 538 LHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKG 597
Query: 495 EKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
E + + + W +++ P + LTW+KT F AP GN+P+AL+++ MGKG+ WV
Sbjct: 598 EAMNLAFPTNTPSIGWMDASLTVQKP-QPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWV 656
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
NG+SIGRYW +F T G+ S Y + + T YHVPRA+LKP+ NLLV
Sbjct: 657 NGESIGRYWTAFAT--GDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLV 714
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KP 664
+ EE GNP +++ ++ VC V+ H P + +W I+ +GK +P
Sbjct: 715 IFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNW---------QIESYGKGQTFHRP 764
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
V C G+ I+ I FASFG P G C Y G CH++ S ++ER C+GK+RC++ + +
Sbjct: 765 KVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISN 824
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 825 SNFGKDPCPNVLKRLTVEAVC 845
>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
Length = 840
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/798 (45%), Positives = 461/798 (57%), Gaps = 71/798 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G Y F R D+++FIK +Q+ GLYV LRIG
Sbjct: 60 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKVVQAAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN P+K
Sbjct: 120 PYICAEWNFGGFPVWLKYVPGIEFRTDNGPFKAAMQKFTEKIVSMMKSEKLFESQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA MAV TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 180 LSQIENEFGPVEWEIGAPGKAYTKWAADMAVKLGTGVPWVMCKQDDAPDPVINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE+WT +Y +GG R A+D+AF VA FI GS++NYYMYH
Sbjct: 240 -ENFK-PNKDYKPKLWTENWTGWYTEFGGAVPYRPAEDLAFSVARFIQNGGSFMNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+A I YD APLDEYGL R+PKWGHL++LH AIKLC L++ V
Sbjct: 298 GGTNFGRTSAGLFIATSYDYDAPLDEYGLTRDPKWGHLRDLHKAIKLCEPALVSVDPTVK 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA VF+ S CAAFL N D + +V V F N Y+LP SISILPDCKT FNT
Sbjct: 358 SLGSNQEAHVFQSKSS-CAAFLANYDTKYSVKVTFGNGQYDLPPWSISILPDCKTAVFNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
R+ Q ++ T W+ Y EA + + EGL +QI+ +DASDY W
Sbjct: 417 ARLGAQSSQMKMT---PVGGALSWQSYIEEAATGYTDDTTTLEGLWEQINVTRDASDYLW 473
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + +S N +P L + S GH LH F+NG+ G+ +GS +N T V L
Sbjct: 474 YMTNVNIDSDEGFLKNGDSPVLTIFSAGHSLHVFINGQLAGTVYGSLENPKLTFSQNVKL 533
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
G N +LLSV VGLP+ G E+ AG+ + + + W Y++GL GE
Sbjct: 534 TAGINKISLLSVAVGLPNVGVHFEKWNAGILGPVTLKGLNEGTRDLSGWKWSYKIGLKGE 593
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S+ + + LTWYK TF AP GNDP+AL++ SMGKG+ WVNGQ
Sbjct: 594 ALSLHTVTGSSSVEWVEGSLSAKKQPLTWYKATFDAPEGNDPVALDMSSMGKGQIWVNGQ 653
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGR+W ++ T++G+ S YA + C + YHVPR++L P+GNLLV+
Sbjct: 654 SIGRHWPAY-TARGSCSACNYAGTYDDKKCRSNCG-EPSQRWYHVPRSWLNPSGNLLVVF 711
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS-- 669
EE G P GI++ VC + P L +W + G+ +QP
Sbjct: 712 EEWGGEPSGISLVKRTTGSVCADIFEGQ-PALKNW---------QMIALGRLDHLQPKAH 761
Query: 670 --CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
CP G+KISKI FAS+G+P G C + GSCH+ S E+ CIGK CS+ + + F
Sbjct: 762 LWCPHGQKISKIKFASYGSPQGTCGSFKAGSCHAHKSYDAFEKKCIGKQSCSVTVAAEVF 821
Query: 728 GGDPCPGIHKALLVDAQC 745
GGDPCP K L V+A C
Sbjct: 822 GGDPCPDSSKKLSVEAVC 839
>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
Length = 836
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/794 (43%), Positives = 460/794 (57%), Gaps = 62/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L KAK+GGLDVIQTYVFWN+HEP G Y+F GR D+++F+K Q GLYV LRIG
Sbjct: 55 MWPDLFRKAKDGGLDVIQTYVFWNMHEPSPGNYNFEGRFDLVKFVKLAQEAGLYVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 115 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKNAMEGFTKKVVDLMKSEGLFESQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY+ E + G Y+ WAA+MAV TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 175 LAQVENEYKPEEMEYGLAGAQYMNWAAQMAVGMDTGVPWVMCKQDDAPDPVINTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
PN P KP++WTE W+ +Y +GG R +D+AF VA F K GS+VNYYMYH
Sbjct: 235 DNFV--PNKPYKPTMWTEAWSGWYTEFGGASPHRPVEDLAFAVARFFVKGGSFVNYYMYH 292
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+R+PKWGHLKELH AIKLC L++G V
Sbjct: 293 GGTNFGRTAGGPFIATSYDYDAPIDEYGLIRQPKWGHLKELHKAIKLCEPALVSGDPVVT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A+V+ +G CAAF+VN D V+F Y++ S+SILPDC+ V FNT
Sbjct: 353 SLGHFQQAYVYSAGAGNCAAFIVNYDSNSVGRVIFNGQRYKIAPWSVSILPDCRNVVFNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+V Q ++ T F WE E I +F++ + A GLL+QI+ +D +DY WY
Sbjct: 413 AKVDVQTSQMKMTPVGGFG----WESIDENIASFEDNSISAVGLLEQINITRDNTDYLWY 468
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ N P L VQS G LH F+N + GS +G +N + V L
Sbjct: 469 ITSVEVDEDEPFIKNGGLPVLTVQSAGDALHVFINDDLAGSQYGRKENPKVRFSSGVRLN 528
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
GTN +LLS+TVGL + G E AGV + + ++ W YQ+GL GE
Sbjct: 529 VGTNKISLLSMTVGLQNIGPHFEMANAGVLGPITLSGFKDGTRDLSSQRWSYQIGLKGET 588
Query: 497 LQIYSNLGLNKVLW-SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
+ ++++ G N V W + P Q L WYK F APAG DP+ L+L SMGKG+AWVNGQS
Sbjct: 589 MNLHTS-GDNTVEWMKGVAVPQSQPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQS 647
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTGNLLVLL 611
IGRYW S+ Y H C ++ YHVPR++L+P+GN LVL
Sbjct: 648 IGRYWPSYLAEGVCSDGCSY--EGTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLF 705
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE GNP G+++ T ++ VC HV+ SH ++ W R ++K P V C
Sbjct: 706 EEIGGNPSGVSLVTRSVDSVCAHVSESHSQSINFW---RLESTDQVQKL-HIPKVHLQCS 761
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G++IS I FASFG P G C + G CHS +S +++ C+G +CS+ + + FGGDP
Sbjct: 762 KGQRISAIKFASFGTPQGLCGSFQQGDCHSPNSVATIQKKCMGLRKCSLSVSEKIFGGDP 821
Query: 732 CPGIHKALLVDAQC 745
CPG+ K + ++A C
Sbjct: 822 CPGVRKGVAIEAVC 835
>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
Length = 845
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/794 (45%), Positives = 464/794 (58%), Gaps = 60/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++FIK ++ GLYV LRIG
Sbjct: 62 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVKQAGLYVHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 122 PYVCAEWNFGGFPVWLKYVPGINFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 182 LSQIENEYGPMEYELGAPGQAYSKWAAKMAVGLGTGVPWVMCKQDDAPDPVINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP +WTE WT ++ +GG R A+D+AF VA FI K G+++NYYMYH
Sbjct: 242 --DYFSPNKPYKPKMWTEAWTGWFTEFGGAVPYRPAEDLAFSVARFIQKGGAFINYYMYH 299
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G +V+
Sbjct: 300 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGAPSVM 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ SG CAAFL N ++R V F N+ Y LP SISILPDCK +NT
Sbjct: 360 PLGNYQEAHVFKSKSGACAAFLANYNQRSFAKVSFGNMHYNLPPWSISILPDCKNTVYNT 419
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
R+ Q + R K S + W+ Y EA DNT + GLL+QI+ +D SDY W
Sbjct: 420 ARIGAQ-SARMKMSPIPMRGGFSWQAYSEEASTEGDNTFMMV-GLLEQINTTRDVSDYLW 477
Query: 388 YTFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y+ +S S L V S GH LH FVNG+ +G+A+GS ++ T V +
Sbjct: 478 YSTDVRIDSNEGFLRSGKYPVLTVLSAGHALHVFVNGQLSGTAYGSLESPKLTFSQGVKM 537
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N LLS+ VGLP+ G E AGV + + + + W Y++GL GE
Sbjct: 538 RAGINRIYLLSIAVGLPNVGPHFETWNAGVLGPVTLNGLNEGRRDLSWQKWTYKIGLHGE 597
Query: 496 KLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L ++S G + V W+ S S + L WYKTTF APAGN P+AL++ SMGKG+ W+NGQ
Sbjct: 598 ALSLHSLSGSSSVEWAQGSFVSRKQPLMWYKTTFNAPAGNSPLALDMGSMGKGQVWINGQ 657
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
S+GRYW ++K S GN YA N + C + YHVPR++L GNLLV+
Sbjct: 658 SVGRYWPAYKAS-GNCGVCNYAGTFNEKKCLTNCG-EASQRWYHVPRSWLNTAGNLLVVF 715
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P GI++ + VC + ++ ++ + + + +P V C
Sbjct: 716 EEWGGDPNGISLVRREVDSVCADIYEWQPTLMNYMMQSSGKVNKPL-----RPKVHLQCG 770
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+KIS I FASFG P+G C Y GSCH+ HS R C+G++ CS+ + FGGDP
Sbjct: 771 AGQKISLIKFASFGTPEGVCGSYRQGSCHAFHSYDAFNRLCVGQNWCSVTVAPEMFGGDP 830
Query: 732 CPGIHKALLVDAQC 745
CP + K L V+A C
Sbjct: 831 CPNVMKKLAVEAVC 844
>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
Length = 853
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/800 (44%), Positives = 478/800 (59%), Gaps = 73/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDV++TYVFWN+HEP G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 58 MWEDLIQKAKDGGLDVVETYVFWNVHEPSPGNYNFKGRYDLVRFLKTIQKAGLYAHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSEKLFESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAA MAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 178 LSQIENEYGAQSKLFGAAGHNYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP+IWTE W+ ++ +GG + R QD+A+ VA FI K GS+VNYYMYH
Sbjct: 238 -DSF-APNKPYKPTIWTEAWSGWFSEFGGPIHQRPVQDLAYAVARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL+R+PK+GHLKELH AIK+C R L++ +
Sbjct: 296 GGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQPKYGHLKELHRAIKMCERALVSADPIIT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A+V+ SG C+AFL N+D + A V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 356 SLGNFQQAYVYTSESGDCSAFLSNHDSKSAARVMFNNMHYNLPPWSISILPDCRNVVFNT 415
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q ++ +N++ S WE Y E I + D+ + + A GLL+QI+ +D++DY
Sbjct: 416 AKVGVQTSQMGMLPTNIQMLS---WESYDEDITSLDDSSTITAPGLLEQINVTRDSTDYL 472
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY SS + + P L VQS GH +H F+NG+ +GS+ G+ ++ FT V+
Sbjct: 473 WYKTSVDIGSSESFLRGGELPTLIVQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + + W YQVGL G
Sbjct: 533 LHAGTNRIALLSVAVGLPNVGGHFEAWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKG 592
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S ++ V W S+ + +Q LTW+KT F AP G++P+AL+++ MGKG+ W+N
Sbjct: 593 EAMNLVSPNSISSVDWMRGSLAAQKQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATN-TYHVPRAFLKPTGNLLVL 610
GQSIGRYW +F + GN + YA + T YHVPR++LKP NLLV+
Sbjct: 653 GQSIGRYWTAF--ANGNCNGCSYAGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVI 710
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPT 665
EE G+P I++ ++ VC V H P + +W I+ +GK P
Sbjct: 711 FEEFGGDPSRISLVKRSVSSVCAEVAEYH-PTIKNW---------HIESYGKAEDFHSPK 760
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
V C G+ IS I FASFG P G C Y G+CH++ S V+++ CIGK RC++ + +
Sbjct: 761 VHLRCNPGQAISSIKFASFGTPLGTCGSYQEGTCHAATSYSVLQKKCIGKQRCAVTISNS 820
Query: 726 YFGGDPCPGIHKALLVDAQC 745
F GDPCP + K L V+A C
Sbjct: 821 NF-GDPCPKVLKRLSVEAVC 839
>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/800 (44%), Positives = 465/800 (58%), Gaps = 72/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDV++TYVFWN+HEP G Y+F GR D+ RFIK IQ GLY LRIG
Sbjct: 59 MWEGLIQKAKEGGLDVVETYVFWNVHEPSPGNYNFEGRYDLARFIKTIQKAGLYANLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSENLFESQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 179 LSQIENEYGVQSKLFGAAGQNYMTWAAKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG + R QD+AF VA FI K GS++NYYMYH
Sbjct: 239 -DAFS-PNRPYKPTMWTEAWSGWFNEFGGPIHQRPVQDLAFAVARFIQKGGSFINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH A+K+C + L++ V
Sbjct: 297 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAVKMCEKALVSADPIVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A+V+ SG CAAFL N D A V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 357 SLGSSQQAYVYTSESGNCAAFLSNYDTDSAARVMFNNMHYNLPPWSISILPDCRNVVFNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYF 386
+V Q S+ L +S WE Y E + D+T + A GLL+QI+ KD SDY
Sbjct: 417 AKVGVQ---TSQLEMLPTNSPMLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYL 473
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S+ + + P L VQS GH +H F+NG +GSA GS +N FT V+
Sbjct: 474 WYITSVDIGSTESFLHGGELPTLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVN 533
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
R G N ALLSV VGLP+ G E G+ H + + W Y+VGL G
Sbjct: 534 FRAGRNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKG 593
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S + LTW+K+ F AP G++P+A++++ MGKG+ W+N
Sbjct: 594 EAMNLVSPNGISSVEWMEGSLAAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWIN 653
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
G SIGRYW ++ T GN + YA + T YHVPRA+LKP NLLV+
Sbjct: 654 GVSIGRYWTAYAT--GNCDKCNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVV 711
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPT 665
EE GNP I++ ++ VC V+ H P L +W I+ +GK +P
Sbjct: 712 FEELGGNPTSISLVKRSVTGVCADVSEYH-PTLKNW---------HIESYGKSEDLHRPK 761
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
V C G I+ I FASFG P G C Y G+CH+ S ++E+ CIGK RC++ + +
Sbjct: 762 VHLKCSAGYSITSIKFASFGTPLGTCGSYQQGTCHAPMSYDILEKRCIGKQRCAVTISNT 821
Query: 726 YFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+ C
Sbjct: 822 NFGQDPCPNVLKRLSVEVVC 841
>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/800 (45%), Positives = 473/800 (59%), Gaps = 72/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGG+DV++TYVFWN+HEP G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 57 MWEDLILKAKEGGIDVVETYVFWNVHEPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGMMKSERLFESQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G YV WAAKMAV+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 177 LSQIENEYGAQSKLQGAAGQNYVNWAAKMAVEMGTGVPWVMCKEDDAPDPVINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP IWTE W+ ++ +GG + R QD+AF A FI + GS+VNYYMYH
Sbjct: 237 -DKFT-PNRPYKPMIWTEAWSGWFTEFGGPIHKRPVQDLAFAAARFIIRGGSFVNYYMYH 294
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PK+GHLKELH AIK+C R L++ V
Sbjct: 295 GGTNFGRTAGGPFIATSYDYDAPLDEYGLIRQPKYGHLKELHRAIKMCERALVSTDPIVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG+ Q+A V+ SG CAAFL N D + + V+F N+ Y LP S+SILPDC+ V FNT
Sbjct: 355 SLGEFQQAHVYTTESGDCAAFLSNYDSKSSARVMFNNMHYSLPPWSVSILPDCRNVVFNT 414
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFD-NTLLRAEGLLDQISAAKDASDYF 386
+V Q ++ +N + S WE + E I + D ++ + A GLL+QI+ KDASDY
Sbjct: 415 AKVGVQTSQMQMLPTNTQLFS---WESFDEDIYSVDESSAITAPGLLEQINVTKDASDYL 471
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY SS + + P L VQS GH +H F+NG+ +GSA G+ + FT V+
Sbjct: 472 WYITSVDIGSSESFLRGGELPTLIVQSTGHAVHVFINGQLSGSAFGTREYRRFTYTGKVN 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L G N ALLSV +GLP+ G E G+ H + + W YQVGL G
Sbjct: 532 LLAGINRIALLSVAIGLPNVGEHFESWSTGILGPVALHGLDKGKWDLSGQKWTYQVGLKG 591
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+I Q LTW+KT F AP G++P+AL+++ MGKG+ W+N
Sbjct: 592 EAMDLASPNGISSVAWMQSAIVVQRNQPLTWHKTYFDAPEGDEPLALDMEGMGKGQIWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW +F T GN + YA + + T YHVPR++LK T NLLV+
Sbjct: 652 GQSIGRYWTAFAT--GNCNDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSWLKTTQNLLVI 709
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK-----PT 665
EE GNP I++ ++ VC V+ H P + +W I+ +GK P
Sbjct: 710 FEELGGNPSKISLVKRSVSSVCADVSEYH-PNIKNW---------HIESYGKSEEFRPPK 759
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
V C G+ IS I FASFG P G C Y G+CHS S ++E+ CIGK RC++ + +
Sbjct: 760 VHLHCSPGQTISSIKFASFGTPLGTCGNYEQGACHSPASYVILEKRCIGKPRCTVTVSNS 819
Query: 726 YFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 820 NFGQDPCPKVLKRLSVEAVC 839
>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/799 (44%), Positives = 468/799 (58%), Gaps = 71/799 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GG+DVI+TYVFWN+HEP G Y F GR DI+RF+K IQ GLY LRIG
Sbjct: 59 MWEDLIQKAKDGGIDVIETYVFWNVHEPTPGNYHFEGRYDIVRFMKTIQRAGLYAHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKAENLFESQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAA MA+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 179 LSQIENEYGVQSKLFGAAGYNYMTWAANMAIQTGTGVPWVMCKEDDAPDPVINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP+IWTE W+ ++ +GG + R QD+AF VA FI K GS++NYYM+H
Sbjct: 239 -DSF-APNKPYKPTIWTEAWSGWFSEFGGTIHQRPVQDLAFAVAKFIQKGGSFINYYMFH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+R+PK+GHLKELH +IK+C R L++ V
Sbjct: 297 GGTNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRSIKMCERALVSVDPIVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG Q+ V+ SG CAAFL N D + A VLF N+ Y LP SISILPDC+ V FNT
Sbjct: 357 QLGTYQQVHVYSTESGDCAAFLANYDTKSAARVLFNNMHYNLPPWSISILPDCRNVVFNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYFW 387
+V Q S+ L + WE Y E I + D+ + GLL+QI+ +DASDY W
Sbjct: 417 AKVGVQ---TSQMEMLPTNGIFSWESYDEDISSLDDSSTFTTAGLLEQINVTRDASDYLW 473
Query: 388 YTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y SS + + P L +QS GH +H F+NG+ +GSA G+ +N FT V+L
Sbjct: 474 YMTSVDIGSSESFLHGGELPTLIIQSTGHAVHIFINGQLSGSAFGTRENRRFTYTGKVNL 533
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R GTN ALLSV VGLP+ G E G+ H + + W YQVGL GE
Sbjct: 534 RPGTNRIALLSVAVGLPNVGGHYESWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGE 593
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
+ + S + V W SS+ + Q LTW+K F AP G++P+AL+++ MGKG+ W+NG
Sbjct: 594 AMNLLSPDSVTSVEWMQSSLAAQRPQPLTWHKAYFNAPEGDEPLALDMEGMGKGQIWING 653
Query: 553 QSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
QSIGRYW ++ + GN + YA T YHVPR++LKPT NLLV+
Sbjct: 654 QSIGRYWTAY--ASGNCNGCSYAGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTNNLLVVF 711
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPTV 666
EE G+P I++ ++ VC V+ H P + +W I+ +G+ P V
Sbjct: 712 EELGGDPSRISLVKRSLASVCAEVSEFH-PTIKNW---------QIESYGRAEEFHSPKV 761
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
C G+ I+ I FASFG P G C Y G+CH+S S ++E+ CIGK RC++ + +
Sbjct: 762 HLRCSGGQSITSIKFASFGTPLGTCGSYQQGACHASTSYAILEKKCIGKQRCAVTISNSN 821
Query: 727 FGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 822 FGQDPCPNVMKKLSVEAVC 840
>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
Length = 856
Score = 653 bits (1685), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/803 (44%), Positives = 473/803 (58%), Gaps = 78/803 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GG+DVI+TYVFWNLHEP G+YDF GRND++RF+K I GLY LRIG
Sbjct: 63 MWEGLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKAIHKAGLYAHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 123 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +G Y+ WAAKMA+ TGVPWVMCK+DDAP PVI+ CNG C
Sbjct: 183 LSQIENEYGRQGQILGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVISTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP+IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 243 -DSF-APNKPYKPTIWTEAWSGWFTEFGGPMHHRPVQDLAFAVARFIQKGGSFVNYYMYH 300
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA +T YD AP+DEYGL+R+PK+GHLKELH AIK+C + L++ V
Sbjct: 301 GGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSTDPVVT 360
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A V+ SG C+AFL N D A VLF N+ Y LP SISILPDC+ FNT
Sbjct: 361 SLGNKQQAHVYSSESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNT 420
Query: 329 ERVSTQYNKRSK--TSNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDY 385
+V Q ++ TS F +W+ Y E + + D+ + +GLL+QI+ +D SDY
Sbjct: 421 AKVGVQTSQMEMLPTSTGSF----QWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDY 476
Query: 386 FWYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WY + + + P L +QS GH +H FVNG+ +GSA G+ N FT + +
Sbjct: 477 LWYMTSVDIGETESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYKGKI 536
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLI 493
+L GTN ALLSV VGLP+ G E G+ H + + + W YQVGL
Sbjct: 537 NLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLK 596
Query: 494 GEKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE + + W +++ P + LTW+KT F AP GN+P+AL+++ MGKG+ W
Sbjct: 597 GEAMNLAYPTNTPSFGWMDASLTVQKP-QPLTWHKTYFDAPEGNEPLALDMEGMGKGQIW 655
Query: 550 VNGQSIGRYWVSFKTSK-GNPSQT-QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNL 607
VNG+SIGRYW +F T G+ S T Y N S C YHVPR++LKP+ NL
Sbjct: 656 VNGESIGRYWTAFATGDCGHCSYTGTYKPNKCNS--GCG-QPTQKWYHVPRSWLKPSQNL 712
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK----- 662
LV+ EE GNP +++ ++ VC V+ H P + +W I+ +GK
Sbjct: 713 LVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNW---------QIESYGKGQTFR 762
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
+P V C G+ IS I FASFG P G C Y G CH++ S ++ER C+GK+RC++ +
Sbjct: 763 RPKVHLKCSPGQAISAIKFASFGTPLGTCGSYQQGDCHAATSYAILERKCVGKARCAVTI 822
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
+ FG DPCP + K L V+A C
Sbjct: 823 SNSNFGKDPCPNVLKRLTVEAVC 845
>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
Length = 848
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/800 (42%), Positives = 463/800 (57%), Gaps = 86/800 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP++I +AK+GGL+ IQTYVFWN+HEP++G+++FSGR D+++FIK I+ G+YV LR+G
Sbjct: 74 MWPNIIKRAKQGGLNTIQTYVFWNVHEPEQGKFNFSGRADLVKFIKLIEKNGMYVTLRLG 133
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EWT+GGLP WL +V GI FR+DN P+K
Sbjct: 134 PFIQAEWTHGGLPYWLREVPGIFFRTDNTPFKEHTERYVKVILDKMKEEKLFASQGGPII 193
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G Y+ WA+K+ G+PWVMCKQ+DAP P+INACNG C
Sbjct: 194 LGQIENEYSAVQRAYKEDGLNYIKWASKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHC 253
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN NKPS+WTE+WT+ ++V+G P RS +DIA+ VA F +KNG++VNYYMYH
Sbjct: 254 GDTFPGPNKENKPSLWTENWTTQFRVYGDPPAQRSVEDIAYSVARFFSKNGTHVNYYMYH 313
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A ++ T YYD APLDEYGL REPK+GHLK LH A+ LC + LL G V
Sbjct: 314 GGTNFGRTSAHYVTTRYYDDAPLDEYGLEREPKYGHLKHLHNALNLCKKALLWGQPRVEK 373
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
E +E+ + VCAAFL NN+ A + F+ Y +P +SISILPDCKTV +NT
Sbjct: 374 PSNETEIRYYEQPGTKVCAAFLANNNTESAEKIKFKGKEYIIPHRSISILPDCKTVVYNT 433
Query: 329 ERVSTQYNKR----SKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASD 384
+ + + R SK +N FD E I + GL KD +D
Sbjct: 434 GEIISHHTSRNFMKSKKANKNFDFKVFTETVPSKIKGDSYIPVELYGL------TKDETD 487
Query: 385 YFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
Y WYT F + ++ ++ L + S GH LH ++NGEY G+ HGSH+ SF +
Sbjct: 488 YGWYTTSFKIDDNDLSKKKGSKPTLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKP 547
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-------QDKSFTNCSWGYQVG 491
+ L++G N +L V G PDSG+++E + G V + D + N WG +VG
Sbjct: 548 ISLKEGENHLTMLGVLTGFPDSGSYMEHRYTGPRSVSILGLGSGTLDLTEEN-KWGNKVG 606
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
+ GEKL I++ GL KV W LTWY+T F AP A+ + MGKG WVN
Sbjct: 607 MEGEKLGIHAEEGLKKVKWQKFSGKEPGLTWYQTYFDAPESQSAAAIRMNGMGKGLIWVN 666
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
G+ +GRYW+SF + G P+Q + YH+PR+FLKP NLLV+
Sbjct: 667 GEGVGRYWMSFLSPLGQPTQIE--------------------YHIPRSFLKPKKNLLVIF 706
Query: 612 EEE-NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQ--RGDTDIKKFGKKPTVQP 668
EEE N P I I VC H+ ++ P + W R + TD T
Sbjct: 707 EEEPNVKPELIDFVIINRDTVCSHIGENYTPSVRHWTRKNDQVQAITDDVHL----TASL 762
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF- 727
C KKIS++ FASFGNP+G C + +G+C++ S+ VVE+ C+GK+ C IP+ F
Sbjct: 763 KCSGTKKISEVEFASFGNPNGTCGNFTLGTCNAPVSKKVVEKYCLGKAECVIPVNKSTFQ 822
Query: 728 --GGDPCPGIHKALLVDAQC 745
D CP + K L V +C
Sbjct: 823 QDKKDSCPKVEKKLAVQVKC 842
>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 855
Score = 652 bits (1681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/801 (44%), Positives = 470/801 (58%), Gaps = 75/801 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GG+DVI+TYVFWNLHEP G+YDF GRND++RF+K I GLY LRIG
Sbjct: 63 MWEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 123 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +G Y+ WAAKMA+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 183 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 243 -DSF-APNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYH 300
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA +T YD AP+DEYGL+R+PK+GHLKELH AIK+C + L++ V
Sbjct: 301 GGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVT 360
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G Q+A V+ SG C+AFL N D A VLF N+ Y LP SISILPDC+ FNT
Sbjct: 361 SIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNT 420
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q S+ L D+ +WE Y E + + D+ + GLL+QI+ +D SDY
Sbjct: 421 AKVGVQ---TSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYL 477
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S + + P L +QS GH +H FVNG+ +GSA G+ N FT + ++
Sbjct: 478 WYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKIN 537
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + + W YQVGL G
Sbjct: 538 LHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKG 597
Query: 495 EKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
E + + + W +++ P + LTW+KT F AP GN+P+AL+++ MGKG+ WV
Sbjct: 598 EAMNLAFPTNTPSIGWMDASLTVQKP-QPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWV 656
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
NG+SIGRYW +F T G+ S Y + + T YHVPRA+LKP+ NLLV
Sbjct: 657 NGESIGRYWTAFAT--GDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLV 714
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KP 664
+ EE GNP +++ ++ VC V+ H P + +W I+ +GK +P
Sbjct: 715 IFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNW---------QIESYGKGQTFHRP 764
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
V C G+ I+ I FASFG P G C Y G CH++ S ++ER C+GK+RC++ + +
Sbjct: 765 KVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILER-CVGKARCAVTISN 823
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
FG DPCP + K L V+A C
Sbjct: 824 SNFGKDPCPNVLKRLTVEAVC 844
>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/794 (44%), Positives = 457/794 (57%), Gaps = 67/794 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ KAK+GGLDV+ TYVFWN+HEP G YDF GR D++RFIK Q GLYV LRIG
Sbjct: 59 MWDDLMQKAKDGGLDVVDTYVFWNVHEPSPGNYDFEGRYDLVRFIKTAQRVGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKMAMQGFTQKIVQMMKSEKLFASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY A G Y+ WAAKMAV +TGVPWVMCK+DDAP PVIN+CNG C
Sbjct: 179 LSQIENEYGPQSKALGAAGHAYMNWAAKMAVGLNTGVPWVMCKEDDAPDPVINSCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE W+ ++ +GG Y R QD+AF VA F+ K GS NYYMYH
Sbjct: 239 --DYFSPNKPYKPTLWTEAWSGWFTEFGGPVYGRPVQDLAFAVARFVQKGGSLFNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG++R+PK+GHLK LH AIKLC L++ V
Sbjct: 297 GGTNFGRTAGGPFITTSYDYDAPLDEYGMLRQPKYGHLKNLHRAIKLCEHALVSSDPTVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A VF G CAAFL N A TV+F N+ Y LP SISILPDCK V FNT
Sbjct: 357 SLGAYEQAHVFSSGPGRCAAFLANYHTNSAATVVFNNMRYALPAWSISILPDCKRVVFNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYFW 387
+V ++T L S WE Y E + ++ + GLL+QI+ +D SDY W
Sbjct: 417 AQVGVHI---AQTQMLPTISKLSWETYNEDTYSLGGSSRMTVAGLLEQINVTRDTSDYLW 473
Query: 388 YTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y +SS A Q P L V+S GH +H F+NG+++GSA+GS ++ +FT ++L
Sbjct: 474 YMTSVGISSSEAFLRGGQKPTLSVRSAGHAVHVFINGQFSGSAYGSREHPAFTYTGPINL 533
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAG------VHRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS+ VGLP+ G E+ G + + K T W YQVGL GE
Sbjct: 534 RAGMNKIALLSIAVGLPNVGLHFEKWQTGILGPISISGLNGGKKDLTWQKWSYQVGLKGE 593
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
+ + S V W S+ R LTWYK +F AP GN+P+AL+L+SMGKG+AW+NGQ
Sbjct: 594 AMNLVSPTEATSVDWIKGSLLQGQRPLTWYKASFNAPRGNEPLALDLRSMGKGQAWINGQ 653
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGRYW+++ +KG S+ YA T + C YHVPR++LKPT N+LVL
Sbjct: 654 SIGRYWMAY--AKGGCSRCTYAGTYRPPTCENGCG-QPTQRWYHVPRSWLKPTNNVLVLF 710
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+ I++ ++ +CG H S + + D ++ C
Sbjct: 711 EELGGDASKISLMRRSVTGLCGEAVEYHAKNDSYIIESNEELD----------SLHLQCN 760
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I FASFG P G C Y G+CH+ S ++E+ CIG CS+ FG DP
Sbjct: 761 PGQVISAIKFASFGTPSGTCGSYQKGTCHAPDSHAIIEKKCIGLKSCSVSTTRDNFGVDP 820
Query: 732 CPGIHKALLVDAQC 745
CP K LLV+ C
Sbjct: 821 CPNELKQLLVEVDC 834
>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
Length = 832
Score = 650 bits (1676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/797 (44%), Positives = 461/797 (57%), Gaps = 72/797 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F GR D++RF+K ++ GLY LRIG
Sbjct: 57 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFGGRYDLVRFLKLVKQAGLYAHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 117 PYVCAEWNFGGFPVWLKYVPGIHFRTDNGPFKAAMGKFTEKIVSMMKAEGLYETQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV +TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 177 LSQIENEYGPVEYYDGAAGKSYTNWAAKMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN NKP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 237 --DYFSPNKDNKPKMWTEAWTGWFTGFGGAVPQRPAEDMAFAVARFIQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA I+ YD AP+DEYGL+R+PKWGHL++LH AIKLC L++G +
Sbjct: 295 GGTNFGRTAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEPALVSGEPTIT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ QE++V+ S CAAFL N + R TV F + Y LP S+SILPDCKT FNT
Sbjct: 355 SLGQNQESYVYRSKSS-CAAFLANFNSRYYATVTFNGMHYNLPPWSVSILPDCKTTVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q + T +++ W+ Y E ++ +GL++Q+S D SDY WY
Sbjct: 414 ARVGAQ----TTTMKMQYLGGFSWKAYTEDTDALNDNTFTKDGLVEQLSTTWDRSDYLWY 469
Query: 389 TFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + + L V S GH +H F+NG+ +G+A+GS DN T + L
Sbjct: 470 TTYVDIAKNEEFLKTGKYPYLTVMSAGHAVHVFINGQLSGTAYGSLDNPKLTYSGSAKLW 529
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G+N ++LSV+VGLP+ G E GV + + + W YQ+GL GE
Sbjct: 530 AGSNKISILSVSVGLPNVGNHFETWNTGVLGPVTLTGLNEGKRDLSLQKWTYQIGLHGET 589
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L ++S G + V W S + LTWYKT F AP GN+P+AL++ +MGKG+ W+NGQSIG
Sbjct: 590 LSLHSLTGSSNVEWGEA-SQKQPLTWYKTFFNAPPGNEPLALDMNTMGKGQIWINGQSIG 648
Query: 557 RYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
RYW ++K S G+ Y N + C + YHVPR++L PTGN LV+LEE
Sbjct: 649 RYWPAYKAS-GSCGSCDYRGTYNEKKCLSNCG-EASQRWYHVPRSWLIPTGNFLVVLEEW 706
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGK 674
G+P GI++ ++ VC V P + +W K +G +P V SC G+
Sbjct: 707 GGDPTGISMVKRSVASVCAEVEELQ-PTMDNW---------RTKAYG-RPKVHLSCDPGQ 755
Query: 675 KISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERA-----CIGKSRCSIPLLSRYFGG 729
K+SKI FASFG P G C ++ GSCH+ S E+ C+G+ CS+ + FGG
Sbjct: 756 KMSKIKFASFGTPQGTCGSFSEGSCHAHKSYDAFEQEGLMQNCVGQEFCSVNVAPEVFGG 815
Query: 730 DPCPGIHKALLVDAQCR 746
DPCPG K L V+A C
Sbjct: 816 DPCPGTMKKLAVEAICE 832
>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
Length = 843
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/793 (44%), Positives = 458/793 (57%), Gaps = 58/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI +AK+GGLDVIQTYVFWN HEP G+Y F D+++FIK +Q GLYV LRIG
Sbjct: 60 MWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYVPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQDDAP PVINACNG C
Sbjct: 180 LSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA F+ K G+++NYYMYH
Sbjct: 240 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLAFSVAKFLQKGGAFINYYMYH 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++ V
Sbjct: 298 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSSDPTVT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ SG CAAFL N + + V F N+ Y LP SISILPDCK +NT
Sbjct: 358 PLGTYQEAHVFKSNSGACAAFLANYNRKSFAKVAFGNMHYNLPPWSISILPDCKNTVYNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
R+ Q R K + W+ Y + + +T GLL+QI+ +DA+DY WY
Sbjct: 418 ARIGAQ-TARMKMPRVPIHGGFSWQAYNDETATYSDTSFTTAGLLEQINITRDATDYLWY 476
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ S + P L V S GH L F+NG+ G+A+GS + T + V+LR
Sbjct: 477 MTDVKIDPSEDFLRSGNYPVLTVLSAGHALRVFINGQLAGTAYGSLETPKLTFKQGVNLR 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLS+ VGLP+ G E AG+ + + + + W Y++GL GE
Sbjct: 537 AGINQIALLSIAVGLPNVGPHFETWNAGILGPVILNGLNEGRRDLSWQKWSYKIGLKGEA 596
Query: 497 LQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W+ S + + LTWYKTTF PAGN P+AL++ SMGKG+ W+N +S
Sbjct: 597 LSLHSLTGSSSVEWTEGSFVAQRQPLTWYKTTFNRPAGNSPLALDMGSMGKGQVWINDRS 656
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEE 613
IGRYW ++K S G + YA +A+ YHVPR++L PTGNLLV+LEE
Sbjct: 657 IGRYWPAYKAS-GTCGECNYAGTFSEKKCLSNCGEASQRWYHVPRSWLNPTGNLLVVLEE 715
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSW-LRHRQRGDTDIKKFGKKPTVQPSCPL 672
G+P GI + + VC + P L SW ++ R + + +P SC
Sbjct: 716 WGGDPNGIFLVRREVDSVCADIYEWQ-PNLMSWQMQVSGRVNKPL-----RPKAHLSCGP 769
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KIS I FASFG P+G C + G CH+ S ER+CIG++ CS+ + FGGDPC
Sbjct: 770 GQKISSIKFASFGTPEGVCGSFREGGCHAHKSYNAFERSCIGQNSCSVTVSPENFGGDPC 829
Query: 733 PGIHKALLVDAQC 745
P + K L V+A C
Sbjct: 830 PNVMKKLSVEAIC 842
>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 649 bits (1674), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/795 (45%), Positives = 465/795 (58%), Gaps = 63/795 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLD I TYVFWNLHEP G+Y+F GR D++RFIK IQ GLYV LRIG
Sbjct: 57 MWEGLIQKAKDGGLDAIDTYVFWNLHEPSPGKYNFEGRYDLVRFIKLIQKAGLYVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 117 PYICAEWNFGGFPVWLKFVPGVSFRTDNEPFKMAMQRFTQKIVQMMKNEKLFESQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY AF G Y+ WAAKMAV TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 177 ISQIENEYGHESRAFGAPGYAYLTWAAKMAVAMDTGVPWVMCKEDDAPDPVINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN PNKP++WTE W+ ++ + G R +D++F V FI K GS+VNYYMYH
Sbjct: 237 --DYFSPNKPNKPTLWTEAWSGWFTEFAGPIQQRPVEDLSFAVTRFIQKGGSFVNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH AIKLC R LL+
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCERALLSADPAET 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG +A VF SG CAAFL N + A V F ++ Y L SISILPDCK V FNT
Sbjct: 355 SLGTYAKAQVFYSESGGCAAFLSNYNPTSAARVTFNSMHYNLAPWSISILPDCKNVVFNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSD-EKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
V Q S+ L +S+ WE + E I + D++ + GLL+Q++ +D SDY
Sbjct: 415 ATVGVQ---TSQMQMLPTNSELLSWETFNEDISSADDDSTITVVGLLEQLNVTRDTSDYL 471
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY+ R +SS + Q P L VQS GH +H F+NG +GSA G+ ++ FT V+
Sbjct: 472 WYSTRIDISSSESFLHGGQHPTLIVQSTGHAMHVFINGHLSGSAFGTREDRRFTFTGDVN 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L+ G+N ++LS+ VGLP++G E GV H + K + W YQVGL G
Sbjct: 532 LQTGSNIISVLSIAVGLPNNGPHFETWSTGVLGPVVLHGLDEGKKDLSWQKWSYQVGLKG 591
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S ++ + W S + LTWYK F AP G++P+AL++ SMGKG+ W+N
Sbjct: 592 EAMNLVSPNVISNIDWMKGSLFAQKQQPLTWYKAYFDAPDGDEPLALDMGSMGKGQVWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVL 610
GQSIGRYW ++ +KGN S Y+ T F YHVPR++LKPT NLLVL
Sbjct: 652 GQSIGRYWTAY--AKGNCSGCSYSGTFRTTKCQFGCGQPTQRWYHVPRSWLKPTQNLLVL 709
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE G+ I+ ++ VC V+ H P + +W Q ++ KP V C
Sbjct: 710 FEELGGDASKISFMKRSVTTVCAEVSEHH-PNIKNWHIESQERPEEM----SKPKVHLHC 764
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
G+ IS I FASFG P G C + G+CH+ SQ V+E+ CIG+ +CS+ + S F +
Sbjct: 765 ASGQSISAIKFASFGTPSGTCGNFQKGTCHAPTSQAVLEKKCIGQQKCSVAVSSSNF-AN 823
Query: 731 PCPGIHKALLVDAQC 745
PCP + K L V+A C
Sbjct: 824 PCPNMFKKLSVEAVC 838
>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 832
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/800 (42%), Positives = 462/800 (57%), Gaps = 86/800 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP++I +AK+GGL+ IQTYVFWN+HEP++G+++FSGR D+++FIK I+ GLYV LR+G
Sbjct: 58 MWPNIIKRAKQGGLNTIQTYVFWNVHEPEQGKFNFSGRADLVKFIKLIEKNGLYVTLRLG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EWT+GGLP WL +V GI FR+DN+P+K
Sbjct: 118 PFIQAEWTHGGLPYWLREVPGIFFRTDNEPFKEHTERYVKVVLDMMKEEKLFASQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G Y+ WA+K+ G+PWVMCKQ+DAP P+INACNG C
Sbjct: 178 LGQIENEYSAVQRAYKEDGLNYIKWASKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN NKPS+WTE+WT+ ++V+G P RS +DIA+ VA F +KNG++VNYYMYH
Sbjct: 238 GDTFPGPNKDNKPSLWTENWTTQFRVFGDPPAQRSVEDIAYSVARFFSKNGTHVNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A ++ T YYD APLDE+GL REPK+GHLK LH A+ LC + LL G V
Sbjct: 298 GGTNFGRTSAHYVTTRYYDDAPLDEFGLEREPKYGHLKHLHNALNLCKKALLWGQPRVEK 357
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
E +E+ + VCAAFL NN+ A + FR Y +P +SISILPDCKTV +NT
Sbjct: 358 PSNETEIRYYEQPGTKVCAAFLANNNTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNT 417
Query: 329 ERVSTQYNKR----SKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASD 384
+ + + R SK +N FD E I + GL KD SD
Sbjct: 418 GEIISHHTSRNFMKSKKANKNFDFKVFTESVPSKIKGDSFIPVELYGL------TKDESD 471
Query: 385 YFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
Y WYT F + ++ + L + S GH LH ++NGEY G+ HGSH+ SF +
Sbjct: 472 YGWYTTSFKIDDNDLSKKKGGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKP 531
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-------QDKSFTNCSWGYQVG 491
V L++G N +L V G PDSG+++E + G V + D + N WG +VG
Sbjct: 532 VTLKEGENHLTMLGVLTGFPDSGSYMEHRYTGPRSVSILGLGSGTLDLTEEN-KWGNKVG 590
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
+ GE+L I++ GL KV W +TWY+T F AP A+ + MGKG WVN
Sbjct: 591 MEGERLGIHAEEGLKKVKWEKASGKEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVN 650
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
G+ +GRYW+SF + G P+Q + YH+PR+FLKP NLLV+
Sbjct: 651 GEGVGRYWMSFLSPLGQPTQIE--------------------YHIPRSFLKPKKNLLVIF 690
Query: 612 EEE-NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQ--RGDTDIKKFGKKPTVQP 668
EEE N P I + VC ++ ++ P + W R + TD T
Sbjct: 691 EEEPNVKPELIDFVIVNRDTVCSYIGENYTPSVRHWTRKNDQVQAITDDVHL----TANL 746
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF- 727
C KKIS + FASFGNP+G C + +GSC++ S+ VVE+ C+GK+ C IP+ F
Sbjct: 747 KCSGTKKISAVEFASFGNPNGTCGNFTLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFE 806
Query: 728 --GGDPCPGIHKALLVDAQC 745
D CP + K L V +C
Sbjct: 807 QDKKDSCPKVEKKLAVQVKC 826
>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 853
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/811 (44%), Positives = 469/811 (57%), Gaps = 92/811 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDVI+TY+FWN+HEP +G Y+F GR D++RF+K IQ GLY LRIG
Sbjct: 62 MWEDLIYKAKEGGLDVIETYIFWNVHEPSRGNYNFEGRYDLVRFVKTIQKAGLYAHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 122 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G YV WAAKMAV+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 182 LSQIENEYGAQSKLLGPAGQNYVNWAAKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KPSIWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 242 --DYFTPNKPYKPSIWTEAWSGWFSEFGGPNHERPVQDLAFGVARFIQKGGSFVNYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL+R+PK+GHLKELH AIK+C R L++ V
Sbjct: 300 GGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCERALVSADPAVT 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G Q+A V+ SG CAAFL N D + +V V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 360 SMGNFQQAHVYTTKSGDCAAFLSNFDTKSSVRVMFNNMHYNLPPWSISILPDCRNVVFNT 419
Query: 329 ERVSTQYNKRSK--TSNLKFDSDEKWEEYREAILNFDN---TLLRAEGLLDQISAAKDAS 383
+V Q ++ T+ F WE + E I + D+ + GLL+QI+ +D S
Sbjct: 420 AKVGVQTSQMQMLPTNTHMFS----WESFDEDISSLDDGSAITITTSGLLEQINVTRDTS 475
Query: 384 DYFWYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WY SS + + P L VQS GH +H F+NG+ +GSA+G+ ++ F
Sbjct: 476 DYLWYITSVDIGSSESFLRGGKLPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFRYTG 535
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLE---RKVAGVHRVRVQDKSFTNCS---WGYQVG 491
TV+LR GTN ALLSV VGLP+ G E + G +R ++ + S W YQVG
Sbjct: 536 TVNLRAGTNRIALLSVAVGLPNVGGHFETWNTGILGPVVLRGLNQGKLDLSWQKWTYQVG 595
Query: 492 LIGEKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE + + S G++ V W S++ S Q LTW+KT F AP G++P+AL+++ MGKG+
Sbjct: 596 LKGEAMNLASPNGISSVEWMQSALVSEKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQI 655
Query: 549 WVNGQSIGRYWVSFKTSKGN---------PSQTQYAVNTVTSIHFCAIIKATNTYHVPRA 599
W+NG SIGRYW + N P + Q T YHVPR+
Sbjct: 656 WINGLSIGRYWTAPAAGICNGCSYAGTFRPPKCQVGCGQPTQ----------RWYHVPRS 705
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
+LKP NLLV+ EE G+P I++ ++ +C V+ H P + +W I
Sbjct: 706 WLKPNHNLLVVFEELGGDPSKISLVKRSVSSICADVSEYH-PNIRNW---------HIDS 755
Query: 660 FGKK-----PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIG 714
+GK P V C + IS I FASFG P G C Y G CHS S +E+ CIG
Sbjct: 756 YGKSEEFHPPKVHLHCSPSQAISSIKFASFGTPLGTCGNYEKGVCHSPTSYATLEKKCIG 815
Query: 715 KSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
K RC++ + + FG DPCP + K L V+A C
Sbjct: 816 KPRCTVTVSNSNFGQDPCPNVLKRLSVEAVC 846
>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
Length = 848
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/800 (42%), Positives = 462/800 (57%), Gaps = 86/800 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP++I +AK+GGL+ IQTYVFWN+HEP++G+++FSGR D+++FIK I+ GLYV LR+G
Sbjct: 74 MWPNIIKRAKQGGLNTIQTYVFWNVHEPEQGKFNFSGRADLVKFIKLIEKNGLYVTLRLG 133
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EWT+GGLP WL +V GI FR+DN+P+K
Sbjct: 134 PFIQAEWTHGGLPYWLREVPGIFFRTDNEPFKEHTERYVKVVLDMMKEEKLFASQGGPII 193
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G Y+ WA+K+ G+PWVMCKQ+DAP P+INACNG C
Sbjct: 194 LGQIENEYSAVQRAYKEDGLNYIKWASKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHC 253
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN NKPS+WTE+WT+ ++V+G P RS +DIA+ VA F +KNG++VNYYMYH
Sbjct: 254 GDTFPGPNKDNKPSLWTENWTTQFRVFGDPPAQRSVEDIAYSVARFFSKNGTHVNYYMYH 313
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A ++ T YYD APLDE+GL REPK+GHLK LH A+ LC + LL G V
Sbjct: 314 GGTNFGRTSAHYVTTRYYDDAPLDEFGLEREPKYGHLKHLHNALNLCKKALLWGQPRVEK 373
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
E +E+ + VCAAFL NN+ A + FR Y +P +SISILPDCKTV +NT
Sbjct: 374 PSNETEIRYYEQPGTKVCAAFLANNNTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNT 433
Query: 329 ERVSTQYNKR----SKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASD 384
+ + + R SK +N FD E I + GL KD SD
Sbjct: 434 GEIISHHTSRNFMKSKKANKNFDFKVFTESVPSKIKGDSFIPVELYGL------TKDESD 487
Query: 385 YFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
Y WYT F + ++ + L + S GH LH ++NGEY G+ HGSH+ SF +
Sbjct: 488 YGWYTTSFKIDDNDLSKKKGGKPNLRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKP 547
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-------QDKSFTNCSWGYQVG 491
V L++G N +L V G PDSG+++E + G V + D + N WG +VG
Sbjct: 548 VTLKEGENHLTMLGVLTGFPDSGSYMEHRYTGPRSVSILGLGSGTLDLTEEN-KWGNKVG 606
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
+ GE+L I++ GL KV W +TWY+T F AP A+ + MGKG WVN
Sbjct: 607 MEGERLGIHAEEGLKKVKWEKASGKEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVN 666
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
G+ +GRYW+SF + G P+Q + YH+PR+FLKP NLLV+
Sbjct: 667 GEGVGRYWMSFLSPLGQPTQIE--------------------YHIPRSFLKPKKNLLVIF 706
Query: 612 EEE-NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQ--RGDTDIKKFGKKPTVQP 668
EEE N P I + VC ++ ++ P + W R + TD T
Sbjct: 707 EEEPNVKPELIDFVIVNRDTVCSYIGENYTPSVRHWTRKNDQVQAITDDVHL----TANL 762
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF- 727
C KKIS + FASFGNP+G C + +GSC++ S+ VVE+ C+GK+ C IP+ F
Sbjct: 763 KCSGTKKISAVEFASFGNPNGTCGNFTLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFE 822
Query: 728 --GGDPCPGIHKALLVDAQC 745
D CP + K L V +C
Sbjct: 823 QDKKDSCPKVEKKLAVQVKC 842
>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
Length = 844
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/797 (42%), Positives = 464/797 (58%), Gaps = 80/797 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I +AK+GGL+ IQTYVFWN+HEPQ+G+++FSGR D+++FIK I+ G+YV LR+G
Sbjct: 70 MWPSIIKRAKQGGLNTIQTYVFWNVHEPQQGKFNFSGRADLVKFIKLIEKNGMYVTLRLG 129
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EWT+GGLP WL +V GI FR+DNKP+K
Sbjct: 130 PFIQAEWTHGGLPYWLREVPGIFFRTDNKPFKEHTERYVRMILDKMKEERLFASQGGPII 189
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ + G Y+ WA+K+ G+PWVMCKQ+DAP P+INACNG C
Sbjct: 190 LGQIENEYSAVQRAYKQDGLNYIKWASKLVDSMKLGIPWVMCKQNDAPDPMINACNGRHC 249
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN NKPS+WTE+WT+ ++V+G P RS +DIA+ VA F +KNGS+VNYYMYH
Sbjct: 250 GDTFPGPNKENKPSLWTENWTTQFRVFGDPPTQRSVEDIAYSVARFFSKNGSHVNYYMYH 309
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A ++ T YYD APLDEYGL REPK+GHLK LH+A+ LC +PLL G
Sbjct: 310 GGTNFGRTSAHYVTTRYYDDAPLDEYGLEREPKYGHLKHLHSALNLCKKPLLWGQPKTEK 369
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G+ E +E+ + CAAFL NN+ A T+ F+ Y + +SISILPDCKTV +NT
Sbjct: 370 PGKDTEIRYYEQPGTKTCAAFLANNNTEAAETIKFKGREYVIAPRSISILPDCKTVVYNT 429
Query: 329 ERVSTQYNKR----SKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASD 384
++ +Q+ R SK +N KFD E + + GL KD +D
Sbjct: 430 AQIVSQHTSRNFMKSKKANKKFDFKVFTETLPSKLEGNSYIPVELYGL------TKDKTD 483
Query: 385 YFWYT--FRFHYN----SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
Y WYT F+ H N + + + S GH LH ++NGEY GS HGSH+ SF +
Sbjct: 484 YGWYTTSFKVHKNHLPTKKGVKTFVRIASLGHALHIWLNGEYLGSGHGSHEEKSFVFQKQ 543
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCS-WGYQVGL 492
V L+ G N +L V G PDSG+++E + G V + + T S WG ++G+
Sbjct: 544 VTLKAGENHLIMLGVLTGFPDSGSYMEHRYTGPRGVSILGLTSGTLDLTESSKWGNKIGM 603
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
GEKL I++ GL KV W LTWY+ F AP + A+ + MGKG WVNG
Sbjct: 604 EGEKLGIHTEEGLKKVEWKKFTGKAPGLTWYQAYFDAPESLNAAAIRMNGMGKGLIWVNG 663
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+ +GRYW SF + G P+Q + YH+PR+FLKP NLLV+ E
Sbjct: 664 EGVGRYWQSFLSPLGQPTQIE--------------------YHIPRSFLKPKKNLLVIFE 703
Query: 613 EE-NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE N P + + VC +V ++ P + W R + + T++ C
Sbjct: 704 EEPNVKPELMDFVIVNRDTVCSYVGENYTPSVRHWTRKQDQVQAITDNVSLTATLK--CS 761
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF---G 728
KKI+ + FASFGNP G C + +G+C++ S+ V+E+ C+GK+ C IP+ F
Sbjct: 762 GTKKIAAVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDK 821
Query: 729 GDPCPGIHKALLVDAQC 745
D C + K L V +C
Sbjct: 822 KDSCKNVAKTLAVQVKC 838
>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
Length = 841
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/793 (44%), Positives = 462/793 (58%), Gaps = 58/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVI+TYVFWN HEP+ G+Y F G D++RF+K + GLYV LRIG
Sbjct: 58 MWPDLIQKAKEGGLDVIETYVFWNGHEPEPGKYYFEGNYDLVRFVKLVHQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYIPGISFRTDNAPFKFQMERFTRKIVNMMKAERLYESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MA+ TGVPWVMCKQDDAP P+IN CNG C
Sbjct: 178 LSQIENEYGPMEYELGAPGKAYSKWAAQMALGLGTGVPWVMCKQDDAPDPIINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K G+ +NYYMYH
Sbjct: 238 --DYFSPNKAYKPKMWTEAWTGWFTQFGGAVPHRPAEDMAFAVARFIQKGGALINYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+R+PKWGHLK+L+ AIKLC L++G V
Sbjct: 296 GGTNFGRTAGGPFIATSYDYDAPIDEYGLLRQPKWGHLKDLNRAIKLCEPALVSGDPIVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ SG CAAFL N + R TV F N+ Y +P SISILPDCK FNT
Sbjct: 356 RLGNYQEAHVFKSKSGACAAFLSNYNPRSYATVAFGNMHYNIPPWSISILPDCKNTVFNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q K S + W+ Y E +++ GLL+QI+ +DA+DY WY
Sbjct: 416 ARVGAQ-TAIMKMSPVPMHESFSWQAYNEEPASYNEKAFTTVGLLEQINTTRDATDYLWY 474
Query: 389 TFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T H ++ S L V S GH +H FVNG+ G+A+GS D T V+LR
Sbjct: 475 TTDVHIDANEGFLRSGKYPVLTVLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLR 534
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLS+ VGLP+ G E AG+ + + + T W Y++GL GE
Sbjct: 535 AGNNKIALLSIAVGLPNVGPHFEMWNAGILGPVNLNGLDEGRRDLTWQKWTYKIGLDGEA 594
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
+ ++S G + V W S+ + + LTW+KTTF APAGN P+AL++ SMGKG+ W+NGQS
Sbjct: 595 MSLHSLSGSSSVEWIQGSLVAQKQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQS 654
Query: 555 IGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW ++K S G+ Y N C + YHVPR++L PTGNLLV+ E
Sbjct: 655 LGRYWPAYK-STGSCGSCDYTGTYNEKKCSSNCG-EASQRWYHVPRSWLNPTGNLLVVFE 712
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P GI + + VC ++ N P L +W + + + K +P SC
Sbjct: 713 EWGGDPNGIHLVRRDVDSVCVNI-NEWQPTLMNW---QMQSSGKVNK-PLRPKAHLSCGP 767
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KIS + FASFG P+G+C + GSCH+ HS +R C+G++ C++ + FGGDPC
Sbjct: 768 GQKISSVKFASFGTPEGECGSFREGSCHAHHSYDAFQRTCVGQNFCTVTVAPEMFGGDPC 827
Query: 733 PGIHKALLVDAQC 745
P + K L V+ C
Sbjct: 828 PNVMKKLSVEVIC 840
>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
Length = 841
Score = 647 bits (1669), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/792 (45%), Positives = 459/792 (57%), Gaps = 56/792 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F D+++FIK IQ GLYV LRIG
Sbjct: 58 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLIQQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKAQMQRFTTKIVNMMKAERLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MA+ TGVPWVMCKQDDAP P+INACNG C
Sbjct: 178 LSQIENEYGPMEYELGAPGKVYTDWAAHMALGLGTGVPWVMCKQDDAPDPIINACNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT +Y +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 238 --DYFSPNKAYKPKMWTEAWTGWYTEFGGAVPSRPAEDLAFSVARFIQKGGSFINYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++ V
Sbjct: 296 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSADPTVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ SG CAAFL N + R V F N+ Y LP SISILPDCK +NT
Sbjct: 356 PLGTYQEAHVFKSKSGACAAFLANYNPRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q + + K + W+ Y + + +T GLL+QI+ +D+SDY WY
Sbjct: 416 ARVGAQ-SAQMKMPRVPLHGAFSWQAYNDETATYADTSFTTAGLLEQINTTRDSSDYLWY 474
Query: 389 TFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ S L + S GH L F+NG+ G+++GS + T V+LR
Sbjct: 475 LTDVKIDPNEEFLRSGKYPVLTILSAGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLR 534
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLS+ VGLP+ G E AGV + + + + W Y+VGL GE
Sbjct: 535 AGINQIALLSIAVGLPNVGPHFETWNAGVLGPVILNGLNEGRRDLSWQKWSYKVGLKGEA 594
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ + + LTWYKTTF APAGN P+AL++ SMGKG+ W+NG+S
Sbjct: 595 LSLHSLSGSSSVEWIQGSLVTRRQPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRS 654
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEE 613
IGRYW ++K S G+ YA + +A+ YHVPR +L PTGNLLV+LEE
Sbjct: 655 IGRYWPAYKAS-GSCGACNYAGSYHEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEE 713
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P GI + I +C + P L SW + + +KK +P SC G
Sbjct: 714 WGGDPNGIFLVRREIDSICADIYEWQ-PNLMSW---QMQASGKVKK-PVRPKAHLSCGPG 768
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
+KIS I FASFG P+G C + GSCH+ +S +R+CIG++ CS+ + FGGDPCP
Sbjct: 769 QKISSIKFASFGTPEGGCGSFREGSCHAHNSYDAFQRSCIGQNSCSVTVAPENFGGDPCP 828
Query: 734 GIHKALLVDAQC 745
+ K L V+A C
Sbjct: 829 NVMKKLSVEAIC 840
>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 846
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/793 (44%), Positives = 458/793 (57%), Gaps = 58/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++F+K + GLYV LRIG
Sbjct: 63 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL + GI FR+DN P+K
Sbjct: 123 PYICAEWNFGGFPVWLKYIPGINFRTDNGPFKAQMQKFTTKIVNMMKAERLFETQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQDDAP P+IN CNG C
Sbjct: 183 LSQIENEYGPMEYEIGSPGKAYTKWAAEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 243 --DYFSPNKAYKPKMWTEAWTGWFTQFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYH 300
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G VI
Sbjct: 301 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVI 360
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF +G CAAFL N +R V FRN+ Y LP SISILPDCK +NT
Sbjct: 361 PLGNYQEAHVFNYKAGGCAAFLANYHQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNT 420
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q + R K + + W+ Y E ++ GLL+QI+ +D SDY WY
Sbjct: 421 ARVGAQ-SARMKMTPVPMHGGFSWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWY 479
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
H + S + + P L V S GH LH F+NG+ +G+A+GS D T V LR
Sbjct: 480 MTDVHIDPSEGFLRSGKYPVLGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLR 539
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS+ VGLP+ G E AG+ + + + + W Y++GL GE
Sbjct: 540 AGVNKISLLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEA 599
Query: 497 LQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W+ S+ + + L+WYKTTF APAGN P+AL++ SMGKG+ W+NGQ
Sbjct: 600 LGLHSISGSSSVEWAEGSLVAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQH 659
Query: 555 IGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GR+W ++K S G Y N C + YHVP+++LKPTGNLLV+ E
Sbjct: 660 VGRHWPAYKAS-GTCGDCSYIGTYNEKKCSTNCG-EASQRWYHVPQSWLKPTGNLLVVFE 717
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P GI++ + VC + P L + ++ + + K +P SC
Sbjct: 718 EWGGDPNGISLVRRDVDSVCADIYEWQ-PTL---MNYQMQASGKVNK-PLRPKAHLSCGP 772
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KI I FASFG P+G C Y GSCH+ HS C+G++ CS+ + FGGDPC
Sbjct: 773 GQKIRSIKFASFGTPEGVCGSYRQGSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFGGDPC 832
Query: 733 PGIHKALLVDAQC 745
+ K L V+A C
Sbjct: 833 LNVMKKLAVEAIC 845
>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
Length = 839
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/793 (44%), Positives = 458/793 (57%), Gaps = 58/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++F+K + GLYV LRIG
Sbjct: 56 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL + GI FR+DN P+K
Sbjct: 116 PYICAEWNFGGFPVWLKYIPGINFRTDNGPFKAQMQKFTTKVVNMMKAERLFETQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQDDAP P+IN CNG C
Sbjct: 176 LSQIENEYGPMEYEIGSPGKAYTKWAAEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 236 --DYFSPNKAYKPKMWTEAWTGWFTQFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G VI
Sbjct: 294 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVI 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF +G CAAFL N +R V FRN+ Y LP SISILPDCK +NT
Sbjct: 354 PLGNYQEAHVFNYKAGGCAAFLANYHQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q + R K + + W+ Y E ++ GLL+QI+ +D SDY WY
Sbjct: 414 ARVGAQ-SARMKMTPVPMHGGFSWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWY 472
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
H + S + + P L V S GH LH F+NG+ +G+A+GS D T V LR
Sbjct: 473 MTDVHIDPSEGFLRSGKYPVLGVLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLR 532
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS+ VGLP+ G E AG+ + + + + W Y++GL GE
Sbjct: 533 AGVNKISLLSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEA 592
Query: 497 LQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W+ S+ + + L+WYKTTF APAGN P+AL++ SMGKG+ W+NGQ
Sbjct: 593 LGLHSISGSSSVEWAEGSLVAQRQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQH 652
Query: 555 IGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GR+W ++K S G Y N C + YHVP+++LKPTGNLLV+ E
Sbjct: 653 VGRHWPAYKAS-GTCGDCSYIGTYNEKKCSTNCG-EASQRWYHVPQSWLKPTGNLLVVFE 710
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P GI++ + VC + P L + ++ + + K +P SC
Sbjct: 711 EWGGDPNGISLVRRDVDSVCADIYEWQ-PTL---MNYQMQASGKVNK-PLRPKAHLSCGP 765
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KI I FASFG P+G C Y GSCH+ HS C+G++ CS+ + FGGDPC
Sbjct: 766 GQKIRSIKFASFGTPEGVCGSYRQGSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFGGDPC 825
Query: 733 PGIHKALLVDAQC 745
+ K L V+A C
Sbjct: 826 LNVMKKLAVEAIC 838
>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 841
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/808 (43%), Positives = 456/808 (56%), Gaps = 90/808 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y+F GR D++RFIK +Q G++V LRIG
Sbjct: 57 MWDGLIEKAKDGGLDVIQTYVFWNGHEPTPGNYNFEGRYDLVRFIKTVQKAGMFVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVGMMKSENLFASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK+DDAP PVINACNG C
Sbjct: 177 LSQIENEYGPEGKEFGAAGKAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TF PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K GS++NYYMYH
Sbjct: 237 -DTFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL REPK+GHLKELH A+KLC +PL++ V
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF +SG CAAFL N + V+F N +Y LP SISILPDCK V FNT
Sbjct: 355 TLGSMQEAHVFRSSSG-CAAFLANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
V Q N+ ++ S WE+Y E + + LL + GLL+Q++ +D SDY W
Sbjct: 414 ATVGVQTNQMQMWADGA--SSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLW 471
Query: 388 YTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S L VQS GH LH F+NG+ GSA+G+ ++ + +L
Sbjct: 472 YITSVEVDPSEKFLQGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANL 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R GTN ALLSV GLP+ G E GV H + + T +W YQVGL GE
Sbjct: 532 RAGTNKVALLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGE 591
Query: 496 KLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
++ + S G V W S + + L WY+ F P+G++P+AL++ SMGKG+ W+NG
Sbjct: 592 QMNLNSLEGSGSVEWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWING 651
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-----------YHVPRAFL 601
QSIGRYW T YA H+ +A YHVPR++L
Sbjct: 652 QSIGRYW------------TAYAEGDCKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWL 699
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
+PT NLLV+ EE G+ I + + VC V+ H P + +W I+ +G
Sbjct: 700 QPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKNW---------QIESYG 749
Query: 662 K----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSR 717
+ V C G+ IS I FASFG P G C + G CHS +S V+E+ CIG R
Sbjct: 750 EPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINSNSVLEKKCIGLQR 809
Query: 718 CSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C + + FGGDPCP + K + V+A C
Sbjct: 810 CVVAISPSNFGGDPCPEVMKRVAVEAVC 837
>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
Precursor
gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
Length = 845
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/797 (42%), Positives = 461/797 (57%), Gaps = 80/797 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I +AK+GGL+ IQTYVFWN+HEPQ+G+++FSGR D+++FIK IQ G+YV LR+G
Sbjct: 71 MWPSIIKRAKQGGLNTIQTYVFWNVHEPQQGKFNFSGRADLVKFIKLIQKNGMYVTLRLG 130
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EWT+GGLP WL +V GI FR+DNK +K
Sbjct: 131 PFIQAEWTHGGLPYWLREVPGIFFRTDNKQFKEHTERYVRMILDKMKEERLFASQGGPII 190
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ + G Y+ WA+ + G+PWVMCKQ+DAP P+INACNG C
Sbjct: 191 LGQIENEYSAVQRAYKQDGLNYIKWASNLVDSMKLGIPWVMCKQNDAPDPMINACNGRHC 250
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN NKPS+WTE+WT+ ++V+G P RS +DIA+ VA F +KNG++VNYYMYH
Sbjct: 251 GDTFPGPNRENKPSLWTENWTTQFRVFGDPPTQRSVEDIAYSVARFFSKNGTHVNYYMYH 310
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A ++ T YYD APLDEYGL +EPK+GHLK LH A+ LC +PLL G
Sbjct: 311 GGTNFGRTSAHYVTTRYYDDAPLDEYGLEKEPKYGHLKHLHNALNLCKKPLLWGQPKTEK 370
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G+ E +E+ + CAAFL NN+ A T+ F+ Y + +SISILPDCKTV +NT
Sbjct: 371 PGKDTEIRYYEQPGTKTCAAFLANNNTEAAETIKFKGREYVIAPRSISILPDCKTVVYNT 430
Query: 329 ERVSTQYNKR----SKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASD 384
++ +Q+ R SK +N KFD E + + GL KD +D
Sbjct: 431 AQIVSQHTSRNFMKSKKANKKFDFKVFTETLPSKLEGNSYIPVELYGL------TKDKTD 484
Query: 385 YFWYT--FRFHYN----SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
Y WYT F+ H N + + + S GH LHA++NGEY GS HGSH+ SF +
Sbjct: 485 YGWYTTSFKVHKNHLPTKKGVKTFVRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQ 544
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCS-WGYQVGL 492
V L+ G N +L V G PDSG+++E + G + + + T S WG ++G+
Sbjct: 545 VTLKAGENHLVMLGVLTGFPDSGSYMEHRYTGPRGISILGLTSGTLDLTESSKWGNKIGM 604
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
GEKL I++ GL KV W LTWY+T F AP + + MGKG WVNG
Sbjct: 605 EGEKLGIHTEEGLKKVEWKKFTGKAPGLTWYQTYFDAPESVSAATIRMHGMGKGLIWVNG 664
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+ +GRYW SF + G P+Q + YH+PR+FLKP NLLV+ E
Sbjct: 665 EGVGRYWQSFLSPLGQPTQIE--------------------YHIPRSFLKPKKNLLVIFE 704
Query: 613 EE-NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE N P + + VC +V ++ P + W R + + T++ C
Sbjct: 705 EEPNVKPELMDFAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLK--CS 762
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF---G 728
KKI+ + FASFGNP G C + +G+C++ S+ V+E+ C+GK+ C IP+ F
Sbjct: 763 GTKKIAAVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDK 822
Query: 729 GDPCPGIHKALLVDAQC 745
D C + K L V +C
Sbjct: 823 KDSCKNVVKMLAVQVKC 839
>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 848
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/794 (43%), Positives = 463/794 (58%), Gaps = 60/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLD I TYVFWNLHEP G Y+F GRND++RFIK + GLYV LRIG
Sbjct: 61 MWEGLIQKAKDGGLDAIDTYVFWNLHEPSPGNYNFEGRNDLVRFIKTVHKAGLYVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I SEW +GG P+WL V GI FR+DN+P+K
Sbjct: 121 PYICSEWNFGGFPVWLKFVPGISFRTDNEPFKSAMQKFTQKVVQLMKNEKLFESQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+ AF G Y+ WAAKMAV TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 181 LSQIENEYEPESKAFGASGYAYMTWAAKMAVGMGTGVPWVMCKEDDAPDPVINTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE W+ ++ +GG Y R +D+ F VA FI K GS++NYYMYH
Sbjct: 241 --DYFSPNKPYKPTMWTEAWSGWFTEFGGPIYQRPVEDLTFAVARFIQKGGSFINYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R PK+GHLKELH A+KLC LL V
Sbjct: 299 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRRPKYGHLKELHKAVKLCELALLNADPTVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG ++A VF SG A FL N + + A V F N+++ LP SISILPDCK VAFNT
Sbjct: 359 TLGSYEQAHVFSSKSGSGAVFLSNFNTKSATKVTFNNMNFHLPPWSISILPDCKNVAFNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSD-EKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
RV Q S+T L+ +S+ W + E + + +T + GLLDQ++ +D+SDY
Sbjct: 419 ARVGVQ---TSQTQLLRTNSELHSWGIFNEDVSSVAGDTTITVTGLLDQLNITRDSSDYL 475
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WYT + S + Q P L VQS G +H F+N + +GSA G+ ++ FT V+
Sbjct: 476 WYTTSVDIDPSESFLGGGQHPSLTVQSAGDAMHVFINDQLSGSASGTREHRRFTFTGNVN 535
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L G N +LLS+ VGL ++G E + GV H + + + W YQVGL G
Sbjct: 536 LHAGLNKISLLSIAVGLANNGPHFETRNTGVLGPVALHGLDHGTRDLSWQKWSYQVGLKG 595
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + S ++ V W S + + LTWYK F P G++P+AL++ SMGKG+ W+N
Sbjct: 596 EATNLDSPNSISAVDWMTGSLVAQKQQPLTWYKAYFDEPNGDEPLALDMGSMGKGQVWIN 655
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
GQSIGRYW + S + + T F YHVPR++LKP+ NLLV+
Sbjct: 656 GQSIGRYWTIYADSDCS-ACTYSGTFRPKKCQFGCQHPTQQWYHVPRSWLKPSKNLLVVF 714
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+ + + ++ VC V+ +H P +++W G T+++ +KP + C
Sbjct: 715 EEIGGDVSKVALVKKSVTSVCAEVSENH-PRITNW-HTESHGQTEVQ---QKPEISLHCT 769
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G IS I F+SFG P G C ++ G+CH+ +S V+++ C+GK +CS+ + + FG DP
Sbjct: 770 DGHSISAIKFSSFGTPSGSCGKFQHGTCHAPNSNAVLQKECLGKQKCSVTISNTNFGADP 829
Query: 732 CPGIHKALLVDAQC 745
CP K L V+A C
Sbjct: 830 CPSKLKKLSVEAVC 843
>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 672
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 320/604 (52%), Positives = 399/604 (66%), Gaps = 43/604 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIA AK+GGLDVIQTYVFWN+HEP +GQY+F GR D+++FI+EIQ+QGLYV LRIG
Sbjct: 70 MWPKLIANAKKGGLDVIQTYVFWNVHEPVQGQYNFQGRYDLVKFIREIQTQGLYVSLRIG 129
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EW YGG P WLHDV I FR+DN+P+K
Sbjct: 130 PFIEAEWKYGGFPFWLHDVPNITFRTDNEPFKQHMQRFVTQIVNMMKHEGLYYPQGGPII 189
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +EPAF GP YV WAA+MAV TGVPW+MCKQ+DAP P+IN CNG+ C
Sbjct: 190 ISQIENEYQMVEPAFGSGGPRYVRWAAEMAVGLQTGVPWMMCKQNDAPDPIINTCNGLIC 249
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA-KNGSYVNYYMY 208
GETF GPNSP KP++WTE+WT+ Y ++G +RS +DIAF VALFIA K GS+V+YYMY
Sbjct: 250 GETFVGPNSPTKPALWTENWTTRYPIYGNDTKLRSTEDIAFAVALFIARKKGSFVSYYMY 309
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNFGR A++++ T YYD APLDEYGL+ P WGHL+ELHAA+KL S LL G +
Sbjct: 310 HGGTNFGRFASSYVTTSYYDGAPLDEYGLIWRPTWGHLRELHAAVKLSSEALLFGRYSNF 369
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA +F ET C AFLVN D+ + TV+FRNI ++L KSIS+L +C+TV F T
Sbjct: 370 SLGPEQEAHIF-ETELKCVAFLVNFDKHQTPTVVFRNIYFQLAPKSISVLSECRTVVFET 428
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFW 387
RV+ QY R+ + W+ ++E I + + L + +S KD +DY W
Sbjct: 429 ARVNAQYGSRTAEVVESLNDIHTWKAFKEPIPEDISKAVYTGNQLFEHLSMTKDETDYLW 488
Query: 388 YTFRFHYNSSN--AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT-VHLRQG 444
Y + Y S+ L+V+S H+LHAFVN EY GS HGSHD + NT + L +G
Sbjct: 489 YIVSYEYIPSDDGQLVLLNVESRAHVLHAFVNTEYAGSVHGSHDGPGNIILNTNISLNEG 548
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQI 499
N +LLSV VG PDSGA +ER+ G+H+V +Q N W YQVGL GE +I
Sbjct: 549 QNTISLLSVMVGSPDSGAHMERRSFGIHKVSIQQGQQPLHLLNNELWAYQVGLYGEANRI 608
Query: 500 YSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
Y+ + W+ I + T TWYKTTF P GND +ALNL SMGKGE WVNG+S+GRY
Sbjct: 609 YTQEESSSAEWTEINNLTYHPFTWYKTTFATPVGNDVVALNLTSMGKGEVWVNGESLGRY 668
Query: 559 WVSF 562
WVSF
Sbjct: 669 WVSF 672
>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
Length = 843
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/798 (44%), Positives = 461/798 (57%), Gaps = 69/798 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLDVI+TYVFWN+HEP G Y+F GRND++RFI+ + GLY LRIG
Sbjct: 56 MWEDLIYKAKEGGLDVIETYVFWNVHEPSPGNYNFEGRNDLVRFIQTVHKAGLYAHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR DN+P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGISFRQDNEPFKKAMQGFTEKIVGMMKSERLYESQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G Y+ WAAKMAV+ TGVPW+MCK+DDAP PVIN CNG C
Sbjct: 176 LSQIENEYGAQSKMLGPVGYNYMSWAAKMAVEMGTGVPWIMCKEDDAPDPVINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 236 -DKFT-PNKPYKPTMWTEAWSGWFSEFGGPIHKRPVQDLAFAVARFIQKGGSFVNYYMYH 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL+R+PK+GHLKELH AIK+C + L++ V
Sbjct: 294 GGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQPKYGHLKELHKAIKMCEKALISTDPVVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A+V+ SG C+AFL N D + + V+F N+ Y LP S+SILPDC+ FNT
Sbjct: 354 SLGNFQQAYVYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWSVSILPDCRNAVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V Q S+ L +S+ WE + E + T + A GLL+QI+ +D SDY W
Sbjct: 414 AKVGVQ---TSQMQMLPTNSERFSWESFEEDTSSSSATTITASGLLEQINVTRDTSDYLW 470
Query: 388 YTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y SS + + P L VQS GH +H F+NG +GSA+G+ ++ F V+L
Sbjct: 471 YITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFINGRLSGSAYGTREDRRFRYTGDVNL 530
Query: 442 RQGTNDGALLSVTVGLPDSGAFLE---RKVAGVHRVRVQDKSFTNCS---WGYQVGLIGE 495
R GTN ALLSV VGLP+ G E + G + DK + S W YQVGL GE
Sbjct: 531 RAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHGLDKGKLDLSWQKWTYQVGLKGE 590
Query: 496 KLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
+ + S G++ V W + + + LTW+KT F AP G +P+AL++ MGKG+ W+NG
Sbjct: 591 AMNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPEGEEPLALDMDGMGKGQIWING 650
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
SIGRYW + T N + C YHVPR++LK NLLV+ E
Sbjct: 651 ISIGRYWTAIATGSCNDCNYAGSFRPPKCQLGCG-QPTQRWYHVPRSWLKQNHNLLVVFE 709
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK-----PTVQ 667
E G+P I++ ++ VC V+ H P L +W I +GK P V
Sbjct: 710 ELGGDPSKISLAKRSVSSVCADVSEYH-PNLKNW---------HIDSYGKSENFRPPKVH 759
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
C G+ IS I FASFG P G C Y G+CHSS S ++E+ CIGK RC + + + F
Sbjct: 760 LHCNPGQAISSIKFASFGTPLGTCGSYEQGACHSSSSYDILEQKCIGKPRCIVTVSNSNF 819
Query: 728 GGDPCPGIHKALLVDAQC 745
G DPCP + K L V+A C
Sbjct: 820 GRDPCPNVLKRLSVEAVC 837
>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 845
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/798 (45%), Positives = 459/798 (57%), Gaps = 68/798 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D++RFIK +Q GLYV LRIG
Sbjct: 62 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 122 PYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MAV TGVPW+MCKQ+DAP P+IN CNG C
Sbjct: 182 LSQIENEYGPMEYEIGAPGRAYTQWAAHMAVGLGTGVPWIMCKQEDAPDPIINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF +A FI K GS+VNYYMYH
Sbjct: 242 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPHRPAEDLAFSIARFIQKGGSFVNYYMYH 299
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL R+PKWGHLK+LH AIKLC L++G V
Sbjct: 300 GGTNFGRTAGGPFIATSYDYDAPLDEYGLPRQPKWGHLKDLHRAIKLCEPALVSGDPTVQ 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +EA VF SG CAAFL N + + TV F N Y LP SISILP+CK +NT
Sbjct: 360 QLGNYEEAHVFRSKSGACAAFLANYNPQSYATVAFGNQRYNLPPWSISILPNCKHTVYNT 419
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q + K + + W+ + E D++ GLL+QI+A +D SDY WY
Sbjct: 420 ARVGSQ-STTMKMTRVPIHGGLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWY 478
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ NS+ N + P L V S GH LH F+N + +G+A+GS + T +V LR
Sbjct: 479 STDVVINSNEGFLRNGKNPVLTVLSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLR 538
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLSV VGLP+ G ER AGV + + T W Y+VGL GE
Sbjct: 539 AGVNKISLLSVAVGLPNVGPHFERWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEA 598
Query: 497 LQIYSNLGLNKVLWSS--IRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W + S + LTWYKTTF APAG P+AL++ SMGKG+ W+NGQS
Sbjct: 599 LNLHSLSGSSSVEWLQGFLVSRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQS 658
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW ++K S G+ YA N C + YHVP ++LKPTGNLLV+ E
Sbjct: 659 LGRYWPAYKAS-GSCGYCNYAGTYNEKKCGSNCG-QASQRWYHVPHSWLKPTGNLLVVFE 716
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPTVQ 667
E G+P GI + I VC + P L S+ D++ GK +P
Sbjct: 717 ELGGDPNGIFLVRRDIDSVCADIYEWQ-PNLVSY---------DMQASGKVRSPVRPKAH 766
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
SC G+KIS I FASFG P G C Y GSCH+ S ++ C+G+S C++ + F
Sbjct: 767 LSCGPGQKISSIKFASFGTPVGSCGNYREGSCHAHKSYDAFQKNCVGQSWCTVTVSPEIF 826
Query: 728 GGDPCPGIHKALLVDAQC 745
GGDPCP + K L V+A C
Sbjct: 827 GGDPCPSVMKKLSVEAIC 844
>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
Length = 818
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/807 (43%), Positives = 464/807 (57%), Gaps = 74/807 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K GGLD+I+TYVFW+LHEP +GQYDF GR D++RFIK + GLYV LRIG
Sbjct: 23 MWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGRKDLVRFIKTVGEAGLYVHLRIG 82
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW YGG P+WLH + GI FR+DNKP+K
Sbjct: 83 PYACAEWNYGGFPLWLHFIPGIKFRTDNKPFKDEMQRFTTKIVDLMKQENLYASQGGPII 142
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ WAA MA TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 143 LSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGVPWVMCQQTDAPDPIINTCNGFYC 202
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP IWTE+W+ ++ +GG R +D+AF VA F + G++ NYYMY
Sbjct: 203 DQF--SPNSNNKPKIWTENWSGWFLSFGGPVPQRPVEDLAFAVARFFQRGGTFQNYYMYT 260
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
G NFG T+ F+ T Y AP+DEYG+ R+PKWGHLKELH AIKLC L+ + +
Sbjct: 261 WGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGHLKELHKAIKLCEPALVATDHHTL 320
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA V++ SGVCAAFL N + TV F SY LP S+SILPDC+TV FNT
Sbjct: 321 RLGPNLEAHVYKTASGVCAAFLANIGTQSDATVTFNGKSYSLPAWSVSILPDCRTVVFNT 380
Query: 329 ERVSTQ--------YNKRSKTSNLKFDSDE----KWEEYREAILNFDNTLLRAEGLLDQI 376
++++Q N S TS+ + S E W E + + +R GLL+QI
Sbjct: 381 AQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSDWSFVIEPVGISKSNAIRKTGLLEQI 440
Query: 377 SAAKDASDYFWYTFRFHYN------SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDN 430
+ D SDY WY+ + S+ Q+ L +S GH+LHAFVNG+ GS G+ N
Sbjct: 441 NTTADVSDYLWYSISIAIDGDEPFLSNGTQSNLHAESLGHVLHAFVNGKLAGSGIGNSGN 500
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH---RVRVQDKS--FTNCS 485
+ L G N LLS TVGL + GAF + AG+ +++ Q+ + ++ +
Sbjct: 501 AKIIFEKLIMLTPGNNSIDLLSATVGLQNYGAFFDLMGAGITGPVKLKGQNGTLDLSSNA 560
Query: 486 WGYQVGLIGEKLQIYSNLG-LNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMG 544
W YQ+GL GE L ++ N G +++ + S + L WYKTTF AP GNDP+A++ MG
Sbjct: 561 WTYQIGLKGEDLSLHENSGDVSQWISESTLPKNQPLIWYKTTFNAPDGNDPVAIDFTGMG 620
Query: 545 KGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-----YHVPRA 599
KGEAWVNGQSIGRYW ++ + + S A N IK YHVPR+
Sbjct: 621 KGEAWVNGQSIGRYWPTYSSPQNGCST---ACNYRGPYSASKCIKNCGKPSQILYHVPRS 677
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
F++ N LVL EE G+P I++ T + +C HV+ SH P+ +WL +Q+G KK
Sbjct: 678 FIQSESNTLVLFEEMGGDPTQISLATKQMTSLCAHVSESHPAPVDTWLSLQQKG----KK 733
Query: 660 FGKKPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
G PT+Q CP + IS I FASFG P G C + C S+ VV++AC+G RC
Sbjct: 734 SG--PTIQLECPYPNQVISSIKFASFGTPSGMCGSFNHSQCSSASVLAVVQKACVGSKRC 791
Query: 719 SIPLLSRYFGGDPCPGIHKALLVDAQC 745
S+ + S+ GDPC G+ K+L V+A C
Sbjct: 792 SVGISSKTL-GDPCRGVIKSLAVEAAC 817
>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
Length = 835
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/795 (44%), Positives = 461/795 (57%), Gaps = 64/795 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGG+DVIQTYVFWN HEP++G+Y F R D+++FIK +Q GLYV LRIG
Sbjct: 54 MWPDLIQKAKEGGVDVIQTYVFWNGHEPEEGKYYFEERYDLVKFIKVVQEAGLYVHLRIG 113
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL V GI FR++N+P+K
Sbjct: 114 PYACAEWNFGGFPVWLKYVPGISFRTNNEPFKAAMQKFTTKIVDMMKAEKLYETQGGPII 173
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E E G Y WAAKMAVD TGVPW+MCKQDD P P+IN CNG C
Sbjct: 174 LSQIENEYGPMEWELGEPGKVYSEWAAKMAVDLGTGVPWIMCKQDDVPDPIINTCNGFYC 233
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN NKP +WTE WT+++ +GG R A+D+AF VA FI GS++NYYMYH
Sbjct: 234 --DYFTPNKANKPKMWTEAWTAWFTEFGGPVPYRPAEDMAFAVARFIQTGGSFINYYMYH 291
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ F+ T Y APLDE+G +R+PKWGHLK+LH AIKLC L++ V
Sbjct: 292 GGTNFGRTSGGPFIATSYDYDAPLDEFGSLRQPKWGHLKDLHRAIKLCEPALVSVDPTVT 351
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA VF+ SG CAAFL N ++ V F N+ Y LP SISILPDCK +NT
Sbjct: 352 SLGNYQEARVFKSESGACAAFLANYNQHSFAKVAFGNMHYNLPPWSISILPDCKNTVYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q + T + S WE + E + ++ GLL+QI+ +D SDY WY
Sbjct: 412 ARVGAQSAQMKMTPVSRGFS---WESFNEDAASHEDDTFTVVGLLEQINITRDVSDYLWY 468
Query: 389 TFRFHYN------SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ +S L V S GH LH FVNG+ G+ +GS +N T N ++LR
Sbjct: 469 MTDIEIDPTEGFLNSGNWPWLTVFSAGHALHVFVNGQLAGTVYGSLENPKLTFSNGINLR 528
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS+ VGLP+ G E AGV + + + T W Y+VGL GE
Sbjct: 529 AGVNKISLLSIAVGLPNVGPHFETWNAGVLGPVSLNGLNEGTRDLTWQKWFYKVGLKGEA 588
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G V W S+ + + L+WYKTTF AP GN+P+AL++ +MGKG+ W+NGQS
Sbjct: 589 LSLHSLSGSPSVEWVEGSLVAQKQPLSWYKTTFNAPDGNEPLALDMNTMGKGQVWINGQS 648
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GR+W ++K+S G+ S Y + + C + YHVPR++L PTGNLLV+ E
Sbjct: 649 LGRHWPAYKSS-GSCSVCNYTGWFDEKKCLTNCG-EGSQRWYHVPRSWLYPTGNLLVVFE 706
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK--KPTVQPSC 670
E G+P GIT+ I VC + P L +W R KF + +P C
Sbjct: 707 EWGGDPYGITLVKREIGSVCADIYEWQ-PQLLNWQRLVS------GKFDRPLRPKAHLKC 759
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
G+KIS I FASFG P+G C + GSCH+ S ++ C+GK CS+ + FGGD
Sbjct: 760 APGQKISSIKFASFGTPEGVCGNFQQGSCHAPRSYDAFKKNCVGKESCSVQVTPENFGGD 819
Query: 731 PCPGIHKALLVDAQC 745
PC + K L V+A C
Sbjct: 820 PCRNVLKKLSVEAIC 834
>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
Length = 853
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/794 (44%), Positives = 464/794 (58%), Gaps = 60/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ KAK+GGLDVI TYVFWN+HEP G Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 58 MWEDLVQKAKDGGLDVIDTYVFWNVHEPSPGNYNFEGRFDLVRFIKTVQKGGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKDERLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY AF G Y+ WAA+MAV TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 178 FSQIENEYGPESRAFGAAGHSYINWAAQMAVGLKTGVPWVMCKEDDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 238 -DAFS-PNKPYKPTMWTEAWSGWFTEFGGAFHHRPVQDLAFAVARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+REPK+GHLKELH AIKLC L++ +
Sbjct: 296 GGTNFGRSAGGPFITTSYDYDAPIDEYGLIREPKYGHLKELHRAIKLCEHELVSSDPTIT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG Q+A VF C+AFL N + A V+F N+ Y LP SISILPDC+ V FNT
Sbjct: 356 LLGTYQQAHVFSSGKRSCSAFLANYHTQSAARVMFNNMHYVLPPWSISILPDCRNVVFNT 415
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNFD-NTLLRAEGLLDQISAAKDASDYF 386
+V Q + + + +F S WE Y E I + ++ + A GL++QI+ +D +DY
Sbjct: 416 AKVGVQTSHVQMLPTGSRFFS---WESYDEDISSLGASSRMTALGLMEQINVTRDTTDYL 472
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + N S + Q P L V+S GH LH F+NG+++GSA G+ +N FT V+
Sbjct: 473 WYITSVNINPSESFLRGGQWPTLTVESAGHALHVFINGQFSGSAFGTRENREFTFTGPVN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR GTN ALLS+ VGLP+ G E G+ H + +K T W YQVGL G
Sbjct: 533 LRAGTNRIALLSIAVGLPNVGVHYETWKTGILGPVMLHGLNQGNKDLTWQQWSYQVGLKG 592
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E + + S + V W TRQ L WYK F AP GN+P+AL+++SMGKG+ W+NG
Sbjct: 593 EAMNLVSPNRASSVDWIQGSLATRQQPLKWYKAYFDAPGGNEPLALDMRSMGKGQVWING 652
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLL 611
QSIGRYW+S+ +KG+ S Y+ + T YHVPR++LKP NLLV+
Sbjct: 653 QSIGRYWLSY--AKGDCSSCGYSGTFRPPKCQLGCGQPTQRWYHVPRSWLKPKQNLLVIF 710
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+ I++ + VC H P + + + + + ++ + V C
Sbjct: 711 EELGGDASKISLVKRSTTSVCADAFEHH-PTIEN---YNTESNGESERNLHQAKVHLRCA 766
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I FASFG P G C + G+CH+ +S VVE+ CIG+ C + + + FG DP
Sbjct: 767 PGQSISAINFASFGTPTGTCGSFQEGTCHAPNSHSVVEKKCIGRESCMVAISNSNFGADP 826
Query: 732 CPGIHKALLVDAQC 745
CP K L V+A C
Sbjct: 827 CPSKLKKLSVEAVC 840
>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
Length = 843
Score = 641 bits (1654), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/810 (43%), Positives = 457/810 (56%), Gaps = 92/810 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y+F GR D++RFIK +Q G++V LRIG
Sbjct: 57 MWDGLIEKAKDGGLDVIQTYVFWNGHEPTPGNYNFEGRYDLVRFIKTVQKAGMFVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVGMMKSENLFASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK+DDAP PVINACNG C
Sbjct: 177 LSQIENEYGPEGKEFGAAGKAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TF PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K GS++NYYMYH
Sbjct: 237 -DTFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL REPK+GHLKELH A+KLC +PL++ V
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF +SG CAAFL N + V+F N +Y LP SISILPDCK V FNT
Sbjct: 355 TLGSMQEAHVFRSSSG-CAAFLANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
V Q N+ ++ S WE+Y E + + LL + GLL+Q++ +D SDY W
Sbjct: 414 ATVGVQTNQMQMWADGA--SSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLW 471
Query: 388 YTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y R + S L VQS GH LH F+NG+ GSA+G+ ++ + +L
Sbjct: 472 YITRVEVDPSEKFLQGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANL 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGY--QVGLI 493
R GTN ALLSV GLP+ G E GV H + + T +W Y QVGL
Sbjct: 532 RAGTNKVALLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQFQVGLK 591
Query: 494 GEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE++ + S G V W S + + L WY+ F P+G++P+AL++ SMGKG+ W+
Sbjct: 592 GEQMNLNSLEGSGSVEWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWI 651
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-----------YHVPRA 599
NGQSIGRYW T YA H+ +A YHVPR+
Sbjct: 652 NGQSIGRYW------------TAYAEGDCKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRS 699
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
+L+PT NLLV+ EE G+ I + + VC V+ H P + +W I+
Sbjct: 700 WLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKNW---------QIES 749
Query: 660 FGK----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGK 715
+G+ V C G+ IS I FASFG P G C + G CHS +S V+E+ CIG
Sbjct: 750 YGEPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINSNSVLEKKCIGL 809
Query: 716 SRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
RC + + FGGDPCP + K + V+A C
Sbjct: 810 QRCVVAISPSNFGGDPCPEVMKRVAVEAVC 839
>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
Length = 898
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/800 (44%), Positives = 468/800 (58%), Gaps = 72/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW +I KAK+GGLDV++TYVFWN+HEP G Y+F GR D++RFI+ +Q GLY LRIG
Sbjct: 111 MWEDIIQKAKDGGLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIG 170
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 171 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPII 230
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + G Y+ WAA MAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 231 LSQIENEYGVQSKLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC 290
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP+IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 291 -DAFS-PNKPYKPTIWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYH 348
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGLVR+PK+GHLKELH +IKLC R L++ V
Sbjct: 349 GGTNFGRTAGGPFITTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVS 408
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A V+ +G CAAFL N D + + V+F N+ Y LP SISILPDC+ FNT
Sbjct: 409 SLGSFQQAHVYSSDAGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNT 468
Query: 329 ERVSTQY-NKRSKTSNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q + +N + S WE Y E I + D+ + GLL+QI+ +DASDY
Sbjct: 469 AKVGVQTAHMEMLPTNAEMLS---WESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYL 525
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY R SS + + P L +Q+ GH +H F+NG+ TGSA G+ + FT V+
Sbjct: 526 WYITRIDIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVN 585
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + + W Y+VGL G
Sbjct: 586 LHAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKG 645
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+ + +Q LTW+K F AP G++P+AL+++ MGKG+ W+N
Sbjct: 646 EAMNLVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWIN 705
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW ++ + GN Y+ + T YHVPR++LKPT NLLV+
Sbjct: 706 GQSIGRYWTAY--ANGNCQGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVV 763
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPT 665
EE G+P I++ ++ VC V H P + +W I+ +GK KP
Sbjct: 764 FEELGGDPSRISLVRRSMTSVCADVFEYH-PNIKNW---------HIESYGKTEELHKPK 813
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
V C G+ IS I FAS+G P G C + G CH+ S +VE+ CIG+ RC++ + +
Sbjct: 814 VHLRCGPGQSISSIKFASYGTPLGTCGSFEQGPCHAPDSYAIVEKRCIGRQRCAVTISNT 873
Query: 726 YFGGDPCPGIHKALLVDAQC 745
F DPCP + K L V+A C
Sbjct: 874 NFAQDPCPNVLKRLSVEAVC 893
>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 846
Score = 640 bits (1652), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/797 (45%), Positives = 464/797 (58%), Gaps = 72/797 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK GGLDV++TYVFWN+HEP G Y+F GR D++RFIK IQ GLY LRIG
Sbjct: 57 MWEDLILKAKNGGLDVVETYVFWNVHEPYPGIYNFEGRFDLVRFIKTIQKAGLYANLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+ +K
Sbjct: 117 PYVCAEWNFGGFPVWLKYVPGISFRTDNEAFKNAMQGFTEKIVALMKSENLFESQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY T F E G Y+ WAA MAV TGVPWVMCK+ DAP PVIN CNG C
Sbjct: 177 LAQIENEYGTESKLFGEAGYNYMTWAANMAVGLQTGVPWVMCKEADAPDPVINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TF PN P KP++WTE WT ++ +GG + R QD+AF VA FI + GS VNYYMYH
Sbjct: 237 -DTFS-PNKPYKPTMWTEAWTGWFSEFGGPLHQRPVQDLAFAVARFIQRGGSLVNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH AIK+C L++ V
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPIDEYGLLRQPKYGHLKELHRAIKMCEPALVSADPIVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A V+ SG CAAFL N D + VLF N Y LP SISILPDCK FNT
Sbjct: 355 SLGDYQQAHVYSSESGGCAAFLSNYDTKSFARVLFNNRHYNLPPWSISILPDCKNAVFNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q ++ L +S WE Y E I D+ +++ + GLL+QI+ +D SDY
Sbjct: 415 AKVGVQ---TAQMGMLPAESTTLSWESYFEDISALDDRSMMTSPGLLEQINVTRDTSDYL 471
Query: 387 WYTFRFHYNSSN-----AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY +SS + P L VQS GH +H F+NG+ +GS GS + FT V+
Sbjct: 472 WYITSVDISSSEPFLHGGELPTLLVQSTGHAVHVFINGQLSGSVSGSRKSRRFTYSGKVN 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN LLSV VGLP+ G E G+ + +R ++ W Y+VGL G
Sbjct: 532 LHAGTNKIGLLSVAVGLPNVGGHFETWNTGILGPVVLYGLRQGKWDLSSQKWTYKVGLKG 591
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G + V W +S+ + T Q LTW+K F AP G +P+AL+++ MGKG+ W+N
Sbjct: 592 EAMNLISPSGFSPVEWMQASLAAQTPQPLTWHKAYFDAPEGEEPLALDMEGMGKGQIWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTGNLL 608
GQSIGRYW ++ ++GN S+ YA T C + T YHVPR++L+P NLL
Sbjct: 652 GQSIGRYWTAY--ARGNCSRCNYA--TAFRPPKCQLGCGQPTQRWYHVPRSWLRPEQNLL 707
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
V+ EE GNP I++ + VC V+ H P +W I P V
Sbjct: 708 VVFEEVGGNPSRISIVKRLVTSVCADVSEFH-PTFKNW---------HITAKFITPKVHL 757
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
SC G+ IS I FASFG P G C Y G+CH+ S G++E+ C+GK RC++ + + F
Sbjct: 758 SCDPGQYISSIKFASFGTPLGTCGSYQQGTCHAPSSSGILEKKCVGKQRCAVTVSNSNF- 816
Query: 729 GDPCPGIHKALLVDAQC 745
DPCP + K L V+A C
Sbjct: 817 EDPCPNMMKRLSVEAVC 833
>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
Length = 845
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/800 (44%), Positives = 468/800 (58%), Gaps = 72/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW +I KAK+GGLDV++TYVFWN+HEP G Y+F GR D++RFI+ +Q GLY LRIG
Sbjct: 58 MWEDIIQKAKDGGLDVVETYVFWNVHEPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSERLFESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + G Y+ WAA MAV TGVPWVMCK++DAP PVIN CNG C
Sbjct: 178 LSQIENEYGVQSKLLGDAGHDYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP+IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 238 -DAFS-PNKPYKPTIWTEAWSGWFNEFGGPLHQRPVQDLAFAVARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGLVR+PK+GHLKELH +IKLC R L++ V
Sbjct: 296 GGTNFGRTAGGPFITTSYDYDAPIDEYGLVRQPKYGHLKELHRSIKLCERALVSADPIVS 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A V+ +G CAAFL N D + + V+F N+ Y LP SISILPDC+ FNT
Sbjct: 356 SLGSFQQAHVYSSDAGDCAAFLSNYDTKSSARVMFNNMHYNLPPWSISILPDCRNAVFNT 415
Query: 329 ERVSTQY-NKRSKTSNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYF 386
+V Q + +N + S WE Y E I + D+ + GLL+QI+ +DASDY
Sbjct: 416 AKVGVQTAHMEMLPTNAEMLS---WESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYL 472
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY R SS + + P L +Q+ GH +H F+NG+ TGSA G+ + FT V+
Sbjct: 473 WYITRIDIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + + W Y+VGL G
Sbjct: 533 LHAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKG 592
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+ + +Q LTW+K F AP G++P+AL+++ MGKG+ W+N
Sbjct: 593 EAMNLVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW ++ + GN Y+ + T YHVPR++LKPT NLLV+
Sbjct: 653 GQSIGRYWTAY--ANGNCQGCSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVV 710
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPT 665
EE G+P I++ ++ VC V H P + +W I+ +GK KP
Sbjct: 711 FEELGGDPSRISLVRRSMTSVCADVFEYH-PNIKNW---------HIESYGKTEELHKPK 760
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
V C G+ IS I FAS+G P G C + G CH+ S +VE+ CIG+ RC++ + +
Sbjct: 761 VHLRCGPGQSISSIKFASYGTPLGTCGSFEQGPCHAPDSYAIVEKRCIGRQRCAVTISNT 820
Query: 726 YFGGDPCPGIHKALLVDAQC 745
F DPCP + K L V+A C
Sbjct: 821 NFAQDPCPNVLKRLSVEAVC 840
>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
Length = 851
Score = 640 bits (1651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/818 (43%), Positives = 456/818 (55%), Gaps = 100/818 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y+F GR D++RFIK +Q G++V LRIG
Sbjct: 57 MWDGLIEKAKDGGLDVIQTYVFWNGHEPTPGNYNFEGRYDLVRFIKTVQKAGMFVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVGMMKSENLFASQGGPII 176
Query: 93 -------------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGP 139
IENEY F G Y+ WAAKMAV TGVPWVMCK+DDAP P
Sbjct: 177 LSQASAKLCFPCHIENEYGPEGKEFGAAGKAYINWAAKMAVGLDTGVPWVMCKEDDAPDP 236
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
VINACNG C +TF PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K
Sbjct: 237 VINACNGFYC-DTFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFGVARFVQKG 294
Query: 200 GSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
GS++NYYMYHGGTNFGRTA IT YD APLDEYGL REPK+GHLKELH A+KLC +
Sbjct: 295 GSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKFGHLKELHRAVKLCEQ 354
Query: 259 PLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISIL 318
PL++ V +LG +QEA VF +SG CAAFL N + V+F N +Y LP SISIL
Sbjct: 355 PLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAKVIFNNENYSLPPWSISIL 413
Query: 319 PDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQIS 377
PDCK V FNT V Q N+ ++ S WE+Y E + + LL + GLL+Q++
Sbjct: 414 PDCKNVVFNTATVGVQTNQMQMWADGA--SSMMWEKYDEEVDSLAAAPLLTSTGLLEQLN 471
Query: 378 AAKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNV 431
+D SDY WY + S L VQS GH LH F+NG+ GSA+G+ ++
Sbjct: 472 VTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDR 531
Query: 432 SFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCS 485
+ +LR GTN ALLSV GLP+ G E GV H + + T +
Sbjct: 532 KISYSGNANLRAGTNKVALLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQT 591
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQS 542
W YQVGL GE++ + S G V W S + + L WY+ F P+G++P+AL++ S
Sbjct: 592 WSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGS 651
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT--------- 593
MGKG+ W+NGQSIGRYW T YA H+ +A
Sbjct: 652 MGKGQIWINGQSIGRYW------------TAYAEGDCKGCHYTGSYRAPKCQAGCGQPTQ 699
Query: 594 --YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQ 651
YHVPR++L+PT NLLV+ EE G+ I + + VC V+ H P + +W
Sbjct: 700 RWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKNW----- 753
Query: 652 RGDTDIKKFGK----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGV 707
I+ +G+ V C G+ IS I FASFG P G C + G CHS +S V
Sbjct: 754 ----QIESYGEPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINSNSV 809
Query: 708 VERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ER CIG RC + + FGGDPCP + K + V+A C
Sbjct: 810 LERKCIGLERCVVAISPSNFGGDPCPEVMKRVAVEAVC 847
>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 640 bits (1650), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/801 (43%), Positives = 459/801 (57%), Gaps = 86/801 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP GQY F GR D++ FIK ++ GLYV LRIG
Sbjct: 56 MWPDLIEKAKDGGLDVVQTYVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MAV +T VPW+MCK+DDAP P+IN CNG C
Sbjct: 176 LSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WT++Y +G R +D+A+ VA FI K GS+VNYYMYH
Sbjct: 236 --DWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+REPKWGHLK+LH AIKLC L+ G V
Sbjct: 294 GGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q++ VF ++G CAAFL N D+ V F + Y+LP SISILPDCKT FNT
Sbjct: 354 SLGNAQKSSVFRSSTGACAAFLENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q ++ +++ W+ Y E I +F L GLL+QI+ +D +DY WY
Sbjct: 414 ARVGSQISQM----KMEWAGGFAWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWY 469
Query: 389 TF--------RFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
T +F N N + L V S GH LH F+NG+ G+ +GS D+ T V
Sbjct: 470 TTYVDVAQDEQFLSNGENLK--LTVMSAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVK 527
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIG 494
L G+N + LS+ VGLP+ G E AG+ D + T W YQVGL G
Sbjct: 528 LWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKG 587
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E + ++S G + V W P ++ LTWYK F AP G++P+AL++ SMGKG+ W+NG
Sbjct: 588 ESMSLHSLSGSSTVEWG---EPVQKQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWING 644
Query: 553 QSIGRYWVSFKTS--------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPT 604
Q IGRYW +K S +G +T+ N S + YHVPR++L PT
Sbjct: 645 QGIGRYWPGYKASGNCGTCDYRGEYDETKCQTNCGDS--------SQRWYHVPRSWLSPT 696
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
GNLLV+ EE G+P GI++ +I VC V+ P + +W K +K
Sbjct: 697 GNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWH----------TKDYEKA 745
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
V C G+KI++I FASFG P G C Y G CH+ S + + C+G+ RC + ++
Sbjct: 746 KVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIFWKNCVGQERCGVSVVP 805
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
FGGDPCPG K +V+A C
Sbjct: 806 EIFGGDPCPGTMKRAVVEAIC 826
>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
Length = 836
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 354/796 (44%), Positives = 457/796 (57%), Gaps = 67/796 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +G+Y F GR D++RFIK +Q+ GLYV LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN P+K
Sbjct: 116 PYICAEWNFGGFPVWLKYVPGIAFRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP PVI+ACNG C
Sbjct: 176 MSQIENEYGPVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F PN KP ++TE WT +Y +GG R A+D+A+ VA FI GS++NYYMYH
Sbjct: 236 ENFF--PNKDYKPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYH 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA I+ YD AP+DEYGL EPKWGHL++LH AIKLC L++ V
Sbjct: 294 GGTNFGRTAGGPFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA V++ SG CAAFL N D + + V F N Y+LP S+SILPDCK V FNT
Sbjct: 354 YLGTNLEAHVYKAKSGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILN-FDNTLLRAEGLLDQISAAKDASDY 385
R+ Q +S +K + S W+ Y E + + +GLL+QI+ +D +DY
Sbjct: 414 ARIGAQ------SSQMKMNPVSTFSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDY 467
Query: 386 FWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WY H Q P L V S GH LH F+NG+ +G+ +G N T + V
Sbjct: 468 LWYMTEVHIKPDEGFLKTGQYPVLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNV 527
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLI 493
L GTN +LLSV +GLP+ G E AGV + ++ W Y++GL
Sbjct: 528 KLTVGTNKISLLSVAMGLPNVGLHFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLK 587
Query: 494 GEKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE L + + G + W S+ + + LTWYKTTF AP GNDP+AL++ SMGKG+ W+N
Sbjct: 588 GEALNLQAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWIN 647
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLV 609
G+SIGR+W ++ T+ GN + YA N C + YHVPR++LKP+GN L+
Sbjct: 648 GESIGRHWPAY-TAHGNCNGCNYAGIFNDKKCQTGCG-GPSQRWYHVPRSWLKPSGNQLI 705
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS 669
+ EE GNP GIT+ + +VC + P L + + G + + K +
Sbjct: 706 VFEELGGNPAGITLVKRTMDRVCADIFEGQ-PSLKN---SQIIGSSKVNSLQSKAHLW-- 759
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
C G KISKI FASFG P G C + GSCH+ S ++R CIGK CS+ + FGG
Sbjct: 760 CAPGLKISKIQFASFGVPQGTCGSFREGSCHAHKSYDALQRNCIGKQSCSVSVAPEVFGG 819
Query: 730 DPCPGIHKALLVDAQC 745
DPCPG K L V+A C
Sbjct: 820 DPCPGSMKKLSVEALC 835
>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 354/796 (44%), Positives = 457/796 (57%), Gaps = 67/796 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +G+Y F GR D++RFIK +Q+ GLYV LRIG
Sbjct: 53 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN P+K
Sbjct: 113 PYICAEWNFGGFPVWLKYVPGIAFRTDNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPII 172
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP PVI+ACNG C
Sbjct: 173 MSQIENEYGPVEYEIGAPGKAYTKWAAEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F PN KP ++TE WT +Y +GG R A+D+A+ VA FI GS++NYYMYH
Sbjct: 233 ENFF--PNKDYKPKMFTEAWTGWYTEFGGAIPNRPAEDLAYSVARFIQNRGSFINYYMYH 290
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA I+ YD AP+DEYGL EPKWGHL++LH AIKLC L++ V
Sbjct: 291 GGTNFGRTAGGPFISTSYDYDAPIDEYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVT 350
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA V++ SG CAAFL N D + + V F N Y+LP S+SILPDCK V FNT
Sbjct: 351 YLGTNLEAHVYKAKSGACAAFLANYDPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNT 410
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILN-FDNTLLRAEGLLDQISAAKDASDY 385
R+ Q +S +K + S W+ Y E + + +GLL+QI+ +D +DY
Sbjct: 411 ARIGAQ------SSQMKMNPVSTFSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDY 464
Query: 386 FWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WY H Q P L V S GH LH F+NG+ +G+ +G N T + V
Sbjct: 465 LWYMTEVHIKPDEGFLKTGQYPVLTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNV 524
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLI 493
L GTN +LLSV +GLP+ G E AGV + ++ W Y++GL
Sbjct: 525 KLTVGTNKISLLSVAMGLPNVGLHFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLK 584
Query: 494 GEKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE L + + G + W S+ + + LTWYKTTF AP GNDP+AL++ SMGKG+ W+N
Sbjct: 585 GEALNLQAITGSSSDEWVEGSLLAQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWIN 644
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLV 609
G+SIGR+W ++ T+ GN + YA N C + YHVPR++LKP+GN L+
Sbjct: 645 GESIGRHWPAY-TAHGNCNGCNYAGIFNDKKCQTGCG-GPSQRWYHVPRSWLKPSGNQLI 702
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS 669
+ EE GNP GIT+ + +VC + P L + + G + + K +
Sbjct: 703 VFEELGGNPAGITLVKRTMDRVCADIFEGQ-PSLKN---SQIIGSSKVNSLQSKAHLW-- 756
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
C G KISKI FASFG P G C + GSCH+ S ++R CIGK CS+ + FGG
Sbjct: 757 CAPGLKISKIQFASFGVPQGTCGSFREGSCHAHKSYDALQRNCIGKQSCSVSVAPEVFGG 816
Query: 730 DPCPGIHKALLVDAQC 745
DPCPG K L V+A C
Sbjct: 817 DPCPGSMKKLSVEALC 832
>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
Length = 839
Score = 640 bits (1650), Expect = e-180, Method: Compositional matrix adjust.
Identities = 349/801 (43%), Positives = 460/801 (57%), Gaps = 86/801 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP GQY F GR D++ FIK ++ GLYV LRIG
Sbjct: 68 MWPDLIEKAKDGGLDVVQTYVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 128 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MAV +T VPW+MCK+DDAP P+IN CNG C
Sbjct: 188 LSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WT++Y +G R +D+A+ VA FI K GS+VNYYMYH
Sbjct: 248 --DWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYH 305
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+REPKWGHLK+LH AIKLC L+ G V
Sbjct: 306 GGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVT 365
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q++ VF ++G CAAFL N D+ V F + Y+LP SISILPDCKT FNT
Sbjct: 366 SLGNAQKSSVFRSSTGACAAFLENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNT 425
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q ++ +++ W+ Y E I +F L GLL+QI+ +D +DY WY
Sbjct: 426 ARVGSQISQM----KMEWAGGFAWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWY 481
Query: 389 TF--------RFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
T +F N N + L V S GH LH F+NG+ G+ +GS D+ T V
Sbjct: 482 TTYVDVAQDEQFLSNGENLK--LTVMSAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVK 539
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIG 494
L G+N + LS+ VGLP+ G E AG+ D + T W YQVGL G
Sbjct: 540 LWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKG 599
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E + ++S G + V W P ++ LTWYK F AP G++P+AL++ SMGKG+ W+NG
Sbjct: 600 ESMSLHSLSGSSTVEWG---EPVQKQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWING 656
Query: 553 QSIGRYWVSFKTS--------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPT 604
Q IGRYW +K S +G +T+ N S + YHVPR++L PT
Sbjct: 657 QGIGRYWPGYKASGNCGTCDYRGEYDETKCQTNCGDS--------SQRWYHVPRSWLSPT 708
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
GNLLV+ EE G+P GI++ +I VC V+ P + +W K + +K
Sbjct: 709 GNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNW---------HTKDY-EKA 757
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
V C G+KI++I FASFG P G C Y G CH+ S + + C+G+ RC + ++
Sbjct: 758 KVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIFWKNCVGQERCGVSVVP 817
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
FGGDPCPG K +V+A C
Sbjct: 818 EIFGGDPCPGTMKRAVVEAIC 838
>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
Length = 851
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 355/818 (43%), Positives = 456/818 (55%), Gaps = 100/818 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y+F GR D++RFIK +Q G++V LRIG
Sbjct: 57 MWDGLIEKAKDGGLDVIQTYVFWNGHEPTPGNYNFEGRYDLVRFIKTVQKAGMFVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVGMMKSENLFASQGGPII 176
Query: 93 -------------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGP 139
IENEY F G Y+ WAAKMAV TGVPWVMCK+DDAP P
Sbjct: 177 LSQASAKLCFPCHIENEYGPEGKEFGAAGKAYINWAAKMAVGLDTGVPWVMCKEDDAPDP 236
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
VINACNG C +TF PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K
Sbjct: 237 VINACNGFYC-DTFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFGVARFVQKG 294
Query: 200 GSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
GS++NYYMYHGGTNFGRTA IT YD APLDEYGL REPK+GHLKELH A+KLC +
Sbjct: 295 GSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKFGHLKELHRAVKLCEQ 354
Query: 259 PLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISIL 318
PL++ V +LG +QEA VF +SG CAAFL N + V+F N +Y LP SISIL
Sbjct: 355 PLVSADPTVTTLGSMQEAHVFRSSSG-CAAFLANYNSNSYAKVIFNNENYSLPPWSISIL 413
Query: 319 PDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQIS 377
PDCK V FNT V Q N+ ++ S WE+Y E + + LL + GLL+Q++
Sbjct: 414 PDCKNVVFNTATVGVQTNQMQMWADGA--SSMMWEKYDEEVDSLAAAPLLTSTGLLEQLN 471
Query: 378 AAKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNV 431
+D SDY WY + S L VQS GH LH F+NG+ GSA+G+ ++
Sbjct: 472 VTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDR 531
Query: 432 SFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCS 485
+ +LR GTN ALLSV GLP+ G E GV H + + T +
Sbjct: 532 KISYSGNANLRAGTNKVALLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQT 591
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQS 542
W YQVGL GE++ + S G V W S + + L WY+ F P+G++P+AL++ S
Sbjct: 592 WSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGS 651
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT--------- 593
MGKG+ W+NGQSIGRYW T YA H+ +A
Sbjct: 652 MGKGQIWINGQSIGRYW------------TAYAEGDCKGCHYTGSYRAPKCQAGCGQPTQ 699
Query: 594 --YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQ 651
YHVPR++L+PT NLLV+ EE G+ I + + VC V+ H P + +W
Sbjct: 700 RWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKNW----- 753
Query: 652 RGDTDIKKFGK----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGV 707
I+ +G+ V C G+ IS I FASFG P G C + G CHS +S V
Sbjct: 754 ----QIESYGEPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINSNSV 809
Query: 708 VERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+E+ CIG RC + + FGGDPCP + K + V+A C
Sbjct: 810 LEKKCIGLQRCVVAISPSNFGGDPCPEVMKRVAVEAVC 847
>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 843
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 361/798 (45%), Positives = 458/798 (57%), Gaps = 68/798 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D++RFIK +Q GLYV LRIG
Sbjct: 60 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MAV TGVPW+MCKQDDAP P+IN CNG C
Sbjct: 180 LSQIENEYGPMEYEIGAPGRSYTQWAAHMAVGLGTGVPWIMCKQDDAPDPIINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF +A FI K GS+VNYYMYH
Sbjct: 240 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPHRPAEDLAFSIARFIQKGGSFVNYYMYH 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL R+PKWGHLK+LH AIKLC L++G V
Sbjct: 298 GGTNFGRTAGGPFIATSYDYDAPLDEYGLARQPKWGHLKDLHRAIKLCEPALVSGDSTVQ 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +EA VF SG CAAFL N + + TV F N Y LP SISILP+CK +NT
Sbjct: 358 RLGNYEEAHVFRSKSGACAAFLANYNPQSYATVAFGNQHYNLPPWSISILPNCKHTVYNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q + K + + W+ + E D++ GLL+QI+A +D SDY WY
Sbjct: 418 ARVGSQ-STTMKMTRVPIHGGLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWY 476
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ NS+ N + P L V S GH LH F+N + +G+A+GS + T +V LR
Sbjct: 477 STDVVINSNEGFLRNGKNPVLTVLSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLR 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLSV VGLP+ G ER AGV + + T W Y+VGL GE
Sbjct: 537 AGVNKISLLSVAVGLPNVGPHFERWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEA 596
Query: 497 LQIYSNLGLNKVLWSS--IRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W + S + LTWYKTTF APAG P+AL++ SMGKG+ W+NGQS
Sbjct: 597 LNLHSLSGSSSVEWLQGFLVSRRQPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQS 656
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW ++K S G+ YA N C + YHVP ++LKP+GNLLV+ E
Sbjct: 657 LGRYWPAYKAS-GSCGYCNYAGTYNEKKCGSNCG-EASQRWYHVPHSWLKPSGNLLVVFE 714
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-----KPTVQ 667
E G+P GI + I VC + P L S+ +++ GK +P
Sbjct: 715 ELGGDPNGIFLVRRDIDSVCADIYEWQ-PNLVSY---------EMQASGKVRSPVRPKAH 764
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
SC G+KIS I FASFG P G C Y GSCH+ S + C+G+S C++ + F
Sbjct: 765 LSCGPGQKISSIKFASFGTPVGSCGSYREGSCHAHKSYDAFLKNCVGQSWCTVTVSPEIF 824
Query: 728 GGDPCPGIHKALLVDAQC 745
GGDPCP + K L V+A C
Sbjct: 825 GGDPCPRVMKKLSVEAIC 842
>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 351/811 (43%), Positives = 467/811 (57%), Gaps = 97/811 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GG+DVI+TYVFWNLHEP G+YDF GRND++RF+K I GLY LRIG
Sbjct: 63 MWEDLIQKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 123 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +G Y+ WAAKMA+ TGVPWVMCK+DDAP PVIN CNG C
Sbjct: 183 LSQIENEYGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN P KP IWTE W+ ++ +GG + R QD+AF VA FI K GS+VNYYMYH
Sbjct: 243 -DSF-APNKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQKGGSFVNYYMYH 300
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA +T YD AP+DEYGL+R+PK+GHLKELH AIK+C + L++ V
Sbjct: 301 GGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVT 360
Query: 269 SLGQLQEAFVFEE--------TSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPD 320
S+G Q+ +++ E SG C+AFL N D A VLF N+ Y LP SISILPD
Sbjct: 361 SIGNKQQVWIYYERFAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPD 420
Query: 321 CKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAA 379
C+ FNT +V S+ +WE Y E + + D+ + GLL+QI+
Sbjct: 421 CRNAVFNTAKV----------------SNFQWESYLEDLSSLDDSSTFTTHGLLEQINVT 464
Query: 380 KDASDYFWYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
+D SDY WY S + + P L +QS GH +H FVNG+ +GSA G+ N F
Sbjct: 465 RDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRF 524
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWG 487
T + ++L GTN ALLSV VGLP+ G E G+ H + + W
Sbjct: 525 TYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWT 584
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSM 543
YQVGL GE + + + W +++ P + LTW+KT F AP GN+P+AL+++ M
Sbjct: 585 YQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKP-QPLTWHKTYFDAPEGNEPLALDMEGM 643
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLK 602
GKG+ WVNG+SIGRYW +F T G+ S Y + + T YHVPRA+LK
Sbjct: 644 GKGQIWVNGESIGRYWTAFAT--GDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLK 701
Query: 603 PTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK 662
P+ NLLV+ EE GNP +++ ++ VC V+ H P + +W I+ +GK
Sbjct: 702 PSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNW---------QIESYGK 751
Query: 663 -----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVER---ACIG 714
+P V C G+ I+ I FASFG P G C Y G CH++ S ++ER C+G
Sbjct: 752 GQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERYMQKCVG 811
Query: 715 KSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
K+RC++ + + FG DPCP + K L V+A C
Sbjct: 812 KARCAVTISNSNFGKDPCPNVLKRLTVEAVC 842
>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 887
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 335/795 (42%), Positives = 471/795 (59%), Gaps = 78/795 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I KA+ GGL+ IQTYVFWN+HEP++G+YDF GR D+++FIK I +GLYV LR+G
Sbjct: 71 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 130
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V + FR++N+P+K
Sbjct: 131 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 190
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G Y+ WAA + + G+PWVMCKQ+DAPG +INACNG C
Sbjct: 191 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 250
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN +KPS+WTE+WT+ ++V+G P R+A+DIAF VA + +KNGS+VNYYMYH
Sbjct: 251 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTAEDIAFSVARYFSKNGSHVNYYMYH 310
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A F+ T YYD APLDE+GL + PK+GHLK +H A++LC + L G +
Sbjct: 311 GGTNFGRTSAHFVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQT 370
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG E +E+ + VCAAFL NN+ R T+ F+ Y LP +SISILPDCKTV +NT
Sbjct: 371 LGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNT 430
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLL--DQISAAKDASDYF 386
++ Q++ R + K K+E + E I +LL + L+ + KD +DY
Sbjct: 431 AQIVAQHSWRDFVKSEKTSKGLKFEMFSENI----PSLLDGDSLIPGELYYLTKDKTDYA 486
Query: 387 WYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WYT + + + L V S GH L +VNGEY G AHG H+ SF V+
Sbjct: 487 WYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVN 546
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-KSFT-----NCSWGYQVGLIG 494
+ G N ++L V GLPDSG+++E + AG + + KS T N WG+ GL G
Sbjct: 547 FKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEG 606
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
EK ++Y+ G KV W + LTWYKT F P G + +A+ ++ MGKG WVNG
Sbjct: 607 EKKEVYTEEGSKKVKWEK-DGERKPLTWYKTYFETPEGVNAVAIRMKGMGKGLIWVNGIG 665
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLK--PTGNLLVLLE 612
+GRYW+SF + G P+QT+ YH+PR+F+K N+LV+LE
Sbjct: 666 VGRYWMSFLSPLGEPTQTE--------------------YHIPRSFMKGEKKKNMLVILE 705
Query: 613 EENGNPLGITVDTIAIRK--VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE G L ++D + + + +C +V + + SW R + + K K ++ C
Sbjct: 706 EEPGVKLE-SIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMR--C 762
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
P K++ ++ FASFG+P G C + +G C +S S+ VVE+ C+G++ CSI + FG
Sbjct: 763 PPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGDK 822
Query: 731 PCPGIHKALLVDAQC 745
CP I K L V +C
Sbjct: 823 GCPEIVKTLAVQVKC 837
>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
Length = 861
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 348/806 (43%), Positives = 460/806 (57%), Gaps = 67/806 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAK+GGLDVI++YVFWN+HEP++ +Y F R D+++F+K +Q GL V LRIG
Sbjct: 61 MWPDIIQKAKDGGLDVIESYVFWNMHEPKQNEYYFEDRFDLVKFVKIVQQAGLLVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 121 PYACAEWNYGGFPVWLHLIPGIHFRTDNEPFKNEMQRFTAKIVDMMKQEKLFASQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + G YV WAA MAV +TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 181 LAQIENEYGNIDGPYGAAGKSYVKWAASMAVGLNTGVPWVMCQQADAPDPIINTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PNSPNKP +WTE+W+ ++ +GG+ R +D+AF VA F + G++ NYYMYH
Sbjct: 241 -DAFT-PNSPNKPKMWTENWSGWFLSFGGRLPFRPTEDLAFSVARFFQRGGTFQNYYMYH 298
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT F+ T Y AP+DEYG+VR+PKWGHLKELH AIKLC L+ N
Sbjct: 299 GGTNFGRTTGGPFIATSYDYDAPIDEYGIVRQPKWGHLKELHKAIKLCEAALVNAESNYT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ SG CAAFL N++ + TV F SY LP S+SILPDCK V FNT
Sbjct: 359 SLGSGLEAHVYSPGSGTCAAFLANSNTQSDATVKFNGNSYHLPAWSVSILPDCKNVVFNT 418
Query: 329 ERVSTQY--------NKRSKTSNLKFDSDE----KWEEYREAILNFDNTLLRAEGLLDQI 376
++ +Q N SN +D W E I + GLL+QI
Sbjct: 419 AKIGSQTTSVQMNPANLILAGSNSMKGTDSANAASWSWLHEQIGIGGSNTFSKPGLLEQI 478
Query: 377 SAAKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDN 430
+ D+SDY WYT + + Q L VQS GH LH F+NGE+ G GS +
Sbjct: 479 NTTVDSSDYLWYTTSIQVDDNEPFLHNGTQPVLHVQSLGHALHVFINGEFAGRGAGSSSS 538
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNC 484
L+ + L+ G N+ LLS+TVGL + G+F + AG+ + + +
Sbjct: 539 SKIALQTPITLKSGKNNIDLLSITVGLQNYGSFFDTWGAGITGPVILQGFKDGEHDLSTQ 598
Query: 485 SWGYQVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQS 542
W YQ+GL GE+L IYS W + PT+Q + WYKT F AP+GNDP+ALNL
Sbjct: 599 QWTYQIGLTGEQLGIYSGDTKASAQWVAGSDLPTKQPMIWYKTNFDAPSGNDPVALNLLG 658
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSK-GNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAF 600
MGKG AWVNGQSIGRYW S+ S+ G Y + T + YHVPR++
Sbjct: 659 MGKGVAWVNGQSIGRYWPSYIASQSGCTDSCDYRGAYSSTKCQTNCGQPSQKLYHVPRSW 718
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
++PTGN+LVL EE G+P I+ T ++ +C V+ +HLPP+ SW G ++ K
Sbjct: 719 IQPTGNVLVLFEELGGDPTQISFMTRSVGSLCAQVSETHLPPVDSWKSSATSG-LEVNK- 776
Query: 661 GKKPTVQPSCPLGKKISK-IVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
K +Q CP + + K I FASFG G C + G C+++ + +VE ACIG+ CS
Sbjct: 777 -PKAELQLHCPSSRHLIKSIKFASFGTSKGSCGSFTYGHCNTNSTMSIVEEACIGRESCS 835
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
+ + F GDPC G K L V+A C
Sbjct: 836 VEVSIEKF-GDPCKGTVKNLAVEASC 860
>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
Length = 887
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 334/795 (42%), Positives = 471/795 (59%), Gaps = 78/795 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I KA+ GGL+ IQTYVFWN+HEP++G+YDF GR D+++FIK I +GLYV LR+G
Sbjct: 71 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 130
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V + FR++N+P+K
Sbjct: 131 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 190
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G Y+ WAA + + G+PWVMCKQ+DAPG +INACNG C
Sbjct: 191 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 250
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN +KPS+WTE+WT+ ++V+G P R+ +DIAF VA + +KNGS+VNYYMYH
Sbjct: 251 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 310
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A F+ T YYD APLDE+GL + PK+GHLK +H A++LC + L G +
Sbjct: 311 GGTNFGRTSAHFVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQT 370
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG E +E+ + VCAAFL NN+ R T+ F+ Y LP +SISILPDCKTV +NT
Sbjct: 371 LGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNT 430
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLL--DQISAAKDASDYF 386
++ Q++ R + K K+E + E I +LL + L+ + KD +DY
Sbjct: 431 AQIVAQHSWRDFVKSEKTSKGLKFEMFSENI----PSLLDGDSLIPGELYYLTKDKTDYA 486
Query: 387 WYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WYT + + + L V S GH L +VNGEY G AHG H+ SF V+
Sbjct: 487 WYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVN 546
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-KSFT-----NCSWGYQVGLIG 494
+ G N ++L V GLPDSG+++E + AG + + KS T N WG+ GL G
Sbjct: 547 FKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEG 606
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
EK ++Y+ G KV W + LTWYKT F P G + +A+ +++MGKG WVNG
Sbjct: 607 EKKEVYTEEGSKKVKWEK-DGKRKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIG 665
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLK--PTGNLLVLLE 612
+GRYW+SF + G P+QT+ YH+PR+F+K N+LV+LE
Sbjct: 666 VGRYWMSFLSPLGEPTQTE--------------------YHIPRSFMKGEKKKNMLVILE 705
Query: 613 EENGNPLGITVDTIAIRK--VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE G L ++D + + + +C +V + + SW R + + K K ++ C
Sbjct: 706 EEPGVKLE-SIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMR--C 762
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
P K++ ++ FASFG+P G C + +G C +S S+ VVE+ C+G++ CSI + FG
Sbjct: 763 PPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGDK 822
Query: 731 PCPGIHKALLVDAQC 745
CP I K L V +C
Sbjct: 823 GCPEIVKTLAVQVKC 837
>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 353/793 (44%), Positives = 456/793 (57%), Gaps = 59/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y+F GR D+++FIK Q GL+V LRIG
Sbjct: 62 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGSYNFEGRYDLVKFIKTAQKAGLFVHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 122 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKAAMQGFTEKIVGMMKSEELFASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY E F G Y WAAKMAV TGVPWVMCKQ+DAP PVINACNG C
Sbjct: 182 LSQIENEYGPEEKEFGAAGKSYSDWAAKMAVGLDTGVPWVMCKQEDAPDPVINACNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN+P+KP++WTE WT ++ +GG R +D++F VA F+ K GS++NYYMYH
Sbjct: 242 -DAFT-PNTPSKPTMWTEAWTGWFTEFGGTIRKRPVEDLSFAVARFVQKGGSFINYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL REPK+GHLKELH AIKLC + L++ V
Sbjct: 300 GGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKYGHLKELHKAIKLCEQALVSVDPTVT 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG +QEA V+ SG CAAFL N + ++F N Y LP SISILPDCKTV +NT
Sbjct: 360 SLGSMQEAHVYRSPSG-CAAFLANYNSNSHAKIVFDNEHYSLPPWSISILPDCKTVVYNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
V Q ++ S+ S WE Y E + + LL GLL+Q++A +D SDY W
Sbjct: 419 ATVGVQTSQMQMWSDGA--SSMMWERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLW 476
Query: 388 YTFRFHYNSSNAQ------APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S L VQS GH LH FVNG+ GSA G+ ++ + + V L
Sbjct: 477 YMTSVDVSPSEKSLQGGKPLSLTVQSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKL 536
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R GTN +LLSV GLP+ G E GV H + + T +W YQVGL GE
Sbjct: 537 RAGTNKISLLSVACGLPNIGVHYETWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGE 596
Query: 496 KLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
++ + S G + V W S I L WY+ F P+G++P+AL++ SMGKG+ W+NG
Sbjct: 597 QMNLNSLEGASSVEWMQGSLIAQNQMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWING 656
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
QSIGRY +++ T + + C YHVP+++L+PT NLLV+ E
Sbjct: 657 QSIGRYSLAYATGDCKDCSYTGSFRAIKCQAGCG-QPTQRWYHVPKSWLQPTRNLLVVFE 715
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+ I++ ++ VC V+ H P + +W + + K ++ V C
Sbjct: 716 ELGGDTSKISLVKRSVSNVCADVSEFH-PSIKNW---QTENSGEAKPELRRSKVHLRCAP 771
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+ IS I FASFG P G C + G CHS+ SQ V+E CIGK RC++ + FGGDPC
Sbjct: 772 GQSISAIKFASFGTPLGTCGSFEQGQCHSTKSQTVLEN-CIGKQRCAVTISPDNFGGDPC 830
Query: 733 PGIHKALLVDAQC 745
P + K + V+A C
Sbjct: 831 PNVMKRVAVEAVC 843
>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 353/793 (44%), Positives = 455/793 (57%), Gaps = 59/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y+F GR D+++FIK Q GL+V LRIG
Sbjct: 62 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGSYNFEGRYDLVKFIKTAQKAGLFVHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 122 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKAAMQGFTEKIVGMMKSEELFASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY E F G Y WAAKMAV TGVPWVMCKQ+DAP PVINACNG C
Sbjct: 182 LSQIENEYGPEEKEFGAAGKSYSDWAAKMAVGLDTGVPWVMCKQEDAPDPVINACNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN+P+KP++WTE WT ++ +GG R +D++F VA F+ K GS++NYYMYH
Sbjct: 242 -DAFT-PNTPSKPTMWTEAWTGWFTEFGGTIRKRPVEDLSFAVARFVQKGGSFINYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL REPK+GHLKELH AIKLC + L++ V
Sbjct: 300 GGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKYGHLKELHKAIKLCEQALVSVDPTVT 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG +QEA V+ SG CAAFL N + ++F N Y LP SISILPDCKTV +NT
Sbjct: 360 SLGSMQEAHVYRSPSG-CAAFLANYNSNSHAKIVFDNEHYSLPPWSISILPDCKTVVYNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
V Q ++ S+ S WE Y E + + LL GLL+Q++A +D SDY W
Sbjct: 419 ATVGVQTSQMQMWSDGA--SSMMWERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLW 476
Query: 388 YTFRFHYNSSNAQ------APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S L VQS GH LH FVNG+ GSA G+ ++ + + V L
Sbjct: 477 YMTSVDVSPSEKSLQGGKPLSLTVQSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKL 536
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R GTN +LLSV GLP+ G E GV H + + T +W YQVGL GE
Sbjct: 537 RAGTNKISLLSVACGLPNIGVHYETWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGE 596
Query: 496 KLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
++ + S G + V W S I L WY+ F P+G++P+AL++ SMGKG+ W+NG
Sbjct: 597 QMNLNSLEGASSVEWMQGSLIAQNQMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWING 656
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
QSIGRY +++ T + + C YHVP+ +L+PT NLLV+ E
Sbjct: 657 QSIGRYSLAYATGDCKDCSYTGSFRAIKCQAGCG-QPTQRWYHVPKPWLQPTRNLLVVFE 715
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+ I++ ++ VC V+ H P + +W + + K ++ V C
Sbjct: 716 ELGGDTSKISLVKRSVSNVCADVSEFH-PSIKNW---QTENSGEAKPELRRSKVHLRCAP 771
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+ IS I FASFG P G C + G CHS+ SQ V+E CIGK RC++ + FGGDPC
Sbjct: 772 GQSISAIKFASFGTPLGTCGSFEQGQCHSTKSQTVLEN-CIGKQRCAVTISPDNFGGDPC 830
Query: 733 PGIHKALLVDAQC 745
P + K + V+A C
Sbjct: 831 PNVMKRVAVEAVC 843
>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 352/794 (44%), Positives = 450/794 (56%), Gaps = 63/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GG+DVIQTYVFWN HEP G Y F R D+++FIK +Q GLY+ LRIG
Sbjct: 58 MWPDLIQKAKDGGVDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVQQAGLYLHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN P+K
Sbjct: 118 PYICAEWNFGGFPVWLKYVPGIEFRTDNGPFKAAMQKFTEKIVGMMKSEKLFENQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MAV TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 178 LSQIENEYGPVEWEIGAPGKAYTKWAADMAVKLGTGVPWIMCKQEDAPDPMIDTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP IWTE WT +Y +GG R A+D+AF VA FI GSY+NYYMYH
Sbjct: 238 -ENFK-PNKDYKPKIWTEAWTGWYTEFGGAVPHRPAEDMAFSVARFIQNGGSYINYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDE+GL REPKWGHL++LH AIKLC L++ V
Sbjct: 296 GGTNFGRTAGGPFIATSYDYDAPLDEFGLPREPKWGHLRDLHKAIKLCEPALVSVDPTVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA VF+ S VCAAFL N D + +V V F N YELP S+SILPDCKT +NT
Sbjct: 356 SLGSNQEAHVFKSKS-VCAAFLANYDTKYSVKVTFGNGQYELPPWSVSILPDCKTAVYNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYFW 387
R+ +Q S+ + S W+ Y E + D+ GL +QI+ +DA+DY W
Sbjct: 415 ARLGSQ---SSQMKMVPASSSFSWQSYNEETASADDDDTTTMNGLWEQINVTRDATDYLW 471
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y ++ + Q P L + S GH LH F+NG+ G+A+G N T + L
Sbjct: 472 YLTDVKIDADEGFLKSGQNPLLTIFSAGHALHVFINGQLAGTAYGGLSNPKLTFSQNIKL 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
+G N +LLSV VGLP+ G E AGV + + + W Y++GL GE
Sbjct: 532 TEGINKISLLSVAVGLPNVGLHFETWNAGVLGPITLKGLNEGTRDLSGQKWSYKIGLKGE 591
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G V W S+ + + LTWYKT F AP GNDP+AL++ SMGKG+ W+NGQ
Sbjct: 592 SLSLHTASGSESVEWVEGSLLAQKQALTWYKTAFDAPQGNDPLALDMSSMGKGQMWINGQ 651
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+IGR+W + + G+ YA + C + YHVPR++LKP+GNLL +
Sbjct: 652 NIGRHWPGY-IAHGSCGDCNYAGTFDDKKCRTNCG-EPSQRWYHVPRSWLKPSGNLLAVF 709
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P GI+ VC + P L +W + K +P CP
Sbjct: 710 EEWGGDPTGISFVKRTTASVCADIFEGQ-PALKNW-----QAIASGKVISPQPKAHLWCP 763
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+KIS+I FASFG P G C + GSCH+ S ER C+GK CS+ + FGGDP
Sbjct: 764 TGQKISQIKFASFGMPQGTCGSFREGSCHAHKSYDAFERNCVGKQSCSVTVAPEVFGGDP 823
Query: 732 CPGIHKALLVDAQC 745
CP K L V+A C
Sbjct: 824 CPDSAKKLSVEAVC 837
>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
Length = 846
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 352/795 (44%), Positives = 463/795 (58%), Gaps = 63/795 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFW++HE G Y+F GR D++RFIK +Q GLY LRIG
Sbjct: 58 MWEDLIQKAKDGGLDVIDTYVFWDVHETSPGNYNFDGRYDLVRFIKTVQKVGLYAHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAAMQGFTQKIVQMMKNENLFASQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY A G Y+ WAAKMAV TGVPWVMCK+DDAP P+IN CNG C
Sbjct: 178 LSQIENEYGPESRALGAAGRSYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG + R +D+AF VA FI K GSY NYYMYH
Sbjct: 238 -DAF-APNKPYKPTLWTEAWSGWFTEFGGPIHQRPVEDLAFAVARFIQKGGSYFNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+REPK+GHLK LH AIKLC L++ ++
Sbjct: 296 GGTNFGRSAGGPFITTSYDYDAPIDEYGLIREPKYGHLKALHKAIKLCEHALVSSDPSIT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A VF CAAFL N + + A V+F N+ Y+LP SISILPDC+ V FNT
Sbjct: 356 SLGTYQQAHVFSSGRS-CAAFLANYNAKSAARVMFNNMHYDLPPWSISILPDCRNVVFNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
RV Q + L S+ WE Y E I + D++ + A GLL+QI+ +D SDY
Sbjct: 415 ARVGAQ---TLRMQMLPTGSELFSWETYDEEISSLTDSSRITALGLLEQINVTRDTSDYL 471
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + S N Q P L VQS GH LH F+NG+++GSA G+ +N T V+
Sbjct: 472 WYLTSVDISPSEAFLRNGQKPSLTVQSAGHGLHVFINGQFSGSAFGTRENRQLTFTGPVN 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR GTN ALLS+ VGLP+ G E GV + + K T W YQVGL G
Sbjct: 532 LRAGTNRIALLSIAVGLPNVGLHYETWKTGVQGPVLLNGLNQGKKDLTWQKWSYQVGLKG 591
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S S + L W+K F AP GN+P+AL+++SMGKG+ W+N
Sbjct: 592 EAMNLVSPNGVSSVDWIEGSLASSQGQALKWHKAYFDAPRGNEPLALDMRSMGKGQVWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW+++ +KG+ + Y S + T YHVPR++LKPT NLLV+
Sbjct: 652 GQSIGRYWMAY--AKGDCNSCSYIWTFRPSKCQLGCGEPTQRWYHVPRSWLKPTKNLLVV 709
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE G+ I++ +I VC H P + ++ G D + + C
Sbjct: 710 FEELGGDASKISLVKRSIEGVCADAYEHH--PAT---KNYNTGGNDESSKLHQAKIHLRC 764
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
G+ I+ I FASFG P G C + G+CH+ ++ V+E+ CIG+ C + + + FG D
Sbjct: 765 APGQFIAAIKFASFGTPSGTCGSFQQGTCHAPNTHSVIEKKCIGQESCMVTISNSNFGAD 824
Query: 731 PCPGIHKALLVDAQC 745
PCP + K L V+A C
Sbjct: 825 PCPNVLKKLSVEAVC 839
>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
Length = 847
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 347/792 (43%), Positives = 460/792 (58%), Gaps = 56/792 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++F+K +Q GLY+ LRIG
Sbjct: 64 MWPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIG 123
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 124 PYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPII 183
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP P+INACNG C
Sbjct: 184 LSQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYC 243
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 244 --DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYH 301
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL R+PKWGHLK+LH AIKLC L++G +
Sbjct: 302 GGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRM 361
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA V++ SG C+AFL N + + V F N Y LP SISILPDCK +NT
Sbjct: 362 PLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNT 421
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q R K + W+ Y E + + GL++QI+ +D SDY WY
Sbjct: 422 ARVGAQ-TSRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWY 480
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+++ N P L V S GH +H F+NG+ +GSA+GS D+ T R V+LR
Sbjct: 481 MTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLR 540
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N A+LS+ VGLP+ G E AGV + + + + W Y+VGL GE
Sbjct: 541 AGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGES 600
Query: 497 LQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W+ + + + LTWYKTTF APAG+ P+A+++ SMGKG+ W+NGQS
Sbjct: 601 LSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQS 660
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEE 613
+GR+W ++K + G+ S+ Y +A+ YHVPR++LKP+GNLLV+ EE
Sbjct: 661 LGRHWPAYK-AVGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEE 719
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P GIT+ + VC + S+ + ++ + K P C G
Sbjct: 720 WGGDPNGITLVRREVDSVCADIYEWQ----STLVNYQLHASGKVNK-PLHPKAHLQCGPG 774
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
+KI+ + FASFG P+G C Y GSCH+ HS + C+G++ CS+ + FGGDPCP
Sbjct: 775 QKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCP 834
Query: 734 GIHKALLVDAQC 745
+ K L V+A C
Sbjct: 835 NVMKKLAVEAVC 846
>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 988
Score = 636 bits (1641), Expect = e-179, Method: Compositional matrix adjust.
Identities = 334/795 (42%), Positives = 471/795 (59%), Gaps = 78/795 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I KA+ GGL+ IQTYVFWN+HEP++G+YDF GR D+++FIK I +GLYV LR+G
Sbjct: 1 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V + FR++N+P+K
Sbjct: 61 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 120
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G Y+ WAA + + G+PWVMCKQ+DAPG +INACNG C
Sbjct: 121 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 180
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN +KPS+WTE+WT+ ++V+G P R+ +DIAF VA + +KNGS+VNYYMYH
Sbjct: 181 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 240
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A F+ T YYD APLDE+GL + PK+GHLK +H A++LC + L G +
Sbjct: 241 GGTNFGRTSAHFVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQT 300
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG E +E+ + VCAAFL NN+ R T+ F+ Y LP +SISILPDCKTV +NT
Sbjct: 301 LGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNT 360
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLL--DQISAAKDASDYF 386
++ Q++ R + K K+E + E I +LL + L+ + KD +DY
Sbjct: 361 AQIVAQHSWRDFVKSEKTSKGLKFEMFSENI----PSLLDGDSLIPGELYYLTKDKTDYA 416
Query: 387 WYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WYT + + + L V S GH L +VNGEY G AHG H+ SF V+
Sbjct: 417 WYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVN 476
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-KSFT-----NCSWGYQVGLIG 494
+ G N ++L V GLPDSG+++E + AG + + KS T N WG+ GL G
Sbjct: 477 FKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEG 536
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
EK ++Y+ G KV W + LTWYKT F P G + +A+ +++MGKG WVNG
Sbjct: 537 EKKEVYTEEGSKKVKWEK-DGKRKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIG 595
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLK--PTGNLLVLLE 612
+GRYW+SF + G P+QT+ YH+PR+F+K N+LV+LE
Sbjct: 596 VGRYWMSFLSPLGEPTQTE--------------------YHIPRSFMKGEKKKNMLVILE 635
Query: 613 EENGNPLGITVDTIAIRK--VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE G L ++D + + + +C +V + + SW R + + K K ++ C
Sbjct: 636 EEPGVKLE-SIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMR--C 692
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
P K++ ++ FASFG+P G C + +G C +S S+ VVE+ C+G++ CSI + FG
Sbjct: 693 PPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGDK 752
Query: 731 PCPGIHKALLVDAQC 745
CP I K L V +C
Sbjct: 753 GCPEIVKTLAVQVKC 767
>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
Length = 847
Score = 635 bits (1639), Expect = e-179, Method: Compositional matrix adjust.
Identities = 347/792 (43%), Positives = 460/792 (58%), Gaps = 56/792 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++F+K +Q GLY+ LRIG
Sbjct: 64 MWPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIG 123
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 124 PYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPII 183
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP P+INACNG C
Sbjct: 184 LSQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYC 243
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 244 --DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYH 301
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL R+PKWGHLK+LH AIKLC L++G +
Sbjct: 302 GGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRM 361
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA V++ SG C+AFL N + + V F N Y LP SISILPDCK +NT
Sbjct: 362 PLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNT 421
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q R K + W+ Y E + + GL++QI+ +D SDY WY
Sbjct: 422 ARVGAQ-TSRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWY 480
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+++ N P L V S GH +H F+NG+ +GSA+GS D+ T R V+LR
Sbjct: 481 MTDVKVDANEGFLRNGDLPTLTVLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNLR 540
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N A+LS+ VGLP+ G E AGV + + + + W Y+VGL GE
Sbjct: 541 AGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGES 600
Query: 497 LQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W+ + + + LTWYKTTF APAG+ P+A+++ SMGKG+ W+NGQS
Sbjct: 601 LSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQS 660
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEE 613
+GR+W ++K + G+ S+ Y +A+ YHVPR++LKP+GNLLV+ EE
Sbjct: 661 LGRHWPAYK-AVGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEE 719
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P GIT+ + VC + S+ + ++ + K P C G
Sbjct: 720 WGGDPNGITLVRREVDSVCADIYEWQ----STLVNYQLHASGKVNK-PLHPKAHLQCGPG 774
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
+KI+ + FASFG P+G C Y GSCH+ HS + C+G++ CS+ + FGGDPCP
Sbjct: 775 QKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCP 834
Query: 734 GIHKALLVDAQC 745
+ K L V+A C
Sbjct: 835 NVMKKLAVEAVC 846
>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
Length = 847
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 347/792 (43%), Positives = 460/792 (58%), Gaps = 56/792 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D++RF+K +Q GLY+ LRIG
Sbjct: 64 MWPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVRFVKLVQQSGLYLHLRIG 123
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 124 PYVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPII 183
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAAKMAV TGVPWVMCKQDDAP P+INACNG C
Sbjct: 184 LSQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYC 243
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 244 --DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYH 301
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL R+PKWGHLK+LH AIKLC L++G +
Sbjct: 302 GGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRM 361
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA V++ SG C+AFL N + + V F + Y LP SISILPDCK +NT
Sbjct: 362 PLGNYQEAHVYKAKSGACSAFLANYNPKSYAKVSFGSNHYNLPPWSISILPDCKNTVYNT 421
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q R K + W+ Y E + + GL++QI+ +D SDY WY
Sbjct: 422 ARVGAQ-TSRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWY 480
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+++ N P L V S GH +H F+NG+ +GSA+GS D+ T R V+LR
Sbjct: 481 MTDVKIDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLR 540
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N A+LS+ VGLP+ G E AGV + + + + W Y+VGL GE
Sbjct: 541 AGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLSGGRRDLSWQKWTYKVGLKGES 600
Query: 497 LQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W+ + + + LTWYKTTF APAG+ P+A+++ SMGKG+ W+NGQS
Sbjct: 601 LSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQS 660
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEE 613
+GR+W ++K + G+ S+ Y +A+ YHVPR++LKP+GNLLV+ EE
Sbjct: 661 LGRHWPAYK-AVGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEE 719
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P GI++ + VC + S+ + ++ + K P V C G
Sbjct: 720 WGGDPNGISLVRREVDSVCADIYEWQ----STLVNYQLHASGKVNK-PLHPKVHLQCGPG 774
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
+KI+ + FASFG P+G C Y GSCH HS + C+G++ CS+ + FGGDPCP
Sbjct: 775 QKITTVKFASFGTPEGTCGSYRQGSCHDHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCP 834
Query: 734 GIHKALLVDAQC 745
+ K L V+A C
Sbjct: 835 NVMKKLAVEAVC 846
>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 848
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 343/795 (43%), Positives = 460/795 (57%), Gaps = 59/795 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW SLI KAK GGLDV+ TYVFWNLHEP G YDF GRND+++FIK ++ GLYV LRIG
Sbjct: 60 MWESLIEKAKMGGLDVVDTYVFWNLHEPSPGIYDFEGRNDLVKFIKLVEKAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P WL V GI FR+DN+P+K
Sbjct: 120 PYICGEWNFGGFPAWLKFVPGISFRTDNEPFKLAMAKFTKKIVQMMKDERLFQSQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+T + F E G Y+ WAAKMAV TGVPWVMCKQDDAP P+IN CNG C
Sbjct: 180 LSQIENEYETEDKVFGEAGFAYMNWAAKMAVQMDTGVPWVMCKQDDAPDPMINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP+ WTE WT+++ +GG + R +D+AF VA FI K GS VNYYMYH
Sbjct: 240 --DYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPVEDLAFGVARFIQKGGSLVNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLK LH A+KLC + LLTG +
Sbjct: 298 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFGHLKRLHDAVKLCEKALLTGEPHDY 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+L Q+A VF +SG CAAFL N V F Y LP SISILPDCK+V +NT
Sbjct: 358 TLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTFNGRHYTLPPWSISILPDCKSVIYNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYFW 387
+V Q N+ S K +S WE Y E I + +++ + +GLL+Q++ KD SDY W
Sbjct: 418 AQVQVQTNQLSFLPT-KVES-FSWETYNENISSIEEDSSMSYDGLLEQLTITKDNSDYLW 475
Query: 388 YTFRFHYNSSNAQ------APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT + + + + L S GH +H F+NG+ GS+ G+HDN FT ++L
Sbjct: 476 YTTSVNVDPNESYLRGGKFPTLTATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINL 535
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
+ G N +LLS+ GLP++G E + GV H + + W Y+VGL GE
Sbjct: 536 QAGVNKVSLLSIAGGLPNNGPHYEEREMGVLGPVAIHGLDKGKMDLSRQKWSYKVGLKGE 595
Query: 496 KLQIYSNLGLNKVLWS--SIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
+ + S + V W+ S++ Q LTWYK F AP G++P+AL++ SM KG+ W+NG
Sbjct: 596 NMNLGSPSSVQAVDWAKDSLKQENAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWING 655
Query: 553 QSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
Q++GRYW T+ GN + Y+ F YHVPR++L PT NL+V+
Sbjct: 656 QNVGRYWTI--TANGNCTDCSYSGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVF 713
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE GNP I++ ++ +C + + P + + H+ G+ + + K + C
Sbjct: 714 EEVGGNPSRISLVKRSVTSICTEAS-QYRPVIKNVHMHQNNGELNEQNVLK---INLHCA 769
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I FASFG P G C + G+CHS S V+++ C+G+ RC + + FG DP
Sbjct: 770 AGQFISAIKFASFGTPSGACGSHKQGTCHSPKSDYVLQKLCVGRQRCLATIPTSIFGEDP 829
Query: 732 CPGIHKALLVDAQCR 746
CP + K L + C+
Sbjct: 830 CPNLRKKLSAEVVCQ 844
>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 840
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 355/793 (44%), Positives = 461/793 (58%), Gaps = 60/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F G D+++FIK +Q GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYIPGISFRTDNEPFKHQMQKFTTKIVDLMKAERLYESQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MA+ TGVPWVMCKQDD P P+IN CNG C
Sbjct: 179 MSQIENEYGPMEYEIGAAGKAYTKWAAEMAMGLGTGVPWVMCKQDDTPDPLINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 239 --DYFSPNKAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYH 296
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G V
Sbjct: 297 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+G QEA VF+ SG CAAFL N + + TV F N+ Y LP SISILPDCK +NT
Sbjct: 357 KIGNYQEAHVFKSKSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPDCKNTVYNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q + + K + + W + E D++ GLL+Q++ +D SDY WY
Sbjct: 417 ARVGSQ-SAQMKMTRVPIHGGFSWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWY 475
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ + + N + P L V S GH LH F+NG+ +G+A+GS + T V LR
Sbjct: 476 STDVVLDPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLR 535
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLSV VGLP+ G E AGV + + + W Y+VGL GE
Sbjct: 536 AGVNKISLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGEI 595
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ S + LTWYKTTF APAG P+AL++ SMGKG+ W+NGQ+
Sbjct: 596 LSLHSLSGSSSVEWIQGSLVSQRQPLTWYKTTFDAPAGTAPLALDMDSMGKGQVWLNGQN 655
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW ++K S G YA N C + YHVP+++LKPTGNLLV+ E
Sbjct: 656 LGRYWPAYKAS-GTCDYCDYAGTYNENKCRSNCG-EASQRWYHVPQSWLKPTGNLLVVFE 713
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P GI + I VC + P L S+ + + G + +P V SC
Sbjct: 714 ELGGDPNGIFLVRRDIDSVCADIYEWQ-PNLISY-QMQTSGKAPV-----RPKVHLSCSP 766
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KIS I FASFG P G C + GSCH+ S ER C+G++ C++ + FGGDPC
Sbjct: 767 GQKISSIKFASFGTPAGSCGNFHEGSCHAHKSYDAFERNCVGQNWCTVTVSPENFGGDPC 826
Query: 733 PGIHKALLVDAQC 745
P + K L V+A C
Sbjct: 827 PNVLKKLSVEAIC 839
>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
Length = 870
Score = 629 bits (1623), Expect = e-177, Method: Compositional matrix adjust.
Identities = 343/802 (42%), Positives = 455/802 (56%), Gaps = 65/802 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGG+DVI+TYVFWN HEP G Y F GR D+++F K IQ G+Y+ LRIG
Sbjct: 76 MWPGLVRLAKEGGVDVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIG 135
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GGLP+WLH V G FR+D++P+K
Sbjct: 136 PFVAAEWNFGGLPVWLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPII 195
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E A+ E G Y LWAAKMA+ +TGVPW+MC+Q DAP PVI+ CN C
Sbjct: 196 LSQVENEYGYYENAYGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYC 255
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK P SPNKP IWTE+W +++ +G + R A+D+A+ VA F K GS NYYMYH
Sbjct: 256 -DQFK-PISPNKPKIWTENWPGWFKTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYH 313
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL R PKWGHLKELH IK C LL ++
Sbjct: 314 GGTNFGRTAGGPFITTSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLL 373
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG LQEA V+E+ SG CAAFL N D++ V FR++SY LP S+SILPDCK VAFNT
Sbjct: 374 SLGPLQEADVYEDASGACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNT 433
Query: 329 ERVSTQ--------YNKRSKTSNLKFDSDE-KWEEYREAILNFDNTLLRAEGLLDQISAA 379
+V Q + S+ K D +WE ++E + G +D I+
Sbjct: 434 AKVGCQTSIVNMAPIDLHPTASSPKRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTT 493
Query: 380 KDASDYFWYTFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
KDA+DY WYT ++ + A L V+S GH +H F+N + SA G+ F
Sbjct: 494 KDATDYLWYTTSIFVHAEEDFLRNRGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQF 553
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGY 488
+ L+ G N+ ALLS+TVGL +GAF E AG V+V T +W Y
Sbjct: 554 KFGTPIALKAGKNEIALLSMTVGLQTAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTY 613
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKG 546
++GL GE L+I + L +W+ P +Q LTWYK AP GN+P+AL++ MGKG
Sbjct: 614 KIGLQGEHLRIQKSYNLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKG 673
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKP 603
AW+NGQ IGRYW +TSK TQ + C T YHVPR++ KP
Sbjct: 674 MAWLNGQEIGRYWPR-RTSKYENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKP 732
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
+GN+L++ EE G+P I + CGH++ H S+ +G ++I+ +
Sbjct: 733 SGNVLIIFEEIGGDPSQIRFSMRKVSGACGHLSVDH----PSFDVENLQG-SEIESDKNR 787
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLL 723
PT+ CP IS + FASFGNP+G C Y +G CH +S +VE+ C+ ++ C++ +
Sbjct: 788 PTLSLKCPTNTNISSVKFASFGNPNGTCGSYMLGDCHDQNSAALVEKVCLNQNECALEMS 847
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
S F CP K L V+ C
Sbjct: 848 SANFNMQLCPSTVKKLAVEVNC 869
>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
Length = 870
Score = 628 bits (1620), Expect = e-177, Method: Compositional matrix adjust.
Identities = 342/802 (42%), Positives = 455/802 (56%), Gaps = 65/802 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGG+DVI+TYVFWN HEP G Y F GR D+++F K IQ G+Y+ LRIG
Sbjct: 76 MWPGLVRLAKEGGVDVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIG 135
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GGLP+WLH V G FR+D++P+K
Sbjct: 136 PFVAAEWNFGGLPVWLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPII 195
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E A+ E G Y LWAAKMA+ +TGVPW+MC+Q DAP PVI+ CN C
Sbjct: 196 LSQVENEYGYYENAYGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYC 255
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK P SPNKP IWTE+W +++ +G + R A+D+A+ VA F K GS NYYMYH
Sbjct: 256 -DQFK-PISPNKPKIWTENWPGWFKTFGARDPHRPAEDVAYSVARFFQKGGSVQNYYMYH 313
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL R PKWGHLKELH IK C LL ++
Sbjct: 314 GGTNFGRTAGGPFITTSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLL 373
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG LQEA V+E+ SG CAAFL N D++ V FR++SY LP S+SILPDCK VAFNT
Sbjct: 374 SLGPLQEADVYEDASGACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNT 433
Query: 329 ERVSTQ--------YNKRSKTSNLKFDSDE-KWEEYREAILNFDNTLLRAEGLLDQISAA 379
+V Q + S+ K D +WE ++E + G +D I+
Sbjct: 434 AKVGCQTSIVNMAPIDLHPTASSPKRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTT 493
Query: 380 KDASDYFWYTFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
KDA+DY WYT ++ + A L V+S GH +H F+N + SA G+ F
Sbjct: 494 KDATDYLWYTTSIFVHAEEDFLRNRGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQF 553
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGY 488
+ L+ G N+ +LLS+TVGL +GAF E AG V+V T +W Y
Sbjct: 554 KFGTPIALKAGKNEISLLSMTVGLQTAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTY 613
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKG 546
++GL GE L+I + L +W+ P +Q LTWYK AP GN+P+AL++ MGKG
Sbjct: 614 KIGLQGEHLRIQKSYNLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKG 673
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKP 603
AW+NGQ IGRYW +TSK TQ + C T YHVPR++ KP
Sbjct: 674 MAWLNGQEIGRYWPR-RTSKYENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKP 732
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
+GN+L++ EE G+P I + CGH++ H S+ +G ++I+ +
Sbjct: 733 SGNVLIIFEEIGGDPSQIRFSMRKVSGACGHLSVDH----PSFDVENLQG-SEIENDKNR 787
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLL 723
PT+ CP IS + FASFGNP+G C Y +G CH +S +VE+ C+ ++ C++ +
Sbjct: 788 PTLSLKCPTNTNISSVKFASFGNPNGTCGSYMLGDCHDQNSAALVEKVCLNQNECALEMS 847
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
S F CP K L V+ C
Sbjct: 848 SANFNMQLCPSTVKKLAVEVNC 869
>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
Length = 839
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 354/793 (44%), Positives = 454/793 (57%), Gaps = 58/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++FI+ +Q GLYV LRIG
Sbjct: 56 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIRLVQQAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 116 PYACAEWNFGGFPVWLKYIPGISFRTDNGPFKFQMQKFTTKIVNIMKAERLYESQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MA+ TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 176 LSQIENEYGPMEYELGAPGKAYAQWAAHMAIGLGTGVPWVMCKQDDAPDPVINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 236 --DYFSPNKAYKPKMWTEAWTGWFTGFGGTVPHRPAEDLAFSVARFIQKGGSFINYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++ V
Sbjct: 294 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSADPTVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ SG CAAFL N + TV F N Y LP SISILP+CK +NT
Sbjct: 354 RLGNYQEAHVFKSKSGACAAFLANYNPHSYSTVAFGNQHYNLPPWSISILPNCKHTVYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
R+ +Q + + K + + W+ + E D++ GLL+QI+A +D SDY WY
Sbjct: 414 ARLGSQ-SAQMKMTRVPIHGGLSWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWY 472
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ N N + P L V S GH LH F+NG+ +G+ +GS D T +V+LR
Sbjct: 473 STDVVINPDEGYFRNGKNPVLTVLSAGHALHVFINGQLSGTVYGSLDFPKLTFSESVNLR 532
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLSV VGLP+ G E AGV + + + T W Y+VGL GE
Sbjct: 533 AGVNKISLLSVAVGLPNVGPHFETWNAGVLGPITLNGLNEGRRDLTWQKWSYKVGLKGED 592
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W + S + LTWYKTTF APAG P+AL++ SMGKG+ W+NGQS
Sbjct: 593 LSLHSLSGSSSVDWLQGYLVSRRQPLTWYKTTFDAPAGVAPLALDMNSMGKGQVWLNGQS 652
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW ++K + G+ YA N C + YHVP ++LKPTGNLLV+ E
Sbjct: 653 LGRYWPAYKAT-GSCDYCNYAGTYNEKKCGTNCG-EASQRWYHVPHSWLKPTGNLLVMFE 710
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P G+ + I VC + +S ++ + + P SC
Sbjct: 711 ELGGDPNGVFLVRRDIDSVCADIYEWQPNLVSYQMQASGKVSRPV-----SPKAHLSCGP 765
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KIS I FASFG P G C Y GSCH+ S +R C+G+S C++ + FGGDPC
Sbjct: 766 GQKISSIKFASFGTPVGSCGNYREGSCHAHKSYDAFQRNCVGQSSCTVTVSPEIFGGDPC 825
Query: 733 PGIHKALLVDAQC 745
P + K L V+A C
Sbjct: 826 PNVMKKLSVEAIC 838
>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 841
Score = 627 bits (1616), Expect = e-177, Method: Compositional matrix adjust.
Identities = 351/793 (44%), Positives = 461/793 (58%), Gaps = 60/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F G D+++FIK +Q GLYV LRIG
Sbjct: 60 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN+P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYIPGISFRTDNEPFKVQMQKFTTKIVDLMKAERLYESQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MA++ TGVPW+MCKQDD P P+IN CNG C
Sbjct: 180 MSQIENEYGPMEYEIGAAGKAYTKWAAEMAMELGTGVPWIMCKQDDTPDPLINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 240 --DYFSPNKAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQKGGSFINYYMYH 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G V
Sbjct: 298 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+G QEA VF+ SG CAAFL N + + TV F N+ Y LP SISILP+CK +NT
Sbjct: 358 KIGNYQEAHVFKSMSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPNCKNTVYNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q + + K + + W + E D++ GLL+Q++ +D SDY WY
Sbjct: 418 ARVGSQ-SAQMKMTRVPIHGGLSWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWY 476
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ + + N + P L V S GH LH F+NG+ +G+A+GS + T V LR
Sbjct: 477 STDVVLDPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLR 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLSV VGLP+ G E AGV + + + W Y+VGL GE
Sbjct: 537 TGVNKISLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGET 596
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ S + LTWYKTTF AP G P+AL++ SMGKG+ W+NGQ+
Sbjct: 597 LSLHSLGGSSSVEWIQGSLVSQRQPLTWYKTTFDAPDGTAPLALDMNSMGKGQVWLNGQN 656
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW ++K S G YA N C + YHVP+++LKPTGNLLV+ E
Sbjct: 657 LGRYWPAYKAS-GTCDYCDYAGTYNENKCRSNCG-EASQRWYHVPQSWLKPTGNLLVVFE 714
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+ GI++ I VC + P L S+ + + G + +P V SC
Sbjct: 715 ELGGDLNGISLVRRDIDSVCADIYEWQ-PNLISY-QMQTSGKAPV-----RPKVHLSCSP 767
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KIS I FASFG P G C + GSCH+ S ER C+G++ C++ + FGGDPC
Sbjct: 768 GQKISSIKFASFGTPVGSCGNFHEGSCHAHMSYDAFERNCVGQNLCTVAVSPENFGGDPC 827
Query: 733 PGIHKALLVDAQC 745
P + K L V+A C
Sbjct: 828 PNVLKKLSVEAIC 840
>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
Length = 785
Score = 627 bits (1616), Expect = e-177, Method: Compositional matrix adjust.
Identities = 346/797 (43%), Positives = 453/797 (56%), Gaps = 78/797 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP +GQY F GR D++ FIK ++ GLYV LRIG
Sbjct: 14 MWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKLVKQAGLYVHLRIG 73
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 74 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVDMMKSEGLFEWQGGPII 133
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MAV +T VPWVMCK+DDAP P+IN CNG C
Sbjct: 134 LSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDAPDPIINTCNGFYC 193
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WTS+Y +G R +D+A+ VA FI K GS+VNYYMYH
Sbjct: 194 --DWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYH 251
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+REPKWGHLKELH AIKLC L+ G V
Sbjct: 252 GGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLCEPALVAGDPIVT 311
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A VF ++ C AFL N D+ V F + Y LP SISILPDCKT +NT
Sbjct: 312 SLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYNLPPWSISILPDCKTTVYNT 371
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q ++ +++ W+ Y E I + + GLL+QI+ +D +DY WY
Sbjct: 372 ARVGSQISQM----KMEWAGGFTWQSYNEDINSLGDESFVTVGLLEQINVTRDNTDYLWY 427
Query: 389 TFRFHYNS-----SNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T SN + P L V S GH LH FVNG+ TG+ +GS D+ T R V L
Sbjct: 428 TTYVDVAQDEQFLSNGKNPVLTVMSAGHALHIFVNGQLTGTVYGSVDDPKLTYRGNVKLW 487
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIGEK 496
G+N + LS+ VGLP+ G E AG+ D + T W Y+VGL GE
Sbjct: 488 PGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGED 547
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L ++S G + V W + LTWYK F AP G++P+AL++ SMGKG+ W+NGQ IG
Sbjct: 548 LSLHSLSGSSSVEWGEPMQ-KQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIG 606
Query: 557 RYWVSFKTS--------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
RYW +K S +G + + N S + YHVPR++L PTGNLL
Sbjct: 607 RYWPGYKASGTCGICDYRGEYDEKKCQTNCGDS--------SQRWYHVPRSWLNPTGNLL 658
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
V+ EE G+P GI++ +C V+ P +++W K +K +
Sbjct: 659 VIFEEWGGDPTGISMVKRTTGSICADVSEWQ-PSMTNWR----------TKDYEKAKIHL 707
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
C G+K++ I FASFG P G C Y+ G CH+ S + + CIG+ RC + ++ FG
Sbjct: 708 QCDHGRKMTDIKFASFGTPQGSCGSYSEGGCHAHKSYDIFWKNCIGQERCGVSVVPNVFG 767
Query: 729 GDPCPGIHKALLVDAQC 745
GDPCPG K +V+A C
Sbjct: 768 GDPCPGTMKRAVVEAIC 784
>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 852
Score = 626 bits (1615), Expect = e-176, Method: Compositional matrix adjust.
Identities = 353/793 (44%), Positives = 463/793 (58%), Gaps = 61/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWN HEP G Y F GR D++RFIK +Q GL++ LRIG
Sbjct: 60 MWEGLIQKAKDGGLDVIDTYVFWNGHEPSPGNYYFEGRYDLVRFIKTVQKAGLFLHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 120 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKVAMQGFTQKIVQMMKNEKLFASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY A G Y+ WAAKMAV TGVPWVMCK+DDAP P+INACNG C
Sbjct: 180 LSQIENEYGPERKALGAPGQNYINWAAKMAVGLDTGVPWVMCKEDDAPDPMINACNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG + R QD+AF VA FI + GSYVNYYMYH
Sbjct: 240 -DGFT-PNKPYKPTMWTEAWSGWFLEFGGTIHHRPVQDLAFAVARFIQRGGSYVNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLKELH AIKLC LL+ V
Sbjct: 298 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEHSLLSSEPTVT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG +A+VF CAAFL N +A V F N Y+LP S+SILPDC+ +NT
Sbjct: 358 SLGTYHQAYVFNSGPRRCAAFLSNFHSVEA-RVTFNNKHYDLPPWSVSILPDCRNEVYNT 416
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
+V Q + + +N + S W+ Y E I + + + + A GLL+QI+ +D SDY
Sbjct: 417 AKVGVQTSHVQMIPTNSRLFS---WQTYDEDISSVHERSSIPAIGLLEQINVTRDTSDYL 473
Query: 387 WYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
WY +SS+ + L VQS GH LH FVNG+++GSA G+ + FT + V+L
Sbjct: 474 WYMTNVDISSSDLSGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFADPVNLH 533
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIGEK 496
G N ALLS+ VGLP+ G E G+ D K T W +VGL GE
Sbjct: 534 AGINRIALLSIAVGLPNVGLHYESWKTGIQGPVFLDGLGNGKKDLTLHKWFNKVGLKGEA 593
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
+ + S G + V W S+ + T+Q L WYK F AP GN+P+AL+++ MGKG+ W+NGQ
Sbjct: 594 MNLVSPNGASSVGWIRRSLATQTKQTLKWYKAYFNAPGGNEPLALDMRRMGKGQVWINGQ 653
Query: 554 SIGRYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
SIGRYW+++ +KG+ S Y T YHVPR++LKPT NL+V+ E
Sbjct: 654 SIGRYWMAY--AKGDCSSCSYIGTFRPTKCQLHCGRPTQRWYHVPRSWLKPTQNLVVVFE 711
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P IT+ ++ VCG + +H P ++ G+ D K + V C
Sbjct: 712 ELGGDPSKITLVRRSVAGVCGDLHENH-PNAENF---DVDGNEDSKTL-HQAQVHLHCAP 766
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+ IS I FASFG P G C + G+CH+++S VVE+ CIG+ CS+ + + F DPC
Sbjct: 767 GQSISSIKFASFGTPSGTCGSFQQGTCHATNSHAVVEKNCIGRESCSVAVSNSTFETDPC 826
Query: 733 PGIHKALLVDAQC 745
P + K L V+A C
Sbjct: 827 PNVLKRLSVEAVC 839
>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 1052
Score = 626 bits (1614), Expect = e-176, Method: Compositional matrix adjust.
Identities = 331/791 (41%), Positives = 467/791 (59%), Gaps = 74/791 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I KA+ GGL+ IQTYVFWN+HEP++G+YDF GR D+++FIK I +GLYV LR+G
Sbjct: 69 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 128
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V + FR++N+P+K
Sbjct: 129 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 188
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ E G Y+ WAA + + G+PWVMCKQ+DAPG +INACNG C
Sbjct: 189 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 248
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN +KPS+WTE+WT+ ++V+G P R+ +DIAF VA + +KNGS+VNYYMYH
Sbjct: 249 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 308
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A F+ T YYD APLDE+GL + PK+GHLK +H A++LC + L G +
Sbjct: 309 GGTNFGRTSAHFVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQT 368
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG E +E+ + VCAAFL NN+ R T+ F+ Y LP +SISILPDCKTV +NT
Sbjct: 369 LGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNT 428
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLL--DQISAAKDASDYF 386
++ Q++ R + K K+E + E I +LL + L+ + KD +DY
Sbjct: 429 AQIVAQHSWRDFVKSEKTSKGLKFEMFSENI----PSLLDGDSLIPGELYYLTKDKTDYA 484
Query: 387 WYTFRFHY--NSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
+ + L V S GH L +VNGEY G AHG H+ SF V+ + G
Sbjct: 485 CVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTG 544
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-KSFT-----NCSWGYQVGLIGEKLQ 498
N ++L V GLPDSG+++E + AG + + KS T N WG+ GL GEK +
Sbjct: 545 DNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKE 604
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
+Y+ G KV W + LTWYKT F P G + +A+ +++MGKG WVNG +GRY
Sbjct: 605 VYTEEGSKKVKWEK-DGKRKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRY 663
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLK--PTGNLLVLLEEENG 616
W+SF + G P+QT+ YH+PR+F+K N+LV+LEEE G
Sbjct: 664 WMSFLSPLGEPTQTE--------------------YHIPRSFMKGEKKKNMLVILEEEPG 703
Query: 617 NPLGITVDTIAIRK--VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGK 674
L ++D + + + +C +V + + SW R + + K K ++ CP K
Sbjct: 704 VKLE-SIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMR--CPPEK 760
Query: 675 KISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPG 734
++ ++ FASFG+P G C + +G C +S S+ VVE+ C+G++ CSI + FG CP
Sbjct: 761 QMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGDKGCPE 820
Query: 735 IHKALLVDAQC 745
I K L V +C
Sbjct: 821 IVKTLAVQVKC 831
>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
Length = 839
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 348/795 (43%), Positives = 456/795 (57%), Gaps = 66/795 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L KAK+GGLDVIQTYVFWN HEP G Y+F GR D+++FIK Q GL+V LRIG
Sbjct: 57 MWEGLFQKAKDGGLDVIQTYVFWNGHEPTPGNYNFEGRYDLVKFIKTAQKAGLFVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSEELFASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +F G Y WAAKMAV TGVPWVMCKQDDAP PVINACNG C
Sbjct: 177 LSQIENEYGPEGKSFGAAGKSYSNWAAKMAVGLDTGVPWVMCKQDDAPDPVINACNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE WT ++ +GG R +D++F VA F+ K GS++NYYMYH
Sbjct: 237 -DAFS-PNKPYKPTMWTEAWTGWFTEFGGTIRKRPVEDLSFAVARFVQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL REPK+GHLKELH A+KLC L++ V
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKYGHLKELHRAVKLCEPALVSVDPAVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF S CAAFL N + V+F N Y LP SISILPDCKTV FNT
Sbjct: 355 TLGSMQEAHVFRSPSS-CAAFLANYNSNSHANVVFNNEHYSLPPWSISILPDCKTVVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
V Q ++ ++ +S WE Y E + + LL GLL+Q++ +D+SDY W
Sbjct: 414 ATVGVQTSQMQMWAD--GESSMMWERYDEEVGSLAAAPLLTTTGLLEQLNVTRDSSDYLW 471
Query: 388 YTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S L VQS GH LH F+NG+ GSA G+ + F+ + +L
Sbjct: 472 YITSVDVSPSEKFLQGGEPLSLTVQSAGHALHIFINGQLQGSASGTREAKKFSYKGNANL 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R GTN ALLS+ GLP+ G E G+ H + V + T +W YQVGL GE
Sbjct: 532 RAGTNKIALLSIACGLPNVGVHYETWNTGIVGPVVLHGLDVGSRDLTWQTWSYQVGLKGE 591
Query: 496 KLQIYSNLGLNKVLWSS----IRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
++ + S G + V W ++P L+WY+ F P G++P+AL++ SMGKG+ W+N
Sbjct: 592 QMNLNSLEGASSVEWMQGSLLAQAP---LSWYRAYFDTPTGDEPLALDMGSMGKGQIWIN 648
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRY S+ + G+ YA + + T YHVP+++L+P+ NLLV+
Sbjct: 649 GQSIGRYSTSY--ASGDCKACSYAGSYRAPKCQAGCGQPTQRWYHVPKSWLQPSRNLLVV 706
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE G+ I++ ++ VC V+ H + +W + G+ + +P V C
Sbjct: 707 FEELGGDSSKISLVKRSVSSVCADVSEYHT-NIKNW-QIENAGEVEF----HRPKVHLRC 760
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
G+ IS I FASFG P G C + G CHS+ S V+E+ CIG+ RC++ + FGGD
Sbjct: 761 APGQTISAIKFASFGTPLGTCGNFQQGDCHSTKSHAVLEKNCIGQQRCAVTISPDNFGGD 820
Query: 731 PCPGIHKALLVDAQC 745
PCP K + V+A C
Sbjct: 821 PCPKEMKKVAVEAVC 835
>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
Length = 851
Score = 625 bits (1613), Expect = e-176, Method: Compositional matrix adjust.
Identities = 335/792 (42%), Positives = 450/792 (56%), Gaps = 56/792 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AK+GG D ++TYVFWN HEP +GQY F R D++RF K ++ GLY+ LRIG
Sbjct: 68 MWPKLVAEAKDGGADCVETYVFWNGHEPAQGQYYFEERFDLVRFAKIVKDAGLYMILRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EWT+GG+P+WLH G VFR++N+P+K
Sbjct: 128 PFVAAEWTFGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E A+ PY +WAA MA+ +TGVPW+MC+Q DAP PVIN CN C
Sbjct: 188 LAQVENEYGDMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNSP KP WTE+W ++Q +G R +D+AF VA F K GS NYY+YH
Sbjct: 248 -DQFK-PNSPTKPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYVYH 305
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT IT YD AP+DEYGL R PKW HL++LH +IKL LL G + +
Sbjct: 306 GGTNFGRTTGGPFITTSYDYDAPIDEYGLRRLPKWAHLRDLHKSIKLGEHTLLYGNSSFV 365
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ + SG C AFL N D K V F++ SY+LP S+SILPDCK VAFNT
Sbjct: 366 SLGPQQEADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNT 425
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q +NL+ + W +RE + N L G +D I+ KD++DY W
Sbjct: 426 AKVRSQTLMMDMVPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLW 485
Query: 388 YTFRFHYNSSN---AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
YT F + S+ L ++S GH + AF+N E GSA+G+ +F++ V+LR G
Sbjct: 486 YTTSFDVDGSHLAGGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAG 545
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDK-----SFTNCSWGYQVGLIGEKLQI 499
N +LLS+TVGL + G E AG+ V++ ++ W Y++GL GE +
Sbjct: 546 KNKLSLLSMTVGLQNGGPMYEWAGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSL 605
Query: 500 YSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+ + W P + +TWYK P G+DP+ L++QSMGKG AW+NG +IGR
Sbjct: 606 FKADKGKDIRWMPQSEPPKNQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGR 665
Query: 558 YW--VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
YW +S + + S + YHVPR++ P+GN LV+ EE+
Sbjct: 666 YWPRISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKG 725
Query: 616 GNPLGITVDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P IT + VC V+ H P L SW R+ Q D K VQ SCP G
Sbjct: 726 GDPTKITFSRRTVASVCSFVSE-HYPSIDLESWDRNTQNDGRDAAK------VQLSCPKG 778
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
K IS + FASFGNP G C Y GSCH +S VVE+AC+ + C++ L FG D CP
Sbjct: 779 KSISSVKFASFGNPSGTCRSYQQGSCHHPNSISVVEKACLNMNGCTLSLSDEGFGEDLCP 838
Query: 734 GIHKALLVDAQC 745
G+ K L ++A C
Sbjct: 839 GVTKTLAIEADC 850
>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 625 bits (1611), Expect = e-176, Method: Compositional matrix adjust.
Identities = 351/805 (43%), Positives = 450/805 (55%), Gaps = 75/805 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP + QYDF GRND+++F+K + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRRQYDFKGRNDLVKFVKTVAEAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN P+K
Sbjct: 116 PYVCAEWNYGGFPLWLHFIPGIQFRTDNGPFKEEMQIFTAKIVDMMKKENLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ WAA MA TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 176 LSQIENEYGNIDSAYGSAAKSYIQWAASMATSLDTGVPWVMCQQADAPDPMINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+WT ++ +GG R +DIAF VA F G++ NYYMYH
Sbjct: 236 DQF--TPNSVKKPKMWTENWTGWFLSFGGAVPYRPVEDIAFAVARFFQLGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT F+ T Y AP+DEYGL+R+PKWGHLK+LH AIKLC L+ +
Sbjct: 294 GGTNFGRTTGGPFIATSYDYDAPIDEYGLLRQPKWGHLKDLHKAIKLCEAALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ +G CAAFL N TV F SY LP S+SILPDCK VA NT
Sbjct: 354 SLGTNLEASVYKTGTGSCAAFLANVRTNSDATVNFSGNSYHLPAWSVSILPDCKNVALNT 413
Query: 329 ERV-STQYNKRSKTSNLKFDSDEK------WEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++ S R +LK D D W E + N GLL+QI+ D
Sbjct: 414 AQINSMAVMPRFMQQSLKNDIDSSDGFQSGWSWVDEPVGISKNNAFTKLGLLEQINITAD 473
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ + +Q L V+S GH LHAF+NG+ GS G+ N T+
Sbjct: 474 KSDYLWYSLSTEIQGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGNSGNAKVTV 533
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------WG 487
V L G N LLS+TVGL + GAF +++ AG+ ++ K N + W
Sbjct: 534 DIPVTLIHGKNTIDLLSLTVGLQNYGAFYDKQGAGITG-PIKLKGLANGTTVDLSSQQWT 592
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKG 546
YQVGL GE+L + S V S++ P +Q L WYKTTF APAGNDP+AL+ MGKG
Sbjct: 593 YQVGLQGEELGLPSGSSSKWVAGSTL--PKKQPLIWYKTTFDAPAGNDPVALDFMGMGKG 650
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQT-----QYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
EAWVNGQSIGRYW ++ +S G + + Y+ N + C + YHVPR++L
Sbjct: 651 EAWVNGQSIGRYWPAYVSSNGGCTSSCNYRGPYSSNKC--LKNCG-KPSQQLYHVPRSWL 707
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
+P+GN LVL EE G+P I+ T + +C V+ H P+ W G
Sbjct: 708 QPSGNTLVLFEEIGGDPTQISFATKQVESLCSRVSEYHPLPVDMWGSDLTTGRK------ 761
Query: 662 KKPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSI 720
P + CP + IS I FASFG P G C ++ C S + +V+ ACIG CSI
Sbjct: 762 SSPMLSLECPFPNQVISSIKFASFGTPRGTCGSFSHSKCSSRTALSIVQEACIGSKSCSI 821
Query: 721 PLLSRYFGGDPCPGIHKALLVDAQC 745
+ F GDPC GI K+L V+A C
Sbjct: 822 GVSIDTF-GDPCSGIAKSLAVEASC 845
>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
Length = 851
Score = 624 bits (1610), Expect = e-176, Method: Compositional matrix adjust.
Identities = 334/792 (42%), Positives = 449/792 (56%), Gaps = 56/792 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AK+GG D ++TYVFWN HEP +GQY F R D++RF K ++ GLY+ LRIG
Sbjct: 68 MWPKLVAEAKDGGADCVETYVFWNGHEPAQGQYYFEERFDLVRFAKIVKDAGLYMILRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EWT+GG+P+WLH G VFR++N+P+K
Sbjct: 128 PFVAAEWTFGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E A+ PY +WAA MA+ +TGVPW+MC+Q DAP PVIN CN C
Sbjct: 188 LAQVENEYGDMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNSP KP WTE+W ++Q +G R +D+AF VA F K GS NYY+YH
Sbjct: 248 -DQFK-PNSPTKPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYVYH 305
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT IT YD AP+DEYGL R PKW HL++LH +IKL LL G + +
Sbjct: 306 GGTNFGRTTGGPFITTSYDYDAPIDEYGLRRLPKWAHLRDLHKSIKLGEHTLLYGNSSFV 365
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ + SG C AFL N D K V F++ SY+LP S+SILPDCK VAFNT
Sbjct: 366 SLGPQQEADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNT 425
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q +NL+ + W +RE + N L G +D I+ KD++DY W
Sbjct: 426 AKVRSQTLMMDMVPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLW 485
Query: 388 YTFRFHYNSSN---AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
YT F + S+ L ++S GH + AF+N E GSA+G+ +F++ V+LR G
Sbjct: 486 YTTSFDVDGSHLAGGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAG 545
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDK-----SFTNCSWGYQVGLIGEKLQI 499
N +LLS+TVGL + G E AG+ V++ ++ W Y++GL GE +
Sbjct: 546 KNKLSLLSMTVGLQNGGPMYEWAGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSL 605
Query: 500 YSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+ + W P + +TWYK P G+DP+ L++QSMGKG AW+NG +IGR
Sbjct: 606 FKADKGKDIRWMPQSEPPKNQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGR 665
Query: 558 YW--VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
YW +S + + S + YHVPR++ P+GN LV+ EE+
Sbjct: 666 YWPRISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKG 725
Query: 616 GNPLGITVDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P IT + VC V+ H P L SW R+ Q D K VQ SCP G
Sbjct: 726 GDPTKITFSRRTVASVCSFVS-EHYPSIDLESWDRNTQNDGRDAAK------VQLSCPKG 778
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
K IS + F SFGNP G C Y GSCH +S VVE+AC+ + C++ L FG D CP
Sbjct: 779 KSISSVKFVSFGNPSGTCRSYQQGSCHHPNSISVVEKACLNMNGCTVSLSDEGFGEDLCP 838
Query: 734 GIHKALLVDAQC 745
G+ K L ++A C
Sbjct: 839 GVTKTLAIEADC 850
>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
Length = 894
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 348/829 (41%), Positives = 470/829 (56%), Gaps = 93/829 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG+DVIQTY FW+ HEP +GQY+F GR DI++F + + GLY+ LRIG
Sbjct: 66 MWPDLIAKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR++N +K
Sbjct: 126 PYVCAEWNFGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE F +KG Y+ WAA+MA+ GVPWVMCKQ DAPG +I+ACNG C
Sbjct: 186 MLQIENEYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ +K PNS NKP++WTEDW +Y WGG+ R +D+AF VA F + GS+ NYYMY
Sbjct: 246 -DGYK-PNSYNKPTMWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYF 303
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRT+ F IT Y AP+DEYGL+ EPKWGHLK+LHAAIKLC L+ + N
Sbjct: 304 GGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNY 363
Query: 268 ISLGQLQEAFVFE---ETSGV----------CAAFLVNNDERKAVTVLFRNISYELPRKS 314
I LG QEA V+ T G+ C+AFL N DE KA +V F Y LP S
Sbjct: 364 IKLGPKQEAHVYRMNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWS 423
Query: 315 ISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDS-----------------DEKWEEYRE 357
+SILPDC+ V +NT +V Q + ++ +L S + W +E
Sbjct: 424 VSILPDCRNVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKE 483
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHG 409
+ + +G+L+ ++ KD SDY W+ R + +N A + + S
Sbjct: 484 PVGVWSENNFTVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMR 543
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
+L FVNG+ TGS G V V +G ND LL+ TVGL + GAFLE+ A
Sbjct: 544 DVLRVFVNGQLTGSVIGHWVKV----EQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGA 599
Query: 470 GVH-RVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ---L 520
G ++++ D F+ W YQVGL GE L+IY+ K W+ + SP
Sbjct: 600 GFRGQIKLTGFKNGDIDFSKLLWTYQVGLKGEFLKIYTIEENEKASWAEL-SPDDDPSTF 658
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNT 578
WYKT F +PAG DP+AL+L SMGKG+AWVNG IGRYW G P Y A ++
Sbjct: 659 IWYKTYFDSPAGTDPVALDLGSMGKGQAWVNGHHIGRYWTLVAPEDGCPEICDYRGAYDS 718
Query: 579 VTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
C K T T YHVPR++L+ + NLLV+LEE GNP I++ + +C V+
Sbjct: 719 DKCSFNCG--KPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSAGVLCAQVSE 776
Query: 638 SHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVG 697
SH PP+ W + D I P + C G IS I FAS+G P G C+++++G
Sbjct: 777 SHYPPVQKWF-NPDSVDEKITVNDLTPEMHLQCQDGFTISSIEFASYGTPQGSCQKFSMG 835
Query: 698 SCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQCR 746
+CH+++S +V ++C+GK+ CS+ + + FGGDPC G+ K L V+A+CR
Sbjct: 836 NCHATNSSSIVSKSCLGKNSCSVEISNISFGGDPCRGVVKTLAVEARCR 884
>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 919
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 334/792 (42%), Positives = 449/792 (56%), Gaps = 56/792 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AK+GG D ++TYVFWN HEP +GQY F R D++RF K ++ GLY+ LRIG
Sbjct: 136 MWPKLVAEAKDGGADCVETYVFWNGHEPAQGQYYFEERFDLVRFAKIVKDAGLYMILRIG 195
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EWT+GG+P+WLH G VFR++N+P+K
Sbjct: 196 PFVAAEWTFGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHII 255
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E A+ PY +WAA MA+ +TGVPW+MC+Q DAP PVIN CN C
Sbjct: 256 LAQVENEYGDMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYC 315
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNSP KP WTE+W ++Q +G R +D+AF VA F K GS NYY+YH
Sbjct: 316 -DQFK-PNSPTKPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYVYH 373
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT IT YD AP+DEYGL R PKW HL++LH +IKL LL G + +
Sbjct: 374 GGTNFGRTTGGPFITTSYDYDAPIDEYGLRRLPKWAHLRDLHKSIKLGEHTLLYGNSSFV 433
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ + SG C AFL N D K V F++ SY+LP S+SILPDCK VAFNT
Sbjct: 434 SLGPQQEADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNT 493
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q +NL+ + W +RE + N L G +D I+ KD++DY W
Sbjct: 494 AKVRSQTLMMDMVPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLW 553
Query: 388 YTFRFHYNSSN---AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
YT F + S+ L ++S GH + AF+N E GSA+G+ +F++ V+LR G
Sbjct: 554 YTTSFDVDGSHLAGGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAG 613
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDK-----SFTNCSWGYQVGLIGEKLQI 499
N +LLS+TVGL + G E AG+ V++ ++ W Y++GL GE +
Sbjct: 614 KNKLSLLSMTVGLQNGGPMYEWAGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSL 673
Query: 500 YSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+ + W P + +TWYK P G+DP+ L++QSMGKG AW+NG +IGR
Sbjct: 674 FKADKGKDIRWMPQSEPPKNQPMTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGR 733
Query: 558 YW--VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
YW +S + + S + YHVPR++ P+GN LV+ EE+
Sbjct: 734 YWPRISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKG 793
Query: 616 GNPLGITVDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P IT + VC V+ H P L SW R+ Q D K VQ SCP G
Sbjct: 794 GDPTKITFSRRTVASVCSFVSE-HYPSIDLESWDRNTQNDGRDAAK------VQLSCPKG 846
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
K IS + F SFGNP G C Y GSCH +S VVE+AC+ + C++ L FG D CP
Sbjct: 847 KSISSVKFVSFGNPSGTCRSYQQGSCHHPNSISVVEKACLNMNGCTVSLSDEGFGEDLCP 906
Query: 734 GIHKALLVDAQC 745
G+ K L ++A C
Sbjct: 907 GVTKTLAIEADC 918
>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 623 bits (1607), Expect = e-175, Method: Compositional matrix adjust.
Identities = 339/807 (42%), Positives = 455/807 (56%), Gaps = 78/807 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI AKEGG+DVI+TYVFWN HE Y F GR D+++FI + + GLY+ LRIG
Sbjct: 52 MWPSLIQNAKEGGVDVIETYVFWNGHELSPDNYHFDGRFDLVKFINIVHNAGLYLILRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+P+WLH + VFR+DN +K
Sbjct: 112 PFVAAEWNFGGVPVWLHYIPNTVFRTDNASFKFYMQKFTTYIVSLMKKEKLFASQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY IE + E G PY +WAA+MAV + GVPW+MC+Q DAP PVIN CN C
Sbjct: 172 LSQVENEYGDIERVYGEGGKPYAMWAAQMAVSQNIGVPWIMCQQYDAPDPVINTCNSFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNSPNKP +WTE+W +++ +G + R +DIAF VA F K GS NYYMYH
Sbjct: 232 DQF--TPNSPNKPKMWTENWPGWFKTFGARDPHRPPEDIAFSVARFFQKGGSLQNYYMYH 289
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL R PKWGHLKELH AIKL R LL +
Sbjct: 290 GGTNFGRTAGGPFITTSYDYDAPIDEYGLPRLPKWGHLKELHRAIKLTERVLLNSEPTYV 349
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ ++SG CAAF+ N DE+ TV FRNISY LP S+SILPDCK V FNT
Sbjct: 350 SLGPSLEADVYTDSSGACAAFIANIDEKDDKTVQFRNISYHLPAWSVSILPDCKNVVFNT 409
Query: 329 ERVSTQYNKRSKTSNLKFDSDE---------------KWEEYREAILNFDNTLLRAEGLL 373
+ RS+T+ ++ +E KWE + E + L+
Sbjct: 410 AMI------RSQTAMVEMVPEELQPSADATNKDLKALKWEVFVEQPGIWGKADFVKNVLV 463
Query: 374 DQISAAKDASDYFWYTFRFHYNSSN-----AQAPLDVQSHGHILHAFVNGEYTGSAHGSH 428
D ++ KD +DY WYT N + +Q L V+S GH LHAF+N + SA G+
Sbjct: 464 DHLNTTKDTTDYLWYTTSIFVNENEKFLKGSQPVLVVESKGHALHAFINKKLQVSATGNG 523
Query: 429 DNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-----KSFTN 483
+++F + + L+ G N+ ALLS+TVGL ++G F E AG+ +V ++ ++
Sbjct: 524 SDITFKFKQAISLKAGKNEIALLSMTVGLQNAGPFYEWVGAGLSKVVIEGFNNGPVDLSS 583
Query: 484 CSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQ 541
+W Y++GL GE L IY G+ V W S R P +Q LTWYK P+GN+P+ L++
Sbjct: 584 YAWSYKIGLQGEHLGIYKPDGIKNVKWLSSREPPKQQPLTWYKVILDPPSGNEPVGLDMV 643
Query: 542 SMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPR 598
MGKG AW+NG+ IGRYW + K+S + + C T YHVPR
Sbjct: 644 HMGKGLAWLNGEEIGRYWPT-KSSIHDVCVQKCDYRGKFRPDKCLTGCGEPTQRWYHVPR 702
Query: 599 AFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIK 658
++ KP+GN+LV+ EE+ G+P I + + +C H+ H P + SW + +
Sbjct: 703 SWFKPSGNILVIFEEKGGDPTQIRLSKRKVLGICAHLGEGH-PSIESW------SEAENV 755
Query: 659 KFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
+ K TV CP +I+KI FASFG P G C Y++G CH +S +VE+ C+ ++ C
Sbjct: 756 ERKSKATVDLKCPDNGRIAKIKFASFGTPQGSCGSYSIGDCHDPNSISLVEKVCLNRNEC 815
Query: 719 SIPLLSRYFGGDPCPGIHKALLVDAQC 745
I L F CP K L V+A C
Sbjct: 816 RIELGEEGFNKGLCPTASKKLAVEAMC 842
>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
Length = 897
Score = 623 bits (1606), Expect = e-175, Method: Compositional matrix adjust.
Identities = 348/826 (42%), Positives = 457/826 (55%), Gaps = 90/826 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG+DVIQTYVFWN HEP KGQY F G+ D+++F+K + GLY+ LRIG
Sbjct: 70 MWPDLIAKSKEGGVDVIQTYVFWNGHEPVKGQYIFEGQYDLVKFVKLVGVSGLYLHLRIG 129
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW +GG P+WL D+ GIVFR+DN P+
Sbjct: 130 PYVCAEWNFGGFPVWLRDIPGIVFRTDNSPFMEEMQQFVKKIVDLMREEMLFSWQGGPII 189
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY IE +F G YV WAA+MA+ GVPWVMC+Q DAPG +I+ACN C
Sbjct: 190 MLQIENEYGNIEHSFGPGGKEYVKWAARMALGLGAGVPWVMCRQTDAPGSIIDACNEYYC 249
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ +K PNS KP +WTEDW +Y WGG R +D+AF VA F + GS+ NYYMY
Sbjct: 250 -DGYK-PNSNKKPILWTEDWDGWYTTWGGSLPHRPVEDLAFAVARFFQRGGSFQNYYMYF 307
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNF RTA F IT Y AP+DEYGL+ EPKWGHLK+LHAAIKLC L+ +
Sbjct: 308 GGTNFARTAGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY 367
Query: 268 ISLGQLQEAFVFEE-------------TSGVCAAFLVNNDERKAVTVLFRNISYELPRKS 314
I LG QEA V+ + C+AFL N DE KAVTV F SY LP S
Sbjct: 368 IKLGSKQEAHVYRANVHAEGQNLTQHGSQSKCSAFLANIDEHKAVTVRFLGQSYTLPPWS 427
Query: 315 ISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDS-----------------DEKWEEYRE 357
+S+LPDC+ FNT +V+ Q + +S L S W +E
Sbjct: 428 VSVLPDCRNAVFNTAKVAAQTSIKSMELALPQFSGISAPKQLMAQNEGSYMSSSWMTVKE 487
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHG 409
I + EG+L+ ++ KD SDY WY R + + +N + + S
Sbjct: 488 PISVWSGNNFTVEGILEHLNVTKDHSDYLWYFTRIYVSDDDIAFWEENNVHPAIKIDSMR 547
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
+L F+NG+ TGS G V V ++G N+ LLS TVGL + GAFLER A
Sbjct: 548 DVLRVFINGQLTGSVIGRWIKVV----QPVQFQKGYNELVLLSQTVGLQNYGAFLERDGA 603
Query: 470 G------VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQLT 521
G + R D +N W YQVGL GE +IY+ K W+ ++ T
Sbjct: 604 GFRGHTKLTGFRDGDIDLSNLEWTYQVGLQGENQKIYTTENNEKAEWTDLTLDDIPSTFT 663
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKG-NPSQTQYAVNTVT 580
WYKT F AP+G DP+AL+L SMGKG+AWVN IGRYW +G + A N+
Sbjct: 664 WYKTYFDAPSGADPVALDLGSMGKGQAWVNDHHIGRYWTLVAPEEGCQKCDYRGAYNSEK 723
Query: 581 SIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSH 639
C K T YH+PR++L+P+ NLLV+ EE GNP I++ + VC V+ +H
Sbjct: 724 CRTNCG--KPTQIWYHIPRSWLQPSNNLLVIFEETGGNPFEISIKLRSASVVCAQVSETH 781
Query: 640 LPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSC 699
PPL W+ H ++ P +Q C G IS I FAS+G P G C++++ G+C
Sbjct: 782 YPPLQRWI-HTDFIYGNVSGKDMTPEIQLRCQDGYVISSIEFASYGTPQGSCQKFSRGNC 840
Query: 700 HSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
H+ +S VV +AC G+ C+I + + FGGDPC GI K L V+A+C
Sbjct: 841 HAPNSLSVVSKACQGRDTCNIAISNAVFGGDPCRGIVKTLAVEAKC 886
>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
gi|223950023|gb|ACN29095.1| unknown [Zea mays]
Length = 815
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 350/804 (43%), Positives = 453/804 (56%), Gaps = 80/804 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y F R D++RF+K +Q GL+V LRIG
Sbjct: 29 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 88
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 89 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 148
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK++DAP PVINACNG C
Sbjct: 149 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 208
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K GS++NYYMYH
Sbjct: 209 -DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYH 266
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+REPK HLKELH A+KLC + L++ +
Sbjct: 267 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTIT 326
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF SG CAAFL N + V+F N Y LP SISILPDCK V FN+
Sbjct: 327 TLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNS 385
Query: 329 ERVSTQYNKRSKTSNLKFDSDEK----WEEYREAILNFDNT-LLRAEGLLDQISAAKDAS 383
V Q TS ++ D WE Y E + + LL GLL+Q++ +D+S
Sbjct: 386 ATVGVQ------TSQMQMWGDGATSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSS 439
Query: 384 DYFWYTFRFHYNSSN------AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
DY WY + S + P L VQS GH LH FVNG+ GS++G+ ++
Sbjct: 440 DYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYN 499
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQV 490
V+LR GTN ALLSV GLP+ G E GV H + + T +W YQV
Sbjct: 500 GNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQV 559
Query: 491 GLIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
GL GE++ + S G V W S I + L WYK F P+G++P+AL++ SMGKG+
Sbjct: 560 GLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMGSMGKGQ 619
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGN 606
W+NGQSIGRYW ++ + G+ Y + T YHVPR++L+P+ N
Sbjct: 620 VWINGQSIGRYWTAY--ADGDCKGCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRN 677
Query: 607 LLVLLEE-ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG---- 661
LLV+LEE G+ I + ++ VC V+ H P + W I+ +G
Sbjct: 678 LLVVLEELGGGDSSKIALAKRSVSSVCADVSEDH-PNIKKW---------QIESYGEREH 727
Query: 662 KKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIP 721
++ V C G+ IS I FASFG P G C + G CHS+ S V+E+ CIG RC +
Sbjct: 728 RRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKRCIGLQRCVVA 787
Query: 722 LLSRYFGGDPCPGIHKALLVDAQC 745
+ FGGDPCP + K + V+A C
Sbjct: 788 ISPDNFGGDPCPSVTKRVAVEAVC 811
>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
Length = 889
Score = 622 bits (1605), Expect = e-175, Method: Compositional matrix adjust.
Identities = 352/828 (42%), Positives = 462/828 (55%), Gaps = 93/828 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG D+IQTY FWN HEP +GQY+F GR DI++FIK S GLY LRIG
Sbjct: 61 MWPDLIAKSKEGGADLIQTYAFWNGHEPIRGQYNFEGRYDIVKFIKLAGSAGLYFHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR+DN PYK
Sbjct: 121 PYVCAEWNFGGFPVWLRDIPGIEFRTDNAPYKDEMQRFVKKIVDLMRQEMLFSWQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE + ++G YV WAA MA+ GVPWVMC+Q DAP +I+ACN C
Sbjct: 181 LLQIENEYGNIERLYGQRGKDYVKWAADMAIGLGAGVPWVMCRQTDAPENIIDACNAFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS KP++WTEDW +Y WGG+ R +D AF VA F + GSY NYYM+
Sbjct: 241 -DGFK-PNSYRKPALWTEDWNGWYTSWGGRVPHRPVEDNAFAVARFFQRGGSYHNYYMFF 298
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV- 267
GGTNFGRT+ F +T Y AP+DEYGL+ +PKWGHLK+LH+AIKLC P L +
Sbjct: 299 GGTNFGRTSGGPFYVTSYDYDAPIDEYGLLSQPKWGHLKDLHSAIKLC-EPALVAVDDAP 357
Query: 268 --ISLGQLQEAFVFEETSGV-------------CAAFLVNNDERKAVTVLFRNISYELPR 312
I LG +QEA V+ +S V C+AFL N DE + V F Y LP
Sbjct: 358 QYIRLGPMQEAHVYRHSSYVEDQSSSTLGNGTLCSAFLANIDEHNSANVKFLGQVYSLPP 417
Query: 313 KSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSD-----------------EKWEEY 355
S+SILPDCK VAFNT +V++Q + ++ + F + W
Sbjct: 418 WSVSILPDCKNVAFNTAKVASQISVKTVEFSSPFIENTTEPGYLLLHDGVHHISTNWMIL 477
Query: 356 REAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQS 407
+E I + AEG+L+ ++ KD SDY WY R H + +S L + S
Sbjct: 478 KEPIGEWGGNNFTAEGILEHLNVTKDTSDYLWYIMRLHISDEDISFWEASEVSPKLIIDS 537
Query: 408 HGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERK 467
++ FVNG+ GS G V V L QG N+ A+LS TVGL + GAFLE+
Sbjct: 538 MRDVVRIFVNGQLAGSHVGRWVRV----EQPVDLVQGYNELAILSETVGLQNYGAFLEKD 593
Query: 468 VAG------VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSI--RSPTRQ 519
AG + ++ + TN W YQVGL GE ++I+S W + S
Sbjct: 594 GAGFKGQIKLTGLKSGEYDLTNSLWVYQVGLRGEFMKIFSLEEHESADWVDLPNDSVPSA 653
Query: 520 LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPS-QTQYAVNT 578
TWYKT F AP G DP++L L SMGKG+AWVNG SIGRYW G S + A +
Sbjct: 654 FTWYKTFFDAPQGKDPVSLYLGSMGKGQAWVNGHSIGRYWSLVAPVDGCQSCDYRGAYHE 713
Query: 579 VTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
C K T + YH+PR++L+P+ NLLV+ EE GNPL I+V + +C V+
Sbjct: 714 SKCATNCG--KPTQSWYHIPRSWLQPSKNLLVIFEETGGNPLEISVKLHSTSSICTKVSE 771
Query: 638 SHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVG 697
SH PPL W H+ + + P + C G++IS I+FASFG P G C+R++ G
Sbjct: 772 SHYPPLHLW-SHKDIVNGKVSISNAVPEIHLQCDNGQRISSIMFASFGTPQGSCQRFSQG 830
Query: 698 SCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
CH+ +S VV AC G++ CSI + ++ FGGDPC G+ K L V+A+C
Sbjct: 831 DCHAPNSFSVVSEACQGRNNCSIGVSNKVFGGDPCRGVVKTLAVEAKC 878
>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 843
Score = 622 bits (1604), Expect = e-175, Method: Compositional matrix adjust.
Identities = 340/808 (42%), Positives = 454/808 (56%), Gaps = 80/808 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGG+DVI+TYVFWN HE G Y F GR D+++F K +Q G+Y+ LRIG
Sbjct: 52 MWPGLVQTAKEGGVDVIETYVFWNGHELSPGNYYFGGRFDLVKFAKTVQQAGMYLILRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
PF+ +EW +GG+P+WLH V G VFR+ N+P+
Sbjct: 112 PFVAAEWNFGGVPVWLHYVPGTVFRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPII 171
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY E + E G Y LWAAKMAV +TGVPW+MC+Q DAP PVI+ CN C
Sbjct: 172 LSQIENEYGYYENFYKEDGKKYALWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P SPN+P IWTE+W +++ +GG+ R A+D+AF VA F K GS NYYMYH
Sbjct: 232 DQF--TPTSPNRPKIWTENWPGWFKTFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYH 289
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL R PKWGHLKELH AIKLC LL G I
Sbjct: 290 GGTNFGRTAGGPFITTSYDYDAPVDEYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNI 349
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ ++SG CAAF+ N D++ TV FRN SY LP S+SILPDCK V FNT
Sbjct: 350 SLGPSVEADVYTDSSGACAAFISNVDDKNDKTVEFRNASYHLPAWSVSILPDCKNVVFNT 409
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-----KWEEYREAILNFDNTLLRAEGLLDQISAAKDAS 383
+V++Q N + SD+ KW+ +E + G +D I+ KD +
Sbjct: 410 AKVTSQTNVVAMIPESLQQSDKGVNSLKWDIVKEKPGIWGKADFVKSGFVDLINTTKDTT 469
Query: 384 DYFWYTFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY W+T + ++ L ++S GH LHAFVN EY G+ G+ + F+ +N
Sbjct: 470 DYLWHTTSIFVSENEEFLKKGSKPVLLIESTGHALHAFVNQEYQGTGTGNGTHSPFSFKN 529
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDK-----SFTNCSWGYQVGL 492
+ LR G N+ ALL +TVGL +G F + AG+ V+++ ++ +W Y++G+
Sbjct: 530 PISLRAGKNEIALLCLTVGLQTAGPFYDFIGAGLTSVKIKGLKNGTIDLSSYAWTYKIGV 589
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE L++Y GLNKV W+S P + LTWYK AP G++P+ L++ MGKG AW+
Sbjct: 590 QGEYLRLYQGNGLNKVNWTSTSEPQKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWL 649
Query: 551 NGQSIGRYW---VSFKTS----------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVP 597
NG+ IGRYW FK+ K NP + T YHVP
Sbjct: 650 NGEEIGRYWPRKSEFKSEDCVKECDYRGKFNPDKCDTGCGEPTQ----------RWYHVP 699
Query: 598 RAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
R++ KP+GN+LVL EE+ G+P I + C V + P L +G+ I
Sbjct: 700 RSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSGACALVAEDY--PSVGLL---SQGEDKI 754
Query: 658 KKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSR 717
+ P +CP +IS + FASFG P G C Y G CH +S +VE+AC+ K+
Sbjct: 755 QNNKNVPFAHLTCPSNTRISAVKFASFGTPSGSCGSYLKGDCHDPNSSTIVEKACLNKND 814
Query: 718 CSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C I L F + CPG+ + L V+A C
Sbjct: 815 CVIKLTEENFKTNLCPGLSRKLAVEAVC 842
>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
Length = 842
Score = 622 bits (1604), Expect = e-175, Method: Compositional matrix adjust.
Identities = 346/799 (43%), Positives = 451/799 (56%), Gaps = 71/799 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y F R D++RFIK +Q GL+V LRIG
Sbjct: 57 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFIKTVQKAGLFVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSEKLFASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY G Y+ WAAKMA+ TGVPWVMCK++DAP PVINACNG C
Sbjct: 177 LSQIENEYGPEGKELGAAGQAYINWAAKMAIGLGTGVPWVMCKEEDAPDPVINACNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K GS++NYYMYH
Sbjct: 237 -DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGLVREPK HLKELH A+KLC + L++ +
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVSVDPAIT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF SG CAAFL N + V+F N Y LP SISILPDCK V FN+
Sbjct: 355 TLGTMQEAHVFRSPSG-CAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKNVVFNS 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
V Q ++ + S WE Y E + + LL GLL+Q++ +D+SDY W
Sbjct: 414 ATVGVQTSQMQMWGDGA--SSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLW 471
Query: 388 YTFRFHYNSSN-------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
Y + S L V S GH LH FVNGE GSA+G+ ++ +
Sbjct: 472 YITSVDISPSENFLQGGGKPLSLSVLSAGHALHVFVNGELQGSAYGTREDRRIKYNGNAN 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR GTN ALLSV GLP+ G E GV H + + T +W YQVGL G
Sbjct: 532 LRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVGLHGLNEGSRDLTWQTWSYQVGLKG 591
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E++ + S G V W S I + L+WY+ F P+G++P+AL++ SMGKG+ W+N
Sbjct: 592 EQMNLNSLEGSTSVEWMQGSLIAQNQQPLSWYRAYFETPSGDEPLALDMGSMGKGQIWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW ++ + G+ + Y + T YHVPR++L+PT NLLV+
Sbjct: 652 GQSIGRYWTAY--ADGDCKECSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVV 709
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK----KPTV 666
EE G+ I + ++ VC V+ H P + +W I+ +G+ + V
Sbjct: 710 FEELGGDSSKIALVKRSVSSVCADVSEDH-PNIKNW---------QIESYGEREYHRAKV 759
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
C G+ IS I FASFG P G C + G CHS++S V+E+ CIG RC++ +
Sbjct: 760 HLRCSPGQSISAIKFASFGTPMGTCGNFQQGDCHSANSHTVLEKKCIGLQRCAVAISPES 819
Query: 727 FGGDPCPGIHKALLVDAQC 745
FGGDPCP + K + V+A C
Sbjct: 820 FGGDPCPRVTKRVAVEAVC 838
>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
Length = 845
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 350/804 (43%), Positives = 453/804 (56%), Gaps = 80/804 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y F R D++RF+K +Q GL+V LRIG
Sbjct: 59 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 119 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK++DAP PVINACNG C
Sbjct: 179 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K GS++NYYMYH
Sbjct: 239 -DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+REPK HLKELH A+KLC + L++ +
Sbjct: 297 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTIT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF SG CAAFL N + V+F N Y LP SISILPDCK V FN+
Sbjct: 357 TLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNS 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDEK----WEEYREAILNFDNT-LLRAEGLLDQISAAKDAS 383
V Q TS ++ D WE Y E + + LL GLL+Q++ +D+S
Sbjct: 416 ATVGVQ------TSQMQMWGDGATSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSS 469
Query: 384 DYFWYTFRFHYNSSN------AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
DY WY + S + P L VQS GH LH FVNG+ GS++G+ ++
Sbjct: 470 DYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYN 529
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQV 490
V+LR GTN ALLSV GLP+ G E GV H + + T +W YQV
Sbjct: 530 GNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQV 589
Query: 491 GLIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
GL GE++ + S G V W S I + L WYK F P+G++P+AL++ SMGKG+
Sbjct: 590 GLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMGSMGKGQ 649
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGN 606
W+NGQSIGRYW ++ + G+ Y + T YHVPR++L+P+ N
Sbjct: 650 VWINGQSIGRYWTAY--ADGDCKGCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRN 707
Query: 607 LLVLLEE-ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG---- 661
LLV+LEE G+ I + ++ VC V+ H P + W I+ +G
Sbjct: 708 LLVVLEELGGGDSSKIALAKRSVSSVCADVSEDH-PNIKKW---------QIESYGEREH 757
Query: 662 KKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIP 721
++ V C G+ IS I FASFG P G C + G CHS+ S V+E+ CIG RC +
Sbjct: 758 RRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKRCIGLQRCVVA 817
Query: 722 LLSRYFGGDPCPGIHKALLVDAQC 745
+ FGGDPCP + K + V+A C
Sbjct: 818 ISPDNFGGDPCPSVTKRVAVEAVC 841
>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 345/823 (41%), Positives = 457/823 (55%), Gaps = 86/823 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG DV+QTYVFW HEP KGQY F GR D+++F+K + GLY+ LRIG
Sbjct: 66 MWPDLIAKSKEGGADVVQTYVFWGGHEPVKGQYYFEGRYDLVKFVKLVGESGLYLHLRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL DV G+VFR+DN P+K
Sbjct: 126 PYVCAEWNFGGFPVWLRDVPGVVFRTDNAPFKEEMQKFVTKIVDLMREEMLLSWQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE +F + G Y+ WAA MA+ GVPWVMCKQ DAP +I+ACNG C
Sbjct: 186 MFQIENEYGNIEHSFGQGGKEYMKWAAGMALALDAGVPWVMCKQTDAPENIIDACNGYYC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNSP KP WTEDW +Y WGG+ R +D+AF VA F + GS+ NYYMY
Sbjct: 246 -DGFK-PNSPKKPIFWTEDWDGWYTTWGGRLPHRPVEDLAFAVARFFQRGGSFQNYYMYF 303
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRT+ F IT Y AP+DEYGL+ EPKWGHLK+LHAAIKLC L+ +
Sbjct: 304 GGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSAQY 363
Query: 268 ISLGQLQEAFVFEETSGV-------------CAAFLVNNDERKAVTVLFRNISYELPRKS 314
I LG QEA V+ + + C+AFL N DER+A TV F S+ LP S
Sbjct: 364 IKLGPKQEAHVYGGSLSIQGMNFSQYGSQSKCSAFLANIDERQAATVRFLGQSFTLPPWS 423
Query: 315 ISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDE----------------KWEEYREA 358
+SILPDC+ FNT +V+ Q + ++ L + W +E
Sbjct: 424 VSILPDCRNTVFNTAKVAAQTHIKTVEFVLPLSNSSLLPQFIVQNEDSPQSTSWLIAKEP 483
Query: 359 ILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGH 410
I + +G+L+ ++ KD SDY WY R + + + + + S
Sbjct: 484 ITLWSEENFTVKGILEHLNVTKDESDYLWYFTRIYVSDDDIAFWEKNKVSPAVSIDSMRD 543
Query: 411 ILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG 470
+L F+NG+ TGS G V ++G N+ LLS TVGL + GAFLER AG
Sbjct: 544 VLRVFINGQLTGSVVGHWVKAV----QPVQFQKGYNELVLLSQTVGLQNYGAFLERDGAG 599
Query: 471 VH-RVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQLTW 522
++++ D +N SW YQVGL GE L++YS K WS ++ + TW
Sbjct: 600 FKGQIKLTGFKNGDIDLSNLSWTYQVGLKGEFLKVYSTGDNEKFEWSELAVDATPSTFTW 659
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSI 582
YKT F AP+G DP+AL+L SMGKG+AWVNG IGRYW G S +
Sbjct: 660 YKTFFDAPSGVDPVALDLGSMGKGQAWVNGHHIGRYWTVVSPKDGCGSCDYRGAYSSGKC 719
Query: 583 HFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPP 642
YHVPRA+L+ + NLLV+ EE GNP I+V + + +C V+ SH PP
Sbjct: 720 RTNCGNPTQTWYHVPRAWLEASNNLLVVFEETGGNPFEISVKLRSAKVICAQVSESHYPP 779
Query: 643 LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS 702
L W R G +I + P + C G +S I FAS+G P+G C++++ G+CH+S
Sbjct: 780 LRKWSRADLTGG-NISRNDMTPEMHLKCQDGHIMSSIEFASYGTPNGSCQKFSRGNCHAS 838
Query: 703 HSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+S VV AC GK++C I + + F GDPC G+ K L V+A+C
Sbjct: 839 NSSSVVTEACQGKNKCDIAISNAVF-GDPCRGVIKTLAVEARC 880
>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
Length = 897
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 350/804 (43%), Positives = 453/804 (56%), Gaps = 80/804 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y F R D++RF+K +Q GL+V LRIG
Sbjct: 111 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 170
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 171 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 230
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK++DAP PVINACNG C
Sbjct: 231 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 290
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K GS++NYYMYH
Sbjct: 291 -DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYH 348
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+REPK HLKELH A+KLC + L++ +
Sbjct: 349 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTIT 408
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF SG CAAFL N + V+F N Y LP SISILPDCK V FN+
Sbjct: 409 TLGTMQEAHVFRSPSG-CAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNS 467
Query: 329 ERVSTQYNKRSKTSNLKFDSDEK----WEEYREAILNFDNT-LLRAEGLLDQISAAKDAS 383
V Q TS ++ D WE Y E + + LL GLL+Q++ +D+S
Sbjct: 468 ATVGVQ------TSQMQMWGDGATSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSS 521
Query: 384 DYFWYTFRFHYNSSN------AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
DY WY + S + P L VQS GH LH FVNG+ GS++G+ ++
Sbjct: 522 DYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYN 581
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQV 490
V+LR GTN ALLSV GLP+ G E GV H + + T +W YQV
Sbjct: 582 GNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQV 641
Query: 491 GLIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
GL GE++ + S G V W S I + L WYK F P+G++P+AL++ SMGKG+
Sbjct: 642 GLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMGSMGKGQ 701
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGN 606
W+NGQSIGRYW ++ + G+ Y + T YHVPR++L+P+ N
Sbjct: 702 VWINGQSIGRYWTAY--ADGDCKGCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRN 759
Query: 607 LLVLLEE-ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG---- 661
LLV+LEE G+ I + ++ VC V+ H P + W I+ +G
Sbjct: 760 LLVVLEELGGGDSSKIALAKRSVSSVCADVSEDH-PNIKKW---------QIESYGEREH 809
Query: 662 KKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIP 721
++ V C G+ IS I FASFG P G C + G CHS+ S V+E+ CIG RC +
Sbjct: 810 RRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKRCIGLQRCVVA 869
Query: 722 LLSRYFGGDPCPGIHKALLVDAQC 745
+ FGGDPCP + K + V+A C
Sbjct: 870 ISPDNFGGDPCPSVTKRVAVEAVC 893
>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 826
Score = 621 bits (1602), Expect = e-175, Method: Compositional matrix adjust.
Identities = 344/790 (43%), Positives = 456/790 (57%), Gaps = 63/790 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F G D++RFIK +Q GLY+ LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVRFIKLVQQGGLYLHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQT--------IEPAFHEKGPPYV 112
P++ +EW +GG P+WL V GI FR+DN+P+K E E T E FH +G P +
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGIHFRTDNEPFKAEMEKFTSHIVNMMKAEKLFHWQGGPII 175
Query: 113 L-----------------------WAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
L WAAKMAVD TGVPWVMCK+DDAP PVIN NG
Sbjct: 176 LSQIENEFGPLEYDQGAPAKAYAAWAAKMAVDLETGVPWVMCKEDDAPDPVINTWNGFYA 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE+WT ++ +G R +D+AF VA F+ K GSYVNYYMYH
Sbjct: 236 DGFY--PNKRYKPMMWTENWTGWFTGYGVPVPHRPVEDLAFSVAKFVQKGGSYVNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYG++R+PK+GHL +LH AIKLC L++G V
Sbjct: 294 GGTNFGRTAGGPFIATSYDYDAPLDEYGMLRQPKYGHLTDLHKAIKLCEPALVSGYPVVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QE+ VF SG CAAFL N D + TV F + Y LP SISILPDCKT FNT
Sbjct: 354 SLGNNQESNVFRSNSGACAAFLANYDTKYYATVTFNGMRYNLPPWSISILPDCKTTVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q + T+ F W Y E + D+ GL++QIS +D++DY WY
Sbjct: 414 ARVGAQTTQMQMTTVGGF----SWVSYNEDPNSIDDGSFTKLGLVEQISMTRDSTDYLWY 469
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + + + N Q P L QS GH LH F+NG+ G+A+GS ++ T V L
Sbjct: 470 TTYVNIDQNEQFLKNGQYPVLTAQSAGHSLHVFINGQLIGTAYGSVEDPRLTYTGNVKLF 529
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAG------VHRVRVQDKSFTNCSWGYQVGLIGEK 496
G+N + LS+ VGLP+ G E G ++ + + T W Y++GL GE
Sbjct: 530 AGSNKISFLSIAVGLPNVGEHFETWNTGLLGPVTLNGLNEGKRDLTWQKWTYKIGLKGEA 589
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L +++ G + V W S + L WYK F AP G++P+AL++ +MGKG+ W+NGQSIG
Sbjct: 590 LSLHTLSGSSNVEWGDA-SRKQPLAWYKGFFNAPGGSEPLALDMSTMGKGQVWINGQSIG 648
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW ++K P T + YHVPR++L PTGNL+V+ EE G
Sbjct: 649 RYWPAYKARGSCPKCDYEGTYEETKCQSNCGDSSQRWYHVPRSWLNPTGNLIVVFEEWGG 708
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKI 676
P GI++ ++R C +V+ P +++W H + ++ V SC G K+
Sbjct: 709 EPTGISLVKRSMRSACAYVSQGQ-PSMNNW--HTKYAESK---------VHLSCDPGLKM 756
Query: 677 SKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIH 736
++I FAS+G P G CE Y+ G CH+ S + ++ CIG+ CS+ ++ FGGDPCPGI
Sbjct: 757 TQIKFASYGTPQGACESYSEGRCHAHKSYDIFQKNCIGQQVCSVTVVPEVFGGDPCPGIM 816
Query: 737 KALLVDAQCR 746
K++ V A C
Sbjct: 817 KSVAVQASCE 826
>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
Length = 827
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 349/794 (43%), Positives = 444/794 (55%), Gaps = 71/794 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGG++VIQTYVFWN HEP GQY F R D+++FIK +Q GLYV LRIG
Sbjct: 55 MWPGLIQKAKEGGIEVIQTYVFWNGHEPSPGQYYFQDRYDLVKFIKLVQQAGLYVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 115 PYVCAEWNFGGFPMWLKYVPGIEFRTDNGPFKAAMQKFVTLIVNMMKEQKLFQTQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MA +TGVPW+MCKQ+DAP P I+ CNG C
Sbjct: 175 LSQIENEYGPVEWTIGAPGKAYTKWAAAMATGLNTGVPWIMCKQEDAPDPTIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E +K PN+ NKP +WTE+WT +Y WG R +D AF VA FIA +GS+VNYYMYH
Sbjct: 235 -EGYK-PNNYNKPKVWTENWTGWYTEWGASVPYRPPEDTAFSVARFIAASGSFVNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA FM T Y APLDEYGL +PKWGHL++LH AIK R L++ VIS
Sbjct: 293 GGTNFDRTAGLFMATSYDYDAPLDEYGLTHDPKWGHLRDLHRAIKQSERALVSADPTVIS 352
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+ QEA VF+ G CAAFL N D + + V F N Y LPR SIS+LPDCKTV +NT
Sbjct: 353 LGKNQEAHVFQSKMG-CAAFLANYDTQYSARVNFWNKPYSLPRWSISVLPDCKTVVYNTA 411
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFWY 388
++S Q ++ + S W+ + + + + + GL +Q D +DY WY
Sbjct: 412 KISAQSTQKWM---MPVASGFSWQSHIDEVPVGYSAGTFTKVGLWEQKYLTGDKTDYLWY 468
Query: 389 TFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
NS S L V S GH+LH F+NG GSA+GS +N T V L
Sbjct: 469 MTDVTINSNEGFLRSGKNPFLTVASAGHVLHVFINGHLAGSAYGSLENPKLTFSQNVKLV 528
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLS TVGL + G + GV + T W Y++GL GE
Sbjct: 529 GGVNKIALLSATVGLANVGVHYDTWNVGVLGPVTLQGLNQGTLDMTKWKWSYKIGLKGED 588
Query: 497 LQIYS---NLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L+++S N+G + + ++P LTWYKT AP GNDP+AL + SMGKG+ ++NG+
Sbjct: 589 LKLFSGGANVGWAQGAQLAKKTP---LTWYKTFINAPPGNDPVALYMGSMGKGQMYINGR 645
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGR+W ++ T+KGN YA + C YHVPR++LKPTGNLLV+
Sbjct: 646 SIGRHWPAY-TAKGNCKDCDYAGYYDDQKCRSGCG-QPPQQWYHVPRSWLKPTGNLLVVF 703
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P GI++ + VC + + P + SW + P CP
Sbjct: 704 EEMGGDPTGISLVKRVVGSVCADIDDDQ-PEMKSW----------TENIPVTPKAHLWCP 752
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+K SKIVFAS+G P G C Y G CH+ S ++ CIGK C I + FGGDP
Sbjct: 753 PGQKFSKIVFASYGWPQGRCGAYRQGKCHALKSWDPFQKYCIGKGACDIDVAPATFGGDP 812
Query: 732 CPGIHKALLVDAQC 745
CPG K L V QC
Sbjct: 813 CPGSAKRLSVQLQC 826
>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 839
Score = 620 bits (1598), Expect = e-174, Method: Compositional matrix adjust.
Identities = 346/797 (43%), Positives = 458/797 (57%), Gaps = 71/797 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F R D+++FIK +Q GLYV LRIG
Sbjct: 61 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLYVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 121 PYICAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSIMKEEKLFQTQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W ++MAV TGVPW+MCKQ D P P+I+ CNG C
Sbjct: 181 MSQIENEYGPVEWEIGAPGKAYTKWFSQMAVGLDTGVPWIMCKQQDTPDPLIDTCNGYYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE+WT +Y +GG R A+D+AF VA F+ GS+VNYYMYH
Sbjct: 241 -ENFT-PNKKYKPKMWTENWTGWYTEFGGAVPRRPAEDMAFSVARFVQNGGSFVNYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT++ I YD P+DEYGL+ EPKWGHL++LH AIKLC L++ V
Sbjct: 299 GGTNFDRTSSGLFIATSYDYDGPIDEYGLLNEPKWGHLRDLHKAIKLCEPALVSVDPTVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G E VF +TSG CAAFL N D + + +V F N Y+LP SISILPDCKT FNT
Sbjct: 359 WPGNNLEVHVF-KTSGACAAFLANYDTKSSASVKFGNGQYDLPPWSISILPDCKTAVFNT 417
Query: 329 ERVSTQYNKRSKTS-NLKFDSDEKWEEYRE--AILNFDNTLLRAEGLLDQISAAKDASDY 385
R+ Q + T+ N FD W+ Y E A N D++ L A L +QI+ +D++DY
Sbjct: 418 ARLGAQSSLMKMTAVNSAFD----WQSYNEEPASSNEDDS-LTAYALWEQINVTRDSTDY 472
Query: 386 FWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WY + +++ N Q+P L V S GH+LH +N + +G+ +G D+ T ++V
Sbjct: 473 LWYMTDVNIDANEGFIKNGQSPVLTVMSAGHVLHVLINDQLSGTVYGGLDSHKLTFSDSV 532
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLI 493
LR G N +LLS+ VGLP+ G E AGV + + + W Y++GL
Sbjct: 533 KLRVGNNKISLLSIAVGLPNVGPHFETWNAGVLGPVTLKGLNEGTRDLSKQKWSYKIGLK 592
Query: 494 GEKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE L + + G + V W S+ + + L WYKTTF PAGNDP+AL++ SMGKG+AW+N
Sbjct: 593 GEALNLNTVSGSSSVEWVQGSLLAKQQPLAWYKTTFSTPAGNDPLALDMISMGKGQAWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVL 610
G+SIGR+W + ++GN YA T + YH+PR++L P+GN LV+
Sbjct: 653 GRSIGRHWPGY-IARGNCGDCYYAGTYTDKKCRTNCGEPSQRWYHIPRSWLNPSGNYLVV 711
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK--KPTVQP 668
EE G+P GIT+ VC + L++RQ D+ GK +P
Sbjct: 712 FEEWGGDPTGITLVKRTTASVCADIYQGQPT-----LKNRQMLDS-----GKVVRPKAHL 761
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
CP GK IS+I FAS+G P G C + GSCH+ S ++ CIGK C + + FG
Sbjct: 762 WCPPGKNISQIKFASYGLPQGTCGNFREGSCHAHKSYDAPQKNCIGKQSCLVTVAPEVFG 821
Query: 729 GDPCPGIHKALLVDAQC 745
GDPCPGI K L ++A C
Sbjct: 822 GDPCPGIAKKLSLEALC 838
>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
Length = 912
Score = 620 bits (1598), Expect = e-174, Method: Compositional matrix adjust.
Identities = 354/835 (42%), Positives = 462/835 (55%), Gaps = 102/835 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGG+DVI+TYVFWN H+P KGQY+F GR D+++F K + S GLY LRIG
Sbjct: 80 MWPDLIAKAKEGGVDVIETYVFWNGHQPVKGQYNFEGRYDLVKFAKLVASNGLYFFLRIG 139
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL D+ GI FR++N P+K
Sbjct: 140 PYACAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMKRFVSKVVNLMREEMLFSWQGGPII 199
Query: 93 ---------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINA 143
IENEY +E ++ +G YV WAA MA+ GVPWVMCKQ DAP +I+
Sbjct: 200 LLQVRREYGIENEYGNLESSYGNEGKEYVKWAASMALSLGAGVPWVMCKQPDAPYDIIDT 259
Query: 144 CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
CN C + FK PNS NKP WTE+W +Y WG + R +D+AF VA F + GS
Sbjct: 260 CNAYYC-DGFK-PNSRNKPIFWTENWDGWYTQWGERLPHRPVEDLAFAVARFFQRGGSLQ 317
Query: 204 NYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT 262
NYYMY GGTNFGRTA IT Y AP+DEYGL+ EPKWGHLK+LHAA+KLC L+
Sbjct: 318 NYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYGLLNEPKWGHLKDLHAALKLCEPALVA 377
Query: 263 G-TQNVISLGQLQEAFVFEET-------------SGVCAAFLVNNDERKAVTVLFRNISY 308
+ I LG QEA V++E S C+AFL N DERKA TV FR +Y
Sbjct: 378 ADSPTYIKLGSKQEAHVYQENVHREGLNLSISQISNKCSAFLANIDERKAATVTFRGQTY 437
Query: 309 ELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSD-----------------EK 351
LP S+SILPDC++ FNT +V Q + + SNL S+ +
Sbjct: 438 TLPPWSVSILPDCRSAIFNTAKVGAQTSVKLVGSNLPLTSNLLLSQQSIDHNGISHISKS 497
Query: 352 WEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPL 403
W +E I + N+ AEG+ + ++ KD SDY WY+ R + + + A L
Sbjct: 498 WMTTKEPINIWINSSFTAEGIWEHLNVTKDQSDYLWYSTRIYVSDGDILFWKENAAHPKL 557
Query: 404 DVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAF 463
+ S IL FVNG+ G+ G TL+ + G ND LL+ TVGL + GAF
Sbjct: 558 AIDSVRDILRVFVNGQLIGNVVGHWVKAVQTLQ----FQPGYNDLTLLTQTVGLQNYGAF 613
Query: 464 LERKVAGVHRVRVQDKSFTNCS-------WGYQVGLIGEKLQIYS----NLGLNKVLWSS 512
+E+ AG+ R ++ F N W YQVGL GE L+ Y+ N G ++ +
Sbjct: 614 IEKDGAGI-RGTIKITGFENGHIDLSKPLWTYQVGLQGEFLKFYNEESENAGWVELTPDA 672
Query: 513 IRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKG-NPSQ 571
I S TWYKT F P GNDP+AL+L+SMGKG+AWVNG IGRYW G
Sbjct: 673 IPS---TFTWYKTYFDVPGGNDPVALDLESMGKGQAWVNGHHIGRYWTRVSPKTGCQVCD 729
Query: 572 TQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRK 630
+ A ++ C K T T YHVPR++LK + N LV+LEE GNPLGI+V +
Sbjct: 730 YRGAYDSDKCTTNCG--KPTQTLYHVPRSWLKASNNFLVILEETGGNPLGISVKLHSASI 787
Query: 631 VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGD 690
VC V+ S+ PP+ L G ++ P + C G IS I FASFG P G
Sbjct: 788 VCAQVSQSYYPPMQKLLNASLLGQQEVSSNDMIPEMNLRCRDGNIISSITFASFGTPGGS 847
Query: 691 CERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C+ ++ G+CH+ S+ +V +AC+GK CSI + S FGGDPC + K L V+A+C
Sbjct: 848 CQSFSRGNCHAPSSKSIVSKACLGKRSCSIKISSDVFGGDPCQDVVKTLSVEARC 902
>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 903
Score = 619 bits (1597), Expect = e-174, Method: Compositional matrix adjust.
Identities = 348/830 (41%), Positives = 467/830 (56%), Gaps = 94/830 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG+DVIQTY FW+ HEP +GQY+F GR DI++F + + GLY+ LRIG
Sbjct: 66 MWPDLIAKSKEGGVDVIQTYAFWSGHEPVRGQYNFEGRYDIVKFANLVGASGLYLHLRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR++N +K
Sbjct: 126 PYVCAEWNFGGFPVWLRDIPGIEFRTNNALFKEEMQRFVKKMVDLMQEEELLSWQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE F +KG Y+ WAA+MA+ GVPWVMCKQ DAPG +I+ACNG C
Sbjct: 186 MMQIENEYGNIEGQFGQKGKEYIKWAAEMALGLGAGVPWVMCKQVDAPGSIIDACNGYYC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ +K PNS NKP++WTEDW +Y WGG+ R +D+AF VA F + GS+ NYYMY
Sbjct: 246 -DGYK-PNSYNKPTLWTEDWDGWYASWGGRLPHRPVEDLAFAVARFYQRGGSFQNYYMYF 303
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRT+ F IT Y AP+DEYGL+ EPKWGHLK+LHAAIKLC L+ + N
Sbjct: 304 GGTNFGRTSGGPFYITSYDYDAPIDEYGLLSEPKWGHLKDLHAAIKLCEPALVAADSPNY 363
Query: 268 ISLGQLQEAFVFE---ETSGV----------CAAFLVNNDERKAVTVLFRNISYELPRKS 314
I LG QEA V+ T G+ C+AFL N DE KA +V F Y LP S
Sbjct: 364 IKLGPKQEAHVYRVNSHTEGLNITSYGSQISCSAFLANIDEHKAASVTFLGQKYNLPPWS 423
Query: 315 ISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDS-----------------DEKWEEYRE 357
+SILPDC+ V +NT +V Q + ++ +L S + W +E
Sbjct: 424 VSILPDCRNVVYNTAKVGAQTSIKTVEFDLPLYSGISSQQQFITKNDDLFITKSWMTVKE 483
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHG 409
+ + +G+L+ ++ KD SDY W+ R + +N A + + S
Sbjct: 484 PVGVWSENNFTVQGILEHLNVTKDQSDYLWHITRIFVSEDDISFWEKNNISAAVSIDSMR 543
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
+L FVNG+ T GS + V +G ND LL+ TVGL + GAFLE+ A
Sbjct: 544 DVLRVFVNGQLT---EGSVIGHWVKVEQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGA 600
Query: 470 GVHRVRVQDKSFTNCS-------WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--- 519
G R +++ F N W YQVGL GE +IY+ K W+ + SP
Sbjct: 601 GF-RGQIKLTGFKNGDIDLSKLLWTYQVGLKGEFFKIYTIEENEKAGWAEL-SPDDDPST 658
Query: 520 LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVN 577
WYKT F +PAG DP+AL+L SMGKG+AWVNG IGRYW G P Y A N
Sbjct: 659 FIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNGHHIGRYWTLVAPEDGCPEICDYRGAYN 718
Query: 578 TVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVT 636
+ C K T T YHVPR++L+ + NLLV+LEE GNP I++ + +C V+
Sbjct: 719 SDKCSFNCG--KPTQTLYHVPRSWLQSSSNLLVILEETGGNPFDISIKLRSAGVLCAQVS 776
Query: 637 NSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAV 696
SH PP+ W + D I P + C G IS I FAS+G P G C+++++
Sbjct: 777 ESHYPPVQKWF-NPDSVDEKITVNDLTPEMHLQCQDGFTISSIEFASYGTPQGSCQKFSM 835
Query: 697 GSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQCR 746
G+CH+++S +V ++C+GK+ CS+ + + FGGDPC GI K L V+A+CR
Sbjct: 836 GNCHATNSSSIVSKSCLGKNSCSVEISNNSFGGDPCRGIVKTLAVEARCR 885
>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
Length = 836
Score = 619 bits (1596), Expect = e-174, Method: Compositional matrix adjust.
Identities = 345/795 (43%), Positives = 458/795 (57%), Gaps = 60/795 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D++RFIK ++ GLYV LRIG
Sbjct: 51 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIG 110
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR++N P+K
Sbjct: 111 PYVCAEWNFGGFPVWLKYIPGIAFRTNNGPFKAYMQRFTKKIVDMMKAEGLFESQGGPII 170
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQDDAP P+IN+CNG C
Sbjct: 171 LSQIENEYGPMEYELGAAGRAYSQWAAQMAVGLGTGVPWVMCKQDDAPDPIINSCNGFYC 230
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R +D+AF VA FI K GS++NYYMYH
Sbjct: 231 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPVEDLAFSVARFIQKGGSFINYYMYH 288
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGLVR+PKWGHLK+LH AIKLC L++G +V+
Sbjct: 289 GGTNFGRTAGGPFIATSYDYDAPLDEYGLVRQPKWGHLKDLHRAIKLCEPALVSGDPSVM 348
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+ QEA VF+ G CAAFL N + R V F N+ Y LP SISILPDCK +NT
Sbjct: 349 PLGRFQEAHVFKSKYGHCAAFLANYNPRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNT 408
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
RV Q + R K + W+ Y EA + GL++QI+ +D SDY W
Sbjct: 409 ARVGAQ-SARMKMVPVPIHGAFSWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLW 467
Query: 388 YTFRFHYN------SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y+ + + L V S GH LH FVN + +G+A+GS + T V+L
Sbjct: 468 YSTDVKIDPDEGFLKTGKYPTLTVLSAGHALHVFVNDQLSGTAYGSLEFPKITFSKGVNL 527
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ++LS+ VGLP+ G E AGV + + + + W Y+VG+ GE
Sbjct: 528 RAGINKISILSIAVGLPNVGPHFETWNAGVLGPVTLNGLNEGRRDLSWQKWSYKVGVEGE 587
Query: 496 KLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
+ ++S G + V W+ S + + LTW+KTTF APAGN P+AL++ SMGKG+ W+NG+
Sbjct: 588 AMSLHSLSGSSSVEWTAGSFVARRQPLTWFKTTFNAPAGNSPLALDMNSMGKGQIWINGK 647
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGR+W ++K S G+ YA N + C + YHVPR++ PTGNLLV+
Sbjct: 648 SIGRHWPAYKAS-GSCGWCDYAGTFNEKKCLSNCG-EASQRWYHVPRSWPNPTGNLLVVF 705
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P GI++ + VC + P L + ++ + + K +P C
Sbjct: 706 EEWGGDPNGISLVRREVDSVCADIYEWQ-PTL---MNYQMQASGKVNK-PLRPKAHLQCG 760
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD- 730
G+KIS + FASFG P+G C Y GSCH+ HS ER C+G++ CS+ ++ R G+
Sbjct: 761 PGQKISSVKFASFGTPEGACGSYREGSCHAHHSYDAFERLCVGQNWCSVTVVPRNVSGEI 820
Query: 731 PCPGIHKALLVDAQC 745
P P + K L V+ C
Sbjct: 821 PAPSVMKKLAVEVVC 835
>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 830
Score = 619 bits (1596), Expect = e-174, Method: Compositional matrix adjust.
Identities = 344/799 (43%), Positives = 456/799 (57%), Gaps = 82/799 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP + QY F GR D++ FIK ++ GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVVQTYVFWNGHEPSRRQYYFEGRYDLVHFIKLVKQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQNFTTKIVDMMKSEGLFEWQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MAV +T VPWVMCK+DDAP P+IN CNG C
Sbjct: 179 LSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDAPDPIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WTS+Y +G R +D+A+ VA FI K GS+VNYYMYH
Sbjct: 239 --DWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYH 296
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+REPKWGHLKELH AIKLC L+ G V
Sbjct: 297 GGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLCEPALVAGDPIVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A VF ++ C AFL N D+ V F + Y+LP SISILPDCKT +NT
Sbjct: 357 SLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYDLPPWSISILPDCKTTVYNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V +Q ++ +++ W+ Y E I + + GLL+QI+ +D +DY WY
Sbjct: 417 ASVGSQISQM----KMEWAGGFTWQSYNEDINSLGDESFATVGLLEQINVTRDNTDYLWY 472
Query: 389 TFRFHYNS-----SNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T SN + P L V S GH LH FVNG+ TG+ +GS ++ T V L
Sbjct: 473 TTYVDIAQDEQFLSNGKNPMLTVMSAGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLW 532
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIGEK 496
G+N + LS+ VGLP+ G E AG+ D + T W Y+VGL GE
Sbjct: 533 SGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEA 592
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W P ++ L+WYK F AP G++P+AL++ SMGKG+ W+NGQ
Sbjct: 593 LSLHSLSGSSSVEWG---EPVQKQPLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQG 649
Query: 555 IGRYWVSFKTS--------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
IGRYW +K S +G + + N S + YHVPR++L PTGN
Sbjct: 650 IGRYWPGYKASGTCGICDYRGEYDEKKCQTNCGDS--------SQRWYHVPRSWLNPTGN 701
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
LLV+ EE G+P GI++ +C V+ P +++W R +G +K V
Sbjct: 702 LLVIFEEWGGDPTGISMVKRIAGSICADVSEWQ-PSMANW---RTKGY-------EKAKV 750
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
C G+K++ I FASFG P G C Y+ G CH+ S + ++CIG+ RC + ++
Sbjct: 751 HLQCDHGRKMTHIKFASFGTPQGSCGSYSEGGCHAHKSYDIFWKSCIGQERCGVSVVPDA 810
Query: 727 FGGDPCPGIHKALLVDAQC 745
FGGDPCPG K +V+A C
Sbjct: 811 FGGDPCPGTMKRAVVEAIC 829
>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
Length = 843
Score = 619 bits (1595), Expect = e-174, Method: Compositional matrix adjust.
Identities = 332/794 (41%), Positives = 450/794 (56%), Gaps = 59/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AK+GG D I+TYVFWN HE GQY F R D++RF+K ++ GL + LRIG
Sbjct: 59 MWPKLVAEAKDGGADCIETYVFWNGHEIAPGQYYFEDRFDLVRFVKVVKDAGLLLILRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+P+WLH V G VFR+DN+P+K
Sbjct: 119 PFVAAEWNFGGVPVWLHYVPGTVFRTDNEPFKSHMKSFTTYIVNMMKKEQLFASQGGNII 178
Query: 93 ---IENEY-QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
IENEY E A+ G PY +WAA MAV +TGVPW+MC++ DAP PVIN+CNG
Sbjct: 179 LAQIENEYGDYYEQAYAPGGKPYAMWAASMAVAQNTGVPWIMCQESDAPDPVINSCNGFY 238
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + F+ PNSP KP +WTE+W ++Q +G R +D+AF VA F K GS NYY+Y
Sbjct: 239 C-DGFQ-PNSPTKPKLWTENWPGWFQTFGESNPHRPPEDVAFAVARFFEKGGSVQNYYVY 296
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNFGRT IT YD AP+DEYGL R PKW HL++LH +I+LC LL G
Sbjct: 297 HGGTNFGRTTGGPFITTSYDYDAPIDEYGLRRFPKWAHLRDLHKSIRLCEHTLLYGNTTF 356
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
+SLG QEA ++ + SG C AFL N D V FRN Y+LP S+SILPDC+ V FN
Sbjct: 357 LSLGPKQEADIYSDQSGGCVAFLANIDSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFN 416
Query: 328 TERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
T +V +Q + + +L+ E+W +RE + G +D I+ KD++DY
Sbjct: 417 TAKVQSQTSMVAMVPESLQASKPERWNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYL 476
Query: 387 WYTFRFHYNSSNAQAP---LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
WYT F + S ++ L++ S GH +HAF+N E+ GSA+G+ SF+++ ++LR
Sbjct: 477 WYTTSFSVDESYSKGSHVVLNIDSKGHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRT 536
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAG-----VHRVRVQDKSFTNCSWGYQVGLIGEKLQ 498
G N+ ALLS+TVGL ++G E AG + VR + ++ +W Y++GL GE
Sbjct: 537 GKNELALLSMTVGLQNAGFSYEWIGAGFTNVNISGVRNGTINLSSNNWAYKIGLEGEYYS 596
Query: 499 IYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
++ N W P + LTWYK P G+DP+ +++QSMGKG W+NG +IG
Sbjct: 597 LFKPDQRNNQRWIPQSEPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIG 656
Query: 557 RYW--VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
RYW S + PS YH+PR++ P+GN+LV+ EE+
Sbjct: 657 RYWPRTSSIDDRCTPSCDYRGEFNPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEK 716
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPT-VQPSCP 671
G+P IT A+ VC V+ H P L SW D G P Q SCP
Sbjct: 717 GGDPTKITFSRRAVTSVCSFVS-EHFPSIDLESW-------DGSATNEGTSPAKAQLSCP 768
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
+GK IS + FAS G P G C Y GSCH +S VVE+AC+ + C++ L FG D
Sbjct: 769 IGKNISSLKFASLGTPSGTCRSYQKGSCHHPNSLSVVEKACLNTNSCTVSLSDESFGKDL 828
Query: 732 CPGIHKALLVDAQC 745
CPG+ K L ++A C
Sbjct: 829 CPGVTKTLAIEADC 842
>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 836
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 347/796 (43%), Positives = 456/796 (57%), Gaps = 67/796 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP +GQY+F GR D+++F+K + + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+ +EW YGG P+WLH + GI FR+DNKP+
Sbjct: 116 PYACAEWNYGGFPLWLHFIPGIQFRTDNKPFEAEMKQFTAKIVDLMKQENLYASQGGPII 175
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY IE + Y+ WAA MA TGVPWVMC+Q +AP P+INACNG C
Sbjct: 176 LSQIENEYGNIEADYGPAAKSYIKWAASMATSLGTGVPWVMCQQQNAPDPIINACNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS KP IWTE +T ++ +G R +D+AF VA F + G++ NYYMYH
Sbjct: 236 -DQFK-PNSNTKPKIWTEGYTGWFLAFGDAVPHRPVEDLAFAVARFYQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR + + YD AP+DEYG +R+PKWGHLK++H AIKLC L+ +
Sbjct: 294 GGTNFGRASGGPFVASSYDYDAPIDEYGFIRQPKWGHLKDVHKAIKLCEEALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ +T VCAAFL N A TV F SY LP S+SILPDCK V NT
Sbjct: 354 SLGPNIEAAVY-KTGVVCAAFLANIATSDA-TVTFNGNSYHLPAWSVSILPDCKNVVLNT 411
Query: 329 ERVSTQYNKRS-KTSNLK-----FDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
++++ S T +LK DS +W E I GLL+QI+ D
Sbjct: 412 AKITSASMISSFTTESLKDVGSLDDSGSRWSWISEPIGISKADSFSTFGLLEQINTTADR 471
Query: 383 SDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
SDY WY+ + + AQ L ++S GH LHAF+NG+ GS G+H+ + + + L
Sbjct: 472 SDYLWYSLSIDLD-AGAQTFLHIKSLGHALHAFINGKLAGSGTGNHEKANVEVDIPITLV 530
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-------QDKSFTNCSWGYQVGLIGE 495
G N LLS+TVGL + GAF + AG+ + + ++ W YQVGL E
Sbjct: 531 SGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVILKCLKNGSNVDLSSKQWTYQVGLKNE 590
Query: 496 KLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L + S W+S + PT Q LTWYKT F AP+GN+P+A++ MGKGEAWVNGQ
Sbjct: 591 DLGLSSGCSGQ---WNSQSTLPTNQPLTWYKTNFVAPSGNNPVAIDFTGMGKGEAWVNGQ 647
Query: 554 SIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVL 610
SIGRYW ++ + KG + + + A + + C T YHVPR++L+P N LVL
Sbjct: 648 SIGRYWPTYASPKGGCTDSCNYRGAYDASKCLKNCGKPSQT-LYHVPRSWLRPDRNTLVL 706
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE GNP I+ T I VC HV+ SH PP+ SW + + G + P V C
Sbjct: 707 FEESGGNPKQISFATKQIGSVCSHVSESHPPPVDSWNSNTESGRKVV------PVVSLEC 760
Query: 671 PLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
P + +S I FASFG P G C + G C S+ + +V++ACIG S C I L F G
Sbjct: 761 PYPNQVVSSIKFASFGTPLGTCGNFKHGLCSSNKALSIVQKACIGSSSCRIELSVNTF-G 819
Query: 730 DPCPGIHKALLVDAQC 745
DPC G+ K+L V+A C
Sbjct: 820 DPCKGVAKSLAVEASC 835
>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
Length = 923
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 357/823 (43%), Positives = 468/823 (56%), Gaps = 89/823 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAKAKEGG+DVI+TY+FWN HEP KGQY F GR DI+RF K + ++GL++ LRIG
Sbjct: 99 MWPSLIAKAKEGGVDVIETYIFWNGHEPAKGQYYFEGRFDIVRFAKLVAAEGLFLFLRIG 158
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL D+ GI FR+DN+PYK
Sbjct: 159 PYACAEWNFGGFPVWLRDIPGIEFRTDNEPYKAEMQNFVTKIVDIMKEEKLYSWQGGPII 218
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + + G Y+ WAA+MA+ TGVPWVMC+Q DAP +++ CN C
Sbjct: 219 LQQIENEYGNIQGKYGQAGKRYMQWAAQMALALDTGVPWVMCRQTDAPEQILDTCNAFYC 278
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS NKP+IWTEDW +Y WG R AQD AF VA F + GS+ NYYMY
Sbjct: 279 -DGFK-PNSYNKPTIWTEDWDGWYADWGEALPHRPAQDSAFAVARFYQRGGSFQNYYMYF 336
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT---GTQ 265
GGTNF RTA IT Y AP+DEYG++R+PKWGHLK+LHAAIKLC P LT G+
Sbjct: 337 GGTNFERTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLC-EPALTAVDGSP 395
Query: 266 NVISLGQLQEAFVFEE----TSG-------VCAAFLVNNDERKAVTVLFRNISYELPRKS 314
I LG +QEA V+ T+G C+AFL N DE K +V SY LP S
Sbjct: 396 RYIKLGPMQEAHVYSSENVHTNGSISGNAQFCSAFLANIDEHKYASVWIFGKSYSLPPWS 455
Query: 315 ISILPDCKTVAFNTERVSTQ------------YNKRSKTSNLKFDS---DEKWEEYREAI 359
+SILPDC+TVAFNT RV TQ Y+ R K L W +E +
Sbjct: 456 VSILPDCETVAFNTARVGTQTSFFNVESGSPSYSSRHKPRILSLGGPYLSSTWWASKEPV 515
Query: 360 LNFDNTLLRAEGLLDQISAAKDASDYFWYTFR--------FHYNSSNAQAPLDVQSHGHI 411
+ + A+G+L+ ++ KD SDY YT R ++NS L + +
Sbjct: 516 GIWSEDIFAAQGILEHLNVTKDISDYLSYTTRVNISDEDVLYWNSEGLLPSLTIDQIRDV 575
Query: 412 LHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV 471
+ FVNG+ GS G +L + L QG N+ LLS VGL + GAFLE+ AG
Sbjct: 576 VRIFVNGKLAGSQVGHW----VSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGF 631
Query: 472 H-RVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRS--PTRQLTWY 523
+V++ D TN W YQ+GL GE +IYS WSS+++ TW+
Sbjct: 632 RGQVKLTGLSNGDIDLTNSLWTYQIGLKGEFSRIYSPEKQGSAGWSSMQNDDTLSPFTWF 691
Query: 524 KTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIH 583
KTTF AP GN P+A++L SMGKG+AWVNG IGRYW G PS YA N S
Sbjct: 692 KTTFDAPEGNGPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCPSSCNYAGNYGDSKC 751
Query: 584 FCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPP 642
AT + YH+PR +L+ + NLLVL EE G+P I+++ + +C ++ ++ PP
Sbjct: 752 RSNCGIATQSWYHIPREWLQESDNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPP 811
Query: 643 LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS 702
LS+W R G + P ++ C G ISKI FAS+G P GDC+ ++VG+CH+S
Sbjct: 812 LSAWSR-AANGRPSVNTVA--PELRLQCDEGHVISKITFASYGTPTGDCQNFSVGNCHAS 868
Query: 703 HSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ +V AC GK+RC+I + + F GDPC + K L V A+C
Sbjct: 869 TTLDLVAEACEGKNRCAISVTNDVF-GDPCRKVVKDLAVVAEC 910
>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 898
Score = 618 bits (1593), Expect = e-174, Method: Compositional matrix adjust.
Identities = 336/808 (41%), Positives = 457/808 (56%), Gaps = 80/808 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGG+DVI+TYVFWN HE G Y F GR D+++F + +Q G+Y+ LRIG
Sbjct: 107 MWPGLVQTAKEGGVDVIETYVFWNGHELSPGNYYFGGRFDLVKFAQTVQQAGMYLILRIG 166
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
PF+ +EW +GG+P+WLH V G VFR+ N+P+
Sbjct: 167 PFVAAEWNFGGVPVWLHYVPGTVFRTYNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPII 226
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY E + E G Y LWAAKMAV +TGVPW+MC+Q DAP PVI+ CN C
Sbjct: 227 LAQIENEYGYYENFYKEDGKKYALWAAKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYC 286
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P SPN+P IWTE+W +++ +GG+ R A+D+AF VA F K GS NYYMYH
Sbjct: 287 DQF--TPTSPNRPKIWTENWPGWFKTFGGRDPHRPAEDVAFSVARFFQKGGSVHNYYMYH 344
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL R PKWGHLKELH AIKLC LL G I
Sbjct: 345 GGTNFGRTAGGPFITTSYDYDAPVDEYGLPRLPKWGHLKELHRAIKLCEHVLLNGKSVNI 404
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ ++SG CAAF+ N D++ TV FRN S+ LP S+SILPDCK V FNT
Sbjct: 405 SLGPSVEADVYTDSSGACAAFISNVDDKNDKTVEFRNASFHLPAWSVSILPDCKNVVFNT 464
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-----KWEEYREAILNFDNTLLRAEGLLDQISAAKDAS 383
+V++Q + + SD+ KW+ +E + G +D I+ KD +
Sbjct: 465 AKVTSQTSVVAMVPESLQQSDKVVNSFKWDIVKEKPGIWGKADFVKNGFVDLINTTKDTT 524
Query: 384 DYFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY W+T + + + L ++S GH LHAFVN EY G+ G+ + FT +N
Sbjct: 525 DYLWHTTSIFVSENEEFLKKGNKPVLLIESTGHALHAFVNQEYEGTGSGNGTHAPFTFKN 584
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGL 492
+ LR G N+ ALL +TVGL +G F + AG+ V+++ + ++ +W Y++G+
Sbjct: 585 PISLRAGKNEIALLCLTVGLQTAGPFYDFVGAGLTSVKIKGLNNGTIDLSSYAWTYKIGV 644
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE L++Y GLN V W+S P + LTWYK AP G++P+ L++ MGKG AW+
Sbjct: 645 QGEYLRLYQGNGLNNVNWTSTSEPPKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWL 704
Query: 551 NGQSIGRYW---VSFKTS----------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVP 597
NG+ IGRYW FK+ K NP + T YHVP
Sbjct: 705 NGEEIGRYWPRKSEFKSEDCVKECDYRGKFNPDKCDTGCGEPTQ----------RWYHVP 754
Query: 598 RAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
R++ KP+GN+LVL EE+ G+P I + C V + P ++ +G+ I
Sbjct: 755 RSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSGACALVAEDY-PSVA----LVSQGEDKI 809
Query: 658 KKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSR 717
+ P + +CP +IS + FASFG+P G C Y G CH +S +VE+AC+ K+
Sbjct: 810 QSNKNIPFARLACPGNTRISAVKFASFGSPSGTCGSYLKGDCHDPNSSTIVEKACLNKND 869
Query: 718 CSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C I L F + CPG+ + L V+A C
Sbjct: 870 CVIKLTEENFKSNLCPGLSRKLAVEAVC 897
>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
Length = 844
Score = 617 bits (1592), Expect = e-174, Method: Compositional matrix adjust.
Identities = 333/795 (41%), Positives = 449/795 (56%), Gaps = 60/795 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AK+GG D I+TYVFWN HE GQY F R D++RF+K ++ GL + LRIG
Sbjct: 59 MWPKLVAEAKDGGADCIETYVFWNGHEIAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG+P+WLH V G VFR++N+P+K
Sbjct: 119 PYVAAEWNYGGVPVWLHYVPGTVFRTNNEPFKNHMKSFTTYIVDMMKKEQLFASQGGNII 178
Query: 93 ---IENEY-QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
IENEY E A+ G PY +WAA MA+ +TGVPW+MC++ DAP PVIN+CNG
Sbjct: 179 LAQIENEYGDYYEQAYGAGGKPYAMWAASMALAQNTGVPWIMCQESDAPDPVINSCNGFY 238
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + F+ PNSP KP IWTE+W ++Q +G R +D+AF VA F K GS NYY+Y
Sbjct: 239 C-DGFQ-PNSPTKPKIWTENWPGWFQTFGESNPHRPPEDVAFAVARFFEKGGSVQNYYVY 296
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNFGRT IT YD AP+DEYGL R PKW HL+ELH +I+LC LL G
Sbjct: 297 HGGTNFGRTTGGPFITTSYDYDAPIDEYGLRRFPKWAHLRELHKSIRLCEHTLLYGNTTF 356
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
+SLG QEA ++ + SG C AFL N D V FRN Y+LP S+SILPDC+ V FN
Sbjct: 357 LSLGPKQEADIYSDQSGGCVAFLANIDSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFN 416
Query: 328 TERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
T +V +Q + + +L+ E+W +RE + G +D I+ KD++DY
Sbjct: 417 TAKVQSQTSMVTMVPESLQASKPERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYL 476
Query: 387 WYTFRF----HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
WYT F Y+S + A L++ S+GH +HAF+N GSA+G+ F+++ T++LR
Sbjct: 477 WYTTSFSVDGSYSSKGSHAVLNIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLR 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAG-----VHRVRVQDKSFTNCSWGYQVGLIGEKL 497
G N+ ALLS+TVGL ++G E AG + VR ++ +W Y++GL GE
Sbjct: 537 TGKNELALLSMTVGLQNAGFAYEWIGAGFTNVNISGVRTGIIDLSSNNWAYKIGLEGEYY 596
Query: 498 QIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
++ N W P + LTWYK P G+DP+ +++QSMGKG AW+NG +I
Sbjct: 597 NLFKPDQTNNQRWIPQSEPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAI 656
Query: 556 GRYW--VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE 613
GRYW S + PS YH+PR++ P+GN+LV+ EE
Sbjct: 657 GRYWPRTSSINDRCTPSCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEE 716
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPT-VQPSC 670
+ G+P IT A+ VC V+ H P L SW D G P Q SC
Sbjct: 717 KGGDPTKITFSRRAVTSVCSFVS-EHFPSIDLESW-------DESAMNEGTPPAKAQLSC 768
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
P GK IS + FAS GNP G C Y +G CH +S VVE+AC+ + C++ L FG D
Sbjct: 769 PEGKSISSVKFASLGNPSGTCRSYQMGRCHHPNSLSVVEKACLNTNSCTVSLTDESFGKD 828
Query: 731 PCPGIHKALLVDAQC 745
C G+ K L ++A C
Sbjct: 829 LCHGVTKTLAIEADC 843
>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
Length = 831
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 342/798 (42%), Positives = 450/798 (56%), Gaps = 79/798 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP GQY F GR D++ FIK ++ GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVVQTYVFWNGHEPSPGQYHFEGRYDLVHFIKLVKQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG PIWL V GI FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPIWLKYVPGISFRTDNEPFKAEMQKFTTKIVQMMKSERLFEWQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MA+ +TGVPW+MCK+DDAP P+IN CNG C
Sbjct: 179 LSQIENEFGPLEWDQGEPAKDYASWAANMAMALNTGVPWIMCKEDDAPDPIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WT++Y +G R +D+A+ VA FI K GS+VNYYMYH
Sbjct: 239 --DWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYH 296
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA F+ T Y APLDEYGL+REPKWGHLKELH AIKLC L+ +
Sbjct: 297 GGTNFERTAGGPFIATSYDYDAPLDEYGLLREPKWGHLKELHRAIKLCEPALVAADPILS 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q+A VF ++G CAAFL N + V F + Y+LP SISILPDCKT FNT
Sbjct: 357 SLGNAQKASVFRSSTGACAAFLENKHKLSYARVSFNGMHYDLPPWSISILPDCKTTVFNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYFW 387
RV +Q ++ +++ W+ Y E I +F GLL+QI+ +D +DY W
Sbjct: 417 ARVGSQISQM----KMEWAGGLTWQSYNEEINSFSELESFTTVGLLEQINMTRDNTDYLW 472
Query: 388 YTFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT + +S L V S GH LH F+NG+ +G+ +GS +N T V L
Sbjct: 473 YTTYVDVAKDEQFLTSGKNPKLTVMSAGHALHVFINGQLSGTVYGSVENPKLTYTGKVKL 532
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIGE 495
G+N + LS+ VGLP+ G E AG+ D + T W YQVGL GE
Sbjct: 533 WSGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGKRDLTWQKWTYQVGLKGE 592
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
+ ++S G + V W + LTWYK F AP G++P+AL++ SMGKG+ W+NGQ I
Sbjct: 593 AMSLHSLSGSSSVEWGEPVQ-KQPLTWYKAFFNAPDGDEPLALDMNSMGKGQIWINGQGI 651
Query: 556 GRYWVSFKTS--------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNL 607
GRYW +K S +G ++T+ N C + YHVPR +L PTGNL
Sbjct: 652 GRYWPGYKASGTCGHCDYRGEYNETKCQTN-------CG-DPSQRWYHVPRPWLNPTGNL 703
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQ 667
LV+ EE G+P GI++ VC V+ P + +W K +K V
Sbjct: 704 LVIFEEWGGDPTGISMVKRTTGSVCADVSEWQ-PSIKNWR----------TKDYEKAEVH 752
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
C G+KI++I FASFG P G C Y+ G CH+ S + ++ CI + C + ++ F
Sbjct: 753 LQCDHGRKITEIKFASFGTPQGSCGNYSEGGCHAHRSYDIFKKNCINQEWCGVSVVPEAF 812
Query: 728 GGDPCPGIHKALLVDAQC 745
GGDPCPG K +V+ C
Sbjct: 813 GGDPCPGTMKRAVVEVTC 830
>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
Length = 836
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 345/794 (43%), Positives = 454/794 (57%), Gaps = 65/794 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVIQTYVFWN HEP G+Y F R D+++FIK + GLYV LRIG
Sbjct: 58 MWPDLIQKSKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GIVFR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGIVFRTDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV +TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 178 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE WT +Y +GG R A+D+AF +A FI K GS+VNYYMYH
Sbjct: 238 -ENFT-PNKNYKPKMWTEVWTGWYTEFGGAVPTRPAEDLAFSIARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL REPKWGHL++LH AIK L++ +V
Sbjct: 296 GGTNFGRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA VF+ SG CAAFL N D + + V F N YELP ISILPDCKT +NT
Sbjct: 356 SLGNGQEAHVFKSKSG-CAAFLANYDTKSSAKVSFGNGQYELPPWPISILPDCKTAVYNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
R+ +Q ++ T S W+ + E + D + +GL +QI+ +D +DY W
Sbjct: 415 ARLGSQSSQMKMT---PVKSALPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLW 471
Query: 388 YTFRFHYNSSN-----AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + ++P L + S GH LH F+NG+ +G+ +G+ +N T V
Sbjct: 472 YMTDITISPDEGFIKRGESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKP 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E AGV + + W Y++GL GE
Sbjct: 532 RSGINKLALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKIGLKGE 591
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W+ S ++ LTWYK TF AP GN P+AL++ SMGKG+ W+NGQ
Sbjct: 592 ALGLHTVSGSSSVEWAEGPSMAQKQPLTWYKATFNAPPGNGPLALDMSSMGKGQIWINGQ 651
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGR+W ++ T++GN YA + C + YHVPR++L P+GNLLV+
Sbjct: 652 SIGRHWPAY-TARGNCGNCYYAGTYDDKKCRTHCG-EPSQRWYHVPRSWLTPSGNLLVVF 709
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P I++ VC + P L++ + G + +P CP
Sbjct: 710 EEWGGDPTKISLVERRTSSVCADIFEGQ-PTLTN-SQKLASGKLN------RPKAHLWCP 761
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I FAS+G P G C + GSCH+ S +R CIGK CS+ + FGGDP
Sbjct: 762 PGQVISDIKFASYGLPQGTCGSFQEGSCHAHKSYDAPKRNCIGKQSCSVAVAPEVFGGDP 821
Query: 732 CPGIHKALLVDAQC 745
CPG K L V+A C
Sbjct: 822 CPGSTKKLSVEAVC 835
>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
Length = 831
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 325/776 (41%), Positives = 450/776 (57%), Gaps = 65/776 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I +AK+GGL+ IQTYVFWN+HEPQ+G+++FSGR D+++FIK IQ G+YV LR+G
Sbjct: 84 MWPSIIKRAKQGGLNTIQTYVFWNVHEPQQGKFNFSGRADLVKFIKLIQKNGMYVTLRLG 143
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKMAV 120
PFI++EWT+G + + H +R KIENEY ++ A+ + G Y+ WA+ +
Sbjct: 144 PFIQAEWTHGYITRYDHKNIAGAYR------KIENEYSAVQRAYKQDGLNYIKWASNLVD 197
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
G+PWVMCKQ+DAP P+INACNG CG+TF GPN NKPS+WTE+WT+ ++V+G P
Sbjct: 198 SMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDPP 257
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVRE 240
RS +DIA+ VA F +KNG++VNYYMYHGGTNFGRT+A ++ T YYD APLDEYGL +E
Sbjct: 258 TQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYDDAPLDEYGLEKE 317
Query: 241 PKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET-SGVCAAFLVNNDERKAV 299
PK+GHLK LH A+ LC +PLL G G+ E +E+ + CAAFL NN+ A
Sbjct: 318 PKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEAAE 377
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKR----SKTSNLKFDSDEKWEEY 355
T+ F+ Y + +SISILPDCKTV +NT ++ +Q+ R SK +N KFD E
Sbjct: 378 TIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKKFDFKVFTETL 437
Query: 356 REAILNFDNTLLRAEGLLDQISAAKDASDYFWYT--FRFHYN----SSNAQAPLDVQSHG 409
+ + GL KD +DY WYT F+ H N + + + S G
Sbjct: 438 PSKLEGNSYIPVELYGL------TKDKTDYGWYTTSFKVHKNHLPTKKGVKTFVRIASLG 491
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
H LHA++NGEY GS HGSH+ SF + V L+ G N +L V G PDSG+++E +
Sbjct: 492 HALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGSYMEHRYT 551
Query: 470 GVHRVRVQDKS-----FTNCS-WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWY 523
G + + + T S WG ++G+ GEKL I++ GL KV W LTWY
Sbjct: 552 GPRGISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGKAPGLTWY 611
Query: 524 ----------KTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQ 573
+T F AP + + MGKG WVNG+ +GRYW SF + G P+Q +
Sbjct: 612 QKFSKECETLQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSPLGQPTQIE 671
Query: 574 YAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE-NGNPLGITVDTIAIRKVC 632
YH+PR+FLKP NLLV+ EEE N P + + VC
Sbjct: 672 --------------------YHIPRSFLKPKKNLLVIFEEEPNVKPELMDFAIVNRDTVC 711
Query: 633 GHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCE 692
+V ++ P + W R + + T++ C KKI+ + FASFGNP G C
Sbjct: 712 SYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLK--CSGTKKIAAVEFASFGNPIGVCG 769
Query: 693 RYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF---GGDPCPGIHKALLVDAQC 745
+ +G+C++ S+ V+E+ C+GK+ C IP+ F D C + K L V +C
Sbjct: 770 NFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVVKMLAVQVKC 825
>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
Length = 840
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 337/796 (42%), Positives = 454/796 (57%), Gaps = 65/796 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP +GQY+F GRND++ F+K + GLYV LRIG
Sbjct: 60 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRGQYNFEGRNDLVGFVKAVAEAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI R+DN+PYK
Sbjct: 120 PYVCAEWNYGGFPLWLHFIPGIKLRTDNEPYKAEMHRFTAKIVEMMKNEKLYASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ WAA MAV TGVPWVMC+Q DAP VIN CNG C
Sbjct: 180 LSQIENEYGNIDKAYGPAAKTYINWAANMAVSLDTGVPWVMCQQADAPSSVINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS + P IWTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 240 DQF--SPNSNSTPKIWTENWSGWFLSFGGAVPQRPVEDLAFAVARFYQRGGTFQNYYMYH 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR++ F+ T Y APLDEYGL+R+PKWGHLK++H AIKLC ++ +
Sbjct: 298 GGTNFGRSSGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDVHKAIKLCEPAMVATDPTIS 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ EA V+ +T VC+AFL N D + TV F SY+LP S+SILPDCK V NT
Sbjct: 358 SLGQNIEAAVY-KTGSVCSAFLANVDTKSDATVTFNGNSYQLPAWSVSILPDCKNVVINT 416
Query: 329 ERVST-----QYNKRSKTSNLKFDS--DEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
+++T + ++S +++++ W E + GLL+QI+ D
Sbjct: 417 AKINTATMVPSFTRQSISADVEPTEAVGSGWSWINEPVGISKGDAFTRVGLLEQINTTAD 476
Query: 382 ASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
SDY WY+ +A L VQS GH LHAFVNG+ GS G+ N ++ V
Sbjct: 477 KSDYLWYSTSIDVK-GGYKADLHVQSLGHALHAFVNGKLAGSGTGNSGNAKVSVEIPVEF 535
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------WGYQVGLI 493
G N LLS+TVGL + GAF + AG+ VQ K N + W YQ+GL
Sbjct: 536 ASGKNTIDLLSLTVGLQNYGAFFDLVGAGITG-PVQLKGSANGTTIDLSSQQWTYQIGLK 594
Query: 494 GEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
GE + S G ++ + + LTWYKT F AP G++P+AL+ MGKGEAWVNGQ
Sbjct: 595 GEDEDLPS--GSSQWISQPTLPKNQPLTWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQ 652
Query: 554 SIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGRYW + K + Y A + C + + YHVPR+++K +GN LVL
Sbjct: 653 SIGRYWPTNVAPKTGCTDCNYRGAYSADKCRKNCG-MPSQKLYHVPRSWMKSSGNTLVLF 711
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P ++ T + +C HV+ SH P+ W + G +P + CP
Sbjct: 712 EEVGGDPTQLSFATRQVESLCSHVSESHPSPVDMWSSDSKAGSK------SRPRLSLECP 765
Query: 672 LGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
+ IS I FAS+G P G C ++ GSC SS + +V++AC+G CSI + + F GD
Sbjct: 766 FPNQVISSIKFASYGRPSGTCGSFSHGSCRSSRALSIVQKACVGSKSCSIEVSTHTF-GD 824
Query: 731 PCPGIHKALLVDAQCR 746
PC G+ K+L V+A C+
Sbjct: 825 PCKGLAKSLAVEASCK 840
>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
Length = 844
Score = 616 bits (1589), Expect = e-173, Method: Compositional matrix adjust.
Identities = 331/795 (41%), Positives = 448/795 (56%), Gaps = 60/795 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AK+GG D I+TYVFWN HE GQY F R D++RF+K ++ GL + LRIG
Sbjct: 59 MWPKLVAEAKDGGADCIETYVFWNGHEIAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG+P+WLH V G VFR++N+P+K
Sbjct: 119 PYVAAEWNYGGVPVWLHYVPGTVFRTNNEPFKNHVKSFTTYIVDMMKKEQLFASQGGNII 178
Query: 93 ---IENEY-QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
IENEY E A+ G PY +WAA MA+ +TGVPW+MC++ DAP PVIN+CNG
Sbjct: 179 LAQIENEYGDYYEQAYGAGGKPYAMWAASMALAQNTGVPWIMCQESDAPDPVINSCNGFY 238
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + F+ PNSP KP IWTE+W ++Q +G R +D+AF VA F K GS NYY+Y
Sbjct: 239 C-DGFQ-PNSPTKPKIWTENWPGWFQTFGESNPHRPPEDVAFAVARFFEKGGSVQNYYVY 296
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNFGRT IT YD AP+DEYGL R PKW HL++LH +I+LC LL G
Sbjct: 297 HGGTNFGRTTGGPFITTSYDYDAPIDEYGLRRFPKWAHLRDLHKSIRLCEHTLLYGNTTF 356
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
+SLG QEA ++ + SG C AFL N D V FRN Y+LP S+SILPDC+ V FN
Sbjct: 357 LSLGPKQEADIYSDQSGGCVAFLANIDSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFN 416
Query: 328 TERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
T +V +Q + + +L+ E+W +RE + G +D I+ KD++DY
Sbjct: 417 TAKVQSQTSMVTMVPESLQASKPERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYL 476
Query: 387 WYTFRF----HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
WYT F Y+S + A L++ S+GH +HAF+N GSA+G+ F+++ ++LR
Sbjct: 477 WYTTSFSVDGSYSSKGSHAVLNIDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLR 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAG-----VHRVRVQDKSFTNCSWGYQVGLIGEKL 497
G N+ ALLS+TVGL ++G E AG + VR ++ +W Y++GL GE
Sbjct: 537 TGKNELALLSMTVGLQNAGFAYEWIGAGFTNVNISGVRTGTIDLSSNNWAYKIGLEGEYY 596
Query: 498 QIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
++ N W P + LTWYK P G+DP+ +++QSMGKG AW+NG +I
Sbjct: 597 NLFKPDQTNNQRWIPQSEPPKNQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAI 656
Query: 556 GRYW--VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE 613
GRYW S + PS YH+PR++ P+GN+LV+ EE
Sbjct: 657 GRYWPRTSSINDRCTPSCNYRGTFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEE 716
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPT-VQPSC 670
+ G+P IT A+ VC V+ H P L SW D G P Q C
Sbjct: 717 KGGDPTKITFSRRAVTSVCSFVS-EHFPSIDLESW-------DESAMTEGTPPAKAQLFC 768
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
P GK IS + FAS GNP G C Y +G CH +S VVE+AC+ + C++ L FG D
Sbjct: 769 PEGKSISSVKFASLGNPSGTCRSYQMGRCHHPNSLSVVEKACLNTNSCTVSLTDESFGKD 828
Query: 731 PCPGIHKALLVDAQC 745
CPG+ K L ++A C
Sbjct: 829 LCPGVTKTLAIEADC 843
>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 836
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 340/796 (42%), Positives = 466/796 (58%), Gaps = 69/796 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGLDVIQTYVFWN HEP +G Y+++GR ++ +FI+ + G+YV LRIG
Sbjct: 58 MWPGLIAKAKEGGLDVIQTYVFWNGHEPTRGVYNYAGRYNLPKFIRLVYEAGMYVNLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW GG P WL + GI FR+DN+P+K
Sbjct: 118 PYVCAEWNSGGFPAWLRFIPGIEFRTDNEPFKNETQRFVNHLVRKLKREKLFAWQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ ++ E G Y+ W A MAV +T VPW+MC+Q +AP VIN CNG C
Sbjct: 178 MAQIENEYGNIDASYGEAGQRYLNWIANMAVATNTSVPWIMCQQPEAPQLVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ ++ PNS +KP+ WTE+WT ++Q WGG R QDIAF VA F K GS++NYYMYH
Sbjct: 238 -DGWR-PNSEDKPAFWTENWTGWFQSWGGGAPTRPVQDIAFSVARFFEKGGSFMNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV-- 267
GGTNF RT + T Y AP+DEY VR+PKWGHLK+LHAA+KLC P L V
Sbjct: 296 GGTNFERTGVESVTTSYDYDAPIDEYD-VRQPKWGHLKDLHAALKLC-EPALVEVDTVPT 353
Query: 268 -ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAF 326
ISLG QEA V++ +SG CAAFL + D ++ V F+ Y+LP S+SILPDCK+V F
Sbjct: 354 GISLGPNQEAHVYQSSSGTCAAFLASWDTNDSL-VTFQGQPYDLPAWSVSILPDCKSVVF 412
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
NT +V Q + + + W Y E + + ++ GLL+QI+ KD +DY
Sbjct: 413 NTAKVGAQSVIMTMQGAVPVTN---WVSYHEPLGPW-GSVFSTNGLLEQIATTKDTTDYL 468
Query: 387 WYTFRFHYNSSN-----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
WY S+ AQA L + S H FVNG YTG++H + R + L
Sbjct: 469 WYMTNVQVAESDVRNISAQATLVMSSLRDAAHTFVNGFYTGTSHQQFMHA----RQPISL 524
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV-HRVRVQDK-----SFTNCSWGYQVGLIGE 495
R G+N+ +LS+T+GL G FLE + AG+ + VR++D +W YQVGL GE
Sbjct: 525 RPGSNNITVLSMTMGLQGYGPFLENEKAGIQYGVRIEDLPSGTIELGGSTWTYQVGLQGE 584
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
Q++ G W++I + Q L W KT F PAGN IAL+L SMGKG WVNG
Sbjct: 585 SKQLFEVNGSLTAEWNTISEVSDQNFLFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGV 644
Query: 554 SIGRYWVSFKTSK-GNPSQTQYAVNTVTSIHFCAIIKAT-NTYHVPRAFLKPTGNLLVLL 611
++GRYW SF + G + Y + S + + N YH+PR +L P N +VL
Sbjct: 645 NLGRYWSSFTAQRDGCDASCDYRGSYTQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLF 704
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPTVQPS 669
EE+ GNP I++ T +++C H++ SH P L+SW + T ++ +
Sbjct: 705 EEKGGNPKDISIATRMPQQICSHISQSHPFPFSLTSWTKRDNLTSTLLRA-----PLTLE 759
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
C G++IS+I FAS+G P GDCE + + SCH++ S V+ +AC+G+ +CS+P++S FG
Sbjct: 760 CAEGQQISRICFASYGTPSGDCEGFVLSSCHANTSYDVLTKACVGRQKCSVPIVSSIFGD 819
Query: 730 DPCPGIHKALLVDAQC 745
DPCPG+ K+L A+C
Sbjct: 820 DPCPGLSKSLAATAEC 835
>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 887
Score = 615 bits (1586), Expect = e-173, Method: Compositional matrix adjust.
Identities = 355/820 (43%), Positives = 453/820 (55%), Gaps = 86/820 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LIAK+KEGG DV+QTYVFWN HEP KGQY+F GR D+++F+K I S GLY+ LRIG
Sbjct: 68 MWSDLIAKSKEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR+DN+P+K
Sbjct: 128 PYVCAEWNFGGFPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E ++ +KG YV WAA MA+ GVPWVMCKQ DAP +I+ACNG C
Sbjct: 188 MLQIENEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS KP +WTEDW +Y WGG R A+D+AF VA F + GS+ NYYMY
Sbjct: 248 -DGFK-PNSRTKPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYF 305
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRT+ F IT Y APLDEYGL EPKWGHLK+LHAAIKLC L+
Sbjct: 306 GGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQY 365
Query: 268 ISLGQLQEAFVFE---ETSG-VCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKT 323
LG QEA ++ ET G VCAAFL N DE K+ V F SY LP S+SILPDC+
Sbjct: 366 RKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRH 425
Query: 324 VAFNTERVSTQYNKRS------------------KTSNLKFDSDEKWEEYREAILNFDNT 365
VAFNT +V Q + ++ + N+ + S + W +E I +
Sbjct: 426 VAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYIS-KSWMALKEPIGIWGEN 484
Query: 366 LLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGHILHAFVN 417
+GLL+ ++ KD SDY W+ R + + + + + S +L FVN
Sbjct: 485 NFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVN 544
Query: 418 GEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG------V 471
+ GS G V QG ND LL+ TVGL + GAFLE+ AG +
Sbjct: 545 KQLAGSIVGHW----VKAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKL 600
Query: 472 HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRA 529
+ D + SW YQVGL GE +IY+ K WS++ + WYKT F
Sbjct: 601 TGFKNGDLDLSKSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDP 660
Query: 530 PAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAI 587
PAG DP+ LNL+SMG+G+AWVNGQ IGRYW G Y A N+ C
Sbjct: 661 PAGTDPVVLNLESMGRGQAWVNGQHIGRYWNIISQKDGCDRTCDYRGAYNSDKCTTNCG- 719
Query: 588 IKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSW 646
K T T YHVPR++LKP+ NLLVL EE GNP I+V T+ +CG V+ SH PPL W
Sbjct: 720 -KPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLRKW 778
Query: 647 -LRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ 705
G I P V C G IS I FAS+G P G C+ +++G CH+S+S
Sbjct: 779 STPDYINGTMSINSVA--PEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASNSL 836
Query: 706 GVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+V AC G++ C I + + F DPC G K L V ++C
Sbjct: 837 SIVSEACKGRNSCFIEVSNTAFISDPCSGTLKTLAVMSRC 876
>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
like [Medicago truncatula]
gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
Length = 841
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 344/793 (43%), Positives = 450/793 (56%), Gaps = 58/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++FIK +Q GLYV LRIG
Sbjct: 58 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYIPGISFRTDNEPFKFQMQKFTEKIVDMMKADRLFESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MAV TGVPW+MCKQDDAP PVIN CNG C
Sbjct: 178 MSQIENEYGPMEYEIGAPGKSYTKWAADMAVGLGTGVPWIMCKQDDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYH
Sbjct: 238 --DYFSPNKDYKPKMWTEAWTGWFTEFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+++PKWGHLK+LH AIKL L++G V
Sbjct: 296 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLQQPKWGHLKDLHRAIKLSEPALISGDPTVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+G QEA VF+ SG CAAFL N + + TV F N+ Y LP SISILPDCK +NT
Sbjct: 356 RIGNYQEAHVFKSKSGACAAFLGNYNPKAFATVAFGNMHYNLPPWSISILPDCKNTVYNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q + + K + + W+ + E + D++ GLL+Q++ +D +DY WY
Sbjct: 416 ARVGSQ-SAQMKMTRVPIHGGLSWQVFTEQTASTDDSSFTMTGLLEQLNTTRDLTDYLWY 474
Query: 389 TFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ + S L V S GH LH F+N + +G+ +GS + T V L
Sbjct: 475 STDVVIDPNEGFLRSGKDPVLTVLSAGHALHVFINSQLSGTIYGSLEFPKLTFSQNVKLI 534
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLSV VGLP+ G E AGV + + + + W Y+VGL GE
Sbjct: 535 PGVNKISLLSVAVGLPNVGPHFETWNAGVLGPITLNGLDEGRRDLSWQKWSYKVGLHGEA 594
Query: 497 LQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L ++S G + V W S+ S + LTWYKTTF AP G P AL++ SMGKG+ W+NGQ+
Sbjct: 595 LSLHSLGGSSSVEWVQGSLVSRMQPLTWYKTTFDAPDGIAPFALDMGSMGKGQVWLNGQN 654
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
+GRYW ++K S G YA N C + YHVP ++L PTGNLLV+ E
Sbjct: 655 LGRYWPAYKAS-GTCDNCDYAGTYNENKCRSNCG-EASQRWYHVPHSWLIPTGNLLVVFE 712
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P GI + I VC + +S ++ + + + +P SC
Sbjct: 713 ELGGDPNGIFLVRRDIDSVCADIYEWQPNLISYQMQTSGKTNKPV-----RPKAHLSCGP 767
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G+KIS I FASFG P G C + GSCH+ S E+ C+G++ C + + FGGDPC
Sbjct: 768 GQKISSIKFASFGTPVGSCGNFHEGSCHAHKSYNTFEKNCVGQNSCKVTVSPENFGGDPC 827
Query: 733 PGIHKALLVDAQC 745
P + K L V+A C
Sbjct: 828 PNVLKKLSVEAIC 840
>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
Length = 833
Score = 615 bits (1585), Expect = e-173, Method: Compositional matrix adjust.
Identities = 344/798 (43%), Positives = 455/798 (57%), Gaps = 70/798 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP KGQYDF GR D+++F+K + GLYV LRIG
Sbjct: 52 MWPDLIQKSKDGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 112 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + G Y+ WAAKMA TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 172 LSQIENEYGNIDSHYGSAGKSYINWAAKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 232 DQF--TPNSNTKPKMWTENWSGWFLSFGGAVPHRPVEDLAFAVARFFQRGGTFQNYYMYH 289
Query: 210 GGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF R T F+ T Y AP+DEYG++R+ KWGHLK++H AIKLC L+ +
Sbjct: 290 GGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVHKAIKLCEEALIATDPKIS 349
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ EA V+ +T VCAAFL N D + TV F SY LP S+SILPDCK V NT
Sbjct: 350 SLGQNLEAAVY-KTGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSILPDCKNVVLNT 408
Query: 329 ERVSTQYNKRSKTSNLKFD-------SDEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++ N S SN + S KW E + + +L GLL+QI+ D
Sbjct: 409 AKI----NSASAISNFVTEDISSLETSSSKWSWINEPVGISKDDILSKTGLLEQINTTAD 464
Query: 382 ASDYFWYTFRFHY-NSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
SDY WY+ + +Q L ++S GH LHAF+NG+ G+ G+ D + +
Sbjct: 465 RSDYLWYSLSLDLADDPGSQTVLHIESLGHALHAFINGKLAGNQAGNSDKSKLNVDIPIA 524
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV--------QDKSFTNCSWGYQVGL 492
L G N LLS+TVGL + GAF + AG+ + ++ W YQ+GL
Sbjct: 525 LVSGKNKIDLLSLTVGLQNYGAFFDTVGAGITGPVILKGLKNGNNTLDLSSRKWTYQIGL 584
Query: 493 IGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE L + S + W+S + P Q L WYKT F AP+G++P+A++ MGKGEAWV
Sbjct: 585 KGEDLGLSS---GSSGGWNSQSTYPKNQPLVWYKTNFDAPSGSNPVAIDFTGMGKGEAWV 641
Query: 551 NGQSIGRYWVSFKTSK-GNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLL 608
NGQSIGRYW ++ S G Y +S K + T YHVPR+FLKP GN L
Sbjct: 642 NGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSFLKPNGNTL 701
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
VL EE G+P I+ T + VC HV++SH P + W + + G K G P +
Sbjct: 702 VLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGG----KVG--PALLL 755
Query: 669 SCP-LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
SCP + IS I FAS+G P G C + G C S+ + +V++ACIG CS+ + + F
Sbjct: 756 SCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSVGVSTDTF 815
Query: 728 GGDPCPGIHKALLVDAQC 745
GDPC G+ K+L V+A C
Sbjct: 816 -GDPCRGVPKSLAVEATC 832
>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
Length = 890
Score = 614 bits (1584), Expect = e-173, Method: Compositional matrix adjust.
Identities = 345/828 (41%), Positives = 460/828 (55%), Gaps = 93/828 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I K+KEGG DVIQ+YVFWN HEP KGQY+F GR D+++FI+ + S GLY+ LRIG
Sbjct: 63 MWPDIIEKSKEGGADVIQSYVFWNGHEPTKGQYNFDGRYDLVKFIRLVGSSGLYLHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL DV GI FR+DN P+K
Sbjct: 123 PYVCAEWNFGGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVI 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY IE ++ ++G Y+ W MA+ VPWVMC+Q DAP +IN+CNG C
Sbjct: 183 MLQVENEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK NSP+KP WTE+W ++ WG + R +D+AF VA F + GS+ NYYMY
Sbjct: 243 -DGFKA-NSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYF 300
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRTA F IT Y +P+DEYGL+REPKWGHLK+LH A+KLC L++ +
Sbjct: 301 GGTNFGRTAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQY 360
Query: 268 ISLGQLQEAFVFEETSGV-------------CAAFLVNNDERKAVTVLFRNISYELPRKS 314
I LG QEA V+ S C+AFL N DERKAV V F +Y LP S
Sbjct: 361 IKLGPKQEAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWS 420
Query: 315 ISILPDCKTVAFNTERVSTQ--------YNKRSKTSNLKFDSDEK---------WEEYRE 357
+SILPDC+ V FNT +V+ Q Y S +LK + ++ W +E
Sbjct: 421 VSILPDCQNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKE 480
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHG 409
I + + +G+L+ ++ KD SDY WY R H + N + + S
Sbjct: 481 PIGIWSDQNFTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVR 540
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
+ FVNG+ TGSA G V F V +G ND LLS +GL +SGAF+E+ A
Sbjct: 541 DVFRVFVNGKLTGSAIGQW--VKFV--QPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGA 596
Query: 470 GVHRVRVQDKSFTNCS-------WGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQL 520
G+ R R++ F N W YQVGL GE L YS K W+ S+ +
Sbjct: 597 GI-RGRIKLTGFKNGDIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTF 655
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNT 578
TWYK F +P G DP+A+NL SMGKG+AWVNG IGRYW G P + Y A N+
Sbjct: 656 TWYKAYFSSPDGTDPVAINLGSMGKGQAWVNGHHIGRYWSVVSPKDGCPRKCDYRGAYNS 715
Query: 579 VTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
C + T + YH+PR++LK + NLLVL EE GNPL I V + +CG V+
Sbjct: 716 GKCATNCG--RPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTGVICGQVSE 773
Query: 638 SHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVG 697
SH P L L + D + P + C G IS + FAS+G P G C +++ G
Sbjct: 774 SHYPSLRK-LSNDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRG 832
Query: 698 SCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
CH+++S VV +AC+GK+ C++ + + FGGDPC I K L V+A+C
Sbjct: 833 PCHATNSLSVVSQACLGKNSCTVEISNSAFGGDPCHSIVKTLAVEARC 880
>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
Length = 858
Score = 614 bits (1583), Expect = e-173, Method: Compositional matrix adjust.
Identities = 345/806 (42%), Positives = 466/806 (57%), Gaps = 72/806 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW++HE +GQYDF GR D++RF+K + GLYV LRIG
Sbjct: 63 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+ +K
Sbjct: 123 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 183 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP +WTE+W+ ++ +GG R A+D+AF VA F + G++ NYYMYH
Sbjct: 243 DQFT--PNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYH 300
Query: 210 GGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR T F+ T Y AP+DEYG+VR+PKWGHL+++H AIKLC L+ +
Sbjct: 301 GGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYS 360
Query: 269 SLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
SLGQ EA V++ + +CAAFL N D + TV F +Y+LP S+SILPDCK V N
Sbjct: 361 SLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLN 420
Query: 328 TERVSTQYNK---RSKTSNLKFDSDEK----------WEEYREAILNFDNTLLRAEGLLD 374
T ++++Q RS S+++ D+D+ W E + L GL++
Sbjct: 421 TAQINSQVTTSEMRSLGSSIQ-DTDDSLITPELATAGWSYAIEPVGITKENALTKPGLME 479
Query: 375 QISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHD 429
QI+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS
Sbjct: 480 QINTTADASDFLWYSTSIVVKGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSAS 539
Query: 430 NVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SFTNC 484
+ +L+ V L G N LLS TVGL + GAF + AGV V++ + ++
Sbjct: 540 SSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSST 599
Query: 485 SWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSM 543
W YQ+GL GE L +Y+ + S PT Q L WYKT F APAG+DP+A++ M
Sbjct: 600 DWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGM 659
Query: 544 GKGEAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAF 600
GKGEAWVNGQSIGRYW + G + Y A ++ + C T YHVPR+F
Sbjct: 660 GKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQT-LYHVPRSF 718
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
L+P N LVL E+ G+P I+ T +C HV+ H + SW+ +Q T
Sbjct: 719 LQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQT----- 773
Query: 661 GKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
+ P ++ CP G+ IS I FASFG P G C Y G C SS + VV+ AC+G + CS
Sbjct: 774 -QGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCS 832
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
+P+ S F GDPC G+ K+L+V+A C
Sbjct: 833 VPVSSNNF-GDPCSGVTKSLVVEAAC 857
>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 614 bits (1583), Expect = e-173, Method: Compositional matrix adjust.
Identities = 351/798 (43%), Positives = 464/798 (58%), Gaps = 67/798 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNL+EP +GQYDF GR D+++F+K + + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 116 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVI 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MA TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 176 LSQIENEYGNIDSAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 236 DQF--TPNSNTKPKMWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ F+ T Y AP+DEYG++R+PKWGHLKE+H AIKLC L+ +
Sbjct: 294 GGTNFDRTSGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ +T VCAAFL N D + VTV F SY LP S+SILPDCK V NT
Sbjct: 354 SLGPNLEAAVY-KTGSVCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNT 412
Query: 329 ERVSTQYNKRS-KTSNLKFD------SDEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++++ S T +LK D S W E + GLL+QI+ D
Sbjct: 413 AKINSASAISSFTTESLKEDIGSSEASSTGWSWISEPVGISKADSFPQTGLLEQINTTAD 472
Query: 382 ASDYFWYTFRFHYN-SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
SDY WY+ Y + +Q L ++S GH LHAF+NG+ GS G+ FT+ V
Sbjct: 473 KSDYLWYSLSIDYKGDAGSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVT 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------WGYQVGL 492
L G N LLS+TVGL + GAF + AG+ + K N + W YQVGL
Sbjct: 533 LVAGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVIL-KGLANGNTLDLSYQKWTYQVGL 591
Query: 493 IGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE L + S + W+S + P Q L WYKTTF AP+G+DP+A++ MGKGEAWV
Sbjct: 592 KGEDLGLSSG---SSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWV 648
Query: 551 NGQSIGRYWVSFKTSK-GNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLL 608
NGQSIGRYW ++ S G Y S K + T YHVPR++LKP+GN+L
Sbjct: 649 NGQSIGRYWPTYVASDAGCTDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNIL 708
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
VL EE+ G+P I+ T +C HV++SH PP+ W + G +K G P +
Sbjct: 709 VLFEEKGGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSDTESG----RKVG--PVLSL 762
Query: 669 SCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
+CP + IS I FAS+G P G C + G C S+ + +V++ACIG S CS+ + S F
Sbjct: 763 TCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSETF 822
Query: 728 GGDPCPGIHKALLVDAQC 745
G+PC G+ K+L V+A C
Sbjct: 823 -GNPCRGVAKSLAVEATC 839
>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
Length = 715
Score = 614 bits (1583), Expect = e-173, Method: Compositional matrix adjust.
Identities = 307/669 (45%), Positives = 411/669 (61%), Gaps = 64/669 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAKEGGL++IQTYVFWN+HEP +GQ++F G D+++FIK I QGLYV LRIG
Sbjct: 58 MWPDIIRKAKEGGLNLIQTYVFWNIHEPVQGQFNFEGNYDVVKFIKTIGEQGLYVTLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+IE+EW GG P WL +V I FRS N+P+
Sbjct: 118 PYIEAEWNQGGFPYWLREVPNITFRSYNEPFIHHMKKYSEMVIDLMKKEKLFAPQGGPII 177
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY ++ A+ + G YV WAA MA + GVPW+MCKQ DAP VIN CNG C
Sbjct: 178 MAQIENEYNNVQLAYRDNGKKYVEWAANMATGLYNGVPWIMCKQKDAPAQVINTCNGRHC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TF GPN PNKPS+WTE+WT+ Y+ +G P R+A+DIAF VA F AKNG+ NYYMY+
Sbjct: 238 ADTFTGPNGPNKPSLWTENWTAQYRTFGDPPSQRAAEDIAFSVARFFAKNGTLTNYYMYY 297
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTN+GRT ++F+ T YYD+APLDE+GL REPKW HL++LH A++L R LL GT +V
Sbjct: 298 GGTNYGRTGSSFVTTRYYDEAPLDEFGLYREPKWSHLRDLHRALRLSRRALLWGTPSVQK 357
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
+ Q E V+E+ CAAFL NN T+ FR Y LP KS+SILPDCK ++ NT+
Sbjct: 358 INQHLEITVYEKPGTDCAAFLTNNHTTLPATIKFRGREYYLPEKSVSILPDCKLLSTNTQ 417
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+ +Q+N R+ + K + KWE Y+E + + L+ L+ S KD SDY WY+
Sbjct: 418 TIVSQHNSRNFLPSEK-AKNLKWEMYQEKVPTISDLSLKNREPLELYSLTKDTSDYAWYS 476
Query: 390 FRFHYNSSNAQAP------LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
+++ + L + S GH L AFVNGE+ G HG++ SF + V L+
Sbjct: 477 TSINFDRHDLPMRPDILPVLQIASMGHALSAFVNGEFVGFGHGNNIEKSFVFQKPVILKP 536
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQ 498
GTN ++L+ TVG P+SGA++E++ AG + VQ T +WG++VG+ GEK Q
Sbjct: 537 GTNTISILAETVGFPNSGAYMEKRFAGPRGITVQGLMAGTLDITQNNWGHEVGVFGEKEQ 596
Query: 499 IYSNLGLNKVLWSSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+++ G KV W+ + PT+ +TWYKT F AP GN+P+AL + M KG WVNG S+GR
Sbjct: 597 LFTEEGAKKVKWTPVNGPTKGAVTWYKTYFDAPEGNNPVALKMDKMQKGMMWVNGNSLGR 656
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW SF + G P+Q + YH+PRAFLKPT NLLV+ EE G+
Sbjct: 657 YWSSFLSPLGQPTQFE--------------------YHIPRAFLKPTNNLLVIFEETGGH 696
Query: 618 PLGITVDTI 626
P I V +
Sbjct: 697 PETIEVQIV 705
>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
Length = 887
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 353/818 (43%), Positives = 448/818 (54%), Gaps = 82/818 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI K+KEGG DVIQTYVFW+ HEP KGQY+F GR D+++F+K I S GLY+ LRIG
Sbjct: 68 MWSDLIEKSKEGGADVIQTYVFWSGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR+DN+P+K
Sbjct: 128 PYVCAEWNFGGFPVWLRDIPGIQFRTDNEPFKKEMQKFVTKIVDLMRDAKLFCWQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E ++ +KG YV WAA MA+ GVPWVMCKQ DAP +I+ACNG C
Sbjct: 188 MLQIENEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS KP +WTEDW +Y WGG R A+D+AF VA F + GS+ NYYMY
Sbjct: 248 -DGFK-PNSQMKPILWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYF 305
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRT+ F IT Y APLDEYGL EPKWGHLK+LHAAIKLC L+
Sbjct: 306 GGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQY 365
Query: 268 ISLGQLQEAFVFE---ETSG-VCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKT 323
LG QEA ++ ET G VCAAFL N DE K+ V F SY LP S+SILPDC+
Sbjct: 366 RKLGSNQEAHIYRGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRH 425
Query: 324 VAFNTERVSTQYNKRS------------------KTSNLKFDSDEKWEEYREAILNFDNT 365
VAFNT +V Q + ++ + N+ + S + W +E I +
Sbjct: 426 VAFNTAKVGAQTSVKTVESARPSLGSKSILQKVVRQDNVSYIS-KSWMALKEPIGIWGEN 484
Query: 366 LLRAEGLLDQISAAKDASDYFWYTFRF--------HYNSSNAQAPLDVQSHGHILHAFVN 417
+GLL+ ++ KD SDY W+ R + + A + + S +L FVN
Sbjct: 485 NFTFQGLLEHLNVTKDRSDYLWHKTRITVSEDDISFWKKNGANPTVSIDSMRDVLRVFVN 544
Query: 418 GEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG------V 471
+ +GS G V QG ND LL+ TVGL + GAFLE+ AG +
Sbjct: 545 KQLSGSVVGHW----VKAVQPVRFMQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKL 600
Query: 472 HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRA 529
+ D SW YQVGL GE +IY+ K WS++ + WYKT F
Sbjct: 601 TGFKNGDMDLAKSSWTYQVGLKGEAEKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDT 660
Query: 530 PAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIK 589
PAG DP+ L+L+SMGKG+AWVNG IGRYW G Y + K
Sbjct: 661 PAGTDPVVLDLESMGKGQAWVNGHHIGRYWNIISQKDGCERTCDYRGAYYSDKCTTNCGK 720
Query: 590 ATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSW-L 647
T T YHVPR++LKP+ NLLVL EE GNP I+V T+ +CG V SH PPL W
Sbjct: 721 PTQTRYHVPRSWLKPSSNLLVLFEETGGNPFNISVKTVTAGILCGQVLESHYPPLRKWST 780
Query: 648 RHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGV 707
G I P V C G IS I FAS+G P G C+R+++G CH+S+S +
Sbjct: 781 PDYINGTMSINSVA--PEVYLHCEDGHVISSIEFASYGTPRGSCDRFSIGKCHASNSLSI 838
Query: 708 VERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
V AC G++ C I + + F DPC G K L V A+C
Sbjct: 839 VSEACKGRTSCFIEVSNTAFRSDPCSGTLKTLAVMARC 876
>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
Length = 830
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 341/783 (43%), Positives = 448/783 (57%), Gaps = 46/783 (5%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D+++F+K ++ GLYV LRIG
Sbjct: 55 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVKEAGLYVNLRIG 114
Query: 61 PFIESEWTYG-----GLPIWLHDVAGI---------------VFRSDNKPY---KIENEY 97
P+I +EW +G G + + A + +F S P +IENEY
Sbjct: 115 PYICAEWNFGHQFQNGQWPFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQIENEY 174
Query: 98 QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN 157
+E G Y WAA+MAV TGVPWVMCKQDDAP P+IN CNG C + PN
Sbjct: 175 GPMEYELGSPGQAYTKWAAQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYC--DYFSPN 232
Query: 158 SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRT 217
KP +WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYHGGTNFGRT
Sbjct: 233 KAYKPKMWTEAWTGWFTQFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRT 292
Query: 218 AAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEA 276
A F+ T Y APLDEYGL+R+PKWGHLK+LH AIKLC L++G VI LG QEA
Sbjct: 293 AGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEA 352
Query: 277 FVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYN 336
VF +G CAAFL N +R V FRN+ Y LP SISILPDCK +NT RV Q +
Sbjct: 353 HVFNYKAGGCAAFLANYHQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQ-S 411
Query: 337 KRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS 396
K + + W+ Y E + + GLL+QI+ +D SDY WY H +
Sbjct: 412 ATIKMTPVPMHGGLSWQTYNEEPSSSGDNTFTMVGLLEQINTTRDVSDYLWYMTDVHIDP 471
Query: 397 S-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGAL 450
S + + P L V S GH LH F+NG+ +G+A+GS D T V LR G N +L
Sbjct: 472 SEGFLKSGKYPVLTVLSAGHALHVFINGQLSGTAYGSLDFPKLTFSQGVSLRAGVNKISL 531
Query: 451 LSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLG 504
LS+ VGLP+ G E AG+ + + + W Y++GL GE L ++S G
Sbjct: 532 LSIAVGLPNVGPHFETWNAGILGPVTLNGLNEGRMDLSWQKWSYKIGLHGEALSLHSISG 591
Query: 505 LNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSF 562
+ V W+ S+ + + L+WYKTTF APAGN P+AL++ SMGKG+ W+NGQ +GR+W ++
Sbjct: 592 SSSVEWAEGSLVAQKQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAY 651
Query: 563 KTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGIT 622
K S T + YHVP+++LKPTGNLLV+ EE G+P G++
Sbjct: 652 KASGTCGECTYIGTYNENKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGVS 711
Query: 623 VDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFA 682
+ + VC + P L + ++ + + K +P SC G+KI I FA
Sbjct: 712 LVRREVDSVCADIYEWQ-PTL---MNYQMQASGKVNK-PLRPKAHLSCGPGQKIRSIKFA 766
Query: 683 SFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVD 742
SFG P+G C Y GSCH+ HS C+G++ CS+ + FGGDPCP + K L +
Sbjct: 767 SFGTPEGVCGSYNQGSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCPSVMKKLAAE 826
Query: 743 AQC 745
A C
Sbjct: 827 AIC 829
>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 956
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 345/806 (42%), Positives = 466/806 (57%), Gaps = 72/806 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW++HE +GQYDF GR D++RF+K + GLYV LRIG
Sbjct: 161 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 220
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+ +K
Sbjct: 221 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 280
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 281 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 340
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP +WTE+W+ ++ +GG R A+D+AF VA F + G++ NYYMYH
Sbjct: 341 DQFT--PNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYH 398
Query: 210 GGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR T F+ T Y AP+DEYG+VR+PKWGHL+++H AIKLC L+ +
Sbjct: 399 GGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYS 458
Query: 269 SLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
SLGQ EA V++ + +CAAFL N D + TV F +Y+LP S+SILPDCK V N
Sbjct: 459 SLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLN 518
Query: 328 TERVSTQYNK---RSKTSNLKFDSDEK----------WEEYREAILNFDNTLLRAEGLLD 374
T ++++Q RS S+++ D+D+ W E + L GL++
Sbjct: 519 TAQINSQVTTSEMRSLGSSIQ-DTDDSLITPELATAGWSYAIEPVGITKENALTKPGLME 577
Query: 375 QISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHD 429
QI+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS
Sbjct: 578 QINTTADASDFLWYSTSIVVKGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSAS 637
Query: 430 NVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SFTNC 484
+ +L+ V L G N LLS TVGL + GAF + AGV V++ + ++
Sbjct: 638 SSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSST 697
Query: 485 SWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSM 543
W YQ+GL GE L +Y+ + S PT Q L WYKT F APAG+DP+A++ M
Sbjct: 698 DWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGM 757
Query: 544 GKGEAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAF 600
GKGEAWVNGQSIGRYW + G + Y A ++ + C T YHVPR+F
Sbjct: 758 GKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQT-LYHVPRSF 816
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
L+P N LVL E+ G+P I+ T +C HV+ H + SW+ +Q T
Sbjct: 817 LQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQT----- 871
Query: 661 GKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
+ P ++ CP G+ IS I FASFG P G C Y G C SS + VV+ AC+G + CS
Sbjct: 872 -QGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCS 930
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
+P+ S F GDPC G+ K+L+V+A C
Sbjct: 931 VPVSSNNF-GDPCSGVTKSLVVEAAC 955
>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 796
Score = 613 bits (1582), Expect = e-173, Method: Compositional matrix adjust.
Identities = 345/806 (42%), Positives = 466/806 (57%), Gaps = 72/806 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW++HE +GQYDF GR D++RF+K + GLYV LRIG
Sbjct: 1 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+ +K
Sbjct: 61 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 120
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 121 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 180
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP +WTE+W+ ++ +GG R A+D+AF VA F + G++ NYYMYH
Sbjct: 181 DQFT--PNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYH 238
Query: 210 GGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR T F+ T Y AP+DEYG+VR+PKWGHL+++H AIKLC L+ +
Sbjct: 239 GGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYS 298
Query: 269 SLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
SLGQ EA V++ + +CAAFL N D + TV F +Y+LP S+SILPDCK V N
Sbjct: 299 SLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLN 358
Query: 328 TERVSTQYNK---RSKTSNLKFDSDEK----------WEEYREAILNFDNTLLRAEGLLD 374
T ++++Q RS S+++ D+D+ W E + L GL++
Sbjct: 359 TAQINSQVTTSEMRSLGSSIQ-DTDDSLITPELATAGWSYAIEPVGITKENALTKPGLME 417
Query: 375 QISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHD 429
QI+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS
Sbjct: 418 QINTTADASDFLWYSTSIVVKGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSAS 477
Query: 430 NVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SFTNC 484
+ +L+ V L G N LLS TVGL + GAF + AGV V++ + ++
Sbjct: 478 SSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSST 537
Query: 485 SWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSM 543
W YQ+GL GE L +Y+ + S PT Q L WYKT F APAG+DP+A++ M
Sbjct: 538 DWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGM 597
Query: 544 GKGEAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAF 600
GKGEAWVNGQSIGRYW + G + Y A ++ + C T YHVPR+F
Sbjct: 598 GKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQT-LYHVPRSF 656
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
L+P N LVL E+ G+P I+ T +C HV+ H + SW+ +Q T
Sbjct: 657 LQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQT----- 711
Query: 661 GKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
+ P ++ CP G+ IS I FASFG P G C Y G C SS + VV+ AC+G + CS
Sbjct: 712 -QGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCS 770
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
+P+ S F GDPC G+ K+L+V+A C
Sbjct: 771 VPVSSNNF-GDPCSGVTKSLVVEAAC 795
>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 918
Score = 613 bits (1581), Expect = e-172, Method: Compositional matrix adjust.
Identities = 352/823 (42%), Positives = 462/823 (56%), Gaps = 88/823 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAK KEGG+D I+TYVFWN HEP KGQY F GR DI+RF K + ++GL++ LRIG
Sbjct: 93 MWPSLIAKCKEGGVDAIETYVFWNGHEPAKGQYYFEGRFDIVRFAKLVAAEGLFLFLRIG 152
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL DV GI FR+DN+PYK
Sbjct: 153 PYACAEWNFGGFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPII 212
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + + G Y+LWAA+MA+ TGVPWVMC+Q DAP ++N CN C
Sbjct: 213 LQQIENEYGNIQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYC 272
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS NKP+IWTEDW +Y WG R AQD AF VA F + GS NYYMY
Sbjct: 273 -DGFK-PNSYNKPTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYF 330
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPL--LTGTQN 266
GGTNF RTA IT Y AP+DEYG++R+PKWGHLK+LHAAIKLC L + G+ +
Sbjct: 331 GGTNFERTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPH 390
Query: 267 VISLGQLQEAFVFEE-----------TSGVCAAFLVNNDERKAVTVLFRNISYELPRKSI 315
+ LG +QEA V+ S C+AFL N DE K +V SY LP S+
Sbjct: 391 YVKLGPMQEAHVYSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSV 450
Query: 316 SILPDCKTVAFNTERVSTQ------------YNKRSKTSNLKFDS----DEKWEEYREAI 359
SILPDC+TVAFNT RV TQ Y+ R K L W ++E +
Sbjct: 451 SILPDCETVAFNTARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPV 510
Query: 360 LNFDNTLLRAEGLLDQISAAKDASDYFWYTFR--------FHYNSSNAQAPLDVQSHGHI 411
+ + A+G+L+ ++ KD SDY YT R ++NS L + +
Sbjct: 511 GIWGEGIFTAQGILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDV 570
Query: 412 LHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV 471
FVNG+ GS G +L + L QG N+ LLS VGL + GAFLE+ AG
Sbjct: 571 ARVFVNGKLAGSKVGHW----VSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGF 626
Query: 472 H-RVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRS--PTRQLTWY 523
+V++ D TN W YQ+GL GE +IYS WSS+++ TW+
Sbjct: 627 RGQVKLTGLSNGDIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWF 686
Query: 524 KTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIH 583
KT F AP GN P+ ++L SMGKG+AWVNG IGRYW G PS YA S
Sbjct: 687 KTMFDAPEGNGPVTIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCPSSCNYAGTYSDSKC 746
Query: 584 FCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPP 642
AT + YH+PR +L+ +GNLLVL EE G+P I+++ + +C ++ ++ PP
Sbjct: 747 RSNCGIATQSWYHIPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPP 806
Query: 643 LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS 702
LS+W R G + P ++ C G ISKI FAS+G P G C+ ++VG+CH+S
Sbjct: 807 LSAWSR-AANGRPSVNTVA--PELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHAS 863
Query: 703 HSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ +V AC GK+RC+I + + F GDPC + K L V+A+C
Sbjct: 864 TTLDLVVEACEGKNRCAISVTNEVF-GDPCRKVVKDLAVEAEC 905
>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
Length = 836
Score = 613 bits (1580), Expect = e-172, Method: Compositional matrix adjust.
Identities = 344/794 (43%), Positives = 454/794 (57%), Gaps = 65/794 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVIQTYVFWN HEP G+Y F R D+++FIK + GLYV LRIG
Sbjct: 58 MWPDLIQKSKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GIVFR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGIVFRTDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV +TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 178 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE WT +Y +GG R A+D+AF +A FI K GS+VNYYMYH
Sbjct: 238 -ENFT-PNKNYKPKMWTEVWTGWYTEFGGAVPTRPAEDLAFSIARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL REPKWGHL++LH AIK L++ +V
Sbjct: 296 GGTNFGRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA VF+ SG CAAFL N D + + V F N YELP SISILPDC+T +NT
Sbjct: 356 SLGNSQEAHVFKSKSG-CAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCRTAVYNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
R+ +Q ++ T S W+ + E + D + +GL +QI+ +D +DY W
Sbjct: 415 ARLGSQSSQMKMT---PVKSALPWQSFIEESASSDESDTTTLDGLWEQINVTRDTTDYSW 471
Query: 388 YTFRFHYNSSN-----AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + ++P L + S GH LH F+NG+ +G+ +G+ +N T V L
Sbjct: 472 YMTDITISPDEGFIKRGESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKL 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E AGV + + W Y+VGL GE
Sbjct: 532 RSGINKLALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKVGLKGE 591
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W+ S ++ LTWY+ TF AP GN P+AL++ SMGKG+ W+NGQ
Sbjct: 592 ALGLHTVSGSSSVEWAEGPSMAQKQPLTWYRATFNAPPGNGPLALDMSSMGKGQIWINGQ 651
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
SIGR+W ++ T++GN YA + C + YHVPR++L +GNLLV+
Sbjct: 652 SIGRHWPAY-TARGNCGNCYYAGTYDDKKCRTHCG-EPSQRWYHVPRSWLTTSGNLLVVF 709
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P I++ VC + P L++ + G + +P CP
Sbjct: 710 EEWGGDPTKISLVERRTSSVCADIFEGQ-PTLTN-SQKLASGKLN------RPKAHLWCP 761
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I FAS+G G C + GSCH+ S +R CIGK CS+ + FGGDP
Sbjct: 762 PGQVISDIKFASYGLSQGTCGSFQEGSCHAHKSYDAPKRNCIGKQSCSVTVAPEVFGGDP 821
Query: 732 CPGIHKALLVDAQC 745
CPG K L V+A C
Sbjct: 822 CPGSTKKLSVEAVC 835
>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
Length = 882
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 349/826 (42%), Positives = 459/826 (55%), Gaps = 94/826 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG DVIQTYVFWN HEP + QY+F GR DI++F+K + S GLY+ LRIG
Sbjct: 59 MWPDLIAKSKEGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR+DN P+K
Sbjct: 119 PYVCAEWNFGGFPVWLRDIPGIEFRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E +F ++G YV WAA+MA++ GVPWVMC+Q DAP +INACNG C
Sbjct: 179 MLQIENEYGNVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTEDW ++ WGG+ R +DIAF VA F + GS+ NYYMY
Sbjct: 239 DAFW--PNSANKPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYF 296
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL-TGTQNV 267
GGTNFGR++ F +T Y AP+DEYGL+ +PKWGHLKELHAAIKLC L+ +
Sbjct: 297 GGTNFGRSSGGPFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALVAVDSPQY 356
Query: 268 ISLGQLQEAFVFEETSGV----------CAAFLVNNDERKAVTVLFRNISYELPRKSISI 317
I LG +QEA V+ + C+AFL N DE K +V F Y+LP S+SI
Sbjct: 357 IKLGPMQEAHVYRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSI 416
Query: 318 LPDCKTVAFNTERVSTQYNKRSKTSNLKFDSD-----------------EKWEEYREAIL 360
LPDC+T FNT +V Q + ++ +L + + W +E I
Sbjct: 417 LPDCRTTVFNTAKVGAQTSIKTVEFDLPLVRNISVTQPLMVQNKISYVPKTWMTLKEPIS 476
Query: 361 NFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS-------SNAQAP-LDVQSHGHIL 412
+ +G+L+ ++ KD SDY W R + ++ N +P L + S IL
Sbjct: 477 VWSENNFTIQGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDIL 536
Query: 413 HAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH 472
H FVNG+ GS G V + L QG ND LLS TVGL + GAFLE+ AG
Sbjct: 537 HIFVNGQLIGSVIGHWVKVV----QPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGF- 591
Query: 473 RVRVQDKSFTN-------CSWGYQVGLIGEKLQIYSNLGLNKVLWSSIR---SPTRQLTW 522
+ +V+ F N SW YQVGL GE +IY K W+ + SP+ TW
Sbjct: 592 KGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLTPDASPS-TFTW 650
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSI 582
YKT F AP G +P+AL+L SMGKG+AWVNG IGRYW G + Y + TS
Sbjct: 651 YKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWTRVAPKDGC-GKCDYRGHYHTSK 709
Query: 583 HFCAIIKATNT---YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSH 639
CA T YH+PR++L+ + NLLVL EE G P I+V + + + +C V+ SH
Sbjct: 710 --CATNCGNPTQIWYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTICAEVSESH 767
Query: 640 LPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSC 699
P L +W K P + C G IS I FAS+G P G C+ ++ G C
Sbjct: 768 YPSLQNWSPSDFIDQNSKNKM--TPEMHLQCDDGHTISSIEFASYGTPQGSCQMFSQGQC 825
Query: 700 HSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
H+ +S +V +AC GK C I +L+ FGGDPC GI K L V+A+C
Sbjct: 826 HAPNSLALVSKACQGKGSCVIRILNSAFGGDPCRGIVKTLAVEAKC 871
>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
Length = 822
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 337/796 (42%), Positives = 449/796 (56%), Gaps = 78/796 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP GQY F GR D++ FIK ++ GLYV LRIG
Sbjct: 53 MWPDLIEKAKDGGLDVVQTYVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 113 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPII 172
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MAV +TGVPW+MCK+DDAP P+IN CNG C
Sbjct: 173 LSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTGVPWIMCKEDDAPDPIINTCNGFYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WT++Y +G R +D+A+ VA FI K GS+VNYYM+H
Sbjct: 233 --DWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMFH 290
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+REPKWGHLK+LH AIKLC L+ G V
Sbjct: 291 GGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVT 350
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG Q++ VF ++G CAAFL N D+ V F + Y+LP SISILPDCKT FNT
Sbjct: 351 SLGNAQKSSVFRSSTGACAAFLDNKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNT 410
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV +Q ++ +++ W+ Y E I +F GLL+QI+ +D +DY WY
Sbjct: 411 ARVGSQISQM----KMEWAGGFAWQSYNEEINSFGEDPFTTVGLLEQINVTRDNTDYLWY 466
Query: 389 TFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
T SN + P + ++ + G+ +GS D+ T V L
Sbjct: 467 TTYVDVAQDDQFLSNGENP-KLTVMCFLILNILFNLLAGTVYGSVDDPKLTYTGNVKLWA 525
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIGEKL 497
G+N + LS+ VGLP+ G E AG+ D + T W YQVGL GE +
Sbjct: 526 GSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESM 585
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
++S G + V W + LTWYK F AP G++P+AL++ SMGKG+ W+NGQ IGR
Sbjct: 586 SLHSLSGSSTVEWGEPVQ-KQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGR 644
Query: 558 YWVSFKTS--------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLV 609
YW +K S +G +T+ N S + YHVPR++L PTGNLLV
Sbjct: 645 YWPGYKASGNCGTCDYRGEYDETKCQTNCGDS--------SQRWYHVPRSWLSPTGNLLV 696
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS 669
+ EE G+P GI++ +I VC V+ P + +W K +K V
Sbjct: 697 IFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWH----------TKDYEKAKVHLQ 745
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
C G+KI++I FASFG P G C Y+ G CH+ S + + C+G+ RC + ++ FGG
Sbjct: 746 CDNGQKITEIKFASFGTPQGSCGSYSEGGCHAHKSYDIFWKNCVGQERCGVSVVPEIFGG 805
Query: 730 DPCPGIHKALLVDAQC 745
DPCPG K +V+A C
Sbjct: 806 DPCPGTMKRAVVEAIC 821
>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 612 bits (1578), Expect = e-172, Method: Compositional matrix adjust.
Identities = 335/803 (41%), Positives = 450/803 (56%), Gaps = 67/803 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGG+DVI+TYVFWN HE Y F GR D+++F+K +Q +Y+ LR+G
Sbjct: 53 MWPGLVKTAKEGGIDVIETYVFWNGHELSPDNYYFGGRYDLLKFVKIVQQARMYLILRVG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+P+WLH V G VFR++++P+K
Sbjct: 113 PFVAAEWNFGGVPVWLHYVPGTVFRTNSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPII 172
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E + + G PY +WAA MA+ + GVPW+MC+Q DAP PVIN CN C
Sbjct: 173 LAQVENEYGDTERIYGDGGKPYAMWAANMALSQNIGVPWIMCQQYDAPDPVINTCNSFYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNSPNKP +WTE+W +++ +G R +DIAF VA F K GS NYYMYH
Sbjct: 233 DQF--TPNSPNKPKMWTENWPGWFKTFGAPDPHRPHEDIAFSVARFFQKGGSLQNYYMYH 290
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD AP+DEYGL R PKWGHLKELH AIK C LL G +
Sbjct: 291 GGTNFGRTSGGPFITTSYDYNAPIDEYGLARLPKWGHLKELHRAIKSCEHVLLYGEPINL 350
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QE V+ ++SG CAAF+ N DE++ ++F+N+SY +P S+SILPDCK V FNT
Sbjct: 351 SLGPSQEVDVYTDSSGGCAAFISNVDEKEDKIIVFQNVSYHVPAWSVSILPDCKNVVFNT 410
Query: 329 ERVSTQYNK------RSKTSNLKFDSDEK---WEEYREAILNFDNTLLRAEGLLDQISAA 379
+V +Q ++ + S + + D K WE + E + G +D I+
Sbjct: 411 AKVGSQTSQVEMVPEELQPSLVPSNKDLKGLQWETFVEKAGIWGEADFVKNGFVDHINTT 470
Query: 380 KDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
KD +DY WYT S +Q L V+S GH LHAFVN + GSA G+ + F
Sbjct: 471 KDTTDYLWYTVSLTVGESENFLKEISQPVLLVESKGHALHAFVNQKLQGSASGNGSHSPF 530
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-----KSFTNCSWGY 488
+ L+ G ND ALLS+TVGL ++G F E AG+ V+++ + +W Y
Sbjct: 531 KFECPISLKAGKNDIALLSMTVGLQNAGPFYEWVGAGLTSVKIKGLNNGIMDLSTYTWTY 590
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKG 546
++GL GE L IY GLN V W S P +Q LTWYK P+GN+PI L++ MGKG
Sbjct: 591 KIGLQGEHLLIYKPEGLNSVKWLSTPEPPKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKG 650
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKP 603
AW+NG+ IGRYW K+S + + + C+ T YHVPR++ KP
Sbjct: 651 LAWLNGEEIGRYWPR-KSSIHDKCVQECDYRGKFMPNKCSTGCGEPTQRWYHVPRSWFKP 709
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP-PLSSWLRHRQRGDTDIKKFGK 662
+GN+LV+ EE+ G+P I VC V+ H L SW + + +
Sbjct: 710 SGNILVIFEEKGGDPTKIRFSRRKTTGVCALVSEDHPTYELESWHKDANENNKN------ 763
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
K T+ CP IS + FAS+G P G C Y+ G CH +S VVE+ CI K+ C+I L
Sbjct: 764 KATIHLKCPENTHISSVKFASYGTPTGKCGSYSQGDCHDPNSASVVEKLCIRKNDCAIEL 823
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
+ F D CP K L V+A C
Sbjct: 824 AEKNFSKDLCPSTTKKLAVEAVC 846
>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
[Brachypodium distachyon]
Length = 852
Score = 612 bits (1577), Expect = e-172, Method: Compositional matrix adjust.
Identities = 338/803 (42%), Positives = 459/803 (57%), Gaps = 68/803 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV++TYVFW++HE QYDF GR D++RF+K GLYV LRIG
Sbjct: 59 MWPGLMQKAKDGGLDVVETYVFWDIHETATXQYDFEGRKDLVRFVKAAADTGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 119 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 179 LSQIENEYGNIDSAYGAAGKSYIRWAAGMAVALDTGVPWVMCQQADAPDPLINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP +WTE+W+ ++ +GG R +D+AF VA F + G+ NYYMYH
Sbjct: 239 DQFT--PNSNSKPKLWTENWSGWFLSFGGAVPYRPTEDLAFAVARFYQRGGTLQNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR++ I+ YD AP+DEYGLVR+PKWGHLK++H AIK C L+ + +
Sbjct: 297 GGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQPKWGHLKDVHKAIKQCEPALIATDPSYM 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+GQ EA V++ S VCAAFL N D + TV F +Y+LP S+SILPDCK V NT
Sbjct: 357 SMGQNAEAHVYKAGS-VCAAFLANMDTQSDKTVTFNGNAYKLPAWSVSILPDCKNVVLNT 415
Query: 329 ERVSTQYNK---RSKTSNLKFDSDEK---------WEEYREAILNFDNTLLRAEGLLDQI 376
++++Q RS S+ K W E + L GL++QI
Sbjct: 416 AQINSQTTTSEMRSLGSSTKASDGSSIETELALSGWSYAIEPVGITTENALTKPGLMEQI 475
Query: 377 SAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNV 431
+ DASD+ WY+ + +Q+ L V S GH+L A++NG++ GSA GS +
Sbjct: 476 NTTADASDFLWYSTSVVVKGGEPYLNGSQSNLLVNSLGHVLQAYINGKFAGSAKGSATSS 535
Query: 432 SFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SFTNCSW 486
+L+ + L G N LLS TVGL + GAF + AG+ V++ ++ W
Sbjct: 536 LISLQTPITLVPGKNKIDLLSGTVGLSNYGAFFDLVGAGITGPVKLSGPKGVLDLSSTDW 595
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGK 545
YQVGL GE L +Y+ + S PT Q L WYK+ F PAG+DP+A++ MGK
Sbjct: 596 TYQVGLRGEGLHLYNPSEASPEWVSDKAYPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGK 655
Query: 546 GEAWVNGQSIGRYW-VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKP 603
GEAWVNGQSIGRYW + G + Y +S + + T YHVPR+FL+P
Sbjct: 656 GEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQP 715
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
N +VL E+ G+P I+ T VC HV+ H + SW+ +Q+ +++ G
Sbjct: 716 GSNDIVLFEQFGGDPSKISFTTKQTASVCAHVSEDHPDQIDSWISPQQK----VQRSG-- 769
Query: 664 PTVQPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
P ++ CP G+ IS I FASFG P G C Y G C S + V + ACIG S CS+P+
Sbjct: 770 PALRLECPKAGQVISSIKFASFGTPSGTCGNYNHGECSSPQALAVAQEACIGVSSCSVPV 829
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
++ F GDPC G+ K+L+V+A C
Sbjct: 830 STKNF-GDPCTGVTKSLVVEAAC 851
>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
Length = 827
Score = 612 bits (1577), Expect = e-172, Method: Compositional matrix adjust.
Identities = 339/796 (42%), Positives = 451/796 (56%), Gaps = 71/796 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQK-GQYDFSGRNDIIRFIKEIQSQGLYVCLRI 59
MWP L+ AKEGG+DVI+TYVFWN+H+P +Y F GR D+++FI +Q G+Y+ LRI
Sbjct: 51 MWPELVKTAKEGGVDVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRI 110
Query: 60 GPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY---------------------------- 91
GPF+ +EW +GG+P+WLH V G VFR+DN +
Sbjct: 111 GPFVAAEWNFGGIPVWLHYVNGTVFRTDNYNFKYYMEEFTTYIVKLMKKEKLFASQGGPI 170
Query: 92 -----KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG 146
K+ENEY E A+ E G Y WAA+MAV +TGVPW+MC+Q DAP VIN CN
Sbjct: 171 ILSQAKVENEYGYYEGAYGEGGKRYAAWAAQMAVSQNTGVPWIMCQQFDAPPSVINTCNS 230
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + FK P P+KP IWTE+W ++Q +G R A+D+AF VA F K GS NYY
Sbjct: 231 FYC-DQFK-PIFPDKPKIWTENWPGWFQTFGAPNPHRPAEDVAFSVARFFQKGGSVQNYY 288
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRTA IT YD +AP+DEYGL R PKWGHLKELH AIKLC LL
Sbjct: 289 MYHGGTNFGRTAGGPFITTSYDYEAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLNSKP 348
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
+SLG QEA V+ + SG C AFL N D++ TV F+N+SY+LP S+SILPDCK V
Sbjct: 349 VNLSLGPSQEADVYADASGGCVAFLANIDDKNDKTVDFQNVSYKLPAWSVSILPDCKNVV 408
Query: 326 FNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDY 385
+NT + ++ + L KWE + E + G +D I+ KD +DY
Sbjct: 409 YNTAK------QKDGSKAL------KWEVFVEKAGIWGEPDFMKNGFVDHINTTKDTTDY 456
Query: 386 FWYTFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WYT + L ++S GH LHAFVN E GSA G+ + F +N +
Sbjct: 457 LWYTTSIVVGENEEFLKEGRHPVLLIESMGHALHAFVNQELQGSASGNGSHSPFKFKNPI 516
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIG 494
L+ G N+ ALLS+TVGLP++G+F E AG+ VR++ ++ +W Y++GL G
Sbjct: 517 SLKAGNNEIALLSMTVGLPNAGSFYEWVGAGLTSVRIEGFNNGTVDLSHFNWIYKIGLQG 576
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
EKL IY G+N V W + P ++ LTWYK PAGN+P+ L++ MGKG AW+NG
Sbjct: 577 EKLGIYKPEGVNSVSWVATSEPPKKQPLTWYKVVLDPPAGNEPVGLDMLHMGKGLAWLNG 636
Query: 553 QSIGRYWVSFKTSKGNPSQTQ--YAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
+ IGRYW K+S T+ Y + F + T YHVPR++ KP+GNLLV
Sbjct: 637 EEIGRYWPR-KSSVHEKCVTECDYRGKFMPDKCFTGCGQPTQRWYHVPRSWFKPSGNLLV 695
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS 669
+ EE+ G+P IT + +C + + P + ++ G K K +V
Sbjct: 696 IFEEKGGDPEKITFSRRKMSSICALIAEDY--PSADRKSLQEAGS---KNSNSKASVHLG 750
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
CP IS + FASFG P G C Y+ G CH +S VVE+AC+ K+ C+I L F
Sbjct: 751 CPQNAVISAVKFASFGTPTGKCGSYSEGECHDPNSISVVEKACLNKTECTIELTEENFNK 810
Query: 730 DPCPGIHKALLVDAQC 745
CP + L V+A C
Sbjct: 811 GLCPDFTRRLAVEAVC 826
>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
Length = 852
Score = 611 bits (1576), Expect = e-172, Method: Compositional matrix adjust.
Identities = 348/805 (43%), Positives = 465/805 (57%), Gaps = 76/805 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP + QYDF GR D+I F+K ++ GL+V +RIG
Sbjct: 63 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVEKAGLFVHIRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 123 PYVCAEWNYGGFPLWLHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVI 182
Query: 93 ---IENEYQT--IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGM 147
IENEY IE + + PYV WAA MA +TGVPWVMC+Q DAP VIN CNG
Sbjct: 183 LSQIENEYGNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGF 242
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
C + FK NS P +WTE+WT ++ +GG R +DIAF VA F + G++ NYYM
Sbjct: 243 YC-DQFK-QNSDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYM 300
Query: 208 YHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQN 266
YHGGTNFGRT+ F+ T Y APLDEYGL+ +PKWGHLK+LH AIKLC ++ N
Sbjct: 301 YHGGTNFGRTSGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPN 360
Query: 267 VISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAF 326
+ SLG E V+ +T CAAFL N + V F SY LP S+SILPDCK VAF
Sbjct: 361 ITSLGSNIEVSVY-KTDSQCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAF 419
Query: 327 NTERVS-----TQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAK 380
+T +++ + + RS ++ S W E + ++ +N R GLL+QI+
Sbjct: 420 STAKINSASTISTFVTRSSEADASGGSLSGWTSVNEPVGISNENAFTRM-GLLEQINTTA 478
Query: 381 DASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFT 434
D SDY WY+ + + + L V++ GH+LHA++NG+ +GS G+ + +FT
Sbjct: 479 DKSDYLWYSLSVNIKNDEPFLQDGSATVLHVKTLGHVLHAYINGKLSGSGKGNSRHSNFT 538
Query: 435 LRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------W 486
+ V L G N LLS TVGL + GAF + K AG+ VQ K F N S W
Sbjct: 539 IEVPVTLVPGENKIDLLSATVGLQNYGAFFDLKGAGITG-PVQLKGFKNGSTTDLSSKQW 597
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMG 544
YQVGL GE L + SN G LW S + PT Q L WYK +F APAG+ P++++ MG
Sbjct: 598 TYQVGLKGEDLGL-SNGG--STLWKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMG 654
Query: 545 KGEAWVNGQSIGRYWVSFKTSKG---NPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
KGEAWVNGQSIGR+W ++ +P + N + C + YHVPR++L
Sbjct: 655 KGEAWVNGQSIGRFWPAYIAPNDGCTDPCNYRGGYNAEKCLKNCG-KPSQLLYHVPRSWL 713
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
K +GN+LVL EE G+P ++ T I+ VC ++++H P+ W D KK G
Sbjct: 714 KSSGNVLVLFEEMGGDPTKLSFATREIQSVCSRISDAHPLPIDMWASE----DDARKKSG 769
Query: 662 KKPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSI 720
PT+ CP + IS I FASFG P G C + G C SS++ +V++ACIG CS+
Sbjct: 770 --PTLSLECPHPNQVISSIKFASFGTPQGTCGSFIHGRCSSSNALSIVKKACIGSKSCSL 827
Query: 721 PLLSRYFGGDPCPGIHKALLVDAQC 745
+ F GDPC G+ K+L V+A C
Sbjct: 828 GVSINAF-GDPCKGVAKSLAVEASC 851
>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 837
Score = 611 bits (1576), Expect = e-172, Method: Compositional matrix adjust.
Identities = 315/789 (39%), Positives = 448/789 (56%), Gaps = 68/789 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP LI +AKEGGL+ I+TY+FWN HEP+ G+Y+F GR D+I+++K IQ +Y +RIG
Sbjct: 66 VWPKLIERAKEGGLNTIETYIFWNAHEPEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N PYK
Sbjct: 126 PFIQAEWNHGGLPYWLREIDHIIFRANNDPYKKEMEKFVRFIVQKLKDAELFASQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ G Y+ WAA+MA+ TGVPW+MCKQ APG VI CNG C
Sbjct: 186 LTQIENEYGNIKKDHATDGDKYLEWAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ NKP +WTE+WT ++ +G + +RSA+DIA+ V F AK GS VNYYMYH
Sbjct: 246 GDTWT-LRDKNKPMLWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYH 304
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+AP+DEYG+ +EPK+GHL++LH I+ + L G +
Sbjct: 305 GGTNFGRTGASYVLTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEI 364
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +FE +C +FL NN+ + TV+FR + +P +S+SIL CK V +NT
Sbjct: 365 LGHGYEAHIFELPEENLCLSFLSNNNTGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNT 424
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+N+RS ++ + +WE Y E I + +T +R + L+Q + KDASDY WY
Sbjct: 425 KRVFVQHNERSYHTSEVTSKNNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWY 484
Query: 389 TFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F S ++ + L V+S H + F N + G A GS F V L+
Sbjct: 485 TTSFRLESDDLPFRNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLK 544
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-----WGYQVGLIGEKL 497
G N LLS T+G+ DSG L +G+ +Q + WG++ L GE
Sbjct: 545 VGVNHVVLLSSTMGMKDSGGELAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDK 604
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+IYS G+ KV W + R TWYK F P G+DP+ L++ SM KG +VNG+ +GR
Sbjct: 605 EIYSEKGVGKVQWKPAEN-GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGR 663
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YWVS++T G PSQ YH+PR FLK NLLV+ EEE G
Sbjct: 664 YWVSYRTLAGTPSQA--------------------LYHIPRPFLKSKDNLLVVFEEEMGK 703
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKIS 677
P GI V T+ +C ++ + + +W + + ++ T+ CP K I
Sbjct: 704 PDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLM--CPPEKTIQ 761
Query: 678 KIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGIH 736
++VFASFGNP+G C + VG+CH+ +++ +VE+ C+GK C +P+ +G D C
Sbjct: 762 EVVFASFGNPEGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTT 821
Query: 737 KALLVDAQC 745
L V +C
Sbjct: 822 ATLGVQVRC 830
>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
Length = 807
Score = 611 bits (1575), Expect = e-172, Method: Compositional matrix adjust.
Identities = 311/761 (40%), Positives = 449/761 (59%), Gaps = 41/761 (5%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AK GGL+ I+TYVFWN HEP+ G+Y F GR D+IRF+ I+ +Y +RIG
Sbjct: 66 MWDKLVKTAKMGGLNTIETYVFWNGHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKMAV 120
PFI++EW +GGLP WL ++ I+FR++N+P+KIENEY I+ +G Y+ WAA+MA+
Sbjct: 126 PFIQAEWNHGGLPYWLREIGHIIFRANNEPFKIENEYGNIKKDRKVEGDKYLEWAAEMAI 185
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
GVPWVMCKQ APG VI CNG CG+T+ + NKP +WTE+WT+ ++ +G +
Sbjct: 186 STGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDK-NKPRLWTENWTAQFRTFGDQL 244
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVRE 240
RSA+DIA+ V F AK G+ VNYYMYHGGTNFGRT A++++TGYYD+AP+DEYG+ +E
Sbjct: 245 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEYGMCKE 304
Query: 241 PKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFE-ETSGVCAAFLVNNDERKAV 299
PK+GHL++LH IK + L G Q+ LG EA +E +C +FL NN+ +
Sbjct: 305 PKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGEDG 364
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAI 359
TV+FR + +P +S+SIL DCKTV +NT+RV Q+++RS + + + WE Y EAI
Sbjct: 365 TVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNVWEMYSEAI 424
Query: 360 LNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS------NAQAPLDVQSHGHILH 413
F T +R + L+Q + KD SDY WYT F S + + + ++S H +
Sbjct: 425 PKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIKSTAHAMI 484
Query: 414 AFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHR 473
F N + G+ GS SF + LR G N A+LS ++G+ DSG L G+
Sbjct: 485 GFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVEVKGGIQD 544
Query: 474 VRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFR 528
VQ + G++ L GE +IY+ G+ + W + +TWYK F
Sbjct: 545 CVVQGLNTGTLDLQGNGRGHKARLEGEDKEIYTEKGMAQFQWKPAENDL-PITWYKRYFD 603
Query: 529 APAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAII 588
P G+DPI +++ SM KG +VNG+ IGRYW SF T G+PSQ+
Sbjct: 604 EPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLAGHPSQS---------------- 647
Query: 589 KATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLR 648
YH+PRAFLKP GNLL++ EEE G P GI + T+ +C ++ + + +W
Sbjct: 648 ----VYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFISEHNPAQIKTW-- 701
Query: 649 HRQRGDTDIKKFGKKPTVQPS--CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQG 706
+ IK + + + + CP + I ++VFASFGNP+G C + G+CH+ ++
Sbjct: 702 --ESDGGQIKLIAEDTSTRGTLNCPPQRTIQEVVFASFGNPEGACGNFTAGTCHTPDAKA 759
Query: 707 VVERACIGKSRCSIPLLSRYFGGD-PCPGIHKALLVDAQCR 746
VVE+ C+GK C +P+++ +G D CP L V +C+
Sbjct: 760 VVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRCK 800
>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 852
Score = 610 bits (1573), Expect = e-172, Method: Compositional matrix adjust.
Identities = 349/805 (43%), Positives = 463/805 (57%), Gaps = 76/805 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP + QYDF GR D+I F+K ++ GL+V +RIG
Sbjct: 63 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLINFVKLVERAGLFVHIRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 123 PYVCAEWNYGGFPLWLHFIPGIEFRTDNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVI 182
Query: 93 ---IENEYQT--IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGM 147
IENEY IE + + PYV WAA MA +TGVPWVMC+Q DAP VIN CNG
Sbjct: 183 LSQIENEYGNGDIESRYGPRAKPYVNWAASMATSLNTGVPWVMCQQPDAPPSVINTCNGF 242
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
C + FK NS P +WTE+WT ++ +GG R +DIAF VA F + G++ NYYM
Sbjct: 243 YC-DQFK-QNSDKTPKMWTENWTGWFLSFGGPVPYRPVEDIAFAVARFFQRGGTFQNYYM 300
Query: 208 YHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQN 266
YHGGTNFGRT+ F+ T Y APLDEYGL+ +PKWGHLK+LH AIKLC ++ N
Sbjct: 301 YHGGTNFGRTSGGPFIATSYDYDAPLDEYGLINQPKWGHLKDLHKAIKLCEAAMVATEPN 360
Query: 267 VISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAF 326
V SLG E V+ +T CAAFL N + V F SY LP S+SILPDCK VAF
Sbjct: 361 VTSLGSNIEVSVY-KTDSQCAAFLANTATQSDAAVSFNGNSYHLPPWSVSILPDCKNVAF 419
Query: 327 NTERVS-----TQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAK 380
+T +++ + + RS ++ S W E + ++ +N R GLL+QI+
Sbjct: 420 STAKINSASTISTFVTRSSEADASGGSLSGWTSVNEPVGISNENAFTRM-GLLEQINTTA 478
Query: 381 DASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFT 434
D SDY WY+ + + + L V++ GH+LHA++NG +GS G+ + +FT
Sbjct: 479 DKSDYLWYSLSVNIKNDEPFLQDGSATVLHVKTLGHVLHAYINGRLSGSGKGNSRHSNFT 538
Query: 435 LRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------W 486
+ V L G N LLS TVGL + GAF + K AG+ VQ K F N S W
Sbjct: 539 IEVPVTLVPGENKIDLLSATVGLQNYGAFFDLKGAGITG-PVQLKGFKNGSTTDLSSKQW 597
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMG 544
YQVGL GE L + SN G LW S + PT Q L WYK +F APAG+ P++++ MG
Sbjct: 598 TYQVGLKGEDLGL-SNGG--STLWKSQTALPTNQPLIWYKASFDAPAGDTPLSMDFTGMG 654
Query: 545 KGEAWVNGQSIGRYWVSFKTSKG---NPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
KGEAWVNGQSIGR+W ++ +P + N + C + YHVPR++L
Sbjct: 655 KGEAWVNGQSIGRFWPAYIAPNDGCTDPCNYRGGYNAEKCLKNCG-KPSQLLYHVPRSWL 713
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
K +GN+LVL EE G+P ++ T I+ VC +++H P+ W D KK G
Sbjct: 714 KSSGNVLVLFEEMGGDPTKLSFATREIQSVCSRTSDAHPLPIDMWASE----DDARKKSG 769
Query: 662 KKPTVQPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSI 720
PT+ CP + IS I FASFG P G C + G C SS++ +V++ACIG CS+
Sbjct: 770 --PTLSLECPHPNQVISSIKFASFGTPQGTCGSFIHGRCSSSNALSIVKKACIGSKSCSL 827
Query: 721 PLLSRYFGGDPCPGIHKALLVDAQC 745
+ F GDPC G+ K+L V+A C
Sbjct: 828 GVSINAF-GDPCKGVAKSLAVEASC 851
>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 853
Score = 610 bits (1573), Expect = e-172, Method: Compositional matrix adjust.
Identities = 340/814 (41%), Positives = 464/814 (57%), Gaps = 90/814 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV++TYVFW++HEP +GQYDF GRND++RF+K GLYV LRIG
Sbjct: 60 MWPGLMQKAKDGGLDVVETYVFWDVHEPVRGQYDFEGRNDLVRFVKAAADAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI R+DN+P+K
Sbjct: 120 PYVCAEWNYGGFPLWLHFIPGIKLRTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I ++ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 180 LSQIENEYGNIAASYGAAGKSYIRWAAGMAVALDTGVPWVMCQQTDAPEPLINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P+ P++P +WTE+W+ ++ +GG R +D+AF VA F + G+ NYYMYH
Sbjct: 240 DQFT--PSLPSRPKLWTENWSGWFLSFGGAVPYRPTEDLAFAVARFYQRGGTLQNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR++ I+ YD AP+DEYGLVR+PKWGHL+++H AIK+C L+ + +
Sbjct: 298 GGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKMCEPALIATDPSYM 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ EA V++ S +CAAFL N D++ TV F +Y+LP S+SILPDCK V NT
Sbjct: 358 SLGQNAEAHVYKSGS-LCAAFLANIDDQSDKTVTFNGKAYKLPAWSVSILPDCKNVVLNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSD-------------EKWEEYREAILNFDNTLLRAEGLLDQ 375
++++Q ++ NL F + W E + L GL++Q
Sbjct: 417 AQINSQV-ASTQMRNLGFSTQASDGSSVEAELAASSWSYAVEPVGITKENALTKPGLMEQ 475
Query: 376 ISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDN 430
I+ DASD+ WY+ + +Q+ L V S GH+L F+NG+ GS+ GS +
Sbjct: 476 INTTADASDFLWYSTSIVVAGGEPYLNGSQSNLLVNSLGHVLQVFINGKLAGSSKGSASS 535
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SFTNCS 485
+L V L G N LLS TVGL + GAF + AG+ V++ ++
Sbjct: 536 SLISLTTPVTLVTGKNKIDLLSATVGLTNYGAFFDLVGAGITGPVKLTGPKGTLDLSSAE 595
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSM 543
W YQ+GL GE L +Y N W S S PT LTWYK+ F APAG+DP+A++ M
Sbjct: 596 WTYQIGLRGEDLHLY-NPSEASPEWVSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGM 654
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------- 593
GKGEAWVNGQSIGRYW P+ V S ++ AT
Sbjct: 655 GKGEAWVNGQSIGRYW---------PTNIAPQSGCVNSCNYRGSYSATKCLKKCGQPSQI 705
Query: 594 -YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQR 652
YHVPR+FL+P N +VL E+ GNP I+ T VC HV+ H + SW+ +Q+
Sbjct: 706 LYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQTESVCAHVSEDHPDQIDSWVSSQQK 765
Query: 653 GDTDIKKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERA 711
+++ G P ++ CP G+ IS I FASFG P G C Y+ G C SS + V + A
Sbjct: 766 ----LQRSG--PALRLECPKEGQVISSIKFASFGTPSGTCGSYSHGECSSSQALAVAQEA 819
Query: 712 CIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C+G S CS+P+ ++ F GDPC G+ K+L+V+A C
Sbjct: 820 CVGVSSCSVPVSAKNF-GDPCRGVTKSLVVEAAC 852
>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
Length = 861
Score = 609 bits (1571), Expect = e-171, Method: Compositional matrix adjust.
Identities = 345/809 (42%), Positives = 465/809 (57%), Gaps = 75/809 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQ---YDFSGRNDIIRFIKEIQSQGLYVCL 57
MWP LI K+K+GGLDVI+TYVFW++HEP +GQ YDF GR D++RF+K + GLYV L
Sbjct: 63 MWPGLIQKSKDGGLDVIETYVFWDIHEPVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHL 122
Query: 58 RIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------------------------- 92
RIGP++ +EW YGG P+WLH V GI FR+DN+ +K
Sbjct: 123 RIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGG 182
Query: 93 ------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG 146
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG
Sbjct: 183 PIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNG 242
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + PNS +KP +WTE+W+ ++ +GG R A+D+AF VA F + G++ NYY
Sbjct: 243 FYCDQFT--PNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYY 300
Query: 207 MYHGGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGR T F+ T Y AP+DEYG+VR+PKWGHL+++H AIKLC L+
Sbjct: 301 MYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEP 360
Query: 266 NVISLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ SLGQ EA V++ + +CAAFL N D + V F +Y+LP S+SILPDCK V
Sbjct: 361 SYSSLGQNTEATVYQTADNSICAAFLANVDAQSDKAVKFNGNTYKLPAWSVSILPDCKNV 420
Query: 325 AFNTERVSTQYNK---RSKTSNLKFDSDEK----------WEEYREAILNFDNTLLRAEG 371
NT ++++Q RS S+++ D+D+ W E + L G
Sbjct: 421 VLNTAQINSQVTTSEMRSLGSSIQ-DTDDSLITPELATAGWSYAIEPVGITKENALTKPG 479
Query: 372 LLDQISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHG 426
L++QI+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA G
Sbjct: 480 LMEQINTTADASDFLWYSTSIVVKGDEPYLNGSQSNLLVNSLGHVLQVYINGKLAGSAKG 539
Query: 427 SHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SF 481
S + +L+ V L G N LLS TVGL + GAF + AGV V++ +
Sbjct: 540 SASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLIGAGVTGPVKLSGPNGALNL 599
Query: 482 TNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNL 540
++ W YQ+GL GE L +Y+ + S PT Q L WYKT F APAG+DP+A++
Sbjct: 600 SSTDWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDF 659
Query: 541 QSMGKGEAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVP 597
MGKGEAWVNGQSIGRYW + G + Y A ++ + C T YHVP
Sbjct: 660 TGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQT-LYHVP 718
Query: 598 RAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
R+FL+P N LVL E+ G+P I+ T +C HV+ H + SW+ +Q T
Sbjct: 719 RSFLQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQT-- 776
Query: 658 KKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKS 716
P ++ CP G+ IS I FASFG P G C Y G C SS + VV+ AC+G +
Sbjct: 777 ----PGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMT 832
Query: 717 RCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
CS+P+ S F GDPC G+ K+L+V+A C
Sbjct: 833 NCSVPVSSNNF-GDPCSGVTKSLVVEAAC 860
>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
Length = 861
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 345/809 (42%), Positives = 466/809 (57%), Gaps = 75/809 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQ---YDFSGRNDIIRFIKEIQSQGLYVCL 57
MWP LI K+K+GGLDVI+TYVFW++HE +GQ YDF GR D++RF+K + GLYV L
Sbjct: 63 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHL 122
Query: 58 RIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------------------------- 92
RIGP++ +EW YGG P+WLH V GI FR+DN+ +K
Sbjct: 123 RIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGG 182
Query: 93 ------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG 146
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG
Sbjct: 183 PIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNG 242
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + PNS +KP +WTE+W+ ++ +GG R A+D+AF VA F + G++ NYY
Sbjct: 243 FYCDQFT--PNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYY 300
Query: 207 MYHGGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGR T F+ T Y AP+DEYG+VR+PKWGHL+++H AIKLC L+
Sbjct: 301 MYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEP 360
Query: 266 NVISLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ SLGQ EA V++ + +CAAFL N D + TV F +Y+LP S+SILPDCK V
Sbjct: 361 SYSSLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNV 420
Query: 325 AFNTERVSTQYNK---RSKTSNLKFDSDEK----------WEEYREAILNFDNTLLRAEG 371
NT ++++Q RS S+++ D+D+ W E + L G
Sbjct: 421 VLNTAQINSQVTTSEMRSLGSSIQ-DTDDSLITPELATAGWSYAIEPVGITKENALTKPG 479
Query: 372 LLDQISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHG 426
L++QI+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA G
Sbjct: 480 LMEQINTTADASDFLWYSTSIVVKGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKG 539
Query: 427 SHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SF 481
S + +L+ V L G N LLS TVGL + GAF + AGV V++ +
Sbjct: 540 SASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNL 599
Query: 482 TNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNL 540
++ W YQ+GL GE L +Y+ + S PT Q L WYKT F APAG+DP+A++
Sbjct: 600 SSTDWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDF 659
Query: 541 QSMGKGEAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVP 597
MGKGEAWVNGQSIGRYW + G + Y A ++ + C T YHVP
Sbjct: 660 TGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQT-LYHVP 718
Query: 598 RAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
R+FL+P N LVL E+ G+P I+ T +C HV+ H + SW+ +Q T
Sbjct: 719 RSFLQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQT-- 776
Query: 658 KKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKS 716
+ P ++ CP G+ IS I FASFG P G C Y G C SS + VV+ AC+G +
Sbjct: 777 ----QGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMT 832
Query: 717 RCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
CS+P+ S F GDPC G+ K+L+V+A C
Sbjct: 833 NCSVPVSSNNF-GDPCSGVTKSLVVEAAC 860
>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 916
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 355/837 (42%), Positives = 467/837 (55%), Gaps = 118/837 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I AK+GG DV+QTYVFWN HEP++GQY+F GR D+++FIK ++ GLY LRIG
Sbjct: 62 MWPSIIQHAKDGGADVVQTYVFWNGHEPEQGQYNFEGRYDLVKFIKLVKQAGLYFHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P WL ++ GIVFR+DN+P+K
Sbjct: 122 PYVCAEWNFGGFPYWLKEIPGIVFRTDNEPFKVAMQGFTSKIVNLMKENELFSWQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE F + G YV WAA MA+ T VPW+MCKQ+DAP +IN CNG C
Sbjct: 182 MAQIENEYGDIESQFGDGGKRYVQWAADMALSLDTRVPWIMCKQEDAPANIINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ +K PN+ KP +WTEDW ++Q WG R +D AF VA F + GS+ NYYMY
Sbjct: 242 -DGWK-PNTALKPILWTEDWNGWFQNWGQAAPHRPVEDNAFAVARFFQRGGSFQNYYMYF 299
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA FM T Y AP+DEYGL+R+PKWGHLK+LHAAIKLC P LT V
Sbjct: 300 GGTNFARTAGGPFMTTTYDYDAPIDEYGLIRQPKWGHLKDLHAAIKLC-EPALTAVDTVP 358
Query: 269 S---LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
+G QEA + +G CAAFL N D +VTV F+ SY LP S+SILPDCK VA
Sbjct: 359 QSTWIGSNQEAHEY-SANGHCAAFLANIDSENSVTVQFQGESYVLPAWSVSILPDCKNVA 417
Query: 326 FNTERVSTQYN---KRSKTSNLKFD-------------------SDEKWEEYREAILNFD 363
FNT ++ Q R SN + D ++ KW+ E
Sbjct: 418 FNTAQIGAQTTVTRMRIAPSNSRGDIFLPSNTLVHDHISDGGVFANLKWQASAEPFGIRG 477
Query: 364 NTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS-------SNAQAPLDVQSHGHILHAFV 416
+ + LL+Q++ KD SDY WY+ S S +A L + + +H FV
Sbjct: 478 SGTTVSNSLLEQLNITKDTSDYLWYSTSITITSEGVTSDVSGTEANLVLGTMRDAVHIFV 537
Query: 417 NGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVR 475
NG+ GSA G + V + L+ G N LLS+T+GL + GA+LE AG+ V
Sbjct: 538 NGKLAGSAMGWNIQVV----QPITLKDGKNSIDLLSMTLGLQNYGAYLETWGAGIRGSVS 593
Query: 476 VQDKSFTNCS-----WGYQVGLIGEKLQIYSNLGLNKVLW-SSIRSPTRQLTWYKTTFRA 529
V + N S W YQVGL GE+L+++ N + W SS + LTWYKTTF A
Sbjct: 594 VTGLPYGNLSLSTAEWSYQVGLRGEELKLFHNGTADGFSWDSSSFTNASYLTWYKTTFDA 653
Query: 530 PAGNDPIALNLQSMGKGEAWVNGQSIGRYWV---------------SFKTSK-----GNP 569
P G DP+AL+L SMGKG+AW+NG +GRY++ ++ T+K G P
Sbjct: 654 PGGTDPVALDLGSMGKGQAWINGHHLGRYFLMVAPQSGCETCDYRGAYNTNKCRTNCGEP 713
Query: 570 SQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIR 629
SQ ++ V IHF YH+PRA+L+ TGNLLVL EE G+ ++V T +
Sbjct: 714 SQ-RWQV-----IHF-------QMYHIPRAWLQATGNLLVLFEEIGGDISKVSVVTRSAH 760
Query: 630 KVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDG 689
VC H+ S PP+ +W HR I F + C G+ I+KI FASFGNP G
Sbjct: 761 AVCAHINESQPPPIRTWRPHR-----SIDAFNNPAEMLLECAAGQHITKIKFASFGNPRG 815
Query: 690 DCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG-DPCPGIHKALLVDAQC 745
C + G+CH++ S V + CIGK +C IP+ ++FG DPCPG+ K+L V C
Sbjct: 816 SCGHFQHGTCHANKSMEAVRKVCIGKQQCYIPVQRKFFGSIDPCPGVSKSLAVQVHC 872
>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 609 bits (1570), Expect = e-171, Method: Compositional matrix adjust.
Identities = 350/797 (43%), Positives = 463/797 (58%), Gaps = 65/797 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP +GQYDF GR D+++F+K + + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 116 PYVCAEWNYGGFPVWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVI 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MA TGVPWVMC Q DAP P+IN NG
Sbjct: 176 LSQIENEYGNIDTAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFY- 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+ F PNS KP +WTE+W+ ++ V+GG R +D+AF VA F + G++ NYYMYH
Sbjct: 235 GDEFT-PNSNTKPKMWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF R + F+ T Y AP+DEYG++R+PKWGHLKE+H AIKLC L+ +
Sbjct: 294 GGTNFDRASGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ +T VCAAFL N + VTV F SY LP S+SILPDCK+V NT
Sbjct: 354 SLGPNLEAAVY-KTGSVCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVLNT 412
Query: 329 ERVSTQYNKRS-KTSNLKFD------SDEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++++ S T + K D S W E + GLL+QI+ D
Sbjct: 413 AKINSASAISSFTTESSKEDIGSSEASSTGWSWISEPVGISKTDSFSQTGLLEQINTTAD 472
Query: 382 ASDYFWYTFRFHYNS-SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
SDY WY+ Y + +++Q L ++S GH LHAF+NG+ GS G+ FT+ V
Sbjct: 473 KSDYLWYSLSIDYKADASSQTVLHIESLGHALHAFINGKLAGSQPGNSGKYKFTVDIPVT 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------WGYQVGL 492
L G N LLS+TVGL + GAF + G+ + K F N + W YQVGL
Sbjct: 533 LVAGKNTIDLLSLTVGLQNYGAFFDTWGVGITGPVIL-KGFANGNTLDLSSQKWTYQVGL 591
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE L + S L S+ P Q LTWYKTTF AP+G+DP+A++ MGKGEAWVN
Sbjct: 592 QGEDLGLSSGSSGQWNLQSTF--PKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVN 649
Query: 552 GQSIGRYWVSFKTSKGNPSQT-QYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
GQ IGRYW ++ S + + + Y S K + T YHVPR++LKP+GN+LV
Sbjct: 650 GQRIGRYWPTYVASDASCTDSCNYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILV 709
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS 669
L EE G+P I+ T +C HV++SH PP+ W + G +K G P + +
Sbjct: 710 LFEERGGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSETESG----RKVG--PVLSLT 763
Query: 670 CPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
CP + IS I FAS+G P G C + G C S+ + +V++ACIG S CS+ + S F
Sbjct: 764 CPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSDTF- 822
Query: 729 GDPCPGIHKALLVDAQC 745
GDPC G+ K+L V+A C
Sbjct: 823 GDPCRGMAKSLAVEATC 839
>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
Length = 842
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 345/802 (43%), Positives = 457/802 (56%), Gaps = 72/802 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWN HEP + QY+F GR D+++F+K + GLYV +RIG
Sbjct: 55 MWPGLIQKSKDGGLDVIETYVFWNGHEPVRNQYNFEGRYDLVKFVKLVAEAGLYVHIRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 115 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ AF Y+ WAA MA+ TGVPWVMC+Q DAP PVIN CNG C
Sbjct: 175 LSQIENEYGNIDSAFGPAAKTYINWAAGMAISLDTGVPWVMCQQADAPDPVINTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++Q +GG R +D+AF VA F +G++ NYYMYH
Sbjct: 235 DQFT--PNSKNKPKMWTENWSGWFQSFGGAVPYRPVEDLAFAVARFYQLSGTFQNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT I+ YD APLDEYGL+R+PKWGHLK++H AIKLC L+
Sbjct: 293 GGTNFGRTTGGPFISTSYDYDAPLDEYGLLRQPKWGHLKDVHKAIKLCEEALIATDPTTT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ +T +CAAFL N TV F SY LP S+SILPDCK VA NT
Sbjct: 353 SLGSNLEATVY-KTGSLCAAFLANIATTDK-TVTFNGNSYNLPAWSVSILPDCKNVALNT 410
Query: 329 ERVST-----QYNKRSKTSNLKFDSDEK----WEEYREAILNFDNTLLRAEGLLDQISAA 379
++++ + ++S ++ DS + W E + N GLL+QI+
Sbjct: 411 AKINSVTIVPSFARQSLVGDV--DSSKAIGSGWSWINEPVGISKNDAFVKSGLLEQINTT 468
Query: 380 KDASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
D SDY WY+ + + +Q L V+S GH LHAF+NG+ GS G N
Sbjct: 469 ADKSDYLWYSLSTNIKGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGKSSNAKV 528
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH---RVRVQDKSFTNCS---WG 487
T+ + L G N LLS+TVGL + GAF E AG+ +++ Q+ + + S W
Sbjct: 529 TVDIPITLTPGKNTIDLLSLTVGLQNYGAFYELTGAGITGPVKLKAQNGNTVDLSSQQWT 588
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKG 546
YQ+GL GE I S V S P Q L WYKT+F APAGNDP+A++ MGKG
Sbjct: 589 YQIGLKGEDSGISSGSSSEWV--SQPTLPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKG 646
Query: 547 EAWVNGQSIGRYW-VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPT 604
EAWVNGQSIGRYW + S G Y ++ K + T YH+PR+++K +
Sbjct: 647 EAWVNGQSIGRYWPTNVSPSSGCADSCNYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSS 706
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
GN+LVLLEE G+P I T + +C HV+ SH P+ W + G K+ G P
Sbjct: 707 GNILVLLEEIGGDPTQIAFATRQVGSLCSHVSESHPQPVDMWNTDSEGG----KRSG--P 760
Query: 665 TVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLL 723
+ CP K IS I FASFG P G C Y+ G C S+ + +V++AC+G C++ +
Sbjct: 761 VLSLQCPHPDKVISSIKFASFGTPHGSCGSYSHGKCSSTSALSIVQKACVGSKSCNVGVS 820
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
F GDPC G+ K+L V+A C
Sbjct: 821 INTF-GDPCRGVKKSLAVEASC 841
>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 830
Score = 608 bits (1569), Expect = e-171, Method: Compositional matrix adjust.
Identities = 347/791 (43%), Positives = 458/791 (57%), Gaps = 63/791 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNL+EP +GQYDF GR D+++F+K + + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGLDVIETYVFWNLNEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 116 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKEENLYASQGGPVI 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MA TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 176 LSQIENEYGNIDSAYGAAGKSYIKWAATMATSLDTGVPWVMCQQADAPDPIINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 236 DQF--TPNSNTKPKMWTENWSGWFLPFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ F+ T Y AP+DEYG++R+PKWGHLKE+H AIKLC L+ +
Sbjct: 294 GGTNFDRTSGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ +T VCAAFL N D + VTV F SY LP S+SILPDCK V NT
Sbjct: 354 SLGPNLEAAVY-KTGSVCAAFLANVDTKSDVTVNFSGNSYHLPAWSVSILPDCKNVVLNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+V + + S W E + GLL+QI+ D SDY WY
Sbjct: 413 AKVCL---TNFISMFMWLPSSTGWSWISEPVGISKADSFPQTGLLEQINTTADKSDYLWY 469
Query: 389 TFRFHYN-SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTND 447
+ Y + +Q L ++S GH LHAF+NG+ GS G+ FT+ V L G N
Sbjct: 470 SLSIDYKGDAGSQTVLHIESLGHALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNT 529
Query: 448 GALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------WGYQVGLIGEKLQI 499
LLS+TVGL + GAF + AG+ + K N + W YQVGL GE L +
Sbjct: 530 IDLLSLTVGLQNYGAFFDTWGAGITGPVIL-KGLANGNTLDLSYQKWTYQVGLKGEDLGL 588
Query: 500 YSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
S + W+S + P Q L WYKTTF AP+G+DP+A++ MGKGEAWVNGQSIGR
Sbjct: 589 SSG---SSGQWNSQSTFPKNQPLIWYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGR 645
Query: 558 YWVSFKTSK-GNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEEN 615
YW ++ S G Y S K + T YHVPR++LKP+GN+LVL EE+
Sbjct: 646 YWPTYVASDAGCTDSCNYRGPYSASKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKG 705
Query: 616 GNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKK 675
G+P I+ T +C HV++SH PP+ W + G +K G P + +CP +
Sbjct: 706 GDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSDTESG----RKVG--PVLSLTCPHDNQ 759
Query: 676 -ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPG 734
IS I FAS+G P G C + G C S+ + +V++ACIG S CS+ + S F G+PC G
Sbjct: 760 VISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSETF-GNPCRG 818
Query: 735 IHKALLVDAQC 745
+ K+L V+A C
Sbjct: 819 VAKSLAVEATC 829
>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
Length = 866
Score = 607 bits (1565), Expect = e-171, Method: Compositional matrix adjust.
Identities = 347/822 (42%), Positives = 457/822 (55%), Gaps = 91/822 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP KGQYDF GR D+++F+K + GLYV LRIG
Sbjct: 52 MWPDLIQKSKDGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 112 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKVEAEMKRFTAKIVDLMKQEKLYASQGGP 171
Query: 93 -----IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGM 147
IENEY I+ A+ G Y+ WAAKMA TGVPWVMC+Q+DAP +IN CNG
Sbjct: 172 IILSQIENEYGDIDSAYGSAGKSYINWAAKMATSLDTGVPWVMCQQEDAPDSIINTCNGF 231
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
C + PNS KP +WTE+W+++Y ++GG R +D+AF VA F + G++ NYYM
Sbjct: 232 YCDQF--TPNSNTKPKMWTENWSAWYLLFGGGFPHRPVEDLAFAVARFFQRGGTFQNYYM 289
Query: 208 ---------------------YHGGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGH 245
YHGGTNF R T F+ T Y AP+DEYG++R+PKWGH
Sbjct: 290 VLQPEMFFTSSIYYMVLFLRPYHGGTNFDRSTGGPFIATSYDFDAPIDEYGIIRQPKWGH 349
Query: 246 LKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRN 305
LK+LH A+KLC L+ + SLG EA V+ +T VCAAFL N D + TV F
Sbjct: 350 LKDLHKAVKLCEEALIATEPKITSLGPNLEAAVY-KTGSVCAAFLANVDTKSDKTVNFSG 408
Query: 306 ISYELPRKSISILPDCKTVAFNTERV---STQYNKRSKTSNLKFDSDE----KWEEYREA 358
SY LP S+SILPDCK V NT ++ S N +K+S S E KW E
Sbjct: 409 NSYHLPAWSVSILPDCKNVVLNTAKINSASAISNFVTKSSKEDISSLETSSSKWSWINEP 468
Query: 359 ILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS-NAQAPLDVQSHGHILHAFVN 417
+ + + GLL+QI+ D SDY WY+ +Q L ++S GH LHAFVN
Sbjct: 469 VGISKDDIFSKTGLLEQINITADRSDYLWYSLSVDLKDDLGSQTVLHIESLGHALHAFVN 528
Query: 418 GEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ 477
G+ GS G+ D + + + G N LLS+TVGL + GAF +R AG+ V
Sbjct: 529 GKLAGSHTGNKDKPKLNVDIPIKVIYGNNQIDLLSLTVGLQNYGAFFDRWGAGITG-PVT 587
Query: 478 DKSFTNCS---------WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTF 527
K N + W YQVGL GE L + S G ++ S P Q L WYKT F
Sbjct: 588 LKGLKNGNNTLDLSSQKWTYQVGLKGEDLGLSS--GSSEGWNSQSTFPKNQPLIWYKTNF 645
Query: 528 RAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQT--QYAVNTVTSIHFC 585
AP+G++P+A++ MGKGEAWVNGQSIGRYW ++ S + + + T T H
Sbjct: 646 DAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASNADCTDSCNYRGPFTQTKCHMN 705
Query: 586 AIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSS 645
+ YHVPR+FLKP GN LVL EE G+P I T + +C HV++SH P +
Sbjct: 706 CGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDPTQIAFATKQLESLCAHVSDSHPPQIDL 765
Query: 646 WLRHRQRGDTDIKKFGK-KPTVQPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHSSH 703
W + D +GK P + +CP + I I FAS+G P G C + G C S+
Sbjct: 766 W-------NQDTTSWGKVGPALLLNCPNHNQVIFSIKFASYGTPLGTCGNFYRGRCSSNK 818
Query: 704 SQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ +V++ACIG CSI + + F GDPC G+ K+L V+A C
Sbjct: 819 ALSIVKKACIGSRSCSIGVSTDTF-GDPCRGVPKSLAVEATC 859
>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 819
Score = 607 bits (1565), Expect = e-171, Method: Compositional matrix adjust.
Identities = 339/773 (43%), Positives = 435/773 (56%), Gaps = 90/773 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G Y+F GR D++RFIK +Q G++V LRIG
Sbjct: 57 MWDGLIEKAKDGGLDVIQTYVFWNGHEPTPGNYNFEGRYDLVRFIKTVQKAGMFVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN+P+K
Sbjct: 117 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVGMMKSENLFASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY F G Y+ WAAKMAV TGVPWVMCK+DDAP PVINACNG C
Sbjct: 177 LSQIENEYGPEGKEFGAAGKAYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TF PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K GS++NYYMYH
Sbjct: 237 -DTFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYGL REPK+GHLKELH A+KLC +PL++ V
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG +QEA VF +SG CAAFL N + V+F N +Y LP SISILPDCK V FNT
Sbjct: 355 TLGSMQEAHVFRSSSG-CAAFLANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
V Q N+ ++ S WE+Y E + + LL + GLL+Q++ +D SDY W
Sbjct: 414 ATVGVQTNQMQMWADGA--SSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLW 471
Query: 388 YTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S L VQS GH LH F+NG+ GSA+G+ ++ + +L
Sbjct: 472 YITSVEVDPSEKFLQGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANL 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R GTN ALLSV GLP+ G E GV H + + T +W YQVGL GE
Sbjct: 532 RAGTNKVALLSVACGLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGE 591
Query: 496 KLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
++ + S G V W S + + L WY+ F P+G++P+AL++ SMGKG+ W+NG
Sbjct: 592 QMNLNSLEGSGSVEWMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWING 651
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-----------YHVPRAFL 601
QSIGRYW T YA H+ +A YHVPR++L
Sbjct: 652 QSIGRYW------------TAYAEGDCKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWL 699
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
+PT NLLV+ EE G+ I + + VC V+ H P + +W I+ +G
Sbjct: 700 QPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKNW---------QIESYG 749
Query: 662 K----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVER 710
+ V C G+ IS I FASFG P G C + G CHS +S V+E+
Sbjct: 750 EPEFHTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINSNSVLEK 802
>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
Length = 909
Score = 607 bits (1564), Expect = e-170, Method: Compositional matrix adjust.
Identities = 345/828 (41%), Positives = 455/828 (54%), Gaps = 94/828 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG DVI+TYVFWN HEP +GQY+F GR D+++F++ S GLY LRIG
Sbjct: 77 MWPDLIAKSKEGGADVIETYVFWNGHEPVRGQYNFEGRYDLVKFVRLAASHGLYFFLRIG 136
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL D+ GI FR++N P+K
Sbjct: 137 PYACAEWNFGGFPVWLRDIPGIEFRTNNAPFKEEMKRFVSKVVNLMREERLFSWQGGPII 196
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE ++ + G Y+ WAAKMA+ GVPWVMC+Q DAP +I+ CN C
Sbjct: 197 LLQIENEYGNIENSYGKGGKEYMKWAAKMALSLGAGVPWVMCRQQDAPYDIIDTCNAYYC 256
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS NKP++WTE+W +Y WG + R +D+AF VA F + GS+ NYYMY
Sbjct: 257 -DGFK-PNSHNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVARFFQRGGSFQNYYMYF 314
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL-TGTQNV 267
GGTNFGRTA IT Y AP+DEYGL+REPKWGHLK+LHAA+KLC L+ T +
Sbjct: 315 GGTNFGRTAGGPLQITSYDYDAPIDEYGLLREPKWGHLKDLHAALKLCEPALVATDSPTY 374
Query: 268 ISLGQLQEAFVFE-------------ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKS 314
I LG QEA V++ E+S +C+AFL N DE K TV FR Y +P S
Sbjct: 375 IKLGPKQEAHVYQANVHLEGLNLSMFESSSICSAFLANIDEWKEATVTFRGQRYTIPPWS 434
Query: 315 ISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSD-----------------EKWEEYRE 357
+S+LPDC+ FNT +V Q + + S L S+ + W +E
Sbjct: 435 VSVLPDCRNTVFNTAKVRAQTSVKLVESYLPTVSNIFPAQQLRHQNDFYYISKSWMTTKE 494
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS--------NAQAPLDVQSHG 409
+ + + EG+ + ++ KD SDY WY+ R + + S + L +
Sbjct: 495 PLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSDSDILFWEENDVHPKLTIDGVR 554
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
IL F+NG+ G+ G V TL+ G ND LL+ TVGL + GAFLE+ A
Sbjct: 555 DILRVFINGQLIGNVVGHWIKVVQTLQ----FLPGYNDLTLLTQTVGLQNYGAFLEKDGA 610
Query: 470 GVHRVRVQDKSFTNCS-------WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT--RQL 520
G+ R +++ F N W YQVGL GE L+ YS N W +
Sbjct: 611 GI-RGKIKITGFENGDIDLSKSLWTYQVGLQGEFLKFYSEENENSE-WVELTPDAIPSTF 668
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNT 578
TWYKT F P G DP+AL+ +SMGKG+AWVNGQ IGRYW G Y A N+
Sbjct: 669 TWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYWTRVSPKSGCQQVCDYRGAYNS 728
Query: 579 VTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
C K T T YHVPR++LK T NLLV+LEE GNP I+V + R +C V+
Sbjct: 729 DKCSTNCG--KPTQTLYHVPRSWLKATNNLLVILEETGGNPFEISVKLHSSRIICAQVSE 786
Query: 638 SHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVG 697
S+ PPL + G+ ++ P + C G IS + FASFG P G C+ ++ G
Sbjct: 787 SNYPPLQKLVNADLIGE-EVSANNMIPELHLHCQQGHTISSVAFASFGTPGGSCQNFSRG 845
Query: 698 SCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+CH+ S +V AC GK CSI + FG DPCPG+ K L V+A+C
Sbjct: 846 NCHAPSSMSIVSEACQGKRSCSIKISDSAFGVDPCPGVVKTLSVEARC 893
>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
Length = 916
Score = 605 bits (1561), Expect = e-170, Method: Compositional matrix adjust.
Identities = 327/792 (41%), Positives = 438/792 (55%), Gaps = 55/792 (6%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AK+GG D I+TYVFWN HE G+Y F R D++RF K ++ GLY+ LRIG
Sbjct: 132 MWPKLVAEAKDGGADCIETYVFWNGHETAPGEYYFEDRFDLVRFAKVVKDAGLYLMLRIG 191
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+P+WLH + G VFR++N+P+K
Sbjct: 192 PFVAAEWNFGGVPVWLHYIPGAVFRTNNEPFKSHMKSFTTKIVDMMKRERFFASQGGHII 251
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY E A+ G Y +WAA MA+ +TGVPW+MC+Q DAP VIN CN C
Sbjct: 252 LAQIENEYGDTEQAYGADGKAYAMWAASMALAQNTGVPWIMCQQYDAPEHVINTCNSFYC 311
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK NSP KP IWTE+W ++Q +G R +D+AF VA F K GS NYY+YH
Sbjct: 312 -DQFKT-NSPTKPKIWTENWPGWFQTFGESNPHRPPEDVAFSVARFFQKGGSVQNYYVYH 369
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT IT YD AP+DEYGL R PKW HL++LH +IKLC LL G +
Sbjct: 370 GGTNFGRTTGGPFITTSYDYDAPIDEYGLTRLPKWAHLRDLHKSIKLCEHSLLYGNLTSL 429
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ + SG C AFL N D V FR+ Y+LP S+SILPDCK FNT
Sbjct: 430 SLGTKQEADVYTDHSGGCVAFLANIDPENDTVVTFRSRQYDLPAWSVSILPDCKNAVFNT 489
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q L+ ++W +RE +D G +D I+ KD++DY W
Sbjct: 490 AKVQSQTLMVDMVPETLQSTKPDRWSIFREKTGIWDKNDFIRNGFVDHINTTKDSTDYLW 549
Query: 388 YTFRFH----YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
+T F+ Y ++ + L + S GH +HAF+N E GSA+G+ SF + + L+
Sbjct: 550 HTTSFNVDRSYPTNGNRELLSIDSKGHAVHAFLNNELIGSAYGNGSKSSFNVHMPIKLKP 609
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDK-----SFTNCSWGYQVGLIGEKLQ 498
G N+ ALLS+TVGL ++G E AG+ V + ++ +W Y++GL GE
Sbjct: 610 GKNEIALLSMTVGLQNAGPHYEWVGAGLTSVNISGMKNGSIDLSSNNWAYKIGLEGEHYG 669
Query: 499 IYSNLGLNKVLWSSIRSPTR--QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
++ N WS P + LTWYK P G+DP+ +++QSMGKG AW+NG +IG
Sbjct: 670 LFKPDQGNNQRWSPQSEPPKGQPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIG 729
Query: 557 RYW--VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
RYW S + PS + YHVPR++ P+GN LV+ EE+
Sbjct: 730 RYWPRTSSSDDRCTPSCNYRGPFNPSKCRTGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQ 789
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLP-PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G+P IT KVC V+ ++ L SW + D K VQ SCP G
Sbjct: 790 GGDPTKITFSRRVATKVCSFVSENYPSIDLESWDKSISDDGKDTAK------VQLSCPKG 843
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCP 733
K IS + FASFG+P G C Y G CH S VVE+AC+ + C++ L FG D CP
Sbjct: 844 KNISSVKFASFGDPSGTCRSYQQGRCHHPSSLSVVEKACLNINSCTVSLSDEGFGKDLCP 903
Query: 734 GIHKALLVDAQC 745
G+ K L ++A C
Sbjct: 904 GVAKTLAIEADC 915
>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 839
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 336/802 (41%), Positives = 461/802 (57%), Gaps = 76/802 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW+ HEP+K +Y+F GR D+++F+K GLYV LRIG
Sbjct: 56 MWPELIQKSKDGGLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ W+A MA+ TGVPW MC+Q DAP P+IN CNG C
Sbjct: 176 LSQIENEYGNIDSAYGAAAKSYIKWSASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +G R +D+AF VA F + G++ NYYMYH
Sbjct: 236 DQF--TPNSNNKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ +I+ YD AP+DEYGL+R+PKWGHL++LH AIKLC L+ +
Sbjct: 294 GGTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ SG CAAFL N D + TV F SY LP S+SILPDCK VAFNT
Sbjct: 354 SLGSNLEAAVYKTESGSCAAFLANVDTKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSD--EKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V ++N SKT + ++ +W +E I GLL+QI+ D SDY
Sbjct: 414 AKV--KFNSISKTPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYL 471
Query: 387 WYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY+ R + ++A L ++S G +++AF+NG+ GS HG +L ++
Sbjct: 472 WYSLRTDIKGDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGKQ---KISLDIPIN 528
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-------FTNCSWGYQVGLI 493
L GTN LLSVTVGL + GAF + AG+ + + W YQVGL
Sbjct: 529 LVTGTNTIDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLK 588
Query: 494 GEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE + + ++ W S PT+Q L WYKTTF AP+G++P+A++ GKG AWVN
Sbjct: 589 GEDTGLAT---VDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVN 645
Query: 552 GQSIGRYWVSFKTSKGNPSQT-----QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
GQSIGRYW + G +++ Y N + C T YHVPR++LKP+GN
Sbjct: 646 GQSIGRYWPTSIAGNGGCTESCDYRGSYRANKC--LKNCGKPSQT-LYHVPRSWLKPSGN 702
Query: 607 LLVLLEEENGNPLGITVDTIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-KP 664
+LVL EE G+P I+ T +C V+ SH PP+ +W D+ I + +P
Sbjct: 703 ILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTS-----DSKISNRNRTRP 757
Query: 665 TVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLL 723
+ CP+ + I I FASFG P G C + G C+SS S +V++ACIG C++ +
Sbjct: 758 VLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSCNVEVS 817
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
+R F G+PC G+ K+L V+A C
Sbjct: 818 TRVF-GEPCRGVVKSLAVEASC 838
>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 845
Score = 603 bits (1555), Expect = e-169, Method: Compositional matrix adjust.
Identities = 335/796 (42%), Positives = 447/796 (56%), Gaps = 63/796 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+A+AKEGG D I+TYVFWN HE G+Y F R D+++F + ++ GL++ LRIG
Sbjct: 61 MWPKLVAEAKEGGADCIETYVFWNGHETAPGKYYFEDRFDLVQFARVVKDAGLFLMLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+P WLH + G VFR++N+P+K
Sbjct: 121 PFVAAEWNFGGVPAWLHYIPGTVFRTNNEPFKSHMKSFTTKIVDMMKEQRFFASQGGHII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + A+ G Y +WA MA +TGVPW+MC+Q D P VIN CN C
Sbjct: 181 LAQIENEYGYYQQAYGAGGKAYAMWAGSMAQAQNTGVPWIMCQQYDVPDRVINTCNSFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNSP +P IWTE+W ++Q +G R +D+AF VA F K GS NYY+YH
Sbjct: 241 -DQFK-PNSPTQPKIWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSVQNYYVYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA IT YD AP+DEYGL R PKW HLKELH +IKLC LL G ++
Sbjct: 299 GGTNFDRTAGGPFITTSYDYDAPIDEYGLRRLPKWAHLKELHQSIKLCEHSLLFGNSTLL 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ + SG C AFL N D K V FRN Y+LP S+SILPDCK V FNT
Sbjct: 359 SLGPQQEADVYTDHSGGCVAFLANIDSEKDRVVTFRNRQYDLPAWSVSILPDCKNVVFNT 418
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFD-NTLLRAEGLLDQISAAKDASDYF 386
+V +Q L+ ++W + E I +D N +R E +D I+ KD++DY
Sbjct: 419 AKVRSQTLMVDMVPGTLQASKPDQWSIFTERIGVWDKNDFVRNE-FVDHINTTKDSTDYL 477
Query: 387 WYTFRF----HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
W+T F +Y SS L++ S GH +HAF+N GSA+G+ SF+ ++L+
Sbjct: 478 WHTTSFDVDRNYPSSGNHPVLNIDSKGHAVHAFLNNMLIGSAYGNGSESSFSAHMPINLK 537
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEKL 497
G N+ A+LS+TVGL +G + E AG+ V + ++ +W Y+VGL GE
Sbjct: 538 AGKNEIAILSMTVGLKSAGPYYEWVGAGLTSVNISGMKNGTTDLSSNNWAYKVGLEGEHY 597
Query: 498 QIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
++ + N W P + LTWYK P G+DP+ L++QSMGKG W+NG +I
Sbjct: 598 GLFKHDQGNNQRWRPQSQPPKHQPLTWYKVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAI 657
Query: 556 GRYWVSFKTSKGNP-SQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTGNLLVLL 611
GRYW +TS N T S + C + T YHVPR++ P+GN LV+
Sbjct: 658 GRYWP--RTSPTNDRCTTSCDYRGKFSPNKCRVGCGKPTQRWYHVPRSWFHPSGNTLVVF 715
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLP-PLSSWLRHRQRGDTDIKKFGK-KPTVQPS 669
EE+ G+P IT VC V+ ++ L SW D I G+ VQ S
Sbjct: 716 EEQGGDPTKITFSRRVATSVCSFVSENYPSIDLESW-------DKSISDDGRVAAKVQLS 768
Query: 670 CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
CP GK IS + FASFG+P G C Y GSCH S VVE+AC+ + C++ L FG
Sbjct: 769 CPKGKNISSVKFASFGDPSGTCRSYQQGSCHHPDSVSVVEKACMNMNSCTVSLSDEGFGE 828
Query: 730 DPCPGIHKALLVDAQC 745
DPCPG+ K L ++A C
Sbjct: 829 DPCPGVTKTLAIEADC 844
>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 602 bits (1551), Expect = e-169, Method: Compositional matrix adjust.
Identities = 343/800 (42%), Positives = 448/800 (56%), Gaps = 68/800 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI K+K+GGLDVI+TYVFWN HEP + QY+F GR D+++FIK + GLY LRIG
Sbjct: 62 MWADLIQKSKDGGLDVIETYVFWNAHEPVQNQYNFEGRYDLVKFIKLVGEAGLYAHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+P+K
Sbjct: 122 PYVCAEWNYGGFPLWLHFVPGIKFRTDNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ ++ Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 182 LSQIENEYGNIDSSYGPAAKSYINWAASMAVSLDTGVPWVMCQQADAPDPIINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +GG R +D+AF VA F G++ NYYMYH
Sbjct: 242 DQF--TPNSKNKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQLGGTFQNYYMYH 299
Query: 210 GGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR T F+ T Y APLDEYGL R+PKWGHLK+LH +IKLC L+
Sbjct: 300 GGTNFGRSTGGPFISTSYDYDAPLDEYGLTRQPKWGHLKDLHKSIKLCEEALVATDPVTS 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ EA V++ +G+C+AFL N TV F SY LP S+SILPDCK VA NT
Sbjct: 360 SLGQNLEATVYKTGTGLCSAFLANFGTSDK-TVNFNGNSYNLPGWSVSILPDCKNVALNT 418
Query: 329 ERV-STQYNKRSKTSNLKFDSD------EKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++ S +L D+D W E + N GLL+QI+ D
Sbjct: 419 AKINSMTVIPNFVHQSLIGDADSADTLGSSWSWIYEPVGISKNDAFVKPGLLEQINTTAD 478
Query: 382 ASDYFWYTFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ + +Q L V+S GH LHAFVNG+ GS G+ N +
Sbjct: 479 KSDYLWYSLSTVIKDNEPFLEDGSQTVLHVESLGHALHAFVNGKLAGSGTGNAGNAKVAV 538
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQ------DKSFTNCSWGY 488
V L G N LLS+T GL + GAF E + AG+ V+++ ++ W Y
Sbjct: 539 EIPVTLLPGKNTIDLLSLTAGLQNYGAFFELEGAGITGPVKLEGLKNGTTVDLSSLQWTY 598
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGE 547
Q+GL GE+L + S N + PT+Q L WYKT+F APAGNDPIA++ MGKGE
Sbjct: 599 QIGLKGEELGLSSG---NSQWVTQPALPTKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGE 655
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGN 606
AWVNGQSIGRYW + + S Y + +S K + T YHVPR++++ +GN
Sbjct: 656 AWVNGQSIGRYWPTKVSPTSGCSNCNYRGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGN 715
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
LVL EE G+P I T +C HV+ SH P+ W + + +K G P +
Sbjct: 716 TLVLFEEIGGDPTQIAFATKQSASLCSHVSESHPLPVDMWSSNSEAE----RKAG--PVL 769
Query: 667 QPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
CP + IS I FASFG P G C ++ G C S+ + +V++ACIG CSI +
Sbjct: 770 SLECPFPNQVISSIKFASFGTPRGTCGSFSHGQCKSTRALSIVQKACIGSKSCSIGASAS 829
Query: 726 YFGGDPCPGIHKALLVDAQC 745
F GDPC G+ K+L V+A C
Sbjct: 830 TF-GDPCRGVAKSLAVEASC 848
>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 852
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 335/807 (41%), Positives = 463/807 (57%), Gaps = 79/807 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW+ HEP+K +Y+F GR D+++F+K GLYV LRIG
Sbjct: 62 MWPELIQKSKDGGLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+P+K
Sbjct: 122 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ W+A MA+ TGVPW MC+Q DAP P+IN CNG C
Sbjct: 182 LSQIENEYGNIDSAYGAAAKSYIKWSASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +G R +D+AF VA F + G++ NYYMYH
Sbjct: 242 DQFT--PNSNNKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ +I+ YD AP+DEYGL+R+PKWGHL++LH AIKLC L+ +
Sbjct: 300 GGTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTIT 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ SG CAAFL N D + TV F SY LP S+SILPDCK VAFNT
Sbjct: 360 SLGSNLEAAVYKTESGSCAAFLANVDTKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNT 419
Query: 329 ERV-----STQYNKRSKTSNLKFDSD--EKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++ ST + ++S + ++ +W +E I GLL+QI+ D
Sbjct: 420 AKINSATESTAFARQSLKPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTAD 479
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ R + ++A L ++S G +++AF+NG+ GS HG +L
Sbjct: 480 KSDYLWYSLRTDIKGDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGKQ---KISL 536
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK------SFTNCSWGY 488
++L GTN LLSVTVGL + GAF + AG+ V ++ + W Y
Sbjct: 537 DIPINLVTGTNTIDLLSVTVGLANYGAFFDLMGAGITGPVTLKSAKGGSSIDLASQQWTY 596
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKG 546
QVGL GE + + ++ W S PT+Q L WYKTTF AP+G++P+A++ GKG
Sbjct: 597 QVGLKGEDTGLAT---VDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKG 653
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQT-----QYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
AWVNGQSIGRYW + G +++ Y N + C T YHVPR++L
Sbjct: 654 IAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKC--LKNCGKPSQT-LYHVPRSWL 710
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
KP+GN+LVL EE G+P I+ T +C V+ SH PP+ +W D+ I
Sbjct: 711 KPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTS-----DSKISNR 765
Query: 661 GK-KPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
+ +P + CP+ + I I FASFG P G C + G C+SS S +V++ACIG C
Sbjct: 766 NRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSC 825
Query: 719 SIPLLSRYFGGDPCPGIHKALLVDAQC 745
++ + +R F G+PC G+ K+L V+A C
Sbjct: 826 NVEVSTRVF-GEPCRGVVKSLAVEASC 851
>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 848
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 349/805 (43%), Positives = 462/805 (57%), Gaps = 73/805 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP +GQYDF GR D+++F+K + + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGLDVIETYVFWNLHEPVRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 116 PYVCAEWNYGGFPVWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDMIKQEKLYASQGGPVI 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MA TGVPWVMC Q DAP P+IN NG
Sbjct: 176 LSQIENEYGNIDTAYGAAGKSYIKWAATMATSLDTGVPWVMCLQADAPDPIINTWNGFY- 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+ F PNS KP +WTE+W+ ++ V+GG R +D+AF VA F + G++ NYYMYH
Sbjct: 235 GDEFT-PNSNTKPKMWTENWSGWFLVFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF R + F+ T Y AP+DEYG++R+PKWGHLKE+H AIKLC L+ +
Sbjct: 294 GGTNFDRASGGPFIATSYDYDAPIDEYGIIRQPKWGHLKEVHKAIKLCEEALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ +T VCAAFL N + VTV F SY LP S+SILPDCK+V NT
Sbjct: 354 SLGPNLEAAVY-KTGSVCAAFLANVGTKSDVTVNFSGNSYHLPAWSVSILPDCKSVVLNT 412
Query: 329 ERVSTQYNKRS-KTSNLKFD------SDEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++++ S T + K D S W E + GLL+QI+ D
Sbjct: 413 AKINSASAISSFTTESSKEDIGSSEASSTGWSWISEPVGISKTDSFSQTGLLEQINTTAD 472
Query: 382 ASDYFWYTFRFHYNS-SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNV--------S 432
SDY WY+ Y + +++Q L ++S GH LHAF+NG+ G H +
Sbjct: 473 KSDYLWYSLSIDYKADASSQTVLHIESLGHALHAFINGKLAGKYKLKHSQLIICNSGKYK 532
Query: 433 FTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS------- 485
FT+ V L G N LLS+TVGL + GAF + G+ + K F N +
Sbjct: 533 FTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGVGITGPVIL-KGFANGNTLDLSSQ 591
Query: 486 -WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSM 543
W YQVGL GE L + S L S+ P Q LTWYKTTF AP+G+DP+A++ M
Sbjct: 592 KWTYQVGLQGEDLGLSSGSSGQWNLQSTF--PKNQPLTWYKTTFSAPSGSDPVAIDFTGM 649
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPSQT-QYAVNTVTSIHFCAIIKATNT-YHVPRAFL 601
GKGEAWVNGQ IGRYW ++ S + + + Y S K + T YHVPR++L
Sbjct: 650 GKGEAWVNGQRIGRYWPTYVASDASCTDSCNYRGPYSASKCRKNCEKPSQTLYHVPRSWL 709
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
KP+GN+LVL EE G+P I+ T +C HV++SH PP+ W + G +K G
Sbjct: 710 KPSGNILVLFEERGGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSETESG----RKVG 765
Query: 662 KKPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSI 720
P + +CP + IS I FAS+G P G C + G C S+ + +V++ACIG S CS+
Sbjct: 766 --PVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSV 823
Query: 721 PLLSRYFGGDPCPGIHKALLVDAQC 745
+ S F GDPC G+ K+L V+A C
Sbjct: 824 GVSSDTF-GDPCRGMAKSLAVEATC 847
>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
Length = 844
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 343/802 (42%), Positives = 457/802 (56%), Gaps = 70/802 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I K+K+GGLDVI+TYVFWNLHEP + QYDF GR D+++FIK + + GLYV +RIG
Sbjct: 57 MWPGIIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V G+ FR+DN+P+K
Sbjct: 117 PYVCAEWNYGGFPVWLHFVPGVQFRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ +F YV WAA MA +TGVPWVMC Q DAP P+IN CNG C
Sbjct: 177 LSQIENEYGNVQSSFGSAAKSYVQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +GG R +D+AF VA F GS NYYMYH
Sbjct: 237 DQF--TPNSNNKPKMWTENWSGWFLSFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYH 294
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ F+ T Y AP+DEYGLVR+PKWGHL+++H AIK+C L++ V
Sbjct: 295 GGTNFGRTSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ S C+AFL N D + TV F SY LP S+SILPDCK V NT
Sbjct: 355 SLGPNLEATVYKSGSQ-CSAFLANVDTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNT 413
Query: 329 ERVSTQYNKRSKTSN-LKFDS------DEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++++ + S ++ LK D D W E I N GL +QI+ D
Sbjct: 414 AKINSVTTRPSFSNQPLKVDVSASEAFDSGWSWIDEPIGISKNNSFANLGLSEQINTTAD 473
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ Y ++ + L V S GH+LH F+N + GS GS + +L
Sbjct: 474 KSDYLWYSLSTDIKGDEPYLANGSNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSL 533
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK------SFTNCSWGY 488
+ L G N LLS+TVGL + GAF E + AGV V+++++ ++ W Y
Sbjct: 534 DIPITLVPGKNTIDLLSLTVGLQNYGAFFELRGAGVTGPVKLENQKNNITVDLSSGQWTY 593
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSP-TRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
Q+GL GE L + S G S P + LTWYKTTF APAG+DP+AL+ GKGE
Sbjct: 594 QIGLEGEDLGLPS--GSTSQWLSQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGE 651
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTG 605
AW+NG SIGRYW S+ S S Y A + + C T YHVP+++LKPTG
Sbjct: 652 AWINGHSIGRYWPSYIASGQCTSYCDYKGAYSANKCLRNCGKPSQT-LYHVPQSWLKPTG 710
Query: 606 NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPT 665
N LVL EE +P +T + + +C HV+ SH PP+ W +D K+ P
Sbjct: 711 NTLVLFEEIGSDPTRLTFASKQLGSLCSHVSESHPPPVEMW-------SSDSKQQKTGPV 763
Query: 666 VQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
+ CP + IS I FASFG P G C ++ G C + ++ +V++ACIG CSI +
Sbjct: 764 LSLECPSPSQVISSIKFASFGTPRGTCGSFSHGQCSTRNALSIVQKACIGSKSCSIDVSI 823
Query: 725 RYFGGDPCPGIHKALLVDAQCR 746
+ F GDPC G K+L V+A C+
Sbjct: 824 KAF-GDPCRGKTKSLAVEAYCQ 844
>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
Length = 839
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 317/791 (40%), Positives = 448/791 (56%), Gaps = 72/791 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGGL+ I+TYVFWN HEP+ G+++F GRND+I+F+K IQS G+Y +RIG
Sbjct: 68 MWPKLLKTAKEGGLNTIETYVFWNAHEPEPGKFNFEGRNDMIKFLKLIQSFGMYAIVRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI+ EW +G LP WL ++ I+FR++N+PYK
Sbjct: 128 PFIQGEWNHGALPYWLREIPHIIFRANNEPYKREMEKFVRFIVQMLKDENLFASQGGNVI 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ +G Y+ WAA+MA+ + GVPW+MCKQ APG VI CNG C
Sbjct: 188 LAQIENEYGNIKKDHITEGDKYLEWAAEMAISTNIGVPWIMCKQSTAPGVVIPTCNGRHC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ + NKP +WTE+WT+ ++ +G RSA+DIA+ V F AK G+ VNYYMY+
Sbjct: 248 GDTWIMKDE-NKPHLWTENWTAQFRAFGNDLAQRSAEDIAYSVLRFFAKGGTLVNYYMYY 306
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+ P+DEYG+ + PK+GHL++LH IK SR L G Q+
Sbjct: 307 GGTNFGRTGASYVLTGYYDEGPIDEYGMPKAPKYGHLRDLHNVIKSYSRAFLEGKQSFEL 366
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LGQ EA FE +C AF+ NN+ + TV+FR Y +P +S+SIL DCK V +NT
Sbjct: 367 LGQGYEARNFEIPEEKLCLAFISNNNTGEDGTVIFRGDKYYIPSRSVSILADCKHVVYNT 426
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+++RS K + WE + E I + T +R + L+Q + KD SDY WY
Sbjct: 427 KRVFVQHSERSFHKAEKATKNNVWEMFSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWY 486
Query: 389 TFRFHYNSS------NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F + + + + V+S H + FVN + G+ HGS FT + LR
Sbjct: 487 TTSFRLEADDLPIRGDIRPVIAVKSTAHAMVGFVNDAFAGNGHGSKKEKFFTFETPISLR 546
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-----WGYQVGLIGEKL 497
G N ALLS ++G+ DSG L G+ +Q + WG++ L GE
Sbjct: 547 LGVNHLALLSSSMGMKDSGGELVELKGGIQDCTIQGLNTGTLDLQINGWGHKAKLEGEVK 606
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+IY+ G+ V W S + +TWYK F P G+DP+ L++ SM KG +VNG+ +GR
Sbjct: 607 EIYTEKGMGAVKWVPAVS-GQAVTWYKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGR 665
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW S+KT SQ YH+PR FLK NLLV+ EEE G
Sbjct: 666 YWTSYKTPGKVASQA--------------------VYHIPRTFLKSKNNLLVVFEEELGK 705
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP--SCPLGKK 675
P GI + T+ +C ++ + + W H + IK + + +CP K
Sbjct: 706 PEGILIQTVRRDDICVFISEHNPAQIKPWDEHGGQ----IKLIAEDHNTRGFLNCPPKKI 761
Query: 676 ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPG 734
I ++VFASFGNP G C + VG+CH+ +++ +VE+ C+GK C +P+L ++G D CP
Sbjct: 762 IQEVVFASFGNPVGSCANFTVGTCHTPNAKEIVEKECLGKKGCVLPVLHTFYGADINCPT 821
Query: 735 IHKALLVDAQC 745
L V +C
Sbjct: 822 TTATLAVQVRC 832
>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
Full=Protein AR782; Flags: Precursor
gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 852
Score = 601 bits (1549), Expect = e-169, Method: Compositional matrix adjust.
Identities = 334/807 (41%), Positives = 461/807 (57%), Gaps = 79/807 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW+ HEP+K +Y+F GR D+++F+K GLYV LRIG
Sbjct: 62 MWPELIQKSKDGGLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+P+K
Sbjct: 122 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ W+A MA+ TGVPW MC+Q DAP P+IN CNG C
Sbjct: 182 LSQIENEYGNIDSAYGAAAKSYIKWSASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +G R +D+AF VA F + G++ NYYMYH
Sbjct: 242 DQFT--PNSNNKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ +I+ YD AP+DEYGL+R+PKWGHL++LH AIKLC L+ +
Sbjct: 300 GGTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTIT 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ SG CAAFL N D + TV F SY LP S+SILPDCK VAFNT
Sbjct: 360 SLGSNLEAAVYKTESGSCAAFLANVDTKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNT 419
Query: 329 ERV-----STQYNKRSKTSNLKFDSD--EKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++ ST + ++S + ++ +W +E I GLL+QI+ D
Sbjct: 420 AKINSATESTAFARQSLKPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTAD 479
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ R + ++A L ++S G +++AF+NG+ GS HG +L
Sbjct: 480 KSDYLWYSLRTDIKGDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGKQ---KISL 536
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-------FTNCSWGY 488
++L GTN LLSVTVGL + GAF + AG+ + + W Y
Sbjct: 537 DIPINLVTGTNTIDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTY 596
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKG 546
QVGL GE + + ++ W S PT+Q L WYKTTF AP+G++P+A++ GKG
Sbjct: 597 QVGLKGEDTGLAT---VDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKG 653
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQT-----QYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
AWVNGQSIGRYW + G +++ Y N + C T YHVPR++L
Sbjct: 654 IAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKC--LKNCGKPSQT-LYHVPRSWL 710
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
KP+GN+LVL EE G+P I+ T +C V+ SH PP+ +W D+ I
Sbjct: 711 KPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTS-----DSKISNR 765
Query: 661 GK-KPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
+ +P + CP+ + I I FASFG P G C + G C+SS S +V++ACIG C
Sbjct: 766 NRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSC 825
Query: 719 SIPLLSRYFGGDPCPGIHKALLVDAQC 745
++ + +R F G+PC G+ K+L V+A C
Sbjct: 826 NVEVSTRVF-GEPCRGVVKSLAVEASC 851
>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
sativus]
Length = 844
Score = 600 bits (1547), Expect = e-169, Method: Compositional matrix adjust.
Identities = 343/802 (42%), Positives = 456/802 (56%), Gaps = 70/802 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I K+K+GGLDVI+TYVFWNLHEP + QYDF GR D+++FIK + + GLYV +RIG
Sbjct: 57 MWPGIIQKSKDGGLDVIETYVFWNLHEPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V G+ FR+DN+P+K
Sbjct: 117 PYVCAEWNYGGFPVWLHFVPGVQFRTDNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ +F YV WAA MA +TGVPWVMC Q DAP P+IN CNG C
Sbjct: 177 LSQIENEYGNVQSSFGSAAKSYVQWAATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +GG R +D+AF VA F GS NYYMYH
Sbjct: 237 DQF--TPNSNNKPKMWTENWSGWFLSFGGALPYRPVEDLAFAVARFYQTGGSLQNYYMYH 294
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ F+ T Y AP+DEYGLVR+PKWGHL+++H AIK+C L++ V
Sbjct: 295 GGTNFGRTSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKMCEEALVSTDPAVT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ S C+AFL N D + TV F SY LP S+SILPDCK V NT
Sbjct: 355 SLGPNLEATVYKSGSQ-CSAFLANVDTQSDKTVTFNGNSYHLPAWSVSILPDCKNVVLNT 413
Query: 329 ERVSTQYNKRSKTSN-LKFDS------DEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++++ + S ++ LK D D W E I N GL +QI+ D
Sbjct: 414 AKINSVTTRPSFSNQPLKVDVSASEAFDSGWSWIDEPIGISKNNSFANLGLSEQINTTAD 473
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ Y ++ + L V S GH+LH F+N + GS GS + +L
Sbjct: 474 KSDYLWYSLSTDIKGDEPYLANGSNTVLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSL 533
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK------SFTNCSWGY 488
+ L G N LLS+TVGL + GAF E + AGV V++++ ++ W Y
Sbjct: 534 DIPITLVPGKNTIDLLSLTVGLQNYGAFFELRGAGVTGPVKLENXKNNITVDLSSGQWTY 593
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSP-TRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
Q+GL GE L + S G S P + LTWYKTTF APAG+DP+AL+ GKGE
Sbjct: 594 QIGLEGEDLGLPS--GSTSQWLSQPNLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGE 651
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTG 605
AW+NG SIGRYW S+ S S Y A + + C T YHVP+++LKPTG
Sbjct: 652 AWINGHSIGRYWPSYIASGQCTSYCDYKGAYSANKCLRNCGKPSQT-LYHVPQSWLKPTG 710
Query: 606 NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPT 665
N LVL EE +P +T + + +C HV+ SH PP+ W +D K+ P
Sbjct: 711 NTLVLFEEIGSDPTRLTFASKQLGSLCSHVSESHPPPVEMW-------SSDSKQQKTGPV 763
Query: 666 VQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
+ CP + IS I FASFG P G C ++ G C + ++ +V++ACIG CSI +
Sbjct: 764 LSLECPSPSQVISSIKFASFGTPRGTCGSFSHGQCSTRNALSIVQKACIGSKSCSIDVSI 823
Query: 725 RYFGGDPCPGIHKALLVDAQCR 746
+ F GDPC G K+L V+A C+
Sbjct: 824 KAF-GDPCRGKTKSLAVEAYCQ 844
>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 846
Score = 600 bits (1547), Expect = e-169, Method: Compositional matrix adjust.
Identities = 334/807 (41%), Positives = 461/807 (57%), Gaps = 79/807 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW+ HEP+K +Y+F GR D+++F+K GLYV LRIG
Sbjct: 56 MWPELIQKSKDGGLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ W+A MA+ TGVPW MC+Q DAP P+IN CNG C
Sbjct: 176 LSQIENEYGNIDSAYGAAAKSYIKWSASMALSLDTGVPWNMCQQTDAPDPMINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +G R +D+AF VA F + G++ NYYMYH
Sbjct: 236 DQFT--PNSNNKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ +I+ YD AP+DEYGL+R+PKWGHL++LH AIKLC L+ +
Sbjct: 294 GGTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ SG CAAFL N D + TV F SY LP S+SILPDCK VAFNT
Sbjct: 354 SLGSNLEAAVYKTESGSCAAFLANVDTKSDATVTFNGKSYNLPAWSVSILPDCKNVAFNT 413
Query: 329 ERV-----STQYNKRSKTSNLKFDSD--EKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++ ST + ++S + ++ +W +E I GLL+QI+ D
Sbjct: 414 AKINSATESTAFARQSLKPDGGSSAELGSQWSYIKEPIGISKADAFLKPGLLEQINTTAD 473
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ R + ++A L ++S G +++AF+NG+ GS HG +L
Sbjct: 474 KSDYLWYSLRTDIKGDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGKQ---KISL 530
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-------FTNCSWGY 488
++L GTN LLSVTVGL + GAF + AG+ + + W Y
Sbjct: 531 DIPINLVTGTNTIDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTY 590
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKG 546
QVGL GE + + ++ W S PT+Q L WYKTTF AP+G++P+A++ GKG
Sbjct: 591 QVGLKGEDTGLAT---VDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKG 647
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQT-----QYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
AWVNGQSIGRYW + G +++ Y N + C T YHVPR++L
Sbjct: 648 IAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKC--LKNCGKPSQT-LYHVPRSWL 704
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
KP+GN+LVL EE G+P I+ T +C V+ SH PP+ +W D+ I
Sbjct: 705 KPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTS-----DSKISNR 759
Query: 661 GK-KPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
+ +P + CP+ + I I FASFG P G C + G C+SS S +V++ACIG C
Sbjct: 760 NRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSC 819
Query: 719 SIPLLSRYFGGDPCPGIHKALLVDAQC 745
++ + +R F G+PC G+ K+L V+A C
Sbjct: 820 NVEVSTRVF-GEPCRGVVKSLAVEASC 845
>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
Length = 892
Score = 600 bits (1547), Expect = e-169, Method: Compositional matrix adjust.
Identities = 334/824 (40%), Positives = 453/824 (54%), Gaps = 89/824 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LIA++KEGG DVI+TY FWN HEP +GQY+F GR DI++F K + S GL++ +RIG
Sbjct: 67 MWPTLIARSKEGGADVIETYTFWNGHEPTRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIG 126
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG PIWL D+ GI FR+DN P+K
Sbjct: 127 PYACAEWNFGGFPIWLRDIPGIEFRTDNAPFKEEMERYVKKIVDLMISESLFSWQGGPII 186
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E F KG Y+ WAA+MAV GVPWVMC+Q DAP +I+ CN C
Sbjct: 187 LLQIENEYGNVESTFGPKGKLYMKWAAEMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYC 246
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PNS KP IWTE+W ++ WG + R ++DIAF +A F + GS NYYMY
Sbjct: 247 -DGFT-PNSEKKPKIWTENWNGWFADWGERLPYRPSEDIAFAIARFFQRGGSLQNYYMYF 304
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRTA IT Y APLDEYGL+R+PKWGHLK+LHAAIKLC L+ +
Sbjct: 305 GGTNFGRTAGGPTQITSYDYDAPLDEYGLLRQPKWGHLKDLHAAIKLCEPALVAADSPQY 364
Query: 268 ISLGQLQEAFVFEETS-----------GVCAAFLVNNDERKAVTVLFRNISYELPRKSIS 316
I LG QEA V+ TS G+CAAF+ N DE ++ TV F + LP S+S
Sbjct: 365 IKLGPKQEAHVYRGTSNNIGQYMSLNEGICAAFIANIDEHESATVKFYGQEFTLPPWSVS 424
Query: 317 ILPDCKTVAFNTERVSTQYNKRSKTSN----------LKFDSDEKWEEYREAILNFDNTL 366
ILPDC+ AFNT +V Q + ++ S+ L+ + K E + ++ + L
Sbjct: 425 ILPDCRNTAFNTAKVGAQTSIKTVGSDSVSVGNNSLFLQVITKSKLESFSQSWMTLKEPL 484
Query: 367 -------LRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGHI 411
++G+L+ ++ KD SDY WY R + + ++ +D+ S
Sbjct: 485 GVWGDKNFTSKGILEHLNVTKDQSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDF 544
Query: 412 LHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG- 470
+ FVNG+ GS G V V L QG ND LLS TVGL + GAFLE+ AG
Sbjct: 545 VRIFVNGQLAGSVKGKWIKVV----QPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGF 600
Query: 471 -----VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWY 523
+ + D + T W YQVGL GE L++Y W+ + T +WY
Sbjct: 601 KGQIKLTGCKSGDINLTTSLWTYQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWY 660
Query: 524 KTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTS 581
KT F AP G DP+AL+ SMGKG+AWVNG +GRYW + G Y A ++
Sbjct: 661 KTKFDAPGGTDPVALDFSSMGKGQAWVNGHHVGRYWTLVAPNNGCGRTCDYRGAYHSDKC 720
Query: 582 IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP 641
C I YH+PR++LK N+LV+ EE + P I++ T + +C V+ H P
Sbjct: 721 RTNCGEITQA-WYHIPRSWLKTLNNVLVIFEEIDKTPFDISISTRSTETICAQVSEKHYP 779
Query: 642 PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS 701
PL W D + K P + C G IS I FAS+G+P+G C++++ G CH+
Sbjct: 780 PLHKW--SHSEFDRKLSLMDKTPEMHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHA 837
Query: 702 SHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
++S VV +ACIG++ CSI + + F GDPC + K+L V A+C
Sbjct: 838 ANSLSVVSQACIGRTSCSIGISNGVF-GDPCRHVVKSLAVQAKC 880
>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
Length = 839
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 344/797 (43%), Positives = 448/797 (56%), Gaps = 66/797 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GG+DVI+TYVFWNLHEP +GQY+F GR D++ F+K + + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKAVAAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH +AGI FR++N+P+K
Sbjct: 116 PYVCAEWNYGGFPLWLHFIAGIKFRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ Y+ WAA MA TGVPW+MC+Q +AP P+IN CN C
Sbjct: 176 LSQIENEYGNIDTHDARAAKSYIDWAASMATSLDTGVPWIMCQQANAPDPIINTCNSFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 236 DQF--TPNSDNKPKMWTENWSGWFLAFGGAVPYRPVEDLAFAVARFFQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT I+ YD AP+DEYG +R+PKWGHLK+LH AIKLC L+ +
Sbjct: 294 GGTNFGRTTGGPFISTSYDYDAPIDEYGDIRQPKWGHLKDLHKAIKLCEEALIASDPTIT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S G E V+ +T VC+AFL N A TV F SY LP S+SILPDCK V NT
Sbjct: 354 SPGPNLETAVY-KTGAVCSAFLANIGMSDA-TVTFNGNSYHLPGWSVSILPDCKNVVLNT 411
Query: 329 ERVSTQYNKRS-KTSNLK------FDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
+V+T S T +LK S W E + GLL+QI+ D
Sbjct: 412 AKVNTASMISSFATESLKEKVDSLDSSSSGWSWISEPVGISTPDAFTKSGLLEQINTTAD 471
Query: 382 ASDYFWYTFRFHYNSSNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
SDY WY+ Y + P L ++S GH LHAFVNG+ GS GS N + +
Sbjct: 472 RSDYLWYSLSIVYEDNAGDQPVLHIESLGHALHAFVNGKLAGSKAGSSGNAKVNVDIPIT 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-------QDKSFTNCSWGYQVGLI 493
L G N LLS+TVGL + GAF + AG+ + T+ W YQVGL
Sbjct: 532 LVTGKNTIDLLSLTVGLQNYGAFYDTVGAGITGPVILKGLKNGSSVDLTSQQWTYQVGLQ 591
Query: 494 GEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE + + S N W+S + P Q LTWYKT F AP+G++P+A++ MGKGEAWVN
Sbjct: 592 GEFVGLSSG---NVGQWNSQSNLPANQPLTWYKTNFVAPSGSNPVAIDFTGMGKGEAWVN 648
Query: 552 GQSIGRYWVSFKT-SKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
GQSIGRYW ++ + + G Y S K + T YHVPRA+LKP N V
Sbjct: 649 GQSIGRYWPTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAWLKPDSNTFV 708
Query: 610 LLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS 669
L EE G+P I+ T I VC HVT SH PP+ +W + + +K G P +
Sbjct: 709 LFEESGGDPTKISFGTKQIESVCSHVTESHPPPVDTWNSNAESE----RKVG--PVLSLE 762
Query: 670 CPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
CP + IS I FASFG P G C Y GSC S+ + +V++ACIG S C+I + F
Sbjct: 763 CPYPNQAISSIKFASFGTPRGTCGNYNHGSCSSNRALSIVQKACIGSSSCNIGVSINTF- 821
Query: 729 GDPCPGIHKALLVDAQC 745
G+PC G+ K+L V+A C
Sbjct: 822 GNPCRGVTKSLAVEAAC 838
>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
Length = 838
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 311/792 (39%), Positives = 450/792 (56%), Gaps = 72/792 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AK GGL+ I+TYVFWN HEP+ G+Y F GR D+IRF+ I+ +Y +RIG
Sbjct: 66 MWDKLVKTAKMGGLNTIETYVFWNGHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N+P+K
Sbjct: 126 PFIQAEWNHGGLPYWLREIGHIIFRANNEPFKREMEKFVRFIVQKLKDAEMFAPQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ +G Y+ WAA+MA+ GVPWVMCKQ APG VI CNG C
Sbjct: 186 LSQIENEYGNIKKDRKVEGDKYLEWAAEMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ + NKP +WTE+WT+ ++ +G + RSA+DIA+ V F AK G+ VNYYMYH
Sbjct: 246 GDTWTLLDK-NKPRLWTENWTAQFRTFGDQLAQRSAEDIAYAVLRFFAKGGTLVNYYMYH 304
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+AP+DEYG+ +EPK+GHL++LH IK + L G Q+
Sbjct: 305 GGTNFGRTGASYVLTGYYDEAPMDEYGMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEI 364
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +E +C +FL NN+ + TV+FR + +P +S+SIL DCKTV +NT
Sbjct: 365 LGHGYEAHNYELPEDKLCLSFLSNNNTGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNT 424
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+++RS + + + WE Y EAI F T +R + L+Q + KD SDY WY
Sbjct: 425 KRVFVQHSERSFHTTDETSKNNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWY 484
Query: 389 TFRFHYNSS------NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F S + + + ++S H + F N + G+ GS SF + LR
Sbjct: 485 TTSFRLESDDLPFRRDIRPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLR 544
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
G N A+LS ++G+ DSG L G+ VQ + WG++ L GE
Sbjct: 545 VGINHIAMLSSSMGMKDSGGELVEVKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDK 604
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+IY+ G+ + W + +TWYK F P G+DPI +++ SM KG +VNG+ IGR
Sbjct: 605 EIYTEKGMAQFQWKPAENDL-PITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGR 663
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW SF T G+PSQ+ YH+PRAFLKP GNLL++ EEE G
Sbjct: 664 YWTSFITLAGHPSQS--------------------VYHIPRAFLKPKGNLLIIFEEELGK 703
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS--CPLGKK 675
P GI + T+ +C ++ + + +W + IK + + + + CP +
Sbjct: 704 PGGILIQTVRRDDICVFISEHNPAQIKTW----ESDGGQIKLIAEDTSTRGTLNCPPKRT 759
Query: 676 ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPG 734
I ++VFASFGNP+G C + G+CH+ ++ +VE+ C+GK C +P+++ +G D CP
Sbjct: 760 IQEVVFASFGNPEGACGNFTAGTCHTPDAKAIVEKECLGKESCVLPVVNTVYGADINCPA 819
Query: 735 IHKALLVDAQCR 746
L V +C+
Sbjct: 820 TTATLAVQVRCK 831
>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
Length = 919
Score = 598 bits (1542), Expect = e-168, Method: Compositional matrix adjust.
Identities = 343/825 (41%), Positives = 455/825 (55%), Gaps = 92/825 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAK KEGG DVI+TYVFWN HEP KGQY F R D+++F K + ++GL++ LRIG
Sbjct: 94 MWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYYFEERFDLVKFAKLVAAEGLFLFLRIG 153
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL D+ GI FR+DN+P+K
Sbjct: 154 PYACAEWNFGGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPII 213
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + + G Y+ WAA+MA+ TG+PWVMC+Q DAP +I+ CN C
Sbjct: 214 LQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYC 273
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS NKP+IWTEDW +Y WGG R A+D AF VA F + GS NYYMY
Sbjct: 274 -DGFK-PNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYF 331
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT--GTQN 266
GGTNF RTA IT Y AP+DEYG++R+PKWGHLK+LH AIKLC L+ G+
Sbjct: 332 GGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQ 391
Query: 267 VISLGQLQEAFVFE----ETSG-------VCAAFLVNNDERKAVTVLFRNISYELPRKSI 315
I LG +QEA V+ T+G +C+AFL N DE K +V SY LP S+
Sbjct: 392 YIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSV 451
Query: 316 SILPDCKTVAFNTERVSTQY------------NKRSKTSNLKFDS-----DEKWEEYREA 358
SILPDC+ VAFNT R+ Q + R K S L S W +E
Sbjct: 452 SILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKET 511
Query: 359 ILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGH 410
I + +G+L+ ++ KD SDY WYT R + ++S L +
Sbjct: 512 IGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRD 571
Query: 411 ILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG 470
+ FVNG+ GS G +L+ + L +G N+ LLS VGL + GAFLE+ AG
Sbjct: 572 VARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAG 627
Query: 471 VHRVRVQ-------DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTW 522
R +V D TN W YQVGL GE IY+ WS ++ + Q TW
Sbjct: 628 F-RGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQPFTW 686
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVT 580
YKT F P G DP+A++L SMGKG+AWVNG IGRYW G S Y A N
Sbjct: 687 YKTMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCSSSCYYPGAYNERK 746
Query: 581 SIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHL 640
C + N YH+PR +LK + NLLVL EE G+P I+++ + VC ++ ++
Sbjct: 747 CQSNCG-MPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYY 805
Query: 641 PPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCH 700
PPLS+W H G + P ++ C G IS+I FAS+G P G C ++ G+CH
Sbjct: 806 PPLSAW-SHLSSGRASVN--AATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCH 862
Query: 701 SSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+S + +V AC+G ++C+I + + F GDPC G+ K L V+A+C
Sbjct: 863 ASSTLDLVTEACVGNTKCAISVSNDVF-GDPCRGVLKDLAVEAKC 906
>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
Length = 851
Score = 598 bits (1542), Expect = e-168, Method: Compositional matrix adjust.
Identities = 337/804 (41%), Positives = 459/804 (57%), Gaps = 75/804 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWN HEP+K +Y+F GR D+++F+K GLYV LRIG
Sbjct: 63 MWPDLIQKSKDGGLDVIETYVFWNGHEPEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW YGG P+WLH V GI FR+DN+P+K
Sbjct: 123 PYACAEWNYGGFPVWLHFVPGIKFRTDNEPFKAEMQRFTAKIVDLMKQEKLYASQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ ++ G Y+ W+A MA+ TGVPW MC+Q DAP P+IN CNG C
Sbjct: 183 LSQIENEYGNIDSSYGAAGKSYMKWSASMALSLDTGVPWNMCQQGDAPDPIINTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTE+W+ ++ +G R +D+AF VA F + G++ NYYMYH
Sbjct: 243 DQFT--PNSNNKPKMWTENWSGWFLGFGEPSPYRPVEDLAFAVARFFQRGGTFQNYYMYH 300
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ +I+ YD AP+DEYGL+R+PKWGHL++LH AIKLC L+ +
Sbjct: 301 GGTNFERTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPKIT 360
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ ++G CAAFL N + TV F SY LP S+SILPDCK VAFNT
Sbjct: 361 SLGSNLEAAVYKTSTGSCAAFLANIGTKSDATVTFNGKSYRLPAWSVSILPDCKNVAFNT 420
Query: 329 ERV-----STQYNKRSKTSNLKFDSD--EKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++ ST + ++S N ++ +W +E + GLL+QI+ D
Sbjct: 421 AKINSATESTAFARQSLKPNADSSAELGSQWSYIKEPVGISKADAFVKPGLLEQINTTAD 480
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ R + ++A L VQS G +++AF+NG+ GS +G +L
Sbjct: 481 KSDYLWYSLRMDIKGDETFLDEGSKAVLHVQSIGQLVYAFINGKLAGSGNGKQ---KISL 537
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG----VHRVRVQDKSFTNCS---WGY 488
++L G N LLSVTVGL + G F + AG V + S T+ S W Y
Sbjct: 538 DIPINLVTGKNTIDLLSVTVGLANYGPFFDLTGAGITGPVSLKSAKTGSSTDLSSQQWTY 597
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGE 547
QVGL GE + S G + S+ PT Q L WYKTTF AP+G+DP+A++ GKG
Sbjct: 598 QVGLKGEDKGLGS--GDSSEWVSNSPLPTSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGI 655
Query: 548 AWVNGQSIGRYW-VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTG 605
AWVNGQSIGRYW S + G Y + ++ K + T YHVPR+++KP+G
Sbjct: 656 AWVNGQSIGRYWPTSIARTDGCVGSCDYRGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSG 715
Query: 606 NLLVLLEEENGNPLGITVDTIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK- 663
N LVLLEE G+P I+ T +C V+ SH P+ +W+ KF +
Sbjct: 716 NTLVLLEEMGGDPTKISFATKQTGSNLCLTVSQSHPAPVDTWISD--------SKFSNRT 767
Query: 664 -PTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIP 721
P + CP+ + IS I FASFG P G C ++ G C S+ S VV++AC+G C +
Sbjct: 768 SPVLSLKCPVSTQVISSIRFASFGTPTGTCGSFSYGHCSSARSLSVVQKACVGSRSCKVE 827
Query: 722 LLSRYFGGDPCPGIHKALLVDAQC 745
+ +R F G+PC G+ K+L V+A C
Sbjct: 828 VSTRVF-GEPCRGVVKSLAVEASC 850
>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 326/794 (41%), Positives = 452/794 (56%), Gaps = 61/794 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP +I K+KEGGLDVI+TYVFWN HEP +GQY F GR D++RF+K +Q GL+V LRIG
Sbjct: 66 VWPEIIRKSKEGGLDVIETYVFWNYHEPVRGQYYFEGRFDLVRFVKTVQEAGLFVHLRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW YGG P+WLH + G+ FR+ N +K
Sbjct: 126 PYACAEWNYGGFPLWLHFIPGVQFRTSNDIFKNAMKSFLTKIVDLMKDDNLFASQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY ++ A+ G YV WAA+ A+ +T VPWVMC Q+DAP PVIN CNG C
Sbjct: 186 LAQVENEYGNVQWAYGVGGELYVKWAAETAISLNTTVPWVMCVQEDAPDPVINTCNGFYC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNSP+KP +WTE+++ ++ +G R +D+AF VA F GS+ NYYMY
Sbjct: 246 DQF--TPNSPSKPKMWTENYSGWFLAFGYAVPYRPVEDLAFAVARFFEYGGSFQNYYMYF 303
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA ++ YD AP+DEYG +R+PKWGHL++LH+AIK C L++
Sbjct: 304 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRDLHSAIKQCEEYLVSSDPVHQ 363
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA V+ + S CAAFL N D V F +Y LP S+SIL DCK V FNT
Sbjct: 364 QLGNKLEAHVYYKHSNDCAAFLANYDSGSDANVTFNGNTYFLPAWSVSILADCKNVIFNT 423
Query: 329 ERVSTQYN------KRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
+V TQ + RS T + + W Y+E + + N GLL+QI+ KD
Sbjct: 424 AKVVTQRHIGDALFSRSTTVDGNLVAASPWSWYKEEVGIWGNNSFTKPGLLEQINTTKDT 483
Query: 383 SDYFWYTFRFHYNS-SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
SD+ WY+ + + + + L+++S GH FVN + +G+HD+ SF+L + L
Sbjct: 484 SDFLWYSTSLYVEAGQDKEHLLNIESLGHAALVFVNKRFVAFGYGNHDDASFSLTREISL 543
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-----KSFTNCSWGYQVGLIGEK 496
+G N +LS+ +G+ + G + + + AG+H V + D K ++ W YQVGL GE
Sbjct: 544 EEGNNTLDVLSMLIGVQNYGPWFDVQGAGIHSVFLVDLHKSKKDLSSGKWTYQVGLEGEY 603
Query: 497 LQIYSNLGLNKVLWSSIRS--PTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L + + N LWS S + L WYK T AP GN P+ALNL SMGKG+AW+NGQS
Sbjct: 604 LGLDNVSLANSSLWSQGTSLPVNKSLIWYKATIIAPEGNGPLALNLASMGKGQAWINGQS 663
Query: 555 IGRYWVSFKT-SKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
IGRYW ++ + S G Y A N+ C A YH+PR ++ P NLLVL
Sbjct: 664 IGRYWSAYLSPSAGCTDNCDYRGAYNSFKCQKKCG-QPAQTLYHIPRTWVHPGENLLVLH 722
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE G+P I++ T + +C V+ PP SW +++ + P V+ +C
Sbjct: 723 EELGGDPSQISLLTRTGQDICSIVSEDDPPPADSW-------KPNLEFMSQSPEVRLTCE 775
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G I+ I FASFG P+G C + G+CH+ +V++ACIG RCSIP+ + GDP
Sbjct: 776 HGWHIAAINFASFGTPEGKCGTFTPGNCHADMLT-IVQKACIGHERCSIPISAAKL-GDP 833
Query: 732 CPGIHKALLVDAQC 745
CPG+ K +V+A C
Sbjct: 834 CPGVVKRFVVEALC 847
>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
Length = 846
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 333/807 (41%), Positives = 458/807 (56%), Gaps = 79/807 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFW+ HEP+K +Y+F GR D+++F+K ++ GLYV LRIG
Sbjct: 56 MWPELIKKSKDGGLDVIETYVFWSGHEPEKNKYNFEGRYDLVKFVKLVEEAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEPFKEEMQRFTTKIVDLMKQEKLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ W+A MA+ TGVPW MC+Q DAP P+IN CNG C
Sbjct: 176 LSQIENEYGNIDSAYGAAAKIYIKWSASMALSLDTGVPWNMCQQADAPDPMINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP +WTE+W+ ++ +G R +D+AF VA F + G++ NYYMYH
Sbjct: 236 DQFT--PNSNSKPKMWTENWSGWFLGFGDPSPYRPVEDLAFAVARFYQRGGTFQNYYMYH 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ +I+ YD AP+DEYGL+R+PKWGHL++LH AIKLC L+ +
Sbjct: 294 GGTNFDRTSGGPLISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKLCEDALIATDPTIS 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ SG CAAFL N + TV F SY LP S+SILPDCK VAFNT
Sbjct: 354 SLGSNLEAAVYKTASGSCAAFLANVGTKSDATVSFNGESYHLPAWSVSILPDCKNVAFNT 413
Query: 329 ERVSTQYNKRS-KTSNLKFDS------DEKWEEYREAILNFDNTLLRAEGLLDQISAAKD 381
++++ + +LK D +W +E I GLL+QI+ D
Sbjct: 414 AKINSATEPTAFARQSLKPDGGSSAELGSEWSYIKEPIGISKADAFLKPGLLEQINTTAD 473
Query: 382 ASDYFWYTFRFH------YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
SDY WY+ R + ++A L ++S G +++AF+NG+ GS HG +L
Sbjct: 474 KSDYLWYSLRMDIKGDETFLDEGSKAVLHIESLGQVVYAFINGKLAGSGHGKQ---KISL 530
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-------FTNCSWGY 488
++L G N LLSVTVGL + GAF + AG+ + + W Y
Sbjct: 531 DIPINLAAGKNTVDLLSVTVGLANYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTY 590
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKG 546
QVGL GE + + ++ W S PT+Q L WYKTTF AP+G++P+A++ GKG
Sbjct: 591 QVGLKGEDTGLAT---VDSSEWVSKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKG 647
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQT-----QYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
AWVNGQSIGRYW + G + + Y N + C T YHVPR++L
Sbjct: 648 IAWVNGQSIGRYWPTSIAGNGGCTDSCDYRGSYRANKC--LKNCGKPSQT-LYHVPRSWL 704
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
KP+GN LVL EE G+P I+ T +C V+ SH PP+ +W D+ I
Sbjct: 705 KPSGNTLVLFEEMGGDPTQISFGTKQTGSNLCLMVSQSHPPPVDTWTS-----DSKISNR 759
Query: 661 GK-KPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
+ +P + CP+ + IS I FASFG P G C + G C+SS S VV++ACIG C
Sbjct: 760 NRTRPVLSLKCPVSTQVISSIKFASFGTPQGTCGSFTHGHCNSSRSLSVVQKACIGSRSC 819
Query: 719 SIPLLSRYFGGDPCPGIHKALLVDAQC 745
++ + +R F G+PC G+ K+L V+A C
Sbjct: 820 NVEVSTRVF-GEPCRGVIKSLAVEASC 845
>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 842
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 336/807 (41%), Positives = 452/807 (56%), Gaps = 77/807 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHE +GQYDF GR D+++F+K + GLYV LRIG
Sbjct: 52 MWPDLIQKSKDGGLDVIETYVFWNLHEAVRGQYDFGGRKDLVKFVKTVAEAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI R+DN+P+K
Sbjct: 112 PYVCAEWNYGGFPLWLHFIPGIQLRTDNEPFKAEMQRFTAKIVDMMKKEKLYASQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ Y+ WAA MAV TGVPWVMC+QDDAP VI+ CNG C
Sbjct: 172 LSQIENEYGNIDRAYGAAAQTYIKWAADMAVSLDTGVPWVMCQQDDAPPSVISTCNGFYC 231
Query: 150 GETFKGPNSPNK-PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
+ P P K P +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMY
Sbjct: 232 DQW--TPRLPEKRPKMWTENWSGWFLSFGGAVPQRPVEDLAFAVARFFQRGGTFQNYYMY 289
Query: 209 HGGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNFGR T F+ T Y AP+DEYGL+R+PKWGHLK++H AIKLC ++
Sbjct: 290 HGGTNFGRSTGGPFIATSYDYDAPIDEYGLLRQPKWGHLKDVHKAIKLCEEAMVATDPKY 349
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
S G EA V+ +T CAAFL N+D + TV F SY LP S+SILPDCK V N
Sbjct: 350 SSFGPNVEATVY-KTGSACAAFLANSDTKSDATVTFNGNSYHLPAWSVSILPDCKNVVLN 408
Query: 328 TERVST-----QYNKRSKTSNLKFDSDEK----WEEYREAILNFDNTLLRAEGLLDQISA 378
T ++++ + S ++ DS E W E + GLL+QI+
Sbjct: 409 TAKINSAAMIPSFMHHSVLDDI--DSSEALGSGWSWINEPVGISKKDAFTRVGLLEQINT 466
Query: 379 AKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVS 432
D SDY WY+ SS+ +Q L V+S GH LHAF+NG+ G + +N
Sbjct: 467 TADKSDYLWYSLSIDVTSSDTFLQDGSQTILHVESLGHALHAFINGKPAGRGIITANNGK 526
Query: 433 FTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS------- 485
++ V G N LLS+T+GL + GAF ++ AG+ VQ K N +
Sbjct: 527 ISVDIPVTFASGKNTIDLLSLTIGLQNYGAFFDKSGAGITG-PVQLKGLKNGTTTDLSSQ 585
Query: 486 -WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSM 543
W YQ+GL GE S + ++ P +Q LTWYK TF AP G++P+AL+ M
Sbjct: 586 RWTYQIGLQGEDSGFSSGSSSQWISQPTL--PKKQPLTWYKATFNAPDGSNPVALDFTGM 643
Query: 544 GKGEAWVNGQSIGRYWVSFKT-SKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAF 600
GKGEAWVNGQSIGRYW + + G P + ++ C + YHVPR++
Sbjct: 644 GKGEAWVNGQSIGRYWPTNNAPTSGCPDSCNFRGPYDSNKCRKNCG-KPSQELYHVPRSW 702
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
LKP+GN LVL EE G+P I+ T I +C HV+ SH P+ +W + G +K
Sbjct: 703 LKPSGNTLVLFEEIGGDPTQISFATRQIESLCSHVSESHPSPVDTWSSDSKAG----RKL 758
Query: 661 GKKPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
G P + CP + IS I FAS+G P G C ++ G C S+ + +V++AC+G CS
Sbjct: 759 G--PVLSLECPFPNQVISSIKFASYGKPQGTCGSFSHGQCKSTSALSIVQKACVGSKSCS 816
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQCR 746
I + + F GDPC G+ K+L V+A CR
Sbjct: 817 IEVSVKTF-GDPCKGVAKSLAVEASCR 842
>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 838
Score = 595 bits (1535), Expect = e-167, Method: Compositional matrix adjust.
Identities = 337/800 (42%), Positives = 451/800 (56%), Gaps = 74/800 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP +GQY+F GR D+++F+K + + GLYV LRIG
Sbjct: 57 MWPDLIQKSKDGGLDVIETYVFWNLHEPVQGQYNFEGRADLVKFVKAVAAAGLYVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+ +EW YGG P+WLH + GI FR+DNKP+
Sbjct: 117 PYACAEWNYGGFPLWLHFIPGIQFRTDNKPFEAEMKRFTVKIVDMMKQESLYASQGGPII 176
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
++ENEY I+ A+ Y+ WAA MA TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 177 LSQVENEYGNIDAAYGPAAKSYIKWAASMATSLDTGVPWVMCQQADAPDPIINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 237 DQF--TPNSNAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT I+ YD AP+D+YG++R+PKWGHLK++H AIKLC L+ +
Sbjct: 295 GGTNFGRTTGGPFISTSYDYDAPIDQYGIIRQPKWGHLKDVHKAIKLCEEALIATDPTIT 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S G EA V+ +T +CAAFL N A TV F SY LP S+SILPDCK V NT
Sbjct: 355 SPGPNIEAAVY-KTGSICAAFLANIATSDA-TVTFNGNSYHLPAWSVSILPDCKNVVLNT 412
Query: 329 ERVS--------TQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAK 380
+++ T + + + +L DS W E I + GLL+QI+
Sbjct: 413 AKINSASMISSFTTESFKEEVGSLD-DSGSGWSWISEPIGISKSDSFSKFGLLEQINTTA 471
Query: 381 DASDYFWYTFRFHYN-SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY+ S +Q L ++S GH LHAF+NG+ GS G+ + V
Sbjct: 472 DKSDYLWYSISIDVEGDSGSQTVLHIESLGHALHAFINGKIAGSGTGNSGKAKVNVDIPV 531
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS--------WGYQVG 491
L G N LLS+TVGL + GAF + AG+ + K N S W YQVG
Sbjct: 532 TLVAGKNSIDLLSLTVGLQNYGAFFDTWGAGITGPVIL-KGLKNGSTVDLSSQQWTYQVG 590
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRS-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
L E L + + W+S + PT Q L WYKT F AP+G++P+A++ MGKGEAW
Sbjct: 591 LKYEDLGPSNG---SSGQWNSQSTLPTNQSLIWYKTNFVAPSGSNPVAIDFTGMGKGEAW 647
Query: 550 VNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
VNGQSIGRYW ++ + G + + + A ++ + C T YH+PR++L+P N
Sbjct: 648 VNGQSIGRYWPTYVSPNGGCTDSCNYRGAYSSSKCLKNCGKPSQT-LYHIPRSWLQPDSN 706
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
LVL EE G+P I+ T I +C HV+ SH PP+ W + R K G P +
Sbjct: 707 TLVLFEESGGDPTQISFATKQIGSMCSHVSESHPPPVDLWNSDKGR------KVG--PVL 758
Query: 667 QPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
CP + IS I FASFG P G C + G C S+ + +V++ACIG S C I +
Sbjct: 759 SLECPYPNQLISSIKFASFGTPYGTCGNFKHGRCRSNKALSIVQKACIGSSSCRIGISIN 818
Query: 726 YFGGDPCPGIHKALLVDAQC 745
F GDPC G+ K+L V+A C
Sbjct: 819 TF-GDPCKGVTKSLAVEASC 837
>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
Length = 911
Score = 595 bits (1533), Expect = e-167, Method: Compositional matrix adjust.
Identities = 310/787 (39%), Positives = 447/787 (56%), Gaps = 72/787 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AK GGL+ I+TYVFWN HEP+ G+Y F GR D+IRF+ I+ +Y +RIG
Sbjct: 66 MWDKLVKTAKMGGLNTIETYVFWNGHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N+P+K
Sbjct: 126 PFIQAEWNHGGLPYWLREIGHIIFRANNEPFKREMEKFVRFIVQKLKDAEMFAPQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ +G Y+ WAA+MA+ GVPWVMCKQ APG VI CNG C
Sbjct: 186 LSQIENEYGNIKKDRKVEGDKYLEWAAEMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ + NKP +WTE+WT+ ++ +G + RSA+DIA+ V F AK G+ VNYYMYH
Sbjct: 246 GDTWTLLDK-NKPRLWTENWTAQFRTFGDQLAQRSAEDIAYAVLRFFAKGGTLVNYYMYH 304
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+AP+DEYG+ +EPK+GHL++LH IK + L G Q+
Sbjct: 305 GGTNFGRTGASYVLTGYYDEAPMDEYGMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEI 364
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +E +C +FL NN+ + TV+FR + +P +S+SIL DCKTV +NT
Sbjct: 365 LGHGYEAHNYELPEDKLCLSFLSNNNTGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNT 424
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+++RS + + + WE Y EAI F T +R + L+Q + KD SDY WY
Sbjct: 425 KRVFVQHSERSFHTTDETSKNNVWEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWY 484
Query: 389 TFRFHYNSS------NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F S + + + ++S H + F N + G+ GS SF + LR
Sbjct: 485 TTSFRLESDDLPFRRDIRPVIQIKSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLR 544
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
G N A+LS ++G+ DSG L G+ VQ + WG++ L GE
Sbjct: 545 VGINHIAMLSSSMGMKDSGGELVEVKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDK 604
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+IY+ G+ + W + +TWYK F P G+DPI +++ SM KG +VNG+ IGR
Sbjct: 605 EIYTEKGMAQFQWKPAENDL-PITWYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGR 663
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW SF T G+PSQ+ YH+PRAFLKP GNLL++ EEE G
Sbjct: 664 YWTSFITLAGHPSQS--------------------VYHIPRAFLKPKGNLLIIFEEELGK 703
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPS--CPLGKK 675
P GI + T+ +C ++ + + +W + IK + + + + CP +
Sbjct: 704 PGGILIQTVRRDDICVFISEHNPAQIKTW----ESDGGQIKLIAEDTSTRGTLNCPPKRT 759
Query: 676 ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPG 734
I ++VFASFGNP+G C + G+CH+ ++ +VE+ C+GK C +P+++ +G D CP
Sbjct: 760 IQEVVFASFGNPEGACGNFTAGTCHTPDAKAIVEKECLGKESCVLPVVNTVYGADINCPA 819
Query: 735 IHKALLV 741
L V
Sbjct: 820 TTATLAV 826
>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
Length = 835
Score = 595 bits (1533), Expect = e-167, Method: Compositional matrix adjust.
Identities = 310/794 (39%), Positives = 449/794 (56%), Gaps = 78/794 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ +AK+GGL+ I+TYVFWN HEP+ G+Y+F GR D+I+F+K IQ +Y +RIG
Sbjct: 63 MWPKLLDRAKDGGLNTIETYVFWNAHEPEPGKYNFEGRCDLIKFLKLIQDNDMYAVIRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N+PYK
Sbjct: 123 PFIQAEWNHGGLPYWLREIPHIIFRANNEPYKKEMEKFVRFIVQKLKDADMFASQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ G Y+ WAA+MA+ + G+PW+MCKQ APG VI CNG C
Sbjct: 183 LAQIENEYGNIKKDHITDGDKYLEWAAEMALSTNIGIPWIMCKQTTAPGVVIPTCNGRHC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ NKP +WTE+WT+ ++ +G + +RSA+DIA+ V F AK G+ VNYYMY+
Sbjct: 243 GDTWT-LRDKNKPRLWTENWTAQFRAFGDQAAVRSAEDIAYSVLRFFAKGGTLVNYYMYY 301
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+AP+DEYGL +EPK+GHL++LH IK + L G Q+
Sbjct: 302 GGTNFGRTGASYVLTGYYDEAPIDEYGLNKEPKFGHLRDLHKLIKSYHKAFLVGKQSFEL 361
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +E +C AF+ NN+ + TV+FR Y +P +S+SIL DC V +NT
Sbjct: 362 LGHGYEAHNYELPEENLCLAFISNNNTGEDGTVMFRGKKYYIPSRSVSILADCNHVVYNT 421
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+++RS + + + WE Y E I + T +R + L+Q + KD SDY WY
Sbjct: 422 KRVFVQHSERSFHTADESTKNNVWEMYSEPIPRYKVTSVRTKEPLEQYNLTKDKSDYLWY 481
Query: 389 TFRFHYNSS------NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F + + + + V+S H + FVN + GS GS + F + LR
Sbjct: 482 TTSFRLEADDLPFRRDIRPVVQVKSSAHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLR 541
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
G N ALLS ++G+ DSG L G+ +Q + WG+++ L GE
Sbjct: 542 IGINHLALLSSSMGMKDSGGELVEVKGGIQDCMIQGLNTGTLDLQGNGWGHKINLDGEDK 601
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+IY+ G+ V W + +TWY+ F P G+DP+ L++ SM KG +VNG+ +GR
Sbjct: 602 EIYTEKGMGTVKWKPAEN-GHAVTWYRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGR 660
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW S+KT G PSQ+ YH+PR FLK NLLV+ EEE G
Sbjct: 661 YWTSYKTIAGLPSQS--------------------LYHIPRPFLKSKKNLLVVFEEEIGK 700
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTD---IKKFGKKPTVQP--SCPL 672
P GI + T+ +C ++ + + +W D D IK + + + +CP
Sbjct: 701 PEGILIQTVRRDDICFLMSEHNPAQVKTW-------DADGGQIKLIAEDHSSRGILTCPH 753
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-P 731
K I ++VFASFGNP+G C + G+CH+ +++ V + C+GK C +PL+ +G D
Sbjct: 754 KKTIEEVVFASFGNPEGACGNFTAGTCHTPNAKEFVAKECLGKKSCVLPLIHTLYGADIN 813
Query: 732 CPGIHKALLVDAQC 745
CP L V +C
Sbjct: 814 CPTTTATLAVQVRC 827
>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
Length = 908
Score = 593 bits (1529), Expect = e-166, Method: Compositional matrix adjust.
Identities = 331/824 (40%), Positives = 458/824 (55%), Gaps = 89/824 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+IAK KEGG DVI+TY+FWN HEP KGQY F R D++RFIK + ++GL++ LRIG
Sbjct: 82 MWPSIIAKCKEGGADVIETYIFWNGHEPAKGQYYFEERFDLVRFIKLVAAEGLFLFLRIG 141
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL D+ GI FR+DN+PYK
Sbjct: 142 PYACAEWNFGGFPVWLRDIPGIEFRTDNEPYKAEMQTFVTKIVDMMKDEKLYSWQGGPII 201
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + + G Y+ WAA+MA+ TG+PWVMC+Q DAP +++ CN C
Sbjct: 202 LQQIENEYGNIQGKYGQAGKRYMQWAAQMALGLDTGIPWVMCRQTDAPEQILDTCNAFYC 261
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS NKP+IWTEDW +Y WGG R A+D AF VA F + GS NYYMY
Sbjct: 262 -DGFK-PNSYNKPTIWTEDWDGWYADWGGPLPHRPAEDSAFAVARFYQRGGSLQNYYMYF 319
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT--GTQN 266
GGTNF RTA IT Y AP++EYG++R+PKWGHLK+LH AIKLC L+ G+
Sbjct: 320 GGTNFARTAGGPLQITSYDYDAPINEYGMLRQPKWGHLKDLHTAIKLCEPALIAVDGSPQ 379
Query: 267 VISLGQLQEAFVFE--------ETSG---VCAAFLVNNDERKAVTVLFRNISYELPRKSI 315
+ LG +QEA ++ T+G +C+AFL N DE K V+V SY LP S+
Sbjct: 380 YVKLGSMQEAHIYSSAKVHTNGSTAGNAQICSAFLANIDEHKYVSVWIFGKSYNLPPWSV 439
Query: 316 SILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEK-----------------WEEYREA 358
SILPDC+ VAFNT RV Q + + S S + W +E
Sbjct: 440 SILPDCENVAFNTARVGAQTSVFTFESGSPSHSSRREPSVLLPGVRGSYLSSTWWTSKET 499
Query: 359 ILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGH 410
I + + +G+L+ ++ KD SDY WYT + ++S L +
Sbjct: 500 IGTWGDGSFATQGILEHLNVTKDISDYLWYTTSVNISDEDVAFWSSKGVLPSLIIDQIRD 559
Query: 411 ILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG 470
+ FVNG+ GS G +L+ + +G N+ LLS VGL + GAFLE+ AG
Sbjct: 560 VARVFVNGKLAGSQVGHW----VSLKQPIQFVRGLNELTLLSEIVGLQNYGAFLEKDGAG 615
Query: 471 VH-RVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTW 522
+V++ D TN +W YQVGL GE IY+ WS++++ Q TW
Sbjct: 616 FKGQVKLTGLSNGDTDLTNSAWTYQVGLKGEFSMIYTPEKQECAEWSAMQTDNIQSPFTW 675
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY-AVNTVTS 581
YKT AP G DP+A++L SMGKG+AWVNG+ IGRYW G PS Y + T
Sbjct: 676 YKTMVDAPEGTDPVAIDLGSMGKGQAWVNGRLIGRYWSLVAPESGCPSSCNYPGAYSETK 735
Query: 582 IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP 641
+ + YH+PR +L+ + NLLVL EE G+P I+++ + +C ++ ++ P
Sbjct: 736 CQSNCGMPTQSWYHIPREWLQESNNLLVLFEETGGDPSKISLEVHYTKTICSRISENYYP 795
Query: 642 PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS 701
PLS+W G + P + C G +IS+I FAS+G P G C+ ++ G CH+
Sbjct: 796 PLSAW-SWLDTGRVSVDSVA--PELLLRCDDGYEISRITFASYGTPSGGCQNFSKGKCHA 852
Query: 702 SHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ + V AC+GK++C+I + + F GDPC G+ K L V+A+C
Sbjct: 853 ASTLDFVTEACVGKNKCAISVSNDVF-GDPCRGVLKDLAVEAEC 895
>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 929
Score = 590 bits (1522), Expect = e-166, Method: Compositional matrix adjust.
Identities = 351/824 (42%), Positives = 456/824 (55%), Gaps = 91/824 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSL+ K+KEGG DV+Q+YVFWN HEP++GQY+F GR D+++FIK +Q GLY LRIG
Sbjct: 65 MWPSLVQKSKEGGADVVQSYVFWNGHEPKQGQYNFEGRYDLVKFIKVVQQAGLYFHLRIG 124
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P WL D+ GIVFR+DN+P+K
Sbjct: 125 PYVCAEWNFGGFPYWLKDIPGIVFRTDNEPFKVAMEGFVSKIVNLMKENQLFAWQGGPII 184
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE AF + G Y +WAA++A+ GVPWVMC+QDDAPG +IN CNG C
Sbjct: 185 MAQIENEYGNIEWAFGDGGKRYAMWAAELALGLDAGVPWVMCQQDDAPGNIINTCNGYYC 244
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK N+ KP+ WTEDW ++Q WG R +D AF +A F + GS+ NYYMY
Sbjct: 245 -DGFKA-NTATKPAFWTEDWNGWFQYWGQSVPHRPVEDNAFAIARFFQRGGSFQNYYMYF 302
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV- 267
GGTNF RTA FM T Y APLDEYGL+R+PKWGHL++LHAAIKLC P LT V
Sbjct: 303 GGTNFARTAGGPFMTTSYDYDAPLDEYGLIRQPKWGHLRDLHAAIKLC-EPALTAVDEVP 361
Query: 268 --ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
LG EA V+ G CAAFL N D K TV F+ +Y LP S+SILPDCK V
Sbjct: 362 LSTWLGPNVEAHVY-SGRGQCAAFLANIDSWKIATVQFKGKAYVLPPWSVSILPDCKNVV 420
Query: 326 FNTERVSTQYN------KRSKT-------SNLK--------FDSDEKWEEYREAILNFDN 364
FNT +V Q RSK SN+ S KWE E +
Sbjct: 421 FNTAQVGAQTTLTRMTIVRSKLEGEVVMPSNMLRKHAPESIVGSGLKWEASVEPVGIRGA 480
Query: 365 TLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGHILHAFV 416
L + LL+Q++ KD++DY WY+ + + +QA L + S +H FV
Sbjct: 481 ATLVSNRLLEQLNITKDSTDYLWYSISIKVSVEAVTALSKTKSQAILVLGSMRDAVHIFV 540
Query: 417 NGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH---R 473
N + GSA GS V V L++G ND LLS+TVGL + GA+LE AG+
Sbjct: 541 NRQLVGSAMGSDVQVV----QPVPLKEGKNDIDLLSMTVGLQNYGAYLETWGAGIRGSAL 596
Query: 474 VRVQDKSFTNCS---WGYQVGLIGEKLQIYSNLGLNKVLWSSIRS--PTRQLTWYKTTFR 528
+R + S W YQVG+ GE+ +++ + + W S S LTWYKTTF
Sbjct: 597 LRGLPSGVLDLSTERWSYQVGIQGEEKRLFETGTADGIQWDSSSSFPNASALTWYKTTFD 656
Query: 529 APAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCA 586
AP G DP+AL+L SMGKG+AWVNG +GRYW S S+ S Y A + C
Sbjct: 657 APKGTDPVALDLGSMGKGQAWVNGHHMGRYWPSVLASQSGCSTCDYRGAYDADKCRTNCG 716
Query: 587 I----IKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPP 642
+ + YH+PRA+L+ + NLLVL EE G+ +++ T + VC HV S PP
Sbjct: 717 KPSQRWQYVDMYHIPRAWLQLSNNLLVLFEEIGGDVSKVSLVTRSAPAVCTHVHESQPPP 776
Query: 643 LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS 702
+ W + D + G+ C G+ I I FASFGNP G C + G+CH+
Sbjct: 777 VLFWPANSSM-DAMSSRSGEAVL---ECIAGQHIRHIKFASFGNPKGSCGNFQRGTCHAM 832
Query: 703 HSQGVVERACIGKSRCSIPLLSRYFGG-DPCPGIHKALLVDAQC 745
S V +AC+G RCSIP+ + FG DPCP + K+L V C
Sbjct: 833 KSLEVARKACMGMHRCSIPVQWQTFGEFDPCPDVSKSLAVQVFC 876
>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 831
Score = 590 bits (1521), Expect = e-166, Method: Compositional matrix adjust.
Identities = 338/799 (42%), Positives = 462/799 (57%), Gaps = 75/799 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK+GGLDVIQTYVFW+ HEP +G Y+F+GR D+ +F++ + G+YV LRIG
Sbjct: 55 MWPGLIAKAKKGGLDVIQTYVFWSGHEPTQGVYNFAGRYDLPKFLRLVHEAGMYVNLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P WL + GI FR+DN+ +K
Sbjct: 115 PYVCAEWNFGGFPGWLRFLPGIEFRTDNESFKVHLSHSFTSSLISVYSRSFNIQLVICAQ 174
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
IENEY +I+ + E G Y+ W A MAV + VPW+MC Q DAP VI+ CNG C +
Sbjct: 175 IENEYGSIDAVYGEAGQKYLNWIANMAVATNISVPWIMCNQPDAPPSVIDTCNGFYC-DG 233
Query: 153 FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 212
F+ PNS KP++WTE+WT ++Q WG R QDIAF VA F K GS+++YYMYHGGT
Sbjct: 234 FR-PNSEGKPALWTENWTGWFQSWGEGAPTRPVQDIAFAVARFFQKGGSFMHYYMYHGGT 292
Query: 213 NFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV---IS 269
NF R+A + T Y AP+DEYG VR+PKWGHLK+LHAA+KLC L G V IS
Sbjct: 293 NFERSAMEGVTTNYDYDAPIDEYGDVRQPKWGHLKDLHAALKLCEL-CLVGVDTVPSEIS 351
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QEA V+ ++G CAAFL + + TVLF+ SY+LP S+SILPDCK+V FNT
Sbjct: 352 LGPYQEAHVYNSSTGACAAFLASWGTDDS-TVLFQGQSYDLPAWSVSILPDCKSVVFNTA 410
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
+V Q + S + + W YRE + + +T E L++QI+ KD +DY WYT
Sbjct: 411 KVGVQSMTMTMQSAIPVTN---WVSYREPLEPWGSTFSTNE-LVEQIATTKDTTDYLWYT 466
Query: 390 FRFHYNSSN-----AQAPLDVQSHGHILHAFVNGEYTG--SAHGSHDNVSFTLRNTVHLR 442
S+ AQA L + H FVN TG SAHGS + S + LR
Sbjct: 467 TNVEVAESDAPNGLAQATLVMSYLRDAAHIFVNKWLTGTKSAHGSEASQS------ISLR 520
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKS-----FTNCSWGYQVGLIGEK 496
G N +LS+T GL +G FLE++ AG+ +RV+ +W YQVGL GE
Sbjct: 521 PGINSVKVLSMTTGLQGTGPFLEKEKAGIQFGIRVEGLPSGAIIMQRNTWTYQVGLQGEN 580
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
+++ + G +WS+ + Q L+W+KTTF P N +AL+L SMGKG+ WVNG +
Sbjct: 581 NRLFESNGSLSAVWSTSTDVSNQMSLSWFKTTFDMPERNGTVALDLSSMGKGQVWVNGIN 640
Query: 555 IGRYWVS-FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLE 612
+GRYW S + G Y + S + + + YHVPR +L NLLVL E
Sbjct: 641 LGRYWSSCIAHTDGCVDNCDYRGSHSESKCLTKCGQPSQSWYHVPREWLLSKQNLLVLFE 700
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSH-LP-PLSSWLRHRQRGDTDIKKFGKKPTVQP-- 668
E+ GNP IT+ + +C ++ SH P PLSS + + T P + P
Sbjct: 701 EQEGNPEAITIAPRIPQHICSRMSESHPFPIPLSSSTKRGSQTST--------PPIAPLA 752
Query: 669 -SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
C G+ IS+I FAS+G P GDC + + SCH++ S+ V+ +AC+G+ +C +P++S
Sbjct: 753 LECADGQHISRISFASYGTPSGDCGDFKLSSCHANSSKDVLSKACVGRQKCLVPIVSSIC 812
Query: 728 GGDPCPGIHKALLVDAQCR 746
GGDPCPG+ K+L A+C+
Sbjct: 813 GGDPCPGMIKSLAATAECQ 831
>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 851
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 334/803 (41%), Positives = 437/803 (54%), Gaps = 69/803 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGG+DVI+TYVFWN HEP G Y F GR D+++F+K ++ G+++ LRIG
Sbjct: 59 MWPKLVQTAKEGGVDVIETYVFWNGHEPSPGNYYFGGRYDLVKFVKIVEQAGMHLILRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+P+WLH V G VFR++NKP+K
Sbjct: 119 PFVAAEWYFGGIPVWLHYVPGTVFRTENKPFKYHMQKFTTFIVDLMKQEKFFASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E + E G Y +WAA MAV + GVPW+MC+Q DAP VIN CN C
Sbjct: 179 LAQVENEYGYYEKDYGEGGKQYAMWAASMAVSQNIGVPWIMCQQFDAPESVINTCNSFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P NKP IWTE+W +++ +GG R A+DIAF VA F K GS NYYMYH
Sbjct: 239 DQF--TPIYQNKPKIWTENWPGWFKTFGGWNPHRPAEDIAFSVARFFQKGGSVHNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD +AP+DEYGL R PKWGHLK+LH AIKLC +L +
Sbjct: 297 GGTNFGRTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKQLHRAIKLCEHIMLNSQPTNV 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA VF +SG CAAF+ N D++ TV FRN+SY LP S+SILPDCK V FNT
Sbjct: 357 SLGPSLEADVFTNSSGACAAFIANMDDKNDKTVEFRNMSYHLPAWSVSILPDCKNVVFNT 416
Query: 329 ERVSTQYN---------KRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAA 379
+V +Q + + S S K D KW+ + E + GL+D I+
Sbjct: 417 AKVGSQSSVVEMLPESLQLSVGSADKSLKDLKWDVFVEKAGIWGEADFVKSGLVDHINTT 476
Query: 380 KDASDYFWYTFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
K +DY WYT + + L ++S GH +HAFVN E SA G+ + F
Sbjct: 477 KFTTDYLWYTTSILVGENEEFLKKGSSPVLLIESKGHAVHAFVNQELQASAAGNGTHFPF 536
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-------W 486
L+ + L++G ND ALLS+TVGL ++G+F E AG+ V++Q F N + W
Sbjct: 537 KLKAPISLKEGKNDIALLSMTVGLQNAGSFYEWVGAGLTSVKIQ--GFNNGTIDLSAYNW 594
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMG 544
Y++GL GE + G V W S P ++ LTWYK P G+DP+ L++ MG
Sbjct: 595 TYKIGLEGEHQGLDKEEGFGNVNWISASEPPKEQPLTWYKVIVDPPPGDDPVGLDMIHMG 654
Query: 545 KGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKP 603
KG AW+NG+ IGRYW G + Y + T YHVPR++ K
Sbjct: 655 KGLAWLNGEEIGRYWPRKGPLHGCVKECNYRGKFDPDKCNTGCGEPTQRWYHVPRSWFKQ 714
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP-PLSSWLRHRQRGDTDIKKFGK 662
+GN+LV+ EE+ G+P I I VC V ++ L SW G K
Sbjct: 715 SGNVLVIFEEKGGDPSKIEFSRRKITGVCALVAENYPSIDLESW----NDGSGSNKTVA- 769
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
T+ CP IS + FASFGNP G C Y G CH +S VVE+ C+ K+RC I L
Sbjct: 770 --TIHLGCPEDTHISSVKFASFGNPTGACRSYTQGDCHDPNSISVVEKVCLNKNRCDIEL 827
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
F C K L V+ QC
Sbjct: 828 TGENFNKGSCLSEPKKLAVEVQC 850
>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 859
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 341/786 (43%), Positives = 433/786 (55%), Gaps = 86/786 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LIAK+KEGG DV+QTYVFWN HEP KGQY+F GR D+++F+K I S GLY+ LRIG
Sbjct: 68 MWSDLIAKSKEGGADVVQTYVFWNGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR+DN+P+K
Sbjct: 128 PYVCAEWNFGGFPVWLRDIPGIEFRTDNEPFKKEMQKFVTKIVDLMREAKLFCWQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E ++ +KG YV WAA MA+ GVPWVMCKQ DAP +I+ACNG C
Sbjct: 188 MLQIENEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS KP +WTEDW +Y WGG R A+D+AF VA F + GS+ NYYMY
Sbjct: 248 -DGFK-PNSRTKPVLWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQRGGSFQNYYMYF 305
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRT+ F IT Y APLDEYGL EPKWGHLK+LHAAIKLC L+
Sbjct: 306 GGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADAPQY 365
Query: 268 ISLGQLQEAFVFE---ETSG-VCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKT 323
LG QEA ++ ET G VCAAFL N DE K+ V F SY LP S+SILPDC+
Sbjct: 366 RKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRH 425
Query: 324 VAFNTERVSTQYNKRS------------------KTSNLKFDSDEKWEEYREAILNFDNT 365
VAFNT +V Q + ++ + N+ + S + W +E I +
Sbjct: 426 VAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYIS-KSWMALKEPIGIWGEN 484
Query: 366 LLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGHILHAFVN 417
+GLL+ ++ KD SDY W+ R + + + + + S +L FVN
Sbjct: 485 NFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVN 544
Query: 418 GEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG------V 471
+ GS G V QG ND LL+ TVGL + GAFLE+ AG +
Sbjct: 545 KQLAGSIVGHW----VKAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKL 600
Query: 472 HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRA 529
+ D + SW YQVGL GE +IY+ K WS++ + WYKT F
Sbjct: 601 TGFKNGDLDLSKSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDP 660
Query: 530 PAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAI 587
PAG DP+ LNL+SMG+G+AWVNGQ IGRYW G Y A N+ C
Sbjct: 661 PAGTDPVVLNLESMGRGQAWVNGQHIGRYWNIISQKDGCDRTCDYRGAYNSDKCTTNCG- 719
Query: 588 IKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSW 646
K T T YHVPR++LKP+ NLLVL EE GNP I+V T+ +CG V+ SH PPL W
Sbjct: 720 -KPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLRKW 778
Query: 647 -LRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ 705
G I P V C G IS I FAS+G P G C+ +++G CH+S+S
Sbjct: 779 STPDYINGTMSINSVA--PEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASNSL 836
Query: 706 GVVERA 711
+V
Sbjct: 837 SIVSEV 842
>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
Length = 803
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 314/790 (39%), Positives = 444/790 (56%), Gaps = 102/790 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP L+ +AKEGGL+ I+TY+FWN HEP+ G+Y+F GR D+++F+K IQ G+Y +RIG
Sbjct: 66 VWPKLLDRAKEGGLNTIETYIFWNAHEPEPGKYNFEGRLDLVKFLKMIQEHGMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N PYK
Sbjct: 126 PFIQAEWNHGGLPYWLREIDHIIFRANNDPYKKEMEKWTRFVVQKLKDAELFASQGGPVI 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ +G Y+ WAA+MA+ TGVPW+MCKQ APG VI CNG C
Sbjct: 186 LTQIENEYGNIKKDHKIEGDKYLEWAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ NKP +WTE+WT ++ +G + +RSA+DIA+ V F AK GS VNYYMYH
Sbjct: 246 GDTWT-LRDKNKPMLWTENWTQQFRAYGDQLAMRSAEDIAYAVLRFFAKGGSMVNYYMYH 304
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+A++++TGYYD+APLDEYG+ +EPK+GHL++LH I+ + L+G +
Sbjct: 305 GGTNFGRTSASYVLTGYYDEAPLDEYGMYKEPKFGHLRDLHNVIRSYQKAFLSGKHSSEI 364
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +FE +C +FL NN+ + TV+FR + + +P +S+SIL CK V +NT
Sbjct: 365 LGHGYEAQIFELPEENLCLSFLSNNNTGEDGTVIFRGVKHYVPSRSVSILAGCKDVVYNT 424
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+++RS ++ + +WE Y E + + +T +R + L+Q + KDASDY WY
Sbjct: 425 KRVFVQHSERSYHTSEVTSKNNQWEMYSEMVPKYKDTKIRTKEPLEQYNQTKDASDYLWY 484
Query: 389 TFRFHYNSS------NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F S + + L V+S H + F N + GSA G+ F V L+
Sbjct: 485 TTSFRLESDDLPFRGDIRPVLQVKSSAHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLK 544
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSN 502
G N LLS T+G+ DSG L G+ +Q G G + LQ+
Sbjct: 545 AGVNHVVLLSSTMGMKDSGGELAEVKGGIQECLIQ---------GLNTGTL--DLQVNG- 592
Query: 503 LGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSF 562
W +K F P G+DPI L++ SM KG +VNG+ IGRYWVSF
Sbjct: 593 -------WG-----------HKRYFDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYWVSF 634
Query: 563 KTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGIT 622
+T G PSQ YH+PR FLKP NLLV+ EEE G P GI
Sbjct: 635 RTLAGTPSQA--------------------VYHIPRPFLKPKDNLLVVFEEEMGKPDGIL 674
Query: 623 VDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTD---IKKFGKKPTVQPS--CPLGKKIS 677
V T+ +C ++ + + +W DTD IK + +V+ + CP K I
Sbjct: 675 VQTVTRDDICLLISEHNPGQIKTW-------DTDGVKIKLIAEDHSVRGTLMCPPEKIIQ 727
Query: 678 KIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGIH 736
++VFASFGNPDG C + VG+CH+ +++ +VE+ C+GK C +P+ +G D C
Sbjct: 728 EVVFASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTT 787
Query: 737 KALLVDAQCR 746
L V +CR
Sbjct: 788 GTLGVQVRCR 797
>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
Length = 1036
Score = 587 bits (1514), Expect = e-165, Method: Compositional matrix adjust.
Identities = 313/764 (40%), Positives = 443/764 (57%), Gaps = 78/764 (10%)
Query: 32 QYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY 91
QYDF GR D+++FIK I +GLYV LR+GPFI++EW +GGLP WL +V + FR++N+P+
Sbjct: 80 QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPF 139
Query: 92 K-------------------------------IENEYQTIEPAFHEKGPPYVLWAAKMAV 120
K IENEY ++ A+ E G Y+ WAA +
Sbjct: 140 KEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVE 199
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
+ G+PWVMCKQ+DAPG +INACNG CG+TF GPN +KPS+WTE+WT+ ++V+G P
Sbjct: 200 SMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPP 259
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVRE 240
R+ +DIAF VA + +KNGS+VNYYMYHGGTNFGRT+A F+ T YYD APLDE+GL +
Sbjct: 260 TQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAHFVTTRYYDDAPLDEFGLEKA 319
Query: 241 PKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET-SGVCAAFLVNNDERKAV 299
PK+GHLK +H A++LC + L G +LG E +E+ + VCAAFL NN+ R
Sbjct: 320 PKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTN 379
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAI 359
T+ F+ Y LP +SISILPDCKTV +NT ++ Q++ R + K K+E + E I
Sbjct: 380 TIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSENI 439
Query: 360 LNFDNTLLRAEGLL--DQISAAKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHI 411
+LL + L+ + KD +DY WYT + + + L V S GH
Sbjct: 440 ----PSLLDGDSLIPGELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHA 495
Query: 412 LHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV 471
L +VNGEY G AHG H+ SF V+ + G N ++L V GLPDSG+++E + AG
Sbjct: 496 LIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGP 555
Query: 472 HRVRVQD-KSFT-----NCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKT 525
+ + KS T N WG+ GL GEK ++Y+ G KV W + LTWYKT
Sbjct: 556 RAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEK-DGKRKPLTWYKT 614
Query: 526 TFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFC 585
F P G + +A+ +++MGKG WVNG +GRYW+SF + G P+QT+
Sbjct: 615 YFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSPLGEPTQTE------------ 662
Query: 586 AIIKATNTYHVPRAFLK--PTGNLLVLLEEENGNPLGITVDTIAIRK--VCGHVTNSHLP 641
YH+PR+F+K N+LV+LEEE G L ++D + + + +C +V +
Sbjct: 663 --------YHIPRSFMKGEKKKNMLVILEEEPGVKLE-SIDFVLVNRDTICSNVGEDYPV 713
Query: 642 PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS 701
+ SW R + + K K ++ CP K++ ++ FASFG+P G C + +G C +
Sbjct: 714 SVKSWKREGPKIVSRSKDMRLKAVMR--CPPEKQMVEVQFASFGDPTGTCGNFTMGKCSA 771
Query: 702 SHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
S S+ VVE+ C+G++ CSI + FG CP I K L V +C
Sbjct: 772 SKSKEVVEKECLGRNYCSIVVARETFGDKGCPEIVKTLAVQVKC 815
>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
Length = 860
Score = 586 bits (1511), Expect = e-164, Method: Compositional matrix adjust.
Identities = 338/805 (41%), Positives = 462/805 (57%), Gaps = 72/805 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAK+GGLDVI+TYVFW++HEP +GQYDF GR D+ F+K + GLYV LRIG
Sbjct: 67 MWPGIIQKAKDGGLDVIETYVFWDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIG 126
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 127 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKTEMQRFTAKVVDTMKGAGLYASQGGPII 186
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MA+ TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 187 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAISLDTGVPWVMCQQTDAPDPLINTCNGFYC 246
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 247 DQFT--PNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYH 304
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTN R++ F+ T Y AP+DEYGLVREPKWGHL+++H AIKLC L+ +
Sbjct: 305 GGTNLDRSSGGPFIATSYDYDAPIDEYGLVREPKWGHLRDVHKAIKLCEPALIATDPSYT 364
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLGQ EA V+ +T VCAAFL N D + TV F Y LP S+SILPDCK V NT
Sbjct: 365 SLGQNAEAAVY-KTGSVCAAFLANIDGQSDKTVTFNGRMYRLPAWSVSILPDCKNVVLNT 423
Query: 329 ERVSTQYNKRS----KTSNLKFDSD--------EKWEEYREAI-LNFDNTLLRAEGLLDQ 375
++++Q ++SN+ D W E + + DN L +A GL++Q
Sbjct: 424 AQINSQVTSSEMRYLESSNMASDGSFITPELAVSGWSYAIEPVGITKDNALTKA-GLMEQ 482
Query: 376 ISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDN 430
I+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS +
Sbjct: 483 INTTADASDFLWYSTSITVKGDEPYLNGSQSNLVVNSLGHVLQVYINGKIAGSAQGSASS 542
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKS----FTNCS 485
+ + + L G N LLS TVGL + GAF + AG+ V++ + ++
Sbjct: 543 SLISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGTNGALDLSSAE 602
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMG 544
W YQ+GL GE L +Y + S+ P Q L WYKT F PAG+DP+A++ MG
Sbjct: 603 WTYQIGLRGEDLHLYDPSEASPEWVSANAYPINQPLIWYKTKFTPPAGDDPVAIDFTGMG 662
Query: 545 KGEAWVNGQSIGRYW---VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
KGEAWVNGQSIGRYW ++ ++ N + + N+ + C T YHVPR+FL
Sbjct: 663 KGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGSYNSNKCLKKCGQPSQT-LYHVPRSFL 721
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
+P N +VL E+ G+P I+ VC V+ H + SW +Q ++++G
Sbjct: 722 QPGSNDIVLFEQFGGDPSKISFVIRQTGSVCAQVSEEHPAQIDSWNSSQQT----MQRYG 777
Query: 662 KKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSI 720
P ++ CP G+ IS I FASFG P G C Y+ G C S+ + VV+ ACIG S CS+
Sbjct: 778 --PELRLECPKDGQVISSIKFASFGTPSGTCGSYSHGECSSTQALSVVQEACIGVSSCSV 835
Query: 721 PLLSRYFGGDPCPGIHKALLVDAQC 745
P+ S YF G+PC G+ K+L V+A C
Sbjct: 836 PVSSNYF-GNPCTGVTKSLAVEAAC 859
>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 830
Score = 583 bits (1504), Expect = e-164, Method: Compositional matrix adjust.
Identities = 335/783 (42%), Positives = 460/783 (58%), Gaps = 51/783 (6%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFW++HEP +GQYDF GR D+ F+K + GLYV LRIG
Sbjct: 60 MWPGLIQKAKDGGLDVIETYVFWDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY---------KIENEYQTIEPAFHEKGPPY 111
P++ +EW YGG P+WLH + GI FR+DN+P+ KIENEY I+ A+ G Y
Sbjct: 120 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMQRFTAKIENEYGNIDSAYGAPGKAY 179
Query: 112 VLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTS 171
+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C + PNS KP +WTE+W+
Sbjct: 180 MRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFT--PNSAAKPKMWTENWSG 237
Query: 172 FYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FMITGYYDQA 230
++ +GG R +D+AF VA F + G++ NYYMYHGGTN R++ F+ T Y A
Sbjct: 238 WFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDA 297
Query: 231 PLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFL 290
P+DEYGLVR+PKWGHL+++H AIKLC L+ + SLG EA V++ S VCAAFL
Sbjct: 298 PIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGS-VCAAFL 356
Query: 291 VNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYN----KRSKTSNLKF 346
N D + TV F Y LP S+SILPDCK V NT ++++Q + ++SN+
Sbjct: 357 ANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVAS 416
Query: 347 DSD--------EKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS- 396
D W E + + DN L +A GL++QI+ DASD+ WY+
Sbjct: 417 DGSFVTPELAVSDWSYAIEPVGITKDNALTKA-GLMEQINTTADASDFLWYSTSITVKGD 475
Query: 397 ----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLS 452
+ +Q+ L V S GH+L ++NG+ GSA GS + + + + L G N LLS
Sbjct: 476 EPYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLS 535
Query: 453 VTVGLPDSGAFLERKVAGVH-RVRVQDKS----FTNCSWGYQVGLIGEKLQIYSNLGLNK 507
TVGL + GAF + AG+ V++ + ++ W YQ+GL GE L +Y +
Sbjct: 536 ATVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEASP 595
Query: 508 VLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW---VSFK 563
S+ P L WYKT F PAG+DP+A++ MGKGEAWVNGQSIGRYW ++ +
Sbjct: 596 EWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQ 655
Query: 564 TSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
+ N + A ++ + C T YHVPR+FL+P N LVL E G+P I+
Sbjct: 656 SGCVNSCNYRGAYSSSKCLKKCGQPSQT-LYHVPRSFLQPGSNDLVLFEHFGGDPSKISF 714
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL-GKKISKIVFA 682
VC V+ +H + SW + ++++G P ++ CP G+ IS + FA
Sbjct: 715 VMRQTGSVCAQVSEAHPAQIDSWSSQQP-----MQRYG--PALRLECPKEGQVISSVKFA 767
Query: 683 SFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVD 742
SFG P G C Y+ G C S+ + +V+ ACIG S CS+P+ S YF G+PC G+ K+L V+
Sbjct: 768 SFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSSNYF-GNPCTGVTKSLAVE 826
Query: 743 AQC 745
A C
Sbjct: 827 AAC 829
>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 822
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 309/789 (39%), Positives = 435/789 (55%), Gaps = 83/789 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP LI +AKEGGL+ I+TY+FWN HEP+ G+Y+F GR D+I+++K IQ +Y +RIG
Sbjct: 66 VWPKLIERAKEGGLNTIETYIFWNAHEPEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N PYK
Sbjct: 126 PFIQAEWNHGGLPYWLREIDHIIFRANNDPYKKEMEKFVRFIVQKLKDAELFASQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ G Y+ WAA+MA+ TGVPW+MCKQ APG VI CNG C
Sbjct: 186 LTQIENEYGNIKKDHATDGDKYLEWAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ NKP +WTE+WT ++ +G + +RSA+DIA+ V F AK GS VNYYMYH
Sbjct: 246 GDTWT-LRDKNKPMLWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYH 304
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+AP+DEYG+ +EPK+GHL++LH I+ + L G +
Sbjct: 305 GGTNFGRTGASYVLTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEI 364
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +FE +C +FL NN+ + TV+FR + +P +S+SIL CK V +NT
Sbjct: 365 LGHGYEAHIFELPEENLCLSFLSNNNTGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNT 424
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+N+RS ++ + +WE Y E I + +T +R + L+Q + KDASDY WY
Sbjct: 425 KRVFVQHNERSYHTSEVTSKNNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWY 484
Query: 389 TFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F S ++ + L V+S H + F N + G A GS F V L+
Sbjct: 485 TTSFRLESDDLPFRNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLK 544
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-----WGYQVGLIGEKL 497
G N LLS T+G+ DSG L +G+ +Q + WG++ L GE
Sbjct: 545 VGVNHVVLLSSTMGMKDSGGELAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDK 604
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+IYS G+ KV W + R TWYK F P G+DP+ L++ SM KG +VNG+ +GR
Sbjct: 605 EIYSEKGVGKVQWKPAEN-GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGR 663
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YWVS++T G PSQ YH+PR FLK NLLV+ EEE G
Sbjct: 664 YWVSYRTLAGTPSQA--------------------LYHIPRPFLKSKDNLLVVFEEEMGK 703
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKIS 677
P GI V T+ +C ++ + + +W + + ++ T+ CP K I
Sbjct: 704 PDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLM--CPPEKTIQ 761
Query: 678 KIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGIH 736
++VFASFGNP+G C + C+GK C +P+ +G D C
Sbjct: 762 EVVFASFGNPEGMCGNFT---------------ECLGKPSCMLPVDHTVYGADINCQSTT 806
Query: 737 KALLVDAQC 745
L V +C
Sbjct: 807 ATLGVQVRC 815
>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
Length = 833
Score = 580 bits (1495), Expect = e-162, Method: Compositional matrix adjust.
Identities = 311/790 (39%), Positives = 445/790 (56%), Gaps = 74/790 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AK+GGL+ I+TYVFWN HEP+ G+Y+F GRND+I+F+K IQS +Y +RIG
Sbjct: 65 MWHKLLKTAKDGGLNTIETYVFWNAHEPEPGKYNFEGRNDLIKFLKLIQSHDMYALVRIG 124
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N+PYK
Sbjct: 125 PFIQAEWNHGGLPYWLREIPHIIFRANNEPYKKEMEKFVRFIVQKLKDAEMFASQGGPVI 184
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ +G Y+ WAA+MA+ +TGVPW+MCKQ APG VI CNG C
Sbjct: 185 LAQIENEYGNIKKDHIVEGDKYLEWAAQMAISTNTGVPWIMCKQSTAPGEVIPTCNGRHC 244
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM-Y 208
G+T+ + NKP +WTE+WT+ ++ +G + +RSA+DIA+ V F AK G+ VNYYM Y
Sbjct: 245 GDTWTLKDK-NKPRLWTENWTAQFRAFGDQLALRSAEDIAYSVLRFFAKGGTLVNYYMQY 303
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
+GGTNFGRT A++++TGYYD+ P+DE + + PK+GHL++LH IK SR L G Q+
Sbjct: 304 YGGTNFGRTGASYVLTGYYDEGPVDEC-MPKAPKYGHLRDLHNLIKSYSRAFLEGKQSFE 362
Query: 269 SLGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
L EA FE +C AF+ NN+ + TV FR Y +P +S+SIL DCK V +N
Sbjct: 363 LLAHGYEAHNFEIPEEKLCLAFISNNNTGEDGTVNFRGDKYYIPSRSVSILADCKHVVYN 422
Query: 328 TERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
T+RV Q+++RS + K WE Y E I + T +R + ++Q + KD SDY
Sbjct: 423 TKRVFVQHSERSFHTAQKLAKSNAWEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYL- 481
Query: 388 YTFRFHYNS----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
FR + + + + V+S H L FVN + G+ GS F ++LR
Sbjct: 482 -CFRLEADDLPFRGDIRPVVQVKSTSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRI 540
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-----WGYQVGLIGEKLQ 498
G N ALLS ++G+ DSG L G+ +Q + WG++V L GE +
Sbjct: 541 GINHLALLSSSMGMKDSGGELVEVKGGIQDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKE 600
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
IY+ G+ V W + R +TWYK F P G DP+ L++ SMGKG +VNG+ +GRY
Sbjct: 601 IYTEKGMGAVKWVPATT-GRAVTWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRY 659
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
W S++T G PSQ YH+PR FLKP NLLV+ EEE G P
Sbjct: 660 WPSYRTVGGVPSQAM--------------------YHIPRPFLKPKNNLLVIFEEELGKP 699
Query: 619 LGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP--SCPLGKKI 676
GI + T+ +C ++ + + +W + IK + + + CP K I
Sbjct: 700 EGILIQTVRRDDICVFISEHNPAQIKTW----DKDGGQIKLIAEDHSTRGILKCPPKKTI 755
Query: 677 SKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGI 735
++VFASFGNP+G C + G+CH+ +++ +V + C+GK C +P+L +G D CP
Sbjct: 756 QEVVFASFGNPEGSCANFTAGTCHTPNAKDIVAKECLGKKSCVLPVLHTVYGADINCPTT 815
Query: 736 HKALLVDAQC 745
L V +C
Sbjct: 816 TATLAVQVRC 825
>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
Length = 841
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 327/793 (41%), Positives = 443/793 (55%), Gaps = 61/793 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP +I K+KEGGLDVI+TYVFWN HEP KGQY F GR D++RF+K IQ GL V LRIG
Sbjct: 60 VWPDIIRKSKEGGLDVIETYVFWNYHEPVKGQYYFEGRFDLVRFVKTIQEAGLLVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW YGG P+WLH + GI FR+ N+ +K
Sbjct: 120 PYACAEWNYGGFPLWLHFIPGIQFRTTNELFKEEMKLFLTKIVNMMKEENLFASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E A+ G YV WAA+ AV +T VPWVMC Q DAP P+IN CNG C
Sbjct: 180 LAQVENEYGNVEWAYGAAGELYVKWAAETAVSLNTSVPWVMCAQVDAPDPIINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
PNSP+KP +WTE+++ ++ +G R +D+AF VA F G++ NYYMY
Sbjct: 240 DRF--SPNSPSKPKMWTENYSGWFLSFGYAIPYRPVEDLAFAVARFFETGGTFQNYYMYF 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA ++ YD AP+DEYG +R+PKWGHL++LH AIK C L++
Sbjct: 298 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRDLHKAIKQCEEHLISSDPIHQ 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA ++ ++S CAAFL N D V F Y LP S+SILPDCK V FNT
Sbjct: 358 QLGNNLEAHIYYKSSNDCAAFLANYDSSSDANVTFNGNIYFLPAWSVSILPDCKNVIFNT 417
Query: 329 ERV-----STQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDAS 383
+V + S + N W Y+E + + N A GLL+QI+ KD S
Sbjct: 418 AKVLILNLGDDFFAHSTSVNEIPLEQIVWSWYKEEVGIWGNNSFTAPGLLEQINTTKDIS 477
Query: 384 DYFWYTFRFHYNSSNAQ-APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
D+ WY+ N+ + L+++S GH FVN G +G+HD+ SF+L + L
Sbjct: 478 DFLWYSTSISVNADQVKDIILNIESLGHAALVFVNKVLVGK-YGNHDDASFSLTEKISLI 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKL 497
+G N LLS+ +G+ + G + + + AG++ V + +S ++ W YQVGL GE
Sbjct: 537 EGNNTLDLLSMMIGVQNYGPWFDVQGAGIYAVLLVGQSKVKIDLSSEKWTYQVGLEGEYF 596
Query: 498 QIYSNLGLNKVLWSSIRSP--TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
+ N LW+ SP + L WYK TF AP G P+ALNL MGKG+AWVNGQSI
Sbjct: 597 GLDKVSLANSSLWTQGASPPINKSLIWYKGTFVAPEGKGPLALNLAGMGKGQAWVNGQSI 656
Query: 556 GRYWVSFKT-SKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
GRYW ++ + S G Y A ++ + C A YH+PR ++ P NLLVL E
Sbjct: 657 GRYWPAYLSPSTGCNDSCDYRGAYDSFKCLKKCG-QPAQTLYHIPRTWVHPGENLLVLHE 715
Query: 613 EENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL 672
E G+P I+V T ++C V+ PP SW + ++ K + P V+ +C
Sbjct: 716 ELGGDPSKISVLTRTGHEICSIVSEDDPPPADSW-----KSSSEFKS--QNPEVRLTCEQ 768
Query: 673 GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
G I I FASFG P G C + GSCH+ +V++ACIG+ CSI + + GDPC
Sbjct: 769 GWHIKSINFASFGTPAGICGTFNPGSCHADMLD-IVQKACIGQEGCSISISAANL-GDPC 826
Query: 733 PGIHKALLVDAQC 745
PG+ K V+A+C
Sbjct: 827 PGVLKRFAVEARC 839
>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
Length = 739
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 323/688 (46%), Positives = 407/688 (59%), Gaps = 58/688 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK GGLD I TYVFWN+HEP G Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 58 MWEDLIRKAKGGGLDAIDTYVFWNVHEPSPGIYNFEGRYDLVRFIKTVQRVGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + G Y WAAKMAV +TGVPWVMCKQDDAP PVINACNG C
Sbjct: 178 LSQIENEYGSESKQLGGAGYAYTNWAAKMAVGLNTGVPWVMCKQDDAPDPVINACNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE W+ ++ +GG Y R QD+AF VA FI K GSY+NYYMYH
Sbjct: 238 --DYFSPNKPYKPTLWTESWSGWFTEFGGPIYQRPVQDLAFAVARFIQKGGSYINYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+REPK+GHL +LH AIK C R L++ V
Sbjct: 296 GGTNFGRSAGGPFITTSYDYDAPIDEYGLIREPKYGHLMDLHKAIKQCERALVSSDPTVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A VF +G CAAFL N A V F N Y+LP SISILPDCKT FNT
Sbjct: 356 SLGAYEQAHVFSSKNGACAAFLANYHSNSAARVTFNNRKYDLPPWSISILPDCKTDVFNT 415
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
RV Q K + SN K S WE Y E + + +++ + A GLL+Q++A +D SDY
Sbjct: 416 ARVRFQTTKIQMLPSNSKLFS---WETYDEDVSSLSESSKITASGLLEQLNATRDTSDYL 472
Query: 387 WYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY +SS + + + V S GH +H F+NG++ GSA G+ ++ S T V+
Sbjct: 473 WYITSVDISSSESFLRGGNKPSISVHSAGHAVHVFINGQFLGSAFGTSEDRSCTFNGPVN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGE 495
LR GTN ALLSV VGLP+ G E AG+ V + K T W YQ+GL GE
Sbjct: 533 LRAGTNKIALLSVAVGLPNVGFHFETWKAGITGVLLYGLDHGQKDLTWQKWSYQIGLKGE 592
Query: 496 KLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
+ + S G++ V W +RS + QL W+K F AP G +P+AL+L SMGKG+ W+N
Sbjct: 593 AMNLVSPNGVSSVDWVRDSLDVRSQS-QLKWHKAYFNAPDGVEPLALDLSSMGKGQVWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW+ + +KG + YA + + T YHVPR++LKPT NL+VL
Sbjct: 652 GQSIGRYWMVY--AKGACNSCNYAGTYRPAKCQLGCGQPTQQWYHVPRSWLKPTNNLIVL 709
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNS 638
LEE GNP I++ I NS
Sbjct: 710 LEELGGNPWKISLQKRIIHTPASSEPNS 737
>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
Length = 706
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 282/615 (45%), Positives = 395/615 (64%), Gaps = 45/615 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGL+ I+TYVFWN+HEP+KG+++F G+ND++RF + IQ +Y +R+G
Sbjct: 73 MWPELIAKAKEGGLNTIETYVFWNIHEPEKGEFNFEGQNDVVRFFQLIQEHDMYAMVRLG 132
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ IVFR++N+PYK
Sbjct: 133 PFIQAEWNHGGLPYWLREIPDIVFRTNNEPYKMHMETFVKIIIKRLKDANLFASQGGPII 192
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF ++G Y+ WAAKMA+ + G+PW+MCKQ AP VI CNG C
Sbjct: 193 LAQIENEYQHMEAAFKDEGTKYINWAAKMAISTNIGIPWIMCKQTKAPSDVIPTCNGRNC 252
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ GP + + P +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYMYH
Sbjct: 253 GDTWPGPTNKSMPLLWTENWTAQYRVFGDPPSQRSAEDIAFAVARFFSVGGTLANYYMYH 312
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+AAF++ YYD+APLDE+GL +EPKWGHL++LH A+KLC + LL GT +
Sbjct: 313 GGTNFGRTSAAFVMPKYYDEAPLDEFGLYKEPKWGHLRDLHQALKLCKKALLWGTPSTEK 372
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+ EA VFE VC AFL N++ + T+ FR Y +PR SIS+L DC+TV F T
Sbjct: 373 LGKQLEARVFEMPEQKVCVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGT 432
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYR-EAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+ V+ Q+N+R+ + + WE + E + + +R D + KD +DY W
Sbjct: 433 QHVNAQHNQRTFHFADQTAQNNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVW 492
Query: 388 YTFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT F + S+ + L+V SHGH AFVN ++ G HG+ N +FTL + L
Sbjct: 493 YTSSFKLEADDMPIRSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDL 552
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEK 496
++G N A+L+ ++G+ DSGA++E ++AGV RV++ + TN WG+ VGL+GE+
Sbjct: 553 KKGVNHVAVLASSMGMTDSGAYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGER 612
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
QIY++ G+ V W + R LTWYK F P+G DP+ L++ +MGKG +VNGQ IG
Sbjct: 613 KQIYTDKGMGSVTWKPAMN-DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIG 671
Query: 557 RYWVSFKTSKGNPSQ 571
RYW+S+K + G PSQ
Sbjct: 672 RYWISYKHALGRPSQ 686
>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
Length = 731
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 315/673 (46%), Positives = 401/673 (59%), Gaps = 57/673 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWNLHEP G Y+F GR D++RFIK + GLYV LRIG
Sbjct: 58 MWEGLIQKAKDGGLDVIDTYVFWNLHEPSPGNYNFDGRYDLVRFIKLVHEAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 118 PYICAEWNFGGFPVWLKYVPGISFRTDNEPFKSAMQKFTQKIVQMMKDENLFESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+ AF G Y+ WAA MA+ TGVPWVMCK+ DAP PVIN CNG C
Sbjct: 178 LSQIENEYEPESKAFGSPGHAYMTWAAHMAISMDTGVPWVMCKEFDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE WT ++ +GG + R A+D+AF VA FI K GS VNYYMYH
Sbjct: 238 --DYFSPNKPYKPTMWTEAWTGWFTDFGGPNHQRPAEDLAFAVARFIQKGGSLVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD AP+DEYGL+R+PK+GHLKELH AIKLC + LL V
Sbjct: 296 GGTNFGRTSGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHKAIKLCEKALLAADSTVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A VF SG CAAFL N + ++A V F NI Y LP SISILPDCK V FNT
Sbjct: 356 SLGSYEQAHVFSSDSGGCAAFLSNYNTKQAARVKFNNIQYSLPPWSISILPDCKNVVFNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSD-EKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
V Q S+ L DS+ WE + E I + D+ ++ GLL+Q++ +D SDY
Sbjct: 416 AHVGVQ---TSQVHMLPTDSELLSWETFNEDISSVDDDKMITVAGLLEQLNITRDTSDYL 472
Query: 387 WYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WYT H +SS + + P L VQS GH LH F+NGE +GSAHG+ + FT +
Sbjct: 473 WYTTSVHISSSESFLRGGRLPVLTVQSAGHALHVFINGELSGSAHGTREQRRFTFTEDMK 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
G N +LLSV VGLP++G E G+ H + + T W Y+VGL G
Sbjct: 533 FHAGKNRISLLSVAVGLPNNGPRFETWNTGILGPVTLHGLDEGQRDLTWQKWSYKVGLKG 592
Query: 495 EKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S ++ V W S + + LTWYK F +P G+DP+AL++ SMGKG+ W+N
Sbjct: 593 EDMNLRSRKSVSLVDWIQGSLMVGKQQPLTWYKAYFNSPKGDDPLALDMGSMGKGQVWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
G SIGRYW + ++GN S Y+ + + T YHVPR++LK T NLLVL
Sbjct: 653 GHSIGRYWTLY--AEGNCSGCSYSATFRPARCQLGCGQPTQKWYHVPRSWLKSTRNLLVL 710
Query: 611 LEEENGNPLGITV 623
EE G+ I++
Sbjct: 711 FEEIGGDASRISL 723
>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
Length = 766
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 312/691 (45%), Positives = 406/691 (58%), Gaps = 51/691 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFW+ HEP G+Y F GR D+++FIK ++ GLYV LRIG
Sbjct: 67 MWPDLIQKAKEGGLDVIQTYVFWDGHEPSPGKYYFEGRYDLVKFIKLVKQAGLYVNLRIG 126
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW GG P+WL + GI FR+DN+P+K
Sbjct: 127 PYICAEWNLGGFPVWLKYIPGISFRTDNEPFKRYMAGFTKKIVEMMKAESLFEPQGGPII 186
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA MAV+ +TGVPW+MCKQD+ P P+IN CNG C
Sbjct: 187 MSQIENEYGPVEWEIGAIGKVYTRWAASMAVNLNTGVPWIMCKQDEVPDPIINTCNGFYC 246
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PN KP +WTE WT ++ +GG R +D+A+ V FI K GS++NYYMYH
Sbjct: 247 -DWFK-PNKDYKPIMWTELWTGWFTAFGGPVPYRPVEDVAYAVVKFIQKGGSFINYYMYH 304
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL REPKWGHL++LH AIK+C L++ V
Sbjct: 305 GGTNFGRTAGGPFIATSYDYDAPLDEYGLKREPKWGHLRDLHRAIKMCEPALVSNDPTVT 364
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+G QEA VF+ SG C+AFL N DE V V F+ + YELP SISILPDC V +NT
Sbjct: 365 KIGDSQEAHVFKFESGACSAFLENKDETNFVKVTFQGMQYELPPWSISILPDCVNVVYNT 424
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV TQ + + S +++ W Y E +++ + EGL +QIS KD++DY Y
Sbjct: 425 GRVGTQTSMMTMLS--ASNNEFSWASYNEDTASYNEESMTIEGLSEQISITKDSTDYLRY 482
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + N + P L V S GH L FVNG+ +G+A+GS ++ T V L
Sbjct: 483 TTDVTIGQNEGFLKNGEYPVLTVNSAGHALQVFVNGQLSGTAYGSVNDPRLTFSGKVKLW 542
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS VGLP+ G E GV + + + + W Y+VG+IGE
Sbjct: 543 AGNNKISLLSSAVGLPNVGTHFETWNYGVLGPVTLNGLNEGKRDLSLQKWSYKVGVIGEA 602
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
LQ++S G + V W S S + TWYKTTF AP GNDP+AL++ +MGKG+ W+NGQSIG
Sbjct: 603 LQLHSPTGSSSVEWGSSTSKIQPFTWYKTTFNAPGGNDPLALDMNTMGKGQIWINGQSIG 662
Query: 557 RYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
RYW ++K + G S Y F + YH+PR++L PTGNLLV+ EE
Sbjct: 663 RYWPAYK-ANGKCSACHYTGWYDEKKCGFNCGEASQRWYHIPRSWLNPTGNLLVVFEEWG 721
Query: 616 GNPLGITVDTIAIRKVCGHVTNSHLPPLSSW 646
G+P GIT+ I C ++ H P + +W
Sbjct: 722 GDPTGITLVRRTIGSACAYINEWH-PTVKNW 751
>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
gi|219886857|gb|ACL53803.1| unknown [Zea mays]
gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
Length = 852
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 335/805 (41%), Positives = 460/805 (57%), Gaps = 73/805 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFW++HEP +GQYDF GR D+ F+K + GLYV LRIG
Sbjct: 60 MWPGLIQKAKDGGLDVIETYVFWDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 120 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 180 LSQIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 240 DQFT--PNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYH 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTN R++ F+ T Y AP+DEYGLVR+PKWGHL+++H AIKLC L+ +
Sbjct: 298 GGTNLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ S VCAAFL N D + TV F Y LP S+SILPDCK V NT
Sbjct: 358 SLGPNVEAAVYKVGS-VCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNT 416
Query: 329 ERVSTQYN----KRSKTSNLKFDSD--------EKWEEYREAI-LNFDNTLLRAEGLLDQ 375
++++Q + ++SN+ D W E + + DN L +A GL++Q
Sbjct: 417 AQINSQTTGSEMRYLESSNVASDGSFVTPELAVSDWSYAIEPVGITKDNALTKA-GLMEQ 475
Query: 376 ISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDN 430
I+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS +
Sbjct: 476 INTTADASDFLWYSTSITVKGDEPYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASS 535
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKS----FTNCS 485
+ + + L G N LLS TVGL + GAF + AG+ V++ + ++
Sbjct: 536 SLISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAE 595
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMG 544
W YQ+GL GE L +Y + S+ P L WYKT F PAG+DP+A++ MG
Sbjct: 596 WTYQIGLRGEDLHLYDPSEASPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMG 655
Query: 545 KGEAWVNGQSIGRYW---VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
KGEAWVNGQSIGRYW ++ ++ N + A ++ + C T YHVPR+FL
Sbjct: 656 KGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQT-LYHVPRSFL 714
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
+P N LVL E G+P I+ VC V+ +H + SW + ++++G
Sbjct: 715 QPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQP-----MQRYG 769
Query: 662 KKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSI 720
P ++ CP G+ IS + FASFG P G C Y+ G C S+ + +V+ ACIG S CS+
Sbjct: 770 --PALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSV 827
Query: 721 PLLSRYFGGDPCPGIHKALLVDAQC 745
P+ S YF G+PC G+ K+L V+A C
Sbjct: 828 PVSSNYF-GNPCTGVTKSLAVEAAC 851
>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 571 bits (1472), Expect = e-160, Method: Compositional matrix adjust.
Identities = 309/672 (45%), Positives = 405/672 (60%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP G+Y+F GR D+++FIK +Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GGLP+WL V+G+ FR+DN+P+K
Sbjct: 115 PYICAEWNFGGLPVWLKYVSGMEFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV T VPW+MCKQ+DAP PVI+ CNG C
Sbjct: 175 MAQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F+ PN P KP +WTE WT ++ +GG R A+DIAF VA F+ NGSY NYYMYH
Sbjct: 235 -EGFR-PNKPYKPKMWTEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD AP+DEYGL+ EPK+GHL+ELH AIK C L++ V
Sbjct: 293 GGTNFGRTSSGLFIATSYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ SG CAAFL N D + +V V F+N+ Y+LP SISILPDCKTV +NT
Sbjct: 353 SLGSNQEAHVYRSKSGACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+VS+Q + T W+ Y E D++ LRA GL +Q + +D+SDY W
Sbjct: 413 AKVSSQGSSIKMTPA---GGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLW 469
Query: 388 YTFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S S L V S GH+LH FVNG+ G+ +G+ DN T V L
Sbjct: 470 YMTDINIASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
G N +LLSV+VGLP+ G + AGV + + W Y+VGL GE
Sbjct: 530 NAGINKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGE 589
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S+ + T+ LTWYK TF AP GN+P+AL++ SMGKG+ W+NG+
Sbjct: 590 SLSLHTLSGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGE 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+GR+W + ++G+ S+ YA N C + YHVPR++LK +GNLLV+
Sbjct: 650 GVGRHWPGY-AAQGDCSKCSYAGTFNEKKCQTNCG-QPSQRWYHVPRSWLKTSGNLLVVF 707
Query: 612 EEENGNPLGITV 623
EE G+P GI++
Sbjct: 708 EEWGGDPTGISL 719
>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 309/672 (45%), Positives = 405/672 (60%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP G+Y+F GR D+++FIK +Q GLYV LRIG
Sbjct: 55 MWPDLIEKAKDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GGLP+WL V+G+ FR+DN+P+K
Sbjct: 115 PYICAEWNFGGLPVWLKYVSGMEFRTDNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV T VPW+MCKQ+DAP PVI+ CNG C
Sbjct: 175 MAQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F+ PN P KP +WTE WT ++ +GG R A+DIAF VA F+ NGSY NYYMYH
Sbjct: 235 -EGFR-PNKPYKPKMWTEVWTGWFTKFGGPIPQRPAEDIAFSVARFVQNNGSYFNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD AP+DEYGL+ EPK+GHL+ELH AIK C L++ V
Sbjct: 293 GGTNFGRTSSGLFIATSYDYDAPIDEYGLLNEPKYGHLRELHKAIKQCEPALVSSYPTVT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ SG CAAFL N D + +V V F+N+ Y+LP SISILPDCKTV +NT
Sbjct: 353 SLGSNQEAHVYRSKSGACAAFLSNYDAKYSVRVSFQNLPYDLPPWSISILPDCKTVVYNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+VS+Q + T W+ Y E D++ LRA GL +Q + +D+SDY W
Sbjct: 413 AKVSSQGSSIKMTPA---GGGLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLW 469
Query: 388 YTFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S S L V S GH+LH FVNG+ G+ +G+ DN T V L
Sbjct: 470 YMTDVNIASNEGFLKSGKDPYLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
G N +LLSV+VGLP+ G + AGV + + W Y+VGL GE
Sbjct: 530 NAGINKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGE 589
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S+ + T+ LTWYK TF AP GN+P+AL++ SMGKG+ W+NG+
Sbjct: 590 SLSLHTLSGSSSVEWVQGSLVARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGE 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+GR+W + ++G+ S+ YA N C + YHVPR++LK +GNLLV+
Sbjct: 650 GVGRHWPGY-AAQGDCSKCSYAGTFNEKKCQTNCG-QPSQRWYHVPRSWLKTSGNLLVVF 707
Query: 612 EEENGNPLGITV 623
EE G+P GI++
Sbjct: 708 EEWGGDPTGISL 719
>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
Length = 802
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 324/788 (41%), Positives = 437/788 (55%), Gaps = 80/788 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAKEGGLDVI+TYVFW+ HEP GQY F GR D+++F+K +Q GL V LRIG
Sbjct: 50 MWPGIIQKAKEGGLDVIETYVFWDRHEPSPGQYYFEGRYDLVKFVKLVQQAGLLVNLRIG 109
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW GG PIWL D+ IVFR+DN+P+K
Sbjct: 110 PYVCAEWNLGGFPIWLRDIPHIVFRTDNEPFKKYMQSFLTKIVNMMKEENLFASQGGPII 169
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY ++ + E G Y+ WAA+MA +TGVPW+MC Q P +I+ CNGM C
Sbjct: 170 LAQVENEYGNVDSHYGEAGVRYINWAAEMAQAQNTGVPWIMCAQSKVPEYIIDTCNGMYC 229
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
P KP++WTE +T ++ +G R +DIAF VA F + GS+ NYYMY
Sbjct: 230 DGW--NPTLYKKPTMWTESYTGWFTYYGWPLPHRPVEDIAFAVARFFERGGSFHNYYMYF 287
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ + YD APLDEYG+ PKWGHLK+LH +KL +L+
Sbjct: 288 GGTNFGRTSGGPYVASSYDYDAPLDEYGMQHLPKWGHLKDLHETLKLGEEVILSSEGQHS 347
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA V+ +G C AFL N D V FRN+SY LP S+SI+ DCKTVAFN+
Sbjct: 348 ELGPNQEAHVYSYGNG-CVAFLANVDSMNDTVVEFRNVSYSLPAWSVSIVLDCKTVAFNS 406
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+V +Q S + S W + E + + +A+ LL+Q+ KD SDY WY
Sbjct: 407 AKVKSQSAVVSMNPS---KSSLSWTSFDEPV-GISGSSFKAKQLLEQMETTKDTSDYLWY 462
Query: 389 TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDG 448
T R Y + L ++S ++H FVNG++ S H S + ++ + L G+N
Sbjct: 463 TTR--YATGTGSTWLSIESMRDVVHIFVNGQFQSSWHTSKSVLYNSVEAPIKLAPGSNTI 520
Query: 449 ALLSVTVGLPDSGAFLERKVAGVHRVRV------QDKSFTNCSWGYQVGLIGEKLQIYSN 502
ALLS TVGL + GAF+E AG+ + D++ + W YQVGL GE L++++
Sbjct: 521 ALLSATVGLQNFGAFIETWSAGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFTV 580
Query: 503 LGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSF 562
G V WS++ S + LTWY T F AP G+DP+AL+L SMGKG+AWVNGQSIGRYW ++
Sbjct: 581 EGSRSVNWSAV-STKKPLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAY 639
Query: 563 KTSKG-NPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPL 619
K + P Y + + + C + YHVPR+++KP GNLLVL EE G+P
Sbjct: 640 KAADSVCPESCDYRGSYDQNKCLTGCG-QSSQRWYHVPRSWMKPRGNLLVLFEETGGDPS 698
Query: 620 GITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKK-ISK 678
I T + +C V SH + W CP K+ IS+
Sbjct: 699 SIDFVTRSTNVICARVYESHPASVKLW-----------------------CPGEKQVISQ 735
Query: 679 IVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGI-HK 737
I FAS GNP+G C + GSCH++ VE+AC+G+ CS L+ F CPG+ K
Sbjct: 736 IRFASLGNPEGSCGSFKEGSCHTNDLSNTVEKACVGQRSCS---LAPDFTTSACPGVREK 792
Query: 738 ALLVDAQC 745
L V+A C
Sbjct: 793 FLAVEALC 800
>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
Length = 892
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 324/826 (39%), Positives = 441/826 (53%), Gaps = 93/826 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LIA++KEGG DVI+TY FWN HEP +GQY+F GR DI++F K + S GL++ +RIG
Sbjct: 67 MWPTLIARSKEGGADVIETYTFWNGHEPTRGQYNFEGRYDIVKFAKLVGSHGLFLFIRIG 126
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG PIWL D+ GI FR+DN P+K
Sbjct: 127 PYACAEWNFGGFPIWLRDIPGIEFRTDNAPFKEEMERYVKKIVDLMISESLFSWQGGPII 186
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E +F KG Y+ WAA+MAV GVPWVMC+Q DAP +I+ CN C
Sbjct: 187 LLQIENEYGNVESSFGPKGKLYMKWAAEMAVGLGAGVPWVMCRQTDAPEYIIDTCNAYYC 246
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PNS KP IWTE+W ++ WG + R ++DIAF +A F + GS NYYMY
Sbjct: 247 -DGFT-PNSEKKPKIWTENWNGWFADWGERLPYRPSEDIAFAIARFFQRGGSLQNYYMYF 304
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNV 267
GGTNFGRTA IT Y APLDEYGL+R+PKWGHLK+LHAAIKLC L+ +
Sbjct: 305 GGTNFGRTAGGPTQITSYDYDAPLDEYGLLRQPKWGHLKDLHAAIKLCEPALVAADSPQY 364
Query: 268 ISLGQLQEAFVFEETS-----------GVCAAFLVNNDERKAVTVLFRNISYELPRKSIS 316
I LG QEA V+ TS G+CAAF+ N DE ++ TV F + LP S+
Sbjct: 365 IKLGPKQEAHVYRGTSNNIGQYMSLNEGICAAFIANIDEHESATVKFYGQEFTLPPWSVV 424
Query: 317 ILPDCKT-------------------VAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYRE 357
+ + F + Y K S+ F + W +E
Sbjct: 425 FCQIAEIQLSTQLRWGHKLQSKQWAQILFQLGIILCFYKLSLKASSESF--SQSWMTLKE 482
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHG 409
+ + + ++G+L+ ++ KD SDY WY R + + ++ +D+ S
Sbjct: 483 PLGVWGDKNFTSKGILEHLNVTKDQSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMR 542
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
+ FVNG+ GS G V V L QG ND LLS TVGL + GAFLE+ A
Sbjct: 543 DFVRIFVNGQLAGSVKGKWIKVV----QPVKLVQGYNDILLLSETVGLQNYGAFLEKDGA 598
Query: 470 G------VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LT 521
G + + D + T W YQVGL GE L++Y W+ + T +
Sbjct: 599 GFKGQIKLTGCKSGDINLTTSLWTYQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFS 658
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTV 579
WYKT F AP G DP+AL+ SMGKG+AWVNG +GRYW + G Y A ++
Sbjct: 659 WYKTKFDAPGGTDPVALDFSSMGKGQAWVNGHHVGRYWTLVAPNNGCGRTCDYRGAYHSD 718
Query: 580 TSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSH 639
C I YH+PR++LK N+LV+ EE + P I++ T + +C V+ H
Sbjct: 719 KCRTNCGEITQA-WYHIPRSWLKTLNNVLVIFEETDKTPFDISISTRSTETICAQVSEKH 777
Query: 640 LPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSC 699
PPL W D + K P + C G IS I FAS+G+P+G C++++ G C
Sbjct: 778 YPPLHKW--SHSEFDRKLSLMDKTPEMHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKC 835
Query: 700 HSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
H+++S VV +ACIG++ CSI + + F GDPC + K+L V A+C
Sbjct: 836 HAANSLSVVSQACIGRTSCSIGISNGVF-GDPCRHVVKSLAVQAKC 880
>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 723
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 310/672 (46%), Positives = 404/672 (60%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP GQY F R +++RF+K +Q GLYV LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MA+ TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 176 LSQIENEYGPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F+ PN KP +WTE WT ++ +GG R +D+A+ VA FI GS +NYYMYH
Sbjct: 236 -ENFE-PNKAYKPKMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+R+PKWGHL++LH AIKLC L++ V
Sbjct: 294 GGTNFGRTAGGPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVS 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ SG CAAFL N D +V V F N Y+LP S+SILPDCKTV FNT
Sbjct: 354 SLGSKQEAHVYNTRSGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYRE--AILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V N S + S W Y E A D+T A GL++QIS +DA+DY
Sbjct: 414 AKV----NAPSYWPKMTPISSFSWHSYNEETASAYADDTTTMA-GLVEQISITRDATDYL 468
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY +S+ + Q P L + S GH LH F+NG+ +G+ +G DN T V+
Sbjct: 469 WYMTDIRIDSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVN 528
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR G N ++LSV VGLP+ G E AG+ + + + W Y+VGL G
Sbjct: 529 LRPGVNKLSMLSVAVGLPNVGVHFETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKG 588
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W S+ S + LTWYKTTF AP GN+P+AL++ SMGKG+ W+NG
Sbjct: 589 EALNLHTVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWING 648
Query: 553 QSIGRYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+SIGR+W ++ T++G+ + Y + T HF + YHVPRA+LKP+GN+LV+
Sbjct: 649 ESIGRHWPAY-TARGSCGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIF 707
Query: 612 EEENGNPLGITV 623
EE GNP GI++
Sbjct: 708 EEWGGNPDGISL 719
>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
Length = 874
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 331/836 (39%), Positives = 448/836 (53%), Gaps = 118/836 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LI AKEGGLD+I TYVFW+ HEP G Y+F GR D+IRF+K + GLYV LRIG
Sbjct: 53 MWPALIRNAKEGGLDMIDTYVFWDGHEPSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW +GG P WL + GI FR+ N+ +
Sbjct: 113 PYVCAEWNFGGFPAWLLKLPGIQFRTHNRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVL 172
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY ++ ++ G Y+LWAA+MA D TGVPW+MCKQ DAP +IN CNG C
Sbjct: 173 FSQIENEYGNVQGSYGTNGKTYMLWAARMAKDLETGVPWIMCKQPDAPDYIINTCNGYYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM-- 207
+ +K PNS +KP++WTE+W+ +YQ+WG R+ +D+AF VA F + G NYYM
Sbjct: 233 -DGWK-PNSRDKPAMWTENWSGWYQLWGEAAPYRTVEDVAFAVARFFQRGGVAQNYYMVR 290
Query: 208 ----------------YHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELH 250
Y GGTNFGRT+ IT YD APLDE+G++R+PKWGHLKELH
Sbjct: 291 MLHDLEQHLLMPERCQYFGGTNFGRTSGGPFITTSYDYDAPLDEFGMLRQPKWGHLKELH 350
Query: 251 AAIKLCSRPLLTGTQNVISLGQLQEAFV------------FEETSGVCAAFLVNNDERKA 298
AA+KLC L + +LG++QE F + CAAFL N D A
Sbjct: 351 AALKLCETALTSNDPLYYTLGRMQEMVQAHVYSDGSLEANFSNLATPCAAFLANIDTSSA 410
Query: 299 VTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEK------- 351
+V F Y LP S+SILPDC+ V FNT +VS Q + + K E+
Sbjct: 411 -SVKFGGNVYNLPPWSVSILPDCRNVVFNTAQVSAQTSVTKMVAVQKPSLIEEVSGSYTP 469
Query: 352 -------WEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAP-- 402
WE ++E + + A LL+QIS D++DY WY+ RF + +
Sbjct: 470 GLVEQLAWEWFQEPVGGSGINKILAHALLEQISTTNDSTDYLWYSTRFEISDQELKGGDP 529
Query: 403 -LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFT-LRNTVHLRQGTNDGALLSVTVGLPDS 460
L + S ++H FVNGE+ GS + ++ +HL+ G N A+LS TVGL +
Sbjct: 530 VLVITSMRDMVHIFVNGEFAGSTSTLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNY 589
Query: 461 GAFLERKVAG------VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIR 514
GA LE AG + + ++ T+ W +QVGL GE + + WSS
Sbjct: 590 GAHLETHGAGITGSVWIQGLSTGTRNLTSALWLHQVGLNGEH---------DAITWSSTT 640
Query: 515 S-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT-SKGNPSQ 571
S P Q L WYK F P G+DP+A++L SMGKG+AWVNG S+GR+W + S G +
Sbjct: 641 SLPFFQPLVWYKANFNIPDGDDPVAIHLGSMGKGQAWVNGHSLGRFWPAITAPSTGCSDR 700
Query: 572 TQYAVNTVTS--IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIR 629
Y +S + C + + YHVPR +L N LVLLEE GN G++ + +
Sbjct: 701 CDYRGTYYSSKCLSGCG-LPSQEWYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVD 759
Query: 630 KVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDG 689
+VC V+ LPP + +F P + SC G+ IS I FASFGNP G
Sbjct: 760 RVCAQVSEYSLPP--------------VAQFSSLPELGLSCSPGQFISSIFFASFGNPKG 805
Query: 690 DCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C + GSCH+ S+ +VE+ACIG+ CS + + FG DPCPG K L V+A C
Sbjct: 806 RCGAFQKGSCHALESETIVEKACIGRQSCSFEIFWKNFGTDPCPGKAKTLAVEAAC 861
>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 1225
Score = 568 bits (1463), Expect = e-159, Method: Compositional matrix adjust.
Identities = 310/672 (46%), Positives = 404/672 (60%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP GQY F R +++RF+K +Q GLYV LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIETYVFWNGHEPSPGQYYFEDRYELVRFVKLVQQAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTAKIVSMMKGEKLYHSQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MA+ TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 176 LSQIENEYGPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPMIDTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F+ PN KP +WTE WT ++ +GG R +D+A+ VA FI GS +NYYMYH
Sbjct: 236 -ENFE-PNKAYKPKMWTEAWTGWFTEFGGPVPYRPVEDLAYAVARFIQNRGSLINYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+R+PKWGHL++LH AIKLC L++ V
Sbjct: 294 GGTNFGRTAGGPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKLCEPALVSVDPTVS 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ SG CAAFL N D +V V F N Y+LP S+SILPDCKTV FNT
Sbjct: 354 SLGSKQEAHVYNTRSGECAAFLANYDPSTSVRVTFGNHPYDLPPWSVSILPDCKTVVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYRE--AILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V N S + S W Y E A D+T A GL++QIS +DA+DY
Sbjct: 414 AKV----NAPSYWPKMTPISSFSWHSYNEETASAYADDTTTMA-GLVEQISITRDATDYL 468
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY +S+ + Q P L + S GH LH F+NG+ +G+ +G DN T V+
Sbjct: 469 WYMTDIRIDSNEGFLKSGQWPLLTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVN 528
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR G N ++LSV VGLP+ G E AG+ + + + W Y+VGL G
Sbjct: 529 LRPGVNKLSMLSVAVGLPNVGVHFETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKG 588
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W S+ S + LTWYKTTF AP GN+P+AL++ SMGKG+ W+NG
Sbjct: 589 EALNLHTVSGSSSVEWMTGSLVSQKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWING 648
Query: 553 QSIGRYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+SIGR+W ++ T++G+ + Y + T HF + YHVPRA+LKP+GN+LV+
Sbjct: 649 ESIGRHWPAY-TARGSCGKCYYGGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIF 707
Query: 612 EEENGNPLGITV 623
EE GNP GI++
Sbjct: 708 EEWGGNPDGISL 719
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 221/499 (44%), Positives = 289/499 (57%), Gaps = 18/499 (3%)
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
I+ CNG C E FK PN KP IWTE+W+ +Y +GG R +D+AF VA FI G
Sbjct: 723 IDTCNGFYC-ENFK-PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGG 780
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPL 260
S VNYYMYHGGTNFGRT+ F+ T Y AP+DEYGL+REPKWGHL++LH AIKLC L
Sbjct: 781 SLVNYYMYHGGTNFGRTSGLFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPAL 840
Query: 261 LTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPD 320
++ LG+ QEA VF+ +SG CAAFL N D V V F N Y+LP SISILPD
Sbjct: 841 VSADPTSTWLGKDQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPD 900
Query: 321 CKTVAFNTERVSTQ---YNKRSKTSNLKFDSDEKWEEYREAILN-FDNTLLRAEGLLDQI 376
CKTV FNT RV + + + S W Y+E + + +GL++Q+
Sbjct: 901 CKTVTFNTARVRRDPKLFIPNLLMAKMTPISSFWWLSYKEEPASAYAKDTTTKDGLVEQV 960
Query: 377 SAAKDASDYFWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDN 430
S D +DY WY +S+ + Q P L V S GHILH F+NG+ +GS +GS ++
Sbjct: 961 SVTWDTTDYLWYMTDIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYGSLED 1020
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNC 484
T V+L+QG N ++LSVTVGLP+ G + AGV + + +
Sbjct: 1021 PRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKY 1080
Query: 485 SWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMG 544
W Y+VGL GE L +YS G N V W + LTWYKTTF PAGN+P+AL++ SM
Sbjct: 1081 KWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQKQPLTWYKTTFNTPAGNEPLALDMSSMS 1140
Query: 545 KGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPT 604
KG+ WVNG+SIGRY+ + S + T + + YH+PR +L P
Sbjct: 1141 KGQIWVNGRSIGRYFPGYIASGKCNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPN 1200
Query: 605 GNLLVLLEEENGNPLGITV 623
GNLL++LEE GNP GI++
Sbjct: 1201 GNLLIILEEIGGNPQGISL 1219
>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 782
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 309/668 (46%), Positives = 397/668 (59%), Gaps = 50/668 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD+I+TYVFWN HEP G+Y F R D++RFIK +Q GLYV LRIG
Sbjct: 114 MWPDLIQKAKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIG 173
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WL V GI FR+DN P+K
Sbjct: 174 PYVCAEWNYGGFPLWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPII 233
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 234 LSQIENEYGPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYC 293
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP IWTE+W+ +Y +GG R +D+AF VA FI GS VNYYMYH
Sbjct: 294 -ENFK-PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYH 351
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+ F+ T Y AP+DEYGL+REPKWGHL++LH AIKLC L++
Sbjct: 352 GGTNFGRTSGLFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALVSADPTSTW 411
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+ QEA VF+ +SG CAAFL N D V V F N Y+LP SISILPDCKTV FNT
Sbjct: 412 LGKNQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDCKTVTFNTG 471
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILN-FDNTLLRAEGLLDQISAAKDASDYFWY 388
S Q +S + + S W Y+E + + +GL++Q+S D +DY WY
Sbjct: 472 --SLQIGVKSYEAKMTPISSFWWLSYKEEPASAYAQDTTTKDGLVEQVSVTWDTTDYLWY 529
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+S+ + Q P L V S GHILH F+NG+ +GS +GS ++ T V+L+
Sbjct: 530 ILSIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVNLK 589
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG N ++LSVTVGLP+ G + AGV + + + W Y+VGL GE
Sbjct: 590 QGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLRGEI 649
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L +YS G N V W + LTWYKTTF PAGN+P+AL++ SM KG+ WVNG+SIG
Sbjct: 650 LNLYSVKGSNSVQWMKGSFQKQPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIG 709
Query: 557 RYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
RY+ + ++G ++ Y T + + YH+PR +L P GNLL++LEE
Sbjct: 710 RYFPGY-IARGKCNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIG 768
Query: 616 GNPLGITV 623
GNP GI++
Sbjct: 769 GNPQGISL 776
>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
Length = 803
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 325/797 (40%), Positives = 426/797 (53%), Gaps = 105/797 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ AKEGG+DVI+TYVFWN HEP Y F R D+++F+K +Q G+Y+ LRIG
Sbjct: 59 MWPELVQTAKEGGVDVIETYVFWNGHEPSPSNYYFEKRYDLVKFVKIVQQAGMYLILRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+P+WLH V G VFR+DN +K
Sbjct: 119 PFVAAEWNFGGVPVWLHYVPGTVFRTDNYNFKYHMQKFMTYIVNLMKKEKLFASQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E A+ E G Y +WAA+MAV + GVPW+MC+Q DAP VIN CN C
Sbjct: 179 LAQVENEYGFYESAYGEGGKRYAMWAAQMAVSQNIGVPWIMCQQFDAPNSVINTCNSFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK P P+KP IWTE+W ++Q +G R A+DIAF VA F K GS NYYMYH
Sbjct: 239 -DQFK-PIFPDKPKIWTENWPGWFQTFGAPNPHRPAEDIAFSVARFFQKGGSVQNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD +AP+DEYGL R PKW HLKELH AIKLC LL +
Sbjct: 297 GGTNFGRTSGGPFITTSYDYEAPIDEYGLARLPKWAHLKELHKAIKLCELTLLNSVPVNL 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ E SG CAAFL N DE+ TV+FRN+SY LP S+SILPDCK V FNT
Sbjct: 357 SLGPSQEADVYAEESGACAAFLANMDEKNDKTVVFRNMSYHLPAWSVSILPDCKNVVFNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-----KWEEYREAILNFDNTLLRAEGLLDQISAAKDAS 383
+V++Q + + SD+ KWE + E + + L G +D I+ KD +
Sbjct: 417 AKVNSQTSIVEMVPDDLRSSDKGTKALKWETFVENAGIWGTSDLVKNGFVDHINTTKDTT 476
Query: 384 DYFWYTFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WYT + + L ++S GH LHAFVN E G+A G+ + F +
Sbjct: 477 DYLWYTTSIFVGENEEFLKKGGRPVLLIESKGHALHAFVNQELQGTASGNGTHSPFKFKK 536
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-------WGYQV 490
V L G ND ALLS+TVGL ++G+F E AG+ V++ K F N + W Y++
Sbjct: 537 PVSLVAGKNDIALLSMTVGLQNAGSFYEWVGAGLTSVKM--KGFNNGTIDLSTFNWTYKI 594
Query: 491 GLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
GL GEKL +Y+ + + V W + P + LTWYK A
Sbjct: 595 GLQGEKLGMYNGIAVETVNWVATSKPPKDQPLTWYKRQIHAR------------------ 636
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
Q + W +N+ +I YHVPR++ KP+GN+L
Sbjct: 637 ----QMLNWMW---------------RINS-------EMILVWTRYHVPRSWFKPSGNIL 670
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
V+ EE+ G+P IT I VC V + L + G ++ K +V
Sbjct: 671 VIFEEKGGDPTKITFSRRKISGVCALVAEDYPMANLESLENAGSGSSNYKA-----SVHL 725
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
CP IS I FASFG+P G C Y+ G CH S VVE+ C+ K++C + + F
Sbjct: 726 KCPKSSIISAIKFASFGSPAGACGSYSEGECHDPKSISVVEKVCLNKNQCVVEVTEENFS 785
Query: 729 GDPCPGIHKALLVDAQC 745
CPG K L V+A C
Sbjct: 786 KGLCPGKMKKLAVEAVC 802
>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
Length = 724
Score = 566 bits (1459), Expect = e-158, Method: Compositional matrix adjust.
Identities = 311/672 (46%), Positives = 405/672 (60%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP G+Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR++N+P+K
Sbjct: 115 PYVCAEWNFGGFPVWLKYVPGMEFRTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 175 MAQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F+ PN P KP +WTE WT +Y +GG R A+DIAF VA F+ NGS+ NYYMYH
Sbjct: 235 -EGFR-PNKPYKPKMWTEVWTGWYTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD APLDEYGL+ EPK+GHL++LH AIKL L++ V
Sbjct: 293 GGTNFGRTSSGLFIATSYDYDAPLDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ SG CAAFL N D R +V V F+N Y LP SISILPDCKT +NT
Sbjct: 353 SLGSNQEAHVYRSKSGACAAFLSNYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+V++Q S W+ Y E D++ L A GL +Q + +D+SDY W
Sbjct: 413 AQVNSQ---SSSIKMTPAGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S+ N + P L V S GH+LH FVNG+ +G+ +G+ DN T V L
Sbjct: 470 YMTNVNIASNEGFLKNGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N +LLSV+VGLP+ G + AGV + ++ W Y+VGL GE
Sbjct: 530 RAGINKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGE 589
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L ++S G + V W S+ + + LTWYK TF AP GNDP+AL++ SMGKG+ W+NG+
Sbjct: 590 SLSLHSLSGSSSVEWVRGSLMAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGE 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+GR+W + ++G+ S+ YA N C + YHVPR++LKP+GNLLV+
Sbjct: 650 GVGRHWPGY-IAQGDCSKCSYAGTFNEKKCQTNCG-QPSQRWYHVPRSWLKPSGNLLVVF 707
Query: 612 EEENGNPLGITV 623
EE GNP GI++
Sbjct: 708 EEWGGNPTGISL 719
>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
Length = 874
Score = 565 bits (1456), Expect = e-158, Method: Compositional matrix adjust.
Identities = 332/836 (39%), Positives = 446/836 (53%), Gaps = 118/836 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LI AKEGGLD+I TYVFW+ HEP G Y+F GR D+IRF+K + GLYV LRIG
Sbjct: 53 MWPALIRNAKEGGLDMIDTYVFWDGHEPSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW +GG P WL + GI FR+ N+ +
Sbjct: 113 PYVCAEWNFGGFPAWLLKLPGIQFRTHNRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVL 172
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY ++ ++ G Y+LWAA+MA D TGVPW+MCKQ DAP +IN CNG C
Sbjct: 173 FSQIENEYGNVQGSYGINGKTYMLWAARMAKDLETGVPWIMCKQPDAPDYIINTCNGYYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM-- 207
+ +K PNS +KP++WTE+W+ +YQ WG R+ +D+AF VA F + G NYYM
Sbjct: 233 -DGWK-PNSRDKPAMWTENWSGWYQSWGEAAPYRTVEDVAFAVARFFQRGGVAQNYYMVR 290
Query: 208 ----------------YHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELH 250
Y GGTNFGRT+ IT YD APLDE+G++R+PKWGHLKELH
Sbjct: 291 TLHDLEQRLLMPERCQYFGGTNFGRTSGGPFITTSYDYDAPLDEFGMLRQPKWGHLKELH 350
Query: 251 AAIKLCSRPLLTGTQNVISLGQLQEAFV------------FEETSGVCAAFLVNNDERKA 298
AA+KLC L + +LG++QE F + CAAFL N D A
Sbjct: 351 AALKLCETALTSNDPVYYTLGRMQEMVQAHVYSDGSLEANFSNLATPCAAFLANIDTSSA 410
Query: 299 VTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEK------- 351
+V F Y LP S+SILPDC+ V FNT +VS Q + + K E+
Sbjct: 411 -SVKFGGKVYNLPPWSVSILPDCRNVVFNTAQVSAQTSVTKMVAVQKPSLIEEVSGSYTP 469
Query: 352 -------WEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAP-- 402
WE ++E + + A LL+QIS D++DY WY+ RF +
Sbjct: 470 GLVEQLAWEWFQEPVGGSGINKILAHALLEQISTTNDSTDYMWYSTRFEILDQELKGGDP 529
Query: 403 -LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFT-LRNTVHLRQGTNDGALLSVTVGLPDS 460
L + S ++H FVNGE+ GS + ++ +HL+ G N A+LS TVGL +
Sbjct: 530 VLVITSMRDMVHIFVNGEFAGSTSTLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNY 589
Query: 461 GAFLERKVAG------VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIR 514
GA LE AG + + ++ T+ W +QVGL GE + + WSS
Sbjct: 590 GAHLETHGAGITGSIWIQGLSTGTRNLTSALWLHQVGLNGEH---------DAITWSSTT 640
Query: 515 S-PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW-VSFKTSKGNPSQ 571
S P Q L WYK F P G+DP+A++L SMGKG+AWVNG S+GR+W V S G +
Sbjct: 641 SLPFFQPLVWYKANFNIPDGDDPVAIHLGSMGKGQAWVNGHSLGRFWPVITAPSTGCSDR 700
Query: 572 TQYAVNTVTS--IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIR 629
Y +S + C + + YHVPR +L N LVLLEE GN G++ + +
Sbjct: 701 CDYRGTYYSSKCLSSCG-LPSQEWYHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVD 759
Query: 630 KVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDG 689
+VC V+ LPP + +F P + SC G+ IS I FASFGNP G
Sbjct: 760 RVCAQVSEYSLPP--------------VAQFSSLPELGLSCSPGQFISSIFFASFGNPKG 805
Query: 690 DCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C + GSCH+ S+ +VE+ACIG+ CS + + FG DPCPG K L V+A C
Sbjct: 806 RCGAFQKGSCHALESETIVEKACIGRQSCSFEIFWKNFGTDPCPGKAKTLAVEAAC 861
>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
Length = 728
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 314/674 (46%), Positives = 393/674 (58%), Gaps = 62/674 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP GQY F GR D++RFIK Q GLYV LRIG
Sbjct: 59 MWPGLIQKAKEGGLDVIQTYVFWNGHEPSPGQYYFEGRYDLVRFIKLAQQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 119 LYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVNLMKSEKLFESQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 179 MSQIENEYGPVEWEIGAPGKAYTKWAAEMAVGLDTGVPWIMCKQEDAPDPIIDTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE WT +Y +GG + R +D+A+ VA FI NGS+VNYYMYH
Sbjct: 239 -EGFT-PNKNYKPKMWTEAWTGWYTEFGGPIHNRPVEDLAYSVARFIQNNGSFVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTAA + YD AP+DEYGL REPKWGHL++LH AIKLC L++ V
Sbjct: 297 GGTNFGRTAAGLFVATSYDYDAPIDEYGLPREPKWGHLRDLHKAIKLCEPSLVSAYPTVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G+ E VF+ S CAAFL N D V F+N+ Y+LP SISILPDCK FNT
Sbjct: 357 WPGKNLEVHVFKSKSS-CAAFLANYDPSSPAKVTFQNMQYDLPPWSISILPDCKNAVFNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDE----KWEEYREAILNFDNT-LLRAEGLLDQISAAKDAS 383
RVS SK+S +K W+ Y E ++ D++ + GL +QIS +D S
Sbjct: 416 ARVS------SKSSQMKMTPVSGGAFSWQSYIEETVSADDSDTIAKNGLWEQISITRDGS 469
Query: 384 DYFWYT--FRFHYNSS---NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WY H N N Q+P L V S GH LH F+NG+ G+ +GS +N T N
Sbjct: 470 DYLWYLTDVNIHPNEGFLKNGQSPVLTVMSAGHALHVFINGQLAGTVYGSLENPKLTFSN 529
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
V LR G N +LLS VGLP+ G E GV + + T W Y+VG
Sbjct: 530 NVKLRAGINKISLLSAAVGLPNVGLHFETWNTGVLGPVTLKGLNEGTRDLTKQKWSYKVG 589
Query: 492 LIGEKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
L GE L +++ G + V W S+ + + LTWYK TF AP GNDP+AL++ +MGKG+ W
Sbjct: 590 LKGEDLSLHTLSGSSSVEWVQGSLLAQKQPLTWYKATFNAPEGNDPLALDMNTMGKGQIW 649
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLL 608
+NG+SIGR+W +K S GN YA +A+ YHVPR++LKP+GN L
Sbjct: 650 INGESIGRHWPEYKAS-GNCGGCSYAGIYTEKKCLSNCGEASQRWYHVPRSWLKPSGNFL 708
Query: 609 VLLEEENGNPLGIT 622
V+ EE G+P GI+
Sbjct: 709 VVFEELGGDPTGIS 722
>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
Length = 450
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 285/484 (58%), Positives = 341/484 (70%), Gaps = 39/484 (8%)
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
IENEY IE AFHEKG YV WAAKMAVD TGVPW+MCKQ DAP PVIN CNGM+CGET
Sbjct: 1 IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKCGET 60
Query: 153 FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 212
F GPNSPNKPS+WTE+WTSFYQV+GG+PYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT
Sbjct: 61 FGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 120
Query: 213 NFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQ 272
NFGRTAAA++ITGYYDQAPLDEYGL+R+PKWGHLKELHA IK CS LL G Q +S+GQ
Sbjct: 121 NFGRTAAAYVITGYYDQAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTNLSVGQ 180
Query: 273 LQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVS 332
LQ+A++FE G C AFLVNND A TV FRN S+EL KSISILPDC + FNT +V+
Sbjct: 181 LQQAYMFEAQGGGCVAFLVNNDSVNA-TVGFRNKSFELLPKSISILPDCDNIIFNTAKVN 239
Query: 333 TQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRF 392
N+R TS+ K ++ WE+Y + I N+ ++ ++++ LL+ ++ KD SDY WYTF F
Sbjct: 240 AGSNRRITTSSKKLNT---WEKYIDVIPNYSDSTIKSDTLLEHMNTTKDKSDYLWYTFSF 296
Query: 393 HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHD-NVSFTLRNTVHLRQG--TNDGA 449
N S + L V+S H+ +AFVN +Y+GSAHGS + V F + + L +N+ +
Sbjct: 297 QPNLSCTKPLLHVESLAHVAYAFVNNKYSGSAHGSKNGKVPFIMEVPIVLDDDGLSNNIS 356
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVL 509
+LSV VGL VGL+GE LQ+Y L V
Sbjct: 357 ILSVLVGL-------------------------------SVGLLGETLQLYGKEHLEMVK 385
Query: 510 WSSIR-SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGN 568
WS S + LTW+K F P GNDP+ LNL +M KGEAWVNGQSIGRYW+SF TSKG+
Sbjct: 386 WSKADISIAQPLTWFKLEFDTPKGNDPVVLNLATMSKGEAWVNGQSIGRYWISFLTSKGH 445
Query: 569 PSQT 572
PSQT
Sbjct: 446 PSQT 449
>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
Length = 724
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 310/672 (46%), Positives = 404/672 (60%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN H P G+Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIETYVFWNGHGPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR++N+P+K
Sbjct: 115 PYVCAEWNFGGFPVWLKYVPGMEFRTNNQPFKVAMRGFVQKIVNMMKSENLFESQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 175 MAQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F+ PN P KP +WTE WT +Y +GG R A+DIAF VA F+ NGS+ NYYMYH
Sbjct: 235 -EGFR-PNKPYKPKMWTEVWTGWYTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD APLDEYGL+ EPK+GHL++LH AIKL L++ V
Sbjct: 293 GGTNFGRTSSGLFIATSYDYDAPLDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ SG CAAFL N D R +V V F+N Y LP SISILPDCKT +NT
Sbjct: 353 SLGSNQEAHVYRSKSGACAAFLSNYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+V++Q S W+ Y E D++ L A GL +Q + +D+SDY W
Sbjct: 413 AQVNSQ---SSSIKMTPAGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S+ N + P L V S GH+LH FVNG+ +G+ +G+ DN T V L
Sbjct: 470 YMTNVNIASNEGFLKNGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N +LLSV+VGLP+ G + AGV + ++ W Y+VGL GE
Sbjct: 530 RAGINKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGE 589
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L ++S G + V W S+ + + LTWYK TF AP GNDP+AL++ SMGKG+ W+NG+
Sbjct: 590 SLSLHSLSGSSSVEWVRGSLVAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGE 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+GR+W + ++G+ S+ YA N C + YHVPR++LKP+GNLLV+
Sbjct: 650 GVGRHWPGY-IAQGDCSKCSYAGTFNEKKCQTNCG-QPSQRWYHVPRSWLKPSGNLLVVF 707
Query: 612 EEENGNPLGITV 623
EE GNP GI++
Sbjct: 708 EEWGGNPTGISL 719
>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
Length = 736
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 309/670 (46%), Positives = 399/670 (59%), Gaps = 53/670 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK GGLDVI TYVFW++HEP G YDF GR D++RFIK +Q GLY LRIG
Sbjct: 60 MWEDLIWKAKHGGLDVIDTYVFWDVHEPSPGNYDFEGRYDLVRFIKTVQKVGLYANLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTI------EPAFHEKGPP-- 110
P++ +EW +GG+P+WL V G+ FR+DN+P+K ++ Q I E F +G P
Sbjct: 120 PYVCAEWNFGGIPVWLKYVPGVSFRTDNEPFKAAMQGFTQKIVQMMKSEKLFQSQGGPII 179
Query: 111 -------------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
YV WAA MAV TGVPWVMCK++DAP PVIN+CNG C +
Sbjct: 180 LSQIENEYGPESRGAAGRAYVNWAASMAVGLGTGVPWVMCKENDAPDPVINSCNGFYCDD 239
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
PN P KPS+WTE W+ ++ +GG + R +D++F VA FI K GSYVNYYMYHGG
Sbjct: 240 F--SPNKPYKPSMWTETWSGWFTEFGGPIHQRPVEDLSFAVARFIQKGGSYVNYYMYHGG 297
Query: 212 TNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
TNFGR+A IT YD AP+DEYGL+R+PK+ HLKELH AIK C L++ V+SL
Sbjct: 298 TNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYSHLKELHKAIKRCEHALVSLDPTVLSL 357
Query: 271 GQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTER 330
G L +A VF +G CAAFL N + + A TV F N Y+LP SISILPDCK FNT +
Sbjct: 358 GTLLQAHVFSSGTGTCAAFLANYNAQSAATVTFNNRHYDLPPWSISILPDCKIDVFNTAK 417
Query: 331 VSTQYNKRSKTSNLKFDSDE-KWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYFWY 388
V Q S+ L WE Y E + + +++ + A GLL+Q++ +D SDY WY
Sbjct: 418 VRVQ---PSQVKMLPVKPKLFSWESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWY 474
Query: 389 TFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+SS + Q P ++VQS GH +H FVNG+++GSA G+ + S T V LR
Sbjct: 475 ITSVDISSSESFLRGGQKPSINVQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLR 534
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLSVTVGL + G E AG+ H + K T W Y+VGL GE
Sbjct: 535 AGANKIALLSVTVGLQNVGRHYETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEA 594
Query: 497 LQIYSNLGLNKVLWSSIRSPTR---QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
+ + S G++ V W T+ QL WYK F AP G +P+AL+L+SMGKG+ W+NGQ
Sbjct: 595 MNLVSPNGVSSVDWVQESQATQSRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQ 654
Query: 554 SIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE 613
SIGRYW+++ N V C YHVPR++LKPT NL+V+ EE
Sbjct: 655 SIGRYWMAYAKGDCNSCTYSGTFRPVKCQLGCG-QPTQRWYHVPRSWLKPTKNLIVVFEE 713
Query: 614 ENGNPLGITV 623
GNP I++
Sbjct: 714 LGGNPWKISL 723
>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
Length = 724
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 309/672 (45%), Positives = 404/672 (60%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP G+Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIETYVFWNGHEPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR++N+P+K
Sbjct: 115 PYVCAEWNFGGFPVWLKYVPGMEFRTNNQPFKVAMQGFVQKIVNMMKSENLFESQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPW+MCK++DAP PVI+ CNG C
Sbjct: 175 MAQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLKTGVPWIMCKREDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F+ PN P KP +WTE WT +Y +GG R A+DIAF VA F+ NGS+ NYYMYH
Sbjct: 235 -EGFR-PNKPYKPKMWTEVWTGWYTKFGGPIPQRPAEDIAFSVARFVQNNGSFFNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD APLDEYGL+ EPK+GHL++LH AIKL L++ V
Sbjct: 293 GGTNFGRTSSGLFIATSYDYDAPLDEYGLLNEPKYGHLRDLHKAIKLSEPALVSSYAAVT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ SG CAAFL N D R +V V F+N Y LP SISILPDCKT +NT
Sbjct: 353 SLGSNQEAHVYRSKSGACAAFLSNYDSRYSVKVTFQNRPYNLPPWSISILPDCKTAVYNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+V++Q S W+ Y E D++ L A GL +Q + +D+SDY W
Sbjct: 413 AQVNSQ---SSSIKMTPAGGGLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + S+ N + P L V S GH+LH FVNG+ +G+ +G+ DN T V L
Sbjct: 470 YMTNVNIASNEGFLRNGKDPYLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N +LLSV+VGLP+ G + AGV + ++ W Y+VGL GE
Sbjct: 530 RAGINKISLLSVSVGLPNVGVHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGE 589
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L ++S G + V W S+ + + LTWYK TF AP GNDP+AL + SMGKG+ W+NG+
Sbjct: 590 SLSLHSLSGSSSVEWVRGSLVAQKQPLTWYKATFNAPGGNDPLALGMASMGKGQIWINGE 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+GR+W + ++G+ S+ YA N C + +HVPR++LKP+GNLLV+
Sbjct: 650 GVGRHWPGY-IAQGDCSKCSYAGTFNEKKCQTNCG-QPSQRWHHVPRSWLKPSGNLLVVF 707
Query: 612 EEENGNPLGITV 623
EE GNP GI++
Sbjct: 708 EEWGGNPTGISL 719
>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 725
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 304/671 (45%), Positives = 397/671 (59%), Gaps = 55/671 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP GQY+F R D++RF+K + GLYV LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIETYVFWNGHEPSPGQYNFEDRYDLVRFVKLVHQAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFKAAMQKFTEKIVGLMKGEKLYESQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MA+ +TGVPWVMCKQDDAP PVI+ CNG C
Sbjct: 176 LSQIENEYGPVEWEIGAPGKSYTKWAAQMALGLNTGVPWVMCKQDDAPDPVIDTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT ++ +GG R +D+A+ VA FI GS++NYYMYH
Sbjct: 236 -ENFK-PNKVYKPKMWTEAWTGWFTEFGGPAPYRPVEDMAYSVARFIQNGGSFINYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DEYGL+REPKW HL++LH AIKLC L++ V
Sbjct: 294 GGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWSHLRDLHKAIKLCEPALVSVDPTVS 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ SG CAAFL N D + TV F N Y+LP S+SILPDCK+V FNT
Sbjct: 354 YLGSNQEAHVFKTRSGSCAAFLANYDASSSATVTFGNNQYDLPPWSVSILPDCKSVIFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILN-FDNTLLRAEGLLDQISAAKDASDYFW 387
+V ++ T F W Y E + + GL++QIS +D++DY W
Sbjct: 414 AKVGAPTSQPKMTPVSSFS----WLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLW 469
Query: 388 YT--FRFHYNSS---NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y R N + Q P L V S GH LH F+NG+ +G+ +G +N T V+L
Sbjct: 470 YMTDIRIDPNEGFLKSGQWPLLTVFSAGHALHVFINGQLSGTTYGGSENYKLTFSKYVNL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ++LSV VGLP+ G E GV + + + W Y++GL GE
Sbjct: 530 RAGINKLSILSVAVGLPNGGLHYETWNTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGE 589
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L ++S G + V W S+ + + LTWYKTTF +P GN+P+AL++ SMGKG+ W+NGQ
Sbjct: 590 ALNLHSVSGSSSVEWVTGSLVAQKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQ 649
Query: 554 SIGRYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
SIGR+W ++ T+KG+ + Y + H + YHVPRA+LK +GN+LV+ E
Sbjct: 650 SIGRHWPAY-TAKGSCGKCNYGGIFNEKKCHSXCGEPSQRWYHVPRAWLKSSGNVLVIFE 708
Query: 613 EENGNPLGITV 623
E GNP GI++
Sbjct: 709 EWGGNPEGISL 719
>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
Length = 745
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 308/673 (45%), Positives = 399/673 (59%), Gaps = 58/673 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWN+HEP G Y+F GR D+++FIK +Q +GLYV LRIG
Sbjct: 59 MWEDLIQKAKDGGLDVIDTYVFWNVHEPSPGNYNFEGRYDLVQFIKTVQKKGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY A G Y WAAKMAV TGVPWVMCK+DDAP PVINACNG C
Sbjct: 179 LSQIENEYGPQGRALGASGHAYSNWAAKMAVGLGTGVPWVMCKEDDAPDPVINACNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP +WTE W+ ++ +GG R +D+AF VA FI K GS+ NYYMYH
Sbjct: 239 DDF--SPNKPYKPKLWTESWSGWFSEFGGSNPQRPVEDLAFAVARFIQKGGSFFNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+REPK+GHLK+LH AIK C L++ V
Sbjct: 297 GGTNFGRSAGGPFITTSYDYDAPIDEYGLLREPKYGHLKDLHKAIKQCEHALVSSDPTVT 356
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A VF + CAAFL N A V F N Y+LP SISILPDC+T FNT
Sbjct: 357 SLGAYEQAHVFSSGT-TCAAFLANYHSNSAARVTFNNRHYDLPPWSISILPDCRTDVFNT 415
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
R+ Q ++ + SN K S WE Y E + + +++ + A LL+QI A +D SDY
Sbjct: 416 ARMRFQPSQIQMLPSNSKLLS---WETYDEDVSSLAESSRITASRLLEQIDATRDTSDYL 472
Query: 387 WYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY +SS + + + V S G +H F+NG+++GSA G+ ++ SFT +
Sbjct: 473 WYITSVDISSSESFLRGRNKPSISVHSSGDAVHVFINGKFSGSAFGTREDRSFTFNGPID 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR GTN ALLSV VGLP+ G E +G+ H + K T W YQVGL G
Sbjct: 533 LRAGTNKIALLSVAVGLPNGGIHFESWKSGITGPVLLHDLDHGQKDLTGQKWSYQVGLKG 592
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTR---QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S ++ QL W+K F AP G +P+AL++ SMGKG+ W+N
Sbjct: 593 EAMNLVSPNGVSSVDWVSESLASQNQPQLKWHKAHFNAPNGVEPLALDMSSMGKGQVWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW+ + +KGN + YA + + T YHVPR++LKP NL+V+
Sbjct: 653 GQSIGRYWMVY--AKGNCNSCNYAGTYRQAKCQVGCGQPTQRWYHVPRSWLKPKNNLMVV 710
Query: 611 LEEENGNPLGITV 623
EE GNP I++
Sbjct: 711 FEELGGNPWKISL 723
>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
Length = 805
Score = 560 bits (1442), Expect = e-156, Method: Compositional matrix adjust.
Identities = 322/790 (40%), Positives = 437/790 (55%), Gaps = 81/790 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAKEGGLDVI+TYVFW+ HEP GQY F GR D+++F+K +Q GL + LRIG
Sbjct: 50 MWPGIIQKAKEGGLDVIETYVFWDRHEPSPGQYYFEGRYDLVKFVKLVQQAGLLMNLRIG 109
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW GG PIWL D+ IVFR+DN+P+K
Sbjct: 110 PYVCAEWNLGGFPIWLRDIPHIVFRTDNEPFKKYMQSFLTKIVNMMKEENLFASQGGPII 169
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY ++ + E G Y+ WAA+MA +TGVPW+MC Q P +I+ CNGM C
Sbjct: 170 LAQVENEYGNVDSHYGEAGVRYINWAAEMAQAQNTGVPWIMCAQSKVPEYIIDTCNGMYC 229
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM-- 207
P KP++WTE +T ++ +G R +DIAF VA F + GS+ NYYM
Sbjct: 230 DGW--NPILYKKPTMWTESYTGWFTYYGWPIPHRPVEDIAFAVARFFERGGSFHNYYMVW 287
Query: 208 YHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQN 266
Y GGTNFGRT+ + YD APLDEYG+ PKWGHLK+LH +KL +L+
Sbjct: 288 YFGGTNFGRTSGGPYVASSYDYDAPLDEYGMQHLPKWGHLKDLHETLKLGEEVILSSEGQ 347
Query: 267 VISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAF 326
LG QEA V+ +G C AFL N D V FRN+SY LP S+SIL DCKTVAF
Sbjct: 348 HSELGPNQEAHVYSYGNG-CVAFLANVDSMNDTVVEFRNVSYSLPAWSVSILLDCKTVAF 406
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
N+ +V +Q S + + S W + E + + +A+ LL+Q+ KD SDY
Sbjct: 407 NSAKVKSQSAVVSMSPS---KSTLSWTSFDEPV-GISGSSFKAKQLLEQMETTKDTSDYL 462
Query: 387 WYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
WYT + + L ++S ++H FVNG++ S H S + ++ + L G+N
Sbjct: 463 WYTTSVEATGTGSTW-LSIESMRDVVHIFVNGQFQSSWHTSKSVLYNSVEAPITLAPGSN 521
Query: 447 DGALLSVTVGLPDSGAFLERKVAGVHRVRV------QDKSFTNCSWGYQVGLIGEKLQIY 500
ALLS TVGL + GAF+E AG+ + D++ + W YQVGL GE L+++
Sbjct: 522 TIALLSATVGLQNFGAFIETWSAGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLF 581
Query: 501 SNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWV 560
+ G V WS++ S + LTWY T F AP G+DP+AL+L SMGKG+AWVNGQSIGRYW
Sbjct: 582 TVEGSRSVNWSAV-STEKPLTWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWP 640
Query: 561 SFKTSKG-NPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
++K + P Y + + + C + YHVPR+++KP GNLLVL EE G+
Sbjct: 641 AYKAADSVCPESCDYRGSYDQNKCLTGCG-QSSQRWYHVPRSWMKPRGNLLVLFEETGGD 699
Query: 618 PLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKK-I 676
P I T + +C V SH + W CP K+ I
Sbjct: 700 PSSIDFVTRSTNVICARVYESHPASVKLW-----------------------CPGEKQVI 736
Query: 677 SKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGI- 735
S+I FAS GNP+G C + GSCH++ VE+AC+G+ CS L+ F CPG+
Sbjct: 737 SQIRFASLGNPEGSCGSFKEGSCHTNDLSNTVEKACVGQRSCS---LAPDFTISACPGVR 793
Query: 736 HKALLVDAQC 745
K L V+A C
Sbjct: 794 EKFLAVEALC 803
>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
Length = 729
Score = 560 bits (1442), Expect = e-156, Method: Compositional matrix adjust.
Identities = 308/669 (46%), Positives = 397/669 (59%), Gaps = 58/669 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK GGLDVI TYVFW++HEP G YDF GR D++RFIK +Q GLY LRIG
Sbjct: 60 MWEDLIWKAKHGGLDVIDTYVFWDVHEPSPGNYDFEGRYDLVRFIKTVQKVGLYANLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTI------EPAFHEKGPP-- 110
P++ +EW +GG+P+WL V G+ FR+DN+P+K ++ Q I E F +G P
Sbjct: 120 PYVCAEWNFGGIPVWLKYVPGVSFRTDNEPFKAAMQGFTQKIVQMMKSEKLFQSQGGPII 179
Query: 111 -------------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
YV WAA MAV TGVPWVMCK++DAP PVIN+CNG C +
Sbjct: 180 LSQIENEYGPESRGAAGRAYVNWAASMAVGLGTGVPWVMCKENDAPDPVINSCNGFYCDD 239
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
PN P KPS+WTE W+ ++ +GG + R +D++F VA FI K GSYVNYYMYHGG
Sbjct: 240 F--SPNKPYKPSMWTETWSGWFTEFGGPIHQRPVEDLSFAVARFIQKGGSYVNYYMYHGG 297
Query: 212 TNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
TNFGR+A IT YD AP+DEYGL+R+PK+ HLKELH AIK C L++ V+SL
Sbjct: 298 TNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYSHLKELHKAIKRCEHALVSLDPTVLSL 357
Query: 271 GQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTER 330
G L +A VF +G CAAFL N + + A TV F N Y+LP SISILPDCK FNT +
Sbjct: 358 GTLLQAHVFSSGTGTCAAFLANYNAQSAATVTFNNRHYDLPPWSISILPDCKIDVFNTAK 417
Query: 331 VSTQYNKRSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYFWYT 389
V K S WE Y E + + +++ + A GLL+Q++ +D SDY WY
Sbjct: 418 VKMLPVKPKLFS---------WESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYI 468
Query: 390 FRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
+SS + Q P ++VQS GH +H FVNG+++GSA G+ + S T V LR
Sbjct: 469 TSVDISSSESFLRGGQKPSINVQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRA 528
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKL 497
G N ALLSVTVGL + G E AG+ H + K T W Y+VGL GE +
Sbjct: 529 GANKIALLSVTVGLQNVGRHYETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAM 588
Query: 498 QIYSNLGLNKVLWSSIRSPTR---QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
+ S G++ V W T+ QL WYK F AP G +P+AL+L+SMGKG+ W+NGQS
Sbjct: 589 NLVSPNGVSSVDWVQESQATQSRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQS 648
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
IGRYW+++ N V C YHVPR++LKPT NL+V+ EE
Sbjct: 649 IGRYWMAYAKGDCNSCTYSGTFRPVKCQLGCG-QPTQRWYHVPRSWLKPTKNLIVVFEEL 707
Query: 615 NGNPLGITV 623
GNP I++
Sbjct: 708 GGNPWKISL 716
>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 831
Score = 560 bits (1442), Expect = e-156, Method: Compositional matrix adjust.
Identities = 320/802 (39%), Positives = 437/802 (54%), Gaps = 87/802 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + YDFSG NDIIRF+K IQ GLY LRIG
Sbjct: 60 MWPELIQKAKEGGLDAIETYVFWNAHEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG+P+W+H++ + R+ N +
Sbjct: 120 PYVCAEWNYGGIPVWVHNLPDVEIRTANSVFMNEMQNFTTLIVDMLKKEKLFASQGGPII 179
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + + + G Y+ W A MA GVPW+MC++ DAP P+IN CNG C
Sbjct: 180 LTQIENEYGNVISQYGDAGKAYMNWCANMAESLKVGVPWIMCQESDAPQPMINTCNGWYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F+ PNS N P +WTE+W +++ WGG+ R+A+D+AF VA F G++ NYYMYH
Sbjct: 240 -DNFE-PNSFNSPKMWTENWIGWFKNWGGRDPHRTAEDVAFAVARFFQTGGTFQNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG + +PKWGHLKELH+A+K L +G +
Sbjct: 298 GGTNFGRTAGGPYITTSYDYDAPLDEYGNIAQPKWGHLKELHSALKAMEEALTSGNVSET 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG + ++ T+G + FL N + T+ FR +Y +P S+SILPDC+ +NT
Sbjct: 358 DLGNSVKVTIY-ATNGSSSCFLSNTNTTADATLTFRGNNYTVPAWSVSILPDCQHEEYNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTL-----LRAEGLLDQISAAKDAS 383
+V Q + +K N K + + ++ N D L + A LLDQ AA DAS
Sbjct: 417 AKVKEQTSVMTK-ENSKAEKEAAILKWVWRSENIDKALHGKSNVSAHRLLDQKDAANDAS 475
Query: 384 DYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
DY WY + H + L + GH++HAFVNGEY S ++ + +
Sbjct: 476 DYLWYMTKLHVKHDDPVWSENMTLRINGSGHVIHAFVNGEYIDSHWATYGIHNDKFEPKI 535
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAG----VHRVRVQD-----KSFTNCSWGYQV 490
L+ GTN +LLSVTVGL + GAF + AG + V V+ K+ ++ W Y++
Sbjct: 536 KLKHGTNTISLLSVTVGLQNYGAFFDTWHAGLVGPIELVSVKGEETIIKNLSSHKWSYKI 595
Query: 491 GLIGEKLQIYSNLG--LNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
GL G +++S+ + W S + PT R LTWYKTTF+AP G DP+ ++LQ MGKG
Sbjct: 596 GLHGWDHKLFSDDSPFAAQSKWESEKLPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGY 655
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPT 604
AWVNG++IGR W S+ + S S C T YHVPR++LK
Sbjct: 656 AWVNGKNIGRIWPSYNAEEDGCSDEPCDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDG 715
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
N LVL E GNP + T+ + VC + +
Sbjct: 716 ANTLVLFAELGGNPSLVNFQTVVVGNVCANAY-------------------------ENK 750
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS-SHSQGVVERACIGKSRCSIPLL 723
T++ SC G+KIS I FASFG+P G C + GSC S S++ +V++AC+GK CSI L
Sbjct: 751 TLELSCQ-GRKISAIKFASFGDPKGVCGAFTNGSCESKSNALPIVQKACVGKEACSIDLS 809
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
+ FG C + K L V+A C
Sbjct: 810 EKTFGATACGNLAKRLAVEAVC 831
>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 763
Score = 559 bits (1441), Expect = e-156, Method: Compositional matrix adjust.
Identities = 319/783 (40%), Positives = 437/783 (55%), Gaps = 90/783 (11%)
Query: 32 QYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY 91
QYDF GRND++RF+K GLYV LRIGP++ +EW YGG P+WLH + GI R+DN+P+
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 92 K-------------------------------IENEYQTIEPAFHEKGPPYVLWAAKMAV 120
K IENEY I ++ G Y+ WAA MAV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
TGVPWVMC+Q DAP P+IN CNG C + P+ P++P +WTE+W+ ++ +GG
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFT--PSLPSRPKLWTENWSGWFLSFGGAV 178
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVR 239
R +D+AF VA F + G+ NYYMYHGGTNFGR++ I+ YD AP+DEYGLVR
Sbjct: 179 PYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVR 238
Query: 240 EPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAV 299
+PKWGHL+++H AIK+C L+ + +SLGQ EA V++ S +CAAFL N D++
Sbjct: 239 QPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGS-LCAAFLANIDDQSDK 297
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSD---------- 349
TV F +Y+LP S+SILPDCK V NT ++++Q ++ NL F +
Sbjct: 298 TVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQV-ASTQMRNLGFSTQASDGSSVEAE 356
Query: 350 ---EKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS-----SNAQA 401
W E + L GL++QI+ DASD+ WY+ + +Q+
Sbjct: 357 LAASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQS 416
Query: 402 PLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSG 461
L V S GH+L F+NG+ GS+ GS + +L V L G N LLS TVGL + G
Sbjct: 417 NLPVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYG 476
Query: 462 AFLERKVAGVH-RVRVQDK----SFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRS- 515
AF + AG+ V++ ++ W YQ+GL GE L +Y N W S S
Sbjct: 477 AFFDLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLY-NPSEASPEWVSDNSY 535
Query: 516 PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY 574
PT LTWYK+ F APAG+DP+A++ MGKGEAWVNGQSIGRYW P+
Sbjct: 536 PTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYW---------PTNIAP 586
Query: 575 AVNTVTSIHFCAIIKATNT-----------YHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
+ V S ++ AT YHVPR+FL+P N +VL E+ GNP I+
Sbjct: 587 QSDCVNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISF 646
Query: 624 DTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPL-GKKISKIVFA 682
T VC HV+ H + SW+ +Q+ +++ G P ++ CP G+ IS I FA
Sbjct: 647 TTKQTESVCAHVSEDHPDQIDSWVSSQQK----LQRSG--PALRLECPKEGQVISSIKFA 700
Query: 683 SFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVD 742
SFG P G C Y+ G C SS + V + AC+G S CS+P+ ++ F GDPC G+ K+L+V+
Sbjct: 701 SFGTPSGTCGSYSHGECSSSQALAVAQEACVGVSSCSVPVSAKNF-GDPCRGVTKSLVVE 759
Query: 743 AQC 745
A C
Sbjct: 760 AAC 762
>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 846
Score = 559 bits (1440), Expect = e-156, Method: Compositional matrix adjust.
Identities = 297/760 (39%), Positives = 423/760 (55%), Gaps = 72/760 (9%)
Query: 32 QYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY 91
Q F GRND+I+F+K IQS +Y +RIGPFI++EW +GGLP WL ++ I+FR++N+PY
Sbjct: 105 QVQFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEPY 164
Query: 92 K-------------------------------IENEYQTIEPAFHEKGPPYVLWAAKMAV 120
K IENEY I+ +G Y+ WAA+MA+
Sbjct: 165 KKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMAI 224
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
+TGVPW+MCKQ APG VI CNG CG+T+ + NKP +WTE+WT+ ++ +G +
Sbjct: 225 STNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDK-NKPRLWTENWTAQFRAFGDQL 283
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVRE 240
+RSA+DIA+ V F AK G+ VNYYMY+GGTNFGRT A++++TGYYD+ P+DEYG+ +
Sbjct: 284 ALRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRTGASYVLTGYYDEGPVDEYGMPKA 343
Query: 241 PKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFE-ETSGVCAAFLVNNDERKAV 299
PK+GHL++LH IK SR L G Q+ L EA FE +C AF+ NN+ +
Sbjct: 344 PKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGEDG 403
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAI 359
TV FR Y +P +S+SIL DCK V +NT+RV Q+++RS + K WE Y E I
Sbjct: 404 TVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQKLAKSNAWEMYSEPI 463
Query: 360 LNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS------NAQAPLDVQSHGHILH 413
+ T +R + ++Q + KD SDY WYT F + + + + V+S H L
Sbjct: 464 PRYKLTSIRNKEPMEQYNLTKDDSDYLWYTTSFRLEADDLPFRGDIRPVVQVKSTSHALM 523
Query: 414 AFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHR 473
FVN + G+ GS F ++LR G N ALLS ++G+ DSG L G+
Sbjct: 524 GFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEVKGGIQD 583
Query: 474 VRVQDKSFTNCS-----WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFR 528
+Q + WG++V L GE +IY+ G+ V W + R +TWYK F
Sbjct: 584 CTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPATT-GRAVTWYKRYFD 642
Query: 529 APAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAII 588
P G DP+ L++ SMGKG +VNG+ +GRYW S++T G PSQ
Sbjct: 643 EPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVGGVPSQA---------------- 686
Query: 589 KATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLR 648
YH+PR FLKP NLLV+ EEE G P GI + T+ +C ++ + + +W
Sbjct: 687 ----MYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRRDDICVFISEHNPAQIKTW-- 740
Query: 649 HRQRGDTDIKKFGKKPTVQP--SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQG 706
+ IK + + + CP K I ++VFASFGNP+G C + GSCH+ +++
Sbjct: 741 --DKDGGQIKVIAEDHSTRGILKCPPKKTIQEVVFASFGNPEGSCANFTAGSCHTPNAKD 798
Query: 707 VVERACIGKSRCSIPLLSRYFGGD-PCPGIHKALLVDAQC 745
+V + C+GK C +P+L +G D CP L V +C
Sbjct: 799 IVAKECLGKKSCVLPVLHTVYGADINCPTTTATLAVQVRC 838
>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 310/799 (38%), Positives = 429/799 (53%), Gaps = 83/799 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI K+KEGGLDVI+TYVFWN+HEP GQYDFSG D++RFIK IQ+QGLY LRIG
Sbjct: 57 MWPSLIEKSKEGGLDVIETYVFWNVHEPHPGQYDFSGNLDLVRFIKTIQNQGLYAVLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ I FR++N +
Sbjct: 117 PYVCAEWNYGGFPVWLHNIPNIEFRTNNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPII 176
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY I ++ + G YV W A++A + GVPW+MC+Q DAP P+IN CNG C
Sbjct: 177 LAQIENEYGNIMGSYGQNGKEYVQWCAQLAQSYQIGVPWIMCQQSDAPDPLINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTEDWT ++ WGG R+A+D+AF V F G++ NYYMYH
Sbjct: 237 DQWH--PNSNNKPKMWTEDWTGWFMHWGGPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD APL+EYG + +PKWGHLK LH +K L G+ I
Sbjct: 295 GGTNFGRTSGGPYITTSYDYDAPLNEYGDLNQPKWGHLKRLHEVLKSVETTLTMGSSRNI 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G A +F +G FL N + F+N Y +P S+SILPDC T +NT
Sbjct: 355 DYGNQMTATIF-SYAGQSVCFLGNAHPSMDANINFQNTQYTIPAWSVSILPDCYTEVYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW--EEYREAILN---FDNTLLRAEGLLDQISAAKDAS 383
+V+ Q + + + + D +W E + E + + + + A LLDQ A D S
Sbjct: 414 AKVNAQTSIMTINNENSYALDWQWMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTS 472
Query: 384 DYFWYTFRFHYNSSNAQAPLD----VQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
DY WY + D V + GH+LH FVNG + GS + ++ +FT +
Sbjct: 473 DYLWYITSVDVKQGDPILSHDLKIRVNTKGHVLHVFVNGAHIGSQYATYGKYTFTFEADI 532
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLER---KVAGVHRVRVQD-----KSFTNCSWGYQVG 491
L+ G N+ +L+S TVGLP+ GA+ + V GV V D K + W Y+VG
Sbjct: 533 KLKLGKNEISLVSGTVGLPNYGAYFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVG 592
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
+ GE +++YS + +++ + WYKTTFR P G D + L+L+ +GKG+AWVN
Sbjct: 593 MHGENVKLYSPSRSTEEWFTNGLQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKP-TGNL 607
G +IGRYWVS+ + S T T S + C T YHVP +FL+ N
Sbjct: 653 GNNIGRYWVSYLAGEDGCSSTCDYRGTYRS-NKCTTNCGNPTQRWYHVPDSFLRDGLDNT 711
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQ 667
LV+ EE+ GNP + + T+ I K C H ++
Sbjct: 712 LVVFEEQGGNPFQVKIATVTIAKACAKAYEGH-------------------------ELE 746
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
+C + IS+I FASFG P+G+C + G C SS + +V+R C+GK +CSI + +
Sbjct: 747 LACKENQVISEIKFASFGVPEGECGSFKKGHCESSDTLSIVKRLCLGKQQCSIQVNEKML 806
Query: 728 GGDPCPGIHKALLVDAQCR 746
G C L +DA C+
Sbjct: 807 GPTGCRVPENRLAIDALCQ 825
>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
Length = 745
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 311/673 (46%), Positives = 396/673 (58%), Gaps = 57/673 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK GGLDVI TYVFWN+HEP Y+F GR D++RFIK +Q GLYV LRIG
Sbjct: 58 MWEDLIQKAKVGGLDVIDTYVFWNVHEPSPSNYNFEGRYDLVRFIKTVQKVGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY A G Y WAAKMAV TGVPWVMCK+DDAP PVIN+CNG C
Sbjct: 178 LSQIENEYGPQGRALGAVGHAYSNWAAKMAVGLGTGVPWVMCKEDDAPDPVINSCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP +WTE W+ ++ +GG R AQD+AF VA FI K GS+ NYYMYH
Sbjct: 238 DDF--SPNKPYKPKLWTESWSGWFSEFGGPVPQRPAQDLAFAVARFIQKGGSFFNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR+A IT YD AP+DEYGL+REPK+GHLK+LH AIK C L++ V
Sbjct: 296 GGTNFGRSAGGPFITTSYDYDAPIDEYGLLREPKYGHLKDLHKAIKQCEHALVSSDPTVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A VF + CAAFL N A V F N Y+LP SISILPDCKT FNT
Sbjct: 356 SLGAYEQAHVFSSGTQTCAAFLANYHSNSAARVTFNNRHYDLPPWSISILPDCKTDVFNT 415
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYF 386
RV Q +K + SN K S WE Y E + + +++ + A GLL+QI+A +D SDY
Sbjct: 416 ARVRFQNSKIQMLPSNSKLLS---WETYDEDVSSLAESSRITASGLLEQINATRDTSDYL 472
Query: 387 WYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + S + + + V S G +H F+NG+++GSA G+ + S T ++
Sbjct: 473 WYITSVDISPSESFLRGGNKPSISVHSSGDAVHVFINGKFSGSAFGTREQRSCTFNGPIN 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L GTN ALLSV VGLP+ G E G+ H + K T W YQVGL G
Sbjct: 533 LHAGTNKIALLSVAVGLPNGGIHFESWKTGITGPILLHGLDHGQKDLTWQKWSYQVGLKG 592
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
E + + S G++ V W S+ S + QL W+K F AP GN+ +AL++ MGKG+ W+N
Sbjct: 593 EAMNLVSPNGVSSVDWVRESLASQNQPQLKWHKAYFNAPDGNEALALDMSGMGKGQVWIN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQSIGRYW+ + +KGN + YA + + T YHVPR++LKPT NL+V+
Sbjct: 653 GQSIGRYWLVY--AKGNCNSCNYAGTYRQAKCQLGCGQPTQRWYHVPRSWLKPTNNLMVV 710
Query: 611 LEEENGNPLGITV 623
EE GNP I++
Sbjct: 711 FEELGGNPWKISL 723
>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
Length = 737
Score = 557 bits (1435), Expect = e-155, Method: Compositional matrix adjust.
Identities = 307/672 (45%), Positives = 391/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +G Y F R D++RFIK +Q GLYV LRIG
Sbjct: 69 MWPDLIQKAKDGGLDVIQTYVFWNGHEPTQGNYYFQDRYDLVRFIKLVQQAGLYVHLRIG 128
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WL V GI FR+DN P+K
Sbjct: 129 PYVCAEWNYGGFPVWLKYVPGIEFRTDNGPFKAAMHKFTEKIVSMMKAEKLFQTQGGPII 188
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV +TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 189 LSQIENEFGPVEWDIGAPGKAYAKWAAQMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYC 248
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE WT ++ +G R A+D+ F VA FI GS++NYYMYH
Sbjct: 249 -EKFV-PNQNYKPKMWTEAWTGWFTEFGSAVPTRPAEDLVFSVARFIQSGGSFINYYMYH 306
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+ F+ T Y AP+DEYGL+ EPKWGHL+ LH AIKLC L++ V S
Sbjct: 307 GGTNFGRTSGGFVATSYDYDAPIDEYGLLNEPKWGHLRGLHKAIKLCEPALVSVDPTVKS 366
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+ QEA VF SG CAAFL N D + V F N Y+LP SIS+LPDCKT FNT
Sbjct: 367 LGENQEAHVFNSISGKCAAFLANYDTTFSAKVSFGNAQYDLPPWSISVLPDCKTAVFNTA 426
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
RV Q +++ + S W+ Y E + D+ +GL +Q+ DASDY WY
Sbjct: 427 RVGVQSSQKKFVPVINAFS---WQSYIEETASSTDDNTFTKDGLWEQVYLTADASDYLWY 483
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ S+ N Q P L + S GH L F+NG+ +G+ +GS +N T V LR
Sbjct: 484 MTDVNIGSNEGFLKNGQDPLLTIWSAGHALQVFINGQLSGTVYGSLENPKLTFSKNVKLR 543
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N +LLS +VGLP+ G E+ AGV + + + W Y++GL GE
Sbjct: 544 AGVNKISLLSTSVGLPNVGTHFEKWNAGVLGPVTLKGLNEGTRDISKQKWTYKIGLKGEA 603
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L +++ G + V W+ S ++ +TWYKTTF P GNDP+AL++ +MGKG W+NGQS
Sbjct: 604 LSLHTVSGSSSVEWAQGASLAQKQPMTWYKTTFNVPPGNDPLALDMGAMGKGMVWINGQS 663
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIH---FCAIIKATNTYHVPRAFLKPTGNLLVLL 611
IGR+W + GN YA T T +C + YHVPR+ LKP+GNLLV+
Sbjct: 664 IGRHWPGY-IGNGNCGGCNYA-GTYTEKKCRTYCG-KPSQRWYHVPRSRLKPSGNLLVVF 720
Query: 612 EEENGNPLGITV 623
EE G P I++
Sbjct: 721 EEWGGEPHWISL 732
>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
Length = 723
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 305/672 (45%), Positives = 399/672 (59%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+L KAKEGGLDVIQTYVFWN HEP G+Y F R D+++FIK Q GLYV LRIG
Sbjct: 55 MWPALFQKAKEGGLDVIQTYVFWNGHEPSPGKYYFEDRFDLVKFIKLAQQAGLYVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 115 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAAMQKFTTKIVSMMKAENLFQNQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPW MCKQ+DAP PVI+ CNG C
Sbjct: 175 MSQIENEYGPVEWNIGAPGKAYTNWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE+W+ +Y +G R +D+A+ VA FI GS+VNYYMYH
Sbjct: 235 -ENFT-PNKNYKPKMWTENWSGWYTDFGNAICYRPVEDLAYSVARFIQNRGSFVNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD AP+DEYGL EPKW HL++LH AIK C L++ +
Sbjct: 293 GGTNFGRTSSGLFIATSYDYDAPIDEYGLTNEPKWSHLRDLHKAIKQCEPALVSVDPTIT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V+ + VCAAFL N D + A TV F N Y+LP S+SILPDCKT FNT
Sbjct: 353 SLGNKLEAHVYSTGTSVCAAFLANYDTKSAATVTFGNGKYDLPPWSVSILPDCKTDVFNT 412
Query: 329 ERVSTQYNKRSKTS-NLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V Q ++++ S N FD W+ Y E + ++ + AE L +QI+ +D+SDY
Sbjct: 413 AKVGAQSSQKTMISTNSTFD----WQSYIEEPAFSSEDDSITAEALWEQINVTRDSSDYL 468
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + + + N Q P L+V S GH+LH FVNG+ +G+ +G DN T N+V+
Sbjct: 469 WYLTDVNISPNEDFIKNGQYPILNVMSAGHVLHVFVNGQLSGTVYGVLDNPKLTFSNSVN 528
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L G N +LLSV VGLP+ G E GV + + + W Y+VGL G
Sbjct: 529 LTVGNNKISLLSVAVGLPNVGLHFETWNVGVLGPVTLKGLNEGTRDLSWQKWSYKVGLKG 588
Query: 495 EKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W+ S+ + + LTWYK TF APAGNDP+ L++ SMGKGE WVN
Sbjct: 589 ESLSLHTITGGSSVDWTQGSLLAKKQPLTWYKATFNAPAGNDPLGLDMSSMGKGEIWVND 648
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLL 611
QSIGR+W + + G+ YA + T T YH+PR++L PTGN+LV+L
Sbjct: 649 QSIGRHWPGY-IAHGSCGDCDYAGTFTNTKCRTNCGNPTQTWYHIPRSWLNPTGNVLVVL 707
Query: 612 EEENGNPLGITV 623
EE G+P GI++
Sbjct: 708 EEWGGDPSGISL 719
>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
Length = 721
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 311/672 (46%), Positives = 395/672 (58%), Gaps = 58/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F R D++RF+K Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFVKLAQQAGLYVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 115 PYICAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKEERLFQSQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP PVI+ CNG C
Sbjct: 175 LSQIENEYGPVEWEIGAPGKSYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE+WT +Y +GG IR A+D+AF VA FI GS+VNYYMYH
Sbjct: 235 -ENFK-PNKNTKPKMWTENWTGWYTDFGGASPIRPAEDLAFSVARFIQNGGSFVNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ I YD APLDEYGL EPKWGHL+ LH AIK L++ V
Sbjct: 293 GGTNFGRTSGGLFIATSYDYDAPLDEYGLQNEPKWGHLRALHKAIKQSEPALVSTDPKVT 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA VF T G CAAF+ N D + + F + Y+LP SISILPDCKTV +NT
Sbjct: 353 SLGYNLEAHVF-STPGACAAFIANYDTKSSAKATFGSGQYDLPPWSISILPDCKTVVYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYRE--AILNFDNTLLRAEGLLDQISAAKDASDYF 386
RV + K+ N F W+ Y E A + D++ + AE L +Q++ +D+SDY
Sbjct: 412 ARVGNGWVKKMTPVNSGF----AWQSYNEEPASSSQDDS-IAAEALWEQVNVTRDSSDYL 466
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + N + N ++P L V S GH+LH F+NG+ +G+ +G N T + V+
Sbjct: 467 WYMTDVYINGNEGFLKNGRSPVLTVMSAGHLLHVFINGQLSGTVYGGLGNPKLTFSDNVN 526
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR G N +LLSV VGLP+ G E AGV + + + W Y+VGL G
Sbjct: 527 LRVGNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKG 586
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W S+ + + LTWYK TF APAGNDP+AL+L SMGKGE WVNG
Sbjct: 587 EALNLHTESGSSSVEWIQGSLVAKKQPLTWYKATFSAPAGNDPLALDLGSMGKGEVWVNG 646
Query: 553 QSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
+SIGR+W + + G+ + YA T + YHVPR++L GN LV+
Sbjct: 647 RSIGRHWPGY-IAHGSCNACNYAGYYTDQKCRTNCGKPSQRWYHVPRSWLNSGGNSLVVF 705
Query: 612 EEENGNPLGITV 623
EE G+P GI +
Sbjct: 706 EEWGGDPNGIAL 717
>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
Length = 726
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 303/672 (45%), Positives = 400/672 (59%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GG+DVI+TYVFWN HEP +G+Y F R D+++FIK +Q GLYV LRIG
Sbjct: 58 MWPDLIQKAKDGGVDVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W ++MAV +TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 178 LSQIENEYGPVEWEIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE+WT +Y +G R A+D+AF VA F+ GSYVNYYMYH
Sbjct: 238 -ENFS-PNKNYKPKMWTENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD AP+DEYGL+ EPKWGHL++LH AIK C L++ V
Sbjct: 296 GGTNFGRTSSGLFIATSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVS 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G+ E +++ + G CAAFL N D V F N Y+LP SISILPDCKT FNT
Sbjct: 356 WPGKNLEVHLYKTSFGACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNT 415
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREA-ILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V RS T +N F+ W+ Y E + ++ A GLL+Q+S D SDY
Sbjct: 416 AKVRAPRVHRSMTPANSAFN----WQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYL 471
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + + + N Q P L S GH+LH F+NG++ G+A+GS DN T N+V
Sbjct: 472 WYMTDVNISPNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVK 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR G N +LLSV VGL + G E+ GV + + + W Y++GL G
Sbjct: 532 LRVGNNKISLLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKG 591
Query: 495 EKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W+ S S + LTWYKTTF APAGNDP+AL++ SMGKGE WVNG
Sbjct: 592 ESLNLHTTSGSSSVKWTQGSFLSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNG 651
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLL 611
QSIGR+W ++ ++GN YA + T YH+PR++L P+GN+LV+L
Sbjct: 652 QSIGRHWPAY-IARGNCGSCNYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVL 710
Query: 612 EEENGNPLGITV 623
EE G+P GI++
Sbjct: 711 EEWGGDPTGISL 722
>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 716
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 305/671 (45%), Positives = 401/671 (59%), Gaps = 58/671 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD+I+TYVFWN HEP +G+Y F R D++ FIK +Q GLYV LRIG
Sbjct: 52 MWPDLIQKAKDGGLDIIETYVFWNGHEPSEGKYYFEERYDLVGFIKLVQKAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG PIWL V GI FR+DN+P+K
Sbjct: 112 PYVCAEWNYGGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W A+MAVD TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 172 LSQIENEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP IWTE+W+ +Y +GG R +D+AF VA FI NGS VNYY+YH
Sbjct: 232 -ENFK-PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYH 289
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+ F+ T Y AP+DEYGL+REPKWGHL++LH AIK C L++ +
Sbjct: 290 GGTNFGRTSGLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKSCEPALVSADPTITW 349
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+ QEA VF+ +S CAAFL N D +V V F N Y+LP SISILPDC TV FNT
Sbjct: 350 LGKNQEARVFKSSSA-CAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCXTVTFNTA 408
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYRE--AILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +S + + S W Y+E A +T +A GL++Q+S D +DY W
Sbjct: 409 QVGV----KSYQAKMMPISSFGWLSYKEEPASAYAKDTTTKA-GLVEQVSITWDTTDYLW 463
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y +S+ + + P L V S GH+LH F+NG+ +GS +GS ++ + T V L
Sbjct: 464 YMQDISIDSTEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPAITFSKNVDL 523
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
+QG N ++LSVTVGLP+ G + AGV + + + W Y+VGL GE
Sbjct: 524 KQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLEGLNEGTRDMSKYKWSYKVGLSGE 583
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L +YS+ G N V W+ +Q LTWYKTTF+ PAGN+P+ L++ SM KG+ W+NGQS
Sbjct: 584 SLNLYSDKGSNSVQWTKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWINGQS 643
Query: 555 IGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
IGRY+ + + G + YA + C + YH+PR +L P+ NLLV+ E
Sbjct: 644 IGRYFPGY-IANGKCDKCSYAGLFTEKKCLGNCG-EPSQKWYHIPRDWLSPSDNLLVIFE 701
Query: 613 EENGNPLGITV 623
E G+P GI++
Sbjct: 702 EIGGSPDGISL 712
>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
Length = 725
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 296/670 (44%), Positives = 395/670 (58%), Gaps = 53/670 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV+QTYVFWN HEPQ+GQY F R D++RF+K + GL+V LRIG
Sbjct: 61 MWPDLLQKAKDGGLDVVQTYVFWNGHEPQQGQYYFGDRYDLVRFVKLAKQAGLFVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN P+K
Sbjct: 121 PYVCAEWNFGGFPVWLKYVPGVSFRTDNAPFKAAMQAFVEKIVSMMKAEGLFEWQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E PY WAAKMAV GVPWVMCKQDDAP PVIN CNG C
Sbjct: 181 LAQVENEYGPMESVMGGGAKPYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP++WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 241 --DYFSPNSNSKPTMWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYH 298
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ F+ T Y AP+DEYGL+R+PKWGHL++LH AIK L++G +
Sbjct: 299 GGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQ 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
++G ++A+V++ +SG CAAFL N A V+F Y+LP SIS+LPDC+T FNT
Sbjct: 359 TIGNYEKAYVYKSSSGACAAFLSNYHTNAAARVVFNGRRYDLPAWSISVLPDCRTAVFNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
VS+ T F W+ Y EA + D+ +GL++Q+S D SDY WY
Sbjct: 419 ATVSSPSAPARMTPAGGF----SWQSYSEATNSLDDRAFTKDGLVEQLSMTWDKSDYLWY 474
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + NS+ + Q P L + S GH L FVNG+ G+A+G +D+ T V +
Sbjct: 475 TTYVNINSNEQFLKSGQWPQLTIYSAGHALQVFVNGQSYGAAYGGYDSPKLTYSGYVKMW 534
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGLP+ G E GV + + +N W YQ+GL GE
Sbjct: 535 QGSNKISILSAAVGLPNQGTHYEAWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGES 594
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L ++S G + V W S + LTW+K F AP+GN P+AL++ SMGKG+AWVNG IG
Sbjct: 595 LGVHSVAGSSSVEWGSAAG-KQPLTWHKAYFNAPSGNAPVALDMSSMGKGQAWVNGHHIG 653
Query: 557 RYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
RYW S+K + G+ YA + T + YHVPR++L P+GNLLV+LEE
Sbjct: 654 RYW-SYKATGGSCGGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVVLEEFG 712
Query: 616 GNPLGITVDT 625
G+ G+ + T
Sbjct: 713 GDLSGVKLVT 722
>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
Length = 732
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 308/676 (45%), Positives = 393/676 (58%), Gaps = 63/676 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWN HEP G Y+F GR D++RFIK IQ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKAAMQGFTEKIVQMMKEHRFFASQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE++ G YV WAAKMAV +TGVPWVMCK+DDAP P+IN+CNG C
Sbjct: 181 LSQIENEFEPELKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINSCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE W+ ++ +GG R +D+AF VA FI K GSY+NYYMYH
Sbjct: 241 --DYFTPNKPYKPTMWTEAWSGWFTEFGGTIPKRPVEDLAFGVARFIQKGGSYINYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGLV+EPK+ HLK+LH AIK C L++ +V
Sbjct: 299 GGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +EA VF G C AFL N V+F N Y LP SISILPDC+ V FNT
Sbjct: 359 KLGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW----EEYREAILNF-DNTLLRAEGLLDQISAAKDAS 383
V+ +KTS+++ Y E I + D + A GLL+Q++ +D +
Sbjct: 419 ATVA------AKTSHVQMMPSGSILYSVARYDEDIATYGDRGTITARGLLEQVNVTRDTT 472
Query: 384 DYFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WYT +S + L V S GH +H FVNG + GSA G+ +N F+ +
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
V+LR G N ALLSV VGLP+ G E G+ H + +K + W YQ G
Sbjct: 533 QVNLRGGANRIALLSVAVGLPNVGPHFETWATGIVGSVVLHGLDEGNKDLSWQKWTYQAG 592
Query: 492 LIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE +++ S + V W S + + LTWYK F AP GN+P+AL+L+SMGKG+A
Sbjct: 593 LRGEAMKLVSPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQA 652
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNL 607
W+NGQSIGRYW++F +KGN YA + + T YHVPR++LKP GNL
Sbjct: 653 WINGQSIGRYWMAF--AKGNCGSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPRGNL 710
Query: 608 LVLLEEENGNPLGITV 623
LVL EE G+ ++V
Sbjct: 711 LVLFEELGGDISKVSV 726
>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 308/799 (38%), Positives = 428/799 (53%), Gaps = 83/799 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI K+KEGGLDVI+TYVFWN+HEP GQYDFSG D++RFIK IQ+QGL+ LRIG
Sbjct: 57 MWPSLIEKSKEGGLDVIETYVFWNVHEPHPGQYDFSGNLDLVRFIKTIQNQGLHAVLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ I FR++N +
Sbjct: 117 PYVCAEWNYGGFPVWLHNIPNIEFRTNNAIFEDEMKKFTTLIVDMMRHEKLFASQGGPII 176
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY I ++ + G YV W A++A + GVPW+MC+Q D P P+IN CNG C
Sbjct: 177 LAQIENEYGNIMGSYGQNGKEYVQWCAQLAQSYQIGVPWIMCQQSDTPDPLINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTEDWT ++ WGG R+A+D+AF V F G++ NYYMYH
Sbjct: 237 DQWH--PNSNNKPKMWTEDWTGWFMHWGGPTPHRTAEDVAFAVGRFFQYGGTFQNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD APL+EYG + +PKWGHLK LH +K L G+ I
Sbjct: 295 GGTNFGRTSGGPYITTSYDYDAPLNEYGDLNQPKWGHLKRLHEVLKSVETTLTMGSSRNI 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G A +F +G FL N + F+N Y +P S+SILPDC T +NT
Sbjct: 355 DYGNQMTATIF-SYAGQSVCFLGNAHPSMDANINFQNTQYTIPAWSVSILPDCYTEVYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW--EEYREAILN---FDNTLLRAEGLLDQISAAKDAS 383
+V+ Q + + + + D +W E + E + + + + A LLDQ A D S
Sbjct: 414 AKVNAQTSIMTINNENSYALDWQWMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTS 472
Query: 384 DYFWYTFRFHYNSSNAQAPLD----VQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
DY WY + D V + GH+LH FVNG + GS + ++ FT +
Sbjct: 473 DYLWYITSVDVKQGDPILSHDLKIRVNTKGHVLHVFVNGAHIGSQYATYGKYPFTFEADI 532
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLER---KVAGVHRVRVQD-----KSFTNCSWGYQVG 491
L+ G N+ +L+S TVGLP+ GA+ + V GV V D K + W Y+VG
Sbjct: 533 KLKLGKNEISLVSGTVGLPNYGAYFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVG 592
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
+ GE +++YS ++ +++ + WYKTTFR P G D + L+L+ +GKG+AWVN
Sbjct: 593 MHGENVKLYSPSRSSEEWFTNGLQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVN 652
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKP-TGNL 607
G +IGRYWVS+ + S T T S + C T YHVP +FL+ N
Sbjct: 653 GNNIGRYWVSYLAGEDGCSSTCDYRGTYRS-NKCTTNCGNPTQRWYHVPDSFLRDGLDNT 711
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQ 667
LV+ EE+ GNP + + T+ I K C H ++
Sbjct: 712 LVVFEEQGGNPFQVKIATVTIAKACAKAYEGH-------------------------ELE 746
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
+C + IS+I FASFG P+G+C + G C SS + +V+R C+GK +CSI + +
Sbjct: 747 LACKENQVISEIRFASFGVPEGECGSFKKGHCESSDTLSIVKRLCLGKQQCSIHVNEKML 806
Query: 728 GGDPCPGIHKALLVDAQCR 746
G C L +DA C+
Sbjct: 807 GPTGCRVPENRLAIDALCQ 825
>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
Length = 721
Score = 552 bits (1423), Expect = e-154, Method: Compositional matrix adjust.
Identities = 309/672 (45%), Positives = 391/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI AKEGGLDVIQTYVFWN HEP G Y F R D+++FIK + GLYV LRIG
Sbjct: 53 MWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN P+K
Sbjct: 113 PYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPII 172
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE G Y WAA+MAV TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 173 MSQIENEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN+ KP ++TE WT +Y +GG R A+D+A+ VA FI GS++NYYMYH
Sbjct: 233 -ENFM-PNANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYH 290
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL REPKWGHL++LH IKLC L++ V
Sbjct: 291 GGTNFGRTAGGPFIATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVT 350
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA VF T CAAFL N D + +V V F+N+ Y+LP S+SILPDCKTV FNT
Sbjct: 351 SLGSNQEAHVF-WTKTSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNT 409
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI--LNFDNTLLRAEGLLDQISAAKDASDYF 386
+V +Q S + +S W+ Y E N+D + +GL +QIS +DA+DY
Sbjct: 410 AKVVSQ---GSLAKMIAVNSAFSWQSYNEETPSANYDAVFTK-DGLWEQISVTRDATDYL 465
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY N Q P L V S GH LH FVNG+ +G+ +G +N V
Sbjct: 466 WYMTDVTIGPDEAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVK 525
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR G N +LLS+ VGLP+ G E AGV V + W Y++GL G
Sbjct: 526 LRAGVNKVSLLSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKG 585
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W S+ + + L WYKTTF AP GNDP+AL++ SMGKG+ W+NG
Sbjct: 586 EALSLHTVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWING 645
Query: 553 QSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
QSIGR+W +K ++G+ YA + H + YHVPR++L PT NLLV+
Sbjct: 646 QSIGRHWPGYK-ARGSCGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVF 704
Query: 612 EEENGNPLGITV 623
EE G+P I++
Sbjct: 705 EEWGGDPTKISL 716
>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
Length = 719
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 298/670 (44%), Positives = 396/670 (59%), Gaps = 54/670 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI AK+GGLD+I+TYVFWN HEP +G+Y F R D++RFIK +Q GLYV LRIG
Sbjct: 52 MWPSLIQNAKDGGLDIIETYVFWNGHEPTQGKYYFEDRYDLVRFIKLVQQAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG PIWL V GIVFR++N+P+K
Sbjct: 112 PYVCAEWNYGGFPIWLKHVPGIVFRTENEPFKAAMQKFTEKIVGMMKSEKLYESQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MA+ TGVPWVMCKQ+DAP PVI+ CNG C
Sbjct: 172 LSQIENEYGPVEWEIGAPGKSYTKWAAQMALGLDTGVPWVMCKQEDAPDPVIDTCNGFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN NKP IWTE W+ +Y +GG R A+D+AF VA F+ GS NYYMYH
Sbjct: 232 -ENFK-PNRENKPKIWTEVWSGWYTAFGGAVPYRPAEDLAFSVARFVQNGGSLFNYYMYH 289
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR++ F+ Y AP+DEYGL REPKW HL++LH AIKLC L++ NV
Sbjct: 290 GGTNFGRSSGLFIANSYDFDAPIDEYGLKREPKWEHLRDLHKAIKLCEPALVSADPNVTW 349
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+ EA VF+ +SG CAAFL N D + V F N Y+LP SISIL DCK+ FNT
Sbjct: 350 LGKNLEARVFKSSSGACAAFLANYDISTSSKVSFWNTQYDLPPWSISILSDCKSAIFNTA 409
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILN-FDNTLLRAEGLLDQISAAKDASDYFWY 388
R+ Q S + S W Y+E + + + +GL++Q++ D++DY WY
Sbjct: 410 RIGAQ----SAPMKMMLVSSFWWLSYKEEVASGYATDTTTKDGLVEQVNFTWDSTDYLWY 465
Query: 389 TFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+ + A Q P L++ S GH+LH FVNG+ +G+ +GS +N V+L+
Sbjct: 466 MTDIQIDPNEAFIKSGQWPLLNISSAGHVLHVFVNGQLSGTVYGSLENPKVAFSKYVNLK 525
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ++LSVTVGLP+ G E AGV + + + W ++VGL GE
Sbjct: 526 AGVNKLSMLSVTVGLPNVGLHFESWNAGVLGPVTLKGLNEGIRDMSGYKWSHKVGLKGEN 585
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
+ +++ G N V W+ ++ LTWYKT F PAGN+P+AL++ SMGKG+ W+NG+S
Sbjct: 586 MNLHTIGGSNSVQWAKGSGLVQKQPLTWYKTNFNTPAGNEPLALDMSSMGKGQIWINGRS 645
Query: 555 IGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE 613
IGRYW ++ S G+ + YA + T + YHVPR +L+ GN LV+ EE
Sbjct: 646 IGRYWPAYAAS-GSCGKCSYAGIFTEKKCLSNCGQPSQKWYHVPREWLESKGNFLVVFEE 704
Query: 614 ENGNPLGITV 623
GNP GI++
Sbjct: 705 LGGNPGGISL 714
>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
Length = 830
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 316/802 (39%), Positives = 438/802 (54%), Gaps = 87/802 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + +YDFSG ND+IRF+K IQ +GL+ LRIG
Sbjct: 57 MWPDLIKKAKEGGLDAIETYVFWNAHEPIRREYDFSGNNDLIRFLKTIQDEGLFAVLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG+P+W++++ G+ R+ NK +
Sbjct: 117 PYVCAEWNYGGIPVWVYNLPGVEIRTANKVFMNEMQNFTTLIVDMVRKEKLFASQGGPII 176
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + A+ ++G Y+ W A MA F+ GVPW+MC+Q DAP P+IN CNG C
Sbjct: 177 LSQIENEYGNVMSAYGDEGKAYINWCANMADSFNIGVPWIMCQQPDAPQPMINTCNGWYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F+ PN+PN P +WTE+W +++ WGGK R+A+DIA+ VA F G++ NYYMYH
Sbjct: 237 HD-FE-PNNPNSPKMWTENWVGWFKNWGGKDPHRTAEDIAYSVARFFETGGTFQNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG + +PKWGHLKELH +K L G + I
Sbjct: 295 GGTNFGRTAGGPYITTSYDYDAPLDEYGNIAQPKWGHLKELHLVLKSMENSLTNGNVSKI 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +A V+ T+ + FL N + TV F+ +Y +P S+SILPDC+T +NT
Sbjct: 355 DLGSYVKATVY-ATNDSSSCFLTNTNTTTDATVTFKGNTYNVPAWSVSILPDCQTEEYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILN--FDNTLLRAEGLLDQISAAKDASD 384
+V+ Q + K N D E KW E + N + + ++DQ AA D+SD
Sbjct: 414 AKVNVQTSIMVKRENKAEDEPEALKWVWRAENVHNSLIGKSSVSKNTIVDQKIAANDSSD 473
Query: 385 YFWYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
Y WY R N + L + GH++HAFVNGE+ GS ++ + +
Sbjct: 474 YLWYMTRLDINQKDPVWTNNTILRINGTGHVIHAFVNGEHIGSHWATYGIHNDQFETNIK 533
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLER---------KVAGVHRVRVQDKSFTNCSWGYQVG 491
L+ G ND +LLSVTVGL + G ++ ++ G K ++ W Y+VG
Sbjct: 534 LKHGRNDISLLSVTVGLQNYGKEYDKWQDGLVSPIELIGTKGDETIIKDLSSHKWTYKVG 593
Query: 492 LIGEKLQIYSN--LGLNKVLWSSIRSP-TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L G + + +S + W S P + LTWYKTTF+AP +DPI ++LQ MGKG A
Sbjct: 594 LHGWENKFFSQDTFFASSSKWESNELPINKMLTWYKTTFKAPLESDPIVVDLQGMGKGYA 653
Query: 549 WVNGQSIGRYWVSFKTSK----GNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPT 604
WVNG S+GRYW S+ + +P + N + C + YHVPR F++
Sbjct: 654 WVNGHSLGRYWPSYNADEDGCSDDPCDYRGEYNDTKCVSNCG-KPSQRWYHVPRDFIEDG 712
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
N LVL EE GNP I T+ + C + +
Sbjct: 713 VNTLVLFEEIGGNPSQINFQTVIVGSACANAY-------------------------ENK 747
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSH-SQGVVERACIGKSRCSIPLL 723
T++ SC G+ IS I FASFGNP G C + GSC S++ + +V++AC+GK CSI +
Sbjct: 748 TLELSCH-GRSISDIKFASFGNPQGTCGAFTKGSCESNNEALSLVQKACVGKESCSIDVS 806
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
+ FG C + K L V+A C
Sbjct: 807 EKTFGATNCGNMVKRLAVEAVC 828
>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
Length = 787
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 302/668 (45%), Positives = 390/668 (58%), Gaps = 50/668 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP KGQY FS R D+IRF+K ++ GLYV LRIG
Sbjct: 124 MWPGLIQKAKDGGLDVVQTYVFWNGHEPVKGQYYFSDRYDLIRFVKLVKQAGLYVHLRIG 183
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 184 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAEMQRFVEKIVSMMKSERLFEWQGGPII 243
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENE+ +E A PY WAAKMAV +TGVPWVMCKQ+DAP PVIN CNG C
Sbjct: 244 MSQVENEFGPMESAGGVGAKPYANWAAKMAVATNTGVPWVMCKQEDAPDPVINTCNGFYC 303
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN NKP++WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 304 --DYFTPNKKNKPAMWTEAWTGWFTSFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYH 361
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DE+GL+R+PKWGHL++LH AIK L++G +
Sbjct: 362 GGTNFGRTAGGPFVATSYDYDAPIDEFGLLRQPKWGHLRDLHKAIKQAEPTLVSGDPTIQ 421
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A+VF+ +G CAAFL N AV V F Y+LP SISILPDCKTV FNT
Sbjct: 422 SLGNYEKAYVFKSKNGACAAFLSNYHMNSAVKVRFNGRHYDLPAWSISILPDCKTVVFNT 481
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V ++F W+ Y E + D++ +GL++Q+S D SDY WY
Sbjct: 482 ATVKEPTLLPKMHPVVRF----TWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDYLWY 537
Query: 389 TFRFHYN----SSNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
T + S N Q P L V S GH + FVNG+ GS +G +N T V + Q
Sbjct: 538 TTFVNIGPGELSKNGQWPQLTVYSAGHSMQVFVNGKSYGSVYGGFENPKLTYDGHVKMWQ 597
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKL 497
G+N ++LS VGLP+ G ER GV + + ++ W YQVGL GE L
Sbjct: 598 GSNKISILSSAVGLPNVGDHFERWNVGVLGPVTLSGLSEGKRDLSHQKWTYQVGLKGESL 657
Query: 498 QIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
I++ G + V W S + LTW+K F AP+G+DP+AL++ SMGKG+ WVNG +GR
Sbjct: 658 GIHTVSGSSAVEWGGPGS-KQPLTWHKALFNAPSGSDPVALDMGSMGKGQMWVNGHHVGR 716
Query: 558 YWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
YW S+G + + YHVPR++LKP GNLLV+LEE G+
Sbjct: 717 YWSYKAPSRGCGGCSYAGTYREDKCRSSCGELSQRWYHVPRSWLKPGGNLLVVLEEYGGD 776
Query: 618 PLGITVDT 625
G+T+ T
Sbjct: 777 VAGVTLAT 784
>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 732
Score = 550 bits (1418), Expect = e-154, Method: Compositional matrix adjust.
Identities = 307/676 (45%), Positives = 391/676 (57%), Gaps = 63/676 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWN HEP G Y+F GR D++RFIK IQ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE++ G YV WAAKMAV +TGVPWVMCK+DDAP P+IN CNG C
Sbjct: 181 LSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE W+ ++ +GG R +D+AF VA FI K GSY+NYYMYH
Sbjct: 241 --DYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGLV+EPK+ HLK+LH AIK C L++ +V
Sbjct: 299 GGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +EA VF G C AFL N V+F N Y LP SISILPDC+ V FNT
Sbjct: 359 KLGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW----EEYREAILNFDNT-LLRAEGLLDQISAAKDAS 383
V+ +KTS+++ Y E I + N + A GLL+Q++ +D +
Sbjct: 419 ATVA------AKTSHVQMVPSGSILYSVARYDEDIATYGNPGTITARGLLEQVNVTRDTT 472
Query: 384 DYFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WYT +S + L V S GH +H FVNG + GSA G+ +N F+ +
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
V+LR G N ALLSV VGLP+ G E G+ H + +K + W YQ G
Sbjct: 533 QVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSVALHGLDEGNKDLSWQKWTYQAG 592
Query: 492 LIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE + + S + V W S + + LTWYK F AP GN+P+AL+L+SMGKG+A
Sbjct: 593 LRGESMNLVSPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQA 652
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNL 607
W+NGQSIGRYW++F +KG+ YA + + T YHVPR++LKP GNL
Sbjct: 653 WINGQSIGRYWMAF--AKGDCGSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNL 710
Query: 608 LVLLEEENGNPLGITV 623
LVL EE G+ ++V
Sbjct: 711 LVLFEELGGDISKVSV 726
>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
Length = 732
Score = 550 bits (1418), Expect = e-154, Method: Compositional matrix adjust.
Identities = 307/676 (45%), Positives = 391/676 (57%), Gaps = 63/676 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWN HEP G Y+F GR D++RFIK IQ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE++ G YV WAAKMAV +TGVPWVMCK+DDAP P+IN CNG C
Sbjct: 181 LSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE W+ ++ +GG R +D+AF VA FI K GSY+NYYMYH
Sbjct: 241 --DYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGLV+EPK+ HLK+LH AIK C L++ +V
Sbjct: 299 GGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +EA VF G C AFL N V+F N Y LP SISILPDC+ V FNT
Sbjct: 359 KLGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW----EEYREAILNFDNT-LLRAEGLLDQISAAKDAS 383
V+ +KTS+++ Y E I + N + A GLL+Q++ +D +
Sbjct: 419 ATVA------AKTSHVQMVPSGSILYSVARYDEDIATYGNRGTITARGLLEQVNVTRDTT 472
Query: 384 DYFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WYT +S + L V S GH +H FVNG + GSA G+ +N F+ +
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
V+LR G N ALLSV VGLP+ G E G+ H + +K + W YQ G
Sbjct: 533 QVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSVVLHGLDEGNKDLSWQKWTYQAG 592
Query: 492 LIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE + + S + V W S + + LTWYK F AP GN+P+AL+L+SMGKG+A
Sbjct: 593 LRGESMNLVSPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDAPRGNEPLALDLKSMGKGQA 652
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNL 607
W+NGQSIGRYW++F +KG+ YA + + T YHVPR++LKP GNL
Sbjct: 653 WINGQSIGRYWMAF--AKGDCGSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNL 710
Query: 608 LVLLEEENGNPLGITV 623
LVL EE G+ ++V
Sbjct: 711 LVLFEELGGDISKVSV 726
>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 826
Score = 550 bits (1418), Expect = e-154, Method: Compositional matrix adjust.
Identities = 321/804 (39%), Positives = 439/804 (54%), Gaps = 91/804 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + YDFSG NDIIRF+K IQ GLY LRIG
Sbjct: 55 MWPELIQKAKEGGLDAIETYVFWNAHEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG+P+W+H++ + R+ N Y
Sbjct: 115 PYVCAEWNYGGIPVWVHNLPDVEIRTANSVYMNEMQNFTTLIVDMVKKEKLFASQGGPII 174
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + + + G Y+ W A MA + GVPW+MC++ DAP +IN CNG C
Sbjct: 175 LTQIENEYGNVISHYGDAGKAYMNWCANMAESLNVGVPWIMCQESDAPQSMINTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F+ PN+P+ P +WTE+W +++ WGG+ R+A+D+AF VA F G++ NYYMYH
Sbjct: 235 -DNFE-PNNPSSPKMWTENWVGWFKNWGGRDPHRTAEDVAFAVARFFQTGGTFQNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA IT YD APLDEYG + +PKWGHLKELH +K L +G +
Sbjct: 293 GGTNFDRTAGGPYITTSYDYDAPLDEYGNIAQPKWGHLKELHNVLKSMEETLTSGNVSET 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G +A ++ T+G + FL + + T+ FR +Y +P S+SILPDC+ +NT
Sbjct: 353 DFGNSVKATIY-ATNGSSSCFLSSTNTTTDATLTFRGKNYTVPAWSVSILPDCEHEEYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTL-----LRAEGLLDQISAAKDAS 383
+V+ Q + K N K + + ++ N DN L + A LLDQ AA DAS
Sbjct: 412 AKVNVQTSVMVK-ENSKAEEEATALKWVWRSENIDNALHGKSNVSANRLLDQKDAANDAS 470
Query: 384 DYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
DY WY + H + L + S GH++HAFVNGE+ GS ++ + +
Sbjct: 471 DYLWYMTKLHVKHDDPVWGENMTLRINSSGHVIHAFVNGEHIGSHWATYGIHNDKFEPKI 530
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAG----VHRVRVQD-----KSFTNCSWGYQV 490
L+ GTN +LLSVTVGL + GAF + AG + V V+ K+ ++ W Y+V
Sbjct: 531 KLKHGTNTISLLSVTVGLQNYGAFFDTWHAGLVEPIELVSVKGDETIIKNLSSNKWSYKV 590
Query: 491 GLIGEKLQIYSN----LGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGK 545
GL G +++S+ NK W S + PT R LTWYKTTF AP G DP+ ++LQ MGK
Sbjct: 591 GLHGWDHKLFSDDSPFAAPNK--WESEKLPTDRMLTWYKTTFNAPLGTDPVVVDLQGMGK 648
Query: 546 GEAWVNGQSIGRYWVSFKTSKGNPSQ--TQYAVNTVTSIHFCAIIKATNT-YHVPRAFLK 602
G AWVNGQ+IGR W S+ + S Y S K T YHVPR++LK
Sbjct: 649 GYAWVNGQNIGRIWPSYNAEEDGCSDEPCDYRGEYTDSKCVTNCGKPTQRWYHVPRSYLK 708
Query: 603 PTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK 662
N LVL E GNP + T+ + VC + +
Sbjct: 709 DGANNLVLFAELGGNPSQVNFQTVVVGTVCANAY-------------------------E 743
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS-SHSQGVVERACIGKSRCSIP 721
T++ SC G+KIS I FASFG+P+G C + GSC S S++ +V++AC+GK CS
Sbjct: 744 NKTLELSCQ-GRKISAIKFASFGDPEGVCGAFTNGSCESKSNALSIVQKACVGKQACSFD 802
Query: 722 LLSRYFGGDPCPGIHKALLVDAQC 745
+ + FG C + K L V+A C
Sbjct: 803 VSEKTFGPTACGNVAKRLAVEAVC 826
>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
Length = 721
Score = 550 bits (1417), Expect = e-153, Method: Compositional matrix adjust.
Identities = 308/672 (45%), Positives = 390/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI AKEGGLDVIQTYVFWN HEP G Y F R D+++FIK + GLYV LRI
Sbjct: 53 MWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRIS 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN P+K
Sbjct: 113 PYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPII 172
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE G Y WAA+MAV TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 173 MSQIENEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN+ KP ++TE WT +Y +GG R A+D+A+ VA FI GS++NYYMYH
Sbjct: 233 -ENFM-PNANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYH 290
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL REPKWGHL++LH IKLC L++ V
Sbjct: 291 GGTNFGRTAGGPFIATSYDYDAPLDEYGLRREPKWGHLRDLHKTIKLCEPSLVSVDPKVT 350
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA VF T CAAFL N D + +V V F+N+ Y+LP S+SILPDCKTV FNT
Sbjct: 351 SLGSNQEAHVF-WTKTSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDCKTVVFNT 409
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI--LNFDNTLLRAEGLLDQISAAKDASDYF 386
+V +Q S + +S W+ Y E N+D + +GL +QIS +DA+DY
Sbjct: 410 AKVVSQ---GSLAKMIAVNSAFSWQSYNEETPSANYDAVFTK-DGLWEQISVTRDATDYL 465
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY N Q P L V S GH LH FVNG+ +G+ +G +N V
Sbjct: 466 WYMTDVTIGPDEAFLKNGQDPILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVK 525
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
LR G N +LLS+ VGLP+ G E AGV V + W Y++GL G
Sbjct: 526 LRAGVNKVSLLSIAVGLPNVGLHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKG 585
Query: 495 EKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W S+ + + L WYKTTF AP GNDP+AL++ SMGKG+ W+NG
Sbjct: 586 EALSLHTVSGSSSVEWVEGSLLAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWING 645
Query: 553 QSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
QSIGR+W +K ++G+ YA + H + YHVPR++L PT NLLV+
Sbjct: 646 QSIGRHWPGYK-ARGSCGACNYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVF 704
Query: 612 EEENGNPLGITV 623
EE G+P I++
Sbjct: 705 EEWGGDPTKISL 716
>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
Length = 732
Score = 550 bits (1417), Expect = e-153, Method: Compositional matrix adjust.
Identities = 302/668 (45%), Positives = 395/668 (59%), Gaps = 58/668 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWNLHEP G Y+F GRND+++FIK + GLYV LRIG
Sbjct: 58 MWEGLIQKAKDGGLDVIDTYVFWNLHEPSPGNYNFEGRNDLVQFIKLVHKAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL + G++FR+DN+P+K
Sbjct: 118 PYICGEWNFGGFPVWLKYIPGMIFRTDNEPFKLQMQKFTQKIVQMMKDEQLYESQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+ + AF G Y+ WAA MAV +TGVPWVMCK+ DAP PV+N CNG C
Sbjct: 178 LSQIENEYEPEDKAFGAAGHAYMTWAAHMAVSLNTGVPWVMCKEFDAPDPVVNTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP++WTE WT ++ +GG + R +D+AF VA FI K GS+VNYYMYH
Sbjct: 238 --DYFSPNKAYKPTMWTEAWTGWFTDFGGPIHQRPVEDLAFAVARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL+R+PK+GHLK+LH AIKLC R LL+ V
Sbjct: 296 GGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKDLHKAIKLCERALLSSDPVVT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG ++A VF SG CAAFL N + + V F N+ Y LP S+SILPDCK V FNT
Sbjct: 356 TLGSYEQAHVFSSNSGDCAAFLANYNPKATAKVTFNNMHYNLPPWSVSILPDCKNVVFNT 415
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNFDNTLL-RAEGLLDQISAAKDASDYF 386
V Q +K + + +F S WE E I + D+ + GLL+QI+ +DASDY
Sbjct: 416 AEVGVQPSKIQMLPTEARFLS---WEALSEDISSVDDDKIGTVAGLLEQINVTRDASDYL 472
Query: 387 WYTFRFHYNSSN-----AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV- 439
WYT H +SS Q P L V S GH +H FVNG+ +GS +G+ N + +
Sbjct: 473 WYTTGVHISSSETFLDGGQPPILKVISAGHGIHVFVNGQLSGSVYGTRGNRRISFSGELK 532
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLI 493
L G N +LLSV VGLP++G E GV H + + T W Y+VGL
Sbjct: 533 QLHAGRNRISLLSVAVGLPNNGPRFETWNTGVLGPVVIHGLDQGHRDLTWQKWSYKVGLK 592
Query: 494 GEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE L + S + + W S++ + + LTW++ F AP G+DP+AL++ SM KG+ W+
Sbjct: 593 GEDLNLGSPNSIPSINWMQESAMVAERQPLTWHRAFFDAPRGDDPLALDMSSMVKGQVWI 652
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLV 609
NG SIGRYW + + GN + Y+ ++ F YH+PR+ LKPT NLLV
Sbjct: 653 NGNSIGRYWTVY--ADGNCTACSYSGTFRPSTCQFGCGQPTQKWYHIPRSLLKPTENLLV 710
Query: 610 LLEEENGN 617
+ EE G+
Sbjct: 711 VFEEIGGD 718
>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 728
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 302/672 (44%), Positives = 393/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F R D+++FIK +Q GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+VFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE G Y W A+MA TGVPW+MCKQDDAP +IN CNG C
Sbjct: 179 LSQIENEYGPIEWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT ++ +GG R A+DIA VA FI GS++NYYMYH
Sbjct: 239 -ENFK-PNSDNKPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y APLDEYGL REPK+ HLK LH IKLC L++ V S
Sbjct: 297 GGTNFDRTAGEFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QEA VF+ S CAAFL N + A VLF +Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEAHVFKSKSS-CAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 RVSTQYNKRSKTSNLKF---DSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDY 385
+V R+ + ++K ++ W Y E I + DN +GL++QIS +D +DY
Sbjct: 416 KVQV----RTSSIHMKMVPTNTPFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDY 471
Query: 386 FWY----TFRFHYNSSNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
FWY T + P L + S GH LH FVNG+ G+A+GS + T +
Sbjct: 472 FWYLTDITISPDEKFLTGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIK 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L G N ALLS GLP+ G E GV + V T W Y++G G
Sbjct: 532 LHAGVNKLALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKG 591
Query: 495 EKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++ G + V W S+ + + LTWYK+TF +P GN+P+AL++ +MGKG+ W+NG
Sbjct: 592 EALSVHTLAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWING 651
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLL 611
Q+IGR+W ++ T++G + YA +A+ YHVPR++LKPT NL+++L
Sbjct: 652 QNIGRHWPAY-TARGKCERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVL 710
Query: 612 EEENGNPLGITV 623
EE G P GI++
Sbjct: 711 EEWGGEPNGISL 722
>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
Length = 732
Score = 549 bits (1414), Expect = e-153, Method: Compositional matrix adjust.
Identities = 306/676 (45%), Positives = 390/676 (57%), Gaps = 63/676 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVI TYVFWN HEP G Y+F GR D++RFIK IQ GLYV LRIG
Sbjct: 61 MWEDLIKKAKDGGLDVIDTYVFWNGHEPSPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 121 PYVCAEWNFGGFPVWLKYVDGISFRTDNGPFKSAMQGFTEKIVQMMKEHRFFASQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE++ G YV WAAKMAV +TGVPWVMCK+DDAP P+IN CNG C
Sbjct: 181 LSQIENEFEPDLKGLGPAGHSYVNWAAKMAVGLNTGVPWVMCKEDDAPDPIINTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P KP++WTE W+ ++ +GG R +D+AF VA FI K GSY+NYYMYH
Sbjct: 241 --DYFTPNKPYKPTMWTEAWSGWFTEFGGTVPKRPVEDLAFGVARFIQKGGSYINYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGLV+EPK+ HLK+LH AIK C L++ +V
Sbjct: 299 GGTNFGRTAGGPFITTSYDYDAPIDEYGLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +EA VF G C AFL N V+F N Y LP SISILPDC+ V FNT
Sbjct: 359 KLGNYEEAHVFTAGKGSCVAFLTNYHMNAPAKVVFNNRHYTLPAWSISILPDCRNVVFNT 418
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW----EEYREAILNFDNT-LLRAEGLLDQISAAKDAS 383
V+ +KTS+++ Y E I + N + A GLL+Q++ +D +
Sbjct: 419 ATVA------AKTSHVQMVPSGSILYSVARYDEDIATYGNRGTITARGLLEQVNVTRDTT 472
Query: 384 DYFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WYT +S + L V S GH +H FVNG + GSA G+ +N F+ +
Sbjct: 473 DYLWYTTSVDIKASESFLRGGKWPTLTVDSAGHAVHVFVNGHFYGSAFGTRENRKFSFSS 532
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
V+LR G N ALLSV VGLP+ G E G+ H + +K + W YQ G
Sbjct: 533 QVNLRGGANKIALLSVAVGLPNVGPHFETWATGIVGSVVLHGLDEGNKDLSWQKWTYQAG 592
Query: 492 LIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE + + S + V W S + + LTWYK F P GN+P+AL+L+SMGKG+A
Sbjct: 593 LRGESMNLVSPTEDSSVDWIKGSLAKQNKQPLTWYKAYFDVPRGNEPLALDLKSMGKGQA 652
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNL 607
W+NGQSIGRYW++F +KG+ YA + + T YHVPR++LKP GNL
Sbjct: 653 WINGQSIGRYWMAF--AKGDCGSCNYAGTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNL 710
Query: 608 LVLLEEENGNPLGITV 623
LVL EE G+ ++V
Sbjct: 711 LVLFEELGGDISKVSV 726
>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
Length = 724
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 307/673 (45%), Positives = 396/673 (58%), Gaps = 61/673 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVI+TYVFWN HEP GQY F R D+++FIK + GLYV LRIG
Sbjct: 59 MWPGLIQKAKEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W A+MA+ TGVPW+MCKQ+DAPGP+I+ CNG C
Sbjct: 179 LAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT +Y +GG R +DIA+ VA FI K GS VNYYMYH
Sbjct: 239 -EDFK-PNSINKPKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLVNYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA FM + Y APLDEYGL REPK+ HLK LH AIKL LL+ V S
Sbjct: 297 GGTNFDRTAGEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QEA+VF S CAAFL N DE A VLFR Y+LP S+SILPDCKT +NT
Sbjct: 357 LGAKQEAYVFWSKSS-CAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTA 415
Query: 330 RVSTQYNKRSKT-SNLKFDSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V+ R+ + KF W + EA N T R GL++QIS D SDYF
Sbjct: 416 KVNAPSVHRNMVPTGTKFS----WGSFNEATPTANEAGTFAR-NGLVEQISMTWDKSDYF 470
Query: 387 WYTFRFHYNS-----SNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S +P L V S GH LH FVNG+ +G+A+G D+ T +
Sbjct: 471 WYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIK 530
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L G N ALLSV VGLP+ G E+ GV V + W Y++G+ G
Sbjct: 531 LHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKG 590
Query: 495 EKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++N + V W+ S + + LTWYK+TF PAGN+P+AL++ +MGKG+ W+NG
Sbjct: 591 EALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWING 650
Query: 553 QSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVL 610
++IGR+W ++K ++G+ + YA + + C + YHVPR++LK + NL+V+
Sbjct: 651 RNIGRHWPAYK-AQGSCGRCNYAGTFDAKKCLSNCG-EASQRWYHVPRSWLK-SQNLIVV 707
Query: 611 LEEENGNPLGITV 623
EE G+P GI++
Sbjct: 708 FEELGGDPNGISL 720
>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
gi|194689400|gb|ACF78784.1| unknown [Zea mays]
gi|224030521|gb|ACN34336.1| unknown [Zea mays]
gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
Length = 722
Score = 547 bits (1410), Expect = e-153, Method: Compositional matrix adjust.
Identities = 295/670 (44%), Positives = 389/670 (58%), Gaps = 53/670 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV+QTYVFWN HEP +GQY F R D++RF+K + GLYV LRIG
Sbjct: 58 MWPGLLQKAKDGGLDVVQTYVFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E PY WAAKMAV GVPWVMCKQDDAP PVIN CNG C
Sbjct: 178 LAQVENEYGPMESVMGAGAKPYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP++WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 238 --DYFSPNSNSKPTMWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ F+ T Y AP+DEYGL+R+PKWGHL++LH AIK L++G +
Sbjct: 296 GGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQ 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A+VF+ + G CAAFL N A V+F Y+LP SIS+LPDCK FNT
Sbjct: 356 SLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
VS + S + + W+ Y EA + D +GL++Q+S D SDY WY
Sbjct: 416 ATVS----EPSAPARMSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWY 471
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + NS+ + Q P L + S GH L FVNG+ G+ +G +D+ T V +
Sbjct: 472 TTYVNINSNEQFLKSGQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMW 531
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGLP+ G E GV + + ++ W YQ+GL GE
Sbjct: 532 QGSNKISILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGES 591
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L + S G + V W S + LTW+K F AP+G+ P+AL++ SMGKG+AWVNG+ IG
Sbjct: 592 LGVQSVAGSSSVEWGSAAG-KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIG 650
Query: 557 RYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEEN 615
RYW S+K S YA + T + YHVPR++L P+GNLLV+LEE
Sbjct: 651 RYW-SYKASSSGCGGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFG 709
Query: 616 GNPLGITVDT 625
G+ G+ + T
Sbjct: 710 GDLSGVKLVT 719
>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
Length = 730
Score = 547 bits (1409), Expect = e-153, Method: Compositional matrix adjust.
Identities = 298/675 (44%), Positives = 399/675 (59%), Gaps = 61/675 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GG+DVIQTYVFWN HEP G Y F R D+++F+K +Q GLYV LRIG
Sbjct: 61 MWPDLIQKAKDGGVDVIQTYVFWNGHEPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 121 PYVCAEWNFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTAKIVSMMKAENLFESQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W ++MA+ TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 181 MSQIENEYGPVEWEIGAPGKAYTKWFSQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE+W+ +Y +G R AQD+AF VA FI GSYVNYYMYH
Sbjct: 241 -ENFT-PNKNYKPKMWTENWSGWYTDFGSAVPYRPAQDVAFSVARFIQNRGSYVNYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+A I YD AP+DEYGL+ EPKWGHL+ LH AIK C P+L +
Sbjct: 299 GGTNFGRTSAGLFIATSYDYDAPIDEYGLLSEPKWGHLRNLHKAIKQC-EPILVSVDPTV 357
Query: 269 SL-GQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
S G+ E V++ ++G CAAFL N D V F N Y+LP SISILPDCKT FN
Sbjct: 358 SWPGKNLEVHVYKTSTGACAAFLANYDTTSPAKVTFGNGQYDLPPWSISILPDCKTAVFN 417
Query: 328 TERVST--QYNKRSKTSNLKFDSDEKWEEYREAILN--FDNTLLRAEGLLDQISAAKDAS 383
T +V T ++++ + FD W+ Y EA + D++ A LL+QI +D+S
Sbjct: 418 TAKVGTVPSFHRKMTPVSSAFD----WQSYNEAPASSGIDDS-TTANALLEQIKVTRDSS 472
Query: 384 DYFWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WY + + + N Q P L S GH+LH FVNG+++G+A+G +N T N
Sbjct: 473 DYLWYMTDVNISPNEGFIKNGQYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSN 532
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
+V LR G N +LLSV VGL + G E GV + + + W Y++G
Sbjct: 533 SVKLRVGNNKISLLSVAVGLSNVGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIG 592
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
L GE L +++ +G + V W+ S ++ LTWYK TF APAGNDP+AL++ SMGKGE W
Sbjct: 593 LKGETLNLHTLIGSSSVQWTKGSSLVKKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIW 652
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLL 608
VNG+SIGR+W ++ ++G+ YA + + T YH+PR+++ P GN L
Sbjct: 653 VNGESIGRHWPAY-IARGSCGGCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFL 711
Query: 609 VLLEEENGNPLGITV 623
V+LEE G+P GI++
Sbjct: 712 VVLEEWGGDPSGISL 726
>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
Length = 724
Score = 546 bits (1408), Expect = e-152, Method: Compositional matrix adjust.
Identities = 306/673 (45%), Positives = 396/673 (58%), Gaps = 61/673 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVI+TYVFWN HEP GQY F R D+++FIK + GLYV LRIG
Sbjct: 59 MWPGLIQKAKEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W A+MA+ TGVPW+MCKQ+DAPGP+I+ CNG C
Sbjct: 179 LAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT +Y +GG R +DIA+ VA FI K GS +NYYMYH
Sbjct: 239 -EDFK-PNSINKPKMWTENWTGWYTDFGGAVPYRPVEDIAYSVARFIQKGGSLINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA FM + Y APLDEYGL REPK+ HLK LH AIKL LL+ V S
Sbjct: 297 GGTNFDRTAGEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSEPALLSADATVTS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QEA+VF S CAAFL N DE A VLFR Y+LP S+SILPDCKT +NT
Sbjct: 357 LGAKQEAYVFWSKSS-CAAFLSNKDENSAARVLFRGFPYDLPPWSVSILPDCKTEVYNTA 415
Query: 330 RVSTQYNKRSKT-SNLKFDSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V+ R+ + KF W + EA N T R GL++QIS D SDYF
Sbjct: 416 KVNAPSVHRNMVPTGTKFS----WGSFNEATPTANEAGTFAR-NGLVEQISMTWDKSDYF 470
Query: 387 WYTFRFHYNS-----SNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY S +P L V S GH LH FVNG+ +G+A+G D+ T +
Sbjct: 471 WYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIK 530
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L G N ALLSV VGLP+ G E+ GV V + W Y++G+ G
Sbjct: 531 LHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKG 590
Query: 495 EKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L +++N + V W+ S + + LTWYK+TF PAGN+P+AL++ +MGKG+ W+NG
Sbjct: 591 EALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWING 650
Query: 553 QSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVL 610
++IGR+W ++K ++G+ + YA + + C + YHVPR++LK + NL+V+
Sbjct: 651 RNIGRHWPAYK-AQGSCGRCNYAGTFDAKKCLSNCG-EASQRWYHVPRSWLK-SQNLIVV 707
Query: 611 LEEENGNPLGITV 623
EE G+P GI++
Sbjct: 708 FEELGGDPNGISL 720
>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 813
Score = 546 bits (1408), Expect = e-152, Method: Compositional matrix adjust.
Identities = 312/800 (39%), Positives = 431/800 (53%), Gaps = 88/800 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEPQ+ +YDF+GR D I+F + +Q GLYV +RIG
Sbjct: 42 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIG 101
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI FR+DN+ YK
Sbjct: 102 PYVCAEWNYGGFPLWLHNLPGIQFRTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 161
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + G Y+ W A+MA + G+PW+MC+Q DAP P+IN CNG C
Sbjct: 162 LAQIENEYGNVMTPYGNAGKSYINWCAQMAESLNIGIPWIMCQQSDAPQPIINTCNGFYC 221
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F PN+P P ++TE+W +++ WG K RS +D+AF VA F G + NYYMYH
Sbjct: 222 DYDFS-PNNPKSPKMFTENWVGWFKKWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMYH 280
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG + +PKWGHLK+LHA+IK+ + L T++
Sbjct: 281 GGTNFGRTAGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQ 340
Query: 269 SLGQLQEAFVFEE-TSGVCAAFLVNNDERKAVTV-LFRNISYELPRKSISILPDCKTVAF 326
L F TSG FL N D + T+ L + Y +P S+SIL C F
Sbjct: 341 KLXSFVTLTKFSNPTSGERFCFLSNTDNKNDATIDLQADGKYFVPAWSVSILDGCNKEVF 400
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKW----EEYREAILNFDNTLLRAEGLLDQISAAKDA 382
NT ++++Q + K N K ++ W E R+ + +A LL+Q D
Sbjct: 401 NTAKINSQTSMFVKVQNKKENAQFSWVWAPEPMRDTLQG--KGTFKANLLLEQKGTTVDF 458
Query: 383 SDYFWYTFRFHYNSSNA--QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
SDY WY N++++ L V + GH+LHAFVN Y GS S+ SF +
Sbjct: 459 SDYLWYMTNIDSNATSSLQNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQ-SFVFXKPIL 517
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVH----------RVRVQDKSFTNCSWGYQV 490
++ GTN LLS TVGL + AF + G+ V++ ++ W Y+V
Sbjct: 518 IKPGTNTITLLSATVGLKNYDAFYDTVPTGIDGGPIYLIGDGNVKI---DLSSNLWSYKV 574
Query: 491 GLIGEKLQIYSNLGLNKVLWSSI--RSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
GL GE Q+Y+ + + WS+I +S R++T YKT F+ P+G DP+ L++Q MGKG+A
Sbjct: 575 GLNGEMKQLYNPVFSQRTNWSTINQKSIGRRMTLYKTNFKTPSGIDPVTLDMQGMGKGQA 634
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG 605
WVNGQSIGR+W SF + S T + A N + C + YH+PR+FL
Sbjct: 635 WVNGQSIGRFWPSFIAGNDSCSTTCDYRGAYNPSKCVENCG-NPSQRWYHIPRSFLSDDT 693
Query: 606 NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPT 665
N LVL EE GNP ++V TI I +CG+ + T
Sbjct: 694 NTLVLFEEIGGNPQQVSVQTITIGTICGNAN-------------------------EGST 728
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
++ SC G IS+I FAS+GNP+G C + GS H +S +VE+ CIG CSI + ++
Sbjct: 729 LELSCQGGHIISEIQFASYGNPEGKCGSFKQGSWHVINSAILVEKLCIGMESCSIDVSAK 788
Query: 726 YFGGDPCPGIHKALLVDAQC 745
FG I L + A C
Sbjct: 789 SFGLGDVTNISARLAIQALC 808
>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 729
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 303/673 (45%), Positives = 393/673 (58%), Gaps = 58/673 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F R D+++FIK +Q GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+VFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE G Y W A+MA TGVPW+MCKQDDAP +IN CNG C
Sbjct: 179 LSQIENEYGPIEWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT ++ +GG R A+DIA VA FI GS++NYYMYH
Sbjct: 239 -ENFK-PNSDNKPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y APLDEYGL REPK+ HLK LH IKLC L++ V S
Sbjct: 297 GGTNFDRTAGEFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QEA VF+ S CAAFL N + A VLF +Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEAHVFKSKSS-CAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 RVSTQYNKRSKTSNLKF---DSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDY 385
+V R+ + ++K ++ W Y E I + DN +GL++QIS +D +DY
Sbjct: 416 KVQV----RTSSIHMKMVPTNTPFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDY 471
Query: 386 FWY----TFRFHYNSSNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
FWY T + P L + S GH LH FVNG+ G+A+GS + T +
Sbjct: 472 FWYLTDITISPDEKFLTGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIK 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGY-QVGLI 493
L G N ALLS GLP+ G E GV + V T W Y Q+G
Sbjct: 532 LHAGVNKLALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKQIGTK 591
Query: 494 GEKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE L +++ G + V W S+ + + LTWYK+TF +P GN+P+AL++ +MGKG+ W+N
Sbjct: 592 GEALSVHTLAGSSTVEWKEGSLVAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWIN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
GQ+IGR+W ++ T++G + YA +A+ YHVPR++LKPT NL+++
Sbjct: 652 GQNIGRHWPAY-TARGKCERCSYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIV 710
Query: 611 LEEENGNPLGITV 623
LEE G P GI++
Sbjct: 711 LEEWGGEPNGISL 723
>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 838
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 309/802 (38%), Positives = 433/802 (53%), Gaps = 90/802 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEPQ+ +YDF+GR D I+F + +Q GLYV +RIG
Sbjct: 67 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIG 126
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI FR+DN+ YK
Sbjct: 127 PYVCAEWNYGGFPLWLHNLPGIQFRTDNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 186
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + G Y+ W A+MA + G+PW+MC+Q+DAP P+IN CNG C
Sbjct: 187 LAQIENEYGNVMTPYGNAGKSYINWCAQMAESLNIGIPWIMCQQNDAPQPIINTCNGFYC 246
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F PN+P P ++TE+W +++ WG K RS +D+AF VA F G + NYYMYH
Sbjct: 247 DYDFS-PNNPKSPKMFTENWVGWFKKWGDKDPYRSPEDVAFAVARFFQSGGVFNNYYMYH 305
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG + +PKWGHLK+LHA+IK+ + L T++
Sbjct: 306 GGTNFGRTAGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQ 365
Query: 269 SLGQLQEAFVFEE-TSGVCAAFLVNNDERKAVTVLFR---NISYELPRKSISILPDCKTV 324
+ F TSG FL N D + T+ + +P S+SIL C
Sbjct: 366 KISSFVTLTKFSNPTSGERFCFLSNTDNKNDATIDLQADGKYFVPVPAWSVSILDGCNKE 425
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDEKW----EEYREAILNFDNTLLRAEGLLDQISAAK 380
FNT ++++Q + K N K ++ W E R+ + +A LL+Q
Sbjct: 426 VFNTAKINSQTSMFVKVQNKKENAQFSWVWAPEPMRDTLQG--KGTFKANLLLEQKGTTV 483
Query: 381 DASDYFWYTFRFHYNSSNA--QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
D SDY WY N++++ L V + GH+LHAFVN Y GS S+ SF
Sbjct: 484 DFSDYLWYMTNIDSNATSSLQNVTLQVNTKGHMLHAFVNRRYIGSQWRSNGQ-SFVFEKP 542
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH----------RVRVQDKSFTNCSWGY 488
+ ++ GTN LLS TVGL + AF + G+ V++ ++ W Y
Sbjct: 543 ILIKPGTNTITLLSATVGLKNYDAFYDTVPTGIDGGPIYLIGDGNVKID---LSSNLWSY 599
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSI--RSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
+VGL GE Q+Y+ + + WS+I +S R++TWYKT+F+ P+G D + L++Q MGKG
Sbjct: 600 KVGLNGEMKQLYNPVFSQRTNWSTINQKSIGRRMTWYKTSFKTPSGIDRVTLDMQGMGKG 659
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKP 603
+AWVNGQSIGR+W SF S + S T + A N + C + YH+PR+FL
Sbjct: 660 QAWVNGQSIGRFWPSFIASNDSCSTTCDYRGAYNPSKCVENCG-NPSQRWYHIPRSFLSD 718
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
N LVL EE GNP ++V TI I +CG+ +
Sbjct: 719 DTNTLVLFEEIGGNPQQVSVQTITIGTICGNAN-------------------------EG 753
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLL 723
T++ SC G IS+I FAS+GNP+G C + GS H +S +VE+ CIG+ CSI +
Sbjct: 754 STLELSCQGGHIISEIQFASYGNPEGKCGSFKQGSWHVINSAILVEKLCIGRESCSIDVS 813
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
++ FG + L + A C
Sbjct: 814 AKSFGLGDVTNLSARLAIQALC 835
>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
Length = 723
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 297/672 (44%), Positives = 389/672 (57%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV+QTYVFWN HEP +GQY F R D++RF+K + GLYV LRIG
Sbjct: 58 MWPGLLQKAKDGGLDVVQTYVFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E PY WAAKMAV GVPWVMCKQDDAP PVIN CNG C
Sbjct: 178 LAQVENEYGPMESVMGAGAKPYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP++WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 238 --DYFSPNSNSKPTMWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RT+ F+ T Y AP+DEYGL+R+PKWGHL++LH AIK L++G +
Sbjct: 296 GGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQ 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG ++A+VF+ + G CAAFL N A V+F Y+LP SIS+LPDCK FNT
Sbjct: 356 SLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNT 415
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
VS + S + + W+ Y EA + D +GL++Q+S D SDY WY
Sbjct: 416 ATVS----EPSAPARMSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWY 471
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + NS+ + Q P L V S GH L FVNG+ G+ +G +D+ T V +
Sbjct: 472 TTYVNINSNEQFLKSGQWPQLTVYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMW 531
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGLP+ G E GV + + +N W YQ+GL GE
Sbjct: 532 QGSNKISILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGES 591
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L + S G + V W S + LTW+K F AP+G+ P+AL++ SMGKG+AWVNG+ IG
Sbjct: 592 LGVQSVAGSSSVEWGSAAG-KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIG 650
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHF---CAIIKATNTYHVPRAFLKPTGNLLVLLEE 613
RYW S+K S T + C + + YHVPR++L P+GNLLVLLEE
Sbjct: 651 RYW-SYKASSSGGCGGCSYAGTYSETKCQTGCGDV-SQRYYHVPRSWLNPSGNLLVLLEE 708
Query: 614 ENGNPLGITVDT 625
G+ G+ + T
Sbjct: 709 FGGDLPGVKLVT 720
>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 721
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 307/671 (45%), Positives = 387/671 (57%), Gaps = 56/671 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F R D+++F+K +Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLVQQAGLYVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 115 PYICAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP PVI+ CNG C
Sbjct: 175 MSQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGYYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE+WT +Y +GG R A+D+AF VA FI GS+VNYYMYH
Sbjct: 235 -ENFK-PNKNTKPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ I YD APLDEYGL EPK+ HL+ LH AIK C L+ V
Sbjct: 293 GGTNFGRTSGGLFIATSYDYDAPLDEYGLQNEPKYEHLRNLHKAIKQCEPALVATDPKVQ 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA VF T G CAAF+ N D + F N Y+LP SISILPDCKTV +NT
Sbjct: 353 SLGYNLEAHVF-STPGACAAFIANYDTKSYAKATFGNGQYDLPPWSISILPDCKTVVYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+V + K+ N F W+ Y E + + A L +Q++ +D+SDY W
Sbjct: 412 AKVGNSWLKKMTPVNSAF----AWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLW 467
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + N++ N Q+P L S GH+LH F+N + G+ G N T + V L
Sbjct: 468 YMTDVYINANEGFLKNGQSPVLTAMSAGHVLHVFINDQLAGTVWGGLANPKLTFSDNVKL 527
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N +LLSV VGLP+ G E AGV + + ++ W Y+VGL GE
Sbjct: 528 RVGNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRDLSSQKWSYKVGLKGE 587
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S+ + + LTWYKTTF APAGNDP+AL+L SMGKGE WVNG+
Sbjct: 588 SLSLHTESGSSSVEWIRGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGR 647
Query: 554 SIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
SIGR+W + + G+ + YA T T + YHVPR++L GN LV+ E
Sbjct: 648 SIGRHWPGY-IAHGSCNACNYAGFYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFE 706
Query: 613 EENGNPLGITV 623
E G+P GI +
Sbjct: 707 EWGGDPNGIAL 717
>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
Length = 827
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 316/804 (39%), Positives = 428/804 (53%), Gaps = 91/804 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+KEGGLD I+TYVFWN HEP + QYDFS D++RFIK IQ++GLY LRIG
Sbjct: 56 MWPDLIKKSKEGGLDTIETYVFWNAHEPVRRQYDFSANLDLVRFIKTIQNEGLYAVLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGI-----------------------------VFRSDNKPY 91
P++ +EW YGG P+WLH++ GI +F S P
Sbjct: 116 PYVCAEWNYGGFPVWLHNLPGIEELRTTNPVFMNEMQNFTTLIVDMMKQENLFASQGGPI 175
Query: 92 ---KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
+IENEY + ++ + G YV W A MA + GVPW+MC+QDDAP P IN CNG
Sbjct: 176 ILAQIENEYGNVMTSYGDAGKAYVNWCANMADSQNVGVPWIMCQQDDAPEPTINTCNGWY 235
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + PN+ P +WTE+WT +++ WGG+ +R+ +D+AF VA F G++ NYYMY
Sbjct: 236 CDQF--TPNNAKSPKMWTENWTGWFKSWGGRDPVRTPEDLAFSVARFFQLGGTFQNYYMY 293
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNF R A IT YD APLDEYG + +PK+GHLK+LHAA+K + L++G
Sbjct: 294 HGGTNFDRMAGGPYITTTYDYNAPLDEYGNLNQPKFGHLKQLHAALKSIEKALVSGNVTT 353
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
L + G + F N +E V + + +P S+SILPDC+ +N
Sbjct: 354 TDLTDSVSITEYATDKGK-SCFFSNINETTDALVNYLGKDFNVPAWSVSILPDCQEEVYN 412
Query: 328 TERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEG------LLDQISAAKD 381
T +V+TQ + K N K +++ + E+ N DNT +G L+DQ AA D
Sbjct: 413 TAKVNTQTSVMVKKEN-KAENEPEVLEWMWRPENIDNTARLGKGQVTANKLIDQKDAAND 471
Query: 382 ASDYFWYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
ASDY WY + + + L + GHI+HAFVNGE+ GS S+D ++
Sbjct: 472 ASDYLWYMTSVNLKKKDPIWSNEMTLRINVSGHIVHAFVNGEHIGSQWASYDVYNYIFEQ 531
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV---------HRVRVQDKSFTNCSWGY 488
V L+ G N +LLS T+GL + GA + +G+ H K +N W Y
Sbjct: 532 EVKLKPGKNIISLLSATIGLKNYGAQYDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSY 591
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
+VGL G + +++S W S P R +TWYKTTF+ P G DP+ L+LQ +GKG
Sbjct: 592 EVGLHGFENRLFSPESRFATKWQSGNLPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGM 651
Query: 548 AWVNGQSIGRYWVSFKTSKG---NPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKP 603
AWVNG SIGRYW SF G P + + + C K T YHVPR++L
Sbjct: 652 AWVNGHSIGRYWPSFIAEDGCSDEPCDYRGSYTNTKCVRDCG--KPTQQWYHVPRSWLNE 709
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
N LVL EE GNP + TIA+ K CGH +K
Sbjct: 710 GDNTLVLFEEFGGNPSLVNFKTIAMEKACGHAY-------------------------EK 744
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSH-SQGVVERACIGKSRCSIPL 722
+++ SC GK+I+ I FASFG+P G C ++ GSC + + +VE CIGK C I +
Sbjct: 745 KSLELSCQ-GKEITGIKFASFGDPTGSCGNFSKGSCEGKNDAMKIVEDLCIGKESCVIDI 803
Query: 723 LSRYFGGDPCP-GIHKALLVDAQC 745
FG C G+ K L V+A C
Sbjct: 804 SEDTFGATNCALGVVKRLAVEAVC 827
>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 302/672 (44%), Positives = 392/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F R D+++FIK +Q GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKLVQQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V +VFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPDMVFRTDNEPFKAAMQKFTEKIVGMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE G Y W AKMA TGVPW+MCKQDDAP +IN CNG C
Sbjct: 179 LSQIENEYGPIEWEIGAPGKAYTKWVAKMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS KP +WTE+WT ++ +GG R A+DIA VA FI GS++NYYMYH
Sbjct: 239 -ENFK-PNSDKKPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y APLDEYGL REPK+ HLK LH IKLC L++ V S
Sbjct: 297 GGTNFDRTAGEFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QEA VF+ S CAAFL N + A V F +Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEAQVFKSQSS-CAAFLSNYNTSSAARVSFGGSTYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 RVSTQYNKRSKTSNLKF---DSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDY 385
+V R+ + ++K ++ W Y E I + DN +GL++QIS +D +DY
Sbjct: 416 KVQV----RTSSIHMKMVPTNTLFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDY 471
Query: 386 FWY----TFRFHYNSSNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
FWY T + P L++ S GH LH FVNG+ G+A+GS + T +
Sbjct: 472 FWYLTDITISPDEKFLTGEDPLLNIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIK 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
L G N ALLS+ GLP+ G E GV V + W Y++G G
Sbjct: 532 LHAGVNKLALLSIAAGLPNVGVHYETWNTGVLGPVTLKGVNSGTWDMSQWKWSYKIGTKG 591
Query: 495 EKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E L I++ G + V W S+ + + LTWYK+TF PAGN+P+AL++ +MGKG+ W+NG
Sbjct: 592 EALSIHTVTGSSTVEWKQGSLVATKQPLTWYKSTFDTPAGNEPLALDMNTMGKGQTWING 651
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLL 611
Q+IGR+W ++ T++G + YA + +A+ YHVPR++LKPT NL+V+L
Sbjct: 652 QNIGRHWPAY-TARGKCERCSYAGTFTENKCLSNCGEASQRWYHVPRSWLKPTNNLVVVL 710
Query: 612 EEENGNPLGITV 623
EE G P GI++
Sbjct: 711 EEWGGEPNGISL 722
>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
sativus]
Length = 827
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 316/804 (39%), Positives = 428/804 (53%), Gaps = 91/804 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+KEGGLD I+TYVFWN HEP + QYDFS D++RFIK IQ++GLY LRIG
Sbjct: 56 MWPDLIKKSKEGGLDTIETYVFWNAHEPVRRQYDFSANLDLVRFIKTIQNEGLYAVLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGI-----------------------------VFRSDNKPY 91
P++ +EW YGG P+WLH++ GI +F S P
Sbjct: 116 PYVCAEWNYGGFPVWLHNLPGIEELRTTNPVFMNEMQNFTTLIVDMMKQENLFASQGGPI 175
Query: 92 ---KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
+IENEY + ++ + G YV W A MA + GVPW+MC+QDDAP P IN CNG
Sbjct: 176 ILAQIENEYGNVMTSYGDAGKAYVNWCANMADSQNVGVPWIMCQQDDAPEPTINTCNGWY 235
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + PN+ P +WTE+WT +++ WGG+ +R+ +D+AF VA F G++ NYYMY
Sbjct: 236 CDQF--TPNNAKSPKMWTENWTGWFKSWGGRDPVRTPEDLAFSVARFFQLGGTFQNYYMY 293
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNF R A IT YD APLDEYG + +PK+GHLK+LHAA+K + L++G
Sbjct: 294 HGGTNFDRMAGGPYITTTYDYNAPLDEYGNLNQPKFGHLKQLHAALKSIEKALVSGNVTT 353
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
L + G + F N +E V + + +P S+SILPDC+ +N
Sbjct: 354 TDLTDSVSITEYATDKGK-SCFFSNINETTDALVNYLGKDFNVPAWSVSILPDCQEEVYN 412
Query: 328 TERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEG------LLDQISAAKD 381
T +V+TQ + K N K +++ + E+ N DNT +G L+DQ AA D
Sbjct: 413 TAKVNTQTSVMVKKEN-KAENEPEVLEWMWRPENIDNTARLGKGQVTANKLIDQKDAAND 471
Query: 382 ASDYFWYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
ASDY WY + + + L + GHI+HAFVNGE+ GS S+D ++
Sbjct: 472 ASDYLWYMTSVNLKKKDPIWSNEMTLRINVSGHIVHAFVNGEHIGSQWASYDVYNYIXEQ 531
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV---------HRVRVQDKSFTNCSWGY 488
V L+ G N +LLS T+GL + GA + +G+ H K +N W Y
Sbjct: 532 EVKLKPGKNIISLLSATIGLKNYGAQYDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSY 591
Query: 489 QVGLIGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
+VGL G + +++S W S P R +TWYKTTF+ P G DP+ L+LQ +GKG
Sbjct: 592 EVGLHGFENRLFSPESRFATKWQSGNLPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGM 651
Query: 548 AWVNGQSIGRYWVSFKTSKG---NPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKP 603
AWVNG SIGRYW SF G P + + + C K T YHVPR++L
Sbjct: 652 AWVNGHSIGRYWPSFIAEDGCSDEPCDYRGSYTNTKCVRDCG--KPTQQWYHVPRSWLNE 709
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
N LVL EE GNP + TIA+ K CGH +K
Sbjct: 710 GDNTLVLFEEFGGNPSLVNFKTIAMEKACGHAY-------------------------EK 744
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSH-SQGVVERACIGKSRCSIPL 722
+++ SC GK+I+ I FASFG+P G C ++ GSC + + +VE CIGK C I +
Sbjct: 745 KSLELSCQ-GKEITGIKFASFGDPTGSCGNFSKGSCEGKNDAMKIVEDLCIGKESCVIDI 803
Query: 723 LSRYFGGDPCP-GIHKALLVDAQC 745
FG C G+ K L V+A C
Sbjct: 804 SEDTFGATNCALGVVKRLAVEAVC 827
>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
Length = 835
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 314/802 (39%), Positives = 424/802 (52%), Gaps = 93/802 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK GGLD I+TYVFWN+HEP + +YDFSG D+IRFI+ IQ++GLY LRIG
Sbjct: 70 MWPDLIRKAKAGGLDAIETYVFWNVHEPLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIG 129
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EWTYGG P+WLH++ GI FR+ NK +
Sbjct: 130 PYVCAEWTYGGFPMWLHNMPGIEFRTANKVFMNEMQNFTTLIVDMAKQEKLFASQGGPII 189
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY I + + G YV W A MA GVPW+MC+Q DAP P+IN CNG C
Sbjct: 190 IAQIENEYGNIMAPYGDAGKVYVDWCAAMANSLDIGVPWIMCQQSDAPQPMINTCNGWYC 249
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN+PN P +WTE+WT +++ WGGK R+A+D+++ VA F G++ NYYMYH
Sbjct: 250 -DSFT-PNNPNSPKMWTENWTGWFKNWGGKDPHRTAEDLSYSVARFFQTGGTFQNYYMYH 307
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR A IT YD APLDE+G + +PKWGHLK+LH +K L G I
Sbjct: 308 GGTNFGRVAGGPYITTSYDYDAPLDEFGNLNQPKWGHLKDLHTVLKSMEETLTEGNITTI 367
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+G E V+ T V + F N++ T + Y +P S+SILPDCK +NT
Sbjct: 368 DMGNSVEVTVY-ATQKVSSCFFSNSNTTNDATFTYGGTEYTVPAWSVSILPDCKKEVYNT 426
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILNFDNTLLRAEG------LLDQISAAK 380
+V+ Q + K N D + KW E I D+T + +G L+DQ
Sbjct: 427 AKVNAQTSVMVKNKNEAEDQPASLKWSWRPEMI---DDTAVLGKGQVSANRLIDQ-KTTN 482
Query: 381 DASDYFWYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
D SDY WY + + L V + GHILHA+VNGEY GS ++ ++
Sbjct: 483 DRSDYLWYMNSVDLSEDDLVWTDNMTLRVNATGHILHAYVNGEYLGSQWATNGIFNYVFE 542
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRV-----RVQD----KSFTNCSWG 487
V L+ G N ALLS T+G + GAF + +G+ R D K ++ W
Sbjct: 543 EKVKLKPGKNLIALLSATIGFQNYGAFYDLVQSGISGPVEIVGRKGDETIIKDLSSHKWS 602
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSP-TRQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
Y+VG+ G +++Y K W P R LTWYKTTF+AP G D + ++LQ +GKG
Sbjct: 603 YKVGMHGMAMKLYDPESPYK--WEEGNVPLNRNLTWYKTTFKAPLGTDAVVVDLQGLGKG 660
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPT 604
EAWVNGQS+GRYW S G + Y + C YHVPR+FL
Sbjct: 661 EAWVNGQSLGRYWPSSIAEDGCNATCDYRGPYTNTKCVRNCG-NPTQRWYHVPRSFLTAD 719
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
N LVL EE GNP + T+ I CG+ +++ L+ R
Sbjct: 720 ENTLVLFEEFGGNPSLVNFQTVTIGTACGNAYENNVLELACQNR---------------- 763
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSH-SQGVVERACIGKSRCSIPLL 723
IS I FASFG+P G C ++ GSC + + ++++AC+GK CS+ +
Sbjct: 764 ----------PISDIKFASFGDPQGSCGSFSKGSCEGNKDALDIIKKACVGKESCSLDVS 813
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
+ FG C I K L V+A C
Sbjct: 814 EKAFGSTSCGSIPKRLAVEAVC 835
>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
Length = 781
Score = 543 bits (1400), Expect = e-151, Method: Compositional matrix adjust.
Identities = 292/693 (42%), Positives = 400/693 (57%), Gaps = 63/693 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LI AKEGG+DVI+TYVFWN HE G Y F GR D+++F K +Q G+Y+ LRIG
Sbjct: 57 MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTI--------EPAFHEKGPP-- 110
PF+ +EW +GG+P+WLH + G VFR+ N+P+ E T E F +G P
Sbjct: 117 PFVAAEWNFGGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPII 176
Query: 111 ---------------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
Y LWAAKMAV +T VPW+MC+Q DAP PVI+ CN C
Sbjct: 177 LSQIENEYGYYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P SP +P +WTE+W +++ +GG+ R +D+AF VA F K GS NYYMYH
Sbjct: 237 DQF--TPTSPKRPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL R PKWGHLKELH AIKLC LL G I
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNI 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA ++ ++SG CAAF+ N D++ V+FRN SY LP S+SILPDCK V FNT
Sbjct: 355 SLGPSVEADIYTDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-----KWEEYREAILNFDNTLLRAEGLLDQISAAKDAS 383
+VS+ N + SD+ KW+ ++E + G +D I+ KD +
Sbjct: 415 AKVSSPTNIVAMIPEHLQQSDKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTT 474
Query: 384 DYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY W+T +++ ++ L ++S GH LHAFVN +Y G+ G+ + +FT +N
Sbjct: 475 DYLWHTTSILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKN 534
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFTNCSWGYQVGL 492
+ LR G N+ A+LS+TVGL +G F + AGV V++ + ++ +W Y++G+
Sbjct: 535 PISLRAGKNEIAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGV 594
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTR--QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
+GE L IY G+N V W+S P + LTWYK AP+G++P+ L++ MGKG AW+
Sbjct: 595 LGEHLSIYQGEGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWL 654
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAI---IKATNTYHVPRAFLKPTGNL 607
NG+ IGRYW K + + C + YHVPR++ KP+GN+
Sbjct: 655 NGEEIGRYWPRISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNV 714
Query: 608 LVLLEEENGNPLGITV--------DTIAIRKVC 632
LV+ EE+ G+P IT +I + KVC
Sbjct: 715 LVIFEEKGGDPTKITFVRHCHNPYSSIVVEKVC 747
>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
Length = 848
Score = 543 bits (1400), Expect = e-151, Method: Compositional matrix adjust.
Identities = 315/804 (39%), Positives = 415/804 (51%), Gaps = 91/804 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+KEGGLD I+TYVFWN HEP + QYDFSG D++RFIK IQ++GLY LRIG
Sbjct: 77 MWPDLIKKSKEGGLDAIETYVFWNSHEPSRRQYDFSGNLDLVRFIKTIQAEGLYAVLRIG 136
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ G R+ N +
Sbjct: 137 PYVCAEWNYGGFPMWLHNLPGCELRTANSVFMNEMQNFTSLIVDMMKDENLFASQGGPII 196
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
++ENEY + A+ G Y+ W + MA GVPW+MC+Q DAP P+IN CNG C
Sbjct: 197 LAQVENEYGNVMSAYGAAGKTYIDWCSNMAESLDIGVPWIMCQQSDAPQPMINTCNGWYC 256
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+ N P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMYH
Sbjct: 257 DQF--TPNNANSPKMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYH 314
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG + +PKWGHLK+LH + L G + I
Sbjct: 315 GGTNFGRTAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHDILHSMEYTLTHGNISTI 374
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
A ++ T A F N +E T++F+ Y +P S+SILPDC+ V +NT
Sbjct: 375 DYDNSVTATIY-ATDKESACFFGNANETSDATIVFKGTEYNVPAWSVSILPDCENVGYNT 433
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILNFDNTLLRAEG------LLDQISAAK 380
+V TQ K N D S KW E N T L +G L+DQ +AA
Sbjct: 434 AKVKTQTAIMVKQKNEAEDQPSSLKWSWIPE---NTHTTSLLGKGHAHARQLIDQKAAAN 490
Query: 381 DASDYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
DASDY WY H + + L V GH+LHA+VNG++ GS + S+
Sbjct: 491 DASDYLWYMTSLHIKKDDPVWSSDMSLRVNGSGHVLHAYVNGKHLGSQFAKYGVFSYVFE 550
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--------HRVRVQ-DKSFTNCSWG 487
++ LR G N +LLS TVGL + G + G+ HR + K ++ W
Sbjct: 551 KSLKLRPGKNVISLLSATVGLQNYGPMFDLVQTGIPGPVEIIGHRGDEKVVKDLSSHKWS 610
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
Y VGL G ++YS+ + W PT + + WYKTTF+AP G DP+ L+LQ MGKG
Sbjct: 611 YSVGLNGFHNELYSSNSRHASRWVEQDLPTNKMMIWYKTTFKAPLGKDPVVLDLQGMGKG 670
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKP 603
AWVNG +IGRYW SF + S + C T YHVPR+F
Sbjct: 671 FAWVNGNNIGRYWPSFLAEEDGCSTEVCDYRGAYDNNKCVTNCGKPTQRWYHVPRSFFND 730
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
N LVL EE GNP G+ T+ + KV G G+
Sbjct: 731 YENTLVLFEEFGGNPAGVNFQTVTVGKVSGSA-------------------------GEG 765
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ-GVVERACIGKSRCSIPL 722
T++ SC GK IS I FASFG+P G Y G+C S+ +V++AC+GK C +
Sbjct: 766 ETIELSCN-GKSISAIEFASFGDPQGTSGAYVKGTCEGSNDAFSIVQKACVGKETCKLEA 824
Query: 723 LSRYFGGDPC-PGIHKALLVDAQC 745
FG C + L V A C
Sbjct: 825 SKDVFGPTSCGSDVVNTLAVQATC 848
>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 832
Score = 543 bits (1398), Expect = e-151, Method: Compositional matrix adjust.
Identities = 312/806 (38%), Positives = 434/806 (53%), Gaps = 91/806 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI KAKEGGLDVI+TYVFWN HEPQ QYDFSG D+++FIK IQ +GLY LRIG
Sbjct: 52 MWPSLINKAKEGGLDVIETYVFWNAHEPQPRQYDFSGNLDLVKFIKTIQKEGLYAMLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ + FR++N Y
Sbjct: 112 PYVCAEWNYGGFPVWLHNMPNMEFRTNNTAYMNEMQTFTTLIVDKMRHENLFASQGGPII 171
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY I + E G YV W A++A + GVPWVMC+Q DAP P+IN CNG C
Sbjct: 172 LAQIENEYGNIMSEYGENGKQYVQWCAQLAESYKIGVPWVMCQQSDAPDPIINTCNGWYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP +WTE+WT +++ WGG R+A+D+A+ VA F G++ NYYMYH
Sbjct: 232 DQF--SPNSKSKPKMWTENWTGWFKNWGGPIPHRTARDVAYAVARFFQYGGTFQNYYMYH 289
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD APLDEYG +PKWGHLK+LH +K L GT N
Sbjct: 290 GGTNFGRTSGGPYITTSYDYDAPLDEYGNKNQPKWGHLKQLHELLKSMEDVLTQGTTNHT 349
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G L A V+ SG A FL N + T++F++ Y +P S+SILP+C +NT
Sbjct: 350 DYGNLLTATVY-NYSGKSACFLGNANSSNDATIMFQSTQYIVPAWSVSILPNCVNEVYNT 408
Query: 329 ERVSTQYNKRSKTSNLKFDSDEK------WEEYREAILNF-DNTLL-----RAEGLLDQI 376
+++ Q + N K D++E+ W+ E + D +L +A LLDQ
Sbjct: 409 AKINAQTSIMVMKDN-KSDNEEEPHSTLNWQWMHEPHVQMKDGQVLGSVSRKAAQLLDQK 467
Query: 377 SAAKDASDYFWYTFRFHYNSSNA-QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
D SDY WY + ++ + + V ++GH+LH FVNG G +G + SFT
Sbjct: 468 VVTNDTSDYLWYITSVDISENDPIWSKIRVSTNGHVLHVFVNGAQAGYQYGQNGKYSFTY 527
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG----VHRVRVQD-----KSFTNCSW 486
+ L++GTN+ +LLS TVGLP+ GA G V V +Q+ K TN +W
Sbjct: 528 EAKIKLKKGTNEISLLSGTVGLPNYGAHFSNVSVGVCGPVQLVALQNNTEVVKDITNNTW 587
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGK 545
Y+VGL GE +++Y N W++ PT R WYKT F++P G DP+ ++L+ + K
Sbjct: 588 NYKVGLHGEIVKLY--CPENNKGWNTNGLPTNRVFVWYKTLFKSPKGTDPVVVDLKGLKK 645
Query: 546 GEAWVNGQSIGRYWVSF-KTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKP 603
G+AWVNG +IGRYW + G + Y + + T YHVPR+FL+
Sbjct: 646 GQAWVNGNNIGRYWTRYLADDNGCTATCNYRGPYSSDKCITKCGRPTQRWYHVPRSFLRQ 705
Query: 604 TG-NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK 662
N LVL EE G+P + T+ + K+C + ++ L
Sbjct: 706 DNQNTLVLFEEFGGHPNEVKFATVMVEKICANSYEGNVLEL------------------- 746
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
SC + ISKI FASFG P+G+C + C S ++ ++ ++C+GK CS+ +
Sbjct: 747 ------SCREEQVISKIKFASFGVPEGECGSFKKSQCESPNALSILSKSCLGKQSCSVQV 800
Query: 723 LSRYFGGDPC--PGIHKALLVDAQCR 746
R G C P L ++A C
Sbjct: 801 SQRMLGPTGCRMPQNQNKLAIEAVCE 826
>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
Length = 729
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 296/667 (44%), Positives = 387/667 (58%), Gaps = 50/667 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY FS R D++RF+K ++ GLYV LRIG
Sbjct: 68 MWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN P+K
Sbjct: 128 PYVCAEWNFGGFPVWLKYVPGVSFRTDNGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENE+ +E PY WAAKMAV +TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 188 MSQVENEFGPMESVGGSGAKPYANWAAKMAVGTNTGVPWVMCKQDDAPDPVINTCNGFYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KPS+WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 248 --DYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVARFIQKGGSFVNYYMYH 305
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DE+GL+R+PKWGHL++LH AIK L++ +
Sbjct: 306 GGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRAIKQAEPVLVSADPTIE 365
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G ++A+VF+ +G CAAFL N AV V F Y LP SISILPDCKT FNT
Sbjct: 366 SIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPAWSISILPDCKTAVFNT 425
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V ++F W+ Y E + ++ +GL++Q+S D SDY WY
Sbjct: 426 ATVKEPTLMPKMNPVVRF----AWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWY 481
Query: 389 TFRFHYNSSN---AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
T + +++ Q+P L V S GH + FVNG+ GS +G +DN T V + QG
Sbjct: 482 TTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQG 541
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQ 498
+N ++LS VGLP+ G E GV + K ++ W YQVGL GE L
Sbjct: 542 SNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLG 601
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
+++ G + V W + LTW+K F APAGNDP+AL++ SMGKG+ WVNG +GRY
Sbjct: 602 LHTVTGSSAVEWGG-PGGYQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRY 660
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
W S+K S G + + YHVPR++LKP GNLLV+LEE G+
Sbjct: 661 W-SYKASGGCGGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDL 719
Query: 619 LGITVDT 625
G+++ T
Sbjct: 720 AGVSLAT 726
>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
Length = 729
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 296/667 (44%), Positives = 387/667 (58%), Gaps = 50/667 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY FS R D++RF+K ++ GLYV LRIG
Sbjct: 68 MWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN P+K
Sbjct: 128 PYVCAEWNFGGFPVWLKYVPGVSFRTDNGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENE+ +E PY WAAKMAV +TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 188 MSQVENEFGPMESVGGSGAKPYANWAAKMAVRTNTGVPWVMCKQDDAPDPVINTCNGFYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KPS+WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 248 --DYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVARFIQKGGSFVNYYMYH 305
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DE+GL+R+PKWGHL++LH AIK L++ +
Sbjct: 306 GGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRAIKQAEPVLVSADPTIE 365
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G ++A+VF+ +G CAAFL N AV V F Y LP SISILPDCKT FNT
Sbjct: 366 SIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPAWSISILPDCKTAVFNT 425
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V ++F W+ Y E + ++ +GL++Q+S D SDY WY
Sbjct: 426 ATVKEPTLMPKMNPVVRF----AWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWY 481
Query: 389 TFRFHYNSSN---AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
T + +++ Q+P L V S GH + FVNG+ GS +G +DN T V + QG
Sbjct: 482 TTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQG 541
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQ 498
+N ++LS VGLP+ G E GV + K ++ W YQVGL GE L
Sbjct: 542 SNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLG 601
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
+++ G + V W + LTW+K F APAGNDP+AL++ SMGKG+ WVNG +GRY
Sbjct: 602 LHTVTGSSAVEWGG-PGGYQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRY 660
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
W S+K S G + + YHVPR++LKP GNLLV+LEE G+
Sbjct: 661 W-SYKASGGCGGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDL 719
Query: 619 LGITVDT 625
G+++ T
Sbjct: 720 AGVSLAT 726
>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
Length = 825
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 310/803 (38%), Positives = 420/803 (52%), Gaps = 90/803 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+KEGGLD I+TYVFWN+HEP + QYDF G D++RFIK +Q +GLY LRIG
Sbjct: 55 MWPDLIKKSKEGGLDAIETYVFWNVHEPSRRQYDFGGNLDLVRFIKAVQDEGLYAVLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ GI R+ N +
Sbjct: 115 PYVCAEWNYGGFPVWLHNMPGIELRTANSIFMNEMQNFTSLIVDMMKQEQLFASQGGPII 174
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
++ENEY + ++ G Y+ W A MA + GVPW+MC+Q DAP P+IN CNG C
Sbjct: 175 IAQVENEYGNVMSSYGAAGKAYIDWCANMAESLNIGVPWIMCQQSDAPDPMINTCNGWYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P++PN P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMYH
Sbjct: 235 DQF--TPSNPNSPKMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFFQTGGTFQNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDE+G + +PKWGHLK+LH + L +GT + +
Sbjct: 293 GGTNFGRTAGGPYITTSYDYDAPLDEFGNLNQPKWGHLKQLHDVLHSMEEILTSGTVSSV 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
A ++ T + FL N +E T+ F+ +Y +P S+SILPDC V +NT
Sbjct: 353 DYDNSVTATIY-ATDKESSCFLSNANETSDATIEFKGTTYTIPAWSVSILPDCANVGYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILNFDNTLLRAEG------LLDQISAAK 380
+V TQ + K N D + W E N D T+L +G ++DQ + A
Sbjct: 412 AKVKTQTSVMVKRDNKAEDEPTSLNWSWRPE---NVDKTVLLGQGHIHAKQIVDQKAVAN 468
Query: 381 DASDYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
DASDY WY + + + GHILHA+VNGEY GS + ++
Sbjct: 469 DASDYLWYMTSVDLKKDDLIWSKDMSIRINGSGHILHAYVNGEYLGSQWSEYSVSNYVFE 528
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG----VHRV-RVQD----KSFTNCSWG 487
+V L+ G N LLS TVGL + GA + AG V V R D K +N W
Sbjct: 529 KSVKLKHGRNLITLLSATVGLANYGANYDLIQAGILGPVELVGRKGDETIIKDLSNNRWS 588
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
Y+VGL+G + ++Y + + W PT + LTWYKTTF+AP G DP+ L+LQ +GKG
Sbjct: 589 YKVGLLGLEDKLYLSDSKHASKWQEQELPTNKMLTWYKTTFKAPLGTDPVVLDLQGLGKG 648
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKP 603
AW+NG SIGRYW SF S + C T YHVPR+FL+
Sbjct: 649 MAWINGNSIGRYWPSFLAEDDGCSTDLCDYRGPYDNNKCVSNCGKPTQRWYHVPRSFLQD 708
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
N LVL EE GNP + T+ C GD +
Sbjct: 709 NENTLVLFEEFGGNPSQVNFQTVVTGVAC------------------VSGD-------EG 743
Query: 664 PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSH-SQGVVERACIGKSRCSIPL 722
V+ SC G+ IS + FASFG+P G C GSC + + +V++AC+G CS+ +
Sbjct: 744 EVVEISCN-GQSISAVQFASFGDPQGTCGSSVKGSCEGTEDALLIVQKACVGNESCSLEV 802
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
+ FG C L V+ C
Sbjct: 803 SHKLFGSTSCDNGVNRLAVEVLC 825
>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
Length = 740
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 296/672 (44%), Positives = 390/672 (58%), Gaps = 56/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY F+ R D++RF+K ++ GLYV LRIG
Sbjct: 75 MWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQYHFADRYDLVRFVKLVRQAGLYVHLRIG 134
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 135 PYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFKAAMQKFVEKIVSMMKSEGLFEWQGGPII 194
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENE+ +E PY WAA+MAV +TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 195 MAQVENEFGPMESVVGSGAKPYAHWAAQMAVGTNTGVPWVMCKQDDAPDPVINTCNGFYC 254
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP++WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 255 --DYFTPNRKYKPTMWTEAWTGWFTKFGGALPHRPVEDLAFAVARFIQKGGSFVNYYMYH 312
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DE+GL+R+PKWGHL++LH AIK L++G +
Sbjct: 313 GGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRAIKQAEPALISGDPTIQ 372
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G ++A++F+ +G CAAFL N + AV + F Y+LP SISILPDCKT FNT
Sbjct: 373 SIGNYEKAYIFKSKNGACAAFLSNYHMKTAVKIRFDGRHYDLPAWSISILPDCKTAVFNT 432
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V L F W+ Y E + D++ GL++Q+S D SDY WY
Sbjct: 433 ATVKEPTLLPKMNPVLHF----AWQSYSEDTNSLDDSAFTRNGLVEQLSLTWDKSDYLWY 488
Query: 389 TFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + S L V S GH + FVNG GS +G +DN T V +
Sbjct: 489 TTHVSIGGNEQFLKSGQWPQLTVYSAGHSMQVFVNGRSYGSVYGGYDNPKLTFNGHVKMW 548
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGLP++G E GV + + ++ W YQVGL GE
Sbjct: 549 QGSNKISILSSAVGLPNNGNHFELWNVGVLGPVTLSGLNEGKRDLSHQKWTYQVGLKGES 608
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L +++ G + V W+ + LTW+K F APAG+DP+AL++ SMGKG+ WVNG G
Sbjct: 609 LGLHTVTGSSAVEWAG-PGGKQPLTWHKALFNAPAGSDPVALDMGSMGKGQIWVNGHHAG 667
Query: 557 RYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
RYW S++ G+ + YA + C I + YHVPR++LKP+GNLLV+LEE
Sbjct: 668 RYW-SYRAYSGSCRRCSYAGTYREDQCLSNCGDI-SQRWYHVPRSWLKPSGNLLVVLEEY 725
Query: 615 NGNPL-GITVDT 625
G L G+T+ T
Sbjct: 726 GGGDLAGVTLAT 737
>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
Length = 823
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 305/803 (37%), Positives = 420/803 (52%), Gaps = 92/803 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ K++EGGLD I+TYVFW+ HEP + +YDFSG D+IRF+K IQ +GLY LRIG
Sbjct: 55 MWPDLVKKSREGGLDAIETYVFWDSHEPARREYDFSGNLDLIRFLKTIQDEGLYAVLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ G+ R+ N +
Sbjct: 115 PYVCAEWNYGGFPVWLHNMPGVQMRTANDVFMNEMRNFTTLIVNMVKQENLFASQGGPVI 174
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ ++G Y+ W A MA H GVPW+MC+Q DAP P+IN CNG C
Sbjct: 175 LAQIENEYGNVMSSYGDEGKAYIEWCANMAQSLHIGVPWLMCQQSDAPEPMINTCNGWYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMYH
Sbjct: 235 DQF--TPNRPTSPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFYQLGGTFQNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG + +PKWGHLKELH + L G + +
Sbjct: 293 GGTNFGRTAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKELHDVLHSMEDTLTRGNISSV 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G ++ G + FL N D R T+ F+ + YE+P S+SILPDC+ V +NT
Sbjct: 353 DFGNSVSGTIYSTEKG-SSCFLTNTDSRNDTTINFQGLDYEVPAWSVSILPDCQDVVYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDE----KWE-EYREAILNFDNTLLRAEGLLDQISAAKDAS 383
+VS Q + K N+ D W E + + F + +LDQ AA D S
Sbjct: 412 AKVSAQTSVMVKKKNVAEDEPAALTWSWRPETNDKSILFGKGEVSVNQILDQKDAANDLS 471
Query: 384 DYFWYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
DY +Y + L + G +LH FVNGE+ GS + + +
Sbjct: 472 DYLFYMTSVSLKEDDPIWGDNMTLRITGSGQVLHVFVNGEFIGSQWAKYGVFDYVFEQQI 531
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV---------HRVRVQDKSFTNCSWGYQV 490
L +G N LLS TVG + GA + AGV H + K ++ W Y+V
Sbjct: 532 KLNKGKNTITLLSATVGFANYGANFDLTQAGVRGPVELVGYHDDEIIIKDLSSHKWSYKV 591
Query: 491 GLIGEKLQIYSNLGLNKVLWSSIRSPTRQL-TWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GL G + +YS+ + W PT ++ TWYK TF+AP G DP+ ++L +GKG AW
Sbjct: 592 GLEGLRQNLYSS---DSSKWQQDNYPTNKMFTWYKATFKAPLGTDPVVVDLLGLGKGLAW 648
Query: 550 VNGQSIGRYWVSFKTSKG---NPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTG 605
VNG SIGRYW SF G +P + + + + C K T YHVPR+FL G
Sbjct: 649 VNGNSIGRYWPSFIAEDGCSLDPCDYRGSYDNNKCVTNCG--KPTQRWYHVPRSFLNNEG 706
Query: 606 -NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
N LVL EE G+P + T AI C + +K
Sbjct: 707 DNTLVLFEEFGGDPSSVNFQTTAIGSACVNAE-------------------------EKK 741
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ-GVVERACIGKSRCSIPLL 723
++ SC G+ IS I FASFGNP G C ++ G+C +S+ +V++AC+G+ C+I +
Sbjct: 742 KIELSCQ-GRPISAIKFASFGNPLGTCGSFSKGTCEASNDALSIVQKACVGQESCTIDVS 800
Query: 724 SRYFGGDPC-PGIHKALLVDAQC 745
FG C + K L V+A C
Sbjct: 801 EDTFGSTTCGDDVIKTLSVEAIC 823
>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
Length = 731
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 303/671 (45%), Positives = 388/671 (57%), Gaps = 56/671 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G Y F R D+++F+K +Q GLYV LRIG
Sbjct: 61 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 121 PYACAEWNFGGFPVWLKYVPGMSFRTDNEPFKAAMQKFTEKIVNMMKQEQLFEPQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE G Y WAA+MAV +TGVPW+ CKQ+DAP P+I+ CN C
Sbjct: 181 LSQIENEYGPIEWELKAPGKAYAQWAAQMAVGLNTGVPWIACKQEDAPDPLIDTCNAYYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE WT+++ WG R A+D AF V FI GSY NYYMYH
Sbjct: 241 -EKFT-PNKSYKPKMWTEAWTAWFTSWGNPVLYRPAEDQAFSVLKFIQSGGSYANYYMYH 298
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL +PK+ HLK +H AIK + L++ V
Sbjct: 299 GGTNFGRTAGGPFVATSYDYDAPLDEYGLTNDPKYTHLKHMHKAIKQSEKALVSADATVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QEA V+ +SG CAAFL N D +V V F + Y+LP SISILPDCKT +NT
Sbjct: 359 SLGTNQEAHVYSSSSG-CAAFLANYDVSYSVKVNFGSGQYDLPAWSISILPDCKTEVYNT 417
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILN-FDNTLLRAEGLLDQISAAKDASDYFW 387
+V + T F W+ Y + + + F + +GL +Q+ KD+SDY W
Sbjct: 418 AKVLAPRVHKKMTPLGGF----TWDSYIDEVASGFASDTTTEDGLWEQLYMTKDSSDYLW 473
Query: 388 YTFRFHYNS-----SNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S +N + P L+VQS GH L+ FVNG+ GSA+GS+DN T +V L
Sbjct: 474 YMQDVKIGSDEAFLTNGKDPFLNVQSAGHFLNVFVNGKLIGSAYGSNDNPKLTFSQSVKL 533
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
G N ALLS +VGL + G E GV + T W Y+VG+ GE
Sbjct: 534 NVGVNKIALLSASVGLANVGLHFENYNVGVLGPVTLTGLNQGTVDMTKWKWSYKVGVQGE 593
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
KLQ+ + G + V W S+ + + LTWYK+TF AP GNDP+AL++ SMGKG+ W+NGQ
Sbjct: 594 KLQLNTVAGSSSVEWVKGSMLAKKQPLTWYKSTFNAPEGNDPVALDMISMGKGQIWINGQ 653
Query: 554 SIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLE 612
IGRYW ++ T++GN Y + T YHVPR++LKPTGNLLV+ E
Sbjct: 654 GIGRYWPAY-TAQGNCGGCSYGGYFTEKKCLTGCGQPTQRWYHVPRSWLKPTGNLLVVFE 712
Query: 613 EENGNPLGITV 623
E G+P GI++
Sbjct: 713 EWGGDPTGISM 723
>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
Length = 730
Score = 540 bits (1390), Expect = e-150, Method: Compositional matrix adjust.
Identities = 305/729 (41%), Positives = 410/729 (56%), Gaps = 69/729 (9%)
Query: 70 GGLPIWLHDVAGIVFRSDNKPYK-------------------------------IENEYQ 98
GG P+WL V GI FR+DN P+K IENEY
Sbjct: 1 GGFPVWLKYVPGISFRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEYG 60
Query: 99 TIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS 158
A G Y+ WAAKMAV +TGVPWVMCK+DDAP PVINACNG C + F PN
Sbjct: 61 PESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYC-DGFS-PNK 118
Query: 159 PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTA 218
P KP +WTE W+ ++ +GG + R QD+AF VA FI K GSY NYYMYHGGTNFGRTA
Sbjct: 119 PYKPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQKGGSYFNYYMYHGGTNFGRTA 178
Query: 219 AAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAF 277
+T YD AP+DEYGL REPK+ HLKELH AIKL L++ + SLG ++A+
Sbjct: 179 GGPFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSAGPTITSLGTYEQAY 238
Query: 278 VFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNK 337
++ CAAFL N + + A VLF N Y LP SISILPDC+ VA+NT V Q
Sbjct: 239 IYNSGPRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALVGVQ--- 295
Query: 338 RSKTSNLKF----DSDEKWEEYREAILNFDN-TLLRAEGLLDQISAAKDASDYFWYTFRF 392
TS++ S WE Y E I + D + A GLL+QI+ +D SDY WY
Sbjct: 296 ---TSHVHMLPTGTSLLSWETYDEVISSLDERARMTAVGLLEQINVTRDTSDYLWYMTSV 352
Query: 393 HYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
+SS + Q P L+VQS GH + F+NG+++GSA G+ ++ FT V+LR G+N
Sbjct: 353 DISSSESFLRGGQKPTLNVQSAGHAVRVFINGQFSGSAFGTREHRQFTFTGPVNLRAGSN 412
Query: 447 DGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIY 500
+LLS+ VGLP+ G E GV + + + T W YQVGL GE + +
Sbjct: 413 KISLLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGKRDLTWQKWSYQVGLKGEAMNLV 472
Query: 501 SNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGR 557
+ G + W S + LTWYK F AP GN+P+AL+L+SMGKG+ +NGQSIGR
Sbjct: 473 TPEGASSADWVRGSLAARSVQPLTWYKAYFNAPNGNEPLALDLRSMGKGQVRINGQSIGR 532
Query: 558 YWVSFKTSKGNPSQTQYAVNT-VTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
YW ++ +KG+ Y ++ +++ YHVPR++LKP NLLV+ EE G
Sbjct: 533 YWTAY--AKGDCEACSYTGHSGRQNVNLVVASPTQRWYHVPRSWLKPKQNLLVIFEELGG 590
Query: 617 NPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKI 676
+ I + ++ VC + +H P ++ + Q G K+ TV C G+ I
Sbjct: 591 DASKIALLRRSLTNVCANAFENH-PSMAKYSTSSQDGSKV-----KEATVNLQCGPGQSI 644
Query: 677 SKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIH 736
S I FASFG P G C + +G+CH+ +S+ ++E+ C+G+ CS+ + + FG DPCP +
Sbjct: 645 SAIEFASFGTPSGTCGSFHIGTCHAPNSRSIIEKKCVGQKSCSVTISNSIFGADPCPNVL 704
Query: 737 KALLVDAQC 745
K L V+A C
Sbjct: 705 KRLTVEAVC 713
>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
Length = 721
Score = 540 bits (1390), Expect = e-150, Method: Compositional matrix adjust.
Identities = 307/671 (45%), Positives = 384/671 (57%), Gaps = 56/671 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F R D+++F+K Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW GG P+WL V GI FR+DN+P+K
Sbjct: 115 PYICAEWNLGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP PVI+ CNG C
Sbjct: 175 LSQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE+WT +Y +GG R A+D+AF VA FI GS+VNYYMYH
Sbjct: 235 -ENFK-PNKNTKPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ I YD APLDEYGL EPK+ HL+ LH AIK L+ V
Sbjct: 293 GGTNFGRTSGGLFIATSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATDPKVQ 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA VF G CAAF+ N D + F N Y+LP SISILPDCKTV +NT
Sbjct: 353 SLGYNLEAHVF-SAPGACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+V + K+ N F W+ Y E + + A L +Q++ +D+SDY W
Sbjct: 412 AKVGYGWLKKMTPVNSAF----AWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLW 467
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + N++ N Q+P L V S GH+LH F+NG+ G+ G N T + V L
Sbjct: 468 YMTDVNVNANEGFLKNGQSPLLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKL 527
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N +LLSV VGLP+ G E AGV + + + W Y+VGL GE
Sbjct: 528 RAGNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGE 587
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S+ + + LTWYKTTF APAGNDP+AL+L SMGKGE WVNG+
Sbjct: 588 SLSLHTESGSSSVEWIQGSLVAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGR 647
Query: 554 SIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
SIGR+W + + G+ + YA T T + YHVPR++L GN LV+ E
Sbjct: 648 SIGRHWPGY-IAHGSCNACNYAGYYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFE 706
Query: 613 EENGNPLGITV 623
E G+P GI +
Sbjct: 707 EWGGDPNGIAL 717
>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
Length = 715
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 303/670 (45%), Positives = 391/670 (58%), Gaps = 54/670 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY FS R D++RF+K ++ GLYV LRIG
Sbjct: 52 MWPDLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFSDRYDLVRFVKLVKQAGLYVNLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WL V GI FR+DN P+K
Sbjct: 112 PYVCAEWNYGGFPVWLKYVPGISFRTDNGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E YV WAAKMAV + GVPW+MCKQDDAP PVIN CNG C
Sbjct: 172 LAQVENEYGPMESVMGSGAKSYVDWAAKMAVATNAGVPWIMCKQDDAPDPVINTCNGFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PNS NKPS+WTE W+ ++ +GG R +D+AF VA FI K GS++NYYMYH
Sbjct: 232 -DDFT-PNSKNKPSMWTEAWSGWFTAFGGTVPQRPVEDLAFAVARFIQKGGSFINYYMYH 289
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA F+ T Y AP+DEYGL+R+PKWGHL LH AIK L+ G V
Sbjct: 290 GGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQPKWGHLTNLHKAIKQAETALVAGDPTVQ 349
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
++G ++A+VF +SG CAAFL N A V F Y+LP SIS+LPDC+T +NT
Sbjct: 350 NIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARVAFNGRRYDLPAWSISVLPDCRTAVYNT 409
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V+ S + + W+ Y EA + D T +GL++Q+S D SDY WY
Sbjct: 410 ATVTAA----SSPAKMNPAGGFTWQSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWY 465
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + +S + Q P L V S GH + FVNG+Y G+A+G +D T V +
Sbjct: 466 TTYVNIDSGEQFLKSGQWPQLTVYSAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMW 525
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGLP+ G E GV + + + W YQ+GL GEK
Sbjct: 526 QGSNKISILSSAVGLPNVGTHYETWNIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEK 585
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L ++S G + V W + + +TW++ F APAG P+AL+L SMGKG+AWVNG IG
Sbjct: 586 LGVHSVSGSSSVEWGGA-AGKQPVTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIG 644
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEEN 615
RYW S+K S GN YA A+ YHVPR++L P+GNL+VLLEE
Sbjct: 645 RYW-SYKAS-GNCGGCSYAGTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFG 702
Query: 616 GNPLGITVDT 625
G+ G+T+ T
Sbjct: 703 GDLSGVTLMT 712
>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 726
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 303/675 (44%), Positives = 393/675 (58%), Gaps = 63/675 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVI+TYVFWN HEP GQY F R D+++FIK + GLYV LRIG
Sbjct: 59 MWPGLIQKAKEGGLDVIETYVFWNGHEPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPII 178
Query: 93 -----IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGM 147
IENEY +E G Y W A+MA+ TGVPW+MCKQ+DAP P+I+ CNG
Sbjct: 179 LAQGQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDAPSPIIDTCNGY 238
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
C E FK PNS NKP +WTE+WT +Y +GG R +DIA+ VA FI K GS+VNYYM
Sbjct: 239 YC-EDFK-PNSSNKPKMWTENWTGWYTEFGGAVPYRPVEDIAYSVARFIQKGGSFVNYYM 296
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
YHGGTNF RTA FM + Y APLDEYGL REPK+ HLK LH IKL LL+ V
Sbjct: 297 YHGGTNFDRTAGEFMASSYDYDAPLDEYGLPREPKYSHLKALHKVIKLSEPALLSADATV 356
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
SLG QEA+VF S CAAFL N DE A V+FR Y LP S+SILPDCKT +N
Sbjct: 357 TSLGAKQEAYVFWSKSS-CAAFLSNKDESSAARVMFRGFPYVLPPWSVSILPDCKTEFYN 415
Query: 328 TERVSTQYNKRSKT-SNLKFDSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASD 384
T +V+ R+ + +F W + EA N T R GL++QIS D SD
Sbjct: 416 TAKVNAPSVHRNMVPTGARFS----WGSFNEATPTANEAGTFAR-NGLVEQISMTWDKSD 470
Query: 385 YFWYTFRFHYNS-----SNAQAPL-DVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
YFWY S PL V S GH LH FVNG+ +G+A+G D+ T
Sbjct: 471 YFWYLTDITIGSGETFLKTGDFPLFTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFTQK 530
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGL 492
+ L G N ALLSV VGLP+ G E+ GV V + W Y++G+
Sbjct: 531 IKLHAGVNKLALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGV 590
Query: 493 IGEKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE L ++++ + V W+ S + + LTWYK+TF PAGN+P+AL++ +MGKG+ W+
Sbjct: 591 KGEALSLHTDTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWI 650
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
NG++IGR+W ++K ++G+ + YA N + C + YHVPR++LK + NL+
Sbjct: 651 NGRNIGRHWPAYK-AQGSCGRCNYAGTFNAKKCLSNCG-EASQRWYHVPRSWLK-SQNLI 707
Query: 609 VLLEEENGNPLGITV 623
V+ EE G+P GI++
Sbjct: 708 VVFEEWGGDPNGISL 722
>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
Length = 717
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 303/670 (45%), Positives = 391/670 (58%), Gaps = 54/670 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY FS R D++RF+K ++ GLYV LRIG
Sbjct: 54 MWPDLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFSDRYDLVRFVKLVKQAGLYVNLRIG 113
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WL V GI FR+DN P+K
Sbjct: 114 PYVCAEWNYGGFPVWLKYVPGISFRTDNGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPII 173
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E YV WAAKMAV + GVPW+MCKQDDAP PVIN CNG C
Sbjct: 174 LAQVENEYGPMESVMGSGAKSYVDWAAKMAVATNAGVPWIMCKQDDAPDPVINTCNGFYC 233
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PNS NKPS+WTE W+ ++ +GG R +D+AF VA FI K GS++NYYMYH
Sbjct: 234 -DDFT-PNSKNKPSMWTEAWSGWFTAFGGTVPQRPVEDLAFAVARFIQKGGSFINYYMYH 291
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA F+ T Y AP+DEYGL+R+PKWGHL LH AIK L+ G V
Sbjct: 292 GGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQPKWGHLTNLHKAIKQAEPALVAGDPTVQ 351
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
++G ++A+VF +SG CAAFL N A V F Y+LP SIS+LPDC+T +NT
Sbjct: 352 NIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARVAFNGRRYDLPAWSISVLPDCRTAVYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V+ S + + W+ Y EA + D T +GL++Q+S D SDY WY
Sbjct: 412 ATVTAA----SSPAKMNPAGGFTWQSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWY 467
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + +S + Q P L V S GH + FVNG+Y G+A+G +D T V +
Sbjct: 468 TTYVNIDSGEQFLKSGQWPQLTVYSAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMW 527
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGLP+ G E GV + + + W YQ+GL GEK
Sbjct: 528 QGSNKISILSSAVGLPNVGTHYETWNIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEK 587
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L ++S G + V W + + +TW++ F APAG P+AL+L SMGKG+AWVNG IG
Sbjct: 588 LGVHSVSGSSSVEWGGA-AGKQPVTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIG 646
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEEN 615
RYW S+K S GN YA A+ YHVPR++L P+GNL+VLLEE
Sbjct: 647 RYW-SYKAS-GNCGGCSYAGTYSEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFG 704
Query: 616 GNPLGITVDT 625
G+ G+T+ T
Sbjct: 705 GDLSGVTLMT 714
>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
Length = 767
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 308/778 (39%), Positives = 419/778 (53%), Gaps = 122/778 (15%)
Query: 24 NLHEPQKG-QYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGI 82
++H P+ +++F G D+++FIK I GLY LRIGPFIE+EW +GG P WL +V I
Sbjct: 52 SIHYPRSTPEFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDI 111
Query: 83 VFRSDNKPYK-------------------------------IENEYQTIEPAFHEKGPPY 111
+FRS N+P+K IENEY +I+ A+ E G Y
Sbjct: 112 IFRSYNEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYKELGVQY 171
Query: 112 VLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTS 171
V WA KMAV GVPW+MCKQ DAP PVIN CNG CG+TF GPN PNKPS+WTE+WT+
Sbjct: 172 VQWAGKMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTA 231
Query: 172 FYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAP 231
Y+V+G P R+A+D+AF VA FI+KNG+ NYYMYHGGTNFGRT ++F+ T YYD+AP
Sbjct: 232 QYRVFGDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRTGSSFVTTRYYDEAP 291
Query: 232 LDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET-SGVCAAFL 290
LDEYGL REPKWGHLK+LH+A++LC + L TG+ V LG+ +E +E+ + +CAAFL
Sbjct: 292 LDEYGLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFL 351
Query: 291 VNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDE 350
NN R+A T+ FR Y LP SISILPDCKTV +NT+RV Q+N R+ + + +
Sbjct: 352 TNNHSREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARNFVKSKIANKNL 411
Query: 351 KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAP-------- 402
KWE +E I + + + ++ KD SDY W+ SN P
Sbjct: 412 KWEMSQEPIPVMTDMKILTKSPMELYXFLKDRSDYAWFVTSIEL--SNYDLPMKKDIIPV 469
Query: 403 LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGA 462
L + + GH + AFVNG + GSAHGS+ +F R V QG N +V DSG
Sbjct: 470 LQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKF-QGRNKLHCPAVY----DSG- 523
Query: 463 FLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT 517
G+H V++ + TN WG QVG+ GE ++ Y+ G ++V W++ +
Sbjct: 524 -----TTGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQGGSHRVQWTAAKGKG 578
Query: 518 RQLTWYKTTFRAPAGNDPIALNLQSMGKG--------EAWVNGQSIGRYWVSFKTSKGNP 569
+TWYKT F P GNDP+ L + SM KG AW+ V F+ + GNP
Sbjct: 579 PAMTWYKTYFDMPEGNDPVILRMTSMAKGNGLEYHVPRAWLKPSD--NLLVIFEETGGNP 636
Query: 570 SQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIR 629
+ + + +I C+I+
Sbjct: 637 EEIEXELVNRDTI--CSIV----------------------------------------- 653
Query: 630 KVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDG 689
T H P + SW RH + + + KP CP K I K+ FASFGNP G
Sbjct: 654 ------TEYHPPHVKSWQRHDSKIRAVVDEV--KPKGHLKCPNYKVIVKVDFASFGNPLG 705
Query: 690 DCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD--PCPGIHKALLVDAQC 745
C + +G+C + +S+ VVE+ C GK+ C IP+ + F G+ C I K L V +C
Sbjct: 706 ACGDFEMGNCTAPNSKKVVEQHCXGKTTCEIPMEAGIFXGNSGACSDITKTLAVQVRC 763
>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 826
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 307/798 (38%), Positives = 438/798 (54%), Gaps = 84/798 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEP + +YDFSG + I++ + IQ GLYV +RIG
Sbjct: 57 MWPDLIQKAKDGGLDAIETYIFWDRHEPHRRKYDFSGHLNFIKYFQLIQEAGLYVVMRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R++N+ YK
Sbjct: 117 PYVCAEWNYGGFPLWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 176
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + E G Y+ W A+MA + G+PW+MC+Q DAP P+IN CNG C
Sbjct: 177 LAQIENEYGNVMTPYGEAGKTYINWCAQMAESLNIGIPWIMCQQSDAPQPIINTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN+PN P ++TE+W +++ WG K R+A+D+AF VA F G NYYMYH
Sbjct: 237 -DNFT-PNNPNSPKMFTENWVGWFKKWGDKDPHRTAEDVAFSVARFFQSGGILNNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD APLDEYG + +PKWGHLK+LHA+IKL + L T++
Sbjct: 295 GGTNFGRTSGGPFITTSYDYDAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSDQ 354
Query: 269 SLGQLQEAFVFEE-TSGVCAAFLVNNDERK-AVTVLFRNISYELPRKSISILPDCKTVAF 326
G F +G FL N DE A+ + + Y LP S+SIL C F
Sbjct: 355 DFGSSVTFTKFSNLETGEKFCFLSNADENNDAIVDMLGDRKYFLPAWSVSILDGCNKEIF 414
Query: 327 NTERVSTQ----YNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
NT +VS+Q + K+++ N K + E R+ + + +A LL+Q A D+
Sbjct: 415 NTAKVSSQTSLFFKKQNEKENAKLSWNWASEPMRDTLQGYGT--FKANLLLEQKGATIDS 472
Query: 383 SDYFWYTFRFHYNSSNA--QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
SDY WY + N++++ L V + GH+LHAF+N Y GS GS+ SF +
Sbjct: 473 SDYLWYMTNVNSNTTSSLQNLTLQVNTKGHVLHAFINRRYIGSQWGSNGQ-SFVFEKPIQ 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHR---VRVQDKSFT----NCSWGYQVGLI 493
L+ GTN LLS TVGL + AF + G+ + D + T + W Y+VGL
Sbjct: 532 LKLGTNTITLLSATVGLKNYDAFYDTVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLN 591
Query: 494 GEKLQIYSNLGLNKVLWSSI--RSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE+ Q+Y+ + N+ WS++ +S R++TW+K TF+ P+G DP+ L++Q MGKG+AWVN
Sbjct: 592 GERKQLYNPMFSNRTKWSTLNKKSIGRRMTWFKATFKTPSGTDPVVLDMQGMGKGQAWVN 651
Query: 552 GQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
G+SIGR+W SF S + S+T + + N + C + YH+PR+F+ + N L
Sbjct: 652 GRSIGRFWPSFIASNDSCSETCDYKGSYNPNKCVRNCG-NSSQRWYHIPRSFMNDSINTL 710
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
+L EE GNP ++V TI I +CG+ + T++
Sbjct: 711 ILFEEIGGNPQMVSVQTITIGTICGNAN-------------------------EGSTLEL 745
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQG-VVERACIGKSRCSIPLLSRYF 727
SC G IS+I FAS+G+P+G C + G + S +VE+ACIG CSI + F
Sbjct: 746 SCQGGHVISEIQFASYGHPEGKCGSFQSGLWDVTKSTTIIVEKACIGMKNCSIDISPNLF 805
Query: 728 GGDPCPGIHKALLVDAQC 745
+ L V A C
Sbjct: 806 KLSKVAYPYAKLAVQALC 823
>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
Length = 827
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 321/810 (39%), Positives = 432/810 (53%), Gaps = 102/810 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QYDFSG D+IRFIK IQ +GLY LRIG
Sbjct: 55 MWPDLIRKAKEGGLDAIETYVFWNAHEPARRQYDFSGHLDLIRFIKTIQDEGLYAVLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIV-FRSDNKPY---------------------------- 91
P++ +EW YGG P+WLH++ G+ FR+ N+ +
Sbjct: 115 PYVCAEWNYGGFPVWLHNMPGVQEFRTVNEVFMNEMQNFTTLIVDMVKQEKLFASQGGPI 174
Query: 92 ---KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
+IENEY + + + G Y+ W AKMA GVPW+MC++ DAP P+IN CNG
Sbjct: 175 IIAQIENEYGNMISNYGDAGKVYIDWCAKMAESLDIGVPWIMCQESDAPQPMINTCNGWY 234
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C ++F PN PN P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMY
Sbjct: 235 C-DSFT-PNDPNSPKMWTENWTGWFKSWGGKDPHRTAEDLAFSVARFFQTGGTFQNYYMY 292
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNFGRT+ +T YD APLDE+G + +PKWGHLKELH +K + L G +
Sbjct: 293 HGGTNFGRTSGGPYLTTSYDYDAPLDEFGNLNQPKWGHLKELHTVLKAMEKTLTHGNVST 352
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
G A V+ G + F N + T+ F+ Y +P S+SILPDCKT A+N
Sbjct: 353 TDFGNSVTATVYATEEG-SSCFFGNANTTGDATITFQGSDYVVPAWSVSILPDCKTEAYN 411
Query: 328 TERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILNFDNTLLRAEG------LLDQISAA 379
T +V+TQ + K N + S KW EAI D +++ +G L+DQ
Sbjct: 412 TAKVNTQTSVIVKKPNQAENEPSSLKWVWRPEAI---DEPVVQGKGSFSASFLIDQ-KVI 467
Query: 380 KDASDYFWYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTL 435
DASDY WY + L V + G +LHAFVNGE+ GS +
Sbjct: 468 NDASDYLWYMTSVDLKPDDIIWSDNMTLRVNTTGIVLHAFVNGEHVGSQWTKYGVFKDVF 527
Query: 436 RNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV-----------HRVRVQDKSFTNC 484
+ V L G N +LLSVTVGL + G + AG+ ++D S C
Sbjct: 528 QQQVKLNPGKNQISLLSVTVGLQNYGPMFDMVQAGITGPVELIGQKGDETVIKDLS---C 584
Query: 485 -SWGYQVGLIG-EKLQIYSNLGLNKVL-WSSIRSPTR-QLTWYKTTFRAPAGNDPIALNL 540
W Y+VGL G E + YS N+ WS+ P+ ++TWYKTTF+AP GNDP+ L+L
Sbjct: 585 HKWTYEVGLTGLEDNKFYSKASTNETCGWSAENVPSNSKMTWYKTTFKAPLGNDPVVLDL 644
Query: 541 QSMGKGEAWVNGQSIGRYWVSFKTS----KGNPSQTQYAVNTVTSIHFCAIIKATNTYHV 596
Q MGKG AWVNG ++GRYW S+ +P + + + C + YHV
Sbjct: 645 QGMGKGFAWVNGYNLGRYWPSYLAEADGCSSDPCDYRGQYDNNKCVTNCG-QPSQRWYHV 703
Query: 597 PRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTD 656
PR+FL+ N LVL EE GNP + T+ + VCG N+H
Sbjct: 704 PRSFLQDGENTLVLFEEFGGNPWQVNFQTLVVGSVCG---NAH----------------- 743
Query: 657 IKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHS-QGVVERACIGK 715
+K T++ SC G+ IS I FASFG+P G C + G+C + V+++ C+GK
Sbjct: 744 -----EKKTLELSCN-GRPISAIKFASFGDPQGTCGSFQAGTCQTEQDILPVLQQECVGK 797
Query: 716 SRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
CSI + G C + K L V+A C
Sbjct: 798 ETCSIDISEDKLGKTNCGSVVKKLAVEAVC 827
>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
Length = 828
Score = 538 bits (1387), Expect = e-150, Method: Compositional matrix adjust.
Identities = 312/806 (38%), Positives = 432/806 (53%), Gaps = 95/806 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI+KAK+GGLD I+TYVFWN HEP + QYDFSG D++RFIK IQS GLY LRIG
Sbjct: 57 MWPDLISKAKDGGLDTIETYVFWNAHEPSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ + FR+ N +
Sbjct: 117 PYVCAEWNYGGFPVWLHNMPDMKFRTINPGFMNEMQNFTTKIVNMMKEESLFASQGGPII 176
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ +G Y+ W A MA GVPW+MC+Q AP P+I CNG C
Sbjct: 177 LAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWIMCQQPHAPQPMIETCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ +K P++P+ P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMYH
Sbjct: 237 -DQYK-PSNPSSPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR A IT YD APLDEYG + +PKWGHLK+LH +K +PL G + I
Sbjct: 295 GGTNFGRVAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHTLLKSMEKPLTYGNISTI 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG A V+ T+ + F+ N + V F+ Y +P S+S+LPDC A+NT
Sbjct: 355 DLGNSVTATVY-STNEKSSCFIGNVNATADALVNFKGKDYNVPAWSVSVLPDCDKEAYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLR------AEGLLDQISAAKDA 382
RV+TQ + ++ S D EK + T+L+ A+GL+DQ DA
Sbjct: 414 ARVNTQTSIITEDS---CDEPEKLKWTWRPEFTTQKTILKGSGDLIAKGLVDQKDVTNDA 470
Query: 383 SDYFWYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
SDY WY R H + + L V S+ H+LHA+VNG+Y G+ + +
Sbjct: 471 SDYLWYMTRVHLDKKDPIWSRNMSLRVHSNAHVLHAYVNGKYVGNQIVRDNKFDYRFEKK 530
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLER---------KVAGVHRVRVQDKSFTNCSWGYQ 489
V+L GTN ALLSV+VGL + G F E K+ G +K + W Y+
Sbjct: 531 VNLVHGTNHLALLSVSVGLQNYGPFFESGPTGINGPVKLVGYKGDETIEKDLSKHQWDYK 590
Query: 490 VGLIGEKLQIYS--NLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
+GL G +++S + G + WS+ + P R L+WYK F+AP G DP+ ++L +GKG
Sbjct: 591 IGLNGFNHKLFSMKSAGHHHRKWSTEKLPADRMLSWYKANFKAPLGKDPVIVDLNGLGKG 650
Query: 547 EAWVNGQSIGRYWVSFKTS-KGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLK 602
E W+NGQSIGRYW SF +S +G + Y + CA + T YHVPR+FL
Sbjct: 651 EVWINGQSIGRYWPSFNSSDEGCTEECDYRGEYGSDK--CAFMCGKPTQRWYHVPRSFLN 708
Query: 603 PTG-NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFG 661
G N + L EE G+P + T+ +VC K
Sbjct: 709 DKGHNTITLFEEMGGDPSMVKFKTVVTGRVCA-------------------------KAH 743
Query: 662 KKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ-GVVERACIGKSRCSI 720
+ V+ SC + IS + FASFGNP G C +A GSC + VV + C+GK C++
Sbjct: 744 EHNKVELSCN-NRPISAVKFASFGNPSGQCGSFAAGSCEGAKDAVKVVAKECVGKLNCTM 802
Query: 721 PLLSRYFGGD-PCPGIHKALLVDAQC 745
+ S FG + C K L V+ +C
Sbjct: 803 NVSSHKFGSNLDCGDSPKRLFVEVEC 828
>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
distachyon]
Length = 721
Score = 537 bits (1383), Expect = e-150, Method: Compositional matrix adjust.
Identities = 288/669 (43%), Positives = 388/669 (57%), Gaps = 50/669 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY F R D++RF+K + GLYV LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E PY WAAKMAV GVPWVMCKQDDAP PVIN CNG C
Sbjct: 176 LAQVENEYGPMESVMGGGAKPYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP++WTE W+ ++ +GG R +D+AF VA F+ K GS+VNYYMYH
Sbjct: 236 --DYFTPNSNGKPNMWTEAWSGWFTAFGGAVPHRPVEDLAFAVARFVQKGGSFVNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA F+ T Y AP+DEYGL+R+PKWGHL++LH AIK +++G +
Sbjct: 294 GGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQ 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G ++A+VF+ ++G CAAFL N V++ YELP SISILPDCKT +NT
Sbjct: 354 SIGNYEKAYVFKSSTGACAAFLSNYHTSSPAKVVYNGRRYELPAWSISILPDCKTAVYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V ++ ++ N W+ Y E + D++ +GL++Q+S D SD+ WY
Sbjct: 414 ATVRQKWKEKKLWMNPA--GGFSWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWY 471
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + +SS + Q P L + S GH L FVNG+ G+ +G +D+ + V +
Sbjct: 472 TTYVNIDSSEQFLKSGQWPQLTINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMW 531
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGL + G E GV + + +N W YQ+GL GE
Sbjct: 532 QGSNKISILSSAVGLANQGTHYENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGES 591
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L ++S G + V W S + LTW+K F APAG P+AL++ SMGKG+ WVNG++ G
Sbjct: 592 LGVHSITGSSSVEWGSANG-AQPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAG 650
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW S+K S S + + T + YHVPR++L P+GNLLV+LEE G
Sbjct: 651 RYW-SYKASGSCGSCSYTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGG 709
Query: 617 NPLGITVDT 625
+ G+ + T
Sbjct: 710 DLSGVKLMT 718
>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
Precursor
gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
Length = 741
Score = 536 bits (1382), Expect = e-149, Method: Compositional matrix adjust.
Identities = 289/682 (42%), Positives = 404/682 (59%), Gaps = 56/682 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSL+ AKEGG + I++YVFWN HEP G+Y F GR +I++FIK +Q G+++ LRIG
Sbjct: 62 MWPSLVQTAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW YGG+P+WLH V G VFR+DN+P+K
Sbjct: 122 PFVAAEWNYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E + E G Y W+A MAV + GVPW+MC+Q DAP VI+ CNG C
Sbjct: 182 LSQVENEYGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+P+KP IWTE+W +++ +GG+ R A+D+A+ VA F K GS NYYMYH
Sbjct: 242 DQF--TPNTPDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD +AP+DEYGL R PKWGHLK+LH AI L L++G
Sbjct: 300 GGTNFGRTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNF 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG EA V+ ++SG CAAFL N D++ V+FRN SY LP S+SILPDCKT FNT
Sbjct: 360 TLGHSLEADVYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNT 419
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+++ +K +LK S KWE + E + L+D I+ KD +DY W
Sbjct: 420 AKVTSKSSKVEMLPEDLKSSSGLKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLW 479
Query: 388 YTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT + + A +P L ++S GH LH F+N EY G+A G+ +V F L+ V L
Sbjct: 480 YTTSITVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVAL 539
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEK 496
+ G N+ LLS+TVGL ++G+F E AG+ V ++ + TN W Y++G+ GE
Sbjct: 540 KAGENNIDLLSMTVGLANAGSFYEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEH 599
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L+++ V W+ P ++ LTWYK P+G++P+ L++ SMGKG AW+NG+
Sbjct: 600 LELFKPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEE 659
Query: 555 IGRYW--VSFKTSKGNP--SQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
IGRYW ++ K S + + Y + + + YHVPR++ K +GN LV
Sbjct: 660 IGRYWPRIARKNSPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELV 719
Query: 610 LLEEENGNPLGITVDTIAIRKV 631
+ EE+ GNP+ I ++ RKV
Sbjct: 720 IFEEKGGNPMKI---KLSKRKV 738
>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
Length = 731
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 298/672 (44%), Positives = 393/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F R D+++FIK +Q GL+V LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 176 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT +Y +GG R A+D+AF VA FI GS++NYYMYH
Sbjct: 236 -ENFK-PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL REPKWGHL++LH AIK C L++ +V
Sbjct: 294 GGTNFGRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ S CAAFL N D + +V V F Y+LP SISILPDCKT +NT
Sbjct: 354 KLGSNQEAHVFKSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q S+ S W+ + E + + +GL +QI+ +D +DY W
Sbjct: 413 AKVGSQ---SSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S N ++P L + S GH L+ F+NG+ +G+ +GS +N + V+L
Sbjct: 470 YMTDITIGSDEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E AGV + + W Y+ GL GE
Sbjct: 530 RSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGE 589
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S ++ LTWYK TF AP G+ P+AL++ SMGKG+ W+NGQ
Sbjct: 590 ALGLHTVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQ 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
S+GR+W + ++G+ YA + C + YH+PR++L PTGNLLV+
Sbjct: 650 SVGRHWPGY-IARGSCGDCSYAGTYDDKKCRTHCG-EPSQRWYHIPRSWLTPTGNLLVVF 707
Query: 612 EEENGNPLGITV 623
EE G+P GI++
Sbjct: 708 EEWGGDPSGISL 719
>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
Length = 754
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 294/659 (44%), Positives = 380/659 (57%), Gaps = 50/659 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY FS R D++RF+K ++ GLYV LRIG
Sbjct: 68 MWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN P+K
Sbjct: 128 PYVCAEWNFGGFPVWLKYVPGVSFRTDNGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENE+ +E PY WAAKMAV +TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 188 MSQVENEFGPMESVGGSGAKPYANWAAKMAVGTNTGVPWVMCKQDDAPDPVINTCNGFYC 247
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KPS+WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 248 --DYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVARFIQKGGSFVNYYMYH 305
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y AP+DE+GL+R+PKWGHL++LH AIK L++ +
Sbjct: 306 GGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRAIKQAEPVLVSADPTIE 365
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G ++A+VF+ +G CAAFL N AV V F Y LP SISILPDCKT FNT
Sbjct: 366 SIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPAWSISILPDCKTAVFNT 425
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V ++F W+ Y E + ++ +GL++Q+S D SDY WY
Sbjct: 426 ATVKEPTLMPKMNPVVRF----AWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWY 481
Query: 389 TFRFHYNSSN---AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
T + +++ Q+P L V S GH + FVNG+ GS +G +DN T V + QG
Sbjct: 482 TTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQG 541
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQ 498
+N ++LS VGLP+ G E GV + K ++ W YQVGL GE L
Sbjct: 542 SNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLG 601
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
+ + G + V W + LTW+K F APAGNDP+AL++ SMGKG+ WVNG +GRY
Sbjct: 602 LQTVTGSSAVEWGGPGG-YQPLTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRY 660
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGN 617
W S+K S G + + YHVPR++LKP GNLLV+LEE N
Sbjct: 661 W-SYKASGGCGGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGAN 718
>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
distachyon]
Length = 719
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 288/669 (43%), Positives = 387/669 (57%), Gaps = 52/669 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY F R D++RF+K + GLYV LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E PY WAAKMAV GVPWVMCKQDDAP PVIN CNG C
Sbjct: 176 LAQVENEYGPMESVMGGGAKPYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP++WTE W+ ++ +GG R +D+AF VA F+ K GS+VNYYMYH
Sbjct: 236 --DYFTPNSNGKPNMWTEAWSGWFTAFGGAVPHRPVEDLAFAVARFVQKGGSFVNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA F+ T Y AP+DEYGL+R+PKWGHL++LH AIK +++G +
Sbjct: 294 GGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQ 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
S+G ++A+VF+ ++G CAAFL N V++ YELP SISILPDCKT +NT
Sbjct: 354 SIGNYEKAYVFKSSTGACAAFLSNYHTSSPAKVVYNGRRYELPAWSISILPDCKTAVYNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V + S + + W+ Y E + D++ +GL++Q+S D SD+ WY
Sbjct: 414 ATV----KEPSAPAKMNPAGGFSWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWY 469
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + +SS + Q P L + S GH L FVNG+ G+ +G +D+ + V +
Sbjct: 470 TTYVNIDSSEQFLKSGQWPQLTINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMW 529
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS VGL + G E GV + + +N W YQ+GL GE
Sbjct: 530 QGSNKISILSSAVGLANQGTHYENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGES 589
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L ++S G + V W S + LTW+K F APAG P+AL++ SMGKG+ WVNG++ G
Sbjct: 590 LGVHSITGSSSVEWGSANG-AQPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAG 648
Query: 557 RYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENG 616
RYW S+K S S + + T + YHVPR++L P+GNLLV+LEE G
Sbjct: 649 RYW-SYKASGSCGSCSYTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGG 707
Query: 617 NPLGITVDT 625
+ G+ + T
Sbjct: 708 DLSGVKLMT 716
>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 299/675 (44%), Positives = 388/675 (57%), Gaps = 63/675 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G Y F R D+++F K + GLY+ LRIG
Sbjct: 59 MWPDLIKKAKEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GIVFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGIVFRTDNEPFKIAMQRFTKKIVDMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W A+MA+ TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 179 LSQIENEYGPMEWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT ++ +GG R +DIAF VA FI GS++NYYMY+
Sbjct: 239 -EGFK-PNSDNKPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFLNYYMYY 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y APLDEYGL+REPK+ HLKELH IKLC L++ + S
Sbjct: 297 GGTNFDRTAGVFIATSYDYDAPLDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QE VF+ + CAAFL N D A ++FR Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEVHVFKSKTS-CAAFLSNYDTSSAARIMFRGFPYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 --RVSTQYNKRSKTSNLKFDSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASDY 385
R T K TS KF WE Y E N D T ++ +GL++QIS +D +DY
Sbjct: 416 KIRAPTILMKMVPTST-KFS----WESYNEGSPSSNDDGTFVK-DGLVEQISMTRDKTDY 469
Query: 386 FWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
FWY S + L + S GH LH FVNG G+++G+ N T +
Sbjct: 470 FWYLTDITIGSDESFLKTGDDPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQKI 529
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLI 493
L G N ALLS VGLP++G E GV V + W Y++G+
Sbjct: 530 KLSVGINKLALLSTAVGLPNAGVHYETWNTGVLGPVTLKGVNSGTWDMSKWKWSYKIGIR 589
Query: 494 GEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE + ++ G + V W S LTWYK++F P GN+P+AL++ +MGKG+ WV
Sbjct: 590 GEAMSFHTIAGSSAVKWWIKGSFVVKKEPLTWYKSSFDTPKGNEPLALDMNTMGKGQVWV 649
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
NG +IGR+W ++ T++GN + YA N + C + YHVPR++LKP GNLL
Sbjct: 650 NGHNIGRHWPAY-TARGNCGRCNYAGIYNEKKCLSHCG-EPSQRWYHVPRSWLKPFGNLL 707
Query: 609 VLLEEENGNPLGITV 623
V+ EE G+P GI++
Sbjct: 708 VIFEEWGGDPSGISL 722
>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
Length = 722
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 284/670 (42%), Positives = 392/670 (58%), Gaps = 50/670 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI+ AK GG+DVI+TYVFW+ H+P + Y+F GR D++ F+K + GLY LRIG
Sbjct: 54 MWSQLISNAKAGGIDVIETYVFWDGHQPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIG 113
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW GG P+WL DV GI FR++N+P+K
Sbjct: 114 PYVCAEWNLGGFPVWLKDVPGIEFRTNNQPFKAEMQAFVEKIVAMMKHDKLFAPQGGPII 173
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MA TGVPW+MC+Q DAP +++ CNG C
Sbjct: 174 LAQIENEYGNIDAAYGAAGKEYMEWAANMAQGLGTGVPWIMCQQSDAPDYILDTCNGFYC 233
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
PN+ KP +WTE+W+ ++Q WG R +D+AF VA F + GS+ NYYMY
Sbjct: 234 DAW--APNNKKKPKMWTENWSGWFQKWGEASPHRPVEDVAFAVARFFQRGGSFQNYYMYF 291
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR++ +T YD AP+DE+G++R+PKWGHLK+LHAAIKLC L + I
Sbjct: 292 GGTNFGRSSGGPYVTTSYDYDAPIDEFGVIRQPKWGHLKQLHAAIKLCEAALGSNDPTYI 351
Query: 269 SLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
SLGQLQEA V+ T SG CAAFL N D TV F + +Y LP S+SILPDCKTV+ N
Sbjct: 352 SLGQLQEAHVYGSTSSGACAAFLANIDSSSDATVKFNSRTYLLPAWSVSILPDCKTVSHN 411
Query: 328 TERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
T +V Q + ++ + WE Y E + + ++ + A LL+QI+ KD SDY W
Sbjct: 412 TAKVHVQTAMPTMKPSI---TGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLW 468
Query: 388 YTFRFHYNSSNA---QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
YT + ++A +A L ++S ++H FVNG+ GSA + + + L G
Sbjct: 469 YTTSLDISQADAASGKALLSLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASG 528
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDK------SFTNCSWGYQVGLIGEKLQ 498
N A+L TVGL + G F+E AG++ + T W +QVGL GE L
Sbjct: 529 HNSLAILCATVGLQNYGPFIETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLA 588
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
I++ G +V WSS + L WYK F +P+GNDP+AL+L+SMGKG+AW+NGQSIGR+
Sbjct: 589 IFTESGSQRVRWSSAVPQGQALVWYKAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRF 648
Query: 559 WVSFKT--SKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEEN 615
W S + + G P Y + +S + + YHVPR++L+ +GNL+VL EEE
Sbjct: 649 WPSLRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEG 708
Query: 616 GNPLGITVDT 625
G P G++ T
Sbjct: 709 GKPSGVSFVT 718
>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
Length = 725
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 300/667 (44%), Positives = 386/667 (57%), Gaps = 57/667 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK GGLDVIQTYVFWN HEP G+Y F R D+++FIK +Q GL+V LRIG
Sbjct: 56 MWPDLIQKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG PIWL V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTEGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV +TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 176 LSQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGYYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT +Y +GG R +D+AF VA FI GS+ NYYMYH
Sbjct: 236 -ENFK-PNKVYKPKMWTEVWTGWYTEFGGAIPTRPVEDLAFSVARFIQSGGSFFNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL+++PKWGHLK+LH AIK C L+ +V
Sbjct: 294 GGTNFGRTAGGPFMATSYDYDAPLDEYGLLQQPKWGHLKDLHKAIKSCEYALVAVDPSVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF SG CAAFL N D + V V F Y+LP SISILPDCKT FNT
Sbjct: 354 KLGNNQEAHVFNTKSG-CAAFLANYDTKYPVRVSFGQGQYDLPPWSISILPDCKTAVFNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+V+ K S+ S W+ + E D + +GL +QI +DA+DY W
Sbjct: 413 AKVTW---KTSQVQMKPVYSRLPWQSFIEETTTSDESGTTTLDGLYEQIYMTRDATDYLW 469
Query: 388 YTFRFHYNS-----SNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S +N + P L + S H LH F+NG+ +G+ +GS +N T V L
Sbjct: 470 YMTDITIGSDEAFLNNGKFPLLTIFSACHALHVFINGQLSGTVYGSLENPKLTFSQNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E AGV + + W Y++G+ GE
Sbjct: 530 RPGINKLALLSISVGLPNVGTHFETWNAGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGE 589
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W+ S ++ LTWYK TF AP G+ P+AL++ SMGKG+ W+NGQ
Sbjct: 590 ALGLHTVTGSSSVDWAEGPSMAKKQPLTWYKATFNAPPGHAPLALDMGSMGKGQIWINGQ 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYAVNTVTSI--HFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
S+GR+W + ++G+ YA +C + YH+PR++L PTGNLLV+
Sbjct: 650 SVGRHWPGY-IAQGSCGTCNYAGTFYDKKCRTYCG-KPSQRWYHIPRSWLTPTGNLLVVF 707
Query: 612 EEENGNP 618
EE G+P
Sbjct: 708 EEWGGDP 714
>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
Length = 730
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 301/670 (44%), Positives = 388/670 (57%), Gaps = 55/670 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFWN HEP G+Y F R D++ FIK +Q GL+V LRIG
Sbjct: 65 MWPDLIQKAKDGGLDVIETYVFWNGHEPSPGKYYFEDRFDLVGFIKLVQQAGLFVHLRIG 124
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 125 PFICAEWNFGGFPVWLKYVPGIAFRTDNEPFKEAMQKFTEKIVNIMKAEKLFQSQGGPII 184
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 185 LSQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPIIDTCNGFYC 244
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE+WT +Y +GG R A+DIAF VA FI GS NYYMYH
Sbjct: 245 -ENFT-PNKNYKPKLWTENWTGWYTAFGGATPYRPAEDIAFSVARFIQNRGSLFNYYMYH 302
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ + YD AP+DEYGL+ EPKWGHL+ELH AIK C L++ V
Sbjct: 303 GGTNFGRTSNGLFVATSYDYDAPIDEYGLLNEPKWGHLRELHRAIKQCESALVSVDPTVS 362
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G+ E ++ +T CAAFL N + + V F N Y+LP SISILPDCKT FNT
Sbjct: 363 WPGKNLEVHLY-KTESACAAFLANYNTDYSTQVKFGNGQYDLPPWSISILPDCKTEVFNT 421
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V++ R T +S W+ Y E + +N + L +Q+ +D+SDY W
Sbjct: 422 AKVNSPRLHRKMT---PVNSAFAWQSYNEEPASSSENDPVTGYALWEQVGVTRDSSDYLW 478
Query: 388 YTFRFHYNSSNAQ----APLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
Y + ++ + L S GH+L+ F+NG+Y G+A+GS D+ T +V+LR
Sbjct: 479 YLTDVNIGPNDIKDGKWPVLTAMSAGHVLNVFINGQYAGTAYGSLDDPRLTFSQSVNLRV 538
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKL 497
G N +LLSV+VGL + G E GV + + W Y++GL GE L
Sbjct: 539 GNNKISLLSVSVGLANVGTHFETWNTGVLGPVTLTGLSSGTWDLSKQKWSYKIGLKGESL 598
Query: 498 QIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
+++ G N V W S+ + + L WYKTTF APAGNDP+AL+L SMGKGE WVNGQSI
Sbjct: 599 SLHTEAGSNSVEWVQGSLVAKKQPLAWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGQSI 658
Query: 556 GRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAII--KATNTYHVPRAFLKPTGNLLVLLEE 613
GR+W K ++GN YA T T A + YHVPR++L+ GN LV+LEE
Sbjct: 659 GRHWPGNK-ARGNCGNCNYA-GTYTDTKCLANCGQPSQRWYHVPRSWLRSGGNYLVVLEE 716
Query: 614 ENGNPLGITV 623
G+P GI +
Sbjct: 717 WGGDPNGIAL 726
>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
Length = 724
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 297/672 (44%), Positives = 393/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F R D+++FIK +Q GL+V LRIG
Sbjct: 49 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIG 108
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 109 PYVCAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPII 168
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 169 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC 228
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT +Y +GG R A+D+AF VA FI GS++NYYMYH
Sbjct: 229 -ENFK-PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYH 286
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL REPKWGHL++LH AIK C L++ +V
Sbjct: 287 GGTNFGRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVT 346
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ S CAAFL N D + +V V F Y+LP SISILPDCKT +NT
Sbjct: 347 KLGSNQEAHVFKSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNT 405
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q S+ S W+ + E + + +GL +QI+ +D +DY W
Sbjct: 406 AKVGSQ---SSQVQMTPVHSGFPWQSFIEETTSSDETDTTYMDGLYEQINITRDTTDYLW 462
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S N ++P L + S GH L+ F+NG+ +G+ +GS +N + V+L
Sbjct: 463 YMTDITIGSDEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNL 522
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E AGV + + W Y+ GL GE
Sbjct: 523 RSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGE 582
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S ++ LTW+K TF AP G+ P+AL++ SMGKG+ W+NGQ
Sbjct: 583 ALGLHTVTGSSSVEWVEGPSMAKKQPLTWHKATFNAPPGDAPLALDMGSMGKGQIWINGQ 642
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
S+GR+W + ++G+ YA + C + YH+PR++L PTGNLLV+
Sbjct: 643 SVGRHWPGY-IARGSCGDCSYAGTYDDKKCRTHCG-EPSQRWYHIPRSWLTPTGNLLVVF 700
Query: 612 EEENGNPLGITV 623
EE G+P GI++
Sbjct: 701 EEWGGDPSGISL 712
>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 741
Score = 534 bits (1375), Expect = e-149, Method: Compositional matrix adjust.
Identities = 288/682 (42%), Positives = 403/682 (59%), Gaps = 56/682 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSL+ AKEGG + I++YVFWN HEP G+Y F GR +I++FIK +Q G+++ LRIG
Sbjct: 62 MWPSLVQTAKEGGCNAIESYVFWNGHEPSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW YGG+P+WLH V G VFR+DN+P+K
Sbjct: 122 PFVAAEWNYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKQEKLFAPQGGPII 181
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E + E G Y W+A MAV + GVPW+MC+Q DAP VI+ CNG C
Sbjct: 182 LSQVENEYGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYC 241
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+P+KP IWTE+W +++ +GG+ R A+D+A+ VA F K GS NYYMYH
Sbjct: 242 DQF--TPNTPDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYH 299
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD +AP+DEYGL R PKWGHLK+LH AI L L++G
Sbjct: 300 GGTNFGRTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLISGEHQNF 359
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG EA V+ ++SG CAAFL N D++ V+FRN SY LP S+SILPDCKT FNT
Sbjct: 360 TLGHSLEADVYTDSSGTCAAFLSNLDDKNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNT 419
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+++ +K +LK S KWE + E + L+D I+ KD +DY W
Sbjct: 420 AKVTSKSSKVEMLPEDLKSSSGLKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLW 479
Query: 388 YTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT + + A +P L ++S GH LH F+N EY G+A G+ +V F L+ V L
Sbjct: 480 YTTSITVSENEAFLKKGSSPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVAL 539
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEK 496
+ G + LLS+TVGL ++G+F E AG+ V ++ + TN W Y++G+ GE
Sbjct: 540 KAGETNIDLLSMTVGLANAGSFYEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEH 599
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L+++ V W+ P ++ LTWYK P+G++P+ L++ SMGKG AW+NG+
Sbjct: 600 LELFKPGNSGAVKWTVTTKPPKKQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEE 659
Query: 555 IGRYW--VSFKTSKGNP--SQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
IGRYW ++ K S + + Y + + + YHVPR++ K +GN LV
Sbjct: 660 IGRYWPRIARKNSPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELV 719
Query: 610 LLEEENGNPLGITVDTIAIRKV 631
+ EE+ GNP+ I ++ RKV
Sbjct: 720 IFEEKGGNPMKI---KLSKRKV 738
>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 725
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 299/667 (44%), Positives = 389/667 (58%), Gaps = 57/667 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK GGLDVIQTYVFWN HEP G+Y F R D+++FIK +Q GL+V LRIG
Sbjct: 56 MWPDLIQKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG PIWL V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 176 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT +Y +GG R A+D+AF VA FI GS+ NYYMYH
Sbjct: 236 -ENFK-PNKVYKPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL+++PKWGHL++LH AIK C L+ +V
Sbjct: 294 GGTNFGRTAGGPFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF SG CAAFL N+D + +V V F + Y+LP SISILPDCKT FNT
Sbjct: 354 KLGNNQEAHVFNSKSG-CAAFLANHDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+ K S+ S W+ + E + + +GL +QI +DA+DY W
Sbjct: 413 AKVAW---KASEVQMKPVYSRLPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S N + P L + S GH LH F+NG+ +G+ +GS +N T V L
Sbjct: 470 YMTDITIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E GV + + W Y++G+ GE
Sbjct: 530 RPGINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGE 589
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W+ S ++ LTWYK TF AP G+ P+AL++ SMGKG+ W+NGQ
Sbjct: 590 SLGLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQ 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
S+GR+W + ++G+ YA N +C + YH+PR++L PTGNLLV+
Sbjct: 650 SVGRHWPGY-IAQGSCGNCYYAGTFNDKKCRTYCG-KPSQRWYHIPRSWLTPTGNLLVVF 707
Query: 612 EEENGNP 618
EE G+P
Sbjct: 708 EEWGGDP 714
>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 803
Score = 533 bits (1373), Expect = e-148, Method: Compositional matrix adjust.
Identities = 307/799 (38%), Positives = 432/799 (54%), Gaps = 87/799 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEPQ+ +YDFSG + I+F + +Q GLY+ +RIG
Sbjct: 35 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIG 94
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R+DN+ YK
Sbjct: 95 PYVCAEWNYGGFPLWLHNMPGIQLRTDNQVYKNEMLTFTTKIVNMCKQANLFASQGGPII 154
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + G Y+ W A+MA + GVPW+MC+Q DAP P+IN CNG C
Sbjct: 155 LAQIENEYGNVMTPYGNAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYC 214
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN+P P ++TE+W +++ WG K RSA+D+AF VA F G + NYYMYH
Sbjct: 215 -DSFS-PNNPKSPKMFTENWVGWFKKWGDKDPYRSAEDVAFSVARFFQSGGVFNNYYMYH 272
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD APLDEYG + +PKWGHLK+LH++IKL + L GT +
Sbjct: 273 GGTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHSSIKLGEKILTNGTHSNK 332
Query: 269 SLGQLQEAFVFEE-TSGVCAAFLVNNDERKAVTV-LFRNISYELPRKSISILPDCKTVAF 326
+ G F T+ FL N D+ T+ L + Y +P S+SI+ CK F
Sbjct: 333 TFGSFVTLTKFSNPTTKERFCFLSNTDDTNDATIDLQADGKYFVPAWSVSIIDGCKKEVF 392
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEG------LLDQISAAK 380
NT ++++Q + K N K + W EA+ + L+ +G LL+Q
Sbjct: 393 NTAKINSQTSMFVKVQNEKENVKLSWVWAPEAM----SDTLQGKGTFKENLLLEQKGTTI 448
Query: 381 DASDYFWYTFRFHYN--SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
D+SDY WY N SS L V + GH+LHAFVN Y GS G++ SF
Sbjct: 449 DSSDYLWYMTNVETNGTSSIHNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQ-SFVFEKP 507
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV----QDKSFTNCS---WGYQVG 491
+ L+ GTN LLS TVGL + AF + G+ + TN S W Y+VG
Sbjct: 508 ILLKAGTNIITLLSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGNVTTNLSSNLWSYKVG 567
Query: 492 LIGEKLQIYSNLGLNKVLWSSI--RSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
L GE Q+Y+ + + W+++ S R++TWYKT+F+ P+G DP+ L++Q MGKGEAW
Sbjct: 568 LNGEIKQLYNPVFSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAW 627
Query: 550 VNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
+NGQSIGR+W SF N S+T + A + + C + YH+PR+FL N
Sbjct: 628 INGQSIGRFWPSFIAGNDNCSETCDYRGAYDPSKCVGNCG-NPSQRWYHIPRSFLSNNTN 686
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
LVL EE G+P ++V TI I +CG+ + T+
Sbjct: 687 TLVLFEEIGGSPQQVSVQTITIGTICGNAN-------------------------EGSTL 721
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
+ SC IS+I FAS+GNP G C + GS ++S ++E+ C CS+ + ++
Sbjct: 722 ELSCQGEYIISEIQFASYGNPKGKCGSFKQGSWDVTNSALLLEKTCKDMKSCSVDVSAKL 781
Query: 727 FGGDPCPGIHKALLVDAQC 745
FG + L+V A C
Sbjct: 782 FGLGDAVNLSARLVVQALC 800
>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
Length = 663
Score = 533 bits (1372), Expect = e-148, Method: Compositional matrix adjust.
Identities = 284/604 (47%), Positives = 362/604 (59%), Gaps = 54/604 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+G +DVIQTYVFWN HEP G+Y F R D++RFIK +Q GLYV LRIG
Sbjct: 64 MWPDLIQKAKDG-VDVIQTYVFWNGHEPSPGKYYFEDRYDLVRFIKLVQQAGLYVHLRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 123 PYVCAEWNFGGFPVWLKYVPGIEFRTDNEPFKAAMQKFTEKIVSMMKAEKLFETQGGPII 182
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 183 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQDDAPDPVINTCNGFYC 242
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN NKP +WTE+WT ++ +GG R A+D+AF VA FI GS+VNYYMYH
Sbjct: 243 -ENFV-PNQKNKPKMWTENWTGWFTAFGGPTPQRPAEDVAFSVARFIQNGGSFVNYYMYH 300
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL+REPKWGHL++LH AIKLC L++ V
Sbjct: 301 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKLCESALVSTDPTVT 360
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG QE VF SG CAAFL N D + V F+ + YELP SISILPDCKT FNT
Sbjct: 361 SLGNNQEVHVFNPKSGSCAAFLANYDTTSSAKVNFKIMQYELPPWSISILPDCKTAVFNT 420
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
R+ Q + + T F W+ Y E+ + D+ +GL +Q++ +DASDY W
Sbjct: 421 ARLGAQSSLKQMTPVSTF----SWQSYIEESASSSDDKTFTTDGLWEQLNVTRDASDYLW 476
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + +S+ N Q P L + S GH LH F+NG+ +G+ +G DN T V +
Sbjct: 477 YMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNVKM 536
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N +LLS++VGL + G E+ GV + + + W Y++GL GE
Sbjct: 537 RVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTRDLSKQQWSYKIGLKGE 596
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S + + LTWYKTTF APAGN+P+AL++ +MGKG W+N Q
Sbjct: 597 DLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWINSQ 656
Query: 554 SIGR 557
SIGR
Sbjct: 657 SIGR 660
>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 302/800 (37%), Positives = 421/800 (52%), Gaps = 127/800 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K KEGGLD I+TYVFWN HEP + QYDFSG D+IRF+K IQ +G+Y LRIG
Sbjct: 53 MWPDLIKKGKEGGLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQDEGMYGVLRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ G+ FR+ N +
Sbjct: 113 PYVCAEWNYGGFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPII 172
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ E G Y+ W A MA GVPW+MC+QDDAP P++N CNG C
Sbjct: 173 LAQIENEYGNVIGSYGEAGKAYIKWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN+PN P +WTE+WT +Y+ WGGK R+ +D+AF VA F + G++ NYYMYH
Sbjct: 233 -DNFT-PNNPNTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQRGGTFQNYYMYH 290
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA IT YD APLDE+G + +PK+GHLK+LH + + L G + +
Sbjct: 291 GGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTV 350
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G L A V++ G + F+ N +E + F+ Y++P S+SILPDCKT +NT
Sbjct: 351 DFGNLVTATVYKTEEG-SSCFIGNVNETSDAKINFQGTFYDVPAWSVSILPDCKTETYNT 409
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILNFDNTLLRAEG------LLDQISAAK 380
+++TQ + K +N + S KW E N DN LL+ +G L DQ +
Sbjct: 410 AKINTQTSVMVKKANEAENEPSTLKWSWRPE---NIDNVLLKGKGESTMRQLFDQKVVSN 466
Query: 381 DASDYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
D SDY WY + + L + S H+LHAFVNG++ G+ + +
Sbjct: 467 DESDYLWYMTTVNIKEQDPVWGKNMSLRINSTAHVLHAFVNGQHIGNYRAENGKFHYVFE 526
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV---------HRVRVQDKSFTNCSWG 487
G N LLS+TVGLP+ GAF E AG+ + K + W
Sbjct: 527 QDAKFNPGANVITLLSITVGLPNYGAFFENVPAGITGPVFIIGRNGDETIVKDLSTHKWS 586
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
Y+ GL G + Q++S+ SP +T+ AP G++P+ ++L +GKG
Sbjct: 587 YKTGLSGFENQLFSS-----------ESP--------STWSAPLGSEPVVVDLLGLGKGT 627
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG-N 606
AW+NG +IGRYW +F + I C+ YHVPR+FL G N
Sbjct: 628 AWINGNNIGRYWPAF----------------LADIDGCSA-----EYHVPRSFLNSDGDN 666
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
LVL EE GNP + TI + VC +V +K +
Sbjct: 667 TLVLFEEIGGNPSLVNFQTIGVGNVCANVY-------------------------EKNVL 701
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSH-SQGVVERACIGKSRCSIPLLSR 725
+ SC GK IS I FASFGNP G+C + G+C +S+ + ++ + C+GK +CSI + +
Sbjct: 702 ELSCN-GKPISSIKFASFGNPGGNCGSFEKGTCEASNDAAAILTQECVGKEKCSIDVSEK 760
Query: 726 YFGGDPCPGIHKALLVDAQC 745
FG C G+ K L V+A C
Sbjct: 761 KFGAADCGGLAKRLAVEAIC 780
>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
Length = 731
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 297/674 (44%), Positives = 393/674 (58%), Gaps = 61/674 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G Y F R D+++FIK +Q +GL+V LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 176 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT +Y +GG R A+D+AF VA FI GS++NYYMYH
Sbjct: 236 -ENFK-PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL REPKWGHL++LH AIK C L++ +V
Sbjct: 294 GGTNFGRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ S CAAFL N D + +V V F Y+LP SISILPDCKT +NT
Sbjct: 354 KLGSNQEAHVFKSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q S+ S W+ + E + + +GL +QI+ +D +DY W
Sbjct: 413 AKVGSQ---SSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S N ++P L + S GH L+ F+NG+ +G+ +GS +N + V+L
Sbjct: 470 YMTDITIGSDEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E AGV + + W Y+ GL GE
Sbjct: 530 RSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGE 589
Query: 496 KLQIYSNLGLNKVLWSSIRSPT----RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
L +++ G + V W + P+ + LTWYK TF AP G+ P+AL++ SMGKG+ W+N
Sbjct: 590 ALGLHTVTGSSSVEW--VEGPSMAEKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWIN 647
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLV 609
GQS+GR+W + ++G+ YA + C + YH+PR++L PTGNLLV
Sbjct: 648 GQSVGRHWPGY-IARGSCGDCSYAGTYDDKKCRTHCG-EPSQRWYHIPRSWLTPTGNLLV 705
Query: 610 LLEEENGNPLGITV 623
+ EE G+P I++
Sbjct: 706 VFEEWGGDPSRISL 719
>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
Length = 826
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 303/801 (37%), Positives = 425/801 (53%), Gaps = 88/801 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TYVFWN HEP++ +YDFSG D++RFIK IQ GLY LRIG
Sbjct: 58 MWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ + FR+ N +
Sbjct: 118 PYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVKMMKEEKLFASQGGPII 177
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ +G Y+ W A MA GVPW+MC+Q +AP P++ CNG C
Sbjct: 178 LAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P +P+ P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMYH
Sbjct: 238 DQY--EPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR A IT YD APLDE+G + +PKWGHLK+LH +K + L G + I
Sbjct: 296 GGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHTVLKSMEKSLTYGNISRI 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +A ++ G + F+ N + V F+ Y +P S+S+LPDC A+NT
Sbjct: 356 DLGNSIKATIYTTKEG-SSCFIGNVNATADALVNFKGKDYHVPAWSVSVLPDCDKEAYNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW--EEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V+TQ + ++ S+ + W E ++ IL L+ A+GL+DQ DASDY
Sbjct: 415 AKVNTQTSIMTEDSSKPERLEWTWRPESAQKMILKGSGDLI-AKGLVDQKDVTNDASDYL 473
Query: 387 WYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV-HL 441
WY R H + + L V S+ H+LHA+VNG+Y G+ + V HL
Sbjct: 474 WYMTRLHLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHL 533
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVH---------RVRVQDKSFTNCSWGYQVGL 492
GTN +LLSV+VGL + G F E G++ +K + W Y++GL
Sbjct: 534 VHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGL 593
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
G +++S + W++ + PT R LTWYK F+AP G +P+ ++L +GKGEAW+N
Sbjct: 594 NGYNDKLFSIKSVGHQKWANEKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWIN 653
Query: 552 GQSIGRYWVSFKTS-KGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTG-N 606
GQSIGRYW SF +S G + Y CA + T YHVPR+FL +G N
Sbjct: 654 GQSIGRYWPSFNSSDDGCKDECDY--RGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHN 711
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
+ L EE GNP + T+ + VC H V
Sbjct: 712 TITLFEEMGGNPSMVNFKTVVVGTVCARA-------------HEHN------------KV 746
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQG-VVERACIGKSRCSIPLLSR 725
+ SC + IS + FASFGNP G C +AVG+C V + C+GK C++ + S
Sbjct: 747 ELSCH-NRPISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKECVGKLNCTVNVSSD 805
Query: 726 YFGGD-PCPGIHKALLVDAQC 745
FG C K L V+ +C
Sbjct: 806 TFGSTLDCGDSPKKLAVELEC 826
>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 788
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 303/801 (37%), Positives = 425/801 (53%), Gaps = 88/801 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TYVFWN HEP++ +YDFSG D++RFIK IQ GLY LRIG
Sbjct: 20 MWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIG 79
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ + FR+ N +
Sbjct: 80 PYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVKMMKEEKLFASQGGPII 139
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ +G Y+ W A MA GVPW+MC+Q +AP P++ CNG C
Sbjct: 140 LAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYC 199
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P +P+ P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMYH
Sbjct: 200 DQY--EPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYH 257
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR A IT YD APLDE+G + +PKWGHLK+LH +K + L G + I
Sbjct: 258 GGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHTVLKSMEKSLTYGNISRI 317
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +A ++ G + F+ N + V F+ Y +P S+S+LPDC A+NT
Sbjct: 318 DLGNSIKATIYTTKEG-SSCFIGNVNATADALVNFKGKDYHVPAWSVSVLPDCDKEAYNT 376
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW--EEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V+TQ + ++ S+ + W E ++ IL L+ A+GL+DQ DASDY
Sbjct: 377 AKVNTQTSIMTEDSSKPERLEWTWRPESAQKMILKGSGDLI-AKGLVDQKDVTNDASDYL 435
Query: 387 WYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV-HL 441
WY R H + + L V S+ H+LHA+VNG+Y G+ + V HL
Sbjct: 436 WYMTRLHLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHL 495
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVH---------RVRVQDKSFTNCSWGYQVGL 492
GTN +LLSV+VGL + G F E G++ +K + W Y++GL
Sbjct: 496 VHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGL 555
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
G +++S + W++ + PT R LTWYK F+AP G +P+ ++L +GKGEAW+N
Sbjct: 556 NGYNDKLFSIKSVGHQKWANEKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWIN 615
Query: 552 GQSIGRYWVSFKTS-KGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTG-N 606
GQSIGRYW SF +S G + Y CA + T YHVPR+FL +G N
Sbjct: 616 GQSIGRYWPSFNSSDDGCKDECDY--RGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHN 673
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
+ L EE GNP + T+ + VC H V
Sbjct: 674 TITLFEEMGGNPSMVNFKTVVVGTVCARA-------------HEHN------------KV 708
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQG-VVERACIGKSRCSIPLLSR 725
+ SC + IS + FASFGNP G C +AVG+C V + C+GK C++ + S
Sbjct: 709 ELSCH-NRPISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKECVGKLNCTVNVSSD 767
Query: 726 YFGGD-PCPGIHKALLVDAQC 745
FG C K L V+ +C
Sbjct: 768 TFGSTLDCGDSPKKLAVELEC 788
>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
Length = 826
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 303/801 (37%), Positives = 425/801 (53%), Gaps = 88/801 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TYVFWN HEP++ +YDFSG D++RFIK IQ GLY LRIG
Sbjct: 58 MWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ + FR+ N +
Sbjct: 118 PYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVEMMKEEKLFASQGGPII 177
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ G Y+ W A MA GVPW+MC+Q +AP P++ CNG C
Sbjct: 178 LAQIENEYGNVISSYGAAGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P +P+ P +WTE+WT +++ WGGK R+A+D+AF VA F G++ NYYMYH
Sbjct: 238 DQY--EPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFFQTGGTFQNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR A IT YD AP+DE+G + +PKWGHLK+LH +K + L G + I
Sbjct: 296 GGTNFGRVAGGPYITTSYDYHAPIDEFGNLNQPKWGHLKQLHRVLKSMEKSLTYGNISRI 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG +A ++ G + F+ N + V F+ Y +P S+S+LP+C A+NT
Sbjct: 356 DLGNSIKATIYTTKEG-SSCFIGNVNATANALVNFKGKDYHVPAWSVSVLPECDKEAYNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKW--EEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V+TQ + ++ S+ + W E ++ IL L+ A+GL+DQ DASDY
Sbjct: 415 AKVNTQTSIMTEDSSKPEKLEWTWRPESAQKMILKSSGDLI-AKGLVDQKDVTNDASDYL 473
Query: 387 WYTFRFHYNSSNA----QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV-HL 441
WY R H + + L V S+ H+LHA+VNG+Y G+ + V HL
Sbjct: 474 WYMTRVHLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFEKKVNHL 533
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVH---------RVRVQDKSFTNCSWGYQVGL 492
GTN +LLSV+VGL + GAF E G++ +K + W Y++GL
Sbjct: 534 VHGTNHISLLSVSVGLQNYGAFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGL 593
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
G +++S + + W++ PT R LTWYK F+AP G +P+ ++ +GKGEAW+N
Sbjct: 594 NGYNNKLFSTKSVGHIKWANEMFPTSRMLTWYKAKFKAPLGKEPVIVDFNGLGKGEAWIN 653
Query: 552 GQSIGRYWVSFKTS-KGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTG-N 606
GQSIGRYW SF +S G + Y + CA + T YHVPR+FLK +G N
Sbjct: 654 GQSIGRYWPSFNSSDDGCKDECDYRGEYGSDK--CAFMCGEPTQRWYHVPRSFLKASGHN 711
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
+ L EE GNP + T+ + VC H V
Sbjct: 712 TITLFEEMGGNPSMVNFKTVVVGTVCARA-------------HEHN------------KV 746
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ-GVVERACIGKSRCSIPLLSR 725
+ SC IS + FASFGNP G C +AVG+C V + C+GK C+I + S
Sbjct: 747 ELSCH-NHPISAVKFASFGNPVGHCGTFAVGTCQGDKDAVKTVAKECVGKLNCTINVSSD 805
Query: 726 YFGGD-PCPGIHKALLVDAQC 745
FG C K L V+ +C
Sbjct: 806 TFGSTLDCGDSPKKLAVELEC 826
>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
Length = 740
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 283/682 (41%), Positives = 404/682 (59%), Gaps = 56/682 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSL+ AKEGG + I++YVFWN HEP +Y F GR +I++FIK +Q G+++ LRIG
Sbjct: 61 MWPSLVQTAKEGGCNAIESYVFWNGHEPSPRKYYFGGRYNIVKFIKIVQQAGMHMILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW YGG+P+WLH V G VFR+DN+P+K
Sbjct: 121 PFVAAEWNYGGVPVWLHYVPGTVFRADNEPWKHYMESFTTYIVNLLKKEKLFAPQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY E + E G Y W+A MAV + GVPW+MC+Q DAP VI+ CNG C
Sbjct: 181 LSQVENEYGYYEKDYGEGGKRYAQWSASMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYC 240
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+P+KP IWTE+W +++ +GG+ R A+D+A+ VA F K GS NYYMYH
Sbjct: 241 DQF--TPNTPDKPKIWTENWPGWFKTFGGRDPHRPAEDVAYSVARFFGKGGSVHNYYMYH 298
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD +AP+DEYGL R PKWGHLK+LH AI L L+ G
Sbjct: 299 GGTNFGRTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKDLHKAIMLSENLLINGEHQNF 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG EA V+ ++SG CAAFL N D++ TV+FRN SY LP S+SILPDCK FNT
Sbjct: 359 TLGHSLEADVYTDSSGTCAAFLSNLDDKNDKTVMFRNTSYHLPAWSVSILPDCKNEVFNT 418
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+++++K +L+ S KWE + E + L+D I+ KD +DY W
Sbjct: 419 AKVTSKFSKVEMLPEDLRSSSGLKWEVFSEKPGIWGEADFVKNELVDHINTTKDTTDYLW 478
Query: 388 YTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
YT +++ + L ++S GH LH F+N EY G+A G+ +V F L+ +V L
Sbjct: 479 YTTSITVSTNEEFLKKGSPPVLFIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKSVAL 538
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLIGEK 496
+ G N+ LLS+TVGL ++G+F E AG+ V ++ + TN W Y++G+ G
Sbjct: 539 KAGENNIDLLSMTVGLSNAGSFYEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVQGVH 598
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
L+++ V W+ P ++ LTWYK P+G++P+ L++ SMGKG AW+NG+
Sbjct: 599 LELFKPGDSGAVKWTVTTKPPKKQPLTWYKVVIDPPSGSEPVGLDMMSMGKGMAWLNGEE 658
Query: 555 IGRYW--VSFKTSKGNP--SQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLV 609
IGRYW ++ K++ + + Y + + + YHVPR++ K +GN LV
Sbjct: 659 IGRYWPRIARKSTPNDECVKECDYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELV 718
Query: 610 LLEEENGNPLGITVDTIAIRKV 631
+ EE+ G+P+ I T++ RKV
Sbjct: 719 IFEEKGGDPMKI---TLSKRKV 737
>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
Length = 727
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 291/675 (43%), Positives = 389/675 (57%), Gaps = 64/675 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G Y F R D+++F K + GLY+ LRIG
Sbjct: 59 MWPDLIKKAKEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+VFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ G Y W A+MA+ TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 179 LSQIENEYGPMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT ++ +GG R +DIAF VA FI GS++NYYMY+
Sbjct: 239 -EGFK-PNSDNKPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYY 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y AP+DEYGL+REPK+ HLKELH IKLC L++ + S
Sbjct: 297 GGTNFDRTAGVFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QE VF+ + CAAFL N D A V+FR Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEIHVFKSKTS-CAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 RVSTQYNKRSKTSNLKF---DSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASD 384
++ R+ T +K + WE Y E N T ++ +GL++QIS +D +D
Sbjct: 416 KI------RAPTILMKMIPTSTKFSWESYNEGSPSSNEAGTFVK-DGLVEQISMTRDKTD 468
Query: 385 YFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
YFWY S + L + S GH LH FVNG G+++G+ N T
Sbjct: 469 YFWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQN 528
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGL 492
+ L G N ALLS VGLP++G E G+ V + W Y++GL
Sbjct: 529 IKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGL 588
Query: 493 IGEKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE + +++ G + V W + LTWYK++F P GN+P+AL++ +MGKG+ WV
Sbjct: 589 RGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWV 648
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
NG +IGR+W ++ T++GN + YA N + C + YHVPR++LKP GNLL
Sbjct: 649 NGHNIGRHWPAY-TARGNCGRCNYAGIYNEKKCLSHCG-EPSQRWYHVPRSWLKPFGNLL 706
Query: 609 VLLEEENGNPLGITV 623
V+ EE G+P GI++
Sbjct: 707 VIFEEWGGDPSGISL 721
>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
Length = 779
Score = 530 bits (1365), Expect = e-147, Method: Compositional matrix adjust.
Identities = 302/800 (37%), Positives = 421/800 (52%), Gaps = 127/800 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K KEG LD I+TYVFWN HEP + QYDFSG D+IRF+K IQ++G+Y LRIG
Sbjct: 52 MWPDLIKKGKEGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ G+ FR+ N +
Sbjct: 112 PYVCAEWNYGGFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPII 171
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ E G Y+ W A MA GVPW+MC+QDDAP P++N CNG C
Sbjct: 172 LAQIENEYGNVIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN+PN P +WTE+WT +Y+ WGGK R+ +D+AF VA F K G++ NYYMYH
Sbjct: 232 -DNFS-PNNPNTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYH 289
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA IT YD APLDE+G + +PK+GHLK+LH + + L G + +
Sbjct: 290 GGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTV 349
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G L A V++ G + F+ N +E + F+ SY++P S+SILPDCKT +NT
Sbjct: 350 DFGNLVTATVYQTEEG-SSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNT 408
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILNFDNTLLRAEG------LLDQISAAK 380
+++TQ + K +N + S KW E N D+ LL+ +G L DQ +
Sbjct: 409 AKINTQTSVMVKKANEAENEPSTLKWSWRPE---NIDSVLLKGKGESTMRQLFDQKVVSN 465
Query: 381 DASDYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
D SDY WY + + L + S H+LHAFVNG++ G+ + +
Sbjct: 466 DESDYLWYMTTVNLKEQDPVLGKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFE 525
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV---------HRVRVQDKSFTNCSWG 487
G N LLS+TVGLP+ GAF E AG+ + K + W
Sbjct: 526 QDAKFNPGANVITLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWS 585
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
Y+ GL G + Q++S+ SP +T+ AP G++P+ ++L +GKG
Sbjct: 586 YKTGLSGFENQLFSS-----------ESP--------STWSAPLGSEPVVVDLLGLGKGT 626
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG-N 606
AW+NG +IGRYW +F ++ I C+ YHVPR+FL G N
Sbjct: 627 AWINGNNIGRYWPAF----------------LSDIDGCSA-----EYHVPRSFLNSEGDN 665
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
LVL EE GNP + TI + VC +V +K +
Sbjct: 666 TLVLFEEIGGNPSLVNFQTIGVGSVCANVY-------------------------EKNVL 700
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS-HSQGVVERACIGKSRCSIPLLSR 725
+ SC GK IS I FASFGNP GDC + G+C +S ++ ++ + C+GK +CSI +
Sbjct: 701 ELSCN-GKPISAIKFASFGNPGGDCGSFEKGTCEASNNAAAILTQECVGKEKCSIDVSED 759
Query: 726 YFGGDPCPGIHKALLVDAQC 745
FG C + K L V+A C
Sbjct: 760 KFGAAECGALAKRLAVEAIC 779
>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 846
Score = 530 bits (1365), Expect = e-147, Method: Compositional matrix adjust.
Identities = 307/809 (37%), Positives = 423/809 (52%), Gaps = 95/809 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVI+TYVFWN HEPQ+ QYDFS D++RFI+ IQ +GLY +RIG
Sbjct: 58 MWPYLIRKAKEGGLDVIETYVFWNAHEPQRRQYDFSENLDLVRFIRTIQKEGLYAMIRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I SEW YGGLP+WLH++ + FR+ N+ +
Sbjct: 118 PYISSEWNYGGLPVWLHNIPNMEFRTHNRAFMEEMKTFTRKIVDMMQDETLFAVQGGPII 177
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + A+ G Y+ W A++A F TGVPWVM +Q +AP +I++C+G C
Sbjct: 178 IAQIENEYGNVMHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F+ PN +KP IWTE+WT Y+ WG + R A+D+A+ VA F G++ NYYMYH
Sbjct: 238 -DQFQ-PNDNHKPKIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA +T YD APLDEYG + +PKWGHL++LH +K L G+
Sbjct: 296 GGTNFKRTAGGPYVTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQHT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G + A V+ G F+ N + K T+ FRN Y +P S+SILP+C + A+NT
Sbjct: 356 DYGNMVTATVY-TYDGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTL------LRAEGLLDQISAAKDA 382
+V+TQ K N + +W+ +E + + L A LLDQ D
Sbjct: 415 AKVNTQTTIMVKKDNEDLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDF 474
Query: 383 SDYFWYTFRFHYNSSN-----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
SDY WY + + L V + GH+LH FVNG++ G+ H + F +
Sbjct: 475 SDYLWYITSIDIKGDDDPSWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHES 534
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLE----------RKVAGVHRVRVQD----KSFTN 483
+ L G N+ +LLS TVGLP+ G F + + VA V D K +
Sbjct: 535 KIKLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSK 594
Query: 484 CSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSM 543
W Y+VGL GE YS K ++ R L WYKTTF++P G+DP+ ++L +
Sbjct: 595 NQWSYKVGLHGEHEMHYSYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGL 654
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPS-----QTQYAVNTVTSIHFCAIIKATNTYHVPR 598
GKG AWVNG SIGRYW S+ + S + Y N S+ CA + YHVPR
Sbjct: 655 GKGHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSNKCLSM--CA-QPSQRWYHVPR 711
Query: 599 AFLKPTG-NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
+FL+ N LVL EE G P + T+ + KVC + +
Sbjct: 712 SFLRDDDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGN------------------ 753
Query: 658 KKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSR 717
T++ +C + IS+I FASFG P G+C + G+C SS + ++ CIGK +
Sbjct: 754 -------TLELACNKNQVISEIKFASFGLPKGECGSFQKGNCESSEALSAIKAQCIGKDK 806
Query: 718 CSIPLLSRYFGGDPCP-GIHKALLVDAQC 745
CSI + R G C + L V+A C
Sbjct: 807 CSIQVSERALGPTRCRVAEDRRLAVEAVC 835
>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 530 bits (1364), Expect = e-147, Method: Compositional matrix adjust.
Identities = 302/779 (38%), Positives = 421/779 (54%), Gaps = 82/779 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK GGL+ I+TYVFWN HEPQ+GQYDFSG ND+++FIK +Q + LY LRIG
Sbjct: 46 MWPMLMKKAKNGGLNAIETYVFWNAHEPQRGQYDFSGNNDLVQFIKAVQKERLYAILRIG 105
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK-----------------------IENEY 97
P++ +EW YGG P+WLH++ GI FR++N+ YK IENE+
Sbjct: 106 PYVCAEWNYGGFPVWLHNLPGIKFRTNNQVYKVTFXFFFLTKNLKKINNMFLKNXIENEF 165
Query: 98 QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN 157
+E ++ ++G YV W A++A ++ PW+MC+Q DAP P++ C+ + PN
Sbjct: 166 GNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIVCNCDQFK-------PN 218
Query: 158 SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRT 217
+ N P +WTE W +++ WG + R+A+D+AF VA F GS NYYMYHGGTNFGR+
Sbjct: 219 NKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNFGRS 278
Query: 218 AAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEA 276
A IT YD APLDEYG + +PKWGHLK+LH I+ + L G I G A
Sbjct: 279 AGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHSTTA 338
Query: 277 FVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYN 336
+ G + F N E + F+ Y +P S+++LPDCKT +NT +V+TQ
Sbjct: 339 TSY-TYKGKSSCFF-GNPENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTT 396
Query: 337 KRSKTSNL--KFDSDEKWEEYREAIL------NFDNTLLRAEGLLDQISAAKDASDYFWY 388
R +L K KW+ E I + + + A L+DQ D+SDY WY
Sbjct: 397 IREMVPSLVGKHKKPLKWQWRNEKIEHLTHEGDISGSAITANSLIDQKMVTNDSSDYLWY 456
Query: 389 TFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV-HLRQ 443
FH N ++ + L V++ GHILHAFVN ++ G+ G + SFTL V +LR
Sbjct: 457 LTGFHLNGNDPLFGKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRNLRH 516
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGVH---RVRVQDKSFTNCS---WGYQVGLIGEKL 497
G N ALLS TVGLP+ GA+ E G++ + K+ + S W Y+VGL GEK
Sbjct: 517 GFNQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGKTIRDLSTNEWIYKVGLDGEKY 576
Query: 498 QIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
+ + + W S P Q TWYKT+F P G + + ++L MGKG+AWVNG+SIG
Sbjct: 577 EFFDPDHKFRKPWLSNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVNGKSIG 636
Query: 557 RYWVSF-KTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKP-TGNLLVLLEE 613
RYW S+ T G S Y S K T YH+PR+++ N L+L EE
Sbjct: 637 RYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTLILFEE 696
Query: 614 ENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLG 673
G PL I + T ++KVC V G K ++ +C
Sbjct: 697 FGGMPLNIEIKTTRVKKVCAKV-----------------------DLGSK--LELTCH-D 730
Query: 674 KKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPC 732
+ + +I+F FGNP G+C + GSCHSS + V+E+ C+ K +CSI + G C
Sbjct: 731 RTVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKLGLTGC 789
>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 731
Score = 530 bits (1364), Expect = e-147, Method: Compositional matrix adjust.
Identities = 295/672 (43%), Positives = 392/672 (58%), Gaps = 57/672 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP G+Y F R D+++FIK +Q GL+V LRIG
Sbjct: 56 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 176 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT +Y +GG R A+D+AF VA FI GS++NYYMYH
Sbjct: 236 -ENFK-PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQSGGSFLNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL+REPKWGHL++LH AIK C L++ +V
Sbjct: 294 GGTNFGRTAGGPFMATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKSCESALVSVDPSVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF+ S CAAFL N D + +V V F Y+LP SISILPDCKT ++T
Sbjct: 354 KLGSNQEAHVFKSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYST 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q S+ S W+ + E + + +GL +QI+ +D +DY W
Sbjct: 413 AKVGSQ---SSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S N ++P L + S GH L+ F+NG+ +G+ +GS +N + V+L
Sbjct: 470 YMTDITIGSDEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E AGV + + W Y+ GL GE
Sbjct: 530 RSGINKLALLSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGE 589
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W S ++ LTWYK TF AP G+ P+AL++ SMGKG+ W+NGQ
Sbjct: 590 ALGLHTVTGSSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQ 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLL 611
S+GR+W + ++G+ YA + C + YH+PR++L P GNLLV+
Sbjct: 650 SVGRHWPGY-IARGSCGDCSYAGTYDDKKCRTHCG-EPSQRWYHIPRSWLTPNGNLLVVF 707
Query: 612 EEENGNPLGITV 623
EE G+P I++
Sbjct: 708 EEWGGDPSRISL 719
>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
Length = 725
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 299/668 (44%), Positives = 389/668 (58%), Gaps = 59/668 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK GGLDVIQTYVFWN HEP G+Y F R D+++FIK +Q GL+V LRIG
Sbjct: 56 MWPDLIQKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG PIWL V GI FR+DN+P+K
Sbjct: 116 PYVCAEWNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C
Sbjct: 176 LSQIENEFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE WT +Y +GG R A+D+AF VA FI GS+ NYYMYH
Sbjct: 236 -ENFK-PNKVYKPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFIQSGGSFFNYYMYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA FM T Y APLDEYGL+++PKWGHL++LH AIK C L+ +V
Sbjct: 294 GGTNFGRTAGGPFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHALVAVDPSVT 353
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA VF SG CAAFL N D + +V V F + Y+LP SISILPDCKT FNT
Sbjct: 354 KLGNNQEAHVFNSKSG-CAAFLANYDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNT 412
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V+ K S+ S W+ + E + + +GL +QI +DA+DY W
Sbjct: 413 AKVAW---KASEVQMKPVYSRLPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLW 469
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y S N + P L + S GH LH F+NG+ +G+ +GS +N T V L
Sbjct: 470 YMTDITIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKL 529
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLS++VGLP+ G E GV + + W Y++G+ GE
Sbjct: 530 RPGINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGE 589
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + V W+ S ++ LTWYK TF AP G+ P+AL++ SMGKG+ W+NGQ
Sbjct: 590 SLGLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQ 649
Query: 554 SIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTY-HVPRAFLKPTGNLLVL 610
S+GR+W + ++G+ YA N +C K + + H+PR++L PTGNLLV+
Sbjct: 650 SVGRHWPGY-IAQGSCGNCYYAGTFNDKKCRTYCG--KPSQRWCHIPRSWLTPTGNLLVV 706
Query: 611 LEEENGNP 618
EE G+P
Sbjct: 707 FEEWGGDP 714
>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 716
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 292/671 (43%), Positives = 392/671 (58%), Gaps = 56/671 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY F+ R D++RF+K + GLYV LRIG
Sbjct: 53 MWPDLIQKAKDGGLDVIQTYVFWNGHEPARGQYHFADRYDLVRFVKLARQAGLYVHLRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 113 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAEMQRFVEKIVSMMKSEGLFEWQGGPII 172
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E A PY WAA MAV GVPWVMCKQDDAP PVIN CNG C
Sbjct: 173 LAQVENEYGPMESAMGAGAKPYANWAANMAVATDAGVPWVMCKQDDAPDPVINTCNGFYC 232
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP++WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 233 --DYFTPNSNSKPTMWTEAWTGWFTAFGGPVPHRPVEDMAFAVARFIQKGGSFVNYYMYH 290
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA F+ T Y AP+DEYGL+R+PKWGHL++LH AIK L++G +
Sbjct: 291 GGTNFDRTAGGPFIATSYDYDAPIDEYGLIRQPKWGHLRDLHKAIKQAEPALVSGDPTIQ 350
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+G ++A+VF+ ++G CAAFL N A +++ Y+LP SISILPDCKT FNT
Sbjct: 351 RIGNYEKAYVFKSSTGACAAFLSNYHTSSAARIVYNGRRYDLPAWSISILPDCKTAVFNT 410
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V + + + + W+ Y E D++ +GL++Q+S D SDY WY
Sbjct: 411 ATV----KEPTAPAKMNPAGGFAWQSYSEDTNALDSSAFTKDGLVEQLSMTWDKSDYLWY 466
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + +SS Q P L + S GH + FVNG+ G A+G +++ T V +
Sbjct: 467 TTYVNIDSSEQFLKTGQWPQLTINSAGHSVQVFVNGQSFGVAYGGYNSPKLTYSKPVKMW 526
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG+N ++LS +GLP+ G E GV + + +N W YQ+GL GE
Sbjct: 527 QGSNKISILSSAMGLPNQGTHYEAWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGES 586
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIG 556
L + S + + + S S + LTW+K F APAG+ P+AL++ SMGKG+ WVNG + G
Sbjct: 587 LGVNS-ISGSSSVEWSSASGAQPLTWHKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAG 645
Query: 557 RYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
RYW S++ S G+ YA + C I + YHVPR++LKP+GNLLV+LEE
Sbjct: 646 RYW-SYRAS-GSCGGCSYAGTFSEAKCQTNCGDI-SQRWYHVPRSWLKPSGNLLVVLEEF 702
Query: 615 NGNPLGITVDT 625
G+ G+T+ T
Sbjct: 703 GGDLSGVTLMT 713
>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 923
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 306/809 (37%), Positives = 423/809 (52%), Gaps = 95/809 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVI+TYVFWN HEPQ+ QY+FS D++RFI+ IQ +GLY +RIG
Sbjct: 58 MWPYLIRKAKEGGLDVIETYVFWNAHEPQRRQYEFSENLDLVRFIRTIQKEGLYAMIRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I SEW YGGLP+WLH++ + FR+ N+ +
Sbjct: 118 PYISSEWNYGGLPVWLHNIPNMEFRTHNRAFMEEMKTFTTKIVDMMQDETLFAVQGGPII 177
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + A+ G Y+ W A++A F TGVPWVM +Q +AP +I++C+G C
Sbjct: 178 IAQIENEYGNVMHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F+ PN +KP IWTE+WT Y+ WG + R A+D+A+ VA F G++ NYYMYH
Sbjct: 238 DQ-FQ-PNDNHKPKIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFFQFGGTFQNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA +T YD APLDEYG + +PKWGHL++LH +K L G+
Sbjct: 296 GGTNFKRTAGGPYVTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQNT 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G + A V+ G F+ N + K T+ FRN Y +P S+SILP+C + A+NT
Sbjct: 356 DYGNMVTATVY-TYDGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDN------TLLRAEGLLDQISAAKDA 382
+V+TQ K N + +W+ +E + + L A LLDQ D
Sbjct: 415 AKVNTQTTIMVKKDNEDLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDF 474
Query: 383 SDYFWYTFRFHYNSSN-----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
SDY WY + + L V + GH+LH FVNG++ G+ H + F +
Sbjct: 475 SDYLWYITSIDIKGDDDPSWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHES 534
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLE----------RKVAGVHRVRVQD----KSFTN 483
+ L G N+ +LLS TVGLP+ G F + + VA V D K +
Sbjct: 535 KIKLTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSK 594
Query: 484 CSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSM 543
W Y+VGL GE YS K ++ R L WYKTTF++P G+DP+ ++L +
Sbjct: 595 NQWSYKVGLHGEHEMHYSYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGL 654
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPS-----QTQYAVNTVTSIHFCAIIKATNTYHVPR 598
GKG AWVNG SIGRYW S+ + S + Y N S+ CA + YHVPR
Sbjct: 655 GKGHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSNKCLSM--CA-QPSQRWYHVPR 711
Query: 599 AFLKPTG-NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
+FL+ N LVL EE G P + T+ + KVC + +
Sbjct: 712 SFLRDNDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGN------------------ 753
Query: 658 KKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSR 717
T++ +C + IS+I FASFG P G+C + G+C SS + ++ CIGK +
Sbjct: 754 -------TLELACNKNQVISEIKFASFGLPKGECGSFQKGNCESSEALSAIKAQCIGKDK 806
Query: 718 CSIPLLSRYFGGDPCP-GIHKALLVDAQC 745
CSI + R G C + L V+A C
Sbjct: 807 CSIQVSERTLGPTRCRVAEDRRLAVEAVC 835
>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 306/806 (37%), Positives = 433/806 (53%), Gaps = 101/806 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEPQ+ +YDFSG + I+F + +Q GLY+ +RIG
Sbjct: 35 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIG 94
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R+DN+ YK
Sbjct: 95 PYVCAEWNYGGFPLWLHNMPGIQLRTDNQVYKNEMLTFTTKIVNMCKQANLFASQGGPII 154
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + G Y+ W A+MA F+ GVPW+MC+Q DAP P+IN CNG C
Sbjct: 155 LAQIENEYGNVMTPYGNAGKAYINWCAQMAESFNIGVPWIMCQQSDAPQPIINTCNGFYC 214
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN+P P ++TE+W +++ WG K RSA+D+AF VA F G + NYYMYH
Sbjct: 215 -DSFS-PNNPKSPKMFTENWVGWFKKWGDKDPYRSAEDVAFSVARFFQSGGVFNNYYMYH 272
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ IT YD APLDEYG + +PKWGHLK+LH++IKL + L GT +
Sbjct: 273 GGTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHSSIKLGEKILTNGTHSNK 332
Query: 269 SLGQLQEAFVFEETSGVCAAFL-VNNDERKAVTVLFRNI-----SYELPRKSISILPDCK 322
+ G +FV +T G +N K N Y +P S+SI+ CK
Sbjct: 333 TFG----SFVTFKTFGSFVTLTKFSNPTTKERFCFLSNTXKADGKYFVPAWSVSIIDGCK 388
Query: 323 TVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEG------LLDQI 376
FNT ++++Q + K N K + W EA+ + L+ +G LL+Q
Sbjct: 389 KEVFNTAKINSQTSIFVKVQNEKENVKLSWVWAPEAM----SDTLQGKGTFKENLLLEQK 444
Query: 377 SAAKDASDYFWYTFRFHYN--SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFT 434
D+SDY WY N SS L V + GH+LHAFVN Y GS G++ SF
Sbjct: 445 GTTIDSSDYLWYMTNVETNGTSSIHNVTLQVNTKGHVLHAFVNTRYIGSQWGNNGQ-SFV 503
Query: 435 LRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH----------RVRVQDKSFTNC 484
+ L+ GTN LLS TVGL + AF + G+ V++ ++
Sbjct: 504 FEKPILLKAGTNIITLLSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGNVKID---LSSN 560
Query: 485 SWGYQVGLIGEKLQIYSNLGLNKVLWSSI--RSPTRQLTWYKTTFRAPAGNDPIALNLQS 542
W Y+VGL GE Q+Y+ + + W+++ S R++TWYKT+F+ P+G DP+ L++Q
Sbjct: 561 LWSYKVGLNGEIKQLYNPVFSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQG 620
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRA 599
MGKGEAW+NGQSIGR+W SF N S+T + A + + C + YH+PR+
Sbjct: 621 MGKGEAWINGQSIGRFWPSFIAGNDNCSETCDYRGAYDPSKCVGNCG-NPSQRWYHIPRS 679
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
FL N LVL EE G+P ++V TI I +CG+
Sbjct: 680 FLSNNTNTLVLFEEIGGSPQQVSVQTITIGTICGNAN----------------------- 716
Query: 660 FGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
+ T++ SC IS+I FAS+GNP G C + GS ++S ++E+ C G CS
Sbjct: 717 --EGSTLELSCQGEYIISEIQFASYGNPKGKCGSFKQGSWDVTNSALLLEKTCKGMKSCS 774
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
+ + ++ FG + L+V A C
Sbjct: 775 VDVSAKLFGLGDAVNLSARLVVQALC 800
>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
Length = 741
Score = 527 bits (1357), Expect = e-146, Method: Compositional matrix adjust.
Identities = 284/687 (41%), Positives = 394/687 (57%), Gaps = 67/687 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI+ AK GG+DVI+TYVFW+ H+P + Y+F GR D++ F+K + GLY LRIG
Sbjct: 56 MWSQLISNAKAGGIDVIETYVFWDGHQPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW GG P+WL DVAGI FR++N+P+K
Sbjct: 116 PYVCAEWNLGGFPVWLKDVAGIEFRTNNQPFKAEMQTFVEKIVAMMKHDKLFAPQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y++WAA M+ TGVPW+MC+Q DAP +++ CNG C
Sbjct: 176 LAQIENEYGNIDAAYGAAGKEYMVWAANMSQGLGTGVPWIMCQQSDAPDYILDTCNGFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
PN+ KP +WTE+W+ ++Q WG R +D+AF VA F + GS+ NYYMY
Sbjct: 236 DAW--APNNKKKPKMWTENWSGWFQKWGEASPHRPVEDVAFAVARFFQRGGSFQNYYMYF 293
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGR++ +T YD AP+DE+G++R+PKWGHLK+LHAAIKLC L + I
Sbjct: 294 GGTNFGRSSGGPYVTTSYDYDAPIDEFGVIRQPKWGHLKQLHAAIKLCEAALGSNDPTYI 353
Query: 269 SLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
SLGQLQEA V+ T SG CAAFL N D TV F + +Y LP S+SILPDCKTV+ N
Sbjct: 354 SLGQLQEAHVYGSTSSGACAAFLANIDSSSDATVKFNSRTYLLPAWSVSILPDCKTVSHN 413
Query: 328 TERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
T +V Q + ++ + WE Y E + + ++ + A LL+QI+ KD SDY W
Sbjct: 414 TAKVDVQTAMPTMKPSI---TGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLW 470
Query: 388 YTFRFHYNSSNA---QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
YT + ++A +A L ++S ++H FVNG+ GSA + + + L G
Sbjct: 471 YTTSLDISQADAASGKALLYLESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASG 530
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDK------SFTNCSWGYQVGLIGEKLQ 498
N A+L TVGL + G F+E AG++ + T W +QVGL GE L
Sbjct: 531 HNSLAILCATVGLQNYGPFIETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLA 590
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFR-----------------APAGNDPIALNLQ 541
I++ G +V WSS + L WYK F+ +P+GNDP+AL+L+
Sbjct: 591 IFTESGSQRVRWSSAVPQGQALVWYKVIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLE 650
Query: 542 SMGKGEAWVNGQSIGRYWVSFKT--SKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPR 598
SMGKG+AW+NGQSIGR+W S + + G P Y + +S + + YHVPR
Sbjct: 651 SMGKGQAWINGQSIGRFWPSLRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPR 710
Query: 599 AFLKPTGNLLVLLEEENGNPLGITVDT 625
++L+ GNL+VL EEE G P G++ T
Sbjct: 711 SWLQDGGNLVVLFEEEGGKPSGVSFVT 737
>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
Length = 727
Score = 526 bits (1356), Expect = e-146, Method: Compositional matrix adjust.
Identities = 290/675 (42%), Positives = 388/675 (57%), Gaps = 64/675 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G Y F R D+++F K + GLY+ LRIG
Sbjct: 59 MWPDLIKKAKEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+VFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ G Y W A+MA+ TGVPW+M KQ+DAP P+I+ CNG C
Sbjct: 179 LSQIENEYGPMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMSKQEDAPYPIIDTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT ++ +GG R +DIAF VA FI GS++NYYMY+
Sbjct: 239 -EGFK-PNSDNKPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYY 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y AP+DEYGL+REPK+ HLKELH IKLC L++ + S
Sbjct: 297 GGTNFDRTAGVFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QE VF+ + CAAFL N D A V+FR Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEIHVFKSKTS-CAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 RVSTQYNKRSKTSNLKF---DSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASD 384
++ R+ T +K + WE Y E N T ++ +GL++QIS +D +D
Sbjct: 416 KI------RAPTILMKMIPTSTKFSWESYNEGSPSSNEAGTFVK-DGLVEQISMTRDKTD 468
Query: 385 YFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
YFWY S + L + S GH LH FVNG G+++G+ N T
Sbjct: 469 YFWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQN 528
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGL 492
+ L G N ALLS VGLP++G E G+ V + W Y++GL
Sbjct: 529 IKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGL 588
Query: 493 IGEKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE + +++ G + V W + LTWYK++F P GN+P+AL++ +MGKG+ WV
Sbjct: 589 RGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWV 648
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
NG +IGR+W ++ T++GN + YA N + C + YHVPR++LKP GNLL
Sbjct: 649 NGHNIGRHWPAY-TARGNCGRCNYAGIYNEKKCLSHCG-EPSQRWYHVPRSWLKPFGNLL 706
Query: 609 VLLEEENGNPLGITV 623
V+ EE G+P GI++
Sbjct: 707 VIFEEWGGDPSGISL 721
>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
Length = 771
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 292/726 (40%), Positives = 400/726 (55%), Gaps = 96/726 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LI AKEGG+DVI+TYVFWN HE G Y F GR D+++F K +Q G+Y+ LRIG
Sbjct: 14 MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIG 73
Query: 61 PFIESEWTYGG---------------------------------LPIWLHDVAGIVFRSD 87
PF+ +EW +GG +P+WLH + G VFR+
Sbjct: 74 PFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVPVWLHYIPGTVFRTY 133
Query: 88 NKPYKIENEYQTI--------EPAFHEKGPP-----------------------YVLWAA 116
N+P+ E T E F +G P Y LWAA
Sbjct: 134 NQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYENYYKEDGKKYALWAA 193
Query: 117 KMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVW 176
KMAV +T VPW+MC+Q DAP PVI+ CN C + P SP +P +WTE+W +++ +
Sbjct: 194 KMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQF--TPTSPKRPKMWTENWPGWFKTF 251
Query: 177 GGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEY 235
GG+ R +D+AF VA F K GS NYYMYHGGTNFGRTA IT YD AP+DEY
Sbjct: 252 GGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 311
Query: 236 GLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDE 295
GL R PKWGHLKELH AIKLC LL G ISLG EA ++ ++SG CAAF+ N D+
Sbjct: 312 GLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSSGACAAFISNVDD 371
Query: 296 RKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDE----- 350
+ V+FRN SY LP S+SILPDCK V FNT +VS+ N + SD+
Sbjct: 372 KNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPEHLQQSDKGQKTL 431
Query: 351 KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSN------AQAPLD 404
KW+ ++E + G +D I+ KD +DY W+T +++ ++ L
Sbjct: 432 KWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILIDANEEFLKKGSKPALL 491
Query: 405 VQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFL 464
++S GH LHAFVN +Y G+ G+ + +FT +N + LR G N+ A+LS+TVGL +G F
Sbjct: 492 IESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIAILSLTVGLQTAGPFY 551
Query: 465 ERKVAGVHRVRV-----QDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTR- 518
+ AGV V++ + ++ +W Y++G++GE L IY G+N V W+S P +
Sbjct: 552 DFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGEGMNSVKWTSTSEPPKG 611
Query: 519 -QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVN 577
LTWYK AP+G++P+ L++ MGKG AW+NG+ IGRYW K +
Sbjct: 612 QALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWPRISEFKKEDCVQECDYR 671
Query: 578 TVTSIHFCAI---IKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV--------DTI 626
+ C + YHVPR++ KP+GN+LV+ EE+ G+P IT +I
Sbjct: 672 GKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGDPTKITFVRHCHNPYSSI 731
Query: 627 AIRKVC 632
+ KVC
Sbjct: 732 VVEKVC 737
>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
Length = 722
Score = 524 bits (1349), Expect = e-146, Method: Compositional matrix adjust.
Identities = 299/671 (44%), Positives = 379/671 (56%), Gaps = 57/671 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV+QTYVFWN HEP G+Y F R D+++FIK Q GLYV LRIG
Sbjct: 57 MWPDLLQKAKDGGLDVLQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLAQQHGLYVHLRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I +EW +GG P+WL V GI FR+DN+P+
Sbjct: 117 PYICAEWNFGGFPVWLKYVPGIAFRTDNRPFMAAMEKFTQKIVYMMKAERLFQTQGGPII 176
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY +E G Y WAAKMAV +TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 177 LSQIENEYGPVEWEIGAPGKSYTQWAAKMAVGLNTGVPWVMCKQEDAPDPIIDTCNGFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE WT +Y +GG R AQD+AF VA FI GS+ NYYMYH
Sbjct: 237 -ENFT-PNKNYKPKMWTEIWTGWYTEFGGAVPTRPAQDLAFSVARFIQNGGSFANYYMYH 294
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGL REPK+ HLK +H AIK+ LL V
Sbjct: 295 GGTNFGRTAGGPFIATSYDYDAPLDEYGLPREPKYSHLKYMHKAIKMAEPALLATDAAVS 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QEA V++ SG CAAFL N D + V V F N Y LP SISILPDCKT FNT
Sbjct: 355 KLGNNQEAHVYQSRSG-CAAFLANYDTKYPVRVTFWNKQYNLPPWSISILPDCKTEVFNT 413
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAI-LNFDNTLLRAEGLLDQISAAKDASDYFW 387
RV +S + + + W+ Y E + + D+ + GL +QIS D +DY W
Sbjct: 414 ARVG-----QSPPTKMTPVAHLSWQAYIEDVATSADDNAFTSVGLREQISLTWDNTDYLW 468
Query: 388 YTFRF------HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + + L V S GH LH F+NG+ +GSA+G+ V L
Sbjct: 469 YMTDITIGPNEQFLRTGKYPTLKVDSAGHALHVFINGQLSGSAYGTLAFPKLEFNQGVKL 528
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N ALLSV+VGL + G E GV V T W Y++G+ GE
Sbjct: 529 RAGINKLALLSVSVGLANVGLHFETWNTGVLGPVTLAGVNSGTWDMTRWQWTYKIGMRGE 588
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
+ +++ G + V W S+ + R LTWYK AP GN P+AL++ SMGKG+ W+NGQ
Sbjct: 589 DMSLHTVSGSSSVEWVQGSLLAQYRPLTWYKAILNAPPGNAPLALDMGSMGKGQMWINGQ 648
Query: 554 SIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLE 612
SIGR+W ++K + G+ YA T + YHVPR++LK +GNLLV+ E
Sbjct: 649 SIGRHWPAYK-AHGSCGACYYAGTYTENKCRTNCGQPSQRWYHVPRSWLKSSGNLLVVFE 707
Query: 613 EENGNPLGITV 623
E G+P I++
Sbjct: 708 EWGGDPTKISL 718
>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 833
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 303/806 (37%), Positives = 421/806 (52%), Gaps = 93/806 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGL+ I TYVFW+LHEPQ+ QYDF+G D++RFIK IQ+QGLY LRIG
Sbjct: 60 MWPDLIQKSKDGGLNTIDTYVFWDLHEPQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EWTYGG P+WLH+ I R++N Y
Sbjct: 120 PYVCAEWTYGGFPVWLHNQPSIQLRTNNTVYMSEMQTFTTMIVDMMKKEQLFASQGGPII 179
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + A+H+ G Y+ W A+MA TGVPW+MC+QD+AP P+IN CNG C
Sbjct: 180 ISQIENEYGNVMRAYHDAGVQYINWCAQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+PN P +WTE+W+ +Y+ WGG R+A+D+AF VA F G++ NYYMYH
Sbjct: 240 DQF--TPNNPNSPKMWTENWSGWYKNWGGSDPHRTAEDLAFSVARFYQLGGTFQNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APL+EYG +PKWGHL++LH + + L G +
Sbjct: 298 GGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNV 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
L A ++ G + F N++ + VT+ + ++Y +P S+SILPDC +NT
Sbjct: 358 DYETLTSATIY-SYQGKSSCFFGNSNADRDVTINYGGVNYTIPAWSVSILPDCSNEVYNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V++QY+ K + + +W E I A LLDQ + A+D SDY
Sbjct: 417 AKVNSQYSTFVKKGSEAENEPNSLQWTWRGETIQYITPGRFTASELLDQKTVAEDTSDYL 476
Query: 387 WYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+Y ++ + L V + GHILHAFVNGE+ G + F R +V L+
Sbjct: 477 YYMTTVDISNDDPIWGKDLTLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQ 536
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVH----------RVRVQDKSFTNCSWGYQVGL 492
G N+ LLS TVGL + G + G+H + N W Y+ GL
Sbjct: 537 LGKNEITLLSATVGLTNYGPDFDMVNQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGL 596
Query: 493 IGEKLQIYSNLGLNKV-LWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE +I+ LG + W S P R WYK TF AP G DP+ ++L +GKGEAWV
Sbjct: 597 NGEDKKIF--LGRARYNQWKSDNLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWV 654
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTGNL 607
NG S+GRYW S+ ++G + C + YHVPR+FL T N
Sbjct: 655 NGHSLGRYWPSY-IARGEGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNR 713
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQ 667
LVL EE GNP +T T+ + C + + T++
Sbjct: 714 LVLFEEFGGNPSSVTFQTVTVGNACANAREGY-------------------------TLE 748
Query: 668 PSCPLGKKISKIVFASFGNPDGDCER--------YAVGSCHSSHSQGVVERACIGKSRCS 719
SC G+ IS I FASFG+P G C + + G+C ++ S ++++ C+GK CS
Sbjct: 749 LSCQ-GRAISGIKFASFGDPQGTCGKPFATGSQVFEKGTCEAADSLSIIQKLCVGKYSCS 807
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
I + + G C K L V+A C
Sbjct: 808 IDVSEQILGPAGCTADTKRLAVEAIC 833
>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 303/802 (37%), Positives = 418/802 (52%), Gaps = 89/802 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGL+ I TYVFW+LHEPQ+ QYDF+G D++RFIK IQ+QGLY LRIG
Sbjct: 60 MWPDLIQKSKDGGLNTIDTYVFWDLHEPQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EWTYGG P+WLH+ I R++N Y
Sbjct: 120 PYVCAEWTYGGFPVWLHNQPSIQLRTNNTVYMSEMQTFTTMIVDMMKKEQLFASQGGPII 179
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + A+H+ G Y+ W A+MA TGVPW+MC+QD+AP P+IN CNG C
Sbjct: 180 ISQIENEYGNVMRAYHDAGVQYINWCAQMAAALDTGVPWIMCQQDNAPQPMINTCNGYYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+PN P +WTE+W+ +Y+ WGG R+A+D+AF VA F G++ NYYMYH
Sbjct: 240 DQF--TPNNPNSPKMWTENWSGWYKNWGGSDPHRTAEDLAFSVARFYQLGGTFQNYYMYH 297
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APL+EYG +PKWGHL++LH + + L G +
Sbjct: 298 GGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNV 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
L A ++ G + F N++ + VT+ + ++Y +P S+SILPDC +NT
Sbjct: 358 DYETLTSATIY-SYQGKSSCFFGNSNADRDVTINYGGVNYTIPAWSVSILPDCSNEVYNT 416
Query: 329 ERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V++QY+ K + + +W E I A LLDQ + A+D SDY
Sbjct: 417 AKVNSQYSTFVKKGSEAENEPNSLQWTWRGETIQYITPGRFTASELLDQKTVAEDTSDYL 476
Query: 387 WYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
+Y L V + GHILHAFVNGE+ G + F R +V L+ G N
Sbjct: 477 YYMTTNDDPIWGKDLTLSVNTSGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKN 536
Query: 447 DGALLSVTVGLPDSGAFLERKVAGVH----------RVRVQDKSFTNCSWGYQVGLIGEK 496
+ LLS TVGL + G + G+H + N W Y+ GL GE
Sbjct: 537 EITLLSATVGLTNYGPDFDMVNQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGED 596
Query: 497 LQIYSNLGLNKV-LWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
+I+ LG + W S P R WYK TF AP G DP+ ++L +GKGEAWVNG S
Sbjct: 597 KKIF--LGRARYNQWKSDNLPVNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHS 654
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTGNLLVLL 611
+GRYW S+ ++G + C + YHVPR+FL T N LVL
Sbjct: 655 LGRYWPSY-IARGEGCSPECDYRGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLF 713
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE GNP +T T+ + C + + T++ SC
Sbjct: 714 EEFGGNPSSVTFQTVTVGNACANAREGY-------------------------TLELSCQ 748
Query: 672 LGKKISKIVFASFGNPDGDCER--------YAVGSCHSSHSQGVVERACIGKSRCSIPLL 723
G+ IS I FASFG+P G C + + G+C ++ S ++++ C+GK CSI +
Sbjct: 749 -GRAISGIKFASFGDPQGTCGKPFATGSQVFEKGTCEAADSLSIIQKLCVGKYSCSIDVS 807
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
+ G C K L V+A C
Sbjct: 808 EQILGPAGCTADTKRLAVEAIC 829
>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
Length = 683
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 293/674 (43%), Positives = 395/674 (58%), Gaps = 36/674 (5%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENEY A G Y+ WAAKMAV TGVPWVMCK+DDAP P+INACNG C +
Sbjct: 13 QIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPMINACNGFYC-D 71
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
F PN P KP++WTE W+ ++ +GG + R QD+AF VA FI K GSY+NYYMYHGG
Sbjct: 72 GFS-PNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQKGGSYINYYMYHGG 130
Query: 212 TNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
TNFGRTA IT YD P+DEYGL+R+PK+GHLKELH AIKLC L++ V SL
Sbjct: 131 TNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEHALVSSDPTVTSL 190
Query: 271 GQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTER 330
G Q+A+VF CAAFL +N + F N+ Y+LP SISILPDC+ V FNT +
Sbjct: 191 GAYQQAYVFNSGPRRCAAFL-SNFHSTGARMTFNNMHYDLPAWSISILPDCRNVVFNTAK 249
Query: 331 VSTQYNK-RSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYFWY 388
V Q ++ + +N + S W+ Y E + + + + + A GLL+QI+ +D SDY WY
Sbjct: 250 VGVQTSRVQMIPTNSRLFS---WQTYDEDVSSLHERSSIAAGGLLEQINVTRDTSDYLWY 306
Query: 389 TFRFHYNSSNAQA----PLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
+SS + L VQS GH LH FVNG+++GSA G+ ++ FT VHLR G
Sbjct: 307 MTNVDISSSELRGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREHRQFTFAKPVHLRAG 366
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIGEKLQ 498
N ALLS+ VGLP+ G E G+ D K T W +VGL GE +
Sbjct: 367 INKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAMD 426
Query: 499 IYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
+ S G + V W S+ + T+Q L WYK F AP G++P+AL+++SMGKG+ W+NGQSI
Sbjct: 427 LVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSI 486
Query: 556 GRYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEE 614
G+YW+++ + G+ S Y T YHVPR++LKPT NL+V+ EE
Sbjct: 487 GKYWMAY--ANGDCSLCSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTQNLVVVFEEL 544
Query: 615 NGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK---KPTVQPSCP 671
G+P IT+ ++ VC + H + ++ D D + K + V C
Sbjct: 545 GGDPSKITLVKRSVAGVCADLQEHH--------PNAEKLDIDSHEESKTLHQAQVHLQCV 596
Query: 672 LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP 731
G+ IS I FASFG P G C + G+CH+++S +VE+ CIG+ C + + + FG DP
Sbjct: 597 PGQSISSIKFASFGTPTGTCGSFQQGTCHATNSHAIVEKNCIGRESCLVTVSNSIFGTDP 656
Query: 732 CPGIHKALLVDAQC 745
CP + K L V+A C
Sbjct: 657 CPNVLKRLSVEAVC 670
>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 785
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 296/723 (40%), Positives = 401/723 (55%), Gaps = 108/723 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP +GQY F+ R D++RF+K ++ GLYV LR+G
Sbjct: 70 MWPGLIQKAKDGGLDVVQTYVFWNGHEPAQGQYYFADRYDLVRFVKLVRQAGLYVHLRVG 129
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 130 PYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFKAAMQKFVEKIVSMMKSEGLFEWQGGPII 189
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENE+ +E G PY WAA+MAV + GVPWVMCKQDDAP PVIN CNG C
Sbjct: 190 MAQVENEFGPMESVVGSGGKPYAHWAAQMAVGTNAGVPWVMCKQDDAPDPVINTCNGFYC 249
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+ +KP++WTE WT ++ +GG R +D+AF VA F+ K GS+VNYYMYH
Sbjct: 250 --DYFTPNNKHKPTMWTEAWTGWFTKFGGAAPHRPVEDLAFAVARFVQKGGSFVNYYMYH 307
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYG-------------------------------- 236
GGTNFGRTA F+ T Y AP+DE+G
Sbjct: 308 GGTNFGRTAGGPFIATSYDYDAPIDEFGMQWLLPSLINLNSHRLPRDICRKSSQCGFYLS 367
Query: 237 -----------------LVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVF 279
L+R+PKWGHL+ +H AIK L++G + S+G ++A+VF
Sbjct: 368 VVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHRAIKQAEPALVSGDPTIRSIGNYEKAYVF 427
Query: 280 EETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVS--TQYNK 337
+ +G CAAFL N + AV + F Y+LP SISILPDCKT FNT V T K
Sbjct: 428 KSKNGACAAFLSNYHVKSAVRIRFDGRHYDLPAWSISILPDCKTAVFNTATVKEPTLLPK 487
Query: 338 RSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS 397
S + +F W+ Y E + D++ +GL++Q+S D SDY WYT + S+
Sbjct: 488 MSPVMH-RF----AWQSYSEDTNSLDDSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSN 542
Query: 398 -----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALL 451
+ Q P L V S GH + FVNG GS +G +DN T V + QG+N ++L
Sbjct: 543 ERFLKSGQWPQLSVYSAGHSMQVFVNGRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISIL 602
Query: 452 SVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGL 505
S VGLP++G E GV + + ++ W YQVGL GE L +++ G
Sbjct: 603 SSAVGLPNNGDHFELWNVGVLGPVTLSGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGS 662
Query: 506 NKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTS 565
+ V W+ T+ LTW+K F APAG+DP+AL++ SMGKG+ WVNG+ GRYW S
Sbjct: 663 SAVEWAGPGGGTQPLTWHKALFNAPAGSDPVALDMGSMGKGQVWVNGRHAGRYWSYRAHS 722
Query: 566 KGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGIT 622
+G + Y + TS C + + YHVPR++LKP+GNLLV+LEE G+ G++
Sbjct: 723 RGCGRCSYAGTYREDQCTSN--CGDL-SQRWYHVPRSWLKPSGNLLVVLEEYGGDLAGVS 779
Query: 623 VDT 625
+ T
Sbjct: 780 LAT 782
>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 826
Score = 520 bits (1339), Expect = e-144, Method: Compositional matrix adjust.
Identities = 308/780 (39%), Positives = 412/780 (52%), Gaps = 84/780 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAK+GGLD I++YVFW+ HEP + +YDFSG D I+F + IQ GLY LRIG
Sbjct: 58 MWPDIIQKAKDGGLDAIESYVFWDRHEPVRREYDFSGNLDFIKFFQIIQEAGLYAILRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WLH++ GI R+DN YK
Sbjct: 118 PYVCAEWNFGGFPLWLHNMPGIELRTDNPIYKNEMQIFTTKIVNMAKEAKLFASQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I + E G Y+ W A+MA+ + GVPW+MC+Q DAP P+IN CNG C
Sbjct: 178 LAQIENEYGNIMTDYGEAGKTYIKWCAQMALAQNIGVPWIMCQQHDAPQPMINTCNGHYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F+ PN+P P ++TE+W ++Q WG + RSA+D AF VA F G NYYMYH
Sbjct: 238 -DSFQ-PNNPKSPKMFTENWIGWFQKWGERVPHRSAEDSAFSVARFFQNGGILNNYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA +M T Y APLDEYG + +PKWGHLK+LHAAIKL + + GT+
Sbjct: 296 GGTNFGRTAGGPYMTTSYEYDAPLDEYGNLNQPKWGHLKQLHAAIKLGEKIITNGTRTDK 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVN-NDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
G + T+G FL N ND + A L ++ +Y LP S++IL C FN
Sbjct: 356 DFGNEVTLTTYTHTNGERFCFLSNTNDSKDANVDLQQDGNYFLPAWSVTILDGCNKEVFN 415
Query: 328 TERVSTQYNKRSKTSNLKFDSDEK----WEEYREAILNFDNTLLRAEGLLDQISAAKDAS 383
T +V++Q + K S+ D+ K W ++ + LL+Q D S
Sbjct: 416 TAKVNSQTSIMVKKSD---DASNKLTWAWIPEKKKDTMHGKGNFKVNQLLEQKELTFDVS 472
Query: 384 DYFWYTFRFHYNSSN--AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
DY WY N ++ + A L V + GH L A+VNG + G S +FT V L
Sbjct: 473 DYLWYMTSVDINDTSIWSNATLRVNTRGHTLRAYVNGRHVGYKF-SQWGGNFTYEKYVSL 531
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-------WGYQVGLIG 494
++G N LLS TVGLP+ GA ++ G+ VQ N + W Y++GL G
Sbjct: 532 KKGLNVITLLSATVGLPNYGAKFDKIKTGIAGGPVQLIGNNNETIDLSTNLWSYKIGLNG 591
Query: 495 EKLQIYSNLGLNKVLWSSIRSP---TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
EK ++Y V W + SP R LTWYK F AP+GNDP+ ++L +GKGEAWVN
Sbjct: 592 EKKRLYDPQPRIGVSWRT-NSPYPIGRSLTWYKADFVAPSGNDPVVVDLLGLGKGEAWVN 650
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---YHVPRAFLKPTGNLL 608
GQSIGRYW S+ T+ S T C + YHVPR+FLK N L
Sbjct: 651 GQSIGRYWTSWITATNGCSDTCDYRGKYVPAQKCNTNCGNPSQRWYHVPRSFLKNDKNTL 710
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
VL EE GNP ++ T+ +C V L L
Sbjct: 711 VLFEEIGGNPQNVSFQTVITGTICAQVQEGALLEL------------------------- 745
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
SC GK IS+I F+SFGNP G+C + G+ ++ Q VVE AC+G++ C + FG
Sbjct: 746 SCQGGKTISQIQFSSFGNPTGNCGSFKKGTWEATDGQSVVEAACVGRNSCGFMVTKEAFG 805
>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
Length = 773
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 298/775 (38%), Positives = 410/775 (52%), Gaps = 87/775 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGL+ I TYVFW+LHEPQ+ QYDF+G D++RFIK IQ+QGLY LRIG
Sbjct: 56 MWPDLIQKSKDGGLNTIDTYVFWDLHEPQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKMAV 120
P++ +EWTYGG P+WLH+ I R++N Y IENEY + A+H+ G Y+ W A+MA
Sbjct: 116 PYVCAEWTYGGFPVWLHNQPSIQLRTNNTVYMIENEYGNVMRAYHDAGVQYINWCAQMAA 175
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
TGVPW+MC+QD+AP P+IN CNG C + PN+PN P +WTE+W+ +Y+ WGG
Sbjct: 176 ALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFT--PNNPNSPKMWTENWSGWYKNWGGSD 233
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVR 239
R+A+D+AF VA F G++ NYYMYHGGTNFGRTA IT YD APL+EYG
Sbjct: 234 PHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKN 293
Query: 240 EPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAV 299
+PKWGHL++LH + + L G + L A ++ G + F N++ + V
Sbjct: 294 QPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIY-SYQGKSSCFFGNSNADRDV 352
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYN----KRSKTSNLKFDSDEKWEEY 355
T+ + ++Y +P S+SILPDC +NT +V++QY+ K S+ N W
Sbjct: 353 TINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGSEAENEPNSLQWTW--- 409
Query: 356 REAILNFDNTLLRAEGLLDQISAAKDAS--DYFWYTFRFHYNSSNAQAPLDVQSHGHILH 413
R E + + D S D W L V + GHILH
Sbjct: 410 ------------RGETIQYITPGSVDISNDDPIW----------GKDLTLSVNTSGHILH 447
Query: 414 AFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH- 472
AFVNGE+ G + F R ++ L+ G N+ LLSVTVGL + G + G+H
Sbjct: 448 AFVNGEHIGYQYALLGQFEFQFRRSITLQLGKNEITLLSVTVGLTNYGPDFDMVNQGIHG 507
Query: 473 ---------RVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKV-LWSSIRSPT-RQLT 521
+ N W Y+ GL GE +I+ LG + W S P R
Sbjct: 508 PVQIIASNGSADIIKDLSNNNQWAYKAGLNGEDKKIF--LGRARYNQWKSDNLPVNRSFV 565
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTS 581
WYK TF AP G DP+ ++L +GKGEAWVNG S+GRYW S+ ++G +
Sbjct: 566 WYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSY-IARGEGCSPECDYRGPYK 624
Query: 582 IHFCAIIKATNT---YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNS 638
C + YHVPR+FL T N LVL EE GNP +T T+ + C +
Sbjct: 625 AEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFXGNPSSVTFQTVTVGNACANAREG 684
Query: 639 HLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCER----- 693
+ T++ SC G+ IS I FASFG+P G C +
Sbjct: 685 Y-------------------------TLELSCQ-GRAISXIKFASFGDPQGTCGKPFATG 718
Query: 694 ---YAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ G+C ++ S ++++ C+GK CSI + + G C K L V+A C
Sbjct: 719 SQVFEKGTCEAADSLSIIQKLCVGKYSCSIDVSEQILGPAGCTADTKRLAVEAIC 773
>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
Length = 763
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 304/763 (39%), Positives = 405/763 (53%), Gaps = 94/763 (12%)
Query: 67 WTYG-GLPIWLHDVAGIVFRSDNKPYK-------------------------------IE 94
W Y G P+WL DV GI FR+DN P+K +E
Sbjct: 1 WDYCRGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVE 60
Query: 95 NEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFK 154
NEY IE ++ ++G Y+ W MA+ VPWVMC+Q DAP +IN+CNG C + FK
Sbjct: 61 NEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYC-DGFK 119
Query: 155 GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNF 214
NSP+KP WTE+W ++ WG + R +D+AF VA F + GS+ NYYMY GGTNF
Sbjct: 120 A-NSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNF 178
Query: 215 GRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-TQNVISLGQ 272
GRTA F IT Y +P+DEYGL+REPKWGHLK+LH A+KLC L++ + I LG
Sbjct: 179 GRTAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGP 238
Query: 273 LQEAFVFEETSGV-------------CAAFLVNNDERKAVTVLFRNISYELPRKSISILP 319
QEA V+ S C+AFL N DERKAV V F +Y LP S+SILP
Sbjct: 239 KQEAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILP 298
Query: 320 DCKTVAFNTERVSTQ--------YNKRSKTSNLKFDSDEK---------WEEYREAILNF 362
DC+ V FNT +V+ Q Y S +LK + ++ W +E I +
Sbjct: 299 DCQNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIW 358
Query: 363 DNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGHILHA 414
+ +G+L+ ++ KD SDY WY R H + N + + S +
Sbjct: 359 SDQNFTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRV 418
Query: 415 FVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRV 474
FVNG+ TGSA G V F V +G ND LLS +GL +SGAF+E+ AG+ R
Sbjct: 419 FVNGKLTGSAIGQW--VKFV--QPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGI-RG 473
Query: 475 RVQDKSFTNCS-------WGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKT 525
R++ F N W YQVGL GE L YS K W+ S+ + TWYK
Sbjct: 474 RIKLTGFKNGDIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKA 533
Query: 526 TFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIH 583
F +P G DP+A+NL SMGKG+AWVNG IGRYW G P + Y A N+
Sbjct: 534 YFSSPDGTDPVAINLGSMGKGQAWVNGHHIGRYWSVVSPKDGCPRKCDYRGAYNSGKCAT 593
Query: 584 FCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPP 642
C + T + YH+PR++LK + NLLVL EE GNPL I V + +CG V+ SH P
Sbjct: 594 NCG--RPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTGVICGQVSESHYPS 651
Query: 643 LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS 702
L L + D + P + C G IS + FAS+G P G C +++ G CH++
Sbjct: 652 LRK-LSNDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRGPCHAT 710
Query: 703 HSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+S VV +AC+GK+ C++ + + FGGDPC I K L V+A+C
Sbjct: 711 NSLSVVSQACLGKNSCTVEISNSAFGGDPCHSIVKTLAVEARC 753
>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 774
Score = 514 bits (1323), Expect = e-143, Method: Compositional matrix adjust.
Identities = 307/753 (40%), Positives = 406/753 (53%), Gaps = 88/753 (11%)
Query: 71 GLPIWLHDVAGIVFRSDNKPYK-------------------------------IENEYQT 99
G P+WL DV GI FR+DN+PYK IENEY
Sbjct: 19 GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 78
Query: 100 IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSP 159
I+ + + G Y+LWAA+MA+ TGVPWVMC+Q DAP ++N CN C + FK PNS
Sbjct: 79 IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYC-DGFK-PNSY 136
Query: 160 NKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAA 219
NKP+IWTEDW +Y WG R AQD AF VA F + GS NYYMY GGTNF RTA
Sbjct: 137 NKPTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAG 196
Query: 220 A-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPL--LTGTQNVISLGQLQEA 276
IT Y AP+DEYG++R+PKWGHLK+LHAAIKLC L + G+ + + LG +QEA
Sbjct: 197 GPLQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEA 256
Query: 277 FVFEE-----------TSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
V+ S C+AFL N DE K +V SY LP S+SILPDC+TVA
Sbjct: 257 HVYSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVA 316
Query: 326 FNTERVSTQ------------YNKRSKTSNLKFDS----DEKWEEYREAILNFDNTLLRA 369
FNT RV TQ Y+ R K L W ++E + + + A
Sbjct: 317 FNTARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTA 376
Query: 370 EGLLDQISAAKDASDYFWYTFR--------FHYNSSNAQAPLDVQSHGHILHAFVNGEYT 421
+G+L+ ++ KD SDY YT R ++NS L + + FVNG+
Sbjct: 377 QGILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLA 436
Query: 422 GSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQ--- 477
GS G +L + L QG N+ LLS VGL + GAFLE+ AG +V++
Sbjct: 437 GSKVGHW----VSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLS 492
Query: 478 --DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRS--PTRQLTWYKTTFRAPAGN 533
D TN W YQ+GL GE +IYS WSS+++ TW+KT F AP GN
Sbjct: 493 NGDIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGN 552
Query: 534 DPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT 593
P+ ++L SMGKG+AWVNG IGRYW G PS YA S AT +
Sbjct: 553 GPVTIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCPSSCNYAGTYSDSKCRSNCGIATQS 612
Query: 594 -YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQR 652
YH+PR +L+ +GNLLVL EE G+P I+++ + +C ++ ++ PPLS+W R
Sbjct: 613 WYHIPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSR-AAN 671
Query: 653 GDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERAC 712
G + P ++ C G ISKI FAS+G P G C+ ++VG+CH+S + +V AC
Sbjct: 672 GRPSVNTVA--PELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVEAC 729
Query: 713 IGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK+RC+I + + F GDPC + K L V+A+C
Sbjct: 730 EGKNRCAISVTNEVF-GDPCRKVVKDLAVEAEC 761
>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
Length = 718
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 294/673 (43%), Positives = 385/673 (57%), Gaps = 63/673 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L KAK+GGLDVIQTYVFWN HEP G Y R D ++ K Q L V LR+
Sbjct: 55 MWPDLFQKAKDGGLDVIQTYVFWNGHEPSPGNYTLKDRLDWVKLSKLAQQAVLNVHLRMV 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P T+ G P+WL V G+ FR+DN+P+K
Sbjct: 115 P------TFVGFPVWLKYVPGMAFRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPII 168
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPW MCKQ+DAP PVI+ CNG C
Sbjct: 169 MSQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYC 228
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE+W+ +Y +GG R +D+A+ VA FI GS+VNYYMYH
Sbjct: 229 -ENFT-PNENFKPKMWTENWSGWYTDFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYH 286
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD AP+DEYGL EPKW HLK LH AIK C L++ V
Sbjct: 287 GGTNFGRTSSGLFIATSYDYDAPIDEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVT 346
Query: 269 SLGQLQ-EAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
LG EA V+ + +CAAFL N D + A TV F N Y+LP S+SILPDCKTV FN
Sbjct: 347 WLGNKNLEAHVYYVNTSICAAFLANYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFN 406
Query: 328 TERVSTQ-YNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDY 385
T V+ ++KR FD W+ Y E + D+ + A L +QI+ +D+SDY
Sbjct: 407 TATVNGHSFHKRMTPVETTFD----WQSYSEEPAYSSDDDSIIANALWEQINVTRDSSDY 462
Query: 386 FWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WY + + S N Q P L + S GH+LH FVNG+ +G+ +G DN T +V
Sbjct: 463 LWYLTDVNISPSESFIKNGQFPTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESV 522
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLER---KVAGVHRVRVQDKSFTNCS---WGYQVGLI 493
+L+ G N +LLSV VGLP+ G E V G R++ D+ + S W Y+VGL
Sbjct: 523 NLKVGNNKISLLSVAVGLPNVGLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLK 582
Query: 494 GEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE L +++ G + + W+ S ++ LTWYKTTF AP+GNDP+AL++ SMGKGE W+N
Sbjct: 583 GESLSLHTITGSSSIDWTQGSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWIN 642
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVL 610
QSIGR+W ++ + GN + YA + T YH+PR++L +GN+LV+
Sbjct: 643 DQSIGRHWPAY-IAHGNCDECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVV 701
Query: 611 LEEENGNPLGITV 623
LEE G+P GI++
Sbjct: 702 LEEWGGDPTGISL 714
>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
Length = 786
Score = 509 bits (1311), Expect = e-141, Method: Compositional matrix adjust.
Identities = 294/799 (36%), Positives = 410/799 (51%), Gaps = 141/799 (17%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K KEG LD I+TYVFWN HEP + QYDFSG D+IRF+K IQ++G+Y LRIG
Sbjct: 75 MWPDLIKKGKEGSLDAIETYVFWNAHEPTRRQYDFSGNLDLIRFLKTIQNEGMYGVLRIG 134
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P+WLH++ G+ FR+ N +
Sbjct: 135 PYVCAEWNYGGFPVWLHNMPGMEFRTTNTAFMNEMQNFTTMIVEMVKKEKLFASQGGPII 194
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY + ++ E G Y+ W A MA GVPW+MC+QDDAP P++N CNG C
Sbjct: 195 LAQIENEYGNVIGSYGEAGKAYIQWCANMANSLDVGVPWIMCQQDDAPQPMLNTCNGYYC 254
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ F PN+PN P +WTE+WT +Y+ WGGK R+ +D+AF VA F K G++ NYYMYH
Sbjct: 255 -DNFS-PNNPNTPKMWTENWTGWYKNWGGKDPHRTTEDVAFAVARFFQKEGTFQNYYMYH 312
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA IT YD APLDE+G + +PK+GHLK+LH + + L G + +
Sbjct: 313 GGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVLHAMEKTLTYGNISTV 372
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G L A V++ G + F+ N +E + F+ SY++P S+SILPDCKT +NT
Sbjct: 373 DFGNLVTATVYQTEEG-SSCFIGNVNETSDAKINFQGTSYDVPAWSVSILPDCKTETYNT 431
Query: 329 ERVSTQYNKRSKTSNLKFD--SDEKWEEYREAILNFDNTLLRAEG------LLDQISAAK 380
+++TQ + K +N + S KW E N D+ LL+ +G L DQ +
Sbjct: 432 AKINTQTSVMVKKANEAENEPSTLKWSWRPE---NIDSVLLKGKGESTMRQLFDQKVVSN 488
Query: 381 DASDYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLR 436
D SDY WY + + L + S H+LHAFVNG++ G+ + +
Sbjct: 489 DESDYLWYMTTVNLKEQDPVLGKNMSLRINSTAHVLHAFVNGQHIGNYRVENGKFHYVFE 548
Query: 437 NTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV---------HRVRVQDKSFTNCSWG 487
G N LLS+TVGLP+ GAF E AG+ + K + W
Sbjct: 549 QDAKFNPGANVITLLSITVGLPNYGAFFENFSAGITGPVFIIGRNGDETIVKDLSTHKWS 608
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
Y+ GL G + Q++S+ SP +T+ AP G++P+ ++L +GKG
Sbjct: 609 YKTGLSGFENQLFSS-----------ESP--------STWSAPLGSEPVVVDLLGLGKGT 649
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNL 607
AW+NG +IGRYW +F + I NT
Sbjct: 650 AWINGNNIGRYWPAFLSD----------------------IDGDNT-------------- 673
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQ 667
LVL EE GNP + TI + VC +V +K ++
Sbjct: 674 LVLFEEIGGNPSLVNFQTIGVGSVCANVY-------------------------EKNVLE 708
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS-HSQGVVERACIGKSRCSIPLLSRY 726
SC GK IS I FASFGNP GDC + G+C +S ++ ++ + C+GK +CSI +
Sbjct: 709 LSCN-GKPISAIKFASFGNPGGDCGSFEKGTCEASNNAAAILTQECVGKEKCSIDVSEDK 767
Query: 727 FGGDPCPGIHKALLVDAQC 745
FG C + K L V+A C
Sbjct: 768 FGAAECGALAKRLAVEAIC 786
>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
Length = 1078
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 281/666 (42%), Positives = 380/666 (57%), Gaps = 65/666 (9%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENEYQ +E AF E G Y+ WAAKMA+ +TGVPW+MCKQ APG VI CNG CG+
Sbjct: 454 QIENEYQHLEVAFKEAGTKYINWAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGD 513
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
T+ GP KP +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYMYHGG
Sbjct: 514 TWPGPADKKKPLLWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMYHGG 573
Query: 212 TNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLG 271
TNFGR AAF++ YYD+APLDE+GL +EPKWGHL++LH A++ C + LL G +V LG
Sbjct: 574 TNFGRNGAAFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLG 633
Query: 272 QLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTER 330
+L EA VFE + VC AFL N++ ++ TV FR Y + R+SISIL DCKTV F+T+
Sbjct: 634 KLYEARVFEMKEKNVCVAFLSNHNTKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQH 693
Query: 331 VSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFWYT 389
V++Q+N+R+ + D WE Y E I + T +R + L+Q + KD +DY WYT
Sbjct: 694 VNSQHNQRTFHFADQTVQDNVWEMYSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYT 753
Query: 390 FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGA 449
F + + +V+ +L G+ G SFT+ + L+ G N A
Sbjct: 754 TSFRLETDDLPYRKEVKP---VLE--------GAGTGRRSTRSFTMEKAMDLKVGVNHVA 802
Query: 450 LLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVL 509
+LS T+GL DSG++LE ++AGV+ V ++ G G L L
Sbjct: 803 ILSSTLGLMDSGSYLEHRMAGVYTVTIR---------GLNTG----------TLDLTTNG 843
Query: 510 WSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGN 568
W + Q LTWY+ F P+G DP+ ++L MGKG +VNG+ +GRYWVS+ + G
Sbjct: 844 WGHVPGKDNQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHALGK 903
Query: 569 PSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAI 628
PSQ YHVPR+ L+P GN L+ EEE G P I + T+
Sbjct: 904 PSQY--------------------LYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKR 943
Query: 629 RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK--------KPTVQPSCPLGKKISKIV 680
+C +T + P W + D+ K KPT SCP K I +V
Sbjct: 944 DNICTFMTEKN-PAHVRW--SWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSVV 1000
Query: 681 FASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP-CPGIHKAL 739
FAS+GNP G C Y VGSCH+ ++ VVE+ACIG+ CS+ + S +GGD CPG L
Sbjct: 1001 FASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDVHCPGTTGTL 1060
Query: 740 LVDAQC 745
V A+C
Sbjct: 1061 AVQAKC 1066
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 170/402 (42%), Positives = 226/402 (56%), Gaps = 95/402 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP LI+KAKEGGL+VI++YVFWN HEP++G Y+F GR D+I+F K IQ + +Y +RIGP
Sbjct: 64 WPDLISKAKEGGLNVIESYVFWNGHEPEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGP 123
Query: 62 FIESEWTYGGL-PIWLHDVAGIVFRSDNKPYK---------------------------- 92
F+++EW +G + I ++ I+FR++N+P+K
Sbjct: 124 FVQAEWNHGFVCHIGSGEIPDIIFRTNNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPII 183
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEYQ +E AF E G Y+ WAAKMA+ +TGVPW+MCKQ APG VI CNG C
Sbjct: 184 LAQIENEYQHLEVAFKEAGTKYINWAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHC 243
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM-- 207
G+T+ GP KP +WTE+WT+ Y+V+G P RSA+DIAF VA F + G+ NYYM
Sbjct: 244 GDTWPGPADKKKPLLWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMVV 303
Query: 208 --------------------------------YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
YHGGTNFGR AAF++ YYD+APLDE+
Sbjct: 304 LNSNSNLFLTKKRDEISDRTDTGGFTCVNNQQYHGGTNFGRNGAAFVMPRYYDEAPLDEF 363
Query: 236 GLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDE 295
GL +EPKWGHL++LH A++ C + LL G +V LG+L
Sbjct: 364 GLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLT--------------------- 402
Query: 296 RKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNK 337
R Y + R+SISIL DCKTV + + V+ NK
Sbjct: 403 --------RGQKYFVARRSISILADCKTVKYMKQFVTLIVNK 436
>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
Full=SR12 protein; Flags: Precursor
gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
Length = 731
Score = 506 bits (1304), Expect = e-140, Method: Compositional matrix adjust.
Identities = 290/665 (43%), Positives = 377/665 (56%), Gaps = 54/665 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP +I KAK+ LDVIQTYVFWN HEP +G+Y F GR D+++FIK I GL+V LRIG
Sbjct: 61 MWPDIIEKAKDSQLDVIQTYVFWNGHEPSEGKYYFEGRYDLVKFIKLIHQAGLFVHLRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF +EW +GG P+WL V GI FR+DN P+K
Sbjct: 121 PFACAEWNFGGFPVWLKYVPGIEFRTDNGPFKEKMQVFTTKIVDMMKAEKLFHWQGGPII 180
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNGMR 148
IENEY +E G Y WAA+MA + GVPW+MCKQD D P VI+ CNG
Sbjct: 181 LNQIENEYGPVEWEIGAPGKAYTHWAAQMAQSLNAGVPWIMCKQDSDVPDNVIDTCNGFY 240
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C E F P +KP +WTE+WT +Y +G R A+D+AF VA FI GS++NYYM+
Sbjct: 241 C-EGFV-PKDKSKPKMWTENWTGWYTEYGKPVPYRPAEDVAFSVARFIQNGGSFMNYYMF 298
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTNF TA F+ T Y APLDEYGL REPK+ HLK LH AIK+C L++ V
Sbjct: 299 HGGTNFETTAGRFVSTSYDYDAPLDEYGLPREPKYTHLKNLHKAIKMCEPALVSSDAKVT 358
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+LG QEA V+ SG CAAFL N D + +V V F + +ELP SISILPDCK +NT
Sbjct: 359 NLGSNQEAHVYSSNSGSCAAFLANYDPKWSVKVTFSGMEFELPAWSISILPDCKKEVYNT 418
Query: 329 ERVSTQYNK-RSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYF 386
RV+ K SK + + S+ W+ Y + + D+ R + L +QI+ D SDY
Sbjct: 419 ARVNEPSPKLHSKMTPVI--SNLNWQSYSDEVPTADSPGTFREKKLYEQINMTWDKSDYL 476
Query: 387 WYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + + + L V S GH+LH FVNG+ G A+GS T V
Sbjct: 477 WYMTDVVLDGNEGFLKKGDEPWLTVNSAGHVLHVFVNGQLQGHAYGSLAKPQLTFSQKVK 536
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIG 494
+ G N +LLS VGL + G ER GV + + T W Y++G G
Sbjct: 537 MTAGVNRISLLSAVVGLANVGWHFERYNQGVLGPVTLSGLNEGTRDLTWQYWSYKIGTKG 596
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
E+ Q+Y++ G + V W + + L WYKTTF AP GNDP+AL+L SMGKG+AW+NGQS
Sbjct: 597 EEQQVYNSGGSSHVQWGP-PAWKQPLVWYKTTFDAPGGNDPLALDLGSMGKGQAWINGQS 655
Query: 555 IGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT--YHVPRAFLKPTGNLLVLLE 612
IGR+W S +KG+ + T T + ++ YHVPR++L+P GNLLV+ E
Sbjct: 656 IGRHW-SNNIAKGSCNDNCNYAGTYTETKCLSDCGKSSQKWYHVPRSWLQPRGNLLVVFE 714
Query: 613 EENGN 617
E G+
Sbjct: 715 EWGGD 719
>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 643
Score = 506 bits (1303), Expect = e-140, Method: Compositional matrix adjust.
Identities = 280/642 (43%), Positives = 370/642 (57%), Gaps = 55/642 (8%)
Query: 30 KGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNK 89
K Y+F R D++RF+K + GLYV LRIGP++ +EW +GG P+WL V GI FR+DN
Sbjct: 3 KIMYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNG 62
Query: 90 PYK-------------------------------IENEYQTIEPAFHEKGPPYVLWAAKM 118
P+K IENEY +E G Y WAA+M
Sbjct: 63 PFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQM 122
Query: 119 AVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGG 178
A+ TGVPWVMCKQDDAP PVI+ CNG C E FK PN KP +WTE WT ++ +GG
Sbjct: 123 ALGLDTGVPWVMCKQDDAPDPVIDTCNGFYC-ENFK-PNKVYKPKMWTEAWTGWFTEFGG 180
Query: 179 KPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGL 237
R +D+A+ VA FI GS++NYYMYHGGTNFGRTA F+ T Y AP+DEYGL
Sbjct: 181 PAPYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGL 240
Query: 238 VREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERK 297
+REPKW HL++LH AIKLC L++ V LG QEA VF+ SG CAAFL N D
Sbjct: 241 LREPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASS 300
Query: 298 AVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYRE 357
+ TV F N Y+LP S+SILPDCK+V FNT +V ++ T F W Y E
Sbjct: 301 SATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMTPVSSFS----WLSYNE 356
Query: 358 AILN-FDNTLLRAEGLLDQISAAKDASDYFWYT--FRFHYNSS---NAQAP-LDVQSHGH 410
+ + GL++QIS +D++DY WY R N + Q P L V S GH
Sbjct: 357 ETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLLTVFSAGH 416
Query: 411 ILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG 470
LH F+NG+ +G+ +G +N T V+LR G N ++LSV VGLP+ G E G
Sbjct: 417 ALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHYETWNTG 476
Query: 471 V------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLW--SSIRSPTRQLTW 522
V + + + W Y++GL GE L ++S G + V W S+ + + LTW
Sbjct: 477 VLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSLVAQKQPLTW 536
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY-AVNTVTS 581
YKTTF +P GN+P+AL++ SMGKG+ W+NGQSIGR+W ++ T+KG+ + Y +
Sbjct: 537 YKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAY-TAKGSCGKCNYGGIFNEKK 595
Query: 582 IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
H + YHVPRA+LK +GN+LV+ EE GNP GI++
Sbjct: 596 CHSNCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISL 637
>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
Length = 828
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 300/780 (38%), Positives = 415/780 (53%), Gaps = 84/780 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLD I+TY+FW+ HE +G+Y+FSG D ++F K IQ GLY +RIG
Sbjct: 55 MWPDLVQKAKDGGLDAIETYIFWDRHEQVRGRYNFSGNLDFVKFFKTIQEAGLYGIIRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW YGG P+WLH + GI R+DN YK
Sbjct: 115 PYSCAEWNYGGFPVWLHQIPGIEMRTDNAAYKNEMQIFVTKIINVAKEANLFASQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I F E G Y+ WAA+MA+ + GVPW MC+Q+DAP P+IN CNG C
Sbjct: 175 LAQIENEYGDIMWNFKEPGKAYIKWAAQMALAQNIGVPWFMCQQNDAPQPIINTCNGYYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
FK PN+P P ++TE+W ++Q WG + R+A+D A+ VA F G + NYYMYH
Sbjct: 235 -HNFK-PNNPKSPKMFTENWIGWFQKWGERAPHRTAEDSAYAVARFFQNGGVFNNYYMYH 292
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGT-QNV 267
GGTNFGRT+ ++IT Y AP++EYG + +PK+GHLK LH AIKL + L T +N
Sbjct: 293 GGTNFGRTSGGPYIITSYDYDAPINEYGNLNQPKYGHLKFLHEAIKLGEKVLTNYTSRND 352
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNI-SYELPRKSISILPDCKTVAF 326
LG + + G FL N+ + V +N Y +P S++IL C F
Sbjct: 353 KDLGNGITLTTYTNSVGARFCFLSNDKDNTDGNVDLQNDGKYFVPAWSVTILDGCNKEVF 412
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWE---EYREAILNFDNTLLRAEGLLDQISAAKDAS 383
NT +V++Q + K + + W E ++ +N + ++A LL+Q DAS
Sbjct: 413 NTAKVNSQTSIMEKKIDNSSTNKLTWAWIMEPKKDTMNGRGS-IKAHQLLEQKELTLDAS 471
Query: 384 DYFWYTFRFHYNSSN--AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
DY WY N ++ + A L V++ GH LH +VN Y G H N +FT V L
Sbjct: 472 DYLWYMTSVDINDTSNWSNANLHVETSGHTLHGYVNKRYIGYGHSQFGN-NFTYEKQVSL 530
Query: 442 RQGTNDGALLSVTVGLPDSGAFLER----------KVAGVHRVRVQDKSFTNCSWGYQVG 491
+ GTN LLS TVGL + GA + K+ G + V + + +W ++VG
Sbjct: 531 KNGTNIITLLSATVGLANYGARFDEIKTGISDGPVKLVGQNSVTID---LSTGNWSFKVG 587
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
L GEK + Y + V W++ PT + LTWYKT F++P G +PI ++LQ +GKG AWV
Sbjct: 588 LNGEKRRFYDLQPRSGVAWNTSSYPTGKPLTWYKTQFKSPLGPNPIVVDLQGLGKGHAWV 647
Query: 551 NGQSIGRYWVSFKTSKGNPSQT-QYAVN-TVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
NG+SIGRYW S+ TS S T Y N + + YHVPR+FL N L
Sbjct: 648 NGKSIGRYWTSWITSTAGCSDTCDYRGNYKKEKCNTGCASPSQRWYHVPRSFLNDDMNTL 707
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
+L EE GNP ++ T + +C +V GK ++
Sbjct: 708 ILFEEIGGNPQNVSFLTETTKTICANVYEG----------------------GK---LEL 742
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
SC G+ I+ I FASFGNP G C + GS S +SQ ++E +CIGK+ C + FG
Sbjct: 743 SCQNGQVITSINFASFGNPQGQCGSFKKGSWESLNSQSMMETSCIGKTGCGFTVTRDMFG 802
>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 712
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 285/670 (42%), Positives = 383/670 (57%), Gaps = 60/670 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD+I+TYVFWN HEP +G+ + + + + + +V L
Sbjct: 52 MWPDLIQKAKDGGLDIIETYVFWNGHEPSEGKVTW----EDFLYEQILYINCFHVALFXF 107
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P + G PIWL V GI FR+DN+P+K
Sbjct: 108 PPYFXFQKFSGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPII 167
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W A+MAVD TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 168 LSQIENEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYC 227
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP IWTE+W+ +Y +GG R +D+AF VA FI NGS VNYY+YH
Sbjct: 228 -ENFK-PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNNGSLVNYYVYH 285
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT+ F+ T Y AP+DEYGL+REPKWGHL++LH AIKLC L++
Sbjct: 286 GGTNFGRTSGLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKLCEPALVSADPTSTW 345
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG+ QEA VF+ +S CAAFL N D +V V F N Y+LP SISILPDCKTV FNT
Sbjct: 346 LGKNQEARVFKSSSA-CAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCKTVTFNTA 404
Query: 330 RVSTQYNKRSKTSNLKFDSDEKWEEYREAILN-FDNTLLRAEGLLDQISAAKDASDYFWY 388
++ +S + + S W Y+E + + +GL++Q+S D +DY WY
Sbjct: 405 QIGV----KSYEAKMMPISSFGWLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDYLWY 460
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
+S+ + + P L V S GH+LH F+NG+ +GS +GS ++ T V+L+
Sbjct: 461 MQDISIDSTEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPRITFSKYVNLK 520
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
QG N ++LSVTVGLP+ G + AGV + + + W Y+VGL GE
Sbjct: 521 QGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLSGES 580
Query: 497 LQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSI 555
L +YS+ G N V W+ +Q LTWYKTTF+ PAGN+P+ L++ SM KG+ WVNG+SI
Sbjct: 581 LNLYSDKGSNSVQWTKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWVNGRSI 640
Query: 556 GRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE 613
GRY+ + + G + YA + C + YH+PR +L P+ NLLV+ EE
Sbjct: 641 GRYFPGY-IANGKCDKCSYAGLFTEKKCLGNCG-EPSQKWYHIPRDWLSPSDNLLVIFEE 698
Query: 614 ENGNPLGITV 623
G+P GI++
Sbjct: 699 IGGSPDGISL 708
>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
Length = 806
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 304/798 (38%), Positives = 412/798 (51%), Gaps = 84/798 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEP + +Y+FSG D ++F + IQ GLY +RIG
Sbjct: 40 MWPDLIQKAKDGGLDAIETYIFWDRHEPVRREYNFSGNLDFVKFFQLIQKAGLYAIMRIG 99
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P WLH++ GI R++N YK
Sbjct: 100 PYACAEWNFGGFPSWLHNMPGIELRTNNSVYKNEMQNFTTEIVNVVKEAKLFASQGGPII 159
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I + + G YV WAA+MA+ + GVPW+MC+Q DAP P+IN CNG C
Sbjct: 160 LAQIENEYGDIMWNYKDAGKAYVQWAAQMALAQNIGVPWIMCQQQDAPQPIINTCNGYYC 219
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F+ PN+P P I+TE+W ++Q WG + RSA+D AF VA F G NYYMYH
Sbjct: 220 -HNFQ-PNNPKSPKIFTENWIGWFQKWGERVPHRSAEDSAFSVARFFQNGGVLNNYYMYH 277
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT-GTQNV 267
GGTNFGRTA IT YD AP+DEYG + +PKWGHLK LHAAIKL L +
Sbjct: 278 GGTNFGRTAGGPYITTSYDYDAPIDEYGNLNQPKWGHLKNLHAAIKLGENVLTNYSARKD 337
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNND--ERKAVTVLFRNISYELPRKSISILPDCKTVA 325
LG + +SG FL NN+ + A L + Y +P S+SI+ C
Sbjct: 338 EDLGNGLTLTTYTNSSGARFCFLSNNNNTDLGARVDLKNDGVYIVPAWSVSIINGCNQEV 397
Query: 326 FNTERVSTQYNKRSKTSNLKFDSDEKW----EEYREAILNFDNTLLRAEGLLDQISAAKD 381
FNT +V++Q + K S+ ++ W E R+ I N L+A+ LL+Q D
Sbjct: 398 FNTAKVNSQTSMMVKKSDNVSSTNLTWEWKVEPKRDTI--HGNGSLKAQKLLEQKELTLD 455
Query: 382 ASDYFWYTFRFHYNSSN--AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
ASDY WY N ++ + A L V + GH LH +VN Y G + N FT V
Sbjct: 456 ASDYLWYMTSADINDTSIWSNATLRVNTSGHSLHGYVNQRYVGYQFSQYGN-QFTYEKQV 514
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCS-------WGYQVGL 492
L+ GTN LLS TVGL + GA+ + K G+ V+ N + W Y++GL
Sbjct: 515 SLKNGTNIITLLSATVGLANYGAWFDDKKTGISGGPVELIGKNNVTMDLSTNLWSYKIGL 574
Query: 493 IGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE+ +Y V W SS + L WY+ F++P G +PI ++LQ +GKG AW
Sbjct: 575 NGERRHLYDAQQNVSVAWHTNSSYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAW 634
Query: 550 VNGQSIGRYWVSFKT-SKGNPSQTQYAVNTV-TSIHFCAIIKATNTYHVPRAFLKPTGNL 607
VNG SIGRYW S+ + S G Y N V + + YHVPR+FL N
Sbjct: 635 VNGHSIGRYWSSWISPSDGCSDTCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNT 694
Query: 608 LVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQ 667
LVL EE GNP + T+ +C +V + +
Sbjct: 695 LVLFEEIGGNPQSVQFQTVTTGTICANVY-------------------------EGAQFE 729
Query: 668 PSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYF 727
SC G+ +S+I FAS+GNP+G C + G+ +++SQ VVE +C+GK+ C + F
Sbjct: 730 LSCQSGQVMSQIQFASYGNPEGQCGSFKKGNFDAANSQSVVEASCVGKNNCGFNVTKEMF 789
Query: 728 GGDPCPGIHKALLVDAQC 745
G I + L V C
Sbjct: 790 GVTNVSSIPR-LAVQVTC 806
>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 702
Score = 496 bits (1277), Expect = e-137, Method: Compositional matrix adjust.
Identities = 288/684 (42%), Positives = 393/684 (57%), Gaps = 41/684 (5%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C +
Sbjct: 29 QIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQ 88
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
PNS +KP +WTE+W+ ++ +GG R A+D+AF VA F + G++ NYYMYHGG
Sbjct: 89 FT--PNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGG 146
Query: 212 TNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
TNFGR T F+ T Y AP+DEYG+VR+PKWGHL+++H AIKLC L+ + SL
Sbjct: 147 TNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSL 206
Query: 271 GQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
GQ EA V++ + +CAAFL N D + TV F +Y+LP S+SILPDCK V NT
Sbjct: 207 GQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTA 266
Query: 330 RVSTQYNK---RSKTSNLKFDSDEK----------WEEYREAILNFDNTLLRAEGLLDQI 376
++++Q RS S+++ D+D+ W E + L GL++QI
Sbjct: 267 QINSQVTTSEMRSLGSSIQ-DTDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQI 325
Query: 377 SAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNV 431
+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS +
Sbjct: 326 NTTADASDFLWYSTSIVVKGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSS 385
Query: 432 SFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDK----SFTNCSW 486
+L+ V L G N LLS TVGL + GAF + AGV V++ + ++ W
Sbjct: 386 LISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDW 445
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGK 545
YQ+GL GE L +Y+ + S PT Q L WYKT F APAG+DP+A++ MGK
Sbjct: 446 TYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGK 505
Query: 546 GEAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLK 602
GEAWVNGQSIGRYW + G + Y A ++ + C T YHVPR+FL+
Sbjct: 506 GEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQT-LYHVPRSFLQ 564
Query: 603 PTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK 662
P N LVL E+ G+P I+ T +C HV+ H + SW+ +Q T +
Sbjct: 565 PGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQT------Q 618
Query: 663 KPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIP 721
P ++ CP G+ IS I FASFG P G C Y G C SS + VV+ AC+G + CS+P
Sbjct: 619 GPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVP 678
Query: 722 LLSRYFGGDPCPGIHKALLVDAQC 745
+ S F GDPC G+ K+L+V+A C
Sbjct: 679 VSSNNF-GDPCSGVTKSLVVEAAC 701
>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
Length = 809
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 295/705 (41%), Positives = 381/705 (54%), Gaps = 88/705 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAK+GGLDVIQTYVFWN HEP G ND S G++
Sbjct: 83 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG-------ND---------SDGIFFRFEQY 126
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
F ES G P+WL V GI FR+DN+P+K
Sbjct: 127 YFEES-----GFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 181
Query: 93 ------------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV 140
IENEY F G Y+ WAAKMAV TGVPWVMCK++DAP PV
Sbjct: 182 LSQASIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGTGVPWVMCKEEDAPDPV 241
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
INACNG C + F PN P KP++WTE W+ ++ +GG R +D+AF VA F+ K G
Sbjct: 242 INACNGFYC-DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGG 299
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
S++NYYMYHGGTNFGRTA IT YD AP+DEYGLVREPK HLKELH A+KLC +
Sbjct: 300 SFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQA 359
Query: 260 LLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILP 319
L++ + +LG +QEA VF+ SG CAAFL N + V+F N Y LP SISILP
Sbjct: 360 LVSVDPAITTLGTMQEARVFQSPSG-CAAFLANYNSNSYAKVVFNNEQYSLPPWSISILP 418
Query: 320 DCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISA 378
DCK V FN+ V Q ++ + S WE Y E + + LL GLL+Q++
Sbjct: 419 DCKNVVFNSATVGVQTSQMQMWGDGA--SSMTWERYDEEVDSLAAAPLLTTTGLLEQLNV 476
Query: 379 AKDASDYFWYTFRFHYNSSN-------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNV 431
+D+SDY WY +SS L VQS GH LH FVNG+ GSA+G+ ++
Sbjct: 477 TRDSSDYLWYITSVDISSSENFLQGGGKPLSLSVQSAGHALHVFVNGQLQGSAYGTREDR 536
Query: 432 SFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCS 485
LR GTN ALLSV GLP+ G E GV H + + T +
Sbjct: 537 RIKYNGNASLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLDEGSRDLTWQT 596
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQS 542
W YQVGL GE++ + S G + V W S I + L WY+ F P+G++P+AL++ S
Sbjct: 597 WSYQVGLKGEQMNLNSIEGSSSVEWMQGSLIAQNQQPLAWYRAYFETPSGDEPLALDMGS 656
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFL 601
MGKG+ W+NGQSIGRYW ++ + G+ + Y + T YHVP+++L
Sbjct: 657 MGKGQIWINGQSIGRYWTAY--ADGDCKECSYTGTFRAPKCQSGCGQPTQRWYHVPKSWL 714
Query: 602 KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSW 646
+PT NLLV+ EE G+ I + ++ VC V+ H P + +W
Sbjct: 715 QPTRNLLVVFEELGGDSSKIALVKRSVSSVCADVSEDH-PNIKNW 758
>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
Length = 824
Score = 494 bits (1271), Expect = e-137, Method: Compositional matrix adjust.
Identities = 304/797 (38%), Positives = 416/797 (52%), Gaps = 85/797 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW LI KAKEGGLD I+TY+FWN HE ++ +Y+F+G D ++F +++Q GLY LRIG
Sbjct: 60 MWSDLIQKAKEGGLDTIETYIFWNAHERRRREYNFTGNLDFVKFFQKVQEAGLYGILRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW YGG P+WLH++ I FR+DN+ +K
Sbjct: 120 PYACAEWNYGGFPVWLHNIPEIKFRTDNEIFKNEMQTFTTKIVNMAKEAKLFASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + E G YV W A+MAV + GVPW+MC+Q DAP VIN CNG C
Sbjct: 180 LAQIENEYGNVMGPYGEAGKSYVQWCAQMAVAQNIGVPWIMCQQSDAPSSVINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TF PNSP P +WTE+WT +Y+ WG K R+A+D+AF VA F NG NYYMY+
Sbjct: 240 -DTFT-PNSPKSPKMWTENWTGWYKKWGQKDPHRTAEDLAFSVARFFQYNGVLQNYYMYY 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ F+ T Y APLDEYG + +PKWGHLK LHAA+KL + L T
Sbjct: 298 GGTNFGRTSGGPFIATSYDYDAPLDEYGNLNQPKWGHLKNLHAALKLGEKILTNSTVKTT 357
Query: 269 --SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAF 326
S G ++ G FL N L ++ Y +P S+SIL DC +
Sbjct: 358 KYSDGWVELTTYTSNIDGERLCFLSNTKMDGLDVDLQQDGKYFVPAWSVSILQDCNKETY 417
Query: 327 NTERVSTQYN---KRSKTSNLKFDSDEKWE-EYREAILNFDNTLLRAEGLLDQISAAKDA 382
NT +V+ Q + K+ ++ +W E +A L+ +A LL+Q +A D
Sbjct: 418 NTAKVNVQTSLIVKKLHENDTPLKLSWEWAPEPTKAPLHGQGG-FKATQLLEQKAATYDE 476
Query: 383 SDYFWYTFRFHYN-SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
SDY WY N +++ L V+ G LHAFVNG+ GS HG +FT L
Sbjct: 477 SDYLWYMTSVDNNGTASKNVTLRVKYSGQFLHAFVNGKEIGSQHG----YTFTFEKPALL 532
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-------DKSFTNCSWGYQVGLIG 494
+ GTN +LLS TVGL + G F + G+ V+ ++ W Y+VGL G
Sbjct: 533 KPGTNIISLLSATVGLQNYGEFFDEGPEGIAGGPVELIDSGNTTTDLSSNEWSYKVGLNG 592
Query: 495 EKLQIYS-NLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
E + Y G K + ++R R +TWYKTTF+AP+G +P+ ++LQ MGKG AWVNG
Sbjct: 593 EGGRFYDPTSGRAKWVSGNLRV-GRAMTWYKTTFQAPSGTEPVVVDLQGMGKGHAWVNGN 651
Query: 554 SIGRYW-VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLL 611
S+GR+W + G + Y T YHVPR+FL N L+L
Sbjct: 652 SLGRFWPILTADPNGCDGKCDYRGQYKEGKCLSNCGNPTQRWYHVPRSFLNNGSNTLILF 711
Query: 612 EEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCP 671
EE GNP ++ A +CG+ + T++ SC
Sbjct: 712 EEIGGNPSDVSFQITATETICGNTY-------------------------EGTTLELSCN 746
Query: 672 LGKK-ISKIVFASFGNPDG-DCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGG 729
G++ IS I +ASFG+P G C + GS +S S VE+AC+GK CSI + FG
Sbjct: 747 GGRRIISDIQYASFGDPQGSSCGSFQRGSVEASRSFSAVEKACMGKESCSINVSKATFGV 806
Query: 730 DPCPGI-HKALLVDAQC 745
+ G+ + L+V A C
Sbjct: 807 EDSFGVDNNRLVVQAVC 823
>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 494 bits (1271), Expect = e-136, Method: Compositional matrix adjust.
Identities = 292/773 (37%), Positives = 410/773 (53%), Gaps = 92/773 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L KAKEGG+D I+TY+FW+ HEP + QY FSG DI++F K Q GL+V LRIG
Sbjct: 1 MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW+YGG P+WLH++ GI R+DN+ YK
Sbjct: 61 PYVCAEWSYGGFPMWLHNIPGIELRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPII 120
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + + G YV W A+MAV + GVPW+MC+Q +AP P+IN CNG C
Sbjct: 121 LAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFYC 180
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PN+P P +WTE+W+ ++++WGG+ R+A+D+AF VA FI G +YYMYH
Sbjct: 181 -DQFK-PNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFIQNGGVLNSYYMYH 238
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD APLDEYG + +PKWGHLK+LH AIK R L GT
Sbjct: 239 GGTNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTSK 298
Query: 269 SL-GQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
+ G + + + +G FL N + +A L ++ Y LP S++IL DC +N
Sbjct: 299 NFWGGVDQTTYTNQGTGERFCFLSNTNMEEANVDLGQDGKYSLPAWSVTILQDCNKEIYN 358
Query: 328 TERVSTQYN---KRSKTSNLKFDSDEKWE-EYREAILNFDNTLLRAEGLLDQISAAKDAS 383
T +V+TQ + K+ + W E + +L RA LL+Q D +
Sbjct: 359 TAKVNTQTSIMVKKLHEEDKPVQLSWTWAPEPMKGVLQ-GKGRFRATELLEQKETTVDTT 417
Query: 384 DYFWYTFRFHYNSSNAQ----APLDVQSHGHILHAFVNGEYTGSAHGSH---------DN 430
DY WY + N + + L V + GH LHA+VN + G+ D+
Sbjct: 418 DYLWYMTSVNLNETTLKKWTNVTLRVGTRGHTLHAYVNKKEIGTQFSKQANAQQSVKGDD 477
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ----DKSF---TN 483
SF V L GTN +LLS TVGL + G + ++K G+ VQ K F T+
Sbjct: 478 YSFLFEKPVTLTSGTNTISLLSATVGLANYGQYYDKKPVGIAEGPVQLVANGKPFMDLTS 537
Query: 484 CSWGYQVGLIGEKLQIYS-NLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQ 541
W Y++GL GE + N +S PT R +TWYKTTF +P+G +P+ ++L
Sbjct: 538 YQWSYKIGLSGEAKRYNDPNSPHASKFTASDNLPTGRAMTWYKTTFASPSGTEPVVVDLL 597
Query: 542 SMGKGEAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPR 598
MGKG AWVNG+S+GR+W +KG P Y + N + C + YH+PR
Sbjct: 598 GMGKGHAWVNGKSLGRFWPTQIADAKGCPDTCDYRGSYNGDKCVTNCG-NPSQRWYHIPR 656
Query: 599 AFLKPTG-NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
++L G N L+L EE GNP ++ +A+ +CG+
Sbjct: 657 SYLNKDGQNTLILFEEVGGNPTNVSFQIVAVETICGNAYEGS------------------ 698
Query: 658 KKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVER 710
T++ SC G+ IS I FAS+G+P+G C + GS +++ S VVE+
Sbjct: 699 -------TLELSCEGGRTISDIQFASYGDPEGTCGAFMKGSFYATRSAAVVEK 744
>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
Length = 829
Score = 490 bits (1261), Expect = e-135, Method: Compositional matrix adjust.
Identities = 302/801 (37%), Positives = 408/801 (50%), Gaps = 87/801 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP+ QY+F+G DI+RF KEIQ+ G+Y LRIG
Sbjct: 60 MWPDLIKKAKEGGLDAIETYVFWNGHEPRPRQYNFAGNYDIVRFFKEIQNAGMYAILRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N+P+
Sbjct: 120 PYICGEWNYGGLPAWLRDIPGMQFRMHNQPFEHEMETFTTLIVNKLKDANMFAGQGGPII 179
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY I + Y+ W A MA + GVPW+MC+QD D P VIN CNG
Sbjct: 180 LSQIENEYGNIMANLTDAQSASEYIHWCAAMANKQNVGVPWIMCQQDADVPPNVINTCNG 239
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F P + P IWTE+WT +++ W + RSAQDIAF VA+F K GS NYY
Sbjct: 240 FYCHDWF--PKRTDIPKIWTENWTGWFKAWDKPDFHRSAQDIAFAVAMFFQKRGSLQNYY 297
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRTA IT YD APLDEYG +REPK+GHLK+LHA +K + L+ G
Sbjct: 298 MYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNIREPKYGHLKDLHAVLKSMEKILVHGDF 357
Query: 266 NVISLGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ I+ G+ + + S VC F+ N + + ++ +P S+S+LPDCK V
Sbjct: 358 SDINYGRNVTVTKYTLDGSSVC--FISNQFDDRDANATIDGTTHVVPAWSVSVLPDCKAV 415
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWE---EYREAILNFDNTLLRAEGLLDQISAA 379
A+NT ++ Q + K N E KW E+ + + + R LL+QI+ +
Sbjct: 416 AYNTAKIKAQTSVMVKKPNTVEQEPENLKWSWMPEHLKPFMTDEKGSFRKNELLEQITTS 475
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY F + A+ L V + GH ++AFVNG+ G H + F L + V
Sbjct: 476 TDQSDYLWYRTSFEHKGE-AKYKLSVNTTGHQIYAFVNGKLAGRQHSPNGAFIFQLESPV 534
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + GA E AG+ V++ D + +N SW Y+ GL
Sbjct: 535 KLHDGKNYLSLLSATMGLKNYGALFELMPAGIVGGPVKLVDNNGSTIDLSNSSWSYKAGL 594
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSP-TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE QI+ + K + P R TWYK TF+APAG + + +L + KG AWVN
Sbjct: 595 AGEHRQIHLDKPGYKWHGDNGTIPINRAFTWYKATFQAPAGEEAVVADLMGLNKGVAWVN 654
Query: 552 GQSIGRYWVSFKTSK-GNPSQTQYAVNTVTSIHFCAIIKATNT-----YHVPRAFLKP-T 604
G ++GRYW S+ ++ G Y + N YHVPR FL+
Sbjct: 655 GNNLGRYWPSYVAAEMGGCHHCDYRGAFKAEGDGLKCLTGCNEPAQRFYHVPRVFLRAGE 714
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
N +VL EE G+P + T+A+ VC + ++GD G+
Sbjct: 715 PNTVVLFEEAGGDPSRVGFHTVAVGPVC--------------VEAAEKGDNVTLSCGQHK 760
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLS 724
G+ IS + AS+G G C Y G C S + AC+GK C++
Sbjct: 761 --------GRTISSVDLASYGVTRGQCGAYQ-GGCESKAAYEAFAEACVGKESCTVQHTD 811
Query: 725 RYFGGDPCPGIHKALLVDAQC 745
+ G G+ L V A C
Sbjct: 812 AFSGAGCQSGV---LTVQATC 829
>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
Length = 828
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 317/817 (38%), Positives = 417/817 (51%), Gaps = 121/817 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGL+ I+TYVFWN HEP++ +++F G D++RF KEIQ+ G+Y LRIG
Sbjct: 61 MWPDLIKKAKEGGLNAIETYVFWNGHEPRRREFNFEGNYDVVRFFKEIQNAGMYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP+WL D+ GI FR NKP+
Sbjct: 121 PYICGEWNYGGLPVWLRDIPGIKFRLHNKPFENEMEAFTTLIVKKMKDANMFAGQGGPII 180
Query: 92 --KIENEY--QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY ++P + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGYTMLQPENIQSAHEYIHWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C E F N + P +WTE+WT +Y+ W + R +DIAF VA+F GS NYY
Sbjct: 241 FYCHEWFS--NRTSIPKMWTENWTGWYRDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRTA IT YD APLDEYG +R+PK+GHLKELH+ + + LL G
Sbjct: 299 MYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNLRQPKYGHLKELHSVLMSMEKILLHG-- 356
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN--DERKAVTVLFRNISYELPRKSISILPDCKT 323
+ I V + T +A +NN D+R V V ++ LP S+SILPDCKT
Sbjct: 357 DYIDTNYGDNVTVTKYTLNATSACFINNRFDDRD-VNVTLDGTTHFLPAWSVSILPDCKT 415
Query: 324 VAFNTERVSTQYNKR-SKTSNLKFDSDE-KWEEYREAILNF---DNTLLRAEGLLDQISA 378
VAFN+ ++ TQ +KTS ++ ++ KW E + F + R LL+QI
Sbjct: 416 VAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVT 475
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
D SDY WY + + L V + GH L+AFVNG+ G + ++N +F L++
Sbjct: 476 TTDQSDYLWYRTSLEHKGEGSYV-LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSP 534
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVG 491
V L G N +LLS TVGL + G E AG+ V++ D S +N SW Y+ G
Sbjct: 535 VKLHDGKNYISLLSGTVGLRNYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAG 594
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSP---TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE +IY + NK W S S R TWYKTTF+APAG D + ++L + KG A
Sbjct: 595 LAGEYRKIYLDKPGNK--WRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVA 652
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFC---AIIKA--------------- 590
WVNG S+GRYW S Y + H C + KA
Sbjct: 653 WVNGNSLGRYWPS------------YVAADMPGCHHCDYRGVFKAEVEAQKCLTGCGEPS 700
Query: 591 TNTYHVPRAFL-KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRH 649
YHVPR+FL K N L+L EE G+P + V T+ VC
Sbjct: 701 QQLYHVPRSFLHKGEPNTLILFEEAGGDPSEVAVRTVVEGSVCASA-------------- 746
Query: 650 RQRGDTDIKKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVV 708
+ GD TV SC G+ IS + ASFG G C Y G C S +
Sbjct: 747 -ELGD----------TVTLSCGAHGRTISSVDVASFGVARGRCGSYD-GGCDSKVAYDAF 794
Query: 709 ERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
AC+GK C++ L++ F C + L V A C
Sbjct: 795 AAACVGKESCTV-LVTDAFANAGC--VSGVLTVQATC 828
>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
Length = 828
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 307/812 (37%), Positives = 405/812 (49%), Gaps = 111/812 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DI+RF KEIQ+ GLY LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPGMQFRLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPII 180
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY I + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGNIMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 299 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHG-- 356
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN-DERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ V + T G +A +NN ++ K + V ++ LP S+SILPDCKTV
Sbjct: 357 EYVDTNYSDNVTVTKYTLGSTSACFINNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTV 416
Query: 325 AFNTERVSTQYNKRSKTSNL--KFDSDEKWEEYREAILNF---DNTLLRAEGLLDQISAA 379
AFN+ ++ Q K +N+ K + KW RE + F + R LL+QI +
Sbjct: 417 AFNSAKIKAQTTIMVKKANMVEKEPENLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTS 476
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY + A L V + GH L+AFVNG G H + + F L + V
Sbjct: 477 TDQSDYLWYRTSLDHKGE-ASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAV 535
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ GL
Sbjct: 536 KLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGL 595
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W + R TWYKTTF+APAG D + ++L + KG AW
Sbjct: 596 AGEYRQIH--LDKPGYRWDNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAW 653
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------------Y 594
VNG ++GRYW PS T + + + +A Y
Sbjct: 654 VNGNNLGRYW---------PSYTAAEMGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRYY 704
Query: 595 HVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
HVPR+FLK N L+L EE G+P + ++ VC + G
Sbjct: 705 HVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVAGSVC---------------VSAEVG 749
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
D G+ K IS I SFG G C Y G C S + AC+
Sbjct: 750 DAITLSCGQHS---------KTISTIDVTSFGVARGQCGAYE-GGCESKAAYKAFTEACL 799
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK C++ +++ G G+ L V A C
Sbjct: 800 GKESCTVQIINALTGSGCLSGV---LTVQASC 828
>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
Length = 812
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 303/781 (38%), Positives = 401/781 (51%), Gaps = 110/781 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+G LD I+TY+FW+LHEP + +YDFSG D I+F+K Q QGLYV LRIG
Sbjct: 56 MWPDLIMKAKDGDLDAIETYIFWDLHEPVRRKYDFSGNLDFIKFLKIAQEQGLYVVLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R+DN +K
Sbjct: 116 PYVCAEWNYGGFPMWLHNMPGIQLRTDNAVFKEEMKIFTTKIVTMCKEAGLFAPQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY + + E G Y+ W A+MA+ + GVPW+MCKQ +AP +I+ CNG C
Sbjct: 176 LAQIENEYGDVISHYGEAGNSYIKWCAEMALAQNIGVPWIMCKQKNAPATIIDTCNGYYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+TFK PN+P P I+TE+W ++Q WG + R+A+D AF VA F G+ NYY+YH
Sbjct: 236 -DTFK-PNNPKSPKIFTENWVGWFQKWGERRPHRTAEDSAFSVARFFQNGGALQNYYLYH 293
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+IT Y APLDEYG + EPK+GHLK LHAAIKL + L GT
Sbjct: 294 GGTNFGRTAGGPFIITTYDYDAPLDEYGNLIEPKYGHLKRLHAAIKLGEKVLTNGTATWE 353
Query: 269 SLGQ-LQEAFVFEETSGVCAAFLVNNDERKAVTV-LFRNISYELPRKSISILPDCKTVAF 326
S G L + +G FL N+ K V L ++ Y +P S+S+L DC +
Sbjct: 354 SHGDSLWMTTYTNKGTGQKFCFLSNSHTSKDAEVDLQQDGKYYVPAWSMSLLQDCNKEVY 413
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF--DNTLLRAEGLLDQISAAKDASD 384
NT + Q N K + K + +W + + + A LLDQ S ASD
Sbjct: 414 NTAKTEAQTNIYMKQLDQKLGNSPEWSWTSDPMEDTFQGKGTFTASQLLDQKSVTVGASD 473
Query: 385 YFWYTFRFHYNSSN--AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
Y WY N +N +A + V + GHIL+ F+NG TG+ HG+ F + L
Sbjct: 474 YLWYMTEVVVNDTNTWGKAKVQVNTTGHILYLFINGFLTGTQHGTVSQPGFIHEGNISLN 533
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTN---------CSWGYQVGLI 493
QGTN +LLSVTVG + GAF + + G+ V+ S N +W Y+VG+
Sbjct: 534 QGTNIISLLSVTVGHANYGAFFDMQETGIVGGPVKLFSIENPNNVLDLSKSTWSYKVGIN 593
Query: 494 GEKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
G + Y V W SI P +TWYKTTF+ P G +P+ L+L + KGEAW
Sbjct: 594 GMTKKFYDPKTTIGVQWKTNNVSIGVP---MTWYKTTFKTPDGTNPVVLDLIGLQKGEAW 650
Query: 550 VNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
VNGQSIGRYW +KG Y N + C + YHVPR+FL N
Sbjct: 651 VNGQSIGRYWPAMLAENKGCSDTCDYRGEYNADKCLSGCG-EPSQRFYHVPRSFLNNDVN 709
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
LVL EE +G D F K
Sbjct: 710 TLVLFEE-----MGF----------------------------------DATPFNGK--- 727
Query: 667 QPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRY 726
+S+I FAS+G+P+G C + +G S +S+ VVE+ACIGK CSI + S
Sbjct: 728 --------TMSEIQFASYGDPEGSCGSFKIGEWESRYSKTVVEKACIGKQSCSINVTSST 779
Query: 727 F 727
F
Sbjct: 780 F 780
>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
Length = 828
Score = 487 bits (1253), Expect = e-134, Method: Compositional matrix adjust.
Identities = 316/817 (38%), Positives = 417/817 (51%), Gaps = 121/817 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGL+ I+TYVFWN HEP++ +++F G D++RF KEIQ+ G+Y LRIG
Sbjct: 61 MWPDLIKKAKEGGLNAIETYVFWNGHEPRRREFNFEGNYDVVRFFKEIQNAGMYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP+WL D+ GI FR NKP+
Sbjct: 121 PYICGEWNYGGLPVWLRDIPGIKFRLHNKPFENGMEAFTTLIVKKMKDANMFAGQGGPII 180
Query: 92 --KIENEY--QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY ++P + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGYTMLQPENIQSAHEYIHWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C E F N + P +WTE+WT +Y+ W + R +DIAF VA+F GS NYY
Sbjct: 241 FYCHEWFS--NRTSIPKMWTENWTGWYRDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRTA IT YD APLDEYG +R+PK+GHLKELH+ + + LL G
Sbjct: 299 MYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNLRQPKYGHLKELHSVLMSMEKILLHG-- 356
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN--DERKAVTVLFRNISYELPRKSISILPDCKT 323
+ I V + T +A +NN D+R V V ++ LP S+SILP+CKT
Sbjct: 357 DYIDTNYGDNVTVTKYTLNATSACFINNRFDDRD-VNVTLDGTTHFLPAWSVSILPNCKT 415
Query: 324 VAFNTERVSTQYNKR-SKTSNLKFDSDE-KWEEYREAILNF---DNTLLRAEGLLDQISA 378
VAFN+ ++ TQ +KTS ++ ++ KW E + F + R LL+QI
Sbjct: 416 VAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVT 475
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
D SDY WY + + L V + GH L+AFVNG+ G + ++N +F L++
Sbjct: 476 TTDQSDYLWYRTSLEHKGEGSYV-LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSP 534
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVG 491
V L G N +LLS TVGL + G E AG+ V++ D S +N SW Y+ G
Sbjct: 535 VKLHDGKNYISLLSGTVGLRNYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAG 594
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSP---TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE +IY + NK W S S R TWYKTTF+APAG D + ++L + KG A
Sbjct: 595 LAGEYRKIYLDKPGNK--WRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVA 652
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFC---AIIKA--------------- 590
WVNG S+GRYW S Y + H C + KA
Sbjct: 653 WVNGNSLGRYWPS------------YVAADMPGCHHCDYRGVFKAEVEAQKCLTGCGEPS 700
Query: 591 TNTYHVPRAFL-KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRH 649
YHVPR+FL K N L+L EE G+P + V T+ VC
Sbjct: 701 QQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVRTVVEGSVCASA-------------- 746
Query: 650 RQRGDTDIKKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVV 708
+ GD TV SC G+ IS + ASFG G C Y G C S +
Sbjct: 747 -EVGD----------TVTLSCGAHGRTISSVDVASFGVARGRCGSYD-GGCESKVAYDAF 794
Query: 709 ERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
AC+GK C++ L++ F C + L V A C
Sbjct: 795 AAACVGKESCTV-LVTDAFANAGC--VSGVLTVQATC 828
>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
Length = 828
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 307/813 (37%), Positives = 403/813 (49%), Gaps = 113/813 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DI+RF KEIQ+ GLY LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPGMQFRLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPII 180
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY I + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGNIMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-- 263
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 299 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEY 358
Query: 264 TQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKT 323
S + + TS A F+ N ++ V V ++ LP S+SILPDCKT
Sbjct: 359 VDTNYSDKVTVTKYTLDSTS---ACFINNRNDNMDVNVTLDGTTHLLPAWSVSILPDCKT 415
Query: 324 VAFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISA 378
VAFN+ ++ Q +N+ E KW RE + F + R LL+QI
Sbjct: 416 VAFNSAKIKAQTTVMVNKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVT 475
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
+ D SDY WY ++ A L V + GH L+AFVNG G H + + F L +
Sbjct: 476 STDQSDYLWYRTSINHKGE-ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESP 534
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVG 491
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ G
Sbjct: 535 AKLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAG 594
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE QI+ L W + + TWYKTTF+APAG D + ++L + KG A
Sbjct: 595 LAGEYRQIH--LDKPGCTWDNNNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVA 652
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT--------------- 593
WVNG ++GRYW PS T + + + +A
Sbjct: 653 WVNGNNLGRYW---------PSYTAAEMGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRF 703
Query: 594 YHVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQR 652
YHVPR+FLK N L+L EE G+P ++ T+A VC +
Sbjct: 704 YHVPRSFLKNGEPNTLILFEEAGGDPSHVSFRTVAAGSVCASA---------------EV 748
Query: 653 GDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERAC 712
GDT G+ K IS I SFG G C Y G C S + AC
Sbjct: 749 GDTITLSCGQH---------SKTISAINMTSFGVARGQCGAYK-GGCESKAAYKAFTEAC 798
Query: 713 IGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+GK C++ ++ G C + L V A C
Sbjct: 799 LGKESCTVQ-ITNAVTGSGC--LSNVLTVQASC 828
>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
Length = 824
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 306/812 (37%), Positives = 403/812 (49%), Gaps = 111/812 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DIIRF KEIQ+ GLY LRIG
Sbjct: 57 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ + FR N P+
Sbjct: 117 PYICGEWNYGGLPAWLRDIPQMQFRMHNAPFENEMENFTTLIINKMKDANMFAGQGGPII 176
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY + + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 177 LAQIENEYGNVMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 236
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 237 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 294
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 295 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHG-- 352
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN-DERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ V + T G +A +NN ++ K + V ++ LP S+SILPDCKTV
Sbjct: 353 EYVDANYSDNVTVTKYTLGSTSACFINNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTV 412
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISAA 379
AFN+ ++ Q K +N+ E KW RE + F + R LL+QI +
Sbjct: 413 AFNSAKIKAQTTIMVKKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTS 472
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY + A L V + GH L+AFVNG G H + + F L + V
Sbjct: 473 TDQSDYLWYRTSLDHKGE-ASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAV 531
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ GL
Sbjct: 532 KLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGL 591
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W + R TWYKTTF+APAG D + ++L + KG AW
Sbjct: 592 AGEYRQIH--LDKPGYRWDNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAW 649
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------------Y 594
VNG ++GRYW PS T + + + +A Y
Sbjct: 650 VNGNNLGRYW---------PSYTAAEMGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRYY 700
Query: 595 HVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
HVPR+FLK N L+L EE G+P + ++ VC + G
Sbjct: 701 HVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVAGSVC---------------VSAEVG 745
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
D G+ K IS I SFG G C Y G C S + AC+
Sbjct: 746 DAITLSCGQHS---------KTISTIDVTSFGVARGQCGAYE-GGCESKAAYKAFTEACL 795
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK C++ +++ G G+ L V A C
Sbjct: 796 GKESCTVQIINALTGSG---GLSGVLTVQASC 824
>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
Length = 824
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 306/812 (37%), Positives = 404/812 (49%), Gaps = 111/812 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DIIRF KEIQ+ GLY LRIG
Sbjct: 57 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ + FR N P+
Sbjct: 117 PYICGEWNYGGLPAWLRDIPQMQFRMHNAPFENEMENFTTLIINKMKDANMFAGQGGPII 176
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY + + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 177 LAQIENEYGNVMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 236
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 237 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 294
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 295 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHG-- 352
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN-DERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ V + T G +A +NN ++ K + V ++ LP S+SILPDCKTV
Sbjct: 353 EYVDTNYSDNVTVTKYTLGSTSACFINNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTV 412
Query: 325 AFNTERVSTQYNKRSKTSNL--KFDSDEKWEEYREAILNF---DNTLLRAEGLLDQISAA 379
AFN+ ++ Q K +N+ K + KW RE + F + R LL+QI +
Sbjct: 413 AFNSAKIKAQTTIMVKKANMVEKEPENLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTS 472
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY + A L V + GH L+AFVNG G H + + F L + V
Sbjct: 473 TDQSDYLWYRTSLDHKGE-ASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAV 531
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ GL
Sbjct: 532 KLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGL 591
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W + R TWYKTTF+APAG D + ++L + KG AW
Sbjct: 592 AGEYRQIH--LDKPGYRWDNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAW 649
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------------Y 594
VNG ++GRYW PS T + + + +A Y
Sbjct: 650 VNGNNLGRYW---------PSYTAAEMGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRYY 700
Query: 595 HVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
HVPR+FLK N L+L EE G+P + ++ VC + G
Sbjct: 701 HVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVAGSVC---------------VSAEVG 745
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
D G+ K IS I SFG G C Y G C S + AC+
Sbjct: 746 DAITLSCGQHS---------KTISTIDVTSFGVARGQCGAYE-GGCESKAAYKAFTEACL 795
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK C++ +++ G G+ L V A C
Sbjct: 796 GKESCTVQIINALTGSGCLSGV---LTVQASC 824
>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
Length = 828
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 306/812 (37%), Positives = 403/812 (49%), Gaps = 111/812 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DIIRF KEIQ+ GLY LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ + FR N P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPQMQFRMHNAPFENEMENFTTLIINKMKDANMFAGQGGPII 180
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY + + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGNVMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 299 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHG-- 356
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN-DERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ V + T G +A +NN ++ K + V ++ LP S+SILPDCKTV
Sbjct: 357 EYVDANYSDNVTVTKYTLGSTSACFINNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTV 416
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISAA 379
AFN+ ++ Q K +N+ E KW RE + F + R LL+QI +
Sbjct: 417 AFNSAKIKAQTTIMVKKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTS 476
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY + A L V + GH L+AFVNG G H + + F L + V
Sbjct: 477 TDQSDYLWYRTSLDHKGE-ASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAV 535
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ GL
Sbjct: 536 KLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGL 595
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W + R TWYKTTF+APAG D + ++L + KG AW
Sbjct: 596 AGEYRQIH--LDKPGYRWDNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAW 653
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------------Y 594
VNG ++GRYW PS T + + + +A Y
Sbjct: 654 VNGNNLGRYW---------PSYTAAEMGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRYY 704
Query: 595 HVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
HVPR+FLK N L+L EE G+P + ++ VC + G
Sbjct: 705 HVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVAGSVC---------------VSAEVG 749
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
D G+ K IS I SFG G C Y G C S + AC+
Sbjct: 750 DAITLSCGQHS---------KTISTIDVTSFGVARGQCGAYE-GGCESKAAYKAFTEACL 799
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK C++ +++ G G+ L V A C
Sbjct: 800 GKESCTVQIINALTGSGCLSGV---LTVQASC 828
>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 824
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 306/812 (37%), Positives = 403/812 (49%), Gaps = 111/812 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DIIRF KEIQ+ GLY LRIG
Sbjct: 57 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ + FR N P+
Sbjct: 117 PYICGEWNYGGLPAWLRDIPQMQFRMHNAPFENEMENFTTLIINKMKDANMFAGQGGPII 176
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY + + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 177 LAQIENEYGNVMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 236
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 237 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 294
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 295 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHG-- 352
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN-DERKAVTVLFRNISYELPRKSISILPDCKTV 324
+ V + T G +A +NN ++ K + V ++ LP S+SILPDCKTV
Sbjct: 353 EYVDANYSDNVTVTKYTLGSTSACFINNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTV 412
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISAA 379
AFN+ ++ Q K +N+ E KW RE + F + R LL+QI +
Sbjct: 413 AFNSAKIKAQTTIMVKKANMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTS 472
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY + A L V + GH L+AFVNG G H + + F L + V
Sbjct: 473 TDQSDYLWYRTSLDHKGE-ASYTLFVNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAV 531
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ GL
Sbjct: 532 KLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGL 591
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W + R TWYKTTF+APAG D + ++L + KG AW
Sbjct: 592 AGEYRQIH--LDKPGYRWDNNNGTVPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAW 649
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------------Y 594
VNG ++GRYW PS T + + + +A Y
Sbjct: 650 VNGNNLGRYW---------PSYTAAEMGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRYY 700
Query: 595 HVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
HVPR+FLK N L+L EE G+P + ++ VC + G
Sbjct: 701 HVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVAGSVC---------------VSAEVG 745
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
D G+ K IS I SFG G C Y G C S + AC+
Sbjct: 746 DAITLSCGQH---------SKTISTIDVTSFGVARGQCGAYE-GGCESKAAYKAFTEACL 795
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK C++ +++ G G+ L V A C
Sbjct: 796 GKESCTVQIINALTGSGCLSGV---LTVQASC 824
>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
Length = 625
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 270/633 (42%), Positives = 368/633 (58%), Gaps = 25/633 (3%)
Query: 129 VMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDI 188
V+CKQDDAP P+INACNG C + PN KP +WTE WT ++ +GG R A+D+
Sbjct: 1 VLCKQDDAPDPIINACNGFYC--DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDM 58
Query: 189 AFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLK 247
AF VA FI K GS++NYYMYHGGTNFGRTA F+ T Y APLDEYGL R+PKWGHLK
Sbjct: 59 AFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLK 118
Query: 248 ELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNIS 307
+LH AIKLC L++G + LG QEA V++ SG C+AFL N + + V F N
Sbjct: 119 DLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNH 178
Query: 308 YELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLL 367
Y LP SISILPDCK +NT RV Q R K + W+ Y E + +
Sbjct: 179 YNLPPWSISILPDCKNTVYNTARVGAQ-TSRMKMVRVPVHGGLSWQAYNEDPSTYIDESF 237
Query: 368 RAEGLLDQISAAKDASDYFWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYT 421
GL++QI+ +D SDY WY +++ N P L V S GH +H F+NG+ +
Sbjct: 238 TMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLS 297
Query: 422 GSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVR 475
GSA+GS D+ T R V+LR G N A+LS+ VGLP+ G E AGV + +
Sbjct: 298 GSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLN 357
Query: 476 VQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGN 533
+ + W Y+VGL GE L ++S G + V W+ + + + LTWYKTTF APAG+
Sbjct: 358 GGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGD 417
Query: 534 DPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT 593
P+A+++ SMGKG+ W+NGQS+GR+W ++K + G+ S+ Y +A+
Sbjct: 418 SPLAVDMGSMGKGQIWINGQSLGRHWPAYK-AVGSCSECSYTGTFREDKCLRNCGEASQR 476
Query: 594 -YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQR 652
YHVPR++LKP+GNLLV+ EE G+P GIT+ + VC + S+ + ++
Sbjct: 477 WYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQ----STLVNYQLH 532
Query: 653 GDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERAC 712
+ K P C G+KI+ + FASFG P+G C Y GSCH+ HS + C
Sbjct: 533 ASGKVNK-PLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLC 591
Query: 713 IGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+G++ CS+ + FGGDPCP + K L V+A C
Sbjct: 592 VGQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVC 624
>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
Length = 831
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 301/802 (37%), Positives = 403/802 (50%), Gaps = 91/802 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGL+ I+TYVFWN HEP+ QY+F G DI+RF KE+Q G+Y LRIG
Sbjct: 63 MWPDLIQKAKDGGLNTIETYVFWNGHEPRPRQYNFEGNYDIMRFFKEVQKAGMYAILRIG 122
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ + FR N+P+
Sbjct: 123 PYICGEWNYGGLPAWLRDIPDMQFRLHNEPFEREMETFTTLIVNKMKDANMFAGQGGPII 182
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQ-DDAPGPVINACNG 146
+IENEY ++ E Y+ W A MA + GVPW+MC+Q +D P VI CNG
Sbjct: 183 LTQIENEYGNVQSNLPDQESATKYIHWCADMANKQNVGVPWIMCQQSNDVPPNVIETCNG 242
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + FK P N P IWTE+WT +++ W Y R A+D+A+ VA+F GS NYY
Sbjct: 243 FYCHD-FK-PKGSNMPKIWTENWTGWFKAWDKPDYHRPAEDVAYAVAMFFQNRGSVQNYY 300
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK LH + + L+ G Q
Sbjct: 301 MYHGGTNFGRTSGGPYITTTYDYDAPLDEYGNIRQPKYGHLKALHTVLTSMEKHLVYGQQ 360
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
N +L +A + G A F+ N+ + K V V F +Y++P S+S+LPDCKTVA
Sbjct: 361 NETNLDDKVKATKYTLDDGSSACFISNSHDNKDVNVTFEGSAYQVPAWSVSVLPDCKTVA 420
Query: 326 FNTERVSTQYNKRSKTSNLKFDSDEKWE---EYREAILNFDNTLLRAEGLLDQISAAKDA 382
+NT +V TQ + K + KW E+ ++ LL+QI D
Sbjct: 421 YNTAKVKTQTSVMVKKESAA-KGGLKWSWLPEFLRPSFTDSYGSFKSNELLEQIVTGADE 479
Query: 383 SDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
SDY WY Q L V + GH L+AFVNGE G H + F V L+
Sbjct: 480 SDYLWYKTSLT-RGPKEQFTLYVNTTGHELYAFVNGELAGYKHAVNGPYLFQFEAPVTLK 538
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-------FTNCSWGYQVGLIGE 495
G N +LLS TVGL + GA E AG+ V+ S +N +W Y+ GL GE
Sbjct: 539 PGKNYISLLSATVGLKNYGASFELMPAGIVGGPVKLVSAHGNTIDLSNNTWTYKTGLFGE 598
Query: 496 KLQIYSNLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQS 554
+ QI+ L + WS PT R TWYK TF+APAG + + ++L + KG +VNG +
Sbjct: 599 QKQIH--LDKPGLRWSPFAVPTNRPFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNGHN 656
Query: 555 IGRYWVSFKTSKGNPS-----QTQYAV--NTVTSIHFCAIIKATNTYHVPRAFLKPTG-- 605
+GRYW S+ + + +Y N + C + YHVPR+FL
Sbjct: 657 LGRYWPSYVAGDMDGCHRCDYRGEYVTWNNQEKCLTGCGEV-GQRFYHVPRSFLNAAHGA 715
Query: 606 -NLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKP 664
N +VL EE G+P + T+A+ VC ++GD
Sbjct: 716 PNTVVLFEEAGGDPAKVNFRTVAVGPVCADA---------------EKGD---------- 750
Query: 665 TVQPSCPLGKKISKIVFASFGNPDGDCERYAVGS-CHSSHSQGVVERACIGKSRCSIPLL 723
V +C G+ IS + ASFG G C Y GS C S + + AC+GK C++
Sbjct: 751 AVTLACAHGRTISSVDTASFGVSGGQCGAYEGGSGCESKPALEAITAACVGKKWCTVSYT 810
Query: 724 SRYFGGDPCPGIHKALLVDAQC 745
+ D C G L V A C
Sbjct: 811 DAFDSAD-CKG-SGVLTVQATC 830
>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
Length = 822
Score = 484 bits (1245), Expect = e-133, Method: Compositional matrix adjust.
Identities = 297/814 (36%), Positives = 420/814 (51%), Gaps = 97/814 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGL+ I+TYVFWN HEP + QYDFSG D+IRFIK I+ +GLY LRIG
Sbjct: 37 MWPQLIRKAKEGGLNTIETYVFWNAHEPHQRQYDFSGNLDLIRFIKTIRDEGLYAILRIG 96
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R++N+ YK
Sbjct: 97 PYVCAEWNYGGFPVWLHNLPGIQIRTNNEVYKNEMEIFTTLIVNMMKDGKLFASQGGPII 156
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ ++ ++G YV W A +A F GVPW+MC+Q DAP P+I++CNG C
Sbjct: 157 LSQIENEYGNVQSSYGDEGKEYVKWCANLAESFKVGVPWIMCQQSDAPSPMIDSCNGFYC 216
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ + N+ + P IWTE+WT ++Q WG K RSA+D+AF VA F GS +NYYMYH
Sbjct: 217 DQYYS--NNKSLPKIWTENWTGWFQDWGQKNPHRSAEDVAFAVARFFQLGGSVMNYYMYH 274
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFG T IT YD APLDEYG +R+PKWGHL++LH+ + + L G
Sbjct: 275 GGTNFGTTGGGPYITASYDYDAPLDEYGNLRQPKWGHLRDLHSVLNSMEQTLTYGESKNS 334
Query: 269 SLGQLQEAFV-FEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
+ F+ G + F + D K T+ F Y LP S+SILPDC T +N
Sbjct: 335 NYPDNNNIFITIFAYQGKRSCFFSSID-YKDQTISFEGTDYFLPAWSVSILPDCFTEVYN 393
Query: 328 TERVSTQY----NKRSKTSNLKFDSDEKWEEYREAIL------NFDNTLLRAEGLLDQIS 377
T V+ Q NK + + + + +W+ E I +F L A L+DQ +
Sbjct: 394 TATVNVQTSIMENKANAADSFREPNSLQWKWRPEKIRGLSLQGDFVGNTLVANELMDQKA 453
Query: 378 AAKDASDYFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDN- 430
SDY W + +N +++ L V ++GH++HAFVNG++ GS S ++
Sbjct: 454 VTNGTSDYLWIMTNYDHNMNDSLWGAGKDIILQVHTNGHVVHAFVNGKHVGSQSASIESG 513
Query: 431 -VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-------RVRVQDK--- 479
F + + L++G N +L+SV+VGL + GA + G++ R ++ ++
Sbjct: 514 RFDFVFESKIKLKRGINRISLVSVSVGLQNYGANFDTAPTGINGPITIIGRSKLGNQPDV 573
Query: 480 --SFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPI 536
++ W Y+ GL GE + ++ + + Q WYKT+F AP G DP+
Sbjct: 574 TVDISSNRWVYKTGLHGEDQGFQAVRPRHRRQFYTKHVLINQPFVWYKTSFNAPLGQDPV 633
Query: 537 ALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT--- 593
++L +GKG AWVNG++IGR+W + C T
Sbjct: 634 VVDLLGLGKGTAWVNGRNIGRFWPKALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRY 693
Query: 594 YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
YH+PR +LKP N LVL EE G P ++V T+ + KVC H H
Sbjct: 694 YHIPRDWLKPEDNKLVLFEELGGTPDFVSVQTVTVGKVCVHGYEGH-------------- 739
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ--GVVERA 711
TV+ SC G+K SKI FASFG P G C + + H H+ +VE+A
Sbjct: 740 -----------TVELSCQHGRKFSKITFASFGLPQGKCGSFTPSNNHDCHADVSTIVEKA 788
Query: 712 CIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C+GK RCSI + + C L V+A C
Sbjct: 789 CVGKERCSIDISEKALAPIHCDARIYRLAVEAVC 822
>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 301/803 (37%), Positives = 406/803 (50%), Gaps = 94/803 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TY+FWN HEP + QY+F G D++RF KEIQ+ G+Y LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYIFWNGHEPHRRQYNFEGNYDVVRFFKEIQNAGMYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N+P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPGMQFRLHNEPFENEMETFTTLIVNKMKDSKMFAEQGGPII 180
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQ-DDAPGPVINACNG 146
+IENEY I ++ Y+ W A MA + GVPW+MC+Q DD P V+N CNG
Sbjct: 181 LAQIENEYGNIMGKLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDDDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLKELH+ +K + L+ G
Sbjct: 299 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKELHSVLKSMEKTLVHGEY 358
Query: 266 NVISLGQ--LQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKT 323
+ G + + +S A F+ N + K V V ++ LP S+SILPDCKT
Sbjct: 359 FDTNYGDNITVTKYTLDSSS---ACFINNRFDDKDVNVTLDGATHLLPAWSVSILPDCKT 415
Query: 324 VAFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISA 378
VAFN+ ++ TQ + K N E KW E + F + R LL+QI
Sbjct: 416 VAFNSAKIKTQTSVMVKKPNTAEQEQESLKWSWMPENLSPFMTDEKGNFRKNELLEQIVT 475
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
+ D SDY WY ++ + L V + GH L+AFVNG+ G H + + F L +
Sbjct: 476 STDQSDYLWYRTSLNHKGEGSYK-LYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESP 534
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVG 491
V L G N +LLS TVGL + G E+ G+ V++ D + +N SW Y+ G
Sbjct: 535 VKLHDGKNYISLLSATVGLKNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAG 594
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSP-TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
L E QI+ + K ++ P R TWYK TF AP+G D + ++L + KG AWV
Sbjct: 595 LASEYRQIHLDKPGYKWNGNNGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWV 654
Query: 551 NGQSIGRYWVSFKTSKGNPSQT-------QYAVNTVTSIHFCAIIKATNTYHVPRAFLKP 603
NG ++GRYW S+ ++ Q + + C + YHVPR+FL
Sbjct: 655 NGNNLGRYWPSYTAAEMAGCHRCDYRGAFQAEGDGTRCLTGCG-EPSQRYYHVPRSFLAA 713
Query: 604 -TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK 662
N L+L EE G+P G+ + T+ VC + GD
Sbjct: 714 GEPNTLLLFEEAGGDPSGVALRTVVPGAVC---------------TSGEAGD-------- 750
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
V SC G +S + ASFG G C Y G C S + AC+GK C++ +
Sbjct: 751 --AVTLSCGGGHAVSSVDVASFGVGRGRCGGYE-GGCESKAAYEAFTAACVGKESCTVEI 807
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
+ G G+ L V A C
Sbjct: 808 TGAFAGAGCLSGV---LTVQATC 827
>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
[Cucumis sativus]
Length = 635
Score = 480 bits (1235), Expect = e-132, Method: Compositional matrix adjust.
Identities = 264/639 (41%), Positives = 367/639 (57%), Gaps = 28/639 (4%)
Query: 126 VPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSA 185
VPWVMCKQDDAP P+IN CNG C + PN P KP+ WTE WT+++ +GG + R
Sbjct: 3 VPWVMCKQDDAPDPMINTCNGFYC--DYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPV 60
Query: 186 QDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWG 244
+D+AF VA FI K GS VNYYMYHGGTNFGRTA IT YD AP+DEYGL+R+PK+G
Sbjct: 61 EDLAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFG 120
Query: 245 HLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFR 304
HLK LH A+KLC + LLTG + +L Q+A VF +SG CAAFL N V F
Sbjct: 121 HLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTFN 180
Query: 305 NISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF-D 363
Y LP SISILPDCK+V +NT +V Q N+ S K +S WE Y E I + +
Sbjct: 181 GRHYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSFLPT-KVES-FSWETYNENISSIEE 238
Query: 364 NTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQ------APLDVQSHGHILHAFVN 417
++ + +GLL+Q++ KD SDY WYT + + + + L S GH +H F+N
Sbjct: 239 DSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGHGMHVFIN 298
Query: 418 GEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------ 471
G+ GS+ G+HDN FT ++L+ G N +LLS+ GLP++G E + GV
Sbjct: 299 GKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMGVLGPVAI 358
Query: 472 HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQ-LTWYKTTFR 528
H + + W Y+VGL GE + + S + V W+ S++ Q LTWYK F
Sbjct: 359 HGLDXGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQENAQPLTWYKAYFD 418
Query: 529 APAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAI 587
AP G++P+AL++ SM KG+ W+NGQ++GRYW T+ GN + Y+ F
Sbjct: 419 APEGDEPLALDMGSMQKGQVWINGQNVGRYWTI--TANGNCTDCSYSGTYRPRKCQFGCG 476
Query: 588 IKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWL 647
YHVPR++L PT NL+V+ EE GNP I++ ++ +C + + P + +
Sbjct: 477 QPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICTEAS-QYRPVIKNVH 535
Query: 648 RHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGV 707
H+ G+ + + K + C G+ IS I FASFG P G C + G+CHS S V
Sbjct: 536 MHQNNGELNEQNVLK---INLHCAAGQFISAIKFASFGTPSGACGSHKQGTCHSPKSDYV 592
Query: 708 VERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQCR 746
+++ C+G+ RC + + FG DPCP + K L + C+
Sbjct: 593 LQKLCVGRQRCLATIPTSIFGEDPCPNLRKKLSAEVVCQ 631
>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
Length = 620
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 274/625 (43%), Positives = 358/625 (57%), Gaps = 59/625 (9%)
Query: 48 IQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--------------- 92
+ GLYV LRIGP++ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 2 VHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMK 61
Query: 93 ----------------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDA 136
IENEY +E G Y W A+MA+ TGVPW+MCKQ+DA
Sbjct: 62 AEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDA 121
Query: 137 PGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
PGP+I+ CNG C E FK PNS NKP +WTE+WT +Y +GG R +DIA+ VA FI
Sbjct: 122 PGPIIDTCNGYYC-EDFK-PNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFI 179
Query: 197 AKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLC 256
K GS VNYYMYHGGTNF RTA FM + Y APLDEYGL REPK+ HLK LH AIKL
Sbjct: 180 QKGGSLVNYYMYHGGTNFDRTAGEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLS 239
Query: 257 SRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSIS 316
LL+ V SLG QEA+VF S CAAFL N DE A VLFR Y+LP S+S
Sbjct: 240 EPALLSADATVTSLGAKQEAYVFWSKSS-CAAFLSNKDENSAARVLFRGFPYDLPPWSVS 298
Query: 317 ILPDCKTVAFNTERVSTQYNKRSKT-SNLKFDSDEKWEEYREA--ILNFDNTLLRAEGLL 373
ILPDCKT +NT +V+ R+ + KF W + EA N T R GL+
Sbjct: 299 ILPDCKTEVYNTAKVNAPSVHRNMVPTGTKFS----WGSFNEATPTANEAGTFAR-NGLV 353
Query: 374 DQISAAKDASDYFWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGS 427
+QIS D SDYFWY S +P L V S GH LH FVNG+ +G+A+G
Sbjct: 354 EQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGG 413
Query: 428 HDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSF 481
D+ T + L G N ALLSV VGLP+ G E+ GV V
Sbjct: 414 LDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDM 473
Query: 482 TNCSWGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALN 539
+ W Y++G+ GE L +++N + V W+ S + + LTWYK+TF PAGN+P+AL+
Sbjct: 474 SKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALD 533
Query: 540 LQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPR 598
+ +MGKG+ W+NG++IGR+W ++K ++G+ + YA +A+ YHVPR
Sbjct: 534 MNTMGKGQVWINGRNIGRHWPAYK-AQGSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPR 592
Query: 599 AFLKPTGNLLVLLEEENGNPLGITV 623
++LK + NL+V+ EE G+P GI++
Sbjct: 593 SWLK-SQNLIVVFEELGGDPNGISL 616
>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 830
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 297/806 (36%), Positives = 400/806 (49%), Gaps = 92/806 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP++ QY+F G DI+RF KE+Q G+Y LRIG
Sbjct: 56 MWPDLIRKAKEGGLDAIETYVFWNGHEPRRRQYNFEGSYDIVRFFKEVQDAGMYAILRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D++G+ FR N P+
Sbjct: 116 PYICGEWNYGGLPAWLRDISGMQFRMHNHPFEQEMETFTTLIVDKLKEAKMFAGQGGPII 175
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQ-DDAPGPVINACNG 146
+IENEY I +E Y+ W A MA + GVPW+MC+Q DD P VIN NG
Sbjct: 176 LSQIENEYGNIMGKLNNNESASEYIHWCAAMANKQNVGVPWIMCQQDDDVPSNVINTWNG 235
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F P + P IWTE+WT +++ W + RSA+DIAF VA+F GS NYY
Sbjct: 236 FYCHDWF--PKRTDIPKIWTENWTGWFKAWDKPDFHRSAEDIAFSVAMFFQTRGSLQNYY 293
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH +K + LL G
Sbjct: 294 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHNVLKSMEKILLHGDY 353
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRN-ISYELPRKSISILPDCKTV 324
++G A F+ N + K V V N ++ +P S+SILPDCKTV
Sbjct: 354 KDTTMGNTNVTVTKYTLDNSSACFISNKFDDKEVNVTLDNGATHTVPAWSVSILPDCKTV 413
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNF---DNTLLRAEGLLDQISAAK 380
A+N+ ++ TQ + K + +D W E + F + R LL+QI+ +
Sbjct: 414 AYNSAKIKTQTSVMVKRPGAETVTDGLAWSWMPENLQPFMTDEKGNFRKNELLEQIATSG 473
Query: 381 DASDYFWYTFRF-HYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY F H SN + L V + GH L+AFVNG+ G + + +F + V
Sbjct: 474 DQSDYLWYRTSFEHKGESNYK--LHVNTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPV 531
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDK-------SFTNCSWGYQV 490
L G N +LLS T+GL + GA E AG+ V++ D +N SW Y+
Sbjct: 532 KLHSGKNYISLLSATIGLKNYGALFEMMPAGIVGGPVKLVDTVTNTTAYDLSNSSWSYKA 591
Query: 491 GLIGEKLQIYSNLGLNKVLWSSIRSPT----RQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
GL GE + + + ++ WS + T R TWYK TF APAG +P+ +L +GKG
Sbjct: 592 GLAGEYRETHLDKANDRSQWSGGLNGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKG 651
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQ-TQYAVNTVTSIHFCAIIKATNT-----YHVPRAF 600
WVNG ++GRYW S+ + + Q Y + N YHVPR+F
Sbjct: 652 VVWVNGNNLGRYWPSYVAADMDGCQRCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSF 711
Query: 601 LKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
+K N +VL EE G+P ++ T+A+ +
Sbjct: 712 IKAGEPNTMVLFEEAGGDPTRVSFHTVAVGA------------------------ACAEA 747
Query: 660 FGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
V +C G+ IS + AS G G C Y G C S + AC+GK C+
Sbjct: 748 AEVGDEVALACSHGRTISSVDVASLGVARGKCGAYQ-GGCESKAALAAFTAACVGKESCT 806
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
+ + G C L V A C
Sbjct: 807 VRHTEDFRAGSGCDS--GVLTVQATC 830
>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 851
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 289/811 (35%), Positives = 407/811 (50%), Gaps = 106/811 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L A+AK GLDVIQTY+FW++++P G++ + R D +RFIK Q GL V RIG
Sbjct: 80 MWPELFARAKANGLDVIQTYLFWDVNQPTPGEFVMTDRFDYVRFIKLAQQAGLMVNFRIG 139
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P WL ++GIVFR ++KP+
Sbjct: 140 PYVCAEWNYGGFPAWLRQISGIVFRDNDKPWLDVVGPYITKTVQVLKDNKLLAADGGPVI 199
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY IE ++ GP YV W ++A + G W+MC+QDDAP I CNG C
Sbjct: 200 LLQIENEYGNIEDSY-AGGPAYVQWCGQLAASLNAGAQWIMCQQDDAPANTIATCNGFYC 258
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+P +WTE+W ++Q WG R AQD+AF A F AK G+Y++YYMYH
Sbjct: 259 DNYVP---HKGQPMMWTENWPGWFQTWGQPSPHRPAQDVAFAAARFYAKGGTYMSYYMYH 315
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV- 267
GGTNFGRTA IT YD LDEYG+ EPK+ HL LHA + ++ + NV
Sbjct: 316 GGTNFGRTAGGPGITTSYDYDVALDEYGMPSEPKYSHLGSLHAVLHANEHIIM--SMNVP 373
Query: 268 --ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
ISLG+ EA VF +SG C AFL N D V F ++ELP S+SIL +C
Sbjct: 374 APISLGKNLEAHVFNSSSG-CVAFLSNIDSSVDAEVQFNGRTFELPAWSVSILHNCAFAI 432
Query: 326 FNTERVSTQYNKRSKT-----------------SNLKFDSDEK------WEEYREAILNF 362
+NT VS N R T S K + E+ + Y E I
Sbjct: 433 YNTAAVSAPLNARRMTPLVVHEDAVSDAADHRRSLSKGEGQERVGAFSTFASYAETIGRR 492
Query: 363 DNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQ----APLDVQSHGHILHAFVNG 418
+ +QI+ D +DY WYT ++ S+ +Q + ++ + ++ FV
Sbjct: 493 AEEAVYFTSPQEQINTTNDTTDYLWYTTTYNSASATSQVLSISNVNDVVYVYVNRQFVTM 552
Query: 419 EYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQ 477
++GS + V L GTN +LS T GL + G FLE+ G+ V++
Sbjct: 553 SWSGS-----------VNKAVPLMAGTNVIDVLSTTFGLQNYGTFLEQVTRGIQGTVKLG 601
Query: 478 DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAP-AGNDPI 536
T W +QVGL+GE+L I+ + V W++ + R LTWY+++F P + P+
Sbjct: 602 STDLTQNGWWHQVGLLGEELGIFLPQNASNVPWATPATTNRGLTWYRSSFDLPQSSQAPL 661
Query: 537 ALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTY 594
AL++ MGKG WVNG ++GRYW S Y A + C I + Y
Sbjct: 662 ALDMTGMGKGFVWVNGHNLGRYWPSRIADSMACDDCDYRGAYDDSRCRQGCN-IPSQRYY 720
Query: 595 HVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGD 654
HVPR +L+PT NL+V+LEE GNP I++ CG V +
Sbjct: 721 HVPREWLQPTNNLIVMLEEIGGNPALISLVEREEDISCGAVGEDYP-------------- 766
Query: 655 TDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIG 714
+V C L + I ++ FASFG P G C ++++GSC++++S +VE C+G
Sbjct: 767 ------ADDLSVVLGCGLHQTIRRVEFASFGTPVGTCRQFSLGSCNAANSTAIVESLCLG 820
Query: 715 KSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ C +P+ +F GDPCP K L V C
Sbjct: 821 RQACHVPVAINHF-GDPCPDTTKRLFVQVSC 850
>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 613
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 261/610 (42%), Positives = 352/610 (57%), Gaps = 51/610 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEPQ+ +YDFSGR D I+F + IQ GLYV +RIG
Sbjct: 1 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R++N+ YK
Sbjct: 61 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 120
Query: 93 ---IENEY-QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
IENEY + PA+ + G Y+ W A+MA + GVPW+MC+Q DAP P+IN CNG
Sbjct: 121 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 180
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + F PN+P P ++TE+W +++ WG K R+A+D+AF VA F G + NYYMY
Sbjct: 181 C-DNFT-PNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMY 238
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNFGRT+ IT YD APLDEYG + +PKWGHLK+LHA+IKL + L T++
Sbjct: 239 HGGTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSN 298
Query: 268 ISLGQLQEAFVFEE-TSGVCAAFLVNNDERKAVTV-LFRNISYELPRKSISILPDCKTVA 325
+ G F T+G FL N D + T+ L + Y +P S+SIL C
Sbjct: 299 QNFGSSVTLTKFSNPTTGERFCFLSNTDGKNDATIDLQEDGKYFVPAWSVSILDGCNKEV 358
Query: 326 FNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF--DNTLLRAEGLLDQISAAKDAS 383
+NT +V++Q + K N K ++ W E + + N A LL+Q D S
Sbjct: 359 YNTAKVNSQTSMFVKEQNEKENAQLSWAWAPEPMKDTLQGNGKFAANLLLEQKRVTVDFS 418
Query: 384 DYFWYTFRFHYN--SSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
DYFWY + N SS L V + GH+LHAFVN Y GS GS+ SF + L
Sbjct: 419 DYFWYMTKVDTNGTSSLQNVTLQVNTKGHVLHAFVNKRYIGSKWGSNGQ-SFVFEKPILL 477
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHR---VRVQDKSFT----NCSWGYQVGLIG 494
+ G N LLS TVGL + AF + G+ + D + T + W Y+VGL G
Sbjct: 478 KSGINTITLLSATVGLKNYDAFYDMVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLNG 537
Query: 495 EKLQIYSNLGLNKVLWSSI--RSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
E QIY+ + + W + +S R++TWYKT+F+ PAG DP+ L++Q MGKG+AWVNG
Sbjct: 538 EMKQIYNPVFSQRTNWIPLNQKSIGRRMTWYKTSFKTPAGIDPVVLDMQGMGKGQAWVNG 597
Query: 553 QSIGRYWVSF 562
QSIGR+W SF
Sbjct: 598 QSIGRFWPSF 607
>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
Length = 628
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 249/572 (43%), Positives = 337/572 (58%), Gaps = 52/572 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP+LI AKEGG+DVI+TYVFWN HE G Y F GR D+++F K +Q G+Y+ LRIG
Sbjct: 57 MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTI--------EPAFHEKGPP-- 110
PF+ +EW +GG+P+WLH + G VFR+ N+P+ E T E F +G P
Sbjct: 117 PFVAAEWNFGGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPII 176
Query: 111 ---------------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
Y LWAAKMAV +T VPW+MC+Q DAP PVI+ CN C
Sbjct: 177 LSQIENEYGYYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYC 236
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P SP +P +WTE+W +++ +GG+ R +D+AF VA F K GS NYYMYH
Sbjct: 237 DQF--TPTSPKRPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYH 294
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA IT YD AP+DEYGL R PKWGHLKELH AIKLC LL G I
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNI 354
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA ++ ++SG CAAF+ N D++ V+FRN SY LP S+SILPDCK V FNT
Sbjct: 355 SLGPSVEADIYTDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNT 414
Query: 329 ERVSTQYNKRSKTSNLKFDSDE-----KWEEYREAILNFDNTLLRAEGLLDQISAAKDAS 383
+VS+ N + SD+ KW+ ++E + G +D I+ KD +
Sbjct: 415 AKVSSPTNIVAMIPEHLQQSDKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTT 474
Query: 384 DYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY W+T +++ ++ L ++S GH LHAFVN +Y G+ G+ + +FT +N
Sbjct: 475 DYLWHTTSILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKN 534
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFTNCSWGYQVGL 492
+ LR G N+ A+LS+TVGL +G F + AGV V++ + ++ +W Y++G+
Sbjct: 535 PISLRAGKNEIAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGV 594
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTR--QLTW 522
+GE L IY G+N V W+S P + LTW
Sbjct: 595 LGEHLSIYQGEGMNSVKWTSTSEPPKGQALTW 626
>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
vinifera]
Length = 722
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 292/764 (38%), Positives = 388/764 (50%), Gaps = 148/764 (19%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRN--DIIRFIKEIQSQGLYVCLR 58
MWP +I KA+ GGL+VI TY FWNLHEP + R D++ K I SQG
Sbjct: 86 MWPDIIXKARHGGLNVIHTYAFWNLHEPVQDHMKRFTRMIIDMMSKEKXIASQG------ 139
Query: 59 IGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM 118
GP I + L D A AF E G V WA M
Sbjct: 140 -GPII----------LALVDSA---------------------IAFKEMGTRCVHWAGTM 167
Query: 119 AVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGG 178
AV TG+P VMCKQ DAP PVIN C G CG+TF GPN PNK S+ + Y+V+G
Sbjct: 168 AVGLKTGIPXVMCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSV-SNHXLGMYRVFGD 226
Query: 179 KPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLV 238
P R+A+D+AF + FI+KNG+ NYYMY+ TNFGRT ++F T YYD+APLDEYGL
Sbjct: 227 PPSQRAAEDLAF--SXFISKNGTLANYYMYYSVTNFGRTTSSFATTCYYDEAPLDEYGLP 284
Query: 239 REPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET-SGVCAAFLVNNDERK 297
RE KWGHL++LHAA++L + LL G + LG+ EA ++E+ S +CA FL+NN R
Sbjct: 285 RETKWGHLRDLHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNNITRT 344
Query: 298 AVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYRE 357
T R Y LP+ SIS LPDCKTV FNT+ V +QY+ + + +W ++
Sbjct: 345 PTTTTLRGSKYYLPQHSISNLPDCKTVVFNTQTVVSQYS---------VNKNLQWXMSQD 395
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLD------VQSHGHI 411
A+ ++ + + ++ ++ KD +DY WYT + D V + GH+
Sbjct: 396 ALPTYEECPTKTKSPVELMTMTKDTTDYLWYTTNIELARTGLPFRKDVLRVPQVSNLGHV 455
Query: 412 LHAFVNGEY-----TGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLER 466
+HAF+NGEY TG+ HGS+ SF + L+ G N A L TVGLPDSG+++E
Sbjct: 456 MHAFLNGEYMEFYLTGTRHGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEH 515
Query: 467 KVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTW-YKT 525
++AGVH V +Q GLN +I P W +K
Sbjct: 516 RLAGVHNVAIQ--------------------------GLNT---RTIDLPKNG--WGHKA 544
Query: 526 TFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFC 585
F AP G+ P+AL L +M KG AW+NG+SI YWVS+ + G PSQ+
Sbjct: 545 YFDAPEGDVPVALELSTMAKGMAWINGKSIDXYWVSYLSPLGKPSQS------------- 591
Query: 586 AIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSS 645
YHVPRAFLK + NLLVL EE NP GI + T+ +C +++ H + S
Sbjct: 592 -------VYHVPRAFLKTSDNLLVLFEETGRNPDGIEILTLNRDTICCYISEHHPTHVRS 644
Query: 646 WLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQ 705
W R+ D I FG+P G C + G+C + +S
Sbjct: 645 W--KREASDIQI--------------------------FGDPTGTCXEFIPGNCAAPNSX 676
Query: 706 GVVERACIGKSRCSIPLLSRYFGGDPC----PGIHKALLVDAQC 745
VVE+ C+GKS CSIP+ D GI KAL V C
Sbjct: 677 KVVEKHCLGKSSCSIPVEQEIVSKDGISISGSGITKALAVQVLC 720
>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
Length = 775
Score = 467 bits (1202), Expect = e-128, Method: Compositional matrix adjust.
Identities = 292/776 (37%), Positives = 403/776 (51%), Gaps = 101/776 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TYVFW+ HEP + QYDFSG DI++F + IQ GLYV LRIG
Sbjct: 55 MWPELINKAKDGGLDAIETYVFWDRHEPVRRQYDFSGNLDIVKFFRVIQEAGLYVILRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKMAV 120
P++ +EW YGG P+WLH+ G+ R+DN+ YK+ P +++ V
Sbjct: 115 PYVCAEWNYGGFPMWLHNTPGVELRTDNEIYKV----------------PLLIFFVSNNV 158
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
+ IN CNG C +TFK PN+P P ++TE+W+ +Y++WGGK
Sbjct: 159 RIVSQ---------------INTCNGYYC-DTFK-PNNPKSPKMFTENWSGWYKLWGGKT 201
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVR 239
R+A+D+AF VA F+ G + NYYMY+GGTNFGRTA IT YD +PLDEYG +
Sbjct: 202 SYRTAEDMAFSVARFVQAGGVFNNYYMYYGGTNFGRTAGGPYITASYDYDSPLDEYGNLN 261
Query: 240 EPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCA----AFLVNNDE 295
+PKWGHLK+LHA+IKL + + GT +++ Q + FL N +
Sbjct: 262 QPKWGHLKQLHASIKLGEKIITNGT---VTIKNFQAGVDLTAYTNNATRERFCFLSNINI 318
Query: 296 RKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYN-------KRSKTSNLKFDS 348
A L ++ +Y +P S+SIL +C FNT +V+TQ + + K +NL +
Sbjct: 319 ADAHIDLQQDGNYTIPAWSVSILQNCSKEIFNTAKVNTQTSLMVKKLYENDKPTNLSW-- 376
Query: 349 DEKW--EEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQ---APL 403
W E ++ +L R LLDQ DASDY WY F N + Q L
Sbjct: 377 --VWAPEPMKDTLLG--KGRFRTSQLLDQKETTVDASDYLWYMTSFDMNKNTLQWTNVTL 432
Query: 404 DVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAF 463
V S GH+LHA+VN + + FT V L+ G N +LLS TVGL + G+F
Sbjct: 433 RVTSRGHVLHAYVNKKLIVGSQLVIQG-EFTFEKPVTLKPGNNVISLLSATVGLANYGSF 491
Query: 464 LERKVAGVHRVRVQ----DKSFTNCS---WGYQVGLIGEKLQIYSNLGLNKVLWSSIR-- 514
++ G+ VQ K + S W Y++GL GE + Y + WS+
Sbjct: 492 FDKTPVGIVDGPVQLMANGKPVMDLSSNLWSYKIGLNGEAKRFYDPTSRHNK-WSAANGV 550
Query: 515 SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVS-FKTSKGNPSQTQ 573
S R +TWYKTTF +P+G DP+ ++LQ MGKG AW NG+S+GRYW S + G
Sbjct: 551 STARPMTWYKTTFSSPSGTDPVVVDLQGMGKGHAWANGKSLGRYWPSQIANANGCSGTCD 610
Query: 574 Y--AVNTVTSIHFCAIIKATNTYHVPRAFLKPTG-NLLVLLEEENGNPLGITVDTIAIRK 630
Y N C I YHVPR+FL G N L+L EE G+P GI+ +
Sbjct: 611 YRGPYNAGKCTRNCG-IPTQRWYHVPRSFLNSNGKNTLILFEEVGGDPSGISFQIVTTET 669
Query: 631 VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGD 690
+CG+ + T++ SC G+ IS+I FAS+GNP G
Sbjct: 670 ICGNAY-------------------------EGSTLELSCQGGRTISEIQFASYGNPQGT 704
Query: 691 CERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGI-HKALLVDAQC 745
C + GS + +S +V++ C+GK CSI F + GI +K L V A C
Sbjct: 705 CSSFKKGSFDAMNSVQMVQKECVGKDSCSIIASDETFMVNEPQGISNKRLAVQAHC 760
>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
gi|223947135|gb|ACN27651.1| unknown [Zea mays]
gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
Length = 822
Score = 466 bits (1200), Expect = e-128, Method: Compositional matrix adjust.
Identities = 304/808 (37%), Positives = 402/808 (49%), Gaps = 102/808 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGL+ I+TYVFWN HEP++ QY+F G DIIRF KEIQ+ G++ LRIG
Sbjct: 53 MWPDLINKAKEGGLNTIETYVFWNGHEPRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIG 112
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N P+
Sbjct: 113 PYICGEWNYGGLPAWLRDIPGMQFRLHNAPFEREMETFTTLIVNKMKDVNMFAGQGGPII 172
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY I ++ Y+ W A MA GVPW+MC+QD D P VIN CNG
Sbjct: 173 LAQIENEYGNIMGQLKNNQSASQYIHWCADMANKQEVGVPWIMCQQDNDVPHNVINTCNG 232
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 233 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSVHNYY 290
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH I+ + L+ G
Sbjct: 291 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGKY 350
Query: 266 NVISLGQ-LQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV 324
N S G+ + S VC F+ N + + V ++ +P S+SILP+CKTV
Sbjct: 351 NDTSYGKNVTVTKYMYGGSSVC--FINNQFVDRDMKVTLGGETHLVPAWSVSILPNCKTV 408
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISAA 379
A+NT ++ TQ + K +N E +W E + F R LL+QI+ +
Sbjct: 409 AYNTAKIKTQTSVMVKKANSVEKEPETMRWSWMPENLKPFMTDHRGSFRQSQLLEQIATS 468
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY + + L V + GH ++AFVNG G H + F L++ V
Sbjct: 469 TDQSDYLWYRTSLEHKGEGSYT-LYVNTSGHEMYAFVNGRLVGQNHSADGAFVFQLQSPV 527
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-------DKSFTNCSWGYQVGL 492
L G N +LLS TVGL + G E AG+ V+ T SW Y+ GL
Sbjct: 528 KLHSGKNYVSLLSGTVGLKNYGPSFELVPAGIAGGPVKLVGTNGTAIDLTKSSWSYKSGL 587
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSP---TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W S R TWYKTTF APAG + + ++L + KG AW
Sbjct: 588 AGELRQIH--LDKPGYKWQSHNGTIPVNRPFTWYKTTFEAPAGEEAVVVDLLGLNKGVAW 645
Query: 550 VNGQSIGRYWVSFKTS----------KGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRA 599
VNG S+GRYW S+ + +G + +T C A YHVPR+
Sbjct: 646 VNGNSLGRYWPSYTAAEMPGCHVCDYRGKFIAEGDGIRCLTG---CG-EPAQRFYHVPRS 701
Query: 600 FLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIK 658
FL+ N L+L EE G+P T+A+ VC + + GD
Sbjct: 702 FLRAGEPNTLILFEEAGGDPTRAAFHTVAVGPVC--------------VAAVELGD---- 743
Query: 659 KFGKKPTVQPSC-PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSR 717
V SC G+ ++ + ASFG G C Y G C S + AC+G+
Sbjct: 744 ------DVTLSCGGHGRVVASVDVASFGVARGSCGAYK-GGCESKAALKAFTDACVGRES 796
Query: 718 CSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C++ + + G G AL V A C
Sbjct: 797 CTVKYTAAFAGAGCQSG---ALTVQATC 821
>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
Length = 830
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 305/810 (37%), Positives = 405/810 (50%), Gaps = 102/810 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGL+ I+TYVFWN HEP++ QY+F G DI+RF KEIQ+ G++ LRIG
Sbjct: 58 MWPDLINKAKEGGLNTIETYVFWNGHEPRRRQYNFEGNYDIVRFFKEIQNAGMHAILRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N P+
Sbjct: 118 PYICGEWNYGGLPAWLRDIPGMQFRLHNDPFEREMETFTTLIVNKMKDANMFAGQGGPII 177
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY I ++ Y+ W A MA GVPW+MC+QD D P VIN CNG
Sbjct: 178 LAQIENEYGNIMGKLENNQSASQYIHWCADMANKQKIGVPWIMCQQDNDVPHNVINTCNG 237
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 238 FYCYDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSVHNYY 295
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH +K + L+ G
Sbjct: 296 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHNLLKSMEKILVHGEY 355
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
S G+ + G F+ N + + V V ++ +P S+SILPDCKTVA
Sbjct: 356 KDTSHGKNVTVTKY-TYGGSSVCFISNQFDDRDVNVTLAG-THLVPAWSVSILPDCKTVA 413
Query: 326 FNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISAAK 380
+NT ++ TQ + K +N E +W E + F D+ R LL+QI+ +
Sbjct: 414 YNTAKIKTQTSVMVKKANSVEKEPEALRWSWMPENLKPFMTDDHGSFRQSRLLEQIATST 473
Query: 381 DASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
D SDY WY + + L V + GH ++AFVNG+ G S+ F L++ V
Sbjct: 474 DQSDYLWYRTSLEHKGEGSYT-LYVNTTGHKIYAFVNGKLVGQNQSSNGAFVFQLQSPVK 532
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAG-----VHRVRVQDKS--FTNCSWGYQVGLI 493
L G N +LLS TVGL + G E AG V V D + T+ SW Y+ GL
Sbjct: 533 LHSGKNYVSLLSGTVGLKNYGPLFELVPAGIAGGPVKLVGANDTAIDLTHSSWSYKSGLA 592
Query: 494 GEKLQIYSNLGLNKVLWSSIRSP-----TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
GE QI+ L W S R TWYKTTF APAG++ + ++L + KG A
Sbjct: 593 GEHRQIH--LDKPGYKWRSHNGSGSIPVNRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAA 650
Query: 549 WVNGQSIGRYWVSFKTSK--GNPSQTQY------AVNTVTSIHFCAIIKATNTYHVPRAF 600
WVNG S+GRYW S+ ++ G Y + + + C + YHVPR+F
Sbjct: 651 WVNGNSLGRYWPSYTAAEMGGCHGACDYRGKFKAEGDGIRCLTGCG-EPSQRFYHVPRSF 709
Query: 601 LKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
L+ N LVL EE G+P T+A+ VC + + GD
Sbjct: 710 LRAGEPNTLVLFEEAGGDPARAAFHTVAVGHVC--------------VAAAEVGD----- 750
Query: 660 FGKKPTVQPSCPLGKK---ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKS 716
V SC G ++ + ASFG G C Y G C S + AC+G+
Sbjct: 751 -----DVTLSCGGGLGGGVVASVDVASFGVTRGGCGDYQ-GGCESKAALKAFRDACVGRE 804
Query: 717 RCSIPLLSRYFGGDPCPGIHKA-LLVDAQC 745
C++ + G PG L V A C
Sbjct: 805 SCTVKYTPAFAG----PGCQSGKLTVQATC 830
>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 636
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 260/570 (45%), Positives = 323/570 (56%), Gaps = 53/570 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F R D+++FIK +Q GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+VFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGMVFRTDNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY IE G Y W A+MA TGVPW+MCKQDDAP +IN CNG C
Sbjct: 179 LSQIENEYGPIEWEIGAPGKAYTKWVAEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT ++ +GG R A+DIA VA FI GS++NYYMYH
Sbjct: 239 -ENFK-PNSDNKPKMWTENWTGWFTEFGGAVPYRPAEDIALSVARFIQNGGSFINYYMYH 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y APLDEYGL REPK+ HLK LH IKLC L++ V S
Sbjct: 297 GGTNFDRTAGEFIATSYDYDAPLDEYGLPREPKYSHLKRLHKVIKLCEPALVSADPTVTS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QEA VF+ S CAAFL N + A VLF +Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEAHVFKSKSS-CAAFLSNYNTSSAARVLFGGSTYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 RVST-QYNKRSKTSNLKFDSDEKWEEYREAILNF-DNTLLRAEGLLDQISAAKDASDYFW 387
+V T + + +N F W Y E I + DN +GL++QIS +D +DYFW
Sbjct: 416 KVRTSSIHMKMVPTNTPFS----WGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFW 471
Query: 388 Y----TFRFHYNSSNAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
Y T + P L + S GH LH FVNG+ G+A+GS + T + L
Sbjct: 472 YLTDITISPDEKFLTGEDPLLTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLH 531
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEK 496
G N ALLS GLP+ G E GV + V T W Y++G GE
Sbjct: 532 AGVNKLALLSTAAGLPNVGVHYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEA 591
Query: 497 LQIYSNLGLNKVLWS--SIRSPTRQLTWYK 524
L +++ G + V W S+ + + LTWYK
Sbjct: 592 LSVHTLAGSSTVEWKEGSLVAKKQPLTWYK 621
>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 808
Score = 463 bits (1191), Expect = e-127, Method: Compositional matrix adjust.
Identities = 306/817 (37%), Positives = 406/817 (49%), Gaps = 141/817 (17%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGL+ I+TYVFWN HEP++ +++F G D++RF KEIQ+ G+Y LRIG
Sbjct: 61 MWPDLIKKAKEGGLNAIETYVFWNGHEPRRREFNFEGNYDVVRFFKEIQNAGMYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP+WL D+ GI FR NKP+
Sbjct: 121 PYICGEWNYGGLPVWLRDIPGIKFRLHNKPFENGMEAFTTLIVKKMKDANMFAGQGGPII 180
Query: 92 --KIENEY--QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY ++P + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGYTMLQPENIQSAHEYIHWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C E F N + P +WTE+WT +Y+ W + R +DIAF VA+F GS NYY
Sbjct: 241 FYCHEWFS--NRTSIPKMWTENWTGWYRDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRTA IT YD APLDEYG +R+PK+GHLKELH+ + + LL G
Sbjct: 299 MYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNLRQPKYGHLKELHSVLMSMEKILLHG-- 356
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNN--DERKAVTVLFRNISYELPRKSISILPDCKT 323
+ I V + T +A +NN D+R V V ++ LP S+SILP+CKT
Sbjct: 357 DYIDTNYGDNVTVTKYTLNATSACFINNRFDDRD-VNVTLDGTTHFLPAWSVSILPNCKT 415
Query: 324 VAFNTERVSTQYNKR-SKTSNLKFDSDE-KWEEYREAILNF---DNTLLRAEGLLDQISA 378
VAFN+ ++ TQ +KTS ++ ++ KW E + F + R LL+QI
Sbjct: 416 VAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVT 475
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
D SDY WY + + L V + GH L+AFVNG+ G + ++N +F L++
Sbjct: 476 TTDQSDYLWYRTSLEHKGEGSYV-LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKS- 533
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVG 491
P+ G E AG+ V++ D S +N SW Y+ G
Sbjct: 534 -------------------PNYGGSFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAG 574
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSP---TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE +IY + NK W S S R TWYKTTF+APAG D + ++L + KG A
Sbjct: 575 LAGEYRKIYLDKPGNK--WRSHNSTIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVA 632
Query: 549 WVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFC---AIIKA--------------- 590
WVNG S+GRYW S Y + H C + KA
Sbjct: 633 WVNGNSLGRYWPS------------YVAADMPGCHHCDYRGVFKAEVEAQKCLTGCGEPS 680
Query: 591 TNTYHVPRAFL-KPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRH 649
YHVPR+FL K N L+L EE G+P + V T+ VC
Sbjct: 681 QQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVRTVVEGSVCASA-------------- 726
Query: 650 RQRGDTDIKKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVV 708
+ GD TV SC G+ IS + ASFG G C Y G C S +
Sbjct: 727 -EVGD----------TVTLSCGAHGRTISSVDVASFGVARGRCGSYD-GGCESKVAYDAF 774
Query: 709 ERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
AC+GK C++ L++ F C + L V A C
Sbjct: 775 AAACVGKESCTV-LVTDAFANAGC--VSGVLTVQATC 808
>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
Length = 837
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 273/690 (39%), Positives = 368/690 (53%), Gaps = 65/690 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TY+FWN HEP + QY+F G D++RF KEIQ+ G+Y LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYIFWNGHEPHRRQYNFEGNYDVVRFFKEIQNAGMYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N+P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPGMQFRLHNEPFENEMETFTTLIVNKMKDSKMFAEQGGPII 180
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQ-DDAPGPVINACNG 146
+IENEY I ++ Y+ W A MA + GVPW+MC+Q DD P V+N CNG
Sbjct: 181 LAQIENEYGNIMGKLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDDDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLKELH+ +K + L+ G
Sbjct: 299 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKELHSVLKSMEKTLVHGEY 358
Query: 266 NVISLGQ--LQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKT 323
+ G + + +S A F+ N + K V V ++ LP S+SILPDCKT
Sbjct: 359 FDTNYGDNITVTKYTLDSSS---ACFINNRFDDKDVNVTLDGATHLLPAWSVSILPDCKT 415
Query: 324 VAFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISA 378
VAFN+ ++ TQ + K N E KW E + F + R LL+QI
Sbjct: 416 VAFNSAKIKTQTSVMVKKPNTAEQEQESLKWSWMPENLSPFMTDEKGNFRKNELLEQIVT 475
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
+ D SDY WY ++ + L V + GH L+AFVNG+ G H + + F L +
Sbjct: 476 STDQSDYLWYRTSLNHKGEGSYK-LYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESP 534
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVG 491
V L G N +LLS TVGL + G E+ G+ V++ D + +N SW Y+ G
Sbjct: 535 VKLHDGKNYISLLSATVGLKNYGPSFEKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAG 594
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSP-TRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
L E QI+ + K ++ P R TWYK TF AP+G D + ++L + KG AWV
Sbjct: 595 LASEYRQIHLDKPGYKWNGNNGTIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWV 654
Query: 551 NGQSIGRYWVSFKTSKGNPSQT-------QYAVNTVTSIHFCAIIKATNTYHVPRAFLKP 603
NG ++GRYW S+ ++ Q + + C + YHVPR+FL
Sbjct: 655 NGNNLGRYWPSYTAAEMAGCHRCDYRGAFQAEGDGTRCLTGCG-EPSQRYYHVPRSFLAA 713
Query: 604 -TGNLLVLLEEENGNPLGITVDTIAIRKVC 632
N L+L EE G+P G+ + T+ VC
Sbjct: 714 GEPNTLLLFEEAGGDPSGVALRTVVPGPVC 743
>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
Length = 677
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 279/683 (40%), Positives = 385/683 (56%), Gaps = 42/683 (6%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
KIENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C +
Sbjct: 7 KIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQ 66
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYHGG
Sbjct: 67 FT--PNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGG 124
Query: 212 TNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
TN R++ F+ T Y AP+DEYGLVR+PKWGHL+++H AIKLC L+ + SL
Sbjct: 125 TNLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSL 184
Query: 271 GQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTER 330
G EA V++ S VCAAFL N D + TV F Y LP S+SILPDCK V NT +
Sbjct: 185 GPNVEAAVYKVGS-VCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQ 243
Query: 331 VSTQYN----KRSKTSNLKFDSD--------EKWEEYREAI-LNFDNTLLRAEGLLDQIS 377
+++Q + ++SN+ D W E + + DN L +A GL++QI+
Sbjct: 244 INSQTTGSEMRYLESSNVASDGSFVTPELAVSDWSYAIEPVGITKDNALTKA-GLMEQIN 302
Query: 378 AAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVS 432
DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS +
Sbjct: 303 TTADASDFLWYSTSITVKGDEPYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSL 362
Query: 433 FTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKS----FTNCSWG 487
+ + + L G N LLS TVGL + GAF + AG+ V++ + ++ W
Sbjct: 363 ISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWT 422
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKG 546
YQ+GL GE L +Y + S+ P L WYKT F PAG+DP+A++ MGKG
Sbjct: 423 YQIGLRGEDLHLYDPSEASPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKG 482
Query: 547 EAWVNGQSIGRYW-VSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRAFLKP 603
EAWVNGQSIGRYW + G + Y A ++ + C T YHVPR+FL+P
Sbjct: 483 EAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQT-LYHVPRSFLQP 541
Query: 604 TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKK 663
N LVL E G+P I+ VC V+ +H + SW + ++++G
Sbjct: 542 GSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQP-----MQRYG-- 594
Query: 664 PTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
P ++ CP G+ IS + FASFG P G C Y+ G C S+ + +V+ ACIG S CS+P+
Sbjct: 595 PALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPV 654
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
S YF G+PC G+ K+L V+A C
Sbjct: 655 SSNYF-GNPCTGVTKSLAVEAAC 676
>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 650
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 257/570 (45%), Positives = 320/570 (56%), Gaps = 54/570 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP GQY F R D+++F+K Q GLYV LRIG
Sbjct: 55 MWPDLIQKAKDGGLDVIQTYVFWNGHEPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +EW GG P+WL V GI FR+DN+P+K
Sbjct: 115 PYICAEWNLGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPII 174
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP PVI+ CNG C
Sbjct: 175 LSQIENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYC 234
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP +WTE+WT +Y +GG R A+D+AF VA FI GS+VNYYMYH
Sbjct: 235 -ENFK-PNKNTKPKMWTENWTGWYTDFGGAVPRRPAEDLAFSVARFIQNGGSFVNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT+ I YD APLDEYGL EPK+ HL+ LH AIK L+ V
Sbjct: 293 GGTNFGRTSGGLFIATSYDYDAPLDEYGLENEPKYEHLRALHKAIKQSEPALVATDPKVQ 352
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA VF G CAAF+ N D + F N Y+LP SISILPDCKTV +NT
Sbjct: 353 SLGYNLEAHVF-SAPGACAAFIANYDTKSYAKAKFGNGQYDLPPWSISILPDCKTVVYNT 411
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFW 387
+V + K+ N F W+ Y E + + A L +Q++ +D+SDY W
Sbjct: 412 AKVGYGWLKKMTPVNSAF----AWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLW 467
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + N++ N Q+P L V S GH+LH F+NG+ G+ G N T + V L
Sbjct: 468 YMTDVNVNANEGFLKNGQSPLLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKL 527
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
R G N +LLSV VGLP+ G E AGV + + + W Y+VGL GE
Sbjct: 528 RAGNNKLSLLSVAVGLPNVGVHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGE 587
Query: 496 KLQIYSNLGLNKVLW--SSIRSPTRQLTWY 523
L +++ G + V W S+ + + LTWY
Sbjct: 588 SLSLHTESGSSSVEWIQGSLVAKKQPLTWY 617
>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
Length = 579
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 240/534 (44%), Positives = 307/534 (57%), Gaps = 50/534 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVIQTYVFWN HEP +GQY FS R D++RF+K ++ GLYV LRIG
Sbjct: 52 MWPDLIQKAKDGGLDVIQTYVFWNGHEPVQGQYYFSDRYDLVRFVKLVKQAGLYVNLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WL V GI FR+DN P+K
Sbjct: 112 PYVCAEWNYGGFPVWLKYVPGISFRTDNGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E YV WAAKMAV + GVPW+MCKQDDAP PVIN CNG C
Sbjct: 172 LAQVENEYGPMESVMGSGAKSYVDWAAKMAVATNAGVPWIMCKQDDAPDPVINTCNGFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKPS+WTE W+ ++ +GG R +D+AF VA FI K GS++NYYMYH
Sbjct: 232 DDF--TPNSKNKPSMWTEAWSGWFTAFGGTVPQRPVEDLAFAVARFIQKGGSFINYYMYH 289
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNF RTA F+ T Y AP+DEYGL+R+PKWGHL LH AIK L+ G V
Sbjct: 290 GGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQPKWGHLTNLHKAIKQAETALVAGDPTVQ 349
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
++G ++A+VF +SG CAAFL N A V F Y+LP SIS+LPDC+T +NT
Sbjct: 350 NIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARVAFNGRRYDLPAWSISVLPDCRTAVYNT 409
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
V+ S + + W+ Y EA + D T +GL++Q+S D SDY WY
Sbjct: 410 ATVTAA----SSPAKMNPAGGFTWQSYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWY 465
Query: 389 TFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T + +S + Q P L V S GH + FVNG+Y G+A+G +D T V +
Sbjct: 466 TTYVNIDSGEQFLKSGQWPQLTVYSAGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMW 525
Query: 443 QGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQV 490
QG+N ++LS VGLP+ G E GV + + + W YQV
Sbjct: 526 QGSNKISILSSAVGLPNVGTHYETWNIGVLGPVTLSGLNEGKRDLSKQKWTYQV 579
>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 616
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 248/586 (42%), Positives = 335/586 (57%), Gaps = 64/586 (10%)
Query: 32 QYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY 91
QYDF GRND++RF+K GLYV LRIGP++ +EW YGG P+WLH + GI R+DN+P+
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 92 K-------------------------------IENEYQTIEPAFHEKGPPYVLWAAKMAV 120
K IENEY I ++ G Y+ WAA MAV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 121 DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKP 180
TGVPWVMC+Q DAP P+IN CNG C + P+ P++P +WTE+W+ ++ +GG
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFT--PSLPSRPKLWTENWSGWFLSFGGAV 178
Query: 181 YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVR 239
R +D+AF VA F + G+ NYYMYHGGTNFGR++ I+ YD AP+DEYGLVR
Sbjct: 179 PYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVR 238
Query: 240 EPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAV 299
+PKWGHL+++H AIK+C L+ + +SLGQ EA V++ S +CAAFL N D++
Sbjct: 239 QPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGS-LCAAFLANIDDQSDK 297
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSD---------- 349
TV F +Y+LP S+SILPDCK V NT ++++Q ++ NL F +
Sbjct: 298 TVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQV-ASTQMRNLGFSTQASDGSSVEAE 356
Query: 350 ---EKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRF-------HYNSSNA 399
W E + L GL++QI+ DASD+ WY+ + N S
Sbjct: 357 LAASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGS-- 414
Query: 400 QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPD 459
Q+ L V S GH+L F+NG+ GS+ GS + +L V L G N LLS TVGL +
Sbjct: 415 QSNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTN 474
Query: 460 SGAFLERKVAGVH-RVRVQDK----SFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIR 514
GAF + AG+ V++ ++ W YQ+GL GE L +Y+ + S
Sbjct: 475 YGAFFDLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEASPEWVSDNS 534
Query: 515 SPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
PT LTWYK+ F APAG+DP+A++ MGKGEAWVNGQSIGRYW
Sbjct: 535 YPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYW 580
>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
Length = 592
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 250/598 (41%), Positives = 345/598 (57%), Gaps = 23/598 (3%)
Query: 164 IWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FM 222
+WTE WT ++ +GG R A+D+AF VA FI K GS++NYYMYHGGTNFGRTA F+
Sbjct: 1 MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 60
Query: 223 ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET 282
T Y APLDEYGL R+PKWGHLK+LH AIKLC L++G + LG QEA V++
Sbjct: 61 ATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSK 120
Query: 283 SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTS 342
SG C+AFL N + + V F N Y LP SISILPDCK +NT RV Q R K
Sbjct: 121 SGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQ-TSRMKMV 179
Query: 343 NLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS----- 397
+ W+ Y E + + GL++QI+ +D SDY WY +++
Sbjct: 180 RVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLR 239
Query: 398 NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVG 456
N P L V S GH +H F+NG+ +GSA+GS D+ T R V+LR G N A+LS+ VG
Sbjct: 240 NGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVG 299
Query: 457 LPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLW 510
LP+ G E AGV + + + + W Y+VGL GE L ++S G + V W
Sbjct: 300 LPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEW 359
Query: 511 S--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGN 568
+ + + + LTWYKTTF APAG+ P+A+++ SMGKG+ W+NGQS+GR+W ++K + G+
Sbjct: 360 AEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYK-AVGS 418
Query: 569 PSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIA 627
S+ Y +A+ YHVPR++LKP+GNLLV+ EE G+P GIT+
Sbjct: 419 CSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRRE 478
Query: 628 IRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNP 687
+ VC + S+ + ++ + K P C G+KI+ + FASFG P
Sbjct: 479 VDSVCADIYEWQ----STLVNYQLHASGKVNK-PLHPKAHLQCGPGQKITTVKFASFGTP 533
Query: 688 DGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+G C Y GSCH+ HS + C+G++ CS+ + FGGDPCP + K L V+A C
Sbjct: 534 EGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVC 591
>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
Length = 811
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 288/812 (35%), Positives = 389/812 (47%), Gaps = 128/812 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DI+RF KEIQ+ GLY LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPGMQFRLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPII 180
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY I + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGNIMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K G
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRG------ 292
Query: 207 MYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG--T 264
++ T Y APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 293 ------------GPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYV 340
Query: 265 QNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV 324
S + + TS A F+ N ++ V V ++ LP S+SILPDCKTV
Sbjct: 341 DTNYSDKVTVTKYTLDSTS---ACFINNRNDNMDVNVTLDGTTHLLPAWSVSILPDCKTV 397
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISAA 379
AFN+ ++ Q + + E KW RE + F + R LL+QI +
Sbjct: 398 AFNSAKIKAQTTVMVNKAKMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTS 457
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY ++ A L V + GH L+AFVNG G H + + F L +
Sbjct: 458 TDQSDYLWYRTSINHKGE-ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPA 516
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ GL
Sbjct: 517 KLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGL 576
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W + + TWYKTTF+APAG D + ++L + KG AW
Sbjct: 577 AGEYRQIH--LDKPGCTWDNNNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAW 634
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------------Y 594
VNG ++GRYW S+ ++ T+ H+ + +A Y
Sbjct: 635 VNGNNLGRYWPSYTAARS-------MRRLPTTAHYRGVFQAEGDGQKCLTGCGEPSQRFY 687
Query: 595 HVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
HVPR+FLK N ++L EE G+P ++ T+A VC + G
Sbjct: 688 HVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTVAAGSVCASA---------------EVG 732
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
DT G+ K IS I SFG G C Y G C S + AC+
Sbjct: 733 DTITLSCGQH---------SKTISAINVTSFGVARGQCGAYK-GGCESKAAYKAFTEACL 782
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK C++ ++ G C + L V A C
Sbjct: 783 GKESCTVQ-ITNAVTGSGC--LSNVLTVQASC 811
>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
Flags: Precursor
gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 809
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 288/812 (35%), Positives = 386/812 (47%), Gaps = 130/812 (16%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DI+RF KEIQ+ GLY LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPGMQFRLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPII 180
Query: 92 --KIENEYQTIEPAFH--EKGPPYVLWAAKMAVDFHTGVPWVMCKQD-DAPGPVINACNG 146
+IENEY I + + Y+ W A MA + GVPW+MC+QD D P V+N CNG
Sbjct: 181 LAQIENEYGNIMGQLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K G
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRG------ 292
Query: 207 MYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG--T 264
++ T Y APLDEYG +R+PK+GHLK+LH+ IK + L+ G
Sbjct: 293 ------------GPYITTSYDYDAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYV 340
Query: 265 QNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV 324
S + + TS A F+ N ++ V V ++ LP S+SILPDCKTV
Sbjct: 341 DTNYSDKVTVTKYTLDSTS---ACFINNRNDNMDVNVTLDGTTHLLPAWSVSILPDCKTV 397
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISAA 379
AFN+ ++ Q + + E KW RE + F + R LL+QI +
Sbjct: 398 AFNSAKIKAQTTVMVNKAKMVEKEPESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTS 457
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
D SDY WY ++ A L V + GH L+AFVNG G H + + F L +
Sbjct: 458 TDQSDYLWYRTSINHKGE-ASYTLFVNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPA 516
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV--HRVRVQDKS-----FTNCSWGYQVGL 492
L G N +LLS T+GL + G E+ AG+ V++ D + +N SW Y+ GL
Sbjct: 517 KLHDGKNYISLLSATIGLKNYGPLFEKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGL 576
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
GE QI+ L W + + TWYKTTF+APAG D + ++L + KG AW
Sbjct: 577 AGEYRQIH--LDKPGCTWDNNNGTVPINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAW 634
Query: 550 VNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT---------------Y 594
VNG ++GRYW PS T + + + +A Y
Sbjct: 635 VNGNNLGRYW---------PSYTAAEMGGCHHCDYRGVFQAEGDGQKCLTGCGEPSQRFY 685
Query: 595 HVPRAFLKP-TGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
HVPR+FLK N ++L EE G+P ++ T+A VC + G
Sbjct: 686 HVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTVAAGSVCASA---------------EVG 730
Query: 654 DTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
DT G+ K IS I SFG G C Y G C S + AC+
Sbjct: 731 DTITLSCGQH---------SKTISAINVTSFGVARGQCGAYK-GGCESKAAYKAFTEACL 780
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK C++ ++ G C + L V A C
Sbjct: 781 GKESCTVQ-ITNAVTGSGC--LSNVLTVQASC 809
>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 700
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 249/620 (40%), Positives = 335/620 (54%), Gaps = 102/620 (16%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP +GQY F+ R D++RF+K ++ GLYV LR+G
Sbjct: 70 MWPGLIQKAKDGGLDVVQTYVFWNGHEPAQGQYYFADRYDLVRFVKLVRQAGLYVHLRVG 129
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 130 PYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFKAAMQKFVEKIVSMMKSEGLFEWQGGPII 189
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENE+ +E G PY WAA+MAV + GVPWVMCKQDDAP PVIN CNG C
Sbjct: 190 MAQVENEFGPMESVVGSGGKPYAHWAAQMAVGTNAGVPWVMCKQDDAPDPVINTCNGFYC 249
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN+ +KP++WTE WT ++ +GG R +D+AF VA F+ K GS+VNYYMYH
Sbjct: 250 --DYFTPNNKHKPTMWTEAWTGWFTKFGGAAPHRPVEDLAFAVARFVQKGGSFVNYYMYH 307
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEY--------------------------------- 235
GGTNFGRTA F+ T Y AP+DE+
Sbjct: 308 GGTNFGRTAGGPFIATSYDYDAPIDEFGMQWLLPSLINLNSHRLPRDICRKSSQCGFYLS 367
Query: 236 ----------------GLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVF 279
GL+R+PKWGHL+ +H AIK L++G + S+G ++A+VF
Sbjct: 368 VVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHRAIKQAEPALVSGDPTIRSIGNYEKAYVF 427
Query: 280 EETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVS--TQYNK 337
+ +G CAAFL N + AV + F Y+LP SISILPDCKT FNT V T K
Sbjct: 428 KSKNGACAAFLSNYHVKSAVRIRFDGRHYDLPAWSISILPDCKTAVFNTATVKEPTLLPK 487
Query: 338 RSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS 397
S + +F W+ Y E + D++ +GL++Q+S D SDY WYT + S+
Sbjct: 488 MSPVMH-RF----AWQSYSEDTNSLDDSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSN 542
Query: 398 -----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALL 451
+ Q P L V S GH + FVNG GS +G +DN T V + QG+N ++L
Sbjct: 543 ERFLKSGQWPQLSVYSAGHSMQVFVNGRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISIL 602
Query: 452 SVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGL 505
S VGLP++G E GV + + ++ W YQVGL GE L +++ G
Sbjct: 603 SSAVGLPNNGDHFELWNVGVLGPVTLSGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGS 662
Query: 506 NKVLWSSIRSPTRQLTWYKT 525
+ V W+ T+ LTW+K
Sbjct: 663 SAVEWAGPGGGTQPLTWHKV 682
>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 578
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 247/583 (42%), Positives = 336/583 (57%), Gaps = 41/583 (7%)
Query: 188 IAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHL 246
+AF VA FI K GS+VNYYMYHGGTNFGRTA +T YD AP+DEYGL+R+PK+GHL
Sbjct: 1 LAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHL 60
Query: 247 KELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNI 306
KELH AIK+C + L++ V S+G Q+A V+ SG C+AFL N D A VLF N+
Sbjct: 61 KELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNV 120
Query: 307 SYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDN- 364
Y LP SISILPDC+ FNT +V Q S+ L D+ +WE Y E + + D+
Sbjct: 121 HYNLPPWSISILPDCRNAVFNTAKVGVQ---TSQMEMLPTDTKNFQWESYLEDLSSLDDS 177
Query: 365 TLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNG 418
+ GLL+QI+ +D SDY WY S + + P L +QS GH +H FVNG
Sbjct: 178 STFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHAVHIFVNG 237
Query: 419 EYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------H 472
+ +GSA G+ N FT + ++L GTN ALLSV VGLP+ G E G+ H
Sbjct: 238 QLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALH 297
Query: 473 RVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFR 528
+ + W YQVGL GE + + + W +++ P + LTW+KT F
Sbjct: 298 GLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKP-QPLTWHKTYFD 356
Query: 529 APAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAII 588
AP GN+P+AL+++ MGKG+ WVNG+SIGRYW +F T G+ S Y +
Sbjct: 357 APEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFAT--GDCSHCSYTGTYKPNKCQTGCG 414
Query: 589 KATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWL 647
+ T YHVPRA+LKP+ NLLV+ EE GNP +++ ++ VC V+ H P + +W
Sbjct: 415 QPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNW- 472
Query: 648 RHRQRGDTDIKKFGK-----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS 702
I+ +GK +P V C G+ I+ I FASFG P G C Y G CH++
Sbjct: 473 --------QIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAA 524
Query: 703 HSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
S ++ER C+GK+RC++ + + FG DPCP + K L V+A C
Sbjct: 525 TSYAILERKCVGKARCAVTISNSNFGKDPCPNVLKRLTVEAVC 567
>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 568
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 247/592 (41%), Positives = 334/592 (56%), Gaps = 54/592 (9%)
Query: 183 RSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREP 241
R A+DIAF VA FI K GS+VNYYMYHGGTNFGRTA F+ T Y AP+DEYGL+REP
Sbjct: 3 RPAEDIAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 62
Query: 242 KWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTV 301
KWGHL++LH AIKLC L++G V S+G Q++ VF +G CAAFL N D V
Sbjct: 63 KWGHLRDLHRAIKLCEPALVSGDPTVTSIGHYQQSHVFRSKAGACAAFLSNYDSGSYARV 122
Query: 302 LFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEK--WEEYREAI 359
+F I Y++P SISILPDCKT FNT R+ Q TS LK + K WE Y E
Sbjct: 123 VFNGIHYDIPPWSISILPDCKTTVFNTARIGAQ------TSQLKMEWAGKFSWESYNEDT 176
Query: 360 LNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS-----NAQAP-LDVQSHGHILH 413
+FD+ GL++QIS +D +DY WYT + + N P L V S GH +H
Sbjct: 177 NSFDDRSFTKVGLVEQISMTRDNTDYLWYTTYVNIGENEGFLKNGHYPVLTVNSAGHSMH 236
Query: 414 AFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV-- 471
++NG+ TG+ +G+ +N T +V L G+N ++LSV VGLP+ G E GV
Sbjct: 237 IYINGQLTGTIYGALENPKLTYTGSVKLWAGSNKISILSVAVGLPNIGGHFETWNTGVLG 296
Query: 472 ----HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTF 527
+ + + W YQ+GL GE L +++ G + V W S + LTWYKT+F
Sbjct: 297 PVTLSGLNEGKRDLSWQKWIYQIGLKGEALNLHTLSGSSSVEWGG-PSQKQSLTWYKTSF 355
Query: 528 RAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTS--------KGNPSQTQYAVNTV 579
APAGNDP+AL++ SMGKG+ W+NGQS+GRYW ++K S +G ++ + N
Sbjct: 356 NAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPAYKASGSCGGCDYRGTYNEKKCQSNCG 415
Query: 580 TSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSH 639
S YHVPR++L PTGNLLV+ EE G+P GI++ + VC +
Sbjct: 416 ESTQ--------RWYHVPRSWLNPTGNLLVVFEEWGGDPSGISMVRRKVESVCAEI---- 463
Query: 640 LPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSC 699
+ W + + +G+ SC G+K++ I FASFG P G C ++ G+C
Sbjct: 464 ----AEWQPNMD--NVHTGNYGRS-KAHLSCAPGQKMTNIKFASFGTPQGTCGAFSEGTC 516
Query: 700 HSSHSQGVVERA-----CIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQCR 746
H+ S E+ CIG+ C++ + FGGDPCPG K L V+A C
Sbjct: 517 HAHKSYDAFEKESLLQNCIGQQSCAVLVAPEVFGGDPCPGTMKKLAVEAICE 568
>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
Length = 589
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 247/551 (44%), Positives = 329/551 (59%), Gaps = 26/551 (4%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENEY +E G Y WAA+MAV TGVPW MCKQ+DAP PVI+ CNG C E
Sbjct: 42 QIENEYGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYC-E 100
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
F PN KP +WTE+W+ +Y +GG R +D+A+ VA FI GS+VNYYMYHGG
Sbjct: 101 NFT-PNENFKPKMWTENWSGWYTDFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGG 159
Query: 212 TNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL 270
TNFGRT++ I YD AP+DEYGL EPKW HLK LH AIK C L++ V L
Sbjct: 160 TNFGRTSSGLFIATSYDYDAPIDEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWL 219
Query: 271 GQLQ-EAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
G EA V+ + +CAAFL N D + A TV F N Y+LP S+SILPDCKTV FNT
Sbjct: 220 GNKNLEAHVYYVNTSICAAFLANYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTA 279
Query: 330 RVSTQ-YNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
V+ ++KR FD W+ Y E + D+ + A L +QI+ +D+SDY W
Sbjct: 280 TVNGHSFHKRMTPVETTFD----WQSYSEEPAYSSDDDSIIANALWEQINVTRDSSDYLW 335
Query: 388 YTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
Y + + S N Q P L + S GH+LH FVNG+ +G+ +G DN T +V+L
Sbjct: 336 YLTDVNISPSESFIKNGQFPTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNL 395
Query: 442 RQGTNDGALLSVTVGLPDSGAFLER---KVAGVHRVRVQDKSFTNCS---WGYQVGLIGE 495
+ G N +LLSV VGLP+ G E V G R++ D+ + S W Y+VGL GE
Sbjct: 396 KVGNNKISLLSVAVGLPNVGLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGE 455
Query: 496 KLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
L +++ G + + W+ S ++ LTWYKTTF AP+GNDP+AL++ SMGKGE W+N Q
Sbjct: 456 SLSLHTITGSSSIDWTQGSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQ 515
Query: 554 SIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLE 612
SIGR+W ++ + GN + YA + T YH+PR++L +GN+LV+LE
Sbjct: 516 SIGRHWPAY-IAHGNCDECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLE 574
Query: 613 EENGNPLGITV 623
E G+P GI++
Sbjct: 575 EWGGDPTGISL 585
>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
Length = 607
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 236/511 (46%), Positives = 304/511 (59%), Gaps = 46/511 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GG+DVI+TYVFWN HEP +G+Y F R D+++FIK +Q GLYV LRIG
Sbjct: 58 MWPDLIQKAKDGGVDVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+ FR+DN+P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y W ++MAV +TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 178 LSQIENEYGPVEWEIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E F PN KP +WTE+WT +Y +G R A+D+AF VA F+ GSYVNYYMYH
Sbjct: 238 -ENFS-PNKNYKPKMWTENWTGWYTDFGTAVPYRPAEDLAFSVARFVQNRGSYVNYYMYH 295
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRT++ I YD AP+DEYGL+ EPKWGHL++LH AIK C L++ V
Sbjct: 296 GGTNFGRTSSGLFIATSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVS 355
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
G+ E +++ + G CAAFL N D V F N Y+LP SISILPDCKT FNT
Sbjct: 356 WPGKNLEVHLYKTSFGACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNT 415
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREA-ILNFDNTLLRAEGLLDQISAAKDASDYF 386
+V RS T +N F+ W+ Y E + ++ A GLL+Q+S D SDY
Sbjct: 416 AKVRAPRVHRSMTPANSAFN----WQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYL 471
Query: 387 WYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
WY + + + N Q P L S GH+LH F+NG++ G+A+GS DN T N+V
Sbjct: 472 WYMTDVNISPNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVK 531
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGV 471
LR G N +LLSV VGL + G E+ GV
Sbjct: 532 LRVGNNKISLLSVAVGLSNVGVHYEKWNVGV 562
>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
Length = 705
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 255/611 (41%), Positives = 330/611 (54%), Gaps = 85/611 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAK KEGG DVI+TYVFWN HEP KGQY F R D+++F K + ++GL++ LRIG
Sbjct: 94 MWPSLIAKFKEGGADVIETYVFWNGHEPAKGQYYFEERFDLVKFAKLVAAEGLFLFLRIG 153
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+ +EW +GG P+WL D+ GI FR+DN+P+K
Sbjct: 154 PYACAEWNFGGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPII 213
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + + G Y+ WAA+MA+ TG+PWVMC+Q DAP +I+ CN C
Sbjct: 214 LQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYC 273
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS NKP+IWTEDW +Y WGG R A+D AF VA F + GS NYYMY
Sbjct: 274 -DGFK-PNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYF 331
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL--TGTQN 266
GGTNF RTA IT Y AP+DEYG++R+PKWGHLK+LH AIKLC L+ G+
Sbjct: 332 GGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVVGSPQ 391
Query: 267 VISLGQLQEAFVFE----ETSG-------VCAAFLVNNDERKAVTVLFRNISYELPRKSI 315
I LG +QEA V+ T+G +C+AFL N DE K +V SY LP S+
Sbjct: 392 YIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSV 451
Query: 316 SILPDCKTVAFNTERVSTQY------------NKRSKTSNLKFDS-----DEKWEEYREA 358
SILPDC+ VAFNT R+ Q + R K S L S W +E
Sbjct: 452 SILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKET 511
Query: 359 ILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGH 410
I + +G+L+ ++ KD SDY WYT R + ++S L +
Sbjct: 512 IGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRD 571
Query: 411 ILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG 470
+ FVNG+ GS G +L+ + L +G N+ LLS VGL + GAFLE+ AG
Sbjct: 572 VARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAG 627
Query: 471 VHRVRVQ-------DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTW 522
R +V D TN W YQVGL GE IY+ WS ++ + Q TW
Sbjct: 628 F-RGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQPFTW 686
Query: 523 YKTTFRAPAGN 533
YK G+
Sbjct: 687 YKNICNQSVGD 697
>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 592
Score = 431 bits (1107), Expect = e-117, Method: Compositional matrix adjust.
Identities = 214/495 (43%), Positives = 299/495 (60%), Gaps = 39/495 (7%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP LI +AKEGGL+ I+TY+FWN HEP+ G+Y+F GR D+I+++K IQ +Y +RIG
Sbjct: 66 VWPKLIERAKEGGLNTIETYIFWNAHEPEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N PYK
Sbjct: 126 PFIQAEWNHGGLPYWLREIDHIIFRANNDPYKKEMEKFVRFIVQKLKDAELFASQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ G Y+ WAA+MA+ TGVPW+MCKQ APG VI CNG C
Sbjct: 186 LTQIENEYGNIKKDHATDGDKYLEWAAQMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ NKP +WTE+WT ++ +G + +RSA+DIA+ V F AK GS VNYYMYH
Sbjct: 246 GDTWT-LRDKNKPMLWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYH 304
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+AP+DEYG+ +EPK+GHL++LH I+ + L G +
Sbjct: 305 GGTNFGRTGASYVLTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEI 364
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +FE +C +FL NN+ + TV+FR + +P +S+SIL CK V +NT
Sbjct: 365 LGHGYEAHIFELPEENLCLSFLSNNNTGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNT 424
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
+RV Q+N+RS ++ + +WE Y E I + +T +R + L+Q + KDASDY WY
Sbjct: 425 KRVFVQHNERSYHTSEVTSKNNQWEMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWY 484
Query: 389 TFRFHYNS------SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLR 442
T F S ++ + L V+S H + F N + G A GS F V L+
Sbjct: 485 TTSFRLESDDLPFRNDIRPVLQVKSSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLK 544
Query: 443 QGTNDGALLSVTVGL 457
G N LLS T+G+
Sbjct: 545 VGVNHVVLLSSTMGM 559
>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 641
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 244/584 (41%), Positives = 331/584 (56%), Gaps = 60/584 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDVI+TYVFW++HEP +GQYDF GR D+ F+K + GLYV LRIG
Sbjct: 60 MWPGLIQKAKDGGLDVIETYVFWDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 120 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPII 179
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ A+ G Y+ WAA MAV TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 180 LSQIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYC 239
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS KP +WTE+W+ ++ +GG R +D+AF VA F + G++ NYYMYH
Sbjct: 240 DQF--TPNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYH 297
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTN R++ F+ T Y AP+DEYGLVR+PKWGHL+++H AIKLC L+ +
Sbjct: 298 GGTNLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYT 357
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SLG EA V++ S VCAAFL N D + TV F Y LP S+SILPDCK V NT
Sbjct: 358 SLGPNVEAAVYKVGS-VCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNT 416
Query: 329 ERVSTQYN----KRSKTSNLKFDSD--------EKWEEYREAI-LNFDNTLLRAEGLLDQ 375
++++Q + ++SN+ D W E + + DN L +A GL++Q
Sbjct: 417 AQINSQTTGSEMRYLESSNVASDGSFVTPELAVSDWSYAIEPVGITKDNALTKA-GLMEQ 475
Query: 376 ISAAKDASDYFWYTFRFHYNS-----SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDN 430
I+ DASD+ WY+ + +Q+ L V S GH+L ++NG+ GSA GS +
Sbjct: 476 INTTADASDFLWYSTSITVKGDEPYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASS 535
Query: 431 VSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKS----FTNCS 485
+ + + L G N LLS TVGL + GAF + AG+ V++ + ++
Sbjct: 536 SLISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAE 595
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFR 528
W YQ+GL GE L +Y + S+ P L WYK +
Sbjct: 596 WTYQIGLRGEDLHLYDPSEASPEWVSANAYPINHPLIWYKVSME 639
>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
Length = 569
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 231/513 (45%), Positives = 296/513 (57%), Gaps = 52/513 (10%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G Y F R D+++F K + GLY+ LRIG
Sbjct: 59 MWPDLIKKAKEGGLDVIQTYVFWNGHEPSPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V G+VFR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGMVFRTDNEPFKIAMQKFTKKIVDMMKEEKLFETQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ G Y W A+MA+ TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 179 LSQIENEYGPMQWEMGAAGKAYSKWTAEMALGLSTGVPWIMCKQEDAPYPIIDTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PNS NKP +WTE+WT ++ +GG R +DIAF VA FI GS++NYYMY
Sbjct: 239 -EGFK-PNSDNKPKLWTENWTGWFTEFGGAIPNRPVEDIAFSVARFIQNGGSFMNYYMYX 296
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNF RTA F+ T Y AP+DEYGL+REPK+ HLKELH IKLC L++ + S
Sbjct: 297 GGTNFDRTAGVFIATSYDYDAPIDEYGLLREPKYSHLKELHKVIKLCEPALVSVDPTITS 356
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
LG QE VF+ + CAAFL N D A V+FR Y+LP S+SILPDCKT +NT
Sbjct: 357 LGDKQEIHVFKSKTS-CAAFLSNYDTSSAARVMFRGFPYDLPPWSVSILPDCKTEYYNTA 415
Query: 330 RVSTQYNKRSKTSNLKF---DSDEKWEEYREA--ILNFDNTLLRAEGLLDQISAAKDASD 384
++ R+ T +K + WE Y E N T ++ +GL++QIS +D +D
Sbjct: 416 KI------RAPTILMKMIPTSTKFSWESYNEGSPSSNEAGTFVK-DGLVEQISMTRDKTD 468
Query: 385 YFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
YFWY S + L + S GH LH FVNG G+++G+ N T
Sbjct: 469 YFWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQN 528
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV 471
+ L G N ALLS VGLP++G E G+
Sbjct: 529 IKLSVGINKLALLSTAVGLPNAGVHYETWNTGI 561
>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 713
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 255/619 (41%), Positives = 330/619 (53%), Gaps = 93/619 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRN--------DIIRFIKEIQSQG 52
MWPSLIAK KEGG DVI+TYVFWN HEP KGQY F R D+++F K + ++G
Sbjct: 94 MWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYYFEERFDLVKFAKIDLVKFAKLVAAEG 153
Query: 53 LYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK-------------------- 92
L++ LRIGP+ +EW +GG P+WL D+ GI FR+DN+P+K
Sbjct: 154 LFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLY 213
Query: 93 -----------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVI 141
IENEY I+ + + G Y+ WAA+MA+ TG+PWVMC+Q DAP +I
Sbjct: 214 SWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEII 273
Query: 142 NACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
+ CN C + FK PNS NKP+IWTEDW +Y WGG R A+D AF VA F + GS
Sbjct: 274 DTCNAFYC-DGFK-PNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGS 331
Query: 202 YVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPL 260
NYYMY GGTNF RTA IT Y AP+DEYG++R+PKWGHLK+LH AIKLC L
Sbjct: 332 LQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPAL 391
Query: 261 LT--GTQNVISLGQLQEAFVFE----ETSG-------VCAAFLVNNDERKAVTVLFRNIS 307
+ G+ I LG +QEA V+ T+G +C+AFL N DE K +V S
Sbjct: 392 IAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKS 451
Query: 308 YELPRKSISILPDCKTVAFNTERVSTQY------------NKRSKTSNLKFDS-----DE 350
Y LP S+SILPDC+ VAFNT R+ Q + R K S L S
Sbjct: 452 YSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSS 511
Query: 351 KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAP 402
W +E I + +G+L+ ++ KD SDY WYT R + ++S
Sbjct: 512 TWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPS 571
Query: 403 LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGA 462
L + + FVNG+ GS G +L+ + L +G N+ LLS VGL + GA
Sbjct: 572 LTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGA 627
Query: 463 FLERKVAGVHRVRVQ-------DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRS 515
FLE+ AG R +V D TN W YQVGL GE IY+ WS ++
Sbjct: 628 FLEKDGAGF-RGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQK 686
Query: 516 PTRQ-LTWYKTTFRAPAGN 533
+ Q TWYK G+
Sbjct: 687 DSVQPFTWYKNICNQSVGD 705
>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
Length = 580
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 225/595 (37%), Positives = 331/595 (55%), Gaps = 36/595 (6%)
Query: 164 IWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMI 223
+WTE+WT ++ +G + +RSA+DIA+ V F AK GS VNYYMYHGGTNFGRT A++++
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRTGASYVL 61
Query: 224 TGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFE-ET 282
TGYYD+AP+DEYG+ +EPK+GHL++LH I+ + L G + LG EA +FE
Sbjct: 62 TGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELPE 121
Query: 283 SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTS 342
+C +FL NN+ + TV+FR + +P +S+SIL CK V +NT+RV Q+++RS +
Sbjct: 122 EKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSFHT 181
Query: 343 NLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS------ 396
+ + +WE + E I + +T +R + L+Q + KD +DY WYT F S
Sbjct: 182 SDVTSKNNQWEMFSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRLESDDLPFR 241
Query: 397 SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVG 456
++ + L V+S H + F N + G A G+ F V L+ G N LLS T+G
Sbjct: 242 NDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHVVLLSSTMG 301
Query: 457 LPDSGAFLERKVAGVHRVRVQDKSFTNCS-----WGYQVGLIGEKLQIYSNLGLNKVLWS 511
+ DSG L G+ +Q + WG++ L GE +IYS GL KV W
Sbjct: 302 MKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYSEKGLGKVQWK 361
Query: 512 SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQ 571
+ R TWYK F P G+DP+ L++ SM KG +VNG+ +GRYWVS++T G PSQ
Sbjct: 362 PAEN-DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWVSYRTLAGTPSQ 420
Query: 572 TQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKV 631
YH+PR FLK NLLV+ EEE G P GI V T+ +
Sbjct: 421 A--------------------VYHIPRPFLKSKDNLLVIFEEEMGKPDGILVQTVTRDDI 460
Query: 632 CGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDC 691
C ++ + + +W + + ++ T+ +CP K I ++VFASFGNPDG C
Sbjct: 461 CLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTL--TCPPEKTIQEVVFASFGNPDGMC 518
Query: 692 ERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGIHKALLVDAQC 745
+ VG+CH+ +++ +VE+ C+GK C +P+ +G D C L V +C
Sbjct: 519 GNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 573
>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 830
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 281/808 (34%), Positives = 389/808 (48%), Gaps = 98/808 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L A+AK G+DVIQTY+FWN + P G++ S R D +RF++ Q GLYV RIG
Sbjct: 57 MWPELFARAKANGIDVIQTYLFWNTNVPTPGEFVMSDRFDYVRFVQLAQEAGLYVNFRIG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
PF+ +EWTYGGLP WL + I+FR ++P+
Sbjct: 117 PFVCAEWTYGGLPAWLRQIPDIMFRDYDQPWLQVAGEYITKTVQILKDNRLLAGQGGPII 176
Query: 92 --KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+IENEY E + + GP YV W ++A + W+MC Q DAP +I CN C
Sbjct: 177 LLQIENEYGGTE-SRYAGGPQYVEWCGQLAANLTDAAQWIMCSQPDAPANIIATCNAFYC 235
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ P +PS+WTE+W ++Q WG R AQD+A+ V + K GSY+NYYMYH
Sbjct: 236 DDFVP---HPGQPSMWTENWPGWFQKWGDPTPHRPAQDVAYAVTRYYIKGGSYMNYYMYH 292
Query: 210 GGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL-TGTQNV 267
GGTNF RTA IT YD A LDEYG+ EPK+ HL +HA + ++
Sbjct: 293 GGTNFERTAGGPFITTNYDYDASLDEYGMPNEPKYSHLGSMHAVLHDNEAIMMAVPAPKP 352
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
ISLG EA ++ + G C AFL NN+ + V V F +YELP S+S+L C T +N
Sbjct: 353 ISLGTNLEAHIYNSSVG-CVAFLSNNNNKTDVEVQFNGRTYELPAWSVSVLHGCVTAIYN 411
Query: 328 T----------------ERVSTQYNKRSKTSNLKFDSDEKWEEYREAIL--------NFD 363
T R S + R K + + R L
Sbjct: 412 TAVCRAHQRAPHDAACCARESRRVCDRLPPLRPKARAPCQSGRIRHLCLVVLTSIGPQAP 471
Query: 364 NTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGS 423
T + L+QI D +DY WY+ + +SS A L + + + +VNG++
Sbjct: 472 ATKYWNKTPLEQIDQTLDHTDYLWYSTSY-VSSSATYAQLSLPQITDVAYVYVNGKFVTV 530
Query: 424 AHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG-VHRVRVQDKSFT 482
+ NVS TV L G N +LS+T+GL + G L G + V + + T
Sbjct: 531 SWSG--NVS----ATVSLVAGPNTIDILSLTMGLDNGGDILSEYNCGLLGGVYLGSVNLT 584
Query: 483 NCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGND-PIALNLQ 541
W +Q G++GE+ I+ L KV W++ LTWYK++F P + P+AL+L
Sbjct: 585 ENGWWHQTGVVGERNAIFLPENLKKVAWTTPAVLNTGLTWYKSSFDVPRDSQAPLALDLT 644
Query: 542 SMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHF---CAIIKATNTYHVPR 598
MGKG WVNG ++GRYW + + Y T + H C + T+ YHVPR
Sbjct: 645 GMGKGYVWVNGHNLGRYWPTILATNWPCDVCDYR-GTYDAPHCKQGCNMPSQTH-YHVPR 702
Query: 599 AFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIK 658
+L+ N+LVLLEE GNP I + CG V +
Sbjct: 703 EWLQAENNVLVLLEEMGGNPSKIALVEREEYVSCGVVGEDYP------------------ 744
Query: 659 KFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
V C + I+ + FAS+G P G C Y GSCH+S+S +V C GK C
Sbjct: 745 --ADDLAVVLGCGTHQTIAGVDFASYGTPMGSCRSYQQGSCHASNSTEIVLSLCHGKQAC 802
Query: 719 SIPLLSRYFGGDPCPGI-HKALLVDAQC 745
SIP+ + F G+PCP + +K L V C
Sbjct: 803 SIPVSAAMF-GNPCPDVTNKRLAVQVAC 829
>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
gi|224029591|gb|ACN33871.1| unknown [Zea mays]
Length = 580
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 225/595 (37%), Positives = 330/595 (55%), Gaps = 36/595 (6%)
Query: 164 IWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMI 223
+WTE+WT ++ +G + +RSA+DIA+ V F AK GS VNYYMYHGGTNFGRT A++++
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRTGASYVL 61
Query: 224 TGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFE-ET 282
TGYYD+AP+DEYG+ +EPK+GHL++LH I+ + L G + LG EA +FE
Sbjct: 62 TGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELPE 121
Query: 283 SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTS 342
+C +FL NN+ + TV+FR + +P +S+SIL CK V +NT+RV Q+++RS +
Sbjct: 122 EKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSFHT 181
Query: 343 NLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS------ 396
+ + +WE E I + +T +R + L+Q + KD +DY WYT F S
Sbjct: 182 SDVTSKNNQWEMSSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRLESDDLPFR 241
Query: 397 SNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVG 456
++ + L V+S H + F N + G A G+ F V L+ G N LLS T+G
Sbjct: 242 NDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHVVLLSSTMG 301
Query: 457 LPDSGAFLERKVAGVHRVRVQDKSFTNCS-----WGYQVGLIGEKLQIYSNLGLNKVLWS 511
+ DSG L G+ +Q + WG++ L GE +IYS GL KV W
Sbjct: 302 MKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYSEKGLGKVQWK 361
Query: 512 SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQ 571
+ R TWYK F P G+DP+ L++ SM KG +VNG+ +GRYWVS++T G PSQ
Sbjct: 362 PAEN-DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWVSYRTLAGTPSQ 420
Query: 572 TQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKV 631
YH+PR FLK NLLV+ EEE G P GI V T+ +
Sbjct: 421 A--------------------VYHIPRPFLKSKDNLLVIFEEEMGKPDGILVQTVTRDDI 460
Query: 632 CGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDC 691
C ++ + + +W + + ++ T+ +CP K I ++VFASFGNPDG C
Sbjct: 461 CLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTL--TCPPEKTIQEVVFASFGNPDGMC 518
Query: 692 ERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGIHKALLVDAQC 745
+ VG+CH+ +++ +VE+ C+GK C +P+ +G D C L V +C
Sbjct: 519 GNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 573
>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
Length = 706
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 252/664 (37%), Positives = 351/664 (52%), Gaps = 54/664 (8%)
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
IENE+ +E ++ ++G YV W A++A ++ PW+MC+Q DAP P+IN CNG C +
Sbjct: 1 IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYC-DQ 59
Query: 153 FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 212
FK PN+ N P +WTE W +++ WG + R+A+D+AF VA F GS NYYMYHGGT
Sbjct: 60 FK-PNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGT 118
Query: 213 NFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLG 271
NFGR+A IT YD APLDEYG + +PKWGHLK+LH I+ + L G I G
Sbjct: 119 NFGRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTG 178
Query: 272 QLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERV 331
A + G + F N E + F+ Y +P S+++LPDCKT +NT +V
Sbjct: 179 HSTTATSY-TYKGKSSCFF-GNPENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKV 236
Query: 332 STQYNKRSKTSNL--KFDSDEKWEEYREAIL------NFDNTLLRAEGLLDQISAAKDAS 383
+TQ R +L K KW+ E I + + + A L+DQ D+S
Sbjct: 237 NTQTTIREMVPSLVGKHKKPLKWQWRNEKIEHLTHEGDISGSAITANSLIDQKMVTNDSS 296
Query: 384 DYFWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
DY WY FH N ++ + L V++ GHILHAFVN ++ G+ G + SFTL V
Sbjct: 297 DYLWYLTGFHLNGNDPLFGKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKV 356
Query: 440 -HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH---RVRVQDKSFTNCS---WGYQVGL 492
+LR G N ALLS TVGLP+ GA+ E G++ + K+ + S W Y+VGL
Sbjct: 357 RNLRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGKTIRDLSTNEWIYKVGL 416
Query: 493 IGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GEK + + + W S P Q TWYKT+F P G + + ++L MGKG+AWVN
Sbjct: 417 DGEKYEFFDPDHKFRKPWLSNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVN 476
Query: 552 GQSIGRYWVSF-KTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLKP-TGNLL 608
G+SIGRYW S+ T G S Y S K T YH+PR+++ N L
Sbjct: 477 GKSIGRYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTL 536
Query: 609 VLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQP 668
+L EE G PL I + T ++KVC V G K ++
Sbjct: 537 ILFEEFGGMPLNIEIKTTRVKKVCAKV-----------------------DLGSK--LEL 571
Query: 669 SCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
+C + + +I+F FGNP G+C + GSCHSS + V+E+ C+ K +CSI + G
Sbjct: 572 TCH-DRTVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKLG 630
Query: 729 GDPC 732
C
Sbjct: 631 LTGC 634
>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
Length = 713
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 245/608 (40%), Positives = 325/608 (53%), Gaps = 76/608 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TY+FWN HEP + QY+F G D++RF KEIQ+ G+Y LRIG
Sbjct: 61 MWPDLIKKAKEGGLDAIETYIFWNGHEPHRRQYNFEGNYDVVRFFKEIQNAGMYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I EW YGGLP WL D+ G+ FR N+P+
Sbjct: 121 PYICGEWNYGGLPAWLRDIPGMQFRLHNEPFENEMETFTTLIVNKMKDSKMFAEQGGPII 180
Query: 92 --KIENEYQTIEPAF--HEKGPPYVLWAAKMAVDFHTGVPWVMCKQ-DDAPGPVINACNG 146
+IENEY I ++ Y+ W A MA + GVPW+MC+Q DD P V+N CNG
Sbjct: 181 LAQIENEYGNIMGKLNNNQSASEYIHWCADMANKQNVGVPWIMCQQDDDVPHNVVNTCNG 240
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
C + F PN P IWTE+WT +++ W + RSA+DIAF VA+F K GS NYY
Sbjct: 241 FYCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQKRGSLQNYY 298
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLKELH+ +K + L+ G
Sbjct: 299 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNLRQPKYGHLKELHSVLKSMEKTLVHGEY 358
Query: 266 NVISLGQ--LQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKT 323
+ G + + +S A F+ N + K V V ++ LP S+SILPDCKT
Sbjct: 359 FDTNYGDNITVTKYTLDSSS---ACFINNRFDDKDVNVTLDGATHLLPAWSVSILPDCKT 415
Query: 324 VAFNTERVSTQYNKRSKTSNLKFDSDE--KWEEYREAILNF---DNTLLRAEGLLDQISA 378
VAFN+ ++ TQ + K N E KW E + F + R LL+QI
Sbjct: 416 VAFNSAKIKTQTSVMVKKPNTAEQEQESLKWSWMPENLSPFMTDEKGNFRKNELLEQIVT 475
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNT 438
+ D SDY WY ++ + L V + GH L+AFVNG+ G H + + F L +
Sbjct: 476 STDQSDYLWYRTSLNHKGEGSYK-LYVNTTGHELYAFVNGKLIGKNHSADGDFVFQLESP 534
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQ 498
V L G N +LLS TVGL + G E+ G+ G V LI
Sbjct: 535 VKLHDGKNYISLLSATVGLKNYGPSFEKMPTGIV--------------GGPVKLIDSNG- 579
Query: 499 IYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
+ + L+ WS YK TF AP+G DP+ ++L + KG AWVNG ++GRY
Sbjct: 580 --TAIDLSNSSWS-----------YKATFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGRY 626
Query: 559 WVSFKTSK 566
W S+ ++
Sbjct: 627 WPSYTAAE 634
>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
Length = 2260
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 216/460 (46%), Positives = 279/460 (60%), Gaps = 44/460 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP KGQYDF GR D+++F+K + GLYV LRIG
Sbjct: 52 MWPDLIQKSKDGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ SEW YGG P+WLH + GI FR+DN+P+K
Sbjct: 112 PYVCSEWNYGGFPLWLHFIPGIKFRTDNEPFKVEMKRFTTKIVDLMKQEKLYASQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGP-VINACNGMR 148
IENEY I+ A+ G Y+ WAAKMA TGVPWVMC+Q DAP P VIN CNG
Sbjct: 172 LSQIENEYGDIDSAYGSAGKSYINWAAKMATSLDTGVPWVMCQQADAPDPIVINTCNGFY 231
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + PNS KP +WTE+W+++Y ++GG R +D+AF VA F + G++ NYYMY
Sbjct: 232 CDQF--TPNSKTKPKLWTENWSAWYLLFGGGFPHRPVEDLAFAVARFFQRGGTFQNYYMY 289
Query: 209 HGGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNF R T F+ T Y AP+DEYG++R+PKWGHLK++H AIKLC L+ +
Sbjct: 290 HGGTNFDRSTGGPFIATSYDFDAPIDEYGVIRQPKWGHLKDVHKAIKLCEEALIAAEPKI 349
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
LG EA V+ +T VCAAFL N D + TV F SY LP S+SILPDCK V N
Sbjct: 350 TYLGPNLEAAVY-KTGSVCAAFLANVDAKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLN 408
Query: 328 TERVSTQYNKRS-KTSNLKFD------SDEKWEEYREAILNFDNTLLRAEGLLDQISAAK 380
T ++++ + T +LK D S KW E + + +L GLL+QI+
Sbjct: 409 TAKINSASTISNFVTESLKEDISSSETSRSKWSWINEPVGISKDDILSKTGLLEQINITA 468
Query: 381 DASDYFWYTFRFHY-NSSNAQAPLDVQSHGHILHAFVNGE 419
D SDY WY+ + +Q L ++S GH LHAF+NG+
Sbjct: 469 DRSDYLWYSLSVDLKDDPGSQTVLHIESLGHALHAFINGK 508
Score = 204 bits (518), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 130/336 (38%), Positives = 181/336 (53%), Gaps = 21/336 (6%)
Query: 422 GSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVR 475
GS G+ + + + G N LLS+TVGL + GAF + AG+ ++
Sbjct: 1933 GSQTGNKEKPKLNEDIPITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLK 1992
Query: 476 VQDKSFTNCS--WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTWYKTTFRAPAG 532
+K+ S W YQVGL GE L + S G + S P +Q L WYKT F AP+G
Sbjct: 1993 NGNKTLDLSSRKWTYQVGLKGEDLGLSS--GSSGAWNSKTTFPKKQPLIWYKTNFDAPSG 2050
Query: 533 NDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQT--QYAVNTVTSIHFCAIIKA 590
++P+ ++ MGKGEAWVNGQSIGRYW ++ S + + + T T H +
Sbjct: 2051 SNPVVIDFTGMGKGEAWVNGQSIGRYWPTYVASNVDCTDSCNYRGPFTQTKCHMNCGKPS 2110
Query: 591 TNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHR 650
YHVP++FLKP GN LVL EE G+P I+ T I VC HV++SH P + W +
Sbjct: 2111 QTLYHVPQSFLKPNGNTLVLFEESGGDPTQISFATKQIGSVCAHVSDSHPPQIDLWNQDT 2170
Query: 651 QRGDTDIKKFGKKPTVQPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVE 709
+ G K G P + +CP + IS I FAS+G P G C + G C S+ + +V+
Sbjct: 2171 ESGG----KVG--PALLLNCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKTLSIVK 2224
Query: 710 RACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ACIG CSI + + F GDPC G+ K+L V+A C
Sbjct: 2225 KACIGSRSCSIGVSTDTF-GDPCKGVPKSLAVEATC 2259
>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
Length = 514
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 221/465 (47%), Positives = 274/465 (58%), Gaps = 42/465 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLDVIQTYVFWN HEP G+Y F G D++RFIK ++ GLYV LRIG
Sbjct: 51 MWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIG 110
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR++N P+K
Sbjct: 111 PYVCAEWNFGGFPVWLKYIPGIAFRTNNGPFKAYMQRFTKKIVDMMKAEGLFESQGGPII 170
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQDDAP P+IN+CNG C
Sbjct: 171 LSQIENEYGPMEYELGAAGRAYSQWAAQMAVGLGTGVPWVMCKQDDAPDPIINSCNGFYC 230
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN KP +WTE WT ++ +GG R +D+AF VA FI K GS++NYYMYH
Sbjct: 231 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPVEDLAFSVARFIQKGGSFINYYMYH 288
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFGRTA F+ T Y APLDEYGLVR+PKWGHLK+LH AIKLC L++G +V+
Sbjct: 289 GGTNFGRTAGGPFIATSYDYDAPLDEYGLVRQPKWGHLKDLHRAIKLCEPALVSGDPSVM 348
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG+ QEA VF+ G CAAFL N + R V F N+ Y LP SISILPDCK +NT
Sbjct: 349 PLGRFQEAHVFKSKYGHCAAFLANYNPRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNT 408
Query: 329 ERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
RV Q + R K + W+ Y EA + GL++QI+ +D SDY W
Sbjct: 409 ARVGAQ-SARMKMVPVPIHGAFSWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLW 467
Query: 388 YTFRFHYN------SSNAQAPLDVQSHGHILHAFVNGEYTGSAHG 426
Y+ + + L V S GH LH FVN + + + G
Sbjct: 468 YSTDVKIDPDEGFLKTGKYPTLTVLSAGHALHVFVNDQLSVARDG 512
>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 206/430 (47%), Positives = 276/430 (64%), Gaps = 29/430 (6%)
Query: 207 MYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQN 266
MYHGGTNFGRT++++ ITGYYDQAPLDEYGL+R+PK+GHLKELHAAIK + PLL G Q
Sbjct: 1 MYHGGTNFGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQT 60
Query: 267 VISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAF 326
++SLG +Q+A+VFE+ + C AFLVNND KA + FRN +Y L KSI IL +CK + +
Sbjct: 61 ILSLGPMQQAYVFEDANNGCVAFLVNNDA-KASQIQFRNNAYSLSPKSIGILQNCKNLIY 119
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
T +V+ + N R T F+ + W +RE I F T L+ LL+ + KD +DY
Sbjct: 120 ETAKVNVKMNTRVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYL 179
Query: 387 WYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
WYT F +S + +S GH++H FVN GS HGS D L+ V L G N
Sbjct: 180 WYTSSFKLDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQN 239
Query: 447 DGALLSVTVGLPDSGAFLERKVAGVHRVRV-----QDKSFTNCSWGYQVGLIGEKLQIYS 501
+ ++LS VGLPDSGA++ER+ G+ +V++ + + WGY VGL+GEK+++Y
Sbjct: 240 NISILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQ 299
Query: 502 NLGLNKVLWSSIRS---PTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
LN+V WS ++ R L WYKTTF P G+ P+ L++ SMGKGE WVNG+SIGRY
Sbjct: 300 WKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRY 359
Query: 559 WVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
WVSF T G PSQ+ YH+PRAFLKP+GNLLV+ EEE G+P
Sbjct: 360 WVSFLTPAGQPSQS--------------------IYHIPRAFLKPSGNLLVVFEEEGGDP 399
Query: 619 LGITVDTIAI 628
LGI+++TI++
Sbjct: 400 LGISLNTISV 409
>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
Length = 532
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 225/534 (42%), Positives = 318/534 (59%), Gaps = 25/534 (4%)
Query: 118 MAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWG 177
MAV + GVPW+MC+Q DAP VI+ CNG C + PN+P+KP IWTE+W +++ +G
Sbjct: 1 MAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQF--TPNTPDKPKIWTENWPGWFKTFG 58
Query: 178 GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYG 236
G+ R A+D+A+ VA F K GS NYYMYHGGTNFGRT+ IT YD +AP+DEYG
Sbjct: 59 GRDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYG 118
Query: 237 LVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDER 296
L R PKWGHLK+LH AI L L++G +LG EA V+ ++SG CAAFL N D++
Sbjct: 119 LPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDK 178
Query: 297 KAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNK-RSKTSNLKFDSDEKWEEY 355
V+FRN SY LP S+SILPDCKT FNT +V+++ +K +LK S KWE +
Sbjct: 179 NDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSSKVEMLPEDLKSSSGLKWEVF 238
Query: 356 REAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNA-----QAP-LDVQSHG 409
E + L+D I+ KD +DY WYT + + A +P L ++S G
Sbjct: 239 SEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFIESKG 298
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
H LH F+N EY G+A G+ +V F L+ V L+ G N+ LLS+TVGL ++G+F E A
Sbjct: 299 HTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYEWVGA 358
Query: 470 GVHRVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTW 522
G+ V ++ + TN W Y++G+ GE L+++ V W+ P ++ LTW
Sbjct: 359 GLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQPLTW 418
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW--VSFKTSKGNP--SQTQYAVNT 578
YK P+G++P+ L++ SMGKG AW+NG+ IGRYW ++ K S + + Y
Sbjct: 419 YKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDYRGKF 478
Query: 579 VTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKV 631
+ + + YHVPR++ K +GN LV+ EE+ GNP+ I ++ RKV
Sbjct: 479 MPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKI---KLSKRKV 529
>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 621
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 240/653 (36%), Positives = 340/653 (52%), Gaps = 57/653 (8%)
Query: 118 MAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWG 177
MA GVPW+MC+Q +AP P++ CNG C + P +P+ P +WTE+WT +++ WG
Sbjct: 1 MANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQY--EPTNPSTPKMWTENWTGWFKNWG 58
Query: 178 GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYG 236
GK R+A+D+AF VA F G++ NYYMYHGGTNFGR A IT YD APLDE+G
Sbjct: 59 GKHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFG 118
Query: 237 LVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDER 296
+ +PKWGHLK+LH +K + L G + I LG +A ++ G + F+ N +
Sbjct: 119 NLNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEG-SSCFIGNVNAT 177
Query: 297 KAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKW--EE 354
V F+ Y +P S+S+LPDC A+NT +V+TQ + ++ S+ + W E
Sbjct: 178 ADALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPES 237
Query: 355 YREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNA----QAPLDVQSHGH 410
++ IL L+ A+GL+DQ DASDY WY R H + + L V S+ H
Sbjct: 238 AQKMILKGSGDLI-AKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHSNAH 296
Query: 411 ILHAFVNGEYTGSAHGSHDNVSFTLRNTV-HLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
+LHA+VNG+Y G+ + V HL GTN +LLSV+VGL + G F E
Sbjct: 297 VLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPT 356
Query: 470 GVH---------RVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT-RQ 519
G++ +K + W Y++GL G +++S + W++ + PT R
Sbjct: 357 GINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWANEKLPTGRM 416
Query: 520 LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTS-KGNPSQTQYAVNT 578
LTWYK F+AP G +P+ ++L +GKGEAW+NGQSIGRYW SF +S G + Y
Sbjct: 417 LTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDKCDY--RG 474
Query: 579 VTSIHFCAIIKATNT---YHVPRAFLKPTG-NLLVLLEEENGNPLGITVDTIAIRKVCGH 634
CA + T YHVPR+FL +G N + L EE GNP + T+ + VC
Sbjct: 475 AYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCAR 534
Query: 635 VTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERY 694
H V+ SC + IS + FASFGNP G C +
Sbjct: 535 A-------------HEHN------------KVELSCH-NRPISAVKFASFGNPLGHCGSF 568
Query: 695 AVGSCHSSHSQG-VVERACIGKSRCSIPLLSRYFGGD-PCPGIHKALLVDAQC 745
AVG+C V + C+GK C++ + S FG C K L V+ +C
Sbjct: 569 AVGTCQGDKDAAKTVAKECVGKLNCTVNVSSDTFGSTLDCGDSPKKLAVELEC 621
>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
Length = 740
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 245/611 (40%), Positives = 312/611 (51%), Gaps = 105/611 (17%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLIAK KEGG DVI+TYVFWN HEP KGQY F R D ++F K +
Sbjct: 149 MWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYYFEERFDPVKFEKHV------------ 196
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
G P+WL D+ GI FR+DN+P+K
Sbjct: 197 --------IFGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPII 248
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + + G Y+ WAA+MA+ TG+PWVMC+Q DAP +I+ CN C
Sbjct: 249 LQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYC 308
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ FK PNS NKP+IWTEDW +Y WGG R A+D AF VA F + GS NYYMY
Sbjct: 309 -DGFK-PNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYF 366
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT--GTQN 266
GGTNF RTA IT Y AP+DEYG++R+PKWGHLK+LH AIKLC L+ G+
Sbjct: 367 GGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQ 426
Query: 267 VISLGQLQEAFVFE----ETSG-------VCAAFLVNNDERKAVTVLFRNISYELPRKSI 315
I LG +QEA V+ T+G +C+AFL N DE K +V SY LP S+
Sbjct: 427 YIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSV 486
Query: 316 SILPDCKTVAFNTERVSTQ------------YNKRSKTSNLKFDS-----DEKWEEYREA 358
SILPDC+ VAFNT R+ Q + R K S L S W +E
Sbjct: 487 SILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKET 546
Query: 359 ILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGH 410
I + +G+L+ ++ KD SDY WYT R + ++S L +
Sbjct: 547 IGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRD 606
Query: 411 ILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG 470
+ FVNG+ GS G +L+ + L +G N+ LLS VGL + GAFLE+ AG
Sbjct: 607 VARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAG 662
Query: 471 VHRVRVQ-------DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ-LTW 522
R +V D TN W YQVGL GE IY+ WS ++ + Q TW
Sbjct: 663 F-RGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQPFTW 721
Query: 523 YKTTFRAPAGN 533
YK G+
Sbjct: 722 YKNICNQSVGD 732
>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
Length = 338
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/306 (61%), Positives = 217/306 (70%), Gaps = 56/306 (18%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI+KAK GGLDVI+TYVFWNLHEP+ GQYDF GR++I+RFI+EIQ+ GLY +RIG
Sbjct: 58 MWPSLISKAKHGGLDVIETYVFWNLHEPRHGQYDFKGRHNIVRFIREIQAHGLYAFIRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFIE+EWTYGGLP WLHDV GIV+RSDN+P+K
Sbjct: 118 PFIEAEWTYGGLPFWLHDVPGIVYRSDNEPFKYHMQNFTTKIVNLFKSEGLYAPQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY+ E AFHEKGPPYV WAA MAV TGVPWVMCKQDDAP PVIN CNG C
Sbjct: 178 LQQIENEYKNAERAFHEKGPPYVQWAAAMAVGLQTGVPWVMCKQDDAPDPVINTCNGRTC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
GETF GPNSPNKP+IWT++WTS KNGS+VNYYMYH
Sbjct: 238 GETFVGPNSPNKPAIWTDNWTSL-------------------------KNGSFVNYYMYH 272
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT +AF++T YYD+AP+DEYGL+R+PKWGHLK+LH+ IK CS+ LL G +V
Sbjct: 273 GGTNFGRTGSAFVLTSYYDEAPIDEYGLIRQPKWGHLKQLHSVIKSCSQTLLHGVISVSP 332
Query: 270 LGQLQE 275
LGQ QE
Sbjct: 333 LGQQQE 338
>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
Length = 500
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 217/505 (42%), Positives = 289/505 (57%), Gaps = 19/505 (3%)
Query: 132 KQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFH 191
KQDDAP PVIN CNG C + PN KPS+WTE WT ++ +GG R +D+AF
Sbjct: 1 KQDDAPDPVINTCNGFYC--DYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFA 58
Query: 192 VALFIAKNGSYVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELH 250
VA FI K GS+VNYYMYHGGTNFGRTA F+ T Y AP+DE+GL+R+PKWGHL++LH
Sbjct: 59 VARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLH 118
Query: 251 AAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYEL 310
AIK L++ + S+G ++A+VF+ +G CAAFL N AV V F Y L
Sbjct: 119 RAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNL 178
Query: 311 PRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAE 370
P SISILPDCKT FNT V ++F W+ Y E + ++ +
Sbjct: 179 PAWSISILPDCKTAVFNTATVKEPTLMPKMNPVVRF----AWQSYSEDTNSLSDSAFTKD 234
Query: 371 GLLDQISAAKDASDYFWYTFRFHYNSSN---AQAP-LDVQSHGHILHAFVNGEYTGSAHG 426
GL++Q+S D SDY WYT + +++ Q+P L V S GH + FVNG+ GS +G
Sbjct: 235 GLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVNGKSYGSVYG 294
Query: 427 SHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKS 480
+DN T V + QG+N ++LS VGLP+ G E GV + K
Sbjct: 295 GYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGTKD 354
Query: 481 FTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNL 540
++ W YQVGL GE L +++ G + V W + LTW+K F APAGNDP+AL++
Sbjct: 355 LSHQKWTYQVGLKGETLGLHTVTGSSAVEWGG-PGGYQPLTWHKAFFNAPAGNDPVALDM 413
Query: 541 QSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAF 600
SMGKG+ WVNG +GRYW S+K S G + + YHVPR++
Sbjct: 414 GSMGKGQLWVNGHHVGRYW-SYKASGGCGGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSW 472
Query: 601 LKPTGNLLVLLEEENGNPLGITVDT 625
LKP GNLLV+LEE G+ G+++ T
Sbjct: 473 LKPGGNLLVVLEEYGGDLAGVSLAT 497
>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
Length = 585
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 235/583 (40%), Positives = 302/583 (51%), Gaps = 53/583 (9%)
Query: 207 MYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG-T 264
MY GGTNFGRT+ F IT Y APLDEYGL EPKWGHLK+LHAAIKLC L+
Sbjct: 1 MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADA 60
Query: 265 QNVISLGQLQEAFVFE---ETSG-VCAAFLVNNDERKAVTVLFRNISYELPRKSISILPD 320
LG QEA ++ ET G VCAAFL N DE K+ V F SY LP S+SILPD
Sbjct: 61 PQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPD 120
Query: 321 CKTVAFNTERVSTQYNKRS------------------KTSNLKFDSDEKWEEYREAILNF 362
C+ VAFNT +V Q + ++ + N+ + S + W +E I +
Sbjct: 121 CRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYIS-KSWMALKEPIGIW 179
Query: 363 DNTLLRAEGLLDQISAAKDASDYFWYTFRFH--------YNSSNAQAPLDVQSHGHILHA 414
+GLL+ ++ KD SDY W+ R + + + + + S +L
Sbjct: 180 GENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRV 239
Query: 415 FVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG---- 470
FVN + GS G V QG ND LL+ TVGL + GAFLE+ AG
Sbjct: 240 FVNKQLAGSIVGHW----VKAVQPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGK 295
Query: 471 --VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTT 526
+ + D + SW YQVGL GE +IY+ K WS++ + WYKT
Sbjct: 296 AKLTGFKNGDLDLSKSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKTY 355
Query: 527 FRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHF 584
F PAG DP+ LNL+SMG+G+AWVNGQ IGRYW G Y A N+
Sbjct: 356 FDPPAGTDPVVLNLESMGRGQAWVNGQHIGRYWNIISQKDGCDRTCDYRGAYNSDKCTTN 415
Query: 585 CAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPL 643
C K T T YHVPR++LKP+ NLLVL EE GNP I+V T+ +CG V+ SH PPL
Sbjct: 416 CG--KPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPL 473
Query: 644 SSW-LRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSS 702
W G I P V C G IS I FAS+G P G C+ +++G CH+S
Sbjct: 474 RKWSTPDYINGTMSINSVA--PEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHAS 531
Query: 703 HSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+S +V AC G++ C I + + F DPC G K L V ++C
Sbjct: 532 NSLSIVSEACKGRNSCFIEVSNTAFISDPCSGTLKTLAVMSRC 574
>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
vinifera]
Length = 563
Score = 365 bits (936), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 204/524 (38%), Positives = 287/524 (54%), Gaps = 51/524 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AKEGG+DVI+TYVF N HE Y F G D+++F+K +Q G+Y+ L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ +EW +GG+PIWLH V +F++++KP+K
Sbjct: 61 PFVATEWNFGGVPIWLHYVPRTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 120
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY + + + G PYV+WAA M + + GVPW+MC+ + P+IN CN C
Sbjct: 121 LTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFYC 180
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNSP+K +WTE+W +++ +G R +DIAF VALF NYYMYH
Sbjct: 181 DQF--TPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFFFPKSX--NYYMYH 236
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GGTNFG T+ F+ T Y AP+DEYGL R PK GHLKEL AIK C LL G +
Sbjct: 237 GGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPINL 296
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG QE V+ ++ G AAF+ N DE++ ++F+N SY +P S+SILPDCK V FNT
Sbjct: 297 XLGPSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVVFNT 356
Query: 329 ERVSTQYNKRS------KTSNLKFDSDEK---WEEYREAILNFDNTLLRAEGLLDQISAA 379
+V +Q ++ + S + + D K W+ + E + G +D I+
Sbjct: 357 AKVVSQISQVEMVLEDLQPSLVPSNKDLKGLXWKTFVEKAGIWGEADFVKNGFVDHINTT 416
Query: 380 KDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
KD +D WYT S +Q L V+S GH LHAFVN + GSA G+ + F
Sbjct: 417 KDTTDXLWYTVSITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNGSHSPF 476
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ 477
+ L+ G N+ +LS+TVGL + F E A + V+++
Sbjct: 477 KFECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLTSVKIK 520
>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
Length = 473
Score = 354 bits (909), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 199/476 (41%), Positives = 274/476 (57%), Gaps = 20/476 (4%)
Query: 164 IWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FM 222
+WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYHGGTNF RT+ F+
Sbjct: 1 MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFI 60
Query: 223 ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET 282
T Y AP+DEYGL+R+PKWGHL++LH AIK L++G + SLG ++A+VF+ +
Sbjct: 61 ATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSS 120
Query: 283 SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTS 342
G CAAFL N A V+F Y+LP SIS+LPDCK FNT VS + S +
Sbjct: 121 GGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVS----EPSAPA 176
Query: 343 NLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS----- 397
+ W+ Y EA + D +GL++Q+S D SDY WYT + NS+
Sbjct: 177 RMSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLK 236
Query: 398 NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVG 456
+ Q P L + S GH L FVNG+ G+ +G +D+ T V + QG+N ++LS VG
Sbjct: 237 SGQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVG 296
Query: 457 LPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLW 510
LP+ G E GV + + ++ W YQ+GL GE L + S G + V W
Sbjct: 297 LPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEW 356
Query: 511 SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPS 570
S + + LTW+K F AP+G+ P+AL++ SMGKG+AWVNG+ IGRYW S+K S
Sbjct: 357 GSA-AGKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSGCG 414
Query: 571 QTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDT 625
YA + T + YHVPR++L P+GNLLV+LEE G+ G+ + T
Sbjct: 415 GCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVT 470
>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 655
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 208/532 (39%), Positives = 288/532 (54%), Gaps = 48/532 (9%)
Query: 236 GLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDE 295
GL+REPKWGHLKELH AIKLC L+ G V SLG Q+A VF ++ C AFL N D+
Sbjct: 149 GLLREPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDK 208
Query: 296 RKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEY 355
V F + Y+LP SISILPDCKT +NT V +Q ++ +++ W+ Y
Sbjct: 209 VSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQISQM----KMEWAGGFTWQSY 264
Query: 356 REAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS-----SNAQAP-LDVQSHG 409
E I + + GLL+QI+ +D +DY WYT SN + P L V S G
Sbjct: 265 NEDINSLGDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVMSAG 324
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVA 469
H LH FVNG+ TG+ +GS ++ T V L G+N + LS+ VGLP+ G E A
Sbjct: 325 HALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFETWNA 384
Query: 470 GVHRVRVQD------KSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ--LT 521
G+ D + T W Y+VGL GE L ++S G + V W P ++ L+
Sbjct: 385 GILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEALSLHSLSGSSSVEWG---EPVQKQPLS 441
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTS--------KGNPSQTQ 573
WYK F AP G++P+AL++ SMGKG+ W+NGQ IGRYW +K S +G + +
Sbjct: 442 WYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGTCGICDYRGEYDEKK 501
Query: 574 YAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCG 633
N S + YHVPR++L PTGNLLV+ EE G+P GI++ +C
Sbjct: 502 CQTNCGDS--------SQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRIAGSICA 553
Query: 634 HVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCER 693
V+ P +++W R +G +K V C G+K++ I FASFG P G C
Sbjct: 554 DVSEWQ-PSMANW---RTKGY-------EKAKVHLQCDHGRKMTHIKFASFGTPQGSCGS 602
Query: 694 YAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
Y+ G CH+ S + ++CIG+ RC + ++ FGGDPCPG K +V+A C
Sbjct: 603 YSEGGCHAHKSYDIFWKSCIGQERCGVSVVPDAFGGDPCPGTMKRAVVEAIC 654
>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 493
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 183/425 (43%), Positives = 246/425 (57%), Gaps = 39/425 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEPQ+ +YDFSGR D I+F + IQ GLYV +RIG
Sbjct: 52 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R++N+ YK
Sbjct: 112 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 171
Query: 93 ---IENEY-QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
IENEY + PA+ + G Y+ W A+MA + GVPW+MC+Q DAP P+IN CNG
Sbjct: 172 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFY 231
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + F PN+P P ++TE+W +++ WG K R+A+D+AF VA F G + NYYMY
Sbjct: 232 C-DNFT-PNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMY 289
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
HGGTNFGRT+ IT YD APLDEYG + +PKWGHLK+LHA+IKL + L GT
Sbjct: 290 HGGTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNGTHTN 349
Query: 268 ISLG-QLQEAFVFEETSGVCAAFLVNNDERKAVTV-LFRNISYELPRKSISILPDCKTVA 325
+ G + F T+G FL N D + T+ L + Y +P S+SIL C
Sbjct: 350 QNFGSSVTLTKFFNPTTGERFCFLSNTDGKNDATIDLQADGKYFVPAWSVSILDGCNKEV 409
Query: 326 FNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF--DNTLLRAEGLLDQISAAKDAS 383
+NT +V++Q + K N K ++ W E + + N A L+Q D S
Sbjct: 410 YNTAKVNSQTSMFVKEQNEKENAQLSWAWAPEPMKDTLQGNGKFAANLFLEQKRVTADFS 469
Query: 384 DYFWY 388
DYFWY
Sbjct: 470 DYFWY 474
>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
Length = 446
Score = 345 bits (884), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 170/363 (46%), Positives = 233/363 (64%), Gaps = 33/363 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AK GGL+ I+TYVFWN HEP+ G+Y F GR D+IRF+ I+ +Y +RIG
Sbjct: 66 MWDKLVKTAKMGGLNTIETYVFWNGHEPEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIG 125
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL ++ I+FR++N+P+K
Sbjct: 126 PFIQAEWNHGGLPYWLREIGHIIFRANNEPFKREMEKFVRFIVQKLKDAEMFAPQGGPII 185
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ +G Y+ WAA+MA+ GVPWVMCKQ APG VI CNG C
Sbjct: 186 LSQIENEYGNIKKDRKVEGDKYLEWAAEMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHC 245
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+T+ + NKP +WTE+WT+ ++ +G + RSA+DIA+ V F AK G+ VNYYMYH
Sbjct: 246 GDTWTLLDK-NKPRLWTENWTAQFRTFGDQLAQRSAEDIAYAVLRFFAKGGTLVNYYMYH 304
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT A++++TGYYD+AP+DEYG+ +EPK+GHL++LH IK + L G Q+
Sbjct: 305 GGTNFGRTGASYVLTGYYDEAPMDEYGMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEI 364
Query: 270 LGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
LG EA +E +C +FL NN+ + TV+FR + +P +S+SIL DCKTV +NT
Sbjct: 365 LGHGYEAHNYELPEDKLCLSFLSNNNTGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNT 424
Query: 329 ERV 331
+RV
Sbjct: 425 KRV 427
>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 402
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 174/382 (45%), Positives = 243/382 (63%), Gaps = 14/382 (3%)
Query: 203 VNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT 262
NYYMYHGGTNFGRT+AAF++ YYD+APLDE+GL +EPKWGHL++LH A+KLC + LL
Sbjct: 2 TNYYMYHGGTNFGRTSAAFVMPKYYDEAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61
Query: 263 GTQNVISLGQLQEAFVFE-ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDC 321
G + LG+ EA VFE VC AFL N++ + VT+ FR SY +PR SISIL DC
Sbjct: 62 GKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADC 121
Query: 322 KTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAK 380
KTV F T+ V+ Q+N+R+ + + W+ + E + + + +R D + K
Sbjct: 122 KTVVFGTQHVNAQHNQRTFHFADQTTQNNVWQMFDEEKVPKYKQSKIRLRKAGDLYNLTK 181
Query: 381 DASDYFWYTFRFHYNSSNA------QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFT 434
D +DY WYT F + + + L+V SHGH AFVN ++ G HG+ N +FT
Sbjct: 182 DKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFT 241
Query: 435 LRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQ 489
L + L++G N A+L+ T+G+ DSGA+LE ++AGV RV+++ + TN WG+
Sbjct: 242 LEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHI 301
Query: 490 VGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
VGL+GE+ QIY++ G+ V W + R LTWYK F P+G DPI L++ +MGKG +
Sbjct: 302 VGLVGEQKQIYTDKGMGSVTWKPAVND-RPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMF 360
Query: 550 VNGQSIGRYWVSFKTSKGNPSQ 571
VNGQ IGRYW+S+K + G PSQ
Sbjct: 361 VNGQGIGRYWISYKHALGRPSQ 382
>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
gi|194695440|gb|ACF81804.1| unknown [Zea mays]
Length = 467
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 178/474 (37%), Positives = 264/474 (55%), Gaps = 34/474 (7%)
Query: 285 VCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNL 344
VC AFL N++ + T+ FR Y +PR SIS+L DC+TV F T+ V+ Q+N+R+
Sbjct: 6 VCVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFAD 65
Query: 345 KFDSDEKWEEYR-EAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNS------S 397
+ + WE + E + + +R D + KD +DY WYT F + S
Sbjct: 66 QTAQNNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRS 125
Query: 398 NAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGL 457
+ + L+V SHGH AFVN ++ G HG+ N +FTL + L++G N A+L+ ++G+
Sbjct: 126 DIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGM 185
Query: 458 PDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSS 512
DSGA++E ++AGV RV++ + TN WG+ VGL+GE+ QIY++ G+ V W
Sbjct: 186 TDSGAYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWKP 245
Query: 513 IRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQT 572
+ R LTWYK F P+G DP+ L++ +MGKG +VNGQ IGRYW+S+K + G PSQ
Sbjct: 246 AMN-DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISYKHALGRPSQ- 303
Query: 573 QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVC 632
YHVPR+FL+ N+LVL EEE G P I + T+ +C
Sbjct: 304 -------------------QLYHVPRSFLRQKDNMLVLFEEEFGRPDAIMILTVKRDNIC 344
Query: 633 GHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCE 692
++ + + SW R + + +CP K I ++VFAS+GNP G C
Sbjct: 345 TFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLIQQVVFASYGNPAGICG 404
Query: 693 RYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP-CPGIHKALLVDAQC 745
Y VGSCH+ ++ VVE+AC+GK C++P+ + +GGD C G L V A+C
Sbjct: 405 NYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDANCSGTTATLAVQAKC 458
>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
Length = 450
Score = 334 bits (857), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 192/454 (42%), Positives = 260/454 (57%), Gaps = 23/454 (5%)
Query: 188 IAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHL 246
+AF VA FI K GS+VNYYMYHGGTNF RT+ F+ T Y AP+DEYGL+R+PKWGHL
Sbjct: 1 MAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHL 60
Query: 247 KELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNI 306
++LH AIK L++G + SLG ++A+VF+ + G CAAFL N A V+F
Sbjct: 61 RDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGR 120
Query: 307 SYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTL 366
Y+LP SIS+LPDCK FNT VS + S + + W+ Y EA + D
Sbjct: 121 RYDLPAWSISVLPDCKAAVFNTATVS----EPSAPARMSPAGGFSWQSYSEATNSLDGRA 176
Query: 367 LRAEGLLDQISAAKDASDYFWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEY 420
+GL++Q+S D SDY WYT + NS+ + Q P L V S GH L FVNG+
Sbjct: 177 FTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHSLQVFVNGQS 236
Query: 421 TGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRV 474
G+ +G +D+ T V + QG+N ++LS VGLP+ G E GV +
Sbjct: 237 YGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVLGPVTLSGL 296
Query: 475 RVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGND 534
+ +N W YQ+GL GE L + S G + V W S + + LTW+K F AP+G+
Sbjct: 297 NEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSA-AGKQPLTWHKAYFSAPSGDA 355
Query: 535 PIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHF---CAIIKAT 591
P+AL++ SMGKG+AWVNG+ IGRYW S+K S T + C + +
Sbjct: 356 PVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSGGCGGCSYAGTYSETKCQTGCGDV-SQ 413
Query: 592 NTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDT 625
YHVPR++L P+GNLLVLLEE G+ G+ + T
Sbjct: 414 RYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVT 447
>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
Length = 534
Score = 333 bits (855), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 215/544 (39%), Positives = 291/544 (53%), Gaps = 53/544 (9%)
Query: 236 GLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDE 295
GL+R+PKWGHL++LH AIKLC L+ + SLG EA V++ SG CAAFL N
Sbjct: 9 GLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGT 68
Query: 296 RKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRS-KTSNLKFDS------ 348
+ TV F SY LP S+SILPDCK VAFNT ++++ + +LK D
Sbjct: 69 KSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFARQSLKPDGGSSAEL 128
Query: 349 DEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFH------YNSSNAQAP 402
+W +E I GLL+QI+ D SDY WY+ R + ++A
Sbjct: 129 GSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAV 188
Query: 403 LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGA 462
L ++S G +++AF+NG+ GS HG +L ++L G N LLSVTVGL + GA
Sbjct: 189 LHIESLGQVVYAFINGKLAGSGHGKQK---ISLDIPINLVAGKNTVDLLSVTVGLANYGA 245
Query: 463 FLERKVAGVHRVRVQDKSFTNCS--------WGYQVGLIGEKLQIYSNLGLNKVLWSSIR 514
F + AG+ V KS S W YQVGL GE + GL V S
Sbjct: 246 FFDLVGAGITG-PVTLKSAKGGSSIDLASQQWTYQVGLKGE------DTGLGAVDSSEWV 298
Query: 515 S----PTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNP 569
S PT+Q L WYKTTF AP+G++P+A++ KG AWVNGQSIGRYW + G
Sbjct: 299 SKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIGRYWPTSIAGNGGC 358
Query: 570 SQT-----QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVD 624
+ + Y N + C T YHVPR++LKP+GN LVL EE G+P I+
Sbjct: 359 TDSCDYRGSYRANKC--LKNCGKPSQT-LYHVPRSWLKPSGNTLVLFEEMGGDPTQISFG 415
Query: 625 TIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK-KPTVQPSCPLGKK-ISKIVF 681
T +C V+ SH PP+ +W D+ I + +P + CP+ + IS I F
Sbjct: 416 TKQTGSNLCLTVSQSHPPPVDTWTS-----DSKISNRNRTRPVLSLQCPVSTQVISSIKF 470
Query: 682 ASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLV 741
ASFG P G C + GSC+SS S +V++ACIG C+I + +R F G+PC G+ K+L V
Sbjct: 471 ASFGTPKGTCGSFTSGSCNSSRSLSLVQKACIGSRSCNIEVSTRVF-GEPCRGVVKSLAV 529
Query: 742 DAQC 745
+A C
Sbjct: 530 EASC 533
>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
Length = 1064
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 160/311 (51%), Positives = 202/311 (64%), Gaps = 35/311 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAK+KEGG DVIQTYVFWN HEP + QY+F GR DI++F+K + S GLY+ LRIG
Sbjct: 59 MWPDLIAKSKEGGADVIQTYVFWNGHEPVRRQYNFEGRYDIVKFVKLVGSSGLYLHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL D+ GI FR+DN P+K
Sbjct: 119 PYVCAEWNFGGFPVWLRDIPGIEFRTDNAPFKDEMQRFVKKIVDLMQKEMLFSWQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E +F ++G YV WAA+MA++ GVPWVMC+Q DAP +INACNG C
Sbjct: 179 MLQIENEYGNVESSFGQRGKDYVKWAARMALELDAGVPWVMCQQADAPDIIINACNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS NKP +WTEDW ++ WGG+ R +DIAF VA F + GS+ NYYMY
Sbjct: 239 DAFW--PNSANKPKLWTEDWNGWFASWGGRTPKRPVEDIAFAVARFFQRGGSFHNYYMYF 296
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL-TGTQNV 267
GGTNFGR++ F +T Y AP+DEYGL+ +PKWGHLKELHAAIKLC L+ +
Sbjct: 297 GGTNFGRSSGGPFYVTSYDYDAPIDEYGLLSQPKWGHLKELHAAIKLCEPALVAVDSPQY 356
Query: 268 ISLGQLQEAFV 278
I LG +QE V
Sbjct: 357 IKLGPMQEVGV 367
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 195/524 (37%), Positives = 274/524 (52%), Gaps = 39/524 (7%)
Query: 246 LKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSG---VCAAFLVNNDERKAVTVL 302
LK + + + + ++ T+ + +++E+ ++ SG C+AFL N DE K +V
Sbjct: 545 LKPANILVLISTFAMVMDTKQTAHVYRVKES-LYSTQSGNGSSCSAFLANIDEHKTASVT 603
Query: 303 FRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNF 362
F Y+LP S+SILPDC+T FNT +V Q + KT+ + + + W +E I +
Sbjct: 604 FLGQIYKLPPWSVSILPDCRTTVFNTAKVGAQTS--IKTNKISY-VPKTWMTLKEPISVW 660
Query: 363 DNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS-------NAQAP-LDVQSHGHILHA 414
+G+L+ ++ KD SDY W R + ++ N +P L + S ILH
Sbjct: 661 SENNFTIQGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHI 720
Query: 415 FVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRV 474
FVNG+ GS G V + L QG ND LLS TVGL + GAFLE+ AG +
Sbjct: 721 FVNGQLIGSVIGHWVKVV----QPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGF-KG 775
Query: 475 RVQDKSFTN-------CSWGYQVGLIGEKLQIYSNLGLNKVLWSSIR---SPTRQLTWYK 524
+V+ F N SW YQVGL GE +IY K W+ + SP+ TWYK
Sbjct: 776 QVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLTPDASPS-TFTWYK 834
Query: 525 TTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHF 584
T F AP G +P+AL+L SMGKG+AWVNG IGRYW G + Y + TS
Sbjct: 835 TFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWTRVAPKDGC-GKCDYRGHYHTSK-- 891
Query: 585 CAIIKATNT---YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP 641
CA T YH+PR++L+ + NLLVL EE G P I+V + + + +C V+ SH P
Sbjct: 892 CATNCGNPTQIWYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTICAEVSESHYP 951
Query: 642 PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS 701
L +W K P + C G IS I FAS+G P G C+ ++ G CH+
Sbjct: 952 SLQNWSPSDFIDQNSKNKM--TPEMHLQCDDGHTISSIEFASYGTPQGSCQMFSQGQCHA 1009
Query: 702 SHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+S +V +AC GK C I +L+ FGGDPC GI K L V+A+C
Sbjct: 1010 PNSLALVSKACQGKGSCVIRILNSAFGGDPCRGIVKTLAVEAKC 1053
>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
Length = 504
Score = 328 bits (840), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 199/512 (38%), Positives = 274/512 (53%), Gaps = 35/512 (6%)
Query: 255 LCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKS 314
+C + L++ V SLG Q+A+V+ SG C+AFL N D + + V+F N+ Y LP S
Sbjct: 1 MCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWS 60
Query: 315 ISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDE-KWEEYREAILNFDNTLLRAEGLL 373
+SILPDC+ FNT +V Q S+ L +S+ WE + E + T + A GLL
Sbjct: 61 VSILPDCRNAVFNTAKVGVQ---TSQMQMLPTNSERFSWESFEEDTSSSSATTITASGLL 117
Query: 374 DQISAAKDASDYFWYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGS 427
+QI+ +D SDY WY SS + + P L VQS GH +H F+NG +GSA+G+
Sbjct: 118 EQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFINGRLSGSAYGT 177
Query: 428 HDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLER---KVAGVHRVRVQDKSFTNC 484
++ F V+LR GTN ALLSV VGLP+ G E + G + DK +
Sbjct: 178 REDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHGLDKGKLDL 237
Query: 485 SW---GYQVGLIGEKLQIYSNLGLNKVLW---SSIRSPTRQLTWYKTTFRAPAGNDPIAL 538
SW YQVGL GE + + S G++ V W + + + LTW+KT F AP G +P+AL
Sbjct: 238 SWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPEGEEPLAL 297
Query: 539 NLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPR 598
++ MGKG+ W+NG SIGRYW + T N + C YHVPR
Sbjct: 298 DMDGMGKGQIWINGISIGRYWTAIATGSCNDCNYAGSFRPPKCQLGCGQ-PTQRWYHVPR 356
Query: 599 AFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIK 658
++LK NLLV+ EE G+P I++ ++ VC V+ H P L +W I
Sbjct: 357 SWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVCADVSEYH-PNLKNW---------HID 406
Query: 659 KFGKK-----PTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACI 713
+GK P V C G+ IS I FASFG P G C Y G+CHSS S ++E+ CI
Sbjct: 407 SYGKSENFRPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQGACHSSSSYDILEQKCI 466
Query: 714 GKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
GK RC + + + FG DPCP + K L V+A C
Sbjct: 467 GKPRCIVTVSNSNFGRDPCPNVLKRLSVEAVC 498
>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 727
Score = 321 bits (823), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 213/662 (32%), Positives = 326/662 (49%), Gaps = 58/662 (8%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW ++ K G+D+I+TY FWNLHEP G Y+F G ++ F+ GLYV +R G
Sbjct: 73 MWRPVLEATKAAGIDLIETYTFWNLHEPTPGTYNFEGNANVTAFLDICAELGLYVTVRFG 132
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG P WL ++ GIVFR N+P+
Sbjct: 133 PYVCAEWNYGGFPFWLKEIDGIVFRDYNQPFMDQMSNWMTYIVNYLRPYYASNGGPIILA 192
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
++ENEY +E A+ G Y LWAA+ A G+PW+MC QDD VIN CNG C +
Sbjct: 193 QVENEYGWLEAAYGASGTKYALWAAQFANSLDIGIPWIMCSQDDI-ATVINTCNGFYCHD 251
Query: 152 --TFKGPNSPNKPSIWTEDWTSFYQVW-GGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
PN+P+ WTE+W ++Q W GG P+ R QD+ + VA +IA GS +NYYM+
Sbjct: 252 WIDVHWTAYPNQPAFWTENWPGWFQNWEGGVPH-RPVQDVLYSVARWIAYGGSMMNYYMW 310
Query: 209 HGGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT-GTQN 266
GGT FGR T F+ T Y +DEYG EPK+ E H I +L+
Sbjct: 311 FGGTTFGRWTGGPFITTSYDYDGAIDEYGYPYEPKYSQSLEFHTIIHAYEHIILSMNPPK 370
Query: 267 VISLGQ-LQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
I LG+ ++ + + +G +FL N TV + I++++ S+ +L + ++
Sbjct: 371 PILLGENVEISHFYSVETGESFSFLANFGATGVQTVQWNGITFKVQPWSVQLLYNNVSI- 429
Query: 326 FNTERVSTQYNKRSKTSNLK-FDSDEKWEEYREAILNFDNTLLR-AEGLLDQISAAKDAS 383
F+T + + +K F++ +W E +FD T +E ++Q+S +D +
Sbjct: 430 FDTSATPIGSPVPKQFTPIKSFENIGQWSE------SFDLTFTNYSETPMEQLSLTRDQT 483
Query: 384 DYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQ 443
DY WY + N AQ L + + ++H FV+ +Y + G + TL +T+ +
Sbjct: 484 DYLWYVTKIEVNRVGAQ--LSLPNISDMVHVFVDNQYIATGRGP---TNITLNSTIGV-- 536
Query: 444 GTNDGALLSVTVGLPDSGAFLERKVAGVHR-VRVQDKSFTNCSWGYQVGLIGEKLQIYSN 502
G + +L VGL + +E VAG+ V + ++ W + + GE LQ+Y+
Sbjct: 537 GGHTLQVLHTKVGLVNYAEHMEATVAGIFEPVTLDSVDISSNGWSMKPFVQGETLQLYNP 596
Query: 503 LGLNKVLWSSIRSPTRQLTWYKTTFRAP-AGNDPIALNLQSMGKGEAWVNGQSIGRYWVS 561
V W+++ LTWYK F + N +AL++ M KG +VNG +IGRYW++
Sbjct: 597 NHSGSVQWTNVTG-NPPLTWYKFNFNLELSSNMSLALDMLGMTKGMIFVNGYNIGRYWLA 655
Query: 562 FKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGI 621
NP Q + C + YHVP +L N +V+ EE GNP I
Sbjct: 656 LAYGC-NPCTYQGGYSPSMCQLGCG-EPSQQYYHVPTDWLMNGENEIVIFEEVYGNPEAI 713
Query: 622 TV 623
T+
Sbjct: 714 TL 715
>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
Length = 377
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 138/261 (52%), Positives = 184/261 (70%), Gaps = 31/261 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I +AK+GGL+ IQTYVFWN+HEPQ+G+++FSGR D+++FIK IQ G+YV LR+G
Sbjct: 71 MWPSIIKRAKQGGLNTIQTYVFWNVHEPQQGKFNFSGRADLVKFIKLIQKNGMYVTLRLG 130
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EWT+GGLP WL +V GI FR+DNK +K
Sbjct: 131 PFIQAEWTHGGLPYWLREVPGIFFRTDNKQFKEHTERYVRMILDKMKEERLFASQGGPII 190
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY ++ A+ + G Y+ WA+ + G+PWVMCKQ+DAP P+INACNG C
Sbjct: 191 LGQIENEYSAVQRAYKQDGLNYIKWASNLVDSMKLGIPWVMCKQNDAPDPMINACNGRHC 250
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
G+TF GPN NKPS+WTE+WT+ ++V+G P RS +DIA+ VA F +KNG++VNYYMYH
Sbjct: 251 GDTFPGPNRENKPSLWTENWTTQFRVFGDPPTQRSVEDIAYSVARFFSKNGTHVNYYMYH 310
Query: 210 GGTNFGRTAAAFMITGYYDQA 230
GGTNFGRT+A ++ T YY+ A
Sbjct: 311 GGTNFGRTSAHYVTTRYYEDA 331
>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
Length = 425
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 170/417 (40%), Positives = 242/417 (58%), Gaps = 22/417 (5%)
Query: 227 YDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVC 286
YD AP+DEYGL R PKWGHLK+LH AIKLC LL G +SLG EA V+ ++SG C
Sbjct: 1 YD-APVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVNVSLGPSVEADVYTDSSGAC 59
Query: 287 AAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKF 346
AAF+ N D++ TV FRN SY +P S+SILPDCK V +NT +V+TQ NK +
Sbjct: 60 AAFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTNKIAMIPEKLQ 119
Query: 347 DSDE-----KWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSN--- 398
SD+ KW+ ++E + G +D I+ KD +DY W+T + +
Sbjct: 120 QSDKGQKTFKWDVWKENPGIWGKPDFVINGFVDHINTTKDTTDYLWHTTSISIDENEELL 179
Query: 399 ---AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTV 455
++ L ++S GH LHAFVN +Y G+A+G+ + +FT +N + L+ G N+ ALLS+TV
Sbjct: 180 KKGSKPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKNPISLKAGKNEIALLSLTV 239
Query: 456 GLPDSGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLGLNKVLW 510
GL +G F + AGV V+++ + ++ +W Y++G+ GE L+IY GLN V W
Sbjct: 240 GLQTAGPFYDFVGAGVTSVKIKGLNNKTIDLSSNAWTYKIGVQGEHLKIYQGNGLNSVSW 299
Query: 511 SSIRSPTR--QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGN 568
+S P + LTWYK AP G++P+ L++ MGKG AW+NG+ IGRYW K
Sbjct: 300 TSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAWLNGEGIGRYWPRISEFKKE 359
Query: 569 PSQTQYAVNTVTSIHFCAI---IKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGIT 622
+ + C + YHVPR++ KP+GN+LV EE+ G+P IT
Sbjct: 360 DCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVFFEEKGGDPTKIT 416
>gi|297734971|emb|CBI17333.3| unnamed protein product [Vitis vinifera]
Length = 447
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 172/429 (40%), Positives = 249/429 (58%), Gaps = 33/429 (7%)
Query: 130 MCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIA 189
MCKQ DAP PVIN C G CG+TF GPN PNK S+ TE + P+++ Q I
Sbjct: 1 MCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSVSTE--------YLETPHLKGQQKIL 52
Query: 190 FHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+LFI+KNG+ NYYMY+ TNFGRT ++F T YYD+APLDEYGL RE KWGHL++L
Sbjct: 53 H--SLFISKNGTLANYYMYYSVTNFGRTTSSFATTCYYDEAPLDEYGLPRETKWGHLRDL 110
Query: 250 HAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISY 308
HAA++L + LL G + LG+ EA ++E+ S +CA FL+NN R T R Y
Sbjct: 111 HAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGSKY 170
Query: 309 ELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLR 368
LP+ SIS LPDCKTV FNT+ V++ Y + FDS + +A+ ++ +
Sbjct: 171 YLPQHSISNLPDCKTVVFNTQTVASNYLIFPFS---MFDSLNEPNMKTDALPTYEECPTK 227
Query: 369 AEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEY------TG 422
+ ++ ++ KD +DY WYT + + P V + GH++HAF+NGEY TG
Sbjct: 228 TKSPVELMTMTKDTTDYLWYTTK----KDVLRVP-QVSNLGHVMHAFLNGEYVMEFYLTG 282
Query: 423 SAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS-- 480
+ HGS+ SF + L+ G N A L TVGLPDSG+++E ++AGVH V +Q +
Sbjct: 283 TRHGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHRLAGVHNVAIQGLNTR 342
Query: 481 ---FTNCSWGYQVGLIGEKLQIYS---NLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGND 534
WG++VGL G+KL +++ + + V + +++ L ++ T R P G +
Sbjct: 343 TIDLPKNGWGHKVGLNGDKLHLFTQPPSQSVYHVPRAFLKTSDNLLVLFEETGRNPDGIE 402
Query: 535 PIALNLQSM 543
+ LN ++
Sbjct: 403 ILTLNRDTI 411
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 48/88 (54%), Gaps = 6/88 (6%)
Query: 582 IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP 641
+H + + YHVPRAFLK + NLLVL EE NP GI + T+ +C +++ H
Sbjct: 362 LHLFTQPPSQSVYHVPRAFLKTSDNLLVLFEETGRNPDGIEILTLNRDTICCYISEHHPT 421
Query: 642 PLSSWLRHRQRGDTDIKKF--GKKPTVQ 667
+ SW +R +DI+ F G KP +
Sbjct: 422 HVRSW----KREASDIQMFVDGVKPKAK 445
>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 338
Score = 298 bits (763), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 142/289 (49%), Positives = 187/289 (64%), Gaps = 35/289 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD I+TY+FW+ HEPQ+ +YDFSGR D I+F + IQ GLYV +RIG
Sbjct: 52 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH++ GI R++N+ YK
Sbjct: 112 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 171
Query: 93 ---IENEY-QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR 148
IENEY + PA+ + G Y+ W A+MA + GVPW+MC+Q DAP P+IN CNG
Sbjct: 172 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 231
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
C + F PN+P P ++TE+W +++ WG K R+A+D+AF VA F G + NYYMY
Sbjct: 232 C-DNFT-PNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMY 289
Query: 209 HGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLC 256
HGGTNFGRT+ IT YD APLDEYG + +PKWGHLK+LHA+I +C
Sbjct: 290 HGGTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIXIC 338
>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
Length = 446
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 159/461 (34%), Positives = 242/461 (52%), Gaps = 39/461 (8%)
Query: 300 TVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAI 359
TV+FR + +P +S+SIL DCKTV +NT+RV Q+++RS + + + WE Y EAI
Sbjct: 4 TVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNVWEMYSEAI 63
Query: 360 LNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS------NAQAPLDVQSHGHILH 413
F T +R + L+Q + KD SDY WYT F S + + + ++S H +
Sbjct: 64 PKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIKSTAHAMI 123
Query: 414 AFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHR 473
F N + G+ GS SF + LR G N A+LS ++G+ DSG L G+
Sbjct: 124 GFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVEVKGGIQD 183
Query: 474 VRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFR 528
VQ + WG++ L GE +IY+ G+ + W + +TWYK F
Sbjct: 184 CVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAENDL-PITWYKRYFD 242
Query: 529 APAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAII 588
P G+DPI +++ SM KG +VNG+ IGRYW SF T G+PSQ+
Sbjct: 243 EPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLAGHPSQS---------------- 286
Query: 589 KATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLR 648
YH+PRAFLKP GNLL++ EEE G P GI + T+ +C ++ + + +W
Sbjct: 287 ----VYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFISEHNPAQIKTW-- 340
Query: 649 HRQRGDTDIKKFGKKPTVQPS--CPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQG 706
+ IK + + + + CP + I ++VFASFGNP+G C + G+CH+ ++
Sbjct: 341 --ESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAGTCHTPDAKA 398
Query: 707 VVERACIGKSRCSIPLLSRYFGGD-PCPGIHKALLVDAQCR 746
+VE+ C+GK C +P+++ +G D CP L V +C+
Sbjct: 399 IVEKECLGKESCVLPVVNTVYGADINCPATTATLAVQVRCK 439
>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
Length = 735
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 205/684 (29%), Positives = 327/684 (47%), Gaps = 86/684 (12%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQK-GQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
W ++ +K G+D+I+TY+FWN+H+P ++ +I F+ + L+V LRIG
Sbjct: 73 WNEILKSSKLAGVDIIETYIFWNVHQPNTPNEFYLEDNANITLFLDLCKENELFVNLRIG 132
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW YGG PIWL ++ GIVFR N+P+
Sbjct: 133 PYVCAEWNYGGFPIWLKNIEGIVFRDYNQPFMDAMSTWVTMVVDKLQDYFAPNGGPIIIA 192
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENEY +E + G Y LWA A + G+PW+MC Q+D IN CNG C +
Sbjct: 193 QIENEYGWLENEYGASGREYALWAINFAKSLNIGIPWIMCAQEDIDS-AINTCNGFYCHD 251
Query: 152 TF-KGPNS-PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ N+ P++P+ WTE+W +++ WG R QD+ F A FIA GS NYYM+
Sbjct: 252 WIDRHWNAFPDQPAFWTENWVGWFENWGQAVPKRPVQDMLFSSARFIAYGGSLFNYYMWF 311
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAI-KLCSRPLLTGTQNV 267
GGTNFGR+ ++IT Y APLDE+G EPK+ + H I K S +
Sbjct: 312 GGTNFGRSVGGPWIITSYEYDAPLDEFGFPNEPKYSMSTQFHFVIHKYESIIMGMDPPTP 371
Query: 268 ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFN 327
+ L + EA + E F + D + ++ +Y L S+ I+ +V F+
Sbjct: 372 VPLSNISEAHPYGEDLVFLTNFGLVID-----YIQWQGTNYTLQPWSVVIVY-SGSVVFD 425
Query: 328 TERVSTQYNKRSKTSNLK-------FDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAK 380
T V +Y K S K +DS + E+ ++ + ++ ++ E L+QI+
Sbjct: 426 TSYVPDEYIKPSTRDQFKDVPNAINYDSILSFSEWGQSDI-INDCIINNESPLEQINLTN 484
Query: 381 DASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSA---------HGSHDNV 431
D +DY WYT N + L +++ H F+NG Y G+ ++ N+
Sbjct: 485 DTTDYLWYTTNITLNETTT---LTIENMYDFCHVFLNGAYQGNGWSPVAYITLEPTNGNI 541
Query: 432 SFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAG-VHRVRVQDKSFTNCSWGYQV 490
++ L+ +L++T+GL + A +E G + + + + TN W +
Sbjct: 542 NYQLQ-------------ILTMTMGLENYAAHMESYSRGLLGSISLGQTNITNNQWSMKP 588
Query: 491 GLIGEKLQIYSNLGLNKVLWSSIR-SPTRQLTWYKTTFRAPA-GNDP----IALNLQSMG 544
G++GEKLQIY+ +KV W S T+ +TWY+ +DP LN+ SM
Sbjct: 589 GILGEKLQIYNEYSSSKVNWQPYNPSATQSMTWYQFNISLDGLSSDPSSNAYVLNMTSMN 648
Query: 545 KGEAWVNGQSIGRYWVSFKT-SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKP 603
KG +VNG +IGRY++ T S Q + T ++ + + YH+P +L
Sbjct: 649 KGFVYVNGFNIGRYFLMEATQSNCTLKQDYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFL 708
Query: 604 TGN----LLVLLEEENGNPLGITV 623
+ ++L EE NG+P I +
Sbjct: 709 QQDKQYATVILFEEVNGDPTKIQL 732
>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
Length = 383
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 170/406 (41%), Positives = 228/406 (56%), Gaps = 46/406 (11%)
Query: 230 APLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET-SGVCAA 288
PLDE+GL REPKWGHLK++H A+ LC R L G + LG Q+A V+++ + CAA
Sbjct: 4 GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAA 63
Query: 289 FLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDS 348
L NN+ R A V FR LP +SIS+LPDCKTV FNT+ V+TQ+N R+ + +
Sbjct: 64 LLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSRNFVRSEIANK 123
Query: 349 DEKWEEYREAI---LNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYN------SSNA 399
+ WE YRE L F + R + KD +DY WYT N
Sbjct: 124 NFNWEMYREVPPVGLGFKFDVPR-----ELFHLTKDTTDYAWYTTSLLLGRRDLPMKKNV 178
Query: 400 QAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPD 459
+ L V S GH +HA+VNGEY GSAHGS SF R L++G N ALL VGLPD
Sbjct: 179 RPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRELSSLKEGENHIALLGYLVGLPD 238
Query: 460 SGAFLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIR 514
SGA++E++ AG + + + + WG+QVG GEK ++++ G V W+
Sbjct: 239 SGAYMEKRFAGPRSITILGLNTGTLDISQNGWGHQVGTDGEKKKLFTEEGSKSVQWT--- 295
Query: 515 SPTR--QLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQT 572
P + LTWYK F AP G++P+A+ + MGKG WVNG+SIGRYW ++ + P+Q+
Sbjct: 296 KPDQGGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSPLKKPTQS 355
Query: 573 QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNP 618
+ YH+PRA+LKP NL+VLLEEE GNP
Sbjct: 356 E--------------------YHIPRAYLKPK-NLIVLLEEEGGNP 380
>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
Length = 347
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 163/352 (46%), Positives = 202/352 (57%), Gaps = 39/352 (11%)
Query: 70 GGLPIWLHDVAGIVFRSDNKPYK-------------------------------IENEYQ 98
GG P+WL V GI FR+DN+P+K IENE+
Sbjct: 1 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 60
Query: 99 TIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS 158
+E G Y WAA+MAV TGVPW+MCKQ+DAP PVI+ CNG C E FK PN
Sbjct: 61 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFK-PNK 118
Query: 159 PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTA 218
KP +WTE WT +Y +GG R A+D+AF VA FI GS++NYYMYHGGTNFGRTA
Sbjct: 119 DYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQGGGSFLNYYMYHGGTNFGRTA 178
Query: 219 AA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAF 277
FM T Y APLDEYGL REPKWGHL++LH AIK C L++ +V LG QEA
Sbjct: 179 GGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAH 238
Query: 278 VFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNK 337
VF+ S CAAFL N D + +V V F Y+LP SISILPDCKT +NT +V +Q
Sbjct: 239 VFKSESD-CAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQ--- 294
Query: 338 RSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDYFWY 388
S+ S W+ + E + + +GL +QI+ +D +DY WY
Sbjct: 295 SSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWY 346
>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 486
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 150/291 (51%), Positives = 180/291 (61%), Gaps = 35/291 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD+I+TYVFWN HEP G+Y F R D++RFIK +Q GLYV LRIG
Sbjct: 52 MWPDLIQKAKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG PIWL V GI FR+DN P+K
Sbjct: 112 PYVCAEWNYGGFPIWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQ+DAP P+I+ CNG C
Sbjct: 172 LSQIENEYGPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
E FK PN KP IWTE+W+ +Y +GG R +D+AF VA FI GS VNYYMYH
Sbjct: 232 -ENFK-PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSLVNYYMYH 289
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWG--HLKELHAAIKLCSR 258
GGTNFGRT+ F+ T Y AP+DEYGL+REP G LK L+ + S+
Sbjct: 290 GGTNFGRTSGLFVTTSYDFDAPIDEYGLLREPILGPVTLKGLNEGTRDMSK 340
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/146 (43%), Positives = 88/146 (60%), Gaps = 2/146 (1%)
Query: 479 KSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIAL 538
+ + W Y+VGL GE L +YS G N V W + LTWYKTTF PAGN+P+AL
Sbjct: 336 RDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQKQPLTWYKTTFNTPAGNEPLAL 395
Query: 539 NLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTYHVP 597
++ SM KG+ WVNG+SIGRY+ + ++G ++ Y T + + YH+P
Sbjct: 396 DMSSMSKGQIWVNGRSIGRYFPGY-IARGKCNKCSYTGFFTEKKCLWNCGGPSQKWYHIP 454
Query: 598 RAFLKPTGNLLVLLEEENGNPLGITV 623
R +L P GNLL++LEE GNP GI++
Sbjct: 455 RDWLSPNGNLLIILEEIGGNPQGISL 480
>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
Length = 827
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 204/683 (29%), Positives = 327/683 (47%), Gaps = 79/683 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP ++ + K G++ I+TY+FWNLH+P YDF G +D+ F+ + +G +V +R G
Sbjct: 62 MWPDILKRTKAAGINTIETYIFWNLHQPTPDTYDFEGSSDVKHFLDLCKEEGFHVIVRFG 121
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P++ +EW GGLP WL V GIV+R+ N+P+
Sbjct: 122 PYVCAEWNNGGLPSWLKAVPGIVYRTHNEPFMREMKKWMDYIVHYLSDYYAPNGGPIIMA 181
Query: 92 KIENEYQTIEPAFHEKG-PPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
+IENEY +E + E+G P YV WA K+A ++TG+PW+MC+Q+ VIN CNG C
Sbjct: 182 QIENEYGWLEYEYREQGGPEYVDWAVKLAKSYNTGIPWIMCQQNTR-SDVINTCNGFYCH 240
Query: 151 E--TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
+ + P++P+ +TE WT + Q + R D+ + A F ++ G VNYYM+
Sbjct: 241 DWLQYHQRTFPDQPAFFTELWTGWPQYFEEGFPTRPTVDVLYSAARFYSRGGGMVNYYMW 300
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL------- 261
HGGT FGR + F+ T Y APLDEYG +EPK+ L +LH ++ S +L
Sbjct: 301 HGGTTFGRFTSPFLTTSYDYDAPLDEYGFPQEPKYSMLTKLHVTLEKYSSVILHDPNVPP 360
Query: 262 -----TGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERK------AVTVLFRN----I 306
T +I + E+ VF A V+ + + +V + + N
Sbjct: 361 PYVFPDNTVEMIEYKKDAESVVFLVNWDDTFAKQVDMNGKNVKINQWSVQIYYNNELVFD 420
Query: 307 SYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTL 366
++E+P P K +A + + R+ NL +E + + L ++ +
Sbjct: 421 TFEIPANLTRPNPPFKPIAKTSLDATAAATSRTGLVNLVSSWNEPF-----SFLTYNAS- 474
Query: 367 LRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHG 426
++ Q+ D SDY WY + + L + + FV+G++ G
Sbjct: 475 --SQTPTAQLKLTGDNSDYIWYETEI--DLTKTDEILYLYKSYDFSYVFVDGQFLYWHRG 530
Query: 427 SHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKSFTNCS 485
S F + V G + +L +G+P GA +E+ G+ + + K+ T+
Sbjct: 531 SPIQAYFNGKFPV----GKHTLQILCAAMGVPSYGAHIEQHERGLTGDIFLGSKNITDNG 586
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT--RQLTWYKTTFRAPAGND--PIALNLQ 541
W + L GE L ++++ + V WS + T +TWYK + P+ D AL+L+
Sbjct: 587 WKMRPFLSGELLGLHAS--PSTVKWSPVSKGTAGSGVTWYKFNVKTPSFEDGPAFALDLK 644
Query: 542 SMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFL 601
SM KG +VNG SIGRYWV+ + +QT N + C + YHVP+ FL
Sbjct: 645 SMWKGLVFVNGNSIGRYWVAKGWCEEKCNQTGLYDNYGCREN-CG-ESSQRYYHVPKDFL 702
Query: 602 KPTG-NLLVLLEEENGNPLGITV 623
K + N +++ EE G+P I +
Sbjct: 703 KESSDNEVIIFEELQGDPYSIEL 725
>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
Length = 774
Score = 285 bits (729), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 180/494 (36%), Positives = 246/494 (49%), Gaps = 54/494 (10%)
Query: 269 SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
SL A V+ + SG C AFL N D K V F++ SY+LP S+SILPDCK VAFNT
Sbjct: 317 SLQNYYVADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNT 376
Query: 329 ERVSTQYNKRSKT-SNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+V +Q +NL+ + W +RE + N L G +D I+ KD++DY W
Sbjct: 377 AKVRSQTLMMDMVPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLW 436
Query: 388 YTFRFHYNSSN---AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQG 444
YT F + S+ L ++S GH + AF+N E GSA+G+ +F++ V+LR G
Sbjct: 437 YTTSFDVDGSHLAGGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAG 496
Query: 445 TNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLG 504
N +LLS+TVGL + G E AG+ V++ G+ + + SN
Sbjct: 497 KNKLSLLSMTVGLQNGGPMYEWAGAGITSVKIS-------------GMENRIIDLSSNK- 542
Query: 505 LNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW--VSF 562
W YK P G+DP+ L++QSMGKG AW+NG +IGRYW +S
Sbjct: 543 -----WE-----------YKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISP 586
Query: 563 KTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGIT 622
+ + S + YHVPR++ P+GN LV+ EE+ G+P IT
Sbjct: 587 VSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKIT 646
Query: 623 VDTIAIRKVCGHVTNSHLPP--LSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIV 680
+ VC V+ H P L SW R+ Q D K VQ SCP GK IS +
Sbjct: 647 FSRRTVASVCSFVSE-HYPSIDLESWDRNTQNDGRDAAK------VQLSCPKGKSISSVK 699
Query: 681 FASFGNPDGDCERYAVGSCHSSHSQGVVE---------RACIGKSRCSIPLLSRYFGGDP 731
F SFGNP G C Y GSCH +S VVE RAC+ + C++ L FG D
Sbjct: 700 FVSFGNPSGTCRSYQQGSCHHPNSISVVEKGTLGWAHRRACLNMNGCTVSLSDEGFGEDL 759
Query: 732 CPGIHKALLVDAQC 745
CPG+ K L ++A C
Sbjct: 760 CPGVTKTLAIEADC 773
Score = 223 bits (568), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 114/273 (41%), Positives = 152/273 (55%), Gaps = 53/273 (19%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQ--------------------YDFSGRND 40
MWP L+A+AK+GG D ++TYVFWN HEP +GQ Y F R D
Sbjct: 68 MWPKLVAEAKDGGADCVETYVFWNGHEPAQGQVRAASPKFVMDLACSIRDKPYYFEERFD 127
Query: 41 IIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK-------- 92
++RF K ++ GLY+ LRIGPF+ +EWT+GG+P+WLH G VFR++N+P+K
Sbjct: 128 LVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTT 187
Query: 93 -----------------------IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWV 129
+ENEY +E A+ PY +WAA MA+ +TGVPW+
Sbjct: 188 YIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAASMALAQNTGVPWI 247
Query: 130 MCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIA 189
MC+Q DAP PVIN CN C + FK PNSP KP WTE+W ++Q +G R +D+A
Sbjct: 248 MCQQYDAPDPVINTCNSFYC-DQFK-PNSPTKPKFWTENWPGWFQTFGESNPHRPPEDVA 305
Query: 190 FHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFM 222
F VA F K GS NYY+ T+ AF+
Sbjct: 306 FSVARFFGKGGSLQNYYVADVYTDQSGGCVAFL 338
>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
Flags: Precursor
gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
Length = 761
Score = 280 bits (715), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 187/624 (29%), Positives = 302/624 (48%), Gaps = 76/624 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQK-GQYDFSGRNDIIRFIKEIQSQGLYVCLRI 59
MWP ++ ++K+ G+D+I TY+FWN+H+P +Y F G +I +F+ + LYV LRI
Sbjct: 70 MWPIILKQSKDAGIDIIDTYIFWNIHQPNSPSEYYFDGNANITKFLDLCKEFDLYVNLRI 129
Query: 60 GPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY---------------------------- 91
GP++ +EWTYGG PIWL ++ IV+R N+ +
Sbjct: 130 GPYVCAEWTYGGFPIWLKEIPNIVYRDYNQQWMNEMSIWMEFVVKYLDNYFAPNGGPIIL 189
Query: 92 -KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
++ENEY +E + G Y W+ A + G+PW+MC+Q+D IN CNG C
Sbjct: 190 AQVENEYGWLEQEYGINGTEYAKWSIDFAKSLNIGIPWIMCQQNDIES-AINTCNGYYCH 248
Query: 151 ETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
+ PN+PS WTE+W +++ WG R QDI + A FIA GS +NYYM+
Sbjct: 249 DWISSHWEQFPNQPSFWTENWIGWFENWGQAKPKRPVQDILYSNARFIAYGGSLINYYMW 308
Query: 209 HGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGT--Q 265
GGTNFGRT+ ++IT Y APLDE+G EPK+ + H + LL +
Sbjct: 309 FGGTNFGRTSGGPWIITSYDYDAPLDEFGQPNEPKFSLSSKFHQVLHAIESDLLNNQPPK 368
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVL-FRNISYELPRKSISILPDCKTV 324
+ L Q E + G+ +F+ N ++ + N +Y + S+ I+ + + +
Sbjct: 369 SPTFLSQFIEVHQY----GINLSFITNYGTSTTPKIIQWMNQTYTIQPWSVLIIYNNE-I 423
Query: 325 AFNTERV--STQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTL---------LRAEGLL 373
F+T + +T +N + + + + ++ + N ++ + + +
Sbjct: 424 LFDTSFIPPNTLFNNNTINNFKPINQNIIQSIFQISDFNLNSGGGGGDGDGNSVNSVSPI 483
Query: 374 DQISAAKDASDYFWY-----TFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSH 428
+Q+ KD SDY WY T YN L + +H F++ EY GSA
Sbjct: 484 EQLLITKDTSDYCWYSTNVTTTSLSYNEK-GNIFLTITEFYDYVHIFIDNEYQGSAFSP- 541
Query: 429 DNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKSFTNCSWG 487
++ N ++ T +LS+T+GL + + +E G+ + + ++ TN W
Sbjct: 542 -SLCQLQLNPIN-NSTTFQLQILSMTIGLENYASHMENYTRGILGSILIGSQNLTNNQWL 599
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPT------RQLTWYKTTFR-----APAGNDPI 536
+ GLIGE ++I++N N + W + S + + LTWYK +
Sbjct: 600 MKSGLIGENIKIFNN--DNTINWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVY 657
Query: 537 ALNLQSMGKGEAWVNGQSIGRYWV 560
AL++ SM KG WVNG SIGRYW+
Sbjct: 658 ALDMSSMNKGMIWVNGYSIGRYWL 681
>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
Length = 744
Score = 278 bits (711), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 205/728 (28%), Positives = 313/728 (42%), Gaps = 135/728 (18%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP ++ ++ GL+ ++TY+FWNLHE ++G DFSGR D++RF + Q++GL V LRIG
Sbjct: 33 MWPRILRHMRQSGLNTVETYIFWNLHERRRGVLDFSGRLDLVRFCRLAQAEGLNVILRIG 92
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +E YGGLP WL DV I R+DN+ +K
Sbjct: 93 PYICAETNYGGLPGWLRDVPDIRMRTDNEAFKREKARWVRLVAEVIRPLCAPNGGPVILA 152
Query: 93 -IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMC--------KQDDA---PGPV 140
IENEY I + E G Y+ W+ ++A G+PWV C + DA G
Sbjct: 153 QIENEYDNIAATYGEDGRRYLRWSVELAQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDS 212
Query: 141 INACNGMRC----GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
+ N R G+ F+ P +P++WTE+W +YQ WGG R +++A+ A F
Sbjct: 213 LETLNAFRAHEIIGQHFR--EHPEQPALWTENWAGWYQTWGGVLPKREPEELAYATARFF 270
Query: 197 AKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLC 256
A GS VNY+++HGGTNFGR + T Y PLDEYGL K HL L+ A+ C
Sbjct: 271 AAGGSGVNYFLWHGGTNFGRDGMYLLTTAYEFGGPLDEYGLP-TTKARHLARLNKALAAC 329
Query: 257 SRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSIS 316
+ +L + G+ F+ +SG+ F ++ R ++
Sbjct: 330 ADKILASERPRAITGERNGLLKFQYSSGLT--FWCDDVAR-----------------TVR 370
Query: 317 ILPDCKTVAFNTERVSTQYNKRSKTSNLKFDS-DEKWEEYREAILNFDNTLLRAEGLLDQ 375
I+ V +++ + K S ++F + E A + + A L+Q
Sbjct: 371 IVGKNGEVLYDSSARVAPVRRTWKASGVRFAPWGWRAEPLPAAWPAEAQSAVTARKPLEQ 430
Query: 376 ISAAKDASDYFWYTFRFHYNSS-------------------------------------- 397
+ KD +DY WY S
Sbjct: 431 LLLTKDETDYCWYETAIVVEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASE 490
Query: 398 ---NAQAPLDVQSHGHILHAFVNGEYTGSA-------HGSHDNVSFTLRNTVHLRQ---- 443
N L + I+H F++G + + G D FT + L+
Sbjct: 491 VPANTVNTLRLTRVADIVHVFIDGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRIT 550
Query: 444 -GTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTN-----CSWGYQVGLIGEKL 497
G + +LL +GL + + + + + F N W +Q GL+GE+
Sbjct: 551 PGKHRLSLLCCALGLIKGDWMIGYENMALEKKGLWAPVFWNGKKLEGEWRHQPGLLGERC 610
Query: 498 QIYSNLGLNKVLWSSIRSPT-----RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNG 552
+ + W + ++ T R L W++TTF P G+ P AL+L MGKG AW+NG
Sbjct: 611 GFADPAAGSLLAWKTAKAATGRGARRPLRWWRTTFTRPKGHGPWALDLGGMGKGMAWING 670
Query: 553 QSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG--NLLVL 610
IGRYW+ T P + ++T+ + YHVP +L+ G + LVL
Sbjct: 671 HCIGRYWLLADTDPMGPWMA-WMKGSLTAAPSSGPTQ--RYYHVPDDWLRTDGGPDTLVL 727
Query: 611 LEEENGNP 618
EE G+P
Sbjct: 728 FEELGGDP 735
>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
Length = 286
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/288 (49%), Positives = 171/288 (59%), Gaps = 34/288 (11%)
Query: 66 EWTYGGLPIWLHDVAGIVFRSDNKPYK-------------------------------IE 94
EW +GG P+WL V GI FR+DN+P+K IE
Sbjct: 1 EWNFGGFPVWLKFVPGISFRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQIE 60
Query: 95 NEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFK 154
NEY+ F G Y+ WAA+MA +TGVPWVMCK+ DAP PVIN CNG C +
Sbjct: 61 NEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKF-- 118
Query: 155 GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNF 214
PN P KP +WTE WT ++ +GG Y R +D+AF VA FI GS+VNYYMYHGGTNF
Sbjct: 119 SPNKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFIQAGGSFVNYYMYHGGTNF 178
Query: 215 GRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQL 273
GRTA IT YD AP+DEYGL+R PK+ HLKELH A+KLC LL V+SLG
Sbjct: 179 GRTAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYADPYVMSLGNY 238
Query: 274 QEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDC 321
++A VF TSG CAAFL N + + + V F + LP SISILPDC
Sbjct: 239 EQAHVFSSTSGGCAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286
>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 326
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 137/268 (51%), Positives = 167/268 (62%), Gaps = 34/268 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV+QTYVFWN HEP +GQY F R D++RF+K + GLYV LRIG
Sbjct: 58 MWPGLLQKAKDGGLDVVQTYVFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN P+K
Sbjct: 118 PYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPII 177
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
+ENEY +E PY WAAKMAV GVPWVMCKQDDAP PVIN CNG C
Sbjct: 178 LAQVENEYGPMESVMGAGAKPYANWAAKMAVATGAGVPWVMCKQDDAPDPVINTCNGFYC 237
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PNS +KP++WTE WT ++ +GG R +D+AF VA FI K GS+VNYYMYH
Sbjct: 238 --DYFSPNSNSKPTMWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYH 295
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYG 236
GGTNF RT+ F+ T Y AP+DEYG
Sbjct: 296 GGTNFDRTSGGPFIATSYDYDAPIDEYG 323
>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 342
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 143/289 (49%), Positives = 179/289 (61%), Gaps = 33/289 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP + QY F GR D++ FIK ++ GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVVQTYVFWNGHEPSRRQYYFEGRYDLVHFIKLVKQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------I 93
P++ +EW +GG P+WL V GI FR+DN+P+K I
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKNFTTKIVDMMKSEGLFEWQGGPIILSQI 178
Query: 94 ENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETF 153
ENE+ +E E Y WAA MAV +T VPWVMCK+DDAP P+IN CNG C +
Sbjct: 179 ENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDAPDPIINTCNGFYC--DW 236
Query: 154 KGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTN 213
PN P+KP++WTE WTS+Y +G R +D+A+ VA FI K GS+VNYYMYHGGTN
Sbjct: 237 FSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTN 296
Query: 214 FGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL 261
FGRTA F+ T Y AP+DEYG + +G + HA L PL+
Sbjct: 297 FGRTAGGPFIATSYDYDAPIDEYGELNTFYFG---KRHALYSLHQPPLM 342
>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
Length = 317
Score = 271 bits (693), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 196/315 (62%), Gaps = 12/315 (3%)
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQ-----DKSFTNCSWGYQVGLI 493
+ L GTND ALLSV VGLP+SG ERK+AG+ V ++ + + W YQ+GL+
Sbjct: 6 ISLIPGTNDIALLSVMVGLPNSGGHFERKIAGISTVTLRGFKDGTRDLSQELWTYQIGLL 65
Query: 494 GEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQ 553
GE IYS++G V W+S +P LTWYK P G++P+ L+L SMGKG+AW+NG+
Sbjct: 66 GEMSTIYSDVGFISVNWTSSSTPNPPLTWYKAVIDVPDGDEPVILDLSSMGKGQAWINGE 125
Query: 554 SIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAI---IKATNTYHVPRAFLKPTGNLLVL 610
IGRYW+SF G+ S+ Y N S+H CA + YHVPR++L+PTGNLLVL
Sbjct: 126 HIGRYWISFLAPLGDCSKCDYRGN--YSLHKCATNCGQPSQTLYHVPRSWLRPTGNLLVL 183
Query: 611 LEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSC 670
EE G+P +++ T +I VC H +H P + SW + + ++++ + +P++Q C
Sbjct: 184 FEETGGDPSKVSLLTRSIDSVCAHAFETHPPSIQSW--QKTKVNSEVLRENVEPSLQLDC 241
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD 730
+G++IS I FASFGNP G C + G+CHS S+ VE+AC+G+ CSI + FGGD
Sbjct: 242 SVGRRISSIKFASFGNPKGVCGNFMKGTCHSVESEKAVEKACLGQHGCSITNSPKEFGGD 301
Query: 731 PCPGIHKALLVDAQC 745
C G K+L V+A C
Sbjct: 302 ACVGTVKSLAVEATC 316
>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
gi|194699714|gb|ACF83941.1| unknown [Zea mays]
gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 346
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 143/293 (48%), Positives = 179/293 (61%), Gaps = 37/293 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP + QY F GR D++ FIK ++ GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVVQTYVFWNGHEPSRRQYYFEGRYDLVHFIKLVKQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI FR+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQNFTTKIVDMMKSEGLFEWQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MAV +T VPWVMCK+DDAP P+IN CNG C
Sbjct: 179 LSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDAPDPIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WTS+Y +G R +D+A+ VA FI K GS+VNYYMYH
Sbjct: 239 --DWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYH 296
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL 261
GGTNFGRTA F+ T Y AP+DEYG + +G + HA L PL+
Sbjct: 297 GGTNFGRTAGGPFIATSYDYDAPIDEYGELNTFYFG---KRHALYSLHQPPLM 346
>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
Length = 735
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 201/678 (29%), Positives = 321/678 (47%), Gaps = 89/678 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L++KAKE GL+ IQTYVFWN+HE ++G YDFSGR ++ F++E + GL+V LR+G
Sbjct: 64 MWPYLMSKAKEQGLNTIQTYVFWNMHEQKRGTYDFSGRANLSLFLQEAANAGLFVNLRLG 123
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE----------------------YQ 98
P++ +EW YG LP+WL+++ I FRS N +K E +
Sbjct: 124 PYVCAEWDYGALPVWLNNIPNIAFRSSNDAWKSEMKRFLSDIIVYVDGFLAKNGGPIILA 183
Query: 99 TIEPAFHEKGPPYVLWAAKMAV-DF-HTGVPWVMCKQDDAPGPVINACNGMRCGE----T 152
IE + YV W + DF T +PW+MC A I CNG C +
Sbjct: 184 QIENEYGGNDRAYVDWCGSLVSNDFASTQIPWIMCN-GLAANSTIETCNGCNCFDDGWMD 242
Query: 153 FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 212
PN+P ++TE+W ++Q WG IR+ +D+A+ VA + A G+Y YYM+HGG
Sbjct: 243 RHRRTYPNQPLLFTENW-GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGN 301
Query: 213 NFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI---- 268
++GRT + + T Y D L G EPK+ HL L + ++ LL+ +
Sbjct: 302 HYGRTGGSGLTTAYSDDVILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSARLPIPY 361
Query: 269 ------SLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCK 322
S+G Q + + + F++ N ++ VLF + + +S+ I + +
Sbjct: 362 WDGKQWSVGTQQMVYSYPPS----IQFVI-NQAAFSLFVLFNKQNISIAGQSVQIYDNNE 416
Query: 323 TVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
+ +N+ VS + + + + W+ Y E L+ D ++ A L+Q++ D
Sbjct: 417 HLLWNSADVSGIFRNNTFLVPIVVGPLD-WQVYSEPFLS-DLPVIVASTPLEQLNLTNDE 474
Query: 383 SDYFWYTFRFHYNSSNAQAPLDVQS-HGHILHAFVNGEYTG------SAHGSHDNVSFTL 435
+ Y WY + +AQ + VQ+ + L F++ ++ G A G+ NV+ TL
Sbjct: 475 TIYLWYRRNVSLSQPSAQTIVQVQTRRANSLIFFMDRQFVGYFDDHSHAQGT-INVNITL 533
Query: 436 RNTVHLRQGTNDGALLSVTVGLPD----SGAFLERKVAGVHRVRVQDKSFTNCS-WGYQV 490
+ L +LSV++G+ + G+F + + G + Q S W +Q
Sbjct: 534 NLSQFLPNQQYLFEILSVSLGIDNFNIGPGSFEYKGIVGNVSLGGQSLVGDEASIWEHQK 593
Query: 491 GLIGEKLQIYSNLGLNKVLWSS--IRSPTRQLTWYKTTF------RAPAGNDPIALNLQS 542
GL GE QIY+ G V W+ + + +TW++T F R +P+ L+
Sbjct: 594 GLFGEAYQIYTEQGSKTVEWNPRWTTAINKSVTWFQTRFDLNHLVREDLNANPVLLDAFG 653
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-------YH 595
+ +G A+VNG IG YW+ T + C + TN YH
Sbjct: 654 LNRGHAFVNGNDIGLYWLIEGTCQNKLC--------------CCLQNQTNCQQPSQRYYH 699
Query: 596 VPRAFLKPTGNLLVLLEE 613
+P +LKPT NLL + EE
Sbjct: 700 IPSDWLKPTNNLLTVFEE 717
>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
Length = 759
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 212/689 (30%), Positives = 324/689 (47%), Gaps = 105/689 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQ-YDFSGRNDIIRFIKEIQSQGLYVCLRI 59
MWPSLI K+K+ G+++I+TYVFWNLH+P Q Y+F G +I F+ Q +GLYV LRI
Sbjct: 76 MWPSLIKKSKDAGINMIETYVFWNLHQPNNSQEYNFEGNANITHFLDLCQQEGLYVHLRI 135
Query: 60 GPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY---------------------------- 91
GP++ +EW YGG+P WL ++ GIVFR N+P+
Sbjct: 136 GPYVCAEWNYGGIPSWLRNIPGIVFRDYNQPWMTEMASWMTFIVNYLKPYFASNGGPIIL 195
Query: 92 -KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
++ENEY +E + + G Y WA A + G+PW MC+Q+D IN CNG C
Sbjct: 196 AQVENEYGWLENEYGDSGKLYAEWAISFAKSLNIGIPWTMCQQNDIDD-AINTCNGFYCH 254
Query: 151 E--TFKGPNSPNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ + PN+P+ +TE+W + Q + G P+ R +D+ + VA + ++ GS +NYYM
Sbjct: 255 DWIQYHFQVYPNQPAFFTENWAGWIQYYSEGVPH-RPTEDLLYSVARWFSRGGSLMNYYM 313
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG---- 263
+HGGT F R ++ F+ Y A LDEYG EPK+ L +LH+ + S LL+
Sbjct: 314 WHGGTTFARYSSTFLTNSYDYDAALDEYGYEAEPKYSALAQLHSVLSQYSYILLSSGEVA 373
Query: 264 ---------TQNVISLGQLQ-------EAFVFEETSGVCAAFLVN-NDERKAVTVLFRNI 306
T N I + Q E F GV ++ V N + +TV
Sbjct: 374 RPVNISNITTCNTIEIIQYNTTINGTLETITFVTNFGVSSSAPVQLNWNGQTITV----- 428
Query: 307 SYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAI--LNFDN 364
S+ IL + +TV +T V QY+ + + K + + E I N+ N
Sbjct: 429 ----NPWSVLILYNNQTV-IDTSYVKQQYSAQKEFYQSKRVKNVLVSSWTEPIGVGNYSN 483
Query: 365 TLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSA 424
++ A +Q+ D +DY NA +++ +++GEY +
Sbjct: 484 -VVTANLPSEQLDLTLDQTDYL----------CNAD---------DMIYIYIDGEYQSWS 523
Query: 425 HGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVH-RVRVQDKSFTN 483
GS F L + GT+ ++LS+T+GL G+ E G++ V + + TN
Sbjct: 524 RGSP--AHFVLDTKFGI--GTHKLSILSLTMGLISYGSHFESYKRGLNGTVTLGTQDITN 579
Query: 484 CSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPA---GNDPIALNL 540
W + L+GE I SN L ++ S + LTWYK + AL++
Sbjct: 580 NGWSMRPYLVGEMQGIQSNPHLTSWSINNELSINQPLTWYKLNLIIQSEIQDTSSFALDM 639
Query: 541 QSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAI---IKATNTYHVP 597
M KG VNG SIGRYW++ G S Y + + C + YHVP
Sbjct: 640 IGMNKGFIIVNGNSIGRYWLTLGWGCG--SGCNYTGDGYQG-YLCRTGCGEPSERYYHVP 696
Query: 598 R--AFLKPTG-NLLVLLEEENGNPLGITV 623
+L+P N +++ EE +G+P I +
Sbjct: 697 NDYLYLEPNQLNEIIVFEELSGDPNSIQL 725
>gi|293334807|ref|NP_001170541.1| uncharacterized protein LOC100384558 [Zea mays]
gi|238005922|gb|ACR33996.1| unknown [Zea mays]
Length = 345
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/349 (40%), Positives = 204/349 (58%), Gaps = 29/349 (8%)
Query: 403 LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGA 462
L+V SHGH AFVN ++ G HG+ N +FTL + L++G N A+L+ T+G+ DSGA
Sbjct: 11 LEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGA 70
Query: 463 FLERKVAGVHRVRVQDKS-----FTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT 517
+LE ++AGV RV+++ + TN WG+ VGL+GE+ QIY++ G+ V W +
Sbjct: 71 YLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAVN-D 129
Query: 518 RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVN 577
R LTWYK F P+G DPI L++ +MGKG +VNGQ IGRYW+S+K + G PSQ
Sbjct: 130 RPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISYKHALGRPSQ------ 183
Query: 578 TVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
YH+PR+FL+ N+LVL EEE G P I + T+ +C ++
Sbjct: 184 --------------QLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVKRDNICTFISE 229
Query: 638 SHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVG 697
+ + SW R+ + KP +C K I ++VFAS+GNP G C Y +G
Sbjct: 230 RNPAHIKSW--ERKDSQITVTAADLKPRATLTCSPKKLIQQVVFASYGNPMGICGNYTIG 287
Query: 698 SCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDP-CPGIHKALLVDAQC 745
SCH+ ++ +VE+AC+GK C++P+ + +GGD CPG L V A+C
Sbjct: 288 SCHTPRAKELVEKACLGKRICTLPVSADVYGGDVNCPGTTATLAVQAKC 336
>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
Length = 346
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 142/293 (48%), Positives = 178/293 (60%), Gaps = 37/293 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLDV+QTYVFWN HEP + QY F GR D++ FIK ++ GLYV LRIG
Sbjct: 59 MWPDLIQKAKDGGLDVVQTYVFWNGHEPSRRQYYFEGRYDLVHFIKLVKQAGLYVHLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL V GI R+DN+P+K
Sbjct: 119 PYVCAEWNFGGFPVWLKYVPGISLRTDNEPFKAEMQNFTTKIVDMMKSEGLFEWQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENE+ +E E Y WAA MAV +T VPWVMCK+DDAP P+IN CNG C
Sbjct: 179 LSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDAPDPIINTCNGFYC 238
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ PN P+KP++WTE WTS+Y +G R +D+A+ VA FI K GS+VNYYMYH
Sbjct: 239 --DWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYH 296
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL 261
GGTNFGRTA F+ T Y AP+DEYG + +G + HA L PL+
Sbjct: 297 GGTNFGRTAGGPFIATSYDYDAPIDEYGELNTFYFG---KRHALYSLHQPPLM 346
>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
Length = 825
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 208/700 (29%), Positives = 325/700 (46%), Gaps = 117/700 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +L+ AK GL+ I+ YVFWNLHE ++G ++F+G + RF + GL++ +R GP
Sbjct: 118 WETLLRAAKRDGLNHIEMYVFWNLHEQERGVFNFAGNANATRFYELAAEVGLFLHVRFGP 177
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE----------------------YQT 99
++ +EW+ GGLP+WL+ + G+ RS N P++ E E
Sbjct: 178 YVCAEWSNGGLPLWLNWIPGMKVRSSNAPWQWEMERFVTYMVELSRPFLAKNGGPIIMAQ 237
Query: 100 IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE--TFKGPN 157
IE F P YV W + T +PWVMC + A ++ +CNG C +
Sbjct: 238 IENEFAMHDPEYVEWCGDLVKRLDTSIPWVMCYANAAENTIL-SCNGNDCVDFAVKHVKE 296
Query: 158 SPNKPSIWTEDWTSFYQVWGGKPY------IRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
P+ P +WTED ++Q W R+A+D+A+ VA + A G+ NYYMYHGG
Sbjct: 297 RPSDPLVWTED-EGWFQTWAKDKKNPLPNDQRTAEDMAYAVARWFAVGGAAHNYYMYHGG 355
Query: 212 TNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLG 271
NFGR A+A + T Y D L GL EPK HL++LH A+ C+ L+ + ++
Sbjct: 356 NNFGRAASAGVTTKYADGVNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDRQLLHPH 415
Query: 272 QL--------------QEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISI 317
+L Q AF++ G + N K VTV+FR+ YEL S+ I
Sbjct: 416 ELAPTHGETAEASSLQQRAFIYGAEDGPNQVAFLENQADKKVTVVFRDNKYELAPTSMMI 475
Query: 318 LPDCKTVAFNTERVSTQYN---KRSKTSNLKFDSDEKWEEYREAILNFDNTLLR----AE 370
+ D + FNT V + R+ T ++ + +WE + E LN + R AE
Sbjct: 476 IKD-GALLFNTADVRKSFPGTVHRAYTPIVQ-AATLQWETWSE--LNVSSLTPRRRVVAE 531
Query: 371 GLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILH----------AFVNGEY 420
++Q+ D SDY Y F + A P+D+ S + AFV+G
Sbjct: 532 RPVEQLRLTADRSDYLTYETTFTVDP--ADTPIDIDSDASTVKVTSCEASSIIAFVDGWL 589
Query: 421 TGSAHGSH------DNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRV 474
G + ++ F+L + + + + L+SV++G+ G+ + + G +V
Sbjct: 590 IGERNLAYPGGNCSKEFRFSLPTNIDVTR-QHSLKLVSVSLGIYSLGSNHTKGLTG--KV 646
Query: 475 RVQDKSFTNC-SWGYQVGLIGEKLQIYSNLGLNKVLWS---SIRSPTRQL-TWYKTT--- 526
RV K+ W L+GE+L+IY L+ V W+ + + RQL +WY T+
Sbjct: 647 RVGRKNLAKGHQWEMYPTLVGEQLEIYRPEWLSSVPWTPVPRVVASGRQLMSWYWTSFSY 706
Query: 527 --FRAPAGNDPIA------LNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNT 578
F PA DP++ L+ + +G A++NG +GRYW+ +G Q
Sbjct: 707 PAFELPAEADPVSEPFSILLDCIGLTRGRAYINGHDLGRYWLV--NDEGEFVQ------- 757
Query: 579 VTSIHFCAIIKATNTYHVPRAFL-KPTGNLLVLLEEENGN 617
YHVPR +L K N+LV+ +E G+
Sbjct: 758 -------------RYYHVPRDWLVKDQANVLVVFDELGGS 784
>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
Length = 285
Score = 266 bits (680), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 144/288 (50%), Positives = 174/288 (60%), Gaps = 35/288 (12%)
Query: 66 EWTYGGLPIWLHDVAGIVFRSDNKPYK-------------------------------IE 94
EW +GG P+WL V GI FR+DN P+K IE
Sbjct: 1 EWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQIE 60
Query: 95 NEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFK 154
NEY IE G Y WAA+MAV TGVPW+MCKQ+DAP P+I+ CNG C E F
Sbjct: 61 NEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYC-ENFM 119
Query: 155 GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNF 214
PN+ KP ++TE WT +Y +GG R A+D+A+ VA FI GS++NYYMYHGGTNF
Sbjct: 120 -PNANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNF 178
Query: 215 GRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQL 273
GRTA F+ T Y APLDEYGL REPKWGHL++LH IKLC L++ V SLG
Sbjct: 179 GRTAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSN 238
Query: 274 QEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDC 321
QEA VF T CAAFL N D + +V V F+N+ Y+LP S+SILPDC
Sbjct: 239 QEAHVF-WTKTSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285
>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
Length = 743
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 208/734 (28%), Positives = 309/734 (42%), Gaps = 148/734 (20%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP ++ ++ GL+ ++TY+FWNLHE ++G DFSGR D++RF + Q++GL V LRIG
Sbjct: 33 MWPRILRHMRQSGLNTVETYIFWNLHERRRGVLDFSGRLDLVRFCRLAQAEGLNVILRIG 92
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I +E YGGLP WL DV I R+DN+ +K
Sbjct: 93 PYICAETNYGGLPGWLRDVPDIRMRTDNEAFKREKARWVRLVAEVIRPLCAPNGGPVILA 152
Query: 93 -IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMC--------KQDDA---PGPV 140
IENEY I + E G Y+ W+ ++A G+PWV C + DA G
Sbjct: 153 QIENEYDNIAATYGEDGRRYLRWSVELAQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDS 212
Query: 141 INACNGMRC----GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
+ N R G+ F+ P +P++WTE+W +YQ WGG R +++A+ A F
Sbjct: 213 LETLNAFRAHEIIGQHFR--EHPEQPALWTENWAGWYQTWGGVLPKREPEELAYATARFF 270
Query: 197 AKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLC 256
A GS VNY+++HGGTNFGR + T Y PLDEYGL K H A
Sbjct: 271 AAGGSGVNYFLWHGGTNFGRDGMYLLTTAYEFGGPLDEYGLP------TTKARHLARLNA 324
Query: 257 SRPLLTGTQNVISLGQLQEAFVFEETSGVC------AAFLVNNDERKAVTVLFRNISYEL 310
+ G L + V E++SGV V +D +AV ++
Sbjct: 325 ALAACAG-----ELLASERPGVVEKSSGVVEYHYDSGLVFVCDDTARAVRIV-------- 371
Query: 311 PRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDS-DEKWEEYREAILNFDNTLLRA 369
+KS +L D R K+S ++F + E A + + A
Sbjct: 372 -KKSGEVLYDSSVRVAPVRRA-------WKSSGVRFAPWGWRAEPLPAAWPAEAQSAVTA 423
Query: 370 EGLLDQISAAKDASDYFWYTFRFHYNSS-------------------------------- 397
L+Q+ KD +DY WY S
Sbjct: 424 RKPLEQLLPTKDETDYCWYETAIVVEGSGDVLVAGRDGSPAGLERGALARVGRRGRRPSI 483
Query: 398 ---------NAQAPLDVQSHGHILHAFVNGEYTGSA-------HGSHDNVSFTLRNTVHL 441
N L + I+H F++G + + G D FT + L
Sbjct: 484 AGLASEVPANTVNTLRLTRVADIVHVFIDGTFVATTPTPLRERRGKMDAGLFTQTFELDL 543
Query: 442 RQ-----GTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTN-----CSWGYQVG 491
+ G + +LL +GL + + + + + F N W +Q G
Sbjct: 544 KALRITPGKHRLSLLCCALGLIKGDWMIGYENMALEKKGLWAPVFWNGKKLEGEWRHQPG 603
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPT-----RQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
L+GE+ + + W + ++ T R L W++TTF P G+ P AL+L MGKG
Sbjct: 604 LLGERCGFADPAAGSLLAWKTAKAATGRGARRPLNWWRTTFTRPKGHGPWALDLGGMGKG 663
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG- 605
W+NG IGRYW+ T P + ++T+ + YHVP +L+ G
Sbjct: 664 FCWINGHCIGRYWLLPDTDPMGPWMA-WMKGSLTAAPSGGPTQ--RYYHVPDDWLRTDGG 720
Query: 606 -NLLVLLEEENGNP 618
+ LVL EE G+P
Sbjct: 721 PDTLVLFEELGGDP 734
>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
Length = 735
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 201/677 (29%), Positives = 316/677 (46%), Gaps = 87/677 (12%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L++KAKE GL+ IQTYVFWN+HE ++G YDFSGR ++ F++E + GL+V LR+G
Sbjct: 64 MWPYLMSKAKEQGLNTIQTYVFWNIHEQKRGTYDFSGRANLSLFLQEAANAGLFVNLRLG 123
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE----------------------YQ 98
P++ +EW YG LP+WL+++ I FRS N +K E +
Sbjct: 124 PYVCAEWDYGALPVWLNNIPNIAFRSSNDAWKSEMKRFLSDIIVYVDGFLAKNGGPIILA 183
Query: 99 TIEPAFHEKGPPYVLWAAKMAV-DF-HTGVPWVMCKQDDAPGPVINACNGMRCGE----T 152
IE + YV W + DF T +PW+MC A I CNG C +
Sbjct: 184 QIENEYGGNDRAYVDWCGSLVSNDFASTQIPWIMCN-GLAANSTIETCNGCNCFDDGWMD 242
Query: 153 FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 212
PN+P ++TE+W ++Q WG IR+ +D+A+ VA + A G+Y YYM+HGG
Sbjct: 243 RHRRTYPNQPLLFTENW-GWFQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGN 301
Query: 213 NFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISL-- 270
++GRT + + T Y D L G EPK+ HL L + ++ LL+ N +S+
Sbjct: 302 HYGRTGGSGLTTAYSDDVILRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSNRLSIPY 361
Query: 271 --------GQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCK 322
G Q + + + F++ N ++ VLF + + +S+ I +
Sbjct: 362 WNGKQWTVGTQQMVYSYPPS----VQFVI-NQAAFSLFVLFNKQNISIAGQSVQIYDYNE 416
Query: 323 TVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
+ +N+ VS + + + W+ Y E + D ++ A L+Q++ D
Sbjct: 417 HLLWNSADVSGISRNNTFLVPIVVGPLD-WQVYSEPFTS-DLPVIVASTPLEQLNLTNDE 474
Query: 383 SDYFWYTFRFHYNSSNAQAPLDVQS-HGHILHAFVNGEYTGSAHG-SHD----NVSFTLR 436
+ Y WY + + Q + VQ+ + L F++ ++ G SH NV+ TL
Sbjct: 475 TIYLWYRRNVSLSQPSVQTIVQVQTRRANSLLFFMDRQFVGYFDDHSHTQGTINVNITLN 534
Query: 437 NTVHLRQGTNDGALLSVTVGLPD----SGAFLERKVAGVHRVRVQDKSFTNCS-WGYQVG 491
+ L +LSV++G+ + G+F + + G + Q S W +Q G
Sbjct: 535 LSQFLPNQQYIFEILSVSLGIDNFNIGPGSFEYKGIVGNVSLGGQSLVGDEASIWEHQKG 594
Query: 492 LIGEKLQIYSNLGLNKVLWSSIRSPT--RQLTWYKTTF------RAPAGNDPIALNLQSM 543
L GE QIY+ G V W+ + + +TW++T F R +PI L+
Sbjct: 595 LFGEAHQIYTEQGSKTVEWNPKWTTVINKPVTWFQTRFDLNHLAREDLNANPILLDAFGF 654
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-------YHV 596
+G A+VNG IG YW+ T + N C + TN YH+
Sbjct: 655 NRGHAFVNGNDIGLYWLIEGTCQNNLC--------------CCLQNQTNCQQPSQRYYHI 700
Query: 597 PRAFLKPTGNLLVLLEE 613
+LKPT NLL + EE
Sbjct: 701 SSDWLKPTNNLLTVFEE 717
>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
Length = 811
Score = 258 bits (660), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 194/646 (30%), Positives = 311/646 (48%), Gaps = 87/646 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W SL+AKAKE GL+++Q Y+FWN HEP++G + F+ R ++ F + + + GL+V LR GP
Sbjct: 130 WDSLLAKAKEDGLNLVQLYIFWNFHEPRRGSFYFADRGNLTHFFERVVAHGLFVHLRFGP 189
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE----------------------YQT 99
++ +EW GGLP+WL + G+ RS+++ ++ E
Sbjct: 190 YVCAEWNRGGLPLWLDRIPGMKVRSNSESWRQEMNRIILIMINLARPYFSVNGGPIIMAQ 249
Query: 100 IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS- 158
IE ++ P YV W +++ G+PW MC A I+ CN C + F N+
Sbjct: 250 IENEYNGHDPTYVAWLSQLVRKLGIGIPWTMCNGASAVN-TISTCNDNDCFQ-FAEKNAK 307
Query: 159 --PNKPSIWTEDWTSFYQVWG-------GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
P++P +WTE+ ++Y+ W G+ RS + +A+ VA + A G+ NYYMYH
Sbjct: 308 VFPSQPLVWTEN-EAWYEKWATKNIAQDGQNDQRSPEQVAYVVARWFAVGGAMHNYYMYH 366
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GG NFGRTA+A + T Y D A L GL EPK HL++LH + C++ LL+ + +
Sbjct: 367 GGNNFGRTASAGVTTMYADGAILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNERQLNH 426
Query: 270 LGQL---------QEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPD 320
L Q A+++ G C +FL N ++ Y LP ++I IL D
Sbjct: 427 AKPLGPEGKNAYTQRAYIY----GNC-SFLENTHAIHRACFRYQLKEYCLPPQTIVIL-D 480
Query: 321 CKTVAFNTERVSTQYNKRSKTSN---LKF-DSDEK-WEEYREAILNFDNTLLRAEGLLDQ 375
V +NT VS RS S ++F SD K W E+ N + ++ + L+Q
Sbjct: 481 HNNVLYNTSDVSGTLGSRSTRSFSPLIRFRKSDWKIWSEWDVNPHNVRDQIVN-DSPLEQ 539
Query: 376 ISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILH----------AFVNGEYTGSAH 425
+ +D +DY Y + S+ P + IL F+NGE+ G H
Sbjct: 540 LLVTQDTTDYLMYQNEVRWGSN---GPTKNKMKSSILKFISCDANSFLVFINGEFIGEQH 596
Query: 426 GSH--DNVSFTLRNTVHL--RQGTN-DGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKS 480
++ D+ S R + + G N ++LS+++G+ G ++ + V V++ ++S
Sbjct: 597 LAYPGDDCSNIFRFDLGPLGKYGANLTLSILSISLGIHSLGEKHQKGI--VSDVQIDERS 654
Query: 481 FT---NCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT---RQLTWYKTTFRAPA--- 531
+ W GLIGE L++Y + N V W ++ T R WY T F
Sbjct: 655 LVYGPHERWVMFSGLIGELLKLYDPMWSNSVPWRNLNVQTDRKRTSKWYMTKFVLKQLDW 714
Query: 532 -GNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAV 576
+ L+ + M +G ++NG +GRYW+ + S G Q Y +
Sbjct: 715 DTETSVLLDCKGMNRGRIYLNGHDLGRYWL-IRRSDGAYVQRYYTI 759
>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
Length = 721
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 186/677 (27%), Positives = 311/677 (45%), Gaps = 78/677 (11%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW +++ +A E GL++IQ Y FWNLHEP KGQY++ G DI F+++ +GL+V +RIG
Sbjct: 65 MWDTILDQAVEDGLNLIQIYTFWNLHEPVKGQYNWEGIADIRLFLQKCADRGLFVNMRIG 124
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-NEYQTI-----EPAFHEKGPP---- 110
P++ +EW GG+P+W++ + G+ R++N +K E ++ + F ++G P
Sbjct: 125 PYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTRDFFADRGGPIIFS 184
Query: 111 ------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFK---- 154
Y+ W + A VPW+MC D + INACNG C +
Sbjct: 185 QIENELWGGAREYIDWCGEFAESLELNVPWMMCNGDTSE-KTINACNGNDCSSYLESHGQ 243
Query: 155 -GPNSPNKPSIWTEDWTSFYQVWGGKPY---------IRSAQDIAFHVALFIAKNGSYVN 204
G ++P WTE+ ++Q+ G RSA+D F+V F+ + GSY N
Sbjct: 244 SGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGGSYHN 302
Query: 205 YYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGT 264
YYM+ GG ++G+ A M Y + + L EPK H ++H + + LL
Sbjct: 303 YYMWFGGNHYGKWAGNGMTNWYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLLNDK 362
Query: 265 QNVISLGQL--QEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCK 322
V + L FE G V N++ A V++R+I YELP S+ +L +
Sbjct: 363 AQVNNQKHLNCDNCNAFEYRYGDRLVSFVENNKGSADKVIYRDIVYELPAWSMIVLDEYD 422
Query: 323 TVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
V F T V R K + E W E + ++ + +Q++ +D
Sbjct: 423 NVLFETNNVKPVNKHRVYHCEEKLEF-EYWNEPVSTLSQEAPRVVVSPKANEQLNMTRDL 481
Query: 383 SDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGS--AHGSHDNVSFTLRNTVH 440
+++ +Y + + + + A+V+ + GS H HD T+ +
Sbjct: 482 TEFLYYETEVEFPQDECTLSIG-GTDANAFVAYVDDHFVGSDDEHTHHDGWH-TMNINMK 539
Query: 441 LRQGTNDGALLSVTVGLPD------SGAFLERKVAGV-HRVRVQDKSFTNCSWGYQVGLI 493
+G + LLS ++G+ + ++ ++ G+ +++ N W + GL+
Sbjct: 540 SGKGKHKLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWIKLCGNDIFNQEWKHYPGLV 599
Query: 494 GEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAG---NDPIALNLQSMGKGEAWV 550
GE Q++++ G+ V W S L WY++TF+ P G + L + M +G+A+V
Sbjct: 600 GEAKQVFTDEGMKTVTWKSDVENADNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRGQAYV 659
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG--NLL 608
NG +IGRYW+ GN TQ YH+P+ +LK G N+L
Sbjct: 660 NGHNIGRYWM---IKDGNGEYTQ------------------GYYHIPKDWLKGEGEENVL 698
Query: 609 VLLEEENGNPLGITVDT 625
VL E + +T+ T
Sbjct: 699 VLGETLGASDPSVTICT 715
>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 707
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 176/567 (31%), Positives = 272/567 (47%), Gaps = 55/567 (9%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W ++A +K G+++I TYVFW+LHEPQ+G Y+F G ++ F+ Q GL+V LRIG
Sbjct: 138 IWKKVLALSKNSGINMIDTYVFWDLHEPQRGVYNFEGNANLKHFLDLCQQNGLFVNLRIG 197
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I +EW YGGLPIWL D+ GI R N Y
Sbjct: 198 PYICAEWNYGGLPIWLKDIPGIKMRDFNTQYMEEVERWMKFIVDYLHGYFAPQGGPIVLA 257
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENEY ++ + E G + W A +A G+PW+MC+QDD P VIN CNG C E
Sbjct: 258 QIENEYNWVQWRYQESGRKFAHWCADLANRLDIGIPWIMCQQDDIPT-VINTCNGYYCHE 316
Query: 152 --TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F N ++P ++TE+W+ ++ W R D+ + A + A G+ +NYYM+H
Sbjct: 317 WINFHWNNFKDQPPLFTENWSGWFNNWVNAVRHRPVADLLYSAARWFASGGALMNYYMWH 376
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGR + + Y APL+EYG R PK+ ++ + I LL+
Sbjct: 377 GGTNFGRKSGPMIALSYDYDAPLNEYGNPRNPKYSQTRDFNKLILSLEDILLSQYPPTPI 436
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTE 329
+ + A+F++N++E V+F SY S+ IL + +V F++
Sbjct: 437 FLANNISVIHYRNGNNSASFIINSNENGNSKVMFEGRSYFSYAYSVQILKNYVSV-FDSS 495
Query: 330 RVSTQYNKRSKTS--NLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFW 387
+ Y S N+ F ++ ++ E +F+ +L L++Q++ KD +DY W
Sbjct: 496 QNPRNYTDTVVESEPNIPF-ANSIISKHVER-FDFEESLYDNR-LMEQLNLTKDETDYIW 552
Query: 388 YTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTND 447
YT +++ L V + I+H FV+ Y G+ D+++ T + G +
Sbjct: 553 YTTMINHDQDGEI--LKVINKTDIVHVFVDSYYVGTIMS--DSLAIT-----GVPLGPST 603
Query: 448 GALLSVTVGLPDSGAFLERKVAGV-HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLN 506
LL +G+ +E AG+ V D TN WG + + EK+ I +
Sbjct: 604 LQLLHTKMGIQHYELHMENTKAGILGPVYYGDIEITNQMWGSKPFVSSEKV-ITDPIQSK 662
Query: 507 KVLWSSI-RSPTR-----QLTWYKTTF 527
V WS + R P LTWYK F
Sbjct: 663 FVRWSPLDRKPNEVFYSVPLTWYKFIF 689
>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
Length = 263
Score = 249 bits (635), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 132/241 (54%), Positives = 157/241 (65%), Gaps = 7/241 (2%)
Query: 83 VFRSDNKPY---KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGP 139
+F+S P +IENE+ +E G Y WAA+MAV +TGVPW+MCKQ+DAP P
Sbjct: 26 LFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAARMAVGLNTGVPWIMCKQEDAPDP 85
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
VI+ CNG C E F PN KP +WTE WT +Y +GG R A+D+AF +A I K
Sbjct: 86 VIDTCNGFYC-ENFT-PNKNYKPKMWTEVWTGWYTEFGGAVPTRPAEDLAFSIARLIQKG 143
Query: 200 GSYVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
GS+VNYYMYHGGTNFGRTA FM T Y APLDEYGL REPKWGHL++LH AIK
Sbjct: 144 GSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSES 203
Query: 259 PLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISIL 318
L++ +V SLG QEA VF+ SG CAAFL N D + + V F N YELP SISIL
Sbjct: 204 ALVSAEPSVTSLGNSQEAHVFKSKSG-CAAFLANYDTKSSAKVSFGNGQYELPPWSISIL 262
Query: 319 P 319
P
Sbjct: 263 P 263
>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
Length = 362
Score = 248 bits (632), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/363 (39%), Positives = 202/363 (55%), Gaps = 21/363 (5%)
Query: 267 VISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAF 326
V SLG QE VF SG CAAFL N D + V F+N+ YELP SISILPDCKT F
Sbjct: 2 VTSLGNNQEVHVFNPKSGSCAAFLANYDTTSSAKVNFQNMQYELPPWSISILPDCKTAVF 61
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEY-REAILNFDNTLLRAEGLLDQISAAKDASDY 385
NT R+ Q + + T F W+ Y E+ + D+ +GL +Q++ +DASDY
Sbjct: 62 NTARLGAQSSLKQMTPVSTF----SWQSYIEESASSSDDKTFTTDGLWEQLNVTRDASDY 117
Query: 386 FWYTFRFHYNSS-----NAQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WY + +S+ N Q P L + S GH LH F+NG+ +G+ +G DN T V
Sbjct: 118 LWYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNV 177
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLI 493
+R G N +LLS++VGL + G E+ GV + + + W Y++GL
Sbjct: 178 KMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTRDLSKQQWSYKIGLK 237
Query: 494 GEKLQIYSNLGLNKVLW--SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVN 551
GE L +++ G + V W S + + LTWYKTTF APAGN+P+AL++ +MGKG W+N
Sbjct: 238 GEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWIN 297
Query: 552 GQSIGRYWVSFKTSKGNPSQTQYA-VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVL 610
QSIGR+W + + G+ + YA T H + YHVPR++L PTGNLLV+
Sbjct: 298 SQSIGRHWPGY-IAHGSCGECNYAGTYTDKKCHTNCGQPSQRWYHVPRSWLNPTGNLLVV 356
Query: 611 LEE 613
L+
Sbjct: 357 LKR 359
>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
Length = 263
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/241 (54%), Positives = 155/241 (64%), Gaps = 7/241 (2%)
Query: 83 VFRSDNKPY---KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGP 139
+F+S P +IENE+ +E G Y WAA+MAV +TGVPW+MCKQ+DAP P
Sbjct: 26 LFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAARMAVGLNTGVPWIMCKQEDAPDP 85
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
VI+ CNG C E F PN KP +WTE WT +Y +GG R A+D+AF +A FI K
Sbjct: 86 VIDTCNGFYC-ENFT-PNKNYKPKMWTEVWTGWYTEFGGAVPTRPAEDLAFSIARFIQKG 143
Query: 200 GSYVNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
GS VNYYMYHGGTNFGRTA FM T Y APLDEYGL REPKWGHL+ LH AIK
Sbjct: 144 GSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRNLHKAIKSSES 203
Query: 259 PLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISIL 318
L++ +V SLG QEA F+ SG CAAFL N D + + V F N YELP SISIL
Sbjct: 204 ALVSAEPSVTSLGNSQEAHAFKSKSG-CAAFLANYDTKSSAKVSFGNGQYELPPWSISIL 262
Query: 319 P 319
P
Sbjct: 263 P 263
>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
Length = 418
Score = 239 bits (610), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 173/300 (57%), Gaps = 53/300 (17%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP + KAK Q++F G D+I+FIK I G+ +C++
Sbjct: 22 MWPDIFKKAK---------------------QFNFEGNYDLIKFIKMI---GIMICMQ-- 55
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------KIENE 96
+E + LPIWL ++ I+FRSDN+P+ +IENE
Sbjct: 56 -HLELVHSLKELPIWLREIPNIIFRSDNQPFMYHMEQFTKMIIKKMRDEKFFPRKQIENE 114
Query: 97 YQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGP 156
+ ++ A+ E G YV W MAV TGVPW+MCKQ +A GPV+N CNG CG+TF GP
Sbjct: 115 HTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNALGPVMNTCNGRYCGDTFSGP 174
Query: 157 NSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGR 216
N + +I + Y+ +G P R+A+DIA VA F +K G+ NYYMY+GGTNFGR
Sbjct: 175 NKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFFSKKGTMANYYMYYGGTNFGR 232
Query: 217 TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEA 276
T+++F+ T YYD+AP+ EYGL REPKWGH ++LH A+KLC + LL GTQ V LG+ E
Sbjct: 233 TSSSFVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKLCQKALLWGTQPVQMLGKDLEV 292
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 66/117 (56%), Gaps = 4/117 (3%)
Query: 594 YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPP-LSSWLRHRQR 652
YH PRA L+P N LV+LEE G GI + T+ +C + H PP + +W R++
Sbjct: 305 YHTPRAILQPKNNFLVVLEEMGGKLDGIEILTVNRDTICS-IAGEHYPPNVETWSRYKGV 363
Query: 653 GDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVE 709
T++ KP C K I+++ FAS+G+P G+C + +G C++ +SQ +VE
Sbjct: 364 IRTNVDT--PKPAANLVCLDNKTITQVDFASYGDPVGNCGHFILGKCNAPNSQKIVE 418
>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
Length = 376
Score = 239 bits (610), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 140/356 (39%), Positives = 196/356 (55%), Gaps = 23/356 (6%)
Query: 403 LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGA 462
L VQS GH LH FVNG+++GSA G+ + FT VHLR G N ALLS+ VGLP+ G
Sbjct: 18 LTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRAGINKIALLSIAVGLPNVGL 77
Query: 463 FLERKVAGVHRVRVQD------KSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLW--SSIR 514
E G+ D K T W +VGL GE + + S G + V W S+
Sbjct: 78 HYESWKTGILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAMDLVSPNGGSSVDWIRGSLA 137
Query: 515 SPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQ 573
+ T+Q L WYK F AP G++P+AL+++SMGKG+ W+NGQSIGRYW+++ + G+ S
Sbjct: 138 TQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSIGRYWMAY--ANGDCSLCS 195
Query: 574 Y-AVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVC 632
Y T YHVPR++LKPT NL+V+ EE G+P IT+ ++ VC
Sbjct: 196 YIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKNLMVMFEELGGDPSKITLVKRSVAGVC 255
Query: 633 GHVTNSHLPPLSSWLRHRQRGDTDIKKFGK---KPTVQPSCPLGKKISKIVFASFGNPDG 689
+ H + ++ D D + K + V C G+ IS I FASFG P G
Sbjct: 256 ADLQEHH--------PNAEKFDIDSHEESKTLHQAQVHLQCVPGQSISSIKFASFGTPTG 307
Query: 690 DCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C + G+CH+++S +VE+ CIG+ C + + + FG DPCP + K L V+A C
Sbjct: 308 TCGSFQQGTCHATNSHAIVEKNCIGRESCLVTVSNSIFGTDPCPNVLKRLSVEAVC 363
>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
gi|217314871|gb|ACK36970.1| lectin [Glycine max]
Length = 447
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 145/415 (34%), Positives = 204/415 (49%), Gaps = 35/415 (8%)
Query: 350 EKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSN--------AQA 401
+ W +E + + + EG+ + ++ KD SDY WY+ R + + S+
Sbjct: 33 KSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSDSDILFWEENDVHP 92
Query: 402 PLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSG 461
L + IL F+NG+ + + + G ND S+ + G
Sbjct: 93 KLTIDGVRDILRVFINGQLIVKDE--------QFKAVISVSIGKNDCTAGSIN----NYG 140
Query: 462 AFLERKVAGVH-RVRVQ-----DKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRS 515
AFLE+ AG+ ++++ D + W YQVGL GE L+ YS N W +
Sbjct: 141 AFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQGEFLKFYSEENENSE-WVELTP 199
Query: 516 PT--RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQ 573
TWYKT F P G DP+AL+ +SMGKG+AWVNGQ IGRYW G
Sbjct: 200 DAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYWTRVSPKSGCQQVCD 259
Query: 574 Y--AVNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRK 630
Y A N+ C K T T YHVPR++LK T NLLV+LEE GNP I+V + R
Sbjct: 260 YRGAYNSDKCSTNCG--KPTQTLYHVPRSWLKATNNLLVILEETGGNPFEISVKLHSSRI 317
Query: 631 VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGD 690
+C V+ S+ PPL + G+ ++ P + C G IS + FASFG P G
Sbjct: 318 ICAQVSESNYPPLQKLVNADLIGE-EVSANNMIPELHLHCQQGHTISSVAFASFGTPGGS 376
Query: 691 CERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C+ ++ G+CH+ S +V AC GK CSI + FG DPCPG+ K L V+A+C
Sbjct: 377 CQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAFGVDPCPGVVKTLSVEARC 431
>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 611
Score = 232 bits (592), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 159/540 (29%), Positives = 266/540 (49%), Gaps = 19/540 (3%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
++ENEY ++ + E G Y W+A++A + GVPW+MC+QDD VIN CNG C +
Sbjct: 27 QVENEYGWVQERYGESGTKYAQWSARLAQSLNVGVPWIMCQQDDIDS-VINTCNGFYCHD 85
Query: 152 TFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+G PN+P+ +TE+W ++Q W R +D+ + V + A+ GS +NYYM+H
Sbjct: 86 WIEGHWARYPNQPAFFTENWPGWFQQWKQSTPHRPVEDVLYAVGNWFARGGSLMNYYMWH 145
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GGTNFGRT++ ++ Y A LDEYG EPK+ H + + ++ S L + S
Sbjct: 146 GGTNFGRTSSPMVVNSYDYDAALDEYGNPSEPKYSHAAKFNNLLQKYSHIFLNAPEIPRS 205
Query: 270 LGQLQEAFVFEET-SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV--AF 326
+ ++ T G +FL+NN E +++ ++ + S+ +L + TV +
Sbjct: 206 EYLGGSSSIYHYTFGGESLSFLINNHESALNDIVWNGQNHIIKPWSVHLLYNNHTVFDSA 265
Query: 327 NTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYF 386
T VS + S + ++ ++ E I D+T + L+Q+S D +DY
Sbjct: 266 ATPEVSKLAMTSKRFSPVNSFNNAYISQWVEEIDMTDSTW--SSKPLEQLSLTHDKTDYL 323
Query: 387 WYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
WY + A+ + + +LHA+++G+Y + ++ F +++ + L G +
Sbjct: 324 WYVTEINLQVRGAE--VFTTNVSDVLHAYIDGKYQSTIWSAN---PFNIKSDIPL--GWH 376
Query: 447 DGALLSVTVGLPDSGAFLERKVAG-VHRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGL 505
+L+ +G+ +E+ G + + V TN W + + GE+L IY+ +
Sbjct: 377 KLQILNSKLGVQHYTVDMEKVTGGLLGNIWVGGTDITNNGWSMKPYVNGERLAIYNPNNI 436
Query: 506 NKVLWSSIRSPTRQLTWYKTTF-RAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT 564
KV WSS + LTWYK F + N +LN+ M KG W+NG+ + RYW++ K
Sbjct: 437 FKVDWSSFSGVQQPLTWYKINFLHELSPNKHYSLNMSGMNKGMIWLNGKHVARYWIT-KG 495
Query: 565 SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVD 624
N Q C N YH+P+ +L NLLV+ EE GNP I ++
Sbjct: 496 WGCNGCSYQGGYTDQLCSTNCGEPSQIN-YHLPQDWLIEGANLLVIFEEVGGNPKSIKLE 554
>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
Length = 283
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 128/285 (44%), Positives = 162/285 (56%), Gaps = 12/285 (4%)
Query: 118 MAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWG 177
MA TGVPW+MC+Q +AP P+IN CN C + PNS NKP +WTE+W+ ++ +G
Sbjct: 1 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQF--TPNSDNKPKMWTENWSGWFLAFG 58
Query: 178 GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYG 236
G R +D+AF VA F + G++ NYYMYHGGTNFGRT I+ YD AP+DEYG
Sbjct: 59 GAVPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYG 118
Query: 237 LVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDER 296
+R+PKWGHLK+LH AIKLC L+ + S G E V+ +T VC+AFL N
Sbjct: 119 DIRQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVY-KTGAVCSAFLANIGMS 177
Query: 297 KAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRS-KTSNLK------FDSD 349
A TV F SY LP S+SILPDCK V NT +V+T S T +LK S
Sbjct: 178 DA-TVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLKEKVDSLDSSS 236
Query: 350 EKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHY 394
W E + GLL+QI+ D SDY WY+ Y
Sbjct: 237 SGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVY 281
>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
Length = 281
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/288 (43%), Positives = 160/288 (55%), Gaps = 39/288 (13%)
Query: 66 EWTYGGLPIWLHDVAGIVFRSDNKPYK-------------------------------IE 94
EW +GG P+WL V GI FR+DN P+K IE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 95 NEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFK 154
NEY +E Y+ WAA+MAV +T VPWVMCKQDDAP PVINACNG C +
Sbjct: 61 NEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYC--DYF 118
Query: 155 GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNF 214
PN P KP++WTE WT ++ + G P + +D A+ + + V + GTNF
Sbjct: 119 SPNKPYKPTMWTEAWTGWFTGFRG-PVLTDCEDC---FAVQVIRRWILVT-TIVPWGTNF 173
Query: 215 GRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQL 273
GRTA I+ YD AP+DEYGL+R+PKWGHL++LH AIK+C L++G V LG
Sbjct: 174 GRTAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNY 233
Query: 274 QEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDC 321
QEA V+ SG CAAFL N + +V F + Y +P SISILPDC
Sbjct: 234 QEAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281
>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
Length = 268
Score = 221 bits (564), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 104/209 (49%), Positives = 132/209 (63%), Gaps = 33/209 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GGLDVI+TYVFWNLHEP KGQYDF GR D+++F+K + GLYV LRIG
Sbjct: 52 MWPDLIQKSKDGGLDVIETYVFWNLHEPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIG 111
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH + GI FR+DN+P+K
Sbjct: 112 PYVCAEWNYGGFPLWLHFIPGIKFRTDNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPII 171
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I+ + G Y+ WAAKMA TGVPWVMC+Q DAP P+IN CNG C
Sbjct: 172 LSQIENEYGNIDSHYGSAGKSYINWAAKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYC 231
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGG 178
+ PNS KP +WTE+W+ ++ +GG
Sbjct: 232 DQF--TPNSNTKPKMWTENWSGWFLSFGG 258
>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
Length = 244
Score = 218 bits (556), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 103/171 (60%), Positives = 117/171 (68%), Gaps = 31/171 (18%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK GGLDVIQTYVFWN+HEP +GQY+F GR D+++FI+EIQ+QGLYV LRIG
Sbjct: 59 MWPKLIAKAKNGGLDVIQTYVFWNVHEPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIG 118
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+E+EW YGG P WLHDV I FRSDN+P+K
Sbjct: 119 PFVEAEWKYGGFPFWLHDVPSITFRSDNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPII 178
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV 140
IENEYQ IEPAF GP YV WAA MAV TGVPW+MCKQ+DAP PV
Sbjct: 179 ISQIENEYQMIEPAFGASGPRYVRWAAAMAVGLQTGVPWMMCKQNDAPDPV 229
>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
Length = 282
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/292 (42%), Positives = 155/292 (53%), Gaps = 46/292 (15%)
Query: 66 EWTYGGLPIWLHDVAGIVFRSDNKPYK-------------------------------IE 94
EW +GG P+WL V GI FR+DN P+K IE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 95 NEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFK 154
NEY +E Y+ WAA+MAV +TGVPWVMCKQDDAP PVINA NG C + F
Sbjct: 61 NEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYC-DYF- 118
Query: 155 GPNSPNKPSIW----TEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
SPN + DW +R+ + + +I + NYYMYHG
Sbjct: 119 ---SPNSLKTFFGGLKLDWLVPVSGSSSSQTVRTGFCVQVYTEGWIFR-----NYYMYHG 170
Query: 211 GTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
GTNFGRTA I+ YD AP+DEY L+R+PKWGHL++LH AIK+C L++G V
Sbjct: 171 GTNFGRTAGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTK 230
Query: 270 LGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDC 321
LG QEA V+ SG CAAFL N + +V F + Y +P SISILPDC
Sbjct: 231 LGNYQEAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282
>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
max]
Length = 482
Score = 216 bits (551), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 113/287 (39%), Positives = 152/287 (52%), Gaps = 39/287 (13%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP++ + K GGLD I++Y+FW+ HEP + +YD SG D I F+K IQ LY LRIG
Sbjct: 39 LWPAIFKRXKYGGLDAIESYIFWDRHEPVRREYDCSGNLDFIDFLKLIQEAELYFILRIG 98
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ W +GG +WLH++ I R DN K
Sbjct: 99 PYVCEXWNFGGFSLWLHNMPEIELRIDNPIXKNEMQIFTTKIVNMAKEAKLFAPXGGPII 158
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY I + E PY+ W A+MA+ + GVPW+MC DAP P+IN CNG C
Sbjct: 159 LTPIENEYGNIMTDYREARKPYIKWCAQMALTQNIGVPWIMCXXRDAPQPMINTCNGHYC 218
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
++F PN+P ++ +Q WG + +SA++ F VA F G NYYMYH
Sbjct: 219 -DSFX-PNNPKSSKMFRX-----FQKWGERVPHKSAEESTFSVARFFQSGGILNNYYMYH 271
Query: 210 GGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
GGTNFG +M Y APLDEYG + +PKW H K+LH +
Sbjct: 272 GGTNFGHMVGGPYMTASYEYDAPLDEYGNLNKPKWEHFKQLHKELTF 318
Score = 67.0 bits (162), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 42/61 (68%)
Query: 666 VQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSR 725
+ PSC +GK IS+I FASFGNP+G+C + G+ ++ SQ VVE ACIG++ C + R
Sbjct: 421 LDPSCQIGKTISQIQFASFGNPEGNCGSFKGGTWEATDSQSVVEVACIGRNSCGFTVTKR 480
Query: 726 Y 726
+
Sbjct: 481 H 481
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 29/39 (74%)
Query: 527 FRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTS 565
F AP G DP+ ++LQ GK +AWVNG+SIG YW S+ T+
Sbjct: 363 FEAPFGIDPMVMDLQDSGKRQAWVNGKSIGCYWSSWITN 401
>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 249
Score = 212 bits (539), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 98/173 (56%), Positives = 118/173 (68%), Gaps = 31/173 (17%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK+GGLDVIQTYVFWN HEP +GQ++F GR D+++FI+EI +QGLYV LRIG
Sbjct: 68 MWPDLIAKAKKGGLDVIQTYVFWNAHEPVQGQFNFEGRYDLVKFIREIHAQGLYVSLRIG 127
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PF+ESEW YGGLP WL + I FRSDN+P+K
Sbjct: 128 PFVESEWKYGGLPFWLRGIPNITFRSDNEPFKRHMQKFVTKIVNLMKDERLFYPQGGPII 187
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY+ +E AFH KG YV WAA MAV+ TGVPW+MCKQDDAP P+++
Sbjct: 188 ISQIENEYKLVEAAFHSKGSSYVHWAAAMAVNLQTGVPWMMCKQDDAPDPIVS 240
>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
Length = 219
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 107/220 (48%), Positives = 131/220 (59%), Gaps = 33/220 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI +AK+GGLDVIQTYVFWN HEP G+Y F D+++FIK +Q GLYV LRIG
Sbjct: 2 MWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIG 61
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW +GG P+WL + GI FR+DN P+K
Sbjct: 62 PYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPII 121
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC 149
IENEY +E G Y WAA+MAV TGVPWVMCKQDDAP PVINACNG C
Sbjct: 122 LSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYC 181
Query: 150 GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIA 189
+ PN KP +WTE WT ++ +GG R A+D+A
Sbjct: 182 --DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219
>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
Length = 1171
Score = 209 bits (533), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 144/487 (29%), Positives = 225/487 (46%), Gaps = 69/487 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W LI AKE G++ I+TYVFWN HE +KG YDFSGR D+ FI+ I GLY LRIGP
Sbjct: 493 WQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGFIRTIAKAGLYALLRIGP 552
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
+I +E +GG P WL D+ GI FR+ N+P+
Sbjct: 553 YICAETHFGGFPHWLRDIDGIEFRTQNEPFQRESSRWVRFLVEKLNSNNCFYSQGGPIVM 612
Query: 92 -KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
+ ENEY+ I + E G Y+ W +++A D VP MCK + V+ N
Sbjct: 613 VQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCK--GSIENVLETINDFYGH 670
Query: 151 ETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
+ + + PN+P+IWTE WT +Y VWG +IR +D+ + V F A+ G +NYYM+
Sbjct: 671 QEMENHHREYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYAVLRFFAQGGKGINYYMF 730
Query: 209 HGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
HGGTN+ + A T Y AP+DEYG + K+ L+ +H ++ L + I
Sbjct: 731 HGGTNYDQLAMYLQTTSYDYDAPIDEYGR-KTKKYFGLQYIHRQLEQHFASLALKLEAPI 789
Query: 269 SLGQLQE---AFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
+ F++EE C F N+ V ++ Y L S+ ++ D +
Sbjct: 790 AHSYEDNYVWIFIWEEQGSNC-IFFCNDHPTSTKQVQWKEQEYCLAPLSVQMVVDHHRLI 848
Query: 326 FNTERVSTQYNKRSKTSN-LKFDSDE-KWEEYREAILNFD----------------NTLL 367
++++ K + ++E W+ Y+E I D NT +
Sbjct: 849 LKSDQLFVDEELIQKELKPISVTTEEWTWQYYKENIPTTDITSSASQSSSISSLSSNTEI 908
Query: 368 RAEGLLDQISAAKDASDYFWYTFRFH------YNSSNAQ----APLDVQSHGHILHAFVN 417
+ ++ + A+DY WY + + S +A +D+++ ++ +VN
Sbjct: 909 ETQVPVEMLRYTGTATDYAWYIAHYQIDPQIEWTSDDALEWVGGQVDLEAADYV-QVYVN 967
Query: 418 GEYTGSA 424
G Y S+
Sbjct: 968 GVYKTSS 974
>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
Length = 480
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/331 (36%), Positives = 170/331 (51%), Gaps = 23/331 (6%)
Query: 422 GSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD--- 478
G+ +GS D+ T V L G+N + LS+ VGLP+ G E AG+ D
Sbjct: 165 GTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLN 224
Query: 479 ---KSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDP 535
+ T W YQVGL GE ++S G + V W + + F AP G++P
Sbjct: 225 EGRRDLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNASNMAF----FNAPDGDEP 280
Query: 536 IALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY-AVNTVTSIHFCAIIKATNTY 594
+AL++ SMGKG+ W+NGQ IGRYW +K S GN Y T + Y
Sbjct: 281 LALDMSSMGKGQIWINGQGIGRYWPGYKAS-GNCGTCDYRGEYDETKCQTNCGDSSQRWY 339
Query: 595 HVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGD 654
HVPR++L PTGNLLV+ EE G+P GI++ +I VC V+ P + +W
Sbjct: 340 HVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNW-------- 390
Query: 655 TDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIG 714
K + +K V C G+KI++I FASFG P G C Y G CH+ S + + C+G
Sbjct: 391 -HTKDY-EKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIFWKNCVG 448
Query: 715 KSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+ RC + ++ FGGDPCPG K +V+A C
Sbjct: 449 QERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 479
Score = 122 bits (306), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 88/151 (58%), Gaps = 4/151 (2%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENE+ +E E Y WAA MAV +T VPW+MCK+DDAP P+IN CNG C
Sbjct: 29 QIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYC-- 86
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
+ PN P+KP++WTE WT++Y +G R +D+A+ VA FI K GS+VNYYM+
Sbjct: 87 DWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMFLNL 146
Query: 212 TNFGRTAAAFMITGYYDQAPLDEYGLVREPK 242
F + T + + YG V +PK
Sbjct: 147 RGFTKRRPHCNFTWKCSEGTV--YGSVDDPK 175
>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
Length = 706
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 166/562 (29%), Positives = 258/562 (45%), Gaps = 79/562 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W L+ +AK GL+ I+ YVFWNLHE ++G ++F+G +I RF + GL++ +R GP
Sbjct: 116 WEQLLREAKRDGLNHIEMYVFWNLHEQERGVFNFAGNANITRFYELAAEVGLFLHVRFGP 175
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE----------------------YQT 99
++ +EW GGLP+WL+ + G+ RS N P++ E E
Sbjct: 176 YVCAEWNNGGLPLWLNWIPGMEVRSSNAPWQREMERFIRYMVELSRPFLAKNGGPIIMAQ 235
Query: 100 IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE--TFKGPN 157
IE F P Y+ W + T +PWVMC + A ++ +CN C +
Sbjct: 236 IENEFAWHDPEYIAWCGNLVKQLDTSIPWVMCYANAAENTIL-SCNDDDCVDFAVKHVKE 294
Query: 158 SPNKPSIWTEDWTSFYQVWGGKPY------IRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
P+ P +WTED ++Q W RS +D+A+ VA + A G+ NYYMYHGG
Sbjct: 295 RPSDPLVWTED-EGWFQTWQKDKKNPLPNDQRSPEDVAYAVARWFAVGGAAHNYYMYHGG 353
Query: 212 TNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLG 271
N+GR A+A + T Y D L GL EPK HL++LH A+ C+ LL + V++
Sbjct: 354 NNYGRAASAGVTTMYADGVNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRNDRQVLNPR 413
Query: 272 QLQEAFVFEET---SGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
+L V E+T S AF+ + P + +IL D V +
Sbjct: 414 EL--PLVDEQTVKASSQQRAFVYGPEAE--------------PNQDGAILFDTADVRKSF 457
Query: 329 E-RVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLR----AEGLLDQISAAKDAS 383
R Y K S L + + W E LN +T R A+ ++Q+ D S
Sbjct: 458 PGRQHRTYTPLVKASALAWKA---WSE-----LNVSSTTPRRRVVADQPIEQLRLTADQS 509
Query: 384 DYFWYTFRFH----YNSSNAQAPLDVQS-HGHILHAFVNGEYTGSAHGSH------DNVS 432
DY Y F + + + V S + A V+G G + ++ S
Sbjct: 510 DYLTYETTFTPKQLSDVDDDMWTVKVTSCEASSIIALVDGWLIGERNLAYPGGNCSKEFS 569
Query: 433 FTLRNTVHL-RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVG 491
F L ++ + RQ +D L+SV++G+ G+ + V G R+ +D + W
Sbjct: 570 FHLPASIEVGRQ--HDLKLVSVSLGIYSLGSNHSKGVTGSVRIGHKDLA-RGQRWEMYPS 626
Query: 492 LIGEKLQIYSNLGLNKVLWSSI 513
LIGE+L+IY + ++ V W+ +
Sbjct: 627 LIGEQLEIYRSQWIDAVPWTPV 648
>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
Length = 448
Score = 203 bits (516), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 100/237 (42%), Positives = 134/237 (56%), Gaps = 53/237 (22%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPS+I KA+ GGL+ IQTYVFWN+HEP+ +YDF GR D++ FIK IQ +GLYV LR+G
Sbjct: 72 MWPSIIDKARIGGLNTIQTYVFWNVHEPEHRKYDFKGRFDLVTFIKLIQEKGLYVTLRLG 131
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
PFI++EW +GGLP WL +V + FR+DN+P+K
Sbjct: 132 PFIQAEWNHGGLPYWLREVPEVYFRTDNEPFKEHTERYVRKILGMMKEEKLLASQRRSHH 191
Query: 93 --IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
ENE ++ A+ E G Y+ WAA + G+PWVMCKQ++A +INACNG C
Sbjct: 192 LGTENECNAVQLAYKENGERYIKWAANLVESMKLGIPWVMCKQNNASDNLINACNGRHC- 250
Query: 151 ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
++ G I ++DIAF VA + +KNGS+VNYYM
Sbjct: 251 ----------------------FEFLGILQLIEQSEDIAFSVARYFSKNGSHVNYYM 285
Score = 75.5 bits (184), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 69/124 (55%), Gaps = 7/124 (5%)
Query: 591 TNTYHVPRAFLKP--TGNLLVLLEEENGNPLGITVDTIAIRK--VCGHVTNSHLPPLSSW 646
+ YH+PR+F+K N+LV+LEEE G L +D + + + +C +V + + SW
Sbjct: 287 VDRYHIPRSFMKEEKKKNMLVILEEEPGVKLE-AIDFVLVNRDTICSYVGEDYPVSVKSW 345
Query: 647 LRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQG 706
R R + + K K ++ CP K++ + FASFG+P G C + +G C +S S+
Sbjct: 346 KRERPKIASRSKDMRLKAVMK--CPPEKQMVAVEFASFGDPTGTCGNFTMGKCSASKSKE 403
Query: 707 VVER 710
VVE+
Sbjct: 404 VVEK 407
>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
Length = 317
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 119/295 (40%), Positives = 158/295 (53%), Gaps = 26/295 (8%)
Query: 461 GAFLERKVAGVHRVRVQDKSFTNC-------SWGYQVGLIGEKLQIYSNLGLNKVLWSSI 513
GAFLE+ AG + +V+ F N SW YQVGL GE +IY K W+ +
Sbjct: 28 GAFLEKDGAGF-KGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDL 86
Query: 514 R---SPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPS 570
SP+ TWYKT F AP G +P+AL+L SMGKG+AWVNG IGRYW G
Sbjct: 87 TPDASPST-FTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWTRVAPKDGC-G 144
Query: 571 QTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRK 630
+ Y + TS YH+PR++L+ + NLLVL EE G P I+V + + +
Sbjct: 145 KCDYRGHYHTS-----------KYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQT 193
Query: 631 VCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGD 690
+C V+ SH P L +W K P + C G IS I FAS+G P G
Sbjct: 194 ICAEVSESHYPSLQNWSPSDFIDQNSKNKM--TPEMHLQCDDGHTISSIEFASYGTPQGS 251
Query: 691 CERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C+ ++ G CH+ +S +V +AC GK C I +L+ FGGDPC GI K L V+A+C
Sbjct: 252 CQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFGGDPCRGIVKTLAVEAKC 306
>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
Length = 267
Score = 199 bits (505), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 113/268 (42%), Positives = 150/268 (55%), Gaps = 14/268 (5%)
Query: 207 MYHGGTNFGR-TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHGGTNF R T F+ T Y AP+DEYG++R+ KWGHLK+++ AIKLC L+T
Sbjct: 1 MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTDP 60
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
+ SLGQ EA V+ +T VCAAFL N D + TV F SY LP S+S+LPDCK V
Sbjct: 61 KISSLGQNLEAAVY-KTGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPDCKNVV 119
Query: 326 FNTERVSTQYNKRSKTSNLKFD-------SDEKWEEYREAILNFDNTLLRAEGLLDQISA 378
NT ++ N S SN + S KW E + + +L GLL+QI+
Sbjct: 120 LNTAKI----NSASAISNFVTEDISSLETSSSKWSWINEPVGISKDDILSKTGLLEQINT 175
Query: 379 AKDASDYFWYTFRFHY-NSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
D SDY WY+ + +Q L ++S GH LHAF+NG+ G+ G+ D +
Sbjct: 176 TADRSDYLWYSLSLDLADDPGSQTVLHIESLGHTLHAFINGKLAGNQAGNSDKSKLNVDI 235
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLE 465
+ L G N LLS+TVGL + GAF +
Sbjct: 236 PIALVSGKNKIDLLSLTVGLQNYGAFFD 263
>gi|62529271|gb|AAX84941.1| beta-galactosidase [Prunus persica]
Length = 287
Score = 198 bits (504), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 117/288 (40%), Positives = 156/288 (54%), Gaps = 17/288 (5%)
Query: 221 FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFE 280
FM T Y APLDEYGL REPKWGHL++LH AIK L++ +V SLG QEA VF+
Sbjct: 3 FMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVFK 62
Query: 281 ETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSK 340
SG CAAFL N D + + V F N YELP SISILPDCKT +NT R+ +Q ++
Sbjct: 63 SKSG-CAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCKTAVYNTARLGSQSSQMKM 121
Query: 341 TSNLKFDSDEKWEEYREAILNFDNT-LLRAEGLLDQISAAKDASDYFWYTFRFHYNSSN- 398
T S W+ + E + D + +GL +QI+ +D +DY WY +
Sbjct: 122 T---PVKSALPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEG 178
Query: 399 ----AQAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSV 453
++P L + S GH LH F+NG+ +G+ +G+ +N T V LR G N ALLS+
Sbjct: 179 FIKRGESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGINKLALLSI 238
Query: 454 TVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGE 495
+VGLP+ G E AGV + + W Y+ GL GE
Sbjct: 239 SVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKTGLKGE 286
>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
Length = 601
Score = 196 bits (497), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 159/621 (25%), Positives = 269/621 (43%), Gaps = 78/621 (12%)
Query: 57 LRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-NEYQTI-----EPAFHEKGPP 110
+RIGP++ +EW GG+P+W++ + G+ R++N +K E ++ + F ++G P
Sbjct: 1 MRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTRDFFADRGGP 60
Query: 111 ----------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFK 154
Y+ W + A VPW+MC D + INACNG C +
Sbjct: 61 IIFSQIENELWGGAREYIDWCGEFAESLELNVPWMMCNGDTSE-KTINACNGNDCSSYLE 119
Query: 155 GPNSP-----NKPSIWTEDWTSFYQVWGGKPY---------IRSAQDIAFHVALFIAKNG 200
++P WTE+ ++Q+ G RSA+D F+V F+ + G
Sbjct: 120 SHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGG 178
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPL 260
SY NYYM+ GG ++G+ A M Y + + L EPK H ++H + + L
Sbjct: 179 SYHNYYMWFGGNHYGKWAGNGMTNWYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVL 238
Query: 261 LTGTQNVISLGQL--QEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISIL 318
L V + L FE G V N + A V++R+I YELP S+ +L
Sbjct: 239 LNDKAQVNNQKHLNCDNCNAFEYRYGDRLVSFVENSKGSADKVIYRDIVYELPAWSMIVL 298
Query: 319 PDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISA 378
+ V F T V R K + E W E + ++ + +Q++
Sbjct: 299 DEYDNVLFETNNVKPVNKHRVYHCEEKLEF-EYWNEPVSTLSQEAPRVVVSPKANEQLNM 357
Query: 379 AKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTGS--AHGSHDNVSFTLR 436
+D +++ +Y + + + + A+V+ + GS H HD T+
Sbjct: 358 TRDLTEFLYYETEVEFPQDECTLSIG-GTDANAFVAYVDDHFVGSDDEHTHHDGW-HTMN 415
Query: 437 NTVHLRQGTNDGALLSVTVGLP---DSG---AFLERKVAGV-HRVRVQDKSFTNCSWGYQ 489
+ +G + LLS ++G+ DS ++ ++ G+ +++ N W +
Sbjct: 416 INMKSGKGKHKLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWIKLCGNDIFNQEWKHY 475
Query: 490 VGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAG---NDPIALNLQSMGKG 546
GL+GE Q++++ G+ V W S L WY++TF+ P G + L + M +G
Sbjct: 476 PGLVGEAKQVFTDEGMKTVTWKSDVENADNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRG 535
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG- 605
+A+ NG +IGRYW+ GN TQ YH+P+ +LK G
Sbjct: 536 QAYANGHNIGRYWM---IKDGNGEYTQ------------------GFYHIPKDWLKGEGE 574
Query: 606 -NLLVLLEEENGNPLGITVDT 625
N+LVL E + +T+ T
Sbjct: 575 ENVLVLGETLGASDPSVTICT 595
>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
Length = 203
Score = 194 bits (492), Expect = 2e-46, Method: Composition-based stats.
Identities = 99/199 (49%), Positives = 114/199 (57%), Gaps = 32/199 (16%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI AKEGGLDVIQTYVFWN HEP G Y F R D ++FIK + GLYV LRIG
Sbjct: 7 MWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLYVHLRIG 66
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P+I EW +GG P+WL V GI FR+DN P+K
Sbjct: 67 PYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIM 126
Query: 93 --IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG 150
IE EY I G Y WAA+MAV TGVPW+MCKQ+DAP P+I+ CNG C
Sbjct: 127 SQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYC- 185
Query: 151 ETFKGPNSPNKPSIWTEDW 169
E F PN+ KP +WTE W
Sbjct: 186 ENFM-PNANYKPKMWTEAW 203
>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 275
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 107/270 (39%), Positives = 154/270 (57%), Gaps = 23/270 (8%)
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLWS----SIRSPTRQLTWYKTTFRAPAGNDPIALNLQ 541
W YQVGL GE + + + W +++ P + LTW+KT F AP GN+P+AL+++
Sbjct: 8 WTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKP-QPLTWHKTYFDAPEGNEPLALDME 66
Query: 542 SMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAF 600
MGKG+ WVNG+SIGRYW +F T G+ S Y + + T YHVPRA+
Sbjct: 67 GMGKGQIWVNGESIGRYWTAFAT--GDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAW 124
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
LKP+ NLLV+ EE GNP +++ ++ VC V+ H P + +W I+ +
Sbjct: 125 LKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNW---------QIESY 174
Query: 661 GK-----KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGK 715
GK +P V C G+ I+ I FASFG P G C Y G CH++ S ++ER C+GK
Sbjct: 175 GKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGK 234
Query: 716 SRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
+RC++ + + FG DPCP + K L V+A C
Sbjct: 235 ARCAVTISNSNFGKDPCPNVLKRLTVEAVC 264
>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
Length = 172
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/161 (59%), Positives = 111/161 (68%), Gaps = 2/161 (1%)
Query: 118 MAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWG 177
MA+ TGVPW+MCKQ+DAPGP+I+ CNG C E FK PNS NKP +WTE+WT +Y +G
Sbjct: 1 MALGLSTGVPWIMCKQEDAPGPIIDTCNGYYC-EDFK-PNSINKPKMWTENWTGWYTDFG 58
Query: 178 GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGL 237
G R +DIA+ VA FI K GS VNYYMYHGGTNF RTA FM + Y APLDEYGL
Sbjct: 59 GAVPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTAGEFMASSYDYDAPLDEYGL 118
Query: 238 VREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFV 278
REPK+ HLK LH AIKL LL+ V SLG QE +
Sbjct: 119 PREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEVTI 159
>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
Length = 307
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 113/289 (39%), Positives = 165/289 (57%), Gaps = 19/289 (6%)
Query: 352 WEEYREAILN--FDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS-----NAQAP-L 403
W+ Y EA + D++ A LL+QI +D+SDY WY + + + N Q P L
Sbjct: 17 WQSYNEAPASSGIDDST-TANALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNGQYPVL 75
Query: 404 DVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAF 463
S GH+LH FVNG+++G+A+G +N T N+V LR G N +LLSV VGL + G
Sbjct: 76 TAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSNVGLH 135
Query: 464 LERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPT 517
E GV + + + W Y++GL GE L +++ +G + V W+ S
Sbjct: 136 YETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTKGSSLV 195
Query: 518 RQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYA 575
+ LTWYK TF APAGNDP+AL++ SMGKGE WVNG+SIGR+W ++ ++G+ YA
Sbjct: 196 EKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAY-IARGSCGGCNYA 254
Query: 576 VNTVTSIHFCAIIKATNT-YHVPRAFLKPTGNLLVLLEEENGNPLGITV 623
+ + T YH+PR+++ P GN LV+LEE G+P GI++
Sbjct: 255 GTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISL 303
>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
Length = 752
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 133/497 (26%), Positives = 221/497 (44%), Gaps = 74/497 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + AKE GL+ + YVFWN+HE ++G + F+ DI RF++ GL V LR+GP
Sbjct: 38 WNNTLKLAKECGLNFLDIYVFWNVHEKKRGIFTFTEEADIFRFLQMAHQHGLLVMLRLGP 97
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
+I +E +YGG P WL ++ GI FR+ N P+
Sbjct: 98 YICAETSYGGFPCWLREIPGIQFRTYNDPFMREVKRWLFYITTLLKEKRLFFPQGGPIVL 157
Query: 92 -KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR-- 148
++ENEY + KG Y+ W ++ + VP +MC+ +P V C+ +
Sbjct: 158 VQLENEYDLVSKIQLSKGEQYLNWYNELYRELAFDVPLIMCR--SSPEEVGEFCSCSKEP 215
Query: 149 ----------CGETFKG-----------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQD 187
C ETF P++P +WTE W +Y +W P RS +D
Sbjct: 216 ELSTIASVETCIETFNSFYGHKKIADLRRRKPHQPILWTEFWIGWYDIWTSAPRKRSTED 275
Query: 188 IAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGH-- 245
+ + FIA+ G+ +YYM+HGGT+F A T YY +P+DEYG P +
Sbjct: 276 VIYAALRFIAQGGAGFSYYMFHGGTHFNNLAMYSQTTSYYFDSPIDEYG---RPSFLFYM 332
Query: 246 LKELHAAIKLCSRPLLTGTQ-NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFR 304
LK ++ + S LL+ V+ L AF+++E S + + ND + ++F+
Sbjct: 333 LKRINHILHQFSSHLLSQDHPQVLHLLPQVVAFIWQEHSSQQSLSFLCNDSEQIAYIMFQ 392
Query: 305 NISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDN 364
++ S+++ + + F++ S+ Y+ + + K + E + L+
Sbjct: 393 QSMMKMNPLSVAVFLE-NELLFDS---SSGYDWQIPFRDFKPLERAYFRELKTFQLDIPI 448
Query: 365 TLLRA----EGLLDQISAAKDASDYFWY----TFRFHYNSSNAQAPLDVQSHGHILHAFV 416
L + L D +S +D +DY WY T + L ++H F+
Sbjct: 449 PPLSSSCDFSQLPDMLSVTQDETDYMWYISSATLPVSSKEFTCEKVLLQIEMADLIHLFI 508
Query: 417 NGEYTGSAHGSHDNVSF 433
N +Y GS+ D+ F
Sbjct: 509 NQQYMGSSWIKIDDERF 525
>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
Length = 270
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/263 (39%), Positives = 154/263 (58%), Gaps = 9/263 (3%)
Query: 486 WGYQVGLIGEKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSM 543
W Y+VGL GE L ++S G + V W+ + + + LTWYKTTF APAG+ P+A+++ SM
Sbjct: 13 WTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSM 72
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAFLK 602
GKG+ W+NGQS+GR+W ++K + G+ S+ Y +A+ YHVPR++LK
Sbjct: 73 GKGQIWINGQSLGRHWPAYK-AVGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLK 131
Query: 603 PTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGK 662
P+GNLLV+ EE G+P GIT+ + VC + S+ + ++ + K
Sbjct: 132 PSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQ----STLVNYQLHASGKVNK-PL 186
Query: 663 KPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPL 722
P C G+KI+ + FASFG P+G C Y GSCH+ HS + C+G++ CS+ +
Sbjct: 187 HPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTV 246
Query: 723 LSRYFGGDPCPGIHKALLVDAQC 745
FGGDPCP + K L V+A C
Sbjct: 247 APEMFGGDPCPNVMKKLAVEAVC 269
>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
Length = 652
Score = 184 bits (467), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 136/496 (27%), Positives = 223/496 (44%), Gaps = 73/496 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP + AK+ GL+ ++ Y+FWN+HE +KG Y F +I RF++ Q +GL V LR+GP
Sbjct: 37 WPQALELAKDCGLNCLEVYIFWNVHEKKKGVYHFEREGNIFRFLQLAQERGLKVILRMGP 96
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
+I +E +YGG P WL ++ GI FR+ N+P+
Sbjct: 97 YICAETSYGGFPYWLREIPGIEFRTYNEPFMKEMKRWLTDINRMLKENKLYHQKGGPIIL 156
Query: 92 -KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD--------DAPGPVIN 142
+IENEY + + G Y+ W ++ + W+ K D IN
Sbjct: 157 VQIENEYDIVSSIYGAAGQKYLHWCYELYKE--GASEWLTSKDSEYFRVASIDKSIETIN 214
Query: 143 ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G R ++ K P++P +WTE W +Y +W G R D+ + A FIA+ GS
Sbjct: 215 DFYGHRRIDSLKALK-PHQPLLWTEFWIGWYNIWRGAQRQRPVDDVIYAAARFIAQGGSG 273
Query: 203 VNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT 262
+NYYM+HGGT+FG A TGY AP+D YG E K+ LK+L+ + LL+
Sbjct: 274 MNYYMFHGGTHFGNLAMYGQTTGYDFDAPVDSYGRPTE-KFERLKQLNHCLSNLEYILLS 332
Query: 263 GTQ-NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDC 321
+ V L + +++ V ND+R V+ + L S+ I +
Sbjct: 333 QDEPEVQKLTPNVNVYRWKDIESGDECSFVCNDQRSQSYVIVAERAVCLKPLSVKIYLNH 392
Query: 322 KTVAFNTERVSTQYNKRS-----------KTSNLKFDSDEKWEEYREAILNFDNTLLRAE 370
+ V F++ + S +++S KT + S EK ++ ++
Sbjct: 393 EEV-FDSSQNSYNVSQKSYHRLDYVCNEWKTMQIPIPSKEKKDK--------EHFEFSFP 443
Query: 371 GLLDQISAAKDASDYFWYT--------FRFHYNSSNAQAPLDVQSHGHILHAFVNGEYTG 422
+ D + +D +DY WYT F+ + +++++ ++ H F+N +Y G
Sbjct: 444 HIPDMLHITQDETDYMWYTGVGTIYCPFKGENTPHCLKIHMELEAADYV-HVFLNRKYVG 502
Query: 423 SAHGSHDNVSFTLRNT 438
S + FT R +
Sbjct: 503 SCRSPCYDERFTGRRS 518
>gi|414888317|tpg|DAA64331.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 284
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/295 (34%), Positives = 150/295 (50%), Gaps = 29/295 (9%)
Query: 457 LPDSGAFLERKVAGVHRVRVQDKSFTNCS-----WGYQVGLIGEKLQIYSNLGLNKVLWS 511
L DSG L +G+ +Q + WG++ L GE +IYS G+ KV W
Sbjct: 6 LQDSGGELAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWK 65
Query: 512 SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQ 571
+ R TWYK F P G+DP+ L++ SM KG +VNG+ +GRYWVS++T G PSQ
Sbjct: 66 PAEN-GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSYRTLAGTPSQ 124
Query: 572 TQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKV 631
YH+PR FLK NLLV+ EEE G P GI V T+ +
Sbjct: 125 A--------------------LYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRDDI 164
Query: 632 CGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDC 691
C ++ + + +W + + ++ T+ CP K I ++VFASFGNP+G C
Sbjct: 165 CLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLM--CPPEKTIQEVVFASFGNPEGMC 222
Query: 692 ERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFGGD-PCPGIHKALLVDAQC 745
+ VG+CH+ +++ +VE+ C+GK C +P+ +G D C L V +C
Sbjct: 223 GNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 277
>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
Length = 296
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/287 (38%), Positives = 157/287 (54%), Gaps = 15/287 (5%)
Query: 352 WEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSS-----NAQAP-LDV 405
W+ Y EA + D +GL++Q+S D SDY WYT + NS+ + Q P L +
Sbjct: 9 WQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTI 68
Query: 406 QSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLE 465
S GH L FVNG+ G+ +G +D+ T V + QG+N ++LS VGLP+ G E
Sbjct: 69 YSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYE 128
Query: 466 RKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQ 519
GV + + ++ W YQ+GL GE L + S G + V W S + +
Sbjct: 129 TWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSA-AGKQP 187
Query: 520 LTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYA-VNT 578
LTW+K F AP+G+ P+AL++ SMGKG+AWVNG+ IGRYW S+K S YA +
Sbjct: 188 LTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYW-SYKASSSGCGGCSYAGTYS 246
Query: 579 VTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDT 625
T + YHVPR++L P+GNLLV+LEE G+ G+ + T
Sbjct: 247 ETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVT 293
>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
Length = 770
Score = 183 bits (465), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 177/660 (26%), Positives = 281/660 (42%), Gaps = 135/660 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQ-----------KGQYDFSGRNDIIRFIKEIQS 50
W ++ + GL+ +Q YVFWN HEP+ + +YDFSGR D++ FI+
Sbjct: 82 WEPMLEEMGRDGLNHVQLYVFWNYHEPRPPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAK 141
Query: 51 QGLYVCLRIGPFIESEWTYGGLPIWLHDVAGIVFRS---------DNKPY---------- 91
+ L+V LRIGP++ +EW +GGLP+WL DV G+ FRS KP+
Sbjct: 142 KDLFVSLRIGPYVCAEWAFGGLPLWLRDVEGMCFRSICGYNGSPGKCKPWEGGKFRSCDP 201
Query: 92 --------------------------------KIENEYQTIEPAFHEKGPPYVLWAAKMA 119
++ENEY A G Y+ W +++
Sbjct: 202 WRKYMADFVMEIGRMVKEANLMAAQGGPVILGQLENEYGHHSDA----GRAYIDWVGELS 257
Query: 120 VDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS---PNKPSIWTEDWTSFYQVW 176
VPWVMC A G +N CNG C + +K + P++P WTE+ ++ W
Sbjct: 258 FGLGLDVPWVMCNGISANG-TLNVCNGDDCADEYKTDHDKRWPDEPLGWTEN-EGWFDTW 315
Query: 177 GGKP--YIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDE 234
GG RSA+++A+ +A ++A GS+ NYYM++GG + + AA + Y D
Sbjct: 316 GGAVGNSKRSAEEMAYVLAKWVAVGGSHHNYYMWYGGNHLAQWGAASLTNAYADGVNFHS 375
Query: 235 YGLVREPKWGHLKELHAAI-KLCSRPLLTGTQNVISLGQLQEAF-VFEETSGVCAAFLVN 292
GL EPK HL+ LH + KL + ++ + QL+ V+E T+G+ AFL
Sbjct: 376 NGLPNEPKRSHLQRLHEVLGKLNGELMQVEDRHSVMPVQLENGVEVYEWTAGL--AFL-- 431
Query: 293 NDERKA-----VTVLFRNISYELP-RKSISILPDCKTVAFNTERVSTQYN-KRSKTSNLK 345
R A V V + +Y + R+ + + P TV F T V R + L
Sbjct: 432 --HRPACSGSPVEVHYAKATYSIACREVLVVDPSSSTVLFATASVEPPPELVRRVVATLT 489
Query: 346 FDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDV 405
D +W +E +L+ T+ E ++ + + +DY Y L++
Sbjct: 490 AD---RWSMRKEELLHGMATVEGREP-VEHLRVSGLDTDYVTYKTTVTATEGVTNVSLEI 545
Query: 406 QSH-GHILHAFVNGEYTGSA---HGSHDNVSFTLRNTVH-LRQG-TNDGALLSVTVGLPD 459
S + H V+ + +A + N +T +H L G T D +LS ++G+ +
Sbjct: 546 DSRISQVFHVSVDNASSLAATVMDVNKGNTEWTAVAQLHNLTAGRTYDLWILSESLGVEN 605
Query: 460 SGAF---------LERKVAGVHRVRVQDKSFTNCSWGYQVGLIGE--------KLQIYSN 502
+ L++ + G +R+ +KS W GL GE +L +
Sbjct: 606 GMLYGAPAATEPSLQKGIFG--DIRLNEKSIRKGRWSMVKGLDGEVDGGQGKAELPCCDS 663
Query: 503 LG----LNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
LG + S+RS + LT + L L G W+NG IGR+
Sbjct: 664 LGPAWFVAGFTLHSVRSKSISLT--------------LPLGLPQQAGGHIWLNGVDIGRW 709
>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 288
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 147/272 (54%), Gaps = 11/272 (4%)
Query: 183 RSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREP 241
R +D+AF VA F + G++ NYYM+HGGTNFGRT I+ YD P+DEYG++R+P
Sbjct: 16 RPVEDLAFAVARFYQRGGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTPIDEYGIIRQP 75
Query: 242 KWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTV 301
KW HLK +H AIKLC + LL + LG EA V+ V AAFL N + A V
Sbjct: 76 KWDHLKNVHKAIKLCEKALLATGPTITYLGPNIEAAVY-NIGAVSAAFLANIAKTDA-KV 133
Query: 302 LFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRS-KTSNLKF------DSDEKWEE 354
F SY LP +S LPDCK+V NT ++++ S T +LK DS W
Sbjct: 134 SFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMISSFTTESLKEEVGSLDDSGSGWSW 193
Query: 355 YREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILHA 414
E I LL+QI+ D SDY WY+ +++ + L ++S GH LHA
Sbjct: 194 ISEPIGISKAHSFSKFWLLEQINTTADRSDYLWYSSSIDLDAAT-ETVLHIESLGHALHA 252
Query: 415 FVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTN 446
FVNG+ GS G+H+ VS + + L G N
Sbjct: 253 FVNGKLAGSGTGNHEKVSVKVDIPITLVYGKN 284
>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
Length = 288
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 107/246 (43%), Positives = 136/246 (55%), Gaps = 13/246 (5%)
Query: 99 TIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS 158
IE + + G Y WAAK A+ GVPWVMC+Q DAP +I+ CN C + FK PNS
Sbjct: 43 AIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTCNAYYC-DGFK-PNS 100
Query: 159 PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTA 218
NKP++WTE+W +Y WG + R +D+AF VA F + GS+ NYYMY G TNFGRTA
Sbjct: 101 HNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQRGGSFQNYYMYFGRTNFGRTA 160
Query: 219 AA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL-TGTQNVISLGQLQEA 276
IT Y A +DEYG +REPKWGHLK+LHAA+KLC L+ T + I LG QE
Sbjct: 161 GGPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPNQEI 220
Query: 277 FV-------FEETSGVCAAFLVNNDER-KAVTVLFRNISYELPRKSISILPDCKTVAFNT 328
F+ G LV D++ K RN+ L K + LP+
Sbjct: 221 GTLSMLRSRFQSLPGAFNTCLVPFDKKQKGRFSSQRNLLRLLQAKEMK-LPNLHNYGMRL 279
Query: 329 ERVSTQ 334
VST+
Sbjct: 280 FAVSTR 285
>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
Length = 777
Score = 175 bits (444), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 179/667 (26%), Positives = 271/667 (40%), Gaps = 117/667 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHE---PQ----KGQYDFSGRNDIIRFIKEIQSQGLY 54
WP + + GL+ ++TYVFW HE P+ + + DFSG D++RF++ + GL
Sbjct: 41 WPQIFRCMRRDGLNTVETYVFWGDHEFEPPEMPDAEPRADFSGPRDLVRFLRCAKLHGLN 100
Query: 55 VCLRIGPFIESEWTYGGLPIWLHDVAG------IVFRSDNKPY----------------- 91
LR+GP++ +E YGG P WL V + FR+ + Y
Sbjct: 101 AILRLGPYVCAEVNYGGFPWWLRQVCEKGSSKPVRFRTWDPAYCAQVERWLKYLVDHVLK 160
Query: 92 ---------------KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMC--KQD 134
+IENEY I ++ G Y+ W A +A GVP VMC
Sbjct: 161 PARVFAPQGGPVILAQIENEYAMIAESYGPDGQQYLDWIASLANQLALGVPLVMCYGASQ 220
Query: 135 DAPGPVINACNGMRCGETF----KGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAF 190
G VI N E + + +P +WTE WT +Y VWG + R A D+A+
Sbjct: 221 RESGRVIETINAFYAHEHVESLRRAQGANPQPLLWTECWTGWYDVWGAPHHRRDAADLAY 280
Query: 191 HVALFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKEL 249
V F+A G+ +NYYMY GGTN+ R ++ YD APL+EY ++ K HL+ L
Sbjct: 281 AVLRFLAAGGAGINYYMYFGGTNWRRENTMYLQATSYDYDAPLNEY-VMETTKSRHLRRL 339
Query: 250 HAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYE 309
H +I+ P L+ V+ + +L E VFE G A L ER V+ + S E
Sbjct: 340 HESIQ----PFLSDRDGVLDMSRL-ELKVFE---GERRAILY---ERSTVSGDADHRSEE 388
Query: 310 LPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRA 369
R +A + R +L++ + R A+ + TL
Sbjct: 389 SVRCVFDSADIRVHLALELREIIVNAASRDTGQDLRWRMLPEPPPLRAALSDTSATLATI 448
Query: 370 EGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHGHILH--AFVNGE------YT 421
L+D A SDY WY R + L+V G + A G+
Sbjct: 449 PDLVD---ATAGTSDYAWYILRCPTAQGSGLLQLEVADFGRVWRRKAVDQGDDAERQPLE 505
Query: 422 GSAHGSHDNVSFTLRNT------------VHLRQGTNDGALLSVTVGL--------PDSG 461
+A G V N V + +L ++G+ P G
Sbjct: 506 WAAAGPEPPVEDRFPNAWNSTEYGYGIVEVGAIDCHEEYVVLVSSLGMVKGDWQLPPGYG 565
Query: 462 AFLERKVAGVHRVRVQ-DKSFTNCSW------GYQVGLIGEKLQ--IYSNLGLNKVLWSS 512
ERK G+ R + D +F + W G+ GL GE+++ I + LW+
Sbjct: 566 MARERK--GLLRASYRSDVTFADDEWRDALVVGFAAGLRGERIRSVIEGDADAYPYLWTP 623
Query: 513 IRSPT--RQLT---WYKTTFRAPAGN----DPIALNLQSMGKGEAWV--NGQSIGRYWVS 561
++ R+ + WY+ + P N + I L+L G + W+ NG+ GR+W
Sbjct: 624 QKAALSGRRFSWPRWYRASLAIPPPNADETEGIILDLYESGVEKGWIYMNGEPCGRHWRV 683
Query: 562 FKTSKGN 568
T N
Sbjct: 684 HGTMPKN 690
>gi|242077941|ref|XP_002443739.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
gi|241940089|gb|EES13234.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
Length = 111
Score = 168 bits (426), Expect = 1e-38, Method: Composition-based stats.
Identities = 73/92 (79%), Positives = 79/92 (85%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAKEGGLDVIQTYVFWN+HEP +GQY+F GR D +RFIKEIQ QGLYV LRIG
Sbjct: 2 MWPKLIAKAKEGGLDVIQTYVFWNVHEPVQGQYNFEGRYDFVRFIKEIQGQGLYVNLRIG 61
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK 92
PFIESEW YGG P WLHDV I FRSDN+P+K
Sbjct: 62 PFIESEWKYGGFPFWLHDVPNITFRSDNEPFK 93
>gi|222616996|gb|EEE53128.1| hypothetical protein OsJ_35926 [Oryza sativa Japonica Group]
Length = 314
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 128/224 (57%), Gaps = 7/224 (3%)
Query: 524 KTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTS 581
+T F P G DP+A++L SMGKG+AWVNG IGRYW G S Y A N
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCSSSCYYPGAYNERKC 142
Query: 582 IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP 641
C + N YH+PR +LK + NLLVL EE G+P I+++ + VC ++ ++ P
Sbjct: 143 QSNCGM-PTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201
Query: 642 PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS 701
PLS+W H G + P ++ C G IS+I FAS+G P G C ++ G+CH+
Sbjct: 202 PLSAW-SHLSSGRASVN--AATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHA 258
Query: 702 SHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
S + +V AC+G ++C+I + + F GDPC G+ K L V+A+C
Sbjct: 259 SSTLDLVTEACVGNTKCAISVSNDVF-GDPCRGVLKDLAVEAKC 301
>gi|125536445|gb|EAY82933.1| hypothetical protein OsI_38150 [Oryza sativa Indica Group]
Length = 314
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 128/224 (57%), Gaps = 7/224 (3%)
Query: 524 KTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTS 581
+T F P G DP+A++L SMGKG+AWVNG IGRYW G S Y A N
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCSSSCYYPGAYNERKC 142
Query: 582 IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP 641
C + N YH+PR +LK + NLLVL EE G+P I+++ + VC ++ ++ P
Sbjct: 143 QSNCGM-PTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKAVCSRISENYYP 201
Query: 642 PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS 701
PLS+W H G + P ++ C G IS+I FAS+G P G C ++ G+CH+
Sbjct: 202 PLSAW-SHLSSGRASVN--AATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHA 258
Query: 702 SHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
S + +V AC+G ++C+I + + F GDPC G+ K L V+A+C
Sbjct: 259 SSTLDLVTEACVGNTKCAISVSNDVF-GDPCRGVLKDLAVEAKC 301
>gi|77554857|gb|ABA97653.1| Galactose binding lectin domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 317
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 128/224 (57%), Gaps = 7/224 (3%)
Query: 524 KTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTS 581
+T F P G DP+A++L SMGKG+AWVNG IGRYW G S Y A N
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCSSSCYYPGAYNERKC 142
Query: 582 IHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLP 641
C + N YH+PR +LK + NLLVL EE G+P I+++ + VC ++ ++ P
Sbjct: 143 QSNCGM-PTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201
Query: 642 PLSSWLRHRQRGDTDIKKFGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHS 701
PLS+W H G + P ++ C G IS+I FAS+G P G C ++ G+CH+
Sbjct: 202 PLSAW-SHLSSGRASVN--AATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHA 258
Query: 702 SHSQGVVERACIGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
S + +V AC+G ++C+I + + F GDPC G+ K L V+A+C
Sbjct: 259 SSTLDLVTEACVGNTKCAISVSNDVF-GDPCRGVLKDLAVEAKC 301
>gi|388518087|gb|AFK47105.1| unknown [Lotus japonicus]
Length = 220
Score = 159 bits (402), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 114/206 (55%), Gaps = 6/206 (2%)
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY--AVNTVTSIHFCAIIKATNT-YHVPRA 599
MGKG+AWVNG IGRYW G Y A N+ C K T T YHVPR+
Sbjct: 1 MGKGQAWVNGHHIGRYWTRVSPKSGCEQVCDYRGAYNSDKCTTNCG--KPTQTLYHVPRS 58
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
+LK + NLLV+ EE GNP I+V + R VC V+ SH PL + G ++
Sbjct: 59 WLKASDNLLVIFEETGGNPFRISVKLHSARIVCAKVSESHYQPLHKLMNADLIGH-EVSA 117
Query: 660 FGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
P + C G+ IS I FAS+GNP+G C+ ++ G+CH+ S +V +AC GK CS
Sbjct: 118 NSMIPELHLRCQDGRIISSITFASYGNPEGSCQSFSRGNCHAPSSMAIVSKACQGKRSCS 177
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
I + FGGDPC G+ K L V+A+C
Sbjct: 178 IKISDTIFGGDPCQGVMKTLSVEARC 203
>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
Length = 172
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 82/174 (47%), Positives = 99/174 (56%), Gaps = 33/174 (18%)
Query: 70 GGLPIWLHDVAGIVFRSDNKPYK-------------------------------IENEYQ 98
GG P+WL V GI FR+DN+P+K IENEY
Sbjct: 1 GGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYG 60
Query: 99 TIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS 158
+ G YV WAA MAV TGVPWVMCK++DAP PVIN CNG C ++F PN
Sbjct: 61 PQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC-DSFS-PNR 118
Query: 159 PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGT 212
P KP+IWTE W+ ++ +GG + R QD+AF VA FI K GS+ NYYMYHGGT
Sbjct: 119 PYKPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQKGGSFFNYYMYHGGT 172
>gi|166092020|gb|ABY82047.1| beta-galactosidase [Hymenaea courbaril var. stilbocarpa]
Length = 138
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 76/137 (55%), Positives = 89/137 (64%), Gaps = 2/137 (1%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
+IENEY +E G Y WAAKMAV +TGVPWVMCKQDDAP PVI+ CNG C E
Sbjct: 1 QIENEYGPVEWEIRAPGKAYTAWAAKMAVGLNTGVPWVMCKQDDAPDPVIDTCNGYYC-E 59
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 211
F PN KP +WTE+W+ +Y +GG R +DIA+ V FI GS+VNYYMYHGG
Sbjct: 60 NFT-PNKNYKPKMWTENWSGWYTEYGGAVPKRPVEDIAYSVTRFIQNGGSFVNYYMYHGG 118
Query: 212 TNFGRTAAAFMITGYYD 228
TNFGRT + I YD
Sbjct: 119 TNFGRTYSGLFIATSYD 135
>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Length = 469
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 156/358 (43%), Gaps = 70/358 (19%)
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
MYHG TNF RTA IT YD APLDE+G + +PK+GHLK+LH + L G
Sbjct: 23 MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLTYGNI 82
Query: 266 NVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTVA 325
+ G L V++ G + F+ N + + + F+ SY++P +SILPDCKT +
Sbjct: 83 STADFGNLVMTTVYQTEEG-SSCFIGNVNAK----INFQGTSYDVPAWYVSILPDCKTES 137
Query: 326 FNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDASDY 385
+NT + + +TS L F N + D SD+
Sbjct: 138 YNTAK-----RMKLRTS-----------------LRFKN-------------VSNDESDF 162
Query: 386 FWYTFRFHYNSSN----AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHL 441
WY + + L + S H+LH FVNG++TG+ + +
Sbjct: 163 LWYMTTVNLKEQDPAWGKNMSLRINSTAHVLHGFVNGQHTGNYRVENGKFHYVFEQDAKF 222
Query: 442 RQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIYS 501
G N LLSVTV LP+ GAF E AG+ G I
Sbjct: 223 NPGVNVITLLSVTVDLPNYGAFFENVPAGI---------------------TGPVFIIGR 261
Query: 502 NLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
N V + S + +L T F+AP G++P+ ++L GKG+A +N GRYW
Sbjct: 262 NGDETVVKYLSTHNGATKL----TIFKAPLGSEPVVVDLLGFGKGKASINENYTGRYW 315
>gi|217075719|gb|ACJ86219.1| unknown [Medicago truncatula]
Length = 200
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 114/206 (55%), Gaps = 10/206 (4%)
Query: 543 MGKGEAWVNGQSIGRYWVSF-KTSKGNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAF 600
MGKGEAWVNGQSIGRYW ++ + G Y S K + T YHVPRA+
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAW 60
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
LKP N VL EE G+P I+ T I VC HVT SH PP+ +W + + +K
Sbjct: 61 LKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTESHPPPVDTWNSNAE----SERKV 116
Query: 661 GKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
G P + CP + IS I FASFG P C Y GSC S+ + +V++ACIG S C+
Sbjct: 117 G--PVLSLECPYPNQAISSIKFASFGTPRRTCGNYNHGSCSSNRALSIVQKACIGSSSCN 174
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
I + F G+PC G+ K+L V+A C
Sbjct: 175 IGVSINTF-GNPCRGVTKSLAVEAAC 199
>gi|217070908|gb|ACJ83814.1| unknown [Medicago truncatula]
Length = 200
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 117/206 (56%), Gaps = 10/206 (4%)
Query: 543 MGKGEAWVNGQSIGRYWVSFKTSK-GNPSQTQYAVNTVTSIHFCAIIKATNT-YHVPRAF 600
MGKGEAWVNGQSIGRYW ++ S G Y +S K + T YHVPR+F
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSF 60
Query: 601 LKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKF 660
LKP GN LVL EE G+P I+ T + VC HV++SH P + W + + G K
Sbjct: 61 LKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGG----KV 116
Query: 661 GKKPTVQPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
G P + SCP + IS I FAS+G P G C + G C S+ + +V++ACIG CS
Sbjct: 117 G--PALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCS 174
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
+ + + F GDPC G+ K+L V+A C
Sbjct: 175 VGVSTDTF-GDPCRGVPKSLAVEATC 199
>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
Length = 208
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 69/150 (46%), Positives = 91/150 (60%), Gaps = 31/150 (20%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI K+K+GG+DVI+TYVFWNLHEP +GQY+F GR D++ F+K + + GLYV LRIG
Sbjct: 56 MWPDLIQKSKDGGIDVIETYVFWNLHEPVRGQYNFEGRGDLVGFVKVVAAAGLYVHLRIG 115
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK---------------------------- 92
P++ +EW YGG P+WLH +AGI FR++N+P+K
Sbjct: 116 PYVCAEWNYGGFPLWLHFIAGIKFRTNNEPFKAEMKRFTAKIVDMMKQENLYASQGGPII 175
Query: 93 ---IENEYQTIEPAFHEKGPPYVLWAAKMA 119
IENEY I+ Y+ WAA MA
Sbjct: 176 LSQIENEYGNIDTHDARAAKSYIDWAASMA 205
>gi|449532986|ref|XP_004173458.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 213
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 115/207 (55%), Gaps = 6/207 (2%)
Query: 423 SAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRV 476
S +GS ++ T V+L+QG N ++LSVTVGLP+ G + AGV +
Sbjct: 1 SVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNE 60
Query: 477 QDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPI 536
+ + W Y+VGL GE L +YS G N V W + LTWYKTTF PAGN+P+
Sbjct: 61 GTRDMSKYKWSYKVGLKGEILNLYSVKGSNSVQWMKGSFQKQPLTWYKTTFNTPAGNEPL 120
Query: 537 ALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHV 596
AL++ SM KG+ WVNG+SIGRY+ + S + T + + YH+
Sbjct: 121 ALDMSSMSKGQIWVNGRSIGRYFPGYIASGKCNKCSYTGFFTEKKCLWNCGGPSQKWYHI 180
Query: 597 PRAFLKPTGNLLVLLEEENGNPLGITV 623
PR +L P GNLL++LEE GNP GI++
Sbjct: 181 PRDWLSPNGNLLIILEEIGGNPQGISL 207
>gi|388493008|gb|AFK34570.1| unknown [Lotus japonicus]
Length = 189
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/209 (36%), Positives = 112/209 (53%), Gaps = 25/209 (11%)
Query: 540 LQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRA 599
+ MGKG WVNG+SIGR+WVSF + G P+Q +Y H+PRA
Sbjct: 1 MTGMGKGMIWVNGRSIGRHWVSFLSPLGLPTQAEY--------------------HIPRA 40
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
+L P NLLV+LEE+ G P I + + VC + S P ++SW+ + +
Sbjct: 41 YLNPKDNLLVILEEDQGTPEKIEIMNVNRDTVCSIIEESDPPNVNSWVSSHGQFRPRVSN 100
Query: 660 FGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
+ ++ SC GKKI + FASFGNP G C + +G C+++ +Q +VE+ C+GK C+
Sbjct: 101 VATQASL--SCGSGKKIVAVEFASFGNPSGSCGKLVLGDCNAAATQQIVEQQCLGKGSCN 158
Query: 720 IPLLSRYF---GGDPCPGIHKALLVDAQC 745
+ L F G D CPG+ K L + +C
Sbjct: 159 VDLNRATFIKNGKDACPGLVKKLAIQVKC 187
>gi|255602598|ref|XP_002537886.1| beta-galactosidase, putative [Ricinus communis]
gi|223514710|gb|EEF24497.1| beta-galactosidase, putative [Ricinus communis]
Length = 91
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 62/70 (88%), Positives = 67/70 (95%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWPSLI KAKEGGLDVIQTYVFWNLHEPQ GQYDFSGR D+++F+KEIQ+QGLYVCLRIG
Sbjct: 18 MWPSLIGKAKEGGLDVIQTYVFWNLHEPQPGQYDFSGRYDLVKFVKEIQAQGLYVCLRIG 77
Query: 61 PFIESEWTYG 70
PFIESEWTYG
Sbjct: 78 PFIESEWTYG 87
>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
Length = 173
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 91/163 (55%), Gaps = 33/163 (20%)
Query: 79 VAGIVFRSDNKPYK-------------------------------IENEYQTIEPAFHEK 107
V GI FR+DN P+K IENEY +E
Sbjct: 11 VPGIAFRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIENEYGPVEWEIGAP 70
Query: 108 GPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTE 167
G Y WAA+MAV +TGVPW+MCKQ+DAP PVI+ CNG C E F+ PN KP +WTE
Sbjct: 71 GKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC-EGFR-PNKNYKPKMWTE 128
Query: 168 DWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
+WT +Y +GG R +D+AF VA FI NGS+VNYYMYHG
Sbjct: 129 NWTGWYTKFGGPAPYRPVEDLAFSVARFIQNNGSFVNYYMYHG 171
>gi|359496328|ref|XP_003635211.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
gi|296080974|emb|CBI18606.3| unnamed protein product [Vitis vinifera]
Length = 198
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 117/206 (56%), Gaps = 13/206 (6%)
Query: 543 MGKGEAWVNGQSIGRYWVSF-KTSKGNPSQTQY--AVNTVTSIHFCAIIKATNTYHVPRA 599
MGKG+AWVNGQSIGRYW ++ S G + Y A + + C A YH+PR
Sbjct: 1 MGKGQAWVNGQSIGRYWPAYLAPSTGCTTNCDYRGAYDASKCLRNCGQ-PAQTLYHIPRT 59
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
++ NLLVL EE G+P I++ T ++VC HV+ + PP SW +++
Sbjct: 60 WVHSGKNLLVLHEELGGDPSKISLLTRTGQEVCAHVSEADPPPADSW-------QPNLEF 112
Query: 660 FGKKPTVQPSCPLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCS 719
+ V+ +C G IS I FASFG P G C + G+CH ++ VV++ACIG+ C+
Sbjct: 113 MSQSSQVRLTCEQGWHISMINFASFGTPRGHCGTFNPGNCH-ANVLSVVQQACIGQEGCA 171
Query: 720 IPLLSRYFGGDPCPGIHKALLVDAQC 745
IP+ + GDPCPG+ K+L ++A C
Sbjct: 172 IPVSTARL-GDPCPGVLKSLAIEALC 196
>gi|125556151|gb|EAZ01757.1| hypothetical protein OsI_23786 [Oryza sativa Indica Group]
Length = 101
Score = 137 bits (346), Expect = 2e-29, Method: Composition-based stats.
Identities = 59/92 (64%), Positives = 69/92 (75%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGLD I+TYVFWN HEP + QY+F G DI+RF KEIQ+ GLY LRIG
Sbjct: 1 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK 92
P+I EW YGGLP WL D+ G+ FR N P++
Sbjct: 61 PYICGEWNYGGLPAWLRDIPGMQFRLHNAPFE 92
>gi|1669595|dbj|BAA13685.1| AR782 [Arabidopsis thaliana]
Length = 206
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 17/210 (8%)
Query: 544 GKGEAWVNGQSIGRYWVSFKTSKGNPSQT-----QYAVNTVTSIHFCAIIKATNTYHVPR 598
GKG AWVNGQSIGRYW + G +++ Y N + C T YHVPR
Sbjct: 5 GKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKC--LKNCGKPSQT-LYHVPR 61
Query: 599 AFLKPTGNLLVLLEEENGNPLGITVDTIAI-RKVCGHVTNSHLPPLSSWLRHRQRGDTDI 657
++LKP+GN+LVL EE G+P I+ T +C V+ SH PP+ +W D+ I
Sbjct: 62 SWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTS-----DSKI 116
Query: 658 KKFGK-KPTVQPSCPLGKK-ISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGK 715
+ +P + CP+ + I I FASFG P G C + G C+SS S +V++ACIG
Sbjct: 117 SNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGL 176
Query: 716 SRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
C++ + +R F G+PC G+ K+L V+A C
Sbjct: 177 RSCNVEVSTRVF-GEPCRGVVKSLAVEASC 205
>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 154
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 64/121 (52%), Positives = 82/121 (67%), Gaps = 8/121 (6%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAK+GGLD+I+TYVFWN HEP +Y F R D++RFIK +Q GLYV LRIG
Sbjct: 32 MWPDLIQKAKDGGLDIIETYVFWNGHEPSPDKYYFEERYDLVRFIKLVQQAGLYVHLRIG 91
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE---YQTI-----EPAFHEKGPPYV 112
P++ +EW YGG P+WL V GI FR+DN P+K + Y+ + E FH +G P +
Sbjct: 92 PYVCAEWNYGGFPLWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPII 151
Query: 113 L 113
L
Sbjct: 152 L 152
>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
Length = 857
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 159/371 (42%), Gaps = 51/371 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++I KA+ GG + I+TY+ WN HE + Q+DFSG D+ F +G+YV +R GP
Sbjct: 33 WAAVIRKARLGGCNAIETYIAWNYHETAEEQWDFSGDKDLAAFFAICHDEGMYVIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP +L++ GI +R N Y +
Sbjct: 93 YICAEWDFGGLPYYLNNTDGIEYRCSNAAYEQAVRRYFERIMPIIRRYQLGSGGSIIMVQ 152
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMC-KQDDAPGPVINACNGMRCGE 151
IENEY AF +K ++ + ++ F VP V C + N +G
Sbjct: 153 IENEYH----AFGKKDLAHIRFLEELTRGFGITVPLVSCYGAGRNTVEMRNFWSGAERAA 208
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYI-RSAQDIAFHVALFIAKNGSYVNYYMYHG 210
+P E W + + WGG+P + A+ + H + + NYYMY G
Sbjct: 209 AVLRERQSGQPLGIMEFWIGWVEHWGGEPQKHKPAEAVLSHCFEALKSGFVFFNYYMYFG 268
Query: 211 GTNF----GRTAAA---FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTG 263
G+NF GRT A FM Y APLDE+G E K+ L LH I L G
Sbjct: 269 GSNFGSWGGRTIGAHKIFMTQSYDYDAPLDEFGFETE-KYRLLAVLHTFIAWLENDLTAG 327
Query: 264 TQNVISLGQLQEAFVFEETSGVCAAFLV--NNDERKAVTVLFRNISYELPRKSISILPDC 321
+ +I E V + C + ER+ V++ N Y+ SI P+
Sbjct: 328 SL-LIQEQAEHELSVTKAEYPSCRVYYYAHTGKERRQVSLTLDNEEYDF-----SIQPEF 381
Query: 322 KTVAFNTERVS 332
T ++++
Sbjct: 382 CTPVITEKKIT 392
>gi|357455525|ref|XP_003598043.1| Beta-galactosidase [Medicago truncatula]
gi|355487091|gb|AES68294.1| Beta-galactosidase [Medicago truncatula]
Length = 309
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/299 (32%), Positives = 140/299 (46%), Gaps = 53/299 (17%)
Query: 351 KWEEYREAILNFDNTLL-----RAEGLLDQISAAKDASDYFWYTFRFHYNSSN--AQAPL 403
KWE E + +TLL A LL+Q + ASDY WY N + +A L
Sbjct: 27 KWEWASEPM---QDTLLGKGTFTASKLLNQKNVTAGASDYLWYMTEVVVNDTKIWGKARL 83
Query: 404 DVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAF 463
V + G IL++++NG + G GS F V L+QG N +LLSVT+G + +
Sbjct: 84 HVDTKGPILYSYINGFWWGVEGGSPSKPGFVYEEDVSLKQGANIISLLSVTLGKSNCSGY 143
Query: 464 LERKVAGV--HRVRVQDKSFTN-------CSWGYQVGLIGEKLQIYSNLGLNKVLWS--- 511
++ K G+ ++ + N +W Y+VG+ G + Y N V W
Sbjct: 144 IDMKETGIVGGPAKLISTEYPNNVLDLSKSTWSYKVGMNGVARKFYDPKSTNVVPWQTRN 203
Query: 512 -SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPS 570
SI P +TWYKTTF+ P G++ + L+L + +G+AWVNGQSIGRYW+ G S
Sbjct: 204 VSIEGP---MTWYKTTFKTPEGSNLVVLDLIGLQRGKAWVNGQSIGRYWI------GENS 254
Query: 571 QTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE--ENGNPLGITVDTIA 627
++ Y VPR FL N LVL EE P ++VD ++
Sbjct: 255 SFRF-------------------YAVPRPFLNKDVNTLVLFEELGLGEGPFNVSVDIVS 294
>gi|223942939|gb|ACN25553.1| unknown [Zea mays]
Length = 199
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 83/207 (40%), Positives = 118/207 (57%), Gaps = 13/207 (6%)
Query: 543 MGKGEAWVNGQSIGRYW---VSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRA 599
MGKGEAWVNGQSIGRYW ++ ++ N + A ++ + C T YHVPR+
Sbjct: 1 MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQT-LYHVPRS 59
Query: 600 FLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKK 659
FL+P N LVL E G+P I+ VC V+ +H + SW + +++
Sbjct: 60 FLQPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQ-----PMQR 114
Query: 660 FGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRC 718
+G P ++ CP G+ IS + FASFG P G C Y+ G C S+ + +V+ ACIG S C
Sbjct: 115 YG--PALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSC 172
Query: 719 SIPLLSRYFGGDPCPGIHKALLVDAQC 745
S+P+ S YF G+PC G+ K+L V+A C
Sbjct: 173 SVPVSSNYF-GNPCTGVTKSLAVEAAC 198
>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
Length = 315
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 55/94 (58%), Positives = 71/94 (75%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP +I K+KEGGLDVI+TYVFWN HEP +G+Y F GR D++RF+K +Q GL V LRIG
Sbjct: 190 VWPEIIRKSKEGGLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIG 249
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE 94
P+ +EW YGG P+WLH + GI FR+ N +K E
Sbjct: 250 PYACAEWNYGGFPVWLHFIPGIQFRTTNDLFKNE 283
>gi|115361550|gb|ABI95864.1| beta-galactosidase [Planococcus sp. L4]
Length = 552
Score = 128 bits (322), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 134/290 (46%), Gaps = 36/290 (12%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TY+ WN HEP+KGQ+ FSG DI FI+ GLYV LR P
Sbjct: 11 WEDRLQKLKALGLNTVETYIPWNFHEPKKGQFHFSGMADIEGFIELAHRLGLYVILRPAP 70
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF----HEKGPPYVLWA 115
+I +EW GGLP WL +V RS + + +E+ + + P F ++ G P +
Sbjct: 71 YICAEWEMGGLPSWLMKDKNLVLRSSDPAFLGHVEDYFAELLPKFTKHLYQNGGPVIAMQ 130
Query: 116 AK----------MAVDF------HTGVPWVMCKQD--------DAPGPVINACNGMRCGE 151
+ +DF H G+ + D P G R E
Sbjct: 131 IENEYGAYGNDSAYLDFFKAQYEHHGLNTFLFTSDGPDFITQGSMPDVTTTLNFGSRVDE 190
Query: 152 TFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+F+ ++ P+ P + E W ++ W G+ +RS D+A + KN S VN+YM+H
Sbjct: 191 SFQALDAFKPDSPKMVAEFWIGWFDYWSGEHTVRSGDDVASVFKEIMEKNIS-VNFYMFH 249
Query: 210 GGTNFGRTAAAFMITGYYDQAPLDEY-GLVREPKWGHLKELHAAIKLCSR 258
GGTNFG A YY +Y L+ E G + E + A+K R
Sbjct: 250 GGTNFGFMNGANHYDIYYPTITSYDYDSLLTEG--GAITEKYKAVKEVLR 297
>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 127 bits (320), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 52/94 (55%), Positives = 72/94 (76%), Gaps = 1/94 (1%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQK-GQYDFSGRNDIIRFIKEIQSQGLYVCLRI 59
MWP L+ AKEGG+DVI+TYVFWN+H+P +Y F GR D+++FI +Q G+Y+ LRI
Sbjct: 51 MWPELVKTAKEGGVDVIETYVFWNVHQPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRI 110
Query: 60 GPFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKI 93
GPF+ +EW +GG+P+WLH V G VFR+DN +K+
Sbjct: 111 GPFVAAEWNFGGIPVWLHYVNGTVFRTDNYNFKV 144
>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
Length = 177
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 55/94 (58%), Positives = 71/94 (75%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+WP +I K+KEGGLDVI+TYVFWN HEP +G+Y F GR D++RF+K +Q GL V LRIG
Sbjct: 55 VWPEIIRKSKEGGLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIG 114
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE 94
P+ +EW YGG P+WLH + GI FR+ N +K E
Sbjct: 115 PYACAEWNYGGFPVWLHFIPGIQFRTTNDLFKNE 148
>gi|255550369|ref|XP_002516235.1| beta-galactosidase, putative [Ricinus communis]
gi|223544721|gb|EEF46237.1| beta-galactosidase, putative [Ricinus communis]
Length = 451
Score = 127 bits (318), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 149/361 (41%), Gaps = 71/361 (19%)
Query: 207 MYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKL---CSRPLLT 262
MYHGGTNF R + MI YD APLDEYG + +PKWGHL++LH I L SR L
Sbjct: 38 MYHGGTNFRRMSGGPMIVTSYDYDAPLDEYGNLNQPKWGHLRDLHVRILLHLSQSRGLGF 97
Query: 263 GTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCK 322
T ++L +G FL N + + +L + I +P
Sbjct: 98 ATVYALNL-----TTYINNATGERFCFLSNTKTNEDANI-------DLQQDGIFFVP--- 142
Query: 323 TVAFNTERVSTQYNKRSKTSNLKFDSDEKWEEYREAILNFDNTLLRAEGLLDQISAAKDA 382
W Y + + +G Q A D
Sbjct: 143 ----------------------------AWIYYYSSRVQ--------QGNFQQCKATSDE 166
Query: 383 SDYFWYTFRFH--YNSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVH 440
+DY Y R+ + S Q + + ++ G++ + L+ H
Sbjct: 167 TDYLRYITRYFDFFTVSVKDVHSRCQQCNNTEEHDLACDFFGTSPACSCQSAARLQQVFH 226
Query: 441 LRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSWGYQVGLIGEKLQIY 500
++ ++T G + G F + G+ ++ W Y++GL GE ++Y
Sbjct: 227 --------SIYNLTSGKQNYGEFFDEGPEGI----AGAADLSSNQWAYKIGLGGEAKRLY 274
Query: 501 S-NLGLNKVLWSSIRSPT-RQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRY 558
N G V +S P R +TWYKTTF P+G DP+ LNLQ MGKG AWVNG S+GR+
Sbjct: 275 DPNSGHRDVFRTSAILPVGRAMTWYKTTFHVPSGTDPLVLNLQGMGKGHAWVNGHSLGRF 334
Query: 559 W 559
W
Sbjct: 335 W 335
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 34/58 (58%)
Query: 671 PLGKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGKSRCSIPLLSRYFG 728
P G+ IS I FASFGNP+G C G ++++ VE+AC+GK CS+ + G
Sbjct: 379 PNGRIISVIQFASFGNPEGTCGSLQKGDFEAAYTAFAVEKACVGKESCSLGVSESTLG 436
>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
Length = 586
Score = 126 bits (316), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 81/248 (32%), Positives = 115/248 (46%), Gaps = 35/248 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WNLHEP++GQ+ F G DI+RFIK + GL+V +R GP
Sbjct: 34 WEDRLLKLKACGFNTVETYVAWNLHEPEEGQFVFEGIADIVRFIKTAEKVGLHVIVRPGP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYV-LW 114
FI +EW +GG P WL V I R N+PY + ++ + P G P + L
Sbjct: 94 FICAEWEFGGFPYWLLTVPNIKLRCFNQPYLEKVDAYFDVLFERLRPLLSSNGGPIIALQ 153
Query: 115 AAKMAVDFHTGVPWVMCKQD--------------DAPGP----------VINACN-GMRC 149
F ++ +D D P P + N G R
Sbjct: 154 IENEYGSFGNDQKYLQYLRDGIKKRVGNELLFTSDGPEPSMLSGGMIEGIFETVNFGSRA 213
Query: 150 GETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
F PN P + E W ++ WG + + RSA+ + + + +NGS VN+YM
Sbjct: 214 ESAFAQLKQYQPNAPLMCMEFWHGWFDHWGEEHHTRSAESVVETLEEILKQNGS-VNFYM 272
Query: 208 YHGGTNFG 215
HGGTNFG
Sbjct: 273 AHGGTNFG 280
>gi|297841097|ref|XP_002888430.1| hypothetical protein ARALYDRAFT_338750 [Arabidopsis lyrata subsp.
lyrata]
gi|297334271|gb|EFH64689.1| hypothetical protein ARALYDRAFT_338750 [Arabidopsis lyrata subsp.
lyrata]
Length = 470
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 131/467 (28%), Positives = 193/467 (41%), Gaps = 107/467 (22%)
Query: 215 GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP--LLTGTQNVISLGQ 272
G F+ TG Q L G ++ + LKEL +K ++ G+ +++
Sbjct: 53 GSLDVVFVDTGSLQQEVLG--GALKSSRISQLKELTRILKAADGDWRIVVGSDPLLAYNL 110
Query: 273 LQEAFVFEETSGVCAAF------------------LVNNDERKAVTVLFRN---ISYELP 311
+EA EE G+ + F + E T F+N + E+
Sbjct: 111 TKEA---EEAKGIASTFDQIMTKYGVVEHCADAKVIYKFLELMLCTWEFKNKVKTAKEIF 167
Query: 312 RKSISILPDCKTVAFNTERVSTQYNKRSK-------TSNLKFDSDEKWEEYREAILNFDN 364
IS D + N R K+ K + LKF E + E +IL+ D+
Sbjct: 168 NLGISRFTDHGILNQNHVRTDELMKKQKKIVKSEKTSKGLKF---EMFSEDIPSILDGDS 224
Query: 365 TLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNG 418
+L G L ++ KD +DY WYT + + L V GH L +VNG
Sbjct: 225 LIL---GELYYLT--KDKTDYAWYTTSIKIEDDDIPDQKGQKTILRVAGLGHTLIVYVNG 279
Query: 419 EYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD 478
EY ++LR N ++L V GLPDSG+++E AG V +
Sbjct: 280 EYA-----------------INLRTRDNCISILGVLTGLPDSGSYMEHTYAGPRGVSIIG 322
Query: 479 -KSFT-----NCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAG 532
KS T N WG+ V Y+ G KV W + LTWYKT F P G
Sbjct: 323 LKSGTRDLIENNEWGHLV---------YTEEGSKKVKWEKY-GEHKPLTWYKTYFETPEG 372
Query: 533 NDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATN 592
+ +A+ ++ MGKG WVNG +GRYW+SF + G P QT+
Sbjct: 373 ENAVAIRMKGMGKGLIWVNGIGVGRYWMSFVSPLGEPIQTE------------------- 413
Query: 593 TYHVPRAFLK--PTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
YH+PR+F+K ++LV+LEEE P+ V T + K+ + N
Sbjct: 414 -YHIPRSFMKEEKKKSMLVILEEE---PVAKMVPTSSPTKMINDLLN 456
>gi|414879450|tpg|DAA56581.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 154
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 52/70 (74%), Positives = 62/70 (88%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LIAKAK+GGLDVIQTYVFWN HEP +GQ++F GR D+++FI+EI +QGLYV LRIG
Sbjct: 68 MWPDLIAKAKKGGLDVIQTYVFWNAHEPVQGQFNFEGRYDLVKFIREIHAQGLYVSLRIG 127
Query: 61 PFIESEWTYG 70
PF+ESEW YG
Sbjct: 128 PFVESEWKYG 137
>gi|62321383|dbj|BAD94714.1| beta-galactosidase [Arabidopsis thaliana]
Length = 199
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 115/195 (58%), Gaps = 13/195 (6%)
Query: 439 VHLRQGTNDGALLSVTVGLPDSGAFLERKVAG------VHRVRVQDKSFTNCSWGYQVGL 492
+ L G N ALLSV VGLP+ G E+ G + V + W Y++G+
Sbjct: 4 IKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGALGPVTLKGVNSGTWDMSKWKWSYKIGV 63
Query: 493 IGEKLQIYSNLGLNKVLWS--SIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWV 550
GE L +++N + V W+ S + + LTWYK+TF PAGN+P+AL++ +MGKG+ W+
Sbjct: 64 KGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWI 123
Query: 551 NGQSIGRYWVSFKTSKGNPSQTQYA--VNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLL 608
NG++IGR+W ++K ++G+ + YA + + C + YHVPR++LK + NL+
Sbjct: 124 NGRNIGRHWPAYK-AQGSCGRCNYAGTFDAKKCLSNCGEA-SQRWYHVPRSWLK-SQNLI 180
Query: 609 VLLEEENGNPLGITV 623
V+ EE G+P GI++
Sbjct: 181 VVFEELGGDPNGISL 195
>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
domestica]
Length = 673
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 140/304 (46%), Gaps = 55/304 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TY+ WNLHEP++G+++FSG D+ F++ GL+V LR GP
Sbjct: 114 WKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSGNLDVEAFVQMAADIGLWVILRPGP 173
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW GGLP WL + + R+ + +
Sbjct: 174 YICSEWDLGGLPSWLLQDSSMELRTTYVGFIKAVDLYFNQLIPRVVPLQYTQGGPIIAVQ 233
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + ++K P Y+ + KMA+ G+ ++ D+ G G+
Sbjct: 234 VENEYGS-----YDKDPNYMPY-IKMAL-LKRGIVELLMTSDNKDGLSGGYVEGVLATIN 286
Query: 153 FKGPNS----------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
K +S NKP++ TE WT ++ WGG +I A D+ V+ I + G+
Sbjct: 287 LKNVDSIIFNYLQSFQDNKPTMVTEFWTGWFDTWGGPHHIVDADDVMVSVSSII-QMGAS 345
Query: 203 VNYYMYHGGTNFGRTAAAFMITGYY-DQAPLDEYGLVRE-----PKWGHLKELHAAIKLC 256
+N YM+HGGTNFG A T Y D D ++ E PK+ L+E + L
Sbjct: 346 LNLYMFHGGTNFGFMNGAQHFTDYQADVTSYDYDAILTEAGDYTPKFFKLREYFST--LI 403
Query: 257 SRPL 260
PL
Sbjct: 404 DNPL 407
>gi|297840773|ref|XP_002888268.1| hypothetical protein ARALYDRAFT_338522 [Arabidopsis lyrata subsp.
lyrata]
gi|297334109|gb|EFH64527.1| hypothetical protein ARALYDRAFT_338522 [Arabidopsis lyrata subsp.
lyrata]
Length = 246
Score = 122 bits (307), Expect = 5e-25, Method: Composition-based stats.
Identities = 89/272 (32%), Positives = 123/272 (45%), Gaps = 64/272 (23%)
Query: 380 KDASDYFWYTFRFHY------NSSNAQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSF 433
KD +DY WYT + + L V GH L +VNGEY
Sbjct: 25 KDKTDYAWYTTSIKIEDDDIPDQKGQKTILRVAGLGHALIVYVNGEYA------------ 72
Query: 434 TLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-KSFT-----NCSWG 487
++LR N ++L V GLPDSG+++E AG V + KS T N WG
Sbjct: 73 -----INLRTRDNCISILGVLTGLPDSGSYMEHTYAGPRGVSIIGLKSGTRDLIENNEWG 127
Query: 488 YQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGE 547
+ V Y+ G KV W + LTWYKT F P G + +A+ ++ MGKG
Sbjct: 128 HLV---------YTEEGSKKVKWEKY-GEHKPLTWYKTYFETPEGENAVAIRMKGMGKGL 177
Query: 548 AWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLK--PTG 605
WVNG +GRYW+SF + G P QT+ YH+PR+F+K
Sbjct: 178 IWVNGIGVGRYWMSFVSPLGEPIQTE--------------------YHIPRSFMKEEKKK 217
Query: 606 NLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
++LV+LEEE P+ V T + K+ + N
Sbjct: 218 SMLVILEEE---PVAKMVPTSSPTKMINDLLN 246
>gi|414880685|tpg|DAA57816.1| TPA: putative RAN GTPase activating family protein [Zea mays]
Length = 598
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 133/292 (45%), Gaps = 52/292 (17%)
Query: 206 YMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGT 264
+ YHGGTNFGRT+ IT YD APLDEYG +R+PK+GHLK+LH I+ + L+ G
Sbjct: 308 FKYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGK 367
Query: 265 QNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSISILPDCKTV 324
N S G+ A V+ D V V ++ +P S+SILPDCKTV
Sbjct: 368 YNDTSYGK--------------NAIFVDRD----VKVTLSGGTHLVPAWSVSILPDCKTV 409
Query: 325 AFNTERVSTQYNKRSKTSNLKFDSDE--KWE---EYREAILNFDNTLLRAEGLLDQISAA 379
A+NT ++ TQ + K +N E +W E + + R LL+QI+ +
Sbjct: 410 AYNTAKIKTQTSVMVKKANSVEKEPEALRWSWMPENLKPFMTDHRDSFRHSQLLEQITTS 469
Query: 380 KDASDYFWYTFRFHYNSSNAQAPLDVQSHGH-----------ILHAFVNGE--------- 419
D SDY WY + + L V + GH L A V+GE
Sbjct: 470 TDQSDYLWYRTSLEHKGEGSYT-LYVNTSGHEMAKLLGRWSVRLPAPVSGEAPLRKELRF 528
Query: 420 -------YTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFL 464
G + + F L++ V L G N +LLS TVGL + +
Sbjct: 529 SPQRHSRTQGQNYSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKSAKTLV 580
>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
Length = 582
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 121/277 (43%), Gaps = 42/277 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W I KA+ GL+ I+TYV WN H P++G +D G D+ RF++++ + GLY +R G
Sbjct: 34 LWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTDGMLDLGRFLEQVAAAGLYAIVRPG 93
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKG------ 108
P+I +EW GGLP WL G+ R + +E + P ++G
Sbjct: 94 PYICAEWDNGGLPAWLFQEPGVGVRRYEPRFLAAVEQYLEQVLDLVRPLQVDQGGPVLLL 153
Query: 109 ------------PPYVLWAAKMAVDFHTGVPWVMCKQDDAP--------GPVINACNGMR 148
P Y+ A M VP V Q G + G R
Sbjct: 154 QVENEYGAFGNDPEYLEAVAGMIRKAGITVPLVTVDQPTGEMLAAGGLDGVLRTGSFGSR 213
Query: 149 CGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
E + P P + E W ++ WGG + S +D A + +A G+ VN Y
Sbjct: 214 SAERLATLREHQPTGPLMCMEFWDGWFDHWGGPHHTTSVEDAARELDALLAA-GASVNIY 272
Query: 207 MYHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
M+HGGTNFG T+ A +T Y APLDE G
Sbjct: 273 MFHGGTNFGLTSGADDKGVFRPTVTSYDYDAPLDEAG 309
>gi|357450861|ref|XP_003595707.1| Beta-galactosidase [Medicago truncatula]
gi|355484755|gb|AES65958.1| Beta-galactosidase [Medicago truncatula]
Length = 308
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/301 (31%), Positives = 137/301 (45%), Gaps = 54/301 (17%)
Query: 351 KWEEYREAILNFDNTLL-----RAEGLLDQISAAKDASDYFWYTFRFHYNSSN--AQAPL 403
KWE E + +TLL A LLDQ + ASDY WY N + ++ L
Sbjct: 27 KWEWASEPM---QDTLLGQGTFTASKLLDQKNVTAGASDYLWYMTEVVVNDTTVWGKSTL 83
Query: 404 DVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAF 463
V + G I+++++NG + G SF + L++GTN +LLSVT+G + F
Sbjct: 84 QVNAKGPIIYSYINGFWWGVYDSVPSTRSFVYDEDISLKRGTNIISLLSVTLGKSNCSGF 143
Query: 464 LERKVAGVHRVRVQDKS---------FTNCSWGYQVGLIGEKLQIYSNLGLNKVLW---- 510
++ K G+ V+ S + +W Y+VG+ G + Y N V W
Sbjct: 144 IDMKETGIVGGHVKLISIEYPDNVLDLSKSTWSYKVGMNGMARKFYDPKS-NGVPWIPRN 202
Query: 511 SSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPS 570
SI P +TWYKTTF+ P G++ + L+L + +G+AWVNGQ IGRY G S
Sbjct: 203 VSIGVP---MTWYKTTFKTPEGSNLVVLDLIGLQRGKAWVNGQCIGRY------RLGENS 253
Query: 571 QTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEE--ENGNPLGITVDTIAI 628
+Y Y VPR F N LVL EE P ++VD I+I
Sbjct: 254 SFRY-------------------YAVPRPFFNKDVNTLVLFEELGLGKGPFNVSVDIISI 294
Query: 629 R 629
Sbjct: 295 E 295
>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
Length = 867
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/301 (30%), Positives = 133/301 (44%), Gaps = 35/301 (11%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ KAK GG + I+TY+ WN HE ++G++DFSG D+ F++ ++GLYV R GP
Sbjct: 33 WDDVLEKAKAGGCNTIETYIPWNFHEMKEGEWDFSGDKDLAHFLQLCANKGLYVIARPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------KIENEYQ----------T 99
+I +EW +GG P WL I +RS + I +EYQ
Sbjct: 93 YICAEWDFGGFPWWLSTKKDIQYRSAQPSFLHYVDQYFDQVISIIDEYQLTKNGSVIMVQ 152
Query: 100 IEPAFHEKGPP---YVLWAAKMAVDFHTGVPWVMC-KQDDAPGPVINACNGMRCGETFKG 155
IE F G P Y+ + + VP+V C D N +G
Sbjct: 153 IENEFQAYGKPDKKYMEYLRDGMIARGIEVPFVTCYGAVDGAVEFRNFWSGANRAAEILD 212
Query: 156 PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG-SYVNYYMYHGGTNF 214
++P E W +++ WGG + + + +NG + +NYYMY GGTNF
Sbjct: 213 ERFADQPKGVMEFWIGWFEHWGGNKANQKTPEQLERECYQLLRNGFTTINYYMYFGGTNF 272
Query: 215 ----GRTAA--AFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNVI 268
GRT + F T Y +DEY L K+ LK H +K PL T +
Sbjct: 273 DHWGGRTVSEQVFCTTTYDYDVAIDEY-LQPTRKYEVLKRYHLFVKWLE-PLFTNAEQAN 330
Query: 269 S 269
S
Sbjct: 331 S 331
>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
Length = 786
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 163/372 (43%), Gaps = 51/372 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE ++G++DF+G ND+ FI+ Q GLYV +R GP
Sbjct: 67 WEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDFTGNNDVAEFIRLAQENGLYVIVRPGP 126
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY-------QTIEPAFHEKGPPYVLW 114
++ +EW GGLP WL I R + PY +E + I EKG P ++
Sbjct: 127 YVCAEWEMGGLPWWLLKKKDIRLREQD-PYFMERYRIFAQKLGEQIGDLTIEKGGPIIMV 185
Query: 115 AAKMAV-DFHTGVPWVMCKQD--------------------------DAPGPVINACNGM 147
+ + P+V +D D +N G
Sbjct: 186 QVENEYGSYGEDKPYVSAIRDIIRDSGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGTGA 245
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
FK G P P + +E W+ ++ WGG+ R ++++ + + K S+ +
Sbjct: 246 NIENEFKKLGELRPESPQMCSEFWSGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SL 304
Query: 206 YMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI---KLC 256
YM HGGT++G A A +T Y AP++E G V PK+ L+E+ A KL
Sbjct: 305 YMTHGGTSWGHWAGANSPGFSPDVTSYDYDAPINEAGQVT-PKYMELREMLAGYSDKKLP 363
Query: 257 SRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDE---RKAVTVLFRNISYELPRK 313
S P NV + + A +FE A+ + E + ++L+R + +P +
Sbjct: 364 SIPKEIPVINVPKIQFTEVAPLFENLPAPHASMDIQTMEALNQGWGSILYRTKTPAVPTQ 423
Query: 314 SISILPDCKTVA 325
S+ + D A
Sbjct: 424 SVLTITDAHDFA 435
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 7/52 (13%)
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY 574
Y+ TF D LNL++ GKG+ +VNG +IGR+W K P QT Y
Sbjct: 541 YRATFNLKKTGDTF-LNLETWGKGQVYVNGHAIGRFW------KIGPQQTLY 585
>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
Length = 638
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 138/302 (45%), Gaps = 57/302 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WNLHEP+KG++DF+G DI +++E + GL+V R GP
Sbjct: 42 WRDRMLKLKACGLNTLETYVCWNLHEPEKGKFDFTGMLDIAAYLREAANLGLWVIFRPGP 101
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYV--- 112
+I +EW YGGLP WL + R+ +PY +E + ++P +++G P +
Sbjct: 102 YICAEWDYGGLPSWLLRDPNMQVRTTYQPYMEAVERFFDALLPIVKPFQYKEGGPIIAMQ 161
Query: 113 --------------LWAAKMAVDFHTGVPWVMCKQDDA----------PGPVINACNGMR 148
L A K A+ G+ ++ D PG ++ A
Sbjct: 162 VENEYGSYARDDKYLTAVKQAIQ-KRGIEELLLTSDGGQIERLERGCIPGVLMTANFNFN 220
Query: 149 CGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF------IAKNG 200
+ PN+P + E W+ ++ WG + HV F I +
Sbjct: 221 PKKQLGALKKLQPNRPQMVMEFWSGWFDHWGRDHH-------KLHVEKFEQLLGDILRFP 273
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY------YD-QAPLDEYGLVREPKWGHLKELHAAI 253
S VN+YM+HGGTNFG A I GY YD APL E G PK+ +EL +
Sbjct: 274 SSVNFYMFHGGTNFGFMNGANYINGYKPDVTSYDYDAPLSEAG-DPTPKYYKTRELLKTL 332
Query: 254 KL 255
+
Sbjct: 333 AM 334
>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
Length = 591
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 130/285 (45%), Gaps = 58/285 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WNLHEP+ GQ+ F G D++RF++ GL+V +R P
Sbjct: 35 WRDRLLKLKACGFNTVETYIPWNLHEPKPGQFRFDGLADVVRFVEIAGEVGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL G+ R ++PY +
Sbjct: 95 YICAEWEFGGLPAWLLADPGMRVRCMHRPYLDRVDAYYDVLLPLLKPLLCTNGGPIIAMQ 154
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY + ++ A ++G +L+ + F M + PG +
Sbjct: 155 IENEYGSYGNDRAYLVYLKDAMLQRGMDVLLFTSDGPEHF-------MLQGGMIPGVLET 207
Query: 143 ACNGMRCGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G R E F+ P+ P + E W ++ WG + + R A+D+A V + + G
Sbjct: 208 VNFGSRAEEAFEMLRKYQPDGPIMCMEYWNGWFDHWGEQHHTRDAKDVA-DVFDDMLRLG 266
Query: 201 SYVNYYMYHGGTNFGRTAAAF---------MITGYYDQAPLDEYG 236
+ VN+YM+HGGTNFG + A IT Y PL+E G
Sbjct: 267 ASVNFYMFHGGTNFGYMSGANCPQRDHYEPTITSYDYDVPLNESG 311
>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
Length = 786
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 164/372 (44%), Gaps = 51/372 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE ++G++DF+G ND+ FI+ Q GLYV +R GP
Sbjct: 67 WEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDFTGNNDVAEFIRLAQENGLYVIVRPGP 126
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY-------QTIEPAFHEKGPPYVLW 114
++ +EW GGLP WL I R + PY +E + I EKG P ++
Sbjct: 127 YVCAEWEMGGLPWWLLKKKDIRLREQD-PYFMERYRIFAKKLGEQIGDLTIEKGGPIIMV 185
Query: 115 AAKMAV-DFHTGVPWVMCKQD--------------------------DAPGPVINACNGM 147
+ + P+V +D D +N G
Sbjct: 186 QVENEYGSYGEDKPYVSGIRDIIRDSGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGTGA 245
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
FK G P P + +E W+ ++ WGG+ R ++++ + + K S+ +
Sbjct: 246 NIENEFKKLGELRPESPQMCSEFWSGWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SL 304
Query: 206 YMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL---HAAIKLC 256
YM HGGT++G A A +T Y AP++E G V PK+ L+E+ ++ KL
Sbjct: 305 YMTHGGTSWGHWAGANSPGFSPDVTSYDYDAPINEAGQVT-PKYMELREMLSGYSDKKLP 363
Query: 257 SRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDE---RKAVTVLFRNISYELPRK 313
S P NV + + A +FE A+ + E + ++L+R + +P +
Sbjct: 364 SIPKEFPVINVPKIQFTEVAPLFENLPAPHASMDIQTMEAFNQGWGSILYRTKTPAVPTQ 423
Query: 314 SISILPDCKTVA 325
SI + D A
Sbjct: 424 SILTITDAHDFA 435
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 7/52 (13%)
Query: 523 YKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY 574
Y+ TF D LNL++ GKG+ +VNG +IGR+W K P QT Y
Sbjct: 541 YRATFNLKKTGDTF-LNLETWGKGQVYVNGHAIGRFW------KIGPQQTLY 585
>gi|223982755|ref|ZP_03632983.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
gi|223965255|gb|EEF69539.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
Length = 592
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 173/410 (42%), Gaps = 69/410 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN HEP+KGQ+DFSGR D+ RF+++ Q+ GL+V LR P
Sbjct: 34 WQDRLEKLKNMGCNCVETYIPWNYHEPKKGQFDFSGRKDVARFVRKAQALGLWVILRPTP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+I +EW +GGLP WL + RS +PY + ++ I P F G P +
Sbjct: 94 YICAEWEFGGLPAWLLADDSMRVRSTYQPYLDAVDAYYAELFKVIRPLFFTHGGPVLMCQ 153
Query: 113 --------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVINACN------------G 146
L A K ++ H G M D V++A G
Sbjct: 154 IENEYGSFGNDKQYLKAIKRLMEKH-GCDVPMFTSDGGWREVLDAGTLLNEGVLPTANFG 212
Query: 147 MRCGE------TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
R E F N + P + E W ++ WG R A++ A + + + G
Sbjct: 213 SRTDEQIGALRQFMNDNDIHGPLMCMEFWIGWFNNWGSPLKTRDAKEAADELDAML-RQG 271
Query: 201 SYVNYYMYHGGTNFG-------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
S VN YM+HGGTN IT Y APL E+G E K+ +E+ A
Sbjct: 272 S-VNIYMFHGGTNPEFYNGCSYHNGMDPQITSYDYAAPLTEWGTEAE-KYAAFREVIAKY 329
Query: 254 KLCSRPLLTGTQNVISLGQLQ---EAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYEL 310
+ L+ S G+L+ + +F S + + D + + L + Y L
Sbjct: 330 NPITPVPLSTPITFKSYGELRCENKVSLFNTLSSLAQP--IETDIPQPMEKLGQGYGYIL 387
Query: 311 PRKSI--------SILPDCK---TVAFNTERVSTQYNKRSKTSNLKFDSD 349
R + + L DC V N + ++TQY K + SN+ D
Sbjct: 388 YRAHVGKARELAKAKLADCDDRAQVFVNQKLIATQY-KETMGSNIPLTLD 436
>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
Length = 615
Score = 119 bits (298), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 132/305 (43%), Gaps = 54/305 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ + TYV WN HE + G+ F G D+ RF++ Q GL V +R GP
Sbjct: 56 WADRLDRLAALGLNTVDTYVPWNFHERRPGEARFDGWRDLARFVRLAQRAGLDVMVRPGP 115
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL G+ R+ ++PY +
Sbjct: 116 YICAEWDNGGLPAWLTGTPGMRLRAGHQPYLDAVARWFDALVPRVAELQAVHGGPVVAVQ 175
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVD------FHT--GVPWVMCKQDDAPGPVINAC 144
IENEY + + YV W VD +T G +M PG + A
Sbjct: 176 IENEYGS-----YGDDHAYVRWVRDALVDRGITELLYTADGPTPLMLDGGTVPGELAAAT 230
Query: 145 NGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G R E S P +P + E W ++ WG K ++RS A V + GS
Sbjct: 231 FGSRAAEAAALLRSRRPGEPFLCAEFWNGWFDHWGEKHHVRSRDGAAQEVEEILDAGGS- 289
Query: 203 VNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
V+ YM HGGTNFG A A +T Y AP+ E+G + PK+ L+E AA+
Sbjct: 290 VSLYMAHGGTNFGLWAGANHDGGVLRPTVTSYDSDAPVSEHGAL-TPKFHALRERFAALA 348
Query: 255 LCSRP 259
+ P
Sbjct: 349 GRTAP 353
>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
Length = 920
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 125/283 (44%), Gaps = 54/283 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KAK GL+ I TYVFWNLHEPQKG+YDFSG NDI F+K Q +GL+V LR P
Sbjct: 371 WRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFSGNNDIAAFVKTAQEEGLWVILRPSP 430
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL ++ G+ RS Y +
Sbjct: 431 YVCAEWEFGGYPYWLQNIKGLEVRSKEPQYLQAYKNYIMQVGKQLAPLQVNHGGNILMVQ 490
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVD------FHTGVPWVMCKQDDAPGPVINACNG 146
+ENEY + Y+ ++ ++ +T P + + PG + + NG
Sbjct: 491 VENEYGA-----YGSDREYLDINRRLFIEAGFDGLLYTCDPEPFLAKGNLPGKLFTSING 545
Query: 147 M----RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+ R + K N P E + +++ WG + + A+ + ++ G
Sbjct: 546 LDKPARIKQLIKQNNEGKGPYFVAEWYPAWFDWWGTQHHKVPAEKYTPGLDSVLSA-GMS 604
Query: 203 VNYYMYHGGTNFGRTAAAF---------MITGYYDQAPLDEYG 236
VN YM+HGGT A I+ Y APLDE G
Sbjct: 605 VNMYMFHGGTTRDFMNGANYNDQNPYEPQISSYDYDAPLDEAG 647
>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 633
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 157/365 (43%), Gaps = 60/365 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + AK GL+ + TY+FWN+HEP+ G YDFSG +D+ F+K Q +GL V LR GP
Sbjct: 73 WRARLQMAKAMGLNTVATYIFWNVHEPKPGVYDFSGNHDVAAFVKMAQEEGLNVILRAGP 132
Query: 62 FIESEWTYGGLPIWLHD--VAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYVL 113
+ +EW +GG P WL G RS+++ Y I+ Q + P G P V
Sbjct: 133 YACAEWEFGGYPSWLMKDPKMGSALRSNDEVYMAPVERWIKRLGQEMVPLLISNGGPIVA 192
Query: 114 ---------------WAAKMAVDFH-TGVPWVMCKQDDAPGPVIN-ACNGMRCGETFKGP 156
+ A M F G D ++N + G+ G F
Sbjct: 193 VQVENEYGDFGGDKKYLAHMLEIFQNAGFKDSFLYTVDPSKALVNGSLEGLPSGVNFGVG 252
Query: 157 NS-----------PNKPSIWTEDWTSFYQVWG----GKPYIRSAQDIAFHVALFIAKNGS 201
N+ P +P +E W ++ WG +P +DIA+ + + S
Sbjct: 253 NAERGLTALAHLRPGQPLFASEYWPGWFDHWGHPHETRPIPPQLKDIAYTL-----DHKS 307
Query: 202 YVNYYMYHGGTNFGRTAAAFM--------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
+N YM+HGGT+FG + A +T Y APLDE G PK+ ++L A
Sbjct: 308 SINIYMFHGGTSFGFMSGASWTGGEYLPDVTSYDYDAPLDEAGH-PTPKFYAYRDLMAKY 366
Query: 254 KLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTV--LFRNISYELP 311
PL+ VI++ + F S + V K +T+ + ++ Y L
Sbjct: 367 VKTPLPLVPAVPEVIAVPE----FTVGRASSLWDHLPVPVKSEKPLTMEAMDQSYGYALY 422
Query: 312 RKSIS 316
RK +S
Sbjct: 423 RKQLS 427
>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
Length = 618
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 141/325 (43%), Gaps = 55/325 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWN HE + G+++FSG D+ +FIK Q GLYV +R GP
Sbjct: 58 WKHRLQMMKSMGLNTVTTYVFWNYHEEEPGKWNFSGEKDLKKFIKTAQEAGLYVIIRPGP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLWA 115
++ +EW +GG P WL + R+DNK + I + I P G P ++
Sbjct: 118 YVCAEWEFGGYPWWLQKDKNLEIRTDNKAFLKQCENYINELAKQIIPLQINNGGPVIMVQ 177
Query: 116 AK---------------------------------MAVDFHTGVPWVMCKQDDAPGPVIN 142
A+ + V F T + K+ G +
Sbjct: 178 AENEFGSYVAQRKDISLEQHKKYSHKIKDFLVKSGITVPFFTSDGSWLFKEGSIEGALPT 237
Query: 143 ACNGMRCGETFKGPNSPNK---PSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIAK 198
A K N N P + E + + W +P+++ S +D+ L+I K
Sbjct: 238 ANGEGDVDNLRKKINEFNNGKGPYMVAEYYPGWLDHW-AEPFVKVSTEDVVKQTELYI-K 295
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
NG NYYM HGGTNFG T+ A +T Y AP++E G V PK+ L+++
Sbjct: 296 NGISFNYYMIHGGTNFGFTSGANYDKNHDIQPDLTSYDYDAPINEAGWVT-PKFNALRDI 354
Query: 250 HAAIKLCSRPLLTGTQNVISLGQLQ 274
I P + VI++ +++
Sbjct: 355 FQKINRQRLPEVPKPMKVITIPEIK 379
>gi|38699452|gb|AAR27062.1| beta-galactosidase 2 [Ficus carica]
Length = 177
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 99/177 (55%), Gaps = 13/177 (7%)
Query: 386 FWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTV 439
WY + + + +Q L V+S GH LHAFVN E GSA G+ + + + +
Sbjct: 1 LWYMTSIYVDENEGFLKNGSQPILLVESKGHALHAFVNQELQGSASGNGTHSPYKFKKPI 60
Query: 440 HLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-----KSFTNCSWGYQVGLIG 494
L+ G N+ ALLS+TVGL ++G+F E AG+ V + + +N +W Y++GL G
Sbjct: 61 SLKAGKNEIALLSMTVGLQNAGSFYEWVGAGLTNVEISGFKNGPVNLSNSTWTYKIGLQG 120
Query: 495 EKLQIYSNLGLNKVLWSSIRSPTRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
E+L IY G+ KV W + +P ++ L WYK P G++P+ L++ MGKG+ W
Sbjct: 121 EQLGIYKEDGVAKVNWIATSNPPKKQPLIWYKAVIDPPLGDEPVGLDMLHMGKGQIW 177
>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
Length = 664
Score = 117 bits (294), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 140/301 (46%), Gaps = 45/301 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEPQ+G++DFSG D+ F+ GL+V LR GP
Sbjct: 115 WRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 174
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-KIENEY------QTIEPAFHEKGP----- 109
+I SE GGLP WL ++ R+ K + + N+Y + + + ++GP
Sbjct: 175 YICSEMDLGGLPSWLLQDPKMILRTTYKGFVEAVNKYFDHLISRVVPLQYRKRGPIIAVQ 234
Query: 110 ------------PYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG---ETFK 154
Y+ + K ++ G+ ++ DDA + G+ TF+
Sbjct: 235 VENEYGSFAEDKDYMPYIQKALLE--RGIVELLMTSDDAKHMLKGYIEGVLATINMNTFQ 292
Query: 155 GPN-------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ NKP + E W ++ WGGK I++A+D+ V+ FI S+ N YM
Sbjct: 293 INDFKQLSQVQRNKPIMVMEFWVGWFDTWGGKHMIKNAEDVEDTVSKFITSEISF-NVYM 351
Query: 208 YHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPL 260
+HGGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 352 FHGGTNFGFMNGATYFGKHRGVVTSYDYDAVLTEAGDYTE-KYFKLRKLFGSVVAVHLPP 410
Query: 261 L 261
L
Sbjct: 411 L 411
>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
Length = 940
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 156/349 (44%), Gaps = 62/349 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ K+KE G + I+TYV WN HE ++GQ+DFSG D+ F+ +GLYV +R GP
Sbjct: 37 WAEVLDKSKEAGCNCIETYVPWNWHEEEEGQWDFSGDKDLGAFLDLCAERGLYVIVRPGP 96
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL + +R ++ + +
Sbjct: 97 YICAEWDMGGLPYWLERKPDMQYRKFHREFLHYVDLYWDRLVPVVLPRLLSNSGTVIMVQ 156
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINAC-------N 145
+ENE+Q A + Y+ + ++ VP V C G V A +
Sbjct: 157 VENEFQ----ALGKPDKAYMEYLRDGLIERGIDVPLVTCY-----GAVDGAVEFRNFWSH 207
Query: 146 GMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGG-KPYIRSAQDIAFHVALFIAKNGSYVN 204
T + ++P E W +++ WGG + ++A + I + + +N
Sbjct: 208 AEEHARTLE-ERFADQPKGVLEFWIGWFEQWGGPRANQKTASQVERKTYELIREGFTAIN 266
Query: 205 YYMYHGGTNF----GRTAA--AFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
YYM+ GGTNF GRT FM T Y A LDEY L K+ LK +H ++
Sbjct: 267 YYMFFGGTNFGHWGGRTIGEHTFMTTSYDYDAALDEY-LRPTAKYKALKLVHDFVRWME- 324
Query: 259 PLL---TGTQNVISLGQLQEAFVFEETSGVCAAFL-VNNDERKAVTVLF 303
PLL TG+ I LG+ A ++ SG L ++ND+ + + +
Sbjct: 325 PLLTETTGSTAFIPLGKHSSA---KKKSGPQGTILFIHNDDTERLNGML 370
>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 781
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 133/294 (45%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I +K G++ I YVFWN HEP++G+YDF+G+ DI F + Q G+YV +R GP
Sbjct: 59 WEHRIKMSKALGMNTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRMAQENGMYVIVRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R + Y +
Sbjct: 119 YVCAEWEMGGLPWWLLKKEDIKLREQDPYYMERVKLFMNEVGKQLADLQISKGGNIIMVQ 178
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY + PY+ M TGVP C +++A + +N
Sbjct: 179 VENEYGSF-----GIDKPYIAAIRDMVKQAGFTGVPLFQCDWNSNFENNALDDLLWTVNF 233
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + F+ PN P + +E W+ ++ WG K RSA+++ + + +N S
Sbjct: 234 GTGANIDQQFERLKELRPNTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLDRNIS 293
Query: 202 YVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT+FG A T Y AP++E G V PK+ +++L
Sbjct: 294 F-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKFLEVRDL 345
>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
51196]
Length = 664
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 134/289 (46%), Gaps = 65/289 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + AK GL+ I TYVFWNLHEP+ G++DFSG D+ +FI++ Q GL V LR GP
Sbjct: 61 WKARLQMAKAMGLNTIATYVFWNLHEPEPGKFDFSGNADLAQFIRDAQQTGLKVLLRAGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGI--VFRSDNKPY---------------------------- 91
+ +EW +GG P WL + RS++ +
Sbjct: 121 YSCAEWEFGGFPAWLMKNPKMQTALRSNDPEFMKPAEQWILRLGREVAPLQVGYGGPIIG 180
Query: 92 -KIENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPG 138
+IENEY + ++ F + G +L+ A + G +P V + APG
Sbjct: 181 VQIENEYGDFGGDAAYLEHLKKIFLKAGFTQSLLYTANPSRALVRGSIPGVYSAVNFAPG 240
Query: 139 PVINACNG---MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF 195
A + +R G+ P + +E WT ++ W G+P+ + +
Sbjct: 241 HAAQALDSLAQLRAGQ----------PLLSSEYWTGWFDHW-GEPHQSKPLSLQVKDFNY 289
Query: 196 IAKNGSYVNYYMYHGGTNFGRTAAA------FM--ITGYYDQAPLDEYG 236
I ++G+ VN YM+HGGT+FG + + F+ +T Y APLDE G
Sbjct: 290 ILRHGAGVNLYMFHGGTSFGMMSGSSWTKHQFLPDVTSYDYGAPLDEAG 338
>gi|432894411|ref|XP_004075980.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oryzias
latipes]
Length = 640
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 125/283 (44%), Gaps = 56/283 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G +DF G D+ ++ S G++V LR GP
Sbjct: 79 WEDRLLKLKACGLNTLTTYVPWNLHEPERGVFDFEGELDLEAYLGLAASLGIWVILRPGP 138
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN------------------KPY-----------K 92
+I +EW GGLP WL + R+ PY +
Sbjct: 139 YICAEWDLGGLPSWLLRDQNMRLRTTYPGFTAAVDSYFDHLIKKVAPYQYSRGGPIIAVQ 198
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + A E+ P++ A G+ ++ D+ G + G
Sbjct: 199 VENEYGSY--AMDEEYMPFIKEAL-----LSRGITELLVTSDNKDGLKLGGVKGALETIN 251
Query: 153 FKGPN----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
F+ + P KP + E W+ ++ +WGG ++ A+++ V I K
Sbjct: 252 FQKLDPEEIKYLEKIQPQKPKMVMEYWSGWFDLWGGLHHVFPAEEM-MAVVTEILKLDMS 310
Query: 203 VNYYMYHGGTNFGRTAAAF---------MITGYYDQAPLDEYG 236
+N YM+HGGTNFG + AF M+T Y APL E G
Sbjct: 311 INLYMFHGGTNFGFMSGAFAVGRPSPAPMVTSYDYDAPLSEAG 353
>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
queenslandica]
Length = 689
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 129/298 (43%), Gaps = 53/298 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP G++DFSG +I FIK S L V +R GP
Sbjct: 102 WTDRLKKLKAMGLNTVDTYVSWNLHEPMPGEFDFSGLLNIHEFIKIAHSLELNVIVRPGP 161
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW GGLP WL + RS+ KPY +
Sbjct: 162 YICSEWDNGGLPAWLLHDPNMKIRSNYKPYQDAVKRFFTKLFEILTPLQSSYGGPIIAFQ 221
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK-QDDAPGPVINACNGMRCGE 151
+ENEY P + G ++ + A + ++ Q+D A N
Sbjct: 222 VENEYAAYGPR-NATGRHHMQYLANLMRSLGAVELFITSDGQNDIKASSDMAPNNALLTV 280
Query: 152 TFKGPNS----------PNKPSIWTEDWTSFYQVWGGKPYIR--SAQDIAFHVALFIAKN 199
F+ S PNKP + E WT ++ WG + R S + ++ +
Sbjct: 281 NFQNDPSEALNKLLLVQPNKPPLVMEYWTGWFDHWGRRHLERTLSPSQLIVNIGTILQMG 340
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM--------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
GS+ N YM+HGGTNFG A + +T Y APL E G + + K+ L+EL
Sbjct: 341 GSF-NLYMFHGGTNFGFMNGANIEGGEYRPDVTSYDYDAPLSEAGDITK-KYTLLREL 396
>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
Length = 779
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 132/294 (44%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HEP++G+YDF+G+ DI F + Q G+YV +R GP
Sbjct: 59 WEHRIKMCKALGMNTICLYVFWNFHEPEEGRYDFAGQKDIAAFCRLAQENGMYVIVRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R + Y +
Sbjct: 119 YVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVKLFLNEVGKQLADLQISKGGNIIMVQ 178
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY AF PY+ M TGVP C +++A + IN
Sbjct: 179 VENEYG----AFG-IDKPYISEIRDMVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINF 233
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G E FK P+ P + +E W+ ++ WG K RSA+++ + + +N S
Sbjct: 234 GTGANIDEQFKRLKELRPDTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLDRNIS 293
Query: 202 YVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT+FG A T Y AP++E G V PK+ ++ L
Sbjct: 294 F-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYLEVRNL 345
>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
Length = 788
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/299 (29%), Positives = 136/299 (45%), Gaps = 59/299 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE ++G++DF+G ND+ F + Q G+YV +R GP
Sbjct: 63 WEHRIKMCKALGMNTVCLYVFWNIHEQEEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R + PY
Sbjct: 123 YVCAEWEMGGLPWWLLKKKDIRLREQD-PYFMQRVEIFEKEVGKQLAPLTIQNGGPIIMV 181
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINA-------- 143
++ENEY + + K PYV +A + +G V Q D +N
Sbjct: 182 QVENEYGS-----YGKDKPYV--SAIRDIVRKSGFDKVSLFQCDWSSNFLNNGLDDLTWT 234
Query: 144 ---CNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + FK G PN P + +E W+ ++ WG + R A+D+ + ++K
Sbjct: 235 MNFGTGANIDQQFKRLGEVRPNAPKMCSEFWSGWFDKWGARHETRPAKDMVEGMDEMLSK 294
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHA 251
S+ + YM HGGT+FG A A +T Y AP++E+GL PK+ L+++ A
Sbjct: 295 GISF-SLYMTHGGTSFGHWAGANSPGFQPDVTSYDYDAPINEWGLAT-PKFYELQKMMA 351
>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
Length = 574
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 125/278 (44%), Gaps = 46/278 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I AK GL+ I+TYV WN HEP +G++D +G ND+ RF+ I ++GL+ +R GP
Sbjct: 35 WADRIRTAKAMGLNTIETYVAWNAHEPVRGEWDATGWNDLGRFLDLIAAEGLHAIVRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-------NEYQTIEPAFHEKGPPYVLW 114
+I +EW GGLP+WL GI R ++P +E Y+ + P ++G VL
Sbjct: 95 YICAEWHNGGLPVWLTSTPGIGIRR-SEPQFVEAVSEYLRRVYEIVAPRQIDRGGNVVLV 153
Query: 115 A------------------------AKMAVDFHT---GVPWVMCKQDDAPGPVINACNGM 147
A + V T +PW M + P + G
Sbjct: 154 QIENEYGAYGSDKEYLRELVRVTKDAGITVPLTTVDQPMPW-MLEAGSLPELHLTGSFGS 212
Query: 148 RCGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
R E + P P + +E W ++ WG + A + + +A G+ VN
Sbjct: 213 RSAERLATLREHQPTGPLMCSEFWDGWFDWWGSIHHTTDPAASAHDLDVLLAA-GASVNI 271
Query: 206 YMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
YM HGGTNFG T A ++T Y AP+DE G
Sbjct: 272 YMVHGGTNFGTTNGANDKGRFDPIVTSYDYDAPIDESG 309
>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 584
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 132/298 (44%), Gaps = 57/298 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WN HEP++G++ F G D+ +FI GLY +R P
Sbjct: 35 WRDRLLKLKACGFNTVETYVPWNFHEPEEGRFVFEGMADLEKFIALAGELGLYAIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL G+ R KP+ +
Sbjct: 95 YICAEWEFGGLPAWLLKDPGMRLRCSYKPFLDKADAYYDELIPRLTPFLSTKGGPLIAMQ 154
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY + ++ A ++G +L+ + DF M + G
Sbjct: 155 IENEYGSYGNDKTYLNYLKEALVKRGVDVLLFTSDGPEDF-------MLQGGMVEGVWET 207
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G R E F P++P + E W ++ WG + R A D+A + +A G
Sbjct: 208 VNFGSRSAEAFAKLQEYQPDQPLMCMEFWNGWFDHWGETHHTRGAADVALVLDEMLAA-G 266
Query: 201 SYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+ VN+YM+HGGTNFG + A +T Y +PL E G + E K+ ++E+ A
Sbjct: 267 ASVNFYMFHGGTNFGFFSGANYTDRLLPTVTSYDYDSPLSESGELTE-KYYAVREVIA 323
>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
Length = 588
Score = 116 bits (291), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 125/275 (45%), Gaps = 40/275 (14%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + KA+ GL+ ++TYV WNLH+P+ ++ G D+ RF+ ++GL+V LR G
Sbjct: 39 LWRDRLHKARLMGLNTVETYVPWNLHQPRPDEFRMDGGLDLPRFLDLAAAEGLHVLLRPG 98
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFHEK-----GP---- 109
P+I +EW GGLP WL + RS + + +++ ++ + P H++ GP
Sbjct: 99 PYICAEWEGGGLPSWLLADPAMRLRSRDPNFLAAVDDYFRRLLPPLHDRLASRGGPVLAV 158
Query: 110 -------------PYVLWAAKMAVDFHTGVPWVMCKQ------DDAPGPVINACNGMRCG 150
Y+ A VP C Q G + A G R
Sbjct: 159 QVENEYGAYGDDTAYLEHLADSLRRHGVDVPLFTCDQPADLERGALAGVLATANFGSRPA 218
Query: 151 ETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
+ P+ P + TE W ++ WGG +R A+ + + +A G+ VN+YM+
Sbjct: 219 AHLATLRTARPSAPLLCTEFWIGWFDRWGGNHVVRDAEQASQELDELLA-TGASVNFYMF 277
Query: 209 HGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
HGGTNFG A +T Y APLDE G
Sbjct: 278 HGGTNFGFMNGANDKHTYRPTVTSYDYDAPLDEAG 312
>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
Length = 612
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/292 (32%), Positives = 130/292 (44%), Gaps = 49/292 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K K GL+ ++TYV WNLHE +G ++F DI+ FIK Q LYV +R GP
Sbjct: 73 WEDRIVKLKAMGLNTVETYVSWNLHEEIQGDFNFKDGLDIVEFIKTAQKHDLYVIMRPGP 132
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNK-----------------------------PYK 92
+I +EW GGLP WL I RS + ++
Sbjct: 133 YICAEWDLGGLPSWLLHNPNIYLRSLDPIFMKATLRFFDELIPRLIDYQYSNGGPIIAWQ 192
Query: 93 IENEYQTIE--PAFHEKGPPYVLWAAKMAVDFHTGVPWVMC--KQDDAPGPVINACNGMR 148
IENEY + + A+ K ++ + F + W M K+ PG V+ N R
Sbjct: 193 IENEYLSYDNSSAYMRKLQQEMVIRGVKELLFTSDGIWQMQIEKKYSLPG-VLKTVNFQR 251
Query: 149 CGET--FKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
ET KG PN P + TE W+ ++ WG ++ + + A I K S +N
Sbjct: 252 -NETNILKGLRKLQPNMPLMVTEFWSGWFDHWGEDKHVLTVEKAAERTK-NILKMESSIN 309
Query: 205 YYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVREPKWGHLKE 248
YYM HGGTNFG A IT Y AP+ E G + PK+ L+E
Sbjct: 310 YYMLHGGTNFGFMNGANAENGKYKPTITSYDYDAPISESGDI-TPKYRELRE 360
>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 782
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 131/294 (44%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HEP++G+YDF+G+ DI F + Q G+YV +R GP
Sbjct: 59 WEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R + Y +
Sbjct: 119 YVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVKLFMNEVGKQLTDLQISKGGNIIMVQ 178
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY + PY+ + TGVP C +++A + IN
Sbjct: 179 VENEYGSF-----GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINF 233
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P+ P + +E W+ ++ WG K RSA+D+ + + +N S
Sbjct: 234 GTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNIS 293
Query: 202 YVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT+FG A T Y AP++E G V PK+ ++ L
Sbjct: 294 F-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
Length = 786
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 130/301 (43%), Gaps = 67/301 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE Q+G+++F+G ND+ F + Q GLYV +R GP
Sbjct: 61 WEHRIRMCKALGMNTICLYVFWNIHEQQEGKFNFTGNNDVAAFCRLAQKHGLYVIVRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIEN----EYQT---IEPAFHEKGPPYVL- 113
++ +EW GGLP WL I R + PY +E E Q + P +KG P ++
Sbjct: 121 YVCAEWEMGGLPWWLLKKKDIRLR-ERDPYFMERVKVFEQQVGNQLAPLTIDKGGPIIMV 179
Query: 114 -------------------------------------WAAKMAVDFHTGVPWVMCKQDDA 136
WA+ + + W M
Sbjct: 180 QVENEYGSYGVDKEYVSQIRDIVRSSGFDKVALFQCDWASNFEKNGLDDLIWTM------ 233
Query: 137 PGPVINACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
N G E FK G P P + +E W+ ++ WG + R A+++ +
Sbjct: 234 -----NFGTGANIDEQFKRLGELRPQSPKMCSEFWSGWFDKWGARHETRPAKNMVAGIDE 288
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKE 248
+ K S+ + YM HGGT+FG A A +T Y AP++EYGL PK+ L+
Sbjct: 289 MLTKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGLA-TPKYYELRA 346
Query: 249 L 249
+
Sbjct: 347 M 347
Score = 42.7 bits (99), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 65/147 (44%), Gaps = 22/147 (14%)
Query: 505 LNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKT 564
+ +++WS P ++ +Y+ F D LN+++ GKG+ ++NG +IGR+W
Sbjct: 529 MKEIVWSKT-IPQDKIGYYRGYFNLKKVGDTF-LNMEAFGKGQVYINGYAIGRFW----- 581
Query: 565 SKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGNLLVLLEEENGNPLGITVD 624
P QT Y + C + K N V + P GN ++ +++ +D
Sbjct: 582 -NIGPQQTLY-------VPGCWLKKGQNEVIV-LDMVGPKGNPVLFAQDKP------ELD 626
Query: 625 TIAIRKVCGHVTNSHLPPLSSWLRHRQ 651
+ + K H + P L+S H Q
Sbjct: 627 KLNLEKSNKHNNPGNRPDLNSKTPHAQ 653
>gi|373953405|ref|ZP_09613365.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890005|gb|EHQ25902.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 608
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 87/287 (30%), Positives = 126/287 (43%), Gaps = 63/287 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + AK GL+ I TYVFWNLHEPQKG++DF+G ND+ F++ + +GL+V LR P
Sbjct: 58 WRARMKMAKAMGLNTIGTYVFWNLHEPQKGKFDFTGNNDVAEFVRIAKQEGLWVILRPSP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + G+V RS Y +
Sbjct: 118 YVCAEWEFGGYPYWLQNEKGLVVRSKEAQYLKEYESYIKEVGKQLAPLQINHGGNILMVQ 177
Query: 93 IENEYQTI----------EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY + + F E G +L+ A D G PG ++
Sbjct: 178 IENEYGSYGSDKDYLAINQKLFKEAGFDGLLYTCDPAADLVNG---------HLPG-LLP 227
Query: 143 ACNGMRCGETFKGPNSPNK----PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
A NG+ + K S N P E + +++ WG K + A + + +A
Sbjct: 228 AVNGIDNPDKVKQIISQNHNGKGPYYIAEWYPAWFDWWGTKHHTVPAAEYTGRLDSVLAA 287
Query: 199 NGSYVNYYMYHGGTNFGRTAAAF---------MITGYYDQAPLDEYG 236
G +N YM+HGGT G A ++ Y APLDE G
Sbjct: 288 -GISINMYMFHGGTTRGFMNGANYKDTSPYEPQVSSYDYDAPLDEAG 333
>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 131/294 (44%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HEP++G+YDF+G+ DI F + Q G+YV +R GP
Sbjct: 59 WEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R + Y +
Sbjct: 119 YVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVKLFMNEVGKQLADLQISKGGNIIMVQ 178
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY + PY+ + TGVP C +++A + IN
Sbjct: 179 VENEYGSF-----GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINF 233
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P+ P + +E W+ ++ WG K RSA+D+ + + +N S
Sbjct: 234 GTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNIS 293
Query: 202 YVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT+FG A T Y AP++E G V PK+ ++ L
Sbjct: 294 F-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
>gi|332187631|ref|ZP_08389367.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
gi|332012379|gb|EGI54448.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
Length = 613
Score = 116 bits (290), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 134/302 (44%), Gaps = 59/302 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KAK GL+ I TY FWN HEP+ G YDF+G+NDI FI++ Q++GL V LR GP
Sbjct: 61 WRDRLRKAKAMGLNTITTYSFWNAHEPRPGTYDFTGQNDIAAFIRDAQAEGLDVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GG P WL ++ RS + Y +
Sbjct: 121 YVCAEWELGGYPSWLLKDRNLLLRSTDPKYTAAVDRWLARLGQEVKPLLLRNGGPIVAIQ 180
Query: 93 IENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
+ENEY + ++ ++ G VL+ + A D G +P V + G
Sbjct: 181 LENEYGAFGSDKAYLEGLKASYQRAGLADGVLFTSNQAGDLAKGSLPEVPSVVNFGSGGA 240
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
NA + E F+ P+ + E W ++ WG + + A + F+ K G
Sbjct: 241 QNAVAKL---EAFR----PDGLRMVGEYWAGWFDKWGEDHHETDGKKEAEELG-FMLKRG 292
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITG---------YYDQAPLDEYGLVREPKWGHLKELHA 251
V+ YM+HGGT FG A TG Y APLDE G R K+G L + A
Sbjct: 293 YSVSLYMFHGGTTFGWMNGADSHTGTDYHPDTTSYDYNAPLDEAGNPRY-KYGLLASVIA 351
Query: 252 AI 253
+
Sbjct: 352 EV 353
>gi|296081427|emb|CBI16778.3| unnamed protein product [Vitis vinifera]
Length = 242
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 65/124 (52%), Positives = 74/124 (59%), Gaps = 7/124 (5%)
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
IN CN C + PNSPNKP +WTE+W + + +G +DI F VA F K
Sbjct: 120 INTCNSFYCDQF--TPNSPNKPKMWTENWPGWSKTFGALDPHGPREDIVFSVARFFWK-- 175
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYD-QAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
VNYYM HGGTNFGRT+ IT YD AP+DEYGL R PK GHLKEL AIK C
Sbjct: 176 --VNYYMDHGGTNFGRTSGGPFITTTYDYNAPIDEYGLARLPKCGHLKELRRAIKSCEHV 233
Query: 260 LLTG 263
LL G
Sbjct: 234 LLYG 237
>gi|255550379|ref|XP_002516240.1| beta-galactosidase, putative [Ricinus communis]
gi|223544726|gb|EEF46242.1| beta-galactosidase, putative [Ricinus communis]
Length = 216
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 79/130 (60%), Gaps = 8/130 (6%)
Query: 108 GPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTE 167
G Y+ W + MA GVPW++C+Q DAP P+IN C G C + PN+ N P WTE
Sbjct: 57 GKAYLDWCSDMAESLDIGVPWIICQQRDAPQPMINTCYGWYCDQF--TPNTANSPKKWTE 114
Query: 168 DWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAA-FMITGY 226
+WT +++ WG K R+A+ +AF VA F + N YMYHGGTNFGRTA + T
Sbjct: 115 NWTGWFKSWGDKDPHRTAEGVAFAVARFF----QFQNCYMYHGGTNFGRTAGGPYSTTTS 170
Query: 227 YD-QAPLDEY 235
+D APLDE+
Sbjct: 171 HDYDAPLDEH 180
>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 116 bits (290), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 131/294 (44%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HEP++G+YDF+G+ DI F + Q G+YV +R GP
Sbjct: 59 WEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R + Y +
Sbjct: 119 YVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVKLFMNEVGKQLTDLQINKGGNIIMVQ 178
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY + PY+ + TGVP C +++A + IN
Sbjct: 179 VENEYGSF-----GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINF 233
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P+ P + +E W+ ++ WG K RSA+D+ + + +N S
Sbjct: 234 GTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNIS 293
Query: 202 YVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT+FG A T Y AP++E G V PK+ ++ L
Sbjct: 294 F-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
Length = 782
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 131/294 (44%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HEP++G+YDF+G+ DI F + Q G+YV +R GP
Sbjct: 59 WEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDFTGQKDIAAFCRLAQENGMYVIVRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R + Y +
Sbjct: 119 YVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVKLFMNEVGKQLTDLQISKGGNIIMVQ 178
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY + PY+ + TGVP C +++A + IN
Sbjct: 179 VENEYGSF-----GIDKPYIAEIRDIVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINF 233
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P+ P + +E W+ ++ WG K RSA+D+ + + +N S
Sbjct: 234 GTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWGAKHETRSAEDLVKGMKEMLDRNIS 293
Query: 202 YVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT+FG A T Y AP++E G V PK+ ++ L
Sbjct: 294 F-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYFEVRNL 345
>gi|5566254|gb|AAD45349.1| beta-galactosidase [Vitis vinifera]
Length = 181
Score = 115 bits (289), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 70/181 (38%), Positives = 100/181 (55%), Gaps = 15/181 (8%)
Query: 384 DYFWYTFRFHYNSSNA-----QAP-LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRN 437
DY WY R SS + + P L +Q+ GH +H F+NG+ TGSA G+ + FT
Sbjct: 1 DYLWYMTRIDIGSSESFLRGGELPTLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTE 60
Query: 438 TVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV------HRVRVQDKSFTNCSWGYQVG 491
V+L GTN ALLSV VGLP+ G E G+ H + + W Y+VG
Sbjct: 61 KVNLHAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVG 120
Query: 492 LIGEKLQIYSNLGLNKVLW--SSIRSPTRQ-LTWYKTTFRAPAGNDPIALNLQSMGKGEA 548
L GE + + S G++ V W S+ + +Q LTW+K F AP G++P+AL+++ MGKG+
Sbjct: 121 LKGEAMNLVSPNGISSVDWMQGSLAAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQI 180
Query: 549 W 549
W
Sbjct: 181 W 181
>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
Length = 789
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 137/299 (45%), Gaps = 60/299 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE ++G++DFSG +D+ F + Q G+Y+ +R GP
Sbjct: 63 WDHRIKMCKALGMNTICLYVFWNIHEQREGEFDFSGNSDVAAFCRLTQKNGMYIIVRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R ++ PY
Sbjct: 123 YVCAEWEMGGLPWWLLKKKDIRLR-ESDPYFMERVEIFEQKVAEQLAPLTIQNGGPIIMV 181
Query: 92 KIENEY--------------QTIEPAFHEKGPPYVLWAAKMAVDFH-TGVPWVMCKQDDA 136
++ENEY + ++ G L+ A +F G+ ++ +
Sbjct: 182 QVENEYGSYGEDKKYVGQIRDVLRKYWYTNGRGPALFQCDWASNFEKNGLEDLIWTMNFG 241
Query: 137 PGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
G I+A MR GE P+ P + +E W+ ++ WG + R A+D+ + +
Sbjct: 242 TGANIDA-QFMRLGEL-----RPDAPKMCSEFWSGWFDKWGARHETRPAKDMVAGIDEML 295
Query: 197 AKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+K S+ + YM HGGT+FG A A +T Y AP++EYG V PK+ L+++
Sbjct: 296 SKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQV-TPKFWELRKM 352
>gi|326922161|ref|XP_003207320.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Meleagris
gallopavo]
Length = 643
Score = 115 bits (289), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 136/316 (43%), Gaps = 45/316 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD IQTYV WN HE Q G YDFSG D+ F++ GL V LR GP
Sbjct: 49 WKDRLLKMKMAGLDAIQTYVPWNYHETQMGVYDFSGDRDLEYFLQLASETGLLVILRAGP 108
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y E ++P ++ G P ++
Sbjct: 109 YICAEWDMGGLPAWLLEKESIVLRSSDSDYLTAVEKWMGVLLPKMKPHLYQNGGPIIMVQ 168
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPG------------PVINACN 145
+ + A D+ H G V+ D A ++
Sbjct: 169 VENEYGSYFACDYDYLRSLLKIFRQHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAP 228
Query: 146 GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G F S P P + +E +T + WG + + +Q IA + +A+ G+ V
Sbjct: 229 GGNVTAAFLAQRSSEPTGPLVNSEFYTGWLDHWGHRHAVVPSQTIAKTLNEILAR-GANV 287
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
N YM+ GGTNF A M T Y APL E G + E K+ L+E+
Sbjct: 288 NLYMFIGGTNFAYWNGANMPYMSQPTSYDYDAPLSEAGDLTE-KYFALREVIGMYNQLPE 346
Query: 259 PLLTGTQNVISLGQLQ 274
L+ T + + G ++
Sbjct: 347 GLIPPTTSKFAYGNVR 362
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 115 bits (289), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 86/326 (26%), Positives = 131/326 (40%), Gaps = 58/326 (17%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQK-GQYDFSGRNDIIRFIKEIQSQGLYVCLRI 59
MWP L A+A+ GL+ I++Y FWN H + G YD+ D+ F+ L+V R
Sbjct: 1068 MWPKLFAEARANGLNAIESYAFWNKHSATRYGAYDYGFNGDVDLFLSLAAEHDLFVLWRF 1127
Query: 60 GPFIESEWTYGGLP------------IWLHDVAGIVFRSDNKPYKIE------NEYQTIE 101
GP++ +EW GG+P W+HDV G+ R++N + E + + IE
Sbjct: 1128 GPYVCAEWPAGGIPARAPRRAVFASNAWIHDVPGMKTRTNNTAWLNETGRWMRDHFAVIE 1187
Query: 102 PAFHEKGPPYVL------------------WAAKMAVDFHTGVPWVMCKQDDAPGP---- 139
P G + +A + W+MC P
Sbjct: 1188 PHLSRNGASNRIENEYGGSKSDAAAVAYVDALDALADAVAPELVWMMCGFVSLVAPDALH 1247
Query: 140 VINAC---NGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
N C G P P+ +TED +Y WG R D+A+ VA ++
Sbjct: 1248 TGNGCPHDQGPASAHVVVPPAPGADPAWYTED-ELWYDAWGLPSLARPPADVAYGVASYV 1306
Query: 197 AKNGSYVNYYMYHGGTNFGRTAAAFMITG-------------YYDQAPLDEYGLVREPKW 243
A G+ N+YM+HGG ++G + A G Y + APL G EP +
Sbjct: 1307 ATGGAMHNFYMWHGGNHYGNWSTATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLF 1366
Query: 244 GHLKELHAAIKLCSRPLLTGTQNVIS 269
HL +H + + LL T ++
Sbjct: 1367 SHLAAVHGTLDAYAEVLLGATPEALA 1392
>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
Length = 586
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 128/277 (46%), Gaps = 42/277 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W I KA+ GL+ I+TYV WN H PQ+G++ G D+ RF++ ++++G+ +R G
Sbjct: 31 LWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTDGALDLERFLRLVEAEGMLAIVRPG 90
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY-KIENEY-----QTIEPAFHEKGPPYVLW 114
P+I +EW GGLP WL + R D Y + +EY + P ++G P VL
Sbjct: 91 PYICAEWDNGGLPGWLFRDPAVGVRRDEPLYMEAVSEYLGTVLDLVAPFQVDRGGPVVLV 150
Query: 115 AAK----------------MAVDFHTGVPWVMCKQDDAPGPVI--NACNGMRCGETFKGP 156
+ MA+ G+ + D G ++ + +G+ +F
Sbjct: 151 QVENEYGAYGSDHVYLEKLMALTRSHGITVPLTSIDQPSGTMLADGSIDGLHRTGSFGSR 210
Query: 157 NS----------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
++ P P + E W ++ WG + SAQD A + +A G+ VN Y
Sbjct: 211 SAERLATLREHQPTGPLMCAEFWDGWFDHWGAHHHTTSAQDAARELDELLAA-GASVNIY 269
Query: 207 MYHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
M+HGGTNFG T+ A T Y APL E G
Sbjct: 270 MFHGGTNFGFTSGANDKGVYQPTTTSYDYDAPLAEDG 306
>gi|194213013|ref|XP_001503036.2| PREDICTED: LOW QUALITY PROTEIN: galactosidase, beta 1-like 2 [Equus
caballus]
Length = 663
Score = 115 bits (288), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 82/268 (30%), Positives = 120/268 (44%), Gaps = 52/268 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 106 WRDRLLKMKACGLNTLTTYVPWNLHEPERGRFDFSGNLDLEAFVLTAAEIGLWVILRPGP 165
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL +G+ R+ K + +
Sbjct: 166 YICSEIDLGGLPSWLLQDSGMRLRTTYKGFTNAVDLYFDHLMPRVVPLQYKHGGPIIAVQ 225
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G A +G
Sbjct: 226 VENEYGS-----YNKDPTYMPYIKKALED--RGIEELLLTSDNKDGLSSGAVDGVLATIN 278
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 279 LQSQHDLQLLSTFLFTVQGARPKMVMEYWTGWFDSWGGTHNILDSSEVLKTVSAIIDA-G 337
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYD 228
S +N YM+HGGTNFG A YYD
Sbjct: 338 SSINLYMFHGGTNFGFINGAMH---YYD 362
>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
Length = 579
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 126/277 (45%), Gaps = 42/277 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W I KA+ GL+ I+TY WNLHEP +G YDF+G D+ RF++ + G++ +R G
Sbjct: 34 LWADRIEKARLMGLNTIETYTPWNLHEPVEGAYDFTGMLDLERFLRLVADAGMHAIVRPG 93
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVL- 113
P+I +EW GGLP WL+ + R Y + Y + P ++G P VL
Sbjct: 94 PYICAEWDNGGLPAWLYRDPEVGVRRSEPRYLGAVSAYLRRVYDVVTPLQIDRGGPVVLV 153
Query: 114 -------------WAAKMAVDF--HTGVPWVMCKQDDAPGPVINACN----------GMR 148
+ + VD G+ + D +++ + G R
Sbjct: 154 QIENEYGAYGSDKFYLRHLVDLTRECGITVPLTTVDQPTDEMLSQGSLDCLHRTGSFGSR 213
Query: 149 CGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
E + P P + +E W ++ WG + + SA+D A + +A S VN Y
Sbjct: 214 ATERLATLRRHQPTGPLMCSEFWNGWFDHWGDRHHTTSAEDSAAELDALLAAGAS-VNIY 272
Query: 207 MYHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
M+HGGTNFG T+ A IT Y APLDE G
Sbjct: 273 MFHGGTNFGLTSGANDKGVYQPTITSYDYDAPLDEAG 309
>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
Length = 385
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 132/302 (43%), Gaps = 45/302 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ G YDFSG D+ F++ GL V LR GP
Sbjct: 58 WKDRLLKMKMAGLNAIQTYVPWNYHEPQMGVYDFSGDRDLEYFLQLASETGLLVILRAGP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y E ++P + G P ++
Sbjct: 118 YICAEWDMGGLPAWLLEKESIVLRSSDSDYLTAVEKWMGVLLPKMKPHLYHNGGPIIMVQ 177
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPG------------PVINACN 145
+ + A D+ H G V+ D A ++
Sbjct: 178 VENEYGSYFACDYDYLRSLLKIFRQHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAP 237
Query: 146 GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G F S P P + +E +T + WG + + ++ IA + +A+ G+ V
Sbjct: 238 GGNVTAAFLAQRSSEPTGPLVNSEFYTGWLDHWGHRHIVVPSETIAKTLNEILAR-GANV 296
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
N YM+ GGTNF A M T Y APL E G + E K+ L+E+ + + S
Sbjct: 297 NLYMFIGGTNFAYWNGANMPYMSQPTSYDYDAPLSEAGDLTE-KYFALREVIGMVSIPST 355
Query: 259 PL 260
L
Sbjct: 356 CL 357
>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
Length = 651
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 131/299 (43%), Gaps = 56/299 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + K GL+ ++TYV WNLHE G++ F+G DI RF+ + GL V LR GP
Sbjct: 87 WLDRLTRMKAAGLNTVETYVPWNLHEEIHGEFVFTGMLDIRRFVAIAEKVGLLVILRPGP 146
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
FI SEW +GGLP WL + RS +P+ +
Sbjct: 147 FICSEWEFGGLPSWLLRDPQMDVRSTYRPFMDAARSYMRSLISELEDMQYQYGGPIIAMQ 206
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
IENEY + + Y+ + D +GV ++ D+ G G+
Sbjct: 207 IENEYGS-----YSDDVNYMQELKNIMTD--SGVIEILFTSDNKHGLQPGRVPGVFMTTN 259
Query: 153 FKGPNS------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
FK N P KP + E W+ ++ W K + S ++ A V +I + G
Sbjct: 260 FKNTNEGGRMFDKLHELQPGKPLMVMEFWSGWFDHWEEKHHTMSLEEYASAVE-YILQQG 318
Query: 201 SYVNYYMYHGGTNFGRTAAAF------MITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
S +N YM+HGGTNFG A +T Y +PL E G V + K+ ++L A +
Sbjct: 319 SSINLYMFHGGTNFGFLNGANTEPYLPTVTSYDYDSPLSEAGDVTD-KFMMTRQLFAPL 376
>gi|295689222|ref|YP_003592915.1| beta-galactosidase [Caulobacter segnis ATCC 21756]
gi|295431125|gb|ADG10297.1| Beta-galactosidase [Caulobacter segnis ATCC 21756]
Length = 617
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 134/302 (44%), Gaps = 59/302 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KAK GL+ I TY FWN+HEP+ G YDF+G+ND+ FI+ Q++GL V LR GP
Sbjct: 65 WRDRLQKAKTMGLNTITTYAFWNVHEPRPGVYDFTGQNDLAAFIRAAQAEGLDVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ SEW GG P WL ++ RS Y +
Sbjct: 125 YVCSEWELGGYPSWLLKDRNVLLRSTEPQYAAAVERWMARLGREVKPLLLKNGGPIVAIQ 184
Query: 93 IENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
+ENEY + +E + G VL+ + A D G +P + + G
Sbjct: 185 LENEYGAFGDDKAYLEGLEATYRRAGLADGVLFTSNQASDLAKGSLPHLPSMVNFGSGGA 244
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ETF+ P+ + E W ++ WG + + + A + F+ + G
Sbjct: 245 EKSVAQL---ETFR----PDGLRMVGEYWAGWFDKWGEEHHETDGRKEAEELR-FMLQRG 296
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITG---------YYDQAPLDEYGLVREPKWGHLKELHA 251
V+ YM+HGGT+FG A TG Y APLDE G R K+G L + A
Sbjct: 297 YSVSLYMFHGGTSFGWMNGADSHTGKDYHPDTTSYDYDAPLDEAGAPRY-KYGLLASVIA 355
Query: 252 AI 253
+
Sbjct: 356 EV 357
>gi|373953412|ref|ZP_09613372.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890012|gb|EHQ25909.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 610
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 124/286 (43%), Gaps = 61/286 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + AK GL+ I TYVFWNLHEPQKG +DFSG ND+ F+K + +GL+V LR P
Sbjct: 59 WRARMKMAKAMGLNTIGTYVFWNLHEPQKGHFDFSGNNDVAEFVKIAKEEGLWVILRPSP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + G+V RS Y +
Sbjct: 119 YVCAEWEFGGYPYWLQNEKGLVVRSMEAQYIAEYRKYINEVGKQLAPLQINHGGNILMVQ 178
Query: 93 IENEYQTIEP-----AFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG--PVINACN 145
IENEY + A +++ + AA +T P K PG P IN +
Sbjct: 179 IENEYGSYGSDKAYLALNQQ----LFKAAGFDGLLYTCDPGADVKNGHLPGLMPAINGVD 234
Query: 146 GMRCGETFKGPNSPNKPSIWTEDW-TSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
+ N K + +W +++ WG + +A+ + +A G +N
Sbjct: 235 DPAKVKKIINENHNGKGPYYIAEWYPAWFDWWGASHHTVAAEKYVGRLDTVLAA-GISIN 293
Query: 205 YYMYHGGTNFGRTAAAFM--------------ITGYYDQAPLDEYG 236
YM+HGG T AFM IT Y APLDE G
Sbjct: 294 MYMFHGG-----TTRAFMNGANYKDETPYEPQITSYDYDAPLDEAG 334
>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
Length = 570
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/299 (28%), Positives = 132/299 (44%), Gaps = 55/299 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WNLHE + + F DI++F+K Q GLYV +R GP
Sbjct: 5 WKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVIIRPGP 64
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF----HEKGPPYVLWA 115
+I +EW GGLP WL + R+ P+ ++ +Q + P + +G P + W
Sbjct: 65 YICAEWDLGGLPSWLLSDPEMKLRTSYGPFMEAVDRYFQKLFPLLTPLQYCQGGPIIAWQ 124
Query: 116 --------------------AKMAVDFHTGVPWVMCKQDD----APGPV------INACN 145
KM V GV ++ D+ P+ IN
Sbjct: 125 IENEYSSFDKKVDMTYMELLQKMMV--KNGVTEMLLMSDNLFSMKTHPINLVLKTINLQK 182
Query: 146 GMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
++ P+KP + TE W ++ VWG K +I + + + + G+ +N+
Sbjct: 183 NVKDALLQLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKDLFSL-GASINF 241
Query: 206 YMYHGGTNFG-RTAAAFM--------------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM+HGGTNFG A+F IT Y APL E G + PK+ L++
Sbjct: 242 YMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESGDI-TPKYKALRKF 299
>gi|348508362|ref|XP_003441723.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 605
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 127/297 (42%), Gaps = 57/297 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G ++F + D+ ++ GL+V LR GP
Sbjct: 38 WEDRLLKMKACGLNTLTTYVPWNLHEPERGTFNFQDQLDLKAYVSLAAQLGLWVILRPGP 97
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLWA 115
+I +EW GGLP WL + R+ + + I+P E G P + A
Sbjct: 98 YICAEWDLGGLPSWLLQDEEMQLRTTYPGFVNAVNLYFDKLISVIKPLMFEGGGPII--A 155
Query: 116 AKMAVDFHTGVPWVMCKQDDAPGPVINAC----------------NGMRCGET---FKGP 156
++ ++ + +DD P I C G+RCG K
Sbjct: 156 VQVENEYGS------FAKDDKYMPFIKNCLQSRGIKELLMTSDNWEGLRCGGVEGALKTV 209
Query: 157 N---------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
N P KP + E W+ ++ VWG ++ A+D+ V I G
Sbjct: 210 NLQRLSFGAIQHLADIQPQKPLMVMEYWSGWFDVWGEHHHVFYAEDM-LAVVSEILDRGV 268
Query: 202 YVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+N YM+HGGT FG A +T Y APL E G PK+ HL+ L +
Sbjct: 269 SINLYMFHGGTTFGFMNGAMDFGTYKSQVTSYDYDAPLSEAGDC-TPKYHHLRNLFS 324
>gi|449493221|ref|XP_002196735.2| PREDICTED: beta-galactosidase [Taeniopygia guttata]
Length = 636
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 150/360 (41%), Gaps = 48/360 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD IQTYV WN HEPQ G YDF G D+ F++ GL V LR GP
Sbjct: 42 WKDRLLKMKMAGLDAIQTYVPWNYHEPQMGTYDFFGGKDLQYFLQLANDTGLLVILRAGP 101
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y E + P ++ G P ++
Sbjct: 102 YICAEWDMGGLPAWLLEKKSIVLRSSDSDYLEAVERWMGVLLPKMRPYLYQNGGPIIMVQ 161
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPG------------PVINACN 145
+ + A D+ H G V+ D A ++
Sbjct: 162 VENEYGSYFACDYNYLRFLLKLFRLHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAP 221
Query: 146 GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G F S P P + +E +T + WG + AQ IA + +A +G+ V
Sbjct: 222 GANVTAAFLAQRSSEPKGPLVNSEFYTGWLDHWGHHHSVVPAQTIAKTLNEILA-SGANV 280
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
N YM+ GGTNF A M T Y APL E G + E K+ L+++ K
Sbjct: 281 NLYMFIGGTNFAYWNGANMPYMPQPTSYDYDAPLSEAGDLTE-KYFALRKVIGMYKQLPE 339
Query: 259 PLLTGTQNVISLGQ--LQEA-FVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKSI 315
L T + G+ LQ+A V E G+ + V + L + Y L R ++
Sbjct: 340 GLTPPTTPKFAYGKVRLQKAGTVLEVLDGLSRSGPVRSTYPLTFVELKQYFGYVLYRTTL 399
>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
Length = 587
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/405 (27%), Positives = 168/405 (41%), Gaps = 77/405 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYV WNLH+P+ G G D+ RF++ ++GL V LR GP
Sbjct: 35 WADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLDGLLDLPRFLRLAHAEGLKVLLRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL + + RS + + +
Sbjct: 95 YICAEWDGGGLPHWLMSESDVQLRSSDPKFTAIIDRYLDLLLPPLLPHMAESGGPVIAVQ 154
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHT---GVPWVMCKQDDAPGP 139
+ENEY + + AF +G +L+ H +P V+ G
Sbjct: 155 VENEYGAYGNDAEYLKYLVEAFRSRGIEELLFTCDQVNPEHQQAGSIPGVLSTGTFG-GK 213
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
+ A +R + P P + E W ++ WGG + R D+A + +A
Sbjct: 214 IETALATLRA-------HQPEGPLMCAEFWIGWFDHWGGPHHTRDTADVAADLDKLLAA- 265
Query: 200 GSYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
G+ VN YM+HGGTNFG T A IT Y APL E G PK+ +E+ A
Sbjct: 266 GASVNIYMFHGGTNFGLTNGANHHHTYAPTITSYDYDAPLTENG-DPGPKYHAFREVIAK 324
Query: 253 IKLCSRPLLTGTQNV-ISLGQLQEAF----VFEETSG--VCAAFLVNNDE--RKAVTVLF 303
L T + + ++ +L E E SG V + DE +A VL+
Sbjct: 325 YAPVPEELPTPSAKLPVTEVELTERAPLLPYLSELSGRTVRTETPITADELGMRAGYVLY 384
Query: 304 RNISYELPRKSISIL------PDCKTVAFNTERVSTQYNKRSKTS 342
R+ LP+ + +L D V + V N+R +TS
Sbjct: 385 RS---SLPKNGLGVLRFEGGVGDRAQVYVDGAPVGVLENERRETS 426
>gi|297788786|ref|XP_002862437.1| hypothetical protein ARALYDRAFT_359611 [Arabidopsis lyrata subsp.
lyrata]
gi|297307951|gb|EFH38695.1| hypothetical protein ARALYDRAFT_359611 [Arabidopsis lyrata subsp.
lyrata]
Length = 256
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/273 (32%), Positives = 122/273 (44%), Gaps = 68/273 (24%)
Query: 379 AKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHILHAFVNGEYTGSAHGSHDNVS 432
KD +DY WYT + + L V GH L +VNGEY
Sbjct: 24 TKDKTDYAWYTTSIKIEDDDIPDQKGQKTILRVAGLGHALIVYVNGEYA----------- 72
Query: 433 FTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQD-KSFT-----NCSW 486
++LR N ++L V GLPDSG+++E AG V + KS T N W
Sbjct: 73 ------INLRTRDNCISILGVLTGLPDSGSYMEHTYAGPRGVSIIGLKSGTRDLIENNEW 126
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKG 546
G+ V Y+ G KV W + LTWYKT P G + +A+ ++ MGKG
Sbjct: 127 GHLV---------YTEEGSKKVKWEKY-GEHKPLTWYKT----PEGENAVAIRMKGMGKG 172
Query: 547 EAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKP--T 604
WVNG +GRYW+SF + G P QT+ YH+PR+F+K
Sbjct: 173 LIWVNGIGVGRYWMSFVSPLGEPIQTE--------------------YHIPRSFMKEEKK 212
Query: 605 GNLLVLLEEENGNPLGITVDTIAIRKVCGHVTN 637
++LV+LEEE P+ V T + K+ + N
Sbjct: 213 KSMLVILEEE---PVAKMVPTSSPTKMINDLLN 242
>gi|297204198|ref|ZP_06921595.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197714112|gb|EDY58146.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 588
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 126/283 (44%), Gaps = 56/283 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ I+TY+ WNLHEP+ G G D+ R+++ Q +GL+V LR GP
Sbjct: 38 WTDRLRKARLMGLNTIETYLPWNLHEPEPGTLVLDGFLDLPRWLRLAQDEGLHVLLRPGP 97
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN------------------KPY-----------K 92
FI +EW GGLP WL I RS + +P+ +
Sbjct: 98 FICAEWDDGGLPAWLLADPDIRLRSSDPRFTGAFDGYLDQLLPALRPFMAAHGGPVIAVQ 157
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
+ENEY + + A ++G +L+ A H PG +
Sbjct: 158 VENEYGAYGDDTAYLKHVHQALRDRGVEELLYTCDQASAEH-------LAAGTLPGTLAT 210
Query: 143 ACNGMRCGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A G R E + P P + +E W ++ WGG ++RSA D A + ++ G
Sbjct: 211 ATFGSRVEENLAALRTHQPEGPLMCSEFWVGWFDHWGGPHHVRSAADAAADLDRLLSA-G 269
Query: 201 SYVNYYMYHGGTNFGRTAAA-------FMITGYYDQAPLDEYG 236
+ VN YM+HGGTNFG T A +T Y APL E G
Sbjct: 270 ASVNIYMFHGGTNFGFTNGANHKHAYEPTVTSYDYDAPLTESG 312
>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
Length = 589
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/254 (28%), Positives = 114/254 (44%), Gaps = 42/254 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WNLHEP++G++DF G D++ FIK+ Q L V +R P
Sbjct: 34 WEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQGIKDVVSFIKKAQEMELMVIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF----HEKGPPYVLWA 115
+I +EW +GGLP WL + RSD Y K++N Y+ + P +G P ++
Sbjct: 94 YICAEWEFGGLPAWLLTYDNLHLRSDCPRYLEKVKNYYEVLLPMLTSLQSTQGGPIIMMQ 153
Query: 116 A------------------KMAVDFHTGVPWVMC----KQDDAPGPVIN----------- 142
K+ +D VP +Q G +I+
Sbjct: 154 VENEFGSFSNNKTYLKKLKKIMLDLGVEVPLFTSDGSWQQALESGSLIDDDVLVTANFGS 213
Query: 143 -ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
+ + E F + P + E W ++ WG + R AQD+A V + +
Sbjct: 214 HSHENLDVLEQFMANHQKKWPLMSMEFWDGWFNRWGEEIITRDAQDLANCVKELLTRGS- 272
Query: 202 YVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 273 -INLYMFHGGTNFG 285
>gi|433461907|ref|ZP_20419504.1| beta-galactosidase [Halobacillus sp. BAB-2008]
gi|432189486|gb|ELK46587.1| beta-galactosidase [Halobacillus sp. BAB-2008]
Length = 579
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 118/258 (45%), Gaps = 33/258 (12%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WNLHEP++G+++FSG DI FI+ GLYV +R P
Sbjct: 33 WEDRLEKLKALGLNTVETYVPWNLHEPRRGEFEFSGLADIEGFIQTAADLGLYVIVRPAP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF----HEKGPPYV--- 112
+I +EW GGLP WL +V RS + Y +E+ Y+ + P F ++ G P +
Sbjct: 93 YICAEWEMGGLPSWLLKDKDVVMRSSDPVYLSYVESYYKELLPKFVPHLYQNGGPIIAMQ 152
Query: 113 --------------LWAAKMAVDFHTGVPWVMC-------KQDDAPGPVINACNGMRCGE 151
L K + H ++ +Q P G + +
Sbjct: 153 IENEYGAYGNDQKYLTFLKKQYEQHGLDTFLFTSDGPDFIEQGSLPDVTTTLNFGSKVEQ 212
Query: 152 TFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
F+ ++ P + E W ++ W G+ + R A D A + + S VN+YM+H
Sbjct: 213 AFERLDAFKTGSPKMVAEFWIGWFDYWTGEHHTRDAGDAAAVFRELMERKAS-VNFYMFH 271
Query: 210 GGTNFGRTAAAFMITGYY 227
GGTNFG A YY
Sbjct: 272 GGTNFGFMNGANHYDVYY 289
>gi|156375241|ref|XP_001629990.1| predicted protein [Nematostella vectensis]
gi|156217002|gb|EDO37927.1| predicted protein [Nematostella vectensis]
Length = 578
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 133/291 (45%), Gaps = 44/291 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WNLHE K + F DI++F+ Q GL+V +R GP
Sbjct: 5 WADRLKKLKAMGLNTVETYVAWNLHEQVKENFKFKDEVDIVKFVNLAQELGLHVIIRPGP 64
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVLW- 114
+I SEW GGLP WL + + RS P+ E + + P +G P + W
Sbjct: 65 YICSEWDLGGLPSWLLNDPNMRLRSTYGPFMEAVEKYFSKLFALLTPLQFSRGGPIIAWQ 124
Query: 115 ------AAKMAVDFH-----------TGVPWVMCKQDDA----PGPV-INACNGMRCGET 152
+ + VD H G ++ DD P+ ++ M +
Sbjct: 125 VENEYASVQEEVDNHYMELLHKLMLKNGATELLFTSDDVGYTKRYPIKLDGGKYMSFNKW 184
Query: 153 F--KGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
F P+KP + TE W+ ++ WG K ++ + + + I G+ +N+YM+HG
Sbjct: 185 FCLFLHFQPDKPIMVTEYWSGWFDHWGEKHHVLNTERKMINEVKDILDMGASINFYMFHG 244
Query: 211 GTNFG-----RTAAAFMITGY------YD-QAPLDEYGLVREPKWGHLKEL 249
GTNFG TA + GY YD APL E G + PK+ L++L
Sbjct: 245 GTNFGFMNGANTAGNRIDDGYQPDVTSYDYDAPLSEAGDIT-PKYKALRKL 294
>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
Length = 867
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 132/309 (42%), Gaps = 43/309 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ KAK GG + I+TY+ WN HE +G++DFSG D+ F + + LYV R GP
Sbjct: 33 WNEVLDKAKAGGCNTIETYIPWNFHEMNEGEWDFSGDKDLAHFFQLCADKELYVIARPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL I +RS + +
Sbjct: 93 YICAEWDFGGFPWWLSTKKDIQYRSAQPAFLHYVDQYFDRVIPIIDEYQLTKNGTVIMVQ 152
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMC-KQDDAPGPVINACNGMRCGE 151
+ENE+Q A+ + PY+ + VP V C + N + +
Sbjct: 153 VENEFQ----AYGKPDKPYMEYIRDGMKARGIDVPLVTCYGAVEGAVEFRNFWSHSKHAA 208
Query: 152 TFKGPNSPNKPSIWTEDWTSFYQVWGG-KPYIRSAQDIAFHVALFIAKNGSYVNYYMYHG 210
P++P E W +++ WGG K ++ + + ++ + +NYYMY G
Sbjct: 209 AILDERFPDQPKGVMEFWIGWFEQWGGNKADQKTPEQLERECYQLLSNGFTAINYYMYFG 268
Query: 211 GTNF----GRTAA--AFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGT 264
GTNF GRT T Y +DEY L K+ LK H+ +K PL T
Sbjct: 269 GTNFDHWGGRTVGEQTLCTTTYDYDVAIDEY-LQPTRKYEVLKRYHSFVKWL-EPLFTDA 326
Query: 265 QNVISLGQL 273
+ V S +L
Sbjct: 327 EKVASDMKL 335
>gi|410100792|ref|ZP_11295748.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
gi|409214073|gb|EKN07084.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
Length = 779
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 147/365 (40%), Gaps = 76/365 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y FWN+HE + G++DFSG+NDI F + Q G+Y+ LR GP
Sbjct: 63 WEHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKNGMYIMLRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ SEW GGLP WL I R+ N PY
Sbjct: 123 YVCSEWEMGGLPWWLLKKEDIQLRT-NDPYFIERTRIYMNEIGKQLADRQITRGGNIIMV 181
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCKQD--------DAPGPVIN 142
++ENEY + + Y+ + D T VP C D +N
Sbjct: 182 QVENEYGS-----YATDKSYIAKNRDILRDAGFTDVPLFQCDWSSNFLNNALDDLVWTVN 236
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G E FK PN P + +E W+ ++ WG K R A+ + + + +N
Sbjct: 237 FGTGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMIAGLRDMLDRNI 296
Query: 201 SYVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
S+ + YM HGGT FG A + M + Y AP+ E G PK+ L+E A
Sbjct: 297 SF-SLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWAT-PKYHKLREFMA--- 351
Query: 255 LCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYELPRKS 314
N ++ G++Q E + E K +LF N+ P+ S
Sbjct: 352 -----------NYMAPGEVQ-----PEIPDAFPVIEIPEFELKETALLFENLPE--PKTS 393
Query: 315 ISILP 319
I P
Sbjct: 394 HDIKP 398
Score = 39.7 bits (91), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 26/38 (68%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+ TF D + L++Q+ GKG WVNG+++GR+W
Sbjct: 533 YYRATFNLETPGD-VFLDMQTWGKGMVWVNGKAMGRFW 569
>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 610
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 122/286 (42%), Gaps = 53/286 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + AK GL+ I TYVFWN+HEP+KGQYDFSG NDI F+K + + L+V LR P
Sbjct: 57 WRDRMKMAKAMGLNTIGTYVFWNVHEPEKGQYDFSGNNDIAAFVKMAKEEDLWVVLRPSP 116
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL ++ G+ RS Y +
Sbjct: 117 YVCAEWEFGGYPYWLQEIKGLKVRSKEPQYLEAYRNYIMAVGKQLSPLLVTHGGNILMVQ 176
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVD------FHTGVPWVMCKQDDAPG--PVINAC 144
IENEY + + Y+ KM V+ +T P K PG P IN
Sbjct: 177 IENEYGS-----YSDDKDYLDINRKMFVEAGFDGLLYTCDPKAAIKNGHLPGLLPAINGV 231
Query: 145 NGMRCGETFKGPNSPNKPSIWTEDW-TSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
+ + N K + +W +++ WG K + + + +A G +
Sbjct: 232 DDPLQVKQLINENHSGKGPYYIAEWYPAWFDWWGTKHHTVPYRQYLGKLDSVLAA-GISI 290
Query: 204 NYYMYHGGTNFGRTAAAF---------MITGYYDQAPLDEYGLVRE 240
N YM+HGGT G A I+ Y APLDE G E
Sbjct: 291 NMYMFHGGTTRGFMNGANANDADPYEPQISSYDYDAPLDEAGNATE 336
>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
Length = 586
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 127/287 (44%), Gaps = 59/287 (20%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + K K GL+ ++TYV WNLHEP GQ+ + G D+ FI+ +S GLYV +R G
Sbjct: 38 LWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFRYEGGLDLAAFIRLAESLGLYVIVRPG 97
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
PFI +EW +GGLP WL + R +PY
Sbjct: 98 PFICAEWEFGGLPAWLLADPYMEVRCCYQPYLEAVRRFYDDLLPRLLPLQIQRGGPILAM 157
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVI---------- 141
++ENEY + + Y+ W ++ +D GV ++ D A ++
Sbjct: 158 QVENEYGS-----YGSDQLYLTWLRRLMLD--GGVETLLFTSDGATDHMLKHGTLAQVWK 210
Query: 142 NACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
+A G R E F P+ P + E W ++ WG + R A D A + +A
Sbjct: 211 SANFGSRAEEEFAKLREYQPDGPLMCMEFWNGWFDHWGEPHHTRDAADAADALERIMA-C 269
Query: 200 GSYVNYYMYHGGTNFGRTAAAF----------MITGYYDQAPLDEYG 236
G++VN YM+HGGTNFG A + Y APLDE G
Sbjct: 270 GAHVNVYMFHGGTNFGFMNGANTDLLTRDYQPTVNSYDYDAPLDETG 316
>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 84/163 (51%), Gaps = 42/163 (25%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AKEGG+DVI+TYVF N HE Y F G D+++F+K +Q G+Y+ L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKI--------------------------- 93
PF+ +EW + G +F++++KP+K
Sbjct: 61 PFVATEWNF-----------GTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 109
Query: 94 ----ENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK 132
+NEY + + + G PYV+WAA M + + GVPW+MC+
Sbjct: 110 LTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQ 152
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 53/105 (50%), Gaps = 29/105 (27%)
Query: 203 VNYYMYHGGTNFGRTAAA-FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL 261
VNYYMYHGGTNFG T+ F+ T Y AP+DEYGL R PK C
Sbjct: 237 VNYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK-------------CPS--- 280
Query: 262 TGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNI 306
QE V+ ++ G AAF+ N DE++ ++F+N+
Sbjct: 281 ------------QEVDVYADSLGGYAAFISNVDEKEDKMIVFQNV 313
>gi|327283884|ref|XP_003226670.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Anolis
carolinensis]
Length = 584
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 118/254 (46%), Gaps = 48/254 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHE +G++DFSG D+ FIK + GL+V LR GP
Sbjct: 44 WKDRLMKMKACGLNTVTTYVPWNLHEAIRGKFDFSGNLDLQVFIKMAEEVGLWVILRPGP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW GGLP WL + R+ + + +
Sbjct: 104 YICSEWDLGGLPSWLLQDPEMQLRTTYRGFTEAVDNYFDRLIPQVVPLQYKYGGPIIAVQ 163
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + + P Y+ + KMA+ V +M D+ G V +G
Sbjct: 164 VENEYGS-----YAQDPSYMTY-IKMALTSRKIVEMLMTS-DNHDGLVSGTVDGALATIN 216
Query: 153 FKGPNSP----------NK-PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
F+ ++ NK P + E WT ++ WGG ++ A D+ V I K G+
Sbjct: 217 FQKLDTAIMVFLSTDQRNKMPKMVMEYWTGWFDSWGGLHHVFDADDMVQTVGKVI-KLGA 275
Query: 202 YVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 276 SINLYMFHGGTNFG 289
>gi|311264379|ref|XP_003130137.1| PREDICTED: galactosidase, beta 1-like 2 [Sus scrofa]
Length = 635
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 130/303 (42%), Gaps = 55/303 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 77 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDMEAFILLAAEVGLWVILRPGP 136
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL + + R+ + + +
Sbjct: 137 YICSEIDLGGLPSWLLQDSSMKLRTTYEGFTKAVDLYFDHLMARVVPLQYKNGGPIIAVQ 196
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN 142
+ENEY + + K P Y+ + K D G+ ++ D+ G IN
Sbjct: 197 VENEYGS-----YNKDPAYMPYIKKALED--RGIVELLLTSDNEDGLSKGTVDGVLATIN 249
Query: 143 --ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ N +R F +P + E WT ++ WGG +I ++ V+ I G
Sbjct: 250 LQSQNELRLLHNFLQSVQGVRPKMVMEYWTGWFDSWGGPHHILDTSEVLRTVSAII-DAG 308
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLV------REPKWGHLKELHAAIK 254
+ +N YM+HGGTNFG A Y +Y V PK+ L+EL +I
Sbjct: 309 ASINLYMFHGGTNFGFINGAMHFQDYMSDVTSYDYDAVLTEAGDYTPKYIRLRELFGSIS 368
Query: 255 LCS 257
S
Sbjct: 369 GAS 371
>gi|327282153|ref|XP_003225808.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 649
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 134/297 (45%), Gaps = 47/297 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD IQTYV WN HEP++G Y+F+G D+ F++ Q GL V LR GP
Sbjct: 63 WKDRLLKMKMAGLDAIQTYVPWNFHEPERGVYNFTGDRDLEYFLQLAQEVGLLVILRAGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + ++P ++ G P ++
Sbjct: 123 YICAEWDMGGLPAWLLEKESIVLRSSDPDYLTAVGSWMGIFLPKMKPHLYQNGGPIIMVQ 182
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + A DF + G V+ D A + A G+ F G
Sbjct: 183 VENEYGSYFACDFDYLRYLQNLFRQYLGDEVVLFTTDGASMFYLRCGALQGLYSTVDF-G 241
Query: 156 P-------------NSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P P P + +E +T + WG + A +A ++ +A +G+
Sbjct: 242 PGRNVTAAFSTQRHTEPKGPLVNSEFYTGWLDHWGHRHITVPASIVAKSLSEILA-SGAN 300
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
VN YM+ GGTNFG A M T Y APL E G + E K+ ++E+ K
Sbjct: 301 VNMYMFIGGTNFGYWNGANMPYMAQPTSYDYDAPLSEAGDLTE-KYFAIREVIGMFK 356
>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
Length = 774
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 123/285 (43%), Gaps = 63/285 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+ GL+ I YVFWN HE Q G++DFSG+ D+ F++ Q +GLYV LR GP
Sbjct: 60 WRDRLKRARAMGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+ +EW +GG P WL +V+RS + + +
Sbjct: 120 YACAEWDFGGYPSWLLKEKDMVYRSKDPRFLEYCERYIKALGKQLAPLTVNNGGNILMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M D VP C D G V +
Sbjct: 180 VENEYGS-----YAADKEYLAALRDMIKDAGFNVPLFTC---DGGGQVEAGHIDGALPTL 231
Query: 145 NGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + P P E + +++ VWG + Y R A+ + + +
Sbjct: 232 NGVFSEDIFKIIDKYHPGGPYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLG----- 286
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYG 236
G V+ YM+HGGTNF A GY Q APL E+G
Sbjct: 287 QGVSVSMYMFHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWG 331
>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 774
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 123/285 (43%), Gaps = 63/285 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+ GL+ I YVFWN HE Q G++DFSG+ D+ F++ Q +GLYV LR GP
Sbjct: 60 WRDRLKRARAMGLNTISVYVFWNFHERQPGEFDFSGQADVAEFVRLAQEEGLYVILRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+ +EW +GG P WL +V+RS + + +
Sbjct: 120 YACAEWDFGGYPSWLLKEKDMVYRSKDPRFLEYCERYIKALGKQLAPLTVNNGGNILMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M D VP C D G V +
Sbjct: 180 VENEYGS-----YAADKEYLAALRDMIKDAGFNVPLFTC---DGGGQVEAGHIDGALPTL 231
Query: 145 NGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + P P E + +++ VWG + Y R A+ + + +
Sbjct: 232 NGVFSEDIFKIIDKYHPGGPYFVAEFYPAWFDVWGQRHSTVDYKRPAEQLDWMLG----- 286
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYG 236
G V+ YM+HGGTNF A GY Q APL E+G
Sbjct: 287 QGVSVSMYMFHGGTNFWYMNGANTAGGYRPQPTSYDYDAPLGEWG 331
>gi|395846556|ref|XP_003795969.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Otolemur
garnettii]
Length = 633
Score = 113 bits (282), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 130/300 (43%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEPQ+G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPQRGKFDFSGNLDLEAFVLLAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + K P Y+ + K D G+ ++ D+ G +G+
Sbjct: 198 VENEYGS-----YYKDPAYMPYVKKALED--RGIVELLFTSDNKDGLRKGIIHGVLATIN 250
Query: 153 FKGPNSPN------------KPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ P +P + TE WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSPQELQLLTTLLVSIQGVQPKMVTEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDTG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
S +N YM+HGGTNFG A IT Y A L E G PK+ L++ ++
Sbjct: 310 SSINLYMFHGGTNFGFINGAMHFQDYRSDITSYDYDAVLTEAG-DYTPKYIKLRDFFDSL 368
>gi|423220237|ref|ZP_17206732.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
gi|392623314|gb|EIY17417.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
Length = 778
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 131/294 (44%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY + + PYV + + T VP C ++A + +N
Sbjct: 180 VENEYSS-----YATDKPYVAAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNF 234
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G E K+ L++L
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 346
Score = 40.0 bits (92), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+YKTTF+ D L++ + GKG WVNG ++GR+W
Sbjct: 532 YYKTTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFW 568
>gi|153808925|ref|ZP_01961593.1| hypothetical protein BACCAC_03226 [Bacteroides caccae ATCC 43185]
gi|149128258|gb|EDM19477.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 778
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 131/294 (44%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKTLGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---INA 143
+ENEY + + PYV + + T VP C ++A + +N
Sbjct: 180 VENEYSS-----YATDKPYVAAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNF 234
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G E K+ L++L
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 346
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+YKTTF+ D L++ + GKG WVNG ++GR+W
Sbjct: 532 YYKTTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFW 568
>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 586
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 117/276 (42%), Gaps = 42/276 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I KA+ GL+ I+TYV WN H P+ G +D G D+ RF++ ++ G+Y +R GP
Sbjct: 35 WADRIEKARLMGLNTIETYVPWNAHSPRPGVFDTDGILDLPRFLRLVKDAGMYAIVRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPP----- 110
FI +EW GGLP WL G+ R + E E + P + G P
Sbjct: 95 FICAEWDNGGLPPWLFREPGVGIRRHEPRFLDEVEKYLHQVLALVRPHQVDLGGPVLLVQ 154
Query: 111 -------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN 157
Y+ A M VP V Q +G+ +F +
Sbjct: 155 VENEYGAYGDDRDYLQAVADMIRGAGIDVPLVTVDQPVDAMLAAGGLDGVLRTSSFGSDS 214
Query: 158 S----------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P P + E W ++ WGG+ + + A + +A G+ VN YM
Sbjct: 215 ANRLRTLRDHQPTGPLMCMEFWDGWFDHWGGRHHTTPVEQAAEELDALLAA-GASVNVYM 273
Query: 208 YHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
+HGGTNFG T+ A +T Y APLDE G
Sbjct: 274 FHGGTNFGLTSGANDKGIYRPTVTSYDYDAPLDEAG 309
>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
Length = 612
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 124/275 (45%), Gaps = 54/275 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL E ++GQ+DF+G NDI F++E SQGL V LR GP
Sbjct: 59 WKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDISAFVREAASQGLNVILRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GG P WL + RS + + +
Sbjct: 119 YVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEALGTQVRPLLNGNGGPIIAVQ 178
Query: 93 IENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
+ENEY Q + F + G +L+ A A G +P V+ + APG
Sbjct: 179 VENEYGSYGDDHGYLQAVRALFIKAGLGGALLFTADGAQMLGNGTLPDVLAAVNVAPGEA 238
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A + + TF P +P + E W ++ W GKP+ ++ ++ + G
Sbjct: 239 KQALDKL---ATFH----PGQPQLVGEYWAGWFDQW-GKPHAQTDAKQQADEIEWMLRQG 290
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+N YM+ GGT+FG FM + P D Y
Sbjct: 291 HSINLYMFVGGTSFG-----FMNGANFQGGPSDHY 320
>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
Length = 859
Score = 112 bits (280), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 135/300 (45%), Gaps = 56/300 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE ++GQ+DF+G+ND+ F + Q G+YV +R GP
Sbjct: 125 WEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGQNDVAAFCRLAQQNGMYVIVRPGP 184
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-------NEYQTIEPAFHEKGPPYVLW 114
++ +EW GGLP WL I R + PY +E + + P +G P ++
Sbjct: 185 YVCAEWEMGGLPWWLLKKKDIRLREQD-PYFMERVELFEQKVAEQLAPLTIRRGGPIIMV 243
Query: 115 AAKMAV-DFHTGVPWVMCKQD----------------DAPGPVINACN------------ 145
+ + +V +D +A P++ C+
Sbjct: 244 QVENEYGSYGEDKAYVSQIRDVLRRYWSLSPTGEGRGEAASPLMFQCDWSSNFTRNGLDD 303
Query: 146 ---------GMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
G + F+ G P+ P + +E W+ ++ WG + R A+D+ +
Sbjct: 304 LVWTMNFGTGANINDQFRRLGELRPDAPKMCSEFWSGWFDKWGARHETRPARDMVAGIDE 363
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKE 248
++K S+ + YM HGGT+FG A A +T Y AP++EYG PK+ L++
Sbjct: 364 MLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKFWELRK 421
>gi|355690250|gb|AER99094.1| galactosidase, beta 1 [Mustela putorius furo]
Length = 648
Score = 112 bits (279), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 119/283 (42%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ GQY FSG D+ FIK GL V LR GP
Sbjct: 54 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGQYKFSGEQDVEYFIKLAHELGLLVILRPGP 113
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 114 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPRMKPLLYQNGGPIITVQ 173
Query: 113 ---------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
L + +H G ++ D A P + A G+ F G
Sbjct: 174 VENEYGSYFTCDYDYLRFLQKLFHYHLGKDVLLFTTDGALEPFLQCGALQGLYATVDF-G 232
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P + +E +T + W G+P+ ++ I G+
Sbjct: 233 PGANITAAFEVQRKSEPKGPLVNSEFYTGWLDHW-GQPHSTVKTEVVASSLHDILARGAN 291
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF A M T Y APL E G + E
Sbjct: 292 VNLYMFIGGTNFAYWNGANMPYKAQPTSYDYDAPLSEAGDLTE 334
>gi|351700626|gb|EHB03545.1| Beta-galactosidase-1-like protein 2 [Heterocephalus glaber]
Length = 654
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 136/319 (42%), Gaps = 56/319 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLLAAEVGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +E GGLP WL G+ R+ K + +
Sbjct: 138 YVCAEIDLGGLPSWLLQDPGMKLRTTYKGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC--- 149
+ENEY + + + P Y+ + K D G+ ++ D+ G +G+
Sbjct: 198 VENEYGS-----YNRDPAYMPYVKKALED--RGIIELLLTSDNKDGLQKGVVHGVLATIN 250
Query: 150 ---------GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
TF N+P + E WT ++ WG I + ++ V+ I G
Sbjct: 251 LQSQQELQLLTTFLLSVQGNQPKMVMEYWTGWFDSWGSPHNILDSSEVLETVSA-IVNAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGH--LKELHAAIKLCSR 258
S +N YM+HGGTNFG A Y ++ + YG + WG L++LH +
Sbjct: 310 SSINLYMFHGGTNFGFINGAMHFNEY--KSDVTSYG---KQFWGQGRLRQLHGCLADYDA 364
Query: 259 PLLTGTQNVISLGQLQEAF 277
L G+L++ F
Sbjct: 365 VLTEAGDYTAKYGKLRDFF 383
>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
Length = 595
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 128/310 (41%), Gaps = 60/310 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ + TYV WN HE G F G D+ RFI+ Q +GL V +R GP
Sbjct: 37 WADRLRRLAALGLNAVDTYVPWNFHERTAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGP 96
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL G+ R+ + PY +
Sbjct: 97 YICAEWDNGGLPAWLTGTPGMRLRTSHGPYLEAVDRWFDALVPRIAELQAGRGGPVVAVQ 156
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY + I A +G +L+ A G +M PG +
Sbjct: 157 IENEYGSYGDDRAYVRHIRDALVARGITELLYTAD-------GPTPLMQDGGALPGELAA 209
Query: 143 ACNGMRC--GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A G R P +P E W ++ WG K ++R A A + + + G
Sbjct: 210 ATFGSRPDRAAALLRSRRPAEPFFCAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGG 269
Query: 201 SYVNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
S V+ YM HGGTNFG A A +T Y AP+ E G + PK+ L++ A
Sbjct: 270 S-VSLYMAHGGTNFGLWAGANHEGGTIRPTVTSYDSDAPIAENGAL-TPKFFALRDRLTA 327
Query: 253 IKLCS--RPL 260
+ + RPL
Sbjct: 328 LGTAATRRPL 337
>gi|71731106|gb|EAO33173.1| Beta-galactosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 612
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 138/321 (42%), Gaps = 61/321 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL E ++GQ+DF+G NDI F++E SQGL V LR GP
Sbjct: 59 WKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDIGAFVREAASQGLNVILRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GG P WL + RS + + +
Sbjct: 119 YVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEALGTQVRPLLNGNGGPIIAVQ 178
Query: 93 IENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
+ENEY Q + F + G +L+ A A G +P V+ + APG
Sbjct: 179 VENEYGSYGDDHGYLQAVHALFIKAGLGGALLFTADGAQMLGNGTLPDVLAAVNFAPGEA 238
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A + + TF P +P + E W ++ W GKP+ ++ ++ + G
Sbjct: 239 KQALDKL---ATFH----PGQPQLVGEYWAGWFDQW-GKPHAQTDAKQQADEIEWMLRQG 290
Query: 201 SYVNYYMYHGGTNFGRTAAAFM-----------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+N YM+ GGT+FG A T Y A LDE G PK+ +++
Sbjct: 291 HSINLYMFVGGTSFGFMNGANFQGGPGDHYSPQTTSYDYDAVLDEAGRPM-PKFALFRDV 349
Query: 250 HAAIKLCSRPLLTGTQNVISL 270
+ P L G I L
Sbjct: 350 ITRVTGLQPPPLPGASRFIDL 370
>gi|119588246|gb|EAW67842.1| hypothetical protein BC008326, isoform CRA_a [Homo sapiens]
Length = 643
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 124/279 (44%), Gaps = 51/279 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN 142
+ENEY + + K P Y+ + K D G+ ++ D+ G IN
Sbjct: 198 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 250
Query: 143 --ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVR 239
S +N YM+HGGTNFG A Y ++ + YG R
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY--KSDVTSYGKAR 346
>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
Length = 797
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 135/300 (45%), Gaps = 56/300 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE ++GQ+DF+G+ND+ F + Q G+YV +R GP
Sbjct: 63 WEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGQNDVAAFCRLAQQNGMYVIVRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-------NEYQTIEPAFHEKGPPYVLW 114
++ +EW GGLP WL I R + PY +E + + P +G P ++
Sbjct: 123 YVCAEWEMGGLPWWLLKKKDIRLREQD-PYFMERVELFEQKVAEQLAPLTIRRGGPIIMV 181
Query: 115 AAKMAV-DFHTGVPWVMCKQD----------------DAPGPVINACN------------ 145
+ + +V +D +A P++ C+
Sbjct: 182 QVENEYGSYGEDKAYVSQIRDVLRRYWSLSPTGEGRGEAASPLMFQCDWSSNFTRNGLDD 241
Query: 146 ---------GMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
G + F+ G P+ P + +E W+ ++ WG + R A+D+ +
Sbjct: 242 LVWTMNFGTGANINDQFRRLGELRPDAPKMCSEFWSGWFDKWGARHETRPARDMVAGIDE 301
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKE 248
++K S+ + YM HGGT+FG A A +T Y AP++EYG PK+ L++
Sbjct: 302 MLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKFWELRK 359
>gi|76636681|ref|XP_597358.2| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|297483828|ref|XP_002693892.1| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|296479483|tpg|DAA21598.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 758
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 116/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + GL+ + TYV WNLHEP++G +DFSG D+ FI GL+V LR GP
Sbjct: 200 WRDRLLKLRACGLNTLTTYVPWNLHEPERGTFDFSGNLDLEAFILLAAEVGLWVILRPGP 259
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL + R+ K + +
Sbjct: 260 YICSEVDLGGLPSWLLRDPDMRLRTTYKGFTEAVDLYFDHLMLRVVPLQYKHGGPIIAVQ 319
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + K P Y+ + K D G+ ++ D+ G +G+
Sbjct: 320 VENEYGS-----YNKDPAYMPYIKKALQD--RGIAELLLTSDNQGGLKSGVLDGVLATIN 372
Query: 153 FKGPNS------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ++P + E WT ++ WGG YI + ++ V+ I K G
Sbjct: 373 LQSQSELQLFTTILLGAQGSQPKMVMEYWTGWFDSWGGPHYILDSSEVLNTVSA-IVKAG 431
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 432 SSINLYMFHGGTNFGFIGGAMHFQDY 457
>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
Length = 673
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/312 (28%), Positives = 131/312 (41%), Gaps = 56/312 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ I+TYV WN HEP GQY FSG D+ F++ + GL V LR GP
Sbjct: 94 WKDRLFKMKMAGLNAIETYVPWNFHEPFPGQYQFSGEQDLEYFLQLVHEVGLLVILRPGP 153
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP+WL + I RS + Y +E ++P ++ G P +
Sbjct: 154 YICAEWDMGGLPVWLLEKKSIFLRSSDPDYLKAVDKWLEVLLPKMKPYLYQNGGPIITVQ 213
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVINACNGMRCGE------ 151
+ + A D+ H G V+ D A N ++CG
Sbjct: 214 VENEYGSYFACDYNYLRFLLKVFRQHLGEEVVLFTTDGA------GENYLKCGTLQDLYA 267
Query: 152 --------------TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
+ P P + +E +T + WG S ++I + ++
Sbjct: 268 TVDFGTSSNITQAFMIQRKVEPKGPLVNSEFYTGWLDHWGESHQTVSTKNIVASLTDMLS 327
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
+ G+ VN YM+ GGTNFG A M T Y APL E G + E + + +
Sbjct: 328 R-GANVNLYMFIGGTNFGFWNGANMPYLPQPTSYDYDAPLSEAGDLTEKYYAVREAIGKF 386
Query: 253 IKLCSRPLLTGT 264
KL P+ T
Sbjct: 387 EKLPEGPIPPST 398
>gi|298481696|ref|ZP_06999887.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
gi|298272237|gb|EFI13807.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
Length = 778
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 125/285 (43%), Gaps = 52/285 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F K Q G+YV +R GP
Sbjct: 60 WSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVRE 240
+ + YM HGGT FG A + M + Y AP+ E G E
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE 338
>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
Length = 602
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 132/316 (41%), Gaps = 65/316 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ + TY+ WN HE + G++ F G DI RF++ Q GL V +R GP
Sbjct: 40 WHDRLERLAAMGLNTVDTYIAWNFHERRTGEHRFDGWRDIERFVRTAQRTGLDVIVRPGP 99
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL D G+ RS PY +
Sbjct: 100 YICAEWDNGGLPAWLTDRPGMRPRSSYAPYLDEVARWFDVLIPRIADLQAARGGPVVAVQ 159
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
+ENEY + + A +G +L+ A + +M PG +
Sbjct: 160 VENEYGSYGDDHAYMRWVHDALAGRGVTELLYTADGPTE-------LMLDGGSLPGVLAT 212
Query: 143 ACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A G R + + +P + E W ++ WG K + RS A + +AK G
Sbjct: 213 ATLGSRADQAAQLLRTRRSGEPFLCAEFWNGWFDHWGEKHHTRSVGSAAAALDEILAKGG 272
Query: 201 SYVNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVREPKWGHLKE-LHA 251
S V+ Y HGGTNFG A A +T Y AP+ E+G PK+ ++ L A
Sbjct: 273 S-VSLYPAHGGTNFGLWAGANHADGALQPTVTSYDSDAPIAEHG-APTPKFHAFRDRLLA 330
Query: 252 AIKLC------SRPLL 261
A SRPLL
Sbjct: 331 ATGAAERELPRSRPLL 346
>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
Length = 584
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 125/278 (44%), Gaps = 44/278 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W I KA+ GL+ I+TYV WN H P+ G +D SG D+ RF++ + G+Y +R G
Sbjct: 34 LWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLSGGLDLDRFLRLVADAGMYAIVRPG 93
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLW 114
P+I +EW GGLP WL + R Y + Y+ + P ++G P +L
Sbjct: 94 PYICAEWDNGGLPAWLFRDPSVGVRRYEPKYLDAVREYLTKVYEVVVPHQIDRGGPVLLV 153
Query: 115 AAK-------------MAVDFHT---GVPWVMCKQDDAPGPVI---NACNGMRCGETFKG 155
+ A+ HT GV V D P P + + +G+ +F
Sbjct: 154 QVENEYGAFGDDKRYLKALAEHTREAGVT-VPLTTVDQPTPEMLEAGSLDGLHRTASFGS 212
Query: 156 ----------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ P P + +E W ++ WG + SA D A + +A S VN
Sbjct: 213 GAEARLAILRAHQPTGPLMCSEFWNGWFDHWGAHHHTTSAADSAAELDALLAAGAS-VNL 271
Query: 206 YMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
YM+HGGTNFG T A +IT Y APLDE G
Sbjct: 272 YMFHGGTNFGLTNGANDKGVYQPLITSYDYDAPLDEAG 309
>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
Length = 781
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 143/323 (44%), Gaps = 56/323 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE ++G+++F+G ND+ F + Q G+YV +R GP
Sbjct: 62 WEHRIKMCKALGMNAICIYVFWNIHEQKEGEFNFTGNNDVAEFCRLAQKNGMYVIVRPGP 121
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-------NEYQTIEPAFHEKGPPYVLW 114
++ +EW GGLP WL I R + PY +E + + P ++G P ++
Sbjct: 122 YVCAEWEMGGLPWWLLKKKDIKLR-ERDPYFMERVKIFEDKVAEQLAPLTIQRGGPIIMV 180
Query: 115 AAK-----MAVDF-HTGVPWVMCKQD---------------------DAPGPVINACNGM 147
+ +D + G M +Q D +N G
Sbjct: 181 QVENEYGSYGIDKQYVGEIRDMLRQGWGNDVKMFQCDWSSNFTHNGLDDLIWTMNFGTGA 240
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
FK S P+ P + +E W+ ++ WG + R AQD+ ++ ++K S+ +
Sbjct: 241 NIDNQFKKLKSLRPDAPLMCSEFWSGWFDKWGARHETRPAQDMVNNIDEMLSKGISF-SL 299
Query: 206 YMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYG-------LVREPKWGHLKELHAA 252
YM HGGT+FG A A +T Y AP++EYG L+R + + A
Sbjct: 300 YMTHGGTSFGHWAGANSPGFQPDVTSYDYDAPINEYGQATAKYQLLRNTLQKYSDKRLPA 359
Query: 253 IKLCSRPLLTGTQNVISLGQLQE 275
+ PL+ + L QLQE
Sbjct: 360 VPQAPAPLIR-----VPLFQLQE 377
>gi|332264034|ref|XP_003281053.1| PREDICTED: beta-galactosidase-1-like protein 2 [Nomascus
leucogenys]
Length = 679
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 115/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 121 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 180
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 181 YICSELDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 240
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G G
Sbjct: 241 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGVVQGVLATIN 293
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 294 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 352
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 353 SSINLYMFHGGTNFGFMNGAMHFHDY 378
>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
Length = 787
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 127/299 (42%), Gaps = 67/299 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE Q+G++DF+G ND+ F + Q GLYV +R GP
Sbjct: 61 WEHRIKMCKALGMNTVCLYVFWNIHEQQEGRFDFTGNNDVAEFCRLAQRNGLYVIVRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R + PY
Sbjct: 121 YVCAEWEMGGLPWWLLKKKDIRLREPD-PYFMERVKLFERKVGEQLASLTIQNGGPIIMV 179
Query: 92 KIENEY----------QTIEPAFHEKGPPYVL-----WAAKMAVDFHTGVPWVMCKQDDA 136
++ENEY I + G V WA+ + + W M
Sbjct: 180 QVENEYGSYGENKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTM------ 233
Query: 137 PGPVINACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
N G + F+ G PN P + +E W+ ++ WG + R A+ + +
Sbjct: 234 -----NFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKAMVEGIDE 288
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLK 247
++K S+ + YM HGGT+FG A A +T Y AP++EYG PK+ L+
Sbjct: 289 MLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKYWELR 345
>gi|242078611|ref|XP_002444074.1| hypothetical protein SORBIDRAFT_07g006936 [Sorghum bicolor]
gi|241940424|gb|EES13569.1| hypothetical protein SORBIDRAFT_07g006936 [Sorghum bicolor]
Length = 147
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 88/153 (57%), Gaps = 8/153 (5%)
Query: 594 YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
YHVP FL+P N +VL E+ G+P I+ R VC V+ H + SW +Q
Sbjct: 1 YHVPCLFLQPGSNDIVLFEQFGGDPSKISFVIRQTRSVCAQVSEEHPAQIDSWNSSQQ-- 58
Query: 654 DTDIKKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERAC 712
++++ +P ++ CP G+ IS I FASFG P G C Y+ G C S+ + VV+ AC
Sbjct: 59 --TMQRY--RPELRLECPKDGQVISSIKFASFGTPSGTCGSYSHGECSSTQAISVVQEAC 114
Query: 713 IGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
IG S CS+P+ S YF G+P G+ K+L V+A C
Sbjct: 115 IGVSNCSVPVSSNYF-GNPWTGVTKSLAVEAAC 146
>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
Length = 595
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 128/310 (41%), Gaps = 60/310 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ + TYV WN HE G F G D+ RFI+ Q +GL V +R GP
Sbjct: 37 WADRLRRLAALGLNAVDTYVPWNFHERTAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGP 96
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL G+ R+ + PY +
Sbjct: 97 YICAEWDNGGLPAWLTGTPGMRLRTSHGPYLEAVDRWFDALVPRIAELQAGRGGPVVAVQ 156
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY + I A +G +L+ A G +M PG +
Sbjct: 157 IENEYGSYGDDRAYVRHIRDALVARGITELLYTAD-------GPTPLMQDGGALPGELAA 209
Query: 143 ACNGMRC--GETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A G R P +P E W ++ WG K ++R A A + + + G
Sbjct: 210 ATFGSRPDRAAALLRSRRPAEPFFCAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGG 269
Query: 201 SYVNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
S V+ YM HGGTNFG A A +T Y AP+ E G + PK+ L++ A
Sbjct: 270 S-VSLYMAHGGTNFGLWAGANHEGGTIRPTVTSYDSDAPIAENGAL-TPKFFALRDRLTA 327
Query: 253 IKLCS--RPL 260
+ + RPL
Sbjct: 328 LGTVAARRPL 337
>gi|28199702|ref|NP_780016.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182682446|ref|YP_001830606.1| beta-galactosidase [Xylella fastidiosa M23]
gi|386083781|ref|YP_006000063.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|417557800|ref|ZP_12208811.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
gi|28057823|gb|AAO29665.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182632556|gb|ACB93332.1| Beta-galactosidase [Xylella fastidiosa M23]
gi|307578728|gb|ADN62697.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|338179583|gb|EGO82518.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
Length = 612
Score = 111 bits (278), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 124/275 (45%), Gaps = 54/275 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL E ++GQ+DF+G NDI F++E SQGL V LR GP
Sbjct: 59 WKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDIGAFVREAASQGLNVILRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GG P WL + RS + + +
Sbjct: 119 YVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEALGTQVRPLLNGNGGPIIAVQ 178
Query: 93 IENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
+ENEY Q + F + G +L+ A A G +P V+ + APG
Sbjct: 179 VENEYGSYGDDHGYLQAVRALFIKAGLGGALLFTADGAQMLGNGTLPDVLAAVNVAPGEA 238
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A + + TF P +P + E W ++ W GKP+ ++ ++ + G
Sbjct: 239 KQALDKL---ATFH----PGQPQLVGEYWAGWFDQW-GKPHAQTDAKQQADEIEWMLRQG 290
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+N YM+ GGT+FG FM + P D Y
Sbjct: 291 HSINLYMFVGGTSFG-----FMNGANFQGGPSDHY 320
>gi|444724418|gb|ELW65022.1| Beta-galactosidase-1-like protein 2 [Tupaia chinensis]
Length = 656
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/280 (28%), Positives = 126/280 (45%), Gaps = 51/280 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G++ + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 94 WRDRLLKMKACGMNTLTTYVPWNLHEPERGKFDFSGNLDLEAFILLAAELGLWVILRPGP 153
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ SE GGLP WL G+ R+ K + +
Sbjct: 154 YVCSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 213
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDD--------APGPV---- 140
+ENEY + + K P Y+ + K D G+ ++ D+ PG +
Sbjct: 214 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGVVPGALATIN 266
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + + ++ TF +P + E WT ++ WGG +I + ++ V+ + G
Sbjct: 267 LQSQHELQLLNTFLVNAQVVQPKMVMEYWTGWFDSWGGPHHILDSSEVLKTVSALV-DAG 325
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVRE 240
S +N YM+HGGTNFG A Y A + YG V +
Sbjct: 326 SSINLYMFHGGTNFGFMNGAMHFHDY--SADVTSYGDVAD 363
>gi|336404675|ref|ZP_08585368.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
gi|335941579|gb|EGN03432.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
Length = 778
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 125/285 (43%), Gaps = 52/285 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F K Q G+YV +R GP
Sbjct: 60 WSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVDKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVRE 240
+ + YM HGGT FG A + M + Y AP+ E G E
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE 338
>gi|148231352|ref|NP_001080304.1| galactosidase, beta 1-like 2 [Xenopus laevis]
gi|28422231|gb|AAH46858.1| Loc89944-prov protein [Xenopus laevis]
Length = 634
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 131/294 (44%), Gaps = 55/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G++ + TYV WNLHEP+KG++DFS DI F+ GL+V LR GP
Sbjct: 75 WRDRMKKMKACGINTLTTYVPWNLHEPRKGKFDFSKDLDISEFLAIASEMGLWVILRPGP 134
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL + R+ + + +
Sbjct: 135 YICAEWDLGGLPSWLLRDKDMKLRTTYRGFTEATEAYLDELIPRIAKYQYSNGGPIIAVQ 194
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG-------PVINACN 145
+ENEY + + K Y+ + V+ G+ ++ D+ G V+ N
Sbjct: 195 VENEYGS-----YAKDANYMEFIKNALVE--KGIVELLLTSDNKDGLSSGSLENVLATVN 247
Query: 146 GMRCGET-FKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+ F NS NKP + E WT ++ WGGK +I ++ V+ + + G+
Sbjct: 248 FQKIEPVLFSYLNSIQSNKPVMVMEFWTGWFDYWGGKHHIFDVDEMISTVSEVLNR-GAS 306
Query: 203 VNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+N YM+HGGTNFG A IT Y APL E G K+ L+EL
Sbjct: 307 INLYMFHGGTNFGFMNGALHFHEYRPDITSYDYDAPLTEAGDYTS-KYFKLREL 359
>gi|313231409|emb|CBY08524.1| unnamed protein product [Oikopleura dioica]
Length = 493
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 117/260 (45%), Gaps = 42/260 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++ YV WNLHEP G+++FSG D++RFI+ GL+V R GP
Sbjct: 87 WLDRLTKLKYAGLNTVELYVSWNLHEPYSGEFNFSGDLDVVRFIEMAGELGLHVLFRPGP 146
Query: 62 FIESEWTYGGLPIW-LHD--------------------------VAGIVFRSDNK--PYK 92
+I +EW +GG P W LHD V +++R+ +
Sbjct: 147 YICAEWEWGGHPYWLLHDTDMKVRTTYPGYLEAVEKFYSELFGRVNHLMYRNGGPIIAVQ 206
Query: 93 IENEYQTIEPAFH--EKGPPYVLWAAKMAVD-------FHTGVPWVMCKQDDAPGPV-IN 142
IENEY AF P ++ W + D F + W K + P +N
Sbjct: 207 IENEYAGFADAFEIGPLDPGFLTWLRQTIKDQQCEELLFTSDGGWDFYKYELEGDPYGLN 266
Query: 143 ACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ +R N P KP + E W+ ++ WG +A ++ +++N
Sbjct: 267 FDDVLRANYWLNILENNQPGKPKMVMEWWSGWFDFWGYHHQGTTADSFEENLRAILSQNA 326
Query: 201 SYVNYYMYHGGTNFGRTAAA 220
S VNYYM+HGGTNFG A
Sbjct: 327 S-VNYYMFHGGTNFGYMNGA 345
>gi|38699441|gb|AAR27061.1| beta-galactosidase 1 [Ficus carica]
Length = 176
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 84/155 (54%), Gaps = 10/155 (6%)
Query: 403 LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGA 462
L V S GH L FVNG+ TG A+GS D+ T + LR G N ALLSV VGLP+ G
Sbjct: 24 LTVYSAGHALLVFVNGQLTGKAYGSLDSPKLTFTQNIKLRVGVNKLALLSVAVGLPNVGL 83
Query: 463 FLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSP 516
E AGV + + W Y+ GL GE L + S G + V W+
Sbjct: 84 HFETWNAGVLGPVTLKGLNSGTWDMSKWKWSYKTGLEGEDLSLQS--GSSSVQWAQGSFF 141
Query: 517 TRQ--LTWYKTTFRAPAGNDPIALNLQSMGKGEAW 549
T+Q LTWY TTF AP GN P+AL++ SMGKG+ W
Sbjct: 142 TKQQPLTWYTTTFNAPGGNGPLALDMNSMGKGQIW 176
>gi|295086466|emb|CBK67989.1| Beta-galactosidase [Bacteroides xylanisolvens XB1A]
Length = 778
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 125/285 (43%), Gaps = 52/285 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F K Q G+YV +R GP
Sbjct: 60 WSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVDKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVRE 240
+ + YM HGGT FG A + M + Y AP+ E G E
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE 338
>gi|423301385|ref|ZP_17279409.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
gi|408471986|gb|EKJ90515.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
Length = 779
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 142/334 (42%), Gaps = 65/334 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 61 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 121 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 180
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 181 VENEYGS-----YGINKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 235
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 236 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 295
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYG-------LVRE------PK 242
+ + YM HGGT FG A + M + Y AP+ E G L+R+ P
Sbjct: 296 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTEKYFLLRDLLKNYLPA 354
Query: 243 WGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEA 276
L E+ AA+ + P T+ L EA
Sbjct: 355 GAALPEVPAALPVMEIPEFHFTKVAPLFSNLPEA 388
>gi|29349062|ref|NP_812565.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383124327|ref|ZP_09944991.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
gi|29340969|gb|AAO78759.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839176|gb|EES67260.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
Length = 778
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 128/294 (43%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 60 WDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCKQD--------DAPGPVINA 143
+ENEY + + PYV + + T VP C D IN
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINF 234
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G E K+ L++L
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYYLLRDL 346
>gi|380693434|ref|ZP_09858293.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 778
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 128/294 (43%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 60 WDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCKQD--------DAPGPVINA 143
+ENEY + + PYV + + T VP C D IN
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINF 234
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G E K+ L++L
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 346
>gi|426371167|ref|XP_004052524.1| PREDICTED: beta-galactosidase-1-like protein 2 [Gorilla gorilla
gorilla]
Length = 678
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 118/266 (44%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 120 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 179
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 180 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 239
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN 142
+ENEY + + K P Y+ + K D G+ ++ D+ G IN
Sbjct: 240 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 292
Query: 143 --ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 293 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 351
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 352 SSINLYMFHGGTNFGFMNGAMHFHDY 377
>gi|255692586|ref|ZP_05416261.1| beta-galactosidase [Bacteroides finegoldii DSM 17565]
gi|260621643|gb|EEX44514.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 779
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/334 (27%), Positives = 139/334 (41%), Gaps = 65/334 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 61 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 121 YVCAEWEMGGLPWWLLKKRDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 180
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 181 VENEYGS-----YGINKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 235
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 236 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 295
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVRE-------------PK 242
+ + YM HGGT FG A + M + Y AP+ E G E P
Sbjct: 296 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTEKYFLLRDLLKNYLPA 354
Query: 243 WGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEA 276
L E+ AA+ + P T+ L EA
Sbjct: 355 GAALPEVPAALPVIEIPEFHFTKVAPLFSNLPEA 388
>gi|402895882|ref|XP_003911041.1| PREDICTED: beta-galactosidase-1-like protein 2 [Papio anubis]
Length = 636
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 115/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G G
Sbjct: 198 VENEYGS-----YNKDPAYMAYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 250
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTRELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY 335
>gi|413954159|gb|AFW86808.1| putative RAN GTPase activating family protein [Zea mays]
Length = 449
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/235 (33%), Positives = 117/235 (49%), Gaps = 9/235 (3%)
Query: 236 GLVREPKWGHLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNND- 294
G +R+PK+GHLK+LH I+ + L+ G N S G+ A V + T G + +NN
Sbjct: 200 GNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYGK--NAIVTKYTYGGSSVCFINNQF 257
Query: 295 ERKAVTVLFRNISYELPRKSISILPDCKTVAFNTERVSTQYNKRSKTSNLKFDSDE--KW 352
+ V V ++ +P S+SILPDCKTVA+NT ++ TQ + K +N E +W
Sbjct: 258 VDRDVKVTLGGGTHLVPAWSVSILPDCKTVAYNTAKIKTQTSVMVKKANSVEKELEALRW 317
Query: 353 E---EYREAILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSNAQAPLDVQSHG 409
E + + R LL+QI+ + D SDY WY + + L V + G
Sbjct: 318 SWMPENLKPFMTDHRDSFRQSQLLEQIATSTDQSDYLWYRTSLEHKGEGSYT-LYVNTSG 376
Query: 410 HILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFL 464
H ++ FVNG G + + F L++ V L G N +LLS TVGL + +
Sbjct: 377 HEMYVFVNGRLVGQNYSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKSAKTLV 431
>gi|298386767|ref|ZP_06996322.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|298260441|gb|EFI03310.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 778
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 128/294 (43%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 60 WDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCKQD--------DAPGPVINA 143
+ENEY + + PYV + + T VP C D IN
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTRNALDDLIWTINF 234
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKEMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G E K+ L++L
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYYLLRDL 346
>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
Length = 778
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 126/299 (42%), Gaps = 67/299 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE Q+G++DF+G ND+ F + Q GLYV +R GP
Sbjct: 52 WEHRIKMCKALGMNTVCLYVFWNIHEQQEGKFDFTGNNDVAEFCRLAQRNGLYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R + PY
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIRLREPD-PYFMERVKLFERKVGEQLASLTIQNGGPIIMV 170
Query: 92 KIENEY----------QTIEPAFHEKGPPYVL-----WAAKMAVDFHTGVPWVMCKQDDA 136
++ENEY I G V WA+ + + W M
Sbjct: 171 QVENEYGSYGKNKAYVSAIRDIVRRSGFDKVTLFQCDWASNFEKNGLDDLVWTM------ 224
Query: 137 PGPVINACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
N G + F+ G PN P + +E W+ ++ WG + R A+ + +
Sbjct: 225 -----NFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKAMVEGIDE 279
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLK 247
++K S+ + YM HGGT+FG A A +T Y AP++EYG PK+ L+
Sbjct: 280 MLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKYWELR 336
>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
Length = 823
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 124/288 (43%), Gaps = 43/288 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE Q+G++DF+G ND+ F + Q G+YV +R GP
Sbjct: 100 WEQRIKMCKSLGMNTVCLYVFWNIHEQQEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGP 159
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN-------KPYKIENEYQTIEPAFHEKGPPYVLW 114
++ +EW GGLP WL I R D+ K ++ E Q GP ++
Sbjct: 160 YVCAEWEMGGLPWWLLKKKDIRLREDDPYFMARVKAFEAEVGRQLAPLTIQNGGPIIMVQ 219
Query: 115 AAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS---------------- 158
+ +V +D + +C N+
Sbjct: 220 VENEYGSYGVNKKYVSQIRDIVKASGFDKVTLFQCDWASNFENNGLDDLVWTMNFGTGSN 279
Query: 159 ------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
P+ P + +E W+ ++ WG + R A+ + + ++KN S+ + Y
Sbjct: 280 IDAQFKRLKQLRPDAPLMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLSKNISF-SLY 338
Query: 207 MYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKE 248
M HGGT+FG A A +T Y AP++EYG PK+ L++
Sbjct: 339 MTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGHA-TPKFWELRK 385
>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 133/298 (44%), Gaps = 64/298 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+ GL+ + YVFWN HE Q G++DFSG+ DI FI+ Q +GLYV LR GP
Sbjct: 63 WRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + +RS + + +
Sbjct: 123 YVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSPLTINNGGNIIMVQ 182
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + +KG Y+ M + VP C D G V +
Sbjct: 183 VENEYGSYAA---DKG--YLAAIRDMIKEAGFNVPLFTC---DGGGQVEAGHTEGALPTL 234
Query: 145 NGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + K P E + +++ WG + Y R A+ + + ++
Sbjct: 235 NGVFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLS----- 289
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYGLVREPKWGHLKEL 249
+G V+ YM+HGGTNF T A GY Q APL E+G PK+ +E+
Sbjct: 290 HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNCY-PKYHAFREV 346
>gi|160885481|ref|ZP_02066484.1| hypothetical protein BACOVA_03481 [Bacteroides ovatus ATCC 8483]
gi|423290348|ref|ZP_17269197.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
gi|156109103|gb|EDO10848.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
gi|392665735|gb|EIY59258.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
Length = 778
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 125/285 (43%), Gaps = 52/285 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVRE 240
+ + YM HGGT FG A + M + Y AP+ E G E
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE 338
>gi|297727459|ref|NP_001176093.1| Os10g0340600 [Oryza sativa Japonica Group]
gi|255679317|dbj|BAH94821.1| Os10g0340600 [Oryza sativa Japonica Group]
Length = 143
Score = 111 bits (277), Expect = 2e-21, Method: Composition-based stats.
Identities = 45/78 (57%), Positives = 59/78 (75%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP LI KAKEGGL+ I+TYVFWN HEP++ +++F G D++RF KEIQ+ G+Y LRIG
Sbjct: 61 MWPDLIKKAKEGGLNAIETYVFWNGHEPRRREFNFEGNYDVVRFFKEIQNAGMYAILRIG 120
Query: 61 PFIESEWTYGGLPIWLHD 78
P+I EW YG +P+ D
Sbjct: 121 PYICGEWNYGYMPMLYLD 138
>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 777
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 133/298 (44%), Gaps = 64/298 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+ GL+ + YVFWN HE Q G++DFSG+ DI FI+ Q +GLYV LR GP
Sbjct: 63 WRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + +RS + + +
Sbjct: 123 YVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSPLTINNGGNIIMVQ 182
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + +KG Y+ M + VP C D G V +
Sbjct: 183 VENEYGSYAA---DKG--YLAAIRDMIKEAGFNVPLFTC---DGGGQVEAGHTEGALPTL 234
Query: 145 NGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + K P E + +++ WG + Y R A+ + + ++
Sbjct: 235 NGVFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLS----- 289
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYGLVREPKWGHLKEL 249
+G V+ YM+HGGTNF T A GY Q APL E+G PK+ +E+
Sbjct: 290 HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNCY-PKYHAFREV 346
>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
Length = 592
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 135/306 (44%), Gaps = 50/306 (16%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + + GL+ ++TYV WN HE +G+ DF+G D+ RFI GL V +R G
Sbjct: 40 LWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLARFISLAGDLGLDVIVRPG 99
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQ----TIEPAFHEKGPPYVLW 114
P+I +EW +GGLP WL GI R+ + + +++ + I P G P V
Sbjct: 100 PYICAEWDFGGLPAWLMTEPGIALRTSDPAFLAAVDDWFDAVVPVIRPLLTTAGGPVV-- 157
Query: 115 AAKMAVDFHT------------------GVPWVMCKQDDAPGP----------VINACN- 145
A ++ ++ + G+ V+ D PGP V+ N
Sbjct: 158 AVQVENEYGSYGDDAAYLEHCRKGLLDRGID-VLLFTSDGPGPDWLDNGTIPGVLATVNF 216
Query: 146 GMRCGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G R E F P P + E W ++ WG ++R D A V + + G V
Sbjct: 217 GSRTDEAFAELRKVQPAGPDMVMEYWNGWFDHWGEPHHVRDVDDAA-GVLDDVLRAGGSV 275
Query: 204 NYYMYHGGTNFGRTAAAFM--------ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
N+YM HGGTNFG + A + +T Y A + E G + PK+ +E+ + +
Sbjct: 276 NFYMAHGGTNFGLWSGANVEDGKLQPTVTSYDYDAAVGEAGEL-TPKFHAFREVISRYAV 334
Query: 256 CSRPLL 261
+ P L
Sbjct: 335 TALPEL 340
>gi|22760570|dbj|BAC11247.1| unnamed protein product [Homo sapiens]
Length = 636
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 118/266 (44%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN 142
+ENEY + + K P Y+ + K D G+ ++ D+ G IN
Sbjct: 198 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 250
Query: 143 --ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY 335
>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
Length = 775
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 131/298 (43%), Gaps = 64/298 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+ GL+ + YVFWN HE Q G++DFSG+ DI FI+ Q +GLYV LR GP
Sbjct: 61 WRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + +RS + + +
Sbjct: 121 YVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSPLTINNGGNIIMVQ 180
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M + VP C D G V +
Sbjct: 181 VENEYGS-----YAADKEYLAAIRDMIKEAGFNVPLFTC---DGGGQVEAGHVEGALPTL 232
Query: 145 NGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + K P E + +++ WG + Y R A+ + + ++
Sbjct: 233 NGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLS----- 287
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYGLVREPKWGHLKEL 249
+G V+ YM+HGGTNF T A GY Q APL E+G PK+ +E+
Sbjct: 288 HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNCY-PKYHAFREV 344
>gi|31543093|ref|NP_612351.2| beta-galactosidase-1-like protein 2 precursor [Homo sapiens]
gi|74728154|sp|Q8IW92.1|GLBL2_HUMAN RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|26251705|gb|AAH40641.1| Galactosidase, beta 1-like 2 [Homo sapiens]
gi|119588247|gb|EAW67843.1| hypothetical protein BC008326, isoform CRA_b [Homo sapiens]
gi|119588248|gb|EAW67844.1| hypothetical protein BC008326, isoform CRA_b [Homo sapiens]
Length = 636
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 118/266 (44%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN 142
+ENEY + + K P Y+ + K D G+ ++ D+ G IN
Sbjct: 198 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 250
Query: 143 --ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY 335
>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
Length = 655
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 113/259 (43%), Gaps = 46/259 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WN HE +G +DFSG D+ RFI+ Q GLYV LR GP
Sbjct: 35 WRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFSGILDLRRFIQIAQDVGLYVLLRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW +GGLP WL + R+ PY +
Sbjct: 95 YICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVDAYLAKILPLVNDLQMSKGGPIIAVQ 154
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPVI 141
+ENEY + ++ F + G +L+ + G +P V+ +
Sbjct: 155 LENEYGSYGDDLDYKLFLKNQFIKYGIEELLFTSDNGTGIQNGPIPGVLATTN-----FQ 209
Query: 142 NACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G E + P P + E W+ ++ WG + + + V +I GS
Sbjct: 210 EQEQGYLMFEYLRNIKQPGLPMMVMEFWSGWFDHWGEQHNLCHHAEF-IDVFKWILLEGS 268
Query: 202 YVNYYMYHGGTNFGRTAAA 220
VN+YM+HGGTNFG A A
Sbjct: 269 SVNFYMFHGGTNFGFMAGA 287
>gi|37182117|gb|AAQ88861.1| HYDRL-14 [Homo sapiens]
Length = 636
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 118/266 (44%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN 142
+ENEY + + K P Y+ + K D G+ ++ D+ G IN
Sbjct: 198 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 250
Query: 143 --ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY 335
>gi|348508360|ref|XP_003441722.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 648
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 124/283 (43%), Gaps = 56/283 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G + F + D+ +++ S GL+V LR GP
Sbjct: 88 WEDRLLKMKACGLNTLTTYVPWNLHEPERGVFKFDDQLDLEAYLRLAASLGLWVILRPGP 147
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL + R+ + +
Sbjct: 148 YICAEWDLGGLPSWLLRDPQMKLRTTYSGFTYAVNSFFDEVIKKAVPHQYSKGGPIIAVQ 207
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + A E P++ A G+ ++ D+ G + G
Sbjct: 208 VENEYGSY--ATDENYMPFIKEAL-----LSRGITELLLTSDNKDGLKLGGVKGALETIN 260
Query: 153 FKGPN----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
F+ + P +P + E W+ ++ +WGG ++ +A+++ V I K
Sbjct: 261 FQKLDPDEIKYLEQIQPQQPKMVMEYWSGWFDLWGGLHHVYTAEEM-IPVVTEILKLDMS 319
Query: 203 VNYYMYHGGTNFGRTAAAF---------MITGYYDQAPLDEYG 236
+N YM+HGGTNFG + AF M+T Y APL E G
Sbjct: 320 INLYMFHGGTNFGFMSGAFAVGLPAPKPMVTSYDYDAPLSEAG 362
>gi|403304858|ref|XP_003942999.1| PREDICTED: beta-galactosidase-1-like protein 2 [Saimiri boliviensis
boliviensis]
Length = 636
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 116/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFILMASEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G +G
Sbjct: 198 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVHGVLATIN 250
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY 335
>gi|380512533|ref|ZP_09855940.1| beta-galactosidase [Xanthomonas sacchari NCPPB 4393]
Length = 616
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 123/281 (43%), Gaps = 66/281 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EP+ GQ+DFSG NDI F+ E +QGL V LR GP
Sbjct: 65 WKDRLQKARAMGLNTVETYVFWNLVEPRPGQFDFSGNNDIAAFVDEAAAQGLNVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN------------------KP-----------YK 92
++ +EW GG P WL G+ RS + KP +
Sbjct: 125 YVCAEWEAGGYPAWLFAEPGMRVRSQDPRFLAASQAYLDALAAQVKPRLNGNGGPIVAVQ 184
Query: 93 IENEYQT---------------IEPAFHEKGPPYVLWAAKMAVDFHTG-VPWVMCKQDDA 136
+ENEY + ++ F + +L+ A G +P + + A
Sbjct: 185 VENEYGSYGDDHAYMRLNRAMFVQAGFDKA----LLFTADGPDVLANGTLPDTLAVVNFA 240
Query: 137 PGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF- 195
PG +A N F+ P +P + E W ++ WG K +A D + F
Sbjct: 241 PG---DAKNAFETLAKFR----PGQPQMVGEYWAGWFDQWGEK---HAATDATKQASEFE 290
Query: 196 -IAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
I + G N YM+ GGT+FG FM + + P D Y
Sbjct: 291 WILRQGHSANIYMFVGGTSFG-----FMNGANFQKNPSDHY 326
>gi|423215069|ref|ZP_17201597.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692332|gb|EIY85570.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
Length = 778
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 125/285 (43%), Gaps = 52/285 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DF+G+NDI F K Q G+YV +R GP
Sbjct: 60 WSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFAGQNDIAAFCKLAQQHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL + R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVGIFMKEVGKQLAPLQVDKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGTDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVRE 240
+ + YM HGGT FG A + M + Y AP+ E G E
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE 338
>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
Length = 587
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 132/316 (41%), Gaps = 58/316 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WN+HE +KG Y F+G DI FI+ QS L+V +R P
Sbjct: 35 WKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIELAQSLELFVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL G+ R+ KP+ +
Sbjct: 95 YICAEWEFGGLPAWLLKDPGMKVRTVYKPFMKHVKEYFEVLFKILAPLQIDQDGPIILMQ 154
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------------QDDAPGPV 140
IENEY ++ Y+ K+ DF T VP V D P
Sbjct: 155 IENEY-----GYYGNDKEYLSTLLKIMRDFGTTVPVVTSDGPWGEALDAGSLLADVSLPT 209
Query: 141 INACNGMRCG-ETFKGPNSPNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAK 198
+N G + E FK NKP + E W ++ WG + + R A D A + +
Sbjct: 210 MNFGTGAKEHIENFK-EKYVNKPVMCMEFWVGWFDAWGDDRHHTRDASDAANELRDIL-- 266
Query: 199 NGSYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHA 251
N VN YM+HGGTNFG A +T Y A L E G + E + K +
Sbjct: 267 NEGSVNIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILTECGDLTEKYYEFKKVISE 326
Query: 252 AIKLCSRPLLTGTQNV 267
++ LL T +
Sbjct: 327 FTEIKEVELLPQTHKI 342
>gi|374312360|ref|YP_005058790.1| glycoside hydrolase family protein [Granulicella mallensis
MP5ACTX8]
gi|358754370|gb|AEU37760.1| glycoside hydrolase family 35 [Granulicella mallensis MP5ACTX8]
Length = 627
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 123/291 (42%), Gaps = 46/291 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA GL+ I YVFWN+HEP YDFSG+ND+ F++E Q +GLYV LR GP
Sbjct: 70 WRDRLRKAHAMGLNAITIYVFWNIHEPTPEVYDFSGQNDVAEFVREAQQEGLYVILRPGP 129
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY------QTIEPAFHEKGPP----- 110
++ +EW GG P WL + RS +K Q + P +G P
Sbjct: 130 YVCAEWDLGGYPAWLLKDHEMKLRSLQPEFKAAATRWMLRLGQELTPLQASRGGPILAVQ 189
Query: 111 -------------YVLWAAKMAVD-------FHTGVPWVMCKQDDAP----GPVINACNG 146
Y+ W ++ + +TG + KQ P G +
Sbjct: 190 VENEYGSFGDDHEYMKWVHELVLQAGFGGSLLYTGDGADVLKQGTLPSVFAGIDFGTGDA 249
Query: 147 MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
R + +K P P E W ++ WG K + A + + + G ++ Y
Sbjct: 250 ARSIKLYKA-FRPQTPVYVAEYWDGWFDHWGEKHQLTDAAKQETEIRSML-EQGDSISLY 307
Query: 207 MYHGGTNFGRTAAAFM--------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
M HGGT+FG A ++ Y APLDE G R PK+ L+ +
Sbjct: 308 MVHGGTSFGWMNGANNDHDGYQPDVSSYDYDAPLDESGRPR-PKYFRLRNI 357
>gi|393780989|ref|ZP_10369190.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
gi|392677324|gb|EIY70741.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
CL02T12C01]
Length = 776
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 127/294 (43%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE ++GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 58 WEHRIKMCKALGMNTICLYVFWNIHEQEEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 118 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKKVGEQLVPLQITRGGNIIMVQ 177
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCKQD--------DAPGPVINA 143
+ENEY + + PYV M T VP C D +N
Sbjct: 178 VENEYGS-----YGTDKPYVSAIRDMVRGAGFTEVPLFQCDWSSNFTNNALDDLLWTVNF 232
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 233 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGLKDMLDRNIS 292
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G E K+ L++L
Sbjct: 293 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTE-KYFLLRDL 344
Score = 39.3 bits (90), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+YK TF+ +D L++ + GKG WVNG ++GR+W
Sbjct: 530 YYKATFKLSKTDDTF-LDMSTWGKGMVWVNGHAMGRFW 566
>gi|281337336|gb|EFB12920.1| hypothetical protein PANDA_005061 [Ailuropoda melanoleuca]
Length = 655
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 77/251 (30%), Positives = 113/251 (45%), Gaps = 33/251 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFS D+ F+ GL+V LR GP
Sbjct: 100 WRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENLDLEAFVLMAAEIGLWVILRPGP 159
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLWA 115
+I SE GGLP WL ++ R+ K + ++ + P + KG P +
Sbjct: 160 YICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYFDHLISRVVPLQYHKGGPIIAVQ 219
Query: 116 AK-----MAVD-----------FHTGVPWVMCKQDDAPGPVINACNGMRCG---ETFKGP 156
+ AVD G+ ++ DDA G+ TF+
Sbjct: 220 VENEYGSFAVDKDYMPYVRKALLERGIVELLVTSDDAENLQKGYLEGVLATINMNTFEKS 279
Query: 157 N-------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
NKP + E W ++ WGGK + +A+D+ V+ FI S+ N YM+H
Sbjct: 280 AFEQLSQLQRNKPIMVMEYWVGWFDTWGGKHMVNNAEDVEETVSKFITSEISF-NVYMFH 338
Query: 210 GGTNFGRTAAA 220
GGTNFG A
Sbjct: 339 GGTNFGFMNGA 349
>gi|221129758|ref|XP_002162955.1| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
Length = 620
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 121/286 (42%), Gaps = 57/286 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KAK GL+ IQ+YV WN+HE +G YDF+ DII FI Q L V LR GP
Sbjct: 58 WNDSMKKAKSMGLNTIQSYVAWNIHEINEGHYDFNDDKDIINFINLAQQNDLLVILRPGP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I++EW +GG P W+ + S +K Y +
Sbjct: 118 YIDAEWEFGGFPWWMAKSNMTMRTSGDKSYMKYVSNWFSILLPMINQYLYKNGGPIIAVQ 177
Query: 93 IENEYQTIEPAFHEK------------GPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV 140
+ENEY HE G VL+ D ++ C +
Sbjct: 178 VENEYGNYYACDHEYMKELKNLFQLHLGNDVVLFTTDGYTD-----DYLKCGTIPSLFTT 232
Query: 141 INACNGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
I+ + E FK + K P + +E +T + WG R+A++IA H+ +
Sbjct: 233 IDFGTEISAVEAFKLLRNHQKKGPLVNSEFYTGWLDYWGKNHQKRNARNIALHLDEILKL 292
Query: 199 NGSYVNYYMYHGGTNFGRTAAA------FMI--TGYYDQAPLDEYG 236
N S VN YM+ GGTNFG A F+I T Y AP+ E G
Sbjct: 293 NAS-VNLYMFQGGTNFGYMNGADMSDGQFLISPTSYDYDAPISEAG 337
>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
Length = 606
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 125/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A+ GL+ + TYV WN HE G F G D+ RF++ Q GL V +R GP
Sbjct: 48 WADRLARLAALGLNTVDTYVPWNFHERTPGDVRFDGWRDLDRFVRLAQETGLDVIVRPGP 107
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL G+ R+ + P+ +
Sbjct: 108 YICAEWDNGGLPAWLTGTPGMRPRTSHPPFLAAVARWFDQLIPRIAALQAGRGGPVVAVQ 167
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY + + A +G +L+ A + +M G +
Sbjct: 168 IENEYGSYGDDGDYVRWVRDALTARGVTELLYTADGPTE-------LMLDAGAVEGELAA 220
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A G R + + S P +P E W ++ WG + ++R A+ A V + G
Sbjct: 221 ATFGSRPEQAARLLRSRRPEEPFFCAEFWNGWFDHWGEQHHVRPARSAADDVGRILGAGG 280
Query: 201 SYVNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
S ++ YM HGGTNFG A A +T Y AP+ E+G + E + EL AA
Sbjct: 281 S-LSLYMAHGGTNFGLWAGANHDGDRLQPTVTSYDSDAPVAEHGALTEKFFALRDELTAA 339
>gi|326933328|ref|XP_003212758.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Meleagris
gallopavo]
Length = 656
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 136/318 (42%), Gaps = 55/318 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHE +G++DFS D+ F+ GL+V LR GP
Sbjct: 97 WEDRMLKMKACGLNTLTTYVPWNLHEQTRGKFDFSENLDLEAFLSLAAKNGLWVILRPGP 156
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW GGLP WL + R+ K + +
Sbjct: 157 YICSEWDLGGLPSWLLQDPEMQLRTTYKGFTEAVDAYFDHLMPIVVPLQYKRGGPIIAVQ 216
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + K P Y+ + KMA+ G+ ++ D+ G G
Sbjct: 217 VENEYGS-----YAKDPNYMAYV-KMAL-LSRGIVELLMTSDNKNGLSFGLVEGALATVN 269
Query: 153 FKGPN----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
F+ ++P + E WT ++ WGG Y+ A ++ VA I K G+
Sbjct: 270 FQKLEPGVLKYLDTVQRDQPKMVMEYWTGWFDNWGGPHYVFDADEMVNTVA-SILKLGAS 328
Query: 203 VNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
+N YM+HGGTNFG A +T Y A L E G K+ L++L + I
Sbjct: 329 INLYMFHGGTNFGFMNGALKTDEYKSDVTSYDYDAVLTEAGDYTS-KFFKLRQLFSTIIG 387
Query: 256 CSRPLLTGTQNVISLGQL 273
PL ++ S G +
Sbjct: 388 QPLPLPPMIESKASYGAI 405
>gi|296399387|gb|ADH10509.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 121/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD IQTYV WN HEP+ G YDF G D+ F++ GL V LR GP
Sbjct: 40 WKDRLLKMKMAGLDAIQTYVPWNYHEPRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGP 99
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y E + P ++ G P ++
Sbjct: 100 YICAEWDMGGLPAWLLEKKSIVLRSSDSDYLEAVERWMGVLLPKMRPYLYQNGGPIIMVQ 159
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPG------------PVINACN 145
+ + A D+ H G V+ D A ++
Sbjct: 160 VENEYGSYFACDYDYLRFLLKLFRLHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAP 219
Query: 146 GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G F S P P + +E +T + WG + + A+ +A + +A+ G+ V
Sbjct: 220 GGNVTAAFLAQRSSEPMGPLVNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILAR-GANV 278
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF A M T Y APL E G + E
Sbjct: 279 NLYMFIGGTNFAYWNGANMPYMPQPTSYDYDAPLSEAGDLTE 320
>gi|298204831|emb|CBI25664.3| unnamed protein product [Vitis vinifera]
Length = 118
Score = 110 bits (275), Expect = 3e-21, Method: Composition-based stats.
Identities = 44/91 (48%), Positives = 65/91 (71%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AKEGG+DVI+TYVFWN HE G Y F G D+++F+K +Q G+Y+ LR G
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFWNGHELSPGNYYFGGWYDLLKFVKIVQQDGMYLILRFG 60
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY 91
PF+ +EW + G+ +WLH + G VF ++++P+
Sbjct: 61 PFVVAEWNFSGVLVWLHYMPGTVFWTNSEPF 91
>gi|297835700|ref|XP_002885732.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297331572|gb|EFH61991.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 336
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 120/271 (44%), Gaps = 76/271 (28%)
Query: 358 AILNFDNTLLRAEGLLDQISAAKDASDYFWYTFRFHYNSSN------AQAPLDVQSHGHI 411
+IL+ D+ +L G L ++ KD +DY WYT + + L V GH
Sbjct: 8 SILDGDSLIL---GELYYLT--KDKTDYAWYTTSIKIEDDDIPDQKGQKTILRVAGLGHA 62
Query: 412 LHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGV 471
L +VNGEY +AHGSH+ + DSG+++E AG
Sbjct: 63 LIVYVNGEYASNAHGSHE---------------------------MKDSGSYMEHTYAGP 95
Query: 472 HRVRVQD-KSFT-----NCSWGYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTRQLTWYKT 525
V + KS T N WG+ V Y G KV W + LTWYKT
Sbjct: 96 RGVSIIGLKSGTRDLIENNEWGHLV---------YIEEGSKKVKWEKY-GEHKPLTWYKT 145
Query: 526 TFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFC 585
F P G + +A+ ++ MGKG WV+G +GRYW+SF + G P QT+
Sbjct: 146 YFETPEGENAVAIRMKGMGKGLIWVHGIGVGRYWMSFVSPLGEPIQTE------------ 193
Query: 586 AIIKATNTYHVPRAFLKP--TGNLLVLLEEE 614
YH+PR+F+K ++ V+LEEE
Sbjct: 194 --------YHIPRSFMKEEKKKSMFVILEEE 216
>gi|296399420|gb|ADH10537.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 121/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD IQTYV WN HEP+ G YDF G D+ F++ GL V LR GP
Sbjct: 40 WKDRLLKMKMAGLDAIQTYVPWNYHEPRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGP 99
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y E + P ++ G P ++
Sbjct: 100 YICAEWDMGGLPAWLLEKKSIVLRSSDSDYLEAVERWMGVLLPKMRPYLYQNGGPIIMVQ 159
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPG------------PVINACN 145
+ + A D+ H G V+ D A ++
Sbjct: 160 VENEYGSYFACDYDYLRFLLKLFRLHLGHEVVLFTTDGASQFHLKCGALQGLYATVDFAP 219
Query: 146 GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G F S P P + +E +T + WG + + A+ +A + +A+ G+ V
Sbjct: 220 GGNVTAAFLAQRSSEPMGPLVNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILAR-GANV 278
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF A M T Y APL E G + E
Sbjct: 279 NLYMFIGGTNFAYWNGANMPYMPQPTSYDYDAPLSEAGDLTE 320
>gi|301763006|ref|XP_002916929.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Ailuropoda
melanoleuca]
Length = 1209
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 121/274 (44%), Gaps = 40/274 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFS D+ F+ GL+V LR GP
Sbjct: 521 WRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENLDLEAFVLMAAEIGLWVILRPGP 580
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLWA 115
+I SE GGLP WL ++ R+ K + ++ + P + KG P +
Sbjct: 581 YICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYFDHLISRVVPLQYHKGGPIIAVQ 640
Query: 116 AK-----MAVD-----------FHTGVPWVMCKQDDAPGPVINACNGMRCG---ETFKGP 156
+ AVD G+ ++ DDA G+ TF+
Sbjct: 641 VENEYGSFAVDKDYMPYVRKALLERGIVELLVTSDDAENLQKGYLEGVLATINMNTFEKS 700
Query: 157 N-------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
NKP + E W ++ WGGK + +A+D+ V+ FI S+ N YM+H
Sbjct: 701 AFEQLSQLQRNKPIMVMEYWVGWFDTWGGKHMVNNAEDVEETVSKFITSEISF-NVYMFH 759
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
GGTNFG A ++T Y A L E G
Sbjct: 760 GGTNFGFMNGATYFGIHRAVVTSYDYDALLTEAG 793
>gi|237719727|ref|ZP_04550208.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|229450996|gb|EEO56787.1| beta-galactosidase [Bacteroides sp. 2_2_4]
Length = 778
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 124/281 (44%), Gaps = 52/281 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGIDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYG 236
+ + YM HGGT FG A + M + Y AP+ E G
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|322703307|gb|EFY94918.1| beta-calactosidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 645
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 125/296 (42%), Gaps = 65/296 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + AK GL+ I +YVFWN EP +G +DF GRNDI RF++ Q +GLYV LR GP
Sbjct: 65 WTQRLQMAKAMGLNTIFSYVFWNNIEPTEGSWDFDGRNDIARFLRLAQQEGLYVVLRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I E +GG P WL + G+ R +NKP+ +
Sbjct: 125 YICGEHEWGGFPSWLAQIPGMAVRQNNKPFLDASRNYLEQLGKHLAATHISQGGPVLMTQ 184
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPW-----------------VMCKQDD 135
+ENEY + K Y+ A M G + ++ + D
Sbjct: 185 LENEYGSF-----GKDKAYLRAMADMLKANFDGFLYTNDGGGKSYLDGGSLHGILAETDG 239
Query: 136 APGPVINACNGMRCGETFKGPNSPNKPSI-WTEDWTSF--YQVWGGKPYIRSAQDIAFHV 192
P A + T GP + + W +DW+S YQ G+P + + + +
Sbjct: 240 DPKTGFAARDQYVTDPTMLGPQLDGEYYVTWIDDWSSNSPYQYTSGRP--DATKRVLDDL 297
Query: 193 ALFIAKNGSYVNYYMYHGGTNFGRTAAAFMI--------TGYYDQAPLDEYGLVRE 240
+A N S+ + YM+HGGTN+G + T Y APLDE G E
Sbjct: 298 DWILAGNNSF-SIYMFHGGTNWGFENGGIWVDNRLNAVTTSYDYGAPLDESGRATE 352
Score = 39.3 bits (90), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 51/123 (41%), Gaps = 38/123 (30%)
Query: 521 TWYKTTFRAPAG--ND---PIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYA 575
+YK TF PAG ND L+L + KG WVNG +GRYWV P Q+ Y
Sbjct: 546 VFYKGTFGLPAGVGNDLSGDTFLSLPNGVKGSVWVNGHHLGRYWVV------GPQQSLY- 598
Query: 576 VNTVTSIHFCAIIKATNTYHVPRAFL----KPTGNLLVLLEEENGNPLGITVDTIAIRKV 631
VP A+L KP N +V+LE E G+ +A R+
Sbjct: 599 --------------------VPGAYLYGGNKP--NHVVVLELEPKAGAGMVARGLATREW 636
Query: 632 CGH 634
H
Sbjct: 637 ANH 639
>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
Length = 725
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 84/298 (28%), Positives = 134/298 (44%), Gaps = 64/298 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+ GL+ + YVFWN HE Q G++DF+G+ DI F++ Q +GLYV LR GP
Sbjct: 11 WRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEEGLYVILRPGP 70
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL +++RS + + +
Sbjct: 71 YVCAEWDFGGYPSWLLKEKDMIYRSKDPRFLSYCERYIKELGKQLSSLTINNGGNIIMVQ 130
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M + VP C D G V +
Sbjct: 131 VENEYGS-----YAADKEYLAAIRDMIKEAGFNVPLFTC---DGGGQVEAGHIEGALPTL 182
Query: 145 NGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK ++ +K P E + +++ WG + Y R A+ + + ++
Sbjct: 183 NGVFGEDIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLS----- 237
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYGLVREPKWGHLKEL 249
+G V+ YM+HGGTNF T A GY Q APL E+G PK+ +E+
Sbjct: 238 HGVSVSMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWGNCY-PKYHAFREV 294
>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
Length = 777
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 130/298 (43%), Gaps = 64/298 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A GL+ + YVFWN HE Q G++DFSG+ DI FI+ Q +GLYV LR GP
Sbjct: 63 WRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + +RS + + +
Sbjct: 123 YVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSPLTINNGGNIIMVQ 182
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M + VP C D G V +
Sbjct: 183 VENEYGS-----YAADKEYLAAIRDMIKEAGFNVPLFTC---DGGGQVEAGHVEGALPTL 234
Query: 145 NGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + K P E + +++ WG + Y R A+ + + ++
Sbjct: 235 NGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLS----- 289
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYGLVREPKWGHLKEL 249
+G V+ YM+HGGTNF T A GY Q APL E+G PK+ +E+
Sbjct: 290 HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNCY-PKYHAFREV 346
>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
Length = 591
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 123/276 (44%), Gaps = 42/276 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WNLHEPQ+G++ F G D+ RFI+ GL+V +R P
Sbjct: 35 WEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMADLERFIRLAGRLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVL-- 113
+I +EW +GGLP WL G+ R + Y K++ Y + P G P +L
Sbjct: 95 YICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYDELIPRLVPLLCTSGGPVILVQ 154
Query: 114 -------WAAKMAVDFH---------TGVPW--------VMCKQDDAPGPVINACNGMRC 149
+ + A H VP M + PG + G R
Sbjct: 155 VENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGPTDAMLQGGSLPGVLATVNFGSRT 214
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E+F P P + E W ++ W + + R A D A V + + G+ VN+YM
Sbjct: 215 AESFAKLREYQPQGPLMCMEYWNGWFDHWMEEHHQRDAADAA-RVFGEMLEAGASVNFYM 273
Query: 208 YHGGTNFGRTAAAFMITGY------YD-QAPLDEYG 236
+HGGTNFG A I Y YD +PL E+G
Sbjct: 274 FHGGTNFGFYNGANHIKTYEPTITSYDYDSPLTEWG 309
>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
Length = 612
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 124/275 (45%), Gaps = 54/275 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL E ++GQ+DF+G NDI F++E SQGL V LR GP
Sbjct: 59 WKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTGNNDIGAFVREAASQGLNVILRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GG P WL + RS + + +
Sbjct: 119 YVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQRYLEALGTQVRPLLNSNGGPIIAMQ 178
Query: 93 IENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
+ENEY Q + F + G +L+ + A G +P V+ + APG
Sbjct: 179 VENEYGSYGDDHGYLQAVRALFIKAGLGGALLFTSDGAQMLGNGTLPDVLAAVNVAPGEA 238
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
A + + TF P +P + E W ++ W GKP+ ++ ++ + G
Sbjct: 239 KQALDKL---ATFH----PGQPQLVGEYWAGWFDQW-GKPHAQTDAKQQADEIEWMLRQG 290
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+N YM+ GGT+FG FM + P D Y
Sbjct: 291 HSINLYMFVGGTSFG-----FMNGANFQGGPGDHY 320
>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
Length = 775
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 130/298 (43%), Gaps = 64/298 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A GL+ + YVFWN HE Q G++DFSG+ DI FI+ Q +GLYV LR GP
Sbjct: 61 WRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFSGQADIAEFIRTAQEEGLYVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + +RS + + +
Sbjct: 121 YVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCERYIKELGKQLSPLTINNGGNIIMVQ 180
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M + VP C D G V +
Sbjct: 181 VENEYGS-----YAADKEYLAAIRDMIKEAGFNVPLFTC---DGGGQVEAGHVEGALPTL 232
Query: 145 NGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + K P E + +++ WG + Y R A+ + + ++
Sbjct: 233 NGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWGRRHSSVAYERPAEQLDWMLS----- 287
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQ-------APLDEYGLVREPKWGHLKEL 249
+G V+ YM+HGGTNF T A GY Q APL E+G PK+ +E+
Sbjct: 288 HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYDYDAPLGEWGNCY-PKYHAFREV 344
>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
Length = 591
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 123/276 (44%), Gaps = 42/276 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WNLHEPQ+G++ F G D+ RFI+ GL+V +R P
Sbjct: 35 WEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMADLERFIRLAGRLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVL-- 113
+I +EW +GGLP WL G+ R + Y K++ Y + P G P +L
Sbjct: 95 YICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYDELIPRLVPLLCTSGGPVILVQ 154
Query: 114 -------WAAKMAVDFH---------TGVPWV--------MCKQDDAPGPVINACNGMRC 149
+ + A H VP M + PG + G R
Sbjct: 155 VENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGPTDSMLQGGSLPGVLATVNFGSRT 214
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E+F P P + E W ++ W + + R A D A V + + G+ VN+YM
Sbjct: 215 AESFAKLREYQPQGPLMCMEYWNGWFDHWMEEHHQRDAADAA-RVFGEMLEAGASVNFYM 273
Query: 208 YHGGTNFGRTAAAFMITGY------YD-QAPLDEYG 236
+HGGTNFG A I Y YD +PL E+G
Sbjct: 274 FHGGTNFGFHNGANHIKTYEPTITSYDYDSPLTEWG 309
>gi|313241117|emb|CBY33414.1| unnamed protein product [Oikopleura dioica]
Length = 608
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 127/298 (42%), Gaps = 57/298 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ +QTY+ WNLHEP++G + F D+ F+K + GLYV +R GP
Sbjct: 34 WRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDELDVSEFLKIAKDVGLYVIMRPGP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFR-SDNKPY----------------------------- 91
+I +EW +GG P WL ++ R + ++ Y
Sbjct: 94 YICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQNWFTVLFSQLRDHQWSRGGPIISI 153
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDD--------APGPVINA 143
++ENEY A + K Y+ W + D + + + P + A
Sbjct: 154 QVENEY-----ASYNKDSEYLPWVKNLLTDVGKCFLLKIINETNFFLKGAHLLPDTFLTA 208
Query: 144 CNGMRCGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
N G F+ + PN+P + TE W ++ WG + + + I GS
Sbjct: 209 -NFQSVGNAFEVLDKLQPNRPKMVTEFWAGWFDHWGQQGHSTLSPTTFNKTMREILNAGS 267
Query: 202 YVNYYMYHGGTNFGRTAAAFMI----------TGYYDQAPLDEYGLVREPKWGHLKEL 249
VN YM+HGGT+FG A + + T Y APL E G + E KW +E+
Sbjct: 268 SVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTTSYDYDAPLSESGDLTE-KWNVTREI 324
>gi|83415088|ref|NP_001032730.1| beta-galactosidase precursor [Canis lupus familiaris]
gi|94730362|sp|Q9TRY9.3|BGAL_CANFA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|76470548|gb|ABA43388.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 668
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 126/292 (43%), Gaps = 47/292 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ GQY FSG D+ FIK GL V LR GP
Sbjct: 66 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGQYQFSGEQDVEYFIKLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITMQ 185
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + D+ H G ++ D A + A G+ F G
Sbjct: 186 VENEYGSYFTCDYDYLRFLQKLFHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDF-G 244
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P + +E +T + W G+P+ ++ I +G+
Sbjct: 245 PGANITAAFQIQRKSEPKGPLVNSEFYTGWLDHW-GQPHSTVRTEVVASSLHDILAHGAN 303
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKEL 249
VN YM+ GGTNF A M T Y APL E G + E K+ L+E+
Sbjct: 304 VNLYMFIGGTNFAYWNGANMPYQAQPTSYDYDAPLSEAGDLTE-KYFALREV 354
>gi|390469877|ref|XP_002807335.2| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Callithrix jacchus]
Length = 718
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 115/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI GL+ LR GP
Sbjct: 160 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFILMASEIGLWXILRPGP 219
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 220 YICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 279
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G +G
Sbjct: 280 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVHGVLATIN 332
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 333 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 391
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 392 SSINLYMFHGGTNFGFMNGAMHFHDY 417
>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
Length = 591
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 123/276 (44%), Gaps = 42/276 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WNLHEPQ+G++ F G D+ RFI+ GL+V +R P
Sbjct: 35 WEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMADLERFIRLAGRLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVL-- 113
+I +EW +GGLP WL G+ R + Y K++ Y + P G P +L
Sbjct: 95 YICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYDELIPRLVPLLCTSGGPVILVQ 154
Query: 114 -------WAAKMAVDFH---------TGVPWV--------MCKQDDAPGPVINACNGMRC 149
+ + A H VP M + PG + G R
Sbjct: 155 VENEYGSYGSDKAYLEHLRDGLVRRGIDVPLFTSDGPTDSMLQGGSLPGVLATVNFGSRT 214
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E+F P P + E W ++ W + + R A D A V + + G+ VN+YM
Sbjct: 215 AESFAKLREYQPQGPLMCMEYWNGWFDHWMEEHHQRDAADAA-RVFGEMLEAGASVNFYM 273
Query: 208 YHGGTNFGRTAAAFMITGY------YD-QAPLDEYG 236
+HGGTNFG A I Y YD +PL E+G
Sbjct: 274 FHGGTNFGFYNGANHIKTYEPTITSYDYDSPLTEWG 309
>gi|423294349|ref|ZP_17272476.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
gi|392675540|gb|EIY68981.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
Length = 778
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 124/281 (44%), Gaps = 52/281 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGIDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYG 236
+ + YM HGGT FG A + M + Y AP+ E G
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|383110805|ref|ZP_09931623.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
gi|313694380|gb|EFS31215.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
Length = 778
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 124/281 (44%), Gaps = 52/281 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + T VP C ++A +I N
Sbjct: 180 VENEYGS-----YGIDKPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYG 236
+ + YM HGGT FG A + M + Y AP+ E G
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|3025876|gb|AAC12775.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 662
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 126/292 (43%), Gaps = 47/292 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ GQY FSG D+ FIK GL V LR GP
Sbjct: 60 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGQYQFSGEQDVEYFIKLAHELGLLVILRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 120 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITMQ 179
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + D+ H G ++ D A + A G+ F G
Sbjct: 180 VENEYGSYFTCDYDYLRFLQKLFHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDF-G 238
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P + +E +T + W G+P+ ++ I +G+
Sbjct: 239 PGANITAAFQIQRKSEPKGPLVNSEFYTGWLDHW-GQPHSTVRTEVVASSLHDILAHGAN 297
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKEL 249
VN YM+ GGTNF A M T Y APL E G + E K+ L+E+
Sbjct: 298 VNLYMFIGGTNFAYWNGANMPYQAQPTSYDYDAPLSEAGDLTE-KYFALREV 348
>gi|354472811|ref|XP_003498630.1| PREDICTED: beta-galactosidase [Cricetulus griseus]
Length = 681
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 125/289 (43%), Gaps = 58/289 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI GL V LR GP
Sbjct: 78 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEYFIHLAHKLGLLVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-NEYQTI-----EPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y +++ T+ +P ++ G P +
Sbjct: 138 YICAEWDMGGLPAWLLEKESIVLRSSDPDYLAAVDKWLTVLLPKMKPLLYQNGGPIITVQ 197
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN 157
+ + A D +H G ++ D A N +RCG T +G
Sbjct: 198 VENEYGSYFACDYDYLRFLAHRFRYHLGNDVLLFTTDGA------NENFLRCG-TLQGLY 250
Query: 158 S---------------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
+ P P I +E +T + WG Y + +A + +
Sbjct: 251 ATVDFGAVKNITQAFLIQRKFEPKGPLINSEFYTGWLDHWGEPHYTVKTEIVAASLYDLL 310
Query: 197 AKNGSYVNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
A+ G+ VN YM+ GGTNF A T Y APL E G + E
Sbjct: 311 AR-GASVNLYMFIGGTNFAYWNGANIPYAAQPTSYDYDAPLSEAGDLTE 358
>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 587
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 120/278 (43%), Gaps = 42/278 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + GL+ ++TY+ WNLHEP++GQ+ F G D+ RF++ GL+V LR P
Sbjct: 36 WEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVRIAGDLGLHVILRPSP 95
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVLWA 115
+I +EW +GGLP WL I R + Y K++ Y + P KG P +
Sbjct: 96 YICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRLVPLLTSKGGPVIAMQ 155
Query: 116 ----------------------AKMAVD---FHTGVPWV-MCKQDDAPGPVINACNGMRC 149
K VD F + P M + PG + G R
Sbjct: 156 IENEYGSYGNDTAYLEYLKDGLIKRGVDVLLFTSDGPTDGMLQGGAVPGVLATVNFGSRT 215
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E F P P + E W ++ W + R A+D A + N S VN+YM
Sbjct: 216 KEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDAAAVFKEMLDLNAS-VNFYM 274
Query: 208 YHGGTNFG-RTAAAF------MITGYYDQAPLDEYGLV 238
+HGGTNFG A F +T Y APL E G V
Sbjct: 275 FHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECGDV 312
>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
Length = 791
Score = 109 bits (273), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 126/299 (42%), Gaps = 67/299 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE Q+G++DF+ ND+ F + Q GLYV +R GP
Sbjct: 65 WEHRIKMCKALGMNTVCLYVFWNIHEQQEGKFDFTDNNDVAEFCRLAQRNGLYVIVRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R + PY
Sbjct: 125 YVCAEWEMGGLPWWLLKKKDIRLREPD-PYFMERVKLFERKVGEQLASLTIQNGGPIIMV 183
Query: 92 KIENEY----------QTIEPAFHEKGPPYVL-----WAAKMAVDFHTGVPWVMCKQDDA 136
++ENEY I + G V WA+ + + W M
Sbjct: 184 QVENEYGSYGENKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTM------ 237
Query: 137 PGPVINACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
N G + F+ G PN P + +E W+ ++ WG + R A+ + +
Sbjct: 238 -----NFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKTMVEGIDE 292
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLK 247
++K S+ + YM HGGT+FG A A +T Y AP++EYG PK+ L+
Sbjct: 293 MLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKYWELR 349
>gi|426371159|ref|XP_004052521.1| PREDICTED: beta-galactosidase-1-like protein 3 [Gorilla gorilla
gorilla]
Length = 653
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 135/304 (44%), Gaps = 55/304 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMGAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL ++ R+ NK + +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 223
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCG-- 150
+ENEY + +K Y+L+ K + G+ ++ D + G+
Sbjct: 224 VENEYGSF-----KKDKTYMLYLHKALL--RRGIVELLLTSDGEKHVLSGHTKGVLAAIN 276
Query: 151 ------ETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+TF + +KP + E W ++ WG K +++ A+++ V+ FI S+
Sbjct: 277 LQKLHQDTFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF 336
Query: 203 VNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
N YM+HGGTNFG A ++T Y A L E G E K+ L++L ++
Sbjct: 337 -NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSVSA 394
Query: 256 CSRP 259
P
Sbjct: 395 TPLP 398
>gi|313238883|emb|CBY13879.1| unnamed protein product [Oikopleura dioica]
Length = 601
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 127/298 (42%), Gaps = 57/298 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ +QTY+ WNLHEP++G + F D+ F+K + GLYV +R GP
Sbjct: 34 WRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDELDVSEFLKIAKDVGLYVIMRPGP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFR-SDNKPY----------------------------- 91
+I +EW +GG P WL ++ R + ++ Y
Sbjct: 94 YICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQNWFTVLFSQLRDHQWSRGGPIISI 153
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDD--------APGPVINA 143
++ENEY A + K Y+ W + D + + + P + A
Sbjct: 154 QVENEY-----ASYNKDSEYLPWVKNLLTDVGKCFLLKIINETNFFLKGAHLLPDTFLTA 208
Query: 144 CNGMRCGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
N G F+ + PN+P + TE W ++ WG + + + I GS
Sbjct: 209 -NFQSVGNAFEVLDKLQPNRPKMVTEFWAGWFDHWGQQGHSLLSPTTFNKTMREILNAGS 267
Query: 202 YVNYYMYHGGTNFGRTAAAFMI----------TGYYDQAPLDEYGLVREPKWGHLKEL 249
VN YM+HGGT+FG A + + T Y APL E G + E KW +E+
Sbjct: 268 SVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTTSYDYDAPLSESGDLTE-KWNVTREI 324
>gi|344248604|gb|EGW04708.1| Beta-galactosidase [Cricetulus griseus]
Length = 650
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 125/289 (43%), Gaps = 58/289 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI GL V LR GP
Sbjct: 47 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEYFIHLAHKLGLLVILRPGP 106
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIE-NEYQTI-----EPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y +++ T+ +P ++ G P +
Sbjct: 107 YICAEWDMGGLPAWLLEKESIVLRSSDPDYLAAVDKWLTVLLPKMKPLLYQNGGPIITVQ 166
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN 157
+ + A D +H G ++ D A N +RCG T +G
Sbjct: 167 VENEYGSYFACDYDYLRFLAHRFRYHLGNDVLLFTTDGA------NENFLRCG-TLQGLY 219
Query: 158 S---------------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
+ P P I +E +T + WG Y + +A + +
Sbjct: 220 ATVDFGAVKNITQAFLIQRKFEPKGPLINSEFYTGWLDHWGEPHYTVKTEIVAASLYDLL 279
Query: 197 AKNGSYVNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
A+ G+ VN YM+ GGTNF A T Y APL E G + E
Sbjct: 280 AR-GASVNLYMFIGGTNFAYWNGANIPYAAQPTSYDYDAPLSEAGDLTE 327
>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
Length = 784
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 126/300 (42%), Gaps = 67/300 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE Q+ +YDF+G ND+ F + Q G+YV +R GP
Sbjct: 61 WDQRIKMCKALGMNTICLYVFWNIHEQQESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R D+ PY
Sbjct: 121 YVCAEWEMGGLPWWLLKKKDIRLREDD-PYFLARVKAFEAEVGRQLAPLTIQNGGPIIMV 179
Query: 92 KIENEYQT----------IEPAFHEKGPPYVL-----WAAKMAVDFHTGVPWVMCKQDDA 136
++ENEY + I G V WA+ + + W M
Sbjct: 180 QVENEYGSYGVNKQYVSQIRDIVKASGFDKVTLFQCDWASNFEKNGLDDLLWTM------ 233
Query: 137 PGPVINACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
N G FK P P + +E W+ ++ WG + R A+ + +
Sbjct: 234 -----NFGTGSNIDAQFKRLKQLRPETPLMCSEFWSGWFDKWGARHETRPAKAMVEGINE 288
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKE 248
++KN S+ + YM HGGT+FG A A +T Y AP++EYG PK+ L++
Sbjct: 289 MLSKNISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGHA-TPKFWELRK 346
>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 651
Score = 109 bits (272), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 123/278 (44%), Gaps = 58/278 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 101 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 160
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPP----- 110
+ +EW GG P WL I RS + + ++ + ++P + G P
Sbjct: 161 YACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQVQPLLNHNGGPIIAVQ 220
Query: 111 ---------------------YVLWAAKMAVDFHTG---------VPWVMCKQDDAPGPV 140
YV A+ F + +P + + APG
Sbjct: 221 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 280
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF--IAK 198
+A + + F+ P++P + E W ++ W GKP+ +A D F I +
Sbjct: 281 KSAFDKLIA---FR----PDQPRMVGEYWAGWFDHW-GKPH--AATDATQQAEEFEWILR 330
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEYG 236
G N YM+ GGT+FG FM + P D Y
Sbjct: 331 QGHSANLYMFIGGTSFG-----FMNGANFQNNPSDHYA 363
>gi|26345448|dbj|BAC36375.1| unnamed protein product [Mus musculus]
Length = 682
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 45/296 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI+ GL V LR GP
Sbjct: 66 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEHFIQLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLVAVDKWLAVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--------------A 143
+ + A D +H G ++ D A ++
Sbjct: 186 VENEYGSYFACDYDYLRFLVHRFRYHLGNDVILFTTDGASEKMLKCGTLQDLYATVDFGT 245
Query: 144 CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
N + + P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 246 GNNITQAFLVQRKFEPKGPLINSEFYTGWLDHWGKPHSTVKTKTLATSLYNLLAR-GANV 304
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
N YM+ GGTNF T T Y APL E G + + K+ L+E+ K
Sbjct: 305 NLYMFIGGTNFAYWNGANTPYEPQPTSYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|363742521|ref|XP_003642647.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Gallus gallus]
Length = 637
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 136/321 (42%), Gaps = 60/321 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHE +G++DFS D+ F+ GL+V LR GP
Sbjct: 77 WEDRMLKMKACGLNTLTTYVPWNLHEQTRGKFDFSENLDLQAFLSLAAKNGLWVILRPGP 136
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW GGLP WL + R+ K + +
Sbjct: 137 YICSEWDLGGLPSWLLQDPEMQLRTTYKGFTEAVDAYFDHLMPIVVPLQYKRGGPIIAVQ 196
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + K P Y+ + + + G+ ++ D+ G G
Sbjct: 197 VENEYGS-----YAKDPNYMAYVKRALLS--RGIVELLMTSDNKNGLSFGLVEGALATVN 249
Query: 153 FKGPNSP-------------NKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
F+ N P ++P + E WT ++ WGG Y+ A ++ VA I K
Sbjct: 250 FQ--NLPLSILTLFLFXVQRDQPKMVMEYWTGWFDNWGGPHYVFDADEMVNTVAS-ILKL 306
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
G+ +N YM+HGGTNFG A +T Y A L E G K+ L++L +
Sbjct: 307 GASINLYMFHGGTNFGFMNGALKTDEYKSDVTSYDYDAVLTEAGDYTS-KFFKLRQLFST 365
Query: 253 IKLCSRPLLTGTQNVISLGQL 273
I PL ++ S G +
Sbjct: 366 IIGQPLPLPPMIESKASYGAI 386
>gi|345800024|ref|XP_546385.3| PREDICTED: galactosidase, beta 1-like 3 [Canis lupus familiaris]
Length = 808
Score = 109 bits (272), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 131/291 (45%), Gaps = 41/291 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 259 WGDRLRKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDMEAFVLLAAEMGLWVILRPGP 318
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-KIENEY-----QTIEPAFHEKGPPYVLWA 115
+I SE GGLP WL +V R+ + K ++Y + P + +G P +
Sbjct: 319 YICSEIDLGGLPSWLLQDPKMVLRTTYSGFVKAVDKYFDHLISRVVPLQYRRGGPIIAVQ 378
Query: 116 AK-----MAVD-----------FHTGVPWVMCKQDDAPGPVINACNGMRCG---ETFKGP 156
+ A D G+ ++ DDA + G+ +F+
Sbjct: 379 VENEYGSFAEDRGYMPYLQKALLERGIVELLVTSDDAENLLKGHIKGVLATINMNSFQES 438
Query: 157 N-------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
+ NKP + E W ++ WG + +++ +D+ V FIA S+ N YM+H
Sbjct: 439 DFKLLSYVQSNKPIMVMEFWVGWFDTWGSEHKVKNPKDVEETVTKFIASEISF-NVYMFH 497
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
GGTNFG A ++T Y A L E G E K+ L+ L ++
Sbjct: 498 GGTNFGFMNGATDFGIHRGVVTSYDYDAVLTEAGDYTE-KYFKLRRLFGSV 547
>gi|22760724|dbj|BAC11309.1| unnamed protein product [Homo sapiens]
Length = 636
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 80/266 (30%), Positives = 117/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDQEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN 142
+ENEY + + K P Y+ + K D G+ ++ D+ G IN
Sbjct: 198 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 250
Query: 143 --ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + ++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY 335
>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 587
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/278 (31%), Positives = 120/278 (43%), Gaps = 42/278 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + GL+ ++TY+ WNLHEP++GQ+ F G D+ RF++ GL+V LR P
Sbjct: 36 WEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVRIAGDLGLHVILRPSP 95
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVLWA 115
+I +EW +GGLP WL I R + Y K++ Y + P KG P +
Sbjct: 96 YICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRLVPLLTSKGGPVIAMQ 155
Query: 116 ----------------------AKMAVD---FHTGVPWV-MCKQDDAPGPVINACNGMRC 149
K VD F + P M + PG + G R
Sbjct: 156 IENEYGSYGNDTAYLEYLKDGLIKRGVDVLLFTSDGPTDGMLQGGAVPGVLATVNFGSRT 215
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E F P P + E W ++ W + R A+D A + N S VN+YM
Sbjct: 216 KEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDAAAVFKEMLDLNAS-VNFYM 274
Query: 208 YHGGTNFG-RTAAAF------MITGYYDQAPLDEYGLV 238
+HGGTNFG A F +T Y APL E G V
Sbjct: 275 FHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECGDV 312
>gi|26339346|dbj|BAC33344.1| unnamed protein product [Mus musculus]
Length = 756
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 45/296 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI+ GL V LR GP
Sbjct: 66 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEHFIQLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLVAVDKWLAVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--------------A 143
+ + A D +H G ++ D A ++
Sbjct: 186 VENEYGSYFACDYDYLRFLVHRFRYHLGNDVILFTTDGASEKMLKCGTLQDLYATVDFGT 245
Query: 144 CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
N + + P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 246 GNNITQAFLVQRKFEPKGPLINSEFYTGWLDHWGKPHSTVKTKTLATSLYNLLAR-GANV 304
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
N YM+ GGTNF T T Y APL E G + + K+ L+E+ K
Sbjct: 305 NLYMFIGGTNFAYWNGANTPYEPQPTSYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|449489521|ref|XP_004174618.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein 2
[Taeniopygia guttata]
Length = 635
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 76/259 (29%), Positives = 116/259 (44%), Gaps = 47/259 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + GL+ + TYV WNLHE ++G++DFS D+ + GL+V LR GP
Sbjct: 77 WEDRMLKMRACGLNTLTTYVPWNLHEKERGKFDFSKNLDLRYVAQTALXNGLWVILRPGP 136
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW GGLP WL + R+ K + +
Sbjct: 137 YICSEWDLGGLPSWLLQDPEMQLRTTYKGFTEAVDAYFDRLMRVVVPLQYKKGGPIIAVQ 196
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + K P Y+ + KMA+ + G+ ++ D+ G G
Sbjct: 197 VENEYGS-----YAKDPNYMTYV-KMAL-LNRGIVELLMTSDNKNGLSFGLVEGALATVN 249
Query: 153 FKGPN----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
F+ ++P + E WT ++ WGG Y+ A ++ VA I K G+
Sbjct: 250 FQKLEPGLLKYLDTVQKDQPKMVMEYWTGWFDNWGGPHYVFDADEMVNTVAS-ILKTGAS 308
Query: 203 VNYYMYHGGTNFGRTAAAF 221
+N YM+HGGTNFG + A
Sbjct: 309 INLYMFHGGTNFGFMSGAL 327
>gi|148677363|gb|EDL09310.1| galactosidase, beta 1, isoform CRA_b [Mus musculus]
Length = 669
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 45/296 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI+ GL V LR GP
Sbjct: 81 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEHFIQLAHELGLLVILRPGP 140
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + ++P ++ G P +
Sbjct: 141 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLVAVDKWLAVLLPKMKPLLYQNGGPIITVQ 200
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--------------A 143
+ + A D +H G ++ D A ++
Sbjct: 201 VENEYGSYFACDYDYLRFLVHRFRYHLGNDVILFTTDGASEKMLKCGTLQDLYATVDFGT 260
Query: 144 CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
N + + P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 261 GNNITQAFLVQRKFEPKGPLINSEFYTGWLDHWGKPHSTVKTKTLATSLYNLLAR-GANV 319
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
N YM+ GGTNF T T Y APL E G + + K+ L+E+ K
Sbjct: 320 NLYMFIGGTNFAYWNGANTPYEPQPTSYDYDAPLSEAGDLTK-KYFALREVIQMFK 374
>gi|1352080|sp|P48982.1|BGAL_XANMN RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|1045034|gb|AAC41485.1| beta-galactosidase [Xanthomonas axonopodis pv. manihotis]
Length = 598
Score = 108 bits (271), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 115/268 (42%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F+KE +QGL V LR GP
Sbjct: 61 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVKEAAAQGLNVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + ++P + G P +
Sbjct: 121 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALAKQVQPLLNHNGGPIIAVQ 180
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 181 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 240
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 241 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSANLYM 299
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM + P D Y
Sbjct: 300 FIGGTSFG-----FMNGANFQNNPSDHY 322
>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 621
Score = 108 bits (271), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 90/325 (27%), Positives = 142/325 (43%), Gaps = 55/325 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWN HE G+++FSG D+ +FIK Q GLYV +R GP
Sbjct: 62 WKHRLEMMKAMGLNTVTTYVFWNYHEEAPGKWNFSGEKDLQKFIKTAQETGLYVIIRPGP 121
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYVLWA 115
++ +EW +GG P WL + R DNK + I + I P G P ++
Sbjct: 122 YVCAEWEFGGYPWWLQKNKELEIRRDNKAFSEECWKYISQLAKQITPMQITNGGPVIMVQ 181
Query: 116 AK------------MAVDFH-------------TGVPWVMCKQDDAP----GPVINA--- 143
A+ + ++ H +G+ + D + G V A
Sbjct: 182 AENEFGSYVAQRKDIPLEEHRKYSHKIKEMLLKSGISVPLFTSDGSSLFKGGSVEGALPT 241
Query: 144 CNGMRCGETFKGP----NSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIAK 198
NG + K N P + E + + W +P+++ S +++ L+I +
Sbjct: 242 ANGESDIDVLKKSINEYNGGKGPYMIAEYYPGWLDHW-AEPFVKVSTEEVVKQTNLYI-E 299
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
NG NYYM HGGTNFG T+ A +T Y AP+ E G PK+ L+++
Sbjct: 300 NGVSFNYYMIHGGTNFGFTSGANYDKDHDIQPDLTSYDYDAPISEAGWAT-PKYNALRKI 358
Query: 250 HAAIKLCSRPLLTGTQNVISLGQLQ 274
I P + VI++ +++
Sbjct: 359 FQKIHKNKLPDVPKPIKVITIPEIE 383
>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 613
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 123/277 (44%), Gaps = 58/277 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + ++P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQVQPLLNHNGGPIIAVQ 182
Query: 113 -------------------------------LWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
L+ + A G +P + + APG
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGAEMLANGTLPDTLAVVNFAPGEA 242
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF--IAK 198
+A + + F+ P++P + E W ++ W GKP+ +A D F I +
Sbjct: 243 KSAFDKLIA---FR----PDQPRMVGEYWAGWFDHW-GKPH--AATDATQQAEEFEWILR 292
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
G N YM+ GGT+FG FM + P D Y
Sbjct: 293 QGHSANLYMFIGGTSFG-----FMNGANFQNNPSDHY 324
>gi|329927841|ref|ZP_08281902.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938242|gb|EGG34637.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 619
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 129/278 (46%), Gaps = 46/278 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN+HEPQ+G++ FSG D+ FI+ GL+V +R P
Sbjct: 35 WEDRLLKLKACGFNTVETYIAWNVHEPQEGKFSFSGMADVASFIELAGKLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVLWA 115
FI +EW +GGLP WL I R + Y K+++ Y + P G P + A
Sbjct: 95 FICAEWEFGGLPGWLLGYGEIRLRCSDPLYLSKVDHYYDELIPRLVPLLSSNGGP--ILA 152
Query: 116 AKMAVDF------HTGVPW-----------VMCKQDDAP------GPVINACN-----GM 147
++ ++ H + + V+ D P G +N + G
Sbjct: 153 VQVENEYGSYGNDHAYLDYLRAGLVRRGIDVLLFTSDGPTDEMLLGGTLNDVHATVNFGS 212
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
R E+F+ +P + E W ++ W ++R A D+A + + K GS +N
Sbjct: 213 RVEESFRKYREYRTEEPLMVMEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEK-GSSMNM 271
Query: 206 YMYHGGTNFGRTAAAFMITGY------YD-QAPLDEYG 236
YM+HGGTNFG + A I Y YD APL E+G
Sbjct: 272 YMFHGGTNFGFYSGANHIQTYEPTTTSYDYDAPLTEWG 309
>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 613
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 123/277 (44%), Gaps = 58/277 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVQEAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPP----- 110
+ +EW GG P WL I RS + + ++ + ++P + G P
Sbjct: 123 YACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQAYLDAVAKQVQPLLNHNGGPIIAVQ 182
Query: 111 ---------------------YVLWAAKMAVDFHTG---------VPWVMCKQDDAPGPV 140
YV A+ F + +P + + APG
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF--IAK 198
+A + + F+ P++P + E W ++ W GKP+ +A D F I +
Sbjct: 243 KSAFDKLIA---FR----PDQPRMVGEYWAGWFDHW-GKPH--AATDATQQAEEFEWILR 292
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
G N YM+ GGT+FG FM + P D Y
Sbjct: 293 QGHSANLYMFIGGTSFG-----FMNGANFQNNPSDHY 324
>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
Length = 628
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 114/418 (27%), Positives = 167/418 (39%), Gaps = 80/418 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWN HE G++++SG D+ +FIK Q GLYV +R GP
Sbjct: 60 WKHRLQMMKAMGLNAVTTYVFWNYHEENPGKWNWSGEKDLKKFIKTAQEVGLYVIIRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVLWA 115
++ +EW +GG P WL ++ G+ R DN + E + Y ++ G P ++
Sbjct: 120 YVCAEWEFGGYPWWLQNIKGLKIREDNNLFLAETQKYITQLYNQVKDLQITNGGPVIMVQ 179
Query: 116 A---------------------------KMAVDFHTGVPWVMCKQDDA----PGPVINA- 143
A K D VP M D + G V+ A
Sbjct: 180 AENEFGSFVAQRKDIPLASHRTYNAKIVKQLKDAGFSVP--MFTSDGSWLFEGGSVVGAL 237
Query: 144 --CNGMRCGETFKGP----NSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
NG E K N+ P + E + + W K A +A ++
Sbjct: 238 PTANGEDNIENLKKIVNQYNNNQGPYMVAEFYPGWLAHWAEKFPRVDAGTVARQTDKYL- 296
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKE 248
KN NYYM HGGTNFG T A +T Y AP+ E G R PK+ L+
Sbjct: 297 KNDVSFNYYMVHGGTNFGFTNGANYDKNHDIQPDLTSYDYDAPITEAGW-RTPKYDSLRA 355
Query: 249 L---HAAIKLCSRPLLTGTQNV--ISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLF 303
+ H KL P ++ I L +L F + E V A D+ + L
Sbjct: 356 VISKHTKAKLPEVPAPIKVIDIKDIKLSKLYNFFNYAEGQQVVKA-----DKPLSFEDLN 410
Query: 304 RNISYELPRK----------SISILPDCKTVAFNTERV---STQYNKRSKTSNLKFDS 348
+ Y L R+ + L D T+ N E+V + YN + ++ F+S
Sbjct: 411 QGHGYVLYRRHFNQPISGTLDLKGLRDYATIYINGEKVGELNRYYNHYTMPIDIPFNS 468
>gi|336410484|ref|ZP_08590961.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
gi|335944314|gb|EGN06136.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
Length = 769
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 136/330 (41%), Gaps = 57/330 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLREARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPL-------DEYGLVRE------PKWGHL 246
YM HGGT FG A M + Y AP+ D+Y L+R+ P L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTDKYFLLRDLLKNYLPAGEQL 349
Query: 247 KELHAAIKLCSRPLLTGTQNVISLGQLQEA 276
E+ A + P + TQ L EA
Sbjct: 350 PEIPEAFPVIEIPEVEFTQVAPLFSNLPEA 379
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|281352249|gb|EFB27833.1| hypothetical protein PANDA_007660 [Ailuropoda melanoleuca]
Length = 626
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 122/283 (43%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ+YV WN HEPQ GQY FSG +D+ FIK GL V LR GP
Sbjct: 39 WKDRLLKMKMAGLNAIQSYVPWNFHEPQPGQYQFSGEHDVEYFIKLAHELGLLVILRPGP 98
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 99 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 158
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + + D +H G ++ D A + A G+ F G
Sbjct: 159 VENEYGSYFSCDYDHLRFLQKLFHYHLGNDVLLFTTDGAHEMFLKCGALQGLYATVDF-G 217
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P + +E +T + W G+P+ + ++ I G+
Sbjct: 218 PGANITAAFEIQRKSEPRGPLVNSEFYTGWLDHW-GQPHSTAKTEVVASALHEILSRGAN 276
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF A M T Y APL E G + E
Sbjct: 277 VNLYMFIGGTNFAYWNGANMPYQAQPTSYDYDAPLSEAGDLTE 319
>gi|301767332|ref|XP_002919083.1| PREDICTED: beta-galactosidase-like [Ailuropoda melanoleuca]
Length = 668
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 122/283 (43%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ+YV WN HEPQ GQY FSG +D+ FIK GL V LR GP
Sbjct: 66 WKDRLLKMKMAGLNAIQSYVPWNFHEPQPGQYQFSGEHDVEYFIKLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + + D +H G ++ D A + A G+ F G
Sbjct: 186 VENEYGSYFSCDYDHLRFLQKLFHYHLGNDVLLFTTDGAHEMFLKCGALQGLYATVDF-G 244
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P + +E +T + W G+P+ + ++ I G+
Sbjct: 245 PGANITAAFEIQRKSEPRGPLVNSEFYTGWLDHW-GQPHSTAKTEVVASALHEILSRGAN 303
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF A M T Y APL E G + E
Sbjct: 304 VNLYMFIGGTNFAYWNGANMPYQAQPTSYDYDAPLSEAGDLTE 346
>gi|299147339|ref|ZP_07040404.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298514617|gb|EFI38501.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 124/281 (44%), Gaps = 52/281 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + + VP C ++A +I N
Sbjct: 180 VENEYGS-----YGIDKPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYG 236
+ + YM HGGT FG A + M + Y AP+ E G
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|411007376|ref|ZP_11383705.1| beta-galactosidase [Streptomyces globisporus C-1027]
Length = 606
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 100/346 (28%), Positives = 149/346 (43%), Gaps = 61/346 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A GL+ ++TYV WNLHEP++G+ G + RF+ ++ GL+ +R GP
Sbjct: 35 WEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG--ALGRFLDAVERAGLWAIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAFHE----KGPPYVL-- 113
+I +EW GGLP+W+ G R+ + Y+ +E ++ + P + +G P +L
Sbjct: 93 YICAEWENGGLPVWVTGRFGRRVRTRDAEYRAVVERWFRELLPQVVQRQVVRGGPVILVQ 152
Query: 114 ----------------WAAKMAVDFHTGVPWV--------MCKQDDAPGPVINA--CNGM 147
W A + + VP M PG + A +G
Sbjct: 153 AENEYGSFGSDAVYLEWLAGLLRECGVTVPLFTSDGPEDHMLTGGSVPGLLATANFGSGA 212
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
R G + P P + E W ++ WG +P +R A++ A + I + G+ VN YM
Sbjct: 213 REGFEVLRRHQPKGPLMCMEFWCGWFDHWGAEPVLRDAEEAAGALRE-ILECGASVNVYM 271
Query: 208 YHGGTNFGRTAAAF------------MITGYYDQAPLDEYG-----------LVREPKWG 244
HGGTNF A A +T Y AP+DEYG ++RE G
Sbjct: 272 AHGGTNFAGWAGANRGGPLQDGEFQPTVTSYDYDAPVDEYGRATEKFHLFRKVLREYAEG 331
Query: 245 HLKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEET-SGVCAAF 289
L EL K + P+ LG + EA ET SGV AF
Sbjct: 332 PLPELPPEPKGLAVPVRAELTGWTGLGDVLEALGDPETESGVPPAF 377
>gi|346725882|ref|YP_004852551.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
gi|346650629|gb|AEO43253.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
Length = 611
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 117/270 (43%), Gaps = 44/270 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 61 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + ++P + G P +
Sbjct: 121 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQSYLDALAKQVQPLLNHNGGPIIAVQ 180
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 181 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 240
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF--IAKNGSYVNY 205
+ P++P + E W ++ W GKP+ +A D F I + G N
Sbjct: 241 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPH--AATDARQQAEEFEWILRQGHSANL 297
Query: 206 YMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
YM+ GGT+FG FM + P D Y
Sbjct: 298 YMFIGGTSFG-----FMNGANFQNNPSDHY 322
>gi|114641374|ref|XP_001157987.1| PREDICTED: galactosidase, beta 1-like 2 isoform 2 [Pan troglodytes]
Length = 636
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 114/266 (42%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++ ++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERSKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 138 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G G
Sbjct: 198 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 250
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFMNGAMHFHDY 335
>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
Length = 787
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 130/290 (44%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE ++GQ+DF+ ND+ F + Q G+YV +R GP
Sbjct: 55 WEHRIKMCKALGMNTLCIYVFWNIHEQREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGP 114
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY-------QTIEPAFHEKGPPYVLW 114
++ +EW GGLP WL I R + PY +E + + P + G P ++
Sbjct: 115 YVCAEWEMGGLPWWLLKKKDIRLR-ERDPYFLERVKIFEQKVGEQLAPLTIQNGGPIIMV 173
Query: 115 AAKMAV-DFHTGVPWVMCKQDDAPGPVINACNGMRC--GETFK---------------GP 156
+ + P+V +D G +C F+ G
Sbjct: 174 QVENEYGSYGEDKPYVSEIRDCLRGIYGEKLTLFQCDWSSNFERNGLDDLVWTMNFGTGA 233
Query: 157 N-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
N PN P + +E W+ ++ WG R A+D+ + ++KN S+ +
Sbjct: 234 NIDHEFARLKQLRPNAPLMCSEFWSGWFDKWGANHETRPAKDMVDGMDEMLSKNISF-SL 292
Query: 206 YMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT+FG A A +T Y AP++EYG E K+ L+++
Sbjct: 293 YMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGGTTE-KFFQLRKM 341
>gi|73954410|ref|XP_848226.1| PREDICTED: galactosidase, beta 1-like 2 isoform 1 [Canis lupus
familiaris]
Length = 636
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 116/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDMEAFVLLAAEMGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL +G+ R+ K + +
Sbjct: 138 YICSEIDLGGLPSWLLQDSGMRLRTTYKGFTEAVDLYFDHLMARVVPLQYKHGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G +G
Sbjct: 198 VENEYGS-----YNKDPAYMPYIKKALED--RGIVELLLTSDNKDGLQKGVLDGALATIN 250
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ F +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSQHELQLLTNFLVSVQRVQPRMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-ILDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 310 SSINLYMFHGGTNFGFINGAMHFHEY 335
>gi|293370654|ref|ZP_06617206.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|292634388|gb|EFF52925.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 778
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 124/281 (44%), Gaps = 52/281 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + + VP C ++A +I N
Sbjct: 180 VENEYGS-----YGIDKPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKRLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYG 236
+ + YM HGGT FG A + M + Y AP+ E G
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
Length = 823
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 142/346 (41%), Gaps = 57/346 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWNLHEP+ G++DF+G+ND+ F + Q +YV LR GP
Sbjct: 99 WEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFTGQNDLAAFCRLCQQNDMYVILRPGP 158
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R + PY
Sbjct: 159 YVCAEWEMGGLPWWLLKKKDIRLREAD-PYFIERVNIFEQEVARQVGGLTIQNGGPIIMV 217
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPV--INA 143
++ENEY + + + YV + V C ++ P + IN
Sbjct: 218 QVENEYGS-----YGESKEYVSLIRDIVRTNFGDVTLFQCDWASNFTKNALPDLLWTINF 272
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + F G P+ P + +E W+ ++ WG R A D+ + ++K S
Sbjct: 273 GTGANIDQQFAGLKKLRPDSPLMCSEFWSGWFDKWGANHETRPASDMIAGIDEMLSKGIS 332
Query: 202 YVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
+ + YM HGGTN+G A A +T Y AP+ E G W K L +
Sbjct: 333 F-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTTPKYWALRKTLGKYMNG 391
Query: 256 CSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTV 301
+ + +S+ AF F E + + A ++ ++ T+
Sbjct: 392 EKQTKVPDMIKSVSI----PAFQFTEVAPLFANLPISKKDKNIRTM 433
>gi|192185|gb|AAA37292.1| acid beta-galactosidase [Mus musculus]
gi|148677364|gb|EDL09311.1| galactosidase, beta 1, isoform CRA_c [Mus musculus]
Length = 647
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 45/296 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI+ GL V LR GP
Sbjct: 66 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEHFIQLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLVAVDKWLAVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--------------A 143
+ + A D +H G ++ D A ++
Sbjct: 186 VENEYGSYFACDYDYLRFLVHRFRYHLGNDVILFTTDGASEKMLKCGTLQDLYATVDFGT 245
Query: 144 CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
N + + P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 246 GNNITQAFLVQRKFEPKGPLINSEFYTGWLDHWGKPHSTVKTKTLATSLYNLLAR-GANV 304
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
N YM+ GGTNF T T Y APL E G + + K+ L+E+ K
Sbjct: 305 NLYMFIGGTNFAYWNGANTPYEPQPTSYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|297194972|ref|ZP_06912370.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|297152570|gb|EFH31854.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 599
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 123/288 (42%), Gaps = 49/288 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A + GL+ ++TYV WNLHEP+ G+Y G + RF+ + + G++ +R GP
Sbjct: 42 WGHRLAMLRAMGLNCVETYVPWNLHEPEPGRYADDG--ALGRFLDAVHAAGMWAIVRPGP 99
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFHEK-----GP----- 109
+I +EW GGLP WL G R+++ Y +E + + P E+ GP
Sbjct: 100 YICAEWENGGLPFWLTGRVGRRVRTEDPEYLGHVERWFTRLLPQVVEREITRGGPVVMVQ 159
Query: 110 ------------PYVLWAAKMAVDFHTGVPWV--------MCKQDDAPGPVINACNGMRC 149
Y+ ++ GVP M PG + G
Sbjct: 160 VENEYGSYGSDGGYLRQLVELLRSCGVGVPLFTSDGPEDHMLSGGSVPGVLATVNFGSGA 219
Query: 150 GETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
GE F + P P + E W +++ WG +P R A+D A I + G+ VN YM
Sbjct: 220 GEAFAALRRHRPTGPLMCMEFWCGWFEHWGAEPARRDAEDAA-RALREILEAGASVNVYM 278
Query: 208 YHGGTNFGRTAAAF------------MITGYYDQAPLDEYGLVREPKW 243
HGGT+FG A A +T Y AP+DE G E W
Sbjct: 279 AHGGTSFGGWAGANRSGELHDGVLEPTVTSYDYDAPVDEAGRPTEKFW 326
>gi|397498763|ref|XP_003820147.1| PREDICTED: beta-galactosidase-1-like protein 2 [Pan paniscus]
Length = 720
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 114/266 (42%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++ ++DFSG D+ F+ GL+V LR GP
Sbjct: 162 WRDRLLKMKACGLNTLTTYVPWNLHEPERSKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 221
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL G+ R+ K + +
Sbjct: 222 YICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQ 281
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G G
Sbjct: 282 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATIN 334
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 335 LQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAG 393
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 394 SSINLYMFHGGTNFGFMNGAMHFHDY 419
>gi|218260271|ref|ZP_03475643.1| hypothetical protein PRABACTJOHN_01305, partial [Parabacteroides
johnsonii DSM 18315]
gi|218224641|gb|EEC97291.1| hypothetical protein PRABACTJOHN_01305 [Parabacteroides johnsonii
DSM 18315]
Length = 539
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 128/297 (43%), Gaps = 55/297 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y FWN+HE + G++DFSG+NDI F + Q +Y+ LR GP
Sbjct: 63 WEHRIQLCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKYDMYIMLRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ SEW GGLP WL I R+ N PY
Sbjct: 123 YVCSEWEMGGLPWWLLKKDDIKLRT-NDPYFLERTKLFMNEIGKQLADLQITKGGNIIMV 181
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---IN 142
++ENEY + + Y+ + T VP C Q++A + IN
Sbjct: 182 QVENEYGS-----YATDKEYIANIRDIVKGAGFTDVPLFQCDWSSNFQNNALDDLVWTIN 236
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G E FK PN P + +E W+ ++ WG K R A+ + + + +
Sbjct: 237 FGTGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLDRGI 296
Query: 201 SYVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHA 251
S+ + YM HGGT FG A + M + Y AP+ E G PK+ L+EL A
Sbjct: 297 SF-SLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWTT-PKYFKLRELLA 351
>gi|6753190|ref|NP_033882.1| beta-galactosidase precursor [Mus musculus]
gi|114944|sp|P23780.1|BGAL_MOUSE RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|192187|gb|AAA37293.1| beta-galactosidase [Mus musculus]
gi|74143070|dbj|BAE42549.1| unnamed protein product [Mus musculus]
Length = 647
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 45/296 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI+ GL V LR GP
Sbjct: 66 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEHFIQLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLVAVDKWLAVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--------------A 143
+ + A D +H G ++ D A ++
Sbjct: 186 VENEYGSYFACDYDYLRFLVHRFRYHLGNDVILFTTDGASEKMLKCGTLQDLYATVDFGT 245
Query: 144 CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
N + + P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 246 GNNITQAFLVQRKFEPKGPLINSEFYTGWLDHWGKPHSTVKTKTLATSLYNLLAR-GANV 304
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
N YM+ GGTNF T T Y APL E G + + K+ L+E+ K
Sbjct: 305 NLYMFIGGTNFAYWNGANTPYEPQPTSYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|348575339|ref|XP_003473447.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cavia
porcellus]
Length = 740
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 124/289 (42%), Gaps = 58/289 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ G Y+FSG +D+ F++ GL V LR GP
Sbjct: 142 WADRLLKMKMAGLNAIQTYVPWNFHEPQPGHYEFSGDHDVEYFLQLAHKLGLLVILRPGP 201
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + ++P ++ G P +
Sbjct: 202 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLASVDKWLGVLLPKMKPLLYQNGGPIITVQ 261
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVINACNGMRCGETFKG-- 155
+ + A D +H G ++ D GP +RCG T +G
Sbjct: 262 VENEYGSYFACDYNYLRFLQKHFHYHLGDDVLLFTTD---GP---RQEYLRCG-TLQGLY 314
Query: 156 -------------------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
P P I +E +T + WG + + + + ++ +
Sbjct: 315 ATVDFGVGSNITDAFLVQRKAEPKGPLINSEFYTGWLDHWGERHWTVKTEAVVSSLSDML 374
Query: 197 AKNGSYVNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
A+ G VN YM+ GGTNF T A T Y APL E G + E
Sbjct: 375 AQ-GXNVNMYMFIGGTNFAYWNGANTPYAAQPTSYDYDAPLSEAGDLTE 422
>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
Length = 583
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 132/299 (44%), Gaps = 61/299 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WNLHEP +G++ F +I R+I+ GLYV +R GP
Sbjct: 35 WKDRLLKLKAMGLNTVETYVAWNLHEPHEGEFHFGDWLNIERYIELAGELGLYVIVRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL + R +PY +
Sbjct: 95 YICAEWEMGGLPAWLLKDPQMKLRCMYQPYLDAVGEYFSQLMHRLVPLQSTRGGPIIAMQ 154
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
+ENEY + +E + G +L+ A D M + P +
Sbjct: 155 VENEYGSYGNDTRYLKYLEELLRQCGVDVLLFTADGVAD-------EMMQYGSLPH-LFK 206
Query: 143 ACN-GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
A N G R G+ F+ P + E W ++ WG + + RSA ++A V +
Sbjct: 207 AVNFGNRPGDAFEKLREYQTGGPLLVAEFWDGWFDHWGERHHTRSAGEVA-RVLDDLLSE 265
Query: 200 GSYVNYYMYHGGTNFG--RTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKEL 249
G+ VN YM+HGGTNFG A AF +T Y APL E G + PK+ ++E+
Sbjct: 266 GASVNLYMFHGGTNFGFMNGANAFPSPHYTPTVTSYDYDAPLSECGNIT-PKYEAMREV 323
>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
Length = 584
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 116/287 (40%), Gaps = 50/287 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I KA+ GL+ I+TYV WN H P + ++ G D+ RF+ IQ +GL +R GP
Sbjct: 35 WRDRIRKARLMGLNTIETYVAWNFHAPSRDEFHTDGARDLGRFLDIIQEEGLRAIVRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY------QTIEPAFHEKGPPYVLWA 115
+I +EW GGLP WL IV RS + Y E E +EP G P +L
Sbjct: 95 YICAEWDNGGLPTWLTATPDIVVRSSDPTYLTEVERYLEHLAPIVEPRQINHGGPIIL-- 152
Query: 116 AKMAVDFHTG----------------------VPWVMCKQ--DDA------PGPVINACN 145
M V+ G VP Q DD P
Sbjct: 153 --MQVENEYGAYGNDRAYLTHLTNVYRNLGFVVPLTTVDQPMDDMLAHGTLPDLHTTGSF 210
Query: 146 GMRCGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G R E + P + +E W ++ WG + D A + + G+ V
Sbjct: 211 GSRIDERLATLREHQTTGPLMCSEFWIGWFDHWGAHHHTTDVADAANALDRLLGA-GASV 269
Query: 204 NYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKW 243
N YM+HGGTNFG T A ++T Y APL E G E W
Sbjct: 270 NIYMFHGGTNFGFTNGANDKGVYQPLVTSYDYDAPLAEDGYPTEKYW 316
>gi|22137334|gb|AAH28875.1| Galactosidase, beta 1 [Mus musculus]
Length = 647
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 126/296 (42%), Gaps = 45/296 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY+FSG D+ FI+ GL V LR GP
Sbjct: 66 WEDRLLKMKMAGLNAIQMYVPWNFHEPQPGQYEFSGDRDVEHFIQLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLVAVDKWLAVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--------------A 143
+ + A D +H G ++ D A ++
Sbjct: 186 VENEYGSYFACDYDYLRFLVHRFRYHLGNDVILFTTDGASEKMLKCGTLQDLYATVDFGT 245
Query: 144 CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
N + + P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 246 GNNITQAFLVQRKFEPKGPLINSEFYTGWLDHWGKPHSTVKTKTLATSLYNLLAR-GANV 304
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
N YM+ GGTNF T T Y APL E G + + K+ L+E+ K
Sbjct: 305 NLYMFIGGTNFAYWNGANTPYEPQPTSYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
18170]
gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
Length = 784
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 125/302 (41%), Gaps = 66/302 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I + K G++ I YVFWN HE + G++DF+G+ D+ F + Q +YV LR GP
Sbjct: 64 WEHRIKQCKALGMNTICLYVFWNFHEEKPGEFDFTGQKDLAEFCRLCQKNDMYVILRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R D+ PY
Sbjct: 124 YVCAEWEMGGLPWWLLKKKDIRLREDD-PYFLERVAIFEKEVANQVAGLTIQKGGPIIMV 182
Query: 92 KIENEY--------------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAP 137
++ENEY + F + WA+ ++ + W M
Sbjct: 183 QVENEYGSYGESKEYVAKIRDIVRGNFGDVTLFQCDWASNFQLNALDDLVWTM------- 235
Query: 138 GPVINACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF 195
N G E F P+ P + +E W+ ++ WG R+A D+ +
Sbjct: 236 ----NFGTGANIDEQFAPLKKVRPDSPLMCSEFWSGWFDKWGANHETRAADDMIAGIDEM 291
Query: 196 IAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
++K S+ + YM HGGTN+G A A +T Y AP+ E G + PK+ L+E
Sbjct: 292 LSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGKIT-PKYEKLRET 349
Query: 250 HA 251
A
Sbjct: 350 LA 351
>gi|336415312|ref|ZP_08595652.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
gi|335940908|gb|EGN02770.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
3_8_47FAA]
Length = 778
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 124/281 (44%), Gaps = 52/281 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN+HE ++G++DFSG+NDI F + Q G+YV +R GP
Sbjct: 60 WEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R+ + Y +
Sbjct: 120 YVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPVINACN- 145
+ENEY + + PYV + + + VP C ++A +I N
Sbjct: 180 VENEYGS-----YGIDKPYVSAVRDLVRESGFSDVPLFQCDWSSNFTNNALDDLIWTVNF 234
Query: 146 --GMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G + FK P P + +E W+ ++ WG K R A+D+ + + +N S
Sbjct: 235 GTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRLAKDMVQGIKDMLDRNIS 294
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYG 236
+ + YM HGGT FG A + M + Y AP+ E G
Sbjct: 295 F-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
Length = 571
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 131/302 (43%), Gaps = 48/302 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ I Y+ WNLHE ++G +DF G D++ F GL V R GP
Sbjct: 39 WKHRLQSVVDCGLNTIDVYIPWNLHEKERGNFDFGGELDLVEFFTIAAEMGLKVLCRPGP 98
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAF----HEKGPPYVLWA 115
+I SEW +GGLP WL + RS+ Y+ + + + + P H G P + +
Sbjct: 99 YICSEWDWGGLPSWLLKDPKMHIRSNYCGYQAAVSSYFSKLLPLLAPLQHSNGGPIIAFQ 158
Query: 116 AKMAVDFHTG-----VPWV--MCKQD--------DAPGPVINACNGMRCGETFKGPNS-- 158
+ + +PW+ + K G I N ++ T P S
Sbjct: 159 VENEYGDYVDKDNEHLPWLADLMKSHGLFELFFISDGGHTIRKANMLKL--TKSTPISLK 216
Query: 159 ---PNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNF 214
PNKP + TE W ++ WG G+ + + D+ I K G+ VN+YM+HGGTNF
Sbjct: 217 SLQPNKPMLVTEFWAGWFDYWGHGRNLLNN--DVFEKTLKEILKRGASVNFYMFHGGTNF 274
Query: 215 GRTAAAFMI-TGYYD--------QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQ 265
G A + GYY P+DE G R KW IK C T ++
Sbjct: 275 GFMNGAIELEKGYYTADVTSYDYDCPVDESG-NRTEKW-------EIIKRCLDVQKTSSE 326
Query: 266 NV 267
NV
Sbjct: 327 NV 328
>gi|78048770|ref|YP_364945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78037200|emb|CAJ24945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
Length = 650
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 117/270 (43%), Gaps = 44/270 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 100 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRPGP 159
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + ++P + G P +
Sbjct: 160 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQSYLDALAKQVQPLLNHNGGPIIAVQ 219
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 220 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 279
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF--IAKNGSYVNY 205
+ P++P + E W ++ W GKP+ +A D F I + G N
Sbjct: 280 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPH--AATDARQQAEEFEWILRQGHSANL 336
Query: 206 YMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
YM+ GGT+FG FM + P D Y
Sbjct: 337 YMFIGGTSFG-----FMNGANFQNNPSDHY 361
>gi|432108623|gb|ELK33326.1| Beta-galactosidase [Myotis davidii]
Length = 739
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 121/283 (42%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY FS +D+ FI+ GL V LR GP
Sbjct: 70 WQDRLLKMKMAGLNAIQIYVPWNFHEPQPGQYQFSEEHDVEHFIQLAHELGLLVILRPGP 129
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + ++P ++ G P +
Sbjct: 130 YICAEWEMGGLPAWLLEKENIVLRSSDPDYLAAVDTWLGVILPKMKPLLYQNGGPIITVQ 189
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + + D +H G V+ D ++ A G+ F G
Sbjct: 190 VENEYGSYFSCDYDYLRFLQKRFHYHLGNDVVLFTTDGEMEKLMQCGALQGLYATVDF-G 248
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P I +E +T + W G+P+ ++ I G+
Sbjct: 249 PGANITKAFLIQRKYEPKGPLINSEFYTGWLDHW-GQPHSTVKTEVVASSLQDILARGAN 307
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNFG A M T Y APL E G + E
Sbjct: 308 VNLYMFIGGTNFGYWNGANMPYQPQPTSYDYDAPLSEAGDLTE 350
>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
Length = 780
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 127/303 (41%), Gaps = 67/303 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN+HE ++GQ+DF+G ND+ F + G+YV +R GP
Sbjct: 59 WEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFTGNNDVAAFCRLAHKNGMYVIVRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL + R D+ PY
Sbjct: 119 YVCAEWEMGGLPWWLLKKKDVRLREDD-PYFMARVKAFEAEVGRQLAPLTIQNGGPIIMV 177
Query: 92 KIENEYQT----------IEPAFHEKGPPYVL-----WAAKMAVDFHTGVPWVMCKQDDA 136
++ENEY + I G V WA+ + + W M
Sbjct: 178 QVENEYGSYGINKKYVSEIRDIVKASGFDKVTLFQCDWASNFEHNGLDDLVWTM------ 231
Query: 137 PGPVINACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
N G E F+ P P + +E W+ ++ WG + R A+D+ +
Sbjct: 232 -----NFGTGANIDEQFRRLKQLRPEAPLMCSEFWSGWFDKWGARHETRPAKDMVEGIDE 286
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKE 248
+ K S+ + YM HGGT+FG A A +T Y AP++EYG+ PK+ L+
Sbjct: 287 MLRKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGM-PTPKFFALRN 344
Query: 249 LHA 251
A
Sbjct: 345 TMA 347
Score = 39.7 bits (91), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 1/50 (2%)
Query: 510 WSSIRSPTRQLTWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
++ +R P + + +Y+ F D LNL+ GKG+ +VNG ++GR+W
Sbjct: 528 FAPVRLPKQNIGYYRGYFDLKKTGDTF-LNLEQWGKGQVYVNGHALGRFW 576
>gi|325925751|ref|ZP_08187124.1| beta-galactosidase [Xanthomonas perforans 91-118]
gi|325543808|gb|EGD15218.1| beta-galactosidase [Xanthomonas perforans 91-118]
Length = 611
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 115/268 (42%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 61 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + ++P + G P +
Sbjct: 121 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQSYLDALAKQVQPLLNHNGGPIIAVQ 180
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 181 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 240
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 241 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSANLYM 299
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM + P D Y
Sbjct: 300 FIGGTSFG-----FMNGANFQNNPSDHY 322
>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
Length = 613
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 114/269 (42%), Gaps = 40/269 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ ++P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 243 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSANLYM 301
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEYG 236
+ GGT+FG FM + P D Y
Sbjct: 302 FIGGTSFG-----FMNGANFQNNPSDHYA 325
>gi|189096261|pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
Thetaiotaomicron
Length = 612
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 122/293 (41%), Gaps = 51/293 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G + I YVFWN HEP++G+YDF+G+ DI F + Q G YV +R GP
Sbjct: 39 WEHRIKXCKALGXNTICLYVFWNFHEPEEGRYDFAGQKDIAAFCRLAQENGXYVIVRPGP 98
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL I R + Y +
Sbjct: 99 YVCAEWEXGGLPWWLLKKKDIKLREQDPYYXERVKLFLNEVGKQLADLQISKGGNIIXVQ 158
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD--------DAPGPVINAC 144
+ENEY AF P + TGVP C + D IN
Sbjct: 159 VENEYG----AFGIDKPYISEIRDXVKQAGFTGVPLFQCDWNSNFENNALDDLLWTINFG 214
Query: 145 NGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G E FK P+ P +E W+ ++ WG K RSA+++ + +N S+
Sbjct: 215 TGANIDEQFKRLKELRPDTPLXCSEFWSGWFDHWGAKHETRSAEELVKGXKEXLDRNISF 274
Query: 203 VNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ Y HGGT+FG A T Y AP++E G V PK+ ++ L
Sbjct: 275 -SLYXTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKVT-PKYLEVRNL 325
>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
Length = 613
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/314 (28%), Positives = 130/314 (41%), Gaps = 47/314 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ ++P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 243 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSANLYM 301
Query: 208 YHGGTNFG-RTAAAF----------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLC 256
+ GGT+FG A F T Y A LDE G PK+ +++ A +
Sbjct: 302 FIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYDAILDEAGHP-TPKFALMRDAIARVTGI 360
Query: 257 SRPLLTGTQNVISL 270
P L T +L
Sbjct: 361 QPPALPATIATTTL 374
>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 613
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 114/268 (42%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ ++P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 243 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSANLYM 301
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM + P D Y
Sbjct: 302 FIGGTSFG-----FMNGANFQNNPSDHY 324
>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
Length = 613
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 114/268 (42%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ ++P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 243 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSANLYM 301
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM + P D Y
Sbjct: 302 FIGGTSFG-----FMNGANFQNNPSDHY 324
>gi|119588243|gb|EAW67839.1| hCG1729998, isoform CRA_d [Homo sapiens]
Length = 653
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 132/297 (44%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYFDHLIPRVIPLQYRQAGPVIAVQ 223
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCG--------E 151
F+ G+ ++ D + G+ +
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQD 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSVSATPLP 398
>gi|395520729|ref|XP_003764476.1| PREDICTED: beta-galactosidase-1-like protein 2 [Sarcophilus
harrisii]
Length = 704
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 134/298 (44%), Gaps = 55/298 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TY+ WNLHEP++G+++FSG D+ F++ GL+V LR GP
Sbjct: 146 WRDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSGNLDVEAFVQMAADIGLWVILRPGP 205
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW GGLP WL + + R+ + +
Sbjct: 206 YICSEWDLGGLPSWLLQDSSMELRTTYAGFLKAVDRYFNHLIPRVVPLQYKQGGPIIAVQ 265
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + ++K Y+ + K + G+ ++ D+ G G+
Sbjct: 266 VENEYGS-----YDKDSNYMPYIKKALMS--RGINELLMTSDNKDGLSGGYLEGVLATVN 318
Query: 153 FKGPNS----------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
K +S NKP++ TE WT ++ WGG I A D+ V+ I + G+
Sbjct: 319 LKHVDSMIFNYLHSFQENKPTMVTEYWTGWFDTWGGPHNIVDADDVVVTVSSII-QMGAS 377
Query: 203 VNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
+N YM+HGGTNFG A +T Y A L E G PK+ L+E + I
Sbjct: 378 LNLYMFHGGTNFGFMNGAQHFGEYLADVTSYDYDAILTEAG-DYTPKFFKLREFFSTI 434
>gi|397498227|ref|XP_003819886.1| PREDICTED: beta-galactosidase-1-like protein 3 [Pan paniscus]
Length = 653
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 132/297 (44%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 223
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCG--------E 151
F+ G+ ++ D + G+ +
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQD 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFNQLHKIQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSVSATPLP 398
>gi|348573619|ref|XP_003472588.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Cavia
porcellus]
Length = 880
Score = 107 bits (268), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 130/304 (42%), Gaps = 57/304 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 322 WRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLLAAEIGLWVILRPGP 381
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E GGLP WL G+ R+ + + +
Sbjct: 382 YICAEIDLGGLPSWLLQDPGMKLRTTYQGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 441
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + + P Y+ + K D G+ ++ D+ G +G+
Sbjct: 442 VENEYGS-----YNRDPAYMPYIKKALED--RGIIELLLTSDNKDGLQKGVVHGVLATIN 494
Query: 153 FKGPNS------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ N+P + E WT ++ WGG I + ++ V+ I G
Sbjct: 495 LQSQQELQSLTTSLLSVQGNQPKMVMEYWTGWFDSWGGPHNILDSSEVLDTVSA-ITNAG 553
Query: 201 SYVNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
S +N YM+HGGTNFG A +T Y A L E G K+G L++ ++
Sbjct: 554 SSINLYMFHGGTNFGFINGAMHFNDYKSDVTSYDYDAVLTEAGDYTA-KYGKLRDFFGSL 612
Query: 254 KLCS 257
S
Sbjct: 613 SGAS 616
>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 613
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 114/268 (42%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ ++P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 243 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSANLYM 301
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM + P D Y
Sbjct: 302 FIGGTSFG-----FMNGANFQNNPSDHY 324
>gi|296475022|tpg|DAA17137.1| TPA: galactosidase, beta 1 precursor [Bos taurus]
Length = 653
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 131/291 (45%), Gaps = 45/291 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HE Q G+Y+FSG +D+ FI+ GL V LR GP
Sbjct: 64 WKDRLLKMKMAGLNAIQTYVAWNFHELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + P ++ G P +
Sbjct: 124 YICAEWDMGGLPAWLLEKKSIVLRSSDPDYLAAVDKWLGVLLPKMRPLLYKNGGPIITVQ 183
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFK- 154
+ + ++ D+ H G ++ D ++ A G+ F
Sbjct: 184 VENEYGSYLSCDYDYLRFLQKRFHDHLGEDVLLFTTDGVNERLLQCGALQGLYATVDFSP 243
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P + +E +T + WG + S++ +AF + +A G+ V
Sbjct: 244 GTNLTAAFMLQRKFEPTGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLAL-GANV 302
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GGTNF A + T Y APL E G + E K+ L+++
Sbjct: 303 NMYMFIGGTNFAYWNGANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
intestinalis]
Length = 658
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/308 (29%), Positives = 135/308 (43%), Gaps = 56/308 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ I+TYV WNLHEP G+Y+F+G D++ FI YV LR GP
Sbjct: 89 WRDRLMKMKACGLNTIETYVPWNLHEPIPGKYNFTGDLDLVHFILLAHKLEFYVLLRPGP 148
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW +GGLP WL + R+ PY +
Sbjct: 149 YICSEWEFGGLPSWLLRDPKMKVRTMYPPYIAAVTKYFNYLLPFVKPLQYQYGGPIIAFQ 208
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
++NEY + ++ KG +L+ + D G+ +Q PG V+
Sbjct: 209 LDNEYGSYFKDADYLPYLKEFLQNKGIIELLFIS----DSIEGL-----RQQTIPG-VLK 258
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
N R F ++ P+ P + E WT ++ WG K +I + Q+ + ++ G
Sbjct: 259 TVNFKRMENHFTDLSNMQPDAPLMVMEFWTGWFDWWGEKHHILTVQEFGETLNEIFSQGG 318
Query: 201 SYVNYYMYHGGTNFGRTAAAFMI-TGYY-DQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
S VN+YM+ GGTNFG A+ TG++ D D L+ E G L E + K
Sbjct: 319 S-VNFYMFFGGTNFGFMNGAYKDGTGFHADITSYDYDALIAEN--GDLTEKYFKAKQIIE 375
Query: 259 PLLTGTQN 266
GT +
Sbjct: 376 HYFPGTTD 383
>gi|301763008|ref|XP_002916930.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Ailuropoda
melanoleuca]
Length = 688
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 116/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 130 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 189
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL +G+ R+ K + +
Sbjct: 190 YICSEIDLGGLPSWLLQDSGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 249
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + + P Y+ + K D G+ ++ D+ G +G+
Sbjct: 250 VENEYGS-----YNRDPAYMPYIKKALED--RGIVELLLTSDNKDGLQKGVMDGVLATIN 302
Query: 153 FKGPNSPN------------KPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 303 LQSQHELQLLTNFLLSVQRVQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-ILDAG 361
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 362 SSINLYMFHGGTNFGFINGAMHFHEY 387
>gi|158455090|gb|AAI40686.2| Galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 131/291 (45%), Gaps = 45/291 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HE Q G+Y+FSG +D+ FI+ GL V LR GP
Sbjct: 64 WKDRLLKMKMAGLNAIQTYVAWNFHELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + P ++ G P +
Sbjct: 124 YICAEWDMGGLPAWLLEKKSIVLRSSDPDYLAAVDKWLGVLLPKMRPLLYKNGGPIITVQ 183
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFK- 154
+ + ++ D+ H G ++ D ++ A G+ F
Sbjct: 184 VENEYGSYLSCDYDYLRFLQKRFHDHLGEDVLLFTTDGVNERLLQCGALQGLYATLDFSP 243
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P + +E +T + WG + S++ +AF + +A G+ V
Sbjct: 244 GTNLTAAFMLQRKFEPTGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLAL-GANV 302
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GGTNF A + T Y APL E G + E K+ L+++
Sbjct: 303 NMYMFIGGTNFAYWNGANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|157824103|ref|NP_001101662.1| beta-galactosidase precursor [Rattus norvegicus]
gi|149018351|gb|EDL76992.1| galactosidase, beta 1 (mapped) [Rattus norvegicus]
Length = 647
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/286 (30%), Positives = 117/286 (40%), Gaps = 52/286 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD IQTYV WN HEPQ GQYDFSG D+ FI+ GL V LR GP
Sbjct: 66 WEDRLLKMKMAGLDAIQTYVPWNFHEPQPGQYDFSGDRDVEHFIQLAHQLGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPA----FHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y ++ + P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLEKESIVLRSSDPDYLAAVDKWLAVLLPKMKRLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAP------------------GP 139
+ + A D +H G ++ D A G
Sbjct: 186 VENEYGSYFACDYNYLRFLEHRFRYHLGNDIILFTTDGAAEKLLKCGTLQDLYATVDFGT 245
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
N F+ P P I +E +T + W G+P+ + +
Sbjct: 246 TGNITRAFLIQRNFE----PKGPLINSEFYTGWLDHW-GQPHSKVNTKKLVASLYNLLAY 300
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
G+ VN YM+ GGTNF A M T Y APL E G + E
Sbjct: 301 GASVNLYMFIGGTNFAYWNGANMPYAPQPTSYDYDAPLSEAGDLTE 346
>gi|423342145|ref|ZP_17319860.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
gi|409219016|gb|EKN11981.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
Length = 779
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 128/297 (43%), Gaps = 55/297 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y FWN+HE + G++DFSG+NDI F + Q +Y+ LR GP
Sbjct: 63 WEHRIQLCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKYDMYIMLRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ SEW GGLP WL I R+ N PY
Sbjct: 123 YVCSEWEMGGLPWWLLKKDDIKLRT-NDPYFLERTKLFMNEIGKQLADLQITKGGNIIMV 181
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDF-HTGVPWVMCK-----QDDAPGPV---IN 142
++ENEY + + Y+ + T VP C Q++A + IN
Sbjct: 182 QVENEYGS-----YATDKEYIANIRDIVKGAGFTDVPLFQCDWSSNFQNNALDDLVWTIN 236
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G E FK PN P + +E W+ ++ WG K R A+ + + + +
Sbjct: 237 FGTGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLDRGI 296
Query: 201 SYVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHA 251
S+ + YM HGGT FG A + M + Y AP+ E G PK+ L+EL A
Sbjct: 297 SF-SLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWTT-PKYFKLRELLA 351
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 31/48 (64%), Gaps = 4/48 (8%)
Query: 515 SPTRQL---TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+P ++L +Y+ TF D + L++Q+ GKG WVNG++IGR+W
Sbjct: 523 APGKKLDGPAYYRATFNLEEAGD-VFLDMQTWGKGMVWVNGKAIGRFW 569
>gi|332838248|ref|XP_001156615.2| PREDICTED: galactosidase, beta 1-like 3 [Pan troglodytes]
Length = 653
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/297 (27%), Positives = 132/297 (44%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 223
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCG--------E 151
F+ G+ ++ D + G+ +
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQD 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSVSATPLP 398
>gi|261406481|ref|YP_003242722.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261282944|gb|ACX64915.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 619
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 130/278 (46%), Gaps = 46/278 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN+HEPQ+G+++FSG D+ FI+ GL+V +R P
Sbjct: 35 WEDRLLKLKACGFNTVETYIAWNVHEPQEGEFNFSGMADVASFIELAGKLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVLWA 115
FI +EW +GGLP WL I R + Y K+++ Y + P G P + A
Sbjct: 95 FICAEWEFGGLPGWLLGYGEIRLRCSDPLYLSKVDHYYDELIPQLVPLLSTHGGP--ILA 152
Query: 116 AKMAVDF------HTGVPW-----------VMCKQDDAP------GPVINACN-----GM 147
++ ++ H + + V+ D P G ++ + G
Sbjct: 153 VQVENEYGSYGNDHAYLEYLREGLVRRGVDVLLFTSDGPTDEMLLGGTLSDVHATVNFGS 212
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
R E+F+ +P + E W ++ W ++R A D+A V + + GS +N
Sbjct: 213 RVEESFRKYREYRAEEPLMVMEFWNGWFDHWMEDHHVRDAADVA-GVLDEMLEMGSSMNM 271
Query: 206 YMYHGGTNFGRTAAAFMITGY------YD-QAPLDEYG 236
YM+HGGTNFG + A I Y YD APL E+G
Sbjct: 272 YMFHGGTNFGFYSGANHIQAYEPTTTSYDYDAPLTEWG 309
>gi|78042544|ref|NP_001030215.1| beta-galactosidase precursor [Bos taurus]
gi|75057630|sp|Q58D55.1|BGAL_BOVIN RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|61554628|gb|AAX46589.1| galactosidase, beta 1 [Bos taurus]
gi|148839051|dbj|BAF64285.1| galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 131/291 (45%), Gaps = 45/291 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HE Q G+Y+FSG +D+ FI+ GL V LR GP
Sbjct: 64 WKDRLLKMKMAGLNAIQTYVAWNFHELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + P ++ G P +
Sbjct: 124 YICAEWDMGGLPAWLLEKKSIVLRSSDPDYLAAVDKWLGVLLPKMRPLLYKNGGPIITVQ 183
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFK- 154
+ + ++ D+ H G ++ D ++ A G+ F
Sbjct: 184 VENEYGSYLSCDYDYLRFLQKRFHDHLGEDVLLFTTDGVNERLLQCGALQGLYATVDFSP 243
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P + +E +T + WG + S++ +AF + +A G+ V
Sbjct: 244 GTNLTAAFMLQRKFEPTGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLAL-GANV 302
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GGTNF A + T Y APL E G + E K+ L+++
Sbjct: 303 NMYMFIGGTNFAYWNGANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|57619080|ref|NP_001009860.1| beta-galactosidase precursor [Felis catus]
gi|5915775|sp|O19015.1|BGAL_FELCA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|2547317|gb|AAB81350.1| lysosomal beta-galactosidase [Felis catus]
Length = 669
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 121/279 (43%), Gaps = 46/279 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ GQY FSG +D+ F+K GL V LR GP
Sbjct: 66 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGQYQFSGEHDVEYFLKLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + D+ H G ++ D A + A G+ F G
Sbjct: 186 VENEYGSYFTCDYDYLRFLQRRFRDHLGGDVLLFTTDGAHEKFLQCGALQGIYATVDF-G 244
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P++ P P + +E +T + W G+P+ R ++ + +G+
Sbjct: 245 PDANITAAFQIQRKSEPRGPLVNSEFYTGWLDHW-GQPHSRVRTEVVASSLHDVLAHGAN 303
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYG 236
VN YM+ GGTNF A + T Y APL E G
Sbjct: 304 VNLYMFIGGTNFAYWNGANIPYQPQPTSYDYDAPLSEAG 342
>gi|320109257|ref|YP_004184847.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319927778|gb|ADV84853.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 640
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 143/331 (43%), Gaps = 66/331 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KAK GL+ I TYVFWN+HEP+ G YDF+G+ND+ ++ Q GL V LR GP
Sbjct: 57 WDDAMQKAKALGLNAITTYVFWNVHEPRPGVYDFTGQNDLGEYLAAAQRAGLKVILRPGP 116
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN------------------KPY-----------K 92
+ +EW +GG P WL +V RS + +PY +
Sbjct: 117 YACAEWEFGGYPAWLIKDPTVVVRSSDPKFMKPVAKWFHRLGQEVQPYLAANGGPIIAVQ 176
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAA------KMAVD-------------FHTGVPWVMC 131
+ENEY + + A+ E+ V+ + K AVD +T V
Sbjct: 177 VENEYGSFGNDHAYMEQMKDLVISSGIGGKNPKKAVDEDGKNVPQDTGTMLYTADGGVQL 236
Query: 132 KQDDAP--GPVINACNGMRCGETFKGPN-SPNKPSIWTEDWTSFYQVWGGK-PYIRSAQD 187
P V+N G E + PN P + E W ++ WG +A+
Sbjct: 237 PNGTLPELPAVVNFGGGQAKSELARYEAFRPNGPRMVGEYWAGWFDHWGNNHQKTNAAEQ 296
Query: 188 IAFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLV 238
+A + ++ K G V+ YM +GGT+FG A A +T Y AP+DE G
Sbjct: 297 VAEYE--YMLKRGYSVSLYMLYGGTSFGWMAGANSGDKAPYEPDVTSYDYDAPIDERG-N 353
Query: 239 REPKWGHLKELHAAIKLCSRPLLTGTQNVIS 269
PK+ L+E+ + + P + T ++
Sbjct: 354 PTPKYFALREVIQRVTGITPPPVPETAATVA 384
>gi|281337337|gb|EFB12921.1| hypothetical protein PANDA_005062 [Ailuropoda melanoleuca]
Length = 609
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 116/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 52 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL +G+ R+ K + +
Sbjct: 112 YICSEIDLGGLPSWLLQDSGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 171
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + + P Y+ + K D G+ ++ D+ G +G+
Sbjct: 172 VENEYGS-----YNRDPAYMPYIKKALED--RGIVELLLTSDNKDGLQKGVMDGVLATIN 224
Query: 153 FKGPNSPN------------KPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 225 LQSQHELQLLTNFLLSVQRVQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-ILDAG 283
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 284 SSINLYMFHGGTNFGFINGAMHFHEY 309
>gi|344291569|ref|XP_003417507.1| PREDICTED: beta-galactosidase-1-like protein 2 [Loxodonta africana]
Length = 650
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 114/266 (42%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 92 WRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIWMAAELGLWVILRPGP 151
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL + R+ K + +
Sbjct: 152 YICSEIDLGGLPSWLLQDPNMKLRTTYKGFTEAVDLYFDHLIARVVPLQYKLGGPIIAVQ 211
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + K D G+ ++ D+ G +G
Sbjct: 212 VENEYGS-----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGVIHGVLATIN 264
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 265 LQSQQELHLLTTFLLNAQGIQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSAIIDA-G 323
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 324 SSINLYMFHGGTNFGFINGAMHFNEY 349
>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 775
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/314 (28%), Positives = 135/314 (42%), Gaps = 70/314 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+ GL+ + YVFWN HE Q G +DFSG+ DI F++ Q +GLYV LR GP
Sbjct: 61 WRDRLHRARAMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + +RS + + +
Sbjct: 121 YVCAEWDFGGYPSWLLKEKDLTYRSKDPRFMSYCERYIKELGKQLAPLTINNGGNIIMVQ 180
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M + VP C D G V +
Sbjct: 181 VENEYGS-----YAADKEYLAAIRDMLQEAGFNVPLFTC---DGGGQVEAGHIAGALPTL 232
Query: 145 NGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + P P E + +++ WG + Y R A+ + + +
Sbjct: 233 NGVFGEDIFKIVDKYHPGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLG----- 287
Query: 199 NGSYVNYYMYHGGTNF-----GRTAAAF--MITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+G V+ YM+HGGTNF T+ F T Y APL E+G PK+ HA
Sbjct: 288 HGVSVSMYMFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWGNCY-PKY------HA 340
Query: 252 AIKLCSRPLLTGTQ 265
++ + L GTQ
Sbjct: 341 FREIIQKYLPEGTQ 354
>gi|291410639|ref|XP_002721600.1| PREDICTED: galactosidase, beta 1-like [Oryctolagus cuniculus]
Length = 635
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 116/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 78 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL +G+ R+ K + +
Sbjct: 138 YICSEIDLGGLPSWLLQDSGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K P Y+ + + D G+ ++ D+ G G
Sbjct: 198 VENEYGS-----YNKDPAYMPYIKRALED--RGIVELLLTSDNKDGLSKGVVPGVMATIN 250
Query: 147 ------MRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ TF +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 251 LQSHAELQSLTTFLLSVKGIQPKMVMEYWTGWFDSWGGPHNILDSSEVLQTVSA-IVDAG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
+ +N YM+HGGTNFG A Y
Sbjct: 310 ASINLYMFHGGTNFGFINGAMHFQEY 335
>gi|2623150|gb|AAB86405.1| mutant lysosomal beta-galactosidase [Felis catus]
Length = 669
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 121/279 (43%), Gaps = 46/279 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ GQY FSG +D+ F+K GL V LR GP
Sbjct: 66 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGQYQFSGEHDVEYFLKLAHELGLLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 126 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 185
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + D+ H G ++ D A + A G+ F G
Sbjct: 186 VENEYGSYFTCDYDYLRFLQRRFRDHLGGDVLLFTTDGAHEKFLQCGALQGIYATVDF-G 244
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P++ P P + +E +T + W G+P+ R ++ + +G+
Sbjct: 245 PDANITAAFQIQRKSEPRGPLVNSEFYTGWLDHW-GQPHSRVRTEVVASSLHDVLAHGAN 303
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYG 236
VN YM+ GGTNF A + T Y APL E G
Sbjct: 304 VNLYMFIGGTNFAYWNGANIPYQPQPTSYDYDAPLSEAG 342
>gi|62510424|sp|Q60HF6.1|BGAL_MACFA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|52782225|dbj|BAD51959.1| galactosidase, beta 1 [Macaca fascicularis]
gi|67970838|dbj|BAE01761.1| unnamed protein product [Macaca fascicularis]
Length = 682
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 120/283 (42%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNTIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKEAILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + A DF H G V+ D A + A G+ F G
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFHHHLGDDVVLFTTDGAHETFLQCGALQGLYTTVDF-G 243
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P S P P I +E +T + W G+P+ ++ I G+
Sbjct: 244 PGSNITDAFQIQRKCEPKGPLINSEFYTGWLDHW-GQPHSTIKTEVVASSLYDILARGAS 302
Query: 203 VNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF + A T Y APL E G + E
Sbjct: 303 VNLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
Length = 584
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 135/307 (43%), Gaps = 50/307 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + ++TYV WN+HEPQ+G++DFS D+ RFI+ Q GLYV LR P
Sbjct: 34 WRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQLAQEVGLYVILRPAP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL + R D P+ +
Sbjct: 94 YICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQVSDLQITQEGPILMMQ 153
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD----DAPGPVINACNG 146
+ENEY + + ++ K + F + PW+ ++ D P IN G
Sbjct: 154 VENEYGSYGNDKSYLRKSAELMRHNGIDVSLFTSDGPWLDMLENGSIKDIALPTINC--G 211
Query: 147 MRCGETFKGPNS---PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
E F+ +P + E W ++ WG + ++ A + + GS V
Sbjct: 212 SDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHTTSVTDAANELRDCLEAGS-V 270
Query: 204 NYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI-KL 255
N YM+HGGTNFG A +T Y A L E+G V PK+ +++ I ++
Sbjct: 271 NIYMFHGGTNFGFMNGANYYEKLSPDVTSYDYDALLSEWGDVT-PKYEAFQQVIGEITEI 329
Query: 256 CSRPLLT 262
S PL T
Sbjct: 330 PSFPLTT 336
>gi|332264040|ref|XP_003281056.1| PREDICTED: beta-galactosidase-1-like protein 3 [Nomascus
leucogenys]
Length = 655
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 137/297 (46%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNMDLEAFVLMAAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPQLLLRTTNKGFIEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 223
Query: 115 AAKMAVDFH---TGVPWV-----------MCKQDDAPGPVIN--------ACNGMRCGE- 151
F+ T +P++ + D V++ A N + +
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQN 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFSQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATYFGKHTGIVTSYDYDAVLTEAGDYTE-KYFKLQKLFESVSATPLP 398
>gi|1911627|gb|AAB50770.1| beta-galactosidase [dogs, spleen, Peptide Partial, 667 aa]
Length = 667
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 125/292 (42%), Gaps = 47/292 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ GQY FSG D+ FIK GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGQYQFSGEQDVEYFIKLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWDMGGLPAWLLLKESIILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITMQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + D+ H G ++ D A + A G+ F G
Sbjct: 185 VENEYGSYFTCDYDYLRFLQKLFHHHLGNDVLLFTTDGANELFLQCGALQGLYATVDF-G 243
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P + +E +T + W G+P+ ++ I +G+
Sbjct: 244 PGANITAAFQIQRKSEPKGPLVNSEFYTGWLDHW-GQPHSTVRTEVVASSLHDILAHGAN 302
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKEL 249
VN YM+ GGTNF A M T Y APL E + E K+ L+E+
Sbjct: 303 VNLYMFIGGTNFAYWNGANMPYQAQPTSYDYDAPLSEAADLTE-KYFALREV 353
>gi|440904150|gb|ELR54700.1| Beta-galactosidase, partial [Bos grunniens mutus]
Length = 659
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 131/291 (45%), Gaps = 45/291 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HE Q G+Y+FSG +D+ FI+ GL V LR GP
Sbjct: 70 WKDRLLKMKMAGLNAIQTYVAWNFHELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGP 129
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + P ++ G P +
Sbjct: 130 YICAEWDMGGLPAWLLEKKSIVLRSSDPDYLAAVDKWLGVLLPKMRPLLYKNGGPIITVQ 189
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFK- 154
+ + ++ D+ H G ++ D ++ A G+ F
Sbjct: 190 VENEYGSYLSCDYDYLRFLQKRFHDHLGEDVLLFTTDGVNERLLQCGALQGLYATVDFSP 249
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P + +E +T + WG + S++ +AF + +A G+ V
Sbjct: 250 GTNLTAAFMLQRKFEPTGPLVNSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLAL-GANV 308
Query: 204 NYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GGTNF A + T Y APL E G + E K+ L+++
Sbjct: 309 NMYMFIGGTNFAYWNGANIPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 358
>gi|149027890|gb|EDL83350.1| similar to Hypothetical protein MGC47419 (predicted) [Rattus
norvegicus]
Length = 394
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 133/298 (44%), Gaps = 48/298 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 94 WRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIWLAAKIGLWVILRPGP 153
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYVLWA 115
+I SE GGLP WL + R+ + ++ + P ++ G P + A
Sbjct: 154 YICSEIDLGGLPSWLLQDPDMKLRTTYPGFTKAVDLYFDHLMSRVVPLQYKHGGPII--A 211
Query: 116 AKMAVDF------HTGVPWV------------MCKQDDAPGPVINACNG------MRCGE 151
++ ++ H +P++ + D+ G +G ++ +
Sbjct: 212 VQVENEYGSYNGDHAYMPYIKKALEDRGIIEMLLTSDNKDGLEKGVVDGVLATINLQSQQ 271
Query: 152 TFKGPNS------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
NS +P + E WT ++ WGG I + ++ V+ I K+GS +N
Sbjct: 272 ELVALNSILLSIQGIQPKMVMEYWTGWFDSWGGSHNILDSSEVLQTVSAII-KDGSSINL 330
Query: 206 YMYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVR---EPKWGHLKELHAAIKLCSRPL 260
YM+HGGTNFG A Y +A + YG +R + W LH I SR L
Sbjct: 331 YMFHGGTNFGFINGAMHFGDY--KADVTSYGKLRCYIDRGW----RLHCQIHQASRTL 382
>gi|219847209|ref|YP_002461642.1| beta-galactosidase [Chloroflexus aggregans DSM 9485]
gi|219541468|gb|ACL23206.1| Beta-galactosidase [Chloroflexus aggregans DSM 9485]
Length = 898
Score = 107 bits (266), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 122/274 (44%), Gaps = 35/274 (12%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W L+ +A+ GL+ I T + WN HEPQ G +DF+ D+ F+ GL V +R GP
Sbjct: 36 WRPLLEQARWAGLNTIDTVIPWNRHEPQPGVFDFADEADLGAFLDLCHDLGLKVIVRPGP 95
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF----HEKGPPYVL-- 113
+I +EW GGLP WL + R+++ + + + T+ P H +G P +L
Sbjct: 96 YICAEWENGGLPAWLTANGDLRLRTNDPVFLSAVLRWFDTLMPILVPRQHTRGGPIILCQ 155
Query: 114 -----WA-------------AKMAVDFHTGVPWVMCKQDDAPGPVI-NACNGMRCGETFK 154
WA A+ A + VP C P N +G+
Sbjct: 156 IENEHWASGVYGADEHQQTLARAAFERGIEVPQYTCMGATPGYPEFRNGWSGIAEKLVQT 215
Query: 155 GPNSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIAKNGSYVNYYMYHGGTN 213
P+ P I +E W+ ++ WGG R SA + + A + +++M+ GGTN
Sbjct: 216 RQLWPDNPLIVSELWSGWFDNWGGHRQTRKSAAKLDMILHQLTAVGCAGFSHWMWAGGTN 275
Query: 214 F----GRTAAA---FMITGYYDQAPLDEYGLVRE 240
F GRT M TGY AP+DEYG + E
Sbjct: 276 FGYWGGRTVGGDLIHMTTGYDYDAPIDEYGRLTE 309
>gi|33338028|gb|AAQ13636.1|AF173889_1 MSTP114 [Homo sapiens]
gi|22760318|dbj|BAC11149.1| unnamed protein product [Homo sapiens]
Length = 552
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 115/258 (44%), Gaps = 49/258 (18%)
Query: 10 KEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTY 69
K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP+I SE
Sbjct: 2 KACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDL 61
Query: 70 GGLPIWLHDVAGIVFRSDNKPY-----------------------------KIENEYQTI 100
GGLP WL G+ R+ K + ++ENEY +
Sbjct: 62 GGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKRGGPIIAVQVENEYGS- 120
Query: 101 EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG----------PVIN--ACNGMR 148
+ K P Y+ + K D G+ ++ D+ G IN + + ++
Sbjct: 121 ----YNKDPAYMPYVKKALED--RGIVELLLTSDNKDGLSKGIVQGVLATINLQSTHELQ 174
Query: 149 CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
TF +P + E WT ++ WGG I + ++ V+ I GS +N YM+
Sbjct: 175 LLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-IVDAGSSINLYMF 233
Query: 209 HGGTNFGRTAAAFMITGY 226
HGGTNFG A Y
Sbjct: 234 HGGTNFGFMNGAMHFHDY 251
>gi|167750408|ref|ZP_02422535.1| hypothetical protein EUBSIR_01382 [Eubacterium siraeum DSM 15702]
gi|167656559|gb|EDS00689.1| glycosyl hydrolase family 35 [Eubacterium siraeum DSM 15702]
Length = 579
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 137/323 (42%), Gaps = 61/323 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WN HE +KG ++++G +DI RFI+ GLY+ +R P
Sbjct: 34 WQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWNGMHDICRFIELADKLGLYMIIRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW +GGLP WL + R KPY +
Sbjct: 94 YICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDSYYSVLMPKLAPYQIDNGGNIIMMQ 153
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
IENEY ++ Y+ + + VP+V D P +GM G
Sbjct: 154 IENEY-----GYYGNDTSYLEFLRDTMRKYGITVPFVTS---DGPWSEFVFKSGMVDGAL 205
Query: 153 FKGPN---------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
G +KP + E W ++ VWG + I + + A + + +
Sbjct: 206 PTGNFGSSAEWQFGEMRRFIGEDKPLMCMEFWNGWFDVWGEEHNITAPEKAAQELDILL- 264
Query: 198 KNGSYVNYYMYHGGTNFGRTAA------AFMITGYYDQAPLDEYGLVREPKWGHLKELHA 251
KNGS +N+YM+ GGTNFG + ++T Y APL E G + E K+ KE+ +
Sbjct: 265 KNGS-MNFYMFEGGTNFGFMSGKNNEKKTGIVTSYDYDAPLTEDGRITE-KYEKCKEVIS 322
Query: 252 AIKLCSRPLLTGTQNVISLGQLQ 274
+ LT + G+++
Sbjct: 323 RYTDINEVPLTTQIRRLEYGEIR 345
>gi|332672111|ref|YP_004455119.1| beta-galactosidase [Cellulomonas fimi ATCC 484]
gi|332341149|gb|AEE47732.1| Beta-galactosidase [Cellulomonas fimi ATCC 484]
Length = 583
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 122/277 (44%), Gaps = 45/277 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A+E GL+ I+TY+ WN H P +G++ G D+ RF+ E+ +QG++ +R GP
Sbjct: 35 WRDRLTRARELGLNTIETYIPWNAHSPARGEFRTDGILDLGRFLDEVAAQGMWAIVRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYVL-- 113
+I +EWT GGLP WL AG R Y I++ Y+ + P ++G P VL
Sbjct: 95 YICAEWTGGGLPGWLF-TAGAAVRRHEPTYLAAIQDYYEAVAGIVAPRQVDRGGPVVLVQ 153
Query: 114 --------------WAAKMAVDFHTGV-----------PWVMCKQDDAPGPVINACNGMR 148
A + + +G+ PW M + P G R
Sbjct: 154 VENEYGAYGDDKDYLRALVKLLRESGITTPLTTIDQPEPW-MLENGSLPELHKTGSFGSR 212
Query: 149 CGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
E + P P + E W ++ WG + A A + +A G+ VN Y
Sbjct: 213 AAERLATLREHQPTGPLMCAEFWDGWFDSWGLHHHTTDAAASAHELDTLLAA-GASVNLY 271
Query: 207 MYHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
M GGTNFG T A ++T Y APLDE G
Sbjct: 272 MVCGGTNFGFTNGANDKGTYVPIVTSYDYDAPLDEAG 308
>gi|301617189|ref|XP_002938028.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Xenopus (Silurana) tropicalis]
Length = 620
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 121/281 (43%), Gaps = 54/281 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G++ + TYV WNLHEP KG YDF+ DI F+ GL+V LR GP
Sbjct: 61 WRDRMKKMKACGINTLTTYVPWNLHEPGKGTYDFNNGLDISEFLAVAGEMGLWVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFR--------------------------SDNKP---YK 92
+I +EW GGLP WL + R S+ P +
Sbjct: 121 YICAEWDLGGLPSWLLRDKDMKLRTTYPGFTEAVDDYFNELIPRVAKYQYSNGGPIIAVQ 180
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + K Y+ + ++ G+ ++ D+ G + G+
Sbjct: 181 VENEYGS-----YAKDANYMEFIKNALIE--RGIVELLLTSDNKDGISYGSLEGVLATVN 233
Query: 153 FKGPN----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
F+ P KP + E WT ++ WGG ++ + + ++ + + G+
Sbjct: 234 FQKIEPVLFSYLNSIQPKKPIMVMEFWTGWFDYWGGDHHLFDVESMMSTISEVLNR-GAN 292
Query: 203 VNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYG 236
+N YM+HGGTNFG + A IT Y APL E G
Sbjct: 293 INLYMFHGGTNFGFMSGALHFHEYRPDITSYDYDAPLTEAG 333
>gi|423251759|ref|ZP_17232772.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
CL03T00C08]
gi|423255080|ref|ZP_17236010.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
CL03T12C07]
gi|392649184|gb|EIY42863.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
CL03T00C08]
gi|392652521|gb|EIY46180.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
CL03T12C07]
Length = 769
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
Length = 584
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 135/307 (43%), Gaps = 50/307 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + ++TYV WN+HEPQ+G++DFS D+ RFI+ Q GLYV LR P
Sbjct: 34 WRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQLAQEVGLYVILRPAP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL + R D P+ +
Sbjct: 94 YICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQVSDLQITQEGPILMMQ 153
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD----DAPGPVINACNG 146
+ENEY + + ++ K + F + PW+ ++ D P IN G
Sbjct: 154 VENEYGSYGNDKSYLRKSAELMRHNGIDVPLFTSDGPWLDMLENGSIKDIALPTINC--G 211
Query: 147 MRCGETFKGPNS---PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
E F+ +P + E W ++ WG + ++ A + + GS V
Sbjct: 212 SDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHTTSVTDAANELRDCLEAGS-V 270
Query: 204 NYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI-KL 255
N YM+HGGTNFG A +T Y A L E+G V PK+ +++ I ++
Sbjct: 271 NIYMFHGGTNFGFMNGANYYEKLLPDVTSYDYDALLSEWGDVT-PKYEAFQQVIGEITEI 329
Query: 256 CSRPLLT 262
S PL T
Sbjct: 330 PSFPLTT 336
>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
adhaerens]
Length = 543
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 132/293 (45%), Gaps = 50/293 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WNLHEP GQ+D++G ++ +FI Q G YV LR GP
Sbjct: 28 WRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFDYTGILNVRKFILLAQELGFYVILRPGP 87
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYVLWA 115
+I +EW +GG+P WL + RS KP+K + I+ KG P + A
Sbjct: 88 YICAEWEFGGMPSWLLSDKNMQVRSTYKPFKDAVNRFFDGFIPEIKSLQASKGGPII--A 145
Query: 116 AKMAVDF------------------HTGVPWVMCKQDDAPGPVINACNGMRCGETFKG-- 155
++ ++ + G+ ++ D++ G G+ F+G
Sbjct: 146 VQVENEYGSYGSDEEYMQFIRDALINRGIVELLVTSDNSEGIKHGGAPGVLKTYNFQGHA 205
Query: 156 -------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF--IAKNGSYVNYY 206
+ PSI E W+ ++ WG K + IA F I + N+Y
Sbjct: 206 KSHLSILERLQDAPSIVMEFWSGWFDHWGEKNH--QVHTIAHVTNTFKDILDCDASFNFY 263
Query: 207 MYHGGTNFG-RTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
++HGGTNFG A F+ +T Y APL E G + E K+ L+++
Sbjct: 264 VFHGGTNFGFMNGANFIDFFSYYLPTVTSYDYDAPLSEAGDITE-KYMELRKI 315
>gi|62859689|ref|NP_001015958.1| galactosidase, beta 1-like precursor [Xenopus (Silurana)
tropicalis]
gi|89271933|emb|CAJ82193.1| galactosidase, beta 1 [Xenopus (Silurana) tropicalis]
Length = 648
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 117/283 (41%), Gaps = 54/283 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD I TYV WN HE + G Y+FSG +DI F+K GL V LR GP
Sbjct: 63 WKDRLLKMKMAGLDAIYTYVPWNFHETKPGVYNFSGDHDIESFLKLANEIGLLVILRAGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL IV RS + Y +
Sbjct: 123 YICAEWDMGGLPAWLLAKESIVLRSSDPDYLQAVDNWMGVFLPKMKPFLYHNGGPIISVQ 182
Query: 93 IENEY------------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV 140
+ENEY ++ H G VL+ +G+ +V C
Sbjct: 183 VENEYGSYFTCDYNYLRHLLQLFRHHLGDEVVLFTTD-----GSGLQYVRCGTIQGLYTT 237
Query: 141 INACNGMRCGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
++ G ETF P P + +E +T + WG +P+ A ++ I
Sbjct: 238 VDFGPGSNVTETFSVQRYCEPKGPLVNSEFYTGWLDHWG-EPHSVVATEMVTKSLDEILA 296
Query: 199 NGSYVNYYMYHGGTNFG-----RTAAAFMITGYYDQAPLDEYG 236
+G+ VN YM+ GGTNFG T A T Y APL E G
Sbjct: 297 HGANVNMYMFIGGTNFGYWNGANTPYAPQPTSYDYDAPLSEAG 339
>gi|265767009|ref|ZP_06094838.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263253386|gb|EEZ24862.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 769
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|383116237|ref|ZP_09936989.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
gi|251945420|gb|EES85858.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
Length = 769
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYISAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|423270210|ref|ZP_17249181.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
CL05T00C42]
gi|423276168|ref|ZP_17255110.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
CL05T12C13]
gi|392698134|gb|EIY91316.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
CL05T00C42]
gi|392699308|gb|EIY92489.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
CL05T12C13]
Length = 769
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLREARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
Length = 601
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 81/276 (29%), Positives = 124/276 (44%), Gaps = 42/276 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WN+HEP++G++DF G D+I F++ GL+V +R P
Sbjct: 35 WRDRLLKMKACGCNTVETYVAWNVHEPEEGKFDFGGIADVIAFVELAGELGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFH----EKGPPYV--- 112
+I +EW +GGLP WL + + R + + K++ Y + P F G P +
Sbjct: 95 YICAEWEFGGLPAWLLKDSEMQLRCSDPKFLAKVDAYYDVLLPKFVPLLCTNGGPIIAMQ 154
Query: 113 ---------------------LWAAKMAVDFHT--GVPWVMCKQDDAPGPVINACNGMRC 149
+ A + V T G M + P + G R
Sbjct: 155 VENEYGSYGNDKAYLGYLRDGMIARGIDVLLFTSDGPTDEMLQGGTLPDVLATVNFGSRP 214
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E+F P++P + E W ++ W + + R +D A V + G+ VN+YM
Sbjct: 215 EESFAKFREYRPDEPLMCMEFWNGWFDHWMEEHHTRDGEDAA-RVLDDMLGAGASVNFYM 273
Query: 208 YHGGTNFGRTAAAFMITGY------YD-QAPLDEYG 236
+HGGTNFG + A I Y YD APL E G
Sbjct: 274 FHGGTNFGFYSGANHIKTYEPTVTSYDYDAPLTERG 309
>gi|375359947|ref|YP_005112719.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
gi|301164628|emb|CBW24187.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
Length = 769
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|60683116|ref|YP_213260.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
gi|60494550|emb|CAH09349.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
Length = 769
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|224027078|ref|ZP_03645444.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
gi|224020314|gb|EEF78312.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
Length = 783
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 122/295 (41%), Gaps = 55/295 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y FWN+HE + G++DF G+ND+ RF + Q G+Y+ LR GP
Sbjct: 64 WEHRIEMCKALGMNTICIYAFWNIHEQRPGEFDFEGQNDVARFCRLAQKHGMYIMLRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ SEW GGLP WL I R+ + PY
Sbjct: 124 YVCSEWEMGGLPWWLLKKKDIALRTSD-PYFLERTKIFMNELGKQLADLQAPRGGNIIMV 182
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK---------QDDAPGPVIN 142
++ENEY A+ E + T VP C DD IN
Sbjct: 183 QVENEYG----AYAEDKEYIASIRDIVRGAGFTDVPLFQCDWASTFQRNGLDDLLW-TIN 237
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G + FK P P + +E W+ ++ WG K R A + + + +N
Sbjct: 238 FGTGADIDQQFKALREARPETPLMCSEYWSGWFDHWGRKHETRPADVMVKGIKDMMDRNI 297
Query: 201 SYVNYYMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
S+ + YM HGGT FG A M + Y AP+ E G PK+ L++L
Sbjct: 298 SF-SLYMTHGGTTFGHWGGANSPSYSAMCSSYDYDAPISEAGWAT-PKYYQLRDL 350
>gi|423285593|ref|ZP_17264475.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
615]
gi|404579108|gb|EKA83826.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
615]
Length = 769
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLREARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 919
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 107/410 (26%), Positives = 171/410 (41%), Gaps = 65/410 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ KAK G++ + TY WN+HEP++G+++F G ND F+ GL+V R GP
Sbjct: 49 WREVLVKAKLAGMNCVDTYFAWNVHEPEEGEWNFEGDNDCGAFLDLCHELGLWVIARPGP 108
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
FI +EW +GG P WL+ + FR+ + Y +
Sbjct: 109 FICAEWDFGGFPYWLNTKKDMKFRAFDMQYLTYVDRYMDRIIPIIRDREINAGGSVILVQ 168
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--INACNGMRCG 150
+ENEY + A E Y+L + +D VP + C A G V N +G
Sbjct: 169 VENEYGYL--ASDEVARDYMLHLRDVMLDRGVMVPLITCV-GGAEGTVEGANFWSGADHH 225
Query: 151 ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG-SYVNYYM-- 207
P+ P I TE WT +++ WG + + L + G + V++YM
Sbjct: 226 YNNLVQKQPDTPKIVTEFWTGWFEHWGAPAATQKTAALYEKRMLESLRAGFTGVSHYMFF 285
Query: 208 --YHGGTNFGRTAAA---FMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLT 262
+ G GRT A FM+T Y APL EYG V + K+ K + ++ LL
Sbjct: 286 GGTNFGGYGGRTVGASDIFMVTSYDYDAPLSEYGRVTD-KYNTAKRMSYFVQATESVLLN 344
Query: 263 GTQNVISLGQLQEAF---VFEETSGVCAAFLVNNDERKAVTVLF---RNISYELPRKSIS 316
+ +L L + F V E+ + + DER+ ++ R I + ++
Sbjct: 345 AVEGAAALAALPQGFSARVREKGNERIWFVESSKDERETTSMTLPDGRTIPVTVGPHAVV 404
Query: 317 ILPD-----------CKTVAFNTERVSTQ-----YNKRSKTSNLKFDSDE 350
+ D C T ER+ Q Y + + S ++ +SD+
Sbjct: 405 PVIDRLQLEPGVYLTCNTYLIANERIDGQHTLIVYAENGQRSYIELESDQ 454
>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
Length = 608
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/299 (28%), Positives = 137/299 (45%), Gaps = 60/299 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TYV WNLHEP+K Y+F G D+ R++ GL+V LR GP
Sbjct: 54 WRDRMLKMKAAGLNTLETYVPWNLHEPEKYTYNFEGILDLGRYLDIAHEVGLWVILRPGP 113
Query: 62 FIESEWTYGGLPIWLHDV-----------------------AGIVFR--SDNKP---YKI 93
+I +EW +GG+P WL V A +V R ++ P +I
Sbjct: 114 YICAEWEFGGIPGWLAYVKEHVRTTRPMFIDPVEVWFGRLLAEVVPRQYTNGGPIIAVQI 173
Query: 94 ENEY----------QTIEPAFHEKGPPYVLWAAK-MAVDFHTGVPWVMCKQDDAPGPVIN 142
ENEY + ++ +G +L+ + G+P V+ + N
Sbjct: 174 ENEYGGFSNSTEYMERLKKILESRGIVELLFTSDGKGALISGGIPGVLKTVNFQN----N 229
Query: 143 ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAF-HVALFIAKNGS 201
A + ++ + + P++P + E WT ++ WG ++ + +F H +I G+
Sbjct: 230 ASDKLQKLKEIQ----PDRPMMVMEYWTGWFDHWGEDHHLYRLESESFVHSVFYILDAGA 285
Query: 202 YVNYYMYHGGTNFGRTAAAF-----------MITGYYDQAPLDEYGLVREPKWGHLKEL 249
VN+YM+HGGTNFG A IT Y AP+ E G + PK+ ++E+
Sbjct: 286 SVNFYMFHGGTNFGFMNGANTRYKSGGRTLPTITSYDYDAPISETGDLT-PKYFKIREI 343
>gi|53715181|ref|YP_101173.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218046|dbj|BAD50639.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 769
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLREARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|224135029|ref|XP_002327549.1| predicted protein [Populus trichocarpa]
gi|222836103|gb|EEE74524.1| predicted protein [Populus trichocarpa]
Length = 643
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/405 (26%), Positives = 167/405 (41%), Gaps = 68/405 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +AK GL+ IQTYV WNLHEPQ G+ F G D++ F+K + V LR GP
Sbjct: 40 WEDRLVRAKALGLNTIQTYVPWNLHEPQPGKLVFEGIADLVSFLKLCHKLDILVMLRPGP 99
Query: 62 FIESEWTYGGLPIWLHDV-AGIVFRSDNKPY--KIENEY----QTIEPAFHEKGPPYVLW 114
+I EW GG P WL + + RS + Y ++N + + P + G P ++
Sbjct: 100 YICGEWDLGGFPAWLLAIEPPLKLRSSDPAYLRLVDNWWGILLPKVAPFLYNNGGPIIM- 158
Query: 115 AAKMAVDF-------------------HTGVPWVMCKQD--------------DAPGPVI 141
++ +F H G ++ D DA +
Sbjct: 159 -VQIENEFGSYGDDKAYLHHLVKLARGHLGDGIILYTTDGGSRENLEKGTIRGDAVFSTV 217
Query: 142 NACNGMRCGETFKGP---NSPNK-PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
+ G FK N+P K P + +E +T + WG K A A + ++
Sbjct: 218 DFTTGDDPWPIFKLQKEFNAPGKSPPLSSEFYTGWLTHWGEKNAKTGADFTASALEKILS 277
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAFM----------ITGYYDQAPLDEYGLVREPKWGHLK 247
+NGS V YM HGGTNFG A IT Y AP+ E G V K+ L+
Sbjct: 278 QNGSAV-LYMVHGGTNFGFYNGANTGVDESDYKPDITSYDYDAPISESGDVENAKFNALR 336
Query: 248 ---ELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFR 304
ELH A L S P G + + AF+F+ + A +V ++ ++ + +
Sbjct: 337 RVIELHTAASLPSVPSDNGKMGYGPIQLQKTAFLFDLLDNINPADVVESENPLSMESVGQ 396
Query: 305 NISYEL------PR--KSISILPDCKTVAFNTERVSTQYNKRSKT 341
+ L P+ KS+ ++P+ A ++ N R T
Sbjct: 397 MFGFLLYVSEYTPKDDKSVLLIPEVHDRAQVFTLCHSEDNSRRPT 441
>gi|423260608|ref|ZP_17241530.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
CL07T00C01]
gi|423266742|ref|ZP_17245744.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
CL07T12C05]
gi|387775162|gb|EIK37271.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
CL07T00C01]
gi|392699974|gb|EIY93143.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
CL07T12C05]
Length = 769
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +GQ+DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLREARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPSYSAMCSSYDYDAPISEPGWTTD-KYFQLRDL 338
Score = 40.0 bits (92), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 522 WYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTFR D L++ + GKG WVNG +IGR+W
Sbjct: 523 YYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFW 559
>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
Length = 1106
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 127/293 (43%), Gaps = 47/293 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN HEPQ G YDF+ +ND+ F + Q +YV LR GP
Sbjct: 381 WDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFTEQNDLAEFCRLCQQNDMYVILRPGP 440
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEK--------GPPYVL 113
++ +EW GGLP WL I R ++ PY IE E A ++ G P ++
Sbjct: 441 YVCAEWEMGGLPWWLLKKKDIRLR-ESDPYFIE-RVNLFEEAVAKQVKDLTIANGGPIIM 498
Query: 114 WAA-----------------KMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETF-KG 155
+ V H G + + D A +N + + F G
Sbjct: 499 VQVENEYGSYGADKGYVSQIRDIVRTHFGNDIALFQCDWASNFTLNGLDDLIWTMNFGTG 558
Query: 156 PN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
N PN P + +E W+ ++ WG R A+D+ + +++ S+ +
Sbjct: 559 ANVDQQFAKLKKLRPNSPLMCSEFWSGWFDKWGANHETRPAEDMIKGIDDMLSRGISF-S 617
Query: 205 YYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHA 251
YM HGGTN+G A A +T Y AP+ E G PK+ L+E A
Sbjct: 618 LYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTT-PKYWKLREAMA 669
>gi|291557570|emb|CBL34687.1| Beta-galactosidase [Eubacterium siraeum V10Sc8a]
Length = 579
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/323 (26%), Positives = 137/323 (42%), Gaps = 61/323 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WN HE +KG ++++G +DI RFI+ GLY+ +R P
Sbjct: 34 WQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWNGMHDICRFIELADKLGLYMIIRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW +GGLP WL + R KPY +
Sbjct: 94 YICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDSYYSVLMPKLAPYQIDNGGNIIMMQ 153
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
IENEY ++ Y+ + + VP+V D P +GM G
Sbjct: 154 IENEY-----GYYGNDTSYLEFLRDTMRKYGITVPFVTS---DGPWSEFVFKSGMVDGAL 205
Query: 153 FKGPN---------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
G +KP + E W ++ VWG + I + + A + + +
Sbjct: 206 PTGNFGSSAEWQFGEMRRFIGEDKPLMCMEFWNGWFDVWGEEHNITAPEKAAQELDILL- 264
Query: 198 KNGSYVNYYMYHGGTNFGRTAA------AFMITGYYDQAPLDEYGLVREPKWGHLKELHA 251
KNGS +N+YM+ GGTNFG + ++T Y APL E G + E K+ KE+ +
Sbjct: 265 KNGS-MNFYMFEGGTNFGFMSGKNNEKKTGIVTSYDYDAPLTEDGRITE-KYEKCKEVIS 322
Query: 252 AIKLCSRPLLTGTQNVISLGQLQ 274
+ LT + G+++
Sbjct: 323 RYTDINEVPLTTQIRRLEYGKIR 345
>gi|390336578|ref|XP_792349.2| PREDICTED: beta-galactosidase-like [Strongylocentrotus purpuratus]
Length = 671
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 115/259 (44%), Gaps = 53/259 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ +QTYV WN HE + G+++F G +DI+ F+K+ GL V LR GP
Sbjct: 62 WQDRLDKMKMAGLNAVQTYVIWNFHELKPGEFNFDGDHDILSFLKKANDTGLAVILRPGP 121
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN------------------KPY-----------K 92
+I EW GGLP WL ++ GIV RS N +PY +
Sbjct: 122 YICGEWDLGGLPAWLLNIPGIVLRSSNDLYMAHVTEWMNFFLPKLRPYLYVNGGPIIMVQ 181
Query: 93 IENEYQTIEPAFHE-KGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMR--- 148
+ENEY + + H+ + Y L+ A + D V+ D PG + C ++
Sbjct: 182 VENEYGSYQTCDHQYQRQLYHLFRANLGPD-------VVLFTTDGPGDHLLQCGTLQDMY 234
Query: 149 -CGETFKGPNS-----------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
+ G NS P P + +E +T + W + + +
Sbjct: 235 ATIDFGAGSNSTGMFQEMRKFEPKGPLVNSEYYTGWLDHWEHPHQTVKTAAVCTSLDQML 294
Query: 197 AKNGSYVNYYMYHGGTNFG 215
A G+ VN YM+ GGTNFG
Sbjct: 295 AL-GANVNMYMFEGGTNFG 312
>gi|256393561|ref|YP_003115125.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256359787|gb|ACU73284.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 584
Score = 106 bits (264), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 121/282 (42%), Gaps = 56/282 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ I TY+ WNLHE + G +DF G D+ F+ ++GL+V LR GP
Sbjct: 35 WSDRLRKARLMGLNTIDTYIPWNLHERRPGTFDFGGILDLAAFLDAAAAEGLHVLLRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I EW GGLP WL + RS + + +
Sbjct: 95 YICGEWEGGGLPSWLLADPDLALRSTDPAFLQAVEAYLDAIMPIVLPRLGTRGGPVIAVQ 154
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPVI 141
+ENEY + + A +G + + D G +P V+ + G V
Sbjct: 155 VENEYGAYGSDTAYMERLYEALTSRGIDVPFFTSDQPNDLADGALPGVLATANFG-GKVT 213
Query: 142 NACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
+ +R P P + E W ++ WGG RSA+D + + + G+
Sbjct: 214 ASLAALRA-------QQPTGPLMCAEFWNGWFDYWGGTHAQRSAEDAGAALEEML-QAGA 265
Query: 202 YVNYYMYHGGTNFGRTAAA-------FMITGYYDQAPLDEYG 236
VN+YM+HGGTNFG T A +T Y +PLDE G
Sbjct: 266 SVNFYMFHGGTNFGFTNGANDKGTYRATVTSYDYDSPLDEAG 307
>gi|423278914|ref|ZP_17257828.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
gi|404585906|gb|EKA90510.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
610]
Length = 769
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 127/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P+ P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPDTPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A + M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338
>gi|313149116|ref|ZP_07811309.1| beta-galactosidase [Bacteroides fragilis 3_1_12]
gi|313137883|gb|EFR55243.1| beta-galactosidase [Bacteroides fragilis 3_1_12]
Length = 769
Score = 106 bits (264), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 127/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P+ P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPDTPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A + M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338
>gi|123788298|sp|Q3UPY5.1|GLBL2_MOUSE RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|74224567|dbj|BAE25259.1| unnamed protein product [Mus musculus]
Length = 636
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 130/300 (43%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI+ GL+V LR GP
Sbjct: 78 WRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQLAAKIGLWVILRPGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL + R+ + +
Sbjct: 138 YICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVELYFDHLMSRVVPLQYKHGGPIIAVQ 197
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K Y+ + K D G+ ++ D+ G +G
Sbjct: 198 VENEYGS-----YNKDRAYMPYIKKALED--RGIIEMLLTSDNKDGLEKGVVDGVLATIN 250
Query: 147 MRCGETFKGPNSP------NKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ + N+ +P + E WT ++ WGG I + ++ V+ I K+G
Sbjct: 251 LQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHNILDSSEVLQTVSAII-KDG 309
Query: 201 SYVNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
S +N YM+HGGTNFG A +T Y A L E G K+ L+EL +
Sbjct: 310 SSINLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTEAG-DYTAKYTKLRELFGTV 368
>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
Length = 591
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 123/277 (44%), Gaps = 44/277 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I KA+ GL+ I+TYV WN HEP +GQ+ + G D+ F+K + +G++ +R P
Sbjct: 35 WADRIHKARLMGLNTIETYVAWNAHEPVEGQWSWEGGLDLAAFLKAVADEGMHAIVRPAP 94
Query: 62 FIESEWTYGGLPIWL--HDVAGI-----VFRSDNKPYKIENEYQTIEPAFHEKGPPYVL- 113
+I +EW GGLP WL AG+ VF + + Y + Y+ IEP G P +L
Sbjct: 95 YICAEWDNGGLPAWLFGEKAAGVRRDEPVFMAAVQAY-LRRVYEVIEPLQIHHGGPVILV 153
Query: 114 -----WAA--------KMAVDFHTG----VPWVMCKQDD--------APGPVINACNGMR 148
+ A + VD + VP Q + PG + G R
Sbjct: 154 QIENEYGAYGSDPEYLRKLVDITSSAGITVPLTTVDQPEDGMLAAGSLPGLLRTGSFGSR 213
Query: 149 CGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
E + P P + E W ++ WG + A+ A + + +G+ VN Y
Sbjct: 214 SPERLATLRRHQPTGPLMCMEYWNGWFDDWGTPHHTTDAEASAADLDALLG-SGASVNLY 272
Query: 207 MYHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
M GGTNFG T A ++T Y APLDE G
Sbjct: 273 MLCGGTNFGLTNGANDKGTYEPIVTSYDYDAPLDEAG 309
>gi|24418925|ref|NP_722498.1| beta-galactosidase-1-like protein 2 [Mus musculus]
gi|23512349|gb|AAH38479.1| Galactosidase, beta 1-like 2 [Mus musculus]
gi|148693361|gb|EDL25308.1| cDNA sequence BC038479, isoform CRA_b [Mus musculus]
Length = 652
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 130/300 (43%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI+ GL+V LR GP
Sbjct: 94 WRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQLAAKIGLWVILRPGP 153
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL + R+ + +
Sbjct: 154 YICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 213
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K Y+ + K D G+ ++ D+ G +G
Sbjct: 214 VENEYGS-----YNKDRAYMPYIKKALED--RGIIEMLLTSDNKDGLEKGVVDGVLATIN 266
Query: 147 MRCGETFKGPNSP------NKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ + N+ +P + E WT ++ WGG I + ++ V+ I K+G
Sbjct: 267 LQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHNILDSSEVLQTVSAII-KDG 325
Query: 201 SYVNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
S +N YM+HGGTNFG A +T Y A L E G K+ L+EL +
Sbjct: 326 SSINLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTEAG-DYTAKYTKLRELFGTV 384
>gi|395816938|ref|XP_003781939.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Otolemur
garnettii]
Length = 669
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 124/286 (43%), Gaps = 52/286 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ G+Y FS +D+ FI+ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGKYQFSEDHDVEYFIQLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL + ++ RS + Y +
Sbjct: 125 YICAEWDMGGLPAWLLEKESMILRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIISVQ 184
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKM--------AVDFHT-GV--PWVMCKQDDAPGPVI 141
+ENEY + H+ Y+ + K V F T G+ ++ C +
Sbjct: 185 VENEYGSYFTCDHD----YMRFLLKRFRYYLGDDVVLFTTDGIFEKYLNCGALQGLYATV 240
Query: 142 NACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
+ G+ FK + P P I +E +T + WG +D+AF + +A+
Sbjct: 241 DFGTGVNITAAFKLQRKSEPKGPLINSEFYTGWLDHWGQPHSTVKTEDVAFSLFDILAR- 299
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
G+ VN YM+ GGTNF A + T Y APL E G + E
Sbjct: 300 GASVNLYMFTGGTNFAYWNGANIPYSAQPTSYDYDAPLSEAGDLTE 345
>gi|329960238|ref|ZP_08298680.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328532911|gb|EGF59688.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 778
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 121/296 (40%), Gaps = 57/296 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y FWN+HE + G++DF G+NDI F + Q G+Y+ LR GP
Sbjct: 62 WEHRIQMCKALGMNTICIYAFWNIHEQRPGEFDFKGQNDIAEFCRLAQKNGMYIMLRPGP 121
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ SEW GGLP WL I R+ N PY
Sbjct: 122 YVCSEWEMGGLPWWLLKKKDIQLRT-NDPYFLERTKLFMNEIGKQLADLQAPRGGNIIMV 180
Query: 92 KIENEY--QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD--------DAPGPVI 141
++ENEY + + V A T VP C D I
Sbjct: 181 QVENEYGGYAVNKEYIANVRDIVRGAG------FTDVPLFQCDWSSTFQLNGLDDLLWTI 234
Query: 142 NACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
N G FK P+ P + +E W+ ++ WG K R A+ + + + +N
Sbjct: 235 NFGTGANIDAQFKSLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLDRN 294
Query: 200 GSYVNYYMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
S+ + YM HGGT FG A M + Y AP+ E G PK+ L+E+
Sbjct: 295 ISF-SLYMAHGGTTFGHWGGANCPPYSAMCSSYDYDAPISEAGWAT-PKYYKLREM 348
Score = 40.0 bits (92), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 7/54 (12%)
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQY 574
+Y+ +F D + L++Q+ GKG WVNG++IGR+W + P QT Y
Sbjct: 531 AYYRASFNLKETGD-VFLDMQTWGKGMVWVNGKAIGRFW------EIGPQQTLY 577
>gi|424664993|ref|ZP_18102029.1| hypothetical protein HMPREF1205_00868 [Bacteroides fragilis HMW
616]
gi|404575526|gb|EKA80269.1| hypothetical protein HMPREF1205_00868 [Bacteroides fragilis HMW
616]
Length = 769
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 127/290 (43%), Gaps = 45/290 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN+HE +G++DF+G+NDI F + Q G+YV +R GP
Sbjct: 52 WEHRIEMCKALGMNTICLYVFWNIHEQTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL IV R+ + PY +E ++ + P + +
Sbjct: 112 YVCAEWEMGGLPWWLLKKKDIVLRTLD-PYFMERTAIFMKEVGKQLAPLQITRGGNIIMV 170
Query: 119 ---------AVDF--------------HTGVPWVMCKQD--------DAPGPVINACNGM 147
AVD T VP C D IN G
Sbjct: 171 QVENEYGAYAVDKPYVSAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGA 230
Query: 148 RCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
+ FK P+ P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 231 NIEQQFKRLKEARPDTPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SL 289
Query: 206 YMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A + M + Y AP+ E G + K+ L++L
Sbjct: 290 YMAHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338
>gi|218117864|dbj|BAH03319.1| beta-galactosidase [Cucumis melo var. cantalupensis]
Length = 166
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/148 (39%), Positives = 84/148 (56%), Gaps = 8/148 (5%)
Query: 403 LDVQSHGHILHAFVNGEYTGSAHGSHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGA 462
L + S GH LH F+NG+ +G+ +G DN T V+LR G N ++LSV VGLP+ G
Sbjct: 19 LTIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGL 78
Query: 463 FLERKVAGV------HRVRVQDKSFTNCSWGYQVGLIGEKLQIYSNLGLNKVLW--SSIR 514
E AG+ + + + W Y+VGL GE + +++ G + V W S+
Sbjct: 79 HFETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKGESMNLHTISGSSSVEWMTGSLV 138
Query: 515 SPTRQLTWYKTTFRAPAGNDPIALNLQS 542
S + LTWYKTTF AP GN+P+AL++ S
Sbjct: 139 SQKQPLTWYKTTFNAPGGNEPLALDMGS 166
>gi|239986962|ref|ZP_04707626.1| putative beta-galactosidase [Streptomyces roseosporus NRRL 11379]
Length = 606
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 126/285 (44%), Gaps = 49/285 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A GL+ ++TYV WNLHEP++G+ G + RF+ ++ GL+ +R GP
Sbjct: 35 WGHRLAVLAAMGLNCVETYVPWNLHEPREGEVRDVG--ALGRFLDAVERAGLWAIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAFHE----KGPPYVL-- 113
+I +EW GGLP+W+ G R+ + Y+ +E ++ + P E +G P +L
Sbjct: 93 YICAEWENGGLPVWVTGRFGRRVRTRDAEYRAVVERWFRELLPQVVERQVVRGGPVILVQ 152
Query: 114 ----------------WAAKMAVDFHTGVPWV--------MCKQDDAPGPVINA--CNGM 147
W A + + VP M PG + A +G
Sbjct: 153 AENEYGSFGSDAVYLEWLAGLLRECGVTVPLFTSDGPEDHMLTGGSVPGLLATANFGSGA 212
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
R G + P P + E W ++ WG +P +R A++ A + I + G+ VN YM
Sbjct: 213 REGFAVLRRHQPKGPLMCMEFWCGWFDHWGAEPVLRDAEEAAGALRE-ILECGASVNIYM 271
Query: 208 YHGGTNFGRTAAAF------------MITGYYDQAPLDEYGLVRE 240
HGGTNF A A +T Y AP+DEYG E
Sbjct: 272 AHGGTNFAGWAGANRGGPLQDGEFQPTVTSYDYDAPVDEYGRATE 316
>gi|156398646|ref|XP_001638299.1| predicted protein [Nematostella vectensis]
gi|156225418|gb|EDO46236.1| predicted protein [Nematostella vectensis]
Length = 675
Score = 105 bits (263), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 114/279 (40%), Gaps = 45/279 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ +QTYV WNLHEP+ G YDF G ND+ FIK QS GL V LR GP
Sbjct: 60 WKDRLQKMKFAGLNAVQTYVAWNLHEPEIGTYDFEGENDLEEFIKIAQSVGLLVILRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQ-------TIEPAFHEKGPPYVL- 113
+I EW GG P WL IV RS ++ + I P + G P +
Sbjct: 120 YICGEWELGGFPPWLLKNTSIVLRSSKDQVYMDAVDKWMGVLLPKIRPLLYNNGGPVITV 179
Query: 114 -----WAAKMAVDF------------HTGVPWVMCKQDD--------APGPVINACNGMR 148
+ + D H G V+ D P +
Sbjct: 180 QVENEYGSYFTCDHDYMSHLENLFRSHLGKDVVLFTTDGFAKSMLDCGTLPSLFTTVDFG 239
Query: 149 CGETFKGPNS------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G K P S PN P + +E + + WG K + + ++ + +A N S
Sbjct: 240 AGVDPKVPFSILRKYQPNGPLVNSEFYPGWLDHWGEKHSTVNPAVMTQYLDMILAMNAS- 298
Query: 203 VNYYMYHGGTNFGRTAAAF-----MITGYYDQAPLDEYG 236
VN YM+ GGT+FG A T Y APL E G
Sbjct: 299 VNLYMFEGGTSFGYMNAKSSQYQPQPTSYDYDAPLSEAG 337
>gi|402861842|ref|XP_003895286.1| PREDICTED: beta-galactosidase-like [Papio anubis]
Length = 373
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 120/283 (42%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNTIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKEAILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + A DF H G V+ D A + A G+ F G
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFHHHLGDDVVLFTTDGAHETFLQCGALQGLYATVDF-G 243
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P S P P I +E +T + W G+P+ ++ I G+
Sbjct: 244 PGSNITDAFQIQRKCEPKGPLINSEFYTGWLDHW-GQPHSTIKTEVVASSLYDILARGAS 302
Query: 203 VNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF + A T Y APL E G + E
Sbjct: 303 VNLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|291530918|emb|CBK96503.1| Beta-galactosidase [Eubacterium siraeum 70/3]
Length = 579
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 135/318 (42%), Gaps = 51/318 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WN HE +KG +++ G +DI RFI+ GLY+ +R P
Sbjct: 34 WQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWDGMHDICRFIELADKLGLYMIIRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP----------------- 102
+I SEW +GGLP WL + R KPY ++N Y + P
Sbjct: 94 YICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDNYYSVLMPKLAPYQIDNGGNIIMMQ 153
Query: 103 -----AFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN 157
++ Y+ + + VP+V D P +GM G G
Sbjct: 154 IENEYGYYGNDTSYLEFLRDTMRKYGITVPFVTS---DGPWSEFVFKSGMVDGALPTGNF 210
Query: 158 SPN---------------KPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+ KP + E W ++ VWG + I + + A + + KNGS
Sbjct: 211 GSSAEWQLGEMRRFIGEGKPLMCMEFWNGWFDVWGEEHNITAPEKAAQELDTLL-KNGS- 268
Query: 203 VNYYMYHGGTNFGRTAA------AFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLC 256
+N+YM+ GGTNFG + ++T Y APL E G + E K+ KE+ +
Sbjct: 269 MNFYMFEGGTNFGFMSGKNNEKKTGIVTSYDYDAPLTEDGRITE-KYEKCKEVISRYNDI 327
Query: 257 SRPLLTGTQNVISLGQLQ 274
+ LT + G+++
Sbjct: 328 NEVPLTTQIRRLEYGEIR 345
>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 105 bits (263), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 89/314 (28%), Positives = 134/314 (42%), Gaps = 70/314 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + +A GL+ + YVFWN HE Q G +DFSG+ DI F++ Q +GLYV LR GP
Sbjct: 61 WRDRLHRAHAMGLNTVSAYVFWNFHERQPGVFDFSGQADIAEFVRIAQEEGLYVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW +GG P WL + +RS + + +
Sbjct: 121 YVCAEWDFGGYPSWLLKEKDLTYRSKDPRFMSYCERYIKELGKQLAPLTINNGGNIIMVQ 180
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV--------INAC 144
+ENEY + + Y+ M + VP C D G V +
Sbjct: 181 VENEYGS-----YAADKEYLAAIRDMLQEAGFNVPLFTC---DGGGQVEAGHIAGALPTL 232
Query: 145 NGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHVALFIAK 198
NG+ + FK + P P E + +++ WG + Y R A+ + + +
Sbjct: 233 NGVFGEDIFKIVDKYHPGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDWMLG----- 287
Query: 199 NGSYVNYYMYHGGTNF-----GRTAAAF--MITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+G V+ YM+HGGTNF T+ F T Y APL E+G PK+ HA
Sbjct: 288 HGVSVSMYMFHGGTNFWYMNGANTSGGFRPQPTSYDYDAPLGEWGNCY-PKY------HA 340
Query: 252 AIKLCSRPLLTGTQ 265
++ + L GTQ
Sbjct: 341 FREIIQKYLPEGTQ 354
>gi|290956543|ref|YP_003487725.1| glycosyl hydrolase family 42 [Streptomyces scabiei 87.22]
gi|260646069|emb|CBG69162.1| putative glycosyl hydrolase (family 42) [Streptomyces scabiei
87.22]
Length = 591
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 130/300 (43%), Gaps = 57/300 (19%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQ-KGQYDFSGRNDIIRFIKEIQSQGLYVCLRI 59
+W + KA+ GL+ ++TYV WNLH+P G D+ R++ +++GL+V LR
Sbjct: 36 LWADRLRKARLMGLNTVETYVPWNLHQPDPDSPLVLDGLLDLPRYLSLARAEGLHVLLRP 95
Query: 60 GPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY---------------------------- 91
GP+I +EW GGLP WL GI RS + +
Sbjct: 96 GPYICAEWDGGGLPSWLTSDPGIRLRSSDPRFTDALDGYLDILLPPLLPYMAANGGPVIA 155
Query: 92 -KIENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPV 140
++ENEY + + A +G +L+ A H PG +
Sbjct: 156 VQVENEYGAYGDDTAYLKHVHQALRARGVEELLFTCDQAGSGH------HLAAGSLPGVL 209
Query: 141 INACNGMRCGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
A G + E+ + P P + +E W ++ WG + ++R A+ A + +A
Sbjct: 210 STATFGGKIEESLAALRAHMPEGPLMCSEFWIGWFDHWGEEHHVRDAESAAADLDKLLAA 269
Query: 199 NGSYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHA 251
G+ VN YM+HGGTNFG T A ++T Y A L E G PK+ +E+ A
Sbjct: 270 -GASVNIYMFHGGTNFGFTNGANHDQCYAPIVTSYDYDAALTESG-DPGPKYHAFREVIA 327
>gi|355747127|gb|EHH51741.1| hypothetical protein EGM_11177 [Macaca fascicularis]
Length = 373
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 120/283 (42%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNTIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKEAILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + A DF H G V+ D A + A G+ F G
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFHHHLGDDVVLFTTDGAHETFLQCGALQGLYTTVDF-G 243
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P S P P I +E +T + W G+P+ ++ I G+
Sbjct: 244 PGSNITDAFQIQRKCEPKGPLINSEFYTGWLDHW-GQPHSTIKTEVVASSLYDILARGAS 302
Query: 203 VNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF + A T Y APL E G + E
Sbjct: 303 VNLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
Length = 630
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 110/412 (26%), Positives = 171/412 (41%), Gaps = 70/412 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWN+HEP+ G++DF+G ++ +IK +GL V LR GP
Sbjct: 59 WRHRMQMLKAMGLNAVATYVFWNIHEPEPGKWDFTGDKNLAEYIKIAGEEGLMVILRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLWA 115
++ +EW +GG P WL +V G+ R DN+ + I Y+ + KG P V+
Sbjct: 119 YVCAEWEFGGYPWWLQNVEGLELRRDNEQFLKYTQLYINRLYKEVGNLQITKGGPIVMVQ 178
Query: 116 AKMA----VDFHTGVPW---------VMCKQDDA--------------------PGPVIN 142
A+ V +P ++ + DA PG +
Sbjct: 179 AENEFGSYVSQRKDIPLEEHRRYNAKIVQQLKDAGFDVPSFTSDGSWLFEGGAVPGALPT 238
Query: 143 ACNGMRCGETFKGP----NSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
A NG E K N P + E + + W SA IA ++
Sbjct: 239 A-NGESNIENLKKAVDKYNGGQGPYMVAEFYPGWLAHWLEPHPQISATSIARQTEKYLQN 297
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
N S +NYYM HGGTNFG T+ A +T Y AP+ E G V PK+ L+ +
Sbjct: 298 NVS-INYYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPISEAGWVT-PKYDSLRNV 355
Query: 250 HAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRNISYE 309
S P + VI + ++ + T + +V N++ L + Y
Sbjct: 356 IKKYVNYSLPKVPAAIPVIEIPSIKLDKI--ATLDGLNSKVVENNKPMTFEQLNQGYGYV 413
Query: 310 LPRK----------SISILPDCKTVAFNTERV---STQYNKRSKTSNLKFDS 348
L +K I+ L D + N E+V + +N+ S ++ F+S
Sbjct: 414 LYKKHFNQPISGTLKINGLRDYAIIYANDEKVGELNRYFNQDSIDVDIPFNS 465
>gi|354585216|ref|ZP_09004105.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
gi|353188942|gb|EHB54457.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
Length = 619
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 129/278 (46%), Gaps = 46/278 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN+HEP +G+++FSG D+ FI+ GL+V +R P
Sbjct: 35 WEDRLLKLKACGFNTVETYIAWNVHEPTEGEFNFSGMADVGSFIELAGKLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTI----EPAFHEKGPPYVLWA 115
FI +EW +GGLP WL I R + Y K+++ Y + P G P + A
Sbjct: 95 FICAEWEFGGLPGWLLGYGEIRLRCSDPLYLSKVDHYYDELIPRMVPLLSSNGGP--ILA 152
Query: 116 AKMAVDF------HTGVPW-----------VMCKQDDAP------GPVINACN-----GM 147
++ ++ H + + V+ D P G I+ + G
Sbjct: 153 VQVENEYGSYGNDHAYLEYLRAGLVRRGVDVLLFTSDGPTDEMLLGGSIDHVHATVNFGS 212
Query: 148 RCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
R E+F ++P + E W ++ W ++R A D+A + + K GS +N
Sbjct: 213 RVEESFGKYREYRTDEPLMVMEFWNGWFDHWMEDHHVRDAADVAGVLDEMLEK-GSSINM 271
Query: 206 YMYHGGTNFGRTAAAFMITGY------YD-QAPLDEYG 236
YM+HGGTNFG + A I Y YD APL E+G
Sbjct: 272 YMFHGGTNFGFYSGANHIKTYEPTTTSYDYDAPLTEWG 309
>gi|188990653|ref|YP_001902663.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
B100]
gi|167732413|emb|CAP50607.1| exported beta-galactosidase [Xanthomonas campestris pv. campestris]
Length = 680
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 113/269 (42%), Gaps = 40/269 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DF+ ND+ F++E +QGL V LR GP
Sbjct: 130 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 189
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY------QTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + + P + G P +
Sbjct: 190 YACAEWETGGYPAWLFGKDNIRVRSRDPRFLAASQAYLDAVSKQVHPLLNHNGGPIIAVQ 249
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 250 VENEYGSYDDDHAYMADNRAMYVKAGFDDALLFTSDGADMLANGTLPDTLAVVNFAPGEA 309
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 310 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHASTDAKQQTEELEWILRQGHSANLYM 368
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEYG 236
+ GGT+FG FM + P D Y
Sbjct: 369 FIGGTSFG-----FMNGANFQGNPSDHYA 392
>gi|194221516|ref|XP_001490197.2| PREDICTED: beta-galactosidase-like [Equus caballus]
Length = 641
Score = 105 bits (262), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 122/286 (42%), Gaps = 52/286 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEPQ GQY FS +D+ FI+ GL V LR GP
Sbjct: 44 WKDRLLKMKMAGLNAIQTYVPWNFHEPQPGQYQFSEDHDVEYFIQLAHELGLLVILRPGP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP WL + IV RS + Y +
Sbjct: 104 YICAEWDMGGLPAWLLEKQSIVLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 163
Query: 93 IENEYQT-----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVI 141
+ENEY + ++ FH+ VL + F ++ C +
Sbjct: 164 VENEYGSYFTCDYDYLRFLQKLFHQHLGDDVLLFTTDGI-FQK---FLKCGALQGLYATV 219
Query: 142 NACNGMRCGETF--KGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
+ +G+ F + + P P I +E +T + WG + + ++ D+ I +
Sbjct: 220 DFGSGINVTAAFQIQRKSEPRGPLINSEFYTGWLDHWGQR-HSKAKTDVVASTLYDILAS 278
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
G+ VN YM+ GGTNF A + T Y APL E G + E
Sbjct: 279 GANVNMYMFIGGTNFAYWNGANLPYQPQPTSYDYDAPLSEAGDLTE 324
>gi|21232326|ref|NP_638243.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
gi|21114096|gb|AAM42167.1| beta-galactosidase [Xanthomonas campestris pv. campestris str. ATCC
33913]
Length = 613
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/327 (26%), Positives = 135/327 (41%), Gaps = 50/327 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DF+ ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ Q + P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSYLDAVAQQVRPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYDDDHAYMADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAPGEA 242
Query: 153 FKGPN-----SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W G P+ + +I + G N YM
Sbjct: 243 KSAFDKLIKFQPDQPRMVGEYWAGWFDHW-GTPHASTNAKQQTEELEWILRQGHSANLYM 301
Query: 208 YHGGTNFG-RTAAAF----------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLC 256
+ GGT+FG A F T Y A LDE G PK+ ++++ +
Sbjct: 302 FIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFALMRDVITRVTGV 360
Query: 257 SRPLLTGTQNVISLGQLQEAFVFEETS 283
P L I++ L++A + E S
Sbjct: 361 QPPALPAP---IAMAALKDAPLRESAS 384
>gi|440800373|gb|ELR21412.1| lysosomal betagalactosidase, partial [Acanthamoeba castellanii str.
Neff]
Length = 604
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 118/261 (45%), Gaps = 49/261 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP+ + + GL+ + TYV WNLHEP GQYDFSGR DI+RFI+ Q +G V +R P
Sbjct: 57 WPARLRTLRSCGLNTVTTYVPWNLHEPTPGQYDFSGRLDIVRFIEAAQQEGFLVIVRPPP 116
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E +GGLP WL + G+ R + Y +
Sbjct: 117 YICAELEFGGLPAWLLNEEGLQLRCSDPKYLKRVDSFLDHFLPMLATYQYSRGGPIIAMQ 176
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVD--FHTG-VPWVMCKQDDAPGP 139
+ENEY + +E F + +L+++ A D F G +P ++ + G
Sbjct: 177 VENEYGSYGNDHLYLRHLELKFRQHQIDAILFSSNGAGDQMFVGGALPSLLRTVNFGTGA 236
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
+ ++ ++ P+ P TE W ++ WG + + + + ++ N
Sbjct: 237 DVEG--NLKVLRKYQ----PSGPLFVTEFWDGWFDHWGEEHHTTTPTQSMKTLEAILSNN 290
Query: 200 GSYVNYYMYHGGTNFGRTAAA 220
S VN YM GGTNFG T A
Sbjct: 291 AS-VNLYMAFGGTNFGFTNGA 310
>gi|410972397|ref|XP_003992646.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Felis catus]
Length = 703
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/266 (28%), Positives = 115/266 (43%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 145 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 204
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL +G+ R+ K + +
Sbjct: 205 YICSEIDLGGLPSWLLQDSGMRLRTTYKGFTEAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 264
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + + + P Y+ + K D G+ ++ D+ G +G+
Sbjct: 265 VENEYGS-----YNRDPAYMPYIKKALED--RGIVELLLTSDNKDGLQKGVMDGVLATIN 317
Query: 153 FKGPNSPN------------KPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+ + +P + E WT ++ WGG I + ++ V+ I G
Sbjct: 318 LQSQHELQLLTNFLLSVQRVQPKMVMEYWTGWFDSWGGPHNILDSSEVLKTVSA-ILDAG 376
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
+N YM+HGGTNFG A Y
Sbjct: 377 FSINLYMFHGGTNFGFINGAMHFHDY 402
>gi|207029277|ref|NP_001126295.1| beta-galactosidase precursor [Pongo abelii]
Length = 677
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLQLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKCFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF T A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANTPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 591
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 130/304 (42%), Gaps = 53/304 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WN+HEP++G ++F G D++++++ Q GL V LR P
Sbjct: 34 WDHSLYNLKALGCNTVETYVPWNMHEPKEGVFNFEGIADLVKYVQLAQKYGLMVILRPTP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFH----EKGPP----- 110
+I +EW +GGLP WL I RS+ + K+EN Y+ + P E G P
Sbjct: 94 YICAEWEFGGLPAWLLKYRDIRVRSNTNLFLNKVENFYKVLLPLVTSLQVENGGPIIMMQ 153
Query: 111 -------------YVLWAAKMAVDFHTGVPWVMC----KQDDAPGPVIN----------- 142
YV K+ D VP ++ G +I+
Sbjct: 154 VENEYGSFGNDKEYVRSIKKLMRDLGVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGS 213
Query: 143 -ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
+ + E+F N P + E W ++ WG + R + ++A V + + +
Sbjct: 214 RSNENLNALESFIKENKKEWPLMCMEFWDGWFNRWGMEIIRRDSSELAEEVKELLKR--A 271
Query: 202 YVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
+N+YM+ GGTNFG IT Y A L E+G EP + A
Sbjct: 272 SINFYMFQGGTNFGFMNGCSSRENVDLPQITSYDYDALLTEWG---EPTPKYYAVQRAIK 328
Query: 254 KLCS 257
++CS
Sbjct: 329 EVCS 332
>gi|326779952|ref|ZP_08239217.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
gi|326660285|gb|EGE45131.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
Length = 648
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/347 (29%), Positives = 144/347 (41%), Gaps = 63/347 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A GL+ ++TYV WNLHEP++G+ G + RF+ ++ GL+ +R GP
Sbjct: 35 WEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG--ALGRFLDAVERAGLWAIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAFHE----KGPPYVL-- 113
+I +EW GGLP+W+ G R+ + Y+ +E ++ + P +G P VL
Sbjct: 93 YICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWFRELLPQVVRRQVSRGGPVVLVQ 152
Query: 114 ----------------WAAKMAVDFHTGVPWV--------MCKQDDAPGPVINACNGMRC 149
W A + VP M PG + A G
Sbjct: 153 AENEYGSYGSDAVYLEWLAGLLRQCGVTVPLFTSDGPEDHMLTGGSVPGLLATANFGSGA 212
Query: 150 GETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E FK + P P + E W ++ WG +P R + A + I + G+ VN YM
Sbjct: 213 REGFKVLRRHQPGGPLMCMEFWCGWFDHWGAEPVRRDPEQAAGALRE-ILECGASVNVYM 271
Query: 208 YHGGTNFGRTAAAF------------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
HGGTNFG A A +T Y AP+DEYG E K+ +E+ A
Sbjct: 272 AHGGTNFGGWAGANRSGPHQDESFQPTVTSYDYDAPVDEYGRATE-KFRLFREVLEAYAE 330
Query: 256 CSRPL-------LTGTQNV-----ISLGQLQEAFVFEET-SGVCAAF 289
P L G V SLG + E ET SGV A F
Sbjct: 331 GPLPALPPEPVGLAGPVRVELAEWASLGDVLEVLGDPETESGVPATF 377
>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
Length = 613
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 114/268 (42%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E +QGL + LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAAQGLNIILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ ++P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALANQVQPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G + YM
Sbjct: 243 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHAATDARQQAEEFEWILRQGHSASLYM 301
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM + P D Y
Sbjct: 302 FIGGTSFG-----FMNGANFQNNPSDHY 324
>gi|66767541|ref|YP_242303.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
gi|66572873|gb|AAY48283.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
8004]
Length = 613
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 140/334 (41%), Gaps = 64/334 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DF+ ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ Q + P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIRIRSRDPRFLAASQSYLDAVAQQVRPLLNHNGGPIIAVQ 182
Query: 113 -------------------------------LWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
L+ + A G +P + + APG
Sbjct: 183 VENEYGSYDDDHAYIADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAPGEA 242
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+A + + + P++P + E W ++ W G P+ + +I + G
Sbjct: 243 KSAFDKLIKFQ-------PDQPRMVGEYWAGWFDHW-GTPHASTNAKQQTEELEWILRQG 294
Query: 201 SYVNYYMYHGGTNFG-RTAAAF----------MITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GGT+FG A F T Y A LDE G PK+ ++++
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFALMRDV 353
Query: 250 HAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETS 283
+ P L I++ L++A + E S
Sbjct: 354 ITRVTGVQPPALPAP---IAMAALKDAPLRESAS 384
>gi|75041447|sp|Q5R7P4.1|BGAL_PONAB RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|55730998|emb|CAH92216.1| hypothetical protein [Pongo abelii]
Length = 677
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLQLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKCFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF T A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANTPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|154490061|ref|ZP_02030322.1| hypothetical protein PARMER_00290 [Parabacteroides merdae ATCC
43184]
gi|423723056|ref|ZP_17697209.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
gi|154089210|gb|EDN88254.1| glycosyl hydrolase family 35 [Parabacteroides merdae ATCC 43184]
gi|409241481|gb|EKN34249.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
Length = 780
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 124/294 (42%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y FWN+HE + G++DF G+NDI F + Q +G+Y+ LR GP
Sbjct: 64 WQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFKGQNDIAAFCRLAQKEGMYIMLRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ SEW GGLP WL I R+ N PY
Sbjct: 124 YVCSEWEMGGLPWWLLKKEDIKLRT-NDPYFLERTKLFMNEIGKQLADLQVTRGGNIIMV 182
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD--------DAPGPVINA 143
++ENEY +K + A A F T VP C D IN
Sbjct: 183 QVENEYGAYAT---DKAYIANIRDAVKAAGF-TDVPLFQCDWSSTFQLNGLDDLVWTINF 238
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G FK P+ P + +E W+ ++ WG K R A + + + ++ S
Sbjct: 239 GTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAGVMVSGIKDMLDRHIS 298
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G PK+ L+EL
Sbjct: 299 F-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWAT-PKYYKLREL 350
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTF D + L++Q+ GKG WVNG+++GR+W
Sbjct: 533 AYYRTTFELDEVGD-VFLDMQTWGKGMVWVNGKAMGRFW 570
>gi|294903093|ref|XP_002777496.1| Beta-galactosidase precursor, putative [Perkinsus marinus ATCC
50983]
gi|239885192|gb|EER09312.1| Beta-galactosidase precursor, putative [Perkinsus marinus ATCC
50983]
Length = 396
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/253 (31%), Positives = 123/253 (48%), Gaps = 25/253 (9%)
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
++ENEY + G Y+ W +++ VPWVMC A G +N CNG C
Sbjct: 29 QLENEYGH----HSDAGRAYIDWVGELSFGLGLDVPWVMCNGMSANG-TLNVCNGDDCAA 83
Query: 152 TFKGPNS---PNKPSIWTEDWTSFYQVWGGK--PYIRSAQDIAFHVALFIAKNGSYVNYY 206
+K + P++P WTE+ ++ WGG RSA+++A+ +A ++A GS+ NYY
Sbjct: 84 EYKADHDKQWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVAVGGSHHNYY 142
Query: 207 MYHGGTNFGRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAI-KLCSRPLLTGTQ 265
M++GG + + AA + Y D GL EPK HL+ LH + KL + +
Sbjct: 143 MWYGGNHMAQWGAASLTNAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNGELMQVEDR 202
Query: 266 NVISLGQLQEAF-VFEETSGVCAAFLVNNDERKA-----VTVLFRNISYELP-RKSISIL 318
+ + QL+ V+E T+G+ AFL R A V V + +Y + R+ + +
Sbjct: 203 HSVMPVQLENGVEVYEWTAGL--AFL----HRPACSGSPVEVHYAKATYSIACREVLVVD 256
Query: 319 PDCKTVAFNTERV 331
P TV F T V
Sbjct: 257 PSSSTVLFATASV 269
>gi|432954511|ref|XP_004085513.1| PREDICTED: beta-galactosidase-like [Oryzias latipes]
Length = 653
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 128/297 (43%), Gaps = 57/297 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ IQTY+ WN HE G Y+FSG D+ F+K Q GL V LR GP
Sbjct: 61 WKDRLVKMYMAGLNAIQTYIPWNYHEESPGMYNFSGDRDVEYFLKLAQDIGLLVILRPGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL IV RS + Y + ++P ++ G P +
Sbjct: 121 YICAEWEMGGLPAWLLSKKDIVLRSSDPDYVAAVDTWMGKLLPMMKPYLYQNGGPIITVQ 180
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVINACNGMRCGETFK--- 154
+ + A D+ H G V+ D A N ++CG
Sbjct: 181 VENEYGSYFACDYNYMRHLTKLFRSHLGEDVVLFTTDGA------GLNYLKCGAIQGLYA 234
Query: 155 ----GPNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
GP S P+ P + +E +T + WG + + S +A + +A
Sbjct: 235 TVDFGPGSNITAAFEAQRHAEPHGPLVNSEFYTGWLDHWGSRHSVVSPDLVAKSLNQQLA 294
Query: 198 KNGSYVNYYMYHGGTNFG-----RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
G+ VN YM+ GGTNFG + + T Y APL E G + E K+ ++E+
Sbjct: 295 M-GANVNMYMFIGGTNFGYWNGANSPYSAQPTSYDYDAPLTEAGDLTE-KYFAIREV 349
>gi|297194215|ref|ZP_06911613.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|197722531|gb|EDY66439.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 590
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 116/296 (39%), Gaps = 62/296 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP + + GLD ++TYV WNLHEP+ G+YDF G D+ RF+ + GL+ +R P
Sbjct: 33 WPHRLRMLRAMGLDTVETYVPWNLHEPRPGEYDFDGIADLDRFLHATREAGLHAIVRPSP 92
Query: 62 FIESEWTYGGLPIW-LHDVAGIVFRSDNKPY----------------------------- 91
+I +EW GGLP W L D R + Y
Sbjct: 93 YICAEWENGGLPWWLLADPEVGALRCQDPAYLAHVDRWFDRLIPVVAAHQVSRGGNVLMV 152
Query: 92 KIENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVI 141
++ENEY + + +G L+ + DF PG +
Sbjct: 153 QVENEYGSYGTDTGYLEHLAAGLRARGIDVPLFTSDGPDDF-------FLTGGALPGHLA 205
Query: 142 NACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
G R E P+ P++ E W ++ WG +R D A + +A
Sbjct: 206 TVNFGSRPKEALADLARLRPDDPAMCMEFWCGWFDHWGTDHVVRDPADAAGVLEELLAA- 264
Query: 200 GSYVNYYMYHGGTNFGRTAAAF------------MITGYYDQAPLDEYGLVREPKW 243
G+ VN YM HGGTNF A A +T Y AP+DE G E W
Sbjct: 265 GASVNVYMAHGGTNFSTWAGANTEDPAAGTGYRPTVTSYDYDAPVDERGAATEKFW 320
>gi|355567243|gb|EHH23622.1| hypothetical protein EGK_07120 [Macaca mulatta]
Length = 653
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 131/297 (44%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 223
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCG--------E 151
F+ G+ ++ D + G+
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKNVLSGHTKGVLAAINLQKVQRN 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFNQLHKVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATNFGKHTGIVTSYDYDAVLTEAGDYTE-KYFKLQKLLESVSATPLP 398
>gi|423346501|ref|ZP_17324189.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
gi|409219652|gb|EKN12612.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
Length = 780
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 124/294 (42%), Gaps = 53/294 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y FWN+HE + G++DF G+NDI F + Q +G+Y+ LR GP
Sbjct: 64 WQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFKGQNDIAAFCRLAQKEGMYIMLRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ SEW GGLP WL I R+ N PY
Sbjct: 124 YVCSEWEMGGLPWWLLKKEDIKLRT-NDPYFLERTKLFMNEIGKQLADLQVTRGGNIIMV 182
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQD--------DAPGPVINA 143
++ENEY +K + A A F T VP C D IN
Sbjct: 183 QVENEYGAYAT---DKAYIANIRDAVKAAGF-TDVPLFQCDWSSTFQLNGLDDLVWTINF 238
Query: 144 CNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G FK P+ P + +E W+ ++ WG K R A + + + ++ S
Sbjct: 239 GTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAGVMVSGIKDMLDRHIS 298
Query: 202 YVNYYMYHGGTNFGR------TAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
+ + YM HGGT FG A + M + Y AP+ E G PK+ L+EL
Sbjct: 299 F-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWAT-PKYYKLREL 350
Score = 41.2 bits (95), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYW 559
+Y+TTF D + L++Q+ GKG WVNG+++GR+W
Sbjct: 533 AYYRTTFELDEVGD-VFLDMQTWGKGMVWVNGKAMGRFW 570
>gi|242078605|ref|XP_002444071.1| hypothetical protein SORBIDRAFT_07g006925 [Sorghum bicolor]
gi|241940421|gb|EES13566.1| hypothetical protein SORBIDRAFT_07g006925 [Sorghum bicolor]
Length = 147
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/153 (39%), Positives = 85/153 (55%), Gaps = 8/153 (5%)
Query: 594 YHVPRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRG 653
YHVP FL+P N +VL E+ G+P I+ V V+ H + SW +Q
Sbjct: 1 YHVPCLFLQPGNNDIVLFEQFGGDPSKISFVIRQTGSVIAQVSEEHPAQIDSWNSSQQT- 59
Query: 654 DTDIKKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERAC 712
++++G P ++ CP G+ IS I FASFG P G C Y+ G C S + VV+ AC
Sbjct: 60 ---MQRYG--PELRLECPKDGQVISSIKFASFGTPSGTCRSYSHGECSSIQAISVVQEAC 114
Query: 713 IGKSRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
IG S CS+P+ S YF G+P G+ K+L V+A C
Sbjct: 115 IGVSNCSVPVSSNYF-GNPWTGVTKSLAVEAAC 146
>gi|344288159|ref|XP_003415818.1| PREDICTED: beta-galactosidase-like [Loxodonta africana]
Length = 570
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/292 (29%), Positives = 126/292 (43%), Gaps = 47/292 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTY+ WN HEP GQY FS +D+ FI+ GL V LR GP
Sbjct: 49 WKDRLLKMKMAGLNAIQTYIPWNFHEPLPGQYQFSDDHDVEHFIQLTHEIGLLVILRPGP 108
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + ++P ++ G P +
Sbjct: 109 YICAEWDMGGLPAWLLEKQSIVLRSSDPYYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 168
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + D+ H G ++ D A ++ G+ F G
Sbjct: 169 VENEYGSYFTCDYDYLRFLQKCFHSHLGDDVLLFTTDGARESLLQCGTLQGLYATVDF-G 227
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P S P P + +E +T + W G+P+ R + + + G+
Sbjct: 228 PVSNITAAFQTQRRTEPRGPLVNSEFYTGWLDHW-GQPHSRVSTEAVTSALYNMLALGAN 286
Query: 203 VNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
VN YM+ GGTNF T A T Y APL E G + E K+ ++E+
Sbjct: 287 VNLYMFTGGTNFAYWNGANTPYAAQPTSYDYDAPLTEAGDLTE-KYFAVREI 337
>gi|426249767|ref|XP_004018620.1| PREDICTED: beta-galactosidase [Ovis aries]
Length = 634
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 128/291 (43%), Gaps = 45/291 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HE Q G+Y+FSG +D+ FI+ GL V LR GP
Sbjct: 52 WKDRLLKMKMAGLNAIQTYVAWNFHELQPGRYNFSGDHDVEHFIQLAHELGLLVILRPGP 111
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + + P ++ G P +
Sbjct: 112 YICAEWDMGGLPAWLLEKKSIVLRSSDPDYLAAVDKWLGVLLPKMRPLLYKNGGPIITVQ 171
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFK- 154
+ + + D+ H G ++ D + A G+ F
Sbjct: 172 VENEYGSYYSCDYDYLRFLQKRFQDHLGEDVLLFTTDGVNEEFLQCGALQGLYATVDFST 231
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + S++ +AF + +A G+ V
Sbjct: 232 GSNLTAAFMLQRKFEPRGPLINSEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLAL-GANV 290
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GG+NF T T Y APL E G + E K+ L+++
Sbjct: 291 NMYMFIGGSNFAYWNGANTPYQPQPTSYDYDAPLSEAGDLTE-KYFALRDI 340
>gi|384939972|gb|AFI33591.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
gi|387541294|gb|AFJ71274.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
Length = 653
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 131/297 (44%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 223
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCG--------E 151
F+ G+ ++ D + G+
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKNVLSGHTKGVLAAINLQKVQRN 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFNQLHKVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATNFGKHTGIVTSYDYDAVLTEAGDYTE-KYFKLQKLLESVSATPLP 398
>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
Length = 587
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 130/311 (41%), Gaps = 60/311 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TY+ WN HEP +G+++FSG DI FI GL+V +R P
Sbjct: 35 WEDRLLKLKACGLNTVETYIPWNWHEPDEGRFNFSGMADIEAFITLAGKLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL + R + + +
Sbjct: 95 YICAEWEFGGLPAWLLQDPHMQLRCLDPKFLKKVDAYYDELIPRLVPLLSTNGGPIIAVQ 154
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY Q ++ A +G +L+ + D M + PG
Sbjct: 155 IENEYGSYGNDTAYLQYLQEALIARGVDVLLFTSDGPTD-------GMLQGGTVPGVTAT 207
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G R E F P + E W ++ W + R ++D A A +A G
Sbjct: 208 VNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMKPHHTRDSEDAASVFAEMLAL-G 266
Query: 201 SYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKEL---H 250
+ VN+YM+HGGTNFG A IT Y APL E G V K+ ++++ H
Sbjct: 267 ASVNFYMFHGGTNFGFYNGANYHDKYEPTITSYDYDAPLSECGDVTT-KYEAVRQVIAKH 325
Query: 251 AAIKLCSRPLL 261
++L P L
Sbjct: 326 QGVELGDLPAL 336
>gi|390476463|ref|XP_003735126.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Callithrix
jacchus]
Length = 657
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/320 (28%), Positives = 134/320 (41%), Gaps = 48/320 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNTIQTYVPWNFHEPYPGQYQFSEEHDVEYFLRLAHELGLLVVLRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHEKFLRCGALQGLYATVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A +G+ V
Sbjct: 245 GSNVTDAFQTQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLHDILA-HGASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
N YM+ GGTNF + A T Y APL E G + E + + K+
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTEKYFALRDVIRKFEKVPEG 363
Query: 259 PLLTGTQNV----ISLGQLQ 274
P+ T ++LG+L+
Sbjct: 364 PIPPSTPKFAYGKVTLGKLK 383
>gi|402895880|ref|XP_003911040.1| PREDICTED: beta-galactosidase-1-like protein 3 [Papio anubis]
Length = 653
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 131/297 (44%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 104 WRDRLLKLRACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGP 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKGFTEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 223
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCG--------E 151
F+ G+ ++ D + G+
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKNVLSGHTKGVLAAINLQKVQRN 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFNQLHKVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVERAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATNFGKHTGIVTSYDYDAVLTEAGDYTE-KYFKLQKLLESVSATPLP 398
>gi|62897743|dbj|BAD96811.1| galactosidase, beta 1 variant [Homo sapiens]
Length = 677
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 121/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A ++ A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTLLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|315647882|ref|ZP_07900983.1| Beta-galactosidase [Paenibacillus vortex V453]
gi|315276528|gb|EFU39871.1| Beta-galactosidase [Paenibacillus vortex V453]
Length = 587
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 119/276 (43%), Gaps = 42/276 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WNLHEP++GQ+ F G D+ F+++ GL+V LR P
Sbjct: 36 WEDRLMKLKACGFNTVETYIPWNLHEPKEGQFTFDGIADLEGFVQKAGHLGLHVILRPSP 95
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYV--- 112
+I +EW +GGLP WL I R + Y K+++ Y I P KG P +
Sbjct: 96 YICAEWEFGGLPAWLLQYPDIHLRCMDPVYLEKVDHYYDELIPRIVPLLTSKGGPVIAIQ 155
Query: 113 ---------------------LWAAKMAVDFHT--GVPWVMCKQDDAPGPVINACNGMRC 149
L A + V T G M + P + G R
Sbjct: 156 IENEYGSYGNDTAYLEYLKDGLSARGVDVLLFTSDGPTDGMLQGGTVPNVLATVNFGSRP 215
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
GE F P + E W ++ W + RS++++A + N S VN+YM
Sbjct: 216 GEAFAKLREYRTEDPLMCMEYWNGWFDHWLKPHHTRSSEEVAQVFEEMLRLNAS-VNFYM 274
Query: 208 YHGGTNFGRTAAAF-------MITGYYDQAPLDEYG 236
+HGGTNFG A +T Y APL E G
Sbjct: 275 FHGGTNFGFYNGANDQEKYEPTVTSYDYDAPLSECG 310
>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
Length = 580
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 113/260 (43%), Gaps = 50/260 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN+HEP+ GQ++F G D++ FI+ Q L V +R P
Sbjct: 35 WEDRLRKVKAMGCNCVETYIAWNVHEPRDGQFNFDGIADVVEFIRIAQRVDLLVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN------------------KPY-----------K 92
+I +EW +GG+P WL I R + KP +
Sbjct: 95 YICAEWEFGGMPAWLLK-EDIRLRCSDPRFLEKVSAYYDALIPQLKPLLSTSGGPIIAVQ 153
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY Q + E+G +L+ + D M + G +
Sbjct: 154 IENEYGSYGNDQAYLQALRNMLVERGIDVLLFTSDGPAD-------DMLQGGMTEGVLAT 206
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
G R E F PN P + E W ++ W + + RSA+D A V + G
Sbjct: 207 VNFGSRPKEAFGKLEEYQPNAPLMCMEYWNGWFDHWFEEHHTRSAEDAA-QVLDEMLSMG 265
Query: 201 SYVNYYMYHGGTNFGRTAAA 220
+ VN+YM HGGTNFG ++ A
Sbjct: 266 ASVNFYMLHGGTNFGFSSGA 285
>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1106
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 127/295 (43%), Gaps = 51/295 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN HEPQ G YDF+ +ND+ F + Q +YV LR GP
Sbjct: 381 WDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQNDLAEFCRLCQQNDMYVILRPGP 440
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEK--------GPPYVL 113
++ +EW GGLP WL + R ++ PY IE E A ++ G P ++
Sbjct: 441 YVCAEWEMGGLPWWLLKKKDVRLR-ESDPYFIE-RVALFEEAVAKQVKDLTIANGGPIIM 498
Query: 114 WAAK-------------------MAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETF- 153
+ + +F G+ C D A +N + + F
Sbjct: 499 VQVENEYGSYGEDKGYVSQIRDIVRANFGNGIALFQC--DWASNFTLNGLDDLIWTMNFG 556
Query: 154 KGPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G N PN P + +E W+ ++ WG R A D+ + +++ S+
Sbjct: 557 TGANVDQQFAKLKQLRPNSPLMCSEFWSGWFDKWGANHETRPAADMIKGIDDMLSRGISF 616
Query: 203 VNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+ YM HGGTN+G A A +T Y AP+ E G PK+ L+E A
Sbjct: 617 -SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTT-PKYWALREAMA 669
>gi|336428330|ref|ZP_08608312.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336005980|gb|EGN36021.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 583
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 119/271 (43%), Gaps = 49/271 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TYV WN+HEPQKG++ F G DI RFI Q GLYV +R P
Sbjct: 36 WRDRLEKLKAMGANTVETYVPWNMHEPQKGKFVFEGMLDISRFILLAQELGLYVIVRPSP 95
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF-----HEKGP----- 109
+I +EW +GGLP WL G+ R +P+ + Y + P H GP
Sbjct: 96 YICAEWEFGGLPAWLLKEDGMRLRGCYEPFLEAVREYYSVLFPILVPLQIHHGGPVILMQ 155
Query: 110 ------------PYVLWAAKMAVDFHTGVPWVMCK--QDDA------PGPVINACNGMRC 149
Y+ ++ +D VP V D++ PG + G +
Sbjct: 156 VENEYGYYGDDTRYMETMKQLMLDNGAEVPLVTSDGPMDESLSCGRLPGVLPTGNFGSKT 215
Query: 150 GETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIR-----SAQDIAFHVALFIAKNGSY 202
E F+ + P + TE W ++ WG ++R S +D+ + + +
Sbjct: 216 EERFEVLKKYTEGGPLMCTEFWVGWFDHWGNGGHMRGNLEESTKDLDKMLEM------GH 269
Query: 203 VNYYMYHGGTNFGRTAAAFMITGYYDQAPLD 233
VN YM+ GGTNFG + YYD+ D
Sbjct: 270 VNIYMFEGGTNFGFMNG----SNYYDELTPD 296
>gi|354490770|ref|XP_003507529.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Cricetulus griseus]
Length = 689
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 118/266 (44%), Gaps = 49/266 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI+ GL+V LR GP
Sbjct: 131 WRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQLAAKIGLWVILRPGP 190
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL + R+ + +
Sbjct: 191 YICSEIDLGGLPSWLLQDPNMKLRTTYYGFTKAVDLYFDHLMSRVVPLQYKHGGPIIAVQ 250
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNG------ 146
+ENEY + + K Y+ + K D G+ ++ D+ G +G
Sbjct: 251 VENEYGS-----YYKDHAYMPYIKKALED--RGIIEMLLTSDNKDGLQKGVVSGVLATIN 303
Query: 147 MRCGETFKGPNSP------NKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
++ + K +S +P + E WT ++ WGG I + ++ V+ I K+G
Sbjct: 304 LQSQQELKALSSVLLSIQGIQPKMVMEYWTGWFDSWGGPHNILDSSEVLQTVSAII-KSG 362
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGY 226
S +N YM+HGGTNFG A Y
Sbjct: 363 SSINLYMFHGGTNFGFINGAMHFNDY 388
>gi|256424388|ref|YP_003125041.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256039296|gb|ACU62840.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 586
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 120/277 (43%), Gaps = 43/277 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRN-DIIRFIKEIQSQGLYVCLRIG 60
W I AK G + I YVFWN HE ++G++DF+ N DI+ FIK +Q +G++V LR G
Sbjct: 43 WRHRIQMAKAMGCNTIAAYVFWNYHEQEEGKFDFTSENRDIVAFIKMVQEEGMWVMLRPG 102
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY------QTIEPAFHEKGPP---- 110
P++ +EW +GGLP +L + I R + Y E + ++P G P
Sbjct: 103 PYVCAEWEFGGLPPYLLRIPDIKVRCMDPRYIAATERYIKALSEEVKPLQITNGGPIVMV 162
Query: 111 --------------YVLWAAKMAVDFHTGVPW--------VMCKQDDAPGPVINACNGMR 148
Y+L M V VP+ + + PG I +G
Sbjct: 163 QVENEYGSFGNDREYMLKVKDMWVQNGINVPFYTADGPVSALLEAGSVPGAAIGLDSGSS 222
Query: 149 CGETFKGP-NSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
G+ +P+ PS +E + + WG K I V + S+ N Y+
Sbjct: 223 EGDFAAAEKQNPDVPSFSSESYPGWLTHWGEKWARPDKAGIVKEVKFLMDTKRSF-NLYV 281
Query: 208 YHGGTNFGRTAAAFM--------ITGYYDQAPLDEYG 236
HGGTNFG TA A +T Y AP++E G
Sbjct: 282 IHGGTNFGFTAGANSGGKGYEPDLTSYDYDAPINEQG 318
>gi|241156773|ref|XP_002407847.1| beta-galactosidase precursor, putative [Ixodes scapularis]
gi|215494239|gb|EEC03880.1| beta-galactosidase precursor, putative [Ixodes scapularis]
Length = 388
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 125/291 (42%), Gaps = 66/291 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ +QTY+ W+ HEP+ GQYDF G+ DI++FIK + G V LR GP
Sbjct: 66 WEDRLTTMKTAGLNTLQTYIEWSSHEPENGQYDFEGQEDIVKFIKIAERLGFLVILRPGP 125
Query: 62 FIESEWTYGGLPIWLHDVAGIV-FRSDNKPY----------------------------- 91
FI++E GG P WL V RS ++ Y
Sbjct: 126 FIDAERDMGGFPYWLLSEDNTVRLRSSDQRYLKYVDRYFSKLLPLLKPLLYSNGGPVLML 185
Query: 92 KIENEYQTIEPAFHE----------------KGPPYVLWAAKMAVDFHTGVPWVMCKQDD 135
++ENEY + +HE GP +L+ G ++ C ++D
Sbjct: 186 QVENEYGS----YHECDFVYTAHLKDLMRRHLGPDVLLYTTD-----GNGDRYLKCGKND 236
Query: 136 APGPVINACNGMRCGETFKGP--NSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVA 193
++ G +F + P + +E ++ + WG K + +A +A +
Sbjct: 237 GAYTTVDFGPGSDVVASFAAQRRHQDRGPLMNSEFYSGWLDNWGDKHWEGNASAVAETLR 296
Query: 194 LFIAKNGSYVNYYMYHGGTNFGRTAAAFMITGYYD--------QAPLDEYG 236
+ N S VN Y++HGG++FG TA A + G Y AP++E G
Sbjct: 297 EMLTMNAS-VNIYVFHGGSSFGCTAGANLDKGVYSPNPTSYDYDAPMNEAG 346
>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
Length = 651
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 130/300 (43%), Gaps = 48/300 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ IQTYV WN HE G Y+FSG D+ F+K Q GL V LR GP
Sbjct: 59 WKDRLLKMYMAGLNAIQTYVPWNYHEEVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGP 118
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL IV RS + Y + I+P ++ G P +
Sbjct: 119 YICAEWDMGGLPAWLLKKKDIVLRSTDPDYIAAVDKWMGKLLPMIKPYLYQNGGPIITVQ 178
Query: 114 ----WAAKMAVDFH------------------------TGVPWVMCKQDDAPGPVINACN 145
+ + A D++ G+ ++ C ++
Sbjct: 179 VENEYGSYFACDYNYMRHLSKLFRSYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGP 238
Query: 146 GMRCGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G F+ P+ P + +E +T + WG + + S +A ++ + G+ V
Sbjct: 239 GANVTAAFEPQRQVQPHGPLVNSEFYTGWLDHWGSRHSVVSPTQVAKALSEMLLM-GANV 297
Query: 204 NYYMYHGGTNFG-----RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSR 258
N YM+ GGTNFG T A T Y APL E G + E K+ ++E+ IK+ S+
Sbjct: 298 NLYMFIGGTNFGYWNGANTPYAAQPTSYDYDAPLTEAGDLTE-KYFAIREV---IKMYSK 353
>gi|346320352|gb|EGX89953.1| beta-calactosidase, putative [Cordyceps militaris CM01]
Length = 633
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 124/288 (43%), Gaps = 57/288 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + A+ GL+ I +Y++WNLHEP+ G +DFSGRND+ RF + Q +GL V LR GP
Sbjct: 60 WTHRLKMARAMGLNTIFSYLYWNLHEPRPGAWDFSGRNDVARFFRLAQQEGLRVVLRPGP 119
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I E +GG P WL V G+ R +N+P+ +
Sbjct: 120 YICGERDWGGFPAWLSQVPGMAVRQNNRPFLDAAKSYIDRLGKELGQLQITQGGPILMAQ 179
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHT--------GVPWVMCKQDDAPGPVI--N 142
+ENEY + F AA + +F G ++ Q VI +
Sbjct: 180 LENEYGS----FGTDKTYLAALAAMLRENFDVFLYTNDGGGQSYLEGGQLHGVLAVIDGD 235
Query: 143 ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGK-PYIR---SAQDIAFHVALF--I 196
+ +G + + + P + E + S+ WG P+ + S D+A VA
Sbjct: 236 SQSGFAARDKYVTDPTSLGPQLNGEYYISWIDQWGSDYPHQQIAGSQADVAKAVADLDWT 295
Query: 197 AKNGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYG 236
G + YM+HGGTNFG A M T Y APLDE G
Sbjct: 296 LAGGYSFSIYMFHGGTNFGFENGGIRDDGPLAAMTTSYDYGAPLDESG 343
>gi|410036675|ref|XP_003950098.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Pan
troglodytes]
gi|410223432|gb|JAA08935.1| galactosidase, beta 1 [Pan troglodytes]
gi|410267410|gb|JAA21671.1| galactosidase, beta 1 [Pan troglodytes]
gi|410289952|gb|JAA23576.1| galactosidase, beta 1 [Pan troglodytes]
gi|410336943|gb|JAA37418.1| galactosidase, beta 1 [Pan troglodytes]
Length = 677
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|325922356|ref|ZP_08184130.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
gi|325547138|gb|EGD18218.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
Length = 613
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 112/268 (41%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DF+G ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFAGNNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY------QTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + + P + G P +
Sbjct: 123 YTCAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQAYLDAVSKQVHPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYDDDHAYMADNRAMYVKAGFDDALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
P +P + E W ++ W GKP+ + +I + G N YM
Sbjct: 243 KTAFEKLIKFRPEQPRMVGEYWAGWFDHW-GKPHASTDAKQQTEEFEWILRQGHSANLYM 301
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM + P D Y
Sbjct: 302 FIGGTSFG-----FMNGANFQGNPSDHY 324
>gi|119584849|gb|EAW64445.1| galactosidase, beta 1, isoform CRA_d [Homo sapiens]
Length = 500
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|384428898|ref|YP_005638258.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
gi|341938001|gb|AEL08140.1| beta-galactosidase [Xanthomonas campestris pv. raphani 756C]
Length = 613
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 140/334 (41%), Gaps = 64/334 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DF+ ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ Q + P + G P +
Sbjct: 123 YACAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQSYLDAVAQQVRPLLNHNGGPIIAVQ 182
Query: 113 -------------------------------LWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
L+ + A G +P + + APG
Sbjct: 183 VENEYGSYDDDHAYMADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAPGEA 242
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+A + + + P++P + E W ++ W G P+ + +I + G
Sbjct: 243 KSAFDKLIKFQ-------PDQPRMVGEYWAGWFDHW-GTPHASTNAKQQTEELEWILRQG 294
Query: 201 SYVNYYMYHGGTNFG-RTAAAF----------MITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GGT+FG A F T Y A LDE G PK+ ++++
Sbjct: 295 HSANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYDAILDEAGRP-TPKFALMRDV 353
Query: 250 HAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETS 283
+ P L I++ L++A + E S
Sbjct: 354 ITRVTGVQPPALPAP---IAMAALKDAPLRESAS 384
>gi|326332570|ref|ZP_08198838.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325949571|gb|EGD41643.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 603
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 118/289 (40%), Gaps = 58/289 (20%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + + G + + TYV WN HEP +G DF+G D+ RF+ GL V +R G
Sbjct: 35 LWEDRLRRVAATGFNTVDTYVAWNFHEPDEGSPDFTGPRDLARFVTIAGDLGLDVIVRPG 94
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I +EWT GGLP WL RS + Y
Sbjct: 95 PYICAEWTNGGLPSWL-TARTRAPRSSDPVYQDAVTRWLDVLLPRLVPLQAGHGGPVVAV 153
Query: 92 KIENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVI 141
++ENEY + + A ++G +L+ A D VM G +
Sbjct: 154 QLENEYGSYGDDAAHLVWLRQALLDRGVTELLYTADGPTD-------VMLDAGMVEGTLA 206
Query: 142 NACNGMRCGE--TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
A G R E T P +P + E W ++ WG ++RS + A + +
Sbjct: 207 AATFGSRATEAATKLSARRPGEPFLCAEFWNGWFDHWGENHHVRSPESAAATLREIVDLG 266
Query: 200 GSYVNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVRE 240
GS V+ YM HGGTNFG A + +T Y AP+ E G V E
Sbjct: 267 GS-VSVYMAHGGTNFGLWAGSNHDGRRIQPTVTSYDSDAPVGEDGRVSE 314
>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
Length = 1104
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 120/289 (41%), Gaps = 43/289 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HEPQ G +DF+G+ND+ F + + +YV LR GP
Sbjct: 380 WDQRIKLCKALGMNTICLYVFWNSHEPQPGVFDFTGQNDLAEFCRLCRQNDMYVILRPGP 439
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIEN--------EYQTIEPAFHEKGPPYVL 113
++ +EW GGLP WL I R ++ PY IE Q + GP ++
Sbjct: 440 YVCAEWEMGGLPWWLLKKKDIRLR-ESDPYFIERVGIFEKAVAEQVADMTIQNGGPIIMV 498
Query: 114 WAAKMAVDFHTGVPWVMCKQD----DAPGPVINACN---------------------GMR 148
+ +V +D + PG + C+ G
Sbjct: 499 QVENEYGSYGEDKGYVSQIRDIVRANYPGVTLFQCDWASNFTKNGLHDLVWTMNFGTGAN 558
Query: 149 CGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYY 206
+ F P+ P + +E W+ ++ WG R A D+ + ++K S+ + Y
Sbjct: 559 IDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLY 617
Query: 207 MYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
M HGGTN+G A A +T Y AP+ E G W K L
Sbjct: 618 MTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTTPKYWELRKTL 666
>gi|350588684|ref|XP_003130139.3| PREDICTED: galactosidase, beta 1-like 3 [Sus scrofa]
Length = 656
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/299 (29%), Positives = 126/299 (42%), Gaps = 41/299 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 106 WRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDMEAFILLAAEVGLWVILRPGP 165
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-KIENEY------QTIEPAFHEKGPPYVLW 114
+I SE GGLP L R+ N + + +EY + + + + GP +
Sbjct: 166 YICSEIDLGGLPSRLLQDPTSQLRTTNHSFIEAVDEYLDHLIARVVPLQYRKGGPIIAVQ 225
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS- 158
FH G+ ++ D+ + G+ K
Sbjct: 226 VENEYGSFHKDEAYMPYLHKALLKRGIVELLLTSDNTNEVLKGHIKGVLATVNMKSFKEG 285
Query: 159 ---------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
NKP + E W ++ WG K +R A D+ + FI S+ N YM+H
Sbjct: 286 EFKDLYQVQSNKPILIMEFWVGWFDTWGNKHAVRDAIDVENTIFDFIRLEISF-NVYMFH 344
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLL 261
GGTNFG A ++T Y A L E G PK+ L+EL +I + P L
Sbjct: 345 GGTNFGFMNGATYFEQHRGVVTSYDYDAVLTEAG-DYTPKFFKLRELFKSIFVTPLPAL 402
>gi|32709094|gb|AAP86763.1| beta-galactosidase Gal35I [Xanthomonas campestris pv. campestris]
Length = 613
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 113/269 (42%), Gaps = 40/269 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DF+ ND+ F++E +QGL V LR GP
Sbjct: 63 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNANNDVAAFVREAAAQGLNVILRPGP 122
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEY------QTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + + P + G P +
Sbjct: 123 YACAEWETGGYPAWLFGKDNIRVRSRDPRFLAASQAYLDAVSKQVHPLLNHNGGPIIAVQ 182
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 183 VENEYGSYDDDHAYMADNRAMYVKAGFDDALLFTSDGADMLANGTLPDTLAVVNFAPGEA 242
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ P++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 243 KSAFDKLIKFRPDQPRMVGEYWAGWFDHW-GKPHASTDAKQQTEELEWILRQGHSANLYM 301
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEYG 236
+ GGT+FG FM + P D Y
Sbjct: 302 FIGGTSFG-----FMNGANFQGNPSDHYA 325
>gi|426339862|ref|XP_004033858.1| PREDICTED: beta-galactosidase isoform 1 [Gorilla gorilla gorilla]
Length = 677
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRRHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|332215477|ref|XP_003256871.1| PREDICTED: beta-galactosidase isoform 1 [Nomascus leucogenys]
Length = 677
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLECGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|431919325|gb|ELK17922.1| Beta-galactosidase-1-like protein 3 [Pteropus alecto]
Length = 1113
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 133/313 (42%), Gaps = 69/313 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEPQ+G +DFS D+ F+ GL+V LR GP
Sbjct: 653 WRDRLLKLKACGFNTVTTYVPWNLHEPQRGAFDFSENLDLEAFVLMAAEIGLWVILRPGP 712
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL + + R+ ++ + +
Sbjct: 713 YICSEIDLGGLPSWLLQDSNVRLRTTDQGFVEAVDKYFDHLIARVVPLQYRQGGPIIAVQ 772
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTG-VPWVMCK------QDD 135
+ENEY + I+ A ++G +L + + G + V+ Q+D
Sbjct: 773 VENEYGSFDKDKYYMPYIQQALLKRGIVELLLTSDAKTEVLKGYIKGVLAAINIEKFQND 832
Query: 136 APGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF 195
A P+ N NKP + E W ++ WG + ++ AQD+ V+ F
Sbjct: 833 AFEPLYNI--------------QKNKPILVMEYWVGWFDKWGDEHNVKDAQDVENTVSEF 878
Query: 196 IAKNGSYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKE 248
I S+ N YM+HGGTNFG A + T Y A L E G E K+ L++
Sbjct: 879 IKFEISF-NVYMFHGGTNFGFINGATNFGKHKSIATSYDYDAVLTEAGDYTE-KYFKLRK 936
Query: 249 LHAAIKLCSRPLL 261
L ++ P L
Sbjct: 937 LFGSVLALPLPHL 949
Score = 70.1 bits (170), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 105/259 (40%), Gaps = 37/259 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + +V W+ HEPQ+ ++ F+G D+ FI ++GL+V L GP
Sbjct: 80 WKDRLLKLKACGFNTVTMHVPWSHHEPQRHKFYFTGDLDLRAFISIASNEGLWVILCPGP 139
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-KIENEY-----QTIEPAFHEKGPP----- 110
+I S+ GGLP WL + R+ K + K N+Y I P +E P
Sbjct: 140 YIGSDLDLGGLPSWLLQDPKMKLRTTYKGFTKAVNQYFDQLIPRIAPFQYENYGPIIAVQ 199
Query: 111 -------------YVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRC-------- 149
Y+ + K V G+ ++ DD + N +
Sbjct: 200 VENEYGSYHLDKRYMSYVKKALVK--RGIKAMLMTADDGQEIIRGYLNKVIATVHMKNIK 257
Query: 150 GETFKGPNSPN--KPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
ET+K S P + TS WG + + + +V S+ N+YM
Sbjct: 258 KETYKNLFSIQGLSPILMMVYTTSSSDSWGHSHHTLDSHVLMKNVHEMFNLRFSF-NFYM 316
Query: 208 YHGGTNFGRTAAAFMITGY 226
+HGGTNFG A + Y
Sbjct: 317 FHGGTNFGFIGGASSLNSY 335
>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
Length = 588
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 114/258 (44%), Gaps = 49/258 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ ++TY+ WN+HEPQ+GQ+ F R DI +F+K QS GLYV LR P
Sbjct: 36 WRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVFEDRYDIGKFVKLAQSIGLYVILRPSP 95
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQ----TIEPAFHEKGPPYVLWA 115
+I +EW +GGLP WL +V RS+ + K+ N Y+ + P G P ++
Sbjct: 96 YICAEWEFGGLPAWLLRYPDMVVRSNTPRFMEKVANYYEALFKVLVPLQITHGGPVLM-- 153
Query: 116 AKMAVDFHTG----------------------VPWVMC----KQDDAPGPVIN------A 143
M V+ G VP +Q G +I A
Sbjct: 154 --MQVENEYGSFGNDKAYLRHVKSLMETNGVDVPLFTADGSWQQALKAGSLIEDDVFVTA 211
Query: 144 CNGMRCGET------FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
G + E F + N P + E W ++ W + RSA +A +
Sbjct: 212 NFGSKSRENLAELRQFMLMHHKNWPLMCMEFWDGWFNRWQEEIVTRSADSFQTDLAELVK 271
Query: 198 KNGSYVNYYMYHGGTNFG 215
+ S+ N YM+ GGTNFG
Sbjct: 272 EQASF-NLYMFRGGTNFG 288
>gi|164519026|ref|NP_001073876.2| beta-galactosidase-1-like protein 3 [Homo sapiens]
gi|269849685|sp|Q8NCI6.3|GLBL3_HUMAN RecName: Full=Beta-galactosidase-1-like protein 3
Length = 653
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 131/297 (44%), Gaps = 41/297 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR G
Sbjct: 104 WRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGR 163
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEP-----AFHEKGPPYVLW 114
+I SE GGLP WL ++ R+ NK + +E + + P + + GP +
Sbjct: 164 YICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYFDHLIPRVIPLQYRQAGPVIAVQ 223
Query: 115 AAKMAVDFHT---------------GVPWVMCKQDDAPGPVINACNGMRCG--------E 151
F+ G+ ++ D + G+ +
Sbjct: 224 VENEYGSFNKDKTYMPYLHKALLRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQD 283
Query: 152 TFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYH 209
TF + +KP + E W ++ WG K +++ A+++ V+ FI S+ N YM+H
Sbjct: 284 TFNQLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFIKYEISF-NVYMFH 342
Query: 210 GGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRP 259
GGTNFG A ++T Y A L E G E K+ L++L ++ P
Sbjct: 343 GGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSVSATPLP 398
>gi|357450859|ref|XP_003595706.1| Beta-galactosidase [Medicago truncatula]
gi|355484754|gb|AES65957.1| Beta-galactosidase [Medicago truncatula]
Length = 240
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 118/266 (44%), Gaps = 56/266 (21%)
Query: 369 AEGLLDQISAAKDASDYFWYTFRFHYNSSN--AQAPLDVQSHGHILHAFVNGEYTGSAHG 426
A LLDQ + ASDY WY N + ++ L V + G I+++++NG + G
Sbjct: 13 ASKLLDQKNVTAGASDYLWYMTEVVVNDTTVWGKSTLQVNAKGPIIYSYINGFWWGVYDS 72
Query: 427 SHDNVSFTLRNTVHLRQGTNDGALLSVTVGLPDSGAFLERKVAGVHRVRVQDKSFTNCSW 486
SF + L++GTN +LLSVT+G + F++ K
Sbjct: 73 IPSTHSFVYDEDISLKRGTNIISLLSVTLGKSNCSGFIDMK------------------- 113
Query: 487 GYQVGLIGEKLQIYSNLGLNKVLWSSIRSPTR-QLTWYKTTFRAPAGNDPIALNLQSMGK 545
+ G++G S N V W T +TWYKTTF+ P G++ + L+L + +
Sbjct: 114 --ETGIVGG-----SYPRSNGVPWIPRNVSTGVPMTWYKTTFKTPKGSNLVVLDLIGLQR 166
Query: 546 GEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTG 605
G+AWVNGQSIGRY + G S +Y Y VPR F
Sbjct: 167 GKAWVNGQSIGRYQL------GENSSFRY-------------------YAVPRPFFNKDV 201
Query: 606 NLLVLLEE--ENGNPLGITVDTIAIR 629
N LVL EE P ++VD I+I
Sbjct: 202 NTLVLFEELGLGEGPFNVSVDIISIE 227
>gi|242078615|ref|XP_002444076.1| hypothetical protein SORBIDRAFT_07g006945 [Sorghum bicolor]
gi|241940426|gb|EES13571.1| hypothetical protein SORBIDRAFT_07g006945 [Sorghum bicolor]
Length = 144
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/150 (39%), Positives = 85/150 (56%), Gaps = 8/150 (5%)
Query: 597 PRAFLKPTGNLLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTD 656
P FL+P N +VL E+ G+P I+ R VC V+ H + SW +Q
Sbjct: 1 PCLFLQPGSNDIVLFEQFGGDPSKISFVIRQTRSVCAQVSEEHPAQIDSWNSSQQ----T 56
Query: 657 IKKFGKKPTVQPSCPL-GKKISKIVFASFGNPDGDCERYAVGSCHSSHSQGVVERACIGK 715
++++ +P ++ CP G+ IS I FASFG P G C Y+ G C S+ + VV+ ACIG
Sbjct: 57 MQRY--RPELRLECPKDGQVISSIKFASFGTPSGTCGSYSHGECSSTQAISVVQEACIGV 114
Query: 716 SRCSIPLLSRYFGGDPCPGIHKALLVDAQC 745
S CS+P+ S YF G+P G+ K+L V+A C
Sbjct: 115 SNCSVPVSSNYF-GNPWTGVTKSLAVEAAC 143
>gi|359545989|pdb|3THC|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
With Galactose
gi|359545990|pdb|3THC|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
With Galactose
gi|359545991|pdb|3THC|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
With Galactose
gi|359545992|pdb|3THC|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
With Galactose
gi|359545995|pdb|3THD|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
With 1- Deoxygalactonojirimycin
gi|359545996|pdb|3THD|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
With 1- Deoxygalactonojirimycin
gi|359545997|pdb|3THD|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
With 1- Deoxygalactonojirimycin
gi|359545998|pdb|3THD|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
With 1- Deoxygalactonojirimycin
Length = 654
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 42 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 101
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 102 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 161
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 162 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 221
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 222 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 280
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 281 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 322
>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
Length = 603
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 120/284 (42%), Gaps = 53/284 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I KA+ GL+ ++TYV WN+H P++G +D SGR D+ RF+ + ++GL+ +R GP
Sbjct: 35 WADRIRKARLLGLNTVETYVAWNVHSPERGVFDTSGRRDLARFLDLVAAEGLHAIVRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EWT GGLP WL + R + +
Sbjct: 95 YICAEWTGGGLPAWLFADPEVGVRRAEPRFLEAIGEYYAALLPIVAERQVTRGGPVLMVQ 154
Query: 93 IENEYQTI---EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDD--------APGPVI 141
+ENEY P E+ Y+ A M VP Q + P +
Sbjct: 155 VENEYGAYGDDPPVERER---YLRALADMIRAQGIDVPLFTSDQANDHHLSRGSLPELLT 211
Query: 142 NACNGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
A G R E + P P + E W ++ G + + A + +A
Sbjct: 212 TANFGSRATERLAILRKHQPTGPLMCMEFWDGWFDSAGLHHHTTPPEANARDLDDLLAA- 270
Query: 200 GSYVNYYMYHGGTNFGRTAAA------FMITGYYD-QAPLDEYG 236
G+ VN YM HGGTNFG T+ A IT YD APL E+G
Sbjct: 271 GASVNLYMLHGGTNFGLTSGANDKGVYRPITTSYDYDAPLSEHG 314
>gi|179419|gb|AAA51822.1| beta-galactosidase precursor (EC 3.2.1.23) [Homo sapiens]
Length = 677
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLAFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|325914137|ref|ZP_08176490.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
gi|325539640|gb|EGD11283.1| beta-galactosidase [Xanthomonas vesicatoria ATCC 35937]
Length = 635
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 117/275 (42%), Gaps = 54/275 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFS ND+ F++E +QGL V LR GP
Sbjct: 85 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSANNDVAAFVREAAAQGLNVILRPGP 144
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+ +EW GG P WL I RS + + +
Sbjct: 145 YACAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQAYLDAVAKQVQPLLNHNGGPIIAVQ 204
Query: 93 IENEYQTIE----------PAFHEKGPPYVLWAAKMAVDF--HTGVPWVMCKQDDAPGPV 140
+ENEY + + F + G L D + +P + + APG
Sbjct: 205 VENEYGSYDDDHAYMADNRAMFVKAGFDKALLFTSDGADMLANGTLPGTLAVVNFAPGEA 264
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+A + + F+ P +P + E W ++ W G P+ + +I + G
Sbjct: 265 KSAFDKL---IKFR----PEQPRMVGEYWAGWFDHW-GTPHASTDAKQQTEELEWILRQG 316
Query: 201 SYVNYYMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
N YM+ GGT+FG FM + P D Y
Sbjct: 317 HSANLYMFIGGTSFG-----FMNGANFQGNPSDHY 346
>gi|179401|gb|AAA51819.1| beta-D-galactosidase precursor (EC 3.2.1.23) [Homo sapiens]
gi|179423|gb|AAA51823.1| beta-galactosidase precursor (EC 3.2.1.23) [Homo sapiens]
gi|13960104|gb|AAH07493.1| Galactosidase, beta 1 [Homo sapiens]
gi|30583133|gb|AAP35811.1| galactosidase, beta 1 [Homo sapiens]
gi|60655993|gb|AAX32560.1| galactosidase beta 1 [synthetic construct]
gi|123979572|gb|ABM81615.1| galactosidase, beta 1 [synthetic construct]
gi|123994391|gb|ABM84797.1| galactosidase, beta 1 [synthetic construct]
gi|189066575|dbj|BAG35825.1| unnamed protein product [Homo sapiens]
Length = 677
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 586
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 120/282 (42%), Gaps = 52/282 (18%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W I KA+ GL+ I+TYV WN H P++G +D +G D+ RF+ + ++GL+ +R G
Sbjct: 34 LWADRIRKARLMGLNTIETYVAWNAHAPERGVFDLTGNLDLGRFLDLVAAEGLHAIVRPG 93
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+I +EW GGLP WL G+ R+ Y
Sbjct: 94 PYICAEWDNGGLPAWLMATPGVGVRTAEPQYLEAIAGYYDEILAVVAPRQVTRGGPVLMV 153
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQ--DDAPG----PVIN--A 143
++ENEY + Y+ M + VP C Q D+ G P ++ A
Sbjct: 154 QVENEYGA-----YGDDADYLRALVTMMRERGIEVPLTTCDQANDEMLGRGGLPELHKTA 208
Query: 144 CNGMRCGETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
G R E + + P P + E W ++ WG + + + A + G+
Sbjct: 209 TFGSRSPERLETLRRHQPTGPLMCMEYWDGWFDSWGEQHHT-TDAAEAAADLDLLLSQGA 267
Query: 202 YVNYYMYHGGTNFGRTAAA------FMITGYYD-QAPLDEYG 236
N YM+HGGTN G T A IT YD APL E G
Sbjct: 268 SANLYMFHGGTNLGFTNGANDKGTYLPITTSYDYDAPLAEDG 309
>gi|119372308|ref|NP_000395.2| beta-galactosidase isoform a preproprotein [Homo sapiens]
gi|215273939|sp|P16278.2|BGAL_HUMAN RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName: Full=Elastin
receptor 1; Flags: Precursor
gi|119584847|gb|EAW64443.1| galactosidase, beta 1, isoform CRA_b [Homo sapiens]
Length = 677
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|397511636|ref|XP_003826176.1| PREDICTED: beta-galactosidase [Pan paniscus]
Length = 647
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 35 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 95 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 154
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 155 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 214
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 215 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 273
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 274 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 315
>gi|119372312|ref|NP_001073279.1| beta-galactosidase isoform b [Homo sapiens]
Length = 647
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 35 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 95 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 154
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 155 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 214
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 215 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 273
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 274 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 315
>gi|221043328|dbj|BAH13341.1| unnamed protein product [Homo sapiens]
Length = 725
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 113 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 172
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 173 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 232
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 233 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 292
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 293 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 351
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 352 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 393
>gi|30584585|gb|AAP36545.1| Homo sapiens galactosidase, beta 1 [synthetic construct]
gi|60652911|gb|AAX29150.1| galactosidase beta 1 [synthetic construct]
Length = 678
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|62897085|dbj|BAD96483.1| galactosidase, beta 1 variant [Homo sapiens]
Length = 677
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNAIQTYVPWNFHEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 244
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 245 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 303
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 304 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
Length = 598
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 129/304 (42%), Gaps = 53/304 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WN+HEP++G ++F G D++++++ Q GL V LR P
Sbjct: 34 WDHSLYNLKALGCNTVETYVPWNMHEPKEGIFNFEGIADLVKYVQLAQKYGLMVILRPTP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFH----EKGPP----- 110
+I +EW +GGLP WL I RS+ + K+EN Y+ + P E G P
Sbjct: 94 YICAEWEFGGLPAWLLKYKDIRVRSNTNLFLNKVENFYKVLLPMVTPLQVENGGPIIMMQ 153
Query: 111 -------------YVLWAAKMAVDFHTGVPWVMC----KQDDAPGPVIN----------- 142
YV K+ D VP ++ G +I+
Sbjct: 154 VENEYGSFGNDKEYVRNIKKLMRDLGVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGS 213
Query: 143 -ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
+ + E+F N P + E W ++ WG + R ++A V + + +
Sbjct: 214 RSNENLNELESFIKENKKEWPLMCMEFWDGWFNRWGMEIIRRDGSELAEEVKELLKR--A 271
Query: 202 YVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
+N+YM+ GGTNFG IT Y A L E+G EP + A
Sbjct: 272 SINFYMFQGGTNFGFMNGCSSRENVDLPQITSYDYDALLTEWG---EPTSKYYAVQRAIK 328
Query: 254 KLCS 257
++CS
Sbjct: 329 EVCS 332
>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
Length = 583
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 115/253 (45%), Gaps = 36/253 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + I+TYV WNLHEP++G++ F G +D+ F++ GLYV +R P
Sbjct: 35 WEDRLRKIKAMGCNCIETYVAWNLHEPREGEFHFEGMSDVAEFVRLAGELGLYVIVRPSP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYV--- 112
+I +EW +GGLP WL + R ++ + K+ Y + P KG P +
Sbjct: 95 YICAEWEFGGLPAWLLK-DDMRLRCNDPRFLEKVAAYYDALLPQLTPLLATKGGPIIAVQ 153
Query: 113 -------------LWAAKMAVDFHTGVPWVMCK----QDD------APGPVINACNGMRC 149
A+ A+ GV ++ QDD A G + G R
Sbjct: 154 IENEYGSYGNDQAYLQAQRAMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRP 213
Query: 150 GETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E F P+ P + E W ++ W + + R A+D A V + G+ VN+YM
Sbjct: 214 KEAFDKLKEYQPDGPLMCMEYWNGWFDHWFEQHHTRDAEDAA-RVLDDMLGMGASVNFYM 272
Query: 208 YHGGTNFGRTAAA 220
HGGTNFG + A
Sbjct: 273 VHGGTNFGFGSGA 285
>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
Length = 604
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 126/308 (40%), Gaps = 57/308 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLVNGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A+ F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTALFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
Query: 251 AAIKLCSR 258
S+
Sbjct: 338 EEYPALSQ 345
>gi|313240094|emb|CBY32448.1| unnamed protein product [Oikopleura dioica]
Length = 677
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/254 (30%), Positives = 115/254 (45%), Gaps = 32/254 (12%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ I Y+ WNLHE ++G +DF G D++ F GL V R GP
Sbjct: 39 WKHRLQSVVDCGLNTIDVYIPWNLHEKERGNFDFGGELDLVEFFTIAAEMGLKVLCRPGP 98
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAF----HEKGPPYVLWA 115
+I SEW +GGLP WL + RS+ Y+ + + + + P H G P + +
Sbjct: 99 YICSEWDWGGLPSWLLKDPKMHIRSNYCGYQAAVSSYFSKLLPLLAPLQHSNGGPIIAFQ 158
Query: 116 AKMAVDFHTG-----VPWV--MCKQD--------DAPGPVINACNGMRCGETFKGPNS-- 158
+ + +PW+ + K G I N ++ T P S
Sbjct: 159 VENEYGDYVDKDNEHLPWLADLMKSHGLFELFFISDGGHTIRKANMLKL--TKSTPISLK 216
Query: 159 ---PNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNF 214
PNKP + TE W ++ WG G+ + + D+ I K G+ VN+YM+HGGTNF
Sbjct: 217 SLQPNKPMLVTEFWAGWFDYWGHGRNLLNN--DVFEKTLKEILKRGASVNFYMFHGGTNF 274
Query: 215 GRTAAAFMI-TGYY 227
G A + GYY
Sbjct: 275 GFMNGAIELEKGYY 288
>gi|313238701|emb|CBY13726.1| unnamed protein product [Oikopleura dioica]
Length = 645
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 82/313 (26%), Positives = 138/313 (44%), Gaps = 63/313 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ GL+ + TYV WN HE +G+++F G ++ ++IK + GL V LR+GP
Sbjct: 78 WDQRMSNFPAAGLNTLSTYVPWNFHETYEGEFNFDGFQNLRKYIKTAEKHGLNVLLRVGP 137
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRS-------------------------DNKPYKIENE 96
+I +EW +GGLP WL G+ RS P ++ENE
Sbjct: 138 YICAEWEWGGLPAWLLTKKGMKIRSTQDEFLKATKKWLKRLIKEVEDLQFSQAPIQVENE 197
Query: 97 YQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG-----PVINACNGMRCGE 151
Y +E+ Y+ ++ +D GV ++ DD+ G P+ +A + E
Sbjct: 198 Y-----GVYEQDSSYLPSLKQILID--AGVTELLYTCDDSNGLALGTPLKDALLTINLQE 250
Query: 152 TFKGPNS------PNKPSIWTEDWTSFYQVWGGKPYI-------RSAQDIAFHVALFIAK 198
S PNKP++ E WT ++ WG K + + A D + +
Sbjct: 251 NPVDTISSLRIHQPNKPAMVAEYWTGWFDWWGEKHHTLGFPWKNKFALDKFVGTTKDLIE 310
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFM-----------ITGYYDQAPLDEYGLVREPKWGHLK 247
+ N +M+HGGTNFG + IT Y A + E G ++ PK+ ++
Sbjct: 311 QEASFNLFMFHGGTNFGFWNGGIIQGGKDNNYIPDITSYDYDALVGENGDLK-PKFMRMQ 369
Query: 248 E-LHAAIKLCSRP 259
+ + + +K+ + P
Sbjct: 370 QVMRSTLKISALP 382
>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
Length = 607
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 118/285 (41%), Gaps = 48/285 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ + Y FWN HE ++G +DF+G+ DI F++ Q +GL+V LR GP
Sbjct: 57 WRDRLRKARAMGLNAVTVYAFWNFHEEEEGHFDFTGQRDIAEFVRIAQQEGLFVILRPGP 116
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GG P WL + RS + Y +
Sbjct: 117 YVCAEWDLGGYPSWLLKSPAVNLRSLDSRYIAAADKWMKALGQQLAPLQAAKGGPILAVQ 176
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVD--FHTGVPWVMCKQDD-APGPVINACNGMRC 149
+ENEY + + Y+ +M +D F + + D A G + G+
Sbjct: 177 VENEYGSFPDSAQPNAQAYLDRVHQMVLDAGFKDSLLYTGDGADVLARGTFADLTAGIDY 236
Query: 150 GETFKGPN-------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G + PN E W ++ WG K + A I + +G
Sbjct: 237 GTGDSARSIALYKKFRPNTNIYTAEYWDGWFDHWGAKHEVVDAS-IHLKEVHDVLTSGGS 295
Query: 203 VNYYMYHGGTNFGRTAAAFM--------ITGYYDQAPLDEYGLVR 239
++ YM HGGT+FG A + +T Y AP+DE G +R
Sbjct: 296 ISLYMLHGGTSFGWMNGANIDHNHYEPDVTSYDYDAPIDEAGQLR 340
>gi|182439300|ref|YP_001827019.1| beta-galactosidase [Streptomyces griseus subsp. griseus NBRC 13350]
gi|178467816|dbj|BAG22336.1| putative beta-galactosidase [Streptomyces griseus subsp. griseus
NBRC 13350]
Length = 630
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 122/285 (42%), Gaps = 49/285 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A GL+ ++TYV WNLHEP++G+ G + RF+ ++ GL+ +R GP
Sbjct: 35 WEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG--ALGRFLDAVERAGLWAIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAFHE----KGPPYVL-- 113
+I +EW GGLP+W+ G R+ + Y+ +E ++ + P +G P VL
Sbjct: 93 YICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWFRELLPQVVRRQVSRGGPVVLVQ 152
Query: 114 ----------------WAAKMAVDFHTGVPWV--------MCKQDDAPGPVINA--CNGM 147
W A + VP M PG + A +G
Sbjct: 153 AENEYGSYGSDAVYLEWLAGLLRQCGVTVPLFTSDGPEDHMLTGGSVPGLLATANFGSGA 212
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
R G + P P + E W ++ WG +P R + A + I + G+ VN YM
Sbjct: 213 REGFAVLRRHQPGGPLMCMEFWCGWFDHWGAEPVRRDPEQAAGALRE-ILECGASVNVYM 271
Query: 208 YHGGTNFGRTAAAF------------MITGYYDQAPLDEYGLVRE 240
HGGTNFG A A +T Y AP+DEYG E
Sbjct: 272 AHGGTNFGGWAGANRSGPHQDESFQPTVTSYDYDAPVDEYGRATE 316
>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
Length = 589
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 74/254 (29%), Positives = 114/254 (44%), Gaps = 42/254 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHE ++GQ+DF+G D++ F+K+ + GL V LR GP
Sbjct: 34 WEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTGGKDLVSFVKKAEEIGLMVILRPGP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF----HEKGPP----- 110
+I +EW GGLP WL + + R D++ + K+EN ++ + P KG P
Sbjct: 94 YICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVENYFKVLLPLIVPLQVTKGGPVIMVQ 153
Query: 111 -------------YVLWAAKMAVDFHTGVP-------W---VMCKQDDAPGPVINACNGM 147
Y+ KM D VP W +M ++ A G
Sbjct: 154 VENEYGSFSNDKLYLRALKKMIEDAGIDVPLFTSDGAWEQALMSGTLIEEEVLVTANFGS 213
Query: 148 RCGETFKGPNSPNK------PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
R E F S + P + E W ++ W +R A ++ + + +
Sbjct: 214 RGNENFDVLQSFMEKHDKKWPLMCMEFWCGWFNRWNEDIILRDADEVMTCMKELLQRGS- 272
Query: 202 YVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 273 -LNLYMFHGGTNFG 285
>gi|289670687|ref|ZP_06491762.1| beta-galactosidase [Xanthomonas campestris pv. musacearum NCPPB
4381]
Length = 612
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 115/270 (42%), Gaps = 44/270 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E + GL V LR GP
Sbjct: 62 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAALGLNVILRPGP 121
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + ++P + G P +
Sbjct: 122 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALAKQVQPLLNHNGGPIIAVQ 181
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 182 VENEYGSYADDHAYMAENRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 241
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF--IAKNGSYVNY 205
+ ++P + E W ++ W GKP+ +A D F I + G N
Sbjct: 242 KSAFDKLIKFRSDQPRMVGEYWAGWFDHW-GKPH--AATDARQQADEFEWILRQGHSANL 298
Query: 206 YMYHGGTNFGRTAAAFMITGYYDQAPLDEY 235
YM+ GGT+FG FM Y P D Y
Sbjct: 299 YMFIGGTSFG-----FMNGANYQNNPSDHY 323
>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 632
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 125/299 (41%), Gaps = 53/299 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWNLHEP+ G++DFSG ++ +I+ +GL V LR GP
Sbjct: 58 WRHRMKMLKAMGLNAVATYVFWNLHEPEPGKWDFSGDRNLAEYIRIAGEEGLMVILRPGP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPP----- 110
++ +EW +GG P WL +V G+ R DN+ + +E Y+ + +G P
Sbjct: 118 YVCAEWEFGGYPWWLQNVEGMELRRDNEQFLKYTKLYLERLYKEVGKLQITQGGPIIMVQ 177
Query: 111 -------YVLWAAKMAVDFHTGVPWVMCKQDDAPG-----------------------PV 140
YV + ++ H + KQ G P
Sbjct: 178 GENEFGSYVSQRKDITLEEHRAYNAKIIKQLKEVGFDVPMFTSDGSWLFEGGYVPGALPT 237
Query: 141 INACNGMR-CGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
N N + + N P + E + + W A IA ++A N
Sbjct: 238 ANGENNIENLKKVVNQYNGGQGPYMVAEFYPGWLAHWCEPHPQVKASTIARQTEKYLA-N 296
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
G NYYM HGGTNFG T+ A +T Y AP+ E G V PK+ ++ +
Sbjct: 297 GVSFNYYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPISEAGWVT-PKFDSIRNV 354
>gi|313237466|emb|CBY12653.1| unnamed protein product [Oikopleura dioica]
Length = 948
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 134/325 (41%), Gaps = 73/325 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ I Y+ WNLHE ++G +DF G D++ F GL V R GP
Sbjct: 25 WKHRLQSVVDCGLNTIDVYIPWNLHEKERGNFDFGGELDLVEFFTIAAEMGLKVLCRPGP 84
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAF----HEKGPPYVLWA 115
+I SEW +GGLP WL + RS+ Y+ + + + + P H G P + +
Sbjct: 85 YICSEWDWGGLPSWLLKDPKMHIRSNYCGYQAAVSSYFSKLLPLLAPLQHSNGGPIIAFQ 144
Query: 116 AKMAVDFHTG-----VPWV--MCKQ---------DDAPGPVINA---------------- 143
+ + +PW+ + K D G ++
Sbjct: 145 VENEYGDYVDKDNEHLPWLADLMKSHGLFELFFISDGEGVILGGYKMPQNLLKTINFKYL 204
Query: 144 -----------CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFH 191
C+ ++ ++ + PNKP + TE W ++ WG G+ + + D+
Sbjct: 205 NVEKLTKSTPICDNLQALKSLQ----PNKPMLVTEFWAGWFDYWGHGRNLLNN--DVFEK 258
Query: 192 VALFIAKNGSYVNYYMYHGGTNFGRTAAAFMI-TGYYD--------QAPLDEYGLVREPK 242
I K G+ VN+YM+HGGTNFG A + GYY P+DE G R K
Sbjct: 259 TLKEILKRGASVNFYMFHGGTNFGFMNGAIELEKGYYTADVTSYDYDCPVDESG-NRTEK 317
Query: 243 WGHLKELHAAIKLCSRPLLTGTQNV 267
W IK C T ++NV
Sbjct: 318 W-------EIIKRCLDVQKTSSENV 335
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 56/119 (47%), Gaps = 20/119 (16%)
Query: 159 PNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRT 217
PNKP + TE W ++ WG G+ + + ++ I K G+ VN+YM+HGGTNFG
Sbjct: 556 PNKPMLVTEFWAGWFDYWGHGRNLLNN--EVFEKTLKEILKRGASVNFYMFHGGTNFGFM 613
Query: 218 AAAFMI-TGYYD--------QAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGTQNV 267
A + GYY P+DE G R KW I+ C T ++NV
Sbjct: 614 NGAIELEKGYYTADVTSYDYDCPVDESG-NRTEKW-------EIIRRCLNVQKTSSENV 664
>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
Length = 583
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/312 (27%), Positives = 132/312 (42%), Gaps = 56/312 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ I Y+ WNLHE ++G +DF+G D++ F GL V R GP
Sbjct: 39 WKHRLQSVVDCGLNTIDVYIPWNLHEKERGNFDFAGELDLVEFFTIAAEMGLKVLCRPGP 98
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAF----HEKGPPYVLWA 115
+I SEW +GGLP WL + RS+ Y+ + + + + P H G P + +
Sbjct: 99 YICSEWDWGGLPSWLLKDPKMHIRSNYCGYQAAVSSYFSKLLPLLAPLQHSNGGPIIAFQ 158
Query: 116 AKMAVDFHTG-----VPWV--MCKQD--------DAPGPVINACNGMRCGETFKGPN--- 157
+ + +PW+ + K G I N ++ T + +
Sbjct: 159 VENEYGDYVDKDNEHLPWLADLMKSHGLFELFFISDGGHTIRKANMLKVRSTAQLNSGSF 218
Query: 158 ------------SPNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAKNGSYVN 204
PNKP + TE W ++ WG G+ + + ++ I K G+ VN
Sbjct: 219 QLLAKAFSLKSLQPNKPMLVTEFWAGWFDYWGHGRNLLNN--EVFEKTLKEILKRGASVN 276
Query: 205 YYMYHGGTNFGRTAAAFMI-TGYYD--------QAPLDEYGLVREPKWGHLKELHAAIKL 255
+YM+HGGTNFG A + GYY P+DE G R KW I+
Sbjct: 277 FYMFHGGTNFGFMNGAIELEKGYYTADVTSYDYDCPVDESG-NRTEKW-------EIIRR 328
Query: 256 CSRPLLTGTQNV 267
C T ++NV
Sbjct: 329 CLNVQKTSSENV 340
>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
Length = 589
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 74/254 (29%), Positives = 114/254 (44%), Gaps = 42/254 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHE ++GQ+DF+G D++ F+K+ + GL V LR GP
Sbjct: 34 WEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTGGKDLVSFVKKAEEIGLMVILRPGP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF----HEKGPP----- 110
+I +EW GGLP WL + + R D++ + K+EN ++ + P KG P
Sbjct: 94 YICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVENYFKVLLPLIVPLQVTKGGPVIMVQ 153
Query: 111 -------------YVLWAAKMAVDFHTGVP-------W---VMCKQDDAPGPVINACNGM 147
Y+ KM D VP W +M ++ A G
Sbjct: 154 VENEYGSFSNDKLYLRALKKMIEDAGIDVPLFTSDGAWEQALMSGTLIEEEVLVTANFGS 213
Query: 148 RCGETFKGPNSPNK------PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
R E F S + P + E W ++ W +R A ++ + + +
Sbjct: 214 RGNENFDVLQSFMEKHDKKWPLMCMEFWCGWFNRWNEDIILRDADEVMTCMKELLQRGS- 272
Query: 202 YVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 273 -LNLYMFHGGTNFG 285
>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
25986]
gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
Length = 598
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 126/290 (43%), Gaps = 63/290 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEP+ G +DFSG D+ F+ E S GLY +R P
Sbjct: 34 WHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSGSIDLAAFLDEAASLGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWL------------------------HDVAGIVFRSDNK-----PYK 92
FI +EW +GG+P WL H + +V R +K +
Sbjct: 94 FICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQYYDHLMPILVSRQIDKGGNIIMMQ 153
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDA------PGPVIN---A 143
+ENEY + + + Y+ ++ V+ VP +C D G +I+
Sbjct: 154 VENEYGS-----YCEDKDYLRAIRRLMVERGVSVP--LCTSDGPWRGCLRAGTLIDDDVL 206
Query: 144 CN---GMRCGETFKGPNSPNK------PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
C G E F+ ++ +K P + E W ++ +G R +D+A V
Sbjct: 207 CTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWDGWFNRYGENVIRRDPEDLASCVRE 266
Query: 195 FIAKNGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYG 236
+ GS +N YM+HGGTNFG T +T Y APLDE G
Sbjct: 267 VLELGGS-LNLYMFHGGTNFGFMNGCSARHTHDLHQVTSYDYDAPLDEQG 315
>gi|163848976|ref|YP_001637020.1| beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
gi|163670265|gb|ABY36631.1| Beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
Length = 897
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 120/270 (44%), Gaps = 35/270 (12%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W L+ +A+ GL+ I T + WN HEPQ G++DFS D+ F+ GL +R GP
Sbjct: 36 WRPLLEQARWAGLNTIDTVIPWNRHEPQPGEFDFSEEADLGAFLDLCHELGLKAIVRPGP 95
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAF----HEKGPPYVL-- 113
+I +EW GGLP WL + RSD+ ++ + + T+ P + G P +L
Sbjct: 96 YICAEWENGGLPAWLTASGDMRLRSDDPAFRDAVLRWFDTLMPILVPRQYPHGGPIILCQ 155
Query: 114 -----WA-------------AKMAVDFHTGVPWVMCKQDDAPGPVI-NACNGMRCGETFK 154
WA A+ A++ VP C P N +G+
Sbjct: 156 IENEHWASGVYGADTHQQTLAQAALERGIVVPQYTCVGAMPGYPEFRNGWSGIAEKLVQT 215
Query: 155 GPNSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIAKNGSYVNYYMYHGGTN 213
P+ P I +E W+ ++ WGG R +A + + A + +++M+ GGTN
Sbjct: 216 RQLWPDNPLIVSELWSGWFDNWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTN 275
Query: 214 F----GRTAAA---FMITGYYDQAPLDEYG 236
F GRT M T Y AP+DEYG
Sbjct: 276 FGFWGGRTVGGDLIHMTTSYDYDAPVDEYG 305
>gi|289664883|ref|ZP_06486464.1| beta-galactosidase [Xanthomonas campestris pv. vasculorum NCPPB
702]
Length = 582
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 113/268 (42%), Gaps = 40/268 (14%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EPQ+GQ+DFSG ND+ F++E + GL V LR GP
Sbjct: 32 WKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGNNDVAAFVREAAALGLNVILRPGP 91
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+ +EW GG P WL I RS + + ++ + ++P + G P +
Sbjct: 92 YACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDALAKQVQPLLNHNGGPIIAVQ 151
Query: 113 -------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVIN-------ACNGMRCGET 152
A A+ G + D + N A GE
Sbjct: 152 VENEYGSYADDHAYMAENRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA 211
Query: 153 FKGPNS-----PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
+ ++P + E W ++ W GKP+ + +I + G N YM
Sbjct: 212 KSAFDKLIKFRSDQPRMVGEYWAGWFDHW-GKPHAATDARQQADEFEWILRQGHSANLYM 270
Query: 208 YHGGTNFGRTAAAFMITGYYDQAPLDEY 235
+ GGT+FG FM Y P D Y
Sbjct: 271 FIGGTSFG-----FMNGANYQNNPSDHY 293
>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
Length = 594
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 126/308 (40%), Gaps = 57/308 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLVNGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A+ F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTALFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
Query: 251 AAIKLCSR 258
S+
Sbjct: 328 EEYPALSQ 335
>gi|334348881|ref|XP_001378605.2| PREDICTED: beta-galactosidase-like [Monodelphis domestica]
Length = 658
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 94/336 (27%), Positives = 138/336 (41%), Gaps = 59/336 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HEP G Y FS D+ F++ GL V LR GP
Sbjct: 81 WKDRLLKMKMAGLNAIQTYVPWNFHEPLPGVYRFSDDYDLEYFLQLAHEIGLLVILRPGP 140
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL IV RS + Y E E ++P ++ G P +
Sbjct: 141 YICAEWDMGGLPAWLLTKKSIVLRSSDPDYLAETEKWLGVLLPKMKPYLYQNGGPIITVQ 200
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVINACNGMRCG------- 150
+ + D+ H G V+ D A + + ++CG
Sbjct: 201 VENEYGSYFTCDYNYLRFLQQLFHKHLGEEVVLFTTDGA------SEDYLKCGTLQGLYA 254
Query: 151 -----------ETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
E F+ P P + +E +T + WG + I + ++
Sbjct: 255 TVDFGTNHNITEAFQSQRKTEPKGPLVNSEFYTGWLDHWGEAHETVDTKAIISSLNDMLS 314
Query: 198 KNGSYVNYYMYHGGTNFG-----RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
+ G+ VN YM+ GGTNFG A T Y APL E G + E K+ L+EL
Sbjct: 315 Q-GANVNMYMFIGGTNFGFWNGANIPYAAQPTSYDYDAPLSEAGDLTE-KYFALRELIGK 372
Query: 253 IKLCSRPLLTGTQNVISLGQ--LQEAFVFEETSGVC 286
+ L+ T + G+ +++ EE+ V
Sbjct: 373 FEKLPEGLIPPTTPKFAYGKVAMKKVNTLEESLDVL 408
>gi|313246754|emb|CBY35624.1| unnamed protein product [Oikopleura dioica]
Length = 599
Score = 103 bits (256), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 82/313 (26%), Positives = 138/313 (44%), Gaps = 63/313 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ GL+ + TYV WN HE +G+++F G ++ ++IK + GL V LR+GP
Sbjct: 32 WDQRMSNFPAAGLNTLSTYVPWNFHETYEGEFNFDGFQNLRKYIKTAEKHGLNVLLRVGP 91
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRS-------------------------DNKPYKIENE 96
+I +EW +GGLP WL G+ RS P ++ENE
Sbjct: 92 YICAEWEWGGLPAWLLTKKGMKIRSTQDEFLKATKKWLKRLIKEVEDLQYSQAPIQVENE 151
Query: 97 YQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPG-----PVINACNGMRCGE 151
Y +E+ Y+ ++ +D GV ++ DD+ G P+ +A + E
Sbjct: 152 Y-----GVYEQDSSYLPSLKQILID--AGVTELLYTCDDSNGLALGTPLKDALLTINLQE 204
Query: 152 TFKGPNS------PNKPSIWTEDWTSFYQVWGGKPYI-------RSAQDIAFHVALFIAK 198
S PNKP++ E WT ++ WG K + + A D + +
Sbjct: 205 NPVDTISSLRIHQPNKPAMVAEYWTGWFDWWGEKHHTLGFPWKNKFALDKFVGTTKDLIE 264
Query: 199 NGSYVNYYMYHGGTNFGRTAAAFM-----------ITGYYDQAPLDEYGLVREPKWGHLK 247
+ N +M+HGGTNFG + IT Y A + E G ++ PK+ ++
Sbjct: 265 QEASFNLFMFHGGTNFGFWNGGIIQGGKDNNYIPDITSYDYDALVGENGDLK-PKFMRMQ 323
Query: 248 E-LHAAIKLCSRP 259
+ + + +K+ + P
Sbjct: 324 QVMRSTLKISALP 336
>gi|313245457|emb|CBY40184.1| unnamed protein product [Oikopleura dioica]
Length = 620
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 116/277 (41%), Gaps = 66/277 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +AK K GL+ + TYV WNLHEP+ G++ FSG DI+ FI ++ L+V LR GP
Sbjct: 41 WYDRLAKLKSAGLNGVTTYVPWNLHEPEPGEFSFSGELDIVHFINIARTLDLFVILRPGP 100
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW +GGLP WL + + R++ Y +
Sbjct: 101 YICSEWEWGGLPAWLLRDSFMKVRTNYSGYITAVKRFFGQLIPLIKYQQSKYGGPIVAVQ 160
Query: 93 IENEYQT---------------------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMC 131
+ENEY +EP F G +W + + G+ V
Sbjct: 161 VENEYGMYAGQDGAHLNTLAELLKNEGIVEPLFTSDGSS--VWDNEKNTIYEDGLKSVNF 218
Query: 132 KQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFH 191
K + P + + G + P +P E W ++ WG + D +
Sbjct: 219 KSN--PEKHLKSLRG----------HFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKN 266
Query: 192 VALFIAKNGSYVNYYMYHGGTNFGRTAAAFMIT-GYY 227
+ + + S +N+YM+HGGTNFG T I GYY
Sbjct: 267 LDVILDHKAS-LNFYMFHGGTNFGFTNGGLTIARGYY 302
>gi|222526932|ref|YP_002571403.1| beta-galactosidase [Chloroflexus sp. Y-400-fl]
gi|222450811|gb|ACM55077.1| Beta-galactosidase [Chloroflexus sp. Y-400-fl]
Length = 917
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 120/270 (44%), Gaps = 35/270 (12%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W L+ +A+ GL+ I T + WN HEPQ G++DFS D+ F+ GL +R GP
Sbjct: 56 WRPLLEQARWAGLNTIDTVIPWNRHEPQPGEFDFSEEADLGAFLDLCHELGLKAIVRPGP 115
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAF----HEKGPPYVL-- 113
+I +EW GGLP WL + RSD+ ++ + + T+ P + G P +L
Sbjct: 116 YICAEWENGGLPAWLTASGDMRLRSDDPAFRDAVLRWFDTLMPILVPRQYPHGGPIILCQ 175
Query: 114 -----WA-------------AKMAVDFHTGVPWVMCKQDDAPGPVI-NACNGMRCGETFK 154
WA A+ A++ VP C P N +G+
Sbjct: 176 IENEHWASGVYGADTHQQTLAQAALERGIVVPQYTCVGAMPGYPEFRNGWSGIAEKLVQT 235
Query: 155 GPNSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIAKNGSYVNYYMYHGGTN 213
P+ P I +E W+ ++ WGG R +A + + A + +++M+ GGTN
Sbjct: 236 RQLWPDNPLIVSELWSGWFDNWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTN 295
Query: 214 F----GRTAAA---FMITGYYDQAPLDEYG 236
F GRT M T Y AP+DEYG
Sbjct: 296 FGFWGGRTVGGDLIHMTTSYDYDAPVDEYG 325
>gi|423259078|ref|ZP_17240001.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|423263951|ref|ZP_17242954.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
gi|387776658|gb|EIK38758.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
CL07T00C01]
gi|392706217|gb|EIY99340.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
CL07T12C05]
Length = 773
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 129/291 (44%), Gaps = 47/291 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN HE Q+G++DFSG ++ +F K Q G+Y+ LR GP
Sbjct: 57 WEHRILMCKALGMNTICLYMFWNYHEQQEGKFDFSGEKNVAKFCKLAQKHGMYIILRPGP 116
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
++ +EW GGLP WL + RS N PY +E ++ + P + +
Sbjct: 117 YVCAEWEMGGLPWWLLKEKDMKVRSLN-PYFMERTEIFMKELGKQLAPLQLANGGNIIMV 175
Query: 119 ---------AVD--FHTGVPWVMCKQ---------------------DDAPGPVINACNG 146
VD + T + ++C+ DD +N G
Sbjct: 176 QVENEFGGYGVDKPYMTAIRDIVCRAGFDKSVLFQCDWDSTFELNALDDLLW-TLNFGTG 234
Query: 147 MRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
+ FK ++ P+ P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 235 ANIDKEFKKLSTVRPDTPLMCSEFWSGWFDHWGRKHETRPAEKMVEGIKDMLDRNISF-S 293
Query: 205 YYMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G PK+ L+EL
Sbjct: 294 LYMTHGGTTFGHWGGANSPTYSAMCSSYDYDAPISEAGWTT-PKYYLLQEL 343
>gi|257876100|ref|ZP_05655753.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
gi|257810266|gb|EEV39086.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
Length = 591
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 113/256 (44%), Gaps = 47/256 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WNLHEP++G YDF G DI F+K+ Q+ GL V LR
Sbjct: 34 WTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMKDICAFVKQAQALGLMVILRPSV 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF-----HEKGP----- 109
+I +EW +GGLP WL + + RS + + K+ N +Q + P GP
Sbjct: 94 YICAEWEFGGLPAWLLN-EPMRLRSTDPRFMAKVRNYFQVLLPKLVPLQITHGGPVIMMQ 152
Query: 110 ------------PYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACN------------ 145
Y+ ++ ++ VP + D A V++A
Sbjct: 153 VENEYGSYGMEKAYLRQTKELMEEYGIDVP--LFTSDGAWEEVLDAGTLIEDDVFVTGNF 210
Query: 146 GMRCGET------FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
G R E F + N P + E W ++ WG R+ QD+A V +A
Sbjct: 211 GSRSKENAAVMKEFMAKHGKNWPIMCMEYWDGWFNRWGEPIIKRAGQDLANEVKEMLAVG 270
Query: 200 GSYVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 271 S--LNLYMFHGGTNFG 284
>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
Length = 624
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 133/303 (43%), Gaps = 61/303 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWNLHE + G++DFSG ++ +I+ +G+ V LR GP
Sbjct: 55 WRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNLAEYIRIAGEEGMMVILRPGP 114
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVL-- 113
++ +EW +GG P WL ++ G+ R DN + I+ YQ + P KG P ++
Sbjct: 115 YVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDRLYQEVGPLQCTKGGPIIMVQ 174
Query: 114 ----------------------WAAKMA---VDFHTGVP-------WVM---CKQDDAPG 138
+ AK+ D VP W+ C P
Sbjct: 175 CENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPT 234
Query: 139 P--VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALF 195
+ N + + G P + + W S + G+P+ + SA +IA +
Sbjct: 235 ANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLSHW----GEPFPQVSASEIARQTEAY 290
Query: 196 IAKNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHL 246
+ N S+ N+YM HGGTNFG T+ A +T Y AP+ E G + PK+ +
Sbjct: 291 LQNNVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWI-TPKYDSI 348
Query: 247 KEL 249
+ +
Sbjct: 349 RSV 351
>gi|354466872|ref|XP_003495895.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Cricetulus
griseus]
Length = 761
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 133/306 (43%), Gaps = 55/306 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TY+ WNLHE +G +DFS D+ ++ + GL+V LR GP
Sbjct: 210 WKDRLLKLQACGFNTVTTYIPWNLHEQNRGTFDFSEILDLEAYVSLAATLGLWVILRPGP 269
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E GGLP WL + R+ + + +
Sbjct: 270 YICAEVDLGGLPSWLLGYPELQLRTTQQEFLDAVDKYFDHLIPRILPLQYLRGGPVIAVQ 329
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
IENEY + I+ A ++G +L + D H G+ K IN
Sbjct: 330 IENEYGSFSKDGDYMEYIKEALQKRGIVELL----LTSDNHKGIQTGSVK---GALTTIN 382
Query: 143 ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+ + +KP + E WT ++ WG + ++SA++I + V+ FI K G
Sbjct: 383 MASFEKDSFIKLLQMQNDKPIMVMEYWTGWFDTWGREHNVKSAEEIRYTVSRFI-KYGIS 441
Query: 203 VNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
N YM+HGGTNFG AF ++T Y A L E G E K+ L++L A+ +
Sbjct: 442 FNMYMFHGGTNFGFINGAFHYDKHSSVVTSYDYDAVLTEAGDYTE-KYFKLRKLFASASV 500
Query: 256 CSRPLL 261
P L
Sbjct: 501 GFLPRL 506
>gi|300122119|emb|CBK22693.2| unnamed protein product [Blastocystis hominis]
Length = 599
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 135/299 (45%), Gaps = 48/299 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + I K GGL+ +QTYV WN+HEP+KG+++F G ++ RF+ + +YV LR GP
Sbjct: 54 WENTIKKMANGGLNAVQTYVAWNIHEPRKGEFNFDGIANLDRFLSIAEKYNMYVILRPGP 113
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTI----EPAFHEKGPPYVLWA 115
+I +EW +GGLP WL GI R+ + Y+ +E+ ++ + P ++ G +
Sbjct: 114 YICAEWDFGGLPYWLIREEGIKIRTSDPVYQKHVEDYFRVLLNIARPHLYKNGGSIISVQ 173
Query: 116 AKMAVDFHTG-----VPWVMCKQDDAPGPVI-------NACNGMRCGET----------- 152
+ F+ + W++ + G + + + + CG
Sbjct: 174 IENEYGFYPACDKDHLRWLLNLNKEILGDDVVYFTVDTPSDDALSCGTLPEEIYVTVDFG 233
Query: 153 FKGPN---------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
+ P+ + P + TE + + W K + A+ IA + +A N S V
Sbjct: 234 VRDPSGAWDMQMKYAKQGPKVNTEFYPGWLDHWREKHHTVDAKSIADCLDQMMAVNAS-V 292
Query: 204 NYYMYHGGTNFGRTAAAFMITGYYD--------QAPLDEYGLVREPKWGHLKELHAAIK 254
N+YMY GGTN A A + YY APL E + E KW +++ A +
Sbjct: 293 NFYMYFGGTNHHFFAGANGDSNYYQSDPTSYDYDAPLSEAADMTE-KWAIIRDTIAKYR 350
>gi|313231869|emb|CBY08981.1| unnamed protein product [Oikopleura dioica]
Length = 664
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 116/277 (41%), Gaps = 66/277 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +AK K GL+ + TYV WNLHEP+ G++ FSG DI+ FI ++ L+V LR GP
Sbjct: 85 WYDRLAKLKSAGLNGVTTYVPWNLHEPEPGEFSFSGELDIVHFINIARTLDLFVILRPGP 144
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SEW +GGLP WL + + R++ Y +
Sbjct: 145 YICSEWEWGGLPPWLLRDSFMKVRTNYSGYITAVKRFFGQLIPLIKYQQSKYGGPIVAVQ 204
Query: 93 IENEYQT---------------------IEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMC 131
+ENEY +EP F G +W + + G+ V
Sbjct: 205 VENEYGMYAGQDGAHLNTLAELLKNEGIVEPLFTSDGSS--VWDNEKNTIYEDGLKSVNF 262
Query: 132 KQDDAPGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFH 191
K + P + + G + P +P E W ++ WG + D +
Sbjct: 263 KSN--PEKHLKSLRG----------HFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKN 310
Query: 192 VALFIAKNGSYVNYYMYHGGTNFGRTAAAFMIT-GYY 227
+ + + S +N+YM+HGGTNFG T I GYY
Sbjct: 311 LDVILDHKAS-LNFYMFHGGTNFGFTNGGLTIARGYY 346
>gi|296082584|emb|CBI21589.3| unnamed protein product [Vitis vinifera]
Length = 83
Score = 102 bits (254), Expect = 7e-19, Method: Composition-based stats.
Identities = 42/79 (53%), Positives = 57/79 (72%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MW L+ AKEGG+ V +TYVFWN HE G Y F GR D+++F+K +Q G+Y+ L IG
Sbjct: 1 MWSGLVRIAKEGGIVVFETYVFWNGHELSPGNYYFGGRYDLLKFVKIVQQAGMYLILCIG 60
Query: 61 PFIESEWTYGGLPIWLHDV 79
PF+ +EW +GG+P+WLH V
Sbjct: 61 PFVAAEWNFGGVPVWLHYV 79
>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 591
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 121/283 (42%), Gaps = 50/283 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WN+HEP++G ++F G D++++++ Q GL V LR P
Sbjct: 34 WDHSLYNLKALGCNTVETYVPWNIHEPKEGVFNFEGIADLVKYVQLAQKYGLMVILRPTP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFH----EKGPP----- 110
+I +EW +GGLP WL I RS+ + K+EN Y+ + P E G P
Sbjct: 94 YICAEWEFGGLPAWLLKYKDIRVRSNTNLFLDKVENFYKVLLPMVTPLQVENGGPIIMMQ 153
Query: 111 -------------YVLWAAKMAVDFHTGVPWVMC----KQDDAPGPVIN----------- 142
YV K+ D VP ++ G +I+
Sbjct: 154 VENEYGSFGNDKEYVRSIKKIMRDLDVTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGS 213
Query: 143 -ACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
+ + E+F N P + E W ++ WG + R ++A V + + +
Sbjct: 214 RSNENLNELESFIKENKKEWPLMCMEFWDGWFNRWGMEIIRRDGSELAEEVKELLKR--A 271
Query: 202 YVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYG 236
+N+YM+ GGTNFG IT Y A L E+G
Sbjct: 272 SINFYMFQGGTNFGFMNGCSSRENVDLPQITSYDYDALLTEWG 314
>gi|225872977|ref|YP_002754436.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
gi|225792973|gb|ACO33063.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
Length = 619
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 121/291 (41%), Gaps = 68/291 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ I YVFWN+ EP +GQ+DFSG+ D+ RFI+ Q GLYV LR GP
Sbjct: 69 WGDRLRKARAMGLNAISVYVFWNVQEPHRGQWDFSGQYDVARFIRMAQQAGLYVILRPGP 128
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+ +EW+ GG P WL + RS + Y +
Sbjct: 129 YACAEWSMGGYPAWLWKDGRVKIRSSDPAYLHAAQDYMDHLGQQLKPLLWTHGGPIIAVQ 188
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTG---------VPWVMCKQDDAPGPVI 141
+ENEY + A+ E+ V A V +T +P + D PG V
Sbjct: 189 VENEYGSFGKSRAYLEEVRRMVAGAGLGGVVLYTADGPGLWSGSLPELPEAIDVGPGGVE 248
Query: 142 NACNGMRCGETFKGPNSPNKPSIWT-EDWTSFYQVWG-----GKPYIRSAQDIAFHVALF 195
N + P+ ++ E + ++ WG G P +D+ +
Sbjct: 249 NGVKQLLA-------YRPHSKLVYVAEYYPGWFDQWGQPHHHGAPLKEQLKDLR-----W 296
Query: 196 IAKNGSYVNYYMYHGGTNFG----------RTAAAFMITGYYDQAPLDEYG 236
I G VN YM+HGGT++G T A T Y APL+E G
Sbjct: 297 ILSRGYSVNLYMFHGGTDWGFMNGANDNAADTDYAPQTTSYDYAAPLNEAG 347
>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
Length = 604
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFDMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|109052835|ref|XP_001097877.1| PREDICTED: beta-galactosidase-like [Macaca mulatta]
Length = 373
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 119/283 (42%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN HE GQY FS +D+ F++ GL V LR GP
Sbjct: 65 WKDRLLKMKMAGLNTIQTYVPWNFHESWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 124
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 125 YICAEWEMGGLPAWLLEKEAILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQ 184
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + A DF H G V+ D A + A G+ F G
Sbjct: 185 VENEYGSYFACDFDYLRFLQKRFHHHLGDDVVLFTTDGAHETFLQCGALQGLYTTVDF-G 243
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P S P P I +E +T + W G+P+ ++ I G+
Sbjct: 244 PGSNITDAFQIQRKCEPKGPLINSEFYTGWLDHW-GQPHSTIKTEVVASSLYDILARGAS 302
Query: 203 VNYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF + A T Y APL E G + E
Sbjct: 303 VNLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 345
>gi|334347175|ref|XP_003341899.1| PREDICTED: beta-galactosidase-1-like protein [Monodelphis
domestica]
Length = 646
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 81/280 (28%), Positives = 122/280 (43%), Gaps = 46/280 (16%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + K + GL+ +Q YV WN HEPQ G Y+F G D++ F+K ++ L V LR G
Sbjct: 79 LWSDRLHKMRMSGLNAVQVYVPWNYHEPQPGVYNFQGNRDLVAFLKAAANEDLLVILRPG 138
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPA-----FHEKGPPYVL 113
P+I +EW GGLP WL IV R+ + + +++ + + P +H G +
Sbjct: 139 PYICAEWEMGGLPAWLLQNPEIVLRTSDPDFLAAVDSWFHVLMPMVQPWLYHNGGNIISV 198
Query: 114 -----WAAKMAVDFH------------TGVPWVMCKQDDAPGPVINACNGMRCGETFKGP 156
+ + A DF G + D G G+ F GP
Sbjct: 199 QVENEYGSYFACDFRYMRHLAGLFRALLGDQIFLFTTDGPRGFSCGTLQGLYSTVDF-GP 257
Query: 157 N-------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
+ PN P + +E +T + WGG + +A + + + G+ V
Sbjct: 258 DDNMTEIFAMQQKYEPNGPLVNSEYYTGWLDYWGGNHSKWDTKTLANGLQNML-ELGANV 316
Query: 204 NYYMYHGGTNFGRTAAAFM------ITGYYD-QAPLDEYG 236
N YM+HGGTNFG + A +T YD APL E G
Sbjct: 317 NMYMFHGGTNFGYWSGADFKKIYQPVTTSYDYDAPLSEAG 356
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 43/100 (43%), Gaps = 28/100 (28%)
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVT 580
+Y TF+ P L L KG+ W+NG ++GRYW T +G P Q+ Y
Sbjct: 556 AFYSATFQLPGPPWDTFLYLPGWTKGQVWINGFNLGRYW----TRRG-PQQSLY------ 604
Query: 581 SIHFCAIIKATNTYHVPRAFLKPTG--NLLVLLEEENGNP 618
VP L PTG N++ LLE E+ P
Sbjct: 605 ---------------VPGPLLLPTGTPNIITLLELEHAPP 629
>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
15897]
gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
Length = 577
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 133/315 (42%), Gaps = 62/315 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K+ G + ++TY+ WNLHEP KG++DF G+ D+ F++ + GLYV +R P
Sbjct: 34 WEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDFDGQKDVCAFLELAKKLGLYVIIRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAF--------------- 104
+I SEW GGLP WL + I R+++ Y +E Y + P
Sbjct: 94 YICSEWELGGLPAWLLKDSDIRLRTNDSVYMKHLEEYYAVLLPMIAKYQINREGTIILAQ 153
Query: 105 -------HEKGPPYVLWAAKMAVDFHTGVP-------W-------VMCKQDDAPGPVI-- 141
+ + Y+ KM ++ VP W + ++D P
Sbjct: 154 LENEYGSYNQDKDYLKALLKMMREYGIEVPIFTADGTWEEALEAGSLFEEDVFPTGNFGS 213
Query: 142 NACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
NA + + F + P + E W ++ W + R +++ A + GS
Sbjct: 214 NAKENIAVLKEFMKKHQIVAPIMCMEFWDGWFNRWNMEIVKRDPEELV-QSAKEMIDLGS 272
Query: 202 YVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
+N+YM+HGGTNFG + IT Y A L EYG E HL
Sbjct: 273 -INFYMFHGGTNFGWMNGCSARKEHDLPQITSYDYDAILTEYGAKTEKY--HL------- 322
Query: 254 KLCSRPLLTGTQNVI 268
R ++TG Q+++
Sbjct: 323 ---LRKMITGKQDIL 334
>gi|344291571|ref|XP_003417508.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Loxodonta africana]
Length = 770
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 129/296 (43%), Gaps = 54/296 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 224 WRDRLLKLKACGFNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIWMAAELGLWVILRPGP 283
Query: 62 FIESEWTYGGLPIWL---------------------HDVAGIVFRSDNK-----PYKIEN 95
+I SE GGLP WL H + +V ++ ++EN
Sbjct: 284 YICSEIDLGGLPSWLLQDPDLNWRHTXLVTQXSLFDHLIPRVVPLQYHRGGPIIAVQVEN 343
Query: 96 EYQT----------IEPAFHEKGPPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPVINAC 144
EY + ++ A ++G +L + D G + V+ +N
Sbjct: 344 EYGSYNKDKDYMPYVQQALLQRGIVELLLTSDNERDVLKGYIKGVLA--------TVNMK 395
Query: 145 NGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
R + KP + E W ++ WG + ++R A+++ V FI S+ N
Sbjct: 396 TLSRDAFSLLNKAQSEKPIMIMEFWVGWFDTWGNQHFLRDAKEVEHTVLEFIKAEISF-N 454
Query: 205 YYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
YM+HGGTNFG A ++T Y A L E G E K+ L++L ++
Sbjct: 455 AYMFHGGTNFGFMNGATYLGKHRGVVTSYDYDAVLTEAGDYTE-KYFKLRKLFGSV 509
>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 640
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 132/303 (43%), Gaps = 63/303 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K K GL+ I TYV W+LHEP G Y+F G D+ FIK IQ +G+Y+ LR GP
Sbjct: 62 WKDRIQKIKAAGLNAITTYVEWSLHEPFPGTYNFEGMADLEYFIKLIQDEGMYLLLRPGP 121
Query: 62 FIESEWTYGGLPIWLHDVAGI-VFRSDNKPYK---------------------------- 92
+I +E +GG P WL +V R+++ YK
Sbjct: 122 YICAERDFGGFPYWLLNVTPKGSLRTNDSSYKKYVSQWFSVLMKKMQPHLYGNGGNIIMV 181
Query: 93 -IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWV----MCKQDD---APGPVINA- 143
+ENEY +++ Y LW + + + +C+Q D P P + A
Sbjct: 182 QVENEYG----SYYACDSDYKLWLRDLLKGYVEDKALLYTIDICRQRDFDCGPIPEVYAT 237
Query: 144 ------CNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
N C + K PS+ +E + + W ++ D+ H+ ++
Sbjct: 238 VDFGISVNAATCFDFLKNYQK-GGPSVNSEFYPGWLAHWQEPHPKVNSDDVVNHMKSMLS 296
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAF------------MITGYYDQAPLDEYGLVREPKWGH 245
N S+ ++YM+HGGTNFG T+ A +T Y AP+ E G + E K+
Sbjct: 297 LNASF-SFYMFHGGTNFGFTSGANTNESDANIGYLPQLTSYDYDAPITEAGDLTE-KYFK 354
Query: 246 LKE 248
+K+
Sbjct: 355 IKQ 357
>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 604
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 83/302 (27%), Positives = 133/302 (44%), Gaps = 67/302 (22%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WNLHEP++G + F G D+ RFI+ GL+V +R P
Sbjct: 35 WEDRLLKLKACGFNTVETYIPWNLHEPREGSFRFDGFADVARFIETAGRLGLHVIVRPSP 94
Query: 62 FIESEWTYGGLPIW-LHDVAGIVFRSDNKPYKIENEYQT----IEPAFHEKGPPYVLWAA 116
+I +EW +GGLP W L G+ + K++ Y + P +G P + A
Sbjct: 95 YICAEWEFGGLPAWLLKSSMGLRCMDNEYLEKVDRYYDELIPRLLPLLDSRGGPII--AV 152
Query: 117 KMAVDFHT------------------GVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS 158
++ ++ + GV ++ D GP + M G T +G ++
Sbjct: 153 QVENEYGSYGNDTAYLAYLRDGLIRRGVDCLLFTSD---GP----TDEMLLGGTVEGLHA 205
Query: 159 P-------------------NKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
++P + E W ++ W ++R A D+A +V + +
Sbjct: 206 TVNFGSRVAESLAKYREYRQDEPLMVMEYWLGWFDHWRKPHHVREAGDVA-NVLDEMLEQ 264
Query: 200 GSYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
G+ VN YM+HGGTNFG + A IT Y APL E WG + E + A
Sbjct: 265 GASVNLYMFHGGTNFGFYSGANYGEHYEPTITSYDYDAPLTE--------WGDITEKYKA 316
Query: 253 IK 254
I+
Sbjct: 317 IR 318
>gi|420261585|ref|ZP_14764229.1| glycosyl hydrolase [Enterococcus sp. C1]
gi|394771519|gb|EJF51280.1| glycosyl hydrolase [Enterococcus sp. C1]
Length = 591
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 112/256 (43%), Gaps = 47/256 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WNLHEP++G YDF G DI F+K+ Q+ GL V LR
Sbjct: 34 WTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMKDICAFVKQAQTIGLMVILRPSV 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF-----HEKGP----- 109
+I +EW +GGLP WL + + RS + + K+ N +Q + P GP
Sbjct: 94 YICAEWEFGGLPAWLLN-EPMRLRSTDPRFMAKVRNYFQVLLPKLVPLQITHGGPVIMMQ 152
Query: 110 ------------PYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACN------------ 145
Y+ ++ ++ VP + D A V++A
Sbjct: 153 VENEYGSYGMEKAYLRQTKELMEEYGIDVP--LFTSDGAWEEVLDAGTLIEDDIFVTGNF 210
Query: 146 GMRCGET------FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
G R E F + N P + E W ++ WG R QD+A V +A
Sbjct: 211 GSRSKENAAVMKEFMAKHGKNWPIMCMEYWDGWFNRWGEPIIKRDGQDLANEVKEMLAVG 270
Query: 200 GSYVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 271 S--LNLYMFHGGTNFG 284
>gi|325567414|ref|ZP_08144081.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325158847|gb|EGC70993.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 591
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 112/256 (43%), Gaps = 47/256 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WNLHEP++G YDF G DI F+K+ Q+ GL V LR
Sbjct: 34 WTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMKDICAFVKQAQTLGLMVILRPSV 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAF-----HEKGP----- 109
+I +EW +GGLP WL + + RS + + K+ N +Q + P GP
Sbjct: 94 YICAEWEFGGLPAWLLN-EPMRLRSTDPRFMAKVRNYFQVLLPKLVPLQITHGGPVIMMQ 152
Query: 110 ------------PYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACN------------ 145
Y+ ++ ++ VP + D A V++A
Sbjct: 153 VENEYGSYGMEKAYLRQTKELMEEYGIDVP--LFTSDGAWEEVLDAGTLIEDDIFVTGNF 210
Query: 146 GMRCGET------FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
G R E F + N P + E W ++ WG R QD+A V +A
Sbjct: 211 GSRSKENAAVMKEFMAKHGKNWPIMCMEYWDGWFNRWGEPIIKRDGQDLANEVKEMLAVG 270
Query: 200 GSYVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 271 S--LNLYMFHGGTNFG 284
>gi|296216696|ref|XP_002807336.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Callithrix jacchus]
Length = 652
Score = 102 bits (254), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 133/310 (42%), Gaps = 58/310 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TYV WNLHEP++G++DFSG D+ F+ GL+V LR GP
Sbjct: 103 WRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSGNLDLEAFVLMASEIGLWVILRPGP 162
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I SE GGLP WL ++ R+ NK + +
Sbjct: 163 YICSEIDLGGLPSWLLQDPQLLLRTTNKGFIEAVEKYFDHLIPRVIPLQYRQGGPVIAVQ 222
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + +K PY+ A G+ ++ D + G+
Sbjct: 223 VENEYGSFNKD--KKYMPYLHKAM-----LRRGIVELLLTSDGEKNVLSGHTKGVLATIN 275
Query: 153 FKGPN----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+ + +KP + E W ++ W K ++ A++I V+ FI S+
Sbjct: 276 LQKLHRNTFSQLHKVQRDKPLLNMEYWVGWFDRWXDKHHVTDAKEIEHTVSEFIKYEISF 335
Query: 203 VNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLKEL---HAA 252
N YM+HGGTNFG A ++T Y A L E G E K+ L++L +A
Sbjct: 336 -NVYMFHGGTNFGFLNGATYFGKHAGVVTSYDYDAVLTEAGDYTE-KYFKLQKLFGSFSA 393
Query: 253 IKLCSRPLLT 262
I L P LT
Sbjct: 394 IPLPRVPKLT 403
>gi|300795929|ref|NP_001178947.1| beta-galactosidase-1-like protein 2 [Rattus norvegicus]
Length = 652
Score = 102 bits (254), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 130/294 (44%), Gaps = 47/294 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G++DFSG D+ FI GL+V LR GP
Sbjct: 94 WRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIWLAAKIGLWVILRPGP 153
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYVLWA 115
+I SE GGLP WL + R+ + ++ + P ++ G P + A
Sbjct: 154 YICSEIDLGGLPSWLLQDPDMKLRTTYPGFTKAVDLYFDHLMSRVVPLQYKHGGPII--A 211
Query: 116 AKMAVDF------HTGVPWV------------MCKQDDAPGPVINACNG------MRCGE 151
++ ++ H +P++ + D+ G +G ++ +
Sbjct: 212 VQVENEYGSYNGDHAYMPYIKKALEDRGIIEMLLTSDNKDGLEKGVVDGVLATINLQSQQ 271
Query: 152 TFKGPNS------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNY 205
NS +P + E WT ++ WGG I + ++ V+ I K+GS +N
Sbjct: 272 ELVALNSILLSIQGIQPKMVMEYWTGWFDSWGGSHNILDSSEVLQTVSAII-KDGSSINL 330
Query: 206 YMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
YM+HGGTNFG A +T Y A L E G K+ L+EL
Sbjct: 331 YMFHGGTNFGFINGAMHFGDYKADVTSYDYDAILTEAG-DYTAKYTKLRELFGT 383
>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
17393]
gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
Length = 1106
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 126/295 (42%), Gaps = 51/295 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN HEPQ G YDF+ +ND+ F + Q +YV LR GP
Sbjct: 381 WDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQNDLAEFCRLCQQNDMYVILRPGP 440
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEK--------GPPYVL 113
++ +EW GGLP WL + R ++ PY IE E A ++ G P ++
Sbjct: 441 YVCAEWEMGGLPWWLLKKKDVRLR-ESDPYFIE-RVALFEEAVAKQVKDLTIANGGPIIM 498
Query: 114 WAAK-------------------MAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETF- 153
+ + +F + C D A +N + + F
Sbjct: 499 VQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALFQC--DWASNFTLNGLDDLIWTMNFG 556
Query: 154 KGPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G N PN P + +E W+ ++ WG R A D+ + +++ S+
Sbjct: 557 TGANVDQQFAKLKQLRPNSPLMCSEFWSGWFDKWGANHETRPAADMIKGIDDMLSRGISF 616
Query: 203 VNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+ YM HGGTN+G A A +T Y AP+ E G PK+ L+E A
Sbjct: 617 -SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTT-PKYWALREAMA 669
>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1106
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 126/295 (42%), Gaps = 51/295 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ + YVFWN HEPQ G YDF+ +ND+ F + Q +YV LR GP
Sbjct: 381 WDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFTEQNDLAEFCRLCQQNDMYVILRPGP 440
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEK--------GPPYVL 113
++ +EW GGLP WL + R ++ PY IE E A ++ G P ++
Sbjct: 441 YVCAEWEMGGLPWWLLKKKDVRLR-ESDPYFIE-RVALFEEAVAKQVKNLTIANGGPIIM 498
Query: 114 WAAK-------------------MAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETF- 153
+ + +F + C D A +N + + F
Sbjct: 499 VQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALFQC--DWASNFTLNGLDDLIWTMNFG 556
Query: 154 KGPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G N PN P + +E W+ ++ WG R A D+ + +++ S+
Sbjct: 557 TGANVDQQFAKLKQLRPNSPLMCSEFWSGWFDKWGANHETRPAADMIKGIDDMLSRGISF 616
Query: 203 VNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+ YM HGGTN+G A A +T Y AP+ E G PK+ L+E A
Sbjct: 617 -SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTT-PKYWALREAMA 669
>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
Length = 604
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 98/336 (29%), Positives = 135/336 (40%), Gaps = 61/336 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLVNGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
Query: 251 AAIKLCSR--PLLTGT--QNVISLGQLQEAFVFEET 282
S+ PL+ + Q I L F ET
Sbjct: 338 EEYPALSQAEPLVKDSFAQTAIPLTNKVSLFATLET 373
>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
Length = 604
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 98/336 (29%), Positives = 135/336 (40%), Gaps = 61/336 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
Query: 251 AAIKLCSR--PLLTGT--QNVISLGQLQEAFVFEET 282
S+ PL+ + Q I L F ET
Sbjct: 338 EEYPALSQAEPLVKDSFAQTAIPLTNKVSLFATLET 373
>gi|156380756|ref|XP_001631933.1| predicted protein [Nematostella vectensis]
gi|156218982|gb|EDO39870.1| predicted protein [Nematostella vectensis]
Length = 652
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 135/310 (43%), Gaps = 64/310 (20%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
W + K K G++ IQTYV WNLHEP G+Y+F G D++ F++ S L +R G
Sbjct: 57 FWKDRLLKMKAAGMNAIQTYVPWNLHEPTPGKYNFDGGADLLSFLELAHSLDLVAIVRAG 116
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDN-------------------KPY---------- 91
P+I +EW +GGLP WL + I RS K Y
Sbjct: 117 PYICAEWDFGGLPAWLLKNSSITLRSSKDQAYMSAVDSWMGVLLPKLKAYLYEHGGPVIM 176
Query: 92 -KIENEYQTIEPAFHEK------------GPPYVLWAAKMAVDFHT--GVPWVMCKQDDA 136
++ENEY HE G +L+ + ++ G + D
Sbjct: 177 VQVENEYGNYYTCDHEYMNHLEITFRQHLGSNVILFTTDPPIPYNLKCGTLLSLFTTIDF 236
Query: 137 PGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
GP I+ F+ P P + +E +T + WG + ++++ ++ ++ +
Sbjct: 237 -GPGIDPAAAFNIQRQFQ----PKGPFVNSEYYTGWLDHWGEQHQTKTSESVSQYLDKIL 291
Query: 197 AKNGSYVNYYMYHGGTNFG--------RTAAAF--MITGYYDQAPLDEYGLVREPKWGHL 246
A N S VN YM+ GGTNFG A++F + T Y APL E G E K+ +
Sbjct: 292 ALNAS-VNLYMFEGGTNFGFWNGANANAGASSFQPVPTSYDYDAPLTEAGDPTE-KYFAI 349
Query: 247 KEL---HAAI 253
+E+ HA++
Sbjct: 350 REVVGKHASL 359
>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
Length = 594
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/336 (29%), Positives = 135/336 (40%), Gaps = 61/336 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
Query: 251 AAIKLCSR--PLLTGT--QNVISLGQLQEAFVFEET 282
S+ PL+ + Q I L F ET
Sbjct: 328 EEYPALSQAEPLVKDSFAQTAIPLTNKVSLFATLET 363
>gi|413922057|gb|AFW61989.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 139
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/70 (61%), Positives = 55/70 (78%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
MWP L+ KAK+GGLDV+QTYVFWN HEP +GQY F R D++RF+K + GLYV LRIG
Sbjct: 58 MWPGLLQKAKDGGLDVVQTYVFWNGHEPVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIG 117
Query: 61 PFIESEWTYG 70
P++ +EW +G
Sbjct: 118 PYVCAEWNFG 127
>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 604
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|156408171|ref|XP_001641730.1| predicted protein [Nematostella vectensis]
gi|156228870|gb|EDO49667.1| predicted protein [Nematostella vectensis]
Length = 647
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 127/295 (43%), Gaps = 49/295 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G++ +QTYV WNLHEP QY+F+G ++ F++ QS L V LR GP
Sbjct: 53 WKDRLLKLKASGMNTVQTYVPWNLHEPIPKQYNFAGNANLTSFLEIAQSLDLLVILRPGP 112
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE-------YQTIEPAFHEKGPPYVLW 114
+I +EW +GGLP WL IV RS +E ++P +E G P ++
Sbjct: 113 YICAEWDFGGLPGWLLKDPSIVIRSSQGKAYMEAVDAWMSVLLPLVKPFLYENGGPVIMV 172
Query: 115 AA------------------KMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET---F 153
+ +H ++ DD C + T F
Sbjct: 173 QVENEYGDYIHCDHQYMLHLQQLFRYHLTDDIILFTTDDGSNLTAIECGTLPSLYTTVDF 232
Query: 154 KGPNSPN------------KPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
P+ P + +E +T + WG R+++ +A + +A N S
Sbjct: 233 GANTDPSIPFANQRKLQQKGPLVNSEFYTGWLDYWGTPHQTRTSKVVADALDKILALNAS 292
Query: 202 YVNYYMYHGGTNFGR-TAAAF------MITGYYDQAPLDEYGLVREPKWGHLKEL 249
VN YM+ GGTNFG + A F + T Y APL E G + E K+ ++E+
Sbjct: 293 -VNLYMFEGGTNFGFWSGADFHGQYQPVPTSYDYDAPLTEAGDLTE-KYHAIREV 345
>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
Length = 594
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 125/308 (40%), Gaps = 57/308 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLVNGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
Query: 251 AAIKLCSR 258
S+
Sbjct: 328 EEYPALSQ 335
>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
Length = 604
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
Length = 594
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/336 (29%), Positives = 135/336 (40%), Gaps = 61/336 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
Query: 251 AAIKLCSR--PLLTGT--QNVISLGQLQEAFVFEET 282
S+ PL+ + Q I L F ET
Sbjct: 328 EEYPALSQAEPLVKDSFAQTAIPLTNKVSLFATLET 363
>gi|440896703|gb|ELR48559.1| Beta-galactosidase-1-like protein 2, partial [Bos grunniens mutus]
Length = 542
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 108/249 (43%), Gaps = 49/249 (19%)
Query: 19 TYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTYGGLPIWLHD 78
+YV WNLHEP++G +DFSG D+ FI GL+V LR GP+I SE GGLP WL
Sbjct: 1 SYVPWNLHEPERGTFDFSGNLDLEAFILLAAEVGLWVILRPGPYICSEVDLGGLPSWLLR 60
Query: 79 VAGIVFRSDNKPY-----------------------------KIENEYQTIEPAFHEKGP 109
+ R+ K + ++ENEY + + K P
Sbjct: 61 DPDMRLRTTYKGFTEAVDLYFDHLMLRVVPLQYKHGGPIIAVQVENEYGS-----YNKDP 115
Query: 110 PYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPNS----------- 158
Y+ + K D G+ ++ D+ G +G+ + +
Sbjct: 116 AYMPYIKKALQD--RGIAELLLTSDNQGGLESGVLDGVLATINLQSQSELQLFTTILLGA 173
Query: 159 -PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGGTNFGRT 217
++P + E WT ++ WGG YI + ++ V+ I K GS +N YM+HGGTNFG
Sbjct: 174 QGSQPKMVMEYWTGWFDSWGGPHYILDSSEVLNTVSA-IVKAGSSINLYMFHGGTNFGFI 232
Query: 218 AAAFMITGY 226
A Y
Sbjct: 233 GGAMHFQDY 241
>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
Length = 592
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 116/289 (40%), Gaps = 56/289 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN+ EP+KG++ F G D +F+ Q GLY +R P
Sbjct: 34 WQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFDGLCDFEKFLDLAQKLGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW GGLP W+ V G+ R N+PY +
Sbjct: 94 YICAEWELGGLPSWIFTVPGLEPRCKNEPYYQNVRDYYKVLLPRLVNHQIDKGGNIILMQ 153
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
IENEY ++ K Y+ + + + VP+V + C+G
Sbjct: 154 IENEY-----GYYGKDMSYMHFLEGLMREGGITVPFVTSDGPWGKMFIHGQCDGALPTGN 208
Query: 153 FKGPNSP--------------NKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
F P P + E W ++ WG K + S ++ K
Sbjct: 209 FGSHARPLFANMKRMMKKTGNRGPLMCMEFWIGWFDAWGNKEHKTSKLKRNIKDLNYMLK 268
Query: 199 NGSYVNYYMYHGGTNFG-------RTAAAFMITGYYDQAPLDEYGLVRE 240
G+ VN+YM+HGGTNFG T T Y APL E G + E
Sbjct: 269 KGN-VNFYMFHGGTNFGFMNGSNYFTKLTPDTTSYDYDAPLSEDGKITE 316
>gi|384248639|gb|EIE22122.1| hypothetical protein COCSUDRAFT_1093, partial [Coccomyxa
subellipsoidea C-169]
Length = 632
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 124/304 (40%), Gaps = 63/304 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + K GL+ + YV WNLHEP GQY++ G D+ ++ Q QGLYV LR GP
Sbjct: 50 WKDRMLRTKALGLNTLSVYVPWNLHEPFPGQYNWDGFADLEAYLALAQEQGLYVLLRPGP 109
Query: 62 FIESEWTYGGLPIWLHDVAG---------IVFRSDNKPY--------------------- 91
+I +EW +GG P WL + RSD+ Y
Sbjct: 110 YICAEWDFGGFPWWLASSKAGLCSTSSHSVTLRSDDPAYLELVDRWWKVLLPKIGRFLYS 169
Query: 92 --------KIENEYQTIEPAFHEKGPPYVLWAAKMAVD----FHTGVPWVMCKQDDAPGP 139
++ENE+ + P +EK +++ + ++ +T P + PG
Sbjct: 170 RGGNILMVQVENEFGFVGP--NEKYMRHLVGTVRASLGDDALIYTTDPPPNIAKGTLPGD 227
Query: 140 VINACNGMRCG--------ETFKGPNSPNK-PSIWTEDWTSFYQVWGGKPYIRSAQDI-- 188
+ + G + N+P K P + +E +T + WG K S
Sbjct: 228 EVLSVVDFGAGWFDLNWAFSQQRAMNAPGKSPPMCSEFYTGWLTRWGEKMANTSVDQFLD 287
Query: 189 AFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFM--------ITGYYDQAPLDEYGLVRE 240
H L A N VN YM HGGTNFG TA + IT Y AP+ E G +
Sbjct: 288 TLHGVLGFANNTGSVNLYMVHGGTNFGFTAGGSIDNGVYWACITSYDYDAPISEAGDTGQ 347
Query: 241 PKWG 244
P G
Sbjct: 348 PGIG 351
>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|189217683|ref|NP_001121284.1| galactosidase, beta 1-like precursor [Xenopus laevis]
gi|115527881|gb|AAI24928.1| LOC100158367 protein [Xenopus laevis]
Length = 645
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 118/284 (41%), Gaps = 56/284 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GLD I TYV WN HE + G Y+FSG +DI F+K GL V LR GP
Sbjct: 61 WKDRLLKMKMAGLDAIYTYVPWNFHETKPGVYNFSGDHDIESFLKLANEIGLLVILRAGP 120
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPA-----FHEKGPPYVL- 113
+I +EW GGLP WL IV RS + Y ++N P +H GP +
Sbjct: 121 YICAEWDMGGLPAWLLAKESIVLRSSDPDYLQAVDNWMGVFLPKMKPLLYHNGGPIISVQ 180
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVINACNGMRCGETFK--- 154
+ + D+ H G ++ D + A +RCG
Sbjct: 181 VENEYGSYFTCDYNYLRHLLQLFRHHLGDEVILFTTDGS------ALQLVRCGTIQGLYT 234
Query: 155 ----GPNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
GP S P P I +E +T + WG + + + + + +A
Sbjct: 235 TVDFGPGSNITETFLVQRHCEPKGPLINSEFYTGWLDHWGEPHSVVATERVTKSLDEILA 294
Query: 198 KNGSYVNYYMYHGGTNFG-----RTAAAFMITGYYDQAPLDEYG 236
G+ VN YM+ GGTNFG T A T Y APL E G
Sbjct: 295 I-GASVNMYMFIGGTNFGYWNGANTPYAPQPTSYDYDAPLSEAG 337
>gi|444509211|gb|ELV09205.1| Beta-galactosidase [Tupaia chinensis]
Length = 600
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 130/299 (43%), Gaps = 46/299 (15%)
Query: 10 KEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGPFIESEWTY 69
+ GL+ IQTYV WN HEPQ GQY FS +D+ FI+ GL V LR GP+I +EW
Sbjct: 2 RMAGLNAIQTYVPWNFHEPQPGQYRFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDM 61
Query: 70 GGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL------WAAK 117
GGLP WL + IV RS + Y + ++P ++ G P + +
Sbjct: 62 GGLPAWLLEKESIVLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPIITVQVENEYGRY 121
Query: 118 MAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-KGPN----- 157
+ D+ H G ++ D A ++ A G+ F G N
Sbjct: 122 FSCDYDYLRFLQKLFRHHLGDDALLFTTDGAREKLLQCGALQGLYATVDFGAGENVTAAF 181
Query: 158 ------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALF-IAKNGSYVNYYMYHG 210
P P + +E +T + W G+P+ + Q A +L+ I +G+ VN YM+ G
Sbjct: 182 QIQRMSEPKGPLVNSEFYTGWLDHW-GQPH-STVQTEAVASSLYDILAHGANVNLYMFIG 239
Query: 211 GTNF-----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELHAAIKLCSRPLLTGT 264
GTNF T A T Y APL E G + E + K + K+ P+ T
Sbjct: 240 GTNFAYWNGANTPYAPQPTSYDYDAPLSEAGDLTEKYFALRKVIQKFAKIPEGPIPPST 298
>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 594
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
Length = 649
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 129/297 (43%), Gaps = 55/297 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TY+ WNLHE ++G++DFS D+ ++ ++ GL+V LR GP
Sbjct: 80 WKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGP 139
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E GGLP WL R+ NK + +
Sbjct: 140 YICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIPKILPLQYRHGGPVIAVQ 199
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + +K Y+ + K + G+ ++ DD G I + NG
Sbjct: 200 VENEYGSF-----QKDRNYMNYLKKALLK--RGIVELLLTSDDKDGIQIGSVNGALTTIN 252
Query: 153 FKG----------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+KP + E WT +Y WG K +SA++I V FI+ S+
Sbjct: 253 MNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF 312
Query: 203 VNYYMYHGGTNFGRTAAA-------FMITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
N YM+HGGTNFG ++T Y A L E G E K+ L++L A+
Sbjct: 313 -NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAGDYTE-KYFKLRKLFAS 367
>gi|408677368|ref|YP_006877195.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
gi|328881697|emb|CCA54936.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
Length = 611
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 127/297 (42%), Gaps = 50/297 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + GL+ ++TYV WNLHEP+ G+Y + + RF+ + G++ +R GP
Sbjct: 35 WEHRLGMLRAMGLNCVETYVPWNLHEPEPGRY--ADVAALGRFLDAVARAGMWAIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFHE----KGPPYVL-- 113
+I +EW GGLP WL G RS + + +E ++ + P E +G P VL
Sbjct: 93 YICAEWENGGLPHWLTGPLGRRVRSFDPEFLAPVEAWFRRLLPQVVERQIDRGGPVVLVQ 152
Query: 114 ----------------WAAKMAVDFHTGVPWV--------MCKQDDAPGPVINA--CNGM 147
W A++ VP M PG + A +G
Sbjct: 153 VENEYGSYGSDRAYLEWLAELLRGCGVAVPLFTSDGPEDHMLTGGSVPGVLATANFGSGA 212
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
R G + P+ P + E W ++ WG + +R A D A I + G+ VN YM
Sbjct: 213 REGFATLRRHQPSGPLMCMEFWCGWFDHWGTEHAVRDAADAA-EALREILECGASVNVYM 271
Query: 208 YHGGTNFGRTAAA------------FMITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
HGGTNFG A A +T Y AP+DE G E W +E+ AA
Sbjct: 272 AHGGTNFGGFAGANRAGELHDGPLRATVTSYDYDAPVDEAGRPTEKFW-RFREVLAA 327
>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 594
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
Length = 662
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 129/297 (43%), Gaps = 55/297 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TY+ WNLHE ++G++DFS D+ ++ ++ GL+V LR GP
Sbjct: 93 WKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGP 152
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E GGLP WL R+ NK + +
Sbjct: 153 YICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIPKILPLQYRHGGPVIAVQ 212
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + +K Y+ + K + G+ ++ DD G I + NG
Sbjct: 213 VENEYGSF-----QKDRNYMNYLKKALLK--RGIVELLLTSDDKDGIQIGSVNGALTTIN 265
Query: 153 FKG----------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+KP + E WT +Y WG K +SA++I V FI+ S+
Sbjct: 266 MNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF 325
Query: 203 VNYYMYHGGTNFGRTAAA-------FMITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
N YM+HGGTNFG ++T Y A L E G E K+ L++L A+
Sbjct: 326 -NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAGDYTE-KYFKLRKLFAS 380
>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQVFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
Length = 594
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Cavia porcellus]
Length = 679
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 129/306 (42%), Gaps = 55/306 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + + TY+ WNLHEPQ+G++ FSG D+ F+ GL+V LR GP
Sbjct: 126 WRDRLLKLKACGFNTVTTYIPWNLHEPQRGKFVFSGNLDLEAFVLLAAEIGLWVILRPGP 185
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E GGLP WL R+ + + +
Sbjct: 186 YICAEIDLGGLPSWLLQNPKTQLRTTERTFVDAVDAYFDHLMRRMVPLQYHHGGPVIAVQ 245
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK--QDDAPGPVINACNGMRCG 150
+ENEY + F+ G Y+ + + + C +D G + + G
Sbjct: 246 VENEYGS----FNRDG-QYMAYLKEALLKRGIVELLFTCDYYKDVVNGSLKGVLATVNLG 300
Query: 151 ETFKGPNS--------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G NS +KP + E W +Y WG +SA ++A V+ FI KNG
Sbjct: 301 SL--GKNSFYQLLQVQSHKPILIMEYWVGWYDSWGLPHANKSAAEVAHTVSTFI-KNGIS 357
Query: 203 VNYYMYHGGTNFGRTAAAFMITG-------YYDQAPLDEYGLVREPKWGHLKELHAAIKL 255
N YM+HGGTNFG AA ++ G Y A L E G E K+ L+EL +
Sbjct: 358 FNVYMFHGGTNFGFINAAGIVEGRRSVTTSYDYDAVLSEAGDYTE-KYFKLRELLGSFSA 416
Query: 256 CSRPLL 261
P L
Sbjct: 417 VPLPHL 422
>gi|365860016|ref|ZP_09399844.1| putative beta-galactosidase [Streptomyces sp. W007]
gi|364010544|gb|EHM31456.1| putative beta-galactosidase [Streptomyces sp. W007]
Length = 645
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 123/285 (43%), Gaps = 49/285 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A GL+ ++TYV WNLHEP++G+ G + RF+ ++ GL+ +R GP
Sbjct: 35 WEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG--ALGRFLDAVERAGLWAIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK--IENEYQTIEPAFHE----KGPPYVL-- 113
+I +EW GGLP+W+ G R+ + Y+ +E ++ + P + +G P +L
Sbjct: 93 YICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWFRELLPQVVQRQVSRGGPVILVQ 152
Query: 114 ----------------WAAKMAVDFHTGVPWV--------MCKQDDAPGPVINACNGMRC 149
W A + VP M PG + A G
Sbjct: 153 AENEYGSYGSDAVYLEWLAGLLRQCGVTVPLFTSDGPEDHMLTGGSVPGLLATANFGSGA 212
Query: 150 GETFKG--PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
E F+ + P P + E W ++ WG +P R + A + + + G+ VN YM
Sbjct: 213 REGFEVLLRHQPRGPLMCMEFWCGWFDHWGAEPVRRDPEQAAGALREVL-ECGASVNIYM 271
Query: 208 YHGGTNFGRTAAAF------------MITGYYDQAPLDEYGLVRE 240
HGGTNFG A A +T Y AP+DEYG E
Sbjct: 272 AHGGTNFGGWAGANRSGPHQDESFQPTVTSYDYDAPVDEYGRATE 316
>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
Length = 688
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 129/297 (43%), Gaps = 55/297 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TY+ WNLHE ++G++DFS D+ ++ ++ GL+V LR GP
Sbjct: 119 WKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGP 178
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E GGLP WL R+ NK + +
Sbjct: 179 YICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIPKILPLQYRHGGPVIAVQ 238
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + +K Y+ + K + G+ ++ DD G I + NG
Sbjct: 239 VENEYGSF-----QKDRNYMNYLKKALLK--RGIVELLLTSDDKDGIQIGSVNGALTTIN 291
Query: 153 FKG----------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+KP + E WT +Y WG K +SA++I V FI+ S+
Sbjct: 292 MNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF 351
Query: 203 VNYYMYHGGTNFGRTAAA-------FMITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
N YM+HGGTNFG ++T Y A L E G E K+ L++L A+
Sbjct: 352 -NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAGDYTE-KYFKLRKLFAS 406
>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
Length = 604
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|221043038|dbj|BAH13196.1| unnamed protein product [Homo sapiens]
Length = 647
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 120/282 (42%), Gaps = 44/282 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQTYV WN +EP GQY FS +D+ F++ GL V LR GP
Sbjct: 35 WKDRLLKMKMAGLNAIQTYVPWNFYEPWPGQYQFSEDHDVEYFLRLAHELGLLVILRPGP 94
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + I+ RS + Y + ++P ++ G P +
Sbjct: 95 YICAEWEMGGLPAWLLEKESILLRSSDPDYLAAVDKWLGVLLPKMKPLLYQNGGPVITVQ 154
Query: 114 ----WAAKMAVDF------------HTGVPWVMCKQDDAPGPVIN--ACNGMRCGETF-K 154
+ + A DF H G V+ D A + A G+ F
Sbjct: 155 VENEYGSYFACDFDYLRFLQKRFRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGT 214
Query: 155 GPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYV 203
G N P P I +E +T + WG + +A + +A+ G+ V
Sbjct: 215 GSNITDAFLSQRKCEPKGPLINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILAR-GASV 273
Query: 204 NYYMYHGGTNF-----GRTAAAFMITGYYDQAPLDEYGLVRE 240
N YM+ GGTNF + A T Y APL E G + E
Sbjct: 274 NLYMFIGGTNFAYWNGANSPYAAQPTSYDYDAPLSEAGDLTE 315
>gi|321478650|gb|EFX89607.1| hypothetical protein DAPPUDRAFT_303198 [Daphnia pulex]
Length = 651
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 130/313 (41%), Gaps = 63/313 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP + K + GL+V++TYV W HEPQ G Y F G DI + + Q L V LR GP
Sbjct: 62 WPDRMRKMRAAGLNVLETYVEWASHEPQPGVYAFEGNLDIEYYFELAQHFNLSVILRPGP 121
Query: 62 FIESEWTYGGLPIWLHDV-AGIVFRSDNKPY----------------------------- 91
FI++E GGLP WL V I R+ +K Y
Sbjct: 122 FIDAERDMGGLPFWLLSVDPSIKLRTSDKSYVTHVEKWFSVLLSKIKPYLYNNGGPIVTV 181
Query: 92 KIENEYQTIEPAFHEKGPPYVLWA--------AKMAVDFHT---GVPWVMCKQDDAPGPV 140
++ENEY + P + Y W K V F T G ++ C +
Sbjct: 182 QVENEYGSYSPCDRD----YTSWLRDFIRQHLGKDVVLFSTDGDGDGYLQCGKIPGVYAT 237
Query: 141 INACNGMRCGETFKGPNSPNK------PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
++ G E+FK P + P + +E + + +WG +D+ +
Sbjct: 238 VDFGAGSNAVESFK----PQRHFELAGPRVNSEFYPGWLDMWGEPHSTVDKEDVVKTLDD 293
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAF-------MITGYYDQAPLDEYGLVREPKWGHLK 247
+A N S V+ YM+HGGT+FG T+ A IT Y APL+E G E + K
Sbjct: 294 MLAINAS-VSMYMFHGGTSFGFTSGALPSNTYTPCITSYDYDAPLNEAGDPTEKYFSIRK 352
Query: 248 ELHAAIKLCSRPL 260
+ + L P+
Sbjct: 353 VISKYLPLPDFPV 365
>gi|302526862|ref|ZP_07279204.1| beta-galactosidase [Streptomyces sp. AA4]
gi|302435757|gb|EFL07573.1| beta-galactosidase [Streptomyces sp. AA4]
Length = 609
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 116/289 (40%), Gaps = 63/289 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +++ K GL+ ++TYV WN H+P G+ DF G D+ FI+ G V +R P
Sbjct: 64 WHDRLSRLKALGLNTVETYVAWNFHQPTPGRADFRGDRDLPAFIRTAGELGFQVIVRPSP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL + R + Y +
Sbjct: 124 YICAEWEFGGLPAWLLADRNMELRCADPAYLKAVDAWYDQLIPQLTPLEAQHGGPIVAVQ 183
Query: 93 IENEYQT----------IEPAFHEKGPPYVLWAAKMAVDFHT---GVPWVM--CKQDDAP 137
IENEY + + + +G +L+ A A +F +P + D P
Sbjct: 184 IENEYGSYGNDTSYLAHLRDSLRSRGITSLLFVADGASEFFMRFGELPGTLEAGTGDGDP 243
Query: 138 GPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
P I A R P P + E W ++ WG + Q A H+ +A
Sbjct: 244 APSIAALKAFR----------PGAPVMMAEYWDGWFDHWGEPHHTTDPQQTAAHIDQLLA 293
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLV 238
G+ VN YM GGTN+G TA A +T Y +P+ E G V
Sbjct: 294 -TGASVNLYMACGGTNYGFTAGANTSGLQYQPTVTSYDYDSPVGEAGDV 341
>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
Length = 586
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 129/297 (43%), Gaps = 55/297 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + + TY+ WNLHE ++G++DFS D+ ++ ++ GL+V LR GP
Sbjct: 17 WKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYVLLAKTIGLWVILRPGP 76
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +E GGLP WL R+ NK + +
Sbjct: 77 YICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIPKILPLQYRHGGPVIAVQ 136
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGET 152
+ENEY + +K Y+ + K + G+ ++ DD G I + NG
Sbjct: 137 VENEYGSF-----QKDRNYMNYLKKALLK--RGIVELLLTSDDKDGIQIGSVNGALTTIN 189
Query: 153 FKG----------PNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
+KP + E WT +Y WG K +SA++I V FI+ S+
Sbjct: 190 MNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKSAEEIRHTVYKFISYGLSF 249
Query: 203 VNYYMYHGGTNFGRTAAA-------FMITGYYDQAPLDEYGLVREPKWGHLKELHAA 252
N YM+HGGTNFG ++T Y A L E G E K+ L++L A+
Sbjct: 250 -NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAGDYTE-KYFKLRKLFAS 304
>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
Length = 594
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 597
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 120/288 (41%), Gaps = 60/288 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WN HE +G++DFSG DI RFI ++ GLYV +R P
Sbjct: 34 WEHSLYNLKALGFNAVETYVPWNFHETVEGEFDFSGTKDIKRFIHTAEAIGLYVIIRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL + RS + + +
Sbjct: 94 YICAEWEFGGLPAWLLTKPNLRVRSRDPQFLEYVERYYDRLFEILTPLQIDHHGPILMMQ 153
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVP-------WVMC-------KQDDAPG 138
+ENEY + + + Y+ A+M D VP W C + D P
Sbjct: 154 VENEYGS-----YGEDKTYLSALARMMRDRGVTVPLFTSDGSWQQCLEAGSLAEADIIPT 208
Query: 139 PVINACNGMRCGETFKGPNSPNK--PSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
+ + R K K P + E W ++ WG + R + ++ + +
Sbjct: 209 GNFGSKSQKRLDNLHKFHQQFGKTWPLMSMEFWDGWFNRWGDRIITRQSDELIDEIGE-V 267
Query: 197 AKNGSYVNYYMYHGGTNFG-------RTAAAF-MITGYYDQAPLDEYG 236
K GS +N YM+HGGTNFG R +T Y APLDE G
Sbjct: 268 LKRGS-INLYMFHGGTNFGFWNGCSARGRIDLPQVTSYDYDAPLDEAG 314
>gi|395803570|ref|ZP_10482814.1| beta-galactosidase [Flavobacterium sp. F52]
gi|395434124|gb|EJG00074.1| beta-galactosidase [Flavobacterium sp. F52]
Length = 617
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 142/314 (45%), Gaps = 59/314 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDF-SGRNDIIRFIKEIQSQGLYVCLRIG 60
W + K GL+ + TYVFWN HE + G +DF +G D+ F++ +S+GLYV LR G
Sbjct: 58 WRHRLQMLKAMGLNTVATYVFWNYHEIEPGVWDFKTGNRDLAEFLRIAKSEGLYVILRPG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLW 114
P+ EW +GG P WL + +V R++NK + +E+ Y ++ F +G P ++
Sbjct: 118 PYACGEWEFGGYPWWLQNNPDLVIRTNNKAFLDACKTYLEHLYAVVKGNFANQGGPIIMV 177
Query: 115 AAK------------MAVDFHTGVP---WVMCKQDDAPGP-----------------VIN 142
A+ ++ + H + + K+ P P V+
Sbjct: 178 QAENEFGSYVSQRTDISAEDHKAYKTAIYNILKETGFPEPFFTSDGSWLFEGGMVEGVLP 237
Query: 143 ACNGMRCGETFKGP----NSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIA 197
NG E K + P + E + + W +P+++ +++IA ++
Sbjct: 238 TANGESNIENLKKQVDKYHKGQGPYMVAEFYPGWLDHW-AEPFVKIGSEEIASQTKKYLD 296
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKE 248
S+ NYYM HGGTNFG T+ A IT Y AP+ E G PK+ +++
Sbjct: 297 AGVSF-NYYMAHGGTNFGFTSGANYNEESDIQPDITSYDYDAPISEAGWAT-PKFMAIRD 354
Query: 249 L---HAAIKLCSRP 259
+ ++ KL + P
Sbjct: 355 VMQKYSKTKLAAIP 368
>gi|257866484|ref|ZP_05646137.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257873001|ref|ZP_05652654.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
gi|257800442|gb|EEV29470.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257807165|gb|EEV35987.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
Length = 591
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 112/254 (44%), Gaps = 43/254 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WNLHEP++G YDF G DI F+K+ Q+ GL V LR
Sbjct: 34 WADSLYNLKALGANTVETYIPWNLHEPREGVYDFEGMKDIFAFVKQAQALGLMVILRPSV 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFH----EKGPPYVL-- 113
+I +EW +GGLP WL + + RS + + K+ N +Q + P G P ++
Sbjct: 94 YICAEWEFGGLPAWLLN-EPMRLRSTDPRFMAKVRNYFQVLLPKLVPLQITHGGPVIMMQ 152
Query: 114 -------WAAKMAVDFHT-------GVPWVMCKQDDAPGPVINACN------------GM 147
+ + A T G+ + D A V++A G
Sbjct: 153 VENEYGSYGMEKAYLRQTKELMEECGIDVPLFTSDGAWEEVLDAGTLIEDDVFVTGNFGS 212
Query: 148 RCGET------FKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
R E F + N P + E W ++ WG R QD+A V +A
Sbjct: 213 RSKENAAVMKEFMAKHGKNWPIMCMEYWDGWFNRWGEPIIKRDGQDLANEVKEMLAVGS- 271
Query: 202 YVNYYMYHGGTNFG 215
+N YM+HGGTNFG
Sbjct: 272 -LNLYMFHGGTNFG 284
>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 594
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
Length = 588
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 117/271 (43%), Gaps = 49/271 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN+HEP+KG++ F G DI RF+K Q GLYV LR P
Sbjct: 41 WQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGMLDIERFVKTAQELGLYVILRPSP 100
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEY----QTIEPAFHEKGPPYVLWA 115
+I +EW +GGLP WL G+ R P+ +++ Y + I P G P +L
Sbjct: 101 YICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYYDVLLKKIVPYQINYGGPVILMQ 160
Query: 116 AKMAVDFHTG-VPWVMCKQDD------------APGPVINACNGMRCGETFKGPNSPNK- 161
+ ++ +++ +D + GP NG N +K
Sbjct: 161 VENEYGYYANDREYLLAMRDKMQKGGVVVPLVTSDGPFEENLNGGHLEGALPTGNFGSKT 220
Query: 162 --------------PSIWTEDWTSFYQVWGGKPYI-----RSAQDIAFHVALFIAKNGSY 202
P + TE W ++ WG ++ S +D+ + L +
Sbjct: 221 EERFEVLKKYTDGGPLMCTEFWVGWFDHWGNGGHMTGNLEESVKDLDKMLEL------GH 274
Query: 203 VNYYMYHGGTNFGRTAAAFMITGYYDQAPLD 233
VN YM+ GGTNFG + YYD+ D
Sbjct: 275 VNIYMFEGGTNFGFMNGS----NYYDELTPD 301
>gi|354490996|ref|XP_003507642.1| PREDICTED: beta-galactosidase-1-like protein [Cricetulus griseus]
Length = 648
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 124/283 (43%), Gaps = 52/283 (18%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + K + GL+ +Q YV WN HEP+ G Y+F+G D+I F+ E L V LR G
Sbjct: 60 LWADRLLKMRLSGLNAVQFYVPWNYHEPEPGVYNFNGSRDLIAFLDEATRVNLLVILRPG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFHEKGPPYVL----- 113
P+I +EW GGLP WL I R+ + + +++ ++ + P + PY+
Sbjct: 120 PYICAEWEMGGLPSWLLRKPNIHLRTSDPAFLSAVDSWFKVLLPKIY----PYLYHNGGN 175
Query: 114 ---------WAAKMAVDF----HTGVPWVMCKQDDAPGPVINACNGMRCGETFK------ 154
+ + A D+ H + D+ + G+RCG
Sbjct: 176 IISIQVENEYGSYRACDYKYMRHLAGLFRTLLGDEILLFTTDGPQGLRCGSLQGLYTTID 235
Query: 155 -GPN-------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
GP P+ P + +E +T + WG +R++ IA + + + G
Sbjct: 236 FGPADNMTRIFSLLRDYEPHGPLVNSEYYTGWLDYWGQNHSMRTSSAIAQGLEKML-RIG 294
Query: 201 SYVNYYMYHGGTNFGRTAAA------FMITGYYD-QAPLDEYG 236
+ VN YM+HGGTNFG A IT YD AP+ E G
Sbjct: 295 ASVNMYMFHGGTNFGYWNGADEKGRFLPITTSYDYDAPISEAG 337
>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 594
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|449664450|ref|XP_002165261.2| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
Length = 589
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 126/290 (43%), Gaps = 63/290 (21%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++K ++ GL+ IQTY+ WN HEP +G + F G+ ++ +F+K Q L V LR GP
Sbjct: 56 WEDRLSKIRKAGLNAIQTYIPWNFHEPTEGNFQFGGQQNVFKFLKLAQKYDLLVILRPGP 115
Query: 62 FIESEWTYGGLPIWLHDVAG---IVFRSDNKPY--KIENEYQT----IEPAFHEKGPPYV 112
+I +EW +GG P WL G + R+ + Y K+EN + P +E G P +
Sbjct: 116 YICAEWEFGGFPYWLLKKVGNKTMQLRTSDNLYLQKVENYMSVLLSGLRPYLYENGGPII 175
Query: 113 L---------------WAAKMAVDF--HTGVPWVMCKQDDAPGPVINACNGMRCGETFK- 154
+ K+ F + G ++ D A + ++CG T K
Sbjct: 176 TVQVENEYGSYGCDHEYMYKLESIFRKYLGENVILFTTDGA------GDSYLKCG-TIKP 228
Query: 155 -------GPNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
GP + P P + +E +T + WGG+ S +D+ +
Sbjct: 229 LFATVDFGPTAEPKLYFDIQRKYQPLGPLVNSEFYTGWLDHWGGQHAHTSLEDVTDTLDK 288
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFMI--------TGYYDQAPLDEYG 236
++ N S VN YM+ GGTNFG A T Y APL E G
Sbjct: 289 MLSLNAS-VNMYMFEGGTNFGFMNGANQDSNSLQPQPTSYDYDAPLSEAG 337
>gi|423248537|ref|ZP_17229553.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
gi|423253485|ref|ZP_17234416.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392657385|gb|EIY51022.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
CL03T12C07]
gi|392659750|gb|EIY53368.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
CL03T00C08]
Length = 773
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 128/291 (43%), Gaps = 47/291 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I Y+FWN HE Q+G++DFSG ++ +F K Q G+Y+ LR GP
Sbjct: 57 WEHRILMCKALGMNTICLYMFWNYHEQQEGKFDFSGEKNVAKFCKLAQKHGMYIILRPGP 116
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENEYQTIEPAFHEKGPPYVLWAAKM--- 118
+ +EW GGLP WL + RS N PY +E ++ + P + +
Sbjct: 117 YACAEWEMGGLPWWLLKEKDMKVRSLN-PYFMERTEIFMKELGKQLAPLQLANGGNIIMV 175
Query: 119 ---------AVD--FHTGVPWVMCKQ---------------------DDAPGPVINACNG 146
VD + T + ++C+ DD +N G
Sbjct: 176 QVENEFGGYGVDKPYMTAIRDIVCRAGFDKSVLFQCDWDSTFELNALDDLLW-TLNFGTG 234
Query: 147 MRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
+ FK ++ P+ P + +E W+ ++ WG K R A+ + + + +N S+ +
Sbjct: 235 ANIDKEFKKLSTVRPDTPLMCSEFWSGWFDHWGRKHETRPAEKMVEGIKDMLDRNISF-S 293
Query: 205 YYMYHGGTNFGRTAAA------FMITGYYDQAPLDEYGLVREPKWGHLKEL 249
YM HGGT FG A M + Y AP+ E G PK+ L+EL
Sbjct: 294 LYMTHGGTTFGHWGGANSPTYSAMCSSYDYDAPISEAGWTT-PKYYLLQEL 343
>gi|332375542|gb|AEE62912.1| unknown [Dendroctonus ponderosae]
Length = 454
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 133/317 (41%), Gaps = 74/317 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDF----SGRNDII---RFIKEIQSQGLY 54
W + K + GL+ ++TYV WNLHEP+ G++DF S D + F+ + + L+
Sbjct: 58 WRDRLRKIRAAGLNTVETYVPWNLHEPENGKFDFGEGGSEFEDFLHLEEFLNAAKEEDLF 117
Query: 55 VCLRIGPFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------- 91
V LR GP+I SE+ GG P WL + FR+ + Y
Sbjct: 118 VILRTGPYICSEYNSGGFPSWLLREKPMGFRTSEENYMKFVTRFFNVVLTLLAAFQFQLG 177
Query: 92 ------KIENEYQTIE--PAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINA 143
++ENEY +E AF P V + G+ ++ D P+
Sbjct: 178 GPVIAFQVENEYGNLENGAAFQ---PDKVYMEELRQLFLKNGIVELLTSAD---SPLWKG 231
Query: 144 CNGMRCGETFKGPN---------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDI 188
+G GE F+ N P +P + E W ++ GG+ ++S +D
Sbjct: 232 TSGTLPGELFQTANFGDNAVNQLNKLEEFQPGRPLMVMEYWIGWFDNVGGEHSVKSDEDS 291
Query: 189 AFHVALFIAKNGSYVNYYMYHGGTNFGRTAAAFM------------ITGYYD-QAPLDEY 235
+ +KN S+ N YM+HGGTNF A + IT YD AP+ E
Sbjct: 292 RRVLEDIFSKNASF-NAYMFHGGTNFWFNNGANLDNDLMDNSGYTAITTSYDYDAPISES 350
Query: 236 GLVREPKWGHLKELHAA 252
G R K+ +KEL AA
Sbjct: 351 GGYRN-KYFIVKELVAA 366
>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 594
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
Length = 594
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
Length = 624
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 133/303 (43%), Gaps = 61/303 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWNLHE + G++DFSG ++ +I+ +G+ V LR GP
Sbjct: 55 WRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNLAEYIRIAGEEGMMVILRPGP 114
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVL-- 113
++ +EW +GG P WL ++ G+ R DN + I+ YQ + P KG P ++
Sbjct: 115 YVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDRLYQEVGPLQCTKGGPIIMVQ 174
Query: 114 ----------------------WAAKMA---VDFHTGVP-------WVM---CKQDDAPG 138
+ AK+ D VP W+ C P
Sbjct: 175 CENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPT 234
Query: 139 P--VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALF 195
+ N + + G P + + W S + G+P+ + SA +IA +
Sbjct: 235 ANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLSHW----GEPFPQVSASEIARQTEAY 290
Query: 196 IAKNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHL 246
+ + S+ N+YM HGGTNFG T+ A +T Y AP+ E G + PK+ +
Sbjct: 291 LQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWI-TPKYDSI 348
Query: 247 KEL 249
+ +
Sbjct: 349 RSV 351
>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
Length = 594
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|86142033|ref|ZP_01060557.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
gi|85831596|gb|EAQ50052.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
Length = 620
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 152/371 (40%), Gaps = 62/371 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDF-SGRNDIIRFIKEIQSQGLYVCLRIG 60
W I K GL+ I TYVFWN H P G +DF SG ++ FIK + + ++V LR G
Sbjct: 60 WRHRIQMMKAMGLNTIATYVFWNYHNPAPGVWDFESGNRNVAEFIKIAKEEEMFVILRPG 119
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY----------------------------- 91
P+ EW +GG P +L ++ G+ R +N +
Sbjct: 120 PYACGEWEFGGYPWFLQNIPGLKVRENNAQFLAACKEYINELAKQVAPLQVNNGGNIIMT 179
Query: 92 KIENEY-------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK-----QDDAPGP 139
++ENE+ + I P H+ Y KM D P+ + +
Sbjct: 180 QVENEFGSYVAQREDIAPEDHKA---YKEAIFKMLKDAGFQAPFFTSDGAWLFEGGSLEG 236
Query: 140 VINACNGMRCGETFKGP----NSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVAL 194
V+ NG + K N+ P + E + + W +P+++ SA DIA +
Sbjct: 237 VLPTANGEGNIDNLKKVVNKFNNNEGPYMVAEFYPGWLDHW-AEPFVKISASDIAKQTEV 295
Query: 195 FIAKNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGH 245
++ KNG N+YM HGGTNFG T+ A IT Y AP+ E G V PK+
Sbjct: 296 YL-KNGVNFNFYMAHGGTNFGFTSGANYNDEHDIQPDITSYDYDAPISEAGWVT-PKYDS 353
Query: 246 LKELHAAIKLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAVTVLFRN 305
++ L P + VI + Q+Q A + + + V +D L +
Sbjct: 354 IRALMQKYAPYEIPAVPEQIPVIEIPQIQLAKTTDALTFIKKQKPVTSDSPLTFEQLEQG 413
Query: 306 ISYELPRKSIS 316
Y L +K +
Sbjct: 414 FGYVLYKKRFT 424
Score = 39.7 bits (91), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 34/78 (43%), Gaps = 27/78 (34%)
Query: 538 LNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVTSIHFCAIIKATNTYHVP 597
LN+ MGKG +VNG ++GRYW K P QT Y VP
Sbjct: 550 LNMSEMGKGIVFVNGHNLGRYW------KVGPQQTLY---------------------VP 582
Query: 598 RAFLKPTGNLLVLLEEEN 615
+LK GN + + E+ N
Sbjct: 583 GCWLKKKGNTITIFEQLN 600
>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
Length = 594
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 94 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 152
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 153 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 209
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 210 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 269
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 270 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 327
>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
Length = 581
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 117/271 (43%), Gaps = 49/271 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K G + ++TY+ WN+HEP+KG++ F G DI RF+K Q GLYV LR P
Sbjct: 34 WQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGMLDIERFVKTAQELGLYVILRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEY----QTIEPAFHEKGPPYVLWA 115
+I +EW +GGLP WL G+ R P+ +++ Y + I P G P +L
Sbjct: 94 YICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYYDVLLKKIVPYQINYGGPVILMQ 153
Query: 116 AKMAVDFHTG-VPWVMCKQDD------------APGPVINACNGMRCGETFKGPNSPNK- 161
+ ++ +++ +D + GP NG N +K
Sbjct: 154 VENEYGYYANDREYLLAMRDKMQKGGVVVPLVTSDGPFEENLNGGHLEGALPTGNFGSKT 213
Query: 162 --------------PSIWTEDWTSFYQVWGGKPYI-----RSAQDIAFHVALFIAKNGSY 202
P + TE W ++ WG ++ S +D+ + L +
Sbjct: 214 EERFEVLKKYTDGGPLMCTEFWVGWFDHWGNGGHMTGNLEESVKDLDKMLEL------GH 267
Query: 203 VNYYMYHGGTNFGRTAAAFMITGYYDQAPLD 233
VN YM+ GGTNFG + YYD+ D
Sbjct: 268 VNIYMFEGGTNFGFMNGS----NYYDELTPD 294
>gi|383128326|gb|AFG44819.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128328|gb|AFG44820.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128336|gb|AFG44824.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128338|gb|AFG44825.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 87/156 (55%), Gaps = 8/156 (5%)
Query: 550 VNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
VNG+SIGRYW S+ S+G + + + A ++ + C + YHVPR++++PTGN
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQ-PSQKLYHVPRSWIQPTGN 59
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
+LVL EE G+P I+ ++ VC V+ +HLPP+ SW + + +K K +
Sbjct: 60 VLVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSW---KSSATSGLKVNKPKAEL 116
Query: 667 QPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHS 701
Q CP G I I FASFG P G C + G C++
Sbjct: 117 QLHCPSSGHLIKSIKFASFGTPTGHCGSFTYGHCNT 152
>gi|361068121|gb|AEW08372.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128330|gb|AFG44821.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
gi|383128334|gb|AFG44823.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 87/156 (55%), Gaps = 8/156 (5%)
Query: 550 VNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
VNG+SIGRYW S+ S+G + + + A ++ + C + YHVPR++++PTGN
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQ-PSQKLYHVPRSWIQPTGN 59
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
+LVL EE G+P I+ ++ VC V+ +HLPP+ SW + + +K K +
Sbjct: 60 VLVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSW---KSSATSGLKVNKPKAEL 116
Query: 667 QPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHS 701
Q CP G I I FASFG P G C + G C++
Sbjct: 117 QLHCPSSGHLIKSIKFASFGTPTGRCGSFTYGHCNT 152
>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
Length = 624
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 133/303 (43%), Gaps = 61/303 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWNLHE + G++DFSG ++ +I+ +G+ V LR GP
Sbjct: 55 WRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNLAEYIRIAGEEGMMVILRPGP 114
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVL-- 113
++ +EW +GG P WL ++ G+ R DN + I+ YQ + P KG P ++
Sbjct: 115 YVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDRLYQEVGPLQCTKGGPIIMVQ 174
Query: 114 ----------------------WAAKMA---VDFHTGVP-------WVM---CKQDDAPG 138
+ AK+ D VP W+ C P
Sbjct: 175 CENEFGSYVSQRKDISFEEHRSYNAKIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPT 234
Query: 139 P--VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALF 195
+ N + + G P + + W S + G+P+ + SA +IA +
Sbjct: 235 ANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLSHW----GEPFPQVSASEIARQTEAY 290
Query: 196 IAKNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHL 246
+ + S+ N+YM HGGTNFG T+ A +T Y AP+ E G + PK+ +
Sbjct: 291 LQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWI-TPKYDSI 348
Query: 247 KEL 249
+ +
Sbjct: 349 RSV 351
>gi|340372779|ref|XP_003384921.1| PREDICTED: beta-galactosidase-like [Amphimedon queenslandica]
Length = 659
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 125/301 (41%), Gaps = 61/301 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++K GL+ +QTYV WN HEP G Y+F G +D++ F+K Q GL V LR GP
Sbjct: 68 WRDRLSKMYYAGLNAVQTYVPWNFHEPFPGVYNFEGDHDLVGFLKTAQDVGLLVILRAGP 127
Query: 62 FIESEWTYGGLPIW-LHDVAGIVFRSDNKPY----------------------------- 91
+I EW GG P W L + RS + Y
Sbjct: 128 YICGEWEMGGFPSWTLRNQPPPTLRSSDPSYLSLVDAWMGKLLPLVKPLLYENGGPIITV 187
Query: 92 KIENEYQT-----------IEPAFHEK-GPPYVLWAAKMAVDFHT---GVPWVMCKQDDA 136
++ENEY + +E F + GP VL+ A D + +P + D
Sbjct: 188 QVENEYGSFYTCDQKYMNHLESTFRQYLGPNVVLFTTDGAGDGYLKCGTIPSLYATVD-- 245
Query: 137 PGPVINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFI 196
A + F+ P P + +E +T + WG R+ IA + +
Sbjct: 246 ----FGATDNPEGYFAFQRKYEPKGPLVNSEFYTGWLDHWGQAHQTRNGDQIASSLDKIL 301
Query: 197 AKNGSYVNYYMYHGGTNFGRTAAAF--------MITGYYDQAPLDEYGLVREPKWGHLKE 248
A N S VN YM+ GGTNFG A T Y APL+E G + + K+G L+
Sbjct: 302 ALNAS-VNMYMFEGGTNFGFWNGANCGGQSYQPQPTSYDYDAPLNERGEMTD-KFGLLRS 359
Query: 249 L 249
+
Sbjct: 360 V 360
>gi|270295887|ref|ZP_06202087.1| beta-galactosidase [Bacteroides sp. D20]
gi|270273291|gb|EFA19153.1| beta-galactosidase [Bacteroides sp. D20]
Length = 1106
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/444 (25%), Positives = 172/444 (38%), Gaps = 70/444 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HE Q G +DF+G+ND+ F + Q +YV LR GP
Sbjct: 382 WDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGP 441
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R ++ PY
Sbjct: 442 YVCAEWEMGGLPWWLLKKKDIRLR-ESDPYFMERVGIFEKAVAEQVAGMTIQNGGPIIMV 500
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
++ENEY + +KG YV + + GV C D A N + +
Sbjct: 501 QVENEYGSYG---EDKG--YVSQIRDIVRANYPGVALFQC--DWASNFTKNGLHDLVWTM 553
Query: 152 TF-KGPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
F G N P+ P + +E W+ ++ WG R A D+ + ++K
Sbjct: 554 NFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHETRPAADMIAGIDEMLSKG 613
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKELHAAI 253
S+ + YM HGGTN+G A A +T Y AP+ E G W K L +
Sbjct: 614 ISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTTPKYWELRKALSKYM 672
Query: 254 ---KLCSRPLLTGTQNVISLGQLQEAFVFEETSGVCAAFLVNNDERKAV---TVLFRNIS 307
K P L + S + A +F+ + E ++L+R
Sbjct: 673 NGEKQAKVPALIKPIRIPSFQFTEMAPLFDNLPAAKKDRNIRTMEEYNQGFGSILYRTTL 732
Query: 308 YELPRKSISILPDCKTVA--FNTERVSTQYNKRSKTSNLKFDSDEKWEEYR---EAI--L 360
E+ S+ + D A F + + ++R+ L+F + K EA+ +
Sbjct: 733 PEMKTPSLLTVNDAHDYAQVFLDGKYIGKLDRRNGEKQLEFPACPKGARLDILVEAMGRI 792
Query: 361 NFDNTLLRAEGLLDQISAAKDASD 384
NF + +G+ + D D
Sbjct: 793 NFGRAIKDFKGITQSVELTVDIDD 816
>gi|326676244|ref|XP_001339426.3| PREDICTED: galactosidase, beta 1-like [Danio rerio]
Length = 301
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 70/261 (26%), Positives = 115/261 (44%), Gaps = 35/261 (13%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ + TYV WNLHEP++G Y F + D+ +I+ L+V LR GP
Sbjct: 38 WRDRLLKLKACGLNTLTTYVPWNLHEPERGVYVFQDQLDLEAYIRLAAELDLWVILRPGP 97
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYK------IENEYQTIEPAFHEKGPPYV--- 112
+I +EW GGLP WL + R+ + + I P ++KG P +
Sbjct: 98 YICAEWDLGGLPSWLLQDKKMKLRTTYSGFTSAVNSFFDKLIPRITPLQYKKGGPIIAVQ 157
Query: 113 --------------LWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN- 157
L K A+ G+ ++ D+ G +G+ + +
Sbjct: 158 VENEYGSYAKDEQYLSVVKEAL-MSRGISELLMTSDNREGLKCGGVDGVLQTVNLQKLSY 216
Query: 158 ---------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYMY 208
P KP + E W+ ++ VWG ++ SAQ++ + + G +N+YM+
Sbjct: 217 GDVQHLAELQPQKPLMVMEYWSGWFDVWGELHHVFSAQEM-ISIVRELLDRGVSINFYMF 275
Query: 209 HGGTNFGRTAAAFMITGYYDQ 229
HGG++FG + A + Y Q
Sbjct: 276 HGGSSFGFMSGAVDLGTYKPQ 296
>gi|417403754|gb|JAA48674.1| Putative beta-galactosidase [Desmodus rotundus]
Length = 669
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 119/283 (42%), Gaps = 46/283 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY FS +D+ FI+ L V LR GP
Sbjct: 73 WKDRLLKMKMAGLNAIQIYVPWNFHEPQPGQYQFSEDHDVECFIQLAHELELLVVLRPGP 132
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYVL-- 113
+I +EW GGLP WL + IV RS + Y + ++P ++ G P +
Sbjct: 133 YICAEWEMGGLPAWLLEKENIVLRSSDPDYLAAVDKWLGVILPKMKPLLYQNGGPIITVQ 192
Query: 114 ----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVIN--ACNGMRCGETFKG 155
+ + + D +H G ++ D + ++ A G+ F G
Sbjct: 193 VENEYGSYFSCDYDYLRFLQKRFHYHLGNDVILFTTDGSNEKLVQCGALQGLYATVDF-G 251
Query: 156 PNS-------------PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
P + P P I +E +T + W G+P+ + I G+
Sbjct: 252 PGANITDAFLIQRKYEPKGPLINSEFYTGWLDHW-GQPHSTVKTEAVVSSLQNILARGAN 310
Query: 203 VNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYGLVRE 240
VN YM+ GGTNF A M T Y APL E G + E
Sbjct: 311 VNLYMFIGGTNFAYWNGANMPYQAQPTSYDYDAPLSEAGDLTE 353
>gi|317479674|ref|ZP_07938798.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
gi|316904175|gb|EFV26005.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
Length = 1106
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 122/296 (41%), Gaps = 57/296 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HE Q G +DF+G+ND+ F + Q +YV LR GP
Sbjct: 382 WDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGP 441
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R ++ PY
Sbjct: 442 YVCAEWEMGGLPWWLLKKKDIRLR-ESDPYFMERVGIFEKAVAEQVAGMTIQNGGPIIMV 500
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
++ENEY + +KG YV + + GV C D A N + +
Sbjct: 501 QVENEYGSYG---EDKG--YVSQIRDIVRANYPGVALFQC--DWASNFTKNGLHDLVWTM 553
Query: 152 TF-KGPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
F G N P+ P + +E W+ ++ WG R A D+ + ++K
Sbjct: 554 NFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHETRPAADMIAGIDEMLSKG 613
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
S+ + YM HGGTN+G A A +T Y AP+ E G W K L
Sbjct: 614 ISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTTPKYWELRKAL 668
>gi|160890905|ref|ZP_02071908.1| hypothetical protein BACUNI_03350 [Bacteroides uniformis ATCC 8492]
gi|156859904|gb|EDO53335.1| glycosyl hydrolase family 35 [Bacteroides uniformis ATCC 8492]
Length = 1106
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 122/296 (41%), Gaps = 57/296 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HE Q G +DF+G+ND+ F + Q +YV LR GP
Sbjct: 382 WDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGP 441
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R ++ PY
Sbjct: 442 YVCAEWEMGGLPWWLLKKKDIRLR-ESDPYFMERVGIFEKAVAEQVAGMTIQNGGPIIMV 500
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
++ENEY + +KG YV + + GV C D A N + +
Sbjct: 501 QVENEYGSYG---EDKG--YVSQIRDIVRANYPGVALFQC--DWASNFTKNGLHDLVWTM 553
Query: 152 TF-KGPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
F G N P+ P + +E W+ ++ WG R A D+ + ++K
Sbjct: 554 NFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHETRPAADMIAGIDEMLSKG 613
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
S+ + YM HGGTN+G A A +T Y AP+ E G W K L
Sbjct: 614 ISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTTPKYWELRKAL 668
>gi|156552637|ref|XP_001603160.1| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 629
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/296 (29%), Positives = 134/296 (45%), Gaps = 48/296 (16%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W ++ K + GGL+ + TYV W++HEP+ Q+ + G DI+ FIK Q + L+V LR GP
Sbjct: 64 WRGILRKMRAGGLNAVSTYVEWSMHEPEFDQWVWDGDADIVEFIKIAQEEDLFVILRPGP 123
Query: 62 FIESEWTYGGLPIW-LHDVAGIVFRSDNKPY-----KIENE-YQTIEPAFHEKGPPYVL- 113
+I +E +GG P W L V I R+ ++ Y + NE + +P G P ++
Sbjct: 124 YICAERDFGGFPYWLLSRVPDIKLRTKDERYVFYAERFLNEILRRTKPLLRGNGGPIIMV 183
Query: 114 ---------------WAAKMAVDFH------------TGVPWVMCKQDDAPG--PVINAC 144
+ +KM FH G M K PG I+
Sbjct: 184 QVENEYGSFYACDDQYKSKMYEIFHRHVKNDAVLFTTDGSARSMLKCGSIPGVYATIDFG 243
Query: 145 NGMRCGETFKGPN--SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
NG +K SP P + +E + + WG ++ ++A + +A N S
Sbjct: 244 NGANVPFNYKIMREFSPKGPLVNSEYYPGWLTHWGESFQRVNSHNVAKTLDEMLAYNVS- 302
Query: 203 VNYYMYHGGTNFGRTAAAFM-------ITGYYDQAPLDEYGLVREPKWGHLKELHA 251
VN YMY+GGTNF T+ A + +T Y APL E G PK+ L+++ A
Sbjct: 303 VNIYMYYGGTNFAFTSGANINEHYWPQLTSYDYDAPLTEAG-DPTPKYFELRDVIA 357
>gi|423303842|ref|ZP_17281841.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|423307438|ref|ZP_17285428.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
gi|392687173|gb|EIY80470.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|392690047|gb|EIY83318.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
Length = 1106
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 122/296 (41%), Gaps = 57/296 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W I K G++ I YVFWN HE Q G +DF+G+ND+ F + Q +YV LR GP
Sbjct: 382 WDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFTGQNDLAEFCRLCQQNDMYVILRPGP 441
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------------------------------ 91
++ +EW GGLP WL I R ++ PY
Sbjct: 442 YVCAEWEMGGLPWWLLKKKDIRLR-ESDPYFMERVGIFEKAVAEQVAGMTIQNGGPIIMV 500
Query: 92 KIENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVINACNGMRCGE 151
++ENEY + +KG YV + + GV C D A N + +
Sbjct: 501 QVENEYGSYG---EDKG--YVSQIRDIVRANYPGVALFQC--DWASNFTKNGLHDLVWTM 553
Query: 152 TF-KGPN-----------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
F G N P+ P + +E W+ ++ WG R A D+ + ++K
Sbjct: 554 NFGTGANIDQQFAPLKKLRPDSPLMCSEFWSGWFDKWGANHETRPAADMIAGIDEMLSKG 613
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM------ITGYYDQAPLDEYGLVREPKWGHLKEL 249
S+ + YM HGGTN+G A A +T Y AP+ E G W K L
Sbjct: 614 ISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYDAPISESGQTTPKYWELRKAL 668
>gi|358341338|dbj|GAA49044.1| beta-galactosidase [Clonorchis sinensis]
Length = 604
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 136/313 (43%), Gaps = 65/313 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KAK GLD IQ Y+ WN HEP++G+Y+FS D+ F+ IQ + +R+GP
Sbjct: 3 WFDRLKKAKAAGLDAIQIYIPWNFHEPEEGEYNFSDDRDVEHFLDLIQQLDMLAIVRVGP 62
Query: 62 FIESEWTYGGLPIW-LHDVAGIVFRSDNKPY--KIENEYQTIEPA----FHEKGPPYVL- 113
+I +EW +GGLP W L + RS + Y ++ + + P + +G P ++
Sbjct: 63 YICAEWAFGGLPPWLLRKNPTMKLRSSDYSYYREVVKWFGVLLPKLRKHLYTEGGPIIMV 122
Query: 114 -----WAAKMAVD------------FHTGVPWVMCKQDDAPGPVINACNGMRCGE----- 151
+ A D +H G ++ D N+ +RCG
Sbjct: 123 QLENEYGYSTACDRDYMSMLYDLARYHLGQEVILFTTDG------NSLQILRCGSPDQRY 176
Query: 152 --------TFKGPN---------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVAL 194
T PN P +P + +E +T +Y WG K R A+ + +L
Sbjct: 177 LATVDFAPTTIPPNVSFDAVEKFRPGQPLVNSEFYTGWYDTWGSKHAHRPAELV--QESL 234
Query: 195 FIAKNGS---YVNYYMYHGGTNF----GRTAAAFMITGYYDQAPLDEYGLVREPKWGHLK 247
N S VN Y++HGGT+F G+ T Y APL E G + K+ L+
Sbjct: 235 IDLMNYSPRVNVNIYVFHGGTSFGFWSGKPNDVAATTSYDFDAPLSEAGDITY-KYELLR 293
Query: 248 ELHAAIKLCSRPL 260
+ A K +RPL
Sbjct: 294 K--AIHKFRNRPL 304
>gi|383128340|gb|AFG44826.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
Length = 157
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/156 (37%), Positives = 87/156 (55%), Gaps = 8/156 (5%)
Query: 550 VNGQSIGRYWVSFKTSKGNPSQT---QYAVNTVTSIHFCAIIKATNTYHVPRAFLKPTGN 606
VNG+SIGRYW S+ S+G + + + A ++ + C + YHVPR++++PTGN
Sbjct: 1 VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGK-PSQKLYHVPRSWIQPTGN 59
Query: 607 LLVLLEEENGNPLGITVDTIAIRKVCGHVTNSHLPPLSSWLRHRQRGDTDIKKFGKKPTV 666
+LVL EE G+P I+ ++ VC V+ +HLPP+ SW + + +K K +
Sbjct: 60 VLVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSW---KSSATSGLKVNKPKGEL 116
Query: 667 QPSCP-LGKKISKIVFASFGNPDGDCERYAVGSCHS 701
Q CP G I I FASFG P G C + G C++
Sbjct: 117 QLHCPSSGHLIKSIKFASFGTPTGHCGSFTYGHCNT 152
>gi|149711136|ref|XP_001493207.1| PREDICTED: galactosidase, beta 1-like [Equus caballus]
Length = 651
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 119/283 (42%), Gaps = 52/283 (18%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + K + GL+ +Q YV WN HEP+ G Y+F G D+I F+ E L V LR G
Sbjct: 61 LWADRLFKMRMSGLNAVQFYVPWNYHEPEPGVYNFHGSRDLIAFLNEAAIANLLVILRPG 120
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFHEKGPPYVL----- 113
P+I +EW GGLP WL I R+ + + +++ ++ + P H P++
Sbjct: 121 PYICAEWDMGGLPAWLLRKPKIHLRTSDPDFLAAVDSWFKVLLPKIH----PWLYHNGGN 176
Query: 114 ---------WAAKMAVDF----HTGVPWVMCKQDDAPGPVINACNGMRCGE--------- 151
+ + A DF H + D+ + G++CG
Sbjct: 177 IISIQVENEYGSYRACDFNYMRHLAGLFRAILGDEILLFTTDGPEGLKCGSLEGLYTTVD 236
Query: 152 -----------TFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
T P+ P + +E +T + WG RS + + + K G
Sbjct: 237 FGPADNMTKIFTLLRKYEPHGPLVNSEYYTGWLDYWGQNHSTRSVHSVTNGLENML-KLG 295
Query: 201 SYVNYYMYHGGTNFGRTAAA------FMITGYYD-QAPLDEYG 236
+ VN YM+HGGTNFG A IT YD AP+ E G
Sbjct: 296 ASVNMYMFHGGTNFGYWNGADEKGRFLPITTSYDYDAPISEAG 338
Score = 44.3 bits (103), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 41/100 (41%), Gaps = 28/100 (28%)
Query: 521 TWYKTTFRAPAGNDPIALNLQSMGKGEAWVNGQSIGRYWVSFKTSKGNPSQTQYAVNTVT 580
T+Y T F + L L KG+ W+NG ++GRYW +K P QT Y
Sbjct: 538 TFYSTMFAILGSSGDTFLYLPGWTKGQVWINGFNLGRYW-----TKRGPQQTLY------ 586
Query: 581 SIHFCAIIKATNTYHVPRAFLKPTG--NLLVLLEEENGNP 618
VPR L P G N + LLE EN P
Sbjct: 587 ---------------VPRPLLYPRGALNKITLLELENAPP 611
>gi|421767985|ref|ZP_16204697.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
gi|421773235|ref|ZP_16209883.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411182327|gb|EKS49478.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP3]
gi|411186672|gb|EKS53794.1| Beta-galactosidase 3 [Lactobacillus rhamnosus LRHMDP2]
Length = 656
Score = 100 bits (249), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 121/291 (41%), Gaps = 67/291 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHE ++G++DFSG DI RF+K + GLY +R P
Sbjct: 97 WYHSLYNLKALGFNTVETYVPWNLHEYREGEFDFSGILDIERFLKTAEDLGLYAIVRPSP 156
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + R+D+ Y +
Sbjct: 157 YICAEWEFGGFPAWLL-TKKMRLRTDDPAYLVAIDRYYTALMPHLVDHQVTHGGNVIMMQ 215
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGP-VINACNGMRCG- 150
+ENEY + + + Y+ AK+ VP D P P +NA + + G
Sbjct: 216 VENEYGS-----YGEDQDYLAAVAKLMQQHGVDVPLFTS---DGPWPATLNAGSMIDAGI 267
Query: 151 -----------------ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVA 193
F + + P + E W ++ W G+P IR D
Sbjct: 268 LATGNFGSAADKNFDRLAAFHQEHGRDWPLMCMEFWDGWFNRW-GEPIIRRDPDETAEDL 326
Query: 194 LFIAKNGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYG 236
+ K GS VN YM+HGGTNFG + +T Y APL+E G
Sbjct: 327 RAVIKRGS-VNLYMFHGGTNFGFMNGTSARKDHDLPQVTSYDYDAPLNEQG 376
>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
Length = 593
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 113/258 (43%), Gaps = 50/258 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TY+ WN+HEP +G++DF G DI +FIK + GLYV LR P
Sbjct: 34 WGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIKDIEKFIKISEKLGLYVILRPTP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRS--DNKPYKIENEYQTIEPAFHE----KGPPYVLWA 115
+I +EW +GGLP WL I RS DN K+ N Y + P + KG P ++
Sbjct: 94 YICAEWEFGGLPAWLLKDKEIKLRSSDDNFIEKLRNYYNDLLPRLVKYQVTKGGPVLM-- 151
Query: 116 AKMAVDFHTG----------VPWVMCKQDDAPGPVINA----CNGMRCG----------- 150
M V+ G + + K++ P+ + + CG
Sbjct: 152 --MQVENEYGSYGNEKEYLRIVASIMKENGVDVPLFTSDGTWIEALECGSLIEDDIFVSG 209
Query: 151 -------------ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIA 197
+ F N P + E W ++ WG R + D+A V +
Sbjct: 210 NFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDGWFNRWGEDIIRRDSIDLAEDVKEML- 268
Query: 198 KNGSYVNYYMYHGGTNFG 215
K GS +N YM+ GGTNFG
Sbjct: 269 KIGS-INLYMFRGGTNFG 285
>gi|453049630|gb|EME97211.1| beta-galactosidase [Streptomyces mobaraensis NBRC 13819 = DSM
40847]
Length = 584
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 115/286 (40%), Gaps = 59/286 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP +A + GL+ ++TYV WN HEP +G+ G ++ RF+ + GLY +R GP
Sbjct: 35 WPHRLAMLRAMGLNCVETYVPWNRHEPVEGRLHDVG--ELGRFLDAAGAAGLYAIVRPGP 92
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
++ +EW GGLP WL G R+ + + +
Sbjct: 93 YVCAEWENGGLPHWLTGRLGRRVRTSDPEFLRAVDGWLEAVGAELTGRQFGRGGPVVLVQ 152
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWV--------MCKQDDAPGPVINAC 144
+ENEY + + PY+ D VP V M PG
Sbjct: 153 VENEYGS-----YGSDQPYLEHLVGRLRDSGVVVPLVTSDGPEDHMLTGGTVPGATATVN 207
Query: 145 NGMRCGETFK--GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSY 202
G E F+ + P P + E W ++ WGG P R A + A + + G+
Sbjct: 208 FGSGAREAFRVLRRHRPAGPLMCMEFWCGWFAHWGGAPAARDAGEAA-EALREVLECGAS 266
Query: 203 VNYYMYHGGTNFG------------RTAAAFMITGYYDQAPLDEYG 236
VN YM HGGTNFG R A T Y AP+DEYG
Sbjct: 267 VNVYMAHGGTNFGGWAGANRAGAEHRGALRPTTTSYDYDAPVDEYG 312
>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 628
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 139/326 (42%), Gaps = 57/326 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWNLHEP+ G++DF+G ++ FIK +G+ V LR GP
Sbjct: 58 WRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVL-- 113
++ +EW +GG P WL +V G+ R DN + I+ Y+ + KG P V+
Sbjct: 118 YVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQ 177
Query: 114 ----------------------WAAKMA---VDFHTGVPWV------MCKQDDAPGPVIN 142
+ AK+ D VP + + PG +
Sbjct: 178 CENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADVGFNVPLFTSDGSWLFEGGATPGALPT 237
Query: 143 ACNGMRCGETFKGP----NSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIA 197
A NG E K + P + E + + W +P+ + A IA ++
Sbjct: 238 A-NGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW-AEPFPQIGASGIARQTEKYLQ 295
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKE 248
+ S+ N+YM HGGTNFG T+ A +T Y AP+ E G V PK+ ++
Sbjct: 296 NDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRN 353
Query: 249 LHAAIKLCSRPLLTGTQNVISLGQLQ 274
+ + P VI + +Q
Sbjct: 354 VIKKYVKYTIPEAPAPNPVIEIPSIQ 379
>gi|169604026|ref|XP_001795434.1| hypothetical protein SNOG_05023 [Phaeosphaeria nodorum SN15]
gi|111066294|gb|EAT87414.1| hypothetical protein SNOG_05023 [Phaeosphaeria nodorum SN15]
Length = 638
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 125/285 (43%), Gaps = 50/285 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
WP + AK GL+ I +YV+W E GQ+DF+ +NDI + +EIQ G+ LR GP
Sbjct: 68 WPQRLQMAKSMGLNTILSYVYWQDIEQHPGQFDFTDKNDIAAWFQEIQKAGMKAVLRPGP 127
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-KIENEYQT-----IEPAFHEKGPPYVL-- 113
++ +E +GG+P WL ++G+ RS+N P+ N+Y T ++P G P ++
Sbjct: 128 YVCAERDWGGMPGWLPQISGMKHRSNNGPFLDATNKYLTKVGAQLQPLLIANGGPILMVQ 187
Query: 114 ------WAA-------KMAVDFHTGVPWVMCKQDDA-----------PGPV-----INAC 144
WA K+A P +DA PG + +
Sbjct: 188 VENEYGWAGSDHTYTNKLADILKANFPNTKLYTNDANNAGALKNGQVPGALAVFDGTDMK 247
Query: 145 NGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGK----PYIRSAQDIAFHV--ALFIAK 198
NG+ + S P++ E W ++ WG K Y R + ++
Sbjct: 248 NGVTTLRSAITDASSIGPAMNGEYWIRWFDNWGPKNGHSSYDRDTNGMQGRANDLDWMLT 307
Query: 199 NGSYVNYYMYHGGTNF------GRTAAAFMITGYYD-QAPLDEYG 236
NG + + +M+HGGT+F G T T YD APLDE G
Sbjct: 308 NGHHFSIFMFHGGTSFAFGAGSGDTTPRTPFTTSYDYGAPLDETG 352
>gi|433679946|ref|ZP_20511609.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
gi|430814938|emb|CCP42238.1| beta-galactosidase [Xanthomonas translucens pv. translucens DSM
18974]
Length = 615
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 134/312 (42%), Gaps = 61/312 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + KA+ GL+ ++TYVFWNL EP++GQ+DFSG ND+ FI +QGL V LR GP
Sbjct: 64 WKDRLQKARAMGLNTVETYVFWNLVEPRQGQFDFSGNNDLAAFIDAAAAQGLNVILRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDN------------------KP-----------YK 92
++ +EW GG P WL G+ RS + KP +
Sbjct: 124 YVCAEWEAGGYPAWLFAQPGLRVRSQDPRFLAASQAYLDAVAAQVKPKLNRNGGPVIAVQ 183
Query: 93 IENEY----------QTIEPAFHEKG-PPYVLWAAKMAVDFHTG-VPWVMCKQDDAPGPV 140
+ENEY Q F + G +L+ A A G +P + + PG
Sbjct: 184 VENEYGSYDDDHVYMQANRTMFVKAGFDKALLFTADGADVLANGTLPDTLAVVNFGPG-- 241
Query: 141 INACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
+A + F+ P +P + E W ++ WG K A+ A +I + G
Sbjct: 242 -DAEKAFQTLSKFR----PGQPQMVGEYWAGWFDQWGDKHANTDAKKQASEFE-WILRQG 295
Query: 201 SYVNYYMYHGGTNFG--------RTAA---AFMITGYYDQAPLDEYGLVREPKWGHLKEL 249
N YM+ GGT+FG + A+ A T Y A LDE G PK+ ++
Sbjct: 296 HSANIYMFVGGTSFGFMNGANFQKNASDHYAPQTTSYDYDAVLDEAGRP-TPKFALFRDA 354
Query: 250 HAAIKLCSRPLL 261
A I P L
Sbjct: 355 IARITGVQPPAL 366
>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
Length = 624
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/299 (27%), Positives = 128/299 (42%), Gaps = 54/299 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + A+ GL+ + TY FW+ HEP+ GQ+ FSG+ND+ FIK +GL V LR GP
Sbjct: 64 WRERLRMARAMGLNTVTTYAFWSQHEPEPGQWSFSGQNDLRTFIKTAAEEGLNVVLRPGP 123
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVLWA 115
++ +E +GG P WL G+ RS + Y + Q + +G P ++
Sbjct: 124 YVCAEVDFGGFPAWLMRTQGLRVRSMDARYLAASARYFKRLAQEVADLQSSRGGPILMLQ 183
Query: 116 AKMAV-------DFHTGVPWVMCKQDDAPGPVINACNGMRCGETFKGPN----------- 157
+ D+ V M +Q P+ + G G F+G
Sbjct: 184 LENEYGSYGRDHDYLRAVRTQM-RQAGFDAPLFTSDGG--AGRLFEGGTLADVPAVVNFG 240
Query: 158 ----------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGS 201
P+ P + E W ++ WG + + +S ++ A V +++ S
Sbjct: 241 GGADDAQASVQELAAWRPHGPRMAGEYWAGWFDHWGEQHHTQSPEEAARTVERMLSQGVS 300
Query: 202 YVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKELHA 251
+ N YM+HGGT+FG A A T Y A LDE G PK+ L+++ A
Sbjct: 301 F-NLYMFHGGTSFGWLAGANYSGSEPYQPDTTSYDYDAALDEAGRP-TPKYFALRDVIA 357
>gi|26325854|dbj|BAC26681.1| unnamed protein product [Mus musculus]
Length = 646
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 123/279 (44%), Gaps = 44/279 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + K + GL+ +Q YV WN HEP+ G Y+F+G D+I F+ E L V LR G
Sbjct: 58 LWADRLLKMQLSGLNAVQFYVPWNYHEPEPGIYNFNGSRDLIAFLNEAAKVNLLVILRPG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPA-----FHEKGPPYVL 113
P+I +EW GGLP WL I R+ + + +++ ++ + P +H G +
Sbjct: 118 PYICAEWEMGGLPSWLLRNPNIHLRTSDPAFLEAVDSWFKVLLPKIYPFLYHNGGNIISI 177
Query: 114 -----WAAKMAVDF----HTGVPWVMCKQDDAPGPVINACNGMRCGETFK-------GPN 157
+ + A DF H + D + +G+RCG GP
Sbjct: 178 QVENEYGSYKACDFKYMRHLAGLFRALLGDKILLFTTDGPHGLRCGSLQGLYTTIDFGPA 237
Query: 158 -------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
P+ P + +E +T + WG RS+ +A + + K G+ VN
Sbjct: 238 DNVTRIFSLLREYEPHGPLVNSEYYTGWLDYWGQNHSTRSSPAVAQGLEKML-KLGASVN 296
Query: 205 YYMYHGGTNFGRTAAA------FMITGYYD-QAPLDEYG 236
YM+HGGTNFG A IT YD AP+ E G
Sbjct: 297 MYMFHGGTNFGYWNGADEKGRFLPITTSYDYDAPISEAG 335
>gi|134096920|ref|YP_001102581.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|291006638|ref|ZP_06564611.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|133909543|emb|CAL99655.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
Length = 594
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 76/290 (26%), Positives = 120/290 (41%), Gaps = 59/290 (20%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + + + + GL+ + TYV WN HEP++G+ DF+G D++RF++ GL V +R GP
Sbjct: 48 WRNRLDRMRALGLNSVDTYVAWNFHEPRRGEVDFTGWRDVVRFVETAAEAGLKVIIRPGP 107
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GGLP WL + R + Y +
Sbjct: 108 YICAEWDFGGLPAWLLESGNPPLRCSDPAYTELTLRWFDELLPRLAPLQATRGGPVLAFQ 167
Query: 93 IENEY----------QTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGPVIN 142
+ENEY + + E+G +L+ + D+ M + + P +
Sbjct: 168 VENEYGSYGNDQTHLEQLRAGMLERGIDSLLFCSNGPSDY-------MLRGGNLPDTLAT 220
Query: 143 ACNGMRCGETFKGPNS--PNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNG 200
F+ P P TE W ++ WG + + + A HV +A G
Sbjct: 221 VNFAGDPTAPFEALREYQPEGPLWCTEFWDGWFDHWGEEHHTTDPVETAGHVDRMLAA-G 279
Query: 201 SYVNYYMYHGGTNFGRTAAAF----------MITGYYDQAPLDEYGLVRE 240
+ V+ YM GGTNFG A A IT Y +P+ E G + E
Sbjct: 280 ASVSLYMAVGGTNFGWWAGANYDTSKDQYQPTITSYDYDSPIGEAGELTE 329
>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
Length = 604
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 123/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV W+LHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|254675347|ref|NP_083286.1| beta-galactosidase-1-like protein precursor [Mus musculus]
gi|81879201|sp|Q8VC60.1|GLB1L_MOUSE RecName: Full=Beta-galactosidase-1-like protein; Flags: Precursor
gi|18256820|gb|AAH21773.1| Glb1l protein [Mus musculus]
gi|148667965|gb|EDL00382.1| mCG133890 [Mus musculus]
Length = 646
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 123/279 (44%), Gaps = 44/279 (15%)
Query: 1 MWPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIG 60
+W + K + GL+ +Q YV WN HEP+ G Y+F+G D+I F+ E L V LR G
Sbjct: 58 LWADRLLKMQLSGLNAVQFYVPWNYHEPEPGIYNFNGSRDLIAFLNEAAKVNLLVILRPG 117
Query: 61 PFIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPA-----FHEKGPPYVL 113
P+I +EW GGLP WL I R+ + + +++ ++ + P +H G +
Sbjct: 118 PYICAEWEMGGLPSWLLRNPNIHLRTSDPAFLEAVDSWFKVLLPKIYPFLYHNGGNIISI 177
Query: 114 -----WAAKMAVDF----HTGVPWVMCKQDDAPGPVINACNGMRCGETFK-------GPN 157
+ + A DF H + D + +G+RCG GP
Sbjct: 178 QVENEYGSYKACDFKYMRHLAGLFRALLGDKILLFTTDGPHGLRCGSLQGLYTTIDFGPA 237
Query: 158 -------------SPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVN 204
P+ P + +E +T + WG RS+ +A + + K G+ VN
Sbjct: 238 DNVTRIFSLLREYEPHGPLVNSEYYTGWLDYWGQNHSTRSSPAVAQGLEKML-KLGASVN 296
Query: 205 YYMYHGGTNFGRTAAA------FMITGYYD-QAPLDEYG 236
YM+HGGTNFG A IT YD AP+ E G
Sbjct: 297 MYMFHGGTNFGYWNGADEKGRFLPITTSYDYDAPISEAG 335
>gi|431919435|gb|ELK17954.1| Beta-galactosidase [Pteropus alecto]
Length = 675
Score = 100 bits (248), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 115/282 (40%), Gaps = 52/282 (18%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K K GL+ IQ YV WN HEPQ GQY FS +D+ FI+ L V LR GP
Sbjct: 85 WKDRLLKMKMAGLNAIQVYVPWNFHEPQPGQYQFSEDHDVEHFIQLAHELTLLVILRPGP 144
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPYKIENE------YQTIEPAFHEKGPPYV--- 112
+I +EW GGLP WL GI+ RS + Y + ++P ++ G P +
Sbjct: 145 YICAEWEMGGLPAWLLQKEGIILRSSDPDYLEAVDKWLGVILPKMKPFLYQNGGPIITVQ 204
Query: 113 ---------------LWAAKMAVDFHTGVPWVMCKQD----DAP--------------GP 139
L + + +H G ++ D D P GP
Sbjct: 205 VENEYGSYFTCDYDYLRFLQKSFRYHLGNDVILFTTDGVYKDLPHCGTLQGLYSTVDFGP 264
Query: 140 VINACNGMRCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKN 199
N + ++ P P I +E +T + W G+P+ + I +
Sbjct: 265 GANITDAFLLQRKYE----PKGPLINSEFYTGWLDHW-GQPHSTVTTEAVVSSLHDILAH 319
Query: 200 GSYVNYYMYHGGTNFGRTAAAFM-----ITGYYDQAPLDEYG 236
G+ VN YM+ GGTNF A + T Y APL E G
Sbjct: 320 GANVNLYMFIGGTNFAYWNGANIPYQAQPTSYDYDAPLSEAG 361
>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
Length = 585
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 110/251 (43%), Gaps = 39/251 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + ++TYV WNLHE Q+G Y F G D+ RFI+ Q GLYV LR P
Sbjct: 34 WQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQTAQEVGLYVILRPAP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFHE----KGPPYVLWA 115
+I +EW +GGLP WL + R D P+ KI + + P + +G P ++
Sbjct: 94 YICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQVRDLQITQGGPIIMMQ 153
Query: 116 AK----------------MAVDFHTGVPWVMCKQD-------------DAPGPVINACNG 146
+ +A GV + D D P IN +
Sbjct: 154 VENEYGSYANDKEYLRKMVAAMRQHGVETPLVTSDGPWHDMLENGSIKDLALPTINCGSN 213
Query: 147 MRCG-ETFKGPNSPNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAKNGSYVN 204
++ E + + +P + E W ++ WG + + S QD + +A VN
Sbjct: 214 IKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDQHHTTSTQDAVKELQDCLALGS--VN 271
Query: 205 YYMYHGGTNFG 215
YM+HGGTNFG
Sbjct: 272 IYMFHGGTNFG 282
>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 604
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 125/300 (41%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNF----GRTAAAFM----ITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGGTNF G +A + IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGTNFEFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
Length = 585
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 75/251 (29%), Positives = 112/251 (44%), Gaps = 39/251 (15%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K + G + ++TYV WNLHE Q+G Y F G D+ RFI+ Q GLYV LR P
Sbjct: 34 WQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQTAQEVGLYVILRPAP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEYQTIEPAFHE----KGPPYVLWA 115
+I +EW +GGLP WL + R D P+ KI + + P + +G P ++
Sbjct: 94 YICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQVRDLQITQGGPIIMMQ 153
Query: 116 AK----------------MAVDFHTGV---------PWVMCKQD----DAPGPVINACNG 146
+ +A GV PW ++ D P IN +
Sbjct: 154 VENEYGSYANDKEYLRKMVAAMRQHGVETPLVTSDGPWHDMLENGSIKDLALPTINCGSN 213
Query: 147 MRCG-ETFKGPNSPNKPSIWTEDWTSFYQVWG-GKPYIRSAQDIAFHVALFIAKNGSYVN 204
++ E + + +P + E W ++ WG + + S QD + +A VN
Sbjct: 214 IKENFEKLRKFHGEKRPLMVMEFWIGWFDAWGDDQHHTTSIQDAVKELQDCLALGS--VN 271
Query: 205 YYMYHGGTNFG 215
YM+HGGTNFG
Sbjct: 272 IYMFHGGTNFG 282
>gi|258507331|ref|YP_003170082.1| beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|385827042|ref|YP_005864814.1| beta-galactosidase [Lactobacillus rhamnosus GG]
gi|257147258|emb|CAR86231.1| Beta-galactosidase (GH35) [Lactobacillus rhamnosus GG]
gi|259648687|dbj|BAI40849.1| beta-galactosidase [Lactobacillus rhamnosus GG]
Length = 593
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 121/291 (41%), Gaps = 67/291 (23%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHE ++G++DFSG DI RF+K + GLY +R P
Sbjct: 34 WYHSLYNLKALGFNTVETYVPWNLHEYREGEFDFSGILDIERFLKTAEDLGLYAIVRPSP 93
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + R+D+ Y +
Sbjct: 94 YICAEWEFGGFPAWLL-TKKMRLRTDDPAYLAAIDRYYTALMPHLVDHQVTHGGNVIMMQ 152
Query: 93 IENEYQTIEPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCKQDDAPGP-VINACNGMRCG- 150
+ENEY + + + Y+ AK+ VP D P P +NA + + G
Sbjct: 153 VENEYGS-----YGEDQDYLAAVAKLMQQHGVDVPLFTS---DGPWPATLNAGSMIDAGI 204
Query: 151 -----------------ETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVA 193
F + + P + E W ++ W G+P IR D
Sbjct: 205 LATGNFGSAADKNFDRLAAFHQEHGRDWPLMCVEFWDGWFNRW-GEPIIRRDPDETAEDL 263
Query: 194 LFIAKNGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYG 236
+ K GS VN YM+HGGTNFG + +T Y APL+E G
Sbjct: 264 RAVIKRGS-VNLYMFHGGTNFGFMNGTSARKDHDLPQVTSYDYDAPLNEQG 313
>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 604
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 122/300 (40%), Gaps = 57/300 (19%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K G + ++TYV WNLHEPQKG + F G D+ RF+K Q GLY +R P
Sbjct: 44 WYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSP 103
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY-----------------------------K 92
+I +EW +GG P WL + G + RS+N Y +
Sbjct: 104 YICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHVAEYYDVLMEKIVPHQLANGGNILMIQ 162
Query: 93 IENEYQTI--EPAFHEKGPPYVLWAAKMAVDFHTGVPWVMCK------QDDAPGPVINAC 144
IENEY + E A+ ++ A F + PW +DD ++
Sbjct: 163 IENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTSDGPWRATLRAGSMIEDDI---LVTGN 219
Query: 145 NGMRCGETFK------GPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAK 198
G + E F + P + E W ++ W R Q++A V +A
Sbjct: 220 FGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL 279
Query: 199 NGSYVNYYMYHGGTNFG--------RTAAAFMITGYYDQAPLDEYGLVREPKWGHLKELH 250
+N YM+HGG NFG T IT Y APLDE G E + K LH
Sbjct: 280 GS--INLYMFHGGINFGFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLH 337
>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 628
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 139/326 (42%), Gaps = 57/326 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W + K GL+ + TYVFWNLHEP+ G++DF+G ++ FIK +G+ V LR GP
Sbjct: 58 WRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGP 117
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY------KIENEYQTIEPAFHEKGPPYVL-- 113
++ +EW +GG P WL +V G+ R DN + I+ Y+ + KG P V+
Sbjct: 118 YVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQ 177
Query: 114 ----------------------WAAKMA---VDFHTGVPWV------MCKQDDAPGPVIN 142
+ AK+ D VP + + PG +
Sbjct: 178 CENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPT 237
Query: 143 ACNGMRCGETFKGP----NSPNKPSIWTEDWTSFYQVWGGKPYIR-SAQDIAFHVALFIA 197
A NG E K + P + E + + W +P+ + A IA ++
Sbjct: 238 A-NGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW-AEPFPQIGASGIARQTEKYLQ 295
Query: 198 KNGSYVNYYMYHGGTNFGRTAAAFM---------ITGYYDQAPLDEYGLVREPKWGHLKE 248
+ S+ N+YM HGGTNFG T+ A +T Y AP+ E G V PK+ ++
Sbjct: 296 NDVSF-NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVT-PKYDSIRN 353
Query: 249 LHAAIKLCSRPLLTGTQNVISLGQLQ 274
+ + P VI + +Q
Sbjct: 354 VIKKYVKYTIPEAPAPNPVIEIPSIQ 379
>gi|297198988|ref|ZP_06916385.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|297147253|gb|EDY55124.2| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 601
Score = 99.8 bits (247), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 134/334 (40%), Gaps = 58/334 (17%)
Query: 2 WPSLIAKAKEGGLDVIQTYVFWNLHEPQKGQYDFSGRNDIIRFIKEIQSQGLYVCLRIGP 61
W +A GL+ ++TYV WNLHEP G D + RF+ + GL+ +R GP
Sbjct: 41 WGHRLAMLGAMGLNCVETYVPWNLHEPHPG--DVRDVEALGRFLDAAREAGLWAIVRPGP 98
Query: 62 FIESEWTYGGLPIWLHDVAGIVFRSDNKPY--KIENEY-----QTIEPAFHEKGP----- 109
+I +EW GGLP WL A R+ ++ Y ++E + Q +E GP
Sbjct: 99 YICAEWENGGLPHWLKGHA----RTSDEVYLGQVERWFGRLLPQVVERQIDRGGPVIMVQ 154
Query: 110 ------------PYVLWAAKMAVDFHTGVPWV--------MCKQDDAPG--PVINACNGM 147
Y+L ++ VP M PG +N +G
Sbjct: 155 AENEYGSYGSDAAYLLRLTELLRAQGITVPLFTSDGPEDHMLTGGSVPGVLATVNFGSGA 214
Query: 148 RCGETFKGPNSPNKPSIWTEDWTSFYQVWGGKPYIRSAQDIAFHVALFIAKNGSYVNYYM 207
R P+ P + E W +++ WGG+P +R A+D A I + G+ VN YM
Sbjct: 215 RTAFEALRRYRPDGPLMCMEFWCGWFEHWGGEPVVRDAEDAA-EALREILECGASVNLYM 273
Query: 208 YHGGTNFGRTAAAFM-------------ITGYYDQAPLDEYGLVREPKWGHLKELHAAIK 254
HGGTNF A A +T Y AP+DEYG E W + L A
Sbjct: 274 AHGGTNFAGWAGANRGGGALHDGPLEPDVTSYDYDAPIDEYGRPTEKFWRFREVLSAYGP 333
Query: 255 LCSRP----LLTGTQNVISLGQLQEAFVFEETSG 284
+ P +L +V + V EE G
Sbjct: 334 VAELPPAPEVLGAVSDVDLTAWASLSAVLEERGG 367
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.431
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,976,243,888
Number of Sequences: 23463169
Number of extensions: 588663660
Number of successful extensions: 1104288
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2102
Number of HSP's successfully gapped in prelim test: 332
Number of HSP's that attempted gapping in prelim test: 1093516
Number of HSP's gapped (non-prelim): 5826
length of query: 746
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 596
effective length of database: 8,839,720,017
effective search space: 5268473130132
effective search space used: 5268473130132
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)