BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003612
(807 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
Length = 798
Score = 1198 bits (3099), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/798 (71%), Positives = 658/798 (82%), Gaps = 19/798 (2%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
GG+NVTYD RSL+ING KI+FSGSIHYPRSTPQMWP LI+KA+ GGLD + T VFWNLH
Sbjct: 4 GGSNVTYDSRSLVINGKHKIIFSGSIHYPRSTPQMWPYLISKARAGGLDAIDTYVFWNLH 63
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EPQ GQ+DFSGR+DLVRFIKEV AQGLYVCLRIGPFIE EW YGGLPFWLHDVPGIVFRS
Sbjct: 64 EPQQGQYDFSGRKDLVRFIKEVHAQGLYVCLRIGPFIESEWTYGGLPFWLHDVPGIVFRS 123
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
DN+PFK+HM+RYA MIV M+KA +LYASQGGPIILSQIENEYG VE +F EKGPPYV+WA
Sbjct: 124 DNKPFKYHMERYAKMIVKMLKAEKLYASQGGPIILSQIENEYGNVEAAFHEKGPPYVKWA 183
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
AK+AV L TGVPWVMCKQDDAPDPVINACNG +CGETF+GPNSP KPAIWTENWTS YQ
Sbjct: 184 AKMAVGLHTGVPWVMCKQDDAPDPVINACNGLRCGETFSGPNSPRKPAIWTENWTSVYQT 243
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDE 325
YG E R RSAEDIA+H ALFIAK GS+VNYYMYHGGTNFGRTA+ YV T YYDQAPLDE
Sbjct: 244 YGKETRSRSAEDIAFHAALFIAK-GGSFVNYYMYHGGTNFGRTAAEYVPTSYYDQAPLDE 302
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKD 384
YGLLRQPK GHLKELH+A+KLC KP+LS ++ + +LQEAF F+ S ECAAFLVN D
Sbjct: 303 YGLLRQPKHGHLKELHAAIKLCRKPLLSRKWINFSLGQLQEAFAFERNSDECAAFLVNHD 362
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA---------------KLDSVEQWEE 429
R+NATV+F Y+LPP SISILP CKTVAFNTA K DS+EQW+E
Sbjct: 363 GRSNATVHFKGSSYKLPPKSISILPHCKTVAFNTAQVSTQYGTRLATRRHKFDSIEQWKE 422
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHA 489
YKE IP++D++SLRAN LLE MNTTKD+SDYLWY FRF + S++ SVL V+SLGH LHA
Sbjct: 423 YKEYIPSFDKSSLRANTLLEHMNTTKDSSDYLWYTFRFHQNSSNAHSVLTVNSLGHNLHA 482
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNV 549
F+NGEF+GSAHG H +KSFTL++ + L GTN VSLLSVM GLPD+GAYLERRVAGLR V
Sbjct: 483 FVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNYVSLLSVMTGLPDAGAYLERRVAGLRRV 542
Query: 550 SIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
+IQ EL DF+++ WGY+VGL GE +Q+ + S WSRY SS+ +PLTWYK++FDA
Sbjct: 543 TIQRQHELHDFTTYLWGYKVGLSGENIQLHRNNASVKAYWSRYASSS-RPLTWYKSIFDA 601
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLV 669
P G+DPVA+NL SMGKGEAWVNG+SIGRYWVSFL G P Q+W HIPRSFLKP+GNLLV
Sbjct: 602 PAGNDPVALNLASMGKGEAWVNGRSIGRYWVSFLDSDGNPYQTWNHIPRSFLKPSGNLLV 661
Query: 670 LLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQI 729
+LEEE G P GIS+ T+S+T +CGHVS SH PPVISW+ +NQ T KR GRRPKVQ+
Sbjct: 662 ILEEERGNPLGISLGTMSITKVCGHVSISHPPPVISWQGENQIN-GTRKRKYGRRPKVQL 720
Query: 730 RCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFY 789
RCP GRKIS +LF+S+G P+G+CE YAIGSCH+SNSRA VEKACLGK C++PV ++ F
Sbjct: 721 RCPRGRKISSVLFSSFGTPSGDCETYAIGSCHASNSRATVEKACLGKERCSIPVSSKNFK 780
Query: 790 GDPCPGIPKALLVDAQCT 807
GDPCPGI K+LLVDA+C
Sbjct: 781 GDPCPGIAKSLLVDAKCA 798
>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
Length = 817
Score = 1170 bits (3026), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 558/798 (69%), Positives = 646/798 (80%), Gaps = 22/798 (2%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G VTYDGRSLIING RKILFSGSIHYPRSTP+MWP LI++AK+GG+DV++T VFWN HE
Sbjct: 25 GGEVTYDGRSLIINGQRKILFSGSIHYPRSTPEMWPSLISQAKQGGIDVIETYVFWNQHE 84
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PGQ+DFSGRRD+VRFI+EVQAQGLY CLRIGPFI+ EW YGG PFWLHDVPGIV+R+D
Sbjct: 85 PKPGQYDFSGRRDIVRFIREVQAQGLYACLRIGPFIQAEWNYGGFPFWLHDVPGIVYRTD 144
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFKF+M+ + T IV +MK+ LYASQGGPIIL QIENEY VE +F E G YV WAA
Sbjct: 145 NEPFKFYMRNFTTKIVEIMKSENLYASQGGPIILQQIENEYKTVEANFGEAGKRYVLWAA 204
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+AV L+TGVPWVMCKQDDAPDPVIN+CNGR CGETFAGPNSP+KPAIWTENWTS Y ++
Sbjct: 205 NMAVGLETGVPWVMCKQDDAPDPVINSCNGRLCGETFAGPNSPNKPAIWTENWTSSYPLF 264
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++AR R EDIA+HVALF+AKM GS++NYYMYHGGTNFGRTASAYV T YYD+APLDEY
Sbjct: 265 GEDARPRPVEDIAFHVALFVAKMNGSFINYYMYHGGTNFGRTASAYVQTAYYDEAPLDEY 324
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF-SKLQEAFIFQGSS-ECAAFLVNKD 384
GL++QP WGHLKELH+AVKLC + +L G +++ +KLQEA++F+G S +CAAFLVN D
Sbjct: 325 GLIQQPTWGHLKELHAAVKLCSETLLQGAQSNLSLGTKLQEAYVFRGQSGKCAAFLVNND 384
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA---------------KLDSVEQWEE 429
R + TV F N YELP SISILPDCK AFNTA K +S EQWEE
Sbjct: 385 SRTDVTVVFQNTSYELPRKSISILPDCKNEAFNTAKASFRPGLISIQTVTKFNSTEQWEE 444
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHA 489
YKE+I +D+TS RAN LLE MNTTKDASDYLWY FR+ +DPS+ +SVL +S H LHA
Sbjct: 445 YKESILNFDDTSSRANTLLEHMNTTKDASDYLWYTFRYNNDPSNGQSVLSTNSRAHALHA 504
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNV 549
FING GS HG S+ SF+L+ V G NNVSLLSVMVGLPDSGAYLERRVAGLR V
Sbjct: 505 FINGRHTGSQHGSSSNLSFSLDNTVSFRAGINNVSLLSVMVGLPDSGAYLERRVAGLRRV 564
Query: 550 SIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
IQ LKDF++ WGYQVGLLGEKLQI+TD GS+ V WS++GSST LTWYKTVFDA
Sbjct: 565 RIQSNGSLKDFTNNPWGYQVGLLGEKLQIYTDVGSQKVQWSKFGSSTSGLLTWYKTVFDA 624
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLV 669
P G++PVA+NL+SM KGE WVNGQSIGRYWVSFLTP G PSQ WYHIPRSFLKPTGNLLV
Sbjct: 625 PAGNEPVALNLVSMRKGEVWVNGQSIGRYWVSFLTPSGKPSQIWYHIPRSFLKPTGNLLV 684
Query: 670 LLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQI 729
LLEEE G+P GISI VS+ +CGHVS+SHLPPVIS + K H+ GRRPKVQ+
Sbjct: 685 LLEEETGHPVGISIGKVSIPKICGHVSESHLPPVIS-----RVIYKKHENHHGRRPKVQL 739
Query: 730 RCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFY 789
RCPS R IS+ILFAS+G P+G+C++YA+GSCHSSNSR+ VEKACLGK C+VP+ ++F
Sbjct: 740 RCPSNRNISRILFASFGTPSGDCQSYAVGSCHSSNSRSNVEKACLGKGMCSVPLSYKRFG 799
Query: 790 GDPCPGIPKALLVDAQCT 807
GDPCPG PKALLVD QCT
Sbjct: 800 GDPCPGTPKALLVDVQCT 817
>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
Length = 764
Score = 1121 bits (2900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 534/795 (67%), Positives = 634/795 (79%), Gaps = 47/795 (5%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYDGRSLIING KILFSGSIHYPRSTP MW LI+KAK GG+DV+QT VFWNLHEPQ
Sbjct: 1 NVTYDGRSLIINGQHKILFSGSIHYPRSTPDMWSSLISKAKAGGIDVIQTYVFWNLHEPQ 60
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQF F+GR DLVRF+KE+QAQGLY CLRIGPFIE EW YGGLPFWLHD+PG+V+RSDN+
Sbjct: 61 QGQFYFNGRADLVRFVKEIQAQGLYACLRIGPFIESEWTYGGLPFWLHDIPGMVYRSDNQ 120
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK+HMKR+ + IV+MMK+ +LYASQGGPIILSQ+ENEY VE +F EKGP YVRWAA +
Sbjct: 121 PFKYHMKRFVSRIVSMMKSEKLYASQGGPIILSQVENEYKNVEAAFHEKGPSYVRWAALM 180
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV+LQTGVPWVMCKQDDAPDPVIN+CNG +CGETFAGPNSP+KP+IWTE+WTSFYQVYG+
Sbjct: 181 AVNLQTGVPWVMCKQDDAPDPVINSCNGMRCGETFAGPNSPNKPSIWTEDWTSFYQVYGE 240
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
E +RSA+DIA+HVALFIAK GSYVNYYMYHGGTNFGRTASA+ +T YYDQAPLDEYGL
Sbjct: 241 ETYMRSAQDIAFHVALFIAKT-GSYVNYYMYHGGTNFGRTASAFTITSYYDQAPLDEYGL 299
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
+RQPKWGHLKELH+A+K C K +L G + + LQ+A++FQG+S +CAAFLVN D +
Sbjct: 300 IRQPKWGHLKELHAAIKSCSKLLLHGAHKTFSLGPLQQAYVFQGNSGQCAAFLVNNDGKQ 359
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------DSVEQWEEYKE 432
V F + Y+LP SISILPDCKT+ FNTAK+ +SV +WEEY E
Sbjct: 360 EVEVLFQSNSYKLPQKSISILPDCKTMTFNTAKVNAQYTTRSMKPNQKFNSVGKWEEYNE 419
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFIN 492
IP +D+TSLRAN LLE M+TTKD SDYLWY FRF+ + +++SV S GHVLHA++N
Sbjct: 420 PIPEFDKTSLRANRLLEHMSTTKDTSDYLWYTFRFQQNLPNAQSVFNAQSHGHVLHAYVN 479
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ 552
G G HG H + SF+L+ V L NGTN+V+LLS VGLPDSGAYLERRVAGLR V IQ
Sbjct: 480 GVHAGFGHGSHQNTSFSLQTTVRLKNGTNSVALLSATVGLPDSGAYLERRVAGLRRVRIQ 539
Query: 553 GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTG 612
KDF++++WGYQVGLLGE+LQI+T+ GS V W++ G T++PL WYKT+FDAP G
Sbjct: 540 N----KDFTTYTWGYQVGLLGERLQIYTENGSNKVKWNKLG--TNRPLMWYKTLFDAPAG 593
Query: 613 SDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
+DPVA+NL SMGKGEAWVNGQSIGRYWVSF T QG+PSQ+WY+IPR+FLKPTGNLLVLLE
Sbjct: 594 NDPVALNLGSMGKGEAWVNGQSIGRYWVSFHTSQGSPSQTWYNIPRAFLKPTGNLLVLLE 653
Query: 673 EENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP 732
EE GYPPGI++DTVSVT +CG+ S+SHL VQ+ CP
Sbjct: 654 EEKGYPPGITVDTVSVTKVCGYASESHL------------------------SAVQLSCP 689
Query: 733 SGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDP 792
R IS I+FAS+G P+GNCE+YAIG+CHSS+S+A VEKAC+GKRSC++P F GDP
Sbjct: 690 LKRNISSIIFASFGTPSGNCESYAIGNCHSSSSKANVEKACIGKRSCSIPQSNHFFGGDP 749
Query: 793 CPGIPKALLVDAQCT 807
CPGIPK LLV+A+CT
Sbjct: 750 CPGIPKVLLVEAKCT 764
>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
Length = 821
Score = 1120 bits (2896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/796 (66%), Positives = 625/796 (78%), Gaps = 20/796 (2%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +VTYDGRSLIING R++LFSGSIHYPRSTP+MWP LI+KAKEGG+DV++T FWN HE
Sbjct: 29 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 88
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ GQ+DFSGR D+V+F KEVQAQGLY CLRIGPFIE EW YGGLPFWLHDVPGI++RSD
Sbjct: 89 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 148
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFKF+M+ + T IVN+MK+ LYASQGGPIILSQIENEY VE +F EKGPPYVRWAA
Sbjct: 149 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 208
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+AVDLQTGVPWVMCKQDDAPDPVINACNG +CGETFAGPN P+KPAIWTENWTS Y+VY
Sbjct: 209 KMAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVY 268
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++ R R+AED+A+ VALFIAK GS++NYYMYHGGTNFGRT+S+YVLT YYDQAPLDEY
Sbjct: 269 GEDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYDQAPLDEY 328
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDK 385
GL+RQPKWGHLKELH+ +KLC +L GV + + +LQEA++F+ S +CAAFLVN DK
Sbjct: 329 GLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 388
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD---------------SVEQWEEY 430
R N TV F N YEL SISILPDCK +AFNTAK+ S +QW EY
Sbjct: 389 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQTRATFGSTKQWSEY 448
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAF 490
+E IP++ T L+A+ LLE M TTKDASDYLWY RF + S+++ VL+V SL HVLHAF
Sbjct: 449 REGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRFIQNSSNAQPVLRVDSLAHVLHAF 508
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVS 550
+NG+++ SAHG H + SF+L V L +G N +SLLSVMVGLPD+G YLE +VAG+R V
Sbjct: 509 VNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEHKVAGIRRVE 568
Query: 551 IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAP 610
IQ + KDFS WGYQVGL+GEK QI+T GS+ V W GS PLTWYKT+FDAP
Sbjct: 569 IQDGGDSKDFSKHPWGYQVGLMGEKSQIYTSPGSQKVQWHGLGSHGRGPLTWYKTLFDAP 628
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVL 670
G+DPV + SMGKGEAWVNGQSIGRYWVS+LTP G PSQ+WY++PR+FL P GNLLV+
Sbjct: 629 PGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPSGEPSQTWYNVPRAFLNPKGNLLVV 688
Query: 671 LEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIR 730
EEE+G P ISI TVSVT +CGHV+DSH PP+ISW + + H +I PKVQ+R
Sbjct: 689 QEEESGDPLKISIGTVSVTNVCGHVTDSHPPPIISWTTSDDGNESHHGKI----PKVQLR 744
Query: 731 CPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYG 790
CP ISKI FAS+G P G CE+YAIGSCHS NS A+ EKACLGK C++P + F
Sbjct: 745 CPPSSNISKITFASFGTPVGGCESYAIGSCHSPNSLAVAEKACLGKNMCSIPHSLKSFGD 804
Query: 791 DPCPGIPKALLVDAQC 806
DPCPG PKALLV AQC
Sbjct: 805 DPCPGTPKALLVAAQC 820
>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
Length = 813
Score = 1118 bits (2893), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/796 (66%), Positives = 625/796 (78%), Gaps = 20/796 (2%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +VTYDGRSLIING R++LFSGSIHYPRSTP+MWP LI+KAKEGG+DV++T FWN HE
Sbjct: 21 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 80
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ GQ+DFSGR D+V+F KEVQAQGLY CLRIGPFIE EW YGGLPFWLHDVPGI++RSD
Sbjct: 81 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 140
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFKF+M+ + T IVN+MK+ LYASQGGPIILSQIENEY VE +F EKGPPYVRWAA
Sbjct: 141 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 200
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+AVDLQTGVPWVMCKQDDAPDPVINACNG +CGETFAGPN P+KPAIWTENWTS Y+VY
Sbjct: 201 KMAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVY 260
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++ R R+AED+A+ VALFIAK GS++NYYMYHGGTNFGRT+S+YVLT YYDQAPLDEY
Sbjct: 261 GEDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYDQAPLDEY 320
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDK 385
GL+RQPKWGHLKELH+ +KLC +L GV + + +LQEA++F+ S +CAAFLVN DK
Sbjct: 321 GLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 380
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD---------------SVEQWEEY 430
R N TV F N YEL SISILPDCK +AFNTAK+ S +QW EY
Sbjct: 381 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQTRATFGSTKQWSEY 440
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAF 490
+E IP++ T L+A+ LLE M TTKDASDYLWY RF + S+++ VL+V SL HVLHAF
Sbjct: 441 REGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRFIQNSSNAQPVLRVDSLAHVLHAF 500
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVS 550
+NG+++ SAHG H + SF+L V L +G N +SLLSVMVGLPD+G YLE +VAG+R V
Sbjct: 501 VNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEHKVAGIRRVE 560
Query: 551 IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAP 610
IQ + KDFS WGYQVGL+GEK QI+T GS+ V W GS PLTWYKT+FDAP
Sbjct: 561 IQDGGDSKDFSKHPWGYQVGLMGEKSQIYTSPGSQKVQWHGLGSHGRGPLTWYKTLFDAP 620
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVL 670
G+DPV + SMGKGEAWVNGQSIGRYWVS+LTP G PSQ+WY++PR+FL P GNLLV+
Sbjct: 621 PGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPSGEPSQTWYNVPRAFLNPKGNLLVV 680
Query: 671 LEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIR 730
EEE+G P ISI TVSVT +CGHV+DSH PP+ISW + + H +I PKVQ+R
Sbjct: 681 QEEESGDPLKISIGTVSVTNVCGHVTDSHPPPIISWTTSDDGNESHHGKI----PKVQLR 736
Query: 731 CPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYG 790
CP ISKI FAS+G P G CE+YAIGSCHS NS A+ EKACLGK C++P + F
Sbjct: 737 CPPSSNISKITFASFGTPVGGCESYAIGSCHSPNSLAVAEKACLGKNMCSIPHSLKSFGD 796
Query: 791 DPCPGIPKALLVDAQC 806
DPCPG PKALLV AQC
Sbjct: 797 DPCPGTPKALLVAAQC 812
>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
lyrata]
Length = 818
Score = 1114 bits (2882), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 543/826 (65%), Positives = 641/826 (77%), Gaps = 27/826 (3%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M Q F +L+ I D NVTYDGRSLII+G KILFSGSIHY RSTPQM
Sbjct: 1 MTTFQYSLAFFVLMAVIVARDAA-----NVTYDGRSLIIDGQHKILFSGSIHYTRSTPQM 55
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LIAKAK GG+DV+ T VFWN+HEPQ GQFDFSGRRD+V+FIKEV+A GLYVCLRIGP
Sbjct: 56 WPSLIAKAKSGGIDVIDTYVFWNIHEPQQGQFDFSGRRDIVKFIKEVKAHGLYVCLRIGP 115
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
FI+GEW YGGLPFWLH+V GIVFR+DNEPFK+HMKRYA MIV +MK+ LYASQGGPIIL
Sbjct: 116 FIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAQMIVKLMKSENLYASQGGPIIL 175
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYGMV +F + G YV+WAAKLAV+L TGVPWVMCKQDDAPDP++NACNGRQCG
Sbjct: 176 SQIENEYGMVARAFRQDGKSYVKWAAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCG 235
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
ETF GPNSP+KPAIWTENWTSFYQ YG+E IRSAEDIA+HVALFIAK GS+VNYYMYH
Sbjct: 236 ETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAK-NGSFVNYYMYH 294
Query: 301 GGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
GGTNFGR AS +V+T YYDQAPLDEYGLLRQPKWGHLKELH+AVKLC +P+LSG+ +++
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 361 FSKLQEAFIF-QGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
KLQ AF+F + ++ CAA LVN+DK + TV F N Y L P SIS+LPDCK VAFNTA
Sbjct: 355 LGKLQTAFVFGKKANLCAALLVNQDK-CDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTA 413
Query: 420 K---------------LDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
K L S WE++ E +P++ ETS+R+ LLE MNTT+D SDYLW
Sbjct: 414 KVNAQYNTRTRKPRQNLSSPHMWEKFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
RF+ + SVLKV+ LGHVLHAF+N F+GS HG SF LEK + L NGTNN++
Sbjct: 474 TRFEQS-EGAPSVLKVNHLGHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTNNMA 532
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGS 584
LLSVMVGLP+SGA+LERRV G R+V+I F+++SWGYQVGL GEK ++T+ G+
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVNIWNGSYQLFFNNYSWGYQVGLKGEKYHVYTEDGA 592
Query: 585 RIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
+ V W +Y S QPLTWYK FD P G DPVA+NL SMGKGEAWVNGQSIGRYWVSF T
Sbjct: 593 KKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFYT 652
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
+G PSQ WYHIPRSFLKP NLLV+LEEE GYP GI+IDTVSVT +CGHVS++H PV
Sbjct: 653 SKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGYPLGITIDTVSVTEVCGHVSNTHPHPV 712
Query: 704 ISWRSQ--NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
IS R + N+ + K R+PKVQ++CP+GRKISK+LFA++GNPNG+C +Y++GSCH
Sbjct: 713 ISPRKKGHNRNEQRHLKYRYDRKPKVQLQCPTGRKISKVLFATFGNPNGSCGSYSVGSCH 772
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S NS A+V+KACL K C+VPVW++ F GD CP K+LLV AQC+
Sbjct: 773 SPNSLAVVQKACLRKSRCSVPVWSKTFGGDLCPQTVKSLLVRAQCS 818
>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 801
Score = 1108 bits (2866), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 528/795 (66%), Positives = 623/795 (78%), Gaps = 24/795 (3%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+ TYDGRSLI+NG K+LFSGSIHYPRSTP MWP LIAKAKEGG+DV+QT VFWNLHEPQ
Sbjct: 15 SATYDGRSLIVNGEHKLLFSGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQ 74
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G ++FSGRRD+VRF+KE+QAQGLY CLRIGPFIE EW YGGLPFWLHDV GIV+RSDNE
Sbjct: 75 QGTYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNE 134
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK HM+ + T IVNMMK+ LYASQGGPIILSQIENEY +VE +F EKGPPYV+WAAK+
Sbjct: 135 PFKLHMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKM 194
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV LQTGVPW MCKQ+DAPDPVIN CNG +CGETF GPNSP+KP+IWTENWTSFYQ YG+
Sbjct: 195 AVSLQTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 254
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
E IRSAE+IA+HVALFIA G+YVNYYMYHGGTNFGR+ASA+++TGYYDQ+PLDEYGL
Sbjct: 255 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYDQSPLDEYGL 314
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKRN 387
R+PKWGHLKELH+AVKLC P+L+G + + + EA +F+ S+ECAAFLVN+
Sbjct: 315 TREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVNRGAI- 373
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD--------------SVEQWEEYKEA 433
++ V F N+ YELP SISILPDCK VAFNT ++ + +WEE+KE
Sbjct: 374 DSNVLFQNVTYELPLGSISILPDCKNVAFNTRRVSVQHNTRSMMAVQKFDLLEWEEFKEP 433
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFING 493
IP D+T LRAN LLE M TTKD SDYLWY FR + D DS+ L+V S H LHAF+NG
Sbjct: 434 IPNIDDTELRANELLEHMGTTKDRSDYLWYTFRVQQDSPDSQQTLEVDSRAHALHAFVNG 493
Query: 494 EFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQG 553
++ GSAHG + +K F+L K + L NG NN+SLLSVMVGLPDSGA+LE RVAGLR V IQG
Sbjct: 494 DYAGSAHGIYKEKGFSLAKNITLRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGIQG 553
Query: 554 AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGS 613
+DFS WGY+VGL GE+ QIF D GS V WSR G+S+ QPLTWYKT FDAP G
Sbjct: 554 ----EDFSEQHWGYKVGLSGEQSQIFLDTGSSNVQWSRLGNSS-QPLTWYKTQFDAPPGD 608
Query: 614 DPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEE 673
DP+A+NL SMGKG WVNG+ IGRYWVSFLTP+G PSQ WY++PRSFLKPT N LV+LEE
Sbjct: 609 DPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPKGEPSQKWYNVPRSFLKPTDNQLVILEE 668
Query: 674 ENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWR-SQNQRTLKTHKRIPGRRPKVQIRCP 732
E G P IS+D+V +T CG VS+SH P V SW ++ Q+ + R RRPKVQ+ CP
Sbjct: 669 ETGNPVEISLDSVLITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRT--RRPKVQLSCP 726
Query: 733 SGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDP 792
S +KIS ILFAS+G P+G+C++YAIG CHS NSRAIVE ACLG+ C++P+ F GDP
Sbjct: 727 SKKKISNILFASFGTPSGDCQSYAIGLCHSPNSRAIVEHACLGRAKCSIPISNLNFRGDP 786
Query: 793 CPGIPKALLVDAQCT 807
CP + K LLVDAQCT
Sbjct: 787 CPHVTKTLLVDAQCT 801
>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
Length = 828
Score = 1105 bits (2858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 527/815 (64%), Positives = 629/815 (77%), Gaps = 35/815 (4%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
GG G +VTYDGRSLI++G RK+LFSGSIHYPRSTP+MW LIAKAKEGGLDV+ T VFW
Sbjct: 17 GGARGGDVTYDGRSLIVDGQRKLLFSGSIHYPRSTPEMWQSLIAKAKEGGLDVIDTYVFW 76
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
NLHEPQPGQ+DFSGRRD+VRFIKEVQAQGLYVCLRIGPFI+GEW YGGLPFWLHD+PGIV
Sbjct: 77 NLHEPQPGQYDFSGRRDIVRFIKEVQAQGLYVCLRIGPFIQGEWSYGGLPFWLHDIPGIV 136
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FRSDNEPFK M+ + T IV MM++ +LY SQGGPIILSQIENEYG VE ++ EKGP YV
Sbjct: 137 FRSDNEPFKVQMQGFTTKIVTMMQSEKLYVSQGGPIILSQIENEYGTVEEAYHEKGPAYV 196
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
+WAA++AV L TGVPWVMCKQ+DAPDPVINACNG +C ETF GPNSP+KPAIWTENWT+
Sbjct: 197 KWAAQMAVGLNTGVPWVMCKQNDAPDPVINACNGLRCAETFVGPNSPNKPAIWTENWTTR 256
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAP 322
Y + G+ RIRS EDIA+ V FI KGS+VNYYMYHGGTNFGRTASA+V T YYDQAP
Sbjct: 257 YVITGENIRIRSVEDIAFQVTQFIVAKKGSFVNYYMYHGGTNFGRTASAFVPTSYYDQAP 316
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLV 381
+DEYGL+RQPKWGHLKE+H+A+KLCL P+LSG V+++ + Q+AF+F G S ECAAFL+
Sbjct: 317 IDEYGLIRQPKWGHLKEMHAAIKLCLTPLLSGGQVTISLGQQQQAFVFTGLSGECAAFLL 376
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------LDSVEQ 426
N D N A+V F N Y+LPP SISILPDCKTVAFNTAK LD ++
Sbjct: 377 NNDTANTASVQFRNASYDLPPNSISILPDCKTVAFNTAKVSTQYTTRSMTRSKLLDGEDK 436
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHV 486
W +Y+EAI +DETS+++ +LEQM+TTKDASDYLWY FRF+ + SD+++VL V SLGHV
Sbjct: 437 WVQYQEAIVNFDETSVKSEAILEQMSTTKDASDYLWYTFRFQQESSDTQAVLNVRSLGHV 496
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
LHAF+NG+ VG A G H + FTL+ V L G NNVSLLSVMVG+PDSGAY+ERR AGL
Sbjct: 497 LHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVNNVSLLSVMVGMPDSGAYMERRAAGL 556
Query: 547 RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
R V IQ + K+F+++SWGYQVGLLGEKLQIFTD GS V W+ + + PLTWYKT+
Sbjct: 557 RKVKIQEKEGNKEFTNYSWGYQVGLLGEKLQIFTDQGSSQVQWANFSKNALNPLTWYKTL 616
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSW------------- 653
FDAP PVA+NL SMGKGEAWVNGQSIGRYW S+ G+ SQ W
Sbjct: 617 FDAPLEDAPVALNLGSMGKGEAWVNGQSIGRYWPSYRASDGS-SQIWYAYFNTGAIFRAV 675
Query: 654 -YHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQR 712
Y++PRSFLKP GNLLV+LEE G P IS+DT S++ +C HV+ SHLP V SW ++R
Sbjct: 676 RYNVPRSFLKPKGNLLVVLEESGGNPLQISVDTASISKICSHVTASHLPLVSSW---SKR 732
Query: 713 TLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC-ENYAIGSCHSSNSRAIVEK 771
T + RP+V++ CPS KIS ILFASYG P G C + YA+G CHSS+S AIV+K
Sbjct: 733 TNTDNNNSLQARPRVKLDCPSNTKISNILFASYGTPEGTCGDAYAVGMCHSSSSEAIVQK 792
Query: 772 ACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
ACLG+ C++PV ++ F GDPC K+LLV A+C
Sbjct: 793 ACLGQMRCSIPVSSKYFGGDPCSANEKSLLVVAEC 827
>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
Precursor
gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
Length = 815
Score = 1103 bits (2852), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 542/826 (65%), Positives = 640/826 (77%), Gaps = 30/826 (3%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M Q +F +L+ I D NVTYDGRSLII+G KILFSGSIHY RSTPQM
Sbjct: 1 MTTFQYSLVFLVLMAVIVAGDVA-----NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQM 55
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LIAKAK GG+DVV T VFWN+HEPQ GQFDFSG RD+V+FIKEV+ GLYVCLRIGP
Sbjct: 56 WPSLIAKAKSGGIDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGP 115
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
FI+GEW YGGLPFWLH+V GIVFR+DNEPFK+HMKRYA MIV +MK+ LYASQGGPIIL
Sbjct: 116 FIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIIL 175
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYGMV +F ++G YV+W AKLAV+L TGVPWVMCKQDDAPDP++NACNGRQCG
Sbjct: 176 SQIENEYGMVGRAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCG 235
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
ETF GPNSP+KPAIWTENWTSFYQ YG+E IRSAEDIA+HVALFIAK GS+VNYYMYH
Sbjct: 236 ETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAK-NGSFVNYYMYH 294
Query: 301 GGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
GGTNFGR AS +V+T YYDQAPLDEYGLLRQPKWGHLKELH+AVKLC +P+LSG+ +++
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 361 FSKLQEAFIF-QGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
KLQ AF+F + ++ CAA LVN+DK +TV F N Y L P S+S+LPDCK VAFNTA
Sbjct: 355 LGKLQTAFVFGKKANLCAAILVNQDK-CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTA 413
Query: 420 K---------------LDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
K L S + WEE+ E +P++ ETS+R+ LLE MNTT+D SDYLW
Sbjct: 414 KVNAQYNTRTRKARQNLSSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
RF+ + SVLKV+ LGH LHAF+NG F+GS HG F LEK + L NGTNN++
Sbjct: 474 TRFQQS-EGAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLA 532
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGS 584
LLSVMVGLP+SGA+LERRV G R+V I + F+++SWGYQVGL GEK ++T+ GS
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGS 592
Query: 585 RIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
V W +Y S QPLTWYK FD P G DPVA+NL SMGKGEAWVNGQSIGRYWVSF T
Sbjct: 593 AKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFHT 652
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
+G PSQ WYHIPRSFLKP NLLV+LEEE G P GI+IDTVSVT +CGHVS+++ PV
Sbjct: 653 YKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSNTNPHPV 712
Query: 704 ISWRSQ--NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
IS R + N++ L T++ R+PKVQ++CP+GRKISKILFAS+G PNG+C +Y+IGSCH
Sbjct: 713 ISPRKKGLNRKNL-TYRY--DRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIGSCH 769
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S NS A+V+KACL K C+VPVW++ F GD CP K+LLV AQC+
Sbjct: 770 SPNSLAVVQKACLKKSRCSVPVWSKTFGGDSCPHTVKSLLVRAQCS 815
>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
Length = 820
Score = 1082 bits (2797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 531/807 (65%), Positives = 627/807 (77%), Gaps = 30/807 (3%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M Q +F +L+ I D NVTYDGRSLII+G KILFSGSIHY RSTPQM
Sbjct: 1 MTTFQYSLVFLVLMAVIVAGDVA-----NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQM 55
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LIAKAK GG+DVV T VFWN+HEPQ GQFDFSG RD+V+FIKEV+ GLYVCLRIGP
Sbjct: 56 WPSLIAKAKSGGIDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGP 115
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
FI+GEW YGGLPFWLH+V GIVFR+DNEPFK+HMKRYA MIV +MK+ LYASQGGPIIL
Sbjct: 116 FIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIIL 175
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYGMV +F ++G YV+W AKLAV+L TGVPWVMCKQDDAPDP++NACNGRQCG
Sbjct: 176 SQIENEYGMVGRAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCG 235
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
ETF GPNSP+KPAIWTENWTSFYQ YG+E IRSAEDIA+HVALFIAK GS+VNYYMYH
Sbjct: 236 ETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAK-NGSFVNYYMYH 294
Query: 301 GGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
GGTNFGR AS +V+T YYDQAPLDEYGLLRQPKWGHLKELH+AVKLC +P+LSG+ +++
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 361 FSKLQEAFIF-QGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
KLQ AF+F + ++ CAA LVN+DK +TV F N Y L P S+S+LPDCK VAFNTA
Sbjct: 355 LGKLQTAFVFGKKANLCAAILVNQDK-CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTA 413
Query: 420 K---------------LDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
K L S + WEE+ E +P++ ETS+R+ LLE MNTT+D SDYLW
Sbjct: 414 KVNAQYNTRTRKARQNLSSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
RF+ + SVLKV+ LGH LHAF+NG F+GS HG F LEK + L NGTNN++
Sbjct: 474 TRFQQS-EGAPSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLA 532
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGS 584
LLSVMVGLP+SGA+LERRV G R+V I + F+++SWGYQVGL GEK ++T+ GS
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGS 592
Query: 585 RIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
V W +Y S QPLTWYK FD P G DPVA+NL SMGKGEAWVNGQSIGRYWVSF T
Sbjct: 593 AKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFHT 652
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
+G PSQ WYHIPRSFLKP NLLV+LEEE G P GI+IDTVSVT +CGHVS+++ PV
Sbjct: 653 YKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSNTNPHPV 712
Query: 704 ISWRSQ--NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
IS R + N++ L T++ R+PKVQ++CP+GRKISKILFAS+G PNG+C +Y+IGSCH
Sbjct: 713 ISPRKKGLNRKNL-TYRY--DRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIGSCH 769
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKF 788
S NS A+V+KACL K C+VPVW++ F
Sbjct: 770 SPNSLAVVQKACLKKSRCSVPVWSKTF 796
>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
Length = 756
Score = 1056 bits (2730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 504/764 (65%), Positives = 596/764 (78%), Gaps = 24/764 (3%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MWP LIAKAKEGG+DV+QT VFWNLHEPQ G ++FSGRRD+VRF+KE+QAQGLY CLRIG
Sbjct: 1 MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
PFIE EW YGGLPFWLHDV GIV+RSDNEPFK HM+ + T IVNMMK+ LYASQGGPII
Sbjct: 61 PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 120
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQC 239
LSQIENEY +VE +F EKGPPYV+WAAK+AV LQTGVPW MCKQ+DAPDPVIN CNG +C
Sbjct: 121 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 180
Query: 240 GETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMY 299
GETF GPNSP+KP+IWTENWTSFYQ YG+E IRSAE+IA+HVALFIA G+YVNYYMY
Sbjct: 181 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 240
Query: 300 HGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
HGGTNFGR+ASA+++TGYYDQ+PLDEYGL R+PKWGHLKELH+AVKLC P+L+G +
Sbjct: 241 HGGTNFGRSASAFMITGYYDQSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSNF 300
Query: 360 NFSKLQEAFIFQG-SSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
+ + EA +F+ S+ECAAFLVN+ ++ V F N+ YELP SISILPDCK VAFNT
Sbjct: 301 SLGQSVEAIVFKTESNECAAFLVNRGAI-DSNVLFQNVTYELPLGSISILPDCKNVAFNT 359
Query: 419 AKLD--------------SVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
++ + +WEE+KE IP D+T LRAN LLE M TTKD SDYLWY
Sbjct: 360 RRVSVQHNTRSMMAVQKFDLLEWEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYLWYT 419
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
FR + D DS+ L+V S H LHAF+NG++ GSAHG + +K F+L K + L NG NN+S
Sbjct: 420 FRVQQDSPDSQQTLEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGINNIS 479
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGS 584
LLSVMVGLPDSGA+LE RVAGLR V IQG +DFS WGY+VGL GE+ QIF D GS
Sbjct: 480 LLSVMVGLPDSGAFLETRVAGLRRVGIQG----EDFSEQHWGYKVGLSGEQSQIFLDTGS 535
Query: 585 RIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
V WSR G+S+ QPLTWYKT FDAP G DP+A+NL SMGKG WVNG+ IGRYWVSFLT
Sbjct: 536 SNVQWSRLGNSS-QPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLT 594
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVI 704
P+G PSQ WY++PRSFLKPT N LV+LEEE G P IS+D+V +T CG VS+SH P V
Sbjct: 595 PKGEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSESHYPLVA 654
Query: 705 SWR-SQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
SW ++ Q+ + R RRPKVQ+ CPS +KIS ILFAS+G P+G+C++YAIG CHS
Sbjct: 655 SWMGAKKQKVRRVKNRT--RRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIGLCHSP 712
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NSRAIVE ACLG+ C++P+ F GDPCP + K LLVDAQCT
Sbjct: 713 NSRAIVEHACLGRAKCSIPISNLNFRGDPCPHVTKTLLVDAQCT 756
>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 1056 bits (2730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 505/812 (62%), Positives = 612/812 (75%), Gaps = 37/812 (4%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
++L L +L IG G G +VTYDGRSLII+G RKI+FSGSIHYPRSTP+MWP L
Sbjct: 3 RVLFLVAAVLAVIG--SGSAVRGGDVTYDGRSLIIDGQRKIVFSGSIHYPRSTPEMWPSL 60
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
IAKAKEGGLD ++T VFWN+HEPQPG +DFSG D+VRFIKEVQAQGLY CLRIGPFI+
Sbjct: 61 IAKAKEGGLDAIETYVFWNVHEPQPGHYDFSGGHDIVRFIKEVQAQGLYACLRIGPFIQS 120
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW YGGLPFWLHD+PGIVFRSDNEPFK +M+ + +V+MM++ LYASQGGPIILSQIE
Sbjct: 121 EWSYGGLPFWLHDIPGIVFRSDNEPFKVYMQNFTAKVVSMMQSENLYASQGGPIILSQIE 180
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG V+ ++ ++G YV+WAA++A LQTGVPWVMCKQ++AP VIN+CNG +CG+TF
Sbjct: 181 NEYGTVQKAYGQEGLAYVQWAAQMAEGLQTGVPWVMCKQNNAPGHVINSCNGMKCGQTFV 240
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
GPNSP+KP+IWTENWT+ +SAEDIA+HV LFIA KGS+VNYYMYHGGTN
Sbjct: 241 GPNSPNKPSIWTENWTT-----------QSAEDIAFHVTLFIAAKKGSFVNYYMYHGGTN 289
Query: 305 FGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
FGRTASA+V T YYDQAPLDEYGL QPKWGHLKELH+A+KLC P+LSGV V++
Sbjct: 290 FGRTASAFVTTSYYDQAPLDEYGLTTQPKWGHLKELHAAIKLCSTPLLSGVQVNLYLGPQ 349
Query: 365 QEAFIFQG-SSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK--- 420
Q+A+IF S ECAAFL+N D N A+V F N Y+LPP+SISILPDCK V+
Sbjct: 350 QQAYIFNAVSGECAAFLINNDSSNAASVPFRNASYDLPPMSISILPDCKNVSTQYTTRTM 409
Query: 421 -----LDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE 475
LD+ + W+E+ EAIP +D TS R+ LLEQMNTTKD+SDYLWY FRF+H+ SD++
Sbjct: 410 GRGEVLDAADVWQEFTEAIPNFDSTSTRSETLLEQMNTTKDSSDYLWYTFRFQHESSDTQ 469
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
++L VSSLGH LHAF+NG+ VGS G + F E V L G NNVSLLSVMVG+PDS
Sbjct: 470 AILDVSSLGHALHAFVNGQAVGSVQGSRKNPRFKFETSVSLSKGINNVSLLSVMVGMPDS 529
Query: 536 GAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS 595
GA+LE R AGLR V I+ ++ DF+++SWGYQ+GL GE LQI+T+ GS V W ++ S+
Sbjct: 530 GAFLENRAAGLRTVMIRDKQDNNDFTNYSWGYQIGLQGETLQIYTEQGSSQVQWKKF-SN 588
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYH 655
PLTWYKT DAP G PV +NL SMGKGEAWVNGQSIGRYW S YH
Sbjct: 589 AGNPLTWYKTQVDAPPGDVPVGLNLASMGKGEAWVNGQSIGRYWPS------------YH 636
Query: 656 IPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLK 715
+PRSFLKPTGNLLVL EEE G P +S+DTV+++ +CGHV+ SHL PV SW NQR K
Sbjct: 637 VPRSFLKPTGNLLVLQEEEGGNPLQVSLDTVTISQVCGHVTASHLAPVSSWIEHNQR-YK 695
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCEN-YAIGSCHSSNSRAIVEKACL 774
++ GRRPKV + CPS KIS+I FASYG P GNC N A+G+CHS NS+A+VE+ACL
Sbjct: 696 NPAKVSGRRPKVLLACPSKSKISRISFASYGTPLGNCRNSMAVGTCHSQNSKAVVEEACL 755
Query: 775 GKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
GK C++PV +F GDPCP K+L+V A+C
Sbjct: 756 GKMKCSIPVSVRQFGGDPCPAKAKSLMVVAEC 787
>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
Length = 780
Score = 1041 bits (2691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 516/798 (64%), Positives = 610/798 (76%), Gaps = 47/798 (5%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYDGRSLII+G KILFSGSIHY RSTPQMWP LIAKAK GG+DVV T VFWN+HEPQ
Sbjct: 11 NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEPQ 70
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQFDFSG RD+V+FIKEV+ GLYVCLRIGPFI+GEW YGGLPFWLH+V GIVFR+DNE
Sbjct: 71 QGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDNE 130
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK+HMKRYA MIV +MK+ LYASQGGPIILSQIENEYGMV +F ++G YV+W AKL
Sbjct: 131 PFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAKL 190
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV+L TGVPWVMCKQDDAPDP++NACNGRQCGETF GPNSP+KPAIWTENWTS
Sbjct: 191 AVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSL------ 244
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
SAEDIA+HVALFIAK GS+VNYYMYHGGTNFGR AS +V+T YYDQAPLDEYGL
Sbjct: 245 -----SAEDIAFHVALFIAK-NGSFVNYYMYHGGTNFGRNASQFVITSYYDQAPLDEYGL 298
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKRN 387
LRQPKWGHLKELH+AVKLC +P+LSG+ +++ KLQ AF+F + ++ CAA LVN+DK
Sbjct: 299 LRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQDK-C 357
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------LDSVEQWEEYKE 432
+TV F N Y L P S+S+LPDCK VAFNTAK L S + WEE+ E
Sbjct: 358 ESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYNTRTRKARQNLSSPQMWEEFTE 417
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFIN 492
+P++ ETS+R+ LLE MNTT+D SDYLW RF+ + SVLKV+ LGH LHAF+N
Sbjct: 418 TVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQQS-EGAPSVLKVNHLGHALHAFVN 476
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ 552
G F+GS HG F LEK + L NGTNN++LLSVMVGLP+SGA+LERRV G R+V I
Sbjct: 477 GRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVVGSRSVKIW 536
Query: 553 GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTG 612
+ F+++SWGYQVGL GEK ++T+ GS V W +Y S QPLTWYK FD P G
Sbjct: 537 NGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRDSKSQPLTWYKASFDTPEG 596
Query: 613 SDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
DPVA+NL SMGKGEAWVNGQSI + S YHIPRSFLKP NLLV+LE
Sbjct: 597 EDPVALNLGSMGKGEAWVNGQSIAMF-----------SYFRYHIPRSFLKPNSNLLVILE 645
Query: 673 EE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ--NQRTLKTHKRIPGRRPKVQI 729
EE G P GI+IDTVSVT +CGHVS+++ PVIS R + N++ L T++ R+PKVQ+
Sbjct: 646 EEREGNPLGITIDTVSVTEVCGHVSNTNPHPVISPRKKGLNRKNL-TYRY--DRKPKVQL 702
Query: 730 RCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFY 789
+CP+GRKISKILFAS+G PNG+C +Y+IGSCHS NS A+V+KACL K C+VPVW++ F
Sbjct: 703 QCPTGRKISKILFASFGTPNGSCGSYSIGSCHSPNSLAVVQKACLKKSRCSVPVWSKTFG 762
Query: 790 GDPCPGIPKALLVDAQCT 807
GD CP K+LLV AQC+
Sbjct: 763 GDSCPHTVKSLLVRAQCS 780
>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
Length = 771
Score = 1030 bits (2664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 513/821 (62%), Positives = 594/821 (72%), Gaps = 82/821 (9%)
Query: 4 CQLLCL-FGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
C L L F L I G G NVTYDGRSLIING +ILFSGSIHYPRSTP+
Sbjct: 16 CMLFWLGFAFLSMAIITVQGKAG---NVTYDGRSLIINGEHRILFSGSIHYPRSTPE--- 69
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFI 122
+DF GR+DLV+F+ EVQAQGLY LRIGPFI
Sbjct: 70 -----------------------------YDFDGRKDLVKFLLEVQAQGLYAALRIGPFI 100
Query: 123 EGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
EGEW YGGLPFWLHDV GIVFRSDNEPFK HM+R+ T IVNMMK +LYASQGGPII+SQ
Sbjct: 101 EGEWTYGGLPFWLHDVSGIVFRSDNEPFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQ 160
Query: 183 IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET 242
IENEY VE +F EKG YV WAA +AV L TGVPWVMCKQ DAPDPVIN CNG +CGET
Sbjct: 161 IENEYQNVETAFHEKGSRYVHWAANMAVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGET 220
Query: 243 FAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
FAGPNSP+KP++WTENWTSFYQV+G E IR+AEDIA+HVALFIA+ GSYVNYYMYHGG
Sbjct: 221 FAGPNSPNKPSMWTENWTSFYQVFGGEPYIRTAEDIAFHVALFIAR-NGSYVNYYMYHGG 279
Query: 303 TNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFS 362
TNFGRT SA+V T YYDQAPLDEYGL+RQPKWGHLK+LH+ +K C K ++ G +
Sbjct: 280 TNFGRTGSAFVTTSYYDQAPLDEYGLIRQPKWGHLKDLHAKIKSCSKTLIRGTHQTFPLG 339
Query: 363 KLQEAFIF-QGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL 421
+LQEA++F + S +C AFLVN D R + TV F N YELP SISILPDCK++ FNTAK+
Sbjct: 340 RLQEAYVFREKSGDCVAFLVNNDGRRDVTVRFQNRSYELPHKSISILPDCKSITFNTAKV 399
Query: 422 D---------------SVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR 466
+ SV +WEEYKE + T+D TSLRA LL+ ++TTKD SDYLWY FR
Sbjct: 400 NTQYATRSATLSQEFSSVGKWEEYKETVATFDSTSLRAKTLLDHLSTTKDTSDYLWYTFR 459
Query: 467 FKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLL 526
F++ S +S L+ S GHVLHA++NG + GSAHG H SFTLE V L NGTNNV+LL
Sbjct: 460 FQNHFSRPQSTLRAYSRGHVLHAYVNGVYAGSAHGSHESTSFTLENSVRLKNGTNNVALL 519
Query: 527 SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
SV VGLPDSGAYLERRVAGL V IQ KDF+++SWGYQVGLLGEKLQI+TD G
Sbjct: 520 SVTVGLPDSGAYLERRVAGLHRVRIQN----KDFTTYSWGYQVGLLGEKLQIYTDNGLNK 575
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
V W+ + +T QPLTWYKT FDAP GSDP+A+NL SMGKGEAWVNGQSIGRYWVSF T +
Sbjct: 576 VSWNEFRGTT-QPLTWYKTQFDAPAGSDPIALNLHSMGKGEAWVNGQSIGRYWVSFSTSK 634
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW 706
G PSQ+ YHIP+SF+KPTGNLLVLLEEE GYPPGI++D++S++ +CGHVS+SH
Sbjct: 635 GNPSQTRYHIPQSFVKPTGNLLVLLEEEKGYPPGITVDSISISKVCGHVSESH------- 687
Query: 707 RSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSR 766
+ VQ+ CP R IS+ILF+S+G P GNC YAIG CHSSNSR
Sbjct: 688 -----------------KSVVQLSCPPNRNISRILFSSFGTPEGNCNQYAIGKCHSSNSR 730
Query: 767 AIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
AIVEKAC+GK C + F GDPCPGI K LLVDA+CT
Sbjct: 731 AIVEKACIGKTKCIILRSNRFFGGDPCPGIRKGLLVDAKCT 771
>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
Length = 766
Score = 996 bits (2575), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 486/796 (61%), Positives = 579/796 (72%), Gaps = 67/796 (8%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +VTYDGRSLIING R++LFSGSIHYPRSTP+MWP LI+KAKEGG+DV++T FWN HE
Sbjct: 21 GGSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHE 80
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ GQ+DFSGR D+V+F KEVQAQGLY CLRIGPFIE EW YGGLPFWLHDVPGI++RSD
Sbjct: 81 PKQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSD 140
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFKF+M+ + T IVN+MK+ LYASQGGPIILSQIENEY VE +F EKGPPYVRWAA
Sbjct: 141 NEPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAA 200
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+AVDLQT + + Y
Sbjct: 201 KMAVDLQTAM-----------------------------------------------RYY 213
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++ R R+AED+A+ VALFIAK GS++NYYMYHGGTNFGRT+S+YVLT YYDQAPLDEY
Sbjct: 214 GEDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYDQAPLDEY 273
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDK 385
GL+RQPKWGHLKELH+ +KLC +L GV + + +LQEA++F+ S +CAAFLVN DK
Sbjct: 274 GLIRQPKWGHLKELHAVIKLCSDTLLXGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 333
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD---------------SVEQWEEY 430
R N TV F N YEL SISILPDCK +AFNTAK+ S +QW EY
Sbjct: 334 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNTRSVQTRATFGSTKQWSEY 393
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAF 490
+E IP++ T L+A+ LLE M TTKDASDYLWY RF H+ S+++ VL+V SL HVL AF
Sbjct: 394 REGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRFIHNSSNAQPVLRVDSLAHVLLAF 453
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVS 550
+NG+++ SAHG H + SF+L V L +G N +SLLSVMVGLPD+G YLE +VAG+R V
Sbjct: 454 VNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEHKVAGIRRVE 513
Query: 551 IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAP 610
IQ KDFS WGYQVGL+GEKLQI+T GS+ V W GS PLTWYKT+FDAP
Sbjct: 514 IQDGGXSKDFSKHPWGYQVGLMGEKLQIYTSPGSQKVQWYGLGSHGRGPLTWYKTLFDAP 573
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVL 670
G+DPV + SMGKGEAWVNGQSIGRYWVS+LTP G PSQ+WY++PR+FL P GNLLV+
Sbjct: 574 RGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPSGEPSQTWYNVPRAFLNPKGNLLVV 633
Query: 671 LEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIR 730
EEE+G P ISI TVSVT +CGHV+DSH PP+ISW + + H +I PKVQ+R
Sbjct: 634 QEEESGDPLKISIGTVSVTNVCGHVTDSHPPPIISWTTSDDGNESHHGKI----PKVQLR 689
Query: 731 CPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYG 790
CP ISKI FAS+G P G CE+YAIGSCHS NS A+ EKACLGK C++P + F
Sbjct: 690 CPPSSNISKITFASFGTPVGGCESYAIGSCHSPNSLAVAEKACLGKNXCSIPHSLKSFGD 749
Query: 791 DPCPGIPKALLVDAQC 806
DPCPG PKALLV AQC
Sbjct: 750 DPCPGTPKALLVAAQC 765
>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 697
Score = 967 bits (2499), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/696 (66%), Positives = 553/696 (79%), Gaps = 25/696 (3%)
Query: 10 FGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAK 69
F + T G+ GG NVTYDGRSLII+G KILFSGSIHYPRSTPQMWP LIAKAK
Sbjct: 11 FAFISTVFIGTTVYGG---NVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAK 67
Query: 70 EGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYG 129
EGGLDV+QT VFWNLHEPQ GQ+DF G R++VRFIKE+QAQGLYV LRIGP+IE E YG
Sbjct: 68 EGGLDVIQTYVFWNLHEPQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYG 127
Query: 130 GLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM 189
GLP WLHD+PGIVFRSDNE FKFHM++++ IVN+MK+A L+ASQGGPIILSQIENEYG
Sbjct: 128 GLPLWLHDIPGIVFRSDNEQFKFHMQKFSAKIVNLMKSANLFASQGGPIILSQIENEYGN 187
Query: 190 VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSP 249
VE +F EKG Y+RWAA++AV LQTGVPWVMCKQD+APDPVIN CNG QCG+TF GPNSP
Sbjct: 188 VEGAFHEKGLSYIRWAAQMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSP 247
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
+KP++WTENWTSFYQV+G+ IRSAEDIAY+VALFIAK +GSYVNYYMYHGGTNF R A
Sbjct: 248 NKPSLWTENWTSFYQVFGEVPYIRSAEDIAYNVALFIAK-RGSYVNYYMYHGGTNFDRIA 306
Query: 310 SAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI 369
SA+V+T YYD+APLDEYGL+R+PKWGHLKELH+A+K C +L G S + Q A++
Sbjct: 307 SAFVITAYYDEAPLDEYGLVREPKWGHLKELHAAIKSCSNSILHGTQTSFSLGTQQNAYV 366
Query: 370 FQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------- 421
F+ SS ECAAFL N + + + T+ F N+ Y+LPP SISILPDCK VAFNTAK+
Sbjct: 367 FKRSSIECAAFLENTEDQ-SVTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVSIQNARA 425
Query: 422 -------DSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS 474
+S E W+ YKEAIP++ +TSLRAN LL+Q++TTKD SDYLWY FR + ++
Sbjct: 426 MKSQLEFNSAETWKVYKEAIPSFGDTSLRANTLLDQISTTKDTSDYLWYTFRLYDNSPNA 485
Query: 475 ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPD 534
+S+L S GHVLHAF+NG VGS HG H + SF +E ++LING NN+S LS VGLP+
Sbjct: 486 QSILSAYSHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNNISFLSATVGLPN 545
Query: 535 SGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
SGAYLERRVAGLR++ +QG +DF++ +WGYQ+GLLGEKLQI+T GS V W + S
Sbjct: 546 SGAYLERRVAGLRSLKVQG----RDFTNQAWGYQIGLLGEKLQIYTASGSSKVQWESFQS 601
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWY 654
ST +PLTWYKT FDAP G+DPV +NL SMGKG W+NGQ IGRYWVSF TPQGTPSQ WY
Sbjct: 602 ST-KPLTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYWVSFHTPQGTPSQKWY 660
Query: 655 HIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
HIPRS LK TGNLLVLLEEE G P GI++DTV +T+
Sbjct: 661 HIPRSLLKSTGNLLVLLEEETGNPLGITLDTVYITS 696
>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
Length = 758
Score = 963 bits (2490), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/692 (66%), Positives = 545/692 (78%), Gaps = 18/692 (2%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G VTYDGRSLII+GHRKILFSGSIHYPRSTPQMW LIAKAKEGG+DV+QT VFWN HE
Sbjct: 59 GAQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHE 118
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQPGQ+DF+GR DL +FIKE+QAQGLY CLRIGPFIE EW YGGLPFWLHDV GIV+R+D
Sbjct: 119 PQPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTD 178
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFKF+M+ + T IVN+MK+ LYASQGGPIILSQIENEY +E +F EKGP YVRWAA
Sbjct: 179 NEPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAA 238
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+AV+LQTGVPWVMCKQ DAPDPVIN CNG +CG+TF GPNSP+KP++WTENWTSFY+V+
Sbjct: 239 KMAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVF 298
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G E +RSAEDIA+HVALFIA+ GSYVNYYMYHGGTNFGR +SAY+ T YYDQAPLDEY
Sbjct: 299 GGETYLRSAEDIAFHVALFIAR-NGSYVNYYMYHGGTNFGRASSAYIKTSYYDQAPLDEY 357
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDK 385
GL+RQPKWGHLKELH+A+ LC P+L+GV +++ +LQEA++FQ C AFLVN D+
Sbjct: 358 GLIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDE 417
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------DSVEQWEEY 430
NN+TV F N+ EL P SISILPDCK V FNTAK+ D+V++WEEY
Sbjct: 418 GNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYNERIATSSQSFDAVDRWEEY 477
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAF 490
K+AIP + +TSL++N +LE MN TKD SDYLWY FRF+ + S +E +L + SL H +HAF
Sbjct: 478 KDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPNSSCTEPLLHIESLAHAVHAF 537
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVS 550
+N +VG+ HG H K FT + + L N NN+S+LSVMVG PDSGAYLE R AGL V
Sbjct: 538 VNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSGAYLESRFAGLTRVE 597
Query: 551 IQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
IQ K + DF++++WGYQVGL GEKL I+ + V W + ST+QPLTWYK VF+
Sbjct: 598 IQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRKTEISTNQPLTWYKIVFNT 657
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLV 669
P+G DPVA+NL +MGKGEAWVNGQSIGRYWVSF +G PSQ+ YH+PR+FLK + NLLV
Sbjct: 658 PSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSKGDPSQTLYHVPRAFLKTSENLLV 717
Query: 670 LLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
LLEE NG P IS++T+S T L HV HLP
Sbjct: 718 LLEEANGDPLHISLETISRTDLPDHVLYHHLP 749
>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
Length = 696
Score = 963 bits (2490), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/679 (68%), Positives = 546/679 (80%), Gaps = 22/679 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+NVTYDGRSLII+G KILFSGSIHYPRSTPQMWP LIAKAKEGGLDV+QT VFWNLHE
Sbjct: 24 GDNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHE 83
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ GQ+DF G R++VRFIKE+QAQGLYV LRIGP+IE E YGGLP WLHD+PGIVFRSD
Sbjct: 84 PQQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIVFRSD 143
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NE FKFHM+R+ IVN+MK+A L+ASQGGPIILSQIENEYG VE +F EKG Y+RWAA
Sbjct: 144 NEQFKFHMQRFTAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAA 203
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++AV LQTGVPWVMCKQD+APDPVIN CNG QCG+TF GPNSP+KP++WTENWTSFYQV+
Sbjct: 204 QMAVGLQTGVPWVMCKQDNAPDPVINTCNGMQCGKTFKGPNSPNKPSLWTENWTSFYQVF 263
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G+ IRSAEDIAY+VALFIAK +GSYVNYYMYHGGTNF R ASA+V+T YYD+APLDEY
Sbjct: 264 GEVPYIRSAEDIAYNVALFIAK-RGSYVNYYMYHGGTNFDRIASAFVVTAYYDEAPLDEY 322
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDK 385
GL+R+PKWGHLKELH A+K C +L G S + Q A++F+ SS ECAAFL N +
Sbjct: 323 GLVREPKWGHLKELHEAIKSCSNSLLYGTQTSFSLGTQQNAYVFRRSSIECAAFLENTED 382
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------------DSVEQWEEYK 431
R + T+ F N+ Y+LPP SISILPDCK VAFNTAK+ +S E+W+ Y+
Sbjct: 383 R-SVTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVRAQNARAMKSQLQFNSAEKWKVYR 441
Query: 432 EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFI 491
EAIP++ +TSLRAN LL+Q++T KD SDYLWY FR + ++++S+L S GHVLHAF+
Sbjct: 442 EAIPSFADTSLRANTLLDQISTAKDTSDYLWYTFRLYDNSANAQSILSAYSHGHVLHAFV 501
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSI 551
NG VGS HG H + SF +E ++LI+G NN+S LS VGLP+SGAYLE RVAGLR++ +
Sbjct: 502 NGNLVGSKHGSHKNVSFVMENKLNLISGMNNISFLSATVGLPNSGAYLEGRVAGLRSLKV 561
Query: 552 QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPT 611
QG +DF++ +WGYQVGLLGEKLQI+T GS V W + SST +PLTWYKT FDAP
Sbjct: 562 QG----RDFTNQAWGYQVGLLGEKLQIYTASGSSKVKWESFLSST-KPLTWYKTTFDAPV 616
Query: 612 GSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLL 671
G+DPV +NL SMGKG WVNGQ IGRYWVSF TPQGTPSQ WYHIPRS LK TGNLLVLL
Sbjct: 617 GNDPVVLNLGSMGKGYTWVNGQGIGRYWVSFHTPQGTPSQKWYHIPRSLLKSTGNLLVLL 676
Query: 672 EEENGYPPGISIDTVSVTT 690
EEE G P GI++DTV +T+
Sbjct: 677 EEETGNPLGITLDTVYITS 695
>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
Length = 729
Score = 960 bits (2482), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/699 (65%), Positives = 545/699 (77%), Gaps = 25/699 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G VTYDGRSLII+GHRKILFSGSIHYPRSTPQMW LIAKAKEGG+DV+QT VFWN HE
Sbjct: 23 GAQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHE 82
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQPGQ+DF+GR DL +FIKE+QAQGLY CLRIGPFIE EW YGGLPFWLHDV GIV+R+D
Sbjct: 83 PQPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTD 142
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFKF+M+ + T IVN+MK+ LYASQGGPIILSQIENEY +E +F EKGP YVRWAA
Sbjct: 143 NEPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAA 202
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+AV+LQTGVPWVMCKQ DAPDPVIN CNG +CG+TF GPNSP+KP++WTENWTSFY+V+
Sbjct: 203 KMAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVF 262
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G E +RSAEDIA+HVALFIA+ GSYVNYYMYHGGTNFGR +SAY+ T YYDQAPLDEY
Sbjct: 263 GGETYLRSAEDIAFHVALFIAR-NGSYVNYYMYHGGTNFGRASSAYIKTSYYDQAPLDEY 321
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDK 385
GL+RQPKWGHLKELH+A+ LC P+L+GV +++ +LQEA++FQ C AFLVN D+
Sbjct: 322 GLIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDE 381
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL----------------------DS 423
NN+TV F N+ EL P SISILPDCK V FNTAK+ D+
Sbjct: 382 GNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKVCSSSRQSAYKIQELSRSCIQSFDA 441
Query: 424 VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSL 483
V++WEEYK+AIP + +TSL++N +LE MN TKD SDYLWY FRF+ + S +E +L + SL
Sbjct: 442 VDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPNSSCTEPLLHIESL 501
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H +HAF+N +VG+ HG H K FT + + L N NN+S+LSVMVG PDSGAYLE R
Sbjct: 502 AHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSGAYLESRF 561
Query: 544 AGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
AGL V IQ K + DF++++WGYQVGL GEKL I+ + V W + ST+QPLTW
Sbjct: 562 AGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRKTEISTNQPLTW 621
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK VF+ P+G DPVA+NL +MGKGEAWVNGQSIGRYWVSF +G PSQ+ YH+PR+FLK
Sbjct: 622 YKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSKGDPSQTLYHVPRAFLK 681
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
+ NLLVLLEE NG P IS++T+S T L HV HLP
Sbjct: 682 TSENLLVLLEEANGDPLHISLETISRTDLPDHVLYHHLP 720
>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
Length = 715
Score = 937 bits (2423), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 457/709 (64%), Positives = 547/709 (77%), Gaps = 21/709 (2%)
Query: 12 LLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEG 71
++LT D G GG+ VTYDGRSLII+G RKILFSGSIHYPRSTP+MWP L+AKA+EG
Sbjct: 8 VVLTVAVIRDIGVRGGD-VTYDGRSLIIDGQRKILFSGSIHYPRSTPEMWPSLVAKAREG 66
Query: 72 GLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGL 131
G+DV+QT VFWNLHEP+PG++DFSGR DLVRFIKE+QAQGLYVCLRIGPFIE EW YGG
Sbjct: 67 GVDVIQTYVFWNLHEPRPGEYDFSGRNDLVRFIKEIQAQGLYVCLRIGPFIESEWTYGGF 126
Query: 132 PFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVE 191
PFWLHDVP IV+RSDNEPFKF+M+ + T IVNMMK+ LYASQGGPIILSQIENEY VE
Sbjct: 127 PFWLHDVPDIVYRSDNEPFKFYMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYQNVE 186
Query: 192 HSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK 251
+F +KGPPYV WAAK+AV+LQTGVPWVMCKQ DAPDPVIN CNG +CGETF GPNSP K
Sbjct: 187 AAFRDKGPPYVIWAAKMAVELQTGVPWVMCKQTDAPDPVINTCNGMRCGETFGGPNSPTK 246
Query: 252 PAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
P++WTENWTSFYQVYG E IRSAEDIA+HV LFIAK GSY+NYYM+HGGTNFGRTASA
Sbjct: 247 PSLWTENWTSFYQVYGGEPYIRSAEDIAFHVTLFIAK-NGSYINYYMFHGGTNFGRTASA 305
Query: 312 YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ 371
YV+T YYDQAPLDEYGL+RQPKWGHLKELH+A+K C +L GV + + +LQ+A+IF+
Sbjct: 306 YVITSYYDQAPLDEYGLIRQPKWGHLKELHAAIKSCSSTILEGVQSNFSLGQLQQAYIFE 365
Query: 372 GS-SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------- 421
+ CAAFLVN D++NNATV F N+ +EL P SIS+LPDC+ + FNTAK+
Sbjct: 366 EEGAGCAAFLVNNDQKNNATVEFRNITFELLPKSISVLPDCENIIFNTAKVNAKGNEITR 425
Query: 422 ------DSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE 475
D ++WE Y + IP + +T+L+++ LLE MNTTKD SDYLWY F F + S +E
Sbjct: 426 TSSQLFDDADRWEAYTDVIPNFADTNLKSDTLLEHMNTTKDKSDYLWYTFSFLPNSSCTE 485
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKS-FTLEKMVHLINGTNNVSLLSVMVGLPD 534
+L V SL HV AF+N ++ GSAHG K FT+E + L + N +S+LS MVGL D
Sbjct: 486 PILHVESLAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQMNTISILSTMVGLQD 545
Query: 535 SGAYLERRVAGLRNVSIQGA-KELKDFS-SFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
SGA+LERR AGL V I+ A +E+ +F+ ++ WGYQ GL GE L I+ + WS
Sbjct: 546 SGAFLERRYAGLTRVEIRCAQQEIYNFTNNYEWGYQAGLSGESLNIYMREHLDNIEWSEV 605
Query: 593 GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQS 652
S+T QPL+W+K FDAPTG+DPV +NL +MGKGEAWVNGQSIGRYW+SFLT +G PSQ+
Sbjct: 606 VSATDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVNGQSIGRYWLSFLTSKGQPSQT 665
Query: 653 WYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
YHIPR+FL +GNLLVLLEE G P IS+DTVS T L H S H P
Sbjct: 666 LYHIPRAFLNSSGNLLVLLEESGGDPLHISLDTVSRTGLQEHASRYHPP 714
>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
Length = 694
Score = 936 bits (2418), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/703 (64%), Positives = 538/703 (76%), Gaps = 26/703 (3%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
MG+ L L+LT + G NVTYD SL+INGH KILFSGSIHYPRSTPQM
Sbjct: 1 MGEWWRFLLHALILTVSLCTVHGA----NVTYDRTSLVINGHHKILFSGSIHYPRSTPQM 56
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI+KAKEGGLDV+QT VFWNLHEPQ GQ++F+GR DLV FIKE+QAQGLYV LRIGP
Sbjct: 57 WPDLISKAKEGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGP 116
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
+IE E YGGLP WLHDVPGIVFR+DN+ FKFHM+R+ T IVNMMK+A L+ASQGGPIIL
Sbjct: 117 YIESECTYGGLPLWLHDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIIL 176
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG ++ F G PY+ WAA++AV LQTGVPW+MCKQDDAPDPVINACNG QCG
Sbjct: 177 SQIENEYGSIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCG 236
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
F GPNSP+KP++WTENWTSF Q +G +RSA DIAY+VALFIAK KGSYVNYYMYH
Sbjct: 237 RNFKGPNSPNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAK-KGSYVNYYMYH 295
Query: 301 GGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
GGTNF R ASA+++T YYD+APLDEYGL+RQPKWGHLKELH+++K C +P+L G + +
Sbjct: 296 GGTNFDRLASAFIITAYYDEAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFS 355
Query: 361 FSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK 420
Q+A++F+ S+ECAAFL N R + T+ F N+ YELP SISILP CK V FNT K
Sbjct: 356 LGSEQQAYVFRSSTECAAFLENSGPR-DVTIQFQNISYELPGKSISILPGCKNVVFNTGK 414
Query: 421 L---------------DSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNF 465
+ +S E W+ Y EAIP + TS RA+ LL+Q++T KD SDY+WY F
Sbjct: 415 VSIQNNVRAMKPRLQFNSAENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYTF 474
Query: 466 RFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSL 525
RF + +++SVL + S G VLH+FING GSAHG ++ T++K V+LING NN+S+
Sbjct: 475 RFNNKSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNNISI 534
Query: 526 LSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSR 585
LS VGLP+SGA+LE RVAGLR V +QG +DFSS+SWGYQVGLLGEKLQIFT GS
Sbjct: 535 LSATVGLPNSGAFLESRVAGLRKVEVQG----RDFSSYSWGYQVGLLGEKLQIFTVSGSS 590
Query: 586 IVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTP 645
V W + SST +PLTWY+T F AP G+DPV +NL SMGKG AWVNGQ IGRYWVSF P
Sbjct: 591 KVQWKSFQSST-KPLTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRYWVSFHKP 649
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
GTPSQ WYHIPRSFLK TGNLLV+LEEE G P GI++DTV +
Sbjct: 650 DGTPSQQWYHIPRSFLKSTGNLLVILEEETGNPLGITLDTVYI 692
>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
Length = 821
Score = 933 bits (2412), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/806 (55%), Positives = 573/806 (71%), Gaps = 35/806 (4%)
Query: 21 DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLV 80
+G G VTYDGR+L++NG R++LFSG +HY RSTP+MWP++IAKA++GG+DV+QT V
Sbjct: 30 EGEDAGRGEVTYDGRALLLNGTRRMLFSGEMHYTRSTPEMWPKIIAKARKGGIDVIQTYV 89
Query: 81 FWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPG 140
FWN+HEP G+++F GR ++V+FI+E+QAQGLYV LRIGPFIE EW YGG PFWLH+VP
Sbjct: 90 FWNVHEPVQGKYNFEGRYNIVKFIREIQAQGLYVSLRIGPFIEAEWKYGGFPFWLHEVPN 149
Query: 141 IVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP 200
I FR+DNEPFK HM+ + T +VNMMK LY QGGPII+SQIENEY MVE +F GP
Sbjct: 150 ITFRTDNEPFKQHMQGFVTHMVNMMKNEGLYYPQGGPIIISQIENEYQMVEPAFGPGGPR 209
Query: 201 YVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
YV+WAA LAV LQTGVPW+MCKQ+DAPDP+IN CNG CGETF GPNSP+KPA+WTENWT
Sbjct: 210 YVQWAASLAVGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPNKPALWTENWT 269
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ 320
+ Y +YG++ ++RS DI + VALFIA+ GS+V+YYMYHGGTNFGR AS+YV T YYD
Sbjct: 270 TRYPIYGNDTKLRSTGDITFAVALFIARKGGSFVSYYMYHGGTNFGRFASSYVTTSYYDG 329
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFL 380
APLDEYGL+ QP WGHLKELH+AVKL +P+L G + + + QEA +F+ +C AFL
Sbjct: 330 APLDEYGLIWQPTWGHLKELHAAVKLSSEPLLYGTYSNFSLGEDQEAHVFETKLKCVAFL 389
Query: 381 VNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------LDSVE 425
VN DK TV F N+ +L P SISIL DC+TV F T K L+
Sbjct: 390 VNFDKHQRPTVIFRNISLQLAPKSISILSDCRTVVFETGKVNAQHGSRTAEVVQSLNDTH 449
Query: 426 QWEEYKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES--VLKVSS 482
W+ +KE+IP + + L E ++TTKD +DYLWY +++ PSD +L V S
Sbjct: 450 TWKAFKESIPQDISKAAYTGKQLFEHLSTTKDETDYLWYIASYEYRPSDDSHLVLLNVES 509
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKM-VHLINGTNNVSLLSVMVGLPDSGAYLER 541
H+LHAF+NGEFVGS HG H + + + M + L G N +SLL+VMVG PDSGA++ER
Sbjct: 510 QAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKEGQNTISLLNVMVGSPDSGAHMER 569
Query: 542 RVAGLRNVSI-QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
R G+ VSI QG L ++ WGYQVGL GE +I+T GS V W+ + T+ PL
Sbjct: 570 RSFGIHKVSIQQGQHALHLLNNELWGYQVGLFGEGNRIYTQEGSHSVEWTDVNNLTYLPL 629
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWY+T F P G+D V +NL SMGKGE W+NG+SIGRYWVSF TP G PSQS YHIP+ F
Sbjct: 630 TWYQTTFATPMGNDAVTLNLTSMGKGEVWINGESIGRYWVSFKTPSGQPSQSLYHIPQHF 689
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
LK T NLLVL+EE G P I+++TVS+TT+C V++ PPV Q+Q
Sbjct: 690 LKNTDNLLVLVEEMGGNPLQITVNTVSITTVCSSVNELSAPPV-----QSQ--------- 735
Query: 721 PGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
G+ P+V++RC G+ IS + FASYGNP G+C + IGSCH+ +S ++V++AC+GKRSC+
Sbjct: 736 -GKDPEVRLRCQKGKHISAVEFASYGNPAGDCRTFTIGSCHAESSESVVKQACIGKRSCS 794
Query: 781 VPVWTEKFYGDPCPGIPKALLVDAQC 806
+PV F GDPCPGI K+LLV A C
Sbjct: 795 IPVGPGSFGGDPCPGIQKSLLVVAHC 820
>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
Length = 811
Score = 933 bits (2412), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/800 (56%), Positives = 564/800 (70%), Gaps = 35/800 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYDGR+L+++G R++ FSG +HY RSTP+MWP+LIAKAK GGLDV+QT VFWN+HE
Sbjct: 26 GREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHE 85
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P GQ++F GR DLV+FI+E+QAQGLYV LRIGPF+E EW YGG PFWLHDVP I FRSD
Sbjct: 86 PIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSD 145
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK HM+ + T IV MMK LY QGGPII+SQIENEY M+E +F GP YVRWAA
Sbjct: 146 NEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAA 205
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+AV LQTGVPW+MCKQ+DAPDPVIN CNG CGETF GPNSP+KPA+WTENWTS Y +Y
Sbjct: 206 AMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIY 265
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++ ++R EDIA+ VAL+IA+ KGS+V+YYMYHGGTNFGR A++YV T YYD APLDEY
Sbjct: 266 GNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAASYVTTSYYDGAPLDEY 325
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
GL+ QP WGHL+ELH AVK +P+L G + + + QEA +F+ +C AFLVN D+
Sbjct: 326 GLIWQPTWGHLRELHCAVKQSSEPLLFGSYSNFSLGQQQEAHVFETDFKCVAFLVNFDQH 385
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------LDSVEQWEEYK 431
N V F N+ EL P SIS+L DC+ V F TAK L+ + W+ +
Sbjct: 386 NTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWKAFI 445
Query: 432 EAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV--LKVSSLGHVLH 488
E +P +++ N L EQ+ TTKD +DYLWY +K+ SD + L V SL H+LH
Sbjct: 446 EPVPQDLSKSTYTGNQLFEQLPTTKDETDYLWYIVSYKNRASDGNQIARLYVKSLAHILH 505
Query: 489 AFINGEFVGSAHGKHSD-KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
AF+N E+VGS HG H ++ L + L G N +SLLSVMVG PDSGAY+ERR G++
Sbjct: 506 AFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFGIQ 565
Query: 548 NVSI-QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V I QG + + ++ WGYQVGL GEK I+T G V W + + PLTWYKT
Sbjct: 566 TVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMDINNLIYHPLTWYKTT 625
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
F P G+D V +NL SMGKGE WVNG+SIGRYWVSF P G PSQS YHIPR FL P N
Sbjct: 626 FSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPSGQPSQSLYHIPRGFLTPKDN 685
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
LLVL+EE G P I+++T+SVTT+CG+V + +PP+ S G+ PK
Sbjct: 686 LLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS---------------RGKVPK 730
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
V+I C G++IS I FASYGNP G+C ++ IGSCH+ +S ++V+++C+G+R C++PV
Sbjct: 731 VRIWCQGGKRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAA 790
Query: 787 KFYGDPCPGIPKALLVDAQC 806
KF GDPCPGI K+LLV A C
Sbjct: 791 KFGGDPCPGIQKSLLVVADC 810
>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 932 bits (2408), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/698 (64%), Positives = 531/698 (76%), Gaps = 18/698 (2%)
Query: 21 DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLV 80
+G G VTYDGRSLII+G RKILFSGSIHYPRSTPQMWP LIAKAK+GGLDV+QT V
Sbjct: 18 EGFGVEAEEVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPDLIAKAKQGGLDVIQTYV 77
Query: 81 FWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPG 140
FWNLHEPQPG +DFSGR DLV FIKE+QAQGLYVCLRIGPFIE EW YGG PFWLHDVPG
Sbjct: 78 FWNLHEPQPGMYDFSGRYDLVGFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPG 137
Query: 141 IVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP 200
IV+R+DNEPFKF+M+ + T IVNMMK LYASQGGPIILSQIENEY ++ +F G
Sbjct: 138 IVYRTDNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQ 197
Query: 201 YVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
YV+WAAK+AV L TGVPW+MCKQ DAPDPVIN CNG +CGETF GPNSP+KPA+WTENWT
Sbjct: 198 YVQWAAKMAVGLDTGVPWIMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWT 257
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ 320
SFYQVYG IRSAEDIA+HV LFIA+ GSYVNYYMYHGGTNFGRT SAYV+TGYYDQ
Sbjct: 258 SFYQVYGGLPYIRSAEDIAFHVTLFIAR-NGSYVNYYMYHGGTNFGRTGSAYVITGYYDQ 316
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAF 379
APLDEYGLLRQPKWGHLK+LH +K C +L GV + +L E ++F+ EC AF
Sbjct: 317 APLDEYGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFTLGQLLEVYVFEEEKGECVAF 376
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD---------------SV 424
L+N D+ N ATV F N YEL P SISILPDC+ V F+TA ++ SV
Sbjct: 377 LINNDRDNKATVQFRNSSYELLPKSISILPDCQNVTFSTANVNTTSNRRIISPKQNFSSV 436
Query: 425 EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLG 484
+ W+++++ I +D TSL+++ LLEQMNTTKD SDYLWY RF+++ S S+ L V S
Sbjct: 437 DDWQQFQDVISNFDNTSLKSDSLLEQMNTTKDKSDYLWYTLRFEYNLSCSKPTLSVQSAA 496
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
HV HAF+N ++G HG H KSFTLE V + GTNN+S+LSVMVGLPDSGA+LERR A
Sbjct: 497 HVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSVMVGLPDSGAFLERRFA 556
Query: 545 GLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWY 603
GL +V +Q +E + ++ +WGYQVGL+GE+LQ++ + + WS+ G+ Q L WY
Sbjct: 557 GLISVELQCSEQESLNLTNSTWGYQVGLMGEQLQVYKEQNNSDTGWSQLGNVMEQTLFWY 616
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKP 663
KT FD P G DPV ++L SMGKGEAWVNG+SIGRYW+ F +G PSQS YH+PRSFLK
Sbjct: 617 KTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYWILFHDSKGNPSQSLYHVPRSFLKD 676
Query: 664 TGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
+GN+LVLLEE G P GIS+DTVSVT L + S LP
Sbjct: 677 SGNVLVLLEEGGGNPLGISLDTVSVTDLQQNFSKLSLP 714
>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 717
Score = 926 bits (2392), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 451/707 (63%), Positives = 530/707 (74%), Gaps = 18/707 (2%)
Query: 12 LLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEG 71
LLL +G G VTYDGRSLII+G RKILFSG IHYPRSTPQMWP LIAKAK+G
Sbjct: 9 LLLVFWKIREGFGVKAEEVTYDGRSLIIDGQRKILFSGLIHYPRSTPQMWPDLIAKAKQG 68
Query: 72 GLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGL 131
GLDV+QT VFWNLHEPQPG +DF GR DLV FIKE+QAQGLYVCLRIGPFI+ EW YGG
Sbjct: 69 GLDVIQTYVFWNLHEPQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIGPFIQSEWKYGGF 128
Query: 132 PFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVE 191
PFWLHDVPGIV+R+DNE FKF+M+ + T IVNMMK LYASQGGPIILSQIENEY ++
Sbjct: 129 PFWLHDVPGIVYRTDNESFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQ 188
Query: 192 HSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK 251
+F G YV+WAAK+AV L TGVPWVMCKQ DAPDPVIN CNG +CGETF GPNSP+K
Sbjct: 189 KAFGTAGSQYVQWAAKMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFTGPNSPNK 248
Query: 252 PAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
PA+WTENWTSFYQVYG IRSAEDIA+HV LFIA+ GSYVNYYMYHGGTNFGRTASA
Sbjct: 249 PALWTENWTSFYQVYGGLPYIRSAEDIAFHVTLFIAR-NGSYVNYYMYHGGTNFGRTASA 307
Query: 312 YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ 371
YV+TGYYDQAPLDEYGLLRQPKWGHLK+LH +K C +L GV + + +LQE ++F+
Sbjct: 308 YVITGYYDQAPLDEYGLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFSLGQLQEGYVFE 367
Query: 372 GSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------- 422
EC AFL N D+ N TV F N YEL P SISILPDC+ VAFNTA ++
Sbjct: 368 EEKGECVAFLKNNDRDNKVTVQFRNRSYELLPRSISILPDCQNVAFNTANVNTTSNRRII 427
Query: 423 -------SVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE 475
S++ W+++++ IP +D TSLR++ LLEQMNTTKD SDYLWY RF+++ S +
Sbjct: 428 SPKQNFSSLDDWKQFQDVIPYFDNTSLRSDSLLEQMNTTKDKSDYLWYTLRFEYNLSCRK 487
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
L V S HV HAFIN ++G HG H KSFTLE V + GTNN+S+LS MVGLPDS
Sbjct: 488 PTLSVQSAAHVAHAFINNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSAMVGLPDS 547
Query: 536 GAYLERRVAGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
GA+LERR AGL +V +Q +E + ++ +WGYQVGLLGE+LQ++ + + WS+ G+
Sbjct: 548 GAFLERRFAGLISVELQCSEQESLNLTNSTWGYQVGLLGEQLQVYKKQNNSDIGWSQLGN 607
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWY 654
Q L WYKT FD P G DPV ++L SMGKGEAWVN QSIGRYW+ F +G PSQS Y
Sbjct: 608 IMEQLLIWYKTTFDTPEGDDPVVLDLSSMGKGEAWVNEQSIGRYWILFHDSKGNPSQSLY 667
Query: 655 HIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
H+PRSFLK TGN+LVL+EE G P GIS+DTVSV L + S LP
Sbjct: 668 HVPRSFLKDTGNVLVLVEEGGGNPLGISLDTVSVIDLQQNFSKLTLP 714
>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
Length = 719
Score = 925 bits (2390), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 448/714 (62%), Positives = 542/714 (75%), Gaps = 21/714 (2%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
+CL ++L I G G VTYDGRSLIING R ILFSGSIHYPRSTPQMWP LIA
Sbjct: 5 VCLM-MMLVAILELSFGVKGAEEVTYDGRSLIINGQRNILFSGSIHYPRSTPQMWPGLIA 63
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KAK+GGLDV+QT VFWNLHEPQPG++DFSGR DLV FIKE+ AQGLYV LRIGPFIE EW
Sbjct: 64 KAKQGGLDVIQTYVFWNLHEPQPGKYDFSGRNDLVGFIKEIHAQGLYVSLRIGPFIESEW 123
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
YGG PFWLHDVPGIV+R+DNEPFKF+M+ + T IVNMMK LYASQGGPIILSQIENE
Sbjct: 124 NYGGFPFWLHDVPGIVYRTDNEPFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENE 183
Query: 187 YGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP 246
YG ++ +F G YV WAAK+AV L TGVPWVMCKQ DAPDPVIN CNG +CGETF GP
Sbjct: 184 YGNIQKAFGTAGSQYVEWAAKMAVGLNTGVPWVMCKQPDAPDPVINTCNGMRCGETFTGP 243
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
NSP+KPA+WTENWTSFYQVYG IRSAEDIA+HV LF+A+ GS+VNYYMYHGGTNFG
Sbjct: 244 NSPNKPAMWTENWTSFYQVYGGVPYIRSAEDIAFHVTLFVAR-NGSFVNYYMYHGGTNFG 302
Query: 307 RTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQE 366
RT+SAY++TGYYDQAPLDEYGL RQPKWGHLKELH+A+K C +L GV + + +LQE
Sbjct: 303 RTSSAYMITGYYDQAPLDEYGLFRQPKWGHLKELHAAIKSCSTTLLQGVQRNFSLGELQE 362
Query: 367 AFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD--- 422
++F+ + +CAAFL+N DK N TV F+N Y+L P SISILPDC+ VAFNTA L+
Sbjct: 363 GYVFEEENGKCAAFLINNDKGNTVTVQFNNSSYKLLPKSISILPDCQNVAFNTAHLNTTS 422
Query: 423 ------------SVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD 470
SV+ W+++++ IP +D+TSLR++ LLEQMNTTKD SDYLWY R +++
Sbjct: 423 NRRIITSRQNFSSVDDWKQFQDVIPNFDDTSLRSDSLLEQMNTTKDKSDYLWYTLRLENN 482
Query: 471 PSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMV 530
S ++ +L V S HV +AF+N ++G HG H KSFTLE + L TNN+S+LS MV
Sbjct: 483 LSCNDPILHVQSSAHVAYAFVNNTYIGGEHGNHDVKSFTLELPITLNERTNNISILSGMV 542
Query: 531 GLPDSGAYLERRVAGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW 589
GLPDSGA+LE+R AGL NV +Q +E + ++ +WGYQVGLLGE+L+++T+ S + W
Sbjct: 543 GLPDSGAFLEKRFAGLNNVELQCSEQESLNLNNSTWGYQVGLLGEQLKVYTEQNSTDIKW 602
Query: 590 SRYGSST--HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQG 647
++ G+ T LTWYKT FD P G DP+A++L SM KGEAWVNGQSIGRYW+ FL +G
Sbjct: 603 TQLGNITIDEVTLTWYKTTFDTPKGDDPIALDLSSMAKGEAWVNGQSIGRYWILFLDSKG 662
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
PSQS YH+PRSFLK + N LVLL+E G P IS++TVSVT L + S P
Sbjct: 663 NPSQSLYHVPRSFLKDSENSLVLLDEGGGNPLDISLNTVSVTDLQDNFSKLPFP 716
>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
Length = 673
Score = 918 bits (2373), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/677 (63%), Positives = 520/677 (76%), Gaps = 23/677 (3%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGRSLII+G RKILFSGSIHYPRSTPQMWP LI+KAKEGGLDV+QT VFWNLHEPQ
Sbjct: 4 VTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPALISKAKEGGLDVIQTYVFWNLHEPQF 63
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+DFSGR DLVRFIKE+Q QGLYVCLRIGP+IE EW YGG PFWLHDVP IV+R+DN+P
Sbjct: 64 GQYDFSGRYDLVRFIKEIQVQGLYVCLRIGPYIESEWTYGGFPFWLHDVPAIVYRTDNQP 123
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK +M+ + T IV+MM++ LYASQGGPIILSQIENEY VE +F E G YV+WAA++A
Sbjct: 124 FKLYMQNFTTKIVSMMQSEGLYASQGGPIILSQIENEYQNVEKAFGEDGSRYVQWAAEMA 183
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L+TGVPW+MCKQ DAPDP+IN CNG +CGETF GPNSP+KPA WTENWTSFYQVYG E
Sbjct: 184 VGLKTGVPWLMCKQTDAPDPLINTCNGMRCGETFTGPNSPNKPAFWTENWTSFYQVYGGE 243
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
IRSAEDIA+HV LFIA+ GSYVNYYMYHGGTN GRT+S+YV+T YYDQAPLDEYGLL
Sbjct: 244 PYIRSAEDIAFHVTLFIARKNGSYVNYYMYHGGTNLGRTSSSYVITSYYDQAPLDEYGLL 303
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
RQPKWGHLKELH+A+K C +L G + + +LQE ++F+ +C AFLVN D
Sbjct: 304 RQPKWGHLKELHAAIKSCSTTLLEGKQSNFSLGQLQEGYVFEEEGKCVAFLVNNDHVKMF 363
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLD---------------SVEQWEEYKEAI 434
TV F N YELP SISILPDC+ V FNTA ++ S ++WE++++ I
Sbjct: 364 TVQFRNRSYELPSKSISILPDCQNVTFNTATVNTKSNRRMTSTIQTFSSADKWEQFQDVI 423
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGE 494
P +D+T+L +N LLEQMN TKD SDYLWY SES L S HV HAF +G
Sbjct: 424 PNFDQTTLISNSLLEQMNVTKDKSDYLWYTL--------SESKLTAQSAAHVTHAFADGT 475
Query: 495 FVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGA 554
++G AHG H KSFT + + L GTNN+S+LSVMVGLPD+GA+LERR AGL V IQ +
Sbjct: 476 YLGGAHGSHDVKSFTTQVPLKLNEGTNNISILSVMVGLPDAGAFLERRFAGLTAVEIQCS 535
Query: 555 KELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSD 614
+E D ++ +WGYQVGLLGE+L+I+ + + + WS G++ +Q LTWYKT FD+P G +
Sbjct: 536 EESYDLTNSTWGYQVGLLGEQLEIYEEKSNSSIQWSPLGNTCNQTLTWYKTAFDSPKGDE 595
Query: 615 PVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
PVA+NL SMGKG+AWVNG+SIGRYW+SF +G PSQ+ YH+PRSFLK GN LVL EEE
Sbjct: 596 PVALNLESMGKGQAWVNGESIGRYWISFHDSKGQPSQTLYHVPRSFLKDIGNSLVLFEEE 655
Query: 675 NGYPPGISIDTVSVTTL 691
G P IS+DT+S T +
Sbjct: 656 GGNPLHISLDTISSTNI 672
>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
Length = 706
Score = 905 bits (2339), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 444/714 (62%), Positives = 530/714 (74%), Gaps = 36/714 (5%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
MG+ L L+LT + G NVTYD SL+INGH KILFSGSIHYPRSTPQM
Sbjct: 1 MGEWWRFLLHALILTVSLCTVHGA----NVTYDRTSLVINGHHKILFSGSIHYPRSTPQM 56
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI+KAKEGGLDV+QT VFWNLHEPQ GQ++F+GR DLV FIKE+QAQGLYV LRIGP
Sbjct: 57 WPDLISKAKEGGLDVIQTYVFWNLHEPQQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGP 116
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
+IE E YGGLP WLHDVPGIVFR+DN+ FKFHM+R+ T IVNMMK+A L+ASQGGPIIL
Sbjct: 117 YIESECTYGGLPLWLHDVPGIVFRTDNDQFKFHMQRFTTKIVNMMKSANLFASQGGPIIL 176
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG ++ F G PY+ WAA++AV LQTGVPW+MCKQDDAPDPVINACNG QCG
Sbjct: 177 SQIENEYGSIQSKFRANGLPYIHWAAQMAVGLQTGVPWMMCKQDDAPDPVINACNGMQCG 236
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
F GPNSP+KP++WTENWTSF Q +G +RSA DIAY+VALFIAK KGSYVNYYMYH
Sbjct: 237 RNFKGPNSPNKPSLWTENWTSFLQAFGGAPYMRSASDIAYNVALFIAK-KGSYVNYYMYH 295
Query: 301 GGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
GGTNF R ASA+++T YYD+APLDEYGL+RQPKWGHLKELH+++K C +P+L G + +
Sbjct: 296 GGTNFDRLASAFIITAYYDEAPLDEYGLVRQPKWGHLKELHASIKSCSQPLLDGTQTTFS 355
Query: 361 FSKLQEAFIFQGSSECAAFLVNKDKRN-----------NATVYFSNLMYELPPLSISILP 409
Q+ + S + ++ +N + T+ F N+ YELP SISILP
Sbjct: 356 LGSEQQVIKNESSWTYFPLMFSEVPQNVLLSWKISGPRDVTIQFQNISYELPGKSISILP 415
Query: 410 DCKTVAFNTAKL---------------DSVEQWEEYKEAIPTYDETSLRANFLLEQMNTT 454
CK V FNT K+ +S E W+ Y EAIP + TS RA+ LL+Q++T
Sbjct: 416 GCKNVVFNTGKVSIQNNVRAMKPRLQFNSAENWKVYTEAIPNFAHTSKRADTLLDQISTA 475
Query: 455 KDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV 514
KD SDY+WY FRF + +++SVL + S G VLH+FING GSAHG ++ T++K V
Sbjct: 476 KDTSDYMWYTFRFNNKSPNAKSVLSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNV 535
Query: 515 HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGE 574
+LING NN+S+LS VGLP+SGA+LE RVAGLR V +QG +DFSS+SWGYQVGLLGE
Sbjct: 536 NLINGMNNISILSATVGLPNSGAFLESRVAGLRKVEVQG----RDFSSYSWGYQVGLLGE 591
Query: 575 KLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQS 634
KLQIFT GS V W + SST +PLTWY+T F AP G+DPV +NL SMGKG AWVNGQ
Sbjct: 592 KLQIFTVSGSSKVQWKSFQSST-KPLTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQG 650
Query: 635 IGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
IGRYWVSF P GTPSQ WYHIPRSFLK TGNLLV+LEEE G P GI++DTV +
Sbjct: 651 IGRYWVSFHKPDGTPSQQWYHIPRSFLKSTGNLLVILEEETGNPLGITLDTVYI 704
>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
Length = 710
Score = 888 bits (2295), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/685 (62%), Positives = 515/685 (75%), Gaps = 45/685 (6%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G VTYDGRSLII+GHRKILFSGSIHYPRSTPQMW LIAKAKEGG+DV+QT VFWN HE
Sbjct: 23 GAQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHE 82
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQPGQ+DF+GR DL +FIKE+QAQGLY CLRIGPFIE EW YGGLPFWLHDV GIV+R+D
Sbjct: 83 PQPGQYDFNGRYDLXKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTD 142
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFKF+M+ + T IVN+MK+ LYASQGGPIILSQIENEY +E +F EKGP YVRWAA
Sbjct: 143 NEPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAA 202
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+AV+LQTGVPWVMCKQ DAPDPVIN CNG +CG+TF GPNSP+KP++WTENWTSFY+V+
Sbjct: 203 KMAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVF 262
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G E +RSAEDIA+HVALFIA+ GSYVNYYM
Sbjct: 263 GGETYLRSAEDIAFHVALFIAR-NGSYVNYYMV--------------------------- 294
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDK 385
L+RQPKWGHLKELH+A+ LC P+L+GV +++ +LQEA++FQ C AFLVN D+
Sbjct: 295 SLIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDE 354
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------DSVEQWEEY 430
NN+TV F N+ EL P SISILPDCK V FNTAK+ D+V++WEEY
Sbjct: 355 GNNSTVLFQNVSIELLPKSISILPDCKNVIFNTAKINTGYNERITTSSQSFDAVDRWEEY 414
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAF 490
K+AIP + +TSL++N +LE MN TKD SDYLWY FRF+ + S +E +L + SL H +HAF
Sbjct: 415 KDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYTFRFQPNSSCTEPLLHIESLAHAVHAF 474
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVS 550
+N +VG+ HG H K FT + + L N NN+S+LSVMVG PDSGAYLE R AGL V
Sbjct: 475 VNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSGAYLESRFAGLTRVE 534
Query: 551 IQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
IQ K + DF++++WGYQVGL GEKL I+ + V W + ST+QPLTWYK VF+
Sbjct: 535 IQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRKTEISTNQPLTWYKIVFNT 594
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLV 669
P+G DPVA+NL +MGKGEAWVNGQSIGRYWVSF +G PSQ+ YH+PR+FLK + NLLV
Sbjct: 595 PSGDDPVALNLSTMGKGEAWVNGQSIGRYWVSFHNSKGDPSQTLYHVPRAFLKTSENLLV 654
Query: 670 LLEEENGYPPGISIDTVSVTTLCGH 694
LLEE NG P IS++T+S T L H
Sbjct: 655 LLEEANGDPLHISLETISRTDLPDH 679
>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
thaliana]
Length = 636
Score = 872 bits (2254), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 424/644 (65%), Positives = 495/644 (76%), Gaps = 24/644 (3%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M Q +F +L+ I D NVTYDGRSLII+G KILFSGSIHY RSTPQM
Sbjct: 1 MTTFQYSLVFLVLMAVIVAGDVA-----NVTYDGRSLIIDGEHKILFSGSIHYTRSTPQM 55
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LIAKAK GG+DVV T VFWN+HEPQ GQFDFSG RD+V+FIKEV+ GLYVCLRIGP
Sbjct: 56 WPSLIAKAKSGGIDVVDTYVFWNVHEPQQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGP 115
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
FI+GEW YGGLPFWLH+V GIVFR+DNEPFK+HMKRYA MIV +MK+ LYASQGGPIIL
Sbjct: 116 FIQGEWSYGGLPFWLHNVQGIVFRTDNEPFKYHMKRYAKMIVKLMKSENLYASQGGPIIL 175
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYGMV +F ++G YV+W AKLAV+L TGVPWVMCKQDDAPDP++NACNGRQCG
Sbjct: 176 SQIENEYGMVGRAFRQEGKSYVKWTAKLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCG 235
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
ETF GPNSP+KPAIWTENWTSFYQ YG+E IRSAEDIA+HVALFIAK GS+VNYYMYH
Sbjct: 236 ETFKGPNSPNKPAIWTENWTSFYQTYGEEPLIRSAEDIAFHVALFIAK-NGSFVNYYMYH 294
Query: 301 GGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
GGTNFGR AS +V+T YYDQAPLDEYGLLRQPKWGHLKELH+AVKLC +P+LSG+ +++
Sbjct: 295 GGTNFGRNASQFVITSYYDQAPLDEYGLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTIS 354
Query: 361 FSKLQEAFIF-QGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
KLQ AF+F + ++ CAA LVN+DK +TV F N Y L P S+S+LPDCK VAFNTA
Sbjct: 355 LGKLQTAFVFGKKANLCAAILVNQDK-CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTA 413
Query: 420 K---------------LDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
K L S + WEE+ E +P++ ETS+R+ LLE MNTT+D SDYLW
Sbjct: 414 KVNAQYNTRTRKARQNLSSPQMWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQT 473
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
RF+ + SVLKV+ LGH LHAF+NG F+GS HG F LEK + L NGTNN++
Sbjct: 474 TRFQQSEG-APSVLKVNHLGHALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLA 532
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGS 584
LLSVMVGLP+SGA+LERRV G R+V I + F+++SWGYQVGL GEK ++T+ GS
Sbjct: 533 LLSVMVGLPNSGAHLERRVVGSRSVKIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGS 592
Query: 585 RIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
V W +Y S QPLTWYK FD P G DPVA+NL SMGKGEA
Sbjct: 593 AKVQWKQYRDSKSQPLTWYKASFDTPEGEDPVALNLGSMGKGEA 636
>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
Length = 765
Score = 872 bits (2252), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/800 (53%), Positives = 535/800 (66%), Gaps = 81/800 (10%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYDGR+L+++G R++ FSG +HY RSTP+MWP+LIAKAK GGLDV+QT VFWN+HE
Sbjct: 26 GREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHE 85
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P GQ++F GR DLV+FI+E+QAQGLYV LRIGPF+E EW YGG PFWLHDVP I FRSD
Sbjct: 86 PIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSD 145
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK HM+ + T IV MMK LY QGGPII+SQIENEY M+E +F GP YVRWAA
Sbjct: 146 NEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAA 205
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+AV LQTGVPW+MCKQ+DAPDPVIN CNG CGETF GPNSP+KPA+WTENWTS Y +Y
Sbjct: 206 AMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIY 265
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++ ++R+ EDIA+ VALFIA+ KGS+V+YYMYHGGTNFGR A++YV T YYD APLDEY
Sbjct: 266 GNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAASYVTTSYYDGAPLDEY 325
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
+C AFLVN D+
Sbjct: 326 DF----------------------------------------------KCVAFLVNFDQH 339
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------LDSVEQWEEYK 431
N V F N+ EL P SIS+L DC+ V F TAK L+ + W+ +
Sbjct: 340 NTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWKAFI 399
Query: 432 EAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV--LKVSSLGHVLH 488
E +P +++ N L EQ+ TTKD +DYLWY +K+ SD + L V SL H+LH
Sbjct: 400 EPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWYIVSYKNRASDGNQIAHLYVKSLAHILH 459
Query: 489 AFINGEFVGSAHGKHSD-KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
AF+N E+VGS HG H ++ L + L G N +SLLSVMVG PDSGAY+ERR G++
Sbjct: 460 AFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFGIQ 519
Query: 548 NVSI-QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V I QG + + ++ WGYQVGL GEK I+T G+ V W + + PLTWYKT
Sbjct: 520 TVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSVRWMDINNLIYHPLTWYKTT 579
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
F P G+D V +NL SMGKGE WVNG+SIGRYWVSF P G PSQS YHIPR FL P N
Sbjct: 580 FSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPSGQPSQSLYHIPRGFLTPKDN 639
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
LLVL+EE G P I+++T+SVTT+CG+V + +PP+ S G+ PK
Sbjct: 640 LLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS---------------RGKVPK 684
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
V+I C G +IS I FASYGNP G+C ++ IGSCH+ +S ++V+++C+G+R C++PV
Sbjct: 685 VRIWCQGGNRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAA 744
Query: 787 KFYGDPCPGIPKALLVDAQC 806
KF GDPCPGI K+LLV A C
Sbjct: 745 KFGGDPCPGIQKSLLVVADC 764
>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
Length = 761
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/800 (53%), Positives = 534/800 (66%), Gaps = 81/800 (10%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYDGR+L+++G R++ FSG +HY RSTP+MWP+LIAKAK GGLDV+QT VFWN+HE
Sbjct: 22 GREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHE 81
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P GQ++F GR DLV+FI+E+QAQGLYV LRIGPF+E EW YGG PFWLHDVP I FRSD
Sbjct: 82 PIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSD 141
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK HM+ + T IV MMK LY QGGPII+SQIENEY M+E +F GP YVRWAA
Sbjct: 142 NEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAA 201
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+AV LQTGVPW+MCKQ+DAPDPVIN CNG CGETF GPNSP+KPA+WTENWTS Y +Y
Sbjct: 202 AMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPIY 261
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++ ++R EDIA+ VAL+IA+ KGS+V+YYMYHGGTNFGR A++YV T YYD APLDEY
Sbjct: 262 GNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAASYVTTSYYDGAPLDEY 321
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
+C AFLVN D+
Sbjct: 322 DF----------------------------------------------KCVAFLVNFDQH 335
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------LDSVEQWEEYK 431
N V F N+ EL P SIS+L DC+ V F TAK L+ + W+ +
Sbjct: 336 NTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWKAFI 395
Query: 432 EAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV--LKVSSLGHVLH 488
E +P +++ N L EQ+ TTKD +DYLWY +K+ SD + L V SL H+LH
Sbjct: 396 EPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWYIVSYKNRASDGNQIARLYVKSLAHILH 455
Query: 489 AFINGEFVGSAHGKHSD-KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
AF+N E+VGS HG H ++ L + L G N +SLLSVMVG PDSGAY+ERR G++
Sbjct: 456 AFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAYMERRTFGIQ 515
Query: 548 NVSI-QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V I QG + + ++ WGYQVGL GEK I+T G V W + + PLTWYKT
Sbjct: 516 TVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMDINNLIYHPLTWYKTT 575
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
F P G+D V +NL SMGKGE WVNG+SIGRYWVSF P G PSQS YHIPR FL P N
Sbjct: 576 FSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPSGQPSQSLYHIPRGFLTPKDN 635
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
LLVL+EE G P I+++T+SVTT+CG+V + +PP+ S G+ PK
Sbjct: 636 LLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS---------------RGKVPK 680
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
V+I C G++IS I FASYGNP G+C ++ IGSCH+ +S ++V+++C+G+R C++PV
Sbjct: 681 VRIWCQGGKRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAA 740
Query: 787 KFYGDPCPGIPKALLVDAQC 806
KF GDPCPGI K+LLV A C
Sbjct: 741 KFGGDPCPGIQKSLLVVADC 760
>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
Length = 775
Score = 865 bits (2234), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/810 (52%), Positives = 535/810 (66%), Gaps = 91/810 (11%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYDGR+L+++G R++ FSG +HY RSTP+MWP+LIAKAK GGLDV+QT VFWN+HE
Sbjct: 26 GREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHE 85
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P GQ++F GR DLV+FI+E+QAQGLYV LRIGPF+E EW YGG PFWLHDVP I FRSD
Sbjct: 86 PIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSD 145
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK HM+ + T IV MMK LY QGGPII+SQIENEY M+E +F GP YVRWAA
Sbjct: 146 NEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAA 205
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTS----- 261
+AV LQTGVPW+MCKQ+DAPDPVIN CNG CGETF GPNSP+KPA+WTENWTS
Sbjct: 206 AMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSNGQ 265
Query: 262 -----FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTG 316
Y +YG++ ++R+ EDIA+ VALFIA+ KGS+V+YYMYHGGTNFGR A++YV T
Sbjct: 266 NNSAFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAASYVTTS 325
Query: 317 YYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSEC 376
YYD APLDEY +C
Sbjct: 326 YYDGAPLDEYDF----------------------------------------------KC 339
Query: 377 AAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------L 421
AFLVN D+ N V F N+ EL P SIS+L DC+ V F TAK L
Sbjct: 340 VAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSL 399
Query: 422 DSVEQWEEYKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV--L 478
+ + W+ + E +P +++ N L EQ+ TTKD +DYLWY +K+ SD + L
Sbjct: 400 NDINNWKAFIEPVPQDLSKSTYTGNQLFEQLTTTKDETDYLWYIVSYKNRASDGNQIAHL 459
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSD-KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
V SL H+LHAF+N E+VGS HG H ++ L + L G N +SLLSVMVG PDSGA
Sbjct: 460 YVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGA 519
Query: 538 YLERRVAGLRNVSI-QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
Y+ERR G++ V I QG + + ++ WGYQVGL GEK I+T G+ V W +
Sbjct: 520 YMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSVRWMDINNLI 579
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
+ PLTWYKT F P G+D V +NL SMGKGE WVNG+SIGRYWVSF P G PSQS YHI
Sbjct: 580 YHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPSGQPSQSLYHI 639
Query: 657 PRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKT 716
PR FL P NLLVL+EE G P I+++T+SVTT+CG+V + +PP+ S
Sbjct: 640 PRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGNVDEFSVPPLQS----------- 688
Query: 717 HKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGK 776
G+ PKV+I C G +IS I FASYGNP G+C ++ IGSCH+ +S ++V+++C+G+
Sbjct: 689 ----RGKVPKVRIWCQGGNRISSIEFASYGNPVGDCRSFRIGSCHAESSESVVKQSCIGR 744
Query: 777 RSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
R C++PV KF GDPCPGI K+LLV A C
Sbjct: 745 RGCSIPVMAAKFGGDPCPGIQKSLLVVADC 774
>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 718
Score = 865 bits (2234), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/704 (59%), Positives = 512/704 (72%), Gaps = 25/704 (3%)
Query: 9 LFGLLLTTIGGS----DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+FGL L I G+ GG VTYDGRSLII+G RK+LFSGSIHYPRSTP+MWP L
Sbjct: 7 VFGLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSL 66
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I KAKEGG+DV+QT VFWNLHEP+ GQ+DFSGR DLV+FIKE+++QGLYVCLRIGPFIE
Sbjct: 67 IKKAKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEA 126
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW YGGLPFWL DVPG+V+R+DNEPFKFHM+++ IV++MK+ LYASQGGPIILSQIE
Sbjct: 127 EWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIE 186
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEY VE +F EKG Y++WA ++AV L+TGVPW+MCK DAPDPVIN CNG +CGETF
Sbjct: 187 NEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFP 246
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
GPNSP+KP +WTE+WTSF+QVYG E IRSAEDIA+H ALF+AK GSY+NYYMYHGGTN
Sbjct: 247 GPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAK-NGSYINYYMYHGGTN 305
Query: 305 FGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
FGRT+S+Y +TGYYDQAPLDEYGLLRQPK+GHLKELH+A+K P+L G ++ +
Sbjct: 306 FGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPM 365
Query: 365 QEAFIFQGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS 423
Q+A++F+ ++ C AFLVN D + + + F N Y L P SI IL +CK + + TAK++
Sbjct: 366 QQAYVFEDANNGCVAFLVNNDAK-ASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNV 424
Query: 424 V---------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFK 468
+ W ++E IP + TSL+ N LLE N TKD +DYLWY FK
Sbjct: 425 KMNTRVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFK 484
Query: 469 HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSV 528
D + + S GHV+H F+N GS HG + L+ V LING NN+S+LS
Sbjct: 485 LDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSG 544
Query: 529 MVGLPDSGAYLERRVAGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIV 587
MVGLPDSGAY+ERR GL V I G + D S WGY VGLLGEK++++ V
Sbjct: 545 MVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRV 604
Query: 588 PWS--RYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTP 645
WS + G ++PL WYKT FD P G PV +++ SMGKGE WVNG+SIGRYWVSFLTP
Sbjct: 605 KWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTP 664
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G PSQS YHIPR+FLKP+GNLLV+ EEE G P GIS++T+SV
Sbjct: 665 AGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTISVV 708
>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
Length = 716
Score = 863 bits (2229), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 422/707 (59%), Positives = 511/707 (72%), Gaps = 27/707 (3%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
G C L L G+ L GG+ G VTYDGRSLII+G RK+LFSGSIHYPRSTP+M
Sbjct: 7 FGLC--LILVGMFLVFPGGATAAKG----VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEM 60
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI K KEGG+DV+QT VFWNLHEP+ GQ+DFSGR DLV+FIKE+++QGLYVCLRIGP
Sbjct: 61 WPSLIKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGP 120
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
FIE EW YGGLPFWL DVPG+V+R+DNEPFKFHM+++ T IVN+MK+ LYASQGGPIIL
Sbjct: 121 FIEAEWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTTKIVNLMKSEGLYASQGGPIIL 180
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEY VE +F EKG Y++WA ++AV L+TGVPW+MCK DAPDPVIN CNG +CG
Sbjct: 181 SQIENEYANVEAAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMRCG 240
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
ETF GPNSP+KP +WTE+WTSF+QVYG E IRSAEDIA+H LFIAK GSY+NYYMYH
Sbjct: 241 ETFPGPNSPNKPKMWTEDWTSFFQVYGTEPYIRSAEDIAFHAVLFIAK-NGSYINYYMYH 299
Query: 301 GGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
GGTNFGRT+S+Y +TGYYDQAPLDEYGLLRQPK+GHLKELH+A+K P+L G ++
Sbjct: 300 GGTNFGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILS 359
Query: 361 FSKLQEAFIFQ-GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
+Q+A++F+ SS C AFLVN D + + + F Y L P SI IL +CK + + TA
Sbjct: 360 LGPMQQAYVFEDASSGCVAFLVNNDAK-VSQIQFRKSSYSLSPKSIGILQNCKNLIYETA 418
Query: 420 KLDSV---------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
K++ E+WE ++E IP + TSL+AN LLE N TKD +DYLWY
Sbjct: 419 KVNVEKNKRVTTPVQVFNVPEKWEGFRETIPAFSGTSLKANALLEHTNLTKDKTDYLWYT 478
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
FK D + + + S GHV+H F+N GS HG K L+ L NG N++S
Sbjct: 479 SSFKPDSPCTNPSIYIESSGHVVHVFVNNALAGSGHGSRDIKVVKLQVPASLTNGQNSIS 538
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYG 583
+LS MVGLPDSGAY+ER+ GL V I G + D S WGY VGLLGEK+++
Sbjct: 539 ILSGMVGLPDSGAYMERKSYGLTKVQISCGGTKPIDLSGSQWGYSVGLLGEKVRLQQWRN 598
Query: 584 SRIVPWS--RYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVS 641
V WS G ++PL WYKT+FD P G PV +N+ SMGKGE WVNG+SIGRYWVS
Sbjct: 599 LNRVKWSMNNAGLIKNRPLIWYKTIFDGPNGDGPVGLNMSSMGKGEIWVNGESIGRYWVS 658
Query: 642 FLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
FLTP G PSQS YHIPR FLKP+GNLLV+ EEE G P GIS++T+SV
Sbjct: 659 FLTPSGHPSQSIYHIPREFLKPSGNLLVVFEEEGGDPLGISLNTISV 705
>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
Length = 718
Score = 863 bits (2229), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 417/704 (59%), Positives = 511/704 (72%), Gaps = 25/704 (3%)
Query: 9 LFGLLLTTIGGS----DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+FGL L I G+ GG VTYDGRSLII+G RK+LFSGSIHYPRSTP+MWP L
Sbjct: 7 VFGLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSL 66
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I K KEGG+DV+QT VFWNLHEP+ GQ+DFSGR DLV+FIKE+++QGLYVCLRIGPFIE
Sbjct: 67 IKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEA 126
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW YGGLPFWL DVPG+V+R+DNEPFKFHM+++ IV++MK+ LYASQGGPIILSQIE
Sbjct: 127 EWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIE 186
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEY VE +F EKG Y++WA ++AV L+TGVPW+MCK DAPDPVIN CNG +CGETF
Sbjct: 187 NEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFP 246
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
GPNSP+KP +WTE+WTSF+QVYG E IRSAEDIA+H ALF+AK GSY+NYYMYHGGTN
Sbjct: 247 GPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAK-NGSYINYYMYHGGTN 305
Query: 305 FGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
FGRT+S+Y +TGYYDQAPLDEYGLLRQPK+GHLKELH+A+K P+L G ++ +
Sbjct: 306 FGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPM 365
Query: 365 QEAFIFQGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS 423
Q+A++F+ ++ C AFLVN D + + + F N Y L P SI IL +CK + + TAK++
Sbjct: 366 QQAYVFEDANNGCVAFLVNNDAK-ASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNV 424
Query: 424 V---------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFK 468
+ W ++E IP + TSL+ N LLE N TKD +DYLWY FK
Sbjct: 425 KMNTRVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFK 484
Query: 469 HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSV 528
D + + S GHV+H F+N GS HG + L+ V LING NN+S+LS
Sbjct: 485 LDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSG 544
Query: 529 MVGLPDSGAYLERRVAGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIV 587
MVGLPDSGAY+ERR GL V I G + D S WGY VGLLGEK++++ V
Sbjct: 545 MVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRV 604
Query: 588 PWS--RYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTP 645
WS + G ++PL WYKT FD P G PV +++ SMGKGE WVNG+SIGRYWVSFLTP
Sbjct: 605 KWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTP 664
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G PSQS YHIPR+FLKP+GNLLV+ EEE G P GIS++T+SV
Sbjct: 665 AGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTISVV 708
>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 718
Score = 857 bits (2214), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/704 (58%), Positives = 508/704 (72%), Gaps = 25/704 (3%)
Query: 9 LFGLLLTTIGGS----DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+FGL L I G+ GG VTYDGRSLII+G RK+LFSGSIHYPRSTP+MWP L
Sbjct: 7 VFGLCLILIVGTFLEFSGGATAAKGVTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSL 66
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I K KEGG+DV+QT VFWNLHEP+ GQ+DFSGR DLV+FIKE+++QGLYVCLRIGPFIE
Sbjct: 67 IKKTKEGGIDVIQTYVFWNLHEPKLGQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEA 126
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW YGGLPFWL DVPG+V+R+DNEPFKFHM+++ IV++MK+ LYASQGGPIILSQIE
Sbjct: 127 EWNYGGLPFWLRDVPGMVYRTDNEPFKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIE 186
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEY VE +F EKG Y++WA ++AV L+TGVPW+MCK DAPDPVIN CNG +CGETF
Sbjct: 187 NEYANVEGAFHEKGASYIKWAGQMAVGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFP 246
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
GPNSP+KP +WTE+WTSF+QVYG E IRSAEDIA+H ALF+AK GSY+NYYMYHGGTN
Sbjct: 247 GPNSPNKPKMWTEDWTSFFQVYGKEPYIRSAEDIAFHAALFVAK-NGSYINYYMYHGGTN 305
Query: 305 FGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
FGRT+S+Y +TGYYDQAPLDEYGLLRQPK+GHLKELH+A+K P+L G ++ +
Sbjct: 306 FGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPM 365
Query: 365 QEAFIFQGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS 423
Q+A++F+ ++ C AFLVN D + + + F N Y L P SI IL +CK + + TAK++
Sbjct: 366 QQAYVFEDANNGCVAFLVNNDAK-ASQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNV 424
Query: 424 V---------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFK 468
+ W ++E IP L+ N LLE N TKD +DYLWY FK
Sbjct: 425 KMNTRVTTPVQVFNVPDNWNLFRETIPASQAHLLKTNALLEHTNLTKDKTDYLWYTSSFK 484
Query: 469 HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSV 528
D + + S GHV+H F+N GS HG + L+ V LING NN+S+LS
Sbjct: 485 LDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSG 544
Query: 529 MVGLPDSGAYLERRVAGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIV 587
MVGLPDSGAY+ERR GL V I G + D S WGY VGLLGEK++++ V
Sbjct: 545 MVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRV 604
Query: 588 PWS--RYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTP 645
WS + G ++PL WYKT FD P G PV +++ SMGKGE WVNG+SIGRYWVSFLTP
Sbjct: 605 KWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTP 664
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G PSQS YHIPR+FLKP+GNLLV+ EEE G P GIS++T+SV
Sbjct: 665 AGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTISVV 708
>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 850 bits (2197), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 418/810 (51%), Positives = 545/810 (67%), Gaps = 33/810 (4%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G VTYDGRSLIING R++LFSGSIHYPRSTP+MWP LI KAK GGL+V+QT VFWN
Sbjct: 25 GDKKKGVTYDGRSLIINGKRELLFSGSIHYPRSTPEMWPELIQKAKRGGLNVIQTYVFWN 84
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
+HEP+ G+F+F G DLV+FIK + G+ +R+GPFI+ EW +GGLP+WL ++P I+F
Sbjct: 85 IHEPEQGKFNFEGSYDLVKFIKTIGENGMSATIRLGPFIQAEWNHGGLPYWLREIPDIIF 144
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
RSDN PFK HM+R+ TMI+N +K +L+ASQGGPIIL+QIENEY V+ ++ G YV+
Sbjct: 145 RSDNAPFKLHMERFVTMIINKLKEEKLFASQGGPIILAQIENEYNTVQLAYRNLGVSYVQ 204
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WA +A+ L+TGVPWVMCKQ DAP PVIN CNGR CG+TF GPNSPDKP++WTENWT+ +
Sbjct: 205 WAGNMALGLKTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNSPDKPSLWTENWTAQF 264
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPL 323
+V+GD RSAED A+ VA + +K GS VNYYMYHGGTNF RTA+++V T YYD+APL
Sbjct: 265 RVFGDPPSQRSAEDTAFSVARWFSK-NGSLVNYYMYHGGTNFDRTAASFVTTRYYDEAPL 323
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG--SSECAAFLV 381
DEYGL R+PKWGHLK+LH A+ LC K +L G S EA F+ +++CAAFL
Sbjct: 324 DEYGLQREPKWGHLKDLHRALNLCKKALLWGTPNVQRLSADVEARFFEQPRTNDCAAFLA 383
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA---------------KLDSVEQ 426
N + ++ TV F Y LP SISILPDCKTV +NT K D +
Sbjct: 384 NNNTKDPETVTFRGKKYYLPAKSISILPDCKTVVYNTMTVVSQHNSRNFVKSRKTDGKLE 443
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES------VLKV 480
W+ + E IP+ + + E N TKD +DY W+ D +D + VL+V
Sbjct: 444 WKMFSETIPS--NLLVDSRIPRELYNLTKDKTDYAWFTTTINVDRNDLSARKDINPVLRV 501
Query: 481 SSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLE 540
+SLGH + AFINGEF+GSAHG +KSF L+ V L G N V+LL +VGLPDSGAY+E
Sbjct: 502 ASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLKPGINFVTLLGSLVGLPDSGAYME 561
Query: 541 RRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQP 599
R AG R VSI G D SS WG+QV L GE ++FT G R V W++ P
Sbjct: 562 HRYAGPRGVSILGLNTGTLDLSSNGWGHQVALSGETAKVFTKEGGRKVTWTKVNKDG-PP 620
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRS 659
+TWYKT FDAP G PVA+ + M KG W+NG+SIGRYW+++++P G P+QS YHIPRS
Sbjct: 621 VTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGRYWMNYISPLGEPTQSEYHIPRS 680
Query: 660 FLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKR 719
+LKPT NL+V+LEEE P I I TV+ T+C +V++ H P V SW +N++
Sbjct: 681 YLKPTNNLMVILEEEGASPEKIEILTVNRDTICSYVTEYHPPNVRSWERKNKKFTPVADD 740
Query: 720 IPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSC 779
+P +++CP+ +KI + FAS+G+P+G C N+A+G+C S S+ +VE+ CLGK SC
Sbjct: 741 A---KPAARLKCPNKKKIVAVQFASFGDPSGTCGNFAVGTCDSPISKQVVEQHCLGKTSC 797
Query: 780 TVPVWTEKFYG--DPCPGIPKALLVDAQCT 807
+P+ F G D CP + K L V +C+
Sbjct: 798 DIPMDKGLFNGKKDNCPNLTKNLAVQVKCS 827
>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
Length = 835
Score = 844 bits (2181), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/809 (51%), Positives = 539/809 (66%), Gaps = 32/809 (3%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG VTYD RSLIING R++LFSGSIHYPRSTP MWP LI KAK GGL+V+QT VFWN
Sbjct: 25 GGKQVGVTYDERSLIINGKRELLFSGSIHYPRSTPDMWPELILKAKRGGLNVIQTYVFWN 84
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
+HEP+ G+F+F G DLV+FIK + G++ LR+GPFI+ EW +GGLP+WL ++P I+F
Sbjct: 85 IHEPEQGKFNFEGPYDLVKFIKTIGENGMFATLRLGPFIQAEWNHGGLPYWLREIPDIIF 144
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
RSDN PFK HM+++ T I++MMK +L+ASQGGPIILSQIENEY V+ ++ G Y++
Sbjct: 145 RSDNAPFKHHMEKFVTKIIDMMKEEKLFASQGGPIILSQIENEYNTVQLAYKNLGVSYIQ 204
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WA +A+ L TGVPWVMCKQ DAP PVIN CNGR CG+TF GPN P+KP++WTENWT+ +
Sbjct: 205 WAGNMALGLNTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNKPNKPSLWTENWTAQF 264
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPL 323
+V+GD RSAED A+ VA + +K GS VNYYMYHGGTNF RTA+++V T YYD+APL
Sbjct: 265 RVFGDPPSQRSAEDTAFSVARWFSK-NGSLVNYYMYHGGTNFDRTAASFVTTRYYDEAPL 323
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLV 381
DEYGL R+PKWGHLK+LH A+ LC K +L G S EA ++ G+ CAAFL
Sbjct: 324 DEYGLQREPKWGHLKDLHRALNLCKKALLWGNPNVQKLSADVEARFYEQPGTKVCAAFLA 383
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QW 427
+ + + TV F Y LP SISILPDCKTV +NT + S +W
Sbjct: 384 SNNSKEAETVKFRGQEYYLPARSISILPDCKTVVYNTMTVVSQHNSRNFVKSRKTNKLEW 443
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES------VLKVS 481
Y E IP + + ++ E N TKD +DY+W+ D D VL+V+
Sbjct: 444 NMYSETIPA--QLQVDSSLPKELYNLTKDKTDYVWFTTTINVDRRDMNERKRINPVLRVA 501
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
SLGH + AF+NGEF+GSAHG +KSF L+ V L G N V+LL +VGLPDSGAY+E
Sbjct: 502 SLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHSVDLKPGINFVTLLGTLVGLPDSGAYMEH 561
Query: 542 RVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
R AG R VSI G D +S WG+QVGL GE ++FT G V W++ P+
Sbjct: 562 RYAGPRGVSILGLNTGTLDLTSNGWGHQVGLSGETAKLFTKEGGGKVTWTKV-QKAGPPV 620
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWYKT FDAP G PVA+ + M KG W+NG+SIGRYW+++++P G P+QS YHIPRS+
Sbjct: 621 TWYKTHFDAPEGKSPVAVRMTGMNKGMIWINGKSIGRYWMTYVSPLGEPTQSEYHIPRSY 680
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
LKPT NL+V+ EEE P I I TV+ T+C +V++ H P V SW +N + +
Sbjct: 681 LKPTDNLMVIFEEEEANPEKIEILTVNRDTICSYVTEYHPPSVKSWERKNNKFTPV---V 737
Query: 721 PGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
+P ++CP+ +KI + FAS+G+P G C +YA+G+CHS S+ +VE+ CLGK SC
Sbjct: 738 DNAKPAAHLKCPNQKKIIAVQFASFGDPLGTCGDYAVGTCHSLVSKQVVEEHCLGKTSCD 797
Query: 781 VPVWTEKFYG--DPCPGIPKALLVDAQCT 807
+P+ F G D CPGI K L V +C+
Sbjct: 798 IPIDKGLFAGKKDDCPGISKTLAVQVKCS 826
>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
Length = 847
Score = 838 bits (2164), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/842 (49%), Positives = 546/842 (64%), Gaps = 45/842 (5%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGG-NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
L L LL + D G G NNVTYDG+SL +NG R++LFSGSIHY RSTP WP +
Sbjct: 10 LSILLVLLPAIVAAHDHGRVAGINNVTYDGKSLFVNGRRELLFSGSIHYTRSTPDAWPDI 69
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
+ KA+ GGL+V+QT VFWN HEP+ G+F+F G DLV+FI+ VQ++G+YV LR+GPFI+
Sbjct: 70 LDKARHGGLNVIQTYVFWNAHEPEQGKFNFEGNNDLVKFIRLVQSKGMYVTLRVGPFIQA 129
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW +GGLP+WL +VPGI+FRSDNEP+K +MK Y + I+ MMK +L+A QGGPIIL+QIE
Sbjct: 130 EWNHGGLPYWLREVPGIIFRSDNEPYKKYMKAYVSKIIQMMKDEKLFAPQGGPIILAQIE 189
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEY ++ ++ EKG YV+WAA +AV L GVPW+MCKQ DAPDPVINACNGR CG+TF+
Sbjct: 190 NEYNHIQLAYEEKGDSYVQWAANMAVALDIGVPWIMCKQKDAPDPVINACNGRHCGDTFS 249
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
GPN P KP++WTENWT+ Y+V+GD RSAEDIA+ VA F +K G+ VNYYMYHGGTN
Sbjct: 250 GPNKPYKPSLWTENWTAQYRVFGDPVSQRSAEDIAFSVARFFSK-NGNLVNYYMYHGGTN 308
Query: 305 FGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
FGRT SA+ T YYD+APLDEYG+ RQPKW HL++ H A+ LC K +L GV +
Sbjct: 309 FGRTTSAFTTTRYYDEAPLDEYGMERQPKWSHLRDAHKALLLCRKAILGGVPTVQKLNDY 368
Query: 365 QEAFIFQ--GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA--- 419
E IF+ G+S C+AF+ N AT+ F Y LP SIS+LPDCKTV +NT
Sbjct: 369 HEVRIFEKPGTSTCSAFITNNHTNQAATISFRGSNYFLPAHSISVLPDCKTVVYNTQNVM 428
Query: 420 ------KLDSVE------------------------QWEEYKEAIPTYDETSLRANFLLE 449
KL S +WE + EAIP+ + LE
Sbjct: 429 NQLVYYKLISSHLIIKLIVSQHNKRNFVKSAVANNLKWELFLEAIPSSKKLESNQKIPLE 488
Query: 450 QMNTTKDASDYLWYNFRFKHDPSD---SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDK 506
KD +DY WY F+ P D ++L++ SLGH L AF+NG+++G+ HG H +K
Sbjct: 489 LYTLLKDTTDYGWYTTSFELGPEDLPKKSAILRIMSLGHTLSAFVNGQYIGTDHGTHEEK 548
Query: 507 SFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELK-DFSSFSW 565
SF E+ + GTN +S+L+ VGLPDSGAY+E R AG +++SI G + K + + W
Sbjct: 549 SFEFEQPANFKVGTNYISILATTVGLPDSGAYMEHRYAGPKSISILGLNKGKLELTKNGW 608
Query: 566 GYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGK 625
G++VGL GE+L++FT+ GS+ V W T + L+W KT F P G PVAI + MGK
Sbjct: 609 GHRVGLRGEQLKVFTEEGSKKVQWDPVTGET-RALSWLKTRFATPEGRGPVAIRMTGMGK 667
Query: 626 GEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
G WVNG+SIGR+W+SFL+P G PSQ YHIPR +L NLLV+LEEE G P I I
Sbjct: 668 GMIWVNGKSIGRHWMSFLSPLGQPSQEEYHIPRDYLNAKDNLLVVLEEEKGSPEKIEIMI 727
Query: 686 VSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASY 745
V T+C +++++ V SW S+N K P+ ++CPSG+KI + FAS+
Sbjct: 728 VDRDTICSYITENSPANVNSWGSKNGEFRSVGKN---SGPQASLKCPSGKKIVAVEFASF 784
Query: 746 GNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQ 805
GNP+G C ++A+G+C+ ++ +VEKACLGK C V V F G C G L + A+
Sbjct: 785 GNPSGYCGDFALGNCNGGAAKGVVEKACLGKEECLVEVNRANFNGQGCAGSVNTLAIQAK 844
Query: 806 CT 807
C+
Sbjct: 845 CS 846
>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
Length = 825
Score = 835 bits (2156), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/825 (49%), Positives = 549/825 (66%), Gaps = 30/825 (3%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
L LF + L +I +TYDGRSL+++G ++ FSGSIHYPRSTP MWP ++
Sbjct: 5 LKLFSITLFSIITIVCAQNAAQTITYDGRSLLLDGKGELFFSGSIHYPRSTPDMWPDILD 64
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KA+ GGL+++QT VFWN HEP+ + +F GR DLV+F+K VQ +G+YV LRIGPFI+ EW
Sbjct: 65 KARRGGLNLIQTYVFWNGHEPEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIGPFIQAEW 124
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
+GGLP+WL +VP I+FRS+NEPFK +MK Y ++++N MK +L+A QGGPIIL+QIENE
Sbjct: 125 NHGGLPYWLREVPDIIFRSNNEPFKKYMKEYVSIVINRMKEEKLFAPQGGPIILAQIENE 184
Query: 187 YGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP 246
Y ++ ++ G YV+WAAK+AV L GVPWVMCKQ DAPDPVINACNGR CG+TF GP
Sbjct: 185 YNHIQLAYEADGDNYVQWAAKMAVSLYNGVPWVMCKQKDAPDPVINACNGRHCGDTFTGP 244
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
N P KP IWTENWT+ Y+V+GD RSAEDIA+ VA F +K GS VNYYMYHGGTNFG
Sbjct: 245 NKPYKPFIWTENWTAQYRVFGDPPSQRSAEDIAFSVARFFSK-HGSLVNYYMYHGGTNFG 303
Query: 307 RTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQE 366
RT SA+ T YYD+APLDE+GL R+PKW HL++ H AV LC K +L+GV + S+ E
Sbjct: 304 RTTSAFTTTRYYDEAPLDEFGLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQKISQYHE 363
Query: 367 AFIFQG--SSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
+++ S+ CAAF+ N + T+ F Y LPP SISILPDCKTV FNT + S
Sbjct: 364 VIVYEKKESNLCAAFITNNHTQTAKTLSFRGSDYFLPPRSISILPDCKTVVFNTQNIASQ 423
Query: 425 E--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+WE + E IP+ E + E + KD +DY WY +
Sbjct: 424 HSSRHFEKSKTGNDFKWEVFSEPIPSAKELPSKQKLPAELYSLLKDKTDYGWYTTSVELG 483
Query: 471 P------SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
P SD VL++ SLGH L AF+NGE++GS HG H +K F +K V+ G N ++
Sbjct: 484 PEDIPKKSDVAPVLRILSLGHSLQAFVNGEYIGSKHGSHEEKGFEFQKPVNFKVGVNQIA 543
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYG 583
+L+ +VGLPDSGAY+E R AG + ++I G D +S WG+QVGL GE IFT+ G
Sbjct: 544 ILANLVGLPDSGAYMEHRYAGPKTITILGLMSGTIDLTSNGWGHQVGLQGENDSIFTEKG 603
Query: 584 SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
S+ V W + G ++WYKT FD P G++PVAI + M KG WVNG+SIGR+W+S+L
Sbjct: 604 SKKVEW-KDGKGKGSTISWYKTNFDTPEGTNPVAIGMEGMAKGMIWVNGESIGRHWMSYL 662
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
+P G P+QS YHIPRSFLKP NLLV+ EEE P I+I TV+ T+C ++++H P +
Sbjct: 663 SPLGKPTQSEYHIPRSFLKPKDNLLVIFEEEAISPDKIAILTVNRDTICSFITENHPPNI 722
Query: 704 ISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
S+ S+NQ+ + + + P+ I CP +KI+ + FAS+G+P+G C ++ +G C++
Sbjct: 723 RSFASKNQKLERVGENL---TPEAFITCPDQKKITAVEFASFGDPSGFCGSFIMGKCNAP 779
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYG--DPCPGIPKALLVDAQC 806
+S+ IVE+ CLGK +C+VP+ F G D CP + K L + +C
Sbjct: 780 SSKKIVEQLCLGKPTCSVPMVKATFTGGNDGCPDVVKTLAIQVKC 824
>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
Length = 806
Score = 830 bits (2145), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/802 (49%), Positives = 536/802 (66%), Gaps = 30/802 (3%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGRSLIING R++LFSGSIHYPRSTP+ W ++ KA++GG++VVQT VFWN+HE +
Sbjct: 9 VTYDGRSLIINGRRELLFSGSIHYPRSTPEEWAGILDKARQGGINVVQTYVFWNIHETEK 68
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++ + D ++FIK +Q +G+YV LR+GPFI+ EW +GGLP+WL +VP I+FRS+NEP
Sbjct: 69 GKYSIEPQYDYIKFIKLIQKKGMYVTLRVGPFIQAEWNHGGLPYWLREVPEIIFRSNNEP 128
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK HMK+Y + ++ +K A L+A QGGPIIL+QIENEY ++ +F E+G YV+WAAK+A
Sbjct: 129 FKKHMKKYVSTVIKTVKDANLFAPQGGPIILAQIENEYNHIQRAFREEGDNYVQWAAKMA 188
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L GVPW+MCKQ DAPDPVINACNGR CG+TF+GPN P KPAIWTENWT+ Y+V+GD
Sbjct: 189 VSLDIGVPWIMCKQTDAPDPVINACNGRHCGDTFSGPNKPYKPAIWTENWTAQYRVFGDP 248
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
RSAEDIA+ VA F +K GS VNYYMYHGGTNFGRT+SA+ T YYD+APLDEYG+
Sbjct: 249 PSQRSAEDIAFSVARFFSK-NGSLVNYYMYHGGTNFGRTSSAFTTTRYYDEAPLDEYGMQ 307
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
R+PKW HL+++H A+ LC + + +G S+ E +F+ GS+ CAAF+ N +
Sbjct: 308 REPKWSHLRDVHRALSLCKRALFNGASTVTKMSQHHEVIVFEKPGSNLCAAFITNNHTKV 367
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV--------------EQWEEYKEA 433
T+ F Y +PP SISILPDCKTV FNT + S +WE Y E
Sbjct: 368 PTTISFRGTDYYMPPRSISILPDCKTVVFNTQCIASQHSSRNFKRSMAANDHKWEVYSET 427
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVL 487
IPT + +E + KD SDY WY + P +D ++L++ SLGH L
Sbjct: 428 IPTTKQIPTHEKNPIELYSLLKDTSDYAWYTTSVELRPEDLPKKNDIPTILRIMSLGHSL 487
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
AF+NGEF+GS HG H +K F +K V L G N +++L+ VGLPDSGAY+E R AG +
Sbjct: 488 LAFVNGEFIGSNHGSHEEKGFEFQKPVTLKVGVNQIAILASTVGLPDSGAYMEHRFAGPK 547
Query: 548 NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
++ I G K D +S WG++VG+ GEKL IFT+ GS+ V W + ++WYKT
Sbjct: 548 SIFILGLNSGKMDLTSNGWGHEVGIKGEKLGIFTEEGSKKVQW-KEAKGPGPAVSWYKTN 606
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
F P G+DPVAI + MGKG W+NG+SIGR+W+S+L+P G P+QS YHIPR++ P N
Sbjct: 607 FATPEGTDPVAIRMTGMGKGMVWINGKSIGRHWMSYLSPLGQPTQSEYHIPRTYFNPKDN 666
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
LLV+ EEE P + I TV+ T+C V+++H P V SW +++ K + P
Sbjct: 667 LLVVFEEEIANPEKVEILTVNRDTICSFVTENHPPNVKSWAIKSE---KFQAVVNDLVPS 723
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
++CP R I + FAS+G+P G C +A+G C++ + IVEK CLGK SC VP+ +
Sbjct: 724 ASLKCPHQRTIKAVEFASFGDPAGACGAFALGKCNAPAIKQIVEKQCLGKASCLVPIDKD 783
Query: 787 KFYG--DPCPGIPKALLVDAQC 806
F D CP + KAL + +C
Sbjct: 784 AFTKGQDACPNVTKALAIQVRC 805
>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 829 bits (2142), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/832 (48%), Positives = 549/832 (65%), Gaps = 34/832 (4%)
Query: 2 GQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
GQ + + LL++ + G G VTYDGRSLI+NG R++LFSGSIHYPRSTP+MW
Sbjct: 5 GQALIAAVLSLLVS-YAAAHGIAKGAKTVTYDGRSLIVNGRRELLFSGSIHYPRSTPEMW 63
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
P ++ KAK GGL+++QT VFWN+HEP GQF+F G DLV+FIK + GLY LRIGPF
Sbjct: 64 PDILQKAKHGGLNLIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGDYGLYATLRIGPF 123
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILS 181
IE EW +GG P+WL +VP I+FRS NEPFK+HM++Y+ MI+ MMK A+L+A QGGPIIL+
Sbjct: 124 IEAEWNHGGFPYWLREVPDIIFRSYNEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILA 183
Query: 182 QIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGE 241
QIENEY ++ ++ E G YV+WA K+AV L GVPW+MCKQ DAPDPVIN CNGR CG+
Sbjct: 184 QIENEYNSIQLAYRELGVQYVQWAGKMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGD 243
Query: 242 TFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHG 301
TF GPN P+KP++WTENWT+ Y+V+GD R+AED+A+ VA FI+K G+ NYYMYHG
Sbjct: 244 TFTGPNRPNKPSLWTENWTAQYRVFGDPPSQRAAEDLAFSVARFISK-NGTLANYYMYHG 302
Query: 302 GTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
GTNFGRT S++V T YYD+APLDEYGL R+PKWGHLK+LHSA++LC K + +G
Sbjct: 303 GTNFGRTGSSFVTTRYYDEAPLDEYGLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKL 362
Query: 362 SKLQEAFIFQ--GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
K +E ++ G+ CAAFL N R AT+ F Y LPP SISILPDCKTV +NT
Sbjct: 363 GKDKEVRFYEKPGTHICAAFLTNNHSREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQ 422
Query: 420 KLDSVE---------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY- 463
++ + +WE +E IP + + +E N KD SDY W+
Sbjct: 423 RVVAQHNARNFVKSKIANKNLKWEMSQEPIPVMTDMKILTKSPMELYNFLKDRSDYAWFV 482
Query: 464 ------NFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
N+ D VL++S+LGH + AF+NG F+GSAHG + +K+F K V
Sbjct: 483 TSIELSNYDLPMK-KDIIPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFK 541
Query: 518 NGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKL 576
GTN ++LL + VGLP+SGAY+E R AG+ +V I G D ++ WG QVG+ GE +
Sbjct: 542 AGTNYIALLCMTVGLPNSGAYMEHRYAGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHV 601
Query: 577 QIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
+ +T GS V W+ +TWYKT FD P G+DPV + + SM KG AWVNG++IG
Sbjct: 602 KAYTQGGSHRVQWTA-AKGKGPAMTWYKTYFDMPEGNDPVILRMTSMAKGMAWVNGKNIG 660
Query: 637 RYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
RYW+S+L+P PSQS YH+PR++LKP+ NLLV+ EE G P I ++ V+ T+C V+
Sbjct: 661 RYWLSYLSPLEKPSQSEYHVPRAWLKPSDNLLVIFEETGGNPEEIEVELVNRDTICSIVT 720
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYA 756
+ H P V SW+ + + + +PK ++CP+ + I K+ FAS+GNP G C ++
Sbjct: 721 EYHPPHVKSWQRHDSKIRAVVDEV---KPKGHLKCPNYKVIVKVDFASFGNPLGACGDFE 777
Query: 757 IGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGD--PCPGIPKALLVDAQC 806
+G+C + NS+ +VE+ C+GK +C +P+ F G+ C I K L V +C
Sbjct: 778 MGNCTAPNSKKVVEQHCMGKTTCEIPMEAGIFDGNSGACSDITKTLAVQVRC 829
>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
Length = 844
Score = 827 bits (2137), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 415/842 (49%), Positives = 544/842 (64%), Gaps = 45/842 (5%)
Query: 6 LLCLFGLLLTTIGGSDGGG--------------GGGNNVTYDGRSLIINGHRKILFSGSI 51
+L L LL +I G + GG NVTYDG+SL ING R+ILFSGS+
Sbjct: 8 ILILMTLLSISIAGGNAGGLQHHKGRHGKHGRHMSARNVTYDGKSLFINGRREILFSGSV 67
Query: 52 HYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQG 111
HY RSTP MWP ++ KA+ GGL+V+QT VFWN HEP+PG+F+F G DLV+FI+ VQA+G
Sbjct: 68 HYTRSTPDMWPDILDKARRGGLNVIQTYVFWNAHEPEPGKFNFQGNYDLVKFIRLVQAKG 127
Query: 112 LYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLY 171
++V LR+GPFI+ EW +GGLP+WL +VPGI+FRSDNEP+KFHMK + + I+ MMK +L+
Sbjct: 128 MFVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNEPYKFHMKAFVSKIIQMMKDEKLF 187
Query: 172 ASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVI 231
A QGGPIIL+QIENEY ++ ++ EKG YV+WAA +AV GVPW+MCKQ DAPDPVI
Sbjct: 188 APQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANMAVATDIGVPWLMCKQRDAPDPVI 247
Query: 232 NACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKG 291
NACNGR CG+TFAGPN P KPAIWTENWT+ Y+V+GD RSAEDIA+ VA F +K G
Sbjct: 248 NACNGRHCGDTFAGPNKPYKPAIWTENWTAQYRVHGDPPSQRSAEDIAFSVARFFSK-NG 306
Query: 292 SYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPM 351
+ VNYYMYHGGTNFGRT+S + T YYD+APLDEYGL R+PKW HL+++H A+ LC + +
Sbjct: 307 NLVNYYMYHGGTNFGRTSSVFSTTRYYDEAPLDEYGLPREPKWSHLRDVHKALLLCRRAI 366
Query: 352 LSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILP 409
L GV + E F+ G++ CAAF+ N AT+ F Y LPP SISILP
Sbjct: 367 LGGVPSVQKLNHFHEVRTFERVGTNMCAAFITNNHTMEPATINFRGTNYFLPPHSISILP 426
Query: 410 DCKTVAFNTAKLDSVE--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTK 455
DCKTV FNT ++ S WE + EAIPT + + E + K
Sbjct: 427 DCKTVVFNTQQIVSQHNSRNYERSPAANNFHWEMFNEAIPTAKKMPINLPVPAELYSLLK 486
Query: 456 DASDYLWYNFRFKHDPSDSE------SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFT 509
D +DY WY F+ D VL+V SLGH + AF+NG+ VG+AHG H +KSF
Sbjct: 487 DTTDYAWYTTSFELSQEDMSMKPGVLPVLRVMSLGHSMVAFVNGDIVGTAHGTHEEKSFE 546
Query: 510 LEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGA-KELKDFSSFSWGYQ 568
+ V L GTN +SLLS VGLPDSGAY+E R AG ++++I G + D + WG++
Sbjct: 547 FQTPVLLRVGTNYISLLSSTVGLPDSGAYMEHRYAGPKSINILGLNRGTLDLTRNGWGHR 606
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
VGL GE ++F++ GS V W G + + L+WY+T F P G+ PVAI + M KG
Sbjct: 607 VGLKGEGKKVFSEEGSTSVKWKPLG-AVPRALSWYRTRFGTPEGTGPVAIRMSGMAKGMV 665
Query: 629 WVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
WVNG +IGRYW+S+L+P G P+QS YHIPRSFL P NLLV+ EEE P + I V+
Sbjct: 666 WVNGNNIGRYWMSYLSPLGKPTQSEYHIPRSFLNPQDNLLVIFEEEARVPAQVEILNVNR 725
Query: 689 TTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNP 748
T+C V + V SW S R H + + C +G++I + FAS+GNP
Sbjct: 726 DTICSVVGERDPANVNSWVS---RRGNFHPVVKSVGAAASMACATGKRIVAVEFASFGNP 782
Query: 749 NGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYG---DPCPGIPKALLVDAQ 805
+G C ++A+GSC+++ S+ IVE+ CLG+ +CT+ + F D CP + K L V +
Sbjct: 783 SGYCGDFAMGSCNAAASKQIVERECLGQEACTLALDRAVFNNNGVDACPDLVKQLAVQVR 842
Query: 806 CT 807
C
Sbjct: 843 CA 844
>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
Length = 830
Score = 824 bits (2129), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/803 (49%), Positives = 537/803 (66%), Gaps = 30/803 (3%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGRS+I+NG R++LFSGSIHYPR P+MWP +I KAKEGGL+V+QT VFWN+HEP
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPEIIRKAKEGGLNVIQTYVFWNIHEPVQ 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQF+F G DLV+FIK + QGLYV LRIGP+IE EW GG P+WL +VP I FRS NEP
Sbjct: 88 GQFNFEGNYDLVKFIKAIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F HMK+Y+ M+++++K +L+A QGGPII++QIENEY V+ ++ + G Y+ WAA +A
Sbjct: 148 FIHHMKKYSEMVIDLVKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYIEWAANMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L GVPW+MCKQ DAP VIN CNGR C +TF GPN P+KP++WTENWT+ Y+ +GD
Sbjct: 208 TSLYNGVPWIMCKQKDAPPQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R+AEDIA+ VA F AK G+ NYYMY+GGTN+GRT+S++V T YYD+APLDE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAK-NGTLTNYYMYYGGTNYGRTSSSFVTTRYYDEAPLDEFGLY 326
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
R+PKW HL++LH A++L + +L G ++ E +F+ GS++CAAFL N
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPTVQKINQDLEITVFEKPGSTDCAAFLTNNHTTQ 386
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKEA 433
+T+ F Y LP S+SILPDCKTV +NT + S +WE Y+E
Sbjct: 387 PSTIKFRGKDYYLPEKSVSILPDCKTVVYNTQTIVSQHNSRNFITSEKSKNLKWEMYQEK 446
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHD---PSDSESVLKVSSLGHVL 487
+PT + L+ LE + TKD SDY WY+ +HD D VL+++S+GH L
Sbjct: 447 VPTIADLPLKNREPLELYSLTKDTSDYAWYSTSITLERHDLPMRPDILPVLQIASMGHAL 506
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
AF+NGE+VG HG + +KSF +K + L GTN +++L+ VG P+SGAY+E+R AG R
Sbjct: 507 AAFVNGEYVGFGHGNNIEKSFVFQKPIILKPGTNTITILAETVGFPNSGAYMEKRFAGPR 566
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V+IQG D + +WG++VG+ GEK ++FT+ G++ V W+ +TWYKT
Sbjct: 567 GVTIQGLMAGTLDITQNNWGHEVGVFGEKQELFTEEGAKKVQWTPVTGPPKGAVTWYKTY 626
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
FDAP G++PVA+ + M KG WVNG+S+GRYW SFL+P G P+Q+ YHIPR++LKPT N
Sbjct: 627 FDAPEGNNPVALKMDKMEKGMMWVNGKSLGRYWTSFLSPLGQPTQAEYHIPRAYLKPTNN 686
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
LLV+ EE G+P I + TV+ T+C +++ H P V SW + + + +
Sbjct: 687 LLVIFEETGGHPTNIEVQTVNRDTICSIITEYHPPHVKSWERSGTDFVAVVEDL---KSG 743
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
+ CP + I K+ FASYGNP+G C N G+C+S+NS +VE+ CLGK +CT+P+ E
Sbjct: 744 AHLTCPDNKIIEKVEFASYGNPDGACGNLFNGNCNSANSLKVVEQHCLGKNTCTIPIERE 803
Query: 787 KF---YGDPCPGIPKALLVDAQC 806
+ DPCP I K L V +C
Sbjct: 804 IYDEPSKDPCPNIFKTLAVQVKC 826
>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
Length = 784
Score = 815 bits (2105), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/800 (51%), Positives = 518/800 (64%), Gaps = 71/800 (8%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G V+ D R+L+++G R++LF+G +HY RSTP+MWP+LIAKAKEGGLD++QT VFWN+
Sbjct: 37 GAPRQVSLDARALVVDGTRRLLFAGEMHYTRSTPEMWPKLIAKAKEGGLDMIQTYVFWNV 96
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEP GQ++F GR DLVRFIKE+QAQGLYV LRIGPFIE EW YGG PFWLHDVP I FR
Sbjct: 97 HEPVQGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFR 156
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
SDNEPFK HM+R+ T IVNMMK LY QGGPII SQIENEY MVEH+F G YV W
Sbjct: 157 SDNEPFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEHAFGSSGQRYVSW 216
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA +AVD QTGVPW MCKQ+DAPDPV+ G +S P + N + Y
Sbjct: 217 AAAMAVDRQTGVPWTMCKQNDAPDPVV-------------GIHSHTIPLDFP-NASRNYL 262
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLD 324
+YG++ ++RS EDIA+ V FIA+ GSYV+YYMYHGGTNFGR AS+YV T YYD APLD
Sbjct: 263 IYGNDTKLRSPEDIAFAVVYFIARKNGSYVSYYMYHGGTNFGRFASSYVTTSYYDAAPLD 322
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKD 384
EYGL+ QP WGHL+ELH+AVK +P+L G ++ + QEA IF+ S+C AFLVN D
Sbjct: 323 EYGLIWQPTWGHLRELHAAVKQSSEPLLFGTYSYLSLGQEQEAHIFETESQCVAFLVNFD 382
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS---------------VEQWEE 429
+ + + V F N+ EL P SISIL DCK V F TAK+ + + W
Sbjct: 383 RHHISEVVFRNISLELAPKSISILSDCKRVVFETAKVTAQHGSRTAEEVQSFSDINTWTA 442
Query: 430 YKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLH 488
+KE IP + N L E ++TTKD +DYLWY H+
Sbjct: 443 FKEPIPQDVSKAMYSGNRLFEHLSTTKDDTDYLWYIVGLFHN------------------ 484
Query: 489 AFINGEFVGSAHGKHSD-KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
+G HG H + L + L G N +SLLS MVG PDSGA++ERRV GL+
Sbjct: 485 ------ILGRIHGSHGGPANIILNTNISLKEGPNTISLLSAMVGSPDSGAHMERRVFGLQ 538
Query: 548 NVSIQGAKELKD-FSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
VSIQ +E ++ ++ WGYQVGL GE+ I+T GS+ V W+ + + PLTWYKT
Sbjct: 539 KVSIQQGQEPENLLNNELWGYQVGLFGERNSIYTQEGSKSVEWTTIYNLAYSPLTWYKTT 598
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
F P G+D V +NL MGKGE WVNG+SIGRYWVSF P G PSQS YHIPR FL P N
Sbjct: 599 FSTPAGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPSGNPSQSLYHIPRQFLNPQDN 658
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
+LVL EE G P I+++TVSVT +C +V++ P + + + P
Sbjct: 659 ILVLFEEMGGNPQQITVNTVSVTRVCVNVNELSAPSL---------------QYKNKEPA 703
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
V +RC G++IS I FASYGNP G+C+ GSCH+ +S ++V++ACLGK C++P+
Sbjct: 704 VDLRCQEGKQISAIEFASYGNPIGDCKKIRFGSCHAGSSESVVKQACLGKSGCSIPITPI 763
Query: 787 KFYGDPCPGIPKALLVDAQC 806
KF GDPCPGI K+LLV A C
Sbjct: 764 KFGGDPCPGIKKSLLVVANC 783
>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
Length = 759
Score = 814 bits (2103), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/793 (52%), Positives = 526/793 (66%), Gaps = 72/793 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTY+ R+L+++G R++LF+G +HYPRSTP+MWP+LIAKAKEGGLDV+QT VFWN+HEP
Sbjct: 18 VTYEQRALVLDGARRMLFAGEMHYPRSTPEMWPKLIAKAKEGGLDVIQTYVFWNVHEPIQ 77
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++F GR DLVRFIKE+QAQGLYV LRIGPFIE EW YGG PFWLHDVP I FRSDNEP
Sbjct: 78 GQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDNEP 137
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK HM+R+ T IVNMMK LY QGGPII SQIENEY MVE +F G YV WAA +A
Sbjct: 138 FKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEPAFGSSGQRYVSWAAAMA 197
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
VDLQTGVPW MCKQ+DAPDPV+ G +S P + +N + Y +YG++
Sbjct: 198 VDLQTGVPWTMCKQNDAPDPVV-------------GIHSYTIPVNF-QNDSRNYLIYGND 243
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
++RS +DI + VALFIA+ GSYV+YYMYHGGTNFGR AS+YV T YYD APLDEYGL+
Sbjct: 244 TKLRSPQDITFAVALFIARKNGSYVSYYMYHGGTNFGRFASSYVTTSYYDGAPLDEYGLI 303
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
QP WGHL+ELH+AVK +P+L G +++ + QEA IF+ ++C AFLVN D+ + +
Sbjct: 304 WQPTWGHLRELHAAVKQSSEPLLFGTYSNLSIGQEQEAHIFETETQCVAFLVNFDQHHIS 363
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS---------------VEQWEEYKEAI 434
V F N+ EL P SISIL DCK V F TAK+++ + W+ +KE I
Sbjct: 364 EVVFRNISLELAPKSISILLDCKQVVFETAKVNAQHGSRTAEEVQSFSDISTWKAFKEPI 423
Query: 435 PT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFING 493
P +++ N L E ++TTKDA+DYLWY ++ F+N
Sbjct: 424 PQDVSKSAYSGNRLFEHLSTTKDATDYLWY----------------------IVGLFLN- 460
Query: 494 EFVGSAHGKHSD-KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ 552
+G HG H + + L G N +SLLS MVG PDSGA++ERRV G+R VSIQ
Sbjct: 461 -ILGRIHGSHGGPANIIFSTNISLQEGPNTISLLSAMVGSPDSGAHMERRVFGIRKVSIQ 519
Query: 553 GAKELKD-FSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPT 611
+E ++ ++ WGYQVGL GE+ I+T S+I W+ + T+ PLTWYKT F P
Sbjct: 520 QGQEPENLLNNELWGYQVGLFGERNNIYTQ-DSKITEWTTIDNLTYSPLTWYKTTFSTPV 578
Query: 612 GSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLL 671
G+D V +NL MGKGE WVNG+SIGRYWVSF P G PSQS YHIPR FL P N LVL
Sbjct: 579 GNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPSGNPSQSLYHIPREFLNPQDNTLVLF 638
Query: 672 EEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRC 731
EE G P I+++T+SV+ +CG+V++ P + + + P V + C
Sbjct: 639 EEMGGNPQLITVNTMSVSRVCGNVNELSAPSL---------------QYKDKEPAVDLWC 683
Query: 732 PSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGD 791
P G+ IS I FASYG P G+C+ + G CH+ +S ++V++ACLGK C+VPV KF GD
Sbjct: 684 PEGKHISAIEFASYGGPTGDCKKFGFGRCHAGSSESVVKQACLGKSGCSVPVTPIKFGGD 743
Query: 792 PCPGIPKALLVDA 804
PCPGI K+LLV A
Sbjct: 744 PCPGIQKSLLVVA 756
>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
Length = 843
Score = 810 bits (2091), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/811 (49%), Positives = 536/811 (66%), Gaps = 35/811 (4%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
GG VTYD RSLIING R++LFSG+IHYPRSTP MWP LI KAK+GG++ ++T VFW
Sbjct: 42 GGQKALGVTYDARSLIINGKRELLFSGAIHYPRSTPDMWPDLIKKAKQGGINAIETYVFW 101
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
N HEP GQ++F G DLV+FIK + LY +R+GPFI+ EW +GGLP+WL +VPGI+
Sbjct: 102 NGHEPVEGQYNFEGEFDLVKFIKLIHEHKLYAVVRVGPFIQAEWNHGGLPYWLREVPGII 161
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FRSDNEPFK HMKR+ T+IV+ +K +L+A QGGPIIL+QIENEY ++ +F EKG YV
Sbjct: 162 FRSDNEPFKKHMKRFVTLIVDKLKQEKLFAPQGGPIILAQIENEYNTIQRAFREKGDSYV 221
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
+WA KLA+ L VPW+MCKQ DAPDP+IN CNGR CG+TF GPN +KPA+WTENWT+
Sbjct: 222 QWAGKLALSLNANVPWIMCKQRDAPDPIINTCNGRHCGDTFYGPNKRNKPALWTENWTAQ 281
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAP 322
Y+V+GD RSAED+AY VA F +K GS VNYYM++GGTNFGRT++++ T YYD+ P
Sbjct: 282 YRVFGDPPSQRSAEDLAYSVARFFSK-NGSMVNYYMHYGGTNFGRTSASFTTTRYYDEGP 340
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFL 380
LDE+GL R+PKWGHLK++H A+ LC + + G ++ Q+A ++Q G+S CAAFL
Sbjct: 341 LDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAAFL 400
Query: 381 VNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------- 425
N + R V F LP SIS+LPDCKTV FNT + +
Sbjct: 401 ANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSRNFVRSEIANKNF 460
Query: 426 QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHD---PSDSESVLK 479
WE +E P + + E + TKD +DY WY + D + VL+
Sbjct: 461 NWEMCREVPPV--GLGFKFDVPRELFHLTKDTTDYAWYTTSLLLGRRDLPMKKNVRPVLR 518
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V+SLGH +HA++NGE+ GSAHG +KSF L++ V L G N+++LL +VGLPDSGAY+
Sbjct: 519 VASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAVSLKEGENHIALLGYLVGLPDSGAYM 578
Query: 540 ERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
E+R AG R+++I G D S WG+QVG+ GEK ++FT+ GS+ V W++
Sbjct: 579 EKRFAGPRSITILGLNTGTLDISQNGWGHQVGIDGEKKKLFTEEGSKSVQWTK--PDQGG 636
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPR 658
PLTWYK FDAP G +PVAI + MGKG WVNG+SIGRYW ++L+P P+QS YHIPR
Sbjct: 637 PLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSPLKKPTQSEYHIPR 696
Query: 659 SFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHK 718
++LKP NL+VLLEEE G P + I TV+ T+C VS+ H P + ++N
Sbjct: 697 AYLKPK-NLIVLLEEEGGNPKDVHIVTVNRDTICSAVSEIHPPSPRLFETKNG---SLQA 752
Query: 719 RIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRS 778
++ +P+ +++CP ++I + FASYG+P G C Y IG+C + S+ +VEK CLGK S
Sbjct: 753 KVNDLKPRAELKCPGKKQIVAVEFASYGDPFGACGAYFIGNCTAPESKQVVEKYCLGKPS 812
Query: 779 CTVPVWTEKF--YGDPCPGIPKALLVDAQCT 807
C +P+ + F D C + K L V +C
Sbjct: 813 CQIPLDSIPFSNQNDACTHLRKTLAVQLKCA 843
>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
Length = 848
Score = 797 bits (2058), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/804 (48%), Positives = 531/804 (66%), Gaps = 32/804 (3%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLIING+R++L+SGSIHYPRSTP+MWP +I +AK+GGL+ +QT VFWN+HEP+
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSGR DLV+FIK ++ G+YV LR+GPFI+ EW +GGLP+WL +VPGI FR+DN P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNTP 163
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK H +RY +I++ MK +L+ASQGGPIIL QIENEY V+ ++ E G Y++WA+KL
Sbjct: 164 FKEHTERYVKVILDKMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ G+PWVMCKQ+DAPDP+INACNGR CG+TF GPN +KP++WTENWT+ ++VYGD
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVYGDP 283
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
RS EDIAY VA F +K G++VNYYMYHGGTNFGRT++ YV T YYD APLDEYGL
Sbjct: 284 PAQRSVEDIAYSVARFFSK-NGTHVNYYMYHGGTNFGRTSAHYVTTRYYDDAPLDEYGLE 342
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
R+PK+GHLK LH+A+ LC K +L G S E ++ G+ CAAFL N + +
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTES 402
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS---VEQWEEYKEAIPTYD------ 438
+ F Y +P SISILPDCKTV +NT ++ S + + K+A +D
Sbjct: 403 AEKIKFKGKEYIIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKNFDFKVFTE 462
Query: 439 --ETSLRANFLL--EQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ ++ + + E TKD +DY WY FK D +D S+ L+++SLGH LH
Sbjct: 463 TVPSKIKGDSYIPVELYGLTKDETDYGWYTTSFKIDDNDLSKKKGSKPTLRIASLGHALH 522
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
++NGE++G+ HG H +KSF +K + L G N++++L V+ G PDSG+Y+E R G R+
Sbjct: 523 VWLNGEYLGNGHGSHEEKSFVFQKPISLKEGENHLTMLGVLTGFPDSGSYMEHRYTGPRS 582
Query: 549 VSI--QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
VSI G+ L WG +VG+ GEKL I + G + V W ++ S LTWY+T
Sbjct: 583 VSILGLGSGTLDLTEENKWGNKVGMEGEKLGIHAEEGLKKVKWQKF-SGKEPGLTWYQTY 641
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
FDAP AI + MGKG WVNG+ +GRYW+SFL+P G P+Q YHIPRSFLKP N
Sbjct: 642 FDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSPLGQPTQIEYHIPRSFLKPKKN 701
Query: 667 LLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRP 725
LLV+ EEE N P I ++ T+C H+ +++ P V W +N + +
Sbjct: 702 LLVIFEEEPNVKPELIDFVIINRDTVCSHIGENYTPSVRHWTRKNDQVQAITDDV---HL 758
Query: 726 KVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWT 785
++C +KIS++ FAS+GNPNG C N+ +G+C++ S+ +VEK CLGK C +PV
Sbjct: 759 TASLKCSGTKKISEVEFASFGNPNGTCGNFTLGTCNAPVSKKVVEKYCLGKAECVIPVNK 818
Query: 786 EKFY---GDPCPGIPKALLVDAQC 806
F D CP + K L V +C
Sbjct: 819 STFQQDKKDSCPKVEKKLAVQVKC 842
>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
Length = 848
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/804 (48%), Positives = 529/804 (65%), Gaps = 32/804 (3%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLIING+R++L+SGSIHYPRSTP+MWP +I +AK+GGL+ +QT VFWN+HEP+
Sbjct: 44 VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSGR DLV+FIK ++ GLYV LR+GPFI+ EW +GGLP+WL +VPGI FR+DNEP
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNEP 163
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK H +RY ++++MMK +L+ASQGGPIIL QIENEY V+ ++ E G Y++WA+KL
Sbjct: 164 FKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ G+PWVMCKQ+DAPDP+INACNGR CG+TF GPN +KP++WTENWT+ ++V+GD
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGDP 283
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
RS EDIAY VA F +K G++VNYYMYHGGTNFGRT++ YV T YYD APLDE+GL
Sbjct: 284 PAQRSVEDIAYSVARFFSK-NGTHVNYYMYHGGTNFGRTSAHYVTTRYYDDAPLDEFGLE 342
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
R+PK+GHLK LH+A+ LC K +L G S E ++ G+ CAAFL N +
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTEA 402
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS---VEQWEEYKEAIPTYD------ 438
+ F Y +P SISILPDCKTV +NT ++ S + + K+A +D
Sbjct: 403 AEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKNFDFKVFTE 462
Query: 439 --ETSLRANFLL--EQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ ++ + + E TKD SDY WY FK D +D + L+++SLGH LH
Sbjct: 463 SVPSKIKGDSFIPVELYGLTKDESDYGWYTTSFKIDDNDLSKKKGGKPNLRIASLGHALH 522
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
++NGE++G+ HG H +KSF +K V L G N++++L V+ G PDSG+Y+E R G R+
Sbjct: 523 VWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDSGSYMEHRYTGPRS 582
Query: 549 VSI--QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
VSI G+ L WG +VG+ GE+L I + G + V W + S +TWY+T
Sbjct: 583 VSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKKVKWEK-ASGKEPGMTWYQTY 641
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
FDAP AI + MGKG WVNG+ +GRYW+SFL+P G P+Q YHIPRSFLKP N
Sbjct: 642 FDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSPLGQPTQIEYHIPRSFLKPKKN 701
Query: 667 LLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRP 725
LLV+ EEE N P I V+ T+C ++ +++ P V W +N + +
Sbjct: 702 LLVIFEEEPNVKPELIDFVIVNRDTVCSYIGENYTPSVRHWTRKNDQVQAITDDV---HL 758
Query: 726 KVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWT 785
++C +KIS + FAS+GNPNG C N+ +GSC++ S+ +VEK CLGK C +PV
Sbjct: 759 TANLKCSGTKKISAVEFASFGNPNGTCGNFTLGSCNAPVSKKVVEKYCLGKAECVIPVNK 818
Query: 786 EKF---YGDPCPGIPKALLVDAQC 806
F D CP + K L V +C
Sbjct: 819 STFEQDKKDSCPKVEKKLAVQVKC 842
>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 832
Score = 794 bits (2050), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/808 (48%), Positives = 531/808 (65%), Gaps = 32/808 (3%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G ++TYDG SLIING+R++L+SGSIHYPRSTP+MWP +I +AK+GGL+ +QT VFWN+H
Sbjct: 24 GALSITYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVH 83
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP+ G+F+FSGR DLV+FIK ++ GLYV LR+GPFI+ EW +GGLP+WL +VPGI FR+
Sbjct: 84 EPEQGKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRT 143
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
DNEPFK H +RY ++++MMK +L+ASQGGPIIL QIENEY V+ ++ E G Y++WA
Sbjct: 144 DNEPFKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWA 203
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
+KL + G+PWVMCKQ+DAPDP+INACNGR CG+TF GPN +KP++WTENWT+ ++V
Sbjct: 204 SKLVHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRV 263
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDE 325
+GD RS EDIAY VA F +K G++VNYYMYHGGTNFGRT++ YV T YYD APLDE
Sbjct: 264 FGDPPAQRSVEDIAYSVARFFSK-NGTHVNYYMYHGGTNFGRTSAHYVTTRYYDDAPLDE 322
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNK 383
+GL R+PK+GHLK LH+A+ LC K +L G S E ++ G+ CAAFL N
Sbjct: 323 FGLEREPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANN 382
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS---VEQWEEYKEAIPTYD-- 438
+ + F Y +P SISILPDCKTV +NT ++ S + + K+A +D
Sbjct: 383 NTEAAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRNFMKSKKANKNFDFK 442
Query: 439 ------ETSLRANFLL--EQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLG 484
+ ++ + + E TKD SDY WY FK D +D + L+++SLG
Sbjct: 443 VFTESVPSKIKGDSFIPVELYGLTKDESDYGWYTTSFKIDDNDLSKKKGGKPNLRIASLG 502
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LH ++NGE++G+ HG H +KSF +K V L G N++++L V+ G PDSG+Y+E R
Sbjct: 503 HALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDSGSYMEHRYT 562
Query: 545 GLRNVSI--QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G R+VSI G+ L WG +VG+ GE+L I + G + V W + S +TW
Sbjct: 563 GPRSVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKKVKWEK-ASGKEPGMTW 621
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
Y+T FDAP AI + MGKG WVNG+ +GRYW+SFL+P G P+Q YHIPRSFLK
Sbjct: 622 YQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSPLGQPTQIEYHIPRSFLK 681
Query: 663 PTGNLLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIP 721
P NLLV+ EEE N P I V+ T+C ++ +++ P V W +N + +
Sbjct: 682 PKKNLLVIFEEEPNVKPELIDFVIVNRDTVCSYIGENYTPSVRHWTRKNDQVQAITDDV- 740
Query: 722 GRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTV 781
++C +KIS + FAS+GNPNG C N+ +GSC++ S+ +VEK CLGK C +
Sbjct: 741 --HLTANLKCSGTKKISAVEFASFGNPNGTCGNFTLGSCNAPVSKKVVEKYCLGKAECVI 798
Query: 782 PVWTEKF---YGDPCPGIPKALLVDAQC 806
PV F D CP + K L V +C
Sbjct: 799 PVNKSTFEQDKKDSCPKVEKKLAVQVKC 826
>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
Length = 887
Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/811 (48%), Positives = 533/811 (65%), Gaps = 48/811 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLIING R++LFSGS+HYPRSTP MWP +I KA+ GGL+ +QT VFWN+HEP+
Sbjct: 41 VTYDGTSLIINGKRELLFSGSVHYPRSTPHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++DF GR DLV+FIK + +GLYV LR+GPFI+ EW +GGLP+WL +VP + FR++NEP
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK H +RY I+ MMK +L+ASQGGPIIL QIENEY V+ ++ E G Y++WAA L
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ G+PWVMCKQ+DAP +INACNGR CG+TF GPN DKP++WTENWT+ ++V+GD
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R+ EDIA+ VA + +K GS+VNYYMYHGGTNFGRT++ +V T YYD APLDE+GL
Sbjct: 281 PTQRTVEDIAFSVARYFSK-NGSHVNYYMYHGGTNFGRTSAHFVTTRYYDDAPLDEFGLE 339
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
+ PK+GHLK +H A++LC K + G L + E ++ G+ CAAFL N + R+
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW---------------EEYKE 432
T+ F Y LP SISILPDCKTV +NTA++ + W E + E
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSE 459
Query: 433 AIPTYDETSLRANFLL--EQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLG 484
IP+ L + L+ E TKD +DY WY K D D +++L+V+SLG
Sbjct: 460 NIPSL----LDGDSLIPGELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLG 515
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H L ++NGE+ G AHG+H KSF K V+ G N +S+L V+ GLPDSG+Y+E R A
Sbjct: 516 HALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFA 575
Query: 545 GLRNVSIQGAKE-LKDFS-SFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G R +SI G K +D + + WG+ GL GEK +++T+ GS+ V W + G +PLTW
Sbjct: 576 GPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGK--RKPLTW 633
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YKT F+ P G + VAI + +MGKG WVNG +GRYW+SFL+P G P+Q+ YHIPRSF+K
Sbjct: 634 YKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSPLGEPTQTEYHIPRSFMK 693
Query: 663 --PTGNLLVLLEEENGYPPGI---SIDTVSVT--TLCGHVSDSHLPPVISWRSQNQRTLK 715
N+LV+LEEE PG+ SID V V T+C +V + + V SW+ + + +
Sbjct: 694 GEKKKNMLVILEEE----PGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVS 749
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLG 775
K + R K +RCP +++ ++ FAS+G+P G C N+ +G C +S S+ +VEK CLG
Sbjct: 750 RSKDM---RLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLG 806
Query: 776 KRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+ C++ V E F CP I K L V +C
Sbjct: 807 RNYCSIVVARETFGDKGCPEIVKTLAVQVKC 837
>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
Length = 844
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/805 (49%), Positives = 530/805 (65%), Gaps = 34/805 (4%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLII+G R++L+SGSIHYPRSTP+MWP +I +AK+GGL+ +QT VFWN+HEPQ
Sbjct: 40 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 99
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSGR DLV+FIK ++ G+YV LR+GPFI+ EW +GGLP+WL +VPGI FR+DN+P
Sbjct: 100 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKP 159
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK H +RY MI++ MK RL+ASQGGPIIL QIENEY V+ ++ + G Y++WA+KL
Sbjct: 160 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASKLV 219
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
++ G+PWVMCKQ+DAPDP+INACNGR CG+TF GPN +KP++WTENWT+ ++V+GD
Sbjct: 220 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVFGDP 279
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
RS EDIAY VA F +K GS+VNYYMYHGGTNFGRT++ YV T YYD APLDEYGL
Sbjct: 280 PTQRSVEDIAYSVARFFSK-NGSHVNYYMYHGGTNFGRTSAHYVTTRYYDDAPLDEYGLE 338
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
R+PK+GHLK LHSA+ LC KP+L G + K E ++ G+ CAAFL N +
Sbjct: 339 REPKYGHLKHLHSALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 398
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEY---KEAIPTYD------ 438
T+ F Y + P SISILPDCKTV +NTA++ S + K+A +D
Sbjct: 399 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKKFDFKVFTE 458
Query: 439 --ETSLRANFLL--EQMNTTKDASDYLWYNFRFK----HDPSDS--ESVLKVSSLGHVLH 488
+ L N + E TKD +DY WY FK H P+ ++ ++++SLGH LH
Sbjct: 459 TLPSKLEGNSYIPVELYGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFVRIASLGHALH 518
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
++NGE++GS HG H +KSF +K V L G N++ +L V+ G PDSG+Y+E R G R
Sbjct: 519 IWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLIMLGVLTGFPDSGSYMEHRYTGPRG 578
Query: 549 VSIQG--AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
VSI G + L S WG ++G+ GEKL I T+ G + V W ++ + LTWY+
Sbjct: 579 VSILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKF-TGKAPGLTWYQAY 637
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
FDAP + AI + MGKG WVNG+ +GRYW SFL+P G P+Q YHIPRSFLKP N
Sbjct: 638 FDAPESLNAAAIRMNGMGKGLIWVNGEGVGRYWQSFLSPLGQPTQIEYHIPRSFLKPKKN 697
Query: 667 LLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISW-RSQNQRTLKTHKRIPGRR 724
LLV+ EEE N P + V+ T+C +V +++ P V W R Q+Q T
Sbjct: 698 LLVIFEEEPNVKPELMDFVIVNRDTVCSYVGENYTPSVRHWTRKQDQVQAITD----NVS 753
Query: 725 PKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVW 784
++C +KI+ + FAS+GNP G C N+ +G+C++ S+ ++EK CLGK C +PV
Sbjct: 754 LTATLKCSGTKKIAAVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVN 813
Query: 785 TEKFY---GDPCPGIPKALLVDAQC 806
F D C + K L V +C
Sbjct: 814 KSTFQQDKKDSCKNVAKTLAVQVKC 838
>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 887
Score = 790 bits (2040), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/811 (48%), Positives = 532/811 (65%), Gaps = 48/811 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLIING R++ FSGS+HYPRSTP MWP +I KA+ GGL+ +QT VFWN+HEP+
Sbjct: 41 VTYDGTSLIINGKRELFFSGSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++DF GR DLV+FIK + +GLYV LR+GPFI+ EW +GGLP+WL +VP + FR++NEP
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK H +RY I+ MMK +L+ASQGGPIIL QIENEY V+ ++ E G Y++WAA L
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ G+PWVMCKQ+DAP +INACNGR CG+TF GPN DKP++WTENWT+ ++V+GD
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R+AEDIA+ VA + +K GS+VNYYMYHGGTNFGRT++ +V T YYD APLDE+GL
Sbjct: 281 PTQRTAEDIAFSVARYFSK-NGSHVNYYMYHGGTNFGRTSAHFVTTRYYDDAPLDEFGLE 339
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
+ PK+GHLK +H A++LC K + G L + E ++ G+ CAAFL N + R+
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW---------------EEYKE 432
T+ F Y LP SISILPDCKTV +NTA++ + W E + E
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSE 459
Query: 433 AIPTYDETSLRANFLL--EQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLG 484
IP+ L + L+ E TKD +DY WY K D D +++L+V+SLG
Sbjct: 460 NIPSL----LDGDSLIPGELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLG 515
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H L ++NGE+ G AHG+H KSF K V+ G N +S+L V+ GLPDSG+Y+E R A
Sbjct: 516 HALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFA 575
Query: 545 GLRNVSIQGAKE-LKDFS-SFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G R +SI G K +D + + WG+ GL GEK +++T+ GS+ V W + G +PLTW
Sbjct: 576 GPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGE--RKPLTW 633
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YKT F+ P G + VAI + MGKG WVNG +GRYW+SFL+P G P+Q+ YHIPRSF+K
Sbjct: 634 YKTYFETPEGVNAVAIRMKGMGKGLIWVNGIGVGRYWMSFLSPLGEPTQTEYHIPRSFMK 693
Query: 663 --PTGNLLVLLEEENGYPPGI---SIDTVSVT--TLCGHVSDSHLPPVISWRSQNQRTLK 715
N+LV+LEEE PG+ SID V V T+C +V + + V SW+ + + +
Sbjct: 694 GEKKKNMLVILEEE----PGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVS 749
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLG 775
K + R K +RCP +++ ++ FAS+G+P G C N+ +G C +S S+ +VEK CLG
Sbjct: 750 RSKDM---RLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLG 806
Query: 776 KRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+ C++ V E F CP I K L V +C
Sbjct: 807 RNYCSIVVARETFGDKGCPEIVKTLAVQVKC 837
>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 846
Score = 788 bits (2034), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/807 (47%), Positives = 529/807 (65%), Gaps = 32/807 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSLII+G R+I FSGSIHYPRS P MWP LIAKAKEGGL+ ++T +FWN+HE
Sbjct: 38 GTVVSYDRRSLIIDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYIFWNIHE 97
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ GQFDF GR D+VRF K +Q +Y +R+GPFI+ EW +GGLP+WL ++P IVFR++
Sbjct: 98 PEKGQFDFEGRYDIVRFFKLIQEHNMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTN 157
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K HM+ + +I+ +K A L+ASQGGPIIL+QIENEY +E +F G Y++WAA
Sbjct: 158 NEPYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKNDGTKYIKWAA 217
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+A+ G+PW+MCKQ AP VI CNGR CG+T+ GP + P +WTENWT+ Y+V+
Sbjct: 218 NMAISTNVGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPMNKSMPLLWTENWTAQYRVF 277
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + G+ NYYMYHGGTNFGRT++A+V+ YYD+APLDE+
Sbjct: 278 GDPPSQRSAEDIAFAVARFFS-VGGTMTNYYMYHGGTNFGRTSAAFVMPKYYDEAPLDEF 336
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PKWGHL++LH A+KLC K +L G + K EA +F+ + C AFL N +
Sbjct: 337 GLYKEPKWGHLRDLHLALKLCKKALLWGKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHN 396
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEE 429
+++ T+ F Y +P SISIL DCKTV F T +++ W+
Sbjct: 397 TKDDVTLTFRGQSYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHFADQTTQNNVWQM 456
Query: 430 Y-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSS 482
+ +E +P Y ++ +R + N TKD +DY+WY FK + D ++VL+V+S
Sbjct: 457 FDEEKVPKYKQSKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNS 516
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH AF+N +FVG HG +K+FTLEK + L G N+V++L+ +G+ DSGAYLE R
Sbjct: 517 HGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHR 576
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+AG+ V I+G D ++ WG+ VGL+GE+ QI+TD G V W + +PLT
Sbjct: 577 LAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWK--PAVNDRPLT 634
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
WYK FD P+G DP+ +++ +MGKG +VNGQ IGRYW+S+ G PSQ YHIPRSFL
Sbjct: 635 WYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISYKHALGRPSQQLYHIPRSFL 694
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIP 721
+ N+LVL EEE G P I I TV +C +S+ + + SW ++ + T +
Sbjct: 695 RQKDNVLVLFEEEFGRPDAIMILTVKRDNICTFISERNPAHIKSWERKDSQITVTAADL- 753
Query: 722 GRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTV 781
+P+ + C + I +++FASYGNP G C NY IGSCH+ ++ +VEKACLGKR CT+
Sbjct: 754 --KPRATLTCSPKKLIQQVVFASYGNPMGICGNYTIGSCHTPRAKELVEKACLGKRICTL 811
Query: 782 PVWTEKFYGD-PCPGIPKALLVDAQCT 807
PV + + GD CPG L V A+C+
Sbjct: 812 PVSADVYGGDVNCPGTTATLAVQAKCS 838
>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
Length = 844
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/807 (46%), Positives = 531/807 (65%), Gaps = 32/807 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G ++YD RSL+++G R+I FSGSIHYPRS P MWP LIAKAKEGGL+ ++T VFWN+HE
Sbjct: 35 GTVISYDRRSLMVDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHE 94
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ GQF+F GR D+V+F K +Q ++ +R+GPFI+ EW +GGLP+WL ++P IVFR++
Sbjct: 95 PEKGQFNFEGRYDMVKFFKLIQEHDMFAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTN 154
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K HM+ + +++ +K A L+ASQGGPIIL+QIENEY +E +F E+G Y+ WAA
Sbjct: 155 NEPYKMHMETFVKIVIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKEEGTKYIHWAA 214
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ G+PW+MCKQ AP VI CNGR CG+T+ GP + P +WTENWT+ Y+V+
Sbjct: 215 QMAIGTNIGIPWIMCKQTKAPGDVIPTCNGRNCGDTWPGPMNKTMPLLWTENWTAQYRVF 274
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + G+ NYYMYHGGTNFGRTA+A+V+ YYD+APLDE+
Sbjct: 275 GDPPSQRSAEDIAFAVARFFS-VGGTMTNYYMYHGGTNFGRTAAAFVMPKYYDEAPLDEF 333
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PKWGHL++LH A+KLC K +L G + K EA +F+ + C AFL N +
Sbjct: 334 GLYKEPKWGHLRDLHLALKLCKKALLWGKPSTEKLGKQLEARVFEIPEQKVCVAFLSNHN 393
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQ---WEE 429
+++ T+ F Y +P SISIL DCKTV F T + D Q W+
Sbjct: 394 TKDDVTLTFRGQPYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHFADQTNQNNVWQM 453
Query: 430 Y-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSS 482
+ +E +P Y + +R + N TKD +DY+WY FK +P D ++V++V+S
Sbjct: 454 FDEEKVPKYKQAKIRTRKAADLYNLTKDKTDYVWYTSSFKLEPDDMPIRRDIKTVVEVNS 513
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH AF+N +F G HG +K+FTLEK + L G N+V++L+ +G+ DSGAYLE R
Sbjct: 514 HGHASVAFVNNKFAGCGHGTKMNKAFTLEKPMELKKGVNHVAVLASSMGMMDSGAYLEHR 573
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+AG+ V I G D ++ WG+ VGL+GE+ +I+T+ G V W + +PLT
Sbjct: 574 LAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGEQKEIYTEKGMASVTWK--PAVNDKPLT 631
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
WYK FD P+G DP+ +++ +MGKG +VNGQ IGRYW+S+ G PSQ YHIPRSFL
Sbjct: 632 WYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIGRYWMSYKHALGRPSQQLYHIPRSFL 691
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIP 721
+P N+LVL EEE G P I I TV +C ++S+ + + SW ++ + T +
Sbjct: 692 RPKDNVLVLFEEEFGRPDAIMILTVKRDNICTYISERNPAHIKSWERKDSQITATADDLK 751
Query: 722 GRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTV 781
R + CP + I +++FASYGNP G C NY IGSCH+ ++ +VEK+CLGKR+CT+
Sbjct: 752 AR---ATLTCPPKKLIQQVVFASYGNPVGICGNYTIGSCHTPRAKEVVEKSCLGKRTCTL 808
Query: 782 PVWTEKFYGD-PCPGIPKALLVDAQCT 807
PV + + GD CPG L V A+C+
Sbjct: 809 PVSADVYGGDVNCPGTTATLAVQAKCS 835
>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
Length = 850
Score = 786 bits (2029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/808 (47%), Positives = 530/808 (65%), Gaps = 32/808 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+ +GHR+I SGSIHYPRS P MWP LIAKAKEGGL+ ++T VFWN+HE
Sbjct: 40 GTVVSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHE 99
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ G+F+F G+ D+VRF + +Q +Y +R+GPFI+ EW +GGLP+WL ++P IVFR++
Sbjct: 100 PEKGEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTN 159
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K HM+ + +I+ +K A L+ASQGGPIIL+QIENEY +E +F ++G Y+ WAA
Sbjct: 160 NEPYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAA 219
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+A+ G+PW+MCKQ AP VI CNGR CG+T+ GP + P +WTENWT+ Y+V+
Sbjct: 220 KMAISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVF 279
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + G+ NYYMYHGGTNFGRT++A+V+ YYD+APLDE+
Sbjct: 280 GDPPSQRSAEDIAFAVARFFS-VGGTLANYYMYHGGTNFGRTSAAFVMPKYYDEAPLDEF 338
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PKWGHL++LH A+KLC K +L G + K EA +F+ + C AFL N +
Sbjct: 339 GLYKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHN 398
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEE 429
+++AT+ F Y +P SIS+L DC+TV F T +++ WE
Sbjct: 399 TKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQTAQNNVWEM 458
Query: 430 YK-EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSS 482
+ E +P Y + +R + N TKD +DY+WY FK + SD ++VL+V+S
Sbjct: 459 FDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKTVLEVNS 518
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH AF+N +FVG HG +K+FTLEK + L G N+V++L+ +G+ DSGAY+E R
Sbjct: 519 HGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSGAYMEHR 578
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+AG+ V I G D ++ WG+ VGL+GE+ QI+TD G V W + +PLT
Sbjct: 579 LAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWK--PAMNDRPLT 636
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
WYK FD P+G DPV +++ +MGKG +VNGQ IGRYW+S+ G PSQ YH+PRSFL
Sbjct: 637 WYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISYKHALGRPSQQLYHVPRSFL 696
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW-RSQNQRTLKTHKRI 720
+ N+LVL EEE G P I I TV +C +S+ + ++SW R +Q T K +
Sbjct: 697 RQKDNMLVLFEEEFGRPDAIMILTVKRDNICTFISERNPAHIMSWERKDSQITAKANA-- 754
Query: 721 PGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
R + + CP + I +++FASYGNP G C NY +GSCH+ ++ +VEKACLGKR CT
Sbjct: 755 DDLRARAALACPPKKLIQQVVFASYGNPAGICGNYTVGSCHTPRAKEVVEKACLGKRVCT 814
Query: 781 VPVWTEKFYGDP-CPGIPKALLVDAQCT 807
+PV + + GD C G L V A+C+
Sbjct: 815 LPVAADVYGGDANCSGTTATLAVQAKCS 842
>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
Length = 842
Score = 785 bits (2028), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/826 (50%), Positives = 529/826 (64%), Gaps = 54/826 (6%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD +++I+NG R+IL SGSIHYPRSTP+MWP LI KAKEGG+DV+QT VFWN H
Sbjct: 27 GLASVSYDHKAIIVNGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGH 86
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP+ G++ F R DLV+FIK V GLYV LR+GP+ EW +GG P WL VPGI FR+
Sbjct: 87 EPEQGKYYFEERYDLVKFIKLVHQAGLYVNLRVGPYACAEWNFGGFPVWLKYVPGISFRT 146
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
DNEPFK M+++ T IVNMMKA RLY SQGGPIILSQIENEYG +E F E+G Y WA
Sbjct: 147 DNEPFKAAMQKFTTKIVNMMKAERLYESQGGPIILSQIENEYGPLEVRFGEQGKSYAEWA 206
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
AK+A+DL TGVPW+MCKQDDAPDPVIN CNG C + PN KP IWTE WT+++
Sbjct: 207 AKMALDLGTGVPWLMCKQDDAPDPVINTCNGFYCDYFY--PNKAYKPKIWTEAWTAWFTE 264
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLD 324
+G R ED+A+ VA FI + GS++NYYMYHGGTNFGRTA +V T Y APLD
Sbjct: 265 FGSPVPYRPVEDLAFGVANFI-QTGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPLD 323
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNK 383
E+GLLRQPKWGHLK+LH A+KLC ++SG Q+A +F+ +S CAAFL N
Sbjct: 324 EFGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTALGNYQKAHVFRSTSGACAAFLANN 383
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYK 431
D + ATV F N Y LPP SISILPDCK +NTA++ + W+ Y
Sbjct: 384 DPNSFATVAFGNKHYNLPPWSISILPDCKHTVYNTARVGAQSALMKMTPANEGYSWQSYN 443
Query: 432 EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
+ YD+ + LLEQ+NTT+D SDYLWY K DPS+ + L VSS G
Sbjct: 444 DQTAFYDDNAFTVVGLLEQLNTTRDVSDYLWYMTDVKIDPSEGFLRSGNWPWLTVSSAGD 503
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH F+NG+ G+ +G + T K V+L G N +SLLS+ VGLP+ G + E G
Sbjct: 504 ALHVFVNGQLAGTVYGSLKKQKITFSKAVNLRAGVNKISLLSIAVGLPNIGPHFETWNTG 563
Query: 546 -LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L VS+ G E K D + W Y+VGL GE L + + GS V W GS + QPLT
Sbjct: 564 VLGPVSLSGLDEGKRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWVE-GSLVAQRQPLT 622
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------- 642
WYKT F+AP G++P+A+++ SMGKG+ W+NGQSIGRYW +
Sbjct: 623 WYKTTFNAPAGNEPLALDMNSMGKGQVWINGQSIGRYWPGYKASGTCDACNYAGPFNEKK 682
Query: 643 -LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
L+ G SQ WYH+PRS+L PTGNLLV+ EE G P GIS+ + ++C +++ P
Sbjct: 683 CLSNCGDASQRWYHVPRSWLHPTGNLLVVFEEWGGDPNGISLVKRELASVCADINEWQ-P 741
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+++W Q Q + K K + RPK + C SG+KI+ I FAS+G P G C +++ GSCH
Sbjct: 742 QLVNW--QLQASGKVDKPL---RPKAHLSCTSGQKITSIKFASFGTPQGVCGSFSEGSCH 796
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ +S EK C+G+ SCTVPV E F GDPCP + K L V+A C+
Sbjct: 797 AHHSYDAFEKYCIGQESCTVPVTPEIFGGDPCPSVMKKLSVEAVCS 842
>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 841
Score = 785 bits (2027), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/808 (48%), Positives = 530/808 (65%), Gaps = 34/808 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYD RSL+I+G R+I FSGSIHYPRS WP LIA+AKEGGL+V+++ VFWN+HE
Sbjct: 33 GTVITYDRRSLMIDGRREIFFSGSIHYPRSPFHEWPDLIARAKEGGLNVIESYVFWNIHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ G ++F GR D+++F K +Q ++ +RIGPF++ EW +GGLP+WL +VP IVFR+D
Sbjct: 93 PEMGVYNFEGRYDMIKFFKLIQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIVFRTD 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K M+++ T++VN +K A+L+ASQGGPIIL+QIENEY +E +F E G Y+ WAA
Sbjct: 153 NEPYKKLMQKFVTLVVNKLKDAKLFASQGGPIILAQIENEYQHMEAAFKENGTRYIDWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+A+ TGVPW+MCKQ AP VI CNGR CG+T+ GP +KP +WTENWT+ Y+V+
Sbjct: 213 KMAISTSTGVPWIMCKQTKAPAEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVF 272
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + GS VNYYMYHGGTNFGRT +++V+ YYD+APLDE+
Sbjct: 273 GDPPSQRSAEDIAFAVARFFS-VGGSMVNYYMYHGGTNFGRTGASFVMPRYYDEAPLDEF 331
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PKWGHL++LH A++LC K +L G + KL EA +F+ + C AFL N +
Sbjct: 332 GMYKEPKWGHLRDLHHALRLCKKALLRGNPSTQPLGKLYEARLFEIPEQKVCVAFLSNHN 391
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEE 429
+ + TV F Y +P S+SIL DCKTV F+T +++ WE
Sbjct: 392 TKEDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNAQHNQRTFHLTDQTLQNNVWEM 451
Query: 430 YKEA--IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS------DSESVLKVS 481
Y E +PTY T+ R+ LE N TKD +DYLWY FK + D + VL+ S
Sbjct: 452 YTEGDKVPTYKFTTDRSEKPLEAYNMTKDKTDYLWYTTSFKLEAEDLPFRQDIKPVLEAS 511
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S GH + AF+NG+ VG+AHG +K+F+LEK + + G N+VS+LS +GL DSGAYLE
Sbjct: 512 SHGHAMVAFVNGKLVGAAHGTKMNKAFSLEKPIEVRAGINHVSILSSTLGLQDSGAYLEH 571
Query: 542 RVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
R AG+ +V+IQG D SS WG+ VGL GE+ Q D G V W + PL
Sbjct: 572 RQAGVHSVTIQGLNTGTLDLSSNGWGHIVGLDGERKQAHMDKGGE-VQWK--PAVFDLPL 628
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWY+ FD P+G DPV I+L MGKG +VNG+ +GRYW S+ G PSQ YH+PR F
Sbjct: 629 TWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLGRYWSSYKHALGRPSQYLYHVPRCF 688
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
LKPTGN+L + EEE G P I I TV +C +S+ + V SW ++ + +
Sbjct: 689 LKPTGNVLTIFEEEGGRPDAIMILTVKRDNICSFISEKNPGHVRSWERKDSQLTVVADDL 748
Query: 721 PGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
+P+ + CP + I +++FASYGNP G C NY +G+CH+ ++ +VEKAC+GK+SC
Sbjct: 749 ---KPRAVLTCPEKKTIQQVVFASYGNPLGICGNYTVGNCHTPKAKEVVEKACVGKKSCV 805
Query: 781 VPVWTEKFYGD-PCPGIPKALLVDAQCT 807
+ V E + GD CPG L V A+C+
Sbjct: 806 LAVSHEVYGGDLNCPGTTATLAVQAKCS 833
>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
Precursor
gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
Length = 845
Score = 785 bits (2026), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/804 (48%), Positives = 526/804 (65%), Gaps = 32/804 (3%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLII+G R++L+SGSIHYPRSTP+MWP +I +AK+GGL+ +QT VFWN+HEPQ
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSGR DLV+FIK +Q G+YV LR+GPFI+ EW +GGLP+WL +VPGI FR+DN+
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK H +RY MI++ MK RL+ASQGGPIIL QIENEY V+ ++ + G Y++WA+ L
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
++ G+PWVMCKQ+DAPDP+INACNGR CG+TF GPN +KP++WTENWT+ ++V+GD
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
RS EDIAY VA F +K G++VNYYMYHGGTNFGRT++ YV T YYD APLDEYGL
Sbjct: 281 PTQRSVEDIAYSVARFFSK-NGTHVNYYMYHGGTNFGRTSAHYVTTRYYDDAPLDEYGLE 339
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
++PK+GHLK LH+A+ LC KP+L G + K E ++ G+ CAAFL N +
Sbjct: 340 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 399
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEY---KEAIPTYD------ 438
T+ F Y + P SISILPDCKTV +NTA++ S + K+A +D
Sbjct: 400 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKKFDFKVFTE 459
Query: 439 --ETSLRANFLL--EQMNTTKDASDYLWYNFRFK----HDPSDS--ESVLKVSSLGHVLH 488
+ L N + E TKD +DY WY FK H P+ ++ ++++SLGH LH
Sbjct: 460 TLPSKLEGNSYIPVELYGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFVRIASLGHALH 519
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
A++NGE++GS HG H +KSF +K V L G N++ +L V+ G PDSG+Y+E R G R
Sbjct: 520 AWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGSYMEHRYTGPRG 579
Query: 549 VSIQG--AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
+SI G + L S WG ++G+ GEKL I T+ G + V W ++ + LTWY+T
Sbjct: 580 ISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKF-TGKAPGLTWYQTY 638
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
FDAP I + MGKG WVNG+ +GRYW SFL+P G P+Q YHIPRSFLKP N
Sbjct: 639 FDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSPLGQPTQIEYHIPRSFLKPKKN 698
Query: 667 LLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRP 725
LLV+ EEE N P + V+ T+C +V +++ P V W + + +
Sbjct: 699 LLVIFEEEPNVKPELMDFAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQAITDNV---SL 755
Query: 726 KVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWT 785
++C +KI+ + FAS+GNP G C N+ +G+C++ S+ ++EK CLGK C +PV
Sbjct: 756 TATLKCSGTKKIAAVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNK 815
Query: 786 EKFY---GDPCPGIPKALLVDAQC 806
F D C + K L V +C
Sbjct: 816 STFQQDKKDSCKNVVKMLAVQVKC 839
>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
Length = 840
Score = 783 bits (2023), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/821 (49%), Positives = 516/821 (62%), Gaps = 53/821 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R++ ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 30 VSYDHRAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 89
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+FIK VQA GLYV LRIGP+I EW +GG P WL VPGI FR+DN P
Sbjct: 90 GNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDNGP 149
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK+ +L+ SQGGPIILSQIENE+G VE G Y +WAA +A
Sbjct: 150 FKAAMQKFTEKIVSMMKSEKLFESQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAADMA 209
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCKQDDAPDPVIN CNG C E F PN KP +WTENWT +Y +G
Sbjct: 210 VKLGTGVPWVMCKQDDAPDPVINTCNGFYC-ENFK-PNKDYKPKLWTENWTGWYTEFGGA 267
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R AED+A+ VA FI + GS++NYYMYHGGTNFGRT++ + YD APLDEYGL
Sbjct: 268 VPYRPAEDLAFSVARFI-QNGGSFMNYYMYHGGTNFGRTSAGLFIATSYDYDAPLDEYGL 326
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R PKWGHL++LH A+KLC ++S + QEA +FQ S CAAFL N D + +
Sbjct: 327 TRDPKWGHLRDLHKAIKLCEPALVSVDPTVKSLGSNQEAHVFQSKSSCAAFLANYDTKYS 386
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEY-KEAIP 435
V F N Y+LPP SISILPDCKT FNTA+L + W+ Y +EA
Sbjct: 387 VKVTFGNGQYDLPPWSISILPDCKTAVFNTARLGAQSSQMKMTPVGGALSWQSYIEEAAT 446
Query: 436 TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHA 489
Y + + L EQ+N T+DASDYLWY D + VL + S GH LH
Sbjct: 447 GYTDDTTTLEGLWEQINVTRDASDYLWYMTNVNIDSDEGFLKNGDSPVLTIFSAGHSLHV 506
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
FING+ G+ +G + T + V L G N +SLLSV VGLP+ G + E+ AG L
Sbjct: 507 FINGQLAGTVYGSLENPKLTFSQNVKLTAGINKISLLSVAVGLPNVGVHFEKWNAGILGP 566
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWYKTV 606
V+++G E +D S + W Y++GL GE L + T GS V W S+ QPLTWYK
Sbjct: 567 VTLKGLNEGTRDLSGWKWSYKIGLKGEALSLHTVTGSSSVEWVEGSLSAKKQPLTWYKAT 626
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TPQ 646
FDAP G+DPVA+++ SMGKG+ WVNGQSIGR+W ++ +
Sbjct: 627 FDAPEGNDPVALDMSSMGKGQIWVNGQSIGRHWPAYTARGSCSACNYAGTYDDKKCRSNC 686
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW 706
G PSQ WYH+PRS+L P+GNLLV+ EE G P GIS+ + ++C + + P + +W
Sbjct: 687 GEPSQRWYHVPRSWLNPSGNLLVVFEEWGGEPSGISLVKRTTGSVCADIFEGQ-PALKNW 745
Query: 707 RSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSR 766
+ + R+ +PK + CP G+KISKI FASYG+P G C ++ GSCH+ S
Sbjct: 746 Q------MIALGRLDHLQPKAHLWCPHGQKISKIKFASYGSPQGTCGSFKAGSCHAHKSY 799
Query: 767 AIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
EK C+GK+SC+V V E F GDPCP K L V+A CT
Sbjct: 800 DAFEKKCIGKQSCSVTVAAEVFGGDPCPDSSKKLSVEAVCT 840
>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
Length = 823
Score = 783 bits (2023), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/809 (47%), Positives = 529/809 (65%), Gaps = 33/809 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +T+D RSL+++G R + FSGSIHYPRS P MWP LIA+AKEGGL+V+++ VFWN HE
Sbjct: 12 GTAITFDRRSLMVDGRRDLFFSGSIHYPRSPPHMWPDLIARAKEGGLNVIESYVFWNGHE 71
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ G ++F GR D+++F K VQ ++ +RIGPF++ EW +GGLP+WL +VP I+FR++
Sbjct: 72 PEMGVYNFEGRYDMIKFFKLVQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIIFRTN 131
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK HM+++ TMIVN +K A+L+ASQGGPIIL+QIENEY +E +F E G Y+ WAA
Sbjct: 132 NEPFKKHMQKFVTMIVNKLKDAKLFASQGGPIILAQIENEYQHLEAAFKENGTTYIHWAA 191
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+A DL GVPW+MCKQ AP VI CNGR CG+T+ GP +KP +WTENWT+ Y+V+
Sbjct: 192 KMASDLNIGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPTDKNKPLLWTENWTAQYRVF 251
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + G+ VNYYMYHGGTNFGRT +++V+ YYD+APLDE+
Sbjct: 252 GDPPSQRSAEDIAFAVARFYS-VGGTMVNYYMYHGGTNFGRTGASFVMPRYYDEAPLDEF 310
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PKWGHL++LH A++LC K +L G + KL EA +F+ + C AFL N +
Sbjct: 311 GLYKEPKWGHLRDLHHALRLCKKAILWGNPSNQPLGKLYEARLFEIPEQKICVAFLSNHN 370
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEE 429
+ + TV F Y +P S+SIL DCKTV F+T ++S WE
Sbjct: 371 TKEDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNSQHNQRTFHFSDQTVQGNVWEM 430
Query: 430 YKEA--IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVS 481
Y E+ +PTY T++R LE N TKD +DY+WY FK + D VL+VS
Sbjct: 431 YTESDKVPTYKFTNIRTQKPLEAYNLTKDKTDYVWYTTSFKLEAEDLPFRKDIWPVLEVS 490
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S GH + AF+NG++VG+ HG +K+FT+EK + + G N+VS+LS +G+ DSG YLE
Sbjct: 491 SHGHAMVAFVNGKYVGAGHGTKINKAFTMEKPIEVRTGINHVSILSTTLGMQDSGVYLEH 550
Query: 542 RVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
R AG+ V+IQG D +S WG+ VGL GE+ T+ G V W + +PL
Sbjct: 551 RQAGIDGVTIQGLNTGTLDLTSNGWGHLVGLEGERRNAHTEKGGDGVQW--VPAVFDRPL 608
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWY+ FD PTG DPV I++ MGKG +VNG+ +GRYW S+ G PSQ YH+PR F
Sbjct: 609 TWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGLGRYWSSYKHALGRPSQYLYHVPRCF 668
Query: 661 LKPTGNLLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKR 719
LKPTGN++ + EEE G P GI I TV +C +S+ + V SW ++
Sbjct: 669 LKPTGNVMTIFEEEGGGQPDGIMILTVKRDNICSFISEKNPAHVKSWERKDSHLKSVAD- 727
Query: 720 IPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSC 779
+P+ + CP + I +++FASYGNP G C NY +G+CH+ ++ IVEKAC+GK+SC
Sbjct: 728 -ADLKPQAVLSCPEKKLIQQVVFASYGNPLGICGNYTVGNCHAPKAKEIVEKACVGKKSC 786
Query: 780 TVPVWTEKFYGD-PCPGIPKALLVDAQCT 807
+ V E + D CPG L V A+C+
Sbjct: 787 VLQVSHEVYGADLNCPGSTGTLAVQAKCS 815
>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
Flags: Precursor
gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
sativa Japonica Group]
gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
Length = 848
Score = 782 bits (2019), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/812 (47%), Positives = 531/812 (65%), Gaps = 35/812 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYD RSLII+GHR+I FSGSIHYPRS P WP LI+KAKEGGL+V+++ VFWN HE
Sbjct: 30 GTVITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHE 89
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ G ++F GR DL++F K +Q + +Y +RIGPF++ EW +GGLP+WL ++P I+FR++
Sbjct: 90 PEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTN 149
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK +MK++ T+IVN +K A+L+ASQGGPIIL+QIENEY +E +F E G Y+ WAA
Sbjct: 150 NEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAA 209
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+A+ TGVPW+MCKQ AP VI CNGR CG+T+ GP KP +WTENWT+ Y+V+
Sbjct: 210 KMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVF 269
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + G+ NYYMYHGGTNFGR +A+V+ YYD+APLDE+
Sbjct: 270 GDPPSQRSAEDIAFSVARFFS-VGGTMANYYMYHGGTNFGRNGAAFVMPRYYDEAPLDEF 328
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PKWGHL++LH A++ C K +L G KL EA +F+ + C AFL N +
Sbjct: 329 GLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHN 388
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEE 429
+ + TV F Y + SISIL DCKTV F+T ++S WE
Sbjct: 389 TKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM 448
Query: 430 Y-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSS 482
Y +E IP Y +TS+R LEQ N TKD +DYLWY F+ + D + VL+VSS
Sbjct: 449 YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPVLEVSS 508
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH + AF+N FVG HG +K+FT+EK + L G N+V++LS +GL DSG+YLE R
Sbjct: 509 HGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHR 568
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+AG+ V+I+G D ++ WG+ VGL GE+ ++ ++ G V W +QPLT
Sbjct: 569 MAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWK--PGKDNQPLT 626
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
WY+ FD P+G+DPV I+L MGKG +VNG+ +GRYWVS+ G PSQ YH+PRS L
Sbjct: 627 WYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHALGKPSQYLYHVPRSLL 686
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWR-----SQNQRTLKT 716
+P GN L+ EEE G P I I TV +C +++ + P + W SQ +
Sbjct: 687 RPKGNTLMFFEEEGGKPDAIMILTVKRDNICTFMTEKN-PAHVRWSWESKDSQPKAVAGA 745
Query: 717 HKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGK 776
G +P + CP+ + I ++FASYGNP G C NY +GSCH+ ++ +VEKAC+G+
Sbjct: 746 GAGAGGLKPTAVLSCPTKKTIQSVVFASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGR 805
Query: 777 RSCTVPVWTEKFYGD-PCPGIPKALLVDAQCT 807
++C++ V +E + GD CPG L V A+C+
Sbjct: 806 KTCSLVVSSEVYGGDVHCPGTTGTLAVQAKCS 837
>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
Length = 848
Score = 779 bits (2012), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/812 (47%), Positives = 530/812 (65%), Gaps = 35/812 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYD RSLII+GHR+I FSGSIHYPRS P WP LI+KAKEGGL+V+++ VFWN HE
Sbjct: 30 GTVITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHE 89
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ G ++F GR DL++F K +Q + +Y +RIGPF++ EW +GGLP+WL ++P I+FR++
Sbjct: 90 PEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTN 149
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK +MK++ T+IVN +K A+L+ASQGGPIIL+QIENEY +E +F E G Y+ WAA
Sbjct: 150 NEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAA 209
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+A+ TGVPW+MCKQ AP VI CNGR CG+T+ GP KP +WTENWT+ Y+V+
Sbjct: 210 KMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVF 269
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + G+ NYYMYHGGTNFGR +A+V+ YYD+AP DE+
Sbjct: 270 GDPPSQRSAEDIAFSVARFFS-VGGTMANYYMYHGGTNFGRNGAAFVMPRYYDEAPFDEF 328
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PKWGHL++LH A++ C K +L G KL EA +F+ + C AFL N +
Sbjct: 329 GLYKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHN 388
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEE 429
+ + TV F Y + SISIL DCKTV F+T ++S WE
Sbjct: 389 TKEDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM 448
Query: 430 Y-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSS 482
Y +E IP Y +TS+R LEQ N TKD +DYLWY F+ + D + VL+VSS
Sbjct: 449 YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPVLEVSS 508
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH + AF+N FVG HG +K+FT+EK + L G N+V++LS +GL DSG+YLE R
Sbjct: 509 HGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHR 568
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+AG+ V+I+G D ++ WG+ VGL GE+ ++ ++ G V W +QPLT
Sbjct: 569 MAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWK--PGKDNQPLT 626
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
WY+ FD P+G+DPV I+L MGKG +VNG+ +GRYWVS+ G PSQ YH+PRS L
Sbjct: 627 WYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHALGKPSQYLYHVPRSLL 686
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWR-----SQNQRTLKT 716
+P GN L+ EEE G P I I TV +C +++ + P + W SQ +
Sbjct: 687 RPKGNTLMFFEEEGGKPDAIMILTVKRDNICTFMTEKN-PAHVRWSWESKDSQPKAVAGA 745
Query: 717 HKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGK 776
G +P + CP+ + I ++FASYGNP G C NY +GSCH+ ++ +VEKAC+G+
Sbjct: 746 GAGAGGFKPTAVLSCPTKKTIQSVVFASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGR 805
Query: 777 RSCTVPVWTEKFYGD-PCPGIPKALLVDAQCT 807
++C++ V +E + GD CPG L V A+C+
Sbjct: 806 KTCSLVVSSEVYGGDVHCPGTTGTLAVQAKCS 837
>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 845
Score = 778 bits (2010), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/827 (49%), Positives = 526/827 (63%), Gaps = 60/827 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R++LFSGSIHYPRSTP+MW LI KAKEGGLDVV+T VFWN+HEP
Sbjct: 27 DVTYDRKAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 87 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRADNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MK YA IVN+MK+ L+ SQGGPIILSQIENEYG G Y WAA +
Sbjct: 147 PFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWAANM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN CNG C F PN P KPAIWTE W+ ++ +G
Sbjct: 207 AVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFF--PNKPYKPAIWTEAWSGWFSEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI + GS+VNYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 265 PLHQRPVQDLAFAVAQFIQR-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH AVK+C K ++S + LQ+A+++ + CAAFL N D +
Sbjct: 324 LIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSETGGCAAFLSNNDWK 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE Y E
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSKMEMLPTNSEMLSWETYSED 443
Query: 434 IPTYDE-TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
I D+ +S+R+ LLEQ+N T+D SDYLWY D +ES L V + G
Sbjct: 444 ISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSV--DIGSTESFLHGGELPTLIVETTG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING+ GSA G ++ F + V+L G+N ++LLSV VGLP+ G + E
Sbjct: 502 HAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRIALLSVAVGLPNIGGHFETWST 561
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPL 600
G L V+IQG K D S W YQVGL GE + + + G V W + + QPL
Sbjct: 562 GVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVSTNGISAVDWMQGSLIAQKQQPL 621
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+K F+ P G +P+A+++ SMGKG+ W+NGQSIGRYW ++ T
Sbjct: 622 TWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRYWTAYATGDCNGCQYSGVFRPPK 681
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LKPT NLLVL EE G P IS+ SVT +C +V++ H P
Sbjct: 682 CQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGGDPTRISLVKRSVTNVCSNVAEYH-P 740
Query: 702 PVISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
+ +W+ +N +T + H PKV+I C G+ IS I FAS+G P G C ++ G+C
Sbjct: 741 NIKNWQIENYGKTEEFH------LPKVRIHCAPGQSISSIKFASFGTPLGTCGSFKQGTC 794
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
H+ +S A+VEK CLG+++C V + F DPCP + K L V+A CT
Sbjct: 795 HAPDSHAVVEKKCLGRQTCAVTISNSNFGEDPCPNVLKRLSVEAHCT 841
>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
Length = 838
Score = 776 bits (2003), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 411/826 (49%), Positives = 528/826 (63%), Gaps = 54/826 (6%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD R++I+NG R+IL SGS+HYPRSTP+MWP +I KAKEGG+DV+QT VFWN H
Sbjct: 23 GTASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGH 82
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EPQ G++ F GR DLV+FIK V GLYV LR+GP+ EW +GG P WL VPGI FR+
Sbjct: 83 EPQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRT 142
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
DN PFK M+++ IVNMMKA RLY +QGGPIILSQIENEYG +E G Y +WA
Sbjct: 143 DNGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWA 202
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
AK+AV L TGVPWVMCKQDDAPDP+INACNG C + PN KP IWTE WT+++
Sbjct: 203 AKMAVGLDTGVPWVMCKQDDAPDPIINACNGFYC--DYFSPNKAYKPKIWTEAWTAWFTG 260
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLD 324
+G+ R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLD
Sbjct: 261 FGNPVPYRPAEDLAFSVAKFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLD 319
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNK 383
EYGLLRQPKWGHLK+LH A+KLC ++SG QEA +F+ + CAAFL N
Sbjct: 320 EYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANY 379
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------WEEYK 431
D+ + ATV F+N Y LPP SISILPDCK FNTA++ + W+ +
Sbjct: 380 DQHSFATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSAQMKMTPVSRGLPWQSFN 439
Query: 432 EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
E +Y+++S LLEQ+NTT+D SDYLWY+ K D + L + S GH
Sbjct: 440 EETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWLTIMSAGH 499
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH F+NG+ G+A+G T K V+L G N +SLLS+ VGLP+ G + E AG
Sbjct: 500 ALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPHFETWNAG 559
Query: 546 -LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L VS+ G E K D + W Y+VGL GE L + + GS V W GS + QPLT
Sbjct: 560 VLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVE-GSLVAQRQPLT 618
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------- 642
WYK+ F+AP G+DP+A++L +MGKG+ W+NGQS+GRYW +
Sbjct: 619 WYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKASGNCGACNYAGWFNEKK 678
Query: 643 -LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
L+ G SQ WYH+PRS+L PTGNLLVL EE G P GIS+ V ++C +++ P
Sbjct: 679 CLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVCADINEWQ-P 737
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+++W Q Q + K K + RPK + C SG+KI+ I FAS+G P G C ++ GSCH
Sbjct: 738 QLVNW--QMQASGKVDKPL---RPKAHLSCASGQKITSIKFASFGTPQGVCGSFREGSCH 792
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ +S E+ C+G+ SC+VPV E F GDPCP + K L V+ C+
Sbjct: 793 AFHSYDAFERYCIGQNSCSVPVTPEIFGGDPCPHVMKKLSVEVICS 838
>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
Length = 845
Score = 775 bits (2002), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/827 (49%), Positives = 524/827 (63%), Gaps = 60/827 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +++ING R++LFSGSIHYPRSTP+MW LI KAKEGGLDVV+T VFWN+HEP
Sbjct: 27 DVTYDREAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 87 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRADNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MK YA IVN+MK+ L+ SQGGPIILSQIENEYG G Y WAA +
Sbjct: 147 PFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWAANM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN CNG C F PN P KPA WTE W+ ++ +G
Sbjct: 207 AVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFF--PNKPYKPATWTEAWSGWFSEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI + GS+VNYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 265 PLHQRPVQDLAFAVAQFIQR-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH AVK+C K ++S + LQ+A+++ + CAAFL N D +
Sbjct: 324 LIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSETGGCAAFLSNNDWK 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE Y E
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSKMEMLPTNSEMLSWETYSED 443
Query: 434 IPTYDE-TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
I D+ +S+R+ LLEQ+N T+D SDYLWY D +ES L V + G
Sbjct: 444 ISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSV--DIGSTESFLHGGELPTLIVETTG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING+ GSA G ++ F + V+L G+N ++LLSV VGLP+ G + E
Sbjct: 502 HAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRIALLSVAVGLPNIGGHFETWST 561
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPL 600
G L V+IQG K D S W YQVGL GE + + + G V W + + QPL
Sbjct: 562 GVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVSTNGISAVDWMQGSLIAQKQQPL 621
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+K F+ P G +P+A+++ SMGKG+ W+NGQSIGRYW ++ T
Sbjct: 622 TWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRYWTAYATGDCNGCQYSGVFRPPK 681
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LKPT NLLVL EE G P IS+ SVT +C +V++ H P
Sbjct: 682 CQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGGDPTRISLVKRSVTNVCSNVAEYH-P 740
Query: 702 PVISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
+ +W+ +N +T + H PKV+I C G+ IS I FAS+G P G C ++ G+C
Sbjct: 741 NIKNWQIENYGKTEEFH------LPKVRIHCAPGQSISSIKFASFGTPLGTCGSFKQGTC 794
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
H+ +S A+VEK CLG+++C V + F DPCP + K L V+A CT
Sbjct: 795 HAPDSHAVVEKKCLGRQTCAVTISNSNFGEDPCPNVLKRLSVEAHCT 841
>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
Length = 838
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/826 (49%), Positives = 527/826 (63%), Gaps = 54/826 (6%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD R++I+NG R+IL SGS+HYPRSTP+MWP +I KAKEGG+DV+QT VFWN H
Sbjct: 23 GTASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGH 82
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EPQ G++ F GR DLV+FIK V GLYV LR+GP+ EW +GG P WL VPGI FR+
Sbjct: 83 EPQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRT 142
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
DN PFK M+++ IVNMMKA RLY +QGGPIILSQIENEYG +E G Y +WA
Sbjct: 143 DNGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWA 202
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
AK+AV L TGVPWVMCKQDDAPDP+INACNG C + PN KP IWTE WT+++
Sbjct: 203 AKMAVGLDTGVPWVMCKQDDAPDPIINACNGFYC--DYFSPNKAYKPKIWTEAWTAWFTG 260
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLD 324
+G+ R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLD
Sbjct: 261 FGNPVPYRPAEDLAFSVAKFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLD 319
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNK 383
EYGLLRQPKWGHLK+LH A+KLC ++SG QEA +F+ + CAAFL N
Sbjct: 320 EYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANY 379
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------WEEYK 431
D+ + ATV F+N Y LPP SISILPDCK FNTA++ + W+ +
Sbjct: 380 DQHSFATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQSAQMKMTPVSRGLPWQSFN 439
Query: 432 EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
E +Y+++S LLEQ+NTT+D SDYLWY+ K D + L + S GH
Sbjct: 440 EETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWLTIMSAGH 499
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH F+NG+ G+A+G T K V+L G N +SLLS+ VGLP+ G + E AG
Sbjct: 500 ALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPHFETWNAG 559
Query: 546 -LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L VS+ G E K D + W Y+VGL GE L + + GS V W GS + QPLT
Sbjct: 560 VLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVE-GSLVAQRQPLT 618
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------- 642
WYK+ F+AP G+DP+A++L +MGKG+ W+NGQS+GRYW +
Sbjct: 619 WYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKASGNCGACNYAGWFNEKK 678
Query: 643 -LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
L+ G SQ WYH+PRS+L PTGNLLVL EE G P GIS+ V ++C +++ P
Sbjct: 679 CLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVCADINEWQ-P 737
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+++W Q Q + K K + RPK + C G+KI+ I FAS+G P G C ++ GSCH
Sbjct: 738 QLVNW--QMQASGKVDKPL---RPKAHLSCAPGQKITSIKFASFGTPQGVCGSFREGSCH 792
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ +S E+ C+G+ SC+VPV E F GDPCP + K L V+ C+
Sbjct: 793 AFHSYDAFERYCIGQNSCSVPVTPEIFGGDPCPHVMKKLSVEVICS 838
>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
Length = 766
Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/772 (49%), Positives = 511/772 (66%), Gaps = 31/772 (4%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MW ++ KA+ GGL+V+QT VFWN+HEP GQF+F G DLV+FIK + + +YV LR+G
Sbjct: 1 MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
PFI+ EW +GGLP+WL + P I+FRS N FK +MK+Y MIV+MMK +L+ASQGGPI+
Sbjct: 61 PFIQAEWNHGGLPYWLREKPNIIFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPIV 120
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQC 239
L+QIENEY V+ ++ E G YV+WAA +AV L GVPW+MCKQ DAPDPVIN CNGR C
Sbjct: 121 LAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRHC 180
Query: 240 GETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMY 299
G+TF GPN P KPA+WTENWT+ Y+V+GD R+AEDIA+ VA F +K GS VNYYMY
Sbjct: 181 GDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSK-NGSLVNYYMY 239
Query: 300 HGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
HGGTNFGRT++ + T YYD+APLDE+GL R+PKWGHL+++H A+ LC KP+L G
Sbjct: 240 HGGTNFGRTSAVFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGIQ 299
Query: 360 NFSKLQEAFIFQ--GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFN 417
K EA ++ G++ CAAFL N D ++ T+ F + LPP SISILPDCKTV FN
Sbjct: 300 VIGKGLEARFYEKPGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCKTVVFN 359
Query: 418 TAKLDSVE--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY 463
T + S +W+ E+IPT ++ + LE + KD +DY WY
Sbjct: 360 TETIVSQHNARNFIPSKNANKLKWKMSPESIPTVEQVPVNNKIPLELYSLLKDTTDYGWY 419
Query: 464 NFRFKHDPSDSES------VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
+ D D VL+++SLGH + F+NGE++G+AHG H +K+F + V
Sbjct: 420 TTSIELDKEDVSKRPDILPVLRIASLGHAMLVFVNGEYIGTAHGSHEEKNFVFQGSVPFK 479
Query: 518 NGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKL 576
G NN++LL ++VGLPDSGAY+E R AG R+++I G D S WG+QV L GEK+
Sbjct: 480 AGVNNIALLGILVGLPDSGAYMEHRFAGPRSITILGLNTGTLDISKNGWGHQVALQGEKV 539
Query: 577 QIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
++FT GS V WS LTWYKT FDAP G+DPVAI + MGKG+ WVNG+SIG
Sbjct: 540 KVFTQGGSHRVDWSEI-KEEKSALTWYKTYFDAPEGNDPVAIRMNGMGKGQIWVNGKSIG 598
Query: 637 RYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
RYW+S+L+P +QS YHIPRSF+KP+ NLLV+LEEEN P + I V+ T+C ++
Sbjct: 599 RYWMSYLSPLKLSTQSEYHIPRSFIKPSENLLVILEEENVTPEKVEILLVNRDTICSFIT 658
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYA 756
H P V SW ++++ + + +RCP +KI+ I FAS+G+P+G C N+
Sbjct: 659 QYHPPNVKSWERKDKQFRAV---VDDVKTGAHLRCPHDKKITNIEFASFGDPSGVCGNFE 715
Query: 757 IGSCH-SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G CH SS+++ +VE+ CLGK +C+VP+ + + C K L + A+C+
Sbjct: 716 HGKCHSSSDTKKLVEQHCLGKENCSVPMDAFDNFKNECDS--KTLAIQAKCS 765
>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
Length = 861
Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/841 (48%), Positives = 532/841 (63%), Gaps = 71/841 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD RSL+I+G R++L SGSIHYPRSTP+MWP +I KAK+GGLDV+++ VFWN+HEP+
Sbjct: 30 NVTYDHRSLLIDGQRRVLISGSIHYPRSTPEMWPDIIQKAKDGGLDVIESYVFWNMHEPK 89
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
++ F R DLV+F+K VQ GL V LRIGP+ EW YGG P WLH +PGI FR+DNE
Sbjct: 90 QNEYYFEDRFDLVKFVKIVQQAGLLVHLRIGPYACAEWNYGGFPVWLHLIPGIHFRTDNE 149
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ IV+MMK +L+ASQGGPIIL+QIENEYG ++ + G YV+WAA +
Sbjct: 150 PFKNEMQRFTAKIVDMMKQEKLFASQGGPIILAQIENEYGNIDGPYGAAGKSYVKWAASM 209
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMC+Q DAPDP+IN CNG C + F PNSP+KP +WTENW+ ++ +G
Sbjct: 210 AVGLNTGVPWVMCQQADAPDPIINTCNGFYC-DAFT-PNSPNKPKMWTENWSGWFLSFGG 267
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNFGRT ++ T Y AP+DEYG
Sbjct: 268 RLPFRPTEDLAFSVARFFQR-GGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYG 326
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKR 386
++RQPKWGHLKELH A+KLC +++ + EA ++ GS CAAFL N + +
Sbjct: 327 IVRQPKWGHLKELHKAIKLCEAALVNAESNYTSLGSGLEAHVYSPGSGTCAAFLANSNTQ 386
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----------------------- 423
++ATV F+ Y LP S+SILPDCK V FNTAK+ S
Sbjct: 387 SDATVKFNGNSYHLPAWSVSILPDCKNVVFNTAKIGSQTTSVQMNPANLILAGSNSMKGT 446
Query: 424 ----VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------ 473
W E I + LLEQ+NTT D+SDYLWY + D ++
Sbjct: 447 DSANAASWSWLHEQIGIGGSNTFSKPGLLEQINTTVDSSDYLWYTTSIQVDDNEPFLHNG 506
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
++ VL V SLGH LH FINGEF G G S L+ + L +G NN+ LLS+ VGL
Sbjct: 507 TQPVLHVQSLGHALHVFINGEFAGRGAGSSSSSKIALQTPITLKSGKNNIDLLSITVGLQ 566
Query: 534 DSGAYLERRVAGLRN-VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
+ G++ + AG+ V +QG K+ + D S+ W YQ+GL GE+L I++ W
Sbjct: 567 NYGSFFDTWGAGITGPVILQGFKDGEHDLSTQQWTYQIGLTGEQLGIYSGDTKASAQWVA 626
Query: 592 YGSS--THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--- 646
GS T QP+ WYKT FDAP+G+DPVA+NL+ MGKG AWVNGQSIGRYW S++ Q
Sbjct: 627 -GSDLPTKQPMIWYKTNFDAPSGNDPVALNLLGMGKGVAWVNGQSIGRYWPSYIASQSGC 685
Query: 647 -------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVS 687
G PSQ YH+PRS+++PTGN+LVL EE G P IS T S
Sbjct: 686 TDSCDYRGAYSSTKCQTNCGQPSQKLYHVPRSWIQPTGNVLVLFEELGGDPTQISFMTRS 745
Query: 688 VTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISK-ILFASYG 746
V +LC VS++HLPPV SW+S L+ +K + ++Q+ CPS R + K I FAS+G
Sbjct: 746 VGSLCAQVSETHLPPVDSWKSSATSGLEVNK----PKAELQLHCPSSRHLIKSIKFASFG 801
Query: 747 NPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G+C ++ G C+++++ +IVE+AC+G+ SC+V V EKF GDPC G K L V+A C
Sbjct: 802 TSKGSCGSFTYGHCNTNSTMSIVEEACIGRESCSVEVSIEKF-GDPCKGTVKNLAVEASC 860
Query: 807 T 807
+
Sbjct: 861 S 861
>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 771 bits (1991), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/825 (48%), Positives = 523/825 (63%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +++IING RKIL SGSIHYPRSTP MW L+ KAK+GGLDV+QT VFWN+HEP
Sbjct: 29 SVTYDRKAIIINGQRKILISGSIHYPRSTPDMWEGLMQKAKDGGLDVIQTYVFWNVHEPS 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRF+K VQ GLY+ LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 89 PGNYNFEGRYDLVRFVKTVQKAGLYMHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK+ L+ SQGGPIILSQIENEYG + G Y+ WAAK+
Sbjct: 149 PFKMAMQGFTEKIVQMMKSESLFESQGGPIILSQIENEYGSESKALGAPGHAYMTWAAKM 208
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+TGVPWVMCK+DDAPDPVIN CNG C + F PN P KP +WTE W+ ++ +G
Sbjct: 209 AVGLRTGVPWVMCKEDDAPDPVINTCNGFYC-DAFT-PNKPYKPTMWTEAWSGWFTEFGG 266
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+A+ VA FI K GS++NYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 267 TVHERPVEDLAFAVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 325
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKR 386
L+RQPK+GHLKELH A+KLC ++S + + Q++ +F G+ CAAFL N +
Sbjct: 326 LIRQPKYGHLKELHRAIKLCEPALISADPIVTSLGPYQQSHVFSSGTGGCAAFLSNYNPN 385
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE Y E
Sbjct: 386 SVARVMFNNMHYSLPPWSISILPDCRNVVFNTAKVGVQTSQMHMSAGETKLLSWEMYDED 445
Query: 434 IPTY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVSSLGHV 486
I + D + + A LLEQ+N T+D SDYLWY PS+S VL V S GH
Sbjct: 446 IASLGDNSMITAVGLLEQLNVTRDTSDYLWYMTSVDISPSESSLRGGRPPVLTVQSAGHA 505
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
LH +ING+ GSAHG ++ FT V++ G N ++LLS+ V LP+ G + E G
Sbjct: 506 LHVYINGQLSGSAHGSRENRRFTFTGDVNMRAGINRIALLSIAVELPNVGLHYESTNTGV 565
Query: 546 LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPLTW 602
L V + G + K D + W YQVGL GE + + G V W + + + QPLTW
Sbjct: 566 LGPVVLHGLDQGKRDLTWQKWSYQVGLKGEAMNLVAPSGISYVEWMQASFATQKLQPLTW 625
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ-- 646
YK F+AP G +P+A++L SMGKG+ W+NG+SIGRYW ++ P+
Sbjct: 626 YKAYFNAPGGDEPLALDLGSMGKGQVWINGESIGRYWTAAANGDCNHCSYAGTYRAPKCQ 685
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q WYH+PRS+L+PT NLLV+ EE G GIS+ SV+++C VS+ H P +
Sbjct: 686 TGCGQPTQRWYHVPRSWLQPTKNLLVIFEEIGGDASGISLVKRSVSSVCADVSEWH-PTI 744
Query: 704 ISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
+W ++ R+ + H RPKV +RC G+ IS I FAS+G P G C ++ G CHS
Sbjct: 745 KNWHIESYGRSEELH------RPKVHLRCAMGQSISAIKFASFGTPLGTCGSFQQGPCHS 798
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS AI+EK C+G++ C V + F GDPCP + K + V+A CT
Sbjct: 799 PNSHAILEKKCIGQQRCAVTISMNNFGGDPCPNVMKRVAVEAICT 843
>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
Length = 836
Score = 769 bits (1986), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/839 (47%), Positives = 526/839 (62%), Gaps = 60/839 (7%)
Query: 13 LLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGG 72
++ +GG + G VTYD ++L+ING R+IL SGSIHYPRST +MWP L KAK+GG
Sbjct: 14 VMLAVGGVECG------VTYDHKALVINGERRILISGSIHYPRSTAEMWPDLFRKAKDGG 67
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
LDV+QT VFWN+HEP PG ++F GR DLV+F+K Q GLYV LRIGP++ EW +GG P
Sbjct: 68 LDVIQTYVFWNMHEPSPGNYNFEGRFDLVKFVKLAQEAGLYVHLRIGPYVCAEWNFGGFP 127
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WL VPGI FR+DNEPFK M+ + +V++MK+ L+ SQGGPIIL+Q+ENEY E
Sbjct: 128 VWLKYVPGISFRTDNEPFKNAMEGFTKKVVDLMKSEGLFESQGGPIILAQVENEYKPEEM 187
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKP 252
+ G Y+ WAA++AV + TGVPWVMCKQDDAPDPVIN CNG C + F PN P KP
Sbjct: 188 EYGLAGAQYMNWAAQMAVGMDTGVPWVMCKQDDAPDPVINTCNGFYC-DNFV-PNKPYKP 245
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA- 311
+WTE W+ +Y +G + R ED+A+ VA F K GS+VNYYMYHGGTNFGRTA
Sbjct: 246 TMWTEAWSGWYTEFGGASPHRPVEDLAFAVARFFVK-GGSFVNYYMYHGGTNFGRTAGGP 304
Query: 312 YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ 371
++ T Y AP+DEYGL+RQPKWGHLKELH A+KLC ++SG V + Q+A+++
Sbjct: 305 FIATSYDYDAPIDEYGLIRQPKWGHLKELHKAIKLCEPALVSGDPVVTSLGHFQQAYVYS 364
Query: 372 -GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---- 426
G+ CAAF+VN D + V F+ Y++ P S+SILPDC+ V FNTAK+D
Sbjct: 365 AGAGNCAAFIVNYDSNSVGRVIFNGQRYKIAPWSVSILPDCRNVVFNTAKVDVQTSQMKM 424
Query: 427 -------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------ 473
WE E I ++++ S+ A LLEQ+N T+D +DYLWY + D +
Sbjct: 425 TPVGGFGWESIDENIASFEDNSISAVGLLEQINITRDNTDYLWYITSVEVDEDEPFIKNG 484
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
VL V S G LH FIN + GS +G+ + V L GTN +SLLS+ VGL
Sbjct: 485 GLPVLTVQSAGDALHVFINDDLAGSQYGRKENPKVRFSSGVRLNVGTNKISLLSMTVGLQ 544
Query: 534 DSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
+ G + E AG L +++ G K+ +D SS W YQ+GL GE + + T G V W +
Sbjct: 545 NIGPHFEMANAGVLGPITLSGFKDGTRDLSSQRWSYQIGLKGETMNLHTS-GDNTVEWMK 603
Query: 592 -YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------- 643
QPL WYK FDAP G DP+ ++L SMGKG+AWVNGQSIGRYW S+L
Sbjct: 604 GVAVPQSQPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQSIGRYWPSYLAEGVCSD 663
Query: 644 --------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
T G SQ WYH+PRS+L+P+GN LVL EE G P G+S+ T SV
Sbjct: 664 GCSYEGTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLFEEIGGNPSGVSLVTRSVD 723
Query: 690 TLCGHVSDSHLPPVISWRSQN-QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNP 748
++C HVS+SH + WR ++ + K H PKV ++C G++IS I FAS+G P
Sbjct: 724 SVCAHVSESHSQSINFWRLESTDQVQKLHI------PKVHLQCSKGQRISAIKFASFGTP 777
Query: 749 NGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G C ++ G CHS NS A ++K C+G R C++ V + F GDPCPG+ K + ++A C+
Sbjct: 778 QGLCGSFQQGDCHSPNSVATIQKKCMGLRKCSLSVSEKIFGGDPCPGVRKGVAIEAVCS 836
>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 841
Score = 769 bits (1985), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/822 (48%), Positives = 521/822 (63%), Gaps = 54/822 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++++G R+ILFSGSIHYPRSTP+MW LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRFIK VQ G++V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK+ L+ASQGGPIILSQIENEYG F G Y+ WAAK+A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK+DDAPDPVINACNG C +TF+ PN P KP +WTE W+ ++ +G
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYC-DTFS-PNKPYKPTMWTEAWSGWFTEFGGT 264
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R R ED+A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD APLDEYGL
Sbjct: 265 IRQRPVEDLAFGVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGL 323
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+GHLKELH AVKLC +P++S +QEA +F+ SS CAAFL N + +
Sbjct: 324 AREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFLANYNSNSY 383
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIP 435
A V F+N Y LPP SISILPDCK V FNTA + S WE+Y E +
Sbjct: 384 AKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWADGASSMMWEKYDEEVD 443
Query: 436 TYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ L + LLEQ+N T+D SDYLWY + DPS+ + L V S GH LH
Sbjct: 444 SLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQSAGHALH 503
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
FING+ GSA+G D+ + +L GTN V+LLSV GLP+ G + E G+
Sbjct: 504 VFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYETWNTGVVG 563
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYK 604
V I G E +D + +W YQVGL GE++ + + GS V W + + QPL WY+
Sbjct: 564 PVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQNQQPLAWYR 623
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ---- 646
FD P+G +P+A+++ SMGKG+ W+NGQSIGRYW S+ P+
Sbjct: 624 AYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAEGDCKGCHYTGSYRAPKCQAG 683
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G P+Q WYH+PRS+L+PT NLLV+ EE G I++ +V+ +C VS+ H P + +
Sbjct: 684 CGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKN 742
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
W+ ++ + H KV ++C G+ IS I FAS+G P G C + G CHS NS
Sbjct: 743 WQIESYGEPEFHT------AKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINS 796
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+++EK C+G + C V + F GDPCP + K + V+A C+
Sbjct: 797 NSVLEKKCIGLQRCVVAISPSNFGGDPCPEVMKRVAVEAVCS 838
>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/825 (48%), Positives = 518/825 (62%), Gaps = 59/825 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++LIING R+ILFSGSIHYPRSTPQMW LI KAK+GGLD + T VFWNLHEP
Sbjct: 26 SVTYDRKALIINGQRRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDAIDTYVFWNLHEPS 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+++F GR DLVRFIK +Q GLYV LRIGP+I EW +GG P WL VPG+ FR+DNE
Sbjct: 86 PGKYNFEGRYDLVRFIKLIQKAGLYVHLRIGPYICAEWNFGGFPVWLKFVPGVSFRTDNE 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ IV MMK +L+ SQGGPII+SQIENEYG +F G Y+ WAAK+
Sbjct: 146 PFKMAMQRFTQKIVQMMKNEKLFESQGGPIIISQIENEYGHESRAFGAPGYAYLTWAAKM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV + TGVPWVMCK+DDAPDPVIN CNG C + PN P+KP +WTE W+ ++ +
Sbjct: 206 AVAMDTGVPWVMCKEDDAPDPVINTCNGFYC--DYFSPNKPNKPTLWTEAWSGWFTEFAG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R ED+++ V FI K GS+VNYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 264 PIQQRPVEDLSFAVTRFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH A+KLC + +LS + +A +F S CAAFL N +
Sbjct: 323 LIRQPKYGHLKELHKAIKLCERALLSADPAETSLGTYAKAQVFYSESGGCAAFLSNYNPT 382
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+++ Y L P SISILPDCK V FNTA + + WE + E
Sbjct: 383 SAARVTFNSMHYNLAPWSISILPDCKNVVFNTATVGVQTSQMQMLPTNSELLSWETFNED 442
Query: 434 IPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
I + D+ S + LLEQ+N T+D SDYLWY+ R D S SES L V S G
Sbjct: 443 ISSADDDSTITVVGLLEQLNVTRDTSDYLWYSTRI--DISSSESFLHGGQHPTLIVQSTG 500
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING GSA G D+ FT V+L G+N +S+LS+ VGLP++G + E
Sbjct: 501 HAMHVFINGHLSGSAFGTREDRRFTFTGDVNLQTGSNIISVLSIAVGLPNNGPHFETWST 560
Query: 545 G-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPL 600
G L V + G E KD S W YQVGL GE + + + + W + + QPL
Sbjct: 561 GVLGPVVLHGLDEGKKDLSWQKWSYQVGLKGEAMNLVSPNVISNIDWMKGSLFAQKQQPL 620
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TWYK FDAP G +P+A+++ SMGKG+ W+NGQSIGRYW ++
Sbjct: 621 TWYKAYFDAPDGDEPLALDMGSMGKGQVWINGQSIGRYWTAYAKGNCSGCSYSGTFRTTK 680
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LKPT NLLVL EE G IS SVTT+C VS+ H P
Sbjct: 681 CQFGCGQPTQRWYHVPRSWLKPTQNLLVLFEELGGDASKISFMKRSVTTVCAEVSEHH-P 739
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+ +W ++Q + +PKV + C SG+ IS I FAS+G P+G C N+ G+CH
Sbjct: 740 NIKNWHIESQERPEEMS-----KPKVHLHCASGQSISAIKFASFGTPSGTCGNFQKGTCH 794
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+ S+A++EK C+G++ C+V V + F +PCP + K L V+A C
Sbjct: 795 APTSQAVLEKKCIGQQKCSVAVSSSNF-ANPCPNMFKKLSVEAVC 838
>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 768 bits (1982), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/825 (49%), Positives = 521/825 (63%), Gaps = 58/825 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++L+ING R+ILFSGSIHYPRSTP MW LI KAKEGG+DVV+T VFWN+HEP
Sbjct: 26 SVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGIDVVETYVFWNVHEPS 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 86 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK+ RL+ SQGGPIILSQIENEYG G YV WAAK+
Sbjct: 146 PFKRAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGAAGQNYVNWAAKM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV++ TGVPWVMCK+DDAPDPVIN CNG C + F PN P KP IWTE W+ ++ +G
Sbjct: 206 AVEMGTGVPWVMCKEDDAPDPVINTCNGFYC-DKFT-PNRPYKPMIWTEAWSGWFTEFGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R +D+A+ A FI + GS+VNYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 264 PIHKRPVQDLAFAAARFIIR-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH A+K+C + ++S + + + Q+A ++ S +CAAFL N D +
Sbjct: 323 LIRQPKYGHLKELHRAIKMCERALVSTDPIVTSLGEFQQAHVYTTESGDCAAFLSNYDSK 382
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
++A V F+N+ Y LPP S+SILPDC+ V FNTAK+ + WE + E
Sbjct: 383 SSARVMFNNMHYSLPPWSVSILPDCRNVVFNTAKVGVQTSQMQMLPTNTQLFSWESFDED 442
Query: 434 IPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
I + DE+S + A LLEQ+N TKDASDYLWY D SES L+ V S G
Sbjct: 443 IYSVDESSAITAPGLLEQINVTKDASDYLWYITSV--DIGSSESFLRGGELPTLIVQSTG 500
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING+ GSA G + FT V+L+ G N ++LLSV +GLP+ G + E
Sbjct: 501 HAVHVFINGQLSGSAFGTREYRRFTYTGKVNLLAGINRIALLSVAIGLPNVGEHFESWST 560
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPL 600
G L V++ G + K D S W YQVGL GE + + + G V W S +QPL
Sbjct: 561 GILGPVALHGLDKGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVVQRNQPL 620
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+KT FDAP G +P+A+++ MGKG+ W+NGQSIGRYW +F T
Sbjct: 621 TWHKTYFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAFATGNCNDCNYAGSFRPPK 680
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LK T NLLV+ EE G P IS+ SV+++C VS+ H P
Sbjct: 681 CQLGCGQPTQRWYHVPRSWLKTTQNLLVIFEELGGNPSKISLVKRSVSSVCADVSEYH-P 739
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+ +W ++ K R PKV + C G+ IS I FAS+G P G C NY G+CH
Sbjct: 740 NIKNWHIESY-----GKSEEFRPPKVHLHCSPGQTISSIKFASFGTPLGTCGNYEQGACH 794
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
S S I+EK C+GK CTV V F DPCP + K L V+A C
Sbjct: 795 SPASYVILEKRCIGKPRCTVTVSNSNFGQDPCPKVLKRLSVEAVC 839
>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 848
Score = 767 bits (1980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/826 (49%), Positives = 526/826 (63%), Gaps = 58/826 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+ILFSGSIHYPRSTP MW LI KAKEGGLDVV+T VFWN+HEP
Sbjct: 26 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGLDVVETYVFWNVHEPS 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 86 PGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK+ RL+ SQGGPIILSQIENEYG + G YV WAAK+
Sbjct: 146 PFKTAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGDAGQNYVNWAAKM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV++ TGVPWVMCK+DDAPDPVIN CNG C + F PN P KP IWTE W+ ++ +G
Sbjct: 206 AVEMGTGVPWVMCKEDDAPDPVINTCNGFYC-DKFT-PNRPYKPMIWTEAWSGWFTEFGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R +D+A+ VA FI + GS+VNYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 264 PIHKRPVQDLAFAVARFIIR-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH A+K+C + ++S + + + Q+A ++ S +CAAFL N D +
Sbjct: 323 LIRQPKYGHLKELHRAIKMCERALVSTDPIITSLGESQQAHVYTTESGDCAAFLSNYDSK 382
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
++A V F+N+ Y LPP S+SILPDC+ V FNTAK+ + WE + E
Sbjct: 383 SSARVMFNNMHYNLPPWSVSILPDCRNVVFNTAKVGVQTSQMQMLPTNTQLFSWESFDED 442
Query: 434 IPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
+ + D++S + A LLEQ+N TKDASDYLWY D SES L+ V S G
Sbjct: 443 VYSVDDSSAIMAPGLLEQINVTKDASDYLWYITSV--DIGSSESFLRGGELPTLIVQSRG 500
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING+ GSA+G + F V+L G N ++LLSV +GLP+ G + E
Sbjct: 501 HAVHVFINGQLSGSAYGTREYRRFMYTGKVNLRAGINRIALLSVAIGLPNVGEHFESWST 560
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPL 600
G L V++ G + K D S W YQVGL GE + + + G V W S +QPL
Sbjct: 561 GILGPVALHGLDQGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVVQRNQPL 620
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+KT FDAP G +P+A+++ MGKG+ W+NGQSIGRYW +F T
Sbjct: 621 TWHKTHFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTTFATGNCNDCNYAGSFRPPK 680
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LKPT NLLV+ EE G P IS+ SV+++C VS+ H P
Sbjct: 681 CQLGCGQPTQRWYHVPRSWLKPTQNLLVIFEELGGNPSKISLVKRSVSSVCADVSEYH-P 739
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+ +W ++ K+ + P PKV + C G+ IS I FAS+G P G C NY G+CH
Sbjct: 740 NIKNWHIESYG--KSEEFHP---PKVHLHCSPGQTISSIKFASFGTPLGTCGNYEQGACH 794
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S S AI+EK C+GK CTV V F DPCP + K L V+A C
Sbjct: 795 SPASYAILEKRCIGKPRCTVTVSNSNFGQDPCPKVLKRLSVEAVCA 840
>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
Length = 843
Score = 766 bits (1979), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/824 (48%), Positives = 522/824 (63%), Gaps = 56/824 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++++G R+ILFSGSIHYPRSTP+MW LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRFIK VQ G++V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK+ L+ASQGGPIILSQIENEYG F G Y+ WAAK+A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK+DDAPDPVINACNG C +TF+ PN P KP +WTE W+ ++ +G
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYC-DTFS-PNKPYKPTMWTEAWSGWFTEFGGT 264
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R R ED+A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD APLDEYGL
Sbjct: 265 IRQRPVEDLAFGVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGL 323
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+GHLKELH AVKLC +P++S +QEA +F+ SS CAAFL N + +
Sbjct: 324 AREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFLANYNSNSY 383
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIP 435
A V F+N Y LPP SISILPDCK V FNTA + S WE+Y E +
Sbjct: 384 AKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWADGASSMMWEKYDEEVD 443
Query: 436 TYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ L + LLEQ+N T+D SDYLWY R + DPS+ + L V S GH LH
Sbjct: 444 SLAAAPLLTSTGLLEQLNVTRDTSDYLWYITRVEVDPSEKFLQGGTPLSLTVQSAGHALH 503
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
FING+ GSA+G D+ + +L GTN V+LLSV GLP+ G + E G+
Sbjct: 504 VFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYETWNTGVVG 563
Query: 549 -VSIQGAKE-LKDFSSFSWGY--QVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTW 602
V I G E +D + +W Y QVGL GE++ + + GS V W + + QPL W
Sbjct: 564 PVVIHGLDEGSRDLTWQTWSYQFQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQNQQPLAW 623
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ-- 646
Y+ FD P+G +P+A+++ SMGKG+ W+NGQSIGRYW S+ P+
Sbjct: 624 YRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAEGDCKGCHYTGSYRAPKCQ 683
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q WYH+PRS+L+PT NLLV+ EE G I++ +V+ +C VS+ H P +
Sbjct: 684 AGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNI 742
Query: 704 ISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
+W+ ++ + H KV ++C G+ IS I FAS+G P G C + G CHS
Sbjct: 743 KNWQIESYGEPEFHT------AKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSI 796
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS +++EK C+G + C V + F GDPCP + K + V+A C+
Sbjct: 797 NSNSVLEKKCIGLQRCVVAISPSNFGGDPCPEVMKRVAVEAVCS 840
>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
Length = 833
Score = 766 bits (1979), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/825 (48%), Positives = 513/825 (62%), Gaps = 60/825 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R+L+I+G R++L SGSIHYPRSTPQMWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 21 NVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEPV 80
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+DF GR+DLV+F+K V GLYV LRIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 81 KGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 140
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ IV++MK +LYASQGGPIILSQIENEYG ++ + G Y+ WAAK+
Sbjct: 141 PFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWAAKM 200
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ ++ +G
Sbjct: 201 ATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQ--FTPNSNTKPKMWTENWSGWFLSFGG 258
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNF R T ++ T Y AP+DEYG
Sbjct: 259 AVPHRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRSTGGPFIATSYDYDAPIDEYG 317
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
++RQ KWGHLK++H A+KLC + +++ + + EA +++ S CAAFL N D +N
Sbjct: 318 IIRQQKWGHLKDVHKAIKLCEEALIATDPKISSLGQNLEAAVYKTGSVCAAFLANVDTKN 377
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV------------------EQWEE 429
+ TV FS Y LP S+SILPDCK V NTAK++S +W
Sbjct: 378 DKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISNFVTEDISSLETSSSKWSW 437
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFK-HDPSDSESVLKVSSLGHVLH 488
E + + L LLEQ+NTT D SDYLWY+ D S++VL + SLGH LH
Sbjct: 438 INEPVGISKDDILSKTGLLEQINTTADRSDYLWYSLSLDLADDPGSQTVLHIESLGHALH 497
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
AFING+ G+ G ++ + L++G N + LLS+ VGL + GA+ + AG+
Sbjct: 498 AFINGKLAGNQAGNSDKSKLNVDIPIALVSGKNKIDLLSLTVGLQNYGAFFDTVGAGITG 557
Query: 549 -VSIQGAK---ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
V ++G K D SS W YQ+GL GE L + + S Y +QPL WYK
Sbjct: 558 PVILKGLKNGNNTLDLSSRKWTYQIGLKGEDLGLSSGSSGGWNSQSTY--PKNQPLVWYK 615
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------------ 646
T FDAP+GS+PVAI+ MGKGEAWVNGQSIGRYW +++
Sbjct: 616 TNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKC 675
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
G PSQ+ YH+PRSFLKP GN LVL EE G P IS T + ++C HVSDSH P
Sbjct: 676 RKNCGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQ 735
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFASYGNPNGNCENYAIGSCH 761
+ W + K P + + CP+ + IS I FASYG P G C N+ G C
Sbjct: 736 IDLWNQDTESGGKVG-------PALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCS 788
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
S+ + +IV+KAC+G RSC+V V T+ F GDPC G+PK+L V+A C
Sbjct: 789 SNKALSIVKKACIGSRSCSVGVSTDTF-GDPCRGVPKSLAVEATC 832
>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 828
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 413/838 (49%), Positives = 529/838 (63%), Gaps = 57/838 (6%)
Query: 17 IGGSDGGGGGGN---NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGL 73
+ G G GG G NV+YD R+++ING R+IL SGSIHYPRS+P+MWP LI KAKEGGL
Sbjct: 1 MAGDIGDGGHGFQAWNVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGL 60
Query: 74 DVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPF 133
DV+QT VFWN HEP G++ F GR DLVRFIK V+ GLYV LRIGP++ EW +GG P
Sbjct: 61 DVIQTYVFWNGHEPSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPV 120
Query: 134 WLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHS 193
WL V GI FR++NEPFK+HM+R+ IV+MMK+ L+ SQGGPIILSQIENEYG +E+
Sbjct: 121 WLKYVQGINFRTNNEPFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYE 180
Query: 194 FLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPA 253
G Y WAAK+AV L TGVPWVMCKQDDAPDP+IN CNG C + PN KP
Sbjct: 181 IGAPGRAYTEWAAKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYC--DYFSPNKAYKPK 238
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-Y 312
+WTE WT ++ +G R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA +
Sbjct: 239 MWTEAWTGWFTEFGGAVPHRPAEDLAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPF 297
Query: 313 VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG 372
+ T Y APLDE+GLLRQPKWGHLK+LH A+KLC ++SG + +EA +F
Sbjct: 298 IATSYDYDAPLDEFGLLRQPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHS 357
Query: 373 SS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ----- 426
S CAAFL N + R+ A V F N+ Y LPP SISILPDCK +NTA+L +
Sbjct: 358 KSGACAAFLANYNPRSYAKVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMT 417
Query: 427 -------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD------PSD 473
W+ Y E +YD++S A LLEQ+NTT+D SDYLWY+ K S
Sbjct: 418 PVSGRFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSG 477
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
VL V S GH LH FING G+A+G + T + V L G N ++LLS+ VGLP
Sbjct: 478 RYPVLTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLP 537
Query: 534 DSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
+ G + E AG L VS+ G E +D S W Y+VGL GE L + + GS V W
Sbjct: 538 NVGPHFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVE 597
Query: 592 YGS--STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------- 642
GS + QPLTWYKT F+AP G+ P+A+++ SMGKG+ W+NGQ++GRYW ++
Sbjct: 598 -GSLMARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKATGGCG 656
Query: 643 -------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
L+ G PSQ WYH+P S+L PTGNLLV+ EE G P GIS+ +
Sbjct: 657 DCNYAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIE 716
Query: 690 TLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPN 749
++C + + P ++++ + Q + K +K + RPK + C G+KIS I FAS+G P
Sbjct: 717 SVCADIYEWQ-PTLMNY--EMQASGKVNKPL---RPKAHLWCAPGQKISSIKFASFGTPE 770
Query: 750 GNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G C +Y GSCH+ S E++C+G SC+V V E F GDPCP + K L V+A C+
Sbjct: 771 GVCGSYREGSCHAHKSYDAFERSCIGMNSCSVTVAPEIFGGDPCPSVMKKLSVEAICS 828
>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 672
Score = 766 bits (1977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 368/638 (57%), Positives = 455/638 (71%), Gaps = 20/638 (3%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G G VTYDGR+L++NG R++LFSG +HY RSTP+MWP+LIA AK+GGLDV+QT VFWN+
Sbjct: 35 GEGGEVTYDGRALVVNGTRRMLFSGEMHYTRSTPEMWPKLIANAKKGGLDVIQTYVFWNV 94
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEP GQ++F GR DLV+FI+E+Q QGLYV LRIGPFIE EW YGG PFWLHDVP I FR
Sbjct: 95 HEPVQGQYNFQGRYDLVKFIREIQTQGLYVSLRIGPFIEAEWKYGGFPFWLHDVPNITFR 154
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
+DNEPFK HM+R+ T IVNMMK LY QGGPII+SQIENEY MVE +F GP YVRW
Sbjct: 155 TDNEPFKQHMQRFVTQIVNMMKHEGLYYPQGGPIIISQIENEYQMVEPAFGSGGPRYVRW 214
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA++AV LQTGVPW+MCKQ+DAPDP+IN CNG CGETF GPNSP KPA+WTENWT+ Y
Sbjct: 215 AAEMAVGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPTKPALWTENWTTRYP 274
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLD 324
+YG++ ++RS EDIA+ VALFIA+ KGS+V+YYMYHGGTNFGR AS+YV T YYD APLD
Sbjct: 275 IYGNDTKLRSTEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFASSYVTTSYYDGAPLD 334
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKD 384
EYGL+ +P WGHL+ELH+AVKL + +L G + + QEA IF+ +C AFLVN D
Sbjct: 335 EYGLIWRPTWGHLRELHAAVKLSSEALLFGRYSNFSLGPEQEAHIFETELKCVAFLVNFD 394
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------LDSVEQWEE 429
K TV F N+ ++L P SIS+L +C+TV F TA+ L+ + W+
Sbjct: 395 KHQTPTVVFRNIYFQLAPKSISVLSECRTVVFETARVNAQYGSRTAEVVESLNDIHTWKA 454
Query: 430 YKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES--VLKVSSLGHV 486
+KE IP + N L E ++ TKD +DYLWY +++ PSD +L V S HV
Sbjct: 455 FKEPIPEDISKAVYTGNQLFEHLSMTKDETDYLWYIVSYEYIPSDDGQLVLLNVESRAHV 514
Query: 487 LHAFINGEFVGSAHGKHSDK-SFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LHAF+N E+ GS HG H + L + L G N +SLLSVMVG PDSGA++ERR G
Sbjct: 515 LHAFVNTEYAGSVHGSHDGPGNIILNTNISLNEGQNTISLLSVMVGSPDSGAHMERRSFG 574
Query: 546 LRNVSI-QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
+ VSI QG + L ++ W YQVGL GE +I+T S W+ + T+ P TWYK
Sbjct: 575 IHKVSIQQGQQPLHLLNNELWAYQVGLYGEANRIYTQEESSSAEWTEINNLTYHPFTWYK 634
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF 642
T F P G+D VA+NL SMGKGE WVNG+S+GRYWVSF
Sbjct: 635 TTFATPVGNDVVALNLTSMGKGEVWVNGESLGRYWVSF 672
>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
Length = 841
Score = 765 bits (1975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/830 (49%), Positives = 522/830 (62%), Gaps = 56/830 (6%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G +V+YD ++++ING R+IL SGSIHYPRS+P+MWP LI KAKEGGLDV+QT VFWN
Sbjct: 22 GSAKASVSYDSKAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWN 81
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP PG++ F DLV+FIK +Q GLYV LRIGP++ EW +GG P WL +PGI F
Sbjct: 82 GHEPSPGKYYFEDNYDLVKFIKLIQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIQF 141
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+DN PFK M+R+ T IVNMMKA RL+ SQGGPIILSQIENEYG +E+ G Y
Sbjct: 142 RTDNGPFKAQMQRFTTKIVNMMKAERLFQSQGGPIILSQIENEYGPMEYELGAPGKVYTD 201
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WAA +A+ L TGVPWVMCKQDDAPDP+INACNG C + PN KP +WTE WT +Y
Sbjct: 202 WAAHMALGLGTGVPWVMCKQDDAPDPIINACNGFYC--DYFSPNKAYKPKMWTEAWTGWY 259
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAP 322
+G R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y AP
Sbjct: 260 TEFGGAVPSRPAEDLAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAP 318
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLV 381
LDEYGLLRQPKWGHLK+LH A+KLC ++S QEA +F+ S CAAFL
Sbjct: 319 LDEYGLLRQPKWGHLKDLHRAIKLCEPALVSADPTVTPLGTYQEAHVFKSKSGACAAFLA 378
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK--------------LDSVEQW 427
N + R+ A V F N+ Y LPP SISILPDCK +NTA+ L W
Sbjct: 379 NYNPRSFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSAQMKMPRVPLHGAFSW 438
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVS 481
+ Y + TY +TS LLEQ+NTT+D+SDYLWY K DP S VL +
Sbjct: 439 QAYNDETATYADTSFTTAGLLEQINTTRDSSDYLWYLTDVKIDPNEEFLRSGKYPVLTIL 498
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S GH L FING+ G+++G T + V+L G N ++LLS+ VGLP+ G + E
Sbjct: 499 SAGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLRAGINQIALLSIAVGLPNVGPHFET 558
Query: 542 RVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STH 597
AG L V + G E +D S W Y+VGL GE L + + GS V W + GS +
Sbjct: 559 WNAGVLGPVILNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWIQ-GSLVTRR 617
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------- 642
QPLTWYKT F+AP G+ P+A+++ SMGKG+ W+NG+SIGRYW ++
Sbjct: 618 QPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRSIGRYWPAYKASGSCGACNYAGSY 677
Query: 643 -----LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
L+ G SQ WYH+PR++L PTGNLLV+LEE G P GI + + ++C + +
Sbjct: 678 HEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEEWGGDPNGIFLVRREIDSICADIYE 737
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
P ++SW Q Q + K K + RPK + C G+KIS I FAS+G P G C ++
Sbjct: 738 WQ-PNLMSW--QMQASGKVKKPV---RPKAHLSCGPGQKISSIKFASFGTPEGGCGSFRE 791
Query: 758 GSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
GSCH+ NS +++C+G+ SC+V V E F GDPCP + K L V+A C+
Sbjct: 792 GSCHAHNSYDAFQRSCIGQNSCSVTVAPENFGGDPCPNVMKKLSVEAICS 841
>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
Length = 843
Score = 764 bits (1974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/825 (48%), Positives = 523/825 (63%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD ++++ING R+IL SGSIHYPRSTP+MWP LI +AK+GGLDV+QT VFWN HEP
Sbjct: 29 SVSYDSKAIVINGQRRILISGSIHYPRSTPEMWPDLIQRAKDGGLDVIQTYVFWNGHEPS 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F DLV+FIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 89 PGKYYFEDNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIQFRTDNG 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ T IVNMMKA RL+ S GGPIILSQIENEYG +E+ G Y WAA++
Sbjct: 149 PFKDQMQRFTTKIVNMMKAERLFESHGGPIILSQIENEYGPMEYEIGAPGKAYTDWAAQM 208
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDPVINACNG C + PN KP +WTE WT ++ +G
Sbjct: 209 AVGLGTGVPWVMCKQDDAPDPVINACNGFYC--DYFSPNKAYKPKMWTEAWTGWFTEFGG 266
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA F+ K G+++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 267 AVPYRPAEDLAFSVAKFLQK-GGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 325
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LLRQPKWGHLK+LH A+KLC ++S QEA +F+ +S CAAFL N +++
Sbjct: 326 LLRQPKWGHLKDLHRAIKLCEPALVSSDPTVTPLGTYQEAHVFKSNSGACAAFLANYNRK 385
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKE 432
+ A V F N+ Y LPP SISILPDCK +NTA++ + W+ Y +
Sbjct: 386 SFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARIGAQTARMKMPRVPIHGGFSWQAYND 445
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHV 486
TY +TS LLEQ+N T+DA+DYLWY K DPS+ + VL V S GH
Sbjct: 446 ETATYSDTSFTTAGLLEQINITRDATDYLWYMTDVKIDPSEDFLRSGNYPVLTVLSAGHA 505
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
L FING+ G+A+G T ++ V+L G N ++LLS+ VGLP+ G + E AG
Sbjct: 506 LRVFINGQLAGTAYGSLETPKLTFKQGVNLRAGINQIALLSIAVGLPNVGPHFETWNAGI 565
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTW 602
L V + G E +D S W Y++GL GE L + + GS V W+ GS + QPLTW
Sbjct: 566 LGPVILNGLNEGRRDLSWQKWSYKIGLKGEALSLHSLTGSSSVEWTE-GSFVAQRQPLTW 624
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF-------------------- 642
YKT F+ P G+ P+A+++ SMGKG+ W+N +SIGRYW ++
Sbjct: 625 YKTTFNRPAGNSPLALDMGSMGKGQVWINDRSIGRYWPAYKASGTCGECNYAGTFSEKKC 684
Query: 643 LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
L+ G SQ WYH+PRS+L PTGNLLV+LEE G P GI + V ++C + + P
Sbjct: 685 LSNCGEASQRWYHVPRSWLNPTGNLLVVLEEWGGDPNGIFLVRREVDSVCADIYEWQ-PN 743
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
++SW Q Q + + +K + RPK + C G+KIS I FAS+G P G C ++ G CH+
Sbjct: 744 LMSW--QMQVSGRVNKPL---RPKAHLSCGPGQKISSIKFASFGTPEGVCGSFREGGCHA 798
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S E++C+G+ SC+V V E F GDPCP + K L V+A C+
Sbjct: 799 HKSYNAFERSCIGQNSCSVTVSPENFGGDPCPNVMKKLSVEAICS 843
>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 854
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/823 (48%), Positives = 523/823 (63%), Gaps = 56/823 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++ING R+IL SGSIHYPRSTP+MW LI KAK+GGLDVV+T VFWN+HEP P
Sbjct: 28 VTYDRKAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPTP 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNEP
Sbjct: 88 GNYNFEGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV +MK+ L+ SQGGPIILSQIENEYG F G Y+ WAA++A
Sbjct: 148 FKRAMQGFTQKIVGLMKSESLFESQGGPIILSQIENEYGAQSKLFGAAGHNYITWAAEMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK++DAPDPVIN CNG C ++F+ PN P KP IWTE W+ ++ +G
Sbjct: 208 VGLDTGVPWVMCKEEDAPDPVINTCNGFYC-DSFS-PNRPYKPTIWTETWSGWFTEFGGP 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R +D+AY VA FI K GS+VNYYMYHGGTNFGRTA +T YD APLDEYGL
Sbjct: 266 IHQRPVQDLAYAVATFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGL 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
+RQPK+GHLKELH A+K+C + ++S + + Q+A+++ S +C+AFL N D ++
Sbjct: 325 IRQPKYGHLKELHKAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSKS 384
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAI 434
A V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE Y E +
Sbjct: 385 AARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQMLPTNIPMLSWESYDEDL 444
Query: 435 PTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVL 487
+ D++S + A LLEQ+N T+D++DYLWY D S+S L V S GH +
Sbjct: 445 TSMDDSSTMTAPGLLEQINVTRDSTDYLWYITSVDIDSSESFLHGGELPTLIVQSTGHAV 504
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
H FING+ GSA G + FT V+L GTN ++LLSV VGLP+ G + E G L
Sbjct: 505 HIFINGQLTGSAFGTRESRRFTYTGKVNLRAGTNKIALLSVAVGLPNVGGHFEAWNTGIL 564
Query: 547 RNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW---SRYGSSTHQPLTW 602
V++ G + K D S W YQVGL GE + + + V W S QPLTW
Sbjct: 565 GPVALHGLNQGKWDLSWQKWTYQVGLKGEAMNLVSQNAFSSVEWISGSLIAQKKQQPLTW 624
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
+KT+F+ P GS+P+A+++ MGKG+ W+NGQSIGRYW +F
Sbjct: 625 HKTIFNEPEGSEPLALDMEGMGKGQIWINGQSIGRYWTAFANGNCNGCSYAGGFRPTKCQ 684
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q +YH+PRS+LKPT NLLVL EE G P IS+ +V+++C V++ H P +
Sbjct: 685 SGCGKPTQRYYHVPRSWLKPTQNLLVLFEELGGDPSRISLVKRAVSSVCSEVAEYH-PTI 743
Query: 704 ISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
+W ++ ++ PKV +RC G+ IS I FAS+G P G C +Y G+CH++
Sbjct: 744 KNWHIESYGKVEDF-----HSPKVHLRCNPGQAISSIKFASFGTPLGTCGSYQEGTCHAT 798
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
S ++V+K C+GK+ C V + F GDPCP + K L V+A C
Sbjct: 799 TSYSVVQKKCIGKQRCAVTISNSNF-GDPCPKVLKRLSVEAVC 840
>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
Length = 839
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/820 (48%), Positives = 515/820 (62%), Gaps = 52/820 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++I+G R+ILFSGSIHYPRSTP+MW L KAK+GGLDV+QT VFWN HEP P
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPEMWEGLFQKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLV+FIK Q GL+V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 87 GNYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK+ L+ASQGGPIILSQIENEYG SF G Y WAAK+A
Sbjct: 147 FKTAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEGKSFGAAGKSYSNWAAKMA 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCKQDDAPDPVINACNG C + F+ PN P KP +WTE WT ++ +G
Sbjct: 207 VGLDTGVPWVMCKQDDAPDPVINACNGFYC-DAFS-PNKPYKPTMWTEAWTGWFTEFGGT 264
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R R ED+++ VA F+ K GS++NYYMYHGGTNFGRTA +T YD APLDEYGL
Sbjct: 265 IRKRPVEDLSFAVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGL 323
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+GHLKELH AVKLC ++S +QEA +F+ S CAAFL N + ++
Sbjct: 324 AREPKYGHLKELHRAVKLCEPALVSVDPAVTTLGSMQEAHVFRSPSSCAAFLANYNSNSH 383
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEAIP 435
A V F+N Y LPP SISILPDCKTV FNTA + +S WE Y E +
Sbjct: 384 ANVVFNNEHYSLPPWSISILPDCKTVVFNTATVGVQTSQMQMWADGESSMMWERYDEEVG 443
Query: 436 TYDETSLRANF-LLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ L LLEQ+N T+D+SDYLWY PS+ L V S GH LH
Sbjct: 444 SLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDVSPSEKFLQGGEPLSLTVQSAGHALH 503
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
FING+ GSA G K F+ + +L GTN ++LLS+ GLP+ G + E G+
Sbjct: 504 IFINGQLQGSASGTREAKKFSYKGNANLRAGTNKIALLSIACGLPNVGVHYETWNTGIVG 563
Query: 549 VSIQGAKEL--KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
+ ++ +D + +W YQVGL GE++ + + G+ V W + PL+WY+
Sbjct: 564 PVVLHGLDVGSRDLTWQTWSYQVGLKGEQMNLNSLEGASSVEWMQGSLLAQAPLSWYRAY 623
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------G 647
FD PTG +P+A+++ SMGKG+ W+NGQSIGRY S+ + G
Sbjct: 624 FDTPTGDEPLALDMGSMGKGQIWINGQSIGRYSTSYASGDCKACSYAGSYRAPKCQAGCG 683
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWR 707
P+Q WYH+P+S+L+P+ NLLV+ EE G IS+ SV+++C VS+ H + +W+
Sbjct: 684 QPTQRWYHVPKSWLQPSRNLLVVFEELGGDSSKISLVKRSVSSVCADVSEYHT-NIKNWQ 742
Query: 708 SQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRA 767
+N ++ H RPKV +RC G+ IS I FAS+G P G C N+ G CHS+ S A
Sbjct: 743 IENAGEVEFH------RPKVHLRCAPGQTISAIKFASFGTPLGTCGNFQQGDCHSTKSHA 796
Query: 768 IVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
++EK C+G++ C V + + F GDPCP K + V+A C+
Sbjct: 797 VLEKNCIGQQRCAVTISPDNFGGDPCPKEMKKVAVEAVCS 836
>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
Length = 856
Score = 764 bits (1973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/848 (47%), Positives = 531/848 (62%), Gaps = 64/848 (7%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
L C GLL+ +G G VTYD ++L+ING R+ILFSGSIHYPRSTP MW LI
Sbjct: 15 LWCCLGLLILGVGFVQCG------VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLI 68
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GG+DV++T VFWNLHEP PG++DF GR DLVRF+K + GLY LRIGP++ E
Sbjct: 69 QKAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKAIHKAGLYAHLRIGPYVCAE 128
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK MK + IV +MK+ L+ SQGGPIILSQIEN
Sbjct: 129 WNFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIEN 188
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
EYG +G Y+ WAAK+A+ +TGVPWVMCK+DDAPDPVI+ CNG C ++FA
Sbjct: 189 EYGRQGQILGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVISTCNGFYC-DSFA- 246
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN P KP IWTE W+ ++ +G R +D+A+ VA FI K GS+VNYYMYHGGTNF
Sbjct: 247 PNKPYKPTIWTEAWSGWFTEFGGPMHHRPVQDLAFAVARFIQK-GGSFVNYYMYHGGTNF 305
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA +T YD AP+DEYGL+RQPK+GHLKELH A+K+C K ++S V +
Sbjct: 306 GRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSTDPVVTSLGNK 365
Query: 365 QEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-- 421
Q+A ++ S +C+AFL N D + A V F+N+ Y LPP SISILPDC+ FNTAK+
Sbjct: 366 QQAHVYSSESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGV 425
Query: 422 --DSVE---------QWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKH 469
+E QW+ Y E + + D++S LLEQ+N T+D SDYLWY
Sbjct: 426 QTSQMEMLPTSTGSFQWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSV-- 483
Query: 470 DPSDSESVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTN 521
D ++ES L + S GH +H F+NG+ GSA G ++ FT + ++L +GTN
Sbjct: 484 DIGETESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYKGKINLHSGTN 543
Query: 522 NVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIF 579
++LLSV VGLP+ G + E G L V++ G + K D S W YQVGL GE + +
Sbjct: 544 RIALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLA 603
Query: 580 TDYGSRIVPWSRYGSSTH--QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGR 637
+ W + QPLTW+KT FDAP G++P+A+++ MGKG+ WVNG+SIGR
Sbjct: 604 YPTNTPSFGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGR 663
Query: 638 YWVSFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYP 678
YW +F T G P+Q WYH+PRS+LKP+ NLLV+ EE G P
Sbjct: 664 YWTAFATGDCGHCSYTGTYKPNKCNSGCGQPTQKWYHVPRSWLKPSQNLLVIFEELGGNP 723
Query: 679 PGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKIS 738
+S+ SV+ +C VS+ H P + +W+ ++ +T RRPKV ++C G+ IS
Sbjct: 724 STVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTF-----RRPKVHLKCSPGQAIS 777
Query: 739 KILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPK 798
I FAS+G P G C +Y G CH++ S AI+E+ C+GK C V + F DPCP + K
Sbjct: 778 AIKFASFGTPLGTCGSYQQGDCHAATSYAILERKCVGKARCAVTISNSNFGKDPCPNVLK 837
Query: 799 ALLVDAQC 806
L V+A C
Sbjct: 838 RLTVEAVC 845
>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 849
Score = 763 bits (1970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/827 (49%), Positives = 518/827 (62%), Gaps = 60/827 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+ILFSGSIHYPRSTP MW LI KAKEGGLDV++T VFWN+HEP
Sbjct: 31 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVHEPS 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G ++F GR DLVRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 91 RGNYNFEGRYDLVRFVKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK+ RLY SQGGPIILSQIENEYG G YV WAAK+
Sbjct: 151 PFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGSAGQNYVNWAAKM 210
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV+ TGVPWVMCK+DDAPDPVIN CNG C + PN P KP+IWTE W+ ++ +G
Sbjct: 211 AVETGTGVPWVMCKEDDAPDPVINTCNGFYC--DYFTPNKPYKPSIWTEAWSGWFSEFGG 268
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI K GS+VNYYMYHGGTNFGRTA +T YD APLDEYG
Sbjct: 269 PNHERPVQDLAFGVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 327
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH A+K+C + ++S + Q+A ++ S +CAAFL N D +
Sbjct: 328 LIRQPKYGHLKELHKAIKMCERALVSTDPAVTSLGNFQQAHVYSAKSGDCAAFLSNFDTK 387
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
++ V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE + E
Sbjct: 388 SSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQMLPTNTRMFSWESFDED 447
Query: 434 IPTYDETS---LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSS 482
I + D+ S + LLEQ+N T+D SDYLWY D SES L+ V S
Sbjct: 448 ISSLDDGSSITTTTSGLLEQINVTRDTSDYLWYITSV--DIGSSESFLRGGKLPTLIVQS 505
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH +H FING+ GSA+G D+ FT V+L GTN ++LLSV VGLP+ G + E
Sbjct: 506 TGHAVHVFINGQLSGSAYGTREDRRFTYTGTVNLRAGTNRIALLSVAVGLPNVGGHFETW 565
Query: 543 VAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQ 598
G L V ++G + K D S W YQVGL GE + + + G V W S S +Q
Sbjct: 566 NTGILGPVVLRGFDQGKLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQSALVSDKNQ 625
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLT 644
PLTW+KT FDAP G +P+A+++ MGKG+ W+NG SIGRYW +F
Sbjct: 626 PLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYWTALAAGNCNGCSYAGTFRP 685
Query: 645 PQ-----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
P+ G P+Q WYH+PRS+LKP NLLV+ EE G P IS+ SV+++C VS+ H
Sbjct: 686 PKCQVGCGQPTQRWYHVPRSWLKPDHNLLVVFEELGGDPSKISLVKRSVSSVCADVSEYH 745
Query: 700 LPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGS 759
P + +W + K+ + P PKV + C G+ IS I FAS+G P G C NY G
Sbjct: 746 -PNIRNWHIDSYG--KSEEFHP---PKVHLHCSPGQTISSIKFASFGTPLGTCGNYEKGV 799
Query: 760 CHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
CHSS S A +EK C+GK CTV V F DPCP + K L V+A C
Sbjct: 800 CHSSTSHATLEKKCIGKPRCTVTVSNSNFGQDPCPNVLKRLSVEAVC 846
>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 856
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/847 (47%), Positives = 531/847 (62%), Gaps = 65/847 (7%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
CL G L+ +G G VTYD ++L+ING R+ILFSGSIHYPRSTP MW LI
Sbjct: 17 FCL-GFLILGVGFVQCG------VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQ 69
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KAK+GG+DV++T VFWNLHEP PG++DF GR DLVRF+K + GLY LRIGP++ EW
Sbjct: 70 KAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEW 129
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
+GG P WL VPGI FR+DNEPFK MK + IV +MK+ L+ SQGGPIILSQIENE
Sbjct: 130 NFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENE 189
Query: 187 YGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP 246
YG +G Y+ WAAK+A+ +TGVPWVMCK+DDAPDPVIN CNG C ++FA P
Sbjct: 190 YGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC-DSFA-P 247
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
N P KP IWTE W+ ++ +G R +D+A+ VA FI K GS+VNYYMYHGGTNFG
Sbjct: 248 NKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQK-GGSFVNYYMYHGGTNFG 306
Query: 307 RTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
RTA +T YD AP+DEYGL+RQPK+GHLKELH A+K+C K ++S V + Q
Sbjct: 307 RTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQ 366
Query: 366 EAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--- 421
+A ++ S +C+AFL N D + A V F+N+ Y LPP SISILPDC+ FNTAK+
Sbjct: 367 QAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQ 426
Query: 422 -DSVE---------QWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+E QWE Y E + + D++S + LLEQ+N T+D SDYLWY D
Sbjct: 427 TSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSV--D 484
Query: 471 PSDSESVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNN 522
DSES L + S GH +H F+NG+ GSA G ++ FT + ++L +GTN
Sbjct: 485 IGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNR 544
Query: 523 VSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFT 580
++LLSV VGLP+ G + E G L V++ G + K D S W YQVGL GE + +
Sbjct: 545 IALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAF 604
Query: 581 DYGSRIVPWSRYGSSTH--QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY 638
+ + W + QPLTW+KT FDAP G++P+A+++ MGKG+ WVNG+SIGRY
Sbjct: 605 PTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRY 664
Query: 639 WVSFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
W +F T G P+Q WYH+PR++LKP+ NLLV+ EE G P
Sbjct: 665 WTAFATGDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPS 724
Query: 680 GISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISK 739
+S+ SV+ +C VS+ H P + +W+ ++ +T RPKV ++C G+ I+
Sbjct: 725 TVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTF-----HRPKVHLKCSPGQAIAS 778
Query: 740 ILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKA 799
I FAS+G P G C +Y G CH++ S AI+E+ C+GK C V + F DPCP + K
Sbjct: 779 IKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISNSNFGKDPCPNVLKR 838
Query: 800 LLVDAQC 806
L V+A C
Sbjct: 839 LTVEAVC 845
>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
Length = 841
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/823 (49%), Positives = 523/823 (63%), Gaps = 54/823 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD R+++ING R+IL SGSIHYPRS+P+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 29 SVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G++ F GR DLVRFIK V+ GLYV LRIGP++ EW +GG P WL V GI FR++NE
Sbjct: 89 QGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTNNE 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK+HM+R+ IV+MMK+ L+ SQGGPIILSQIENEYG +E+ G Y WAAK+
Sbjct: 149 PFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAKM 208
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDP+IN CNG C + PN KP +WTE WT ++ +G
Sbjct: 209 AVGLGTGVPWVMCKQDDAPDPIINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWFTEFGG 266
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDE+G
Sbjct: 267 AVPHRPAEDLAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFG 325
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LLRQPKWGHLK+LH A+KLC ++SG + +EA +F S CAAFL N + R
Sbjct: 326 LLRQPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPR 385
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------WEEYKEAI 434
+ A V F N+ Y LPP SISILPDCK +NTA+L + W+ Y E
Sbjct: 386 SYAKVSFRNMHYNLPPWSISILPDCKNTVYNTARLGAQSATMKMTPVSGRFGWQSYNEET 445
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD------PSDSESVLKVSSLGHVLH 488
+YD++S A LLEQ+NTT+D SDYLWY+ K S VL V S GH LH
Sbjct: 446 ASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLTVLSAGHALH 505
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING G+A+G + T + V L G N ++LLS+ VGLP+ G + E AG L
Sbjct: 506 VFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHFETWNAGVLG 565
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
VS+ G E +D S W Y+VGL GE L + + GS V W GS + QPLTWYK
Sbjct: 566 PVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVE-GSLMARGQPLTWYK 624
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LT 644
T F+AP G+ P+A+++ SMGKG+ W+NGQ++GRYW ++ L+
Sbjct: 625 TTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKATGGCGDCNYAGTYSEKKCLS 684
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVI 704
G PSQ WYH+P S+L PTGNLLV+ EE G P GIS+ + ++C + + P ++
Sbjct: 685 NCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIESVCADIYEWQ-PTLM 743
Query: 705 SWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
++ + Q + K +K + RPK + C G+KIS I FAS+G P G C +Y GSCH+
Sbjct: 744 NY--EMQASGKVNKPL---RPKAHLWCAPGQKISSIKFASFGTPEGVCGSYREGSCHAHK 798
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S E++C+G SC+V V E F GDPCP + K L V+A C+
Sbjct: 799 SYDAFERSCIGMNSCSVTVAPEIFGGDPCPSVMKKLSVEAICS 841
>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 763 bits (1969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/822 (48%), Positives = 516/822 (62%), Gaps = 53/822 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++ING R+ILFSGSIHYPRSTP+MW LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLV+FIK Q GL+V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK+ L+ASQGGPIILSQIENEYG E F G Y WAAK+A
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCKQ+DAPDPVINACNG C + F PN+P KP +WTE WT ++ +G
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYC-DAFT-PNTPSKPTMWTEAWTGWFTEFGGT 269
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R R ED+++ VA F+ K GS++NYYMYHGGTNFGRTA +T YD APLDEYGL
Sbjct: 270 IRKRPVEDLSFAVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGL 328
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+GHLKELH A+KLC + ++S + +QEA +++ S CAAFL N + ++
Sbjct: 329 AREPKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSGCAAFLANYNSNSH 388
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIP 435
A + F N Y LPP SISILPDCKTV +NTA + S WE Y E +
Sbjct: 389 AKIVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTSQMQMWSDGASSMMWERYDEEVG 448
Query: 436 TYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVSSLGHVLH 488
+ L LLEQ+N T+D SDYLWY PS+ L V S GH LH
Sbjct: 449 SLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTVQSAGHALH 508
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
F+NG+ GSA G DK + + V L GTN +SLLSV GLP+ G + E G+
Sbjct: 509 IFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYETWNTGVNG 568
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYK 604
V + G E +D + +W YQVGL GE++ + + G+ V W + + PL WY+
Sbjct: 569 PVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQNQMPLAWYR 628
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------------ 646
FD P+G +P+A+++ SMGKG+ W+NGQSIGRY +++ T
Sbjct: 629 AYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAYATGDCKDCSYTGSFRAIKCQAG 688
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G P+Q WYH+P+S+L+PT NLLV+ EE G IS+ SV+ +C VS+ H P + +
Sbjct: 689 CGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVCADVSEFH-PSIKN 747
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
W+++N K RR KV +RC G+ IS I FAS+G P G C ++ G CHS+ S
Sbjct: 748 WQTENSGEAKPEL----RRSKVHLRCAPGQSISAIKFASFGTPLGTCGSFEQGQCHSTKS 803
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ ++E C+GK+ C V + + F GDPCP + K + V+A C+
Sbjct: 804 QTVLEN-CIGKQRCAVTISPDNFGGDPCPNVMKRVAVEAVCS 844
>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 762 bits (1968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/847 (47%), Positives = 531/847 (62%), Gaps = 65/847 (7%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
CL G L+ +G G VTYD ++L+ING R+ILFSGSIHYPRSTP MW LI
Sbjct: 14 FCL-GFLILGVGFVQCG------VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQ 66
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KAK+GG+DV++T VFWNLHEP PG++DF GR DLVRF+K + GLY LRIGP++ EW
Sbjct: 67 KAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEW 126
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
+GG P WL VPGI FR+DNEPFK MK + IV +MK+ L+ SQGGPIILSQIENE
Sbjct: 127 NFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENE 186
Query: 187 YGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP 246
YG +G Y+ WAAK+A+ +TGVPWVMCK+DDAPDPVIN CNG C ++FA P
Sbjct: 187 YGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC-DSFA-P 244
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
N P KP IWTE W+ ++ +G R +D+A+ VA FI K GS+VNYYMYHGGTNFG
Sbjct: 245 NKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQK-GGSFVNYYMYHGGTNFG 303
Query: 307 RTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
RTA +T YD AP+DEYGL+RQPK+GHLKELH A+K+C K ++S V + Q
Sbjct: 304 RTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQ 363
Query: 366 EAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--- 421
+A ++ S +C+AFL N D + A V F+N+ Y LPP SISILPDC+ FNTAK+
Sbjct: 364 QAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQ 423
Query: 422 -DSVE---------QWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+E QWE Y E + + D++S + LLEQ+N T+D SDYLWY D
Sbjct: 424 TSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSV--D 481
Query: 471 PSDSESVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNN 522
DSES L + S GH +H F+NG+ GSA G ++ FT + ++L +GTN
Sbjct: 482 IGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNR 541
Query: 523 VSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFT 580
++LLSV VGLP+ G + E G L V++ G + K D S W YQVGL GE + +
Sbjct: 542 IALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAF 601
Query: 581 DYGSRIVPWSRYGSSTH--QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY 638
+ + W + QPLTW+KT FDAP G++P+A+++ MGKG+ WVNG+SIGRY
Sbjct: 602 PTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRY 661
Query: 639 WVSFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
W +F T G P+Q WYH+PR++LKP+ NLLV+ EE G P
Sbjct: 662 WTAFATGDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPS 721
Query: 680 GISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISK 739
+S+ SV+ +C VS+ H P + +W+ ++ +T RPKV ++C G+ I+
Sbjct: 722 TVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTF-----HRPKVHLKCSPGQAIAS 775
Query: 740 ILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKA 799
I FAS+G P G C +Y G CH++ S AI+E+ C+GK C V + F DPCP + K
Sbjct: 776 IKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISNSNFGKDPCPNVLKR 835
Query: 800 LLVDAQC 806
L V+A C
Sbjct: 836 LTVEAVC 842
>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
Length = 849
Score = 762 bits (1968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/822 (48%), Positives = 511/822 (62%), Gaps = 55/822 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 38 SVSYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 97
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F GR DLV+FIK V+ GLYV LRIGP+ EW +GG P WL +PGI FR+DNE
Sbjct: 98 PGEYYFEGRYDLVKFIKLVKEAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTDNE 157
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M + IV+MMK L+ +QGGPIILSQIENEYG VE G Y +WAA +
Sbjct: 158 PFKTAMAGFTKKIVDMMKEEELFETQGGPIILSQIENEYGPVEWEIGAPGQAYTKWAANM 217
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDP+IN CN C + PN KP +WTE WTS++ +G
Sbjct: 218 AVGLGTGVPWVMCKQDDAPDPIINTCNDHYC--DWFSPNKNYKPTMWTEAWTSWFTAFGG 275
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ +A FI + GS++NYYMYHGGTNFGRTA +V T Y AP+DEYG
Sbjct: 276 PVPYRPAEDMAFAIAKFIQR-GGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEYG 334
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPKWGHLK+LH A+K+C ++SG + + QE+ +F+ S +CAAFL N D++
Sbjct: 335 LIRQPKWGHLKDLHKAIKMCEAALVSGDPIVTSLGSSQESHVFKSESGDCAAFLANYDEK 394
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-------------QWEEYKEA 433
+ A V F + Y LPP SISILPDC FNTA++ + WE Y E
Sbjct: 395 SFAKVAFQGMHYNLPPWSISILPDCVNTVFNTARVGAQTSSMTMTSVNPDGFSWETYNEE 454
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVL 487
+YD+ S+ LLEQ+N T+D +DYLWY DP++ VL V S GH L
Sbjct: 455 TASYDDASITMEGLLEQINVTRDVTDYLWYTTDITIDPNEGFLKNGEYPVLTVMSAGHAL 514
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
H FINGE G+ +G + T V L+ G N +S+LS+ VGLP+ GA+ E G L
Sbjct: 515 HIFINGELSGTVYGSVDNPKLTYTGSVKLLAGNNKISVLSIAVGLPNIGAHFETWNTGVL 574
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKT 605
V + G E +D S +W Y++GL GE LQ+ + GS V WS + QPLTWYKT
Sbjct: 575 GPVVLNGLNEGRRDLSWQNWSYKIGLKGEALQLHSLTGSSSVEWSSL-IAQKQPLTWYKT 633
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTP 645
F+AP G+ P A+++ MGKG+ W+NGQSIGRYW ++ L
Sbjct: 634 TFNAPEGNGPFALDMSMMGKGQIWINGQSIGRYWPAYKAYGNCGECSYTGRYNEKKCLAN 693
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G SQ WYH+P S+L PT NLLV+ EE G P GIS+ + + C +S+ H P +
Sbjct: 694 CGEASQRWYHVPSSWLYPTANLLVVFEEWGGDPTGISLVRRTTGSACAFISEWH-PTLRK 752
Query: 706 WRSQNQRTLKTHKRIPG-RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
W +K + R RRPK + C G+KIS I FAS+G P G C N+ GSCH+
Sbjct: 753 WH------IKDYGRAERPRRPKAHLSCADGQKISSIKFASFGTPQGVCGNFTEGSCHAHK 806
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
S I EK C+G++ C+V + + F GDPCP + K L V+A C
Sbjct: 807 SYDIFEKNCVGQQWCSVTISPDVFGGDPCPNVMKNLAVEAIC 848
>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
lyrata]
Length = 853
Score = 762 bits (1967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/847 (47%), Positives = 532/847 (62%), Gaps = 65/847 (7%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
CL GLL+ +G G VTYD ++L+ING R+ILFSGSIHYPRSTP MW LI
Sbjct: 14 FCL-GLLILGVGFVQCG------VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQ 66
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KAK+GG+DV++T VFWNLHEP PG++DF GR DLVRF+K + GLY LRIGP++ EW
Sbjct: 67 KAKDGGIDVIETYVFWNLHEPTPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEW 126
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
+GG P WL VPGI FR+DNEPFK MK + IV +MK+ L+ SQGGPIILSQIENE
Sbjct: 127 NFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENE 186
Query: 187 YGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP 246
YG +G Y+ WAAK+A+ +TGVPWVMCK+DDAPDPVIN CNG C ++FA P
Sbjct: 187 YGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC-DSFA-P 244
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
N P KP IWTE W+ ++ +G R +D+A+ VA FI K GS+VNYYMYHGGTNFG
Sbjct: 245 NKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQK-GGSFVNYYMYHGGTNFG 303
Query: 307 RTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
RTA +T YD AP+DEYGL+R+PK+GHLKELH A+K+C K ++S V + Q
Sbjct: 304 RTAGGPFVTTSYDYDAPIDEYGLIREPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQ 363
Query: 366 EAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--- 421
+A ++ S +C+AFL N D + A V F+N+ Y LPP SISILPDC+ FNTAK+
Sbjct: 364 QAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQ 423
Query: 422 -DSVE---------QWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+E QW+ Y E + + D++S LLEQ+N T+D SDYLWY D
Sbjct: 424 TSQMEMLPTDTKNFQWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSV--D 481
Query: 471 PSDSESVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNN 522
D+ES L + S GH +H F+NG+ GSA G ++ FT + ++L +GTN
Sbjct: 482 IGDTESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNR 541
Query: 523 VSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFT 580
++LLSV VGLP+ G + E G L V++ G + K D S W YQVGL GE + +
Sbjct: 542 IALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLAF 601
Query: 581 DYGSRIVPWSRYGSSTH--QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY 638
+R + W + QPLTW+KT FDAP G++P+A+++ MGKG+ WVNG+SIGRY
Sbjct: 602 PTNTRSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRY 661
Query: 639 WVSFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
W +F T G P+Q +YH+PRS+LKP+ NLLV+ EE G P
Sbjct: 662 WTAFATGDCSQCSYTGTYKPNKCQTGCGQPTQRYYHVPRSWLKPSQNLLVIFEELGGNPS 721
Query: 680 GISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISK 739
+S+ SV+ +C VS+ H P + +W+ ++ +T RPKV ++C G+ I+
Sbjct: 722 SVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTF-----HRPKVHLKCSPGQAIAS 775
Query: 740 ILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKA 799
I FAS+G P G C +Y G CH++ S AI+E+ C+GK C V + F DPCP + K
Sbjct: 776 IKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARCAVTISNTNFGKDPCPNVLKR 835
Query: 800 LLVDAQC 806
L V+A C
Sbjct: 836 LTVEAVC 842
>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
Length = 851
Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/832 (48%), Positives = 521/832 (62%), Gaps = 64/832 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++++G R+ILFSGSIHYPRSTP+MW LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRFIK VQ G++V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQ----------IENEYGMVEHSFLEKGP 199
FK M+ + IV MMK+ L+ASQGGPIILSQ IENEYG F G
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 200 PYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENW 259
Y+ WAAK+AV L TGVPWVMCK+DDAPDPVINACNG C +TF+ PN P KP +WTE W
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC-DTFS-PNKPYKPTMWTEAW 264
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD 319
+ ++ +G R R ED+A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD
Sbjct: 265 SGWFTEFGGTIRQRPVEDLAFGVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYD 323
Query: 320 -QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAA 378
APLDEYGL R+PK+GHLKELH AVKLC +P++S +QEA +F+ SS CAA
Sbjct: 324 YDAPLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAA 383
Query: 379 FLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVE 425
FL N + + A V F+N Y LPP SISILPDCK V FNTA + S
Sbjct: 384 FLANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWADGASSM 443
Query: 426 QWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVL 478
WE+Y E + + L + LLEQ+N T+D SDYLWY + DPS+ + L
Sbjct: 444 MWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSL 503
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
V S GH LH FING+ GSA+G D+ + +L GTN V+LLSV GLP+ G +
Sbjct: 504 TVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVH 563
Query: 539 LERRVAGLRN-VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--S 594
E G+ V I G E +D + +W YQVGL GE++ + + GS V W + +
Sbjct: 564 YETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVA 623
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-------------- 640
QPL WY+ FD P+G +P+A+++ SMGKG+ W+NGQSIGRYW
Sbjct: 624 QNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAEGDCKGCHYTG 683
Query: 641 SFLTPQ-----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
S+ P+ G P+Q WYH+PRS+L+PT NLLV+ EE G I++ +V+ +C V
Sbjct: 684 SYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADV 743
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
S+ H P + +W+ ++ + H KV ++C G+ IS I FAS+G P G C +
Sbjct: 744 SEYH-PNIKNWQIESYGEPEFHT------AKVHLKCAPGQTISAIKFASFGTPLGTCGTF 796
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G CHS NS +++EK C+G + C V + F GDPCP + K + V+A C+
Sbjct: 797 QQGECHSINSNSVLEKKCIGLQRCVVAISPSNFGGDPCPEVMKRVAVEAVCS 848
>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
Length = 832
Score = 761 bits (1966), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/825 (48%), Positives = 518/825 (62%), Gaps = 66/825 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +S+IING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 26 SVTYDHKSVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F GR DLVRF+K V+ GLY LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 86 PGQYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNG 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M ++ IV+MMKA LY +QGGPIILSQIENEYG VE+ G Y WAAK+
Sbjct: 146 PFKAAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENEYGPVEYYDGAAGKSYTNWAAKM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDPVIN CNG C + PN +KP +WTE WT ++ +G
Sbjct: 206 AVGLNTGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNKDNKPKMWTEAWTGWFTGFGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ YD AP+DEYG
Sbjct: 264 AVPQRPAEDMAFAVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
LLRQPKWGHL++LH A+KLC ++SG + + QE+++++ S CAAFL N + R
Sbjct: 323 LLRQPKWGHLRDLHKAIKLCEPALVSGEPTITSLGQNQESYVYRSKSSCAAFLANFNSRY 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPT 436
ATV F+ + Y LPP S+SILPDCKT FNTA++ + W+ Y E
Sbjct: 383 YATVTFNGMHYNLPPWSVSILPDCKTTVFNTARVGAQTTTMKMQYLGGFSWKAYTEDTDA 442
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHVLH 488
++ + + L+EQ++TT D SDYLWY D + +E LK V S GH +H
Sbjct: 443 LNDNTFTKDGLVEQLSTTWDRSDYLWYTTYV--DIAKNEEFLKTGKYPYLTVMSAGHAVH 500
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING+ G+A+G + T L G+N +S+LSV VGLP+ G + E G L
Sbjct: 501 VFINGQLSGTAYGSLDNPKLTYSGSAKLWAGSNKISILSVSVGLPNVGNHFETWNTGVLG 560
Query: 548 NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V++ G E K D S W YQ+GL GE L + + GS V W +S QPLTWYKT
Sbjct: 561 PVTLTGLNEGKRDLSLQKWTYQIGLHGETLSLHSLTGSSNVEWGE--ASQKQPLTWYKTF 618
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQ 646
F+AP G++P+A+++ +MGKG+ W+NGQSIGRYW ++ L+
Sbjct: 619 FNAPPGNEPLALDMNTMGKGQIWINGQSIGRYWPAYKASGSCGSCDYRGTYNEKKCLSNC 678
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW 706
G SQ WYH+PRS+L PTGN LV+LEE G P GIS+ SV ++C V + P + +W
Sbjct: 679 GEASQRWYHVPRSWLIPTGNFLVVLEEWGGDPTGISMVKRSVASVCAEVEELQ-PTMDNW 737
Query: 707 RSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSR 766
R++ RPKV + C G+K+SKI FAS+G P G C +++ GSCH+ S
Sbjct: 738 RTKAY-----------GRPKVHLSCDPGQKMSKIKFASFGTPQGTCGSFSEGSCHAHKSY 786
Query: 767 AIVEKA-----CLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
E+ C+G+ C+V V E F GDPCPG K L V+A C
Sbjct: 787 DAFEQEGLMQNCVGQEFCSVNVAPEVFGGDPCPGTMKKLAVEAIC 831
>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
Length = 845
Score = 761 bits (1964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/826 (48%), Positives = 523/826 (63%), Gaps = 60/826 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+IL SGSIHYPRSTP MW +I KAK+GGLDVV+T VFWN+HEP
Sbjct: 27 SVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFI+ VQ GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 87 PGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV +MK+ RL+ SQGGPIILSQIENEYG+ + G Y+ WAA +
Sbjct: 147 PFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWAANM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN CNG C + F+ PN P KP IWTE W+ ++ +G
Sbjct: 207 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYC-DAFS-PNKPYKPTIWTEAWSGWFNEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI K GS+VNYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 265 PLHQRPVQDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH ++KLC + ++S + + Q+A ++ + +CAAFL N D +
Sbjct: 324 LVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSDAGDCAAFLSNYDTK 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEA 433
++A V F+N+ Y LPP SISILPDC+ FNTAK+ + WE Y E
Sbjct: 384 SSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEMLPTNAEMLSWESYDED 443
Query: 434 IPTYDETSLRANF-LLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
I + D++S LLEQ+N T+DASDYLWY R D SES L+ + + G
Sbjct: 444 ISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRI--DIGSSESFLRGGELPTLILQTTG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING+ GSA G + FT + V+L GTN ++LLSV VGLP+ G + E
Sbjct: 502 HAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIALLSVAVGLPNVGGHFETWNT 561
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPL 600
G L V++ G + K D S W Y+VGL GE + + + G V W + + QPL
Sbjct: 562 GILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLVSPNGISSVDWMQGSLAAQRQQPL 621
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+K F+AP G +P+A+++ MGKG+ W+NGQSIGRYW ++
Sbjct: 622 TWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGRYWTAYANGNCQGCSYSGTYRPPK 681
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LKPT NLLV+ EE G P IS+ S+T++C V + H P
Sbjct: 682 CQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGGDPSRISLVRRSMTSVCADVFEYH-P 740
Query: 702 PVISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
+ +W ++ +T + HK PKV +RC G+ IS I FASYG P G C ++ G C
Sbjct: 741 NIKNWHIESYGKTEELHK------PKVHLRCGPGQSISSIKFASYGTPLGTCGSFEQGPC 794
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
H+ +S AIVEK C+G++ C V + F DPCP + K L V+A C
Sbjct: 795 HAPDSYAIVEKRCIGRQRCAVTISNTNFAQDPCPNVLKRLSVEAVC 840
>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
Length = 851
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/832 (48%), Positives = 520/832 (62%), Gaps = 64/832 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++++G R+ILFSGSIHYPRSTP+MW LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRFIK VQ G++V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQ----------IENEYGMVEHSFLEKGP 199
FK M+ + IV MMK+ L+ASQGGPIILSQ IENEYG F G
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206
Query: 200 PYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENW 259
Y+ WAAK+AV L TGVPWVMCK+DDAPDPVINACNG C +TF+ PN P KP +WTE W
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYC-DTFS-PNKPYKPTMWTEAW 264
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD 319
+ ++ +G R R ED+A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD
Sbjct: 265 SGWFTEFGGTIRQRPVEDLAFGVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYD 323
Query: 320 -QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAA 378
APLDEYGL R+PK+GHLKELH AVKLC +P++S +QEA +F+ SS CAA
Sbjct: 324 YDAPLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAA 383
Query: 379 FLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVE 425
FL N + + A V F+N Y LPP SISILPDCK V FNTA + S
Sbjct: 384 FLANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWADGASSM 443
Query: 426 QWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVL 478
WE+Y E + + L + LLEQ+N T+D SDYLWY + DPS+ + L
Sbjct: 444 MWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSL 503
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
V S GH LH FING+ GSA+G D+ + +L GTN V+LLSV GLP+ G +
Sbjct: 504 TVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVH 563
Query: 539 LERRVAGLRN-VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--S 594
E G+ V I G E +D + +W YQVGL GE++ + + GS V W + +
Sbjct: 564 YETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVA 623
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-------------- 640
QPL WY+ FD P+G +P+A+++ SMGKG+ W+NGQSIGRYW
Sbjct: 624 QNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAEGDCKGCHYTG 683
Query: 641 SFLTPQ-----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
S+ P+ G P+Q WYH+PRS+L+PT NLLV+ EE G I++ +V+ +C V
Sbjct: 684 SYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADV 743
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
S+ H P + +W+ ++ + H KV ++C G+ IS I FAS+G P G C +
Sbjct: 744 SEYH-PNIKNWQIESYGEPEFHT------AKVHLKCAPGQTISAIKFASFGTPLGTCGTF 796
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G CHS NS +++E+ C+G C V + F GDPCP + K + V+A C+
Sbjct: 797 QQGECHSINSNSVLERKCIGLERCVVAISPSNFGGDPCPEVMKRVAVEAVCS 848
>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
Length = 898
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/826 (48%), Positives = 523/826 (63%), Gaps = 60/826 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+IL SGSIHYPRSTP MW +I KAK+GGLDVV+T VFWN+HEP
Sbjct: 80 SVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVHEPS 139
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFI+ VQ GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 140 PGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 199
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV +MK+ RL+ SQGGPIILSQIENEYG+ + G Y+ WAA +
Sbjct: 200 PFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWAANM 259
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN CNG C + F+ PN P KP IWTE W+ ++ +G
Sbjct: 260 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYC-DAFS-PNKPYKPTIWTEAWSGWFNEFGG 317
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI K GS+VNYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 318 PLHQRPVQDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 376
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH ++KLC + ++S + + Q+A ++ + +CAAFL N D +
Sbjct: 377 LVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSDAGDCAAFLSNYDTK 436
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEA 433
++A V F+N+ Y LPP SISILPDC+ FNTAK+ + WE Y E
Sbjct: 437 SSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKVGVQTAHMEMLPTNAEMLSWESYDED 496
Query: 434 IPTYDETSLRANF-LLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
I + D++S LLEQ+N T+DASDYLWY R D SES L+ + + G
Sbjct: 497 ISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRI--DIGSSESFLRGGELPTLILQTTG 554
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING+ GSA G + FT + V+L GTN ++LLSV VGLP+ G + E
Sbjct: 555 HAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIALLSVAVGLPNVGGHFETWNT 614
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPL 600
G L V++ G + K D S W Y+VGL GE + + + G V W + + QPL
Sbjct: 615 GILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLVSPNGISSVDWMQGSLAAQRQQPL 674
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+K F+AP G +P+A+++ MGKG+ W+NGQSIGRYW ++
Sbjct: 675 TWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGRYWTAYANGNCQGCSYSGTYRPPK 734
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LKPT NLLV+ EE G P IS+ S+T++C V + H P
Sbjct: 735 CQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGGDPSRISLVRRSMTSVCADVFEYH-P 793
Query: 702 PVISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
+ +W ++ +T + HK PKV +RC G+ IS I FASYG P G C ++ G C
Sbjct: 794 NIKNWHIESYGKTEELHK------PKVHLRCGPGQSISSIKFASYGTPLGTCGSFEQGPC 847
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
H+ +S AIVEK C+G++ C V + F DPCP + K L V+A C
Sbjct: 848 HAPDSYAIVEKRCIGRQRCAVTISNTNFAQDPCPNVLKRLSVEAVC 893
>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
Length = 853
Score = 760 bits (1962), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/848 (47%), Positives = 528/848 (62%), Gaps = 70/848 (8%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
L+C G L VTYD R+++ING R+IL SGSIHYPRSTP+MW LI
Sbjct: 15 LVCFLGFQLVQC-----------TVTYDRRAIVINGQRRILISGSIHYPRSTPEMWEDLI 63
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GGLDVV+T VFWN+HEP PG ++F GR DLVRF+K +Q GLY LRIGP++ E
Sbjct: 64 QKAKDGGLDVVETYVFWNVHEPSPGNYNFKGRYDLVRFLKTIQKAGLYAHLRIGPYVCAE 123
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK M+ + IV +MK+ +L+ SQGGPIILSQIEN
Sbjct: 124 WNFGGFPVWLKYVPGISFRTDNEPFKRAMQGFTEKIVGLMKSEKLFESQGGPIILSQIEN 183
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
EYG F G Y+ WAA +AV L TGVPWVMCK++DAPDPVIN CNG C ++FA
Sbjct: 184 EYGAQSKLFGAAGHNYMTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC-DSFA- 241
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN P KP IWTE W+ ++ +G R +D+AY VA FI K GS+VNYYMYHGGTNF
Sbjct: 242 PNKPYKPTIWTEAWSGWFSEFGGPIHQRPVQDLAYAVARFIQK-GGSFVNYYMYHGGTNF 300
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA +T YD APLDEYGL+RQPK+GHLKELH A+K+C + ++S + +
Sbjct: 301 GRTAGGPFITTSYDYDAPLDEYGLIRQPKYGHLKELHRAIKMCERALVSADPIITSLGNF 360
Query: 365 QEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD- 422
Q+A+++ S +C+AFL N D ++ A V F+N+ Y LPP SISILPDC+ V FNTAK+
Sbjct: 361 QQAYVYTSESGDCSAFLSNHDSKSAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGV 420
Query: 423 ------------SVEQWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKH 469
+ WE Y E I + D++S + A LLEQ+N T+D++DYLWY +
Sbjct: 421 QTSQMGMLPTNIQMLSWESYDEDITSLDDSSTITAPGLLEQINVTRDSTDYLWY--KTSV 478
Query: 470 DPSDSESVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTN 521
D SES L+ V S GH +H FING+ GS+ G + FT V+L GTN
Sbjct: 479 DIGSSESFLRGGELPTLIVQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVNLHAGTN 538
Query: 522 NVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIF 579
++LLSV VGLP+ G + E G L V++ G + K D S W YQVGL GE + +
Sbjct: 539 RIALLSVAVGLPNVGGHFEAWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLV 598
Query: 580 TDYGSRIVPWSR--YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGR 637
+ V W R + QPLTW+KT+F+AP G +P+A+++ MGKG+ W+NGQSIGR
Sbjct: 599 SPNSISSVDWMRGSLAAQKQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWINGQSIGR 658
Query: 638 YWVSFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYP 678
YW +F G P+Q YH+PRS+LKP NLLV+ EE G P
Sbjct: 659 YWTAFANGNCNGCSYAGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVIFEEFGGDP 718
Query: 679 PGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKIS 738
IS+ SV+++C V++ H P + +W ++ + PKV +RC G+ IS
Sbjct: 719 SRISLVKRSVSSVCAEVAEYH-PTIKNWHIESYGKAEDF-----HSPKVHLRCNPGQAIS 772
Query: 739 KILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPK 798
I FAS+G P G C +Y G+CH++ S ++++K C+GK+ C V + F GDPCP + K
Sbjct: 773 SIKFASFGTPLGTCGSYQEGTCHAATSYSVLQKKCIGKQRCAVTISNSNF-GDPCPKVLK 831
Query: 799 ALLVDAQC 806
L V+A C
Sbjct: 832 RLSVEAVC 839
>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 847
Score = 759 bits (1961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/822 (48%), Positives = 515/822 (62%), Gaps = 53/822 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++ING R+ILFSGSIHYPRSTP+MW LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 32 VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLV+FIK Q GL+V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 92 GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK+ L+ASQGGPIILSQIENEYG E F G Y WAAK+A
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCKQ+DAPDPVINACNG C + F PN+P KP +WTE WT ++ +G
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYC-DAFT-PNTPSKPTMWTEAWTGWFTEFGGT 269
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R R ED+++ VA F+ K GS++NYYMYHGGTNFGRTA +T YD APLDEYGL
Sbjct: 270 IRKRPVEDLSFAVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGL 328
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+GHLKELH A+KLC + ++S + +QEA +++ S CAAFL N + ++
Sbjct: 329 AREPKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVYRSPSGCAAFLANYNSNSH 388
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIP 435
A + F N Y LPP SISILPDCKTV +NTA + S WE Y E +
Sbjct: 389 AKIVFDNEHYSLPPWSISILPDCKTVVYNTATVGVQTSQMQMWSDGASSMMWERYDEEVG 448
Query: 436 TYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVSSLGHVLH 488
+ L LLEQ+N T+D SDYLWY PS+ L V S GH LH
Sbjct: 449 SLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTVQSAGHALH 508
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
F+NG+ GSA G DK + + V L GTN +SLLSV GLP+ G + E G+
Sbjct: 509 IFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYETWNTGVNG 568
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYK 604
V + G E +D + +W YQVGL GE++ + + G+ V W + + PL WY+
Sbjct: 569 PVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQNQMPLAWYR 628
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------------ 646
FD P+G +P+A+++ SMGKG+ W+NGQSIGRY +++ T
Sbjct: 629 AYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAYATGDCKDCSYTGSFRAIKCQAG 688
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G P+Q WYH+P+ +L+PT NLLV+ EE G IS+ SV+ +C VS+ H P + +
Sbjct: 689 CGQPTQRWYHVPKPWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVCADVSEFH-PSIKN 747
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
W+++N K RR KV +RC G+ IS I FAS+G P G C ++ G CHS+ S
Sbjct: 748 WQTENSGEAKPEL----RRSKVHLRCAPGQSISAIKFASFGTPLGTCGSFEQGQCHSTKS 803
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ ++E C+GK+ C V + + F GDPCP + K + V+A C+
Sbjct: 804 QTVLEN-CIGKQRCAVTISPDNFGGDPCPNVMKRVAVEAVCS 844
>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
Length = 838
Score = 759 bits (1960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/823 (48%), Positives = 517/823 (62%), Gaps = 55/823 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING R+IL SGSIHYPRSTP+MWP LI KAK+GG+DV+QT VFWN HEP
Sbjct: 27 SVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGVDVIQTYVFWNGHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG + F R DLV+FIK VQ GLY+ LRIGP+I EW +GG P WL VPGI FR+DN
Sbjct: 87 PGNYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDNG 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV MMK+ +L+ +QGGPIILSQIENEYG VE G Y +WAA +
Sbjct: 147 PFKAAMQKFTEKIVGMMKSEKLFENQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAADM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW+MCKQ+DAPDP+I+ CNG C E F PN KP IWTE WT +Y +G
Sbjct: 207 AVKLGTGVPWIMCKQEDAPDPMIDTCNGFYC-ENFK-PNKDYKPKIWTEAWTGWYTEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA FI + GSY+NYYMYHGGTNFGRTA ++ T Y APLDE+G
Sbjct: 265 AVPHRPAEDMAFSVARFI-QNGGSYINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L R+PKWGHL++LH A+KLC ++S + QEA +F+ S CAAFL N D +
Sbjct: 324 LPREPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVFKSKSVCAAFLANYDTKY 383
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIP 435
+ V F N YELPP S+SILPDCKT +NTA+L S W+ Y E
Sbjct: 384 SVKVTFGNGQYELPPWSVSILPDCKTAVYNTARLGSQSSQMKMVPASSSFSWQSYNEETA 443
Query: 436 TY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLH 488
+ D+ + N L EQ+N T+DA+DYLWY K D S +L + S GH LH
Sbjct: 444 SADDDDTTTMNGLWEQINVTRDATDYLWYLTDVKIDADEGFLKSGQNPLLTIFSAGHALH 503
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING+ G+A+G S+ T + + L G N +SLLSV VGLP+ G + E AG L
Sbjct: 504 VFINGQLAGTAYGGLSNPKLTFSQNIKLTEGINKISLLSVAVGLPNVGLHFETWNAGVLG 563
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
++++G E +D S W Y++GL GE L + T GS V W GS + Q LTWYK
Sbjct: 564 PITLKGLNEGTRDLSGQKWSYKIGLKGESLSLHTASGSESVEWVE-GSLLAQKQALTWYK 622
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------T 644
T FDAP G+DP+A+++ SMGKG+ W+NGQ+IGR+W ++ T
Sbjct: 623 TAFDAPQGNDPLALDMSSMGKGQMWINGQNIGRHWPGYIAHGSCGDCNYAGTFDDKKCRT 682
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVI 704
G PSQ WYH+PRS+LKP+GNLL + EE G P GIS + ++C + + P +
Sbjct: 683 NCGEPSQRWYHVPRSWLKPSGNLLAVFEEWGGDPTGISFVKRTTASVCADIFEGQ-PALK 741
Query: 705 SWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
+W++ ++ +PK + CP+G+KIS+I FAS+G P G C ++ GSCH+
Sbjct: 742 NWQA------IASGKVISPQPKAHLWCPTGQKISQIKFASFGMPQGTCGSFREGSCHAHK 795
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S E+ C+GK+SC+V V E F GDPCP K L V+A C+
Sbjct: 796 SYDAFERNCVGKQSCSVTVAPEVFGGDPCPDSAKKLSVEAVCS 838
>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
Length = 841
Score = 759 bits (1959), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/848 (47%), Positives = 530/848 (62%), Gaps = 67/848 (7%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
LC FG+L +V+YD +++IINGHR+IL SGSIHYPRST +MWP LI
Sbjct: 15 FLCFFGVLSVQA-----------SVSYDSKAIIINGHRRILISGSIHYPRSTSEMWPDLI 63
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAKEGGLDV++T VFWN HEP+PG++ F G DLVRF+K V GLYV LRIGP++ E
Sbjct: 64 QKAKEGGLDVIETYVFWNGHEPEPGKYYFEGNYDLVRFVKLVHQAGLYVHLRIGPYVCAE 123
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL +PGI FR+DN PFKF M+R+ IVNMMKA RLY SQGGPIILSQIEN
Sbjct: 124 WNFGGFPVWLKYIPGISFRTDNAPFKFQMERFTRKIVNMMKAERLYESQGGPIILSQIEN 183
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
EYG +E+ G Y +WAA++A+ L TGVPWVMCKQDDAPDP+IN CNG C +
Sbjct: 184 EYGPMEYELGAPGKAYSKWAAQMALGLGTGVPWVMCKQDDAPDPIINTCNGFYC--DYFS 241
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTE WT ++ +G R AED+A+ VA FI K G+ +NYYMYHGGTNF
Sbjct: 242 PNKAYKPKMWTEAWTGWFTQFGGAVPHRPAEDMAFAVARFIQK-GGALINYYMYHGGTNF 300
Query: 306 GRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA ++ T Y AP+DEYGLLRQPKWGHLK+L+ A+KLC ++SG +
Sbjct: 301 GRTAGGPFIATSYDYDAPIDEYGLLRQPKWGHLKDLNRAIKLCEPALVSGDPIVTRLGNY 360
Query: 365 QEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS 423
QEA +F+ S CAAFL N + R+ ATV F N+ Y +PP SISILPDCK FNTA++ +
Sbjct: 361 QEAHVFKSKSGACAAFLSNYNPRSYATVAFGNMHYNIPPWSISILPDCKNTVFNTARVGA 420
Query: 424 VE--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKH 469
W+ Y E +Y+E + LLEQ+NTT+DA+DYLWY
Sbjct: 421 QTAIMKMSPVPMHESFSWQAYNEEPASYNEKAFTTVGLLEQINTTRDATDYLWYTTDVHI 480
Query: 470 DP------SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNV 523
D S VL V S GH +H F+NG+ G+A+G T + V+L G N +
Sbjct: 481 DANEGFLRSGKYPVLTVLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLRAGNNKI 540
Query: 524 SLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTD 581
+LLS+ VGLP+ G + E AG L V++ G E +D + W Y++GL GE + + +
Sbjct: 541 ALLSIAVGLPNVGPHFEMWNAGILGPVNLNGLDEGRRDLTWQKWTYKIGLDGEAMSLHSL 600
Query: 582 YGSRIVPWSRYGS--STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW 639
GS V W + GS + QPLTW+KT F+AP G+ P+A+++ SMGKG+ W+NGQS+GRYW
Sbjct: 601 SGSSSVEWIQ-GSLVAQKQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQSLGRYW 659
Query: 640 VSFLTPQ--------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
++ + G SQ WYH+PRS+L PTGNLLV+ EE G P
Sbjct: 660 PAYKSTGSCGSCDYTGTYNEKKCSSNCGEASQRWYHVPRSWLNPTGNLLVVFEEWGGDPN 719
Query: 680 GISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISK 739
GI + V ++C ++++ P +++W Q Q + K +K + RPK + C G+KIS
Sbjct: 720 GIHLVRRDVDSVCVNINEWQ-PTLMNW--QMQSSGKVNKPL---RPKAHLSCGPGQKISS 773
Query: 740 ILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKA 799
+ FAS+G P G C ++ GSCH+ +S ++ C+G+ CTV V E F GDPCP + K
Sbjct: 774 VKFASFGTPEGECGSFREGSCHAHHSYDAFQRTCVGQNFCTVTVAPEMFGGDPCPNVMKK 833
Query: 800 LLVDAQCT 807
L V+ C+
Sbjct: 834 LSVEVICS 841
>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
[Brachypodium distachyon]
Length = 852
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/842 (46%), Positives = 520/842 (61%), Gaps = 68/842 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
G NVTYD R+L+I+G R++L SGSIHYPRSTP MWP L+ KAK+GGLDVV+T VFW
Sbjct: 22 GASSATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFW 81
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
++HE Q+DF GR+DLVRF+K GLYV LRIGP++ EW YGG P WLH +PGI
Sbjct: 82 DIHETATXQYDFEGRKDLVRFVKAAADTGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIK 141
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FR+DNEPFK M+R+ +V MK A LYASQGGPIILSQIENEYG ++ ++ G Y+
Sbjct: 142 FRTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKSYI 201
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + F PNS KP +WTENW+ +
Sbjct: 202 RWAAGMAVALDTGVPWVMCQQADAPDPLINTCNGFYC-DQFT-PNSNSKPKLWTENWSGW 259
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QA 321
+ +G R ED+A+ VA F + G+ NYYMYHGGTNFGR++ ++ YD A
Sbjct: 260 FLSFGGAVPYRPTEDLAFAVARFYQR-GGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDA 318
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
P+DEYGL+RQPKWGHLK++H A+K C +++ M+ + EA +++ S CAAFL
Sbjct: 319 PIDEYGLVRQPKWGHLKDVHKAIKQCEPALIATDPSYMSMGQNAEAHVYKAGSVCAAFLA 378
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------------------ 423
N D +++ TV F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 379 NMDTQSDKTVTFNGNAYKLPAWSVSILPDCKNVVLNTAQINSQTTTSEMRSLGSSTKASD 438
Query: 424 ---------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDP 471
+ W E + E +L L+EQ+NTT DASD+LWY+ +P
Sbjct: 439 GSSIETELALSGWSYAIEPVGITTENALTKPGLMEQINTTADASDFLWYSTSVVVKGGEP 498
Query: 472 --SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVM 529
+ S+S L V+SLGHVL A+ING+F GSA G + +L+ + L+ G N + LLS
Sbjct: 499 YLNGSQSNLLVNSLGHVLQAYINGKFAGSAKGSATSSLISLQTPITLVPGKNKIDLLSGT 558
Query: 530 VGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVP 588
VGL + GA+ + AG+ V + G K + D SS W YQVGL GE L ++ +
Sbjct: 559 VGLSNYGAFFDLVGAGITGPVKLSGPKGVLDLSSTDWTYQVGLRGEGLHLYNPSEASPEW 618
Query: 589 WSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-- 646
S T+QPL WYK+ F P G DPVAI+ MGKGEAWVNGQSIGRYW + L PQ
Sbjct: 619 VSDKAYPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSG 678
Query: 647 --------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTV 686
G PSQ+ YH+PRSFL+P N +VL E+ G P IS T
Sbjct: 679 CVNSCNYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFTTK 738
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASY 745
++C HVS+ H + SW S Q+ ++ P +++ CP +G+ IS I FAS+
Sbjct: 739 QTASVCAHVSEDHPDQIDSWISPQQKVQRSG-------PALRLECPKAGQVISSIKFASF 791
Query: 746 GNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQ 805
G P+G C NY G C S + A+ ++AC+G SC+VPV T+ F GDPC G+ K+L+V+A
Sbjct: 792 GTPSGTCGNYNHGECSSPQALAVAQEACIGVSSCSVPVSTKNF-GDPCTGVTKSLVVEAA 850
Query: 806 CT 807
C+
Sbjct: 851 CS 852
>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 853
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/845 (47%), Positives = 530/845 (62%), Gaps = 64/845 (7%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
CL G L+ +G G VTYD ++L+ING R+ILFSGSIHYPRSTP MW LI
Sbjct: 17 FCL-GFLILGVGFVQCG------VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQ 69
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KAK+GG+DV++T VFWNLHEP PG++DF GR DLVRF+K + GLY LRIGP++ EW
Sbjct: 70 KAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEW 129
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
+GG P WL VPGI FR+DNEPFK MK + IV +MK+ L+ SQGGPIILSQIENE
Sbjct: 130 NFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENE 189
Query: 187 YGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP 246
YG +G Y+ WAAK+A+ +TGVPWVMCK+DDAPDPVIN CNG C ++FA P
Sbjct: 190 YGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC-DSFA-P 247
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
N P KP IWTE W+ ++ +G R +D+A+ VA FI K GS+VNYYMYHGGTNFG
Sbjct: 248 NKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQK-GGSFVNYYMYHGGTNFG 306
Query: 307 RTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
RTA +T YD AP+DEYGL+RQPK+GHLKELH A+K+C K ++S V + Q
Sbjct: 307 RTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQ 366
Query: 366 EAFIF---------QGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAF 416
+ +I+ S +C+AFL N D + A V F+N+ Y LPP SISILPDC+ F
Sbjct: 367 QVWIYYERFAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVF 426
Query: 417 NTAKLDSVEQWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE 475
NTAK+ + QWE Y E + + D++S + LLEQ+N T+D SDYLWY D DSE
Sbjct: 427 NTAKVSNF-QWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSV--DIGDSE 483
Query: 476 SVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLS 527
S L + S GH +H F+NG+ GSA G ++ FT + ++L +GTN ++LLS
Sbjct: 484 SFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLS 543
Query: 528 VMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSR 585
V VGLP+ G + E G L V++ G + K D S W YQVGL GE + + +
Sbjct: 544 VAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTP 603
Query: 586 IVPWSRYGSSTH--QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
+ W + QPLTW+KT FDAP G++P+A+++ MGKG+ WVNG+SIGRYW +F
Sbjct: 604 SIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFA 663
Query: 644 TPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
T G P+Q WYH+PR++LKP+ NLLV+ EE G P +S+
Sbjct: 664 TGDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLV 723
Query: 685 TVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFAS 744
SV+ +C VS+ H P + +W+ ++ +T RPKV ++C G+ I+ I FAS
Sbjct: 724 KRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTF-----HRPKVHLKCSPGQAIASIKFAS 777
Query: 745 YGNPNGNCENYAIGSCHSSNSRAIVEK---ACLGKRSCTVPVWTEKFYGDPCPGIPKALL 801
+G P G C +Y G CH++ S AI+E+ C+GK C V + F DPCP + K L
Sbjct: 778 FGTPLGTCGSYQQGECHAATSYAILERYMQKCVGKARCAVTISNSNFGKDPCPNVLKRLT 837
Query: 802 VDAQC 806
V+A C
Sbjct: 838 VEAVC 842
>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
Length = 855
Score = 758 bits (1957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/847 (47%), Positives = 531/847 (62%), Gaps = 66/847 (7%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
CL G L+ +G G VTYD ++L+ING R+ILFSGSIHYPRSTP MW LI
Sbjct: 17 FCL-GFLILGVGFVQCG------VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQ 69
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KAK+GG+DV++T VFWNLHEP PG++DF GR DLVRF+K + GLY LRIGP++ EW
Sbjct: 70 KAKDGGIDVIETYVFWNLHEPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEW 129
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
+GG P WL VPGI FR+DNEPFK MK + IV +MK+ L+ SQGGPIILSQIENE
Sbjct: 130 NFGGFPVWLKYVPGISFRTDNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENE 189
Query: 187 YGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP 246
YG +G Y+ WAAK+A+ +TGVPWVMCK+DDAPDPVIN CNG C ++FA P
Sbjct: 190 YGRQGQLLGAEGHNYMTWAAKMAIATETGVPWVMCKEDDAPDPVINTCNGFYC-DSFA-P 247
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
N P KP IWTE W+ ++ +G R +D+A+ VA FI K GS+VNYYMYHGGTNFG
Sbjct: 248 NKPYKPLIWTEAWSGWFTEFGGPMHHRPVQDLAFGVARFIQK-GGSFVNYYMYHGGTNFG 306
Query: 307 RTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
RTA +T YD AP+DEYGL+RQPK+GHLKELH A+K+C K ++S V + Q
Sbjct: 307 RTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQ 366
Query: 366 EAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--- 421
+A ++ S +C+AFL N D + A V F+N+ Y LPP SISILPDC+ FNTAK+
Sbjct: 367 QAHVYSAESGDCSAFLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQ 426
Query: 422 -DSVE---------QWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+E QWE Y E + + D++S + LLEQ+N T+D SDYLWY D
Sbjct: 427 TSQMEMLPTDTKNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSV--D 484
Query: 471 PSDSESVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNN 522
DSES L + S GH +H F+NG+ GSA G ++ FT + ++L +GTN
Sbjct: 485 IGDSESFLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNR 544
Query: 523 VSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFT 580
++LLSV VGLP+ G + E G L V++ G + K D S W YQVGL GE + +
Sbjct: 545 IALLSVAVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAF 604
Query: 581 DYGSRIVPWSRYGSSTH--QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY 638
+ + W + QPLTW+KT FDAP G++P+A+++ MGKG+ WVNG+SIGRY
Sbjct: 605 PTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRY 664
Query: 639 WVSFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
W +F T G P+Q WYH+PR++LKP+ NLLV+ EE G P
Sbjct: 665 WTAFATGDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPS 724
Query: 680 GISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISK 739
+S+ SV+ +C VS+ H P + +W+ ++ +T RPKV ++C G+ I+
Sbjct: 725 TVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTF-----HRPKVHLKCSPGQAIAS 778
Query: 740 ILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKA 799
I FAS+G P G C +Y G CH++ S AI+E+ C+GK C V + F DPCP + K
Sbjct: 779 IKFASFGTPLGTCGSYQQGECHAATSYAILER-CVGKARCAVTISNSNFGKDPCPNVLKR 837
Query: 800 LLVDAQC 806
L V+A C
Sbjct: 838 LTVEAVC 844
>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
Length = 854
Score = 758 bits (1956), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/825 (47%), Positives = 525/825 (63%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+IL SGSIHYPRSTP MW LI KAK+GGLDV+ T +FWN+HEP
Sbjct: 28 SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR++NE
Sbjct: 88 PGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRTNNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV+MMK+ L+ASQGGPIILSQIENEYG G Y+ WAAK+
Sbjct: 148 PFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWAAKM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK+DDAPDPVINACNG C + F+ PN P KP IWTE W+ ++ +G
Sbjct: 208 AVGLDTGVPWVMCKEDDAPDPVINACNGFYC-DAFS-PNKPYKPRIWTEAWSGWFTEFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI + GS+VNYYMYHGGTNFGR+A +T YD AP+DEYG
Sbjct: 266 TIHRRPVQDLAFGVARFI-QNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKR 386
L+RQPK+GHLKELH A+KLC ++S ++ Q+A +F G CAAFL N + +
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
++A V F+N+ Y+LP SISILPDC+TV FNTA++ + WE Y E
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFPTNSKLHSWETYGED 444
Query: 434 IPTYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHV 486
I + + ++ A LLEQ+N T+D++DYLWY D S+S L V S GH
Sbjct: 445 ISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTPTLTVQSKGHA 504
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
+H FING++ GSA+G ++ FT +L GTN ++LLS+ VGLP+ G + E G
Sbjct: 505 VHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVGLHFETWKTGI 564
Query: 546 LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLTW 602
L V + G + K D S W YQVGL GE + + + G V W R + QPL W
Sbjct: 565 LGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSLAAQGQQPLKW 624
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
YK F+AP G +P+A+++ SMGKG+ W+NGQSIGRYW+++
Sbjct: 625 YKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAYAKGDCNVCSYSGTYRPPKCQ 684
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q WYH+PRS+LKPT NLL++ EE G I++ ++ ++C ++ H P +
Sbjct: 685 HGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKSVCADANEHH-PTL 743
Query: 704 ISWRSQN-QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
+W +++ + + H+ V ++C G+ IS I+FAS+G P+G C ++ G+CH+
Sbjct: 744 ENWHTESPSESEELHQ------ASVHLQCAPGQSISTIMFASFGTPSGTCGSFQKGTCHA 797
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS+AI+EK C+G+ C+VP+ F DPCP + K L V+A C+
Sbjct: 798 PNSQAILEKNCIGQEKCSVPISNSYFGADPCPNVLKRLSVEAACS 842
>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
Length = 854
Score = 757 bits (1955), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/825 (47%), Positives = 525/825 (63%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+IL SGSIHYPRSTP MW LI KAK+GGLDV+ T +FWN+HEP
Sbjct: 28 SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR++NE
Sbjct: 88 PGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRTNNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV+MMK+ L+ASQGGPIILSQIENEYG G Y+ WAAK+
Sbjct: 148 PFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWAAKM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK+DDAPDPVINACNG C + F+ PN P KP IWTE W+ ++ +G
Sbjct: 208 AVGLDTGVPWVMCKEDDAPDPVINACNGFYC-DAFS-PNKPYKPRIWTEAWSGWFTEFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI + GS+VNYYMYHGGTNFGR+A +T YD AP+DEYG
Sbjct: 266 TIHRRPVQDLAFGVARFI-QNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKR 386
L+RQPK+GHLKELH A+KLC ++S ++ Q+A +F G CAAFL N + +
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
++A V F+N+ Y+LP SISILPDC+TV FNTA++ + WE Y E
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFPTNSKLHSWETYGED 444
Query: 434 IPTYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHV 486
I + + ++ A LLEQ+N T+D++DYLWY D S+S L V S GH
Sbjct: 445 ISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTPTLTVQSKGHA 504
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
+H FING++ GSA+G ++ FT +L GTN ++LLS+ VGLP+ G + E G
Sbjct: 505 VHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVGLHFETWKTGI 564
Query: 546 LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLTW 602
L V + G + K D S W YQVGL GE + + + G V W R + QPL W
Sbjct: 565 LGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSLAAQGQQPLKW 624
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
YK F+AP G +P+A+++ SMGKG+ W+NGQSIGRYW+++
Sbjct: 625 YKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAYAKGDCNVCSYSGTYRPPKCQ 684
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q WYH+PRS+LKPT NLL++ EE G I++ ++ ++C ++ H P +
Sbjct: 685 HGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKSVCADANEHH-PTL 743
Query: 704 ISWRSQN-QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
+W +++ + + H+ V ++C G+ IS I+FAS+G P+G C ++ G+CH+
Sbjct: 744 ENWHTESPSESEELHZ------ASVHLQCAPGQSISTIMFASFGTPSGTCGSFQKGTCHA 797
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS+AI+EK C+G+ C+VP+ F DPCP + K L V+A C+
Sbjct: 798 PNSQAILEKNCIGQEKCSVPISNSYFGADPCPNVLKRLSVEAACS 842
>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
Length = 854
Score = 757 bits (1955), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/825 (47%), Positives = 525/825 (63%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+IL SGSIHYPRSTP MW LI KAK+GGLDV+ T +FWN+HEP
Sbjct: 28 SVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR++NE
Sbjct: 88 PGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRTNNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV+MMK+ L+ASQGGPIILSQIENEYG G Y+ WAAK+
Sbjct: 148 PFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWAAKM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK+DDAPDPVINACNG C + F+ PN P KP IWTE W+ ++ +G
Sbjct: 208 AVGLDTGVPWVMCKEDDAPDPVINACNGFYC-DAFS-PNKPYKPRIWTEAWSGWFTEFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI + GS+VNYYMYHGGTNFGR+A +T YD AP+DEYG
Sbjct: 266 TIHRRPVQDLAFGVARFI-QNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKR 386
L+RQPK+GHLKELH A+KLC ++S ++ Q+A +F G CAAFL N + +
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
++A V F+N+ Y+LP SISILPDC+TV FNTA++ + WE Y E
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARVGVQTSHMRMFPTNSKLHSWETYGED 444
Query: 434 IPTYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHV 486
I + + ++ A LLEQ+N T+D++DYLWY D S+S L V S GH
Sbjct: 445 ISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTPTLTVQSKGHA 504
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
+H FING++ GSA+G ++ FT +L GTN ++LLS+ VGLP+ G + E G
Sbjct: 505 VHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVGLHFETWKTGI 564
Query: 546 LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLTW 602
L V + G + K D S W YQVGL GE + + + G V W R + QPL W
Sbjct: 565 LGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSLAAQGQQPLKW 624
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
YK F+AP G +P+A+++ SMGKG+ W+NGQSIGRYW+++
Sbjct: 625 YKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAYAKGDCNVCSYSGTYRPPKCQ 684
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q WYH+PRS+LKPT NLL++ EE G I++ ++ ++C ++ H P +
Sbjct: 685 HGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKSVCADANEHH-PTL 743
Query: 704 ISWRSQN-QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
+W +++ + + H+ V ++C G+ IS I+FAS+G P+G C ++ G+CH+
Sbjct: 744 ENWHTESPSESEELHE------ASVHLQCAPGQSISTIMFASFGTPSGTCGSFQKGTCHA 797
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS+AI+EK C+G+ C+VP+ F DPCP + K L V+A C+
Sbjct: 798 PNSQAILEKNCIGQEKCSVPISNSYFGADPCPNVLKRLSVEAACS 842
>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 840
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 414/853 (48%), Positives = 530/853 (62%), Gaps = 68/853 (7%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M LL +F L+ G +V+YD +++ ING R+IL SGSIHYPRSTP+M
Sbjct: 10 MWNVALLLVFSLI----------GSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEM 59
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI KAK+GGLDV+QT VFWN HEP PG++ F G DLV+FIK VQ GLYV LRIGP
Sbjct: 60 WPDLIQKAKDGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGP 119
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
++ EW +GG P WL +PGI FR+DNEPFK M+++ T IV++MKA RLY SQGGPII+
Sbjct: 120 YVCAEWNFGGFPVWLKYIPGISFRTDNEPFKHQMQKFTTKIVDLMKAERLYESQGGPIIM 179
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG +E+ G Y +WAA++A+ L TGVPWVMCKQDD PDP+IN CNG C
Sbjct: 180 SQIENEYGPMEYEIGAAGKAYTKWAAEMAMGLGTGVPWVMCKQDDTPDPLINTCNGFYC- 238
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ PN KP +WTE WT ++ +G R AED+A+ VA FI K GS++NYYMYH
Sbjct: 239 -DYFSPNKAYKPKMWTEAWTGWFTEFGGPVPHRPAEDLAFSVARFIQK-GGSFINYYMYH 296
Query: 301 GGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA ++ T Y APLDEYGLLRQPKWGHLK+LH A+KLC ++SG
Sbjct: 297 GGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVT 356
Query: 360 NFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
QEA +F+ S CAAFL N + ++ ATV F N+ Y LPP SISILPDCK +NT
Sbjct: 357 KIGNYQEAHVFKSKSGACAAFLANYNPKSYATVAFGNMHYNLPPWSISILPDCKNTVYNT 416
Query: 419 AKLDSVE--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
A++ S W + E T D++S LLEQ+NTT+D SDYLWY+
Sbjct: 417 ARVGSQSAQMKMTRVPIHGGFSWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYS 476
Query: 465 FRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLIN 518
DP++ + VL V S GH LH FING+ G+A+G T + V L
Sbjct: 477 TDVVLDPNEGFLRNGKDPVLTVFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRA 536
Query: 519 GTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKL 576
G N +SLLSV VGLP+ G + E AG L +S+ G E +D S W Y+VGL GE L
Sbjct: 537 GVNKISLLSVAVGLPNVGPHFETWNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGEIL 596
Query: 577 QIFTDYGSRIVPWSRYGS--STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQS 634
+ + GS V W + GS S QPLTWYKT FDAP G+ P+A+++ SMGKG+ W+NGQ+
Sbjct: 597 SLHSLSGSSSVEWIQ-GSLVSQRQPLTWYKTTFDAPAGTAPLALDMDSMGKGQVWLNGQN 655
Query: 635 IGRYWVSFLTPQ--------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
+GRYW ++ G SQ WYH+P+S+LKPTGNLLV+ EE
Sbjct: 656 LGRYWPAYKASGTCDYCDYAGTYNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEEL 715
Query: 675 NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSG 734
G P GI + + ++C + + P +IS++ Q T + P RPKV + C G
Sbjct: 716 GGDPNGIFLVRRDIDSVCADIYEWQ-PNLISYQMQ------TSGKAP-VRPKVHLSCSPG 767
Query: 735 RKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCP 794
+KIS I FAS+G P G+C N+ GSCH+ S E+ C+G+ CTV V E F GDPCP
Sbjct: 768 QKISSIKFASFGTPAGSCGNFHEGSCHAHKSYDAFERNCVGQNWCTVTVSPENFGGDPCP 827
Query: 795 GIPKALLVDAQCT 807
+ K L V+A C+
Sbjct: 828 NVLKKLSVEAICS 840
>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
Length = 845
Score = 757 bits (1954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/822 (49%), Positives = 520/822 (63%), Gaps = 56/822 (6%)
Query: 32 YDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQ 91
YD +++ ING R+IL SGSIHYPRS+P+MWP LI KAKEGGLDV+QT VFWN HEP PG+
Sbjct: 34 YDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGK 93
Query: 92 FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFK 151
+ F G DLV+FIK V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DN PFK
Sbjct: 94 YYFEGNYDLVKFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGINFRTDNGPFK 153
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVD 211
M+R+ T IVNMMKA RL+ SQGGPIILSQIENEYG +E+ G Y +WAAK+AV
Sbjct: 154 AQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGQAYSKWAAKMAVG 213
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEAR 271
L TGVPWVMCKQDDAPDPVIN CNG C + PN P KP +WTE WT ++ +G
Sbjct: 214 LGTGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNKPYKPKMWTEAWTGWFTEFGGAVP 271
Query: 272 IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLR 330
R AED+A+ VA FI K G+++NYYMYHGGTNFGRTA ++ T Y APLDEYGLLR
Sbjct: 272 YRPAEDLAFSVARFIQK-GGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLR 330
Query: 331 QPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNA 389
QPKWGHLK+LH A+KLC ++SG M QEA +F+ S CAAFL N ++R+ A
Sbjct: 331 QPKWGHLKDLHRAIKLCEPALVSGAPSVMPLGNYQEAHVFKSKSGACAAFLANYNQRSFA 390
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKEAIP 435
V F N+ Y LPP SISILPDCK +NTA++ + W+ Y E
Sbjct: 391 KVSFGNMHYNLPPWSISILPDCKNTVYNTARIGAQSARMKMSPIPMRGGFSWQAYSEEAS 450
Query: 436 TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLHA 489
T + + LLEQ+NTT+D SDYLWY+ + D S VL V S GH LH
Sbjct: 451 TEGDNTFMMVGLLEQINTTRDVSDYLWYSTDVRIDSNEGFLRSGKYPVLTVLSAGHALHV 510
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+A+G T + V + G N + LLS+ VGLP+ G + E AG L
Sbjct: 511 FVNGQLSGTAYGSLESPKLTFSQGVKMRAGINRIYLLSIAVGLPNVGPHFETWNAGVLGP 570
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V++ G E +D S W Y++GL GE L + + GS V W++ GS S QPL WYKT
Sbjct: 571 VTLNGLNEGRRDLSWQKWTYKIGLHGEALSLHSLSGSSSVEWAQ-GSFVSRKQPLMWYKT 629
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTP 645
F+AP G+ P+A+++ SMGKG+ W+NGQS+GRYW ++ LT
Sbjct: 630 TFNAPAGNSPLALDMGSMGKGQVWINGQSVGRYWPAYKASGNCGVCNYAGTFNEKKCLTN 689
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G SQ WYH+PRS+L GNLLV+ EE G P GIS+ V ++C + + P +++
Sbjct: 690 CGEASQRWYHVPRSWLNTAGNLLVVFEEWGGDPNGISLVRREVDSVCADIYEWQ-PTLMN 748
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
+ Q+ + K +K + RPKV ++C +G+KIS I FAS+G P G C +Y GSCH+ +S
Sbjct: 749 YMMQS--SGKVNKPL---RPKVHLQCGAGQKISLIKFASFGTPEGVCGSYRQGSCHAFHS 803
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ C+G+ C+V V E F GDPCP + K L V+A C+
Sbjct: 804 YDAFNRLCVGQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVCS 845
>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
Length = 842
Score = 756 bits (1953), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/840 (48%), Positives = 525/840 (62%), Gaps = 82/840 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWN HEP
Sbjct: 24 NVTYDHRALLIDGKRRVLISGSIHYPRSTPEMWPGLIQKSKDGGLDVIETYVFWNGHEPV 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q++F GR DLV+F+K V GLYV +RIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 84 RNQYNFEGRYDLVKFVKLVAEAGLYVHIRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ IV+MMK +LYASQGGPIILSQIENEYG ++ +F Y+ WAA +
Sbjct: 144 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAFGPAAKTYINWAAGM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPWVMC+Q DAPDPVIN CNG C + PNS +KP +WTENW+ ++Q +G
Sbjct: 204 AISLDTGVPWVMCQQADAPDPVINTCNGFYCDQ--FTPNSKNKPKMWTENWSGWFQSFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+A+ VA F ++ G++ NYYMYHGGTNFGRT ++ YD APLDEYG
Sbjct: 262 AVPYRPVEDLAFAVARFY-QLSGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
LLRQPKWGHLK++H A+KLC + +++ + + EA +++ S CAAFL N
Sbjct: 321 LLRQPKWGHLKDVHKAIKLCEEALIATDPTTTSLGSNLEATVYKTGSLCAAFLANI-ATT 379
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV----------------------E 425
+ TV F+ Y LP S+SILPDCK VA NTAK++SV
Sbjct: 380 DKTVTFNGNSYNLPAWSVSILPDCKNVALNTAKINSVTIVPSFARQSLVGDVDSSKAIGS 439
Query: 426 QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNF--RFKHD----PSDSESVLK 479
W E + + + LLEQ+NTT D SDYLWY+ K D S++VL
Sbjct: 440 GWSWINEPVGISKNDAFVKSGLLEQINTTADKSDYLWYSLSTNIKGDEPFLEDGSQTVLH 499
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V SLGH LHAFING+ GS GK S+ T++ + L G N + LLS+ VGL + GA+
Sbjct: 500 VESLGHALHAFINGKLAGSGTGKSSNAKVTVDIPITLTPGKNTIDLLSLTVGLQNYGAFY 559
Query: 540 ERRVAGLRNVSIQGAKELK-------DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
E AG I G +LK D SS W YQ+GL GE I + S V S+
Sbjct: 560 ELTGAG-----ITGPVKLKAQNGNTVDLSSQQWTYQIGLKGEDSGISSGSSSEWV--SQP 612
Query: 593 GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------ 646
+QPL WYKT FDAP G+DPVAI+ MGKGEAWVNGQSIGRYW + ++P
Sbjct: 613 TLPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTNVSPSSGCADS 672
Query: 647 ----------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
G PSQ++YHIPRS++K +GN+LVLLEE G P I+ T V +
Sbjct: 673 CNYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSSGNILVLLEEIGGDPTQIAFATRQVGS 732
Query: 691 LCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCPSGRK-ISKILFASYGN 747
LC HVS+SH PV W + ++ G+R P + ++CP K IS I FAS+G
Sbjct: 733 LCSHVSESHPQPVDMWNTDSEG---------GKRSGPVLSLQCPHPDKVISSIKFASFGT 783
Query: 748 PNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
P+G+C +Y+ G C S+++ +IV+KAC+G +SC V V F GDPC G+ K+L V+A CT
Sbjct: 784 PHGSCGSYSHGKCSSTSALSIVQKACVGSKSCNVGVSINTF-GDPCRGVKKSLAVEASCT 842
>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
Length = 853
Score = 755 bits (1950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/828 (49%), Positives = 517/828 (62%), Gaps = 60/828 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+ILFSGSIHYPRSTP MW LI KAKEGGLDV++T +FWN+HEP
Sbjct: 31 SVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYIFWNVHEPS 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G ++F GR DLVRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 91 RGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK+ RLY SQGGPIILSQIENEYG G YV WAAK+
Sbjct: 151 PFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGPAGQNYVNWAAKM 210
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV+ TGVPWVMCK+DDAPDPVIN CNG C + PN P KP+IWTE W+ ++ +G
Sbjct: 211 AVETGTGVPWVMCKEDDAPDPVINTCNGFYC--DYFTPNKPYKPSIWTEAWSGWFSEFGG 268
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI K GS+VNYYMYHGGTNFGRTA +T YD APLDEYG
Sbjct: 269 PNHERPVQDLAFGVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 327
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH A+K+C + ++S + Q+A ++ S +CAAFL N D +
Sbjct: 328 LIRQPKYGHLKELHKAIKMCERALVSADPAVTSMGNFQQAHVYTTKSGDCAAFLSNFDTK 387
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
++ V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE + E
Sbjct: 388 SSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMQMLPTNTHMFSWESFDED 447
Query: 434 IPTYDETS---LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSS 482
I + D+ S + + LLEQ+N T+D SDYLWY D SES L+ V S
Sbjct: 448 ISSLDDGSAITITTSGLLEQINVTRDTSDYLWYITSV--DIGSSESFLRGGKLPTLIVQS 505
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH +H FING+ GSA+G D+ F V+L GTN ++LLSV VGLP+ G + E
Sbjct: 506 TGHAVHVFINGQLSGSAYGTREDRRFRYTGTVNLRAGTNRIALLSVAVGLPNVGGHFETW 565
Query: 543 VAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQ 598
G L V ++G + K D S W YQVGL GE + + + G V W S S +Q
Sbjct: 566 NTGILGPVVLRGLNQGKLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQSALVSEKNQ 625
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLT 644
PLTW+KT FDAP G +P+A+++ MGKG+ W+NG SIGRYW +F
Sbjct: 626 PLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYWTAPAAGICNGCSYAGTFRP 685
Query: 645 PQ-----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
P+ G P+Q WYH+PRS+LKP NLLV+ EE G P IS+ SV+++C VS+ H
Sbjct: 686 PKCQVGCGQPTQRWYHVPRSWLKPNHNLLVVFEELGGDPSKISLVKRSVSSICADVSEYH 745
Query: 700 LPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGS 759
P + +W + K+ + P PKV + C + IS I FAS+G P G C NY G
Sbjct: 746 -PNIRNWHIDSYG--KSEEFHP---PKVHLHCSPSQAISSIKFASFGTPLGTCGNYEKGV 799
Query: 760 CHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
CHS S A +EK C+GK CTV V F DPCP + K L V+A C+
Sbjct: 800 CHSPTSYATLEKKCIGKPRCTVTVSNSNFGQDPCPNVLKRLSVEAVCS 847
>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 852
Score = 754 bits (1948), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/837 (47%), Positives = 532/837 (63%), Gaps = 73/837 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G RK+L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFW+ HEP+
Sbjct: 31 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
+++F GR DLV+F+K GLYV LRIGP++ EW YGG P WLH VPGI FR+DNE
Sbjct: 91 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ T IV++MK +LYASQGGPIILSQIENEYG ++ ++ Y++W+A +
Sbjct: 151 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 210
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW MC+Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +GD
Sbjct: 211 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R ED+A+ VA F + G++ NYYMYHGGTNF RT+ +++ YD AP+DEYG
Sbjct: 269 PSPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 327
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSG--VLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
LLRQPKWGHL++LH A+KLC +++ + S+ S L+ A S CAAFL N D
Sbjct: 328 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLG-SNLEAAVYKTESGSCAAFLANVDT 386
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV--------------------- 424
+++ATV F+ Y LP S+SILPDCK VAFNTAK++S
Sbjct: 387 KSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAEL 446
Query: 425 -EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHDPS----DSESV 477
QW KE I + LLEQ+NTT D SDYLWY+ R K D + S++V
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L + SLG V++AFING+ GS HGK + +L+ ++L+ GTN + LLSV VGL + GA
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHGK---QKISLDIPINLVTGTNTIDLLSVTVGLANYGA 563
Query: 538 YLERRVAGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
+ + AG+ V+++ AK D +S W YQVGL GE + T S V S+
Sbjct: 564 FFDLMGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWV--SKSPL 621
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------- 646
T QPL WYKT FDAP+GS+PVAI+ GKG AWVNGQSIGRYW + +
Sbjct: 622 PTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCD 681
Query: 647 --------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV-TTL 691
G PSQ+ YH+PRS+LKP+GN+LVL EE G P IS T + L
Sbjct: 682 YRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNL 741
Query: 692 CGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNG 750
C VS SH PPV +W S ++ + + RP + ++CP S + I I FAS+G P G
Sbjct: 742 CLTVSQSHPPPVDTWTSDSKISNRNRT-----RPVLSLKCPISTQVIFSIKFASFGTPKG 796
Query: 751 NCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C ++ G C+SS S ++V+KAC+G RSC V V T + +G+PC G+ K+L V+A C+
Sbjct: 797 TCGSFTQGHCNSSRSLSLVQKACIGLRSCNVEVST-RVFGEPCRGVVKSLAVEASCS 852
>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
Full=Protein AR782; Flags: Precursor
gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 852
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/837 (47%), Positives = 532/837 (63%), Gaps = 73/837 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G RK+L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFW+ HEP+
Sbjct: 31 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
+++F GR DLV+F+K GLYV LRIGP++ EW YGG P WLH VPGI FR+DNE
Sbjct: 91 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ T IV++MK +LYASQGGPIILSQIENEYG ++ ++ Y++W+A +
Sbjct: 151 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 210
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW MC+Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +GD
Sbjct: 211 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLGFGD 268
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R ED+A+ VA F + G++ NYYMYHGGTNF RT+ +++ YD AP+DEYG
Sbjct: 269 PSPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 327
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSG--VLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
LLRQPKWGHL++LH A+KLC +++ + S+ S L+ A S CAAFL N D
Sbjct: 328 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLG-SNLEAAVYKTESGSCAAFLANVDT 386
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV--------------------- 424
+++ATV F+ Y LP S+SILPDCK VAFNTAK++S
Sbjct: 387 KSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAEL 446
Query: 425 -EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHDPS----DSESV 477
QW KE I + LLEQ+NTT D SDYLWY+ R K D + S++V
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L + SLG V++AFING+ GS HGK + +L+ ++L+ GTN + LLSV VGL + GA
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHGK---QKISLDIPINLVTGTNTIDLLSVTVGLANYGA 563
Query: 538 YLERRVAGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
+ + AG+ V+++ AK D +S W YQVGL GE + T S V S+
Sbjct: 564 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWV--SKSPL 621
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------- 646
T QPL WYKT FDAP+GS+PVAI+ GKG AWVNGQSIGRYW + +
Sbjct: 622 PTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCD 681
Query: 647 --------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV-TTL 691
G PSQ+ YH+PRS+LKP+GN+LVL EE G P IS T + L
Sbjct: 682 YRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNL 741
Query: 692 CGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNG 750
C VS SH PPV +W S ++ + + RP + ++CP S + I I FAS+G P G
Sbjct: 742 CLTVSQSHPPPVDTWTSDSKISNRNRT-----RPVLSLKCPISTQVIFSIKFASFGTPKG 796
Query: 751 NCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C ++ G C+SS S ++V+KAC+G RSC V V T + +G+PC G+ K+L V+A C+
Sbjct: 797 TCGSFTQGHCNSSRSLSLVQKACIGLRSCNVEVST-RVFGEPCRGVVKSLAVEASCS 852
>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/826 (48%), Positives = 517/826 (62%), Gaps = 60/826 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R++LFSGSIHYPRSTP+MW LI KAKEGGLDVV+T VFWN+HEP
Sbjct: 28 SVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFIK +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 88 PGNYNFEGRYDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV +MK+ L+ SQGGPIILSQIENEYG+ F G Y+ WAAK+
Sbjct: 148 PFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWAAKM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN CNG C + F+ PN P KP +WTE W+ ++ +G
Sbjct: 208 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYC-DAFS-PNRPYKPTMWTEAWSGWFNEFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VALFI K GS++NYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 266 PIHQRPVQDLAFAVALFIQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH AVK+C K ++S + + Q+A+++ S CAAFL N D
Sbjct: 325 LIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTD 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE Y E
Sbjct: 385 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQLEMLPTNSPMLLWESYNED 444
Query: 434 IPTYDE-TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
+ D+ T++ A+ LLEQ+N TKD SDYLWY D +ES L V S G
Sbjct: 445 VSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSV--DIGSTESFLHGGELPTLIVQSTG 502
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING GSA G ++ FT V+ G N ++LLSV VGLP+ G + E
Sbjct: 503 HAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVGGHFETWNT 562
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPL 600
G L V++ G + K D S W Y+VGL GE + + + G V W + QPL
Sbjct: 563 GILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSLAAQAPQPL 622
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+K+ FDAP G +P+AI++ MGKG+ W+NG SIGRYW ++ T
Sbjct: 623 TWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAYATGNCDKCNYAGTFRPPK 682
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PR++LKP NLLV+ EE G P IS+ SVT +C VS+ H P
Sbjct: 683 CQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVTGVCADVSEYH-P 741
Query: 702 PVISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
+ +W ++ ++ H RPKV ++C +G I+ I FAS+G P G C +Y G+C
Sbjct: 742 TLKNWHIESYGKSEDLH------RPKVHLKCSAGYSITSIKFASFGTPLGTCGSYQQGTC 795
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
H+ S I+EK C+GK+ C V + F DPCP + K L V+ C
Sbjct: 796 HAPMSYDILEKRCIGKQRCAVTISNTNFGQDPCPNVLKRLSVEVVC 841
>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 841
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/830 (49%), Positives = 524/830 (63%), Gaps = 58/830 (6%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G +V+YD +++ ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN
Sbjct: 24 GSAKASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWN 83
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP PG++ F G DLV+FIK VQ GLYV LRIGP++ EW +GG P WL +PGI F
Sbjct: 84 GHEPSPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISF 143
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+DNEPFK M+++ T IV++MKA RLY SQGGPII+SQIENEYG +E+ G Y +
Sbjct: 144 RTDNEPFKVQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENEYGPMEYEIGAAGKAYTK 203
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WAA++A++L TGVPW+MCKQDD PDP+IN CNG C + PN KP +WTE WT ++
Sbjct: 204 WAAEMAMELGTGVPWIMCKQDDTPDPLINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWF 261
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAP 322
+G R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y AP
Sbjct: 262 TEFGGPVPHRPAEDLAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAP 320
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLV 381
LDEYGLLRQPKWGHLK+LH A+KLC ++SG QEA +F+ S CAAFL
Sbjct: 321 LDEYGLLRQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAHVFKSMSGACAAFLA 380
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QW 427
N + ++ ATV F N+ Y LPP SISILP+CK +NTA++ S W
Sbjct: 381 NYNPKSYATVAFGNMHYNLPPWSISILPNCKNTVYNTARVGSQSAQMKMTRVPIHGGLSW 440
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVS 481
+ E T D++S LLEQ+NTT+D SDYLWY+ DP++ + VL V
Sbjct: 441 LSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYSTDVVLDPNEGFLRNGKDPVLTVF 500
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S GH LH FING+ G+A+G T + V L G N +SLLSV VGLP+ G + E
Sbjct: 501 SAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRTGVNKISLLSVAVGLPNVGPHFET 560
Query: 542 RVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STH 597
AG L +S+ G E +D S W Y+VGL GE L + + GS V W + GS S
Sbjct: 561 WNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGETLSLHSLGGSSSVEWIQ-GSLVSQR 619
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------- 646
QPLTWYKT FDAP G+ P+A+++ SMGKG+ W+NGQ++GRYW ++
Sbjct: 620 QPLTWYKTTFDAPDGTAPLALDMNSMGKGQVWLNGQNLGRYWPAYKASGTCDYCDYAGTY 679
Query: 647 ---------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
G SQ WYH+P+S+LKPTGNLLV+ EE G GIS+ + ++C + +
Sbjct: 680 NENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEELGGDLNGISLVRRDIDSVCADIYE 739
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
P +IS++ Q T + P RPKV + C G+KIS I FAS+G P G+C N+
Sbjct: 740 WQ-PNLISYQMQ------TSGKAP-VRPKVHLSCSPGQKISSIKFASFGTPVGSCGNFHE 791
Query: 758 GSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
GSCH+ S E+ C+G+ CTV V E F GDPCP + K L V+A C+
Sbjct: 792 GSCHAHMSYDAFERNCVGQNLCTVAVSPENFGGDPCPNVLKKLSVEAICS 841
>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
Length = 846
Score = 754 bits (1947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/837 (47%), Positives = 532/837 (63%), Gaps = 73/837 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G RK+L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFW+ HEP+
Sbjct: 25 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
+++F GR DLV+F+K GLYV LRIGP++ EW YGG P WLH VPGI FR+DNE
Sbjct: 85 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ T IV++MK +LYASQGGPIILSQIENEYG ++ ++ Y++W+A +
Sbjct: 145 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW MC+Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +GD
Sbjct: 205 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R ED+A+ VA F + G++ NYYMYHGGTNF RT+ +++ YD AP+DEYG
Sbjct: 263 PSPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSG--VLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
LLRQPKWGHL++LH A+KLC +++ + S+ S L+ A S CAAFL N D
Sbjct: 322 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLG-SNLEAAVYKTESGSCAAFLANVDT 380
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV--------------------- 424
+++ATV F+ Y LP S+SILPDCK VAFNTAK++S
Sbjct: 381 KSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPDGGSSAEL 440
Query: 425 -EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHDPS----DSESV 477
QW KE I + LLEQ+NTT D SDYLWY+ R K D + S++V
Sbjct: 441 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 500
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L + SLG V++AFING+ GS HGK + +L+ ++L+ GTN + LLSV VGL + GA
Sbjct: 501 LHIESLGQVVYAFINGKLAGSGHGK---QKISLDIPINLVTGTNTIDLLSVTVGLANYGA 557
Query: 538 YLERRVAGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
+ + AG+ V+++ AK D +S W YQVGL GE + T S V S+
Sbjct: 558 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWV--SKSPL 615
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------- 646
T QPL WYKT FDAP+GS+PVAI+ GKG AWVNGQSIGRYW + +
Sbjct: 616 PTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCD 675
Query: 647 --------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV-TTL 691
G PSQ+ YH+PRS+LKP+GN+LVL EE G P IS T + L
Sbjct: 676 YRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNL 735
Query: 692 CGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNG 750
C VS SH PPV +W S ++ + + RP + ++CP S + I I FAS+G P G
Sbjct: 736 CLTVSQSHPPPVDTWTSDSKISNRNRT-----RPVLSLKCPISTQVIFSIKFASFGTPKG 790
Query: 751 NCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C ++ G C+SS S ++V+KAC+G RSC V V T + +G+PC G+ K+L V+A C+
Sbjct: 791 TCGSFTQGHCNSSRSLSLVQKACIGLRSCNVEVST-RVFGEPCRGVVKSLAVEASCS 846
>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
Length = 847
Score = 754 bits (1946), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/825 (47%), Positives = 519/825 (62%), Gaps = 59/825 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+ILFSGSIHYPRSTP MW LI KAK+GG+DV++T VFWN+HEP
Sbjct: 28 SVTYDRKAIMINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNVHEPT 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG + F GR D+VRF+K +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 88 PGNYHFEGRYDIVRFMKTIQRAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV +MKA L+ SQGGPIILSQIENEYG+ F G Y+ WAA +
Sbjct: 148 PFKRAMQGFTEKIVGLMKAENLFESQGGPIILSQIENEYGVQSKLFGAAGYNYMTWAANM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ TGVPWVMCK+DDAPDPVIN CNG C ++FA PN P KP IWTE W+ ++ +G
Sbjct: 208 AIQTGTGVPWVMCKEDDAPDPVINTCNGFYC-DSFA-PNKPYKPTIWTEAWSGWFSEFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI K GS++NYYM+HGGTNFGR+A +T YD AP+DEYG
Sbjct: 266 TIHQRPVQDLAFAVAKFIQK-GGSFINYYMFHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH ++K+C + ++S + Q+ ++ S +CAAFL N D +
Sbjct: 325 LIRQPKYGHLKELHRSIKMCERALVSVDPIVTQLGTYQQVHVYSTESGDCAAFLANYDTK 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQWEEYKEAI 434
+ A V F+N+ Y LPP SISILPDC+ V FNTAK+ + + WE Y E I
Sbjct: 385 SAARVLFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQMEMLPTNGIFSWESYDEDI 444
Query: 435 PTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGH 485
+ D++S LLEQ+N T+DASDYLWY D SES L + S GH
Sbjct: 445 SSLDDSSTFTTAGLLEQINVTRDASDYLWYMTSV--DIGSSESFLHGGELPTLIIQSTGH 502
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
+H FING+ GSA G ++ FT V+L GTN ++LLSV VGLP+ G + E G
Sbjct: 503 AVHIFINGQLSGSAFGTRENRRFTYTGKVNLRPGTNRIALLSVAVGLPNVGGHYESWNTG 562
Query: 546 LRN-VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPLT 601
+ V++ G + K D S W YQVGL GE + + + V W S + QPLT
Sbjct: 563 ILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLLSPDSVTSVEWMQSSLAAQRPQPLT 622
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
W+K F+AP G +P+A+++ MGKG+ W+NGQSIGRYW ++ +
Sbjct: 623 WHKAYFNAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAYASGNCNGCSYAGTFRPTKC 682
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
G P+Q WYH+PRS+LKPT NLLV+ EE G P IS+ S+ ++C VS+ H P
Sbjct: 683 QLGCGQPTQRWYHVPRSWLKPTNNLLVVFEELGGDPSRISLVKRSLASVCAEVSEFH-PT 741
Query: 703 VISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+ +W+ ++ R + H PKV +RC G+ I+ I FAS+G P G C +Y G+CH
Sbjct: 742 IKNWQIESYGRAEEFHS------PKVHLRCSGGQSITSIKFASFGTPLGTCGSYQQGACH 795
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+S S AI+EK C+GK+ C V + F DPCP + K L V+A C
Sbjct: 796 ASTSYAILEKKCIGKQRCAVTISNSNFGQDPCPNVMKKLSVEAVC 840
>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 839
Score = 753 bits (1945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/830 (48%), Positives = 532/830 (64%), Gaps = 66/830 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G RK+L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFW+ HEP+
Sbjct: 25 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHEPE 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
+++F GR DLV+F+K GLYV LRIGP++ EW YGG P WLH VPGI FR+DNE
Sbjct: 85 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ T IV++MK +LYASQGGPIILSQIENEYG ++ ++ Y++W+A +
Sbjct: 145 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSASM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW MC+Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +GD
Sbjct: 205 ALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLGFGD 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R ED+A+ VA F + G++ NYYMYHGGTNF RT+ +++ YD AP+DEYG
Sbjct: 263 PSPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSG--VLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
LLRQPKWGHL++LH A+KLC +++ + S+ S L+ A S CAAFL N D
Sbjct: 322 LLRQPKWGHLRDLHKAIKLCEDALIATDPTITSLG-SNLEAAVYKTESGSCAAFLANVDT 380
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVE---QWEEY 430
+++ATV F+ Y LP S+SILPDCK VAFNTAK+ S E QW
Sbjct: 381 KSDATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKVKFNSISKTPDGGSSAELGSQWSYI 440
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHDPS----DSESVLKVSSLG 484
KE I + LLEQ+NTT D SDYLWY+ R K D + S++VL + SLG
Sbjct: 441 KEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAVLHIESLG 500
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
V++AFING+ GS HGK + +L+ ++L+ GTN + LLSV VGL + GA+ + A
Sbjct: 501 QVVYAFINGKLAGSGHGK---QKISLDIPINLVTGTNTIDLLSVTVGLANYGAFFDLVGA 557
Query: 545 GLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
G+ V+++ AK D +S W YQVGL GE + T S V S+ T QPL
Sbjct: 558 GITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWV--SKSPLPTKQPLI 615
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
WYKT FDAP+GS+PVAI+ GKG AWVNGQSIGRYW + +
Sbjct: 616 WYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRA 675
Query: 647 -------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV-TTLCGHVSDS 698
G PSQ+ YH+PRS+LKP+GN+LVL EE G P IS T + LC VS S
Sbjct: 676 NKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQS 735
Query: 699 HLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGNCENYAI 757
H PPV +W S ++ + + RP + ++CP S + I I FAS+G P G C ++
Sbjct: 736 HPPPVDTWTSDSKISNRNRT-----RPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQ 790
Query: 758 GSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G C+SS S ++V+KAC+G RSC V V T + +G+PC G+ K+L V+A C+
Sbjct: 791 GHCNSSRSLSLVQKACIGLRSCNVEVST-RVFGEPCRGVVKSLAVEASCS 839
>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
Length = 842
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/822 (47%), Positives = 514/822 (62%), Gaps = 55/822 (6%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD ++++I+G R+ILFSGSIHYPRSTP MW LI KAK+GGLDV+QT VFWN HEP PG
Sbjct: 28 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 87
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
+ F R DLVRFIK VQ GL+V LRIGP+I GEW +GG P WL VPGI FR+DNEPF
Sbjct: 88 NYYFEERYDLVRFIKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 147
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + IV MMK+ +L+ASQGGPIILSQIENEYG G Y+ WAAK+A+
Sbjct: 148 KTAMQGFTEKIVGMMKSEKLFASQGGPIILSQIENEYGPEGKELGAAGQAYINWAAKMAI 207
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L TGVPWVMCK++DAPDPVINACNG C + F+ PN P KP +WTE W+ ++ +G
Sbjct: 208 GLGTGVPWVMCKEEDAPDPVINACNGFYC-DAFS-PNKPYKPTMWTEAWSGWFTEFGGTI 265
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLL 329
R R ED+A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD AP+DEYGL+
Sbjct: 266 RQRPVEDLAFAVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 324
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK HLKELH AVKLC + ++S +QEA +F+ S CAAFL N + + A
Sbjct: 325 REPKHSHLKELHRAVKLCEQALVSVDPAITTLGTMQEAHVFRSPSGCAAFLANYNSNSYA 384
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIPT 436
V F+N Y LPP SISILPDCK V FN+A + S WE Y E + +
Sbjct: 385 KVVFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTSQMQMWGDGASSMMWERYDEEVDS 444
Query: 437 YDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS-------ESVLKVSSLGHVLH 488
L LLEQ+N T+D+SDYLWY PS++ L V S GH LH
Sbjct: 445 LAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPLSLSVLSAGHALH 504
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
F+NGE GSA+G D+ +L GTN ++LLSV GLP+ G + E G+
Sbjct: 505 VFVNGELQGSAYGTREDRRIKYNGNANLRAGTNKIALLSVACGLPNVGVHYETWNTGVGG 564
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYK 604
V + G E +D + +W YQVGL GE++ + + GS V W + + QPL+WY+
Sbjct: 565 PVGLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSTSVEWMQGSLIAQNQQPLSWYR 624
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ---- 646
F+ P+G +P+A+++ SMGKG+ W+NGQSIGRYW +F P+
Sbjct: 625 AYFETPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYADGDCKECSYTGTFRAPKCQAG 684
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G P+Q WYH+PRS+L+PT NLLV+ EE G I++ SV+++C VS+ H P + +
Sbjct: 685 CGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALVKRSVSSVCADVSEDH-PNIKN 743
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
W+ ++ + H R KV +RC G+ IS I FAS+G P G C N+ G CHS+NS
Sbjct: 744 WQIESYGEREYH------RAKVHLRCSPGQSISAIKFASFGTPMGTCGNFQQGDCHSANS 797
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
++EK C+G + C V + E F GDPCP + K + V+A C+
Sbjct: 798 HTVLEKKCIGLQRCAVAISPESFGGDPCPRVTKRVAVEAVCS 839
>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
like [Medicago truncatula]
gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
Length = 841
Score = 753 bits (1944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/825 (49%), Positives = 516/825 (62%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++ ING +IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 27 SVSYDSKAITINGQSRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLV+FIK VQ GLYV LRIGP++ EW +GG P WL +PGI FR+DNE
Sbjct: 87 PGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFKF M+++ IV+MMKA RL+ SQGGPII+SQIENEYG +E+ G Y +WAA +
Sbjct: 147 PFKFQMQKFTEKIVDMMKADRLFESQGGPIIMSQIENEYGPMEYEIGAPGKSYTKWAADM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW+MCKQDDAPDPVIN CNG C + PN KP +WTE WT ++ +G
Sbjct: 207 AVGLGTGVPWIMCKQDDAPDPVINTCNGFYC--DYFSPNKDYKPKMWTEAWTGWFTEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 265 PVPHRPAEDMAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LL+QPKWGHLK+LH A+KL ++SG QEA +F+ S CAAFL N + +
Sbjct: 324 LLQQPKWGHLKDLHRAIKLSEPALISGDPTVTRIGNYQEAHVFKSKSGACAAFLGNYNPK 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKE 432
ATV F N+ Y LPP SISILPDCK +NTA++ S W+ + E
Sbjct: 384 AFATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQSAQMKMTRVPIHGGLSWQVFTE 443
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHV 486
+ D++S LLEQ+NTT+D +DYLWY+ DP S + VL V S GH
Sbjct: 444 QTASTDDSSFTMTGLLEQLNTTRDLTDYLWYSTDVVIDPNEGFLRSGKDPVLTVLSAGHA 503
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
LH FIN + G+ +G T + V LI G N +SLLSV VGLP+ G + E AG
Sbjct: 504 LHVFINSQLSGTIYGSLEFPKLTFSQNVKLIPGVNKISLLSVAVGLPNVGPHFETWNAGV 563
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTW 602
L +++ G E +D S W Y+VGL GE L + + GS V W + GS S QPLTW
Sbjct: 564 LGPITLNGLDEGRRDLSWQKWSYKVGLHGEALSLHSLGGSSSVEWVQ-GSLVSRMQPLTW 622
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
YKT FDAP G P A+++ SMGKG+ W+NGQ++GRYW ++
Sbjct: 623 YKTTFDAPDGIAPFALDMGSMGKGQVWLNGQNLGRYWPAYKASGTCDNCDYAGTYNENKC 682
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
G SQ WYH+P S+L PTGNLLV+ EE G P GI + + ++C + + P
Sbjct: 683 RSNCGEASQRWYHVPHSWLIPTGNLLVVFEELGGDPNGIFLVRRDIDSVCADIYEWQ-PN 741
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
+IS+ Q Q + KT+K + RPK + C G+KIS I FAS+G P G+C N+ GSCH+
Sbjct: 742 LISY--QMQTSGKTNKPV---RPKAHLSCGPGQKISSIKFASFGTPVGSCGNFHEGSCHA 796
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S EK C+G+ SC V V E F GDPCP + K L V+A CT
Sbjct: 797 HKSYNTFEKNCVGQNSCKVTVSPENFGGDPCPNVLKKLSVEAICT 841
>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
lyrata]
Length = 847
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/854 (47%), Positives = 519/854 (60%), Gaps = 67/854 (7%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M L L G L+ ++ GS V+YD R++ ING R+IL SGSIHYPRSTP+M
Sbjct: 14 MAAVSALFLLGFLVCSVSGS---------VSYDSRAITINGKRRILISGSIHYPRSTPEM 64
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI KAKEGGLDV+QT VFWN HEP PG++ F G DLVRF+K VQ GLY+ LRIGP
Sbjct: 65 WPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVRFVKLVQQSGLYLHLRIGP 124
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
++ EW +GG P WL +PGI FR+DN PFK M+R+ T IVNMMKA RL+ SQGGPIIL
Sbjct: 125 YVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIIL 184
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG +E+ G Y WAAK+AV L TGVPWVMCKQDDAPDP+INACNG C
Sbjct: 185 SQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYC- 243
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ PN KP +WTE WT ++ +G R AED+A+ VA FI K GS++NYYMYH
Sbjct: 244 -DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQK-GGSFINYYMYH 301
Query: 301 GGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA ++ T Y APLDEYGL RQPKWGHLK+LH A+KLC ++SG M
Sbjct: 302 GGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRM 361
Query: 360 NFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
QEA +++ S C+AFL N + ++ A V F + Y LPP SISILPDCK +NT
Sbjct: 362 PLGNYQEAHVYKAKSGACSAFLANYNPKSYAKVSFGSNHYNLPPWSISILPDCKNTVYNT 421
Query: 419 AKLDSVE--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
A++ + W+ Y E TY + S L+EQ+NTT+D SDYLWY
Sbjct: 422 ARVGAQTSRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYM 481
Query: 465 FRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLIN 518
K D ++ L V S GH +H FING+ GSA+G T K V+L
Sbjct: 482 TDVKIDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRA 541
Query: 519 GTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQG-AKELKDFSSFSWGYQVGLLGEKL 576
G N +++LS+ VGLP+ G + E AG L VS+ G + +D S W Y+VGL GE L
Sbjct: 542 GFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLSGGRRDLSWQKWTYKVGLKGESL 601
Query: 577 QIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
+ + GS V W+ + QPLTWYKT F AP G P+A+++ SMGKG+ W+NGQS+
Sbjct: 602 SLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSL 661
Query: 636 GRYWVSF--------------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
GR+W ++ L G SQ WYH+PRS+LKP+GNLLV+ EE
Sbjct: 662 GRHWPAYKAVGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWG 721
Query: 676 GYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ--NQRTLKTHKRIPGRRPKVQIRCPS 733
G P GIS+ V ++C + + W+S N + + K PKV ++C
Sbjct: 722 GDPNGISLVRREVDSVCADIYE--------WQSTLVNYQLHASGKVNKPLHPKVHLQCGP 773
Query: 734 GRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPC 793
G+KI+ + FAS+G P G C +Y GSCH +S K C+G+ C+V V E F GDPC
Sbjct: 774 GQKITTVKFASFGTPEGTCGSYRQGSCHDHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPC 833
Query: 794 PGIPKALLVDAQCT 807
P + K L V+A C
Sbjct: 834 PNVMKKLAVEAVCA 847
>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
Length = 845
Score = 753 bits (1943), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/823 (47%), Positives = 517/823 (62%), Gaps = 56/823 (6%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD ++++I+G R+ILFSGSIHYPRSTP MW LI KAK+GGLDV+QT VFWN HEP PG
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 89
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
+ F R DLVRF+K VQ GL+V LRIGP+I GEW +GG P WL VPGI FR+DNEPF
Sbjct: 90 NYYFEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + IV MMK+ L+ASQGGPIILSQIENEYG F G Y+ WAAK+AV
Sbjct: 150 KTAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAV 209
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L TGVPWVMCK++DAPDPVINACNG C + F+ PN P KP +WTE W+ ++ +G
Sbjct: 210 GLDTGVPWVMCKEEDAPDPVINACNGFYC-DAFS-PNKPYKPTMWTEAWSGWFTEFGGTI 267
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLL 329
R R ED+A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD AP+DEYGL+
Sbjct: 268 RQRPVEDLAFAVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLI 326
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK HLKELH AVKLC + ++S +QEA +F+ S CAAFL N + ++A
Sbjct: 327 REPKHSHLKELHRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSGCAAFLANYNSNSHA 386
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIPT 436
V F+N Y LPP SISILPDCK V FN+A + + WE Y E + +
Sbjct: 387 KVVFNNEQYSLPPWSISILPDCKNVVFNSATVGVQTSQMQMWGDGATSMMWERYDEEVDS 446
Query: 437 YDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS-------ESVLKVSSLGHVLH 488
L LLEQ+N T+D+SDYLWY PS++ L V S GH LH
Sbjct: 447 LAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALH 506
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
F+NG+ GS++G D+ V+L GTN ++LLSV GLP+ G + E G+
Sbjct: 507 VFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGG 566
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYK 604
V + G E +D + +W YQVGL GE++ + + GS V W + + QPL WYK
Sbjct: 567 PVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYK 626
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ---- 646
F+ P+G +P+A+++ SMGKG+ W+NGQSIGRYW +F P+
Sbjct: 627 AYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAYADGDCKGCSYTGTFRAPKCQAG 686
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEE-ENGYPPGISIDTVSVTTLCGHVSDSHLPPVI 704
G P+Q WYH+PRS+L+P+ NLLV+LEE G I++ SV+++C VS+ H P +
Sbjct: 687 CGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADVSEDH-PNIK 745
Query: 705 SWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
W+ ++++ RR KV +RC G+ IS I FAS+G P G C N+ G CHS++
Sbjct: 746 KWQ------IESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSAS 799
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S A++EK C+G + C V + + F GDPCP + K + V+A C+
Sbjct: 800 SHAVLEKRCIGLQRCVVAISPDNFGGDPCPSVTKRVAVEAVCS 842
>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
Length = 847
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/854 (47%), Positives = 518/854 (60%), Gaps = 67/854 (7%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M L L G L+ ++ GS V+YD R++ ING R+IL SGSIHYPRSTP+M
Sbjct: 14 MAAVSALFLLGFLVCSVSGS---------VSYDSRAITINGKRRILISGSIHYPRSTPEM 64
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI KAKEGGLDV+QT VFWN HEP PG++ F G DLV+F+K VQ GLY+ LRIGP
Sbjct: 65 WPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGP 124
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
++ EW +GG P WL +PGI FR+DN PFK M+R+ T IVNMMKA RL+ SQGGPIIL
Sbjct: 125 YVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIIL 184
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG +E+ G Y WAAK+AV L TGVPWVMCKQDDAPDP+INACNG C
Sbjct: 185 SQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYC- 243
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ PN KP +WTE WT ++ +G R AED+A+ VA FI K GS++NYYMYH
Sbjct: 244 -DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQK-GGSFINYYMYH 301
Query: 301 GGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA ++ T Y APLDEYGL RQPKWGHLK+LH A+KLC ++SG M
Sbjct: 302 GGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRM 361
Query: 360 NFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
QEA +++ S C+AFL N + ++ A V F N Y LPP SISILPDCK +NT
Sbjct: 362 PLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNT 421
Query: 419 AKLDSVE--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
A++ + W+ Y E TY + S L+EQ+NTT+D SDYLWY
Sbjct: 422 ARVGAQTSRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYM 481
Query: 465 FRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLIN 518
K D ++ L V S GH +H FING+ GSA+G T K V+L
Sbjct: 482 TDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRA 541
Query: 519 GTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAK-ELKDFSSFSWGYQVGLLGEKL 576
G N +++LS+ VGLP+ G + E AG L VS+ G +D S W Y+VGL GE L
Sbjct: 542 GFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESL 601
Query: 577 QIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
+ + GS V W+ + QPLTWYKT F AP G P+A+++ SMGKG+ W+NGQS+
Sbjct: 602 SLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSL 661
Query: 636 GRYWVSF--------------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
GR+W ++ L G SQ WYH+PRS+LKP+GNLLV+ EE
Sbjct: 662 GRHWPAYKAVGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWG 721
Query: 676 GYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ--NQRTLKTHKRIPGRRPKVQIRCPS 733
G P GI++ V ++C + + W+S N + + K PK ++C
Sbjct: 722 GDPNGITLVRREVDSVCADIYE--------WQSTLVNYQLHASGKVNKPLHPKAHLQCGP 773
Query: 734 GRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPC 793
G+KI+ + FAS+G P G C +Y GSCH+ +S K C+G+ C+V V E F GDPC
Sbjct: 774 GQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPC 833
Query: 794 PGIPKALLVDAQCT 807
P + K L V+A C
Sbjct: 834 PNVMKKLAVEAVCA 847
>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 830
Score = 751 bits (1939), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 398/820 (48%), Positives = 515/820 (62%), Gaps = 57/820 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNL+EP
Sbjct: 25 NVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLNEPV 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+DF GR+DLV+F+K V A GLYV LRIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 85 RGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ IV+M+K LYASQGGP+ILSQIENEYG ++ ++ G Y++WAA +
Sbjct: 145 PFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNIDSAYGAAGKSYIKWAATM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ ++ +G
Sbjct: 205 ATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQ--FTPNSNTKPKMWTENWSGWFLPFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNF RT+ ++ T Y AP+DEYG
Sbjct: 263 AVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
++RQPKWGHLKE+H A+KLC + +++ + EA +++ S CAAFL N D ++
Sbjct: 322 IIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTGSVCAAFLANVDTKS 381
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAK------------LDSVEQWEEYKEAIP 435
+ TV FS Y LP S+SILPDCK V NTAK L S W E +
Sbjct: 382 DVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKVCLTNFISMFMWLPSSTGWSWISEPVG 441
Query: 436 TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD-PSDSESVLKVSSLGHVLHAFINGE 494
S LLEQ+NTT D SDYLWY+ + + S++VL + SLGH LHAFING+
Sbjct: 442 ISKADSFPQTGLLEQINTTADKSDYLWYSLSIDYKGDAGSQTVLHIESLGHALHAFINGK 501
Query: 495 FVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQG 553
GS G FT++ V L+ G N + LLS+ VGL + GA+ + AG+ V ++G
Sbjct: 502 LAGSQTGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGAGITGPVILKG 561
Query: 554 AK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPT 611
D S W YQVGL GE L + + + S + +QPL WYKT F AP+
Sbjct: 562 LANGNTLDLSYQKWTYQVGLKGEDLGLSSGSSGQWNSQSTF--PKNQPLIWYKTTFAAPS 619
Query: 612 GSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTP 649
GSDPVAI+ MGKGEAWVNGQSIGRYW +++ G P
Sbjct: 620 GSDPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSASKCRRNCGKP 679
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ 709
SQ+ YH+PRS+LKP+GN+LVL EE+ G P IS T +LC HVSDSH PPV W S
Sbjct: 680 SQTLYHVPRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSHPPPVDLWNSD 739
Query: 710 NQRTLKTHKRIPGRR--PKVQIRCPSGRK-ISKILFASYGNPNGNCENYAIGSCHSSNSR 766
+ GR+ P + + CP + IS I FASYG P G C N+ G C S+ +
Sbjct: 740 TES---------GRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKAL 790
Query: 767 AIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+IV+KAC+G SC+V V +E F G+PC G+ K+L V+A C
Sbjct: 791 SIVQKACIGSSSCSVGVSSETF-GNPCRGVAKSLAVEATC 829
>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 848
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/825 (49%), Positives = 508/825 (61%), Gaps = 57/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYDG++LIING RKILFSGSIHYPRS P MW LI KAK GGLDVV T VFWNLHEP
Sbjct: 29 NVTYDGKALIINGQRKILFSGSIHYPRSVPDMWESLIEKAKMGGLDVVDTYVFWNLHEPS 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG +DF GR DLV+FIK V+ GLYV LRIGP+I GEW +GG P WL VPGI FR+DNE
Sbjct: 89 PGIYDFEGRNDLVKFIKLVEKAGLYVHLRIGPYICGEWNFGGFPAWLKFVPGISFRTDNE 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M ++ IV MMK RL+ SQGGPIILSQIENEY + F E G Y+ WAAK+
Sbjct: 149 PFKLAMAKFTKKIVQMMKDERLFQSQGGPIILSQIENEYETEDKVFGEAGFAYMNWAAKM 208
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV + TGVPWVMCKQDDAPDP+IN CNG C + PN P KP WTE WT+++ +G
Sbjct: 209 AVQMDTGVPWVMCKQDDAPDPMINTCNGFYC--DYFSPNKPYKPNFWTEAWTAWFNNFGG 266
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+A+ VA FI K GS VNYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 267 PNHKRPVEDLAFGVARFIQK-GGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 325
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLK LH AVKLC K +L+G + Q+A +F SS +CAAFL N
Sbjct: 326 LIRQPKFGHLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSN 385
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------SVE--QWEEYKEA 433
N A V F+ Y LPP SISILPDCK+V +NTA++ VE WE Y E
Sbjct: 386 NTARVTFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSFLPTKVESFSWETYNEN 445
Query: 434 IPTYDE-TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVSSLGHV 486
I + +E +S+ + LLEQ+ TKD SDYLWY DP++S L +S GH
Sbjct: 446 ISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGHG 505
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
+H FING+ GS+ G H + FT ++L G N VSLLS+ GLP++G + E R G
Sbjct: 506 MHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMGV 565
Query: 546 LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLTW 602
L V+I G + K D S W Y+VGL GE + + + + V W++ QPLTW
Sbjct: 566 LGPVAIHGLDKGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQENAQPLTW 625
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-------------SFLTPQ--- 646
YK FDAP G +P+A+++ SM KG+ W+NGQ++GRYW P+
Sbjct: 626 YKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWTITANGNCTDCSYSGTYRPRKCQ 685
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q WYH+PRS+L PT NL+V+ EE G P IS+ SVT++C S PV
Sbjct: 686 FGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICTEASQYR--PV 743
Query: 704 IS--WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
I QN L + K+ + C +G+ IS I FAS+G P+G C ++ G+CH
Sbjct: 744 IKNVHMHQNNGELNEQNVL-----KINLHCAAGQFISAIKFASFGTPSGACGSHKQGTCH 798
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
S S +++K C+G++ C + T F DPCP + K L + C
Sbjct: 799 SPKSDYVLQKLCVGRQRCLATIPTSIFGEDPCPNLRKKLSAEVVC 843
>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 854
Score = 750 bits (1937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/826 (47%), Positives = 515/826 (62%), Gaps = 60/826 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R++LFSGSIHYPRSTP+MW LI KAKEGGLDVV+T VFWN+HEP
Sbjct: 28 SVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DL RFIK +Q GLY LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 88 PGNYNFEGRYDLARFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV +MK+ L+ SQGGPIILSQIENEYG+ F G Y+ WAAK+
Sbjct: 148 PFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWAAKM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN CNG C + F+ PN P KP +WTE W+ ++ +G
Sbjct: 208 AVGLGTGVPWVMCKEEDAPDPVINTCNGFYC-DAFS-PNRPYKPTMWTEAWSGWFNEFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI K GS++NYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 266 PIHQRPVQDLAFAVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH AVK+C K ++S + + Q+A+++ S CAAFL N D
Sbjct: 325 LIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTD 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+N+ Y LPP SISILPDC+ V FNTAK+ + WE Y E
Sbjct: 385 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQLEMLPTNSPMLLWESYNED 444
Query: 434 IPTYDE-TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
+ D+ T++ A+ LLEQ+N TKD SDYLWY D +ES L V S G
Sbjct: 445 VSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSV--DIGSTESFLHGGELPTLIVQSTG 502
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING GSA G ++ FT V+ G N ++LLSV VGLP+ G + E
Sbjct: 503 HAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVGGHFETWNT 562
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPL 600
G L V++ G + K D S W Y+VGL GE + + + G V W + QPL
Sbjct: 563 GILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSLAAQAPQPL 622
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TW+K+ FDAP G +P+AI++ MGKG+ W+NG SIGRYW ++ T
Sbjct: 623 TWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAYATGNCDKCNYAGTFRPPK 682
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PR++LKP NLLV+ EE G P IS+ SVT +C VS+ H P
Sbjct: 683 CQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVTGVCADVSEYH-P 741
Query: 702 PVISWRSQNQ-RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
+ +W ++ ++ H RPKV ++C +G I+ I FAS+G P G C +Y G+C
Sbjct: 742 TLKNWHIESYGKSEDLH------RPKVHLKCSAGYSITSIKFASFGTPLGTCGSYQQGTC 795
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
H+ S I+EK C+GK+ C V + F DPCP + K L V+ C
Sbjct: 796 HAPMSYDILEKRCIGKQRCAVTISNTNFGQDPCPNVLKRLSVEVVC 841
>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
Length = 847
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/854 (47%), Positives = 518/854 (60%), Gaps = 67/854 (7%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M L L G L+ ++ GS V+YD R++ ING R+IL SGSIHYPRSTP+M
Sbjct: 14 MAAVSALFLLGFLVCSVSGS---------VSYDSRAITINGKRRILISGSIHYPRSTPEM 64
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI KAKEGGLDV+QT VFWN HEP PG++ F G DLV+F+K VQ GLY+ LRIGP
Sbjct: 65 WPDLIRKAKEGGLDVIQTYVFWNGHEPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGP 124
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
++ EW +GG P WL +PGI FR+DN PFK M+R+ T IVNMMKA RL+ SQGGPIIL
Sbjct: 125 YVCAEWNFGGFPVWLKYIPGISFRTDNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIIL 184
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG +E+ G Y WAAK+AV L TGVPWVMCKQDDAPDP+INACNG C
Sbjct: 185 SQIENEYGPMEYELGAPGRSYTNWAAKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYC- 243
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ PN KP +WTE WT ++ +G R AED+A+ VA FI K GS++NYYMYH
Sbjct: 244 -DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQK-GGSFINYYMYH 301
Query: 301 GGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA ++ T Y APLDEYGL RQPKWGHLK+LH A+KLC ++SG M
Sbjct: 302 GGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRM 361
Query: 360 NFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
QEA +++ S C+AFL N + ++ A V F N Y LPP SISILPDCK +NT
Sbjct: 362 PLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNT 421
Query: 419 AKLDSVE--------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN 464
A++ + W+ Y E TY + S L+EQ+NTT+D SDYLWY
Sbjct: 422 ARVGAQTSRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYM 481
Query: 465 FRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLIN 518
K D ++ L V S GH +H FING+ GSA+G T K V+L
Sbjct: 482 TDVKVDANEGFLRNGDLPTLTVLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNLRA 541
Query: 519 GTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAK-ELKDFSSFSWGYQVGLLGEKL 576
G N +++LS+ VGLP+ G + E AG L VS+ G +D S W Y+VGL GE L
Sbjct: 542 GFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESL 601
Query: 577 QIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
+ + GS V W+ + QPLTWYKT F AP G P+A+++ SMGKG+ W+NGQS+
Sbjct: 602 SLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSL 661
Query: 636 GRYWVSF--------------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
GR+W ++ L G SQ WYH+PRS+LKP+GNLLV+ EE
Sbjct: 662 GRHWPAYKAVGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWG 721
Query: 676 GYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ--NQRTLKTHKRIPGRRPKVQIRCPS 733
G P GI++ V ++C + + W+S N + + K PK ++C
Sbjct: 722 GDPNGITLVRREVDSVCADIYE--------WQSTLVNYQLHASGKVNKPLHPKAHLQCGP 773
Query: 734 GRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPC 793
G+KI+ + FAS+G P G C +Y GSCH+ +S K C+G+ C+V V E F GDPC
Sbjct: 774 GQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPC 833
Query: 794 PGIPKALLVDAQCT 807
P + K L V+A C
Sbjct: 834 PNVMKKLAVEAVCA 847
>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
Length = 846
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/836 (47%), Positives = 529/836 (63%), Gaps = 71/836 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G RK+L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFW+ HEP+
Sbjct: 25 NVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIKKSKDGGLDVIETYVFWSGHEPE 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
+++F GR DLV+F+K V+ GLYV LRIGP++ EW YGG P WLH VPGI FR+DNE
Sbjct: 85 KNKYNFEGRYDLVKFVKLVEEAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ T IV++MK +LYASQGGPIILSQIENEYG ++ ++ Y++W+A +
Sbjct: 145 PFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKIYIKWSASM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW MC+Q DAPDP+IN CNG C + PNS KP +WTENW+ ++ +GD
Sbjct: 205 ALSLDTGVPWNMCQQADAPDPMINTCNGFYCDQFT--PNSNSKPKMWTENWSGWFLGFGD 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R ED+A+ VA F + G++ NYYMYHGGTNF RT+ +++ YD AP+DEYG
Sbjct: 263 PSPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPML-SGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
LLRQPKWGHL++LH A+KLC ++ + +S S L+ A S CAAFL N +
Sbjct: 322 LLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGTK 381
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV---------------------- 424
++ATV F+ Y LP S+SILPDCK VAFNTAK++S
Sbjct: 382 SDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFARQSLKPDGGSSAELG 441
Query: 425 EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF--KHDPS----DSESVL 478
+W KE I + LLEQ+NTT D SDYLWY+ R K D + S++VL
Sbjct: 442 SEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAVL 501
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
+ SLG V++AFING+ GS HGK + +L+ ++L G N V LLSV VGL + GA+
Sbjct: 502 HIESLGQVVYAFINGKLAGSGHGK---QKISLDIPINLAAGKNTVDLLSVTVGLANYGAF 558
Query: 539 LERRVAGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS 595
+ AG+ V+++ AK D +S W YQVGL GE + T S V S+
Sbjct: 559 FDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATVDSSEWV--SKSPLP 616
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------- 646
T QPL WYKT FDAP+GS+PVAI+ GKG AWVNGQSIGRYW + +
Sbjct: 617 TKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTDSCDY 676
Query: 647 -------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV-TTLC 692
G PSQ+ YH+PRS+LKP+GN LVL EE G P IS T + LC
Sbjct: 677 RGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQTGSNLC 736
Query: 693 GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGN 751
VS SH PPV +W S ++ + + RP + ++CP S + IS I FAS+G P G
Sbjct: 737 LMVSQSHPPPVDTWTSDSKISNRNRT-----RPVLSLKCPVSTQVISSIKFASFGTPQGT 791
Query: 752 CENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C ++ G C+SS S ++V+KAC+G RSC V V T + +G+PC G+ K+L V+A C+
Sbjct: 792 CGSFTHGHCNSSRSLSVVQKACIGSRSCNVEVST-RVFGEPCRGVIKSLAVEASCS 846
>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/830 (47%), Positives = 516/830 (62%), Gaps = 67/830 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNL+EP
Sbjct: 25 NVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLNEPV 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+DF GR+DLV+F+K V A GLYV LRIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 85 RGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ IV+M+K LYASQGGP+ILSQIENEYG ++ ++ G Y++WAA +
Sbjct: 145 PFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNIDSAYGAAGKSYIKWAATM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ ++ +G
Sbjct: 205 ATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQ--FTPNSNTKPKMWTENWSGWFLPFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNF RT+ ++ T Y AP+DEYG
Sbjct: 263 AVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
++RQPKWGHLKE+H A+KLC + +++ + EA +++ S CAAFL N D ++
Sbjct: 322 IIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTGSVCAAFLANVDTKS 381
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ--------------------- 426
+ TV FS Y LP S+SILPDCK V NTAK++S
Sbjct: 382 DVTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISSFTTESLKEDIGSSEASST 441
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD-PSDSESVLKVSSLG 484
W E + S LLEQ+NTT D SDYLWY+ + + S++VL + SLG
Sbjct: 442 GWSWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSIDYKGDAGSQTVLHIESLG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LHAFING+ GS G FT++ V L+ G N + LLS+ VGL + GA+ + A
Sbjct: 502 HALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGA 561
Query: 545 GLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
G+ V ++G D S W YQVGL GE L + + + S + +QPL
Sbjct: 562 GITGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGLSSGSSGQWNSQSTF--PKNQPLI 619
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
WYKT F AP+GSDPVAI+ MGKGEAWVNGQSIGRYW +++
Sbjct: 620 WYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSA 679
Query: 647 -------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
G PSQ+ YH+PRS+LKP+GN+LVL EE+ G P IS T +LC HVSDSH
Sbjct: 680 SKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSH 739
Query: 700 LPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCPSGRK-ISKILFASYGNPNGNCENYA 756
PPV W S + GR+ P + + CP + IS I FASYG P G C N+
Sbjct: 740 PPPVDLWNSDTES---------GRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFY 790
Query: 757 IGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C S+ + +IV+KAC+G SC+V V +E F G+PC G+ K+L V+A C
Sbjct: 791 HGRCSSNKALSIVQKACIGSSSCSVGVSSETF-GNPCRGVAKSLAVEATC 839
>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 846
Score = 750 bits (1936), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/825 (48%), Positives = 517/825 (62%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++ ING R+IL SGSIHYPRS+P+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 32 SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 91
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLV+F+K + GLYV LRIGP+I EW +GG P WL +PGI FR+DN
Sbjct: 92 PGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRTDNG 151
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T IVNMMKA RL+ +QGGPIILSQIENEYG +E+ G Y +WAA++
Sbjct: 152 PFKAQMQKFTTKIVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWAAEM 211
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+TGVPWVMCKQDDAPDP+IN CNG C + PN KP +WTE WT ++ +G
Sbjct: 212 AVGLRTGVPWVMCKQDDAPDPIINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWFTQFGG 269
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 270 PVPHRPAEDMAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 328
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKR 386
LLRQPKWGHLK+LH A+KLC ++SG + QEA +F + CAAFL N +R
Sbjct: 329 LLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQR 388
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKE 432
+ A V F N+ Y LPP SISILPDCK +NTA++ + W+ Y E
Sbjct: 389 SFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTPVPMHGGFSWQAYNE 448
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHV 486
+++ LLEQ+NTT+D SDYLWY DPS+ VL V S GH
Sbjct: 449 EPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLGVLSAGHA 508
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
LH FING+ G+A+G T + V L G N +SLLS+ VGLP+ G + E AG
Sbjct: 509 LHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHFETWNAGI 568
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTW 602
L V++ G E +D S W Y++GL GE L + + GS V W+ GS + QPL+W
Sbjct: 569 LGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAE-GSLVAQRQPLSW 627
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------- 643
YKT F+AP G+ P+A+++ SMGKG+ W+NGQ +GR+W ++
Sbjct: 628 YKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKASGTCGDCSYIGTYNEKKC 687
Query: 644 -TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
T G SQ WYH+P+S+LKPTGNLLV+ EE G P GIS+ V ++C + + P
Sbjct: 688 STNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDVDSVCADIYEWQ-PT 746
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
++++ Q Q + K +K + RPK + C G+KI I FAS+G P G C +Y GSCH+
Sbjct: 747 LMNY--QMQASGKVNKPL---RPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYRQGSCHA 801
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+S C+G+ SC+V V E F GDPC + K L V+A C+
Sbjct: 802 FHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCLNVMKKLAVEAICS 846
>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
Length = 833
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/826 (48%), Positives = 519/826 (62%), Gaps = 55/826 (6%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +VTYD RS IING RKIL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN H
Sbjct: 19 GSASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 78
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP G++ F GR DLVRFIK VQA GLYV LRIGP+I EW +GG P WL VPGI FR+
Sbjct: 79 EPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 138
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
DN PFK M+ + IV+MMK+ +L+ QGGPII+SQIENEYG VE+ G Y +WA
Sbjct: 139 DNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWA 198
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
A++AV L TGVPWVMCKQ+DAPDPVI+ACNG C F PN KP ++TE WT +Y
Sbjct: 199 AEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFF--PNKDYKPKMFTEAWTGWYTE 256
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLD 324
+G R AED+AY VA FI + +GS++NYYMYHGGTNFGRTA ++ YD AP+D
Sbjct: 257 FGGAIPNRPAEDLAYSVARFI-QNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPID 315
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNK 383
EYGL +PKWGHL++LH A+KLC ++S EA +++ S CAAFL N
Sbjct: 316 EYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANY 375
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------SVEQWEEYKE 432
D +++A V F N Y+LPP S+SILPDCK V FNTA++ S W+ Y E
Sbjct: 376 DPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQSSQMKMNPVSTFSWQSYNE 435
Query: 433 AIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
+ Y E + + LLEQ+N T+D +DYLWY P + VL V S GH
Sbjct: 436 ETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYPVLTVMSAGH 495
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH FING+ G+ +G+ S+ T V L GTN +SLLSV +GLP+ G + E AG
Sbjct: 496 ALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGLHFETWNAG 555
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V+++G E D SS+ W Y++GL GE L + GS W GS + QPLT
Sbjct: 556 VLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVE-GSLLAQKQPLT 614
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------ 643
WYKT F+AP G+DP+A+++ SMGKG+ W+NG+SIGR+W ++
Sbjct: 615 WYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWPAYTAHGNCNGCNYAGIFNDKK 674
Query: 644 --TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
T G PSQ WYH+PRS+LKP+GN L++ EE G P GI++ ++ +C + +
Sbjct: 675 CQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVCADIFEGQ-- 732
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
P + +N + + + K + + K + C G KISKI FAS+G P G C ++ GSCH
Sbjct: 733 PSL----KNSQIIGSSK-VNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGSCH 787
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ S +++ C+GK+SC+V V E F GDPCPG K L V+A C+
Sbjct: 788 AHKSYDALQRNCIGKQSCSVSVAPEVFGGDPCPGSMKKLSVEALCS 833
>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 839
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/823 (48%), Positives = 517/823 (62%), Gaps = 59/823 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +++++NG R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 30 SVTYDHKAIVVNGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 89
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLV+FIK VQ GLYV LRIGP+I EW +GG P WL VPGI FR+DNE
Sbjct: 90 PGKYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDNE 149
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV++MK +L+ +QGGPII+SQIENEYG VE G Y +W +++
Sbjct: 150 PFKAAMQKFTEKIVSIMKEEKLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWFSQM 209
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW+MCKQ D PDP+I+ CNG C E F PN KP +WTENWT +Y +G
Sbjct: 210 AVGLDTGVPWIMCKQQDTPDPLIDTCNGYYC-ENFT-PNKKYKPKMWTENWTGWYTEFGG 267
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA F+ + GS+VNYYMYHGGTNF RT+S + YD P+DEYG
Sbjct: 268 AVPRRPAEDMAFSVARFV-QNGGSFVNYYMYHGGTNFDRTSSGLFIATSYDYDGPIDEYG 326
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF-SKLQEAFIFQGSSECAAFLVNKDKR 386
LL +PKWGHL++LH A+KLC +P L V ++ + E +F+ S CAAFL N D +
Sbjct: 327 LLNEPKWGHLRDLHKAIKLC-EPALVSVDPTVTWPGNNLEVHVFKTSGACAAFLANYDTK 385
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQWEEYKEAI 434
++A+V F N Y+LPP SISILPDCKT FNTA+L +S W+ Y E
Sbjct: 386 SSASVKFGNGQYDLPPWSISILPDCKTAVFNTARLGAQSSLMKMTAVNSAFDWQSYNEEP 445
Query: 435 PTYDE-TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVL 487
+ +E SL A L EQ+N T+D++DYLWY D ++ VL V S GHVL
Sbjct: 446 ASSNEDDSLTAYALWEQINVTRDSTDYLWYMTDVNIDANEGFIKNGQSPVLTVMSAGHVL 505
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
H IN + G+ +G T V L G N +SLLS+ VGLP+ G + E AG L
Sbjct: 506 HVLINDQLSGTVYGGLDSHKLTFSDSVKLRVGNNKISLLSIAVGLPNVGPHFETWNAGVL 565
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWY 603
V+++G E +D S W Y++GL GE L + T GS V W + GS + QPL WY
Sbjct: 566 GPVTLKGLNEGTRDLSKQKWSYKIGLKGEALNLNTVSGSSSVEWVQ-GSLLAKQQPLAWY 624
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL-------------------- 643
KT F P G+DP+A+++ISMGKG+AW+NG+SIGR+W ++
Sbjct: 625 KTTFSTPAGNDPLALDMISMGKGQAWINGRSIGRHWPGYIARGNCGDCYYAGTYTDKKCR 684
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
T G PSQ WYHIPRS+L P+GN LV+ EE G P GI++ + ++C + P
Sbjct: 685 TNCGEPSQRWYHIPRSWLNPSGNYLVVFEEWGGDPTGITLVKRTTASVCADIYQGQ--PT 742
Query: 704 ISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
+ +N++ L + K + RPK + CP G+ IS+I FASYG P G C N+ GSCH+
Sbjct: 743 L----KNRQMLDSGKVV---RPKAHLWCPPGKNISQIKFASYGLPQGTCGNFREGSCHAH 795
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
S +K C+GK+SC V V E F GDPCPGI K L ++A C
Sbjct: 796 KSYDAPQKNCIGKQSCLVTVAPEVFGGDPCPGIAKKLSLEALC 838
>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/817 (48%), Positives = 513/817 (62%), Gaps = 58/817 (7%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD +++++NG R+IL SGSIHYPRSTP+MWP LI KAK+GGLDVVQT VFWN HEP PG
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSPG 86
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEPF
Sbjct: 87 QYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 146
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+++ T IV MMK+ L+ QGGPIILSQIENE+G +E E Y WAA +AV
Sbjct: 147 KAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 206
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L T VPW+MCK+DDAPDP+IN CNG C + PN P KP +WTE WT++Y +G
Sbjct: 207 ALNTSVPWIMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTAWYTGFGIPV 264
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+AY VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DEYGLL
Sbjct: 265 PHRPVEDLAYGVAKFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLL 323
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNN 388
R+PKWGHLK+LH A+KLC +++G + + Q++ +F+ S+ CAAFL NKDK +
Sbjct: 324 REPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLENKDKVSY 383
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-VEQ----------WEEYKEAIPTY 437
A V F+ + Y+LPP SISILPDCKT FNTA++ S + Q W+ Y E I ++
Sbjct: 384 ARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAGGFAWQSYNEEINSF 443
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYN--FRFKHDP---SDSESV-LKVSSLGHVLHAFI 491
E L LLEQ+N T+D +DYLWY D S+ E++ L V S GH LH FI
Sbjct: 444 GEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGENLKLTVMSAGHALHIFI 503
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVS 550
NG+ G+ +G D T V L G+N +S LS+ VGLP+ G + E AG L V+
Sbjct: 504 NGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVT 563
Query: 551 IQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
+ G E +D + W YQVGL GE + + + GS V W QPLTWYK F+A
Sbjct: 564 LDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGE--PVQKQPLTWYKAFFNA 621
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQGTP 649
P G +P+A+++ SMGKG+ W+NGQ IGRYW + T G
Sbjct: 622 PDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGNCGTCDYRGEYDETKCQTNCGDS 681
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ 709
SQ WYH+PRS+L PTGNLLV+ EE G P GIS+ S+ ++C VS+ P + +W ++
Sbjct: 682 SQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWHTK 740
Query: 710 NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIV 769
+ + KV ++C +G+KI++I FAS+G P G+C +Y G CH+ S I
Sbjct: 741 DY-----------EKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIF 789
Query: 770 EKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
K C+G+ C V V E F GDPCPG K +V+A C
Sbjct: 790 WKNCVGQERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 826
>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
Length = 836
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/826 (48%), Positives = 519/826 (62%), Gaps = 55/826 (6%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +VTYD RS IING RKIL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN H
Sbjct: 22 GSASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 81
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP G++ F GR DLVRFIK VQA GLYV LRIGP+I EW +GG P WL VPGI FR+
Sbjct: 82 EPSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 141
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
DN PFK M+ + IV+MMK+ +L+ QGGPII+SQIENEYG VE+ G Y +WA
Sbjct: 142 DNGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWA 201
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
A++AV L TGVPWVMCKQ+DAPDPVI+ACNG C F PN KP ++TE WT +Y
Sbjct: 202 AEMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFF--PNKDYKPKMFTEAWTGWYTE 259
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLD 324
+G R AED+AY VA FI + +GS++NYYMYHGGTNFGRTA ++ YD AP+D
Sbjct: 260 FGGAIPNRPAEDLAYSVARFI-QNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPID 318
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNK 383
EYGL +PKWGHL++LH A+KLC ++S EA +++ S CAAFL N
Sbjct: 319 EYGLPSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANY 378
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------SVEQWEEYKE 432
D +++A V F N Y+LPP S+SILPDCK V FNTA++ S W+ Y E
Sbjct: 379 DPKSSAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQSSQMKMNPVSTFSWQSYNE 438
Query: 433 AIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
+ Y E + + LLEQ+N T+D +DYLWY P + VL V S GH
Sbjct: 439 ETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYPVLTVMSAGH 498
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH FING+ G+ +G+ S+ T V L GTN +SLLSV +GLP+ G + E AG
Sbjct: 499 ALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGLHFETWNAG 558
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V+++G E D SS+ W Y++GL GE L + GS W GS + QPLT
Sbjct: 559 VLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVE-GSLLAQKQPLT 617
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------ 643
WYKT F+AP G+DP+A+++ SMGKG+ W+NG+SIGR+W ++
Sbjct: 618 WYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWPAYTAHGNCNGCNYAGIFNDKK 677
Query: 644 --TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
T G PSQ WYH+PRS+LKP+GN L++ EE G P GI++ ++ +C + +
Sbjct: 678 CQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVCADIFEGQ-- 735
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
P + +N + + + K + + K + C G KISKI FAS+G P G C ++ GSCH
Sbjct: 736 PSL----KNSQIIGSSK-VNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGSCH 790
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ S +++ C+GK+SC+V V E F GDPCPG K L V+A C+
Sbjct: 791 AHKSYDALQRNCIGKQSCSVSVAPEVFGGDPCPGSMKKLSVEALCS 836
>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
Length = 866
Score = 749 bits (1933), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 403/852 (47%), Positives = 524/852 (61%), Gaps = 87/852 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R+L+I+G R++L SGSIHYPRSTPQMWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 21 NVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEPV 80
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+DF GR+DLV+F+K V GLYV LRIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 81 KGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDNE 140
Query: 149 PFKF--HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
PFK MKR+ IV++MK +LYASQGGPIILSQIENEYG ++ ++ G Y+ WAA
Sbjct: 141 PFKVEAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWAA 200
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+A L TGVPWVMC+Q+DAPD +IN CNG C + PNS KP +WTENW+++Y ++
Sbjct: 201 KMATSLDTGVPWVMCQQEDAPDSIINTCNGFYCDQ--FTPNSNTKPKMWTENWSAWYLLF 258
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYM---------------------YHGGTNF 305
G R ED+A+ VA F + G++ NYYM YHGGTNF
Sbjct: 259 GGGFPHRPVEDLAFAVARFFQR-GGTFQNYYMVLQPEMFFTSSIYYMVLFLRPYHGGTNF 317
Query: 306 GR-TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
R T ++ T Y AP+DEYG++RQPKWGHLK+LH AVKLC + +++ +
Sbjct: 318 DRSTGGPFIATSYDFDAPIDEYGIIRQPKWGHLKDLHKAVKLCEEALIATEPKITSLGPN 377
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
EA +++ S CAAFL N D +++ TV FS Y LP S+SILPDCK V NTAK++S
Sbjct: 378 LEAAVYKTGSVCAAFLANVDTKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSA 437
Query: 425 EQWEEY-----KEAIPTYDETSLRANF-----------------LLEQMNTTKDASDYLW 462
+ KE I + + +S + ++ LLEQ+N T D SDYLW
Sbjct: 438 SAISNFVTKSSKEDISSLETSSSKWSWINEPVGISKDDIFSKTGLLEQINITADRSDYLW 497
Query: 463 YNFRFK-HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTN 521
Y+ D S++VL + SLGH LHAF+NG+ GS G ++ + +I G N
Sbjct: 498 YSLSVDLKDDLGSQTVLHIESLGHALHAFVNGKLAGSHTGNKDKPKLNVDIPIKVIYGNN 557
Query: 522 NVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAK---ELKDFSSFSWGYQVGLLGEKLQ 577
+ LLS+ VGL + GA+ +R AG+ V+++G K D SS W YQVGL GE L
Sbjct: 558 QIDLLSLTVGLQNYGAFFDRWGAGITGPVTLKGLKNGNNTLDLSSQKWTYQVGLKGEDLG 617
Query: 578 IFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGR 637
+ + GS S+ +QPL WYKT FDAP+GS+PVAI+ MGKGEAWVNGQSIGR
Sbjct: 618 LSS--GSSEGWNSQSTFPKNQPLIWYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGR 675
Query: 638 YWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
YW +++ G PSQ+ YH+PRSFLKP GN LVL EE
Sbjct: 676 YWPTYVASNADCTDSCNYRGPFTQTKCHMNCGKPSQTLYHVPRSFLKPNGNTLVLFEENG 735
Query: 676 GYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGR 735
G P I+ T + +LC HVSDSH P + W NQ T K P + + CP+
Sbjct: 736 GDPTQIAFATKQLESLCAHVSDSHPPQIDLW---NQDTTSWGK----VGPALLLNCPNHN 788
Query: 736 K-ISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCP 794
+ I I FASYG P G C N+ G C S+ + +IV+KAC+G RSC++ V T+ F GDPC
Sbjct: 789 QVIFSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSIGVSTDTF-GDPCR 847
Query: 795 GIPKALLVDAQC 806
G+PK+L V+A C
Sbjct: 848 GVPKSLAVEATC 859
>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
Length = 839
Score = 748 bits (1932), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/825 (48%), Positives = 517/825 (62%), Gaps = 56/825 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++ ING R+IL SGSIHYPRS+P+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 25 SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLV+F+K + GLYV LRIGP+I EW +GG P WL +PGI FR+DN
Sbjct: 85 PGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRTDNG 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T +VNMMKA RL+ +QGGPIILSQIENEYG +E+ G Y +WAA++
Sbjct: 145 PFKAQMQKFTTKVVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWAAEM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+TGVPWVMCKQDDAPDP+IN CNG C + PN KP +WTE WT ++ +G
Sbjct: 205 AVGLRTGVPWVMCKQDDAPDPIINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWFTQFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 263 PVPHRPAEDMAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKR 386
LLRQPKWGHLK+LH A+KLC ++SG + QEA +F + CAAFL N +R
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQR 381
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKE 432
+ A V F N+ Y LPP SISILPDCK +NTA++ + W+ Y E
Sbjct: 382 SFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMTPVPMHGGFSWQAYNE 441
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHV 486
+++ LLEQ+NTT+D SDYLWY DPS+ VL V S GH
Sbjct: 442 EPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLGVLSAGHA 501
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
LH FING+ G+A+G T + V L G N +SLLS+ VGLP+ G + E AG
Sbjct: 502 LHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHFETWNAGI 561
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTW 602
L V++ G E +D S W Y++GL GE L + + GS V W+ GS + QPL+W
Sbjct: 562 LGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAE-GSLVAQRQPLSW 620
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------- 643
YKT F+AP G+ P+A+++ SMGKG+ W+NGQ +GR+W ++
Sbjct: 621 YKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKASGTCGDCSYIGTYNEKKC 680
Query: 644 -TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
T G SQ WYH+P+S+LKPTGNLLV+ EE G P GIS+ V ++C + + P
Sbjct: 681 STNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDVDSVCADIYEWQ-PT 739
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
++++ Q Q + K +K + RPK + C G+KI I FAS+G P G C +Y GSCH+
Sbjct: 740 LMNY--QMQASGKVNKPL---RPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYRQGSCHA 794
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+S C+G+ SC+V V E F GDPC + K L V+A C+
Sbjct: 795 FHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCLNVMKKLAVEAICS 839
>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
Length = 835
Score = 748 bits (1931), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/823 (47%), Positives = 521/823 (63%), Gaps = 54/823 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++I+NG RKIL SGSIHYPRSTP+MWP LI KAKEGG+DV+QT VFWN HEP+
Sbjct: 23 SVSYDHKAIIVNGQRKILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEPE 82
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G++ F R DLV+FIK VQ GLYV LRIGP+ EW +GG P WL VPGI FR++NE
Sbjct: 83 EGKYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFGGFPVWLKYVPGISFRTNNE 142
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T IV+MMKA +LY +QGGPIILSQIENEYG +E E G Y WAAK+
Sbjct: 143 PFKAAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENEYGPMEWELGEPGKVYSEWAAKM 202
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AVDL TGVPW+MCKQDD PDP+IN CNG C + PN +KP +WTE WT+++ +G
Sbjct: 203 AVDLGTGVPWIMCKQDDVPDPIINTCNGFYC--DYFTPNKANKPKMWTEAWTAWFTEFGG 260
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA FI + GS++NYYMYHGGTNFGRT+ ++ T Y APLDE+G
Sbjct: 261 PVPYRPAEDMAFAVARFI-QTGGSFINYYMYHGGTNFGRTSGGPFIATSYDYDAPLDEFG 319
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LRQPKWGHLK+LH A+KLC ++S + QEA +F+ S CAAFL N ++
Sbjct: 320 SLRQPKWGHLKDLHRAIKLCEPALVSVDPTVTSLGNYQEARVFKSESGACAAFLANYNQH 379
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAI 434
+ A V F N+ Y LPP SISILPDCK +NTA++ + WE + E
Sbjct: 380 SFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSAQMKMTPVSRGFSWESFNEDA 439
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
++++ + LLEQ+N T+D SDYLWY + DP++ + L V S GH LH
Sbjct: 440 ASHEDDTFTVVGLLEQINITRDVSDYLWYMTDIEIDPTEGFLNSGNWPWLTVFSAGHALH 499
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
F+NG+ G+ +G + T ++L G N +SLLS+ VGLP+ G + E AG L
Sbjct: 500 VFVNGQLAGTVYGSLENPKLTFSNGINLRAGVNKISLLSIAVGLPNVGPHFETWNAGVLG 559
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
VS+ G E +D + W Y+VGL GE L + + GS V W GS + QPL+WYK
Sbjct: 560 PVSLNGLNEGTRDLTWQKWFYKVGLKGEALSLHSLSGSPSVEWVE-GSLVAQKQPLSWYK 618
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LT 644
T F+AP G++P+A+++ +MGKG+ W+NGQS+GR+W ++ LT
Sbjct: 619 TTFNAPDGNEPLALDMNTMGKGQVWINGQSLGRHWPAYKSSGSCSVCNYTGWFDEKKCLT 678
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVI 704
G SQ WYH+PRS+L PTGNLLV+ EE G P GI++ + ++C + + P ++
Sbjct: 679 NCGEGSQRWYHVPRSWLYPTGNLLVVFEEWGGDPYGITLVKREIGSVCADIYEWQ-PQLL 737
Query: 705 SWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
+W QR + P RPK ++C G+KIS I FAS+G P G C N+ GSCH+
Sbjct: 738 NW----QRLVSGKFDRP-LRPKAHLKCAPGQKISSIKFASFGTPEGVCGNFQQGSCHAPR 792
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S +K C+GK SC+V V E F GDPC + K L V+A C+
Sbjct: 793 SYDAFKKNCVGKESCSVQVTPENFGGDPCRNVLKKLSVEAICS 835
>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 845
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/829 (48%), Positives = 513/829 (61%), Gaps = 54/829 (6%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G +V+YD +++ ING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN
Sbjct: 26 GHASASVSYDHKAITINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWN 85
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP PG++ F G DLVRFIK VQ GLYV LRIGP++ EW +GG P WL +PGI F
Sbjct: 86 GHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISF 145
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+DN PFKF M+++ IV+MMKA RL+ SQGGPIILSQIENEYG +E+ G Y +
Sbjct: 146 RTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTQ 205
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WAA +AV L TGVPW+MCKQ+DAPDP+IN CNG C + PN KP +WTE WT ++
Sbjct: 206 WAAHMAVGLGTGVPWIMCKQEDAPDPIINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWF 263
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAP 322
+G R AED+A+ +A FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP
Sbjct: 264 TEFGGAVPHRPAEDLAFSIARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAP 322
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLV 381
LDEYGL RQPKWGHLK+LH A+KLC ++SG +EA +F+ S CAAFL
Sbjct: 323 LDEYGLPRQPKWGHLKDLHRAIKLCEPALVSGDPTVQQLGNYEEAHVFRSKSGACAAFLA 382
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QW 427
N + ++ ATV F N Y LPP SISILP+CK +NTA++ S W
Sbjct: 383 NYNPQSYATVAFGNQRYNLPPWSISILPNCKHTVYNTARVGSQSTTMKMTRVPIHGGLSW 442
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVS 481
+ + E T D++S LLEQ+N T+D SDYLWY+ + ++ VL V
Sbjct: 443 KAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTVL 502
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S GH LH FIN + G+A+G T + V L G N +SLLSV VGLP+ G + ER
Sbjct: 503 SAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFER 562
Query: 542 RVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR-YGSSTHQ 598
AG L +++ G E +D + W Y+VGL GE L + + GS V W + + S Q
Sbjct: 563 WNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSRRQ 622
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------ 646
PLTWYKT FDAP G P+A+++ SMGKG+ W+NGQS+GRYW ++
Sbjct: 623 PLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKASGSCGYCNYAGTYN 682
Query: 647 --------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDS 698
G SQ WYH+P S+LKPTGNLLV+ EE G P GI + + ++C + +
Sbjct: 683 EKKCGSNCGQASQRWYHVPHSWLKPTGNLLVVFEELGGDPNGIFLVRRDIDSVCADIYEW 742
Query: 699 HLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIG 758
P ++S+ Q +++ RPK + C G+KIS I FAS+G P G+C NY G
Sbjct: 743 Q-PNLVSYDMQASGKVRSPV-----RPKAHLSCGPGQKISSIKFASFGTPVGSCGNYREG 796
Query: 759 SCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
SCH+ S +K C+G+ CTV V E F GDPCP + K L V+A CT
Sbjct: 797 SCHAHKSYDAFQKNCVGQSWCTVTVSPEIFGGDPCPSVMKKLSVEAICT 845
>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
Length = 852
Score = 747 bits (1928), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/835 (46%), Positives = 516/835 (61%), Gaps = 70/835 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+++G R++L SGSIHYPRSTP MWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 32 NVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLHEPV 91
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q+DF GR+DL+ F+K V+ GL+V +RIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 92 RNQYDFEGRKDLINFVKLVEKAGLFVHIRIGPYVCAEWNYGGFPLWLHFIPGIEFRTDNE 151
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAA 206
PFK MKR+ IV+M+K LYASQGGP+ILSQIENEYG +E + + PYV WAA
Sbjct: 152 PFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIESRYGPRAKPYVNWAA 211
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+A L TGVPWVMC+Q DAP VIN CNG C + NS P +WTENWT ++ +
Sbjct: 212 SMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQ--FKQNSDKTPKMWTENWTGWFLSF 269
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDE 325
G R EDIA+ VA F + G++ NYYMYHGGTNFGRT+ ++ T Y APLDE
Sbjct: 270 GGPVPYRPVEDIAFAVARFFQR-GGTFQNYYMYHGGTNFGRTSGGPFIATSYDYDAPLDE 328
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
YGL+ QPKWGHLK+LH A+KLC M++ + E +++ S+CAAFL N
Sbjct: 329 YGLINQPKWGHLKDLHKAIKLCEAAMVATEPNITSLGSNIEVSVYKTDSQCAAFLANTAT 388
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------------- 426
+++A V F+ Y LPP S+SILPDCK VAF+TAK++S
Sbjct: 389 QSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTISTFVTRSSEADASGGSLS 448
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNF--RFKHDP----SDSESVLK 479
W E + +E + LLEQ+NTT D SDYLWY+ K+D S +VL
Sbjct: 449 GWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDEPFLQDGSATVLH 508
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V +LGHVLHA+ING+ GS G +FT+E V L+ G N + LLS VGL + GA+
Sbjct: 509 VKTLGHVLHAYINGKLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYGAFF 568
Query: 540 ERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGSS 595
+ + AG+ V ++G K D SS W YQVGL GE L + ++ GS + W S+
Sbjct: 569 DLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGL-SNGGSTL--WKSQTALP 625
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------- 646
T+QPL WYK FDAP G P++++ MGKGEAWVNGQSIGR+W +++ P
Sbjct: 626 TNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDPCNY 685
Query: 647 -------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCG 693
G PSQ YH+PRS+LK +GN+LVL EE G P +S T + ++C
Sbjct: 686 RGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQSVCS 745
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS-GRKISKILFASYGNPNGNC 752
+SD+H P+ W S++ K+ P + + CP + IS I FAS+G P G C
Sbjct: 746 RISDAHPLPIDMWASEDDARKKSG-------PTLSLECPHPNQVISSIKFASFGTPQGTC 798
Query: 753 ENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
++ G C SSN+ +IV+KAC+G +SC++ V F GDPC G+ K+L V+A CT
Sbjct: 799 GSFIHGRCSSSNALSIVKKACIGSKSCSLGVSINAF-GDPCKGVAKSLAVEASCT 852
>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
Length = 843
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 404/825 (48%), Positives = 514/825 (62%), Gaps = 57/825 (6%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD +++IING R+ILFSGSIHYPRSTP MW LI KAKEGGLDV++T VFWN+HEP
Sbjct: 24 SDVTYDRKAIIINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVHEP 83
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
PG ++F GR DLVRFI+ V GLY LRIGP++ EW +GG P WL VPGI FR DN
Sbjct: 84 SPGNYNFEGRNDLVRFIQTVHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRQDN 143
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK M+ + IV MMK+ RLY SQGGPIILSQIENEYG G Y+ WAAK
Sbjct: 144 EPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKMLGPVGYNYMSWAAK 203
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV++ TGVPW+MCK+DDAPDPVIN CNG C + F PN P KP +WTE W+ ++ +G
Sbjct: 204 MAVEMGTGVPWIMCKEDDAPDPVINTCNGFYC-DKFT-PNKPYKPTMWTEAWSGWFSEFG 261
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R +D+A+ VA FI K GS+VNYYMYHGGTNFGRTA +T YD APLDEY
Sbjct: 262 GPIHKRPVQDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEY 320
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDK 385
GL+RQPK+GHLKELH A+K+C K ++S V + Q+A+++ S +C+AFL N D
Sbjct: 321 GLIRQPKYGHLKELHKAIKMCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDS 380
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKE 432
+++A V F+N+ Y LPP S+SILPDC+ FNTAK+ WE ++E
Sbjct: 381 KSSARVMFNNMHYNLPPWSVSILPDCRNAVFNTAKVGVQTSQMQMLPTNSERFSWESFEE 440
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
+ T++ A+ LLEQ+N T+D SDYLWY D SES L V S G
Sbjct: 441 DTSSSSATTITASGLLEQINVTRDTSDYLWYITSV--DVGSSESFLHGGKLPSLIVQSTG 498
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING GSA+G D+ F V+L GTN ++LLSV VGLP+ G + E
Sbjct: 499 HAVHVFINGRLSGSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNT 558
Query: 545 G-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPL 600
G L V I G + K D S W YQVGL GE + + + G V W S +QPL
Sbjct: 559 GILGPVVIHGLDKGKLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPL 618
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ 646
TW+KT FDAP G +P+A+++ MGKG+ W+NG SIGRYW SF P+
Sbjct: 619 TWHKTFFDAPEGEEPLALDMDGMGKGQIWINGISIGRYWTAIATGSCNDCNYAGSFRPPK 678
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+LK NLLV+ EE G P IS+ SV+++C VS+ H P
Sbjct: 679 CQLGCGQPTQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVCADVSEYH-P 737
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+ +W + + R PKV + C G+ IS I FAS+G P G C +Y G+CH
Sbjct: 738 NLKNWHIDSYGKSENF-----RPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQGACH 792
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
SS+S I+E+ C+GK C V V F DPCP + K L V+A C
Sbjct: 793 SSSSYDILEQKCIGKPRCIVTVSNSNFGRDPCPNVLKRLSVEAVC 837
>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
Length = 846
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/823 (47%), Positives = 511/823 (62%), Gaps = 55/823 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++IING R+IL SGSIHYPRSTP+MW LI KAK+GGLDV+ T VFW++HE P
Sbjct: 28 VTYDKKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWDVHETSP 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRFIK VQ GLY LRIGP++ EW +GG P WL VPGI FR+DNEP
Sbjct: 88 GNYNFDGRYDLVRFIKTVQKVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK L+ASQGGPIILSQIENEYG + G Y+ WAAK+A
Sbjct: 148 FKAAMQGFTQKIVQMMKNENLFASQGGPIILSQIENEYGPESRALGAAGRSYINWAAKMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK+DDAPDP+IN CNG C + FA PN P KP +WTE W+ ++ +G
Sbjct: 208 VGLDTGVPWVMCKEDDAPDPMINTCNGFYC-DAFA-PNKPYKPTLWTEAWSGWFTEFGGP 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R ED+A+ VA FI K GSY NYYMYHGGTNFGR+A +T YD AP+DEYGL
Sbjct: 266 IHQRPVEDLAFAVARFIQK-GGSYFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGL 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
+R+PK+GHLK LH A+KLC ++S + Q+A +F CAAFL N + ++
Sbjct: 325 IREPKYGHLKALHKAIKLCEHALVSSDPSITSLGTYQQAHVFSSGRSCAAFLANYNAKSA 384
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-------------VEQWEEYKEAIP 435
A V F+N+ Y+LPP SISILPDC+ V FNTA++ + + WE Y E I
Sbjct: 385 ARVMFNNMHYDLPPWSISILPDCRNVVFNTARVGAQTLRMQMLPTGSELFSWETYDEEIS 444
Query: 436 TY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLH 488
+ D + + A LLEQ+N T+D SDYLWY PS++ + L V S GH LH
Sbjct: 445 SLTDSSRITALGLLEQINVTRDTSDYLWYLTSVDISPSEAFLRNGQKPSLTVQSAGHGLH 504
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
FING+F GSA G ++ T V+L GTN ++LLS+ VGLP+ G + E G++
Sbjct: 505 VFINGQFSGSAFGTRENRQLTFTGPVNLRAGTNRIALLSIAVGLPNVGLHYETWKTGVQG 564
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPLTWYK 604
V + G + KD + W YQVGL GE + + + G V W SS Q L W+K
Sbjct: 565 PVLLNGLNQGKKDLTWQKWSYQVGLKGEAMNLVSPNGVSSVDWIEGSLASSQGQALKWHK 624
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------------ 646
FDAP G++P+A+++ SMGKG+ W+NGQSIGRYW+++
Sbjct: 625 AYFDAPRGNEPLALDMRSMGKGQVWINGQSIGRYWMAYAKGDCNSCSYIWTFRPSKCQLG 684
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G P+Q WYH+PRS+LKPT NLLV+ EE G IS+ S+ +C + H P +
Sbjct: 685 CGEPTQRWYHVPRSWLKPTKNLLVVFEELGGDASKISLVKRSIEGVCADAYEHH-PATKN 743
Query: 706 WRS-QNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
+ + N + K H+ K+ +RC G+ I+ I FAS+G P+G C ++ G+CH+ N
Sbjct: 744 YNTGGNDESSKLHQ------AKIHLRCAPGQFIAAIKFASFGTPSGTCGSFQQGTCHAPN 797
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ +++EK C+G+ SC V + F DPCP + K L V+A C+
Sbjct: 798 THSVIEKKCIGQESCMVTISNSNFGADPCPNVLKKLSVEAVCS 840
>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 843
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/829 (48%), Positives = 513/829 (61%), Gaps = 54/829 (6%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G +V+YD +++IING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN
Sbjct: 24 GQASASVSYDHKAIIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWN 83
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP PG++ F G DLVRFIK VQ GLYV LRIGP++ EW +GG P WL +PGI F
Sbjct: 84 GHEPSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISF 143
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+DN PFKF M+++ IV+MMKA RL+ SQGGPIILSQIENEYG +E+ G Y +
Sbjct: 144 RTDNGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRSYTQ 203
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WAA +AV L TGVPW+MCKQDDAPDP+IN CNG C + PN KP +WTE WT ++
Sbjct: 204 WAAHMAVGLGTGVPWIMCKQDDAPDPIINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWF 261
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAP 322
+G R AED+A+ +A FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP
Sbjct: 262 TEFGGAVPHRPAEDLAFSIARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAP 320
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLV 381
LDEYGL RQPKWGHLK+LH A+KLC ++SG +EA +F+ S CAAFL
Sbjct: 321 LDEYGLARQPKWGHLKDLHRAIKLCEPALVSGDSTVQRLGNYEEAHVFRSKSGACAAFLA 380
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QW 427
N + ++ ATV F N Y LPP SISILP+CK +NTA++ S W
Sbjct: 381 NYNPQSYATVAFGNQHYNLPPWSISILPNCKHTVYNTARVGSQSTTMKMTRVPIHGGLSW 440
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVS 481
+ + E T D++S LLEQ+N T+D SDYLWY+ + ++ VL V
Sbjct: 441 KAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTVL 500
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S GH LH FIN + G+A+G T + V L G N +SLLSV VGLP+ G + ER
Sbjct: 501 SAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFER 560
Query: 542 RVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR-YGSSTHQ 598
AG L +++ G E +D + W Y+VGL GE L + + GS V W + + S Q
Sbjct: 561 WNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSRRQ 620
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------ 646
PLTWYKT FDAP G P+A+++ SMGKG+ W+NGQS+GRYW ++
Sbjct: 621 PLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKASGSCGYCNYAGTYN 680
Query: 647 --------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDS 698
G SQ WYH+P S+LKP+GNLLV+ EE G P GI + + ++C + +
Sbjct: 681 EKKCGSNCGEASQRWYHVPHSWLKPSGNLLVVFEELGGDPNGIFLVRRDIDSVCADIYEW 740
Query: 699 HLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIG 758
P ++S+ Q +++ RPK + C G+KIS I FAS+G P G+C +Y G
Sbjct: 741 Q-PNLVSYEMQASGKVRSPV-----RPKAHLSCGPGQKISSIKFASFGTPVGSCGSYREG 794
Query: 759 SCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
SCH+ S K C+G+ CTV V E F GDPCP + K L V+A CT
Sbjct: 795 SCHAHKSYDAFLKNCVGQSWCTVTVSPEIFGGDPCPRVMKKLSVEAICT 843
>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
Length = 839
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/824 (49%), Positives = 514/824 (62%), Gaps = 54/824 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++ ING RKIL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 25 SVSYDYKAITINGQRKILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLV+FI+ VQ GLYV LRIGP+ EW +GG P WL +PGI FR+DN
Sbjct: 85 PGKYYFEGNYDLVKFIRLVQQAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTDNG 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFKF M+++ T IVN+MKA RLY SQGGPIILSQIENEYG +E+ G Y +WAA +
Sbjct: 145 PFKFQMQKFTTKIVNIMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYAQWAAHM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPWVMCKQDDAPDPVIN CNG C + PN KP +WTE WT ++ +G
Sbjct: 205 AIGLGTGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWFTGFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 263 TVPHRPAEDLAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LLRQPKWGHLK+LH A+KLC ++S QEA +F+ S CAAFL N +
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSADPTVTRLGNYQEAHVFKSKSGACAAFLANYNPH 381
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKE 432
+ +TV F N Y LPP SISILP+CK +NTA+L S W+ + E
Sbjct: 382 SYSTVAFGNQHYNLPPWSISILPNCKHTVYNTARLGSQSAQMKMTRVPIHGGLSWKAFNE 441
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHV 486
T D++S LLEQ+N T+D SDYLWY+ +P + VL V S GH
Sbjct: 442 ETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINPDEGYFRNGKNPVLTVLSAGHA 501
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
LH FING+ G+ +G T + V+L G N +SLLSV VGLP+ G + E AG
Sbjct: 502 LHVFINGQLSGTVYGSLDFPKLTFSESVNLRAGVNKISLLSVAVGLPNVGPHFETWNAGV 561
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR-YGSSTHQPLTWY 603
L +++ G E +D + W Y+VGL GE L + + GS V W + Y S QPLTWY
Sbjct: 562 LGPITLNGLNEGRRDLTWQKWSYKVGLKGEDLSLHSLSGSSSVDWLQGYLVSRRQPLTWY 621
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL-------------------- 643
KT FDAP G P+A+++ SMGKG+ W+NGQS+GRYW ++
Sbjct: 622 KTTFDAPAGVAPLALDMNSMGKGQVWLNGQSLGRYWPAYKATGSCDYCNYAGTYNEKKCG 681
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
T G SQ WYH+P S+LKPTGNLLV+ EE G P G+ + + ++C + + P +
Sbjct: 682 TNCGEASQRWYHVPHSWLKPTGNLLVMFEELGGDPNGVFLVRRDIDSVCADIYEWQ-PNL 740
Query: 704 ISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
+S+ Q Q + K + + PK + C G+KIS I FAS+G P G+C NY GSCH+
Sbjct: 741 VSY--QMQASGKVSRPV---SPKAHLSCGPGQKISSIKFASFGTPVGSCGNYREGSCHAH 795
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S ++ C+G+ SCTV V E F GDPCP + K L V+A CT
Sbjct: 796 KSYDAFQRNCVGQSSCTVTVSPEIFGGDPCPNVMKKLSVEAICT 839
>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
Length = 840
Score = 745 bits (1923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 399/831 (48%), Positives = 518/831 (62%), Gaps = 69/831 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 25 NVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+DF GR+DLV+F+K V A GLYV LRIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 85 RGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPVWLHFIPGIKFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ IV+M+K +LYASQGGP+ILSQIENEYG ++ ++ G Y++WAA +
Sbjct: 145 PFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNIDTAYGAAGKSYIKWAATM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC Q DAPDP+IN NG G+ F PNS KP +WTENW+ ++ V+G
Sbjct: 205 ATSLDTGVPWVMCLQADAPDPIINTWNGFY-GDEFT-PNSNTKPKMWTENWSGWFLVFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNF R + ++ T Y AP+DEYG
Sbjct: 263 AVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRASGGPFIATSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
++RQPKWGHLKE+H A+KLC + +++ + EA +++ S CAAFL N ++
Sbjct: 322 IIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTGSVCAAFLANVGTKS 381
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ--------------------- 426
+ TV FS Y LP S+SILPDCK+V NTAK++S
Sbjct: 382 DVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTESSKEDIGSSEASST 441
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHDPSDSESVLKVSSL 483
W E + S LLEQ+NTT D SDYLWY+ +K D S S++VL + SL
Sbjct: 442 GWSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSIDYKADAS-SQTVLHIESL 500
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH LHAFING+ GS G FT++ V L+ G N + LLS+ VGL + GA+ +
Sbjct: 501 GHALHAFINGKLAGSQPGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWG 560
Query: 544 AGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
G+ V ++G D SS W YQVGL GE L + + + S + +QPL
Sbjct: 561 VGITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGLSSGSSGQWNLQSTF--PKNQPL 618
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGT------------ 648
TWYKT F AP+GSDPVAI+ MGKGEAWVNGQ IGRYW +++ +
Sbjct: 619 TWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDSCNYRGPYS 678
Query: 649 ----------PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDS 698
PSQ+ YH+PRS+LKP+GN+LVL EE G P IS T +LC HVSDS
Sbjct: 679 ASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTESLCAHVSDS 738
Query: 699 HLPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCPSGRK-ISKILFASYGNPNGNCENY 755
H PPV W S+ + GR+ P + + CP + IS I FASYG P G C N+
Sbjct: 739 HPPPVDLWNSETES---------GRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNF 789
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C S+ + +IV+KAC+G SC+V V ++ F GDPC G+ K+L V+A C
Sbjct: 790 YHGRCSSNKALSIVQKACIGSSSCSVGVSSDTF-GDPCRGMAKSLAVEATC 839
>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
Length = 852
Score = 745 bits (1923), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/835 (46%), Positives = 514/835 (61%), Gaps = 70/835 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+++G R++L SGSIHYPRSTP MWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 32 NVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLHEPV 91
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q+DF GR+DL+ F+K V+ GL+V +RIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 92 RNQYDFEGRKDLINFVKLVERAGLFVHIRIGPYVCAEWNYGGFPLWLHFIPGIEFRTDNE 151
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAA 206
PFK MKR+ IV+M+K LYASQGGP+ILSQIENEYG +E + + PYV WAA
Sbjct: 152 PFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIESRYGPRAKPYVNWAA 211
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+A L TGVPWVMC+Q DAP VIN CNG C + NS P +WTENWT ++ +
Sbjct: 212 SMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQ--FKQNSDKTPKMWTENWTGWFLSF 269
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDE 325
G R EDIA+ VA F + G++ NYYMYHGGTNFGRT+ ++ T Y APLDE
Sbjct: 270 GGPVPYRPVEDIAFAVARFFQR-GGTFQNYYMYHGGTNFGRTSGGPFIATSYDYDAPLDE 328
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
YGL+ QPKWGHLK+LH A+KLC M++ + E +++ S+CAAFL N
Sbjct: 329 YGLINQPKWGHLKDLHKAIKLCEAAMVATEPNVTSLGSNIEVSVYKTDSQCAAFLANTAT 388
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------------- 426
+++A V F+ Y LPP S+SILPDCK VAF+TAK++S
Sbjct: 389 QSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTISTFVTRSSEADASGGSLS 448
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNF--RFKHDP----SDSESVLK 479
W E + +E + LLEQ+NTT D SDYLWY+ K+D S +VL
Sbjct: 449 GWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDEPFLQDGSATVLH 508
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V +LGHVLHA+ING GS G +FT+E V L+ G N + LLS VGL + GA+
Sbjct: 509 VKTLGHVLHAYINGRLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYGAFF 568
Query: 540 ERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGSS 595
+ + AG+ V ++G K D SS W YQVGL GE L + ++ GS + W S+
Sbjct: 569 DLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGL-SNGGSTL--WKSQTALP 625
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------- 646
T+QPL WYK FDAP G P++++ MGKGEAWVNGQSIGR+W +++ P
Sbjct: 626 TNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDPCNY 685
Query: 647 -------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCG 693
G PSQ YH+PRS+LK +GN+LVL EE G P +S T + ++C
Sbjct: 686 RGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQSVCS 745
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS-GRKISKILFASYGNPNGNC 752
SD+H P+ W S++ K+ P + + CP + IS I FAS+G P G C
Sbjct: 746 RTSDAHPLPIDMWASEDDARKKSG-------PTLSLECPHPNQVISSIKFASFGTPQGTC 798
Query: 753 ENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
++ G C SSN+ +IV+KAC+G +SC++ V F GDPC G+ K+L V+A CT
Sbjct: 799 GSFIHGRCSSSNALSIVKKACIGSKSCSLGVSINAF-GDPCKGVAKSLAVEASCT 852
>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
Length = 853
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/826 (47%), Positives = 514/826 (62%), Gaps = 60/826 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++II+G R+IL SGSIHYPRSTP MW L+ KAK+GGLDV+ T VFWN+HEP P
Sbjct: 28 VTYDKKAIIIDGQRRILISGSIHYPRSTPDMWEDLVQKAKDGGLDVIDTYVFWNVHEPSP 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 88 GNYNFEGRFDLVRFIKTVQKGGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK RL+ SQGGPII SQIENEYG +F G Y+ WAA++A
Sbjct: 148 FKAAMQGFTQKIVQMMKDERLFQSQGGPIIFSQIENEYGPESRAFGAAGHSYINWAAQMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L+TGVPWVMCK+DDAPDPVIN CNG C + F+ PN P KP +WTE W+ ++ +G
Sbjct: 208 VGLKTGVPWVMCKEDDAPDPVINTCNGFYC-DAFS-PNKPYKPTMWTEAWSGWFTEFGGA 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R +D+A+ VA FI K GS+VNYYMYHGGTNFGR+A +T YD AP+DEYGL
Sbjct: 266 FHHRPVQDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGL 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKRN 387
+R+PK+GHLKELH A+KLC ++S Q+A +F G C+AFL N ++
Sbjct: 325 IREPKYGHLKELHRAIKLCEHELVSSDPTITLLGTYQQAHVFSSGKRSCSAFLANYHTQS 384
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEAI 434
A V F+N+ Y LPP SISILPDC+ V FNTAK+ WE Y E I
Sbjct: 385 AARVMFNNMHYVLPPWSISILPDCRNVVFNTAKVGVQTSHVQMLPTGSRFFSWESYDEDI 444
Query: 435 PTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVSSLGHVL 487
+ +S + A L+EQ+N T+D +DYLWY +PS+S L V S GH L
Sbjct: 445 SSLGASSRMTALGLMEQINVTRDTTDYLWYITSVNINPSESFLRGGQWPTLTVESAGHAL 504
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
H FING+F GSA G ++ FT V+L GTN ++LLS+ VGLP+ G + E G L
Sbjct: 505 HVFINGQFSGSAFGTRENREFTFTGPVNLRAGTNRIALLSIAVGLPNVGVHYETWKTGIL 564
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST-HQPLTWYK 604
V + G + KD + W YQVGL GE + + + + V W + +T QPL WYK
Sbjct: 565 GPVMLHGLNQGNKDLTWQQWSYQVGLKGEAMNLVSPNRASSVDWIQGSLATRQQPLKWYK 624
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVS--------------FLTPQ---- 646
FDAP G++P+A+++ SMGKG+ W+NGQSIGRYW+S F P+
Sbjct: 625 AYFDAPGGNEPLALDMRSMGKGQVWINGQSIGRYWLSYAKGDCSSCGYSGTFRPPKCQLG 684
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G P+Q WYH+PRS+LKP NLLV+ EE G IS+ S T++C + H P + +
Sbjct: 685 CGQPTQRWYHVPRSWLKPKQNLLVIFEELGGDASKISLVKRSTTSVCADAFEHH-PTIEN 743
Query: 706 WRS----QNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
+ + +++R L + KV +RC G+ IS I FAS+G P G C ++ G+CH
Sbjct: 744 YNTESNGESERNL--------HQAKVHLRCAPGQSISAINFASFGTPTGTCGSFQEGTCH 795
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ NS ++VEK C+G+ SC V + F DPCP K L V+A C+
Sbjct: 796 APNSHSVVEKKCIGRESCMVAISNSNFGADPCPSKLKKLSVEAVCS 841
>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
Length = 846
Score = 744 bits (1922), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/854 (46%), Positives = 520/854 (60%), Gaps = 69/854 (8%)
Query: 10 FGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAK 69
F +L ++ G+ + VTYD R+L+I+G R++L SGSIHYPRSTP MWP LI K+K
Sbjct: 6 FVFVLVSLLGAIATTSFASTVTYDHRALVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSK 65
Query: 70 EGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYG 129
+GGLDV++T VFWNLHEP Q+DF GR DLV+F+K V GLYV LRIGP++ EW YG
Sbjct: 66 DGGLDVIETYVFWNLHEPVRRQYDFKGRNDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYG 125
Query: 130 GLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM 189
G P WLH +PGI FR+DN PFK M+ + IV+MMK LYASQGGPIILSQIENEYG
Sbjct: 126 GFPLWLHFIPGIQFRTDNGPFKEEMQIFTAKIVDMMKKENLYASQGGPIILSQIENEYGN 185
Query: 190 VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSP 249
++ ++ Y++WAA +A L TGVPWVMC+Q DAPDP+IN CNG C + PNS
Sbjct: 186 IDSAYGSAAKSYIQWAASMATSLDTGVPWVMCQQADAPDPMINTCNGFYCDQ--FTPNSV 243
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
KP +WTENWT ++ +G R EDIA+ VA F ++ G++ NYYMYHGGTNFGRT
Sbjct: 244 KKPKMWTENWTGWFLSFGGAVPYRPVEDIAFAVARFF-QLGGTFQNYYMYHGGTNFGRTT 302
Query: 310 SA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAF 368
++ T Y AP+DEYGLLRQPKWGHLK+LH A+KLC +++ + EA
Sbjct: 303 GGPFIATSYDYDAPIDEYGLLRQPKWGHLKDLHKAIKLCEAALIATDPTITSLGTNLEAS 362
Query: 369 IFQ-GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV--- 424
+++ G+ CAAFL N ++ATV FS Y LP S+SILPDCK VA NTA+++S+
Sbjct: 363 VYKTGTGSCAAFLANVRTNSDATVNFSGNSYHLPAWSVSILPDCKNVALNTAQINSMAVM 422
Query: 425 -------------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNF 465
W E + + LLEQ+N T D SDYLWY+
Sbjct: 423 PRFMQQSLKNDIDSSDGFQSGWSWVDEPVGISKNNAFTKLGLLEQINITADKSDYLWYSL 482
Query: 466 RFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLING 519
+ + S++VL V SLGH LHAFING+ GS G + T++ V LI+G
Sbjct: 483 STEIQGDEPFLEDGSQTVLHVESLGHALHAFINGKLAGSGTGNSGNAKVTVDIPVTLIHG 542
Query: 520 TNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKL 576
N + LLS+ VGL + GA+ +++ AG+ + ++G D SS W YQVGL GE+L
Sbjct: 543 KNTIDLLSLTVGLQNYGAFYDKQGAGITGPIKLKGLANGTTVDLSSQQWTYQVGLQGEEL 602
Query: 577 QIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
+ + S+ V S QPL WYKT FDAP G+DPVA++ + MGKGEAWVNGQSIG
Sbjct: 603 GLPSGSSSKWVAGSTL--PKKQPLIWYKTTFDAPAGNDPVALDFMGMGKGEAWVNGQSIG 660
Query: 637 RYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
RYW ++++ G PSQ YH+PRS+L+P+GN LVL EE
Sbjct: 661 RYWPAYVSSNGGCTSSCNYRGPYSSNKCLKNCGKPSQQLYHVPRSWLQPSGNTLVLFEEI 720
Query: 675 NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-S 733
G P IS T V +LC VS+ H PV W S L T ++ P + + CP
Sbjct: 721 GGDPTQISFATKQVESLCSRVSEYHPLPVDMWGSD----LTTGRK---SSPMLSLECPFP 773
Query: 734 GRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPC 793
+ IS I FAS+G P G C +++ C S + +IV++AC+G +SC++ V + F GDPC
Sbjct: 774 NQVISSIKFASFGTPRGTCGSFSHSKCSSRTALSIVQEACIGSKSCSIGVSIDTF-GDPC 832
Query: 794 PGIPKALLVDAQCT 807
GI K+L V+A CT
Sbjct: 833 SGIAKSLAVEASCT 846
>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 842
Score = 741 bits (1914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/835 (46%), Positives = 517/835 (61%), Gaps = 73/835 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNLHE
Sbjct: 22 VTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEAVR 81
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+DF GR+DLV+F+K V GLYV LRIGP++ EW YGG P WLH +PGI R+DNEP
Sbjct: 82 GQYDFGGRKDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIQLRTDNEP 141
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+R+ IV+MMK +LYASQGGPIILSQIENEYG ++ ++ Y++WAA +A
Sbjct: 142 FKAEMQRFTAKIVDMMKKEKLYASQGGPIILSQIENEYGNIDRAYGAAAQTYIKWAADMA 201
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK-PAIWTENWTSFYQVYGD 268
V L TGVPWVMC+QDDAP VI+ CNG C + P P+K P +WTENW+ ++ +G
Sbjct: 202 VSLDTGVPWVMCQQDDAPPSVISTCNGFYCDQW--TPRLPEKRPKMWTENWSGWFLSFGG 259
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNFGR T ++ T Y AP+DEYG
Sbjct: 260 AVPQRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYG 318
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
LLRQPKWGHLK++H A+KLC + M++ +F EA +++ S CAAFL N D ++
Sbjct: 319 LLRQPKWGHLKDVHKAIKLCEEAMVATDPKYSSFGPNVEATVYKTGSACAAFLANSDTKS 378
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ--------------------- 426
+ATV F+ Y LP S+SILPDCK V NTAK++S
Sbjct: 379 DATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSAAMIPSFMHHSVLDDIDSSEALGS 438
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLK 479
W E + + + LLEQ+NTT D SDYLWY+ SD S+++L
Sbjct: 439 GWSWINEPVGISKKDAFTRVGLLEQINTTADKSDYLWYSLSIDVTSSDTFLQDGSQTILH 498
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V SLGH LHAFING+ G ++ +++ V +G N + LLS+ +GL + GA+
Sbjct: 499 VESLGHALHAFINGKPAGRGIITANNGKISVDIPVTFASGKNTIDLLSLTIGLQNYGAFF 558
Query: 540 ERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
++ AG+ V ++G K D SS W YQ+GL GE + S+ + S+
Sbjct: 559 DKSGAGITGPVQLKGLKNGTTTDLSSQRWTYQIGLQGEDSGFSSGSSSQWI--SQPTLPK 616
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------- 646
QPLTWYK F+AP GS+PVA++ MGKGEAWVNGQSIGRYW + P
Sbjct: 617 KQPLTWYKATFNAPDGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNNAPTSGCPDSCNFR 676
Query: 647 ------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGH 694
G PSQ YH+PRS+LKP+GN LVL EE G P IS T + +LC H
Sbjct: 677 GPYDSNKCRKNCGKPSQELYHVPRSWLKPSGNTLVLFEEIGGDPTQISFATRQIESLCSH 736
Query: 695 VSDSHLPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCP-SGRKISKILFASYGNPNGN 751
VS+SH PV +W S ++ GR+ P + + CP + IS I FASYG P G
Sbjct: 737 VSESHPSPVDTWSSDSKA---------GRKLGPVLSLECPFPNQVISSIKFASYGKPQGT 787
Query: 752 CENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C +++ G C S+++ +IV+KAC+G +SC++ V + K +GDPC G+ K+L V+A C
Sbjct: 788 CGSFSHGQCKSTSALSIVQKACVGSKSCSIEV-SVKTFGDPCKGVAKSLAVEASC 841
>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
Length = 831
Score = 741 bits (1914), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/820 (48%), Positives = 508/820 (61%), Gaps = 59/820 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++++NG R+IL SGSIHYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP P
Sbjct: 29 VTYDRKAVVVNGQRRILLSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEP
Sbjct: 89 GQYHFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPIWLKYVPGISFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ T IV MMK+ RL+ QGGPIILSQIENE+G +E E Y WAA +A
Sbjct: 149 FKAEMQKFTTKIVQMMKSERLFEWQGGPIILSQIENEFGPLEWDQGEPAKDYASWAANMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ L TGVPW+MCK+DDAPDP+IN CNG C + PN P KP +WTE WT++Y +G
Sbjct: 209 MALNTGVPWIMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTAWYTGFGIP 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+AY VA FI K GS+VNYYMYHGGTNF RTA ++ T Y APLDEYGL
Sbjct: 267 VPHRPVEDLAYGVAKFIQK-GGSFVNYYMYHGGTNFERTAGGPFIATSYDYDAPLDEYGL 325
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LR+PKWGHLKELH A+KLC +++ + + Q+A +F+ S+ CAAFL NK K +
Sbjct: 326 LREPKWGHLKELHRAIKLCEPALVAADPILSSLGNAQKASVFRSSTGACAAFLENKHKLS 385
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-VEQ----------WEEYKEAIPT 436
A V F+ + Y+LPP SISILPDCKT FNTA++ S + Q W+ Y E I +
Sbjct: 386 YARVSFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAGGLTWQSYNEEINS 445
Query: 437 YDE-TSLRANFLLEQMNTTKDASDYLWYNFRF------KHDPSDSESVLKVSSLGHVLHA 489
+ E S LLEQ+N T+D +DYLWY + S L V S GH LH
Sbjct: 446 FSELESFTTVGLLEQINMTRDNTDYLWYTTYVDVAKDEQFLTSGKNPKLTVMSAGHALHV 505
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
FING+ G+ +G + T V L +G+N +S LS+ VGLP+ G + E AG L
Sbjct: 506 FINGQLSGTVYGSVENPKLTYTGKVKLWSGSNTISCLSIAVGLPNVGEHFETWNAGILGP 565
Query: 549 VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVF 607
V++ G E K D + W YQVGL GE + + + GS V W QPLTWYK F
Sbjct: 566 VTLDGLNEGKRDLTWQKWTYQVGLKGEAMSLHSLSGSSSVEWGE--PVQKQPLTWYKAFF 623
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQG 647
+AP G +P+A+++ SMGKG+ W+NGQ IGRYW + T G
Sbjct: 624 NAPDGDEPLALDMNSMGKGQIWINGQGIGRYWPGYKASGTCGHCDYRGEYNETKCQTNCG 683
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWR 707
PSQ WYH+PR +L PTGNLLV+ EE G P GIS+ + ++C VS+ P + +WR
Sbjct: 684 DPSQRWYHVPRPWLNPTGNLLVIFEEWGGDPTGISMVKRTTGSVCADVSEWQ-PSIKNWR 742
Query: 708 SQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRA 767
+++ + H ++C GRKI++I FAS+G P G+C NY+ G CH+ S
Sbjct: 743 TKDYEKAEVH-----------LQCDHGRKITEIKFASFGTPQGSCGNYSEGGCHAHRSYD 791
Query: 768 IVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
I +K C+ + C V V E F GDPCPG K +V+ C+
Sbjct: 792 IFKKNCINQEWCGVSVVPEAFGGDPCPGTMKRAVVEVTCS 831
>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
Length = 840
Score = 741 bits (1913), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/832 (46%), Positives = 515/832 (61%), Gaps = 77/832 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 30 VSYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPVR 89
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++F GR DLV F+K V GLYV LRIGP++ EW YGG P WLH +PGI R+DNEP
Sbjct: 90 GQYNFEGRNDLVGFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEP 149
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+K M R+ IV MMK +LYASQGGPIILSQIENEYG ++ ++ Y+ WAA +A
Sbjct: 150 YKAEMHRFTAKIVEMMKNEKLYASQGGPIILSQIENEYGNIDKAYGPAAKTYINWAANMA 209
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMC+Q DAP VIN CNG C + F+ PNS P IWTENW+ ++ +G
Sbjct: 210 VSLDTGVPWVMCQQADAPSSVINTCNGFYC-DQFS-PNSNSTPKIWTENWSGWFLSFGGA 267
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA F + G++ NYYMYHGGTNFGR++ ++ T Y APLDEYGL
Sbjct: 268 VPQRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRSSGGPFIATSYDYDAPLDEYGL 326
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
LRQPKWGHLK++H A+KLC M++ + + EA +++ S C+AFL N D +++
Sbjct: 327 LRQPKWGHLKDVHKAIKLCEPAMVATDPTISSLGQNIEAAVYKTGSVCSAFLANVDTKSD 386
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANF-- 446
ATV F+ Y+LP S+SILPDCK V NTAK+++ +P++ S+ A+
Sbjct: 387 ATVTFNGNSYQLPAWSVSILPDCKNVVINTAKINTATM-------VPSFTRQSISADVEP 439
Query: 447 ---------------------------LLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
LLEQ+NTT D SDYLWY+ ++ L
Sbjct: 440 TEAVGSGWSWINEPVGISKGDAFTRVGLLEQINTTADKSDYLWYSTSIDVK-GGYKADLH 498
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V SLGH LHAF+NG+ GS G + ++E V +G N + LLS+ VGL + GA+
Sbjct: 499 VQSLGHALHAFVNGKLAGSGTGNSGNAKVSVEIPVEFASGKNTIDLLSLTVGLQNYGAFF 558
Query: 540 ERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
+ AG+ V ++G+ D SS W YQ+GL GE + + I S+
Sbjct: 559 DLVGAGITGPVQLKGSANGTTIDLSSQQWTYQIGLKGEDEDLPSGSSQWI---SQPTLPK 615
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------- 646
+QPLTWYKT FDAP GS+PVA++ MGKGEAWVNGQSIGRYW + + P+
Sbjct: 616 NQPLTWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNVAPKTGCTDCNYRG 675
Query: 647 -----------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
G PSQ YH+PRS++K +GN LVL EE G P +S T V +LC HV
Sbjct: 676 AYSADKCRKNCGMPSQKLYHVPRSWMKSSGNTLVLFEEVGGDPTQLSFATRQVESLCSHV 735
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGNCEN 754
S+SH PV W S ++ K+ RP++ + CP + IS I FASYG P+G C +
Sbjct: 736 SESHPSPVDMWSSDSKAGSKS-------RPRLSLECPFPNQVISSIKFASYGRPSGTCGS 788
Query: 755 YAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
++ GSC SS + +IV+KAC+G +SC++ V T F GDPC G+ K+L V+A C
Sbjct: 789 FSHGSCRSSRALSIVQKACVGSKSCSIEVSTHTF-GDPCKGLAKSLAVEASC 839
>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
Length = 848
Score = 741 bits (1912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 400/839 (47%), Positives = 519/839 (61%), Gaps = 77/839 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 25 NVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+DF GR+DLV+F+K V A GLYV LRIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 85 RGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPVWLHFIPGIKFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ IV+M+K +LYASQGGP+ILSQIENEYG ++ ++ G Y++WAA +
Sbjct: 145 PFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNIDTAYGAAGKSYIKWAATM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC Q DAPDP+IN NG G+ F PNS KP +WTENW+ ++ V+G
Sbjct: 205 ATSLDTGVPWVMCLQADAPDPIINTWNGFY-GDEFT-PNSNTKPKMWTENWSGWFLVFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNF R + ++ T Y AP+DEYG
Sbjct: 263 AVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRASGGPFIATSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
++RQPKWGHLKE+H A+KLC + +++ + EA +++ S CAAFL N ++
Sbjct: 322 IIRQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVYKTGSVCAAFLANVGTKS 381
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ--------------------- 426
+ TV FS Y LP S+SILPDCK+V NTAK++S
Sbjct: 382 DVTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTESSKEDIGSSEASST 441
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHDPSDSESVLKVSSL 483
W E + S LLEQ+NTT D SDYLWY+ +K D S S++VL + SL
Sbjct: 442 GWSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSIDYKADAS-SQTVLHIESL 500
Query: 484 GHVLHAFINGEFVGSAHGKHSD--------KSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
GH LHAFING+ G KHS FT++ V L+ G N + LLS+ VGL +
Sbjct: 501 GHALHAFINGKLAGKYKLKHSQLIICNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNY 560
Query: 536 GAYLERRVAGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
GA+ + G+ V ++G D SS W YQVGL GE L + + + S +
Sbjct: 561 GAFFDTWGVGITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGLSSGSSGQWNLQSTF 620
Query: 593 GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGT---- 648
+QPLTWYKT F AP+GSDPVAI+ MGKGEAWVNGQ IGRYW +++ +
Sbjct: 621 --PKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDS 678
Query: 649 ------------------PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
PSQ+ YH+PRS+LKP+GN+LVL EE G P IS T +
Sbjct: 679 CNYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTES 738
Query: 691 LCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCPSGRK-ISKILFASYGN 747
LC HVSDSH PPV W S+ + GR+ P + + CP + IS I FASYG
Sbjct: 739 LCAHVSDSHPPPVDLWNSETES---------GRKVGPVLSLTCPHDNQVISSIKFASYGT 789
Query: 748 PNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
P G C N+ G C S+ + +IV+KAC+G SC+V V ++ F GDPC G+ K+L V+A C
Sbjct: 790 PLGTCGNFYHGRCSSNKALSIVQKACIGSSSCSVGVSSDTF-GDPCRGMAKSLAVEATC 847
>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
Length = 839
Score = 740 bits (1911), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/829 (47%), Positives = 513/829 (61%), Gaps = 70/829 (8%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQ------------MWPRLIAKAKEGGLDVVQT 78
TYD +++++NG R+IL SGSIHYPRSTP+ MWP LI KAK+GGLDVVQT
Sbjct: 27 TYDRKAVVVNGQRRILISGSIHYPRSTPEARRTRFPFLLLTMWPDLIEKAKDGGLDVVQT 86
Query: 79 LVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDV 138
VFWN HEP PGQ+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL V
Sbjct: 87 YVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYV 146
Query: 139 PGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKG 198
PGI FR+DNEPFK M+++ T IV MMK+ L+ QGGPIILSQIENE+G +E E
Sbjct: 147 PGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPA 206
Query: 199 PPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTEN 258
Y WAA +AV L T VPW+MCK+DDAPDP+IN CNG C + PN P KP +WTE
Sbjct: 207 KAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEA 264
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGY 317
WT++Y +G R ED+AY VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y
Sbjct: 265 WTAWYTGFGIPVPHRPVEDLAYGVAKFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSY 323
Query: 318 YDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-EC 376
AP+DEYGLLR+PKWGHLK+LH A+KLC +++G + + Q++ +F+ S+ C
Sbjct: 324 DYDAPIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGAC 383
Query: 377 AAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-VEQ--------- 426
AAFL NKDK + A V F+ + Y+LPP SISILPDCKT FNTA++ S + Q
Sbjct: 384 AAFLENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAGGF 443
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYN--FRFKHDP---SDSESV-LK 479
W+ Y E I ++ E L LLEQ+N T+D +DYLWY D S+ E++ L
Sbjct: 444 AWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGENLKLT 503
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V S GH LH FING+ G+ +G D T V L G+N +S LS+ VGLP+ G +
Sbjct: 504 VMSAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHF 563
Query: 540 ERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH 597
E AG L V++ G E +D + W YQVGL GE + + + GS V W
Sbjct: 564 ETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGE--PVQK 621
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------- 642
QPLTWYK F+AP G +P+A+++ SMGKG+ W+NGQ IGRYW +
Sbjct: 622 QPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGNCGTCDYRGEY 681
Query: 643 -----LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
T G SQ WYH+PRS+L PTGNLLV+ EE G P GIS+ S+ ++C VS+
Sbjct: 682 DETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSE 741
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
P + +W +++ + KV ++C +G+KI++I FAS+G P G+C +Y
Sbjct: 742 WQ-PSMKNWHTKDY-----------EKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTE 789
Query: 758 GSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G CH+ S I K C+G+ C V V E F GDPCPG K +V+A C
Sbjct: 790 GGCHAHKSYDIFWKNCVGQERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 838
>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 853
Score = 740 bits (1910), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/842 (46%), Positives = 515/842 (61%), Gaps = 68/842 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
G NVTYD R+L+I+G R++L SGSIHYPRSTP MWP L+ KAK+GGLDVV+T VFW
Sbjct: 23 GTSAATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFW 82
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
++HEP GQ+DF GR DLVRF+K GLYV LRIGP++ EW YGG P WLH +PGI
Sbjct: 83 DVHEPVRGQYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIK 142
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
R+DNEPFK M+R+ +V MK A LYASQGGPIILSQIENEYG + S+ G Y+
Sbjct: 143 LRTDNEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYI 202
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
RWAA +AV L TGVPWVMC+Q DAP+P+IN CNG C + P+ P +P +WTENW+ +
Sbjct: 203 RWAAGMAVALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFT--PSLPSRPKLWTENWSGW 260
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QA 321
+ +G R ED+A+ VA F + G+ NYYMYHGGTNFGR++ ++ YD A
Sbjct: 261 FLSFGGAVPYRPTEDLAFAVARFYQR-GGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDA 319
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
P+DEYGL+RQPKWGHL+++H A+K+C +++ M+ + EA +++ S CAAFL
Sbjct: 320 PIDEYGLVRQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGSLCAAFLA 379
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------------------ 423
N D +++ TV F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 380 NIDDQSDKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASD 439
Query: 424 ---------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDP 471
W E + E +L L+EQ+NTT DASD+LWY+ +P
Sbjct: 440 GSSVEAELAASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEP 499
Query: 472 --SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVM 529
+ S+S L V+SLGHVL FING+ GS+ G S +L V L+ G N + LLS
Sbjct: 500 YLNGSQSNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSAT 559
Query: 530 VGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVP 588
VGL + GA+ + AG+ V + G K D SS W YQ+GL GE L ++ +
Sbjct: 560 VGLTNYGAFFDLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEASPEW 619
Query: 589 WSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-- 646
S T+ PLTWYK+ F AP G DPVAI+ MGKGEAWVNGQSIGRYW + + PQ
Sbjct: 620 VSDNSYPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSG 679
Query: 647 --------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTV 686
G PSQ YH+PRSFL+P N +VL E+ G P IS T
Sbjct: 680 CVNSCNYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTK 739
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASY 745
++C HVS+ H + SW S Q+ ++ P +++ CP G+ IS I FAS+
Sbjct: 740 QTESVCAHVSEDHPDQIDSWVSSQQKLQRSG-------PALRLECPKEGQVISSIKFASF 792
Query: 746 GNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQ 805
G P+G C +Y+ G C SS + A+ ++AC+G SC+VPV + K +GDPC G+ K+L+V+A
Sbjct: 793 GTPSGTCGSYSHGECSSSQALAVAQEACVGVSSCSVPV-SAKNFGDPCRGVTKSLVVEAA 851
Query: 806 CT 807
C+
Sbjct: 852 CS 853
>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
Length = 836
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/822 (47%), Positives = 512/822 (62%), Gaps = 55/822 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING ++IL SGSIHYPRSTP+MWP LI K+K+GGLDV+QT VFWN HEP
Sbjct: 27 SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLV+FIK V GLYV LRIGP++ EW +GG P WL VPGIVFR+DNE
Sbjct: 87 PGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV+MMKA +L+ SQGGPIILSQIENE+G VE G Y +WAA++
Sbjct: 147 PFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F PN KP +WTE WT +Y +G
Sbjct: 207 AVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFT-PNKNYKPKMWTEVWTGWYTEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ +A FI K GS+VNYYMYHGGTNFGRTA + YD APLDEYG
Sbjct: 265 AVPTRPAEDLAFSIARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L R+PKWGHL++LH A+K ++S + QEA +F+ S CAAFL N D ++
Sbjct: 324 LPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVFKSKSGCAAFLANYDTKS 383
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------WEEYKEAIP 435
+A V F N YELPP ISILPDCKT +NTA+L S W+ + E
Sbjct: 384 SAKVSFGNGQYELPPWPISILPDCKTAVYNTARLGSQSSQMKMTPVKSALPWQSFVEESA 443
Query: 436 TYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ DE+ + + L EQ+N T+D +DYLWY P + +L + S GH LH
Sbjct: 444 SSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEGFIKRGESPLLTIYSAGHALH 503
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING+ G+ +G + T + V +G N ++LLS+ VGLP+ G + E AG L
Sbjct: 504 VFINGQLSGTVYGALENPKLTFSQNVKPRSGINKLALLSISVGLPNVGLHFETWNAGVLG 563
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKT 605
V+++G D S + W Y++GL GE L + T GS V W+ S + QPLTWYK
Sbjct: 564 PVTLKGLNSGTWDMSRWKWTYKIGLKGEALGLHTVSGSSSVEWAEGPSMAQKQPLTWYKA 623
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TP 645
F+AP G+ P+A+++ SMGKG+ W+NGQSIGR+W ++ T
Sbjct: 624 TFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTARGNCGNCYYAGTYDDKKCRTH 683
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G PSQ WYH+PRS+L P+GNLLV+ EE G P IS+ +++C + + P ++
Sbjct: 684 CGEPSQRWYHVPRSWLTPSGNLLVVFEEWGGDPTKISLVERRTSSVCADIFEGQ--PTLT 741
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
N + L + K RPK + CP G+ IS I FASYG P G C ++ GSCH+ S
Sbjct: 742 ----NSQKLASGKL---NRPKAHLWCPPGQVISDIKFASYGLPQGTCGSFQEGSCHAHKS 794
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
++ C+GK+SC+V V E F GDPCPG K L V+A C+
Sbjct: 795 YDAPKRNCIGKQSCSVAVAPEVFGGDPCPGSTKKLSVEAVCS 836
>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 836
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/833 (46%), Positives = 512/833 (61%), Gaps = 73/833 (8%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G NVTYD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNLHE
Sbjct: 23 GANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHE 82
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P GQ++F GR DLV+F+K V A GLYV LRIGP+ EW YGG P WLH +PGI FR+D
Sbjct: 83 PVRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTD 142
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+PF+ MK++ IV++MK LYASQGGPIILSQIENEYG +E + Y++WAA
Sbjct: 143 NKPFEAEMKQFTAKIVDLMKQENLYASQGGPIILSQIENEYGNIEADYGPAAKSYIKWAA 202
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+A L TGVPWVMC+Q +APDP+INACNG C + PNS KP IWTE +T ++ +
Sbjct: 203 SMATSLGTGVPWVMCQQQNAPDPIINACNGFYCDQF--KPNSNTKPKIWTEGYTGWFLAF 260
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDE 325
GD R ED+A+ VA F + G++ NYYMYHGGTNFGR + + YD AP+DE
Sbjct: 261 GDAVPHRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRASGGPFVASSYDYDAPIDE 319
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
YG +RQPKWGHLK++H A+KLC + +++ + EA +++ CAAFL N
Sbjct: 320 YGFIRQPKWGHLKDVHKAIKLCEEALIATDPTITSLGPNIEAAVYKTGVVCAAFLANI-A 378
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------------DSV 424
++ATV F+ Y LP S+SILPDCK V NTAK+ DS
Sbjct: 379 TSDATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKITSASMISSFTTESLKDVGSLDDSG 438
Query: 425 EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLG 484
+W E I S LLEQ+NTT D SDYLWY+ D + +++ L + SLG
Sbjct: 439 SRWSWISEPIGISKADSFSTFGLLEQINTTADRSDYLWYSLSIDLD-AGAQTFLHIKSLG 497
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LHAFING+ GS G H + ++ + L++G N + LLS+ VGL + GA+ + A
Sbjct: 498 HALHAFINGKLAGSGTGNHEKANVEVDIPITLVSGKNTIDLLSLTVGLQNYGAFFDTWGA 557
Query: 545 GLRNVSIQGAKELK-----DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQ 598
G+ I K LK D SS W YQVGL E L + + + W+ + T+Q
Sbjct: 558 GITGPVI--LKCLKNGSNVDLSSKQWTYQVGLKNEDLGLSSGCSGQ---WNSQSTLPTNQ 612
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------ 646
PLTWYKT F AP+G++PVAI+ MGKGEAWVNGQSIGRYW ++ +P+
Sbjct: 613 PLTWYKTNFVAPSGNNPVAIDFTGMGKGEAWVNGQSIGRYWPTYASPKGGCTDSCNYRGA 672
Query: 647 ----------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
G PSQ+ YH+PRS+L+P N LVL EE G P IS T + ++C HVS
Sbjct: 673 YDASKCLKNCGKPSQTLYHVPRSWLRPDRNTLVLFEESGGNPKQISFATKQIGSVCSHVS 732
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCP-SGRKISKILFASYGNPNGNCE 753
+SH PPV SW S + GR+ P V + CP + +S I FAS+G P G C
Sbjct: 733 ESHPPPVDSWNSNTES---------GRKVVPVVSLECPYPNQVVSSIKFASFGTPLGTCG 783
Query: 754 NYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
N+ G C S+ + +IV+KAC+G SC + + F GDPC G+ K+L V+A C
Sbjct: 784 NFKHGLCSSNKALSIVQKACIGSSSCRIELSVNTF-GDPCKGVAKSLAVEASC 835
>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 846
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/826 (49%), Positives = 509/826 (61%), Gaps = 64/826 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ VTYD ++++ING R+ILFSGSIHYPRSTP+MW LI KAK GGLDVV+T VFWN+HEP
Sbjct: 25 STVTYDRKAILINGQRRILFSGSIHYPRSTPEMWEDLILKAKNGGLDVVETYVFWNVHEP 84
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
PG ++F GR DLVRFIK +Q GLY LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 85 YPGIYNFEGRFDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
E FK M+ + IV +MK+ L+ SQGGPIIL+QIENEYG F E G Y+ WAA
Sbjct: 145 EAFKNAMQGFTEKIVALMKSENLFESQGGPIILAQIENEYGTESKLFGEAGYNYMTWAAN 204
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV LQTGVPWVMCK+ DAPDPVIN CNG C +TF+ PN P KP +WTE WT ++ +G
Sbjct: 205 MAVGLQTGVPWVMCKEADAPDPVINTCNGFYC-DTFS-PNKPYKPTMWTEAWTGWFSEFG 262
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R +D+A+ VA FI + GS VNYYMYHGGTNFGRTA +T YD AP+DEY
Sbjct: 263 GPLHQRPVQDLAFAVARFIQR-GGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 321
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDK 385
GLLRQPK+GHLKELH A+K+C ++S + + Q+A ++ S CAAFL N D
Sbjct: 322 GLLRQPKYGHLKELHRAIKMCEPALVSADPIVTSLGDYQQAHVYSSESGGCAAFLSNYDT 381
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKE 432
++ A V F+N Y LPP SISILPDCK FNTAK+ + WE Y E
Sbjct: 382 KSFARVLFNNRHYNLPPWSISILPDCKNAVFNTAKVGVQTAQMGMLPAESTTLSWESYFE 441
Query: 433 AIPTYDETSLRAN-FLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSL 483
I D+ S+ + LLEQ+N T+D SDYLWY D S SE L V S
Sbjct: 442 DISALDDRSMMTSPGLLEQINVTRDTSDYLWYITSV--DISSSEPFLHGGELPTLLVQST 499
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH +H FING+ GS G + FT V+L GTN + LLSV VGLP+ G + E
Sbjct: 500 GHAVHVFINGQLSGSVSGSRKSRRFTYSGKVNLHAGTNKIGLLSVAVGLPNVGGHFETWN 559
Query: 544 AG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQP 599
G L V + G ++ K D SS W Y+VGL GE + + + G V W + + T QP
Sbjct: 560 TGILGPVVLYGLRQGKWDLSSQKWTYKVGLKGEAMNLISPSGFSPVEWMQASLAAQTPQP 619
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW--------------VSFLTP 645
LTW+K FDAP G +P+A+++ MGKG+ W+NGQSIGRYW +F P
Sbjct: 620 LTWHKAYFDAPEGEEPLALDMEGMGKGQIWINGQSIGRYWTAYARGNCSRCNYATAFRPP 679
Query: 646 Q-----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHL 700
+ G P+Q WYH+PRS+L+P NLLV+ EE G P ISI VT++C VS+ H
Sbjct: 680 KCQLGCGQPTQRWYHVPRSWLRPEQNLLVVFEEVGGNPSRISIVKRLVTSVCADVSEFH- 738
Query: 701 PPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
P +W T K I PKV + C G+ IS I FAS+G P G C +Y G+C
Sbjct: 739 PTFKNWH-------ITAKFI---TPKVHLSCDPGQYISSIKFASFGTPLGTCGSYQQGTC 788
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
H+ +S I+EK C+GK+ C V V F DPCP + K L V+A C
Sbjct: 789 HAPSSSGILEKKCVGKQRCAVTVSNSNF-EDPCPNMMKRLSVEAVC 833
>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
Length = 851
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/835 (47%), Positives = 526/835 (62%), Gaps = 73/835 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD R+L+I+G RKIL SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWN HEP+
Sbjct: 32 SVTYDHRALVIDGKRKILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNGHEPE 91
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
+++F GR DLV+F+K GLYV LRIGP+ EW YGG P WLH VPGI FR+DNE
Sbjct: 92 KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYACAEWNYGGFPVWLHFVPGIKFRTDNE 151
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ IV++MK +LYASQGGPIILSQIENEYG ++ S+ G Y++W+A +
Sbjct: 152 PFKAEMQRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSSYGAAGKSYMKWSASM 211
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW MC+Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +G+
Sbjct: 212 ALSLDTGVPWNMCQQGDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLGFGE 269
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R ED+A+ VA F + G++ NYYMYHGGTNF RT+ +++ YD AP+DEYG
Sbjct: 270 PSPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFERTSGGPLISTSYDYDAPIDEYG 328
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LLRQPKWGHL++LH A+KLC +++ + EA +++ S+ CAAFL N +
Sbjct: 329 LLRQPKWGHLRDLHKAIKLCEDALIATDPKITSLGSNLEAAVYKTSTGSCAAFLANIGTK 388
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV---------------------- 424
++ATV F+ Y LP S+SILPDCK VAFNTAK++S
Sbjct: 389 SDATVTFNGKSYRLPAWSVSILPDCKNVAFNTAKINSATESTAFARQSLKPNADSSAELG 448
Query: 425 EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF--KHDPS----DSESVL 478
QW KE + + LLEQ+NTT D SDYLWY+ R K D + S++VL
Sbjct: 449 SQWSYIKEPVGISKADAFVKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAVL 508
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
V S+G +++AFING+ GS +GK + +L+ ++L+ G N + LLSV VGL + G +
Sbjct: 509 HVQSIGQLVYAFINGKLAGSGNGK---QKISLDIPINLVTGKNTIDLLSVTVGLANYGPF 565
Query: 539 LERRVAGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS 595
+ AG+ VS++ AK D SS W YQVGL GE + + S V S
Sbjct: 566 FDLTGAGITGPVSLKSAKTGSSTDLSSQQWTYQVGLKGEDKGLGSGDSSEWV--SNSPLP 623
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------- 646
T QPL WYKT FDAP+GSDPVAI+ GKG AWVNGQSIGRYW + +
Sbjct: 624 TSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGIAWVNGQSIGRYWPTSIARTDGCVGSCDY 683
Query: 647 -------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV-TTLC 692
G PSQ+ YH+PRS++KP+GN LVLLEE G P IS T + LC
Sbjct: 684 RGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSGNTLVLLEEMGGDPTKISFATKQTGSNLC 743
Query: 693 GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGN 751
VS SH PV +W S ++ + +T P + ++CP S + IS I FAS+G P G
Sbjct: 744 LTVSQSHPAPVDTWISDSKFSNRTS-------PVLSLKCPVSTQVISSIRFASFGTPTGT 796
Query: 752 CENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C +++ G C S+ S ++V+KAC+G RSC V V T + +G+PC G+ K+L V+A C
Sbjct: 797 CGSFSYGHCSSARSLSVVQKACVGSRSCKVEVST-RVFGEPCRGVVKSLAVEASC 850
>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 988
Score = 739 bits (1907), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/781 (47%), Positives = 506/781 (64%), Gaps = 48/781 (6%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MWP +I KA+ GGL+ +QT VFWN+HEP+ G++DF GR DLV+FIK + +GLYV LR+G
Sbjct: 1 MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
PFI+ EW +GGLP+WL +VP + FR++NEPFK H +RY I+ MMK +L+ASQGGPII
Sbjct: 61 PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 120
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQC 239
L QIENEY V+ ++ E G Y++WAA L + G+PWVMCKQ+DAP +INACNGR C
Sbjct: 121 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 180
Query: 240 GETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMY 299
G+TF GPN DKP++WTENWT+ ++V+GD R+ EDIA+ VA + +K GS+VNYYMY
Sbjct: 181 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSK-NGSHVNYYMY 239
Query: 300 HGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
HGGTNFGRT++ +V T YYD APLDE+GL + PK+GHLK +H A++LC K + G L +
Sbjct: 240 HGGTNFGRTSAHFVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ 299
Query: 360 NFSKLQEAFIFQ--GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFN 417
E ++ G+ CAAFL N + R+ T+ F Y LP SISILPDCKTV +N
Sbjct: 300 TLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYN 359
Query: 418 TAKLDSVEQW---------------EEYKEAIPTYDETSLRANFLL--EQMNTTKDASDY 460
TA++ + W E + E IP+ L + L+ E TKD +DY
Sbjct: 360 TAQIVAQHSWRDFVKSEKTSKGLKFEMFSENIPSL----LDGDSLIPGELYYLTKDKTDY 415
Query: 461 LWYNFRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV 514
WY K D D +++L+V+SLGH L ++NGE+ G AHG+H KSF K V
Sbjct: 416 AWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPV 475
Query: 515 HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFS-SFSWGYQVGLL 572
+ G N +S+L V+ GLPDSG+Y+E R AG R +SI G K +D + + WG+ GL
Sbjct: 476 NFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLE 535
Query: 573 GEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNG 632
GEK +++T+ GS+ V W + G +PLTWYKT F+ P G + VAI + +MGKG WVNG
Sbjct: 536 GEKKEVYTEEGSKKVKWEKDGK--RKPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNG 593
Query: 633 QSIGRYWVSFLTPQGTPSQSWYHIPRSFLK--PTGNLLVLLEEENGYPPGI---SIDTVS 687
+GRYW+SFL+P G P+Q+ YHIPRSF+K N+LV+LEEE PG+ SID V
Sbjct: 594 IGVGRYWMSFLSPLGEPTQTEYHIPRSFMKGEKKKNMLVILEEE----PGVKLESIDFVL 649
Query: 688 VT--TLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASY 745
V T+C +V + + V SW+ + + + K + R K +RCP +++ ++ FAS+
Sbjct: 650 VNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDM---RLKAVMRCPPEKQMVEVQFASF 706
Query: 746 GNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQ 805
G+P G C N+ +G C +S S+ +VEK CLG+ C++ V E F CP I K L V +
Sbjct: 707 GDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVARETFGDKGCPEIVKTLAVQVK 766
Query: 806 C 806
C
Sbjct: 767 C 767
>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
Precursor
Length = 858
Score = 738 bits (1906), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/844 (46%), Positives = 522/844 (61%), Gaps = 70/844 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
G NVTYD R+++I+G R++L SGSIHYPRSTP MWP LI K+K+GGLDV++T VFW
Sbjct: 26 GASRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFW 85
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
++HE GQ+DF GR+DLVRF+K V GLYV LRIGP++ EW YGG P WLH VPGI
Sbjct: 86 DIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK 145
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FR+DNE FK M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G Y+
Sbjct: 146 FRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYM 205
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ +
Sbjct: 206 RWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFT--PNSKSKPKMWTENWSGW 263
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQA 321
+ +G R AED+A+ VA F + G++ NYYMYHGGTNFGR T ++ T Y A
Sbjct: 264 FLSFGGAVPYRPAEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDA 322
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS--SECAAF 379
P+DEYG++RQPKWGHL+++H A+KLC +++ + + EA ++Q + S CAAF
Sbjct: 323 PIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAF 382
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS---------------- 423
L N D +++ TV F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 383 LANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQD 442
Query: 424 -----------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF--KHD 470
W E + E +L L+EQ+NTT DASD+LWY+ K D
Sbjct: 443 TDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGD 502
Query: 471 P---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLS 527
+ S+S L V+SLGHVL +ING+ GSA G S +L+ V L+ G N + LLS
Sbjct: 503 EPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLS 562
Query: 528 VMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
VGL + GA+ + AG+ V + G + SS W YQ+GL GE L ++ +
Sbjct: 563 TTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEASP 622
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
S T+QPL WYKT F AP G DPVAI+ MGKGEAWVNGQSIGRYW + L PQ
Sbjct: 623 EWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQ 682
Query: 647 ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
G PSQ+ YH+PRSFL+P N LVL E+ G P IS
Sbjct: 683 SGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFT 742
Query: 685 TVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFA 743
T +++C HVS+ H + SW S Q+T +T + P +++ CP G+ IS I FA
Sbjct: 743 TRQTSSICAHVSEMHPAQIDSWISP-QQTSQT------QGPALRLECPREGQVISNIKFA 795
Query: 744 SYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVD 803
S+G P+G C NY G C SS + A+V++AC+G +C+VPV + F GDPC G+ K+L+V+
Sbjct: 796 SFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNF-GDPCSGVTKSLVVE 854
Query: 804 AQCT 807
A C+
Sbjct: 855 AACS 858
>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 819
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/786 (49%), Positives = 500/786 (63%), Gaps = 54/786 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++++G R+ILFSGSIHYPRSTP+MW LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 27 VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLVRFIK VQ G++V LRIGP+I GEW +GG P WL VPGI FR+DNEP
Sbjct: 87 GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK+ L+ASQGGPIILSQIENEYG F G Y+ WAAK+A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK+DDAPDPVINACNG C +TF+ PN P KP +WTE W+ ++ +G
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYC-DTFS-PNKPYKPTMWTEAWSGWFTEFGGT 264
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R R ED+A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD APLDEYGL
Sbjct: 265 IRQRPVEDLAFGVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGL 323
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+GHLKELH AVKLC +P++S +QEA +F+ SS CAAFL N + +
Sbjct: 324 AREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVFRSSSGCAAFLANYNSNSY 383
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIP 435
A V F+N Y LPP SISILPDCK V FNTA + S WE+Y E +
Sbjct: 384 AKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTNQMQMWADGASSMMWEKYDEEVD 443
Query: 436 TYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ L + LLEQ+N T+D SDYLWY + DPS+ + L V S GH LH
Sbjct: 444 SLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTVQSAGHALH 503
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
FING+ GSA+G D+ + +L GTN V+LLSV GLP+ G + E G+
Sbjct: 504 VFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYETWNTGVVG 563
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYK 604
V I G E +D + +W YQVGL GE++ + + GS V W + + QPL WY+
Sbjct: 564 PVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQNQQPLAWYR 623
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ---- 646
FD P+G +P+A+++ SMGKG+ W+NGQSIGRYW S+ P+
Sbjct: 624 AYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAYAEGDCKGCHYTGSYRAPKCQAG 683
Query: 647 -GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G P+Q WYH+PRS+L+PT NLLV+ EE G I++ +V+ +C VS+ H P + +
Sbjct: 684 CGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCADVSEYH-PNIKN 742
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
W+ ++ + H KV ++C G+ IS I FAS+G P G C + G CHS NS
Sbjct: 743 WQIESYGEPEFHT------AKVHLKCAPGQTISAIKFASFGTPLGTCGTFQQGECHSINS 796
Query: 766 RAIVEK 771
+++EK
Sbjct: 797 NSVLEK 802
>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 956
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/844 (46%), Positives = 522/844 (61%), Gaps = 70/844 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
G NVTYD R+++I+G R++L SGSIHYPRSTP MWP LI K+K+GGLDV++T VFW
Sbjct: 124 GASRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFW 183
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
++HE GQ+DF GR+DLVRF+K V GLYV LRIGP++ EW YGG P WLH VPGI
Sbjct: 184 DIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIK 243
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FR+DNE FK M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G Y+
Sbjct: 244 FRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYM 303
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ +
Sbjct: 304 RWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFT--PNSKSKPKMWTENWSGW 361
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQA 321
+ +G R AED+A+ VA F + G++ NYYMYHGGTNFGR T ++ T Y A
Sbjct: 362 FLSFGGAVPYRPAEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDA 420
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS--SECAAF 379
P+DEYG++RQPKWGHL+++H A+KLC +++ + + EA ++Q + S CAAF
Sbjct: 421 PIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAF 480
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS---------------- 423
L N D +++ TV F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 481 LANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQD 540
Query: 424 -----------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF--KHD 470
W E + E +L L+EQ+NTT DASD+LWY+ K D
Sbjct: 541 TDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGD 600
Query: 471 P---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLS 527
+ S+S L V+SLGHVL +ING+ GSA G S +L+ V L+ G N + LLS
Sbjct: 601 EPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLS 660
Query: 528 VMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
VGL + GA+ + AG+ V + G + SS W YQ+GL GE L ++ +
Sbjct: 661 TTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEASP 720
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
S T+QPL WYKT F AP G DPVAI+ MGKGEAWVNGQSIGRYW + L PQ
Sbjct: 721 EWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQ 780
Query: 647 ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
G PSQ+ YH+PRSFL+P N LVL E+ G P IS
Sbjct: 781 SGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFT 840
Query: 685 TVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFA 743
T +++C HVS+ H + SW S Q+T +T + P +++ CP G+ IS I FA
Sbjct: 841 TRQTSSICAHVSEMHPAQIDSWISP-QQTSQT------QGPALRLECPREGQVISNIKFA 893
Query: 744 SYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVD 803
S+G P+G C NY G C SS + A+V++AC+G +C+VPV + F GDPC G+ K+L+V+
Sbjct: 894 SFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNF-GDPCSGVTKSLVVE 952
Query: 804 AQCT 807
A C+
Sbjct: 953 AACS 956
>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
Length = 870
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/850 (45%), Positives = 503/850 (59%), Gaps = 65/850 (7%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGL 73
L + S+ G ++VTYD RSLIING RK+L S SIHYPRS P MWP L+ AKEGG+
Sbjct: 30 LAAVDASNVTTIGTDSVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGV 89
Query: 74 DVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPF 133
DV++T VFWN HEP PG + F GR DLV+F K +Q G+Y+ LRIGPF+ EW +GGLP
Sbjct: 90 DVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPV 149
Query: 134 WLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHS 193
WLH VPG FR+D+EPFK+HM+++ T VN+MK RL+ASQGGPIILSQ+ENEYG E++
Sbjct: 150 WLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENA 209
Query: 194 FLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPA 253
+ E G Y WAAK+A+ TGVPW+MC+Q DAPDPVI+ CN C + P SP+KP
Sbjct: 210 YGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQF--KPISPNKPK 267
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV 313
IWTENW +++ +G R AED+AY VA F K GS NYYMYHGGTNFGRTA
Sbjct: 268 IWTENWPGWFKTFGARDPHRPAEDVAYSVARFFQK-GGSVQNYYMYHGGTNFGRTAGGPF 326
Query: 314 LTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG 372
+T YD AP+DEYGL R PKWGHLKELH +K C +L+ ++ LQEA +++
Sbjct: 327 ITTSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYED 386
Query: 373 SS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------ 425
+S CAAFL N D +N+ V F ++ Y LP S+SILPDCK VAFNTAK+
Sbjct: 387 ASGACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMA 446
Query: 426 ------------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR- 466
QWE +KE + N ++ +NTTKDA+DYLWY
Sbjct: 447 PIDLHPTASSPKRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSI 506
Query: 467 FKHDPSD-----SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTN 521
F H D ++L V S GH +H FIN + SA G + F + L G N
Sbjct: 507 FVHAEEDFLRNRGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKN 566
Query: 522 NVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFT 580
+SLLS+ VGL +GA+ E AG +V + G K D ++ +W Y++GL GE L+I
Sbjct: 567 EISLLSMTVGLQTAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQK 626
Query: 581 DYGSRIVPWSRYGSS-THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW 639
Y + W+ QPLTWYK V DAP G++PVA+++I MGKG AW+NGQ IGRYW
Sbjct: 627 SYNLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYW 686
Query: 640 V----------------------SFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGY 677
+T G P+Q WYH+PRS+ KP+GN+L++ EE G
Sbjct: 687 PRRTSKYENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGD 746
Query: 678 PPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKI 737
P I V+ CGH+S H P + ++ K RP + ++CP+ I
Sbjct: 747 PSQIRFSMRKVSGACGHLSVDH--PSFDVENLQGSEIENDKN----RPTLSLKCPTNTNI 800
Query: 738 SKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIP 797
S + FAS+GNPNG C +Y +G CH NS A+VEK CL + C + + + F CP
Sbjct: 801 SSVKFASFGNPNGTCGSYMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQLCPSTV 860
Query: 798 KALLVDAQCT 807
K L V+ C+
Sbjct: 861 KKLAVEVNCS 870
>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
Length = 870
Score = 738 bits (1904), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/850 (45%), Positives = 504/850 (59%), Gaps = 65/850 (7%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGL 73
L + S+ G ++VTYD RSLIING RK+L S SIHYPRS P MWP L+ AKEGG+
Sbjct: 30 LAAVDASNVTTIGTDSVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGV 89
Query: 74 DVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPF 133
DV++T VFWN HEP PG + F GR DLV+F K +Q G+Y+ LRIGPF+ EW +GGLP
Sbjct: 90 DVIETYVFWNGHEPSPGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPV 149
Query: 134 WLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHS 193
WLH VPG FR+D+EPFK+HM+++ T VN+MK RL+ASQGGPIILSQ+ENEYG E++
Sbjct: 150 WLHYVPGTTFRTDSEPFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENA 209
Query: 194 FLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPA 253
+ E G Y WAAK+A+ TGVPW+MC+Q DAPDPVI+ CN C + P SP+KP
Sbjct: 210 YGEGGKRYALWAAKMALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQF--KPISPNKPK 267
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV 313
IWTENW +++ +G R AED+AY VA F K GS NYYMYHGGTNFGRTA
Sbjct: 268 IWTENWPGWFKTFGARDPHRPAEDVAYSVARFFQK-GGSVQNYYMYHGGTNFGRTAGGPF 326
Query: 314 LTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG 372
+T YD AP+DEYGL R PKWGHLKELH +K C +L+ ++ LQEA +++
Sbjct: 327 ITTSYDYDAPIDEYGLPRFPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYED 386
Query: 373 SS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------ 425
+S CAAFL N D +N+ V F ++ Y LP S+SILPDCK VAFNTAK+
Sbjct: 387 ASGACAAFLANMDDKNDKVVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMA 446
Query: 426 ------------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR- 466
QWE +KE + N ++ +NTTKDA+DYLWY
Sbjct: 447 PIDLHPTASSPKRDIKSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSI 506
Query: 467 FKHDPSD-----SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTN 521
F H D ++L V S GH +H FIN + SA G + F + L G N
Sbjct: 507 FVHAEEDFLRNRGTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKN 566
Query: 522 NVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFT 580
++LLS+ VGL +GA+ E AG +V + G K D ++ +W Y++GL GE L+I
Sbjct: 567 EIALLSMTVGLQTAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQK 626
Query: 581 DYGSRIVPWSRYGSS-THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW 639
Y + W+ QPLTWYK V DAP G++PVA+++I MGKG AW+NGQ IGRYW
Sbjct: 627 SYNLKSKIWAPTSQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYW 686
Query: 640 V----------------------SFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGY 677
+T G P+Q WYH+PRS+ KP+GN+L++ EE G
Sbjct: 687 PRRTSKYENCVTQCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGD 746
Query: 678 PPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKI 737
P I V+ CGH+S H P + +++ K RP + ++CP+ I
Sbjct: 747 PSQIRFSMRKVSGACGHLSVDH--PSFDVENLQGSEIESDKN----RPTLSLKCPTNTNI 800
Query: 738 SKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIP 797
S + FAS+GNPNG C +Y +G CH NS A+VEK CL + C + + + F CP
Sbjct: 801 SSVKFASFGNPNGTCGSYMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQLCPSTV 860
Query: 798 KALLVDAQCT 807
K L V+ C+
Sbjct: 861 KKLAVEVNCS 870
>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
Length = 844
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/832 (47%), Positives = 519/832 (62%), Gaps = 68/832 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G RK+L SGS+HYPRSTP+MWP +I K+K+GGLDV++T VFWNLHEP
Sbjct: 26 NVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNLHEPV 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q+DF GR+DLV+FIK V A GLYV +RIGP++ EW YGG P WLH VPG+ FR+DNE
Sbjct: 86 RNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFRTDNE 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ IV+++K +LYASQGGPIILSQIENEYG V+ SF YV+WAA +
Sbjct: 146 PFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQWAATM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +G
Sbjct: 206 ATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSFGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + GS NYYMYHGGTNFGRT+ ++ T Y AP+DEYG
Sbjct: 264 ALPYRPVEDLAFAVARFY-QTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L+RQPKWGHL+++H A+K+C + ++S + EA +++ S+C+AFL N D ++
Sbjct: 323 LVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGSQCSAFLANVDTQS 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ--------------------- 426
+ TV F+ Y LP S+SILPDCK V NTAK++SV
Sbjct: 383 DKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAFDS 442
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHD----PSDSESVLK 479
W E I S L EQ+NTT D SDYLWY+ K D + S +VL
Sbjct: 443 GWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNTVLH 502
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V SLGHVLH FIN + GS G +L+ + L+ G N + LLS+ VGL + GA+
Sbjct: 503 VDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYGAFF 562
Query: 540 ERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
E R AG+ V ++ K D SS W YQ+GL GE L + + S+ + S+
Sbjct: 563 ELRGAGVTGPVKLENQKNNITVDLSSGQWTYQIGLEGEDLGLPSGSTSQWL--SQPNLPK 620
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------- 646
++PLTWYKT FDAP GSDP+A++ GKGEAW+NG SIGRYW S++
Sbjct: 621 NKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIASGQCTSYCDYKG 680
Query: 647 -----------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
G PSQ+ YH+P+S+LKPTGN LVL EE P ++ + + +LC HV
Sbjct: 681 AYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLGSLCSHV 740
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS-GRKISKILFASYGNPNGNCEN 754
S+SH PPV W S +++ KT P + + CPS + IS I FAS+G P G C +
Sbjct: 741 SESHPPPVEMWSSDSKQQ-KTG-------PVLSLECPSPSQVISSIKFASFGTPRGTCGS 792
Query: 755 YAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
++ G C + N+ +IV+KAC+G +SC++ V + K +GDPC G K+L V+A C
Sbjct: 793 FSHGQCSTRNALSIVQKACIGSKSCSIDV-SIKAFGDPCRGKTKSLAVEAYC 843
>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
sativus]
Length = 844
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/832 (47%), Positives = 519/832 (62%), Gaps = 68/832 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G RK+L SGS+HYPRSTP+MWP +I K+K+GGLDV++T VFWNLHEP
Sbjct: 26 NVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNLHEPV 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q+DF GR+DLV+FIK V A GLYV +RIGP++ EW YGG P WLH VPG+ FR+DNE
Sbjct: 86 RNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFRTDNE 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ IV+++K +LYASQGGPIILSQIENEYG V+ SF YV+WAA +
Sbjct: 146 PFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQWAATM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +G
Sbjct: 206 ATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQ--FTPNSNNKPKMWTENWSGWFLSFGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + GS NYYMYHGGTNFGRT+ ++ T Y AP+DEYG
Sbjct: 264 ALPYRPVEDLAFAVARFY-QTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L+RQPKWGHL+++H A+K+C + ++S + EA +++ S+C+AFL N D ++
Sbjct: 323 LVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVYKSGSQCSAFLANVDTQS 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ--------------------- 426
+ TV F+ Y LP S+SILPDCK V NTAK++SV
Sbjct: 383 DKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAFDS 442
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHD----PSDSESVLK 479
W E I S L EQ+NTT D SDYLWY+ K D + S +VL
Sbjct: 443 GWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNTVLH 502
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V SLGHVLH FIN + GS G +L+ + L+ G N + LLS+ VGL + GA+
Sbjct: 503 VDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYGAFF 562
Query: 540 ERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
E R AG+ V ++ K D SS W YQ+GL GE L + + S+ + S+
Sbjct: 563 ELRGAGVTGPVKLENXKNNITVDLSSGQWTYQIGLEGEDLGLPSGSTSQWL--SQPNLPK 620
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------- 646
++PLTWYKT FDAP GSDP+A++ GKGEAW+NG SIGRYW S++
Sbjct: 621 NKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIASGQCTSYCDYKG 680
Query: 647 -----------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
G PSQ+ YH+P+S+LKPTGN LVL EE P ++ + + +LC HV
Sbjct: 681 AYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLGSLCSHV 740
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS-GRKISKILFASYGNPNGNCEN 754
S+SH PPV W S +++ KT P + + CPS + IS I FAS+G P G C +
Sbjct: 741 SESHPPPVEMWSSDSKQQ-KTG-------PVLSLECPSPSQVISSIKFASFGTPRGTCGS 792
Query: 755 YAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
++ G C + N+ +IV+KAC+G +SC++ V + K +GDPC G K+L V+A C
Sbjct: 793 FSHGQCSTRNALSIVQKACIGSKSCSIDV-SIKAFGDPCRGKTKSLAVEAYC 843
>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
Length = 861
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/847 (46%), Positives = 521/847 (61%), Gaps = 73/847 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
G NVTYD R+++I+G R++L SGSIHYPRSTP MWP LI K+K+GGLDV++T VFW
Sbjct: 26 GASRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFW 85
Query: 83 NLHEP---QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVP 139
++HEP Q Q+DF GR+DLVRF+K V GLYV LRIGP++ EW YGG P WLH VP
Sbjct: 86 DIHEPVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVP 145
Query: 140 GIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGP 199
GI FR+DNE FK M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G
Sbjct: 146 GIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGK 205
Query: 200 PYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENW 259
Y+RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW
Sbjct: 206 AYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFT--PNSKSKPKMWTENW 263
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYY 318
+ ++ +G R AED+A+ VA F + G++ NYYMYHGGTNFGR T ++ T Y
Sbjct: 264 SGWFLSFGGAVPYRPAEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRSTGGPFIATSYD 322
Query: 319 DQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS--SEC 376
AP+DEYG++RQPKWGHL+++H A+KLC +++ + + EA ++Q + S C
Sbjct: 323 YDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSIC 382
Query: 377 AAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------------- 423
AAFL N D +++ V F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 383 AAFLANVDAQSDKAVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSS 442
Query: 424 --------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF-- 467
W E + E +L L+EQ+NTT DASD+LWY+
Sbjct: 443 IQDTDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV 502
Query: 468 KHDP---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
K D + S+S L V+SLGHVL +ING+ GSA G S +L+ V L+ G N +
Sbjct: 503 KGDEPYLNGSQSNLLVNSLGHVLQVYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKID 562
Query: 525 LLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LLS VGL + GA+ + AG+ V + G + SS W YQ+GL GE L ++
Sbjct: 563 LLSTTVGLSNYGAFFDLIGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSE 622
Query: 584 SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
+ S T+QPL WYKT F AP G DPVAI+ MGKGEAWVNGQSIGRYW + L
Sbjct: 623 ASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNL 682
Query: 644 TPQ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGI 681
PQ G PSQ+ YH+PRSFL+P N LVL E+ G P I
Sbjct: 683 APQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMI 742
Query: 682 SIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKI 740
S T +++C HVS+ H + SW S Q + + PG P +++ CP G+ IS I
Sbjct: 743 SFTTRQTSSICAHVSEMHPAQIDSWISPQQTS-----QTPG--PALRLECPREGQVISNI 795
Query: 741 LFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKAL 800
FAS+G P+G C NY G C SS + A+V++AC+G +C+VPV + F GDPC G+ K+L
Sbjct: 796 KFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNF-GDPCSGVTKSL 854
Query: 801 LVDAQCT 807
+V+A C+
Sbjct: 855 VVEAACS 861
>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 838
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/828 (47%), Positives = 512/828 (61%), Gaps = 66/828 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G R++L SGSIHYPRSTP+MWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 26 NVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR DLV+F+K V A GLYV LRIGP+ EW YGG P WLH +PGI FR+DN+
Sbjct: 86 QGQYNFEGRADLVKFVKAVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTDNK 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF+ MKR+ IV+MMK LYASQGGPIILSQ+ENEYG ++ ++ Y++WAA +
Sbjct: 146 PFEAEMKRFTVKIVDMMKQESLYASQGGPIILSQVENEYGNIDAAYGPAAKSYIKWAASM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPWVMC+Q DAPDP+IN CNG C + F PNS KP +WTENW+ ++ +G
Sbjct: 206 ATSLDTGVPWVMCQQADAPDPIINTCNGFYC-DQFT-PNSNAKPKMWTENWSGWFLSFGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+A+ VA F + G++ NYYMYHGGTNFGRT ++ YD AP+D+YG
Sbjct: 264 AVPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDQYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
++RQPKWGHLK++H A+KLC + +++ + EA +++ S CAAFL N +
Sbjct: 323 IIRQPKWGHLKDVHKAIKLCEEALIATDPTITSPGPNIEAAVYKTGSICAAFLANI-ATS 381
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-----WEEYKEAIPTYDET-- 440
+ATV F+ Y LP S+SILPDCK V NTAK++S E +KE + + D++
Sbjct: 382 DATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSASMISSFTTESFKEEVGSLDDSGS 441
Query: 441 ---------------SLRANFLLEQMNTTKDASDYLWYNFRFK-HDPSDSESVLKVSSLG 484
S LLEQ+NTT D SDYLWY+ S S++VL + SLG
Sbjct: 442 GWSWISEPIGISKSDSFSKFGLLEQINTTADKSDYLWYSISIDVEGDSGSQTVLHIESLG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LHAFING+ GS G ++ V L+ G N++ LLS+ VGL + GA+ + A
Sbjct: 502 HALHAFINGKIAGSGTGNSGKAKVNVDIPVTLVAGKNSIDLLSLTVGLQNYGAFFDTWGA 561
Query: 545 GLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
G+ V ++G K D SS W YQVGL E L GS S+ T+Q L
Sbjct: 562 GITGPVILKGLKNGSTVDLSSQQWTYQVGLKYEDLG--PSNGSSGQWNSQSTLPTNQSLI 619
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
WYKT F AP+GS+PVAI+ MGKGEAWVNGQSIGRYW ++++P
Sbjct: 620 WYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSPNGGCTDSCNYRGAYSS 679
Query: 647 -------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
G PSQ+ YHIPRS+L+P N LVL EE G P IS T + ++C HVS+SH
Sbjct: 680 SKCLKNCGKPSQTLYHIPRSWLQPDSNTLVLFEESGGDPTQISFATKQIGSMCSHVSESH 739
Query: 700 LPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGNCENYAIG 758
PPV W S R + P + + CP + IS I FAS+G P G C N+ G
Sbjct: 740 PPPVDLWNSDKGRKVG---------PVLSLECPYPNQLISSIKFASFGTPYGTCGNFKHG 790
Query: 759 SCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C S+ + +IV+KAC+G SC + + F GDPC G+ K+L V+A C
Sbjct: 791 RCRSNKALSIVQKACIGSSSCRIGISINTF-GDPCKGVTKSLAVEASC 837
>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 736 bits (1900), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/821 (47%), Positives = 509/821 (61%), Gaps = 59/821 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++ING R+IL SGSIHYPRSTP+MW L+ KAK+GGLDVV T VFWN+HEP P
Sbjct: 29 VTYDKKAILINGQRRILISGSIHYPRSTPEMWDDLMQKAKDGGLDVVDTYVFWNVHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G +DF GR DLVRFIK Q GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 89 GNYDFEGRYDLVRFIKTAQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK+ +L+ASQGGPIILSQIENEYG + G Y+ WAAK+A
Sbjct: 149 FKMAMQGFTQKIVQMMKSEKLFASQGGPIILSQIENEYGPQSKALGAAGHAYMNWAAKMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK+DDAPDPVIN+CNG C + PN P KP +WTE W+ ++ +G
Sbjct: 209 VGLNTGVPWVMCKEDDAPDPVINSCNGFYC--DYFSPNKPYKPTLWTEAWSGWFTEFGGP 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R +D+A+ VA F+ K GS NYYMYHGGTNFGRTA +T YD APLDEYG+
Sbjct: 267 VYGRPVQDLAFAVARFVQK-GGSLFNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGM 325
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKRN 387
LRQPK+GHLK LH A+KLC ++S + ++A +F G CAAFL N +
Sbjct: 326 LRQPKYGHLKNLHRAIKLCEHALVSSDPTVTSLGAYEQAHVFSSGPGRCAAFLANYHTNS 385
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------------SVEQWEEYKEAIP 435
ATV F+N+ Y LP SISILPDCK V FNTA++ S WE Y E
Sbjct: 386 AATVVFNNMRYALPAWSISILPDCKRVVFNTAQVGVHIAQTQMLPTISKLSWETYNEDTY 445
Query: 436 TYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLH 488
+ +S + LLEQ+N T+D SDYLWY S++ + L V S GH +H
Sbjct: 446 SLGGSSRMTVAGLLEQINVTRDTSDYLWYMTSVGISSSEAFLRGGQKPTLSVRSAGHAVH 505
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING+F GSA+G +FT ++L G N ++LLS+ VGLP+ G + E+ G L
Sbjct: 506 VFINGQFSGSAYGSREHPAFTYTGPINLRAGMNKIALLSIAVGLPNVGLHFEKWQTGILG 565
Query: 548 NVSIQGAK-ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
+SI G KD + W YQVGL GE + + + + V W + GS +PLTWYK
Sbjct: 566 PISISGLNGGKKDLTWQKWSYQVGLKGEAMNLVSPTEATSVDWIK-GSLLQGQRPLTWYK 624
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF-------LTPQGT--------- 648
F+AP G++P+A++L SMGKG+AW+NGQSIGRYW+++ T GT
Sbjct: 625 ASFNAPRGNEPLALDLRSMGKGQAWINGQSIGRYWMAYAKGGCSRCTYAGTYRPPTCENG 684
Query: 649 ---PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
P+Q WYH+PRS+LKPT N+LVL EE G IS+ SVT LCG + H
Sbjct: 685 CGQPTQRWYHVPRSWLKPTNNVLVLFEELGGDASKISLMRRSVTGLCGEAVEYH------ 738
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
+ +++++ + + ++C G+ IS I FAS+G P+G C +Y G+CH+ +S
Sbjct: 739 -AKNDSYIIESNEEL----DSLHLQCNPGQVISAIKFASFGTPSGTCGSYQKGTCHAPDS 793
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
AI+EK C+G +SC+V + F DPCP K LLV+ C
Sbjct: 794 HAIIEKKCIGLKSCSVSTTRDNFGVDPCPNELKQLLVEVDC 834
>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
Length = 818
Score = 736 bits (1899), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/829 (45%), Positives = 505/829 (60%), Gaps = 70/829 (8%)
Query: 38 IINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGR 97
+I+G R++L SGSIHYPRSTP+MWP LI K+K GGLD+++T VFW+LHEP GQ+DF GR
Sbjct: 1 VIDGTRRVLISGSIHYPRSTPEMWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGR 60
Query: 98 RDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRY 157
+DLVRFIK V GLYV LRIGP+ EW YGG P WLH +PGI FR+DN+PFK M+R+
Sbjct: 61 KDLVRFIKTVGEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIKFRTDNKPFKDEMQRF 120
Query: 158 ATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVP 217
T IV++MK LYASQGGPIILSQIENEYG ++ ++ Y+ WAA +A L TGVP
Sbjct: 121 TTKIVDLMKQENLYASQGGPIILSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGVP 180
Query: 218 WVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAED 277
WVMC+Q DAPDP+IN CNG C + PNS +KP IWTENW+ ++ +G R ED
Sbjct: 181 WVMCQQTDAPDPIINTCNGFYCDQ--FSPNSNNKPKIWTENWSGWFLSFGGPVPQRPVED 238
Query: 278 IAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGH 336
+A+ VA F + G++ NYYMY G NFG T+ ++ T Y AP+DEYG+ RQPKWGH
Sbjct: 239 LAFAVARFFQR-GGTFQNYYMYTWGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGH 297
Query: 337 LKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKRNNATVYFSN 395
LKELH A+KLC +++ ++ EA +++ S CAAFL N +++ATV F+
Sbjct: 298 LKELHKAIKLCEPALVATDHHTLRLGPNLEAHVYKTASGVCAAFLANIGTQSDATVTFNG 357
Query: 396 LMYELPPLSISILPDCKTVAFNTAKLDS---------------------------VEQWE 428
Y LP S+SILPDC+TV FNTA+++S W
Sbjct: 358 KSYSLPAWSVSILPDCRTVVFNTAQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSDWS 417
Query: 429 EYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSS 482
E + ++R LLEQ+NTT D SDYLWY+ D + ++S L S
Sbjct: 418 FVIEPVGISKSNAIRKTGLLEQINTTADVSDYLWYSISIAIDGDEPFLSNGTQSNLHAES 477
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
LGHVLHAF+NG+ GS G + EK++ L G N++ LLS VGL + GA+ +
Sbjct: 478 LGHVLHAFVNGKLAGSGIGNSGNAKIIFEKLIMLTPGNNSIDLLSATVGLQNYGAFFDLM 537
Query: 543 VAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
AG+ V ++G D SS +W YQ+GL GE L + + G S +QPL
Sbjct: 538 GAGITGPVKLKGQNGTLDLSSNAWTYQIGLKGEDLSLHENSGDVSQWISESTLPKNQPLI 597
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
WYKT F+AP G+DPVAI+ MGKGEAWVNGQSIGRYW ++ +PQ
Sbjct: 598 WYKTTFNAPDGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTYSSPQNGCSTACNYRGPYSA 657
Query: 647 -------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
G PSQ YH+PRSF++ N LVL EE G P IS+ T +T+LC HVS+SH
Sbjct: 658 SKCIKNCGKPSQILYHVPRSFIQSESNTLVLFEEMGGDPTQISLATKQMTSLCAHVSESH 717
Query: 700 LPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGNCENYAIG 758
PV +W S Q+ K+ P +Q+ CP + IS I FAS+G P+G C ++
Sbjct: 718 PAPVDTWLSLQQKGKKS-------GPTIQLECPYPNQVISSIKFASFGTPSGMCGSFNHS 770
Query: 759 SCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C S++ A+V+KAC+G + C+V + + K GDPC G+ K+L V+A C+
Sbjct: 771 QCSSASVLAVVQKACVGSKRCSVGI-SSKTLGDPCRGVIKSLAVEAACS 818
>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 848
Score = 734 bits (1896), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/824 (46%), Positives = 506/824 (61%), Gaps = 54/824 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD ++L+I+G R++LFSGSIHYPRSTP+MW LI KAK+GGLD + T VFWNLHEP
Sbjct: 30 NVVYDRKALVIDGQRRLLFSGSIHYPRSTPEMWEGLIQKAKDGGLDAIDTYVFWNLHEPS 89
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFIK V GLYV LRIGP+I EW +GG P WL VPGI FR+DNE
Sbjct: 90 PGNYNFEGRNDLVRFIKTVHKAGLYVHLRIGPYICSEWNFGGFPVWLKFVPGISFRTDNE 149
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ +V +MK +L+ SQGGPIILSQIENEY +F G Y+ WAAK+
Sbjct: 150 PFKSAMQKFTQKVVQLMKNEKLFESQGGPIILSQIENEYEPESKAFGASGYAYMTWAAKM 209
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV + TGVPWVMCK+DDAPDPVIN CNG C + PN P KP +WTE W+ ++ +G
Sbjct: 210 AVGMGTGVPWVMCKEDDAPDPVINTCNGFYC--DYFSPNKPYKPTMWTEAWSGWFTEFGG 267
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+ + VA FI K GS++NYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 268 PIYQRPVEDLTFAVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 326
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+R+PK+GHLKELH AVKLC +L+ ++A +F S A FL N + +
Sbjct: 327 LIRRPKYGHLKELHKAVKLCELALLNADPTVTTLGSYEQAHVFSSKSGSGAVFLSNFNTK 386
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ V F+N+ + LPP SISILPDCK VAFNTA++ + W + E
Sbjct: 387 SATKVTFNNMNFHLPPWSISILPDCKNVAFNTARVGVQTSQTQLLRTNSELHSWGIFNED 446
Query: 434 IPTY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHV 486
+ + +T++ LL+Q+N T+D+SDYLWY DPS+S L V S G
Sbjct: 447 VSSVAGDTTITVTGLLDQLNITRDSSDYLWYTTSVDIDPSESFLGGGQHPSLTVQSAGDA 506
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
+H FIN + GSA G + FT V+L G N +SLLS+ VGL ++G + E R G
Sbjct: 507 MHVFINDQLSGSASGTREHRRFTFTGNVNLHAGLNKISLLSIAVGLANNGPHFETRNTGV 566
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPLTW 602
L V++ G +D S W YQVGL GE + + V W + QPLTW
Sbjct: 567 LGPVALHGLDHGTRDLSWQKWSYQVGLKGEATNLDSPNSISAVDWMTGSLVAQKQQPLTW 626
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-------SFLTPQGT------- 648
YK FD P G +P+A+++ SMGKG+ W+NGQSIGRYW S T GT
Sbjct: 627 YKAYFDEPNGDEPLALDMGSMGKGQVWINGQSIGRYWTIYADSDCSACTYSGTFRPKKCQ 686
Query: 649 -----PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
P+Q WYH+PRS+LKP+ NLLV+ EE G +++ SVT++C VS++H P +
Sbjct: 687 FGCQHPTQQWYHVPRSWLKPSKNLLVVFEEIGGDVSKVALVKKSVTSVCAEVSENH-PRI 745
Query: 704 ISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
+W +++ + ++P++ + C G IS I F+S+G P+G+C + G+CH+
Sbjct: 746 TNWHTESHGQTEVQ-----QKPEISLHCTDGHSISAIKFSSFGTPSGSCGKFQHGTCHAP 800
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS A+++K CLGK+ C+V + F DPCP K L V+A C+
Sbjct: 801 NSNAVLQKECLGKQKCSVTISNTNFGADPCPSKLKKLSVEAVCS 844
>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
Length = 861
Score = 734 bits (1895), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/847 (46%), Positives = 522/847 (61%), Gaps = 73/847 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
G NVTYD R+++I+G R++L SGSIHYPRSTP MWP LI K+K+GGLDV++T VFW
Sbjct: 26 GASRAANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFW 85
Query: 83 NLHEP---QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVP 139
++HE Q Q+DF GR+DLVRF+K V GLYV LRIGP++ EW YGG P WLH VP
Sbjct: 86 DIHEAVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVP 145
Query: 140 GIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGP 199
GI FR+DNE FK M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G
Sbjct: 146 GIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGK 205
Query: 200 PYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENW 259
Y+RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW
Sbjct: 206 AYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFT--PNSKSKPKMWTENW 263
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYY 318
+ ++ +G R AED+A+ VA F + G++ NYYMYHGGTNFGR T ++ T Y
Sbjct: 264 SGWFLSFGGAVPYRPAEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRSTGGPFIATSYD 322
Query: 319 DQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS--SEC 376
AP+DEYG++RQPKWGHL+++H A+KLC +++ + + EA ++Q + S C
Sbjct: 323 YDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSIC 382
Query: 377 AAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------------- 423
AAFL N D +++ TV F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 383 AAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSS 442
Query: 424 --------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF-- 467
W E + E +L L+EQ+NTT DASD+LWY+
Sbjct: 443 IQDTDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV 502
Query: 468 KHDP---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
K D + S+S L V+SLGHVL +ING+ GSA G S +L+ V L+ G N +
Sbjct: 503 KGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKID 562
Query: 525 LLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LLS VGL + GA+ + AG+ V + G + SS W YQ+GL GE L ++
Sbjct: 563 LLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSE 622
Query: 584 SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
+ S T+QPL WYKT F AP G DPVAI+ MGKGEAWVNGQSIGRYW + L
Sbjct: 623 ASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNL 682
Query: 644 TPQ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGI 681
PQ G PSQ+ YH+PRSFL+P N LVL E+ G P I
Sbjct: 683 APQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMI 742
Query: 682 SIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKI 740
S T +++C HVS+ H + SW S Q+T +T + P +++ CP G+ IS I
Sbjct: 743 SFTTRQTSSICAHVSEMHPAQIDSWISP-QQTSQT------QGPALRLECPREGQVISNI 795
Query: 741 LFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKAL 800
FAS+G P+G C NY G C SS + A+V++AC+G +C+VPV + F GDPC G+ K+L
Sbjct: 796 KFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNF-GDPCSGVTKSL 854
Query: 801 LVDAQCT 807
+V+A C+
Sbjct: 855 VVEAACS 861
>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
Length = 844
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/834 (45%), Positives = 506/834 (60%), Gaps = 66/834 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD RSLII+GHRK+L S SIHYPRS P MWP LI AKEGG+DV++T VFWN HE
Sbjct: 21 NVTYDRRSLIIDGHRKLLISASIHYPRSVPAMWPSLIQNAKEGGVDVIETYVFWNGHELS 80
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
P + F GR DLV+FI V GLY+ LRIGPF+ EW +GG+P WLH +P VFR+DN
Sbjct: 81 PDNYHFDGRFDLVKFINIVHNAGLYLILRIGPFVAAEWNFGGVPVWLHYIPNTVFRTDNA 140
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
FKF+M+++ T IV++MK +L+ASQGGPIILSQ+ENEYG +E + E G PY WAA++
Sbjct: 141 SFKFYMQKFTTYIVSLMKKEKLFASQGGPIILSQVENEYGDIERVYGEGGKPYAMWAAQM 200
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV GVPW+MC+Q DAPDPVIN CN C + PNSP+KP +WTENW +++ +G
Sbjct: 201 AVSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQF--TPNSPNKPKMWTENWPGWFKTFGA 258
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R EDIA+ VA F K GS NYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 259 RDPHRPPEDIAFSVARFFQK-GGSLQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 317
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLKELH A+KL + +L+ ++ EA ++ SS CAAF+ N D++
Sbjct: 318 LPRLPKWGHLKELHRAIKLTERVLLNSEPTYVSLGPSLEADVYTDSSGACAAFIANIDEK 377
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE----------------- 425
++ TV F N+ Y LP S+SILPDCK V FNTA + S VE
Sbjct: 378 DDKTVQFRNISYHLPAWSVSILPDCKNVVFNTAMIRSQTAMVEMVPEELQPSADATNKDL 437
Query: 426 ---QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESV 477
+WE + E + + N L++ +NTTKD +DYLWY + ++ S+ V
Sbjct: 438 KALKWEVFVEQPGIWGKADFVKNVLVDHLNTTKDTTDYLWYTTSIFVNENEKFLKGSQPV 497
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L V S GH LHAFIN + SA G SD +F ++ + L G N ++LLS+ VGL ++G
Sbjct: 498 LVVESKGHALHAFINKKLQVSATGNGSDITFKFKQAISLKAGKNEIALLSMTVGLQNAGP 557
Query: 538 YLERRVAGLRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGSS 595
+ E AGL V I+G D SS++W Y++GL GE L I+ G + V W S
Sbjct: 558 FYEWVGAGLSKVVIEGFNNGPVDLSSYAWSYKIGLQGEHLGIYKPDGIKNVKWLSSREPP 617
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVS-------------- 641
QPLTWYK + D P+G++PV ++++ MGKG AW+NG+ IGRYW +
Sbjct: 618 KQQPLTWYKVILDPPSGNEPVGLDMVHMGKGLAWLNGEEIGRYWPTKSSIHDVCVQKCDY 677
Query: 642 --------FLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCG 693
LT G P+Q WYH+PRS+ KP+GN+LV+ EE+ G P I + V +C
Sbjct: 678 RGKFRPDKCLTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTQIRLSKRKVLGICA 737
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCE 753
H+ + H P + SW K+ + V ++CP +I+KI FAS+G P G+C
Sbjct: 738 HLGEGH-PSIESWSEAENVERKS-------KATVDLKCPDNGRIAKIKFASFGTPQGSCG 789
Query: 754 NYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+Y+IG CH NS ++VEK CL + C + + E F CP K L V+A C+
Sbjct: 790 SYSIGDCHDPNSISLVEKVCLNRNECRIELGEEGFNKGLCPTASKKLAVEAMCS 843
>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
Length = 836
Score = 734 bits (1894), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/822 (47%), Positives = 511/822 (62%), Gaps = 55/822 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING ++IL SGSIHYPRSTP+MWP LI K+K+GGLDV+QT VFWN HEP
Sbjct: 27 SVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLV+FIK V GLYV LRIGP++ EW +GG P WL VPGIVFR+DNE
Sbjct: 87 PGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV+MMKA +L+ SQGGPIILSQIENE+G VE G Y +WAA++
Sbjct: 147 PFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F PN KP +WTE WT +Y +G
Sbjct: 207 AVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFT-PNKNYKPKMWTEVWTGWYTEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ +A FI K GS+VNYYMYHGGTNFGRTA + YD APLDEYG
Sbjct: 265 AVPTRPAEDLAFSIARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L R+PKWGHL++LH A+K ++S + QEA +F+ S CAAFL N D ++
Sbjct: 324 LPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSGCAAFLANYDTKS 383
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------WEEYKEAIP 435
+A V F N YELPP SISILPDC+T +NTA+L S W+ + E
Sbjct: 384 SAKVSFGNGQYELPPWSISILPDCRTAVYNTARLGSQSSQMKMTPVKSALPWQSFIEESA 443
Query: 436 TYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLH 488
+ DE+ + + L EQ+N T+D +DY WY P + +L + S GH LH
Sbjct: 444 SSDESDTTTLDGLWEQINVTRDTTDYSWYMTDITISPDEGFIKRGESPLLTIYSAGHALH 503
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING+ G+ +G + T + V L +G N ++LLS+ VGLP+ G + E AG L
Sbjct: 504 VFINGQLSGTVYGALENPKLTFSQNVKLRSGINKLALLSISVGLPNVGLHFETWNAGVLG 563
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKT 605
V+++G D S + W Y+VGL GE L + T GS V W+ S + QPLTWY+
Sbjct: 564 PVTLKGLNSGTWDMSRWKWTYKVGLKGEALGLHTVSGSSSVEWAEGPSMAQKQPLTWYRA 623
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TP 645
F+AP G+ P+A+++ SMGKG+ W+NGQSIGR+W ++ T
Sbjct: 624 TFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTARGNCGNCYYAGTYDDKKCRTH 683
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS 705
G PSQ WYH+PRS+L +GNLLV+ EE G P IS+ +++C + + P ++
Sbjct: 684 CGEPSQRWYHVPRSWLTTSGNLLVVFEEWGGDPTKISLVERRTSSVCADIFEGQ--PTLT 741
Query: 706 WRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNS 765
N + L + K RPK + CP G+ IS I FASYG G C ++ GSCH+ S
Sbjct: 742 ----NSQKLASGKL---NRPKAHLWCPPGQVISDIKFASYGLSQGTCGSFQEGSCHAHKS 794
Query: 766 RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
++ C+GK+SC+V V E F GDPCPG K L V+A C+
Sbjct: 795 YDAPKRNCIGKQSCSVTVAPEVFGGDPCPGSTKKLSVEAVCS 836
>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
Length = 843
Score = 733 bits (1892), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/831 (46%), Positives = 508/831 (61%), Gaps = 61/831 (7%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
+NVTYD RSLII+G R+++ S SIHYPRS P+MWP+L+A+AK+GG D ++T VFWN H
Sbjct: 25 AASNVTYDHRSLIISGRRRLIISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGH 84
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
E PGQ+ F R DLVRF+K V+ GL + LRIGPF+ EW +GG+P WLH VPG VFR+
Sbjct: 85 EIAPGQYYFEDRFDLVRFVKVVKDAGLLLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRT 144
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG-MVEHSFLEKGPPYVRW 204
DNEPFK HMK + T IVNMMK +L+ASQGG IIL+QIENEYG E ++ G PY W
Sbjct: 145 DNEPFKSHMKSFTTYIVNMMKKEQLFASQGGNIILAQIENEYGDYYEQAYAPGGKPYAMW 204
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA +AV TGVPW+MC++ DAPDPVIN+CNG C + F PNSP KP +WTENW ++Q
Sbjct: 205 AASMAVAQNTGVPWIMCQESDAPDPVINSCNGFYC-DGFQ-PNSPTKPKLWTENWPGWFQ 262
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPL 323
+G+ R ED+A+ VA F K GS NYY+YHGGTNFGRT +T YD AP+
Sbjct: 263 TFGESNPHRPPEDVAFAVARFFEK-GGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPI 321
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVN 382
DEYGL R PKW HL++LH +++LC +L G ++ QEA I+ S C AFL N
Sbjct: 322 DEYGLRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLAN 381
Query: 383 KDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV----------------EQ 426
D N+ V F N Y+LP S+SILPDC+ V FNTAK+ S E+
Sbjct: 382 IDSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVAMVPESLQASKPER 441
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES---VLKVSSL 483
W ++E + + N ++ +NTTKD++DYLWY F D S S+ VL + S
Sbjct: 442 WNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDESYSKGSHVVLNIDSK 501
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH +HAF+N EF+GSA+G S SF+++ ++L G N ++LLS+ VGL ++G E
Sbjct: 502 GHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRTGKNELALLSMTVGLQNAGFSYEWIG 561
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFT---DYGSRIVPWSRYGSSTHQP 599
AG NV+I G + + SS +W Y++GL GE +F R +P S +QP
Sbjct: 562 AGFTNVNISGVRNGTINLSSNNWAYKIGLEGEYYSLFKPDQRNNQRWIPQSE--PPKNQP 619
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV------SFLTPQ------- 646
LTWYK D P G DPV I++ SMGKG W+NG +IGRYW TP
Sbjct: 620 LTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIGRYWPRTSSIDDRCTPSCDYRGEF 679
Query: 647 ---------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
G P+Q WYHIPRS+ P+GN+LV+ EE+ G P I+ +VT++C VS+
Sbjct: 680 NPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEKGGDPTKITFSRRAVTSVCSFVSE 739
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRP-KVQIRCPSGRKISKILFASYGNPNGNCENYA 756
H P + + + G P K Q+ CP G+ IS + FAS G P+G C +Y
Sbjct: 740 -HFPSI------DLESWDGSATNEGTSPAKAQLSCPIGKNISSLKFASLGTPSGTCRSYQ 792
Query: 757 IGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
GSCH NS ++VEKACL SCTV + E F D CPG+ K L ++A C+
Sbjct: 793 KGSCHHPNSLSVVEKACLNTNSCTVSLSDESFGKDLCPGVTKTLAIEADCS 843
>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 826
Score = 733 bits (1891), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 396/819 (48%), Positives = 511/819 (62%), Gaps = 59/819 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R++ ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 25 NVWYDSRAITINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLVRFIK VQ GLY+ LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 85 PGKYYFEGNYDLVRFIKLVQQGGLYLHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ + IVNMMKA +L+ QGGPIILSQIENE+G +E+ Y WAAK+
Sbjct: 145 PFKAEMEKFTSHIVNMMKAEKLFHWQGGPIILSQIENEFGPLEYDQGAPAKAYAAWAAKM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AVDL+TGVPWVMCK+DDAPDPVIN NG + PN KP +WTENWT ++ YG
Sbjct: 205 AVDLETGVPWVMCKEDDAPDPVINTWNGFYADGFY--PNKRYKPMMWTENWTGWFTGYGV 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F+ K GSYVNYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 263 PVPHRPVEDLAFSVAKFVQK-GGSYVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
+LRQPK+GHL +LH A+KLC ++SG V + QE+ +F+ +S CAAFL N D +
Sbjct: 322 MLRQPKYGHLTDLHKAIKLCEPALVSGYPVVTSLGNNQESNVFRSNSGACAAFLANYDTK 381
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIP 435
ATV F+ + Y LPP SISILPDCKT FNTA++ + W Y E
Sbjct: 382 YYATVTFNGMRYNLPPWSISILPDCKTTVFNTARVGAQTTQMQMTTVGGFSWVSYNEDPN 441
Query: 436 TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHA 489
+ D+ S L+EQ++ T+D++DYLWY D ++ VL S GH LH
Sbjct: 442 SIDDGSFTKLGLVEQISMTRDSTDYLWYTTYVNIDQNEQFLKNGQYPVLTAQSAGHSLHV 501
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN- 548
FING+ +G+A+G D T V L G+N +S LS+ VGLP+ G + E GL
Sbjct: 502 FINGQLIGTAYGSVEDPRLTYTGNVKLFAGSNKISFLSIAVGLPNVGEHFETWNTGLLGP 561
Query: 549 VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVF 607
V++ G E K D + W Y++GL GE L + T GS V W +S QPL WYK F
Sbjct: 562 VTLNGLNEGKRDLTWQKWTYKIGLKGEALSLHTLSGSSNVEWGD--ASRKQPLAWYKGFF 619
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTP------------------ 649
+AP GS+P+A+++ +MGKG+ W+NGQSIGRYW ++ P
Sbjct: 620 NAPGGSEPLALDMSTMGKGQVWINGQSIGRYWPAYKARGSCPKCDYEGTYEETKCQSNCG 679
Query: 650 --SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWR 707
SQ WYH+PRS+L PTGNL+V+ EE G P GIS+ S+ + C +VS P + +W
Sbjct: 680 DSSQRWYHVPRSWLNPTGNLIVVFEEWGGEPTGISLVKRSMRSACAYVSQGQ-PSMNNWH 738
Query: 708 SQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRA 767
++ + KV + C G K+++I FASYG P G CE+Y+ G CH+ S
Sbjct: 739 TKYAES------------KVHLSCDPGLKMTQIKFASYGTPQGACESYSEGRCHAHKSYD 786
Query: 768 IVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
I +K C+G++ C+V V E F GDPCPGI K++ V A C
Sbjct: 787 IFQKNCIGQQVCSVTVVPEVFGGDPCPGIMKSVAVQASC 825
>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 852
Score = 732 bits (1890), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/821 (46%), Positives = 508/821 (61%), Gaps = 53/821 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++++ING R++L SGSIHYPRSTP+MW LI KAK+GGLDV+ T VFWN HEP P
Sbjct: 30 VTYDKKAILINGQRRLLISGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYVFWNGHEPSP 89
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F GR DLVRFIK VQ GL++ LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 90 GNYYFEGRYDLVRFIKTVQKAGLFLHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 149
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK +L+ASQGGPIILSQIENEYG + G Y+ WAAK+A
Sbjct: 150 FKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPERKALGAPGQNYINWAAKMA 209
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK+DDAPDP+INACNG C + F PN P KP +WTE W+ ++ +G
Sbjct: 210 VGLDTGVPWVMCKEDDAPDPMINACNGFYC-DGFT-PNKPYKPTMWTEAWSGWFLEFGGT 267
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R +D+A+ VA FI + GSYVNYYMYHGGTNFGRTA +T YD AP+DEYGL
Sbjct: 268 IHHRPVQDLAFAVARFIQR-GGSYVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGL 326
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKRN 387
+RQPK+GHLKELH A+KLC +LS + +A++F G CAAFL N
Sbjct: 327 IRQPKYGHLKELHKAIKLCEHSLLSSEPTVTSLGTYHQAYVFNSGPRRCAAFLSNFHSV- 385
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEAI 434
A V F+N Y+LPP S+SILPDC+ +NTAK+ + W+ Y E I
Sbjct: 386 EARVTFNNKHYDLPPWSVSILPDCRNEVYNTAKVGVQTSHVQMIPTNSRLFSWQTYDEDI 445
Query: 435 PT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSSLGHVLHA 489
+ ++ +S+ A LLEQ+N T+D SDYLWY SD + L V S GH LH
Sbjct: 446 SSVHERSSIPAIGLLEQINVTRDTSDYLWYMTNVDISSSDLSGGKKPTLTVQSAGHALHV 505
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN- 548
F+NG+F GSA G + FT V+L G N ++LLS+ VGLP+ G + E G++
Sbjct: 506 FVNGQFSGSAFGTREQRQFTFADPVNLHAGINRIALLSIAVGLPNVGLHYESWKTGIQGP 565
Query: 549 VSIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLTWYKT 605
V + G KD + W +VGL GE + + + G+ V W R + T Q L WYK
Sbjct: 566 VFLDGLGNGKKDLTLHKWFNKVGLKGEAMNLVSPNGASSVGWIRRSLATQTKQTLKWYKA 625
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------------- 646
F+AP G++P+A+++ MGKG+ W+NGQSIGRYW+++
Sbjct: 626 YFNAPGGNEPLALDMRRMGKGQVWINGQSIGRYWMAYAKGDCSSCSYIGTFRPTKCQLHC 685
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW 706
G P+Q WYH+PRS+LKPT NL+V+ EE G P I++ SV +CG + ++H P ++
Sbjct: 686 GRPTQRWYHVPRSWLKPTQNLVVVFEELGGDPSKITLVRRSVAGVCGDLHENH-PNAENF 744
Query: 707 RSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSR 766
KT + +V + C G+ IS I FAS+G P+G C ++ G+CH++NS
Sbjct: 745 DVDGNEDSKTL-----HQAQVHLHCAPGQSISSIKFASFGTPSGTCGSFQQGTCHATNSH 799
Query: 767 AIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
A+VEK C+G+ SC+V V F DPCP + K L V+A C+
Sbjct: 800 AVVEKNCIGRESCSVAVSNSTFETDPCPNVLKRLSVEAVCS 840
>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
Length = 1052
Score = 732 bits (1890), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/813 (46%), Positives = 516/813 (63%), Gaps = 50/813 (6%)
Query: 30 VTYDG--RSLIINGHRK----ILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
VTYDG R+ I + +K + F S MWP +I KA+ GGL+ +QT VFWN
Sbjct: 33 VTYDGSERNFIDHKWKKRASFLWFCSLPSKHTSRKHMWPSIIDKARIGGLNTIQTYVFWN 92
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
+HEP+ G++DF GR DLV+FIK + +GLYV LR+GPFI+ EW +GGLP+WL +VP + F
Sbjct: 93 VHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYF 152
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R++NEPFK H +RY I+ MMK +L+ASQGGPIIL QIENEY V+ ++ E G Y++
Sbjct: 153 RTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIK 212
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WAA L + G+PWVMCKQ+DAP +INACNGR CG+TF GPN DKP++WTENWT+ +
Sbjct: 213 WAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQF 272
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPL 323
+V+GD R+ EDIA+ VA + +K GS+VNYYMYHGGTNFGRT++ +V T YYD APL
Sbjct: 273 RVFGDPPTQRTVEDIAFSVARYFSK-NGSHVNYYMYHGGTNFGRTSAHFVTTRYYDDAPL 331
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLV 381
DE+GL + PK+GHLK +H A++LC K + G L + E ++ G+ CAAFL
Sbjct: 332 DEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLS 391
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW-------------- 427
N + R+ T+ F Y LP SISILPDCKTV +NTA++ + W
Sbjct: 392 NNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFVKSEKTSKGLK 451
Query: 428 -EEYKEAIPTYDETSLRANFLL--EQMNTTKDASDYLWYNFRFKHDPSDS--ESVLKVSS 482
E + E IP+ L + L+ E TKD +DY P +++L+V+S
Sbjct: 452 FEMFSENIPSL----LDGDSLIPGELYYLTKDKTDYACVKIDEDDFPDQKGLKTILRVAS 507
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
LGH L ++NGE+ G AHG+H KSF K V+ G N +S+L V+ GLPDSG+Y+E R
Sbjct: 508 LGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHR 567
Query: 543 VAGLRNVSIQGAKE-LKDFS-SFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
AG R +SI G K +D + + WG+ GL GEK +++T+ GS+ V W + G +PL
Sbjct: 568 FAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGK--RKPL 625
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWYKT F+ P G + VAI + +MGKG WVNG +GRYW+SFL+P G P+Q+ YHIPRSF
Sbjct: 626 TWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSPLGEPTQTEYHIPRSF 685
Query: 661 LK--PTGNLLVLLEEENGYPPGI---SIDTVSVT--TLCGHVSDSHLPPVISWRSQNQRT 713
+K N+LV+LEEE PG+ SID V V T+C +V + + V SW+ + +
Sbjct: 686 MKGEKKKNMLVILEEE----PGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKI 741
Query: 714 LKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKAC 773
+ K + R K +RCP +++ ++ FAS+G+P G C N+ +G C +S S+ +VEK C
Sbjct: 742 VSRSKDM---RLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKEC 798
Query: 774 LGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
LG+ C++ V E F CP I K L V +C
Sbjct: 799 LGRNYCSIVVARETFGDKGCPEIVKTLAVQVKC 831
>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
Length = 715
Score = 731 bits (1887), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/687 (50%), Positives = 465/687 (67%), Gaps = 23/687 (3%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGRS+I+NG R++LFSGSIHYPR P+MWP +I KAKEGGL+++QT VFWN+HEP
Sbjct: 28 VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPDIIRKAKEGGLNLIQTYVFWNIHEPVQ 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQF+F G D+V+FIK + QGLYV LRIGP+IE EW GG P+WL +VP I FRS NEP
Sbjct: 88 GQFNFEGNYDVVKFIKTIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F HMK+Y+ M++++MK +L+A QGGPII++QIENEY V+ ++ + G YV WAA +A
Sbjct: 148 FIHHMKKYSEMVIDLMKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYVEWAANMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L GVPW+MCKQ DAP VIN CNGR C +TF GPN P+KP++WTENWT+ Y+ +GD
Sbjct: 208 TGLYNGVPWIMCKQKDAPAQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R+AEDIA+ VA F AK G+ NYYMY+GGTN+GRT S++V T YYD+APLDE+GL
Sbjct: 268 PSQRAAEDIAFSVARFFAK-NGTLTNYYMYYGGTNYGRTGSSFVTTRYYDEAPLDEFGLY 326
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKRNN 388
R+PKW HL++LH A++L + +L G ++ E +++ ++CAAFL N
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPSVQKINQHLEITVYEKPGTDCAAFLTNNHTTLP 386
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKEAI 434
AT+ F Y LP S+SILPDCK ++ NT + S +WE Y+E +
Sbjct: 387 ATIKFRGREYYLPEKSVSILPDCKLLSTNTQTIVSQHNSRNFLPSEKAKNLKWEMYQEKV 446
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVSSLGHVLH 488
PT + SL+ LE + TKD SDY WY+ D D VL+++S+GH L
Sbjct: 447 PTISDLSLKNREPLELYSLTKDTSDYAWYSTSINFDRHDLPMRPDILPVLQIASMGHALS 506
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
AF+NGEFVG HG + +KSF +K V L GTN +S+L+ VG P+SGAY+E+R AG R
Sbjct: 507 AFVNGEFVGFGHGNNIEKSFVFQKPVILKPGTNTISILAETVGFPNSGAYMEKRFAGPRG 566
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVF 607
+++QG D + +WG++VG+ GEK Q+FT+ G++ V W+ T +TWYKT F
Sbjct: 567 ITVQGLMAGTLDITQNNWGHEVGVFGEKEQLFTEEGAKKVKWTPVNGPTKGAVTWYKTYF 626
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNL 667
DAP G++PVA+ + M KG WVNG S+GRYW SFL+P G P+Q YHIPR+FLKPT NL
Sbjct: 627 DAPEGNNPVALKMDKMQKGMMWVNGNSLGRYWSSFLSPLGQPTQFEYHIPRAFLKPTNNL 686
Query: 668 LVLLEEENGYPPGISIDTVSVTTLCGH 694
LV+ EE G+P I + V+ T H
Sbjct: 687 LVIFEETGGHPETIEVQIVNRDTNLQH 713
>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
Length = 844
Score = 731 bits (1886), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/832 (46%), Positives = 512/832 (61%), Gaps = 62/832 (7%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +NVTYD RSLII+G R+++ S SIHYPRS P+MWP+L+A+AK+GG D ++T VFWN H
Sbjct: 25 GASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGH 84
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
E PGQ+ F R DLVRF+K V+ GL + LRIGP++ EW YGG+P WLH VPG VFR+
Sbjct: 85 EIAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRT 144
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG-MVEHSFLEKGPPYVRW 204
+NEPFK HMK + T IV+MMK +L+ASQGG IIL+QIENEYG E ++ G PY W
Sbjct: 145 NNEPFKNHMKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMW 204
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA +A+ TGVPW+MC++ DAPDPVIN+CNG C + F PNSP KP IWTENW ++Q
Sbjct: 205 AASMALAQNTGVPWIMCQESDAPDPVINSCNGFYC-DGFQ-PNSPTKPKIWTENWPGWFQ 262
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPL 323
+G+ R ED+A+ VA F K GS NYY+YHGGTNFGRT +T YD AP+
Sbjct: 263 TFGESNPHRPPEDVAFAVARFFEK-GGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPI 321
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVN 382
DEYGL R PKW HL+ELH +++LC +L G ++ QEA I+ S C AFL N
Sbjct: 322 DEYGLRRFPKWAHLRELHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLAN 381
Query: 383 KDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV----------------EQ 426
D N+ V F N Y+LP S+SILPDC+ V FNTAK+ S E+
Sbjct: 382 IDSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVPESLQASKPER 441
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS----DSESVLKVSS 482
W ++E + + N ++ +NTTKD++DYLWY F D S S +VL + S
Sbjct: 442 WSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGSYSSKGSHAVLNIDS 501
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH +HAF+N +GSA+G S F+++ ++L G N ++LLS+ VGL ++G E
Sbjct: 502 NGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLRTGKNELALLSMTVGLQNAGFAYEWI 561
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFT---DYGSRIVPWSRYGSSTHQ 598
AG NV+I G + + D SS +W Y++GL GE +F R +P S +Q
Sbjct: 562 GAGFTNVNISGVRTGIIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSE--PPKNQ 619
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-----------------S 641
PLTWYK D P G DPV I++ SMGKG AW+NG +IGRYW +
Sbjct: 620 PLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCNYRGT 679
Query: 642 FL-----TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
F+ T G P+Q WYHIPRS+ P+GN+LV+ EE+ G P I+ +VT++C VS
Sbjct: 680 FIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVTSVCSFVS 739
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRP-KVQIRCPSGRKISKILFASYGNPNGNCENY 755
+ H P I S ++ + G P K Q+ CP G+ IS + FAS GNP+G C +Y
Sbjct: 740 E-HFPS-IDLESWDESAMNE-----GTPPAKAQLSCPEGKSISSVKFASLGNPSGTCRSY 792
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+G CH NS ++VEKACL SCTV + E F D C G+ K L ++A C+
Sbjct: 793 QMGRCHHPNSLSVVEKACLNTNSCTVSLTDESFGKDLCHGVTKTLAIEADCS 844
>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
Length = 836
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 391/827 (47%), Positives = 519/827 (62%), Gaps = 58/827 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++ ING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 20 SVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 79
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLVRFIK V+ GLYV LRIGP++ EW +GG P WL +PGI FR++N
Sbjct: 80 PGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNNG 139
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK +M+R+ IV+MMKA L+ SQGGPIILSQIENEYG +E+ G Y +WAA++
Sbjct: 140 PFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQM 199
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDP+IN+CNG C + PN KP +WTE WT ++ +G
Sbjct: 200 AVGLGTGVPWVMCKQDDAPDPIINSCNGFYC--DYFSPNKAYKPKMWTEAWTGWFTEFGG 257
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 258 AVPYRPVEDLAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 316
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDKR 386
L+RQPKWGHLK+LH A+KLC ++SG M + QEA +F+ CAAFL N + R
Sbjct: 317 LVRQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPR 376
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKE 432
+ A V F N+ Y LPP SISILPDCK +NTA++ + W+ Y E
Sbjct: 377 SFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVPVPIHGAFSWQAYNE 436
Query: 433 AIPTYD-ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
P+ + E S L+EQ+NTT+D SDYLWY+ K DP + L V S GH
Sbjct: 437 EAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPTLTVLSAGH 496
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH F+N + G+A+G T K V+L G N +S+LS+ VGLP+ G + E AG
Sbjct: 497 ALHVFVNDQLSGTAYGSLEFPKITFSKGVNLRAGINKISILSIAVGLPNVGPHFETWNAG 556
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V++ G E +D S W Y+VG+ GE + + + GS V W+ GS + QPLT
Sbjct: 557 VLGPVTLNGLNEGRRDLSWQKWSYKVGVEGEAMSLHSLSGSSSVEWTA-GSFVARRQPLT 615
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------- 642
W+KT F+AP G+ P+A+++ SMGKG+ W+NG+SIGR+W ++
Sbjct: 616 WFKTTFNAPAGNSPLALDMNSMGKGQIWINGKSIGRHWPAYKASGSCGWCDYAGTFNEKK 675
Query: 643 -LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
L+ G SQ WYH+PRS+ PTGNLLV+ EE G P GIS+ V ++C + + P
Sbjct: 676 CLSNCGEASQRWYHVPRSWPNPTGNLLVVFEEWGGDPNGISLVRREVDSVCADIYEWQ-P 734
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
++++ Q Q + K +K + RPK ++C G+KIS + FAS+G P G C +Y GSCH
Sbjct: 735 TLMNY--QMQASGKVNKPL---RPKAHLQCGPGQKISSVKFASFGTPEGACGSYREGSCH 789
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGD-PCPGIPKALLVDAQCT 807
+ +S E+ C+G+ C+V V G+ P P + K L V+ C+
Sbjct: 790 AHHSYDAFERLCVGQNWCSVTVVPRNVSGEIPAPSVMKKLAVEVVCS 836
>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 830
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/817 (47%), Positives = 503/817 (61%), Gaps = 58/817 (7%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD +++++NG R+IL SGSIHYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + T IV+MMK+ L+ QGGPIILSQIENE+G +E E Y WAA +AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L T VPWVMCK+DDAPDP+IN CNG C + PN P KP +WTE WTS+Y +G
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTSWYTGFGIPV 267
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+AY VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DEYGLL
Sbjct: 268 PHRPVEDLAYGVAKFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLL 326
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE-CAAFLVNKDKRNN 388
R+PKWGHLKELH A+KLC +++G + + Q+A +F+ S++ C AFL NKDK +
Sbjct: 327 REPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSY 386
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-VEQ----------WEEYKEAIPTY 437
A V F+ + Y+LPP SISILPDCKT +NTA + S + Q W+ Y E I +
Sbjct: 387 ARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQISQMKMEWAGGFTWQSYNEDINSL 446
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYN--FRFKHD----PSDSESVLKVSSLGHVLHAFI 491
+ S LLEQ+N T+D +DYLWY D + +L V S GH LH F+
Sbjct: 447 GDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVMSAGHALHIFV 506
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVS 550
NG+ G+ +G D T V L +G+N +S LS+ VGLP+ G + E AG L V+
Sbjct: 507 NGQLTGTVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFETWNAGILGPVT 566
Query: 551 IQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
+ G E +D + W Y+VGL GE L + + GS V W QPL+WYK F+A
Sbjct: 567 LDGLNEGRRDLTWQKWTYKVGLKGEALSLHSLSGSSSVEWGE--PVQKQPLSWYKAFFNA 624
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQGTP 649
P G +P+A+++ SMGKG+ W+NGQ IGRYW + T G
Sbjct: 625 PDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGTCGICDYRGEYDEKKCQTNCGDS 684
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ 709
SQ WYH+PRS+L PTGNLLV+ EE G P GIS+ ++C VS+ P + +WR++
Sbjct: 685 SQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRIAGSICADVSEWQ-PSMANWRTK 743
Query: 710 NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIV 769
K H ++C GRK++ I FAS+G P G+C +Y+ G CH+ S I
Sbjct: 744 GYEKAKVH-----------LQCDHGRKMTHIKFASFGTPQGSCGSYSEGGCHAHKSYDIF 792
Query: 770 EKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
K+C+G+ C V V + F GDPCPG K +V+A C
Sbjct: 793 WKSCIGQERCGVSVVPDAFGGDPCPGTMKRAVVEAIC 829
>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
Length = 844
Score = 729 bits (1882), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/833 (46%), Positives = 511/833 (61%), Gaps = 64/833 (7%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +NVTYD RSLII+G R+++ S SIHYPRS P+MWP+L+A+AK+GG D ++T VFWN H
Sbjct: 25 GASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGH 84
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
E PGQ+ F R DLVRF+K V+ GL + LRIGP++ EW YGG+P WLH VPG VFR+
Sbjct: 85 EIAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRT 144
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG-MVEHSFLEKGPPYVRW 204
+NEPFK H+K + T IV+MMK +L+ASQGG IIL+QIENEYG E ++ G PY W
Sbjct: 145 NNEPFKNHVKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMW 204
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA +A+ TGVPW+MC++ DAPDPVIN+CNG C + F PNSP KP IWTENW ++Q
Sbjct: 205 AASMALAQNTGVPWIMCQESDAPDPVINSCNGFYC-DGFQ-PNSPTKPKIWTENWPGWFQ 262
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPL 323
+G+ R ED+A+ VA F K GS NYY+YHGGTNFGRT +T YD AP+
Sbjct: 263 TFGESNPHRPPEDVAFAVARFFEK-GGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPI 321
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVN 382
DEYGL R PKW HL++LH +++LC +L G ++ QEA I+ S C AFL N
Sbjct: 322 DEYGLRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLAN 381
Query: 383 KDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV----------------EQ 426
D N+ V F N Y+LP S+SILPDC+ V FNTAK+ S E+
Sbjct: 382 IDSANDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQTSMVTMVPESLQASKPER 441
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS----DSESVLKVSS 482
W ++E + + N ++ +NTTKD++DYLWY F D S S +VL + S
Sbjct: 442 WSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGSYSSKGSHAVLNIDS 501
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH +HAF+N +GSA+G S F+++ ++L G N ++LLS+ VGL ++G E
Sbjct: 502 NGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLRTGKNELALLSMTVGLQNAGFAYEWI 561
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFT---DYGSRIVPWSRYGSSTHQ 598
AG NV+I G + D SS +W Y++GL GE +F R +P S +Q
Sbjct: 562 GAGFTNVNISGVRTGTIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSE--PPKNQ 619
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-----------------S 641
PLTWYK D P G DPV I++ SMGKG AW+NG +IGRYW +
Sbjct: 620 PLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCNYRGT 679
Query: 642 FL-----TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
F+ T G P+Q WYHIPRS+ P+GN+LV+ EE+ G P I+ +VT++C VS
Sbjct: 680 FIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVTSVCSFVS 739
Query: 697 DSHLPPVI--SWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCEN 754
+ H P + SW + T P K Q+ CP G+ IS + FAS GNP+G C +
Sbjct: 740 E-HFPSIDLESW----DESAMTEGTPPA---KAQLFCPEGKSISSVKFASLGNPSGTCRS 791
Query: 755 YAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
Y +G CH NS ++VEKACL SCTV + E F D CPG+ K L ++A C+
Sbjct: 792 YQMGRCHHPNSLSVVEKACLNTNSCTVSLTDESFGKDLCPGVTKTLAIEADCS 844
>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 843
Score = 729 bits (1881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/832 (45%), Positives = 509/832 (61%), Gaps = 62/832 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YDGRSL+I+G RK+L S SIHYPRS P MWP L+ AKEGG+DV++T VFWN HE
Sbjct: 21 NVSYDGRSLLIDGQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGHELS 80
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG + F GR DLV+F K VQ G+Y+ LRIGPF+ EW +GG+P WLH VPG VFR+ N+
Sbjct: 81 PGNYYFGGRFDLVKFAKTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTYNQ 140
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF +HM+++ T IVN+MK +L+ASQGGPIILSQIENEYG E+ + E G Y WAAK+
Sbjct: 141 PFMYHMQKFTTYIVNLMKQEKLFASQGGPIILSQIENEYGYYENFYKEDGKKYALWAAKM 200
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV TGVPW+MC+Q DAPDPVI+ CN C + P SP++P IWTENW +++ +G
Sbjct: 201 AVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQ--FTPTSPNRPKIWTENWPGWFKTFGG 258
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA F K GS NYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 259 RDPHRPAEDVAFSVARFFQK-GGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYG 317
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLKELH A+KLC +L+G V+++ EA ++ SS CAAF+ N D +
Sbjct: 318 LPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFISNVDDK 377
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-----------DSVEQ--------- 426
N+ TV F N Y LP S+SILPDCK V FNTAK+ +S++Q
Sbjct: 378 NDKTVEFRNASYHLPAWSVSILPDCKNVVFNTAKVTSQTNVVAMIPESLQQSDKGVNSLK 437
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKV 480
W+ KE + + + ++ +NTTKD +DYLW+ ++ S+ VL +
Sbjct: 438 WDIVKEKPGIWGKADFVKSGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKGSKPVLLI 497
Query: 481 SSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLE 540
S GH LHAF+N E+ G+ G + F+ + + L G N ++LL + VGL +G + +
Sbjct: 498 ESTGHALHAFVNQEYQGTGTGNGTHSPFSFKNPISLRAGKNEIALLCLTVGLQTAGPFYD 557
Query: 541 RRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQ 598
AGL +V I+G K D SS++W Y++G+ GE L+++ G V W+ Q
Sbjct: 558 FIGAGLTSVKIKGLKNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNKVNWTSTSEPQKMQ 617
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV------------------ 640
PLTWYK + DAP G +PV ++++ MGKG AW+NG+ IGRYW
Sbjct: 618 PLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSEDCVKECDYRG 677
Query: 641 -----SFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
T G P+Q WYH+PRS+ KP+GN+LVL EE+ G P I V+ C V
Sbjct: 678 KFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSGACALV 737
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
++ + P + SQ + ++ +K + P + CPS +IS + FAS+G P+G+C +Y
Sbjct: 738 AEDY--PSVGLLSQGEDKIQNNKNV----PFAHLTCPSNTRISAVKFASFGTPSGSCGSY 791
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G CH NS IVEKACL K C + + E F + CPG+ + L V+A C+
Sbjct: 792 LKGDCHDPNSSTIVEKACLNKNDCVIKLTEENFKTNLCPGLSRKLAVEAVCS 843
>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
Length = 839
Score = 728 bits (1879), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/831 (45%), Positives = 507/831 (61%), Gaps = 66/831 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+NVTYD R+L+I+G R++L SGSIHYPRSTPQMWP LI K+K+GG+DV++T VFWNLHEP
Sbjct: 24 SNVTYDHRALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEP 83
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
GQ++F GR DLV F+K V A GLYV LRIGP++ EW YGG P WLH + GI FR++N
Sbjct: 84 VRGQYNFEGRGDLVGFVKAVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNN 143
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK MKR+ IV+MMK LYASQGGPIILSQIENEYG ++ Y+ WAA
Sbjct: 144 EPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAAS 203
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A L TGVPW+MC+Q +APDP+IN CN C + PNS +KP +WTENW+ ++ +G
Sbjct: 204 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQ--FTPNSDNKPKMWTENWSGWFLAFG 261
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R ED+A+ VA F + G++ NYYMYHGGTNFGRT ++ YD AP+DEY
Sbjct: 262 GAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEY 320
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
G +RQPKWGHLK+LH A+KLC + +++ + E +++ + C+AFL N
Sbjct: 321 GDIRQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVYKTGAVCSAFLANIG-M 379
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYK--------------- 431
++ATV F+ Y LP S+SILPDCK V NTAK+++ +
Sbjct: 380 SDATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLKEKVDSLDSSS 439
Query: 432 -------EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKH-DPSDSESVLKVSSL 483
E + + + LLEQ+NTT D SDYLWY+ + D + + VL + SL
Sbjct: 440 SGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVYEDNAGDQPVLHIESL 499
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH LHAF+NG+ GS G + ++ + L+ G N + LLS+ VGL + GA+ +
Sbjct: 500 GHALHAFVNGKLAGSKAGSSGNAKVNVDIPITLVTGKNTIDLLSLTVGLQNYGAFYDTVG 559
Query: 544 AGLRN-VSIQGAKELK--DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGSSTHQP 599
AG+ V ++G K D +S W YQVGL GE + + + + W S+ +QP
Sbjct: 560 AGITGPVILKGLKNGSSVDLTSQQWTYQVGLQGEFVGLSS---GNVGQWNSQSNLPANQP 616
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
LTWYKT F AP+GS+PVAI+ MGKGEAWVNGQSIGRYW ++++P
Sbjct: 617 LTWYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTY 676
Query: 647 ---------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
G PSQ+ YH+PR++LKP N VL EE G P IS T + ++C HV++
Sbjct: 677 SASKCLKNCGKPSQTLYHVPRAWLKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTE 736
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGNCENYA 756
SH PPV +W S + K P + + CP + IS I FAS+G P G C NY
Sbjct: 737 SHPPPVDTWNSNAESERKVG-------PVLSLECPYPNQAISSIKFASFGTPRGTCGNYN 789
Query: 757 IGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
GSC S+ + +IV+KAC+G SC + V F G+PC G+ K+L V+A CT
Sbjct: 790 HGSCSSNRALSIVQKACIGSSSCNIGVSINTF-GNPCRGVTKSLAVEAACT 839
>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
Length = 897
Score = 728 bits (1879), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/875 (44%), Positives = 517/875 (59%), Gaps = 108/875 (12%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQ------------------------------- 59
TYD ++++I+G R+ILFSGSIHYPRSTP
Sbjct: 30 TYDKKAVLIDGQRRILFSGSIHYPRSTPDVISCILQNLSFFFSPLLPRGGGEFMAVVSCV 89
Query: 60 ---------------------MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
MW LI KAK+GGLDV+QT VFWN HEP PG + F R
Sbjct: 90 LDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERY 149
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DLVRF+K VQ GL+V LRIGP+I GEW +GG P WL VPGI FR+DNEPFK M+ +
Sbjct: 150 DLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFT 209
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
IV MMK+ L+ASQGGPIILSQIENEYG F G Y+ WAAK+AV L TGVPW
Sbjct: 210 EKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPW 269
Query: 219 VMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
VMCK++DAPDPVINACNG C + F+ PN P KP +WTE W+ ++ +G R R ED+
Sbjct: 270 VMCKEEDAPDPVINACNGFYC-DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDL 327
Query: 279 AYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHL 337
A+ VA F+ K GS++NYYMYHGGTNFGRTA +T YD AP+DEYGL+R+PK HL
Sbjct: 328 AFAVARFVQK-GGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHL 386
Query: 338 KELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLM 397
KELH AVKLC + ++S +QEA +F+ S CAAFL N + ++A V F+N
Sbjct: 387 KELHRAVKLCEQALVSVDPTITTLGTMQEAHVFRSPSGCAAFLANYNSNSHAKVVFNNEQ 446
Query: 398 YELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEAIPTYDETSLRA 444
Y LPP SISILPDCK V FN+A + + WE Y E + + L
Sbjct: 447 YSLPPWSISILPDCKNVVFNSATVGVQTSQMQMWGDGATSMMWERYDEEVDSLAAAPLLT 506
Query: 445 NF-LLEQMNTTKDASDYLWYNFRFKHDPSDS-------ESVLKVSSLGHVLHAFINGEFV 496
LLEQ+N T+D+SDYLWY PS++ L V S GH LH F+NG+
Sbjct: 507 TTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQ 566
Query: 497 GSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAK 555
GS++G D+ V+L GTN ++LLSV GLP+ G + E G+ V + G
Sbjct: 567 GSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLN 626
Query: 556 E-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYKTVFDAPTG 612
E +D + +W YQVGL GE++ + + GS V W + + QPL WYK F+ P+G
Sbjct: 627 EGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSG 686
Query: 613 SDPVAINLISMGKGEAWVNGQSIGRYWV--------------SFLTPQ-----GTPSQSW 653
+P+A+++ SMGKG+ W+NGQSIGRYW +F P+ G P+Q W
Sbjct: 687 DEPLALDMGSMGKGQVWINGQSIGRYWTAYADGDCKGCSYTGTFRAPKCQAGCGQPTQRW 746
Query: 654 YHIPRSFLKPTGNLLVLLEE-ENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQR 712
YH+PRS+L+P+ NLLV+LEE G I++ SV+++C VS+ H P + W+
Sbjct: 747 YHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADVSEDH-PNIKKWQ----- 800
Query: 713 TLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKA 772
++++ RR KV +RC G+ IS I FAS+G P G C N+ G CHS++S A++EK
Sbjct: 801 -IESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKR 859
Query: 773 CLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C+G + C V + + F GDPCP + K + V+A C+
Sbjct: 860 CIGLQRCVVAISPDNFGGDPCPSVTKRVAVEAVCS 894
>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/835 (46%), Positives = 512/835 (61%), Gaps = 70/835 (8%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G NVTYD R+L+I+G R++L SGSIHYPRST +MW LI K+K+GGLDV++T VFWN HE
Sbjct: 29 GVNVTYDHRALLIDGKRRVLVSGSIHYPRSTVEMWADLIQKSKDGGLDVIETYVFWNAHE 88
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P Q++F GR DLV+FIK V GLY LRIGP++ EW YGG P WLH VPGI FR+D
Sbjct: 89 PVQNQYNFEGRYDLVKFIKLVGEAGLYAHLRIGPYVCAEWNYGGFPLWLHFVPGIKFRTD 148
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK M+R+ IV+MMK +LYASQGGPIILSQIENEYG ++ S+ Y+ WAA
Sbjct: 149 NEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSSYGPAAKSYINWAA 208
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS +KP +WTENW+ ++ +
Sbjct: 209 SMAVSLDTGVPWVMCQQADAPDPIINTCNGFYCDQF--TPNSKNKPKMWTENWSGWFLSF 266
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQAPLDE 325
G R ED+A+ VA F ++ G++ NYYMYHGGTNFGR T ++ T Y APLDE
Sbjct: 267 GGAVPYRPVEDLAFAVARFY-QLGGTFQNYYMYHGGTNFGRSTGGPFISTSYDYDAPLDE 325
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKD 384
YGL RQPKWGHLK+LH ++KLC + +++ V+ + + EA +++ G+ C+AFL N
Sbjct: 326 YGLTRQPKWGHLKDLHKSIKLCEEALVATDPVTSSLGQNLEATVYKTGTGLCSAFLANFG 385
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV-------------------- 424
++ TV F+ Y LP S+SILPDCK VA NTAK++S+
Sbjct: 386 T-SDKTVNFNGNSYNLPGWSVSILPDCKNVALNTAKINSMTVIPNFVHQSLIGDADSADT 444
Query: 425 --EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDP---SDSES 476
W E + + LLEQ+NTT D SDYLWY+ ++P S++
Sbjct: 445 LGSSWSWIYEPVGISKNDAFVKPGLLEQINTTADKSDYLWYSLSTVIKDNEPFLEDGSQT 504
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
VL V SLGH LHAF+NG+ GS G + +E V L+ G N + LLS+ GL + G
Sbjct: 505 VLHVESLGHALHAFVNGKLAGSGTGNAGNAKVAVEIPVTLLPGKNTIDLLSLTAGLQNYG 564
Query: 537 AYLERRVAGLRN-VSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
A+ E AG+ V ++G K D SS W YQ+GL GE+L + + + ++
Sbjct: 565 AFFELEGAGITGPVKLEGLKNGTTVDLSSLQWTYQIGLKGEELGLSSGNSQWV---TQPA 621
Query: 594 SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------- 646
T QPL WYKT F+AP G+DP+AI+ MGKGEAWVNGQSIGRYW + ++P
Sbjct: 622 LPTKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGEAWVNGQSIGRYWPTKVSPTSGCSNCN 681
Query: 647 --------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLC 692
PSQ+ YH+PRS+++ +GN LVL EE G P I+ T +LC
Sbjct: 682 YRGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGNTLVLFEEIGGDPTQIAFATKQSASLC 741
Query: 693 GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGN 751
HVS+SH PV W S ++ K P + + CP + IS I FAS+G P G
Sbjct: 742 SHVSESHPLPVDMWSSNSEAERKAG-------PVLSLECPFPNQVISSIKFASFGTPRGT 794
Query: 752 CENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C +++ G C S+ + +IV+KAC+G +SC++ F GDPC G+ K+L V+A C
Sbjct: 795 CGSFSHGQCKSTRALSIVQKACIGSKSCSIGASASTF-GDPCRGVAKSLAVEASC 848
>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
Length = 898
Score = 726 bits (1875), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 375/832 (45%), Positives = 509/832 (61%), Gaps = 62/832 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YDGRSLII+ RK+L S SIHYPRS P MWP L+ AKEGG+DV++T VFWN HE
Sbjct: 76 NVSYDGRSLIIDAQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGHELS 135
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG + F GR DLV+F + VQ G+Y+ LRIGPF+ EW +GG+P WLH VPG VFR+ N+
Sbjct: 136 PGNYYFGGRFDLVKFAQTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTYNQ 195
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF +HM+++ T IVN+MK +L+ASQGGPIIL+QIENEYG E+ + E G Y WAAK+
Sbjct: 196 PFMYHMQKFTTYIVNLMKQEKLFASQGGPIILAQIENEYGYYENFYKEDGKKYALWAAKM 255
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV TGVPW+MC+Q DAPDPVI+ CN C + P SP++P IWTENW +++ +G
Sbjct: 256 AVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQ--FTPTSPNRPKIWTENWPGWFKTFGG 313
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA F K GS NYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 314 RDPHRPAEDVAFSVARFFQK-GGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYG 372
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLKELH A+KLC +L+G V+++ EA ++ SS CAAF+ N D +
Sbjct: 373 LPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFISNVDDK 432
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-----------DSVEQ--------- 426
N+ TV F N + LP S+SILPDCK V FNTAK+ +S++Q
Sbjct: 433 NDKTVEFRNASFHLPAWSVSILPDCKNVVFNTAKVTSQTSVVAMVPESLQQSDKVVNSFK 492
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKV 480
W+ KE + + N ++ +NTTKD +DYLW+ ++ ++ VL +
Sbjct: 493 WDIVKEKPGIWGKADFVKNGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKGNKPVLLI 552
Query: 481 SSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLE 540
S GH LHAF+N E+ G+ G + FT + + L G N ++LL + VGL +G + +
Sbjct: 553 ESTGHALHAFVNQEYEGTGSGNGTHAPFTFKNPISLRAGKNEIALLCLTVGLQTAGPFYD 612
Query: 541 RRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH-Q 598
AGL +V I+G D SS++W Y++G+ GE L+++ G V W+ Q
Sbjct: 613 FVGAGLTSVKIKGLNNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNNVNWTSTSEPPKMQ 672
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV------------------ 640
PLTWYK + DAP G +PV ++++ MGKG AW+NG+ IGRYW
Sbjct: 673 PLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSEDCVKECDYRG 732
Query: 641 -----SFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
T G P+Q WYH+PRS+ KP+GN+LVL EE+ G P I V+ C V
Sbjct: 733 KFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSGACALV 792
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
++ + P ++ SQ + ++++K IP R + CP +IS + FAS+G+P+G C +Y
Sbjct: 793 AEDY--PSVALVSQGEDKIQSNKNIPFAR----LACPGNTRISAVKFASFGSPSGTCGSY 846
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G CH NS IVEKACL K C + + E F + CPG+ + L V+A C+
Sbjct: 847 LKGDCHDPNSSTIVEKACLNKNDCVIKLTEENFKSNLCPGLSRKLAVEAVCS 898
>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
Length = 822
Score = 726 bits (1873), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/818 (47%), Positives = 507/818 (61%), Gaps = 60/818 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+TYD +++++NG R+IL SGSIHYPRSTP+MWP LI KAK+GGLDVVQT VFWN HEP P
Sbjct: 23 LTYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSP 82
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEP
Sbjct: 83 GQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 142
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ T IV MMK+ L+ QGGPIILSQIENE+G +E E Y WAA +A
Sbjct: 143 FKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 202
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPW+MCK+DDAPDP+IN CNG C + PN P KP +WTE WT++Y +G
Sbjct: 203 VALNTGVPWIMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTAWYTGFGIP 260
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+AY VA FI K GS+VNYYM+HGGTNFGRTA ++ T Y AP+DEYGL
Sbjct: 261 VPHRPVEDLAYGVAKFIQK-GGSFVNYYMFHGGTNFGRTAGGPFIATSYDYDAPIDEYGL 319
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LR+PKWGHLK+LH A+KLC +++G + + Q++ +F+ S+ CAAFL NKDK +
Sbjct: 320 LREPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLDNKDKVS 379
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-VEQ----------WEEYKEAIPT 436
A V F+ + Y+LPP SISILPDCKT FNTA++ S + Q W+ Y E I +
Sbjct: 380 YARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQISQMKMEWAGGFAWQSYNEEINS 439
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVL------KVSSLGHVLHAF 490
+ E LLEQ+N T+D +DYLWY D + + L K++ + ++
Sbjct: 440 FGEDPFTTVGLLEQINVTRDNTDYLWYTTYV--DVAQDDQFLSNGENPKLTVMCFLILNI 497
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
+ G+ +G D T V L G+N +S LS+ VGLP+ G + E AG L V
Sbjct: 498 LFNLLAGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPV 557
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
++ G E +D + W YQVGL GE + + + GS V W QPLTWYK F+
Sbjct: 558 TLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGE--PVQKQPLTWYKAFFN 615
Query: 609 APTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQGT 648
AP G +P+A+++ SMGKG+ W+NGQ IGRYW + T G
Sbjct: 616 APDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGNCGTCDYRGEYDETKCQTNCGD 675
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRS 708
SQ WYH+PRS+L PTGNLLV+ EE G P GIS+ S+ ++C VS+ P + +W +
Sbjct: 676 SSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWHT 734
Query: 709 QNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAI 768
++ K H ++C +G+KI++I FAS+G P G+C +Y+ G CH+ S I
Sbjct: 735 KDYEKAKVH-----------LQCDNGQKITEIKFASFGTPQGSCGSYSEGGCHAHKSYDI 783
Query: 769 VEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
K C+G+ C V V E F GDPCPG K +V+A C
Sbjct: 784 FWKNCVGQERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 821
>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 837
Score = 725 bits (1871), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/805 (45%), Positives = 500/805 (62%), Gaps = 32/805 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+ VTYDGRSL+I+G R + FSG+IHYPRS P++WP+LI +AKEGGL+ ++T +FWN HE
Sbjct: 33 GSVVTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG+++F GR DL++++K +Q +Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+P+K M+++ IV +K A L+ASQGGPIIL+QIENEYG ++ G Y+ WAA
Sbjct: 153 NDPYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ QTGVPW+MCKQ AP VI CNGR CG+T+ +KP +WTENWT ++ Y
Sbjct: 213 QMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWT-LRDKNKPMLWTENWTQQFRAY 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ +RSAEDIAY V F AK GS VNYYMYHGGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 272 GDQVAMRSAEDIAYAVLRFFAK-GGSLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEY 330
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ ++ K L G S EA IF+ E C +FL N +
Sbjct: 331 GMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNN 390
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------DSVEQWEE 429
+ TV F + +P S+SIL CK V +NT ++ QWE
Sbjct: 391 TGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSYHTSEVTSKNNQWEM 450
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y E IP Y +T +R LEQ N TKDASDYLWY +FR + D +D VL+V S
Sbjct: 451 YSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVKSS 510
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F N FVG A G K F EK V L G N+V LLS +G+ DSG L
Sbjct: 511 AHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAEVK 570
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
+G++ IQG D WG++ L GE +I+++ G V W + + TW
Sbjct: 571 SGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWKP--AENGRAATW 628
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK FD P G DPV +++ SM KG +VNG+ +GRYWVS+ T GTPSQ+ YHIPR FLK
Sbjct: 629 YKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSYRTLAGTPSQALYHIPRPFLK 688
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
NLLV+ EEE G P GI + TV+ +C +S+ + + +W + + +K
Sbjct: 689 SKDNLLVVFEEEMGKPDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDK-IKLIAEDHS 747
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
RR + CP + I +++FAS+GNP G C N+ +G+CH+ N++ IVEK CLGK SC +P
Sbjct: 748 RRG--TLMCPPEKTIQEVVFASFGNPEGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLP 805
Query: 783 VWTEKFYGD-PCPGIPKALLVDAQC 806
V + D C L V +C
Sbjct: 806 VDHTVYGADINCQSTTATLGVQVRC 830
>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
Length = 847
Score = 724 bits (1869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/835 (44%), Positives = 503/835 (60%), Gaps = 65/835 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD RSLII+G RK+L S SIHYPRS P MWP L+ AKEGG+DV++T VFWN HE
Sbjct: 22 NVTYDRRSLIIDGQRKLLISASIHYPRSVPGMWPGLVKTAKEGGIDVIETYVFWNGHELS 81
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
P + F GR DL++F+K VQ +Y+ LR+GPF+ EW +GG+P WLH VPG VFR+++E
Sbjct: 82 PDNYYFGGRYDLLKFVKIVQQARMYLILRVGPFVAAEWNFGGVPVWLHYVPGTVFRTNSE 141
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK+HM+++ T+IVN+MK +L+ASQGGPIIL+Q+ENEYG E + + G PY WAA +
Sbjct: 142 PFKYHMQKFMTLIVNIMKKEKLFASQGGPIILAQVENEYGDTERIYGDGGKPYAMWAANM 201
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ GVPW+MC+Q DAPDPVIN CN C + PNSP+KP +WTENW +++ +G
Sbjct: 202 ALSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQ--FTPNSPNKPKMWTENWPGWFKTFGA 259
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R EDIA+ VA F K GS NYYMYHGGTNFGRT+ +T YD AP+DEYG
Sbjct: 260 PDPHRPHEDIAFSVARFFQK-GGSLQNYYMYHGGTNFGRTSGGPFITTSYDYNAPIDEYG 318
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLKELH A+K C +L G ++++ QE ++ SS CAAF+ N D++
Sbjct: 319 LARLPKWGHLKELHRAIKSCEHVLLYGEPINLSLGPSQEVDVYTDSSGGCAAFISNVDEK 378
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE----------------- 425
+ + F N+ Y +P S+SILPDCK V FNTAK+ S VE
Sbjct: 379 EDKIIVFQNVSYHVPAWSVSILPDCKNVVFNTAKVGSQTSQVEMVPEELQPSLVPSNKDL 438
Query: 426 ---QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SES 476
QWE + E + E N ++ +NTTKD +DYLWY S+ S+
Sbjct: 439 KGLQWETFVEKAGIWGEADFVKNGFVDHINTTKDTTDYLWYTVSLTVGESENFLKEISQP 498
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
VL V S GH LHAF+N + GSA G S F E + L G N+++LLS+ VGL ++G
Sbjct: 499 VLLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAGKNDIALLSMTVGLQNAG 558
Query: 537 AYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGS 594
+ E AGL +V I+G + D S+++W Y++GL GE L I+ G V W S
Sbjct: 559 PFYEWVGAGLTSVKIKGLNNGIMDLSTYTWTYKIGLQGEHLLIYKPEGLNSVKWLSTPEP 618
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-------------- 640
QPLTWYK V D P+G++P+ ++++ MGKG AW+NG+ IGRYW
Sbjct: 619 PKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKGLAWLNGEEIGRYWPRKSSIHDKCVQECD 678
Query: 641 ---SFL-----TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLC 692
F+ T G P+Q WYH+PRS+ KP+GN+LV+ EE+ G P I T +C
Sbjct: 679 YRGKFMPNKCSTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTKIRFSRRKTTGVC 738
Query: 693 GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC 752
VS+ H P S ++ + +K + + ++CP IS + FASYG P G C
Sbjct: 739 ALVSEDH--PTYELESWHKDANENNK----NKATIHLKCPENTHISSVKFASYGTPTGKC 792
Query: 753 ENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+Y+ G CH NS ++VEK C+ K C + + + F D CP K L V+A C+
Sbjct: 793 GSYSQGDCHDPNSASVVEKLCIRKNDCAIELAEKNFSKDLCPSTTKKLAVEAVCS 847
>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
Length = 916
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/824 (45%), Positives = 495/824 (60%), Gaps = 55/824 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGRSLII+G R++L S SIHYPRS P MWP+L+A+AK+GG D ++T VFWN HE P
Sbjct: 102 VTYDGRSLIISGRRRLLISTSIHYPRSVPAMWPKLVAEAKDGGADCIETYVFWNGHETAP 161
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++ F R DLVRF K V+ GLY+ LRIGPF+ EW +GG+P WLH +PG VFR++NEP
Sbjct: 162 GEYYFEDRFDLVRFAKVVKDAGLYLMLRIGPFVAAEWNFGGVPVWLHYIPGAVFRTNNEP 221
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK HMK + T IV+MMK R +ASQGG IIL+QIENEYG E ++ G Y WAA +A
Sbjct: 222 FKSHMKSFTTKIVDMMKRERFFASQGGHIILAQIENEYGDTEQAYGADGKAYAMWAASMA 281
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ TGVPW+MC+Q DAP+ VIN CN C + NSP KP IWTENW ++Q +G+
Sbjct: 282 LAQNTGVPWIMCQQYDAPEHVINTCNSFYCDQFKT--NSPTKPKIWTENWPGWFQTFGES 339
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R ED+A+ VA F K GS NYY+YHGGTNFGRT +T YD AP+DEYGL
Sbjct: 340 NPHRPPEDVAFSVARFFQK-GGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGL 398
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKRN 387
R PKW HL++LH ++KLC +L G L S++ QEA ++ S C AFL N D N
Sbjct: 399 TRLPKWAHLRDLHKSIKLCEHSLLYGNLTSLSLGTKQEADVYTDHSGGCVAFLANIDPEN 458
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV----------------EQWEEYK 431
+ V F + Y+LP S+SILPDCK FNTAK+ S ++W ++
Sbjct: 459 DTVVTFRSRQYDLPAWSVSILPDCKNAVFNTAKVQSQTLMVDMVPETLQSTKPDRWSIFR 518
Query: 432 EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS----DSESVLKVSSLGHVL 487
E +D+ N ++ +NTTKD++DYLW+ F D S + +L + S GH +
Sbjct: 519 EKTGIWDKNDFIRNGFVDHINTTKDSTDYLWHTTSFNVDRSYPTNGNRELLSIDSKGHAV 578
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
HAF+N E +GSA+G S SF + + L G N ++LLS+ VGL ++G + E AGL
Sbjct: 579 HAFLNNELIGSAYGNGSKSSFNVHMPIKLKPGKNEIALLSMTVGLQNAGPHYEWVGAGLT 638
Query: 548 NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH-QPLTWYKT 605
+V+I G K D SS +W Y++GL GE +F WS QPLTWYK
Sbjct: 639 SVNISGMKNGSIDLSSNNWAYKIGLEGEHYGLFKPDQGNNQRWSPQSEPPKGQPLTWYKV 698
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV------SFLTPQ------------- 646
D P G DPV I++ SMGKG AW+NG +IGRYW TP
Sbjct: 699 NVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSSDDRCTPSCNYRGPFNPSKCR 758
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPV 703
G P+Q WYH+PRS+ P+GN LV+ EE+ G P I+ T +C VS+++ P
Sbjct: 759 TGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATKVCSFVSENY--PS 816
Query: 704 ISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS 763
I S ++ K KVQ+ CP G+ IS + FAS+G+P+G C +Y G CH
Sbjct: 817 IDLESWDKSISDDGKDT----AKVQLSCPKGKNISSVKFASFGDPSGTCRSYQQGRCHHP 872
Query: 764 NSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+S ++VEKACL SCTV + E F D CPG+ K L ++A C+
Sbjct: 873 SSLSVVEKACLNINSCTVSLSDEGFGKDLCPGVAKTLAIEADCS 916
>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
Length = 851
Score = 721 bits (1860), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/825 (45%), Positives = 499/825 (60%), Gaps = 54/825 (6%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD RSLII+G R++L S SIHYPRS P+MWP+L+A+AK+GG D V+T VFWN HEP
Sbjct: 36 SSVTYDQRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 95
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
GQ+ F R DLVRF K V+ GLY+ LRIGPF+ EW +GG+P WLH PG VFR++N
Sbjct: 96 AQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNN 155
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK HMKR+ T IV+MMK + +ASQGG IIL+Q+ENEYG +E ++ PY WAA
Sbjct: 156 EPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAAS 215
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A+ TGVPW+MC+Q DAPDPVIN CN C + PNSP KP WTENW ++Q +G
Sbjct: 216 MALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQ--FKPNSPTKPKFWTENWPGWFQTFG 273
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
+ R ED+A+ VA F K GS NYY+YHGGTNFGRT +T YD AP+DEY
Sbjct: 274 ESNPHRPPEDVAFSVARFFGK-GGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEY 332
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDK 385
GL R PKW HL++LH ++KL +L G ++ QEA ++ S C AFL N D
Sbjct: 333 GLRRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDS 392
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----------------VEQWEE 429
+ V F + Y+LP S+SILPDCK VAFNTAK+ S V+ W
Sbjct: 393 EKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVDGWSI 452
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD---SESVLKVSSLGHV 486
++E + L N ++ +NTTKD++DYLWY F D S VL + S GH
Sbjct: 453 FREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIESKGHA 512
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+ AF+N E +GSA+G S +F++E V+L G N +SLLS+ VGL + G E AG+
Sbjct: 513 VQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAGAGI 572
Query: 547 RNVSIQGAK-ELKDFSSFSWGYQVGLLGEKLQIF-TDYGSRIVPWSRYGSSTHQPLTWYK 604
+V I G + + D SS W Y++GL GE +F D G I + +QP+TWYK
Sbjct: 573 TSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQPMTWYK 632
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------------LTPQ-- 646
D P G DPV +++ SMGKG AW+NG +IGRYW +P
Sbjct: 633 VNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTFSPNKC 692
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
G P+Q WYH+PRS+ P+GN LV+ EE+ G P I+ +V ++C VS+ + P
Sbjct: 693 RRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSEHY--P 750
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
I S ++ T + KVQ+ CP G+ IS + FAS+GNP+G C +Y GSCH
Sbjct: 751 SIDLESWDRNTQNDGRDA----AKVQLSCPKGKSISSVKFASFGNPSGTCRSYQQGSCHH 806
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS ++VEKACL CT+ + E F D CPG+ K L ++A C+
Sbjct: 807 PNSISVVEKACLNMNGCTLSLSDEGFGEDLCPGVTKTLAIEADCS 851
>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 845
Score = 720 bits (1858), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/826 (45%), Positives = 501/826 (60%), Gaps = 59/826 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD RSL+I+G R++L S SIHYPRS P MWP+L+A+AKEGG D ++T VFWN HE P
Sbjct: 31 VTYDHRSLVISGRRRLLISASIHYPRSVPAMWPKLVAEAKEGGADCIETYVFWNGHETAP 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++ F R DLV+F + V+ GL++ LRIGPF+ EW +GG+P WLH +PG VFR++NEP
Sbjct: 91 GKYYFEDRFDLVQFARVVKDAGLFLMLRIGPFVAAEWNFGGVPAWLHYIPGTVFRTNNEP 150
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK HMK + T IV+MMK R +ASQGG IIL+QIENEYG + ++ G Y WA +A
Sbjct: 151 FKSHMKSFTTKIVDMMKEQRFFASQGGHIILAQIENEYGYYQQAYGAGGKAYAMWAGSMA 210
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
TGVPW+MC+Q D PD VIN CN C + PNSP +P IWTENW ++Q +G+
Sbjct: 211 QAQNTGVPWIMCQQYDVPDRVINTCNSFYCDQ--FKPNSPTQPKIWTENWPGWFQTFGES 268
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R ED+A+ VA F K GS NYY+YHGGTNF RTA +T YD AP+DEYGL
Sbjct: 269 NPHRPPEDVAFSVARFFGK-GGSVQNYYVYHGGTNFDRTAGGPFITTSYDYDAPIDEYGL 327
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKRN 387
R PKW HLKELH ++KLC +L G ++ QEA ++ S C AFL N D
Sbjct: 328 RRLPKWAHLKELHQSIKLCEHSLLFGNSTLLSLGPQQEADVYTDHSGGCVAFLANIDSEK 387
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV----------------EQWEEYK 431
+ V F N Y+LP S+SILPDCK V FNTAK+ S +QW +
Sbjct: 388 DRVVTFRNRQYDLPAWSVSILPDCKNVVFNTAKVRSQTLMVDMVPGTLQASKPDQWSIFT 447
Query: 432 EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD----PSDSESVLKVSSLGHVL 487
E I +D+ N ++ +NTTKD++DYLW+ F D S + VL + S GH +
Sbjct: 448 ERIGVWDKNDFVRNEFVDHINTTKDSTDYLWHTTSFDVDRNYPSSGNHPVLNIDSKGHAV 507
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
HAF+N +GSA+G S+ SF+ ++L G N +++LS+ VGL +G Y E AGL
Sbjct: 508 HAFLNNMLIGSAYGNGSESSFSAHMPINLKAGKNEIAILSMTVGLKSAGPYYEWVGAGLT 567
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFT-DYGS--RIVPWSRYGSSTHQPLTWY 603
+V+I G K D SS +W Y+VGL GE +F D G+ R P S+ HQPLTWY
Sbjct: 568 SVNISGMKNGTTDLSSNNWAYKVGLEGEHYGLFKHDQGNNQRWRPQSQ--PPKHQPLTWY 625
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------------LTPQ- 646
K D P G DPV +++ SMGKG W+NG +IGRYW +P
Sbjct: 626 KVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAIGRYWPRTSPTNDRCTTSCDYRGKFSPNK 685
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+ P+GN LV+ EE+ G P I+ T++C VS+++
Sbjct: 686 CRVGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATSVCSFVSENY-- 743
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
P I S + +++ R+ KVQ+ CP G+ IS + FAS+G+P+G C +Y GSCH
Sbjct: 744 PSIDLESWD-KSISDDGRVAA---KVQLSCPKGKNISSVKFASFGDPSGTCRSYQQGSCH 799
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+S ++VEKAC+ SCTV + E F DPCPG+ K L ++A C+
Sbjct: 800 HPDSVSVVEKACMNMNSCTVSLSDEGFGEDPCPGVTKTLAIEADCS 845
>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
Precursor
gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
Length = 851
Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/825 (45%), Positives = 498/825 (60%), Gaps = 54/825 (6%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD RSLII+G R++L S SIHYPRS P+MWP+L+A+AK+GG D V+T VFWN HEP
Sbjct: 36 SSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 95
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
GQ+ F R DLVRF K V+ GLY+ LRIGPF+ EW +GG+P WLH PG VFR++N
Sbjct: 96 AQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNN 155
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK HMKR+ T IV+MMK + +ASQGG IIL+Q+ENEYG +E ++ PY WAA
Sbjct: 156 EPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAAS 215
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A+ TGVPW+MC+Q DAPDPVIN CN C + PNSP KP WTENW ++Q +G
Sbjct: 216 MALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQ--FKPNSPTKPKFWTENWPGWFQTFG 273
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
+ R ED+A+ VA F K GS NYY+YHGGTNFGRT +T YD AP+DEY
Sbjct: 274 ESNPHRPPEDVAFSVARFFGK-GGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEY 332
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDK 385
GL R PKW HL++LH ++KL +L G ++ QEA ++ S C AFL N D
Sbjct: 333 GLRRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDS 392
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----------------VEQWEE 429
+ V F + Y+LP S+SILPDCK VAFNTAK+ S V+ W
Sbjct: 393 EKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVDGWSI 452
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD---SESVLKVSSLGHV 486
++E + L N ++ +NTTKD++DYLWY F D S VL + S GH
Sbjct: 453 FREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIESKGHA 512
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+ AF+N E +GSA+G S +F++E V+L G N +SLLS+ VGL + G E AG+
Sbjct: 513 VQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAGAGI 572
Query: 547 RNVSIQGAK-ELKDFSSFSWGYQVGLLGEKLQIF-TDYGSRIVPWSRYGSSTHQPLTWYK 604
+V I G + + D SS W Y++GL GE +F D G I + +QP+TWYK
Sbjct: 573 TSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQPMTWYK 632
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------------LTPQ-- 646
D P G DPV +++ SMGKG AW+NG +IGRYW +P
Sbjct: 633 VNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTFSPNKC 692
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
G P+Q WYH+PRS+ P+GN LV+ EE+ G P I+ +V ++C VS+ + P
Sbjct: 693 RRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSEHY--P 750
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
I S ++ T + KVQ+ CP G+ IS + F S+GNP+G C +Y GSCH
Sbjct: 751 SIDLESWDRNTQNDGRDA----AKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCHH 806
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS ++VEKACL CTV + E F D CPG+ K L ++A C+
Sbjct: 807 PNSISVVEKACLNMNGCTVSLSDEGFGEDLCPGVTKTLAIEADCS 851
>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
Length = 827
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/821 (46%), Positives = 495/821 (60%), Gaps = 59/821 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
V YD +++ IN R+IL SGSIHYPRSTP+MWP LI KAKEGG++V+QT VFWN HEP
Sbjct: 24 TVWYDHKAITINNQRRILISGSIHYPRSTPEMWPGLIQKAKEGGIEVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R DLV+FIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 84 PGQYYFQDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPMWLKYVPGIEFRTDNG 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T+IVNMMK +L+ +QGGPIILSQIENEYG VE + G Y +WAA +
Sbjct: 144 PFKAAMQKFVTLIVNMMKEQKLFQTQGGPIILSQIENEYGPVEWTIGAPGKAYTKWAAAM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L TGVPW+MCKQ+DAPDP I+ CNG C E + PN+ +KP +WTENWT +Y +G
Sbjct: 204 ATGLNTGVPWIMCKQEDAPDPTIDTCNGFYC-EGYK-PNNYNKPKVWTENWTGWYTEWGA 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
R ED A+ VA FIA GS+VNYYMYHGGTNF RTA ++ T Y APLDEYGL
Sbjct: 262 SVPYRPPEDTAFSVARFIAA-SGSFVNYYMYHGGTNFDRTAGLFMATSYDYDAPLDEYGL 320
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
PKWGHL++LH A+K + ++S ++ K QEA +FQ CAAFL N D + +
Sbjct: 321 THDPKWGHLRDLHRAIKQSERALVSADPTVISLGKNQEAHVFQSKMGCAAFLANYDTQYS 380
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPT 436
A V F N Y LP SIS+LPDCKTV +NTAK+ + W+ + + +P
Sbjct: 381 ARVNFWNKPYSLPRWSISVLPDCKTVVYNTAKISAQSTQKWMMPVASGFSWQSHIDEVPV 440
Query: 437 -YDETSLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKVSSLGHVLHA 489
Y + L EQ T D +DYLWY N S L V+S GHVLH
Sbjct: 441 GYSAGTFTKVGLWEQKYLTGDKTDYLWYMTDVTINSNEGFLRSGKNPFLTVASAGHVLHV 500
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRN 548
FING GSA+G + T + V L+ G N ++LLS VGL + G + + V L
Sbjct: 501 FINGHLAGSAYGSLENPKLTFSQNVKLVGGVNKIALLSATVGLANVGVHYDTWNVGVLGP 560
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKTV 606
V++QG + D + + W Y++GL GE L++F+ G V W++ + PLTWYKT
Sbjct: 561 VTLQGLNQGTLDMTKWKWSYKIGLKGEDLKLFS--GGANVGWAQGAQLAKKTPLTWYKTF 618
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------- 646
+AP G+DPVA+ + SMGKG+ ++NG+SIGR+W ++
Sbjct: 619 INAPPGNDPVALYMGSMGKGQMYINGRSIGRHWPAYTAKGNCKDCDYAGYYDDQKCRSGC 678
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW 706
G P Q WYH+PRS+LKPTGNLLV+ EE G P GIS+ V ++C + D P + SW
Sbjct: 679 GQPPQQWYHVPRSWLKPTGNLLVVFEEMGGDPTGISLVKRVVGSVCADIDDDQ-PEMKSW 737
Query: 707 RSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSR 766
+ IP PK + CP G+K SKI+FASYG P G C Y G CH+ S
Sbjct: 738 T----------ENIP-VTPKAHLWCPPGQKFSKIVFASYGWPQGRCGAYRQGKCHALKSW 786
Query: 767 AIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+K C+GK +C + V F GDPCPG K L V QC+
Sbjct: 787 DPFQKYCIGKGACDIDVAPATFGGDPCPGSAKRLSVQLQCS 827
>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 919
Score = 718 bits (1854), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/825 (45%), Positives = 498/825 (60%), Gaps = 54/825 (6%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD RSLII+G R++L S SIHYPRS P+MWP+L+A+AK+GG D V+T VFWN HEP
Sbjct: 104 SSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 163
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
GQ+ F R DLVRF K V+ GLY+ LRIGPF+ EW +GG+P WLH PG VFR++N
Sbjct: 164 AQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTNN 223
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK HMKR+ T IV+MMK + +ASQGG IIL+Q+ENEYG +E ++ PY WAA
Sbjct: 224 EPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAAS 283
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A+ TGVPW+MC+Q DAPDPVIN CN C + PNSP KP WTENW ++Q +G
Sbjct: 284 MALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQ--FKPNSPTKPKFWTENWPGWFQTFG 341
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
+ R ED+A+ VA F K GS NYY+YHGGTNFGRT +T YD AP+DEY
Sbjct: 342 ESNPHRPPEDVAFSVARFFGK-GGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEY 400
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDK 385
GL R PKW HL++LH ++KL +L G ++ QEA ++ S C AFL N D
Sbjct: 401 GLRRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDS 460
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----------------VEQWEE 429
+ V F + Y+LP S+SILPDCK VAFNTAK+ S V+ W
Sbjct: 461 EKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDMVPANLESSKVDGWSI 520
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD---SESVLKVSSLGHV 486
++E + L N ++ +NTTKD++DYLWY F D S VL + S GH
Sbjct: 521 FREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIESKGHA 580
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+ AF+N E +GSA+G S +F++E V+L G N +SLLS+ VGL + G E AG+
Sbjct: 581 VQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAGAGI 640
Query: 547 RNVSIQGAK-ELKDFSSFSWGYQVGLLGEKLQIF-TDYGSRIVPWSRYGSSTHQPLTWYK 604
+V I G + + D SS W Y++GL GE +F D G I + +QP+TWYK
Sbjct: 641 TSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQPMTWYK 700
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------------LTPQ-- 646
D P G DPV +++ SMGKG AW+NG +IGRYW +P
Sbjct: 701 VNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTFSPNKC 760
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
G P+Q WYH+PRS+ P+GN LV+ EE+ G P I+ +V ++C VS+ + P
Sbjct: 761 RRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSEHY--P 818
Query: 703 VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHS 762
I S ++ T + KVQ+ CP G+ IS + F S+GNP+G C +Y GSCH
Sbjct: 819 SIDLESWDRNTQNDGRDA----AKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCHH 874
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS ++VEKACL CTV + E F D CPG+ K L ++A C+
Sbjct: 875 PNSISVVEKACLNMNGCTVSLSDEGFGEDLCPGVTKTLAIEADCS 919
>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
Length = 827
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/818 (45%), Positives = 498/818 (60%), Gaps = 49/818 (5%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP- 87
NV+YD RSLIING RK+L S +IHYPRS P MWP L+ AKEGG+DV++T VFWN+H+P
Sbjct: 20 NVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVHQPT 79
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
P ++ F GR DLV+FI VQ G+Y+ LRIGPF+ EW +GG+P WLH V G VFR+DN
Sbjct: 80 SPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFRTDN 139
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ--IENEYGMVEHSFLEKGPPYVRWA 205
FK++M+ + T IV +MK +L+ASQGGPIILSQ +ENEYG E ++ E G Y WA
Sbjct: 140 YNFKYYMEEFTTYIVKLMKKEKLFASQGGPIILSQAKVENEYGYYEGAYGEGGKRYAAWA 199
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
A++AV TGVPW+MC+Q DAP VIN CN C + P PDKP IWTENW ++Q
Sbjct: 200 AQMAVSQNTGVPWIMCQQFDAPPSVINTCNSFYCDQF--KPIFPDKPKIWTENWPGWFQT 257
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLD 324
+G R AED+A+ VA F K GS NYYMYHGGTNFGRTA +T YD +AP+D
Sbjct: 258 FGAPNPHRPAEDVAFSVARFFQK-GGSVQNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 316
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNK 383
EYGL R PKWGHLKELH A+KLC +L+ V+++ QEA ++ S C AFL N
Sbjct: 317 EYGLPRLPKWGHLKELHKAIKLCEHVLLNSKPVNLSLGPSQEADVYADASGGCVAFLANI 376
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---DSVEQWEEYKEAIPTYDET 440
D +N+ TV F N+ Y+LP S+SILPDCK V +NTAK +WE + E + E
Sbjct: 377 DDKNDKTVDFQNVSYKLPAWSVSILPDCKNVVYNTAKQKDGSKALKWEVFVEKAGIWGEP 436
Query: 441 SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHAFINGE 494
N ++ +NTTKD +DYLWY ++ VL + S+GH LHAF+N E
Sbjct: 437 DFMKNGFVDHINTTKDTTDYLWYTTSIVVGENEEFLKEGRHPVLLIESMGHALHAFVNQE 496
Query: 495 FVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGA 554
GSA G S F + + L G N ++LLS+ VGLP++G++ E AGL +V I+G
Sbjct: 497 LQGSASGNGSHSPFKFKNPISLKAGNNEIALLSMTVGLPNAGSFYEWVGAGLTSVRIEGF 556
Query: 555 KE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGSSTHQPLTWYKTVFDAPTG 612
D S F+W Y++GL GEKL I+ G V W + QPLTWYK V D P G
Sbjct: 557 NNGTVDLSHFNWIYKIGLQGEKLGIYKPEGVNSVSWVATSEPPKKQPLTWYKVVLDPPAG 616
Query: 613 SDPVAINLISMGKGEAWVNGQSIGRYWV----------------------SFLTPQGTPS 650
++PV ++++ MGKG AW+NG+ IGRYW T G P+
Sbjct: 617 NEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSSVHEKCVTECDYRGKFMPDKCFTGCGQPT 676
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQN 710
Q WYH+PRS+ KP+GNLLV+ EE+ G P I+ ++++C +++ + S +
Sbjct: 677 QRWYHVPRSWFKPSGNLLVIFEEKGGDPEKITFSRRKMSSICALIAEDY-------PSAD 729
Query: 711 QRTLK-THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIV 769
+++L+ + + V + CP IS + FAS+G P G C +Y+ G CH NS ++V
Sbjct: 730 RKSLQEAGSKNSNSKASVHLGCPQNAVISAVKFASFGTPTGKCGSYSEGECHDPNSISVV 789
Query: 770 EKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
EKACL K CT+ + E F CP + L V+A C+
Sbjct: 790 EKACLNKTECTIELTEENFNKGLCPDFTRRLAVEAVCS 827
>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
Length = 860
Score = 717 bits (1851), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/842 (45%), Positives = 519/842 (61%), Gaps = 68/842 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
GG NVTYD R+L+I+G R++L SGSIHYPRSTP MWP +I KAK+GGLDV++T VFW
Sbjct: 30 GGARATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGIIQKAKDGGLDVIETYVFW 89
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
++HEP GQ+DF GR+DL F+K V GLYV LRIGP++ EW YGG P WLH +PGI
Sbjct: 90 DIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIK 149
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FR+DNEPFK M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G Y+
Sbjct: 150 FRTDNEPFKTEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYM 209
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
RWAA +A+ L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ +
Sbjct: 210 RWAAGMAISLDTGVPWVMCQQTDAPDPLINTCNGFYCDQFT--PNSAAKPKMWTENWSGW 267
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQA 321
+ +G R ED+A+ VA F + G++ NYYMYHGGTN R++ ++ T Y A
Sbjct: 268 FLSFGGAVPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDA 326
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
P+DEYGL+R+PKWGHL+++H A+KLC +++ + + EA +++ S CAAFL
Sbjct: 327 PIDEYGLVREPKWGHLRDVHKAIKLCEPALIATDPSYTSLGQNAEAAVYKTGSVCAAFLA 386
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------------------ 423
N D +++ TV F+ MY LP S+SILPDCK V NTA+++S
Sbjct: 387 NIDGQSDKTVTFNGRMYRLPAWSVSILPDCKNVVLNTAQINSQVTSSEMRYLESSNMASD 446
Query: 424 ---------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP- 471
V W E + + +L L+EQ+NTT DASD+LWY + K D
Sbjct: 447 GSFITPELAVSGWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEP 506
Query: 472 --SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVM 529
+ S+S L V+SLGHVL +ING+ GSA G S + +K + L+ G N + LLS
Sbjct: 507 YLNGSQSNLVVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSAT 566
Query: 530 VGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVP 588
VGL + GA+ + AG+ V + G D SS W YQ+GL GE L ++ +
Sbjct: 567 VGLSNYGAFFDLVGAGITGPVKLSGTNGALDLSSAEWTYQIGLRGEDLHLYDPSEASPEW 626
Query: 589 WSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-- 646
S +QPL WYKT F P G DPVAI+ MGKGEAWVNGQSIGRYW + L PQ
Sbjct: 627 VSANAYPINQPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSG 686
Query: 647 --------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTV 686
G PSQ+ YH+PRSFL+P N +VL E+ G P IS
Sbjct: 687 CVNSCNYRGSYNSNKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFVIR 746
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASY 745
++C VS+ H + SW S +Q+T++ + P++++ CP G+ IS I FAS+
Sbjct: 747 QTGSVCAQVSEEHPAQIDSWNS-SQQTMQRYG------PELRLECPKDGQVISSIKFASF 799
Query: 746 GNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQ 805
G P+G C +Y+ G C S+ + ++V++AC+G SC+VPV + ++G+PC G+ K+L V+A
Sbjct: 800 GTPSGTCGSYSHGECSSTQALSVVQEACIGVSSCSVPV-SSNYFGNPCTGVTKSLAVEAA 858
Query: 806 CT 807
C+
Sbjct: 859 CS 860
>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
gi|219886857|gb|ACL53803.1| unknown [Zea mays]
gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
Length = 852
Score = 713 bits (1840), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 386/844 (45%), Positives = 513/844 (60%), Gaps = 73/844 (8%)
Query: 23 GGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFW 82
GG NVTYD R+L+I+G R++L SGSIHYPRSTP MWP LI KAK+GGLDV++T VFW
Sbjct: 23 GGARAANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFW 82
Query: 83 NLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV 142
++HEP GQ+DF GR+DL F+K V GLYV LRIGP++ EW YGG P WLH +PGI
Sbjct: 83 DIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIK 142
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FR+DNEPFK M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G Y+
Sbjct: 143 FRTDNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYM 202
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ +
Sbjct: 203 RWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFT--PNSAAKPKMWTENWSGW 260
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQA 321
+ +G R ED+A+ VA F + G++ NYYMYHGGTN R++ ++ T Y A
Sbjct: 261 FLSFGGAVPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDA 319
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
P+DEYGL+RQPKWGHL+++H A+KLC +++ + EA +++ S CAAFL
Sbjct: 320 PIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGSVCAAFLA 379
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------------------ 423
N D +++ TV F+ MY LP S+SILPDCK V NTA+++S
Sbjct: 380 NIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASD 439
Query: 424 ---------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP- 471
V W E + + +L L+EQ+NTT DASD+LWY + K D
Sbjct: 440 GSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDEP 499
Query: 472 --SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVM 529
+ S+S L V+SLGHVL +ING+ GSA G S + +K + L+ G N + LLS
Sbjct: 500 YLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSAT 559
Query: 530 VGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVP 588
VGL + GA+ + AG+ V + G D SS W YQ+GL GE L ++ +
Sbjct: 560 VGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEASPEW 619
Query: 589 WSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-- 646
S + PL WYKT F P G DPVAI+ MGKGEAWVNGQSIGRYW + L PQ
Sbjct: 620 VSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSG 679
Query: 647 --------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTV 686
G PSQ+ YH+PRSFL+P N LVL E G P IS
Sbjct: 680 CVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFVMR 739
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCP-SGRKISKILFA 743
++C VS++H + SW SQ P +R P +++ CP G+ IS + FA
Sbjct: 740 QTGSVCAQVSEAHPAQIDSWSSQQ----------PMQRYGPALRLECPKEGQVISSVKFA 789
Query: 744 SYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVD 803
S+G P+G C +Y+ G C S+ + +IV++AC+G SC+VPV + ++G+PC G+ K+L V+
Sbjct: 790 SFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPV-SSNYFGNPCTGVTKSLAVE 848
Query: 804 AQCT 807
A C+
Sbjct: 849 AACS 852
>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
Length = 785
Score = 712 bits (1839), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/803 (47%), Positives = 491/803 (61%), Gaps = 62/803 (7%)
Query: 47 FSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKE 106
SGS+HYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP GQ+ F GR DLV FIK
Sbjct: 1 MSGSVHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKL 60
Query: 107 VQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMK 166
V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEPFK M+++ T IV+MMK
Sbjct: 61 VKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVDMMK 120
Query: 167 AARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDA 226
+ L+ QGGPIILSQIENE+G +E E Y WAA +AV L T VPWVMCK+DDA
Sbjct: 121 SEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDA 180
Query: 227 PDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFI 286
PDP+IN CNG C + PN P KP +WTE WTS+Y +G R ED+AY VA FI
Sbjct: 181 PDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFI 238
Query: 287 AKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DEYGLLR+PKWGHLKELH A+K
Sbjct: 239 QK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIK 297
Query: 346 LCLKPMLSGVLVSMNFSKLQEAFIFQGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLS 404
LC +++G + + Q+A +F+ S++ C AFL NKDK + A V F+ + Y LPP S
Sbjct: 298 LCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYNLPPWS 357
Query: 405 ISILPDCKTVAFNTAKLDS-VEQ----------WEEYKEAIPTYDETSLRANFLLEQMNT 453
ISILPDCKT +NTA++ S + Q W+ Y E I + + S LLEQ+N
Sbjct: 358 ISILPDCKTTVYNTARVGSQISQMKMEWAGGFTWQSYNEDINSLGDESFVTVGLLEQINV 417
Query: 454 TKDASDYLWYNFRFKHDPSDSES--------VLKVSSLGHVLHAFINGEFVGSAHGKHSD 505
T+D +DYLWY D + E VL V S GH LH F+NG+ G+ +G D
Sbjct: 418 TRDNTDYLWYTTYV--DVAQDEQFLSNGKNPVLTVMSAGHALHIFVNGQLTGTVYGSVDD 475
Query: 506 KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSF 563
T V L G+N +S LS+ VGLP+ G + E AG L V++ G E +D +
Sbjct: 476 PKLTYRGNVKLWPGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQ 535
Query: 564 SWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISM 623
W Y+VGL GE L + + GS V W QPLTWYK F+AP G +P+A+++ SM
Sbjct: 536 KWTYKVGLKGEDLSLHSLSGSSSVEWGE--PMQKQPLTWYKAFFNAPDGDEPLALDMSSM 593
Query: 624 GKGEAWVNGQSIGRYWVSF--------------------LTPQGTPSQSWYHIPRSFLKP 663
GKG+ W+NGQ IGRYW + T G SQ WYH+PRS+L P
Sbjct: 594 GKGQIWINGQGIGRYWPGYKASGTCGICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNP 653
Query: 664 TGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGR 723
TGNLLV+ EE G P GIS+ + ++C VS+ P + +WR+++ K H
Sbjct: 654 TGNLLVIFEEWGGDPTGISMVKRTTGSICADVSEWQ-PSMTNWRTKDYEKAKIH------ 706
Query: 724 RPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPV 783
++C GRK++ I FAS+G P G+C +Y+ G CH+ S I K C+G+ C V V
Sbjct: 707 -----LQCDHGRKMTDIKFASFGTPQGSCGSYSEGGCHAHKSYDIFWKNCIGQERCGVSV 761
Query: 784 WTEKFYGDPCPGIPKALLVDAQC 806
F GDPCPG K +V+A C
Sbjct: 762 VPNVFGGDPCPGTMKRAVVEAIC 784
>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 887
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 390/852 (45%), Positives = 509/852 (59%), Gaps = 84/852 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LII G R++L S IHYPR+TP+MW LIAK+KEGG DVVQT VFWN HEP
Sbjct: 37 NVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKEGGADVVQTYVFWNGHEPV 96
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR DLV+F+K + + GLY+ LRIGP++ EW +GG P WL D+PGI FR+DNE
Sbjct: 97 KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNE 156
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T IV++M+ A+L+ QGGPII+ QIENEYG VE S+ +KG YV+WAA +
Sbjct: 157 PFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMCKQ DAP+ +I+ACNG C + F PNS KP +WTE+W +Y +G
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGFK-PNSRTKPVLWTEDWDGWYTKWGG 274
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA F + GS+ NYYMY GGTNFGRT+ + +T Y APLDEYG
Sbjct: 275 SLPHRPAEDLAFAVARFYQR-GGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYG 333
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE-----CAAF 379
L +PKWGHLK+LH+A+KLC +++ + + KL QEA I+ G E CAAF
Sbjct: 334 LRSEPKWGHLKDLHAAIKLCEPALVAA--DAPQYRKLGSKQEAHIYHGDGETGGKVCAAF 391
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------------ 421
L N D+ +A V F+ Y LPP S+SILPDC+ VAFNTAK+
Sbjct: 392 LANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGS 451
Query: 422 ----------DSV----EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF 467
D+V + W KE I + E + LLE +N TKD SDYLW+ R
Sbjct: 452 MSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRI 511
Query: 468 KHDPSD--------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLING 519
D S + + S+ VL F+N + GS G H K+ + V I G
Sbjct: 512 SVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVG-HWVKAV---QPVRFIQG 567
Query: 520 TNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQ 577
N++ LL+ VGL + GA+LE+ AG R + G K D S SW YQVGL GE +
Sbjct: 568 NNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSKSSWTYQVGLKGEADK 627
Query: 578 IFTDYGSRIVPWSRYGSSTHQPL-TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
I+T + WS + + WYKT FD P G+DPV +NL SMG+G+AWVNGQ IG
Sbjct: 628 IYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLESMGRGQAWVNGQHIG 687
Query: 637 RYWVSF---------------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
RYW T G P+Q+ YH+PRS+LKP+ NLLVL EE
Sbjct: 688 RYWNIISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETG 747
Query: 676 GYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGR 735
G P IS+ TV+ LCG VS+SH PP+ W + + + I P+V + C G
Sbjct: 748 GNPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDY--INGTMSINSVAPEVHLHCEDGH 805
Query: 736 KISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPG 795
IS I FASYG P G+C+ ++IG CH+SNS +IV +AC G+ SC + V F DPC G
Sbjct: 806 VISSIEFASYGTPRGSCDGFSIGKCHASNSLSIVSEACKGRNSCFIEVSNTAFISDPCSG 865
Query: 796 IPKALLVDAQCT 807
K L V ++C+
Sbjct: 866 TLKTLAVMSRCS 877
>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
Length = 838
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/805 (44%), Positives = 495/805 (61%), Gaps = 32/805 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+I+G R + FSG+IHYPRS P+MW +L+ AK GGL+ ++T VFWN HE
Sbjct: 33 GTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG++ F GR DL+RF+ ++ +Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK M+++ IV +K A ++A QGGPIILSQIENEYG ++ +G Y+ WAA
Sbjct: 153 NEPFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ GVPWVMCKQ AP VI CNGR CG+T+ + +KP +WTENWT+ ++ +
Sbjct: 213 EMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDK-NKPRLWTENWTAQFRTF 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ RSAEDIAY V F AK G+ VNYYMYHGGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 272 GDQLAQRSAEDIAYAVLRFFAK-GGTLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEY 330
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ +K K L G EA ++ + C +FL N +
Sbjct: 331 GMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNN 390
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQ---WEE 429
+ TV F + +P S+SIL DCKTV +NT ++ D + WE
Sbjct: 391 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNVWEM 450
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y EAIP + +T +R LEQ N TKD SDYLWY +FR + D D V+++ S
Sbjct: 451 YSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIKST 510
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F N FVG+ G +KSF EK + L G N++++LS +G+ DSG L
Sbjct: 511 AHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVEVK 570
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G+++ +QG D WG++ L GE +I+T+ G W + P+TW
Sbjct: 571 GGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKP--AENDLPITW 628
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK FD P G DP+ +++ SM KG +VNG+ IGRYW SF+T G PSQS YHIPR+FLK
Sbjct: 629 YKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLAGHPSQSVYHIPRAFLK 688
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
P GNLL++ EEE G P GI I TV +C +S+ + + +W S + +
Sbjct: 689 PKGNLLIIFEEELGKPGGILIQTVRRDDICVFISEHNPAQIKTWESDGGQIKLIAEDTST 748
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
R + CP R I +++FAS+GNP G C N+ G+CH+ +++AIVEK CLGK SC +P
Sbjct: 749 RG---TLNCPPKRTIQEVVFASFGNPEGACGNFTAGTCHTPDAKAIVEKECLGKESCVLP 805
Query: 783 VWTEKFYGD-PCPGIPKALLVDAQC 806
V + D CP L V +C
Sbjct: 806 VVNTVYGADINCPATTATLAVQVRC 830
>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
Length = 882
Score = 710 bits (1833), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/856 (44%), Positives = 509/856 (59%), Gaps = 88/856 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+L+I+G R++L S IHYPR+TP+MWP LIAK+KEGG DV+QT VFWN HEP
Sbjct: 28 NVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNGHEPV 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q++F GR D+V+F+K V + GLY+ LRIGP++ EW +GG P WL D+PGI FR+DN
Sbjct: 88 RRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ IV++M+ L++ QGGPII+ QIENEYG VE SF ++G YV+WAA++
Sbjct: 148 PFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKWAARM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A++L GVPWVMC+Q DAPD +INACNG C + PNS +KP +WTE+W ++ +G
Sbjct: 208 ALELDAGVPWVMCQQADAPDIIINACNGFYCDAFW--PNSANKPKLWTEDWNGWFASWGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R EDIA+ VA F + GS+ NYYMY GGTNFGR++ + +T Y AP+DEYG
Sbjct: 266 RTPKRPVEDIAFAVARFFQR-GGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVS--MNFSKLQEAFIFQ-----------GSS 374
LL QPKWGHLKELH+A+KLC +P L V + +QEA +++ S
Sbjct: 325 LLSQPKWGHLKELHAAIKLC-EPALVAVDSPQYIKLGPMQEAHVYRVKESLYSTQSGNGS 383
Query: 375 ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------------ 422
C+AFL N D+ A+V F +Y+LPP S+SILPDC+T FNTAK+
Sbjct: 384 SCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNTAKVGAQTSIKTVEFDL 443
Query: 423 ------SVEQ--------------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLW 462
SV Q W KE I + E + +LE +N TKD SDYLW
Sbjct: 444 PLVRNISVTQPLMVQNKISYVPKTWMTLKEPISVWSENNFTIQGVLEHLNVTKDHSDYLW 503
Query: 463 YNFRFKHDPSD--------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV 514
R D L + S+ +LH F+NG+ +GS G + + +
Sbjct: 504 RITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVIGHW----VKVVQPI 559
Query: 515 HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLL 572
L+ G N++ LLS VGL + GA+LE+ AG + V + G K + D S +SW YQVGL
Sbjct: 560 QLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLR 619
Query: 573 GEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVN 631
GE +I+ S W+ ++ TWYKT FDAP G +PVA++L SMGKG+AWVN
Sbjct: 620 GEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVN 679
Query: 632 GQSIGRYWVSFL--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLL 671
G IGRYW T G P+Q WYHIPRS+L+ + NLLVL
Sbjct: 680 GHHIGRYWTRVAPKDGCGKCDYRGHYHTSKCATNCGNPTQIWYHIPRSWLQASNNLLVLF 739
Query: 672 EEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRC 731
EE G P IS+ + S T+C VS+SH P + +W + + ++ P++ ++C
Sbjct: 740 EETGGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPSDFIDQNSKNKM---TPEMHLQC 796
Query: 732 PSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGD 791
G IS I FASYG P G+C+ ++ G CH+ NS A+V KAC GK SC + + F GD
Sbjct: 797 DDGHTISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFGGD 856
Query: 792 PCPGIPKALLVDAQCT 807
PC GI K L V+A+C
Sbjct: 857 PCRGIVKTLAVEAKCA 872
>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
Length = 830
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 388/828 (46%), Positives = 509/828 (61%), Gaps = 70/828 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++ ING R+IL SGSIHYPRS+P+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 24 SVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLV+F+K V+ GLYV LRIGP+I EW +G H F++
Sbjct: 84 PGKYYFEGNYDLVKFVKLVKEAGLYVNLRIGPYICAEWNFG------HQ-----FQNGQW 132
Query: 149 PFK---FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
PF+ M+++ T IVNMMKA RL+ SQGGPIILSQIENEYG +E+ G Y +WA
Sbjct: 133 PFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGSPGQAYTKWA 192
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
A++AV L+TGVPWVMCKQDDAPDP+IN CNG C + PN KP +WTE WT ++
Sbjct: 193 AQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYC--DYFSPNKAYKPKMWTEAWTGWFTQ 250
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLD 324
+G R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLD
Sbjct: 251 FGGPVPHRPAEDMAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLD 309
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNK 383
EYGLLRQPKWGHLK+LH A+KLC ++SG + QEA +F + CAAFL N
Sbjct: 310 EYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANY 369
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEE 429
+R+ A V F N+ Y LPP SISILPDCK +NTA++ + W+
Sbjct: 370 HQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQSATIKMTPVPMHGGLSWQT 429
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSL 483
Y E + + + LLEQ+NTT+D SDYLWY DPS+ VL V S
Sbjct: 430 YNEEPSSSGDNTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLKSGKYPVLTVLSA 489
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH LH FING+ G+A+G T + V L G N +SLLS+ VGLP+ G + E
Sbjct: 490 GHALHVFINGQLSGTAYGSLDFPKLTFSQGVSLRAGVNKISLLSIAVGLPNVGPHFETWN 549
Query: 544 AG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQP 599
AG L V++ G E + D S W Y++GL GE L + + GS V W+ GS + QP
Sbjct: 550 AGILGPVTLNGLNEGRMDLSWQKWSYKIGLHGEALSLHSISGSSSVEWAE-GSLVAQKQP 608
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL---------------- 643
L+WYKT F+AP G+ P+A+++ SMGKG+ W+NGQ +GR+W ++
Sbjct: 609 LSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKASGTCGECTYIGTYNE 668
Query: 644 ----TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
T G SQ WYH+P+S+LKPTGNLLV+ EE G P G+S+ V ++C + +
Sbjct: 669 NKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGVSLVRREVDSVCADIYEWQ 728
Query: 700 LPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGS 759
P ++++ Q Q + K +K + RPK + C G+KI I FAS+G P G C +Y GS
Sbjct: 729 -PTLMNY--QMQASGKVNKPL---RPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYNQGS 782
Query: 760 CHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
CH+ +S C+G+ SC+V V E F GDPCP + K L +A C+
Sbjct: 783 CHAFHSYDAFNNLCVGQNSCSVTVAPEMFGGDPCPSVMKKLAAEAICS 830
>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
gi|223950023|gb|ACN29095.1| unknown [Zea mays]
Length = 815
Score = 707 bits (1824), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/794 (46%), Positives = 491/794 (61%), Gaps = 56/794 (7%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MW LI KAK+GGLDV+QT VFWN HEP PG + F R DLVRF+K VQ GL+V LRIG
Sbjct: 29 MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 88
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
P+I GEW +GG P WL VPGI FR+DNEPFK M+ + IV MMK+ L+ASQGGPII
Sbjct: 89 PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 148
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQC 239
LSQIENEYG F G Y+ WAAK+AV L TGVPWVMCK++DAPDPVINACNG C
Sbjct: 149 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 208
Query: 240 GETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMY 299
+ F+ PN P KP +WTE W+ ++ +G R R ED+A+ VA F+ K GS++NYYMY
Sbjct: 209 -DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQK-GGSFINYYMY 265
Query: 300 HGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVS 358
HGGTNFGRTA +T YD AP+DEYGL+R+PK HLKELH AVKLC + ++S
Sbjct: 266 HGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTI 325
Query: 359 MNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
+QEA +F+ S CAAFL N + ++A V F+N Y LPP SISILPDCK V FN+
Sbjct: 326 TTLGTMQEAHVFRSPSGCAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNS 385
Query: 419 AKLD-------------SVEQWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYN 464
A + + WE Y E + + L LLEQ+N T+D+SDYLWY
Sbjct: 386 ATVGVQTSQMQMWGDGATSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYI 445
Query: 465 FRFKHDPSDS-------ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
PS++ L V S GH LH F+NG+ GS++G D+ V+L
Sbjct: 446 TSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLR 505
Query: 518 NGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKE-LKDFSSFSWGYQVGLLGEK 575
GTN ++LLSV GLP+ G + E G+ V + G E +D + +W YQVGL GE+
Sbjct: 506 AGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQVGLKGEQ 565
Query: 576 LQIFTDYGSRIVPWSRYG--SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQ 633
+ + + GS V W + + QPL WYK F+ P+G +P+A+++ SMGKG+ W+NGQ
Sbjct: 566 MNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQ 625
Query: 634 SIGRYWV--------------SFLTPQ-----GTPSQSWYHIPRSFLKPTGNLLVLLEE- 673
SIGRYW +F P+ G P+Q WYH+PRS+L+P+ NLLV+LEE
Sbjct: 626 SIGRYWTAYADGDCKGCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEEL 685
Query: 674 ENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS 733
G I++ SV+++C VS+ H P + W+ ++++ RR KV +RC
Sbjct: 686 GGGDSSKIALAKRSVSSVCADVSEDH-PNIKKWQ------IESYGEREHRRAKVHLRCAH 738
Query: 734 GRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPC 793
G+ IS I FAS+G P G C N+ G CHS++S A++EK C+G + C V + + F GDPC
Sbjct: 739 GQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKRCIGLQRCVVAISPDNFGGDPC 798
Query: 794 PGIPKALLVDAQCT 807
P + K + V+A C+
Sbjct: 799 PSVTKRVAVEAVCS 812
>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
Length = 841
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/854 (44%), Positives = 516/854 (60%), Gaps = 61/854 (7%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
MG L L L ++ S G V+YD R+L+I+G R++L SGSIHYPR+TP++
Sbjct: 1 MGSKNSLVLILLFVSIFACSYLERGWSGKVSYDHRALVIDGKRRVLQSGSIHYPRTTPEV 60
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP +I K+KEGGLDV++T VFWN HEP GQ+ F GR DLVRF+K +Q GL V LRIGP
Sbjct: 61 WPDIIRKSKEGGLDVIETYVFWNYHEPVKGQYYFEGRFDLVRFVKTIQEAGLLVHLRIGP 120
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
+ EW YGG P WLH +PGI FR+ NE FK MK + T IVNMMK L+ASQGGPIIL
Sbjct: 121 YACAEWNYGGFPLWLHFIPGIQFRTTNELFKEEMKLFLTKIVNMMKEENLFASQGGPIIL 180
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
+Q+ENEYG VE ++ G YV+WAA+ AV L T VPWVMC Q DAPDP+IN CNG C
Sbjct: 181 AQVENEYGNVEWAYGAAGELYVKWAAETAVSLNTSVPWVMCAQVDAPDPIINTCNGFYC- 239
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ F+ PNSP KP +WTEN++ ++ +G R ED+A+ VA F + G++ NYYMY
Sbjct: 240 DRFS-PNSPSKPKMWTENYSGWFLSFGYAIPYRPVEDLAFAVARFF-ETGGTFQNYYMYF 297
Query: 301 GGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA ++ YD AP+DEYG +RQPKWGHL++LH A+K C + ++S +
Sbjct: 298 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRDLHKAIKQCEEHLISSDPIHQ 357
Query: 360 NFSKLQEAFI-FQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
EA I ++ S++CAAFL N D ++A V F+ +Y LP S+SILPDCK V FNT
Sbjct: 358 QLGNNLEAHIYYKSSNDCAAFLANYDSSSDANVTFNGNIYFLPAWSVSILPDCKNVIFNT 417
Query: 419 AKL-------------DSVEQ-------WEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
AK+ SV + W YKE + + S A LLEQ+NTTKD S
Sbjct: 418 AKVLILNLGDDFFAHSTSVNEIPLEQIVWSWYKEEVGIWGNNSFTAPGLLEQINTTKDIS 477
Query: 459 DYLWYNFRFKHDPSD-SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
D+LWY+ + + +L + SLGH F+N VG +G H D SF+L + + LI
Sbjct: 478 DFLWYSTSISVNADQVKDIILNIESLGHAALVFVNKVLVGK-YGNHDDASFSLTEKISLI 536
Query: 518 NGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELK-DFSSFSWGYQVGLLGEKL 576
G N + LLS+M+G+ + G + + + AG+ V + G ++K D SS W YQVGL GE
Sbjct: 537 EGNNTLDLLSMMIGVQNYGPWFDVQGAGIYAVLLVGQSKVKIDLSSEKWTYQVGLEGEYF 596
Query: 577 QIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
+ + W++ S ++ L WYK F AP G P+A+NL MGKG+AWVNGQSI
Sbjct: 597 GLDKVSLANSSLWTQGASPPINKSLIWYKGTFVAPEGKGPLALNLAGMGKGQAWVNGQSI 656
Query: 636 GRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEE 673
GRYW ++L+P G P+Q+ YHIPR+++ P NLLVL EE
Sbjct: 657 GRYWPAYLSPSTGCNDSCDYRGAYDSFKCLKKCGQPAQTLYHIPRTWVHPGENLLVLHEE 716
Query: 674 ENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS 733
G P IS+ T + +C VS+ PP SW+S ++ + P+V++ C
Sbjct: 717 LGGDPSKISVLTRTGHEICSIVSEDDPPPADSWKSSSE--------FKSQNPEVRLTCEQ 768
Query: 734 GRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPC 793
G I I FAS+G P G C + GSCH ++ IV+KAC+G+ C++ + GDPC
Sbjct: 769 GWHIKSINFASFGTPAGICGTFNPGSCH-ADMLDIVQKACIGQEGCSISISAANL-GDPC 826
Query: 794 PGIPKALLVDAQCT 807
PG+ K V+A+C+
Sbjct: 827 PGVLKRFAVEARCS 840
>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
Length = 835
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/805 (44%), Positives = 494/805 (61%), Gaps = 32/805 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+I+G R + FSG+IHYPRS P+MWP+L+ +AK+GGL+ ++T VFWN HE
Sbjct: 30 GTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWPKLLDRAKDGGLNTIETYVFWNAHE 89
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG+++F GR DL++F+K +Q +Y +RIGPFI+ EW +GGLP+WL ++P I+FR++
Sbjct: 90 PEPGKYNFEGRCDLIKFLKLIQDNDMYAVIRIGPFIQAEWNHGGLPYWLREIPHIIFRAN 149
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K M+++ IV +K A ++ASQGGPIIL+QIENEYG ++ + G Y+ WAA
Sbjct: 150 NEPYKKEMEKFVRFIVQKLKDADMFASQGGPIILAQIENEYGNIKKDHITDGDKYLEWAA 209
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ G+PW+MCKQ AP VI CNGR CG+T+ +KP +WTENWT+ ++ +
Sbjct: 210 EMALSTNIGIPWIMCKQTTAPGVVIPTCNGRHCGDTWT-LRDKNKPRLWTENWTAQFRAF 268
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+A +RSAEDIAY V F AK G+ VNYYMY+GGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 269 GDQAAVRSAEDIAYSVLRFFAK-GGTLVNYYMYYGGTNFGRTGASYVLTGYYDEAPIDEY 327
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PK+GHL++LH +K K L G EA ++ E C AF+ N +
Sbjct: 328 GLNKEPKFGHLRDLHKLIKSYHKAFLVGKQSFELLGHGYEAHNYELPEENLCLAFISNNN 387
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQ--WEE 429
+ TV F Y +P S+SIL DC V +NT ++ +S + WE
Sbjct: 388 TGEDGTVMFRGKKYYIPSRSVSILADCNHVVYNTKRVFVQHSERSFHTADESTKNNVWEM 447
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y E IP Y TS+R LEQ N TKD SDYLWY +FR + D D V++V S
Sbjct: 448 YSEPIPRYKVTSVRTKEPLEQYNLTKDKSDYLWYTTSFRLEADDLPFRRDIRPVVQVKSS 507
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F+N F GS G DK F EK + L G N+++LLS +G+ DSG L
Sbjct: 508 AHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLRIGINHLALLSSSMGMKDSGGELVEVK 567
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G+++ IQG D WG+++ L GE +I+T+ G V W + +TW
Sbjct: 568 GGIQDCMIQGLNTGTLDLQGNGWGHKINLDGEDKEIYTEKGMGTVKWKP--AENGHAVTW 625
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
Y+ FD P G DPV +++ SM KG +VNG+ +GRYW S+ T G PSQS YHIPR FLK
Sbjct: 626 YRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWTSYKTIAGLPSQSLYHIPRPFLK 685
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
NLLV+ EEE G P GI I TV +C +S+ + V +W + + +
Sbjct: 686 SKKNLLVVFEEEIGKPEGILIQTVRRDDICFLMSEHNPAQVKTWDADGGQIKLIAEDHSS 745
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
R + CP + I +++FAS+GNP G C N+ G+CH+ N++ V K CLGK+SC +P
Sbjct: 746 RGI---LTCPHKKTIEEVVFASFGNPEGACGNFTAGTCHTPNAKEFVAKECLGKKSCVLP 802
Query: 783 VWTEKFYGD-PCPGIPKALLVDAQC 806
+ + D CP L V +C
Sbjct: 803 LIHTLYGADINCPTTTATLAVQVRC 827
>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
Length = 891
Score = 706 bits (1822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/859 (44%), Positives = 511/859 (59%), Gaps = 94/859 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+LII+G R+IL S IHYPR+TP+MWP LIAK+KEGG DVVQT VFW HEP
Sbjct: 35 NVTYDHRALIIDGRRRILNSAGIHYPRATPEMWPDLIAKSKEGGADVVQTYVFWGGHEPV 94
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+ F GR DLV+F+K V GLY+ LRIGP++ EW +GG P WL DVPG+VFR+DN
Sbjct: 95 KGQYYFEGRYDLVKFVKLVGESGLYLHLRIGPYVCAEWNFGGFPVWLRDVPGVVFRTDNA 154
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T IV++M+ L + QGGPII+ QIENEYG +EHSF + G Y++WAA +
Sbjct: 155 PFKEEMQKFVTKIVDLMREEMLLSWQGGPIIMFQIENEYGNIEHSFGQGGKEYMKWAAGM 214
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMCKQ DAP+ +I+ACNG C + F PNSP KP WTE+W +Y +G
Sbjct: 215 ALALDAGVPWVMCKQTDAPENIIDACNGYYC-DGFK-PNSPKKPIFWTEDWDGWYTTWGG 272
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + GS+ NYYMY GGTNFGRT+ + +T Y AP+DEYG
Sbjct: 273 RLPHRPVEDLAFAVARFFQR-GGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 331
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGS----------- 373
LL +PKWGHLK+LH+A+KLC +++ S + KL QEA ++ GS
Sbjct: 332 LLSEPKWGHLKDLHAAIKLCEPALVAAD--SAQYIKLGPKQEAHVYGGSLSIQGMNFSQY 389
Query: 374 ---SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK------LDSV 424
S+C+AFL N D+R ATV F + LPP S+SILPDC+ FNTAK + +V
Sbjct: 390 GSQSKCSAFLANIDERQAATVRFLGQSFTLPPWSVSILPDCRNTVFNTAKVAAQTHIKTV 449
Query: 425 E-------------------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASD 459
E W KE I + E + +LE +N TKD SD
Sbjct: 450 EFVLPLSNSSLLPQFIVQNEDSPQSTSWLIAKEPITLWSEENFTVKGILEHLNVTKDESD 509
Query: 460 YLWYNFRFKHDPSD--------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLE 511
YLWY R D + + S+ VL FING+ GS G H K+
Sbjct: 510 YLWYFTRIYVSDDDIAFWEKNKVSPAVSIDSMRDVLRVFINGQLTGSVVG-HWVKAV--- 565
Query: 512 KMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQV 569
+ V G N + LLS VGL + GA+LER AG + + + G K D S+ SW YQV
Sbjct: 566 QPVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFKGQIKLTGFKNGDIDLSNLSWTYQV 625
Query: 570 GLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
GL GE L++++ + WS +T TWYKT FDAP+G DPVA++L SMGKG+A
Sbjct: 626 GLKGEFLKVYSTGDNEKFEWSELAVDATPSTFTWYKTFFDAPSGVDPVALDLGSMGKGQA 685
Query: 629 WVNGQSIGRYWVSFLTPQ---------------------GTPSQSWYHIPRSFLKPTGNL 667
WVNG IGRYW + ++P+ G P+Q+WYH+PR++L+ + NL
Sbjct: 686 WVNGHHIGRYW-TVVSPKDGCGSCDYRGAYSSGKCRTNCGNPTQTWYHVPRAWLEASNNL 744
Query: 668 LVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKV 727
LV+ EE G P IS+ S +C VS+SH PP+ W + + P++
Sbjct: 745 LVVFEETGGNPFEISVKLRSAKVICAQVSESHYPPLRKWSRADLTGGNISRN--DMTPEM 802
Query: 728 QIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEK 787
++C G +S I FASYG PNG+C+ ++ G+CH+SNS ++V +AC GK C + + +
Sbjct: 803 HLKCQDGHIMSSIEFASYGTPNGSCQKFSRGNCHASNSSSVVTEACQGKNKCDIAI-SNA 861
Query: 788 FYGDPCPGIPKALLVDAQC 806
+GDPC G+ K L V+A+C
Sbjct: 862 VFGDPCRGVIKTLAVEARC 880
>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
Length = 911
Score = 706 bits (1821), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/801 (44%), Positives = 493/801 (61%), Gaps = 32/801 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+I+G R + FSG+IHYPRS P+MW +L+ AK GGL+ ++T VFWN HE
Sbjct: 33 GTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG++ F GR DL+RF+ ++ +Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK M+++ IV +K A ++A QGGPIILSQIENEYG ++ +G Y+ WAA
Sbjct: 153 NEPFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ GVPWVMCKQ AP VI CNGR CG+T+ + +KP +WTENWT+ ++ +
Sbjct: 213 EMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDK-NKPRLWTENWTAQFRTF 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ RSAEDIAY V F AK G+ VNYYMYHGGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 272 GDQLAQRSAEDIAYAVLRFFAK-GGTLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEY 330
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ +K K L G EA ++ + C +FL N +
Sbjct: 331 GMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNN 390
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQ---WEE 429
+ TV F + +P S+SIL DCKTV +NT ++ D + WE
Sbjct: 391 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNVWEM 450
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y EAIP + +T +R LEQ N TKD SDYLWY +FR + D D V+++ S
Sbjct: 451 YSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIKST 510
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F N FVG+ G +KSF EK + L G N++++LS +G+ DSG L
Sbjct: 511 AHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVEVK 570
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G+++ +QG D WG++ L GE +I+T+ G W + P+TW
Sbjct: 571 GGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWK--PAENDLPITW 628
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK FD P G DP+ +++ SM KG +VNG+ IGRYW SF+T G PSQS YHIPR+FLK
Sbjct: 629 YKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLAGHPSQSVYHIPRAFLK 688
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
P GNLL++ EEE G P GI I TV +C +S+ + + +W S + +
Sbjct: 689 PKGNLLIIFEEELGKPGGILIQTVRRDDICVFISEHNPAQIKTWESDGGQIKLIAEDTST 748
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
R + CP R I +++FAS+GNP G C N+ G+CH+ +++AIVEK CLGK SC +P
Sbjct: 749 RG---TLNCPPKRTIQEVVFASFGNPEGACGNFTAGTCHTPDAKAIVEKECLGKESCVLP 805
Query: 783 VWTEKFYGD-PCPGIPKALLV 802
V + D CP L V
Sbjct: 806 VVNTVYGADINCPATTATLAV 826
>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 929
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/857 (44%), Positives = 507/857 (59%), Gaps = 91/857 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+LIING R++L S IHYPR+TP+MWP L+ K+KEGG DVVQ+ VFWN HEP+
Sbjct: 34 NVTYDQRALIINGQRRMLISAGIHYPRATPEMWPSLVQKSKEGGADVVQSYVFWNGHEPK 93
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR DLV+FIK VQ GLY LRIGP++ EW +GG P+WL D+PGIVFR+DNE
Sbjct: 94 QGQYNFEGRYDLVKFIKVVQQAGLYFHLRIGPYVCAEWNFGGFPYWLKDIPGIVFRTDNE 153
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + + IVN+MK +L+A QGGPII++QIENEYG +E +F + G Y WAA+L
Sbjct: 154 PFKVAMEGFVSKIVNLMKENQLFAWQGGPIIMAQIENEYGNIEWAFGDGGKRYAMWAAEL 213
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMC+QDDAP +IN CNG C A N+ KPA WTE+W ++Q +G
Sbjct: 214 ALGLDAGVPWVMCQQDDAPGNIINTCNGYYCDGFKA--NTATKPAFWTEDWNGWFQYWGQ 271
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED A+ +A F + GS+ NYYMY GGTNF RTA +T YD APLDEYG
Sbjct: 272 SVPHRPVEDNAFAIARFFQR-GGSFQNYYMYFGGTNFARTAGGPFMTTSYDYDAPLDEYG 330
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSG---VLVSMNFSKLQEAFIFQGSSECAAFLVNKD 384
L+RQPKWGHL++LH+A+KLC +P L+ V +S EA ++ G +CAAFL N D
Sbjct: 331 LIRQPKWGHLRDLHAAIKLC-EPALTAVDEVPLSTWLGPNVEAHVYSGRGQCAAFLANID 389
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------------- 425
ATV F Y LPP S+SILPDCK V FNTA++ +
Sbjct: 390 SWKIATVQFKGKAYVLPPWSVSILPDCKNVVFNTAQVGAQTTLTRMTIVRSKLEGEVVMP 449
Query: 426 -----------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFK 468
+WE E + +L +N LLEQ+N TKD++DYLWY+ K
Sbjct: 450 SNMLRKHAPESIVGSGLKWEASVEPVGIRGAATLVSNRLLEQLNITKDSTDYLWYSISIK 509
Query: 469 --------HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGT 520
+ S+++L + S+ +H F+N + VGSA G + + V L G
Sbjct: 510 VSVEAVTALSKTKSQAILVLGSMRDAVHIFVNRQLVGSAMG----SDVQVVQPVPLKEGK 565
Query: 521 NNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGA--KELKDFSSFSWGYQVGLLGEKLQI 578
N++ LLS+ VGL + GAYLE AG+R ++ + D S+ W YQVG+ GE+ ++
Sbjct: 566 NDIDLLSMTVGLQNYGAYLETWGAGIRGSALLRGLPSGVLDLSTERWSYQVGIQGEEKRL 625
Query: 579 FTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGR 637
F + + W S LTWYKT FDAP G+DPVA++L SMGKG+AWVNG +GR
Sbjct: 626 FETGTADGIQWDSSSSFPNASALTWYKTTFDAPKGTDPVALDLGSMGKGQAWVNGHHMGR 685
Query: 638 YWVSFLTPQ---------------------GTPSQSW-----YHIPRSFLKPTGNLLVLL 671
YW S L Q G PSQ W YHIPR++L+ + NLLVL
Sbjct: 686 YWPSVLASQSGCSTCDYRGAYDADKCRTNCGKPSQRWQYVDMYHIPRAWLQLSNNLLVLF 745
Query: 672 EEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRC 731
EE G +S+ T S +C HV +S PPV+ W + + + + R + + C
Sbjct: 746 EEIGGDVSKVSLVTRSAPAVCTHVHESQPPPVLFWPANS-----SMDAMSSRSGEAVLEC 800
Query: 732 PSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKF-YG 790
+G+ I I FAS+GNP G+C N+ G+CH+ S + KAC+G C++PV + F
Sbjct: 801 IAGQHIRHIKFASFGNPKGSCGNFQRGTCHAMKSLEVARKACMGMHRCSIPVQWQTFGEF 860
Query: 791 DPCPGIPKALLVDAQCT 807
DPCP + K+L V C+
Sbjct: 861 DPCPDVSKSLAVQVFCS 877
>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
Length = 839
Score = 704 bits (1816), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/806 (43%), Positives = 498/806 (61%), Gaps = 34/806 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G VTYD SL+I+G R++ FSG+IHYPRS QMWP+L+ AKEGGL+ ++T VFWN HE
Sbjct: 35 GTTVTYDKYSLMIDGRRELFFSGAIHYPRSPTQMWPKLLKTAKEGGLNTIETYVFWNAHE 94
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG+F+F GR D+++F+K +Q+ G+Y +RIGPFI+GEW +G LP+WL ++P I+FR++
Sbjct: 95 PEPGKFNFEGRNDMIKFLKLIQSFGMYAIVRIGPFIQGEWNHGALPYWLREIPHIIFRAN 154
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K M+++ IV M+K L+ASQGG +IL+QIENEYG ++ + +G Y+ WAA
Sbjct: 155 NEPYKREMEKFVRFIVQMLKDENLFASQGGNVILAQIENEYGNIKKDHITEGDKYLEWAA 214
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ GVPW+MCKQ AP VI CNGR CG+T+ + +KP +WTENWT+ ++ +
Sbjct: 215 EMAISTNIGVPWIMCKQSTAPGVVIPTCNGRHCGDTWIMKDE-NKPHLWTENWTAQFRAF 273
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G++ RSAEDIAY V F AK G+ VNYYMY+GGTNFGRT ++YVLTGYYD+ P+DEY
Sbjct: 274 GNDLAQRSAEDIAYSVLRFFAK-GGTLVNYYMYYGGTNFGRTGASYVLTGYYDEGPIDEY 332
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ + PK+GHL++LH+ +K + L G + EA F+ E C AF+ N +
Sbjct: 333 GMPKAPKYGHLRDLHNVIKSYSRAFLEGKQSFELLGQGYEARNFEIPEEKLCLAFISNNN 392
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------DSVEQ------WEE 429
+ TV F Y +P S+SIL DCK V +NT ++ E+ WE
Sbjct: 393 TGEDGTVIFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHKAEKATKNNVWEM 452
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
+ E IP Y +T++R LEQ N TKD SDYLWY +FR + D D V+ V S
Sbjct: 453 FSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWYTTSFRLEADDLPIRGDIRPVIAVKST 512
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F+N F G+ HG +K FT E + L G N+++LLS +G+ DSG L
Sbjct: 513 AHAMVGFVNDAFAGNGHGSKKEKFFTFETPISLRLGVNHLALLSSSMGMKDSGGELVELK 572
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G+++ +IQG D WG++ L GE +I+T+ G V W + + Q +TW
Sbjct: 573 GGIQDCTIQGLNTGTLDLQINGWGHKAKLEGEVKEIYTEKGMGAVKW--VPAVSGQAVTW 630
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK FD P G DPV +++ SM KG +VNG+ +GRYW S+ TP SQ+ YHIPR+FLK
Sbjct: 631 YKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGRYWTSYKTPGKVASQAVYHIPRTFLK 690
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
NLLV+ EEE G P GI I TV +C +S+ + + W + +
Sbjct: 691 SKNNLLVVFEEELGKPEGILIQTVRRDDICVFISEHNPAQIKPWDEHGGQIKLIAE---D 747
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
+ + CP + I +++FAS+GNP G+C N+ +G+CH+ N++ IVEK CLGK+ C +P
Sbjct: 748 HNTRGFLNCPPKKIIQEVVFASFGNPVGSCANFTVGTCHTPNAKEIVEKECLGKKGCVLP 807
Query: 783 VWTEKFYGDP--CPGIPKALLVDAQC 806
V FYG CP L V +C
Sbjct: 808 V-LHTFYGADINCPTTTATLAVQVRC 832
>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
Length = 849
Score = 702 bits (1811), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/827 (44%), Positives = 507/827 (61%), Gaps = 63/827 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++L+I+G R++L SGSIHYPR+TP++WP +I K+KEGGLDV++T VFWN HEP
Sbjct: 36 VTYDHKALVIDGKRRVLQSGSIHYPRTTPEVWPEIIRKSKEGGLDVIETYVFWNYHEPVR 95
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F GR DLVRF+K VQ GL+V LRIGP+ EW YGG P WLH +PG+ FR+ N+
Sbjct: 96 GQYYFEGRFDLVRFVKTVQEAGLFVHLRIGPYACAEWNYGGFPLWLHFIPGVQFRTSNDI 155
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK MK + T IV++MK L+ASQGGPIIL+Q+ENEYG V+ ++ G YV+WAA+ A
Sbjct: 156 FKNAMKSFLTKIVDLMKDDNLFASQGGPIILAQVENEYGNVQWAYGVGGELYVKWAAETA 215
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ L T VPWVMC Q+DAPDPVIN CNG C + PNSP KP +WTEN++ ++ +G
Sbjct: 216 ISLNTTVPWVMCVQEDAPDPVINTCNGFYCDQF--TPNSPSKPKMWTENYSGWFLAFGYA 273
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R ED+A+ VA F + GS+ NYYMY GGTNFGRTA ++ YD AP+DEYG
Sbjct: 274 VPYRPVEDLAFAVARFF-EYGGSFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 332
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF-SKLQEAFIFQGSSECAAFLVNKDKRN 387
+RQPKWGHL++LHSA+K C + ++S V +KL+ ++ S++CAAFL N D +
Sbjct: 333 IRQPKWGHLRDLHSAIKQCEEYLVSSDPVHQQLGNKLEAHVYYKHSNDCAAFLANYDSGS 392
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------------------LDSVEQ 426
+A V F+ Y LP S+SIL DCK V FNTAK L +
Sbjct: 393 DANVTFNGNTYFLPAWSVSILADCKNVIFNTAKVVTQRHIGDALFSRSTTVDGNLVAASP 452
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR-FKHDPSDSESVLKVSSLGH 485
W YKE + + S LLEQ+NTTKD SD+LWY+ + D E +L + SLGH
Sbjct: 453 WSWYKEEVGIWGNNSFTKPGLLEQINTTKDTSDFLWYSTSLYVEAGQDKEHLLNIESLGH 512
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
F+N FV +G H D SF+L + + L G N + +LS+++G+ + G + + + AG
Sbjct: 513 AALVFVNKRFVAFGYGNHDDASFSLTREISLEEGNNTLDVLSMLIGVQNYGPWFDVQGAG 572
Query: 546 LRNVS-IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS--THQPLTW 602
+ +V + K KD SS W YQVGL GE L + + WS+ G+S ++ L W
Sbjct: 573 IHSVFLVDLHKSKKDLSSGKWTYQVGLEGEYLGLDNVSLANSSLWSQ-GTSLPVNKSLIW 631
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
YK AP G+ P+A+NL SMGKG+AW+NGQSIGRYW ++L+P
Sbjct: 632 YKATIIAPEGNGPLALNLASMGKGQAWINGQSIGRYWSAYLSPSAGCTDNCDYRGAYNSF 691
Query: 647 ------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHL 700
G P+Q+ YHIPR+++ P NLLVL EE G P IS+ T + +C VS+
Sbjct: 692 KCQKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSQISLLTRTGQDICSIVSEDDP 751
Query: 701 PPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
PP SW K + + P+V++ C G I+ I FAS+G P G C + G+C
Sbjct: 752 PPADSW--------KPNLEFMSQSPEVRLTCEHGWHIAAINFASFGTPEGKCGTFTPGNC 803
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
H ++ IV+KAC+G C++P+ K GDPCPG+ K +V+A C+
Sbjct: 804 H-ADMLTIVQKACIGHERCSIPISAAKL-GDPCPGVVKRFVVEALCS 848
>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
Length = 889
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/860 (44%), Positives = 518/860 (60%), Gaps = 93/860 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LII+G R++L S IHYPR+TP+MWP LIAK+KEGG D++QT FWN HEP
Sbjct: 30 NVSYDHRALIIDGKRRMLISSGIHYPRATPEMWPDLIAKSKEGGADLIQTYAFWNGHEPI 89
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR D+V+FIK + GLY LRIGP++ EW +GG P WL D+PGI FR+DN
Sbjct: 90 RGQYNFEGRYDIVKFIKLAGSAGLYFHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 149
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+K M+R+ IV++M+ L++ QGGPIIL QIENEYG +E + ++G YV+WAA +
Sbjct: 150 PYKDEMQRFVKKIVDLMRQEMLFSWQGGPIILLQIENEYGNIERLYGQRGKDYVKWAADM 209
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMC+Q DAP+ +I+ACN C + F PNS KPA+WTE+W +Y +G
Sbjct: 210 AIGLGAGVPWVMCRQTDAPENIIDACNAFYC-DGFK-PNSYRKPALWTEDWNGWYTSWGG 267
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED A+ VA F + GSY NYYM+ GGTNFGRT+ + +T Y AP+DEYG
Sbjct: 268 RVPHRPVEDNAFAVARFFQR-GGSYHNYYMFFGGTNFGRTSGGPFYVTSYDYDAPIDEYG 326
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE--------- 375
LL QPKWGHLK+LHSA+KLC +P L V + + +L QEA +++ SS
Sbjct: 327 LLSQPKWGHLKDLHSAIKLC-EPALVAVDDAPQYIRLGPMQEAHVYRHSSYVEDQSSSTL 385
Query: 376 -----CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------V 424
C+AFL N D+ N+A V F +Y LPP S+SILPDCK VAFNTAK+ S V
Sbjct: 386 GNGTLCSAFLANIDEHNSANVKFLGQVYSLPPWSVSILPDCKNVAFNTAKVASQISVKTV 445
Query: 425 E--------------------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
E W KE I + + A +LE +N TKD S
Sbjct: 446 EFSSPFIENTTEPGYLLLHDGVHHISTNWMILKEPIGEWGGNNFTAEGILEHLNVTKDTS 505
Query: 459 DYLWYNFRFK--------HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
DYLWY R + S+ L + S+ V+ F+NG+ GS H + +
Sbjct: 506 DYLWYIMRLHISDEDISFWEASEVSPKLIIDSMRDVVRIFVNGQLAGS----HVGRWVRV 561
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQ 568
E+ V L+ G N +++LS VGL + GA+LE+ AG + + + G K + D ++ W YQ
Sbjct: 562 EQPVDLVQGYNELAILSETVGLQNYGAFLEKDGAGFKGQIKLTGLKSGEYDLTNSLWVYQ 621
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRY-GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGE 627
VGL GE ++IF+ W S TWYKT FDAP G DPV++ L SMGKG+
Sbjct: 622 VGLRGEFMKIFSLEEHESADWVDLPNDSVPSAFTWYKTFFDAPQGKDPVSLYLGSMGKGQ 681
Query: 628 AWVNGQSIGRYWVSFLTPQ---------------------GTPSQSWYHIPRSFLKPTGN 666
AWVNG SIGRYW S + P G P+QSWYHIPRS+L+P+ N
Sbjct: 682 AWVNGHSIGRYW-SLVAPVDGCQSCDYRGAYHESKCATNCGKPTQSWYHIPRSWLQPSKN 740
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
LLV+ EE G P IS+ S +++C VS+SH PP+ W ++ K I P+
Sbjct: 741 LLVIFEETGGNPLEISVKLHSTSSICTKVSESHYPPLHLWSHKDIVNGKV--SISNAVPE 798
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
+ ++C +G++IS I+FAS+G P G+C+ ++ G CH+ NS ++V +AC G+ +C++ V +
Sbjct: 799 IHLQCDNGQRISSIMFASFGTPQGSCQRFSQGDCHAPNSFSVVSEACQGRNNCSIGVSNK 858
Query: 787 KFYGDPCPGIPKALLVDAQC 806
F GDPC G+ K L V+A+C
Sbjct: 859 VFGGDPCRGVVKTLAVEAKC 878
>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
Length = 897
Score = 700 bits (1806), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/863 (43%), Positives = 506/863 (58%), Gaps = 98/863 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LII+GHR++L SG IHYPR+TPQMWP LIAK+KEGG+DV+QT VFWN HEP
Sbjct: 39 NVSYDHRALIIDGHRRMLISGGIHYPRATPQMWPDLIAKSKEGGVDVIQTYVFWNGHEPV 98
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+ F G+ DLV+F+K V GLY+ LRIGP++ EW +GG P WL D+PGIVFR+DN
Sbjct: 99 KGQYIFEGQYDLVKFVKLVGVSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIVFRTDNS 158
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF M+++ IV++M+ L++ QGGPII+ QIENEYG +EHSF G YV+WAA++
Sbjct: 159 PFMEEMQQFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNIEHSFGPGGKEYVKWAARM 218
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMC+Q DAP +I+ACN C PNS KP +WTE+W +Y +G
Sbjct: 219 ALGLGAGVPWVMCRQTDAPGSIIDACNEYYCDGY--KPNSNKKPILWTEDWDGWYTTWGG 276
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + GS+ NYYMY GGTNF RTA + +T Y AP+DEYG
Sbjct: 277 SLPHRPVEDLAFAVARFFQR-GGSFQNYYMYFGGTNFARTAGGPFYITSYDYDAPIDEYG 335
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGS----------- 373
LL +PKWGHLK+LH+A+KLC +++ S + KL QEA +++ +
Sbjct: 336 LLSEPKWGHLKDLHAAIKLCEPALVAA--DSAQYIKLGSKQEAHVYRANVHAEGQNLTQH 393
Query: 374 ---SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK------LDSV 424
S+C+AFL N D+ TV F Y LPP S+S+LPDC+ FNTAK + S+
Sbjct: 394 GSQSKCSAFLANIDEHKAVTVRFLGQSYTLPPWSVSVLPDCRNAVFNTAKVAAQTSIKSM 453
Query: 425 E--------------------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
E W KE I + + +LE +N TKD S
Sbjct: 454 ELALPQFSGISAPKQLMAQNEGSYMSSSWMTVKEPISVWSGNNFTVEGILEHLNVTKDHS 513
Query: 459 DYLWYNFRFKHDPSD--------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
DYLWY R D +K+ S+ VL FING+ GS G+ +
Sbjct: 514 DYLWYFTRIYVSDDDIAFWEENNVHPAIKIDSMRDVLRVFINGQLTGSVIGRW----IKV 569
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQ 568
+ V G N + LLS VGL + GA+LER AG R + + G ++ D S+ W YQ
Sbjct: 570 VQPVQFQKGYNELVLLSQTVGLQNYGAFLERDGAGFRGHTKLTGFRDGDIDLSNLEWTYQ 629
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGE 627
VGL GE +I+T + W+ TWYKT FDAP+G+DPVA++L SMGKG+
Sbjct: 630 VGLQGENQKIYTTENNEKAEWTDLTLDDIPSTFTWYKTYFDAPSGADPVALDLGSMGKGQ 689
Query: 628 AWVNGQSIGRYWVSFLTPQ---------------------GTPSQSWYHIPRSFLKPTGN 666
AWVN IGRYW + + P+ G P+Q WYHIPRS+L+P+ N
Sbjct: 690 AWVNDHHIGRYW-TLVAPEEGCQKCDYRGAYNSEKCRTNCGKPTQIWYHIPRSWLQPSNN 748
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGR--R 724
LLV+ EE G P ISI S + +C VS++H PP+ W T + + G+
Sbjct: 749 LLVIFEETGGNPFEISIKLRSASVVCAQVSETHYPPLQRWI----HTDFIYGNVSGKDMT 804
Query: 725 PKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVW 784
P++Q+RC G IS I FASYG P G+C+ ++ G+CH+ NS ++V KAC G+ +C + +
Sbjct: 805 PEIQLRCQDGYVISSIEFASYGTPQGSCQKFSRGNCHAPNSLSVVSKACQGRDTCNIAIS 864
Query: 785 TEKFYGDPCPGIPKALLVDAQCT 807
F GDPC GI K L V+A+C+
Sbjct: 865 NAVFGGDPCRGIVKTLAVEAKCS 887
>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 916
Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/861 (43%), Positives = 515/861 (59%), Gaps = 100/861 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+++I+G R++L S IHYPR+TP+MWP +I AK+GG DVVQT VFWN HEP+
Sbjct: 31 NVTYDQRAVLIDGERRMLISAGIHYPRATPEMWPSIIQHAKDGGADVVQTYVFWNGHEPE 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR DLV+FIK V+ GLY LRIGP++ EW +GG P+WL ++PGIVFR+DNE
Sbjct: 91 QGQYNFEGRYDLVKFIKLVKQAGLYFHLRIGPYVCAEWNFGGFPYWLKEIPGIVFRTDNE 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + + IVN+MK L++ QGGPII++QIENEYG +E F + G YV+WAA +
Sbjct: 151 PFKVAMQGFTSKIVNLMKENELFSWQGGPIIMAQIENEYGDIESQFGDGGKRYVQWAADM 210
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L T VPW+MCKQ+DAP +IN CNG C PN+ KP +WTE+W ++Q +G
Sbjct: 211 ALSLDTRVPWIMCKQEDAPANIINTCNGFYCDGW--KPNTALKPILWTEDWNGWFQNWGQ 268
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
A R ED A+ VA F + GS+ NYYMY GGTNF RTA +T YD AP+DEYG
Sbjct: 269 AAPHRPVEDNAFAVARFFQR-GGSFQNYYMYFGGTNFARTAGGPFMTTTYDYDAPIDEYG 327
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLV---SMNFSKLQEAFIFQGSSECAAFLVNKD 384
L+RQPKWGHLK+LH+A+KLC +P L+ V S QEA + + CAAFL N D
Sbjct: 328 LIRQPKWGHLKDLHAAIKLC-EPALTAVDTVPQSTWIGSNQEAHEYSANGHCAAFLANID 386
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL----------------------- 421
N+ TV F Y LP S+SILPDCK VAFNTA++
Sbjct: 387 SENSVTVQFQGESYVLPAWSVSILPDCKNVAFNTAQIGAQTTVTRMRIAPSNSRGDIFLP 446
Query: 422 ------DSVE--------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF 467
D + +W+ E + +N LLEQ+N TKD SDYLWY+
Sbjct: 447 SNTLVHDHISDGGVFANLKWQASAEPFGIRGSGTTVSNSLLEQLNITKDTSDYLWYSTSI 506
Query: 468 -------KHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGT 520
D S +E+ L + ++ +H F+NG+ GSA G + + + + L +G
Sbjct: 507 TITSEGVTSDVSGTEANLVLGTMRDAVHIFVNGKLAGSAMGWN----IQVVQPITLKDGK 562
Query: 521 NNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQI 578
N++ LLS+ +GL + GAYLE AG+R +VS+ G S+ W YQVGL GE+L++
Sbjct: 563 NSIDLLSMTLGLQNYGAYLETWGAGIRGSVSVTGLPYGNLSLSTAEWSYQVGLRGEELKL 622
Query: 579 FTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY 638
F + + W + LTWYKT FDAP G+DPVA++L SMGKG+AW+NG +GRY
Sbjct: 623 FHNGTADGFSWDSSSFTNASYLTWYKTTFDAPGGTDPVALDLGSMGKGQAWINGHHLGRY 682
Query: 639 WVSFLTPQ---------------------GTPSQSW-------YHIPRSFLKPTGNLLVL 670
++ + PQ G PSQ W YHIPR++L+ TGNLLVL
Sbjct: 683 FL-MVAPQSGCETCDYRGAYNTNKCRTNCGEPSQRWQVIHFQMYHIPRAWLQATGNLLVL 741
Query: 671 LEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG--RRPKVQ 728
EE G +S+ T S +C H+++S PP+ +WR H+ I ++
Sbjct: 742 FEEIGGDISKVSVVTRSAHAVCAHINESQPPPIRTWRP--------HRSIDAFNNPAEML 793
Query: 729 IRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKF 788
+ C +G+ I+KI FAS+GNP G+C ++ G+CH++ S V K C+GK+ C +PV KF
Sbjct: 794 LECAAGQHITKIKFASFGNPRGSCGHFQHGTCHANKSMEAVRKVCIGKQQCYIPV-QRKF 852
Query: 789 YG--DPCPGIPKALLVDAQCT 807
+G DPCPG+ K+L V C+
Sbjct: 853 FGSIDPCPGVSKSLAVQVHCS 873
>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 851
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/834 (44%), Positives = 497/834 (59%), Gaps = 65/834 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD RSLII+G RK+L S +IHYPRS P+MWP+L+ AKEGG+DV++T VFWN HEP
Sbjct: 28 NVSYDSRSLIIDGQRKLLISAAIHYPRSVPEMWPKLVQTAKEGGVDVIETYVFWNGHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG + F GR DLV+F+K V+ G+++ LRIGPF+ EW +GG+P WLH VPG VFR++N+
Sbjct: 88 PGNYYFGGRYDLVKFVKIVEQAGMHLILRIGPFVAAEWYFGGIPVWLHYVPGTVFRTENK 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK+HM+++ T IV++MK + +ASQGGPIIL+Q+ENEYG E + E G Y WAA +
Sbjct: 148 PFKYHMQKFTTFIVDLMKQEKFFASQGGPIILAQVENEYGYYEKDYGEGGKQYAMWAASM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV GVPW+MC+Q DAP+ VIN CN C + P +KP IWTENW +++ +G
Sbjct: 208 AVSQNIGVPWIMCQQFDAPESVINTCNSFYCDQF--TPIYQNKPKIWTENWPGWFKTFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AEDIA+ VA F K GS NYYMYHGGTNFGRT+ +T YD +AP+DEYG
Sbjct: 266 WNPHRPAEDIAFSVARFFQK-GGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLK+LH A+KLC ML+ +++ EA +F SS CAAF+ N D +
Sbjct: 325 LPRLPKWGHLKQLHRAIKLCEHIMLNSQPTNVSLGPSLEADVFTNSSGACAAFIANMDDK 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE----------------- 425
N+ TV F N+ Y LP S+SILPDCK V FNTAK+ S VE
Sbjct: 385 NDKTVEFRNMSYHLPAWSVSILPDCKNVVFNTAKVGSQSSVVEMLPESLQLSVGSADKSL 444
Query: 426 ---QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SES 476
+W+ + E + E + L++ +NTTK +DYLWY ++ S
Sbjct: 445 KDLKWDVFVEKAGIWGEADFVKSGLVDHINTTKFTTDYLWYTTSILVGENEEFLKKGSSP 504
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
VL + S GH +HAF+N E SA G + F L+ + L G N+++LLS+ VGL ++G
Sbjct: 505 VLLIESKGHAVHAFVNQELQASAAGNGTHFPFKLKAPISLKEGKNDIALLSMTVGLQNAG 564
Query: 537 AYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGS 594
++ E AGL +V IQG D S+++W Y++GL GE + + G V W S
Sbjct: 565 SFYEWVGAGLTSVKIQGFNNGTIDLSAYNWTYKIGLEGEHQGLDKEEGFGNVNWISASEP 624
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV-------------- 640
QPLTWYK + D P G DPV +++I MGKG AW+NG+ IGRYW
Sbjct: 625 PKEQPLTWYKVIVDPPPGDDPVGLDMIHMGKGLAWLNGEEIGRYWPRKGPLHGCVKECNY 684
Query: 641 -------SFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCG 693
T G P+Q WYH+PRS+ K +GN+LV+ EE+ G P I +T +C
Sbjct: 685 RGKFDPDKCNTGCGEPTQRWYHVPRSWFKQSGNVLVIFEEKGGDPSKIEFSRRKITGVCA 744
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCE 753
V++++ P I S N + ++K + + + CP IS + FAS+GNP G C
Sbjct: 745 LVAENY--PSIDLESWNDGS-GSNKTV----ATIHLGCPEDTHISSVKFASFGNPTGACR 797
Query: 754 NYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+Y G CH NS ++VEK CL K C + + E F C PK L V+ QC
Sbjct: 798 SYTQGDCHDPNSISVVEKVCLNKNRCDIELTGENFNKGSCLSEPKKLAVEVQCN 851
>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
Length = 803
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/805 (44%), Positives = 488/805 (60%), Gaps = 66/805 (8%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+ VTYD RSL+I+G R + FSG+IHYPRS P++WP+L+ +AKEGGL+ ++T +FWN HE
Sbjct: 33 GSVVTYDARSLLIDGKRDLFFSGAIHYPRSPPEVWPKLLDRAKEGGLNTIETYIFWNAHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG+++F GR DLV+F+K +Q G+Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYNFEGRLDLVKFLKMIQEHGMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+P+K M+++ +V +K A L+ASQGGP+IL+QIENEYG ++ +G Y+ WAA
Sbjct: 153 NDPYKKEMEKWTRFVVQKLKDAELFASQGGPVILTQIENEYGNIKKDHKIEGDKYLEWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ QTGVPW+MCKQ AP VI CNGR CG+T+ +KP +WTENWT ++ Y
Sbjct: 213 QMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWT-LRDKNKPMLWTENWTQQFRAY 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ +RSAEDIAY V F AK GS VNYYMYHGGTNFGRT+++YVLTGYYD+APLDEY
Sbjct: 272 GDQLAMRSAEDIAYAVLRFFAK-GGSMVNYYMYHGGTNFGRTSASYVLTGYYDEAPLDEY 330
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ ++ K LSG S EA IF+ E C +FL N +
Sbjct: 331 GMYKEPKFGHLRDLHNVIRSYQKAFLSGKHSSEILGHGYEAQIFELPEENLCLSFLSNNN 390
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------DSVEQWEE 429
+ TV F + + +P S+SIL CK V +NT ++ QWE
Sbjct: 391 TGEDGTVIFRGVKHYVPSRSVSILAGCKDVVYNTKRVFVQHSERSYHTSEVTSKNNQWEM 450
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y E +P Y +T +R LEQ N TKDASDYLWY +FR + D D VL+V S
Sbjct: 451 YSEMVPKYKDTKIRTKEPLEQYNQTKDASDYLWYTTSFRLESDDLPFRGDIRPVLQVKSS 510
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F N FVGSA G K F EK V L G N+V LLS +G+ DSG L
Sbjct: 511 AHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLKAGVNHVVLLSSTMGMKDSGGELAEVK 570
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G++ IQG D WG++ RY
Sbjct: 571 GGIQECLIQGLNTGTLDLQVNGWGHK----------------------RY---------- 598
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
FD P G DP+ +++ SM KG +VNG+ IGRYWVSF T GTPSQ+ YHIPR FLK
Sbjct: 599 ----FDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYWVSFRTLAGTPSQAVYHIPRPFLK 654
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
P NLLV+ EEE G P GI + TV+ +C +S+ + + +W + +K
Sbjct: 655 PKDNLLVVFEEEMGKPDGILVQTVTRDDICLLISEHNPGQIKTWDTDG---VKIKLIAED 711
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
+ + CP + I +++FAS+GNP+G C N+ +G+CH+ N++ IVEK CLGK SC +P
Sbjct: 712 HSVRGTLMCPPEKIIQEVVFASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLP 771
Query: 783 VWTEKFYGD-PCPGIPKALLVDAQC 806
V + D C L V +C
Sbjct: 772 VDHTVYGADINCQSTTGTLGVQVRC 796
>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
lyrata]
Length = 887
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 389/878 (44%), Positives = 514/878 (58%), Gaps = 84/878 (9%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
Q Q+L L LL G NV+YD R+LII R++L S IHYPR+TP+MW
Sbjct: 11 QWQILSLIIALLVYFPIVSGSFFKPFNVSYDHRALIIADKRRMLVSAGIHYPRATPEMWS 70
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFI 122
LI K+KEGG DV+QT VFW+ HEP GQ++F GR DLV+F+K + + GLY+ LRIGP++
Sbjct: 71 DLIEKSKEGGADVIQTYVFWSGHEPVKGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYV 130
Query: 123 EGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
EW +GG P WL D+PGI FR+DNEPFK M+++ T IV++M+ A+L+ QGGPII+ Q
Sbjct: 131 CAEWNFGGFPVWLRDIPGIQFRTDNEPFKKEMQKFVTKIVDLMRDAKLFCWQGGPIIMLQ 190
Query: 183 IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET 242
IENEYG VE S+ +KG YV+WAA +A+ L GVPWVMCKQ DAP+ +I+ACNG C +
Sbjct: 191 IENEYGDVEKSYGQKGKDYVKWAASMALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DG 249
Query: 243 FAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
F PNS KP +WTE+W +Y +G R AED+A+ VA F + GS+ NYYMY GG
Sbjct: 250 FK-PNSQMKPILWTEDWDGWYTKWGGSLPHRPAEDLAFAVARFYQR-GGSFQNYYMYFGG 307
Query: 303 TNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
TNFGRT+ + +T Y APLDEYGL +PKWGHLK+LH+A+KLC +++ + +
Sbjct: 308 TNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAA--DAPQY 365
Query: 362 SKL---QEAFIFQGSSE-----CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKT 413
KL QEA I++G E CAAFL N D+ +A V F+ Y LPP S+SILPDC+
Sbjct: 366 RKLGSNQEAHIYRGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRH 425
Query: 414 VAFNTAKL----------------------------DSV----EQWEEYKEAIPTYDETS 441
VAFNTAK+ D+V + W KE I + E +
Sbjct: 426 VAFNTAKVGAQTSVKTVESARPSLGSKSILQKVVRQDNVSYISKSWMALKEPIGIWGENN 485
Query: 442 LRANFLLEQMNTTKDASDYLWYNFRFKHDPSD--------SESVLKVSSLGHVLHAFING 493
LLE +N TKD SDYLW+ R D + + + S+ VL F+N
Sbjct: 486 FTFQGLLEHLNVTKDRSDYLWHKTRITVSEDDISFWKKNGANPTVSIDSMRDVLRVFVNK 545
Query: 494 EFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQ 552
+ GS G H K+ + V + G N++ LL+ VGL + GA+LE+ AG R +
Sbjct: 546 QLSGSVVG-HWVKAV---QPVRFMQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLT 601
Query: 553 GAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL-TWYKTVFDAP 610
G K D + SW YQVGL GE +I+T + WS + + WYKT FD P
Sbjct: 602 GFKNGDMDLAKSSWTYQVGLKGEAEKIYTVEHNEKAEWSTLETDASPSIFMWYKTYFDTP 661
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVSF---------------------LTPQGTP 649
G+DPV ++L SMGKG+AWVNG IGRYW T G P
Sbjct: 662 AGTDPVVLDLESMGKGQAWVNGHHIGRYWNIISQKDGCERTCDYRGAYYSDKCTTNCGKP 721
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ 709
+Q+ YH+PRS+LKP+ NLLVL EE G P IS+ TV+ LCG V +SH PP+ W +
Sbjct: 722 TQTRYHVPRSWLKPSSNLLVLFEETGGNPFNISVKTVTAGILCGQVLESHYPPLRKWSTP 781
Query: 710 NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIV 769
+ + I P+V + C G IS I FASYG P G+C+ ++IG CH+SNS +IV
Sbjct: 782 DY--INGTMSINSVAPEVYLHCEDGHVISSIEFASYGTPRGSCDRFSIGKCHASNSLSIV 839
Query: 770 EKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+AC G+ SC + V F DPC G K L V A+C+
Sbjct: 840 SEACKGRTSCFIEVSNTAFRSDPCSGTLKTLAVMARCS 877
>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
Length = 802
Score = 697 bits (1798), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/818 (46%), Positives = 487/818 (59%), Gaps = 74/818 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD RSLI+NG R+IL SGS+HYPR+TP+MWP +I KAKEGGLDV++T VFW+ HEP
Sbjct: 19 NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F GR DLV+F+K VQ GL V LRIGP++ EW GG P WL D+P IVFR+DNE
Sbjct: 79 PGQYYFEGRYDLVKFVKLVQQAGLLVNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK +M+ + T IVNMMK L+ASQGGPIIL+Q+ENEYG V+ + E G Y+ WAA++
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A TGVPW+MC Q P+ +I+ CNG C P KP +WTE++T ++ YG
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDG--WNPTLYKKPTMWTESYTGWFTYYGW 256
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R EDIA+ VA F + GS+ NYYMY GGTNFGRT+ YV + Y APLDEYG
Sbjct: 257 PLPHRPVEDIAFAVARFFER-GGSFHNYYMYFGGTNFGRTSGGPYVASSYDYDAPLDEYG 315
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
+ PKWGHLK+LH +KL + +LS QEA ++ + C AFL N D N
Sbjct: 316 MQHLPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNGCVAFLANVDSMN 375
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIP 435
+ V F N+ Y LP S+SI+ DCKTVAFN+AK+ S W + E +
Sbjct: 376 DTVVEFRNVSYSLPAWSVSIVLDCKTVAFNSAKVKSQSAVVSMNPSKSSLSWTSFDEPVG 435
Query: 436 TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEF 495
+S +A LLEQM TTKD SDYLWY R+ + L + S+ V+H F+NG+F
Sbjct: 436 I-SGSSFKAKQLLEQMETTKDTSDYLWYTTRYA--TGTGSTWLSIESMRDVVHIFVNGQF 492
Query: 496 VGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAK 555
S H S ++E + L G+N ++LLS VGL + GA++E AGL I
Sbjct: 493 QSSWHTSKSVLYNSVEAPIKLAPGSNTIALLSATVGLQNFGAFIETWSAGLSGSLILKGL 552
Query: 556 ELKD--FSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGS 613
D S W YQVGL GE L++FT GSR V WS ST +PLTWY T FDAP G
Sbjct: 553 PGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWS--AVSTKKPLTWYMTEFDAPPGD 610
Query: 614 DPVAINLISMGKGEAWVNGQSIGRYWVSF----------------------LTPQGTPSQ 651
DPVA++L SMGKG+AWVNGQSIGRYW ++ LT G SQ
Sbjct: 611 DPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGSYDQNKCLTGCGQSSQ 670
Query: 652 SWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQ 711
WYH+PRS++KP GNLLVL EE G P I T S +C V +SH V W
Sbjct: 671 RWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICARVYESHPASVKLW----- 725
Query: 712 RTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFASYGNPNGNCENYAIGSCHSSNSRAIVE 770
CP ++ IS+I FAS GNP G+C ++ GSCH+++ VE
Sbjct: 726 -------------------CPGEKQVISQIRFASLGNPEGSCGSFKEGSCHTNDLSNTVE 766
Query: 771 KACLGKRSCTVPVWTEKFYGDPCPGI-PKALLVDAQCT 807
KAC+G+RSC++ F CPG+ K L V+A C+
Sbjct: 767 KACVGQRSCSL---APDFTTSACPGVREKFLAVEALCS 801
>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
Length = 1036
Score = 696 bits (1796), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/750 (46%), Positives = 481/750 (64%), Gaps = 48/750 (6%)
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+DF GR DLV+FIK + +GLYV LR+GPFI+ EW +GGLP+WL +VP + FR++NEPF
Sbjct: 80 QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPF 139
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K H +RY I+ MMK +L+ASQGGPIIL QIENEY V+ ++ E G Y++WAA L
Sbjct: 140 KEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVE 199
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
+ G+PWVMCKQ+DAP +INACNGR CG+TF GPN DKP++WTENWT+ ++V+GD
Sbjct: 200 SMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPP 259
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLR 330
R+ EDIA+ VA + +K GS+VNYYMYHGGTNFGRT++ +V T YYD APLDE+GL +
Sbjct: 260 TQRTVEDIAFSVARYFSK-NGSHVNYYMYHGGTNFGRTSAHFVTTRYYDDAPLDEFGLEK 318
Query: 331 QPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRNN 388
PK+GHLK +H A++LC K + G L + E ++ G+ CAAFL N + R+
Sbjct: 319 APKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDT 378
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW---------------EEYKEA 433
T+ F Y LP SISILPDCKTV +NTA++ + W E + E
Sbjct: 379 NTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRDFVKSEKTSKGLKFEMFSEN 438
Query: 434 IPTYDETSLRANFLL--EQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
IP+ L + L+ E TKD +DY WY K D D +++L+V+SLGH
Sbjct: 439 IPSL----LDGDSLIPGELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGH 494
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
L ++NGE+ G AHG+H KSF K V+ G N +S+L V+ GLPDSG+Y+E R AG
Sbjct: 495 ALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAG 554
Query: 546 LRNVSIQGAKE-LKDFS-SFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWY 603
R +SI G K +D + + WG+ GL GEK +++T+ GS+ V W + G +PLTWY
Sbjct: 555 PRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGK--RKPLTWY 612
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK- 662
KT F+ P G + VAI + +MGKG WVNG +GRYW+SFL+P G P+Q+ YHIPRSF+K
Sbjct: 613 KTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSPLGEPTQTEYHIPRSFMKG 672
Query: 663 -PTGNLLVLLEEENGYPPGI---SIDTVSVT--TLCGHVSDSHLPPVISWRSQNQRTLKT 716
N+LV+LEEE PG+ SID V V T+C +V + + V SW+ + + +
Sbjct: 673 EKKKNMLVILEEE----PGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSR 728
Query: 717 HKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGK 776
K + R K +RCP +++ ++ FAS+G+P G C N+ +G C +S S+ +VEK CLG+
Sbjct: 729 SKDM---RLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGR 785
Query: 777 RSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C++ V E F CP I K L V +C
Sbjct: 786 NYCSIVVARETFGDKGCPEIVKTLAVQVKC 815
>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 822
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/805 (44%), Positives = 487/805 (60%), Gaps = 47/805 (5%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+ VTYDGRSL+I+G R + FSG+IHYPRS P++WP+LI +AKEGGL+ ++T +FWN HE
Sbjct: 33 GSVVTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG+++F GR DL++++K +Q +Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+P+K M+++ IV +K A L+ASQGGPIIL+QIENEYG ++ G Y+ WAA
Sbjct: 153 NDPYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ QTGVPW+MCKQ AP VI CNGR CG+T+ +KP +WTENWT ++ Y
Sbjct: 213 QMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWT-LRDKNKPMLWTENWTQQFRAY 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ +RSAEDIAY V F AK GS VNYYMYHGGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 272 GDQVAMRSAEDIAYAVLRFFAK-GGSLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEY 330
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ ++ K L G S EA IF+ E C +FL N +
Sbjct: 331 GMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNN 390
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------DSVEQWEE 429
+ TV F + +P S+SIL CK V +NT ++ QWE
Sbjct: 391 TGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSYHTSEVTSKNNQWEM 450
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y E IP Y +T +R LEQ N TKDASDYLWY +FR + D +D VL+V S
Sbjct: 451 YSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVKSS 510
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F N FVG A G K F EK V L G N+V LLS +G+ DSG L
Sbjct: 511 AHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAEVK 570
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
+G++ IQG D WG++ L GE +I+++ G V W + + TW
Sbjct: 571 SGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWKP--AENGRAATW 628
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK FD P G DPV +++ SM KG +VNG+ +GRYWVS+ T GTPSQ+ YHIPR FLK
Sbjct: 629 YKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSYRTLAGTPSQALYHIPRPFLK 688
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
NLLV+ EEE G P GI + TV+ +C +S+ + + +W + + +K
Sbjct: 689 SKDNLLVVFEEEMGKPDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDK-IKLIAEDHS 747
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
RR + CP + I +++FAS+GNP G C N+ CLGK SC +P
Sbjct: 748 RRG--TLMCPPEKTIQEVVFASFGNPEGMCGNFT---------------ECLGKPSCMLP 790
Query: 783 VWTEKFYGD-PCPGIPKALLVDAQC 806
V + D C L V +C
Sbjct: 791 VDHTVYGADINCQSTTATLGVQVRC 815
>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
Length = 833
Score = 693 bits (1788), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/802 (43%), Positives = 493/802 (61%), Gaps = 30/802 (3%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+I+G R + FSG+IHYPRS P MW +L+ AK+GGL+ ++T VFWN HE
Sbjct: 32 GTVVSYDERSLLIDGKRDLFFSGAIHYPRSPPDMWHKLLKTAKDGGLNTIETYVFWNAHE 91
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG+++F GR DL++F+K +Q+ +Y +RIGPFI+ EW +GGLP+WL ++P I+FR++
Sbjct: 92 PEPGKYNFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRAN 151
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K M+++ IV +K A ++ASQGGP+IL+QIENEYG ++ + +G Y+ WAA
Sbjct: 152 NEPYKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAA 211
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ TGVPW+MCKQ AP VI CNGR CG+T+ + +KP +WTENWT+ ++ +
Sbjct: 212 QMAISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDK-NKPRLWTENWTAQFRAF 270
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYM-YHGGTNFGRTASAYVLTGYYDQAPLDE 325
GD+ +RSAEDIAY V F AK G+ VNYYM Y+GGTNFGRT ++YVLTGYYD+ P+DE
Sbjct: 271 GDQLALRSAEDIAYSVLRFFAK-GGTLVNYYMQYYGGTNFGRTGASYVLTGYYDEGPVDE 329
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNK 383
+ + PK+GHL++LH+ +K + L G + EA F+ E C AF+ N
Sbjct: 330 C-MPKAPKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNN 388
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA---------------KLDSVEQWE 428
+ + TV F Y +P S+SIL DCK V +NT KL WE
Sbjct: 389 NTGEDGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQKLAKSNAWE 448
Query: 429 EYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP--SDSESVLKVSSLGHV 486
Y E IP Y TS+R +EQ N TKD SDYL + P D V++V S H
Sbjct: 449 MYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLCFRLEADDLPFRGDIRPVVQVKSTSHA 508
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
L F+N F G+ G +K F E ++L G N+++LLS +G+ DSG L G+
Sbjct: 509 LMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEVKGGI 568
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKT 605
++ +IQG D WG++V L GE +I+T+ G V W ++T + +TWYK
Sbjct: 569 QDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKW--VPATTGRAVTWYKR 626
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG 665
FD P G DPV +++ SMGKG +VNG+ +GRYW S+ T G PSQ+ YHIPR FLKP
Sbjct: 627 YFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVGGVPSQAMYHIPRPFLKPKN 686
Query: 666 NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRP 725
NLLV+ EEE G P GI I TV +C +S+ + + +W + + R
Sbjct: 687 NLLVIFEEELGKPEGILIQTVRRDDICVFISEHNPAQIKTWDKDGGQIKLIAEDHSTRG- 745
Query: 726 KVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWT 785
++CP + I +++FAS+GNP G+C N+ G+CH+ N++ IV K CLGK+SC +PV
Sbjct: 746 --ILKCPPKKTIQEVVFASFGNPEGSCANFTAGTCHTPNAKDIVAKECLGKKSCVLPVLH 803
Query: 786 EKFYGD-PCPGIPKALLVDAQC 806
+ D CP L V +C
Sbjct: 804 TVYGADINCPTTTATLAVQVRC 825
>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
Length = 892
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/861 (42%), Positives = 516/861 (59%), Gaps = 97/861 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+LII G R++L S IHYPR+TP+MWP LIA++KEGG DV++T FWN HEP
Sbjct: 36 NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR D+V+F K V + GL++ +RIGP+ EW +GG P WL D+PGI FR+DN
Sbjct: 96 RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+RY IV++M + L++ QGGPIIL QIENEYG VE +F KG Y++WAA++
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESTFGPKGKLYMKWAAEM 215
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L GVPWVMC+Q DAP+ +I+ CN C + F PNS KP IWTENW ++ +G+
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYC-DGFT-PNSEKKPKIWTENWNGWFADWGE 273
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +EDIA+ +A F + GS NYYMY GGTNFGRTA YD APLDEYG
Sbjct: 274 RLPYRPSEDIAFAIARFFQR-GGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYG 332
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE--------- 375
LLRQPKWGHLK+LH+A+KLC +++ S + KL QEA +++G+S
Sbjct: 333 LLRQPKWGHLKDLHAAIKLCEPALVAA--DSPQYIKLGPKQEAHVYRGTSNNIGQYMSLN 390
Query: 376 ---CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL----------- 421
CAAF+ N D+ +ATV F + LPP S+SILPDC+ AFNTAK+
Sbjct: 391 EGICAAFIANIDEHESATVKFYGQEFTLPPWSVSILPDCRNTAFNTAKVGAQTSIKTVGS 450
Query: 422 DSV---------------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDY 460
DSV + W KE + + + + + +LE +N TKD SDY
Sbjct: 451 DSVSVGNNSLFLQVITKSKLESFSQSWMTLKEPLGVWGDKNFTSKGILEHLNVTKDQSDY 510
Query: 461 LWYNFRFK--------HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEK 512
LWY R + +D + + S+ + F+NG+ GS GK + +
Sbjct: 511 LWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----IKVVQ 566
Query: 513 MVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVG 570
V L+ G N++ LLS VGL + GA+LE+ AG + + + G K + ++ W YQVG
Sbjct: 567 PVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWTYQVG 626
Query: 571 LLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAW 629
L GE L+++ + W+ + + +T +WYKT FDAP G+DPVA++ SMGKG+AW
Sbjct: 627 LRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGKGQAW 686
Query: 630 VNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTGNL 667
VNG +GRYW + + P G +Q+WYHIPRS+LK N+
Sbjct: 687 VNGHHVGRYW-TLVAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKTLNNV 745
Query: 668 LVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW-RSQNQRTLKTHKRIPGRRPK 726
LV+ EE + P ISI T S T+C VS+ H PP+ W S+ R L + + P+
Sbjct: 746 LVIFEEIDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRKLS----LMDKTPE 801
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
+ ++C G IS I FASYG+PNG+C+ ++ G CH++NS ++V +AC+G+ SC++ + +
Sbjct: 802 MHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVSQACIGRTSCSIGI-SN 860
Query: 787 KFYGDPCPGIPKALLVDAQCT 807
+GDPC + K+L V A+C+
Sbjct: 861 GVFGDPCRHVVKSLAVQAKCS 881
>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 796
Score = 691 bits (1783), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/807 (46%), Positives = 494/807 (61%), Gaps = 70/807 (8%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MWP LI K+K+GGLDV++T VFW++HE GQ+DF GR+DLVRF+K V GLYV LRIG
Sbjct: 1 MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
P++ EW YGG P WLH VPGI FR+DNE FK M+R+ +V+ MK A LYASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 120
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQC 239
LSQIENEYG ++ ++ G Y+RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C
Sbjct: 121 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 180
Query: 240 GETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMY 299
+ PNS KP +WTENW+ ++ +G R AED+A+ VA F + G++ NYYMY
Sbjct: 181 DQFT--PNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQR-GGTFQNYYMY 237
Query: 300 HGGTNFGR-TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVS 358
HGGTNFGR T ++ T Y AP+DEYG++RQPKWGHL+++H A+KLC +++
Sbjct: 238 HGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSY 297
Query: 359 MNFSKLQEAFIFQGS--SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAF 416
+ + EA ++Q + S CAAFL N D +++ TV F+ Y+LP S+SILPDCK V
Sbjct: 298 SSLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVL 357
Query: 417 NTAKLDS---------------------------VEQWEEYKEAIPTYDETSLRANFLLE 449
NTA+++S W E + E +L L+E
Sbjct: 358 NTAQINSQVTTSEMRSLGSSIQDTDDSLITPELATAGWSYAIEPVGITKENALTKPGLME 417
Query: 450 QMNTTKDASDYLWYNFRF--KHDP---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHS 504
Q+NTT DASD+LWY+ K D + S+S L V+SLGHVL +ING+ GSA G S
Sbjct: 418 QINTTADASDFLWYSTSIVVKGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSAS 477
Query: 505 DKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSF 563
+L+ V L+ G N + LLS VGL + GA+ + AG+ V + G + SS
Sbjct: 478 SSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSST 537
Query: 564 SWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISM 623
W YQ+GL GE L ++ + S T+QPL WYKT F AP G DPVAI+ M
Sbjct: 538 DWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGM 597
Query: 624 GKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFL 661
GKGEAWVNGQSIGRYW + L PQ G PSQ+ YH+PRSFL
Sbjct: 598 GKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFL 657
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIP 721
+P N LVL E+ G P IS T +++C HVS+ H + SW S Q+T +T
Sbjct: 658 QPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISP-QQTSQT----- 711
Query: 722 GRRPKVQIRCP-SGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
+ P +++ CP G+ IS I FAS+G P+G C NY G C SS + A+V++AC+G +C+
Sbjct: 712 -QGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCS 770
Query: 781 VPVWTEKFYGDPCPGIPKALLVDAQCT 807
VPV + F GDPC G+ K+L+V+A C+
Sbjct: 771 VPVSSNNF-GDPCSGVTKSLVVEAACS 796
>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
Length = 894
Score = 689 bits (1778), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/862 (43%), Positives = 510/862 (59%), Gaps = 97/862 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LII+G R++L S IHYPR+TP+MWP LIAK+KEGG+DV+QT FW+ HEP
Sbjct: 35 NVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPV 94
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR D+V+F V A GLY+ LRIGP++ EW +GG P WL D+PGI FR++N
Sbjct: 95 RGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 154
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
FK M+R+ +V++M+ L + QGGPII+ QIENEYG +E F +KG Y++WAA++
Sbjct: 155 LFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMLQIENEYGNIEGQFGQKGKEYIKWAAEM 214
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMCKQ DAP +I+ACNG C PNS +KP +WTE+W +Y +G
Sbjct: 215 ALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGY--KPNSYNKPTMWTEDWDGWYASWGG 272
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + GS+ NYYMY GGTNFGRT+ + +T Y AP+DEYG
Sbjct: 273 RLPHRPVEDLAFAVARFYQR-GGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 331
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE--------- 375
LL +PKWGHLK+LH+A+KLC +++ S N+ KL QEA +++ +S
Sbjct: 332 LLSEPKWGHLKDLHAAIKLCEPALVAAD--SPNYIKLGPKQEAHVYRMNSHTEGLNITSY 389
Query: 376 -----CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------- 423
C+AFL N D+ A+V F Y LPP S+SILPDC+ V +NTAK+ +
Sbjct: 390 GSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTV 449
Query: 424 -------------------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
+ W KE + + E + +LE +N TKD S
Sbjct: 450 EFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNFTVQGILEHLNVTKDQS 509
Query: 459 DYLWYNFRFKHDPSDSE--------SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
DYLW+ R D + + + S+ VL F+NG+ GS G +
Sbjct: 510 DYLWHITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQLTGSVIGHW----VKV 565
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQ 568
E+ V + G N++ LL+ VGL + GA+LE+ AG R + + G K DFS W YQ
Sbjct: 566 EQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTGFKNGDIDFSKLLWTYQ 625
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT--WYKTVFDAPTGSDPVAINLISMGKG 626
VGL GE L+I+T + W+ S P T WYKT FD+P G+DPVA++L SMGKG
Sbjct: 626 VGLKGEFLKIYTIEENEKASWAEL-SPDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKG 684
Query: 627 EAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPT 664
+AWVNG IGRYW + + P+ G P+Q+ YH+PRS+L+ +
Sbjct: 685 QAWVNGHHIGRYW-TLVAPEDGCPEICDYRGAYDSDKCSFNCGKPTQTLYHVPRSWLQSS 743
Query: 665 GNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRR 724
NLLV+LEE G P ISI S LC VS+SH PPV W N ++ +
Sbjct: 744 SNLLVILEETGGNPFDISIKLRSAGVLCAQVSESHYPPVQKWF--NPDSVDEKITVNDLT 801
Query: 725 PKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVW 784
P++ ++C G IS I FASYG P G+C+ +++G+CH++NS +IV K+CLGK SC+V +
Sbjct: 802 PEMHLQCQDGFTISSIEFASYGTPQGSCQKFSMGNCHATNSSSIVSKSCLGKNSCSVEIS 861
Query: 785 TEKFYGDPCPGIPKALLVDAQC 806
F GDPC G+ K L V+A+C
Sbjct: 862 NISFGGDPCRGVVKTLAVEARC 883
>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
Length = 805
Score = 689 bits (1777), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/821 (46%), Positives = 488/821 (59%), Gaps = 75/821 (9%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
NV+YD RSLI+NG R+IL SGS+HYPR+TP+MWP +I KAKEGGLDV++T VFW+ HEP
Sbjct: 18 QNVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEP 77
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
PGQ+ F GR DLV+F+K VQ GL + LRIGP++ EW GG P WL D+P IVFR+DN
Sbjct: 78 SPGQYYFEGRYDLVKFVKLVQQAGLLMNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDN 137
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK +M+ + T IVNMMK L+ASQGGPIIL+Q+ENEYG V+ + E G Y+ WAA+
Sbjct: 138 EPFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAE 197
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A TGVPW+MC Q P+ +I+ CNG C P KP +WTE++T ++ YG
Sbjct: 198 MAQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDG--WNPILYKKPTMWTESYTGWFTYYG 255
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYM--YHGGTNFGRTASA-YVLTGYYDQAPLD 324
R EDIA+ VA F + GS+ NYYM Y GGTNFGRT+ YV + Y APLD
Sbjct: 256 WPIPHRPVEDIAFAVARFFER-GGSFHNYYMVWYFGGTNFGRTSGGPYVASSYDYDAPLD 314
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKD 384
EYG+ PKWGHLK+LH +KL + +LS QEA ++ + C AFL N D
Sbjct: 315 EYGMQHLPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVYSYGNGCVAFLANVD 374
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKE 432
N+ V F N+ Y LP S+SIL DCKTVAFN+AK+ S W + E
Sbjct: 375 SMNDTVVEFRNVSYSLPAWSVSILLDCKTVAFNSAKVKSQSAVVSMSPSKSTLSWTSFDE 434
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFIN 492
+ +S +A LLEQM TTKD SDYLWY + + S + L + S+ V+H F+N
Sbjct: 435 PVGI-SGSSFKAKQLLEQMETTKDTSDYLWYTTSVEATGTGS-TWLSIESMRDVVHIFVN 492
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ 552
G+F S H S ++E + L G+N ++LLS VGL + GA++E AGL I
Sbjct: 493 GQFQSSWHTSKSVLYNSVEAPITLAPGSNTIALLSATVGLQNFGAFIETWSAGLSGSLIL 552
Query: 553 GAKELKD--FSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAP 610
D S W YQVGL GE L++FT GSR V WS ST +PLTWY T FDAP
Sbjct: 553 KGLPGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWS--AVSTEKPLTWYMTEFDAP 610
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------------------LTPQGT 648
G DPVA++L SMGKG+AWVNGQSIGRYW ++ LT G
Sbjct: 611 PGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGSYDQNKCLTGCGQ 670
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRS 708
SQ WYH+PRS++KP GNLLVL EE G P I T S +C V +SH V W
Sbjct: 671 SSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICARVYESHPASVKLW-- 728
Query: 709 QNQRTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFASYGNPNGNCENYAIGSCHSSNSRA 767
CP ++ IS+I FAS GNP G+C ++ GSCH+++
Sbjct: 729 ----------------------CPGEKQVISQIRFASLGNPEGSCGSFKEGSCHTNDLSN 766
Query: 768 IVEKACLGKRSCTVPVWTEKFYGDPCPGI-PKALLVDAQCT 807
VEKAC+G+RSC++ F CPG+ K L V+A C+
Sbjct: 767 TVEKACVGQRSCSL---APDFTISACPGVREKFLAVEALCS 804
>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
Length = 874
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/874 (43%), Positives = 515/874 (58%), Gaps = 118/874 (13%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG N++YD R++II G R+IL SG +HYPR++PQMWP LI AKEGGLD++ T VFW+
Sbjct: 17 GGSATNISYDHRAIIIGGQRRILISGCLHYPRASPQMWPALIRNAKEGGLDMIDTYVFWD 76
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP PG ++F GR DL+RF+K V GLYV LRIGP++ EW +GG P WL +PGI F
Sbjct: 77 GHEPSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQF 136
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+ N F+ M+ + IV+M+K+ +L+ASQGGP++ SQIENEYG V+ S+ G Y+
Sbjct: 137 RTHNRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGTNGKTYML 196
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WAA++A DL+TGVPW+MCKQ DAPD +IN CNG C PNS DKPA+WTENW+ +Y
Sbjct: 197 WAARMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGW--KPNSRDKPAMWTENWSGWY 254
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM------------------YHGGTNF 305
Q++G+ A R+ ED+A+ VA F + G NYYM Y GGTNF
Sbjct: 255 QLWGEAAPYRTVEDVAFAVARFFQR-GGVAQNYYMVRMLHDLEQHLLMPERCQYFGGTNF 313
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRT+ +T YD APLDE+G+LRQPKWGHLKELH+A+KLC + S + ++
Sbjct: 314 GRTSGGPFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPLYYTLGRM 373
Query: 365 QE---AFIF-QGSSE---------CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDC 411
QE A ++ GS E CAAFL N D ++A+V F +Y LPP S+SILPDC
Sbjct: 374 QEMVQAHVYSDGSLEANFSNLATPCAAFLANIDT-SSASVKFGGNVYNLPPWSVSILPDC 432
Query: 412 KTVAFNTAKLDS---------------------------VEQ--WEEYKEAIPTYDETSL 442
+ V FNTA++ + VEQ WE ++E + +
Sbjct: 433 RNVVFNTAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKI 492
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFK---HDPSDSESVLKVSSLGHVLHAFINGEFVGSA 499
A+ LLEQ++TT D++DYLWY+ RF+ + + VL ++S+ ++H F+NGEF GS
Sbjct: 493 LAHALLEQISTTNDSTDYLWYSTRFEISDQELKGGDPVLVITSMRDMVHIFVNGEFAGST 552
Query: 500 HGKHSDKSFT-LEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQG-AKE 556
S + +++ +HL G N++++LS VGL + GA+LE AG+ +V IQG +
Sbjct: 553 STLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSVWIQGLSTG 612
Query: 557 LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDP 615
++ +S W +QVGL GE + WS S QPL WYK F+ P G DP
Sbjct: 613 TRNLTSALWLHQVGLNGEH---------DAITWSSTTSLPFFQPLVWYKANFNIPDGDDP 663
Query: 616 VAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSW 653
VAI+L SMGKG+AWVNG S+GR+W + P G PSQ W
Sbjct: 664 VAIHLGSMGKGQAWVNGHSLGRFWPAITAPSTGCSDRCDYRGTYYSSKCLSGCGLPSQEW 723
Query: 654 YHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRT 713
YH+PR +L N LVLLEE G G+S + V +C VS+ LPPV + S
Sbjct: 724 YHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQFSS----- 778
Query: 714 LKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKAC 773
P++ + C G+ IS I FAS+GNP G C + GSCH+ S IVEKAC
Sbjct: 779 ----------LPELGLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKAC 828
Query: 774 LGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+G++SC+ ++ + F DPCPG K L V+A CT
Sbjct: 829 IGRQSCSFEIFWKNFGTDPCPGKAKTLAVEAACT 862
>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
Length = 831
Score = 687 bits (1772), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/814 (44%), Positives = 492/814 (60%), Gaps = 79/814 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLII+G R++L+SGSIHYPRSTP+MWP +I +AK+GGL+ +QT VFWN+HEPQ
Sbjct: 54 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 113
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSGR DLV+FIK +Q G+YV LR+GPFI+ EW +G + + H
Sbjct: 114 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGYITRYDHK------------ 161
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
N+ A R +IENEY V+ ++ + G Y++WA+ L
Sbjct: 162 -------------NIAGAYR------------KIENEYSAVQRAYKQDGLNYIKWASNLV 196
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
++ G+PWVMCKQ+DAPDP+INACNGR CG+TF GPN +KP++WTENWT+ ++V+GD
Sbjct: 197 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 256
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
RS EDIAY VA F +K G++VNYYMYHGGTNFGRT++ YV T YYD APLDEYGL
Sbjct: 257 PTQRSVEDIAYSVARFFSK-NGTHVNYYMYHGGTNFGRTSAHYVTTRYYDDAPLDEYGLE 315
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRN 387
++PK+GHLK LH+A+ LC KP+L G + K E ++ G+ CAAFL N +
Sbjct: 316 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 375
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEY---KEAIPTYD------ 438
T+ F Y + P SISILPDCKTV +NTA++ S + K+A +D
Sbjct: 376 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRNFMKSKKANKKFDFKVFTE 435
Query: 439 --ETSLRANFLL--EQMNTTKDASDYLWYNFRFK----HDPSDS--ESVLKVSSLGHVLH 488
+ L N + E TKD +DY WY FK H P+ ++ ++++SLGH LH
Sbjct: 436 TLPSKLEGNSYIPVELYGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFVRIASLGHALH 495
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
A++NGE++GS HG H +KSF +K V L G N++ +L V+ G PDSG+Y+E R G R
Sbjct: 496 AWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGSYMEHRYTGPRG 555
Query: 549 VSIQG--AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWY--- 603
+SI G + L S WG ++G+ GEKL I T+ G + V W ++ + LTWY
Sbjct: 556 ISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKF-TGKAPGLTWYQKF 614
Query: 604 -------KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
+T FDAP I + MGKG WVNG+ +GRYW SFL+P G P+Q YHI
Sbjct: 615 SKECETLQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSPLGQPTQIEYHI 674
Query: 657 PRSFLKPTGNLLVLLEEE-NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLK 715
PRSFLKP NLLV+ EEE N P + V+ T+C +V +++ P V W + +
Sbjct: 675 PRSFLKPKKNLLVIFEEEPNVKPELMDFAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQA 734
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLG 775
+ ++C +KI+ + FAS+GNP G C N+ +G+C++ S+ ++EK CLG
Sbjct: 735 ITDNVS---LTATLKCSGTKKIAAVEFASFGNPIGVCGNFTLGTCNAPVSKQVIEKHCLG 791
Query: 776 KRSCTVPVWTEKFY---GDPCPGIPKALLVDAQC 806
K C +PV F D C + K L V +C
Sbjct: 792 KAECVIPVNKSTFQQDKKDSCKNVVKMLAVQVKC 825
>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
Length = 919
Score = 686 bits (1771), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/859 (43%), Positives = 513/859 (59%), Gaps = 94/859 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+++I G R++L S +HYPR+TP+MWP LIAK KEGG DV++T VFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+ F R DLV+F K V A+GL++ LRIGP+ EW +GG P WL D+PGI FR+DNE
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + T IV +MK +LY+ QGGPIIL QIENEYG ++ ++ + G Y++WAA++
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TG+PWVMC+Q DAP+ +I+ CN C + F PNS +KP IWTE+W +Y +G
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYC-DGFK-PNSYNKPTIWTEDWDGWYADWGG 300
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED A+ VA F + GS NYYMY GGTNF RTA + YD AP+DEYG
Sbjct: 301 ALPHRPAEDSAFAVARFYQR-GGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYG 359
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQ-----------GS 373
+LRQPKWGHLK+LH+A+KLC +P L V S + KL QEA ++ G+
Sbjct: 360 ILRQPKWGHLKDLHTAIKLC-EPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGN 418
Query: 374 SE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------SVEQ 426
++ C+AFL N D+ A+V+ Y LPP S+SILPDC+ VAFNTA++ +VE
Sbjct: 419 AQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVES 478
Query: 427 --------------------------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDY 460
W KE I T+ + +LE +N TKD SDY
Sbjct: 479 GSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDY 538
Query: 461 LWYNFRFKHDPSD-----SESV---LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEK 512
LWY R +D S+ V L + + V F+NG+ GS G +L++
Sbjct: 539 LWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQ 594
Query: 513 MVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVG 570
+ L+ G N ++LLS +VGL + GA+LE+ AG R V++ G + D ++ W YQVG
Sbjct: 595 PIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVG 654
Query: 571 LLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWV 630
L GE I+ WSR + QP TWYKT+F P G+DPVAI+L SMGKG+AWV
Sbjct: 655 LKGEFSMIYAPEKQGCAGWSRMQKDSVQPFTWYKTMFSTPKGTDPVAIDLGSMGKGQAWV 714
Query: 631 NGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTGNLL 668
NG IGRYW S + P+ G P+Q+WYHIPR +LK + NLL
Sbjct: 715 NGHLIGRYW-SLVAPESGCSSSCYYPGAYNERKCQSNCGMPTQNWYHIPREWLKESDNLL 773
Query: 669 VLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQ 728
VL EE G P IS++ T+C +S+++ PP+ +W + + P+++
Sbjct: 774 VLFEETGGDPSLISLEAHYAKTVCSRISENYYPPLSAWSHLS----SGRASVNAATPELR 829
Query: 729 IRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKF 788
++C G IS+I FASYG P+G C N++ G+CH+S++ +V +AC+G C + V +
Sbjct: 830 LQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASSTLDLVTEACVGNTKCAISV-SNDV 888
Query: 789 YGDPCPGIPKALLVDAQCT 807
+GDPC G+ K L V+A+C+
Sbjct: 889 FGDPCRGVLKDLAVEAKCS 907
>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
Length = 874
Score = 683 bits (1763), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 379/874 (43%), Positives = 512/874 (58%), Gaps = 118/874 (13%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G N++YD R++II G R+IL SG IHYPR++PQMWP LI AKEGGLD++ T VFW+
Sbjct: 17 GASATNISYDHRAIIIGGQRRILISGCIHYPRASPQMWPALIRNAKEGGLDMIDTYVFWD 76
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP PG ++F GR DL+RF+K V GLYV LRIGP++ EW +GG P WL +PGI F
Sbjct: 77 GHEPSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQF 136
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+ N F+ M+ + IV+M+K+ +L+ASQGGP++ SQIENEYG V+ S+ G Y+
Sbjct: 137 RTHNRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGINGKTYML 196
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
WAA++A DL+TGVPW+MCKQ DAPD +IN CNG C PNS DKPA+WTENW+ +Y
Sbjct: 197 WAARMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGW--KPNSRDKPAMWTENWSGWY 254
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM------------------YHGGTNF 305
Q +G+ A R+ ED+A+ VA F + G NYYM Y GGTNF
Sbjct: 255 QSWGEAAPYRTVEDVAFAVARFFQR-GGVAQNYYMVRTLHDLEQRLLMPERCQYFGGTNF 313
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRT+ +T YD APLDE+G+LRQPKWGHLKELH+A+KLC + S V ++
Sbjct: 314 GRTSGGPFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPVYYTLGRM 373
Query: 365 QE---AFIF-QGSSE---------CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDC 411
QE A ++ GS E CAAFL N D ++A+V F +Y LPP S+SILPDC
Sbjct: 374 QEMVQAHVYSDGSLEANFSNLATPCAAFLANIDT-SSASVKFGGKVYNLPPWSVSILPDC 432
Query: 412 KTVAFNTAKLDS---------------------------VEQ--WEEYKEAIPTYDETSL 442
+ V FNTA++ + VEQ WE ++E + +
Sbjct: 433 RNVVFNTAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKI 492
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFK---HDPSDSESVLKVSSLGHVLHAFINGEFVGSA 499
A+ LLEQ++TT D++DY+WY+ RF+ + + VL ++S+ ++H F+NGEF GS
Sbjct: 493 LAHALLEQISTTNDSTDYMWYSTRFEILDQELKGGDPVLVITSMRDMVHIFVNGEFAGST 552
Query: 500 HGKHSDKSFT-LEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQG-AKE 556
S + +++ +HL G N++++LS VGL + GA+LE AG+ ++ IQG +
Sbjct: 553 STLKSGGLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSIWIQGLSTG 612
Query: 557 LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDP 615
++ +S W +QVGL GE + WS S QPL WYK F+ P G DP
Sbjct: 613 TRNLTSALWLHQVGLNGEH---------DAITWSSTTSLPFFQPLVWYKANFNIPDGDDP 663
Query: 616 VAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSW 653
VAI+L SMGKG+AWVNG S+GR+W P G PSQ W
Sbjct: 664 VAIHLGSMGKGQAWVNGHSLGRFWPVITAPSTGCSDRCDYRGTYYSSKCLSSCGLPSQEW 723
Query: 654 YHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRT 713
YH+PR +L N LVLLEE G G+S + V +C VS+ LPPV + S
Sbjct: 724 YHVPREWLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQFSS----- 778
Query: 714 LKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKAC 773
P++ + C G+ IS I FAS+GNP G C + GSCH+ S IVEKAC
Sbjct: 779 ----------LPELGLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKAC 828
Query: 774 LGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+G++SC+ ++ + F DPCPG K L V+A CT
Sbjct: 829 IGRQSCSFEIFWKNFGTDPCPGKAKTLAVEAACT 862
>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 903
Score = 683 bits (1762), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 370/862 (42%), Positives = 508/862 (58%), Gaps = 96/862 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LII+G R++L S IHYPR+TP+MWP LIAK+KEGG+DV+QT FW+ HEP
Sbjct: 35 NVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPV 94
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR D+V+F V A GLY+ LRIGP++ EW +GG P WL D+PGI FR++N
Sbjct: 95 RGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 154
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
FK M+R+ +V++M+ L + QGGPII+ QIENEYG +E F +KG Y++WAA++
Sbjct: 155 LFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMMQIENEYGNIEGQFGQKGKEYIKWAAEM 214
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMCKQ DAP +I+ACNG C PNS +KP +WTE+W +Y +G
Sbjct: 215 ALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGY--KPNSYNKPTLWTEDWDGWYASWGG 272
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA F + GS+ NYYMY GGTNFGRT+ + +T Y AP+DEYG
Sbjct: 273 RLPHRPVEDLAFAVARFYQR-GGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYG 331
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE--------- 375
LL +PKWGHLK+LH+A+KLC +++ S N+ KL QEA +++ +S
Sbjct: 332 LLSEPKWGHLKDLHAAIKLCEPALVAAD--SPNYIKLGPKQEAHVYRVNSHTEGLNITSY 389
Query: 376 -----CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------- 423
C+AFL N D+ A+V F Y LPP S+SILPDC+ V +NTAK+ +
Sbjct: 390 GSQISCSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTV 449
Query: 424 -------------------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
+ W KE + + E + +LE +N TKD S
Sbjct: 450 EFDLPLYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNFTVQGILEHLNVTKDQS 509
Query: 459 DYLWYNFRFKHDPSDSE--------SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
DYLW+ R D + + + S+ VL F+NG+ + H K +
Sbjct: 510 DYLWHITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQLTEGSVIGHWVK---V 566
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQ 568
E+ V + G N++ LL+ VGL + GA+LE+ AG R + + G K D S W YQ
Sbjct: 567 EQPVKFLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTGFKNGDIDLSKLLWTYQ 626
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT--WYKTVFDAPTGSDPVAINLISMGKG 626
VGL GE +I+T + W+ S P T WYKT FD+P G+DPVA++L SMGKG
Sbjct: 627 VGLKGEFFKIYTIEENEKAGWAEL-SPDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKG 685
Query: 627 EAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPT 664
+AWVNG IGRYW + + P+ G P+Q+ YH+PRS+L+ +
Sbjct: 686 QAWVNGHHIGRYW-TLVAPEDGCPEICDYRGAYNSDKCSFNCGKPTQTLYHVPRSWLQSS 744
Query: 665 GNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRR 724
NLLV+LEE G P ISI S LC VS+SH PPV W N ++ +
Sbjct: 745 SNLLVILEETGGNPFDISIKLRSAGVLCAQVSESHYPPVQKWF--NPDSVDEKITVNDLT 802
Query: 725 PKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVW 784
P++ ++C G IS I FASYG P G+C+ +++G+CH++NS +IV K+CLGK SC+V +
Sbjct: 803 PEMHLQCQDGFTISSIEFASYGTPQGSCQKFSMGNCHATNSSSIVSKSCLGKNSCSVEIS 862
Query: 785 TEKFYGDPCPGIPKALLVDAQC 806
F GDPC GI K L V+A+C
Sbjct: 863 NNSFGGDPCRGIVKTLAVEARC 884
>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
Length = 890
Score = 683 bits (1762), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/862 (42%), Positives = 502/862 (58%), Gaps = 97/862 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LII+G R++L S +HYPR++P+MWP +I K+KEGG DV+Q+ VFWN HEP
Sbjct: 32 NVSYDHRALIIDGKRRMLISAGVHYPRASPEMWPDIIEKSKEGGADVIQSYVFWNGHEPT 91
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR DLV+FI+ V + GLY+ LRIGP++ EW +GG P WL DVPGI FR+DN
Sbjct: 92 KGQYNFDGRYDLVKFIRLVGSSGLYLHLRIGPYVCAEWNFGGFPLWLRDVPGIEFRTDNA 151
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ IV++++ +L+ QGGP+I+ Q+ENEYG +E S+ ++G Y++W +
Sbjct: 152 PFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVENEYGNIESSYGKRGQEYIKWVGNM 211
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L VPWVMC+Q DAP +IN+CNG C A NSP KP WTENW ++ +G+
Sbjct: 212 ALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKA--NSPSKPIFWTENWNGWFTSWGE 269
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
+ R ED+A+ VA F + +GS+ NYYMY GGTNFGRTA + +T Y +P+DEYG
Sbjct: 270 RSPHRPVEDLAFSVARFFQR-EGSFQNYYMYFGGTNFGRTAGGPFYITSYDYDSPIDEYG 328
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE--------- 375
L+R+PKWGHLK+LH+A+KLC ++S S + KL QEA ++ S+
Sbjct: 329 LIREPKWGHLKDLHTALKLCEPALVSAD--SPQYIKLGPKQEAHVYHMKSQTDDLTLSKL 386
Query: 376 -----CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK---------- 420
C+AFL N D+R V F+ Y LPP S+SILPDC+ V FNTAK
Sbjct: 387 GTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDCQNVVFNTAKVAAQTSIKIL 446
Query: 421 -------------LDSVEQ---------WEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
L + +Q W KE I + + + +LE +N TKD S
Sbjct: 447 ELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSDQNFTVKGILEHLNVTKDRS 506
Query: 459 DYLWYNFRFKHDPSDSE--------SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
DYLWY R D + + S+ V F+NG+ GSA G+
Sbjct: 507 DYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFVNGKLTGSAIGQW----VKF 562
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQ 568
+ V + G N++ LLS +GL +SGA++E+ AG+R + + G K D S W YQ
Sbjct: 563 VQPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLTGFKNGDIDLSKSLWTYQ 622
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGE 627
VGL GE L ++ + W+ + TWYK F +P G+DPVAINL SMGKG+
Sbjct: 623 VGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFSSPDGTDPVAINLGSMGKGQ 682
Query: 628 AWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTG 665
AWVNG IGRYW S ++P+ G P+QSWYHIPRS+LK +
Sbjct: 683 AWVNGHHIGRYW-SVVSPKDGCPRKCDYRGAYNSGKCATNCGRPTQSWYHIPRSWLKESS 741
Query: 666 NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGR-R 724
NLLVL EE G P I + S +CG VS+SH P S R + + + + R
Sbjct: 742 NLLVLFEETGGNPLEIVVKLYSTGVICGQVSESHYP---SLRKLSNDYISDGETLSNRAN 798
Query: 725 PKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVW 784
P++ + C G IS + FASYG P G+C ++ G CH++NS ++V +ACLGK SCTV +
Sbjct: 799 PEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLSVVSQACLGKNSCTVEIS 858
Query: 785 TEKFYGDPCPGIPKALLVDAQC 806
F GDPC I K L V+A+C
Sbjct: 859 NSAFGGDPCHSIVKTLAVEARC 880
>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
Length = 923
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/857 (43%), Positives = 511/857 (59%), Gaps = 91/857 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+LI+ G R++L S +HYPR+TP+MWP LIAKAKEGG+DV++T +FWN HEP
Sbjct: 68 NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKAKEGGVDVIETYIFWNGHEPA 127
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+ F GR D+VRF K V A+GL++ LRIGP+ EW +GG P WL D+PGI FR+DNE
Sbjct: 128 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 187
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+K M+ + T IV++MK +LY+ QGGPIIL QIENEYG ++ + + G Y++WAA++
Sbjct: 188 PYKAEMQNFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 247
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPWVMC+Q DAP+ +++ CN C + F PNS +KP IWTE+W +Y +G+
Sbjct: 248 ALALDTGVPWVMCRQTDAPEQILDTCNAFYC-DGFK-PNSYNKPTIWTEDWDGWYADWGE 305
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R A+D A+ VA F + GS+ NYYMY GGTNF RTA + YD AP+DEYG
Sbjct: 306 ALPHRPAQDSAFAVARFYQR-GGSFQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYG 364
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIF-----------QGS 373
+LRQPKWGHLK+LH+A+KLC +P L+ V S + KL QEA ++ G+
Sbjct: 365 ILRQPKWGHLKDLHAAIKLC-EPALTAVDGSPRYIKLGPMQEAHVYSSENVHTNGSISGN 423
Query: 374 SE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------VEQ 426
++ C+AFL N D+ A+V+ Y LPP S+SILPDC+TVAFNTA++ + VE
Sbjct: 424 AQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVES 483
Query: 427 ------------------------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLW 462
W KE + + E A +LE +N TKD SDYL
Sbjct: 484 GSPSYSSRHKPRILSLGGPYLSSTWWASKEPVGIWSEDIFAAQGILEHLNVTKDISDYLS 543
Query: 463 YNFRFKHDPSD-----SESV---LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV 514
Y R D SE + L + + V+ F+NG+ GS G +L + +
Sbjct: 544 YTTRVNISDEDVLYWNSEGLLPSLTIDQIRDVVRIFVNGKLAGSQVGHW----VSLNQPL 599
Query: 515 HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLL 572
L+ G N ++LLS +VGL + GA+LE+ AG R V + G D ++ W YQ+GL
Sbjct: 600 QLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLK 659
Query: 573 GEKLQIFTDYGSRIVPWSRY-GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVN 631
GE +I++ WS T P TW+KT FDAP G+ PVAI+L SMGKG+AWVN
Sbjct: 660 GEFSRIYSPEKQGSAGWSSMQNDDTLSPFTWFKTTFDAPEGNGPVAIDLGSMGKGQAWVN 719
Query: 632 GQSIGRYWVSFLTPQGTPS---------------------QSWYHIPRSFLKPTGNLLVL 670
G IGRYW G PS QSWYHIPR +L+ + NLLVL
Sbjct: 720 GHLIGRYWSLVAPESGCPSSCNYAGNYGDSKCRSNCGIATQSWYHIPREWLQESDNLLVL 779
Query: 671 LEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIR 730
EE G P IS++ T+C +S+++ PP+ +W R + P+++++
Sbjct: 780 FEETGGDPSQISLEVHYTKTICSKISETYYPPLSAW----SRAANGRPSVNTVAPELRLQ 835
Query: 731 CPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYG 790
C G ISKI FASYG P G+C+N+++G+CH+S + +V +AC GK C + V T +G
Sbjct: 836 CDEGHVISKITFASYGTPTGDCQNFSVGNCHASTTLDLVAEACEGKNRCAISV-TNDVFG 894
Query: 791 DPCPGIPKALLVDAQCT 807
DPC + K L V A+C+
Sbjct: 895 DPCRKVVKDLAVVAECS 911
>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 918
Score = 681 bits (1757), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 381/878 (43%), Positives = 518/878 (58%), Gaps = 95/878 (10%)
Query: 11 GLLLTTIGGSDGGGGGGN-NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAK 69
G+L +GG DGG NVTYD R+LI+ G R++L S +HYPR+TP+MWP LIAK K
Sbjct: 43 GVLRQVVGGDDGGTFFEPFNVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKCK 102
Query: 70 EGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYG 129
EGG+D ++T VFWN HEP GQ+ F GR D+VRF K V A+GL++ LRIGP+ EW +G
Sbjct: 103 EGGVDAIETYVFWNGHEPAKGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFG 162
Query: 130 GLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM 189
G P WL DVPGI FR+DNEP+K M+ + T IV++MK +LY+ QGGPIIL QIENEYG
Sbjct: 163 GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 222
Query: 190 VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSP 249
++ + + G Y+ WAA++A+ L TGVPWVMC+Q DAP+ ++N CN C + F PNS
Sbjct: 223 IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYC-DGFK-PNSY 280
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
+KP IWTE+W +Y +G+ R A+D A+ VA F + GS NYYMY GGTNF RTA
Sbjct: 281 NKPTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQR-GGSLQNYYMYFGGTNFERTA 339
Query: 310 SAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---Q 365
+ YD AP+DEYG+LRQPKWGHLK+LH+A+KLC + L+ V S ++ KL Q
Sbjct: 340 GGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLC-ESALTAVDGSPHYVKLGPMQ 398
Query: 366 EAFIF-----------QGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKT 413
EA ++ G+S+ C+AFL N D+ A+V+ Y LPP S+SILPDC+T
Sbjct: 399 EAHVYSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCET 458
Query: 414 VAFNTAKLDS------VEQ-------------------------WEEYKEAIPTYDETSL 442
VAFNTA++ + VE W +KE + + E
Sbjct: 459 VAFNTARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIF 518
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFKHDPSDS--------ESVLKVSSLGHVLHAFINGE 494
A +LE +N TKD SDYL Y R D L + + V F+NG+
Sbjct: 519 TAQGILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGK 578
Query: 495 FVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQG 553
GS G +L + + L+ G N ++LLS +VGL + GA+LE+ AG R V + G
Sbjct: 579 LAGSKVGHW----VSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTG 634
Query: 554 AKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY-GSSTHQPLTWYKTVFDAPT 611
D ++ W YQ+GL GE +I++ WS T P TW+KT+FDAP
Sbjct: 635 LSNGDIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPE 694
Query: 612 GSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPS--------------------- 650
G+ PV I+L SMGKG+AWVNG IGRYW G PS
Sbjct: 695 GNGPVTIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCPSSCNYAGTYSDSKCRSNCGIAT 754
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW-RSQ 709
QSWYHIPR +L+ +GNLLVL EE G P IS++ T+C +S+++ PP+ +W R+
Sbjct: 755 QSWYHIPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAA 814
Query: 710 NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIV 769
N R + P+++++C G ISKI FASYG P G C+N+++G+CH+S + +V
Sbjct: 815 NGR-----PSVNTVAPELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLV 869
Query: 770 EKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+AC GK C + V T + +GDPC + K L V+A+C+
Sbjct: 870 VEACEGKNRCAISV-TNEVFGDPCRKVVKDLAVEAECS 906
>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
Length = 908
Score = 680 bits (1754), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/879 (42%), Positives = 518/879 (58%), Gaps = 96/879 (10%)
Query: 11 GLLLTTIG-GSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAK 69
G L +G G+DG NV+YD R++ + G R++L S +HYPR+TP+MWP +IAK K
Sbjct: 32 GQLREVVGKGTDGLFFEPFNVSYDHRAVRVGGERRMLVSAGVHYPRATPEMWPSIIAKCK 91
Query: 70 EGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYG 129
EGG DV++T +FWN HEP GQ+ F R DLVRFIK V A+GL++ LRIGP+ EW +G
Sbjct: 92 EGGADVIETYIFWNGHEPAKGQYYFEERFDLVRFIKLVAAEGLFLFLRIGPYACAEWNFG 151
Query: 130 GLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM 189
G P WL D+PGI FR+DNEP+K M+ + T IV+MMK +LY+ QGGPIIL QIENEYG
Sbjct: 152 GFPVWLRDIPGIEFRTDNEPYKAEMQTFVTKIVDMMKDEKLYSWQGGPIILQQIENEYGN 211
Query: 190 VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSP 249
++ + + G Y++WAA++A+ L TG+PWVMC+Q DAP+ +++ CN C + F PNS
Sbjct: 212 IQGKYGQAGKRYMQWAAQMALGLDTGIPWVMCRQTDAPEQILDTCNAFYC-DGFK-PNSY 269
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
+KP IWTE+W +Y +G R AED A+ VA F + GS NYYMY GGTNF RTA
Sbjct: 270 NKPTIWTEDWDGWYADWGGPLPHRPAEDSAFAVARFYQR-GGSLQNYYMYFGGTNFARTA 328
Query: 310 SAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---Q 365
+ YD AP++EYG+LRQPKWGHLK+LH+A+KLC +P L V S + KL Q
Sbjct: 329 GGPLQITSYDYDAPINEYGMLRQPKWGHLKDLHTAIKLC-EPALIAVDGSPQYVKLGSMQ 387
Query: 366 EAFIF-------QGSSE-----CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKT 413
EA I+ GS+ C+AFL N D+ +V+ Y LPP S+SILPDC+
Sbjct: 388 EAHIYSSAKVHTNGSTAGNAQICSAFLANIDEHKYVSVWIFGKSYNLPPWSVSILPDCEN 447
Query: 414 VAFNTAKLDS--------------------------------VEQWEEYKEAIPTYDETS 441
VAFNTA++ + W KE I T+ + S
Sbjct: 448 VAFNTARVGAQTSVFTFESGSPSHSSRREPSVLLPGVRGSYLSSTWWTSKETIGTWGDGS 507
Query: 442 LRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESVLK---VSSLGHVLHAFING 493
+LE +N TKD SDYLWY D S+ VL + + V F+NG
Sbjct: 508 FATQGILEHLNVTKDISDYLWYTTSVNISDEDVAFWSSKGVLPSLIIDQIRDVARVFVNG 567
Query: 494 EFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQ 552
+ GS G +L++ + + G N ++LLS +VGL + GA+LE+ AG + V +
Sbjct: 568 KLAGSQVGHW----VSLKQPIQFVRGLNELTLLSEIVGLQNYGAFLEKDGAGFKGQVKLT 623
Query: 553 G-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ-PLTWYKTVFDAP 610
G + D ++ +W YQVGL GE I+T WS + Q P TWYKT+ DAP
Sbjct: 624 GLSNGDTDLTNSAWTYQVGLKGEFSMIYTPEKQECAEWSAMQTDNIQSPFTWYKTMVDAP 683
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------------GT 648
G+DPVAI+L SMGKG+AWVNG+ IGRYW S + P+ G
Sbjct: 684 EGTDPVAIDLGSMGKGQAWVNGRLIGRYW-SLVAPESGCPSSCNYPGAYSETKCQSNCGM 742
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRS 708
P+QSWYHIPR +L+ + NLLVL EE G P IS++ T+C +S+++ PP+ +W
Sbjct: 743 PTQSWYHIPREWLQESNNLLVLFEETGGDPSKISLEVHYTKTICSRISENYYPPLSAWSW 802
Query: 709 QNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAI 768
+ + + P++ +RC G +IS+I FASYG P+G C+N++ G CH++++
Sbjct: 803 LDTGRVS----VDSVAPELLLRCDDGYEISRITFASYGTPSGGCQNFSKGKCHAASTLDF 858
Query: 769 VEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
V +AC+GK C + V + +GDPC G+ K L V+A+C+
Sbjct: 859 VTEACVGKNKCAISV-SNDVFGDPCRGVLKDLAVEAECS 896
>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
Length = 859
Score = 679 bits (1752), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/819 (45%), Positives = 490/819 (59%), Gaps = 84/819 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LII G R++L S IHYPR+TP+MW LIAK+KEGG DVVQT VFWN HEP
Sbjct: 37 NVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKEGGADVVQTYVFWNGHEPV 96
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR DLV+F+K + + GLY+ LRIGP++ EW +GG P WL D+PGI FR+DNE
Sbjct: 97 KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNE 156
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T IV++M+ A+L+ QGGPII+ QIENEYG VE S+ +KG YV+WAA +
Sbjct: 157 PFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMCKQ DAP+ +I+ACNG C + F PNS KP +WTE+W +Y +G
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYC-DGFK-PNSRTKPVLWTEDWDGWYTKWGG 274
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AED+A+ VA F + GS+ NYYMY GGTNFGRT+ + +T Y APLDEYG
Sbjct: 275 SLPHRPAEDLAFAVARFYQR-GGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYG 333
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE-----CAAF 379
L +PKWGHLK+LH+A+KLC +++ + + KL QEA I+ G E CAAF
Sbjct: 334 LRSEPKWGHLKDLHAAIKLCEPALVAA--DAPQYRKLGSKQEAHIYHGDGETGGKVCAAF 391
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------------ 421
L N D+ +A V F+ Y LPP S+SILPDC+ VAFNTAK+
Sbjct: 392 LANIDEHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGS 451
Query: 422 ----------DSV----EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF 467
D+V + W KE I + E + LLE +N TKD SDYLW+ R
Sbjct: 452 MSILQKVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRI 511
Query: 468 KHDPSD--------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLING 519
D S + + S+ VL F+N + GS G H K+ + V I G
Sbjct: 512 SVSEDDISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVG-HWVKAV---QPVRFIQG 567
Query: 520 TNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQ 577
N++ LL+ VGL + GA+LE+ AG R + G K D S SW YQVGL GE +
Sbjct: 568 NNDLLLLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSKSSWTYQVGLKGEADK 627
Query: 578 IFTDYGSRIVPWSRYGSSTHQPL-TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
I+T + WS + + WYKT FD P G+DPV +NL SMG+G+AWVNGQ IG
Sbjct: 628 IYTVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLESMGRGQAWVNGQHIG 687
Query: 637 RYWVSF---------------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
RYW T G P+Q+ YH+PRS+LKP+ NLLVL EE
Sbjct: 688 RYWNIISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETG 747
Query: 676 GYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGR 735
G P IS+ TV+ LCG VS+SH PP+ W + + + I P+V + C G
Sbjct: 748 GNPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDY--INGTMSINSVAPEVHLHCEDGH 805
Query: 736 KISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACL 774
IS I FASYG P G+C+ ++IG CH+SNS +IV + L
Sbjct: 806 VISSIEFASYGTPRGSCDGFSIGKCHASNSLSIVSEVKL 844
>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
Length = 912
Score = 679 bits (1752), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/889 (42%), Positives = 518/889 (58%), Gaps = 103/889 (11%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
+C+F + + G++ NVTYD R+LII+GHR++L S IHYPR+TP+MWP LIA
Sbjct: 28 VCVF-VASIIVAGAEAAWFKPFNVTYDHRALIIDGHRRMLISAGIHYPRATPEMWPDLIA 86
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
KAKEGG+DV++T VFWN H+P GQ++F GR DLV+F K V + GLY LRIGP+ EW
Sbjct: 87 KAKEGGVDVIETYVFWNGHQPVKGQYNFEGRYDLVKFAKLVASNGLYFFLRIGPYACAEW 146
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ---- 182
+GG P WL D+PGI FR++N PFK MKR+ + +VN+M+ L++ QGGPIIL Q
Sbjct: 147 NFGGFPVWLRDIPGIEFRTNNAPFKEEMKRFVSKVVNLMREEMLFSWQGGPIILLQVRRE 206
Query: 183 --IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
IENEYG +E S+ +G YV+WAA +A+ L GVPWVMCKQ DAP +I+ CN C
Sbjct: 207 YGIENEYGNLESSYGNEGKEYVKWAASMALSLGAGVPWVMCKQPDAPYDIIDTCNAYYC- 265
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ F PNS +KP WTENW +Y +G+ R ED+A+ VA F + GS NYYMY
Sbjct: 266 DGFK-PNSRNKPIFWTENWDGWYTQWGERLPHRPVEDLAFAVARFFQR-GGSLQNYYMYF 323
Query: 301 GGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA + YD AP+DEYGLL +PKWGHLK+LH+A+KLC +++ S
Sbjct: 324 GGTNFGRTAGGPLQITSYDYDAPIDEYGLLNEPKWGHLKDLHAALKLCEPALVAA--DSP 381
Query: 360 NFSKL---QEAFIFQG--------------SSECAAFLVNKDKRNNATVYFSNLMYELPP 402
+ KL QEA ++Q S++C+AFL N D+R ATV F Y LPP
Sbjct: 382 TYIKLGSKQEAHVYQENVHREGLNLSISQISNKCSAFLANIDERKAATVTFRGQTYTLPP 441
Query: 403 LSISILPDCKTVAFNTAKLDS--------------------------------VEQWEEY 430
S+SILPDC++ FNTAK+ + + W
Sbjct: 442 WSVSILPDCRSAIFNTAKVGAQTSVKLVGSNLPLTSNLLLSQQSIDHNGISHISKSWMTT 501
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD--------SESVLKVSS 482
KE I + +S A + E +N TKD SDYLWY+ R D + L + S
Sbjct: 502 KEPINIWINSSFTAEGIWEHLNVTKDQSDYLWYSTRIYVSDGDILFWKENAAHPKLAIDS 561
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
+ +L F+NG+ +G+ G TL+ G N+++LL+ VGL + GA++E+
Sbjct: 562 VRDILRVFVNGQLIGNVVGHWVKAVQTLQ----FQPGYNDLTLLTQTVGLQNYGAFIEKD 617
Query: 543 VAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
AG+R + I G + D S W YQVGL GE L+ + + +
Sbjct: 618 GAGIRGTIKITGFENGHIDLSKPLWTYQVGLQGEFLKFYNEESENAGWVELTPDAIPSTF 677
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------ 642
TWYKT FD P G+DPVA++L SMGKG+AWVNG IGRYW
Sbjct: 678 TWYKTYFDVPGGNDPVALDLESMGKGQAWVNGHHIGRYWTRVSPKTGCQVCDYRGAYDSD 737
Query: 643 --LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHL 700
T G P+Q+ YH+PRS+LK + N LV+LEE G P GIS+ S + +C VS S+
Sbjct: 738 KCTTNCGKPTQTLYHVPRSWLKASNNFLVILEETGGNPLGISVKLHSASIVCAQVSQSYY 797
Query: 701 PP---VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
PP +++ Q+ + ++ I P++ +RC G IS I FAS+G P G+C++++
Sbjct: 798 PPMQKLLNASLLGQQEVSSNDMI----PEMNLRCRDGNIISSITFASFGTPGGSCQSFSR 853
Query: 758 GSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G+CH+ +S++IV KACLGKRSC++ + ++ F GDPC + K L V+A+C
Sbjct: 854 GNCHAPSSKSIVSKACLGKRSCSIKISSDVFGGDPCQDVVKTLSVEARC 902
>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
Length = 909
Score = 679 bits (1751), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/861 (42%), Positives = 504/861 (58%), Gaps = 94/861 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+LI+NG R+ L S IHYPR+TP+MWP LIAK+KEGG DV++T VFWN HEP
Sbjct: 46 NVSYDHRALILNGKRRFLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNGHEPV 105
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR DLV+F++ + GLY LRIGP+ EW +GG P WL D+PGI FR++N
Sbjct: 106 RGQYNFEGRYDLVKFVRLAASHGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 165
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ + +VN+M+ RL++ QGGPIIL QIENEYG +E+S+ + G Y++WAAK+
Sbjct: 166 PFKEEMKRFVSKVVNLMREERLFSWQGGPIILLQIENEYGNIENSYGKGGKEYMKWAAKM 225
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L GVPWVMC+Q DAP +I+ CN C + F PNS +KP +WTENW +Y +G+
Sbjct: 226 ALSLGAGVPWVMCRQQDAPYDIIDTCNAYYC-DGFK-PNSHNKPTMWTENWDGWYTQWGE 283
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+A+ VA F + GS+ NYYMY GGTNFGRTA + YD AP+DEYG
Sbjct: 284 RLPHRPVEDLAFAVARFFQR-GGSFQNYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYG 342
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQG------------ 372
LLR+PKWGHLK+LH+A+KLC +P L S + KL QEA ++Q
Sbjct: 343 LLREPKWGHLKDLHAALKLC-EPALVAT-DSPTYIKLGPKQEAHVYQANVHLEGLNLSMF 400
Query: 373 --SSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------- 423
SS C+AFL N D+ ATV F Y +PP S+S+LPDC+ FNTAK+ +
Sbjct: 401 ESSSICSAFLANIDEWKEATVTFRGQRYTIPPWSVSVLPDCRNTVFNTAKVRAQTSVKLV 460
Query: 424 -------------------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
+ W KE + + ++S + E +N TKD S
Sbjct: 461 ESYLPTVSNIFPAQQLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQS 520
Query: 459 DYLWYNFRFKHDPS--------DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
DYLWY+ R S D L + + +L FING+ +G+ G TL
Sbjct: 521 DYLWYSTRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLIGNVVGHWIKVVQTL 580
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQ 568
+ + G N+++LL+ VGL + GA+LE+ AG+R + I G + D S W YQ
Sbjct: 581 Q----FLPGYNDLTLLTQTVGLQNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQ 636
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
VGL GE L+ +++ + TWYKT FD P G DPVA++ SMGKG+A
Sbjct: 637 VGLQGEFLKFYSEENENSEWVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQA 696
Query: 629 WVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTGN 666
WVNGQ IGRYW ++P+ G P+Q+ YH+PRS+LK T N
Sbjct: 697 WVNGQHIGRYWTR-VSPKSGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNN 755
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
LLV+LEE G P IS+ S +C VS+S+ PP+ + N + P+
Sbjct: 756 LLVILEETGGNPFEISVKLHSSRIICAQVSESNYPPLQ--KLVNADLIGEEVSANNMIPE 813
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
+ + C G IS + FAS+G P G+C+N++ G+CH+ +S +IV +AC GKRSC++ +
Sbjct: 814 LHLHCQQGHTISSVAFASFGTPGGSCQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDS 873
Query: 787 KFYGDPCPGIPKALLVDAQCT 807
F DPCPG+ K L V+A+CT
Sbjct: 874 AFGVDPCPGVVKTLSVEARCT 894
>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 782
Score = 678 bits (1750), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/719 (50%), Positives = 457/719 (63%), Gaps = 51/719 (7%)
Query: 12 LLLTTIGGS----DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAK 67
+ L ++ G+ G +VTYD +++IING R+IL SGSIHYPRSTPQMWP LI K
Sbjct: 62 VFLDSVSGTHHSFSGLASASRSVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQK 121
Query: 68 AKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWG 127
AK+GGLD+++T VFWN HEP PG++ F R DLVRFIK VQ GLYV LRIGP++ EW
Sbjct: 122 AKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWN 181
Query: 128 YGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEY 187
YGG P WL VPGI FR+DN PFK M+++ IV+MMK +L+ +QGGPIILSQIENEY
Sbjct: 182 YGGFPLWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEY 241
Query: 188 GMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN 247
G VE G Y +WAA++AV L+TGVPWVMCKQ+DAPDP+I+ CNG C E F PN
Sbjct: 242 GPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYC-ENFK-PN 299
Query: 248 SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR 307
KP IWTENW+ +Y +G R ED+A+ VA FI + GS VNYYMYHGGTNFGR
Sbjct: 300 QIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFI-QNGGSLVNYYMYHGGTNFGR 358
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
T+ +V T Y AP+DEYGLLR+PKWGHL++LH A+KLC ++S S K QEA
Sbjct: 359 TSGLFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEA 418
Query: 368 FIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD---- 422
+F+ SS CAAFL N D V F N Y+LPP SISILPDCKTV FNT L
Sbjct: 419 RVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDCKTVTFNTGSLQIGVK 478
Query: 423 ---------SVEQWEEYKEA-IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS 472
S W YKE Y + + + L+EQ++ T D +DYLWY + D +
Sbjct: 479 SYEAKMTPISSFWWLSYKEEPASAYAQDTTTKDGLVEQVSVTWDTTDYLWYILSIRIDST 538
Query: 473 D------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLL 526
+ +L V+S GH+LH FING+ GS +G D T K V+L G N +S+L
Sbjct: 539 EGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSML 598
Query: 527 SVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGS 584
SV VGLP+ G + + AG L V+++G E +D S + W Y+VGL GE L +++ GS
Sbjct: 599 SVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGS 658
Query: 585 RIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY------ 638
V W + GS QPLTWYKT F+ P G++P+A+++ SM KG+ WVNG+SIGRY
Sbjct: 659 NSVQWMK-GSFQKQPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIA 717
Query: 639 --------WVSFLTPQ------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
+ F T + G PSQ WYHIPR +L P GNLL++LEE G P GIS+
Sbjct: 718 RGKCNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISL 776
>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 836
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 371/832 (44%), Positives = 500/832 (60%), Gaps = 77/832 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R+L ++G R++L SGSIHYPRSTP MWP LIAKAKEGGLDV+QT VFWN HEP
Sbjct: 28 VSYDHRALKLDGQRRMLVSGSIHYPRSTPLMWPGLIAKAKEGGLDVIQTYVFWNGHEPTR 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++++GR +L +FI+ V G+YV LRIGP++ EW GG P WL +PGI FR+DNEP
Sbjct: 88 GVYNYAGRYNLPKFIRLVYEAGMYVNLRIGPYVCAEWNSGGFPAWLRFIPGIEFRTDNEP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK +R+ +V +K +L+A QGGPII++QIENEYG ++ S+ E G Y+ W A +A
Sbjct: 148 FKNETQRFVNHLVRKLKREKLFAWQGGPIIMAQIENEYGNIDASYGEAGQRYLNWIANMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V T VPW+MC+Q +AP VIN CNG C PNS DKPA WTENWT ++Q +G
Sbjct: 208 VATNTSVPWIMCQQPEAPQLVINTCNGFYCDG--WRPNSEDKPAFWTENWTGWFQSWGGG 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
A R +DIA+ VA F K GS++NYYMYHGGTNF RT V T Y AP+DEY +
Sbjct: 266 APTRPVQDIAFSVARFFEK-GGSFMNYYMYHGGTNFERTGVESVTTSYDYDAPIDEYD-V 323
Query: 330 RQPKWGHLKELHSAVKLCLKPM--LSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
RQPKWGHLK+LH+A+KLC + + V ++ QEA ++Q SS CAAFL + D
Sbjct: 324 RQPKWGHLKDLHAALKLCEPALVEVDTVPTGISLGPNQEAHVYQSSSGTCAAFLASWDT- 382
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------------VEQWEEYKEAI 434
N++ V F Y+LP S+SILPDCK+V FNTAK+ + V W Y E +
Sbjct: 383 NDSLVTFQGQPYDLPAWSVSILPDCKSVVFNTAKVGAQSVIMTMQGAVPVTNWVSYHEPL 442
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESVLKVSSLGHVLHA 489
+ N LLEQ+ TTKD +DYLWY + SD +++ L +SSL H
Sbjct: 443 GPWGSV-FSTNGLLEQIATTKDTTDYLWYMTNVQVAESDVRNISAQATLVMSSLRDAAHT 501
Query: 490 FINGEFVGSAHGK--HSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
F+NG + G++H + H+ + +L G+NN+++LS+ +GL G +LE AG
Sbjct: 502 FVNGFYTGTSHQQFMHARQPISLRP------GSNNITVLSMTMGLQGYGPFLENEKAG-- 553
Query: 548 NVSIQGAKELKDFSS-------FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQP- 599
IQ ++D S +W YQVGL GE Q+F GS W+ + Q
Sbjct: 554 ---IQYGVRIEDLPSGTIELGGSTWTYQVGLQGESKQLFEVNGSLTAEWNTISEVSDQNF 610
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------------- 642
L W KT FD P G+ +A++L SMGKG WVNG ++GRYW SF
Sbjct: 611 LFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGVNLGRYWSSFTAQRDGCDASCDYRGSY 670
Query: 643 -----LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
LT PSQ+WYHIPR +L P N +VL EE+ G P ISI T +C H+S
Sbjct: 671 TQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLFEEKGGNPKDISIATRMPQQICSHISQ 730
Query: 698 SHLPP--VISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
SH P + SW ++ T T R P + + C G++IS+I FASYG P+G+CE +
Sbjct: 731 SHPFPFSLTSWTKRDNLT-STLLRAP-----LTLECAEGQQISRICFASYGTPSGDCEGF 784
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ SCH++ S ++ KAC+G++ C+VP+ + F DPCPG+ K+L A+C+
Sbjct: 785 VLSSCHANTSYDVLTKACVGRQKCSVPIVSSIFGDDPCPGLSKSLAATAECS 836
>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
Length = 730
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/730 (49%), Positives = 463/730 (63%), Gaps = 64/730 (8%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
+G LCLF +T +VTYD ++++ING R+IL SGSIHYPRSTPQM
Sbjct: 14 IGLVLFLCLFVFSVTA------------SVTYDHKAIVINGQRRILISGSIHYPRSTPQM 61
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI KAK+GG+DV+QT VFWN HEP PG + F R DLV+F+K VQ GLYV LRIGP
Sbjct: 62 WPDLIQKAKDGGVDVIQTYVFWNGHEPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGP 121
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
++ EW +GG P WL VPG+ FR+DNEPFK M+++ IV+MMKA L+ SQGGPII+
Sbjct: 122 YVCAEWNFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTAKIVSMMKAENLFESQGGPIIM 181
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG VE G Y +W +++A+ L TGVPW+MCKQ+DAPDP+I+ CNG C
Sbjct: 182 SQIENEYGPVEWEIGAPGKAYTKWFSQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYC- 240
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
E F PN KP +WTENW+ +Y +G R A+D+A+ VA FI + +GSYVNYYMYH
Sbjct: 241 ENFT-PNKNYKPKMWTENWSGWYTDFGSAVPYRPAQDVAFSVARFI-QNRGSYVNYYMYH 298
Query: 301 GGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRT++ + YD AP+DEYGLL +PKWGHL+ LH A+K C +P+L V ++
Sbjct: 299 GGTNFGRTSAGLFIATSYDYDAPIDEYGLLSEPKWGHLRNLHKAIKQC-EPILVSVDPTV 357
Query: 360 NF-SKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFN 417
++ K E +++ S+ CAAFL N D + A V F N Y+LPP SISILPDCKT FN
Sbjct: 358 SWPGKNLEVHVYKTSTGACAAFLANYDTTSPAKVTFGNGQYDLPPWSISILPDCKTAVFN 417
Query: 418 TAKLDSVE-------------QWEEYKEAIPTYD-ETSLRANFLLEQMNTTKDASDYLWY 463
TAK+ +V W+ Y EA + + S AN LLEQ+ T+D+SDYLWY
Sbjct: 418 TAKVGTVPSFHRKMTPVSSAFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWY 477
Query: 464 NFRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
P++ VL S GHVLH F+NG+F G+A+G + T V L
Sbjct: 478 MTDVNISPNEGFIKNGQYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLR 537
Query: 518 NGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEK 575
G N +SLLSV VGL + G + E V L V+++G E +D S W Y++GL GE
Sbjct: 538 VGNNKISLLSVAVGLSNVGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGET 597
Query: 576 LQIFTDYGSRIVPWSRYGSS--THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQ 633
L + T GS V W++ GSS QPLTWYK FDAP G+DP+A+++ SMGKGE WVNG+
Sbjct: 598 LNLHTLIGSSSVQWTK-GSSLVKKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGE 656
Query: 634 SIGRYWVSFL--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEE 673
SIGR+W +++ T G P+Q WYHIPRS++ P GN LV+LEE
Sbjct: 657 SIGRHWPAYIARGSCGGCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEE 716
Query: 674 ENGYPPGISI 683
G P GIS+
Sbjct: 717 WGGDPSGISL 726
>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 716
Score = 674 bits (1738), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/720 (49%), Positives = 457/720 (63%), Gaps = 51/720 (7%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+ + LF LLT +G + G VTYD +++IIN R+IL SGSIHYPRSTPQMWP L
Sbjct: 3 KTVLLFLSLLTWVGSTIGA------VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDL 56
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I KAK+GGLD+++T VFWN HEP G++ F R DLV FIK VQ GLYV LRIGP++
Sbjct: 57 IQKAKDGGLDIIETYVFWNGHEPSEGKYYFEERYDLVGFIKLVQKAGLYVHLRIGPYVCA 116
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW YGG P WL VPGI FR+DNEPFK M+++ T IV+MMK +LY +QGGPIILSQIE
Sbjct: 117 EWNYGGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIE 176
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG VE G Y +W A++AVDL+TGVPWVMCKQ+DAPDP+I+ CNG C E F
Sbjct: 177 NEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYC-ENFK 235
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
PN KP IWTENW+ +Y +G R ED+A+ VA FI + GS VNYY+YHGGTN
Sbjct: 236 -PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFI-QNNGSLVNYYVYHGGTN 293
Query: 305 FGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
FGRT+ ++ T Y AP+DEYGL+R+PKWGHL++LH A+K C ++S K
Sbjct: 294 FGRTSGLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKSCEPALVSADPTITWLGKN 353
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-- 422
QEA +F+ SS CAAFL N D + V F N Y+LPP SISILPDC TV FNTA++
Sbjct: 354 QEARVFKSSSACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCXTVTFNTAQVGVK 413
Query: 423 ---------SVEQWEEYKE--AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP 471
S W YKE A +T+ +A L+EQ++ T D +DYLWY D
Sbjct: 414 SYQAKMMPISSFGWLSYKEEPASAYAKDTTTKAG-LVEQVSITWDTTDYLWYMQDISIDS 472
Query: 472 SD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSL 525
++ +L V+S GH+LH FING+ GS +G D + T K V L G N +S+
Sbjct: 473 TEGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPAITFSKNVDLKQGVNKLSM 532
Query: 526 LSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LSV VGLP+ G + + AG L V+++G E +D S + W Y+VGL GE L +++D G
Sbjct: 533 LSVTVGLPNVGLHFDTWNAGVLGPVTLEGLNEGTRDMSKYKWSYKVGLSGESLNLYSDKG 592
Query: 584 SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
S V W++ + QPLTWYKT F P G++P+ +++ SM KG+ W+NGQSIGRY+ ++
Sbjct: 593 SNSVQWTKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWINGQSIGRYFPGYI 652
Query: 644 TPQ--------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYHIPR +L P+ NLLV+ EE G P GIS+
Sbjct: 653 ANGKCDKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEEIGGSPDGISL 712
>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
Length = 767
Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 361/832 (43%), Positives = 487/832 (58%), Gaps = 100/832 (12%)
Query: 2 GQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
GQ + + LL++ + G G VTYDGRSLI+NG R++LFSGSIHYPRSTP
Sbjct: 5 GQALIAAVLSLLVS-YAAAHGIAKGAKTVTYDGRSLIVNGRRELLFSGSIHYPRSTP--- 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
+F+F G DLV+FIK + GLY LRIGPF
Sbjct: 61 -----------------------------EFNFEGNYDLVKFIKLIGDYGLYATLRIGPF 91
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILS 181
IE EW +GG P+WL +VP I+FRS NEPFK+HM++Y+ MI+ MMK A+L+A QGGPIIL+
Sbjct: 92 IEAEWNHGGFPYWLREVPDIIFRSYNEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILA 151
Query: 182 QIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGE 241
QIENEY ++ ++ E G YV+WA K+AV L GVPW+MCKQ DAPDPVIN CNGR CG+
Sbjct: 152 QIENEYNSIQLAYKELGVQYVQWAGKMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGD 211
Query: 242 TFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHG 301
TF GPN P+KP++WTENWT+ Y+V+GD R+AED+A+ VA FI+K G+ NYYMYHG
Sbjct: 212 TFTGPNRPNKPSLWTENWTAQYRVFGDPPSQRAAEDLAFSVARFISK-NGTLANYYMYHG 270
Query: 302 GTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
GTNFGRT S++V T YYD+APLDEYGL R+PKWGHLK+LHSA++LC K + +G
Sbjct: 271 GTNFGRTGSSFVTTRYYDEAPLDEYGLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKL 330
Query: 362 SKLQEAFIFQ--GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
K +E ++ G+ CAAFL N R AT+ F Y LPP SISILPDCKTV +NT
Sbjct: 331 GKDKEVRFYEKPGTHICAAFLTNNHSREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQ 390
Query: 420 KLDSVE---------------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY- 463
++ + +WE +E IP + + +E KD SDY W+
Sbjct: 391 RVVAQHNARNFVKSKIANKNLKWEMSQEPIPVMTDMKILTKSPMELYXFLKDRSDYAWFV 450
Query: 464 ------NFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
N+ D VL++S+LGH + AF+NG F+GSAHG + +K+F K V
Sbjct: 451 TSIELSNYDLPMK-KDIIPVLQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKF- 508
Query: 518 NGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKL 576
G N + +V DSG G+ +V I G D ++ WG QVG+ GE +
Sbjct: 509 QGRNKLHCPAVY----DSG------TTGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHV 558
Query: 577 QIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
+ +T GS V W+ +TWYKT FD P G+DPV + + SM KG NG
Sbjct: 559 KAYTQGGSHRVQWTA-AKGKGPAMTWYKTYFDMPEGNDPVILRMTSMAKG----NGLE-- 611
Query: 637 RYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
YH+PR++LKP+ NLLV+ EE G P I + V+ T+C V+
Sbjct: 612 -----------------YHVPRAWLKPSDNLLVIFEETGGNPEEIEXELVNRDTICSIVT 654
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYA 756
+ H P V SW+ + + + +PK ++CP+ + I K+ FAS+GNP G C ++
Sbjct: 655 EYHPPHVKSWQRHDSKIRAVVDEV---KPKGHLKCPNYKVIVKVDFASFGNPLGACGDFE 711
Query: 757 IGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGD--PCPGIPKALLVDAQC 806
+G+C + NS+ +VE+ C GK +C +P+ F G+ C I K L V +C
Sbjct: 712 MGNCTAPNSKKVVEQHCXGKTTCEIPMEAGIFXGNSGACSDITKTLAVQVRC 763
>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
Length = 803
Score = 671 bits (1730), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/812 (43%), Positives = 483/812 (59%), Gaps = 65/812 (8%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G N+TYD RSLII+G RK+L S +IHYPRS P MWP L+ AKEGG+DV++T VFWN HE
Sbjct: 26 GGNITYDSRSLIIDGQRKLLISAAIHYPRSVPGMWPELVQTAKEGGVDVIETYVFWNGHE 85
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P P + F R DLV+F+K VQ G+Y+ LRIGPF+ EW +GG+P WLH VPG VFR+D
Sbjct: 86 PSPSNYYFEKRYDLVKFVKIVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTD 145
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N FK+HM+++ T IVN+MK +L+ASQGGPIIL+Q+ENEYG E ++ E G Y WAA
Sbjct: 146 NYNFKYHMQKFMTYIVNLMKKEKLFASQGGPIILAQVENEYGFYESAYGEGGKRYAMWAA 205
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++AV GVPW+MC+Q DAP+ VIN CN C + P PDKP IWTENW ++Q +
Sbjct: 206 QMAVSQNIGVPWIMCQQFDAPNSVINTCNSFYCDQF--KPIFPDKPKIWTENWPGWFQTF 263
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDE 325
G R AEDIA+ VA F K GS NYYMYHGGTNFGRT+ +T YD +AP+DE
Sbjct: 264 GAPNPHRPAEDIAFSVARFFQK-GGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDE 322
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKD 384
YGL R PKW HLKELH A+KLC +L+ V V+++ QEA ++ + S CAAFL N D
Sbjct: 323 YGLARLPKWAHLKELHKAIKLCELTLLNSVPVNLSLGPSQEADVYAEESGACAAFLANMD 382
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE--------------- 425
++N+ TV F N+ Y LP S+SILPDCK V FNTAK++S VE
Sbjct: 383 EKNDKTVVFRNMSYHLPAWSVSILPDCKNVVFNTAKVNSQTSIVEMVPDDLRSSDKGTKA 442
Query: 426 -QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVL 478
+WE + E + + L N ++ +NTTKD +DYLWY ++ VL
Sbjct: 443 LKWETFVENAGIWGTSDLVKNGFVDHINTTKDTTDYLWYTTSIFVGENEEFLKKGGRPVL 502
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
+ S GH LHAF+N E G+A G + F +K V L+ G N+++LLS+ VGL ++G++
Sbjct: 503 LIESKGHALHAFVNQELQGTASGNGTHSPFKFKKPVSLVAGKNDIALLSMTVGLQNAGSF 562
Query: 539 LERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGSST 596
E AGL +V ++G D S+F+W Y++GL GEKL ++ V W +
Sbjct: 563 YEWVGAGLTSVKMKGFNNGTIDLSTFNWTYKIGLQGEKLGMYNGIAVETVNWVATSKPPK 622
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
QPLTWYK A + W+ + + + YH+
Sbjct: 623 DQPLTWYKRQIHARQMLN------------------------WMWRINSEMILVWTRYHV 658
Query: 657 PRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRS-QNQRTLK 715
PRS+ KP+GN+LV+ EE+ G P I+ ++ +C V++ + P+ + S +N +
Sbjct: 659 PRSWFKPSGNILVIFEEKGGDPTKITFSRRKISGVCALVAEDY--PMANLESLENAGSGS 716
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLG 775
++ + V ++CP IS I FAS+G+P G C +Y+ G CH S ++VEK CL
Sbjct: 717 SN-----YKASVHLKCPKSSIISAIKFASFGSPAGACGSYSEGECHDPKSISVVEKVCLN 771
Query: 776 KRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
K C V V E F CPG K L V+A C+
Sbjct: 772 KNQCVVEVTEENFSKGLCPGKMKKLAVEAVCS 803
>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 725
Score = 670 bits (1728), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/706 (50%), Positives = 457/706 (64%), Gaps = 50/706 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +++IING R+IL SGSIHYPRS PQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 25 SVTYDHKAIIINGRRRILISGSIHYPRSIPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ++F R DLVRF+K V GLYV LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 85 PGQYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNG 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV +MK +LY SQGGPIILSQIENEYG VE G Y +WAA++
Sbjct: 145 PFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPWVMCKQDDAPDPVI+ CNG C E F PN KP +WTE WT ++ +G
Sbjct: 205 ALGLNTGVPWVMCKQDDAPDPVIDTCNGFYC-ENFK-PNKVYKPKMWTEAWTGWFTEFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
A R ED+AY VA FI + GS++NYYMYHGGTNFGRTA ++ T Y AP+DEYG
Sbjct: 263 PAPYRPVEDMAYSVARFI-QNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF-SKLQEAFIFQG-SSECAAFLVNKDK 385
LLR+PKW HL++LH A+KLC +P L V ++++ QEA +F+ S CAAFL N D
Sbjct: 322 LLREPKWSHLRDLHKAIKLC-EPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDA 380
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------SVEQWEEYKEAI 434
++ATV F N Y+LPP S+SILPDCK+V FNTAK+ S W Y E
Sbjct: 381 SSSATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMTPVSSFSWLSYNEET 440
Query: 435 PT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVL 487
+ Y E + L+EQ++ T+D++DYLWY + DP S +L V S GH L
Sbjct: 441 ASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLLTVFSAGHAL 500
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
H FING+ G+ +G + T K V+L G N +S+LSV VGLP+ G + E G L
Sbjct: 501 HVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHYETWNTGVL 560
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWY 603
V+++G E +D S + W Y++GL GE L + + GS V W GS + QPLTWY
Sbjct: 561 GPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVT-GSLVAQKQPLTWY 619
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------- 646
KT FD+P G++P+A+++ SMGKG+ W+NGQSIGR+W ++
Sbjct: 620 KTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTAKGSCGKCNYGGIFNEKKCH 679
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G PSQ WYH+PR++LK +GN+LV+ EE G P GIS+ S++
Sbjct: 680 SXCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRSIS 725
>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 1225
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/699 (49%), Positives = 451/699 (64%), Gaps = 48/699 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++L+I+G R+IL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 25 SVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R +LVRF+K VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 85 PGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNG 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV+MMK +LY SQGGPIILSQIENEYG VE G Y +WAA++
Sbjct: 145 PFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPWVMCKQ+DAPDP+I+ CNG C E F PN KP +WTE WT ++ +G
Sbjct: 205 ALGLDTGVPWVMCKQEDAPDPMIDTCNGFYC-ENFE-PNKAYKPKMWTEAWTGWFTEFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+AY VA FI + +GS +NYYMYHGGTNFGRTA ++ T Y AP+DEYG
Sbjct: 263 PVPYRPVEDLAYAVARFI-QNRGSLINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKR 386
L+RQPKWGHL++LH A+KLC ++S + QEA ++ S ECAAFL N D
Sbjct: 322 LIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTRSGECAAFLANYDPS 381
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEY------------KEAI 434
+ V F N Y+LPP S+SILPDCKTV FNTAK+++ W + +E
Sbjct: 382 TSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNAPSYWPKMTPISSFSWHSYNEETA 441
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLH 488
Y + + L+EQ++ T+DA+DYLWY + D S +L + S GH LH
Sbjct: 442 SAYADDTTTMAGLVEQISITRDATDYLWYMTDIRIDSNEGFLKSGQWPLLTIFSAGHALH 501
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING+ G+ +G + T K V+L G N +S+LSV VGLP+ G + E AG L
Sbjct: 502 VFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVHFETWNAGILG 561
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
V+++G E +D S + W Y+VGL GE L + T GS V W GS S QPLTWYK
Sbjct: 562 PVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMT-GSLVSQKQPLTWYK 620
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT-------------------- 644
T F+AP G++P+A+++ SMGKG+ W+NG+SIGR+W ++
Sbjct: 621 TTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHWPAYTARGSCGKCYYGGIFTEKKCHF 680
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PR++LKP+GN+LV+ EE G P GIS+
Sbjct: 681 SCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISL 719
Score = 394 bits (1012), Expect = e-106, Method: Compositional matrix adjust.
Identities = 227/501 (45%), Positives = 294/501 (58%), Gaps = 52/501 (10%)
Query: 231 INACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMK 290
I+ CNG C E F PN KP IWTENW+ +Y +G R ED+A+ VA FI +
Sbjct: 723 IDTCNGFYC-ENFK-PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFI-QNG 779
Query: 291 GSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKP 350
GS VNYYMYHGGTNFGRT+ +V T Y AP+DEYGLLR+PKWGHL++LH A+KLC
Sbjct: 780 GSLVNYYMYHGGTNFGRTSGLFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPA 839
Query: 351 MLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILP 409
++S S K QEA +F+ SS CAAFL N D V F N Y+LPP SISILP
Sbjct: 840 LVSADPTSTWLGKDQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILP 899
Query: 410 DCKTVAFNTAKLD------------------SVEQWEEYKEA-IPTYDETSLRANFLLEQ 450
DCKTV FNTA++ S W YKE Y + + + L+EQ
Sbjct: 900 DCKTVTFNTARVRRDPKLFIPNLLMAKMTPISSFWWLSYKEEPASAYAKDTTTKDGLVEQ 959
Query: 451 MNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHS 504
++ T D +DYLWY + D ++ +L V+S GH+LH FING+ GS +G
Sbjct: 960 VSVTWDTTDYLWYMTDIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSGSVYGSLE 1019
Query: 505 DKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSS 562
D T K V+L G N +S+LSV VGLP+ G + + AG L V+++G E +D S
Sbjct: 1020 DPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSK 1079
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLIS 622
+ W Y+VGL GE L +++ GS V W + GS QPLTWYKT F+ P G++P+A+++ S
Sbjct: 1080 YKWSYKVGLRGEILNLYSVKGSNSVQWMK-GSFQKQPLTWYKTTFNTPAGNEPLALDMSS 1138
Query: 623 MGKGEAWVNGQSIGRY--------------WVSFLTPQ------GTPSQSWYHIPRSFLK 662
M KG+ WVNG+SIGRY + F T + G PSQ WYHIPR +L
Sbjct: 1139 MSKGQIWVNGRSIGRYFPGYIASGKCNKCSYTGFFTEKKCLWNCGGPSQKWYHIPRDWLS 1198
Query: 663 PTGNLLVLLEEENGYPPGISI 683
P GNLL++LEE G P GIS+
Sbjct: 1199 PNGNLLIILEEIGGNPQGISL 1219
>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 723
Score = 668 bits (1724), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/699 (49%), Positives = 451/699 (64%), Gaps = 48/699 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++L+I+G R+IL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 25 SVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R +LVRF+K VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 85 PGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNG 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV+MMK +LY SQGGPIILSQIENEYG VE G Y +WAA++
Sbjct: 145 PFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPWVMCKQ+DAPDP+I+ CNG C E F PN KP +WTE WT ++ +G
Sbjct: 205 ALGLDTGVPWVMCKQEDAPDPMIDTCNGFYC-ENFE-PNKAYKPKMWTEAWTGWFTEFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+AY VA FI + +GS +NYYMYHGGTNFGRTA ++ T Y AP+DEYG
Sbjct: 263 PVPYRPVEDLAYAVARFI-QNRGSLINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKR 386
L+RQPKWGHL++LH A+KLC ++S + QEA ++ S ECAAFL N D
Sbjct: 322 LIRQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTRSGECAAFLANYDPS 381
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEY------------KEAI 434
+ V F N Y+LPP S+SILPDCKTV FNTAK+++ W + +E
Sbjct: 382 TSVRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKVNAPSYWPKMTPISSFSWHSYNEETA 441
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLH 488
Y + + L+EQ++ T+DA+DYLWY + D S +L + S GH LH
Sbjct: 442 SAYADDTTTMAGLVEQISITRDATDYLWYMTDIRIDSNEGFLKSGQWPLLTIFSAGHALH 501
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
FING+ G+ +G + T K V+L G N +S+LSV VGLP+ G + E AG L
Sbjct: 502 VFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVHFETWNAGILG 561
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
V+++G E +D S + W Y+VGL GE L + T GS V W GS S QPLTWYK
Sbjct: 562 PVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMT-GSLVSQKQPLTWYK 620
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT-------------------- 644
T F+AP G++P+A+++ SMGKG+ W+NG+SIGR+W ++
Sbjct: 621 TTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHWPAYTARGSCGKCYYGGIFTEKKCHF 680
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PR++LKP+GN+LV+ EE G P GIS+
Sbjct: 681 SCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISL 719
>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 830
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/845 (43%), Positives = 496/845 (58%), Gaps = 95/845 (11%)
Query: 22 GGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVF 81
GG NVTYD R+L+I+G R++L SGSIHYPRSTP MWP LI KAK+GGLDV++T VF
Sbjct: 22 AGGARAANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVF 81
Query: 82 WNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGI 141
W++HEP GQ+DF GR+DL F+K V GLYV LRIGP++ EW YGG P WLH +PGI
Sbjct: 82 WDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI 141
Query: 142 VFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPY 201
FR+DNEPFK M+R+ ++IENEYG ++ ++ G Y
Sbjct: 142 KFRTDNEPFKAEMQRFT----------------------AKIENEYGNIDSAYGAPGKAY 179
Query: 202 VRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTS 261
+RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+
Sbjct: 180 MRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFT--PNSAAKPKMWTENWSG 237
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQ 320
++ +G R ED+A+ VA F + G++ NYYMYHGGTN R++ ++ T Y
Sbjct: 238 WFLSFGGAVPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNLDRSSGGPFIATSYDYD 296
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFL 380
AP+DEYGL+RQPKWGHL+++H A+KLC +++ + EA +++ S CAAFL
Sbjct: 297 APIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGSVCAAFL 356
Query: 381 VNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----------------- 423
N D +++ TV F+ MY LP S+SILPDCK V NTA+++S
Sbjct: 357 ANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVAS 416
Query: 424 ----------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP 471
V W E + + +L L+EQ+NTT DASD+LWY + K D
Sbjct: 417 DGSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDE 476
Query: 472 ---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSV 528
+ S+S L V+SLGHVL +ING+ GSA G S + +K + L+ G N + LLS
Sbjct: 477 PYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSA 536
Query: 529 MVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIV 587
VGL + GA+ + AG+ V + G D SS W YQ+GL GE L ++ +
Sbjct: 537 TVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEASPE 596
Query: 588 PWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ- 646
S + PL WYKT F P G DPVAI+ MGKGEAWVNGQSIGRYW + L PQ
Sbjct: 597 WVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQS 656
Query: 647 ---------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
G PSQ+ YH+PRSFL+P N LVL E G P IS
Sbjct: 657 GCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFVM 716
Query: 686 VSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRR--PKVQIRCP-SGRKISKILF 742
++C VS++H + SW SQ P +R P +++ CP G+ IS + F
Sbjct: 717 RQTGSVCAQVSEAHPAQIDSWSSQQ----------PMQRYGPALRLECPKEGQVISSVKF 766
Query: 743 ASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLV 802
AS+G P+G C +Y+ G C S+ + +IV++AC+G SC+VPV + ++G+PC G+ K+L V
Sbjct: 767 ASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPV-SSNYFGNPCTGVTKSLAV 825
Query: 803 DAQCT 807
+A C+
Sbjct: 826 EAACS 830
>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
Length = 728
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/700 (50%), Positives = 443/700 (63%), Gaps = 49/700 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYDG+++ ING R+ILFSGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 28 SVTYDGKAIKINGQRRILFSGSIHYPRSTPEMWPGLIQKAKEGGLDVIQTYVFWNGHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F GR DLVRFIK Q GLYV LRIG ++ EW +GG P WL VPGI FR+DN
Sbjct: 88 PGQYYFEGRYDLVRFIKLAQQAGLYVHLRIGLYVCAEWNFGGFPVWLKYVPGIAFRTDNG 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IVN+MK+ +L+ SQGGPII+SQIENEYG VE G Y +WAA++
Sbjct: 148 PFKAAMQKFTEKIVNLMKSEKLFESQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAAEM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW+MCKQ+DAPDP+I+ CNG C E F PN KP +WTE WT +Y +G
Sbjct: 208 AVGLDTGVPWIMCKQEDAPDPIIDTCNGFYC-EGFT-PNKNYKPKMWTEAWTGWYTEFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+AY VA FI + GS+VNYYMYHGGTNFGRTA+ +V T Y AP+DEYG
Sbjct: 266 PIHNRPVEDLAYSVARFI-QNNGSFVNYYMYHGGTNFGRTAAGLFVATSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L R+PKWGHL++LH A+KLC ++S K E +F+ S CAAFL N D +
Sbjct: 325 LPREPKWGHLRDLHKAIKLCEPSLVSAYPTVTWPGKNLEVHVFKSKSSCAAFLANYDPSS 384
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-------------QWEEY-KEA 433
A V F N+ Y+LPP SISILPDCK FNTA++ S W+ Y +E
Sbjct: 385 PAKVTFQNMQYDLPPWSISILPDCKNAVFNTARVSSKSSQMKMTPVSGGAFSWQSYIEET 444
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVL 487
+ D ++ N L EQ++ T+D SDYLWY P++ VL V S GH L
Sbjct: 445 VSADDSDTIAKNGLWEQISITRDGSDYLWYLTDVNIHPNEGFLKNGQSPVLTVMSAGHAL 504
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
H FING+ G+ +G + T V L G N +SLLS VGLP+ G + E G L
Sbjct: 505 HVFINGQLAGTVYGSLENPKLTFSNNVKLRAGINKISLLSAAVGLPNVGLHFETWNTGVL 564
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWY 603
V+++G E +D + W Y+VGL GE L + T GS V W + GS + QPLTWY
Sbjct: 565 GPVTLKGLNEGTRDLTKQKWSYKVGLKGEDLSLHTLSGSSSVEWVQ-GSLLAQKQPLTWY 623
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------L 643
K F+AP G+DP+A+++ +MGKG+ W+NG+SIGR+W + L
Sbjct: 624 KATFNAPEGNDPLALDMNTMGKGQIWINGESIGRHWPEYKASGNCGGCSYAGIYTEKKCL 683
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
+ G SQ WYH+PRS+LKP+GN LV+ EE G P GIS
Sbjct: 684 SNCGEASQRWYHVPRSWLKPSGNFLVVFEELGGDPTGISF 723
>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/702 (50%), Positives = 452/702 (64%), Gaps = 53/702 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+++ING RKIL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 24 NVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIEKAKDGGLDVIETYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+++F GR DLV+FIK VQ GLYV LRIGP+I EW +GGLP WL V G+ FR+DN+
Sbjct: 84 PGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRTDNQ 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV+MMK+ +L+ QGGPII++QIENEYG VE G Y +WAA++
Sbjct: 144 PFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+T VPW+MCKQ+DAPDPVI+ CNG C E F PN P KP +WTE WT ++ +G
Sbjct: 204 AVGLKTDVPWIMCKQEDAPDPVIDTCNGFYC-EGFR-PNKPYKPKMWTEVWTGWFTKFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AEDIA+ VA F+ + GSY NYYMYHGGTNFGRT+S + YD AP+DEYG
Sbjct: 262 PIPQRPAEDIAFSVARFV-QNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LL +PK+GHL+ELH A+K C ++S + QEA +++ S CAAFL N D +
Sbjct: 321 LLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSGACAAFLSNYDAK 380
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAI 434
+ V F NL Y+LPP SISILPDCKTV +NTAK+ S W+ Y E
Sbjct: 381 YSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSSIKMTPAGGGLSWQSYNEDT 440
Query: 435 PTYDET-SLRANFLLEQMNTTKDASDYLWY--------NFRFKHDPSDSESVLKVSSLGH 485
PT D++ +LRAN L EQ N T+D+SDYLWY N F S + L V S GH
Sbjct: 441 PTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDVNIASNEGFLK--SGKDPYLTVMSAGH 498
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
VLH F+NG+ G+ +G + T V L G N +SLLSV VGLP+ G + + AG
Sbjct: 499 VLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKISLLSVSVGLPNVGVHYDTWNAG 558
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V++ G E +D + W Y+VGL GE L + T GS V W + GS + QPLT
Sbjct: 559 VLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTLSGSSSVEWVQ-GSLVARTQPLT 617
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------ 643
WYK F AP G++P+A+++ SMGKG+ W+NG+ +GR+W +
Sbjct: 618 WYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWPGYAAQGDCSKCSYAGTFNEKK 677
Query: 644 --TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G PSQ WYH+PRS+LK +GNLLV+ EE G P GIS+
Sbjct: 678 CQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPTGISL 719
>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
Length = 766
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/731 (48%), Positives = 456/731 (62%), Gaps = 60/731 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFW+ HEP
Sbjct: 36 SVTYDRKAIVINGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWDGHEPS 95
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F GR DLV+FIK V+ GLYV LRIGP+I EW GG P WL +PGI FR+DNE
Sbjct: 96 PGKYYFEGRYDLVKFIKLVKQAGLYVNLRIGPYICAEWNLGGFPVWLKYIPGISFRTDNE 155
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK +M + IV MMKA L+ QGGPII+SQIENEYG VE G Y RWAA +
Sbjct: 156 PFKRYMAGFTKKIVEMMKAESLFEPQGGPIIMSQIENEYGPVEWEIGAIGKVYTRWAASM 215
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV+L TGVPW+MCKQD+ PDP+IN CNG C + PN KP +WTE WT ++ +G
Sbjct: 216 AVNLNTGVPWIMCKQDEVPDPIINTCNGFYC--DWFKPNKDYKPIMWTELWTGWFTAFGG 273
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+AY V FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 274 PVPYRPVEDVAYAVVKFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 332
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKR 386
L R+PKWGHL++LH A+K+C ++S QEA +F+ S C+AFL NKD+
Sbjct: 333 LKREPKWGHLRDLHRAIKMCEPALVSNDPTVTKIGDSQEAHVFKFESGACSAFLENKDET 392
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-------------QWEEYKEA 433
N V F + YELPP SISILPDC V +NT ++ + W Y E
Sbjct: 393 NFVKVTFQGMQYELPPWSISILPDCVNVVYNTGRVGTQTSMMTMLSASNNEFSWASYNED 452
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES----------VLKVSSL 483
+Y+E S+ L EQ++ TKD++DYL R+ D + ++ VL V+S
Sbjct: 453 TASYNEESMTIEGLSEQISITKDSTDYL----RYTTDVTIGQNEGFLKNGEYPVLTVNSA 508
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH L F+NG+ G+A+G +D T V L G N +SLLS VGLP+ G + E
Sbjct: 509 GHALQVFVNGQLSGTAYGSVNDPRLTFSGKVKLWAGNNKISLLSSAVGLPNVGTHFETWN 568
Query: 544 AG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH--QP 599
G L V++ G E K D S W Y+VG++GE LQ+ + GS V W GSST QP
Sbjct: 569 YGVLGPVTLNGLNEGKRDLSLQKWSYKVGVIGEALQLHSPTGSSSVEW---GSSTSKIQP 625
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
TWYKT F+AP G+DP+A+++ +MGKG+ W+NGQSIGRYW ++
Sbjct: 626 FTWYKTTFNAPGGNDPLALDMNTMGKGQIWINGQSIGRYWPAYKANGKCSACHYTGWYDE 685
Query: 647 -------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
G SQ WYHIPRS+L PTGNLLV+ EE G P GI++ ++ + C ++++ H
Sbjct: 686 KKCGFNCGEASQRWYHIPRSWLNPTGNLLVVFEEWGGDPTGITLVRRTIGSACAYINEWH 745
Query: 700 LPPVISWRSQN 710
P V +W+ +N
Sbjct: 746 -PTVKNWKIEN 755
>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
Length = 724
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/700 (50%), Positives = 452/700 (64%), Gaps = 49/700 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+++ING RKIL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 24 NVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+++F GR DLV+FIK VQ GLYV LRIGP+I EW +GGLP WL V G+ FR+DN+
Sbjct: 84 PGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRTDNQ 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV+MMK+ +L+ QGGPII++QIENEYG VE G Y +WAA++
Sbjct: 144 PFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+T VPW+MCKQ+DAPDPVI+ CNG C E F PN P KP +WTE WT ++ +G
Sbjct: 204 AVGLKTDVPWIMCKQEDAPDPVIDTCNGFYC-EGFR-PNKPYKPKMWTEVWTGWFTKFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AEDIA+ VA F+ + GSY NYYMYHGGTNFGRT+S + YD AP+DEYG
Sbjct: 262 PIPQRPAEDIAFSVARFV-QNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LL +PK+GHL+ELH A+K C ++S + QEA +++ S CAAFL N D +
Sbjct: 321 LLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSGACAAFLSNYDAK 380
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAI 434
+ V F NL Y+LPP SISILPDCKTV +NTAK+ S W+ Y E
Sbjct: 381 YSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSSIKMTPAGGGLSWQSYNEDT 440
Query: 435 PTYDET-SLRANFLLEQMNTTKDASDYLWY--NFRFKHD----PSDSESVLKVSSLGHVL 487
PT D++ +LRAN L EQ N T+D+SDYLWY + + S + L V S GHVL
Sbjct: 441 PTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDINIASNEGFLKSGKDPYLTVMSAGHVL 500
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
H F+NG+ G+ +G + T V L G N +SLLSV VGLP+ G + + AG L
Sbjct: 501 HVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKISLLSVSVGLPNVGVHYDTWNAGVL 560
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWY 603
V++ G E +D + W Y+VGL GE L + T GS V W + GS + QPLTWY
Sbjct: 561 GPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTLSGSSSVEWVQ-GSLVARTQPLTWY 619
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL-------------------- 643
K F AP G++P+A+++ SMGKG+ W+NG+ +GR+W +
Sbjct: 620 KATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWPGYAAQGDCSKCSYAGTFNEKKCQ 679
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G PSQ WYH+PRS+LK +GNLLV+ EE G P GIS+
Sbjct: 680 TNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPTGISL 719
>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
Length = 723
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/700 (50%), Positives = 452/700 (64%), Gaps = 49/700 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++I+G R+IL SGSIHYPRSTP+MWP L KAKEGGLDV+QT VFWN HEP
Sbjct: 24 SVTYDHKTIVIDGQRRILISGSIHYPRSTPEMWPALFQKAKEGGLDVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLV+FIK Q GLYV LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 84 PGKYYFEDRFDLVKFIKLAQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T IV+MMKA L+ +QGGPII+SQIENEYG VE + G Y WAA++
Sbjct: 144 PFKAAMQKFTTKIVSMMKAENLFQNQGGPIIMSQIENEYGPVEWNIGAPGKAYTNWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW MCKQ+DAPDPVI+ CNG C E F PN KP +WTENW+ +Y +G+
Sbjct: 204 AVGLDTGVPWDMCKQEDAPDPVIDTCNGYYC-ENFT-PNKNYKPKMWTENWSGWYTDFGN 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+AY VA FI + +GS+VNYYMYHGGTNFGRT+S + YD AP+DEYG
Sbjct: 262 AICYRPVEDLAYSVARFI-QNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKR 386
L +PKW HL++LH A+K C ++S + EA ++ G+S CAAFL N D +
Sbjct: 321 LTNEPKWSHLRDLHKAIKQCEPALVSVDPTITSLGNKLEAHVYSTGTSVCAAFLANYDTK 380
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQWEEY-KEA 433
+ ATV F N Y+LPP S+SILPDCKT FNTAK+ +S W+ Y +E
Sbjct: 381 SAATVTFGNGKYDLPPWSVSILPDCKTDVFNTAKVGAQSSQKTMISTNSTFDWQSYIEEP 440
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVL 487
+ ++ S+ A L EQ+N T+D+SDYLWY P++ +L V S GHVL
Sbjct: 441 AFSSEDDSITAEALWEQINVTRDSSDYLWYLTDVNISPNEDFIKNGQYPILNVMSAGHVL 500
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGL 546
H F+NG+ G+ +G + T V+L G N +SLLSV VGLP+ G + E V L
Sbjct: 501 HVFVNGQLSGTVYGVLDNPKLTFSNSVNLTVGNNKISLLSVAVGLPNVGLHFETWNVGVL 560
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWY 603
V+++G E +D S W Y+VGL GE L + T G V W++ GS + QPLTWY
Sbjct: 561 GPVTLKGLNEGTRDLSWQKWSYKVGLKGESLSLHTITGGSSVDWTQ-GSLLAKKQPLTWY 619
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL-------------------- 643
K F+AP G+DP+ +++ SMGKGE WVN QSIGR+W ++
Sbjct: 620 KATFNAPAGNDPLGLDMSSMGKGEIWVNDQSIGRHWPGYIAHGSCGDCDYAGTFTNTKCR 679
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G P+Q+WYHIPRS+L PTGN+LV+LEE G P GIS+
Sbjct: 680 TNCGNPTQTWYHIPRSWLNPTGNVLVVLEEWGGDPSGISL 719
>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
Length = 726
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/723 (49%), Positives = 458/723 (63%), Gaps = 61/723 (8%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
LC F +T +VTYD ++++ING R+IL SGSIHYPRSTPQMWP LI
Sbjct: 16 FLCFFVCYVTA------------SVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLI 63
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GG+DV++T VFWN HEP G++ F R DLV+FIK VQ GLYV LRIGP++ E
Sbjct: 64 QKAKDGGVDVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAE 123
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPG+ FR+DNEPFK M+++ T IV++MK+ L+ SQGGPIILSQIEN
Sbjct: 124 WNFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIEN 183
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
EYG VE G Y +W +++AV L TGVPWVMCKQ+DAPDP+I+ CNG C E F+
Sbjct: 184 EYGPVEWEIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYC-ENFS- 241
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTENWT +Y +G R AED+A+ VA F+ + +GSYVNYYMYHGGTNF
Sbjct: 242 PNKNYKPKMWTENWTGWYTDFGTAVPYRPAEDLAFSVARFV-QNRGSYVNYYMYHGGTNF 300
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRT+S + YD AP+DEYGL+ +PKWGHL++LH A+K C ++S K
Sbjct: 301 GRTSSGLFIATSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKN 360
Query: 365 QEAFIFQGS-SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-- 421
E +++ S CAAFL N D + A V F N Y+LPP SISILPDCKT FNTAK+
Sbjct: 361 LEVHLYKTSFGACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRA 420
Query: 422 ----------DSVEQWEEYKEAIPTYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+S W+ Y E E+ S AN LLEQ++ T D SDYLWY
Sbjct: 421 PRVHRSMTPANSAFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNIS 480
Query: 471 PSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
P++ VL S GHVLH FING+F G+A+G + T V L G N +S
Sbjct: 481 PNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKIS 540
Query: 525 LLSVMVGLPDSGAYLER-RVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDY 582
LLSV VGL + G + E+ V L V+++G E +D S W Y++GL GE L + T
Sbjct: 541 LLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGESLNLHTTS 600
Query: 583 GSRIVPWSRYGS--STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV 640
GS V W++ GS S QPLTWYKT F+AP G+DP+A+++ SMGKGE WVNGQSIGR+W
Sbjct: 601 GSSSVKWTQ-GSFLSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNGQSIGRHWP 659
Query: 641 SFL--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPG 680
+++ T G P+Q WYHIPRS+L P+GN+LV+LEE G P G
Sbjct: 660 AYIARGNCGSCNYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVLEEWGGDPTG 719
Query: 681 ISI 683
IS+
Sbjct: 720 ISL 722
>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
Length = 724
Score = 664 bits (1714), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/702 (50%), Positives = 453/702 (64%), Gaps = 53/702 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD R++IING RKIL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 24 SVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPG+ FR++N+
Sbjct: 84 PGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRTNNQ 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IVNMMK+ L+ SQGGPII++QIENEYG VE G Y +WAA++
Sbjct: 144 PFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+TGVPW+MCKQ+DAPDPVI+ CNG C E F PN P KP +WTE WT +Y +G
Sbjct: 204 AVGLKTGVPWIMCKQEDAPDPVIDTCNGFYC-EGFR-PNKPYKPKMWTEVWTGWYTKFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AEDIA+ VA F+ + GS+ NYYMYHGGTNFGRT+S + YD APLDEYG
Sbjct: 262 PIPQRPAEDIAFSVARFV-QNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LL +PK+GHL++LH A+KL ++S + QEA +++ S CAAFL N D R
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAI 434
+ V F N Y LPP SISILPDCKT +NTA+++S W+ Y E
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTPAGGGLSWQSYNEET 440
Query: 435 PTYDET-SLRANFLLEQMNTTKDASDYLWY--------NFRFKHDPSDSESVLKVSSLGH 485
PT D++ +L AN L EQ N T+D+SDYLWY N F + D L V S GH
Sbjct: 441 PTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKD--PYLTVMSAGH 498
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
VLH F+NG+ G+ +G + T V L G N +SLLSV VGLP+ G + + AG
Sbjct: 499 VLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVGVHYDTWNAG 558
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V++ G E ++ + W Y+VGL GE L + + GS V W R GS + QPLT
Sbjct: 559 VLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVR-GSLMAQKQPLT 617
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------ 643
WYK F+AP G+DP+A+++ SMGKG+ W+NG+ +GR+W ++
Sbjct: 618 WYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIAQGDCSKCSYAGTFNEKK 677
Query: 644 --TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G PSQ WYH+PRS+LKP+GNLLV+ EE G P GIS+
Sbjct: 678 CQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISL 719
>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
Length = 731
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/708 (50%), Positives = 438/708 (61%), Gaps = 50/708 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD ++LIING RK+LFSGSIHYPRSTP+MW LI KAK+GGLDV+ T VFWNLHEP
Sbjct: 27 NVTYDKKALIINGQRKVLFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYVFWNLHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFIK V GLYV LRIGP+I EW +GG P WL VPGI FR+DNE
Sbjct: 87 PGNYNFDGRYDLVRFIKLVHEAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGISFRTDNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV MMK L+ SQGGPIILSQIENEY +F G Y+ WAA +
Sbjct: 147 PFKSAMQKFTQKIVQMMKDENLFESQGGPIILSQIENEYEPESKAFGSPGHAYMTWAAHM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ + TGVPWVMCK+ DAPDPVIN CNG C + PN P KP +WTE WT ++ +G
Sbjct: 207 AISMDTGVPWVMCKEFDAPDPVINTCNGFYC--DYFSPNKPYKPTMWTEAWTGWFTDFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA FI K GS VNYYMYHGGTNFGRT+ +T YD AP+DEYG
Sbjct: 265 PNHQRPAEDLAFAVARFIQK-GGSLVNYYMYHGGTNFGRTSGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLKELH A+KLC K +L+ + ++A +F S CAAFL N + +
Sbjct: 324 LIRQPKYGHLKELHKAIKLCEKALLAADSTVTSLGSYEQAHVFSSDSGGCAAFLSNYNTK 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
A V F+N+ Y LPP SISILPDCK V FNTA + + WE + E
Sbjct: 384 QAARVKFNNIQYSLPPWSISILPDCKNVVFNTAHVGVQTSQVHMLPTDSELLSWETFNED 443
Query: 434 IPTYDETSL-RANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHV 486
I + D+ + LLEQ+N T+D SDYLWY S+S VL V S GH
Sbjct: 444 ISSVDDDKMITVAGLLEQLNITRDTSDYLWYTTSVHISSSESFLRGGRLPVLTVQSAGHA 503
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
LH FINGE GSAHG + FT + + G N +SLLSV VGLP++G E G+
Sbjct: 504 LHVFINGELSGSAHGTREQRRFTFTEDMKFHAGKNRISLLSVAVGLPNNGPRFETWNTGI 563
Query: 547 RN-VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS---STHQPLT 601
V++ G E +D + W Y+VGL GE + + + +V W + GS QPLT
Sbjct: 564 LGPVTLHGLDEGQRDLTWQKWSYKVGLKGEDMNLRSRKSVSLVDWIQ-GSLMVGKQQPLT 622
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
WYK F++P G DP+A+++ SMGKG+ W+NG SIGRYW +
Sbjct: 623 WYKAYFNSPKGDDPLALDMGSMGKGQVWINGHSIGRYWTLYAEGNCSGCSYSATFRPARC 682
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
G P+Q WYH+PRS+LK T NLLVL EE G IS+ VT+
Sbjct: 683 QLGCGQPTQKWYHVPRSWLKSTRNLLVLFEEIGGDASRISLVKRLVTS 730
>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
Length = 892
Score = 663 bits (1711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/862 (41%), Positives = 506/862 (58%), Gaps = 99/862 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+LII G R++L S IHYPR+TP+MWP LIA++KEGG DV++T FWN HEP
Sbjct: 36 NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ++F GR D+V+F K V + GL++ +RIGP+ EW +GG P WL D+PGI FR+DN
Sbjct: 96 RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+RY IV++M + L++ QGGPIIL QIENEYG VE SF KG Y++WAA++
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESSFGPKGKLYMKWAAEM 215
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L GVPWVMC+Q DAP+ +I+ CN C + F PNS KP IWTENW ++ +G+
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYC-DGFT-PNSEKKPKIWTENWNGWFADWGE 273
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +EDIA+ +A F + GS NYYMY GGTNFGRTA YD APLDEYG
Sbjct: 274 RLPYRPSEDIAFAIARFFQR-GGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYG 332
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSE--------- 375
LLRQPKWGHLK+LH+A+KLC +++ S + KL QEA +++G+S
Sbjct: 333 LLRQPKWGHLKDLHAAIKLCEPALVAA--DSPQYIKLGPKQEAHVYRGTSNNIGQYMSLN 390
Query: 376 ---CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA-----KLDS---- 423
CAAF+ N D+ +ATV F + LPP S+ + + +T KL S
Sbjct: 391 EGICAAFIANIDEHESATVKFYGQEFTLPPWSV-VFCQIAEIQLSTQLRWGHKLQSKQWA 449
Query: 424 ------------------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASD 459
+ W KE + + + + + +LE +N TKD SD
Sbjct: 450 QILFQLGIILCFYKLSLKASSESFSQSWMTLKEPLGVWGDKNFTSKGILEHLNVTKDQSD 509
Query: 460 YLWYNFRFK--------HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLE 511
YLWY R + +D + + S+ + F+NG+ GS GK +
Sbjct: 510 YLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----IKVV 565
Query: 512 KMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQV 569
+ V L+ G N++ LLS VGL + GA+LE+ AG + + + G K + ++ W YQV
Sbjct: 566 QPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWTYQV 625
Query: 570 GLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
GL GE L+++ + W+ + + +T +WYKT FDAP G+DPVA++ SMGKG+A
Sbjct: 626 GLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGKGQA 685
Query: 629 WVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTGN 666
WVNG +GRYW + + P G +Q+WYHIPRS+LK N
Sbjct: 686 WVNGHHVGRYW-TLVAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKTLNN 744
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW-RSQNQRTLKTHKRIPGRRP 725
+LV+ EE + P ISI T S T+C VS+ H PP+ W S+ R L + + P
Sbjct: 745 VLVIFEETDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRKLS----LMDKTP 800
Query: 726 KVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWT 785
++ ++C G IS I FASYG+PNG+C+ ++ G CH++NS ++V +AC+G+ SC++ + +
Sbjct: 801 EMHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVSQACIGRTSCSIGI-S 859
Query: 786 EKFYGDPCPGIPKALLVDAQCT 807
+GDPC + K+L V A+C+
Sbjct: 860 NGVFGDPCRHVVKSLAVQAKCS 881
>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
Length = 724
Score = 662 bits (1709), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/702 (50%), Positives = 452/702 (64%), Gaps = 53/702 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD R++IING RKIL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN H P
Sbjct: 24 SVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHGPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPG+ FR++N+
Sbjct: 84 PGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRTNNQ 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IVNMMK+ L+ SQGGPII++QIENEYG VE G Y +WAA++
Sbjct: 144 PFKVAMRGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+TGVPW+MCKQ+DAPDPVI+ CNG C E F PN P KP +WTE WT +Y +G
Sbjct: 204 AVGLKTGVPWIMCKQEDAPDPVIDTCNGFYC-EGFR-PNKPYKPKMWTEVWTGWYTKFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AEDIA+ VA F+ + GS+ NYYMYHGGTNFGRT+S + YD APLDEYG
Sbjct: 262 PIPQRPAEDIAFSVARFV-QNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LL +PK+GHL++LH A+KL ++S + QEA +++ S CAAFL N D R
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAI 434
+ V F N Y LPP SISILPDCKT +NTA+++S W+ Y E
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTPAGGGLSWQSYNEET 440
Query: 435 PTYDET-SLRANFLLEQMNTTKDASDYLWY--------NFRFKHDPSDSESVLKVSSLGH 485
PT D++ +L AN L EQ N T+D+SDYLWY N F + D L V S GH
Sbjct: 441 PTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKD--PYLTVMSAGH 498
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
VLH F+NG+ G+ +G + T V L G N +SLLSV VGLP+ G + + AG
Sbjct: 499 VLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVGVHYDTWNAG 558
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V++ G E ++ + W Y+VGL GE L + + GS V W R GS + QPLT
Sbjct: 559 VLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVR-GSLVAQKQPLT 617
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------ 643
WYK F+AP G+DP+A+++ SMGKG+ W+NG+ +GR+W ++
Sbjct: 618 WYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIAQGDCSKCSYAGTFNEKK 677
Query: 644 --TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G PSQ WYH+PRS+LKP+GNLLV+ EE G P GIS+
Sbjct: 678 CQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISL 719
>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 662 bits (1707), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/832 (42%), Positives = 489/832 (58%), Gaps = 87/832 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YDGR++ I+G RKILFSGSIHYPRST +MWP LI K+KEGGLDV++T VFWN+HEP
Sbjct: 26 DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHEPH 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+DFSG DLVRFIK +Q QGLY LRIGP++ EW YGG P WLH++P I FR++N
Sbjct: 86 PGQYDFSGNLDLVRFIKTIQNQGLYAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTNNA 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
F+ MK++ T+IV+MM+ +L+ASQGGPIIL+QIENEYG + S+ + G YV+W A+L
Sbjct: 146 IFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCAQL 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A Q GVPW+MC+Q DAPDP+IN CNG C + PNS +KP +WTE+WT ++ +G
Sbjct: 206 AQSYQIGVPWIMCQQSDAPDPLINTCNGFYCDQWH--PNSNNKPKMWTEDWTGWFMHWGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R+AED+A+ V F + G++ NYYMYHGGTNFGRT+ Y+ T Y APL+EYG
Sbjct: 264 PTPHRTAEDVAFAVGRFF-QYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L QPKWGHLK LH +K + G ++++ A IF + + FL N
Sbjct: 323 DLNQPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIFSYAGQSVCFLGNAHPSM 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE---------------QW----- 427
+A + F N Y +P S+SILPDC T +NTAK+++ QW
Sbjct: 383 DANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSIMTINNENSYALDWQWMPETH 442
Query: 428 -EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDPSDSESV-LKVSS 482
E+ K+ ++ A LL+Q D SDYLWY + DP S + ++V++
Sbjct: 443 LEQMKDG-KVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPILSHDLKIRVNT 500
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GHVLH F+NG +GS + + +FT E + L G N +SL+S VGLP+ GAY +
Sbjct: 501 KGHVLHVFVNGAHIGSQYATYGKYTFTFEADIKLKLGKNEISLVSGTVGLPNYGAYFDNI 560
Query: 543 VAGLRNVSI----QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
G+ V + G++ KD S+ W Y+VG+ GE +++++ S W G H+
Sbjct: 561 HVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPSRS-TEEWFTNGLQAHK 619
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------- 643
WYKT F P G+D V ++L +GKG+AWVNG +IGRYWVS+L
Sbjct: 620 IFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSSTCDYRGT 679
Query: 644 -------TPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
T G P+Q WYH+P SFL+ N LV+ EE+ G P + I TV++ C
Sbjct: 680 YRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVTIAKACAKA 739
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
+ H ++++ C + IS+I FAS+G P G C ++
Sbjct: 740 YEGH--------------------------ELELACKENQVISEIKFASFGVPEGECGSF 773
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPK-ALLVDAQC 806
G C SS++ +IV++ CLGK+ C++ V EK G +P+ L +DA C
Sbjct: 774 KKGHCESSDTLSIVKRLCLGKQQCSIQV-NEKMLGPTGCRVPENRLAIDALC 824
>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
Length = 739
Score = 661 bits (1705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/708 (49%), Positives = 443/708 (62%), Gaps = 51/708 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +++IING R+IL SGSIHYPRSTP+MW LI KAK GGLD + T VFWN+HEP
Sbjct: 27 SVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIRKAKGGGLDAIDTYVFWNVHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 87 PGIYNFEGRYDLVRFIKTVQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK +L+ SQGGPIILSQIENEYG G Y WAAK+
Sbjct: 147 PFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGSESKQLGGAGYAYTNWAAKM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDPVINACNG C + PN P KP +WTE+W+ ++ +G
Sbjct: 207 AVGLNTGVPWVMCKQDDAPDPVINACNGFYC--DYFSPNKPYKPTLWTESWSGWFTEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R +D+A+ VA FI K GSY+NYYMYHGGTNFGR+A +T YD AP+DEYG
Sbjct: 265 PIYQRPVQDLAFAVARFIQK-GGSYINYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+R+PK+GHL +LH A+K C + ++S + ++A +F + CAAFL N
Sbjct: 324 LIREPKYGHLMDLHKAIKQCERALVSSDPTVTSLGAYEQAHVFSSKNGACAAFLANYHSN 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+N Y+LPP SISILPDCKT FNTA++ + WE Y E
Sbjct: 384 SAARVTFNNRKYDLPPWSISILPDCKTDVFNTARVRFQTTKIQMLPSNSKLFSWETYDED 443
Query: 434 IPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
+ + E+S + A+ LLEQ+N T+D SDYLWY D S SES L+ V S G
Sbjct: 444 VSSLSESSKITASGLLEQLNATRDTSDYLWYITSV--DISSSESFLRGGNKPSISVHSAG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H FING+F+GSA G D+S T V+L GTN ++LLSV VGLP+ G + E A
Sbjct: 502 HAVHVFINGQFLGSAFGTSEDRSCTFNGPVNLRAGTNKIALLSVAVGLPNVGFHFETWKA 561
Query: 545 GLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLT 601
G+ V + G KD + W YQ+GL GE + + + G V W R + L
Sbjct: 562 GITGVLLYGLDHGQKDLTWQKWSYQIGLKGEAMNLVSPNGVSSVDWVRDSLDVRSQSQLK 621
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
W+K F+AP G +P+A++L SMGKG+ W+NGQSIGRYW+ +
Sbjct: 622 WHKAYFNAPDGVEPLALDLSSMGKGQVWINGQSIGRYWMVYAKGACNSCNYAGTYRPAKC 681
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
G P+Q WYH+PRS+LKPT NL+VLLEE G P IS+ + T
Sbjct: 682 QLGCGQPTQQWYHVPRSWLKPTNNLIVLLEELGGNPWKISLQKRIIHT 729
>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
Length = 724
Score = 660 bits (1704), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/702 (50%), Positives = 452/702 (64%), Gaps = 53/702 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD R++IING RKIL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 24 SVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPG+ FR++N+
Sbjct: 84 PGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRTNNQ 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IVNMMK+ L+ SQGGPII++QIENEYG VE G Y +WAA++
Sbjct: 144 PFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L+TGVPW+MCK++DAPDPVI+ CNG C E F PN P KP +WTE WT +Y +G
Sbjct: 204 AVGLKTGVPWIMCKREDAPDPVIDTCNGFYC-EGFR-PNKPYKPKMWTEVWTGWYTKFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AEDIA+ VA F+ + GS+ NYYMYHGGTNFGRT+S + YD APLDEYG
Sbjct: 262 PIPQRPAEDIAFSVARFV-QNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
LL +PK+GHL++LH A+KL ++S + QEA +++ S CAAFL N D R
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAI 434
+ V F N Y LPP SISILPDCKT +NTA+++S W+ Y E
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSSIKMTPAGGGLSWQSYNEET 440
Query: 435 PTYDET-SLRANFLLEQMNTTKDASDYLWY--------NFRFKHDPSDSESVLKVSSLGH 485
PT D++ +L AN L EQ N T+D+SDYLWY N F + D L V S GH
Sbjct: 441 PTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLRNGKD--PYLTVMSAGH 498
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
VLH F+NG+ G+ +G + T V L G N +SLLSV VGLP+ G + + AG
Sbjct: 499 VLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVGVHYDTWNAG 558
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V++ G E ++ + W Y+VGL GE L + + GS V W R GS + QPLT
Sbjct: 559 VLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVR-GSLVAQKQPLT 617
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------ 643
WYK F+AP G+DP+A+ + SMGKG+ W+NG+ +GR+W ++
Sbjct: 618 WYKATFNAPGGNDPLALGMASMGKGQIWINGEGVGRHWPGYIAQGDCSKCSYAGTFNEKK 677
Query: 644 --TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G PSQ W+H+PRS+LKP+GNLLV+ EE G P GIS+
Sbjct: 678 CQTNCGQPSQRWHHVPRSWLKPSGNLLVVFEEWGGNPTGISL 719
>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
Length = 721
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/698 (49%), Positives = 443/698 (63%), Gaps = 48/698 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD +++IING R+IL SGSIHYPRSTPQMWP LI AKEGGLDV+QT VFWN HEP P
Sbjct: 23 VSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGHEPSP 82
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+FIK V GLYV LRIGP+I GEW +GG P WL VPGI FR+DN P
Sbjct: 83 GNYYFEDRYDLVKFIKLVHQAGLYVHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGP 142
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IVNMMKA +L+ QGGPII+SQIENEYG +E G Y +WAA++A
Sbjct: 143 FKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWAAQMA 202
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPW+MCKQ+DAPDP+I+ CNG C E F PN+ KP ++TE WT +Y +G
Sbjct: 203 VGLGTGVPWIMCKQEDAPDPIIDTCNGFYC-ENFM-PNANYKPKMFTEAWTGWYTEFGGP 260
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R AED+AY VA FI + +GS++NYYMYHGGTNFGRTA ++ T Y APLDEYGL
Sbjct: 261 VPYRPAEDMAYSVARFI-QNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 319
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PKWGHL++LH +KLC ++S + QEA +F + CAAFL N D + +
Sbjct: 320 RREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFWTKTSCAAFLANYDLKYS 379
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAK------------LDSVEQWEEYKEAIPT 436
V F NL Y+LPP S+SILPDCKTV FNTAK ++S W+ Y E P+
Sbjct: 380 VRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKMIAVNSAFSWQSYNEETPS 439
Query: 437 YD-ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHA 489
+ + + L EQ++ T+DA+DYLWY P ++ + +L V S GH LH
Sbjct: 440 ANYDAVFTKDGLWEQISVTRDATDYLWYMTDVTIGPDEAFLKNGQDPILTVMSAGHALHV 499
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+ +G+ + V L G N VSLLS+ VGLP+ G + E AG L
Sbjct: 500 FVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLLSIAVGLPNVGLHFETWNAGVLGP 559
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G D S + W Y++GL GE L + T GS V W GS + QPL WYKT
Sbjct: 560 VTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSGSSSVEWVE-GSLLAQRQPLIWYKT 618
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TP 645
F+AP G+DP+A+++ SMGKG+ W+NGQSIGR+W + +
Sbjct: 619 TFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGYKARGSCGACNYAGIYDEKKCHSN 678
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G SQ WYH+PRS+L PT NLLV+ EE G P IS+
Sbjct: 679 CGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKISL 716
>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
Length = 721
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/698 (50%), Positives = 445/698 (63%), Gaps = 47/698 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++I+G R+IL SGSIHYPRSTPQMWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 24 SVTYDHKAIVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLVRF+K Q GLYV LRIGP+I EW +GG P WL VPGI FR+DNE
Sbjct: 84 PGKYYFEDRYDLVRFVKLAQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDNE 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV++MK RL+ SQGGPIILSQIENEYG VE G Y +WAA++
Sbjct: 144 PFKAAMQKFTAKIVSLMKEERLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQ+DAPDPVI+ CNG C E F PN KP +WTENWT +Y +G
Sbjct: 204 AVGLDTGVPWVMCKQEDAPDPVIDTCNGFYC-ENFK-PNKNTKPKMWTENWTGWYTDFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ IR AED+A+ VA FI + GS+VNYYMYHGGTNFGRT+ + YD APLDEYG
Sbjct: 262 ASPIRPAEDLAFSVARFI-QNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L +PKWGHL+ LH A+K ++S + EA +F CAAF+ N D ++
Sbjct: 321 LQNEPKWGHLRALHKAIKQSEPALVSTDPKVTSLGYNLEAHVFSTPGACAAFIANYDTKS 380
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAK-----------LDSVEQWEEY-KEAIP 435
+A F + Y+LPP SISILPDCKTV +NTA+ ++S W+ Y +E
Sbjct: 381 SAKATFGSGQYDLPPWSISILPDCKTVVYNTARVGNGWVKKMTPVNSGFAWQSYNEEPAS 440
Query: 436 TYDETSLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKVSSLGHVLHA 489
+ + S+ A L EQ+N T+D+SDYLWY N + VL V S GH+LH
Sbjct: 441 SSQDDSIAAEALWEQVNVTRDSSDYLWYMTDVYINGNEGFLKNGRSPVLTVMSAGHLLHV 500
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
FING+ G+ +G + T V+L G N +SLLSV VGLP+ G + E AG L
Sbjct: 501 FINGQLSGTVYGGLGNPKLTFSDNVNLRVGNNKLSLLSVAVGLPNVGVHFETWNAGVLGP 560
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G E +D S W Y+VGL GE L + T+ GS V W + GS + QPLTWYK
Sbjct: 561 VTLKGLNEGTRDLSRQKWSYKVGLKGEALNLHTESGSSSVEWIQ-GSLVAKKQPLTWYKA 619
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TP 645
F AP G+DP+A++L SMGKGE WVNG+SIGR+W ++ T
Sbjct: 620 TFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIAHGSCNACNYAGYYTDQKCRTN 679
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PRS+L GN LV+ EE G P GI++
Sbjct: 680 CGKPSQRWYHVPRSWLNSGGNSLVVFEEWGGDPNGIAL 717
>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
Length = 737
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/700 (49%), Positives = 443/700 (63%), Gaps = 50/700 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING ++IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 38 SVSYDHKAVIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPT 97
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G + F R DLVRFIK VQ GLYV LRIGP++ EW YGG P WL VPGI FR+DN
Sbjct: 98 QGNYYFQDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPVWLKYVPGIEFRTDNG 157
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M ++ IV+MMKA +L+ +QGGPIILSQIENE+G VE G Y +WAA++
Sbjct: 158 PFKAAMHKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEWDIGAPGKAYAKWAAQM 217
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDPVIN CNG C E F PN KP +WTE WT ++ +G
Sbjct: 218 AVGLNTGVPWVMCKQDDAPDPVINTCNGFYC-EKFV-PNQNYKPKMWTEAWTGWFTEFGS 275
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
R AED+ + VA FI + GS++NYYMYHGGTNFGRT+ +V T Y AP+DEYGL
Sbjct: 276 AVPTRPAEDLVFSVARFI-QSGGSFINYYMYHGGTNFGRTSGGFVATSYDYDAPIDEYGL 334
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKRN 387
L +PKWGHL+ LH A+KLC ++S + + QEA +F S +CAAFL N D
Sbjct: 335 LNEPKWGHLRGLHKAIKLCEPALVSVDPTVKSLGENQEAHVFNSISGKCAAFLANYDTTF 394
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------------SVEQWEEY-KEAI 434
+A V F N Y+LPP SIS+LPDCKT FNTA++ + W+ Y +E
Sbjct: 395 SAKVSFGNAQYDLPPWSISVLPDCKTAVFNTARVGVQSSQKKFVPVINAFSWQSYIEETA 454
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWY--------NFRFKHDPSDSESVLKVSSLGHV 486
+ D+ + + L EQ+ T DASDYLWY N F + D +L + S GH
Sbjct: 455 SSTDDNTFTKDGLWEQVYLTADASDYLWYMTDVNIGSNEGFLKNGQD--PLLTIWSAGHA 512
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG- 545
L FING+ G+ +G + T K V L G N +SLLS VGLP+ G + E+ AG
Sbjct: 513 LQVFINGQLSGTVYGSLENPKLTFSKNVKLRAGVNKISLLSTSVGLPNVGTHFEKWNAGV 572
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWY 603
L V+++G E +D S W Y++GL GE L + T GS V W++ S + QP+TWY
Sbjct: 573 LGPVTLKGLNEGTRDISKQKWTYKIGLKGEALSLHTVSGSSSVEWAQGASLAQKQPMTWY 632
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL-------------------- 643
KT F+ P G+DP+A+++ +MGKG W+NGQSIGR+W ++
Sbjct: 633 KTTFNVPPGNDPLALDMGAMGKGMVWINGQSIGRHWPGYIGNGNCGGCNYAGTYTEKKCR 692
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G PSQ WYH+PRS LKP+GNLLV+ EE G P IS+
Sbjct: 693 TYCGKPSQRWYHVPRSRLKPSGNLLVVFEEWGGEPHWISL 732
>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
Length = 719
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/696 (48%), Positives = 448/696 (64%), Gaps = 45/696 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++IING R+IL SGSIHYPRSTPQMWP LI AK+GGLD+++T VFWN HEP
Sbjct: 22 VTYDQKAIIINGKRRILVSGSIHYPRSTPQMWPSLIQNAKDGGLDIIETYVFWNGHEPTQ 81
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++ F R DLVRFIK VQ GLYV LRIGP++ EW YGG P WL VPGIVFR++NEP
Sbjct: 82 GKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPIWLKHVPGIVFRTENEP 141
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV MMK+ +LY SQGGPIILSQIENEYG VE G Y +WAA++A
Sbjct: 142 FKAAMQKFTEKIVGMMKSEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQMA 201
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ L TGVPWVMCKQ+DAPDPVI+ CNG C E F PN +KP IWTE W+ +Y +G
Sbjct: 202 LGLDTGVPWVMCKQEDAPDPVIDTCNGFYC-ENFK-PNRENKPKIWTEVWSGWYTAFGGA 259
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R AED+A+ VA F+ + GS NYYMYHGGTNFGR++ ++ Y AP+DEYGL
Sbjct: 260 VPYRPAEDLAFSVARFV-QNGGSLFNYYMYHGGTNFGRSSGLFIANSYDFDAPIDEYGLK 318
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNN 388
R+PKW HL++LH A+KLC ++S K EA +F+ SS CAAFL N D +
Sbjct: 319 REPKWEHLRDLHKAIKLCEPALVSADPNVTWLGKNLEARVFKSSSGACAAFLANYDISTS 378
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-----------WEEYKEAIPT- 436
+ V F N Y+LPP SISIL DCK+ FNTA++ + W YKE + +
Sbjct: 379 SKVSFWNTQYDLPPWSISILSDCKSAIFNTARIGAQSAPMKMMLVSSFWWLSYKEEVASG 438
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHAF 490
Y + + L+EQ+N T D++DYLWY + DP+++ +L +SS GHVLH F
Sbjct: 439 YATDTTTKDGLVEQVNFTWDSTDYLWYMTDIQIDPNEAFIKSGQWPLLNISSAGHVLHVF 498
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
+NG+ G+ +G + K V+L G N +S+LSV VGLP+ G + E AG L V
Sbjct: 499 VNGQLSGTVYGSLENPKVAFSKYVNLKAGVNKLSMLSVTVGLPNVGLHFESWNAGVLGPV 558
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR-YGSSTHQPLTWYKTVF 607
+++G E ++D S + W ++VGL GE + + T GS V W++ G QPLTWYKT F
Sbjct: 559 TLKGLNEGIRDMSGYKWSHKVGLKGENMNLHTIGGSNSVQWAKGSGLVQKQPLTWYKTNF 618
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQG 647
+ P G++P+A+++ SMGKG+ W+NG+SIGRYW ++ L+ G
Sbjct: 619 NTPAGNEPLALDMSSMGKGQIWINGRSIGRYWPAYAASGSCGKCSYAGIFTEKKCLSNCG 678
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
PSQ WYH+PR +L+ GN LV+ EE G P GIS+
Sbjct: 679 QPSQKWYHVPREWLESKGNFLVVFEELGGNPGGISL 714
>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
Length = 730
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/697 (50%), Positives = 442/697 (63%), Gaps = 46/697 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++ING R+IL SGSIHYPRSTPQMWP LI KAK+GGLDV++T VFWN HEP
Sbjct: 34 SVTYDHKAIMINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEPS 93
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLV FIK VQ GL+V LRIGPFI EW +GG P WL VPGI FR+DNE
Sbjct: 94 PGKYYFEDRFDLVGFIKLVQQAGLFVHLRIGPFICAEWNFGGFPVWLKYVPGIAFRTDNE 153
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IVN+MKA +L+ SQGGPIILSQIENEYG VE G Y +WAA++
Sbjct: 154 PFKEAMQKFTEKIVNIMKAEKLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAQM 213
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQ+DAPDP+I+ CNG C E F PN KP +WTENWT +Y +G
Sbjct: 214 AVGLDTGVPWVMCKQEDAPDPIIDTCNGFYC-ENFT-PNKNYKPKLWTENWTGWYTAFGG 271
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R AEDIA+ VA FI + +GS NYYMYHGGTNFGRT++ +V T Y AP+DEYG
Sbjct: 272 ATPYRPAEDIAFSVARFI-QNRGSLFNYYMYHGGTNFGRTSNGLFVATSYDYDAPIDEYG 330
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
LL +PKWGHL+ELH A+K C ++S K E +++ S CAAFL N +
Sbjct: 331 LLNEPKWGHLRELHRAIKQCESALVSVDPTVSWPGKNLEVHLYKTESACAAFLANYNTDY 390
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIP 435
+ V F N Y+LPP SISILPDCKT FNTAK++S W+ Y E
Sbjct: 391 STQVKFGNGQYDLPPWSISILPDCKTEVFNTAKVNSPRLHRKMTPVNSAFAWQSYNEEPA 450
Query: 436 TYDETSLRANFLL-EQMNTTKDASDYLWYNFRFKHDPSDSES----VLKVSSLGHVLHAF 490
+ E + L EQ+ T+D+SDYLWY P+D + VL S GHVL+ F
Sbjct: 451 SSSENDPVTGYALWEQVGVTRDSSDYLWYLTDVNIGPNDIKDGKWPVLTAMSAGHVLNVF 510
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
ING++ G+A+G D T + V+L G N +SLLSV VGL + G + E G L V
Sbjct: 511 INGQYAGTAYGSLDDPRLTFSQSVNLRVGNNKISLLSVSVGLANVGTHFETWNTGVLGPV 570
Query: 550 SIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKTV 606
++ G + D S W Y++GL GE L + T+ GS V W + GS + QPL WYKT
Sbjct: 571 TLTGLSSGTWDLSKQKWSYKIGLKGESLSLHTEAGSNSVEWVQ-GSLVAKKQPLAWYKTT 629
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW--------------------VSFLTPQ 646
F AP G+DP+A++L SMGKGE WVNGQSIGR+W L
Sbjct: 630 FSAPAGNDPLALDLGSMGKGEVWVNGQSIGRHWPGNKARGNCGNCNYAGTYTDTKCLANC 689
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PRS+L+ GN LV+LEE G P GI++
Sbjct: 690 GQPSQRWYHVPRSWLRSGGNYLVVLEEWGGDPNGIAL 726
>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
Length = 721
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/698 (49%), Positives = 442/698 (63%), Gaps = 48/698 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD +++IING R+IL SGSIHYPRSTPQMWP LI AKEGGLDV+QT VFWN HEP P
Sbjct: 23 VSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGHEPSP 82
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+FIK V GLYV LRI P+I GEW +GG P WL VPGI FR+DN P
Sbjct: 83 GNYYFEDRYDLVKFIKLVHQAGLYVHLRISPYICGEWNFGGFPVWLKYVPGIQFRTDNGP 142
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IVNMMKA +L+ QGGPII+SQIENEYG +E G Y +WAA++A
Sbjct: 143 FKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWAAQMA 202
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPW+MCKQ+DAPDP+I+ CNG C E F PN+ KP ++TE WT +Y +G
Sbjct: 203 VGLGTGVPWIMCKQEDAPDPIIDTCNGFYC-ENFM-PNANYKPKMFTEAWTGWYTEFGGP 260
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R AED+AY VA FI + +GS++NYYMYHGGTNFGRTA ++ T Y APLDEYGL
Sbjct: 261 VPYRPAEDMAYSVARFI-QNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 319
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PKWGHL++LH +KLC ++S + QEA +F + CAAFL N D + +
Sbjct: 320 RREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFWTKTSCAAFLANYDLKYS 379
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAK------------LDSVEQWEEYKEAIPT 436
V F NL Y+LPP S+SILPDCKTV FNTAK ++S W+ Y E P+
Sbjct: 380 VRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVSQGSLAKMIAVNSAFSWQSYNEETPS 439
Query: 437 YD-ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHA 489
+ + + L EQ++ T+DA+DYLWY P ++ + +L V S GH LH
Sbjct: 440 ANYDAVFTKDGLWEQISVTRDATDYLWYMTDVTIGPDEAFLKNGQDPILTVMSAGHALHV 499
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+ +G+ + V L G N VSLLS+ VGLP+ G + E AG L
Sbjct: 500 FVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLLSIAVGLPNVGLHFETWNAGVLGP 559
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G D S + W Y++GL GE L + T GS V W GS + QPL WYKT
Sbjct: 560 VTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSGSSSVEWVE-GSLLAQRQPLIWYKT 618
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TP 645
F+AP G+DP+A+++ SMGKG+ W+NGQSIGR+W + +
Sbjct: 619 TFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGYKARGSCGACNYAGIYDEKKCHSN 678
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G SQ WYH+PRS+L PT NLLV+ EE G P IS+
Sbjct: 679 CGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKISL 716
>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
Length = 807
Score = 657 bits (1696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/805 (42%), Positives = 473/805 (58%), Gaps = 63/805 (7%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+I+G R + FSG+IHYPRS P+MW +L+ AK GGL+ ++T VFWN HE
Sbjct: 33 GTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG++ F GR DL+RF+ ++ +Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK IENEYG ++ +G Y+ WAA
Sbjct: 153 NEPFK-------------------------------IENEYGNIKKDRKVEGDKYLEWAA 181
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ GVPWVMCKQ AP VI CNGR CG+T+ + +KP +WTENWT+ ++ +
Sbjct: 182 EMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDK-NKPRLWTENWTAQFRTF 240
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ RSAEDIAY V F AK G+ VNYYMYHGGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 241 GDQLAQRSAEDIAYAVLRFFAK-GGTLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEY 299
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ +K K L G EA ++ + C +FL N +
Sbjct: 300 GMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNN 359
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQ---WEE 429
+ TV F + +P S+SIL DCKTV +NT ++ D + WE
Sbjct: 360 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNVWEM 419
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y EAIP + +T +R LEQ N TKD SDYLWY +FR + D D V+++ S
Sbjct: 420 YSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIKST 479
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
H + F N FVG+ G +KSF EK + L G N++++LS +G+ DSG L
Sbjct: 480 AHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVEVK 539
Query: 544 AGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTW 602
G+++ +QG D G++ L GE +I+T+ G W + P+TW
Sbjct: 540 GGIQDCVVQGLNTGTLDLQGNGRGHKARLEGEDKEIYTEKGMAQFQWKP--AENDLPITW 597
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK FD P G DP+ +++ SM KG +VNG+ IGRYW SF+T G PSQS YHIPR+FLK
Sbjct: 598 YKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLAGHPSQSVYHIPRAFLK 657
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
P GNLL++ EEE G P GI I TV +C +S+ + + +W S + +
Sbjct: 658 PKGNLLIIFEEELGKPGGILIQTVRRDDICVFISEHNPAQIKTWESDGGQIKLIAEDTST 717
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
R + CP R I +++FAS+GNP G C N+ G+CH+ +++A+VEK CLGK SC +P
Sbjct: 718 RG---TLNCPPQRTIQEVVFASFGNPEGACGNFTAGTCHTPDAKAVVEKECLGKESCVLP 774
Query: 783 VWTEKFYGD-PCPGIPKALLVDAQC 806
V + D CP L V +C
Sbjct: 775 VVNTVYGADINCPATTATLAVQVRC 799
>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
Length = 729
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/695 (50%), Positives = 444/695 (63%), Gaps = 47/695 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD +SL+ING R+IL SGSIHYPRSTP+MW LI KAK GGLDV+ T VFW++HEP
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG +DF GR DLVRFIK VQ GLY LRIGP++ EW +GG+P WL VPG+ FR+DNE
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK+ +L+ SQGGPIILSQIENEYG S G YV WAA +
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGRAYVNWAASM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN+CNG C + PN P KP++WTE W+ ++ +G
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDF--SPNKPYKPSMWTETWSGWFTEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+++ VA FI K GSYVNYYMYHGGTNFGR+A +T YD AP+DEYG
Sbjct: 265 PIHQRPVEDLSFAVARFIQK-GGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKR 386
L+RQPK+ HLKELH A+K C ++S ++ L +A +F G+ CAAFL N + +
Sbjct: 324 LIRQPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQ 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------QWEEYKEAIPTYDET 440
+ ATV F+N Y+LPP SISILPDCK FNTAK+ + WE Y E + + E+
Sbjct: 384 SAATVTFNNRHYDLPPWSISILPDCKIDVFNTAKVKMLPVKPKLFSWESYDEDLSSLAES 443
Query: 441 S-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHVLHAFI 491
S + A LLEQ+N T+D SDYLWY D S SES L+ V S GH +H F+
Sbjct: 444 SRITAPGLLEQLNVTRDTSDYLWYITSV--DISSSESFLRGGQKPSINVQSAGHAVHVFV 501
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VS 550
NG+F GSA G +S T V L G N ++LLSV VGL + G + E AG+ V
Sbjct: 502 NGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHYETWEAGITGPVL 561
Query: 551 IQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH--QPLTWYKTVF 607
+ G + KD + W Y+VGL GE + + + G V W + +T L WYK F
Sbjct: 562 LHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQSRSQLKWYKAYF 621
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------GT 648
DAP G +P+A++L SMGKG+ W+NGQSIGRYW+++ G
Sbjct: 622 DAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAYAKGDCNSCTYSGTFRPVKCQLGCGQ 681
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
P+Q WYH+PRS+LKPT NL+V+ EE G P IS+
Sbjct: 682 PTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISL 716
>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
Length = 823
Score = 657 bits (1695), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/832 (41%), Positives = 484/832 (58%), Gaps = 88/832 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGR++II+G ++L SGSIHYPRST QMWP L+ K++EGGLD ++T VFW+ HEP
Sbjct: 25 VTYDGRAIIIDGKHRLLVSGSIHYPRSTAQMWPDLVKKSREGGLDAIETYVFWDSHEPAR 84
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
++DFSG DL+RF+K +Q +GLY LRIGP++ EW YGG P WLH++PG+ R+ N+
Sbjct: 85 REYDFSGNLDLIRFLKTIQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGVQMRTANDV 144
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F M+ + T+IVNM+K L+ASQGGP+IL+QIENEYG V S+ ++G Y+ W A +A
Sbjct: 145 FMNEMRNFTTLIVNMVKQENLFASQGGPVILAQIENEYGNVMSSYGDEGKAYIEWCANMA 204
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L GVPW+MC+Q DAP+P+IN CNG C + PN P P +WTENWT +++ +G +
Sbjct: 205 QSLHIGVPWLMCQQSDAPEPMINTCNGWYCDQ--FTPNRPTSPKMWTENWTGWFKSWGGK 262
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+AED+A+ VA F ++ G++ NYYMYHGGTNFGRTA Y+ T Y APLDEYG
Sbjct: 263 DPHRTAEDLAFSVARFY-QLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 321
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
L QPKWGHLKELH + + G + S++F I+ + FL N D RN+
Sbjct: 322 LNQPKWGHLKELHDVLHSMEDTLTRGNISSVDFGNSVSGTIYSTEKGSSCFLTNTDSRND 381
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAI-------------P 435
T+ F L YE+P S+SILPDC+ V +NTAK+ + K+ + P
Sbjct: 382 TTINFQGLDYEVPAWSVSILPDCQDVVYNTAKVSAQTSVMVKKKNVAEDEPAALTWSWRP 441
Query: 436 TYDETSL-------RANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSSLG 484
++ S+ N +L+Q + D SDYL+Y D L+++ G
Sbjct: 442 ETNDKSILFGKGEVSVNQILDQKDAANDLSDYLFYMTSVSLKEDDPIWGDNMTLRITGSG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
VLH F+NGEF+GS K+ + E+ + L G N ++LLS VG + GA + A
Sbjct: 502 QVLHVFVNGEFIGSQWAKYGVFDYVFEQQIKLNKGKNTITLLSATVGFANYGANFDLTQA 561
Query: 545 GLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQP 599
G+R V + G + +KD SS W Y+VGL G + +++ S+ W + T++
Sbjct: 562 GVRGPVELVGYHDDEIIIKDLSSHKWSYKVGLEGLRQNLYSSDSSK---WQQDNYPTNKM 618
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL---------------- 643
TWYK F AP G+DPV ++L+ +GKG AWVNG SIGRYW SF+
Sbjct: 619 FTWYKATFKAPLGTDPVVVDLLGLGKGLAWVNGNSIGRYWPSFIAEDGCSLDPCDYRGSY 678
Query: 644 ------TPQGTPSQSWYHIPRSFLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
T G P+Q WYH+PRSFL G N LVL EE G P ++ T ++ + C +
Sbjct: 679 DNNKCVTNCGKPTQRWYHVPRSFLNNEGDNTLVLFEEFGGDPSSVNFQTTAIGSACVNAE 738
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYA 756
+ + K+++ C GR IS I FAS+GNP G C +++
Sbjct: 739 E--------------------------KKKIELSC-QGRPISAIKFASFGNPLGTCGSFS 771
Query: 757 IGSCHSSN-SRAIVEKACLGKRSCTVPVWTEKFYGDPC-PGIPKALLVDAQC 806
G+C +SN + +IV+KAC+G+ SCT+ V + F C + K L V+A C
Sbjct: 772 KGTCEASNDALSIVQKACVGQESCTIDVSEDTFGSTTCGDDVIKTLSVEAIC 823
>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/698 (48%), Positives = 440/698 (63%), Gaps = 48/698 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++IING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLV+FIK VQ GLYV LRIGP++ EW +GG P WL VP +VFR+DNEP
Sbjct: 89 GQYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPDMVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV MMK +L+ +QGGPIILSQIENEYG +E G Y +W AK+A
Sbjct: 149 FKAAMQKFTEKIVGMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAKMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L TGVPW+MCKQDDAP+ +IN CNG C E F PNS KP +WTENWT ++ +G
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYC-ENFK-PNSDKKPKMWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R AEDIA VA FI + GS++NYYMYHGGTNF RTA ++ T Y APLDEYGL
Sbjct: 267 VPYRPAEDIALSVARFI-QNGGSFINYYMYHGGTNFDRTAGEFIATSYDYDAPLDEYGLP 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLK LH +KLC ++S + QEA +F+ S CAAFL N + + A
Sbjct: 326 REPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAQVFKSQSSCAAFLSNYNTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------------DSVEQWEEYKEAIP 435
V F Y+LPP S+SILPDCKT +NTAK+ +++ W Y E IP
Sbjct: 386 RVSFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKMVPTNTLFSWGSYNEEIP 445
Query: 436 TY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESVLKVSSLGHVLHA 489
+ D + + L+EQ++ T+D +DY WY P + + +L + S GH LH
Sbjct: 446 SANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPLLNIGSAGHALHV 505
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+A+G T + + L G N ++LLS+ GLP+ G + E G L
Sbjct: 506 FVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSIAAGLPNVGVHYETWNTGVLGP 565
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G D S + W Y++G GE L I T GS V W + GS +T QPLTWYK+
Sbjct: 566 VTLKGVNSGTWDMSQWKWSYKIGTKGEALSIHTVTGSSTVEWKQ-GSLVATKQPLTWYKS 624
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTP 645
FD P G++P+A+++ +MGKG+ W+NGQ+IGR+W ++ L+
Sbjct: 625 TFDTPAGNEPLALDMNTMGKGQTWINGQNIGRHWPAYTARGKCERCSYAGTFTENKCLSN 684
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G SQ WYH+PRS+LKPT NL+V+LEE G P GIS+
Sbjct: 685 CGEASQRWYHVPRSWLKPTNNLVVVLEEWGGEPNGISL 722
>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 721
Score = 657 bits (1694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/698 (50%), Positives = 441/698 (63%), Gaps = 47/698 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++++G R+IL SGSIHYPRSTPQMWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 24 SVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R DLV+F+K VQ GLYV LRIGP+I EW +GG P WL VPGI FR+DNE
Sbjct: 84 PGQYYFEDRFDLVKFVKLVQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDNE 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV++MK RL+ SQGGPII+SQIENEYG VE G Y +WAA++
Sbjct: 144 PFKAAMQKFTAKIVSLMKENRLFQSQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQ+DAPDPVI+ CNG C E F PN KP +WTENWT +Y +G
Sbjct: 204 AVGLDTGVPWVMCKQEDAPDPVIDTCNGYYC-ENFK-PNKNTKPKMWTENWTGWYTDFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA FI + GS+VNYYMYHGGTNFGRT+ + YD APLDEYG
Sbjct: 262 AVPRRPAEDLAFSVARFI-QNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L +PK+ HL+ LH A+K C +++ + EA +F CAAF+ N D ++
Sbjct: 321 LQNEPKYEHLRNLHKAIKQCEPALVATDPKVQSLGYNLEAHVFSTPGACAAFIANYDTKS 380
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAK-----------LDSVEQWEEYKEAIPT 436
A F N Y+LPP SISILPDCKTV +NTAK ++S W+ Y E +
Sbjct: 381 YAKATFGNGQYDLPPWSISILPDCKTVVYNTAKVGNSWLKKMTPVNSAFAWQSYNEEPAS 440
Query: 437 YDET-SLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKVSSLGHVLHA 489
+ S+ A L EQ+N T+D+SDYLWY N + VL S GHVLH
Sbjct: 441 SSQADSIAAYALWEQVNVTRDSSDYLWYMTDVYINANEGFLKNGQSPVLTAMSAGHVLHV 500
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
FIN + G+ G ++ T V L G N +SLLSV VGLP+ G + E AG L
Sbjct: 501 FINDQLAGTVWGGLANPKLTFSDNVKLRVGNNKLSLLSVAVGLPNVGVHFETWNAGVLGP 560
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G E +D SS W Y+VGL GE L + T+ GS V W R GS + QPLTWYKT
Sbjct: 561 VTLKGLNEGTRDLSSQKWSYKVGLKGESLSLHTESGSSSVEWIR-GSLVAKKQPLTWYKT 619
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TP 645
F AP G+DP+A++L SMGKGE WVNG+SIGR+W ++ T
Sbjct: 620 TFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIAHGSCNACNYAGFYTDTKCRTN 679
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PRS+L GN LV+ EE G P GI++
Sbjct: 680 CGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDPNGIAL 717
>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 835
Score = 656 bits (1693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/832 (42%), Positives = 487/832 (58%), Gaps = 87/832 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YDGR++ I+G RKILFSGSIHYPRST +MWP LI K+KEGGLDV++T VFWN+HEP
Sbjct: 26 DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHEPH 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+DFSG DLVRFIK +Q QGL+ LRIGP++ EW YGG P WLH++P I FR++N
Sbjct: 86 PGQYDFSGNLDLVRFIKTIQNQGLHAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTNNA 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
F+ MK++ T+IV+MM+ +L+ASQGGPIIL+QIENEYG + S+ + G YV+W A+L
Sbjct: 146 IFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCAQL 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A Q GVPW+MC+Q D PDP+IN CNG C + PNS +KP +WTE+WT ++ +G
Sbjct: 206 AQSYQIGVPWIMCQQSDTPDPLINTCNGFYCDQWH--PNSNNKPKMWTEDWTGWFMHWGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R+AED+A+ V F + G++ NYYMYHGGTNFGRT+ Y+ T Y APL+EYG
Sbjct: 264 PTPHRTAEDVAFAVGRFF-QYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L QPKWGHLK LH +K + G ++++ A IF + + FL N
Sbjct: 323 DLNQPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIFSYAGQSVCFLGNAHPSM 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE---------------QW----- 427
+A + F N Y +P S+SILPDC T +NTAK+++ QW
Sbjct: 383 DANINFQNTQYTIPAWSVSILPDCYTEVYNTAKVNAQTSIMTINNENSYALDWQWMPETH 442
Query: 428 -EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDPSDSESV-LKVSS 482
E+ K+ ++ A LL+Q D SDYLWY + DP S + ++V++
Sbjct: 443 LEQMKDG-KVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPILSHDLKIRVNT 500
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GHVLH F+NG +GS + + FT E + L G N +SL+S VGLP+ GAY +
Sbjct: 501 KGHVLHVFVNGAHIGSQYATYGKYPFTFEADIKLKLGKNEISLVSGTVGLPNYGAYFDNI 560
Query: 543 VAGLRNVSI----QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
G+ V + G++ KD S+ W Y+VG+ GE +++++ S W G H+
Sbjct: 561 HVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPSRSS-EEWFTNGLQAHK 619
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------- 643
WYKT F P G+D V ++L +GKG+AWVNG +IGRYWVS+L
Sbjct: 620 IFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSSTCDYRGT 679
Query: 644 -------TPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
T G P+Q WYH+P SFL+ N LV+ EE+ G P + I TV++ C
Sbjct: 680 YRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVTIAKACAKA 739
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
+ H ++++ C + IS+I FAS+G P G C ++
Sbjct: 740 YEGH--------------------------ELELACKENQVISEIRFASFGVPEGECGSF 773
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPK-ALLVDAQC 806
G C SS++ +IV++ CLGK+ C++ V EK G +P+ L +DA C
Sbjct: 774 KKGHCESSDTLSIVKRLCLGKQQCSIHV-NEKMLGPTGCRVPENRLAIDALC 824
>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
Length = 736
Score = 655 bits (1689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/702 (50%), Positives = 444/702 (63%), Gaps = 54/702 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD +SL+ING R+IL SGSIHYPRSTP+MW LI KAK GGLDV+ T VFW++HEP
Sbjct: 29 NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG +DF GR DLVRFIK VQ GLY LRIGP++ EW +GG+P WL VPG+ FR+DNE
Sbjct: 89 PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK+ +L+ SQGGPIILSQIENEYG S G YV WAA +
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYG--PESRGAAGRAYVNWAASM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK++DAPDPVIN+CNG C + PN P KP++WTE W+ ++ +G
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDF--SPNKPYKPSMWTETWSGWFTEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+++ VA FI K GSYVNYYMYHGGTNFGR+A +T YD AP+DEYG
Sbjct: 265 PIHQRPVEDLSFAVARFIQK-GGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKR 386
L+RQPK+ HLKELH A+K C ++S ++ L +A +F G+ CAAFL N + +
Sbjct: 324 LIRQPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQ 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEA 433
+ ATV F+N Y+LPP SISILPDCK FNTAK+ + WE Y E
Sbjct: 384 SAATVTFNNRHYDLPPWSISILPDCKIDVFNTAKVRVQPSQVKMLPVKPKLFSWESYDED 443
Query: 434 IPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLG 484
+ + E+S + A LLEQ+N T+D SDYLWY D S SES L+ V S G
Sbjct: 444 LSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSV--DISSSESFLRGGQKPSINVQSAG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H +H F+NG+F GSA G +S T V L G N ++LLSV VGL + G + E A
Sbjct: 502 HAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHYETWEA 561
Query: 545 GLRN-VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH--QPL 600
G+ V + G + KD + W Y+VGL GE + + + G V W + +T L
Sbjct: 562 GITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQSRSQL 621
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
WYK FDAP G +P+A++L SMGKG+ W+NGQSIGRYW+++
Sbjct: 622 KWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAYAKGDCNSCTYSGTFRPVK 681
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G P+Q WYH+PRS+LKPT NL+V+ EE G P IS+
Sbjct: 682 CQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISL 723
>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
Precursor
gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 728
Score = 654 bits (1688), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/698 (47%), Positives = 440/698 (63%), Gaps = 48/698 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++IING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLV+FIK VQ GLYV LRIGP++ EW +GG P WL VPG+VFR+DNEP
Sbjct: 89 GQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV MMK +L+ +QGGPIILSQIENEYG +E G Y +W A++A
Sbjct: 149 FKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAEMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L TGVPW+MCKQDDAP+ +IN CNG C E F PNS +KP +WTENWT ++ +G
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYC-ENFK-PNSDNKPKMWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R AEDIA VA FI + GS++NYYMYHGGTNF RTA ++ T Y APLDEYGL
Sbjct: 267 VPYRPAEDIALSVARFI-QNGGSFINYYMYHGGTNFDRTAGEFIATSYDYDAPLDEYGLP 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLK LH +KLC ++S + QEA +F+ S CAAFL N + + A
Sbjct: 326 REPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKSKSSCAAFLSNYNTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------------DSVEQWEEYKEAIP 435
V F Y+LPP S+SILPDCKT +NTAK+ ++ W Y E IP
Sbjct: 386 RVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKMVPTNTPFSWGSYNEEIP 445
Query: 436 TY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESVLKVSSLGHVLHA 489
+ D + + L+EQ++ T+D +DY WY P + + +L + S GH LH
Sbjct: 446 SANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPLLTIGSAGHALHV 505
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+A+G T + + L G N ++LLS GLP+ G + E G L
Sbjct: 506 FVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGVHYETWNTGVLGP 565
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V++ G D + + W Y++G GE L + T GS V W + GS + QPLTWYK+
Sbjct: 566 VTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLAGSSTVEW-KEGSLVAKKQPLTWYKS 624
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTP 645
FD+PTG++P+A+++ +MGKG+ W+NGQ+IGR+W ++ L+
Sbjct: 625 TFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWPAYTARGKCERCSYAGTFTEKKCLSN 684
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G SQ WYH+PRS+LKPT NL+++LEE G P GIS+
Sbjct: 685 CGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPNGISL 722
>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
Length = 745
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/700 (49%), Positives = 442/700 (63%), Gaps = 51/700 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++IING R+IL SGSIHYPRSTP+MW LI KAK+GGLDV+ T VFWN+HEP P
Sbjct: 29 VTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWNVHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F GR DLV+FIK VQ +GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 89 GNYNFEGRYDLVQFIKTVQKKGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV MMK +L+ SQGGPIILSQIENEYG + G Y WAAK+A
Sbjct: 149 FKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGASGHAYSNWAAKMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCK+DDAPDPVINACNG C + PN P KP +WTE+W+ ++ +G
Sbjct: 209 VGLGTGVPWVMCKEDDAPDPVINACNGFYCDDF--SPNKPYKPKLWTESWSGWFSEFGGS 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
R ED+A+ VA FI K GS+ NYYMYHGGTNFGR+A +T YD AP+DEYGL
Sbjct: 267 NPQRPVEDLAFAVARFIQK-GGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGL 325
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
LR+PK+GHLK+LH A+K C ++S + ++A +F + CAAFL N +
Sbjct: 326 LREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTTCAAFLANYHSNSA 385
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEAIP 435
A V F+N Y+LPP SISILPDC+T FNTA++ + WE Y E +
Sbjct: 386 ARVTFNNRHYDLPPWSISILPDCRTDVFNTARMRFQPSQIQMLPSNSKLLSWETYDEDVS 445
Query: 436 TYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHV 486
+ E+S + A+ LLEQ++ T+D SDYLWY D S SES L+ V S G
Sbjct: 446 SLAESSRITASRLLEQIDATRDTSDYLWYITSV--DISSSESFLRGRNKPSISVHSSGDA 503
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+H FING+F GSA G D+SFT + L GTN ++LLSV VGLP+ G + E +G+
Sbjct: 504 VHVFINGKFSGSAFGTREDRSFTFNGPIDLRAGTNKIALLSVAVGLPNGGIHFESWKSGI 563
Query: 547 RNVSIQGAKE--LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW-SRYGSSTHQP-LTW 602
+ + KD + W YQVGL GE + + + G V W S +S +QP L W
Sbjct: 564 TGPVLLHDLDHGQKDLTGQKWSYQVGLKGEAMNLVSPNGVSSVDWVSESLASQNQPQLKW 623
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
+K F+AP G +P+A+++ SMGKG+ W+NGQSIGRYW+ +
Sbjct: 624 HKAHFNAPNGVEPLALDMSSMGKGQVWINGQSIGRYWMVYAKGNCNSCNYAGTYRQAKCQ 683
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G P+Q WYH+PRS+LKP NL+V+ EE G P IS+
Sbjct: 684 VGCGQPTQRWYHVPRSWLKPKNNLMVVFEELGGNPWKISL 723
>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
Length = 706
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/650 (48%), Positives = 435/650 (66%), Gaps = 28/650 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+ +GHR+I SGSIHYPRS P MWP LIAKAKEGGL+ ++T VFWN+HE
Sbjct: 40 GTVVSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHE 99
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ G+F+F G+ D+VRF + +Q +Y +R+GPFI+ EW +GGLP+WL ++P IVFR++
Sbjct: 100 PEKGEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTN 159
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEP+K HM+ + +I+ +K A L+ASQGGPIIL+QIENEY +E +F ++G Y+ WAA
Sbjct: 160 NEPYKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAA 219
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
K+A+ G+PW+MCKQ AP VI CNGR CG+T+ GP + P +WTENWT+ Y+V+
Sbjct: 220 KMAISTNIGIPWIMCKQTKAPSDVIPTCNGRNCGDTWPGPTNKSMPLLWTENWTAQYRVF 279
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD RSAEDIA+ VA F + + G+ NYYMYHGGTNFGRT++A+V+ YYD+APLDE+
Sbjct: 280 GDPPSQRSAEDIAFAVARFFS-VGGTLANYYMYHGGTNFGRTSAAFVMPKYYDEAPLDEF 338
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
GL ++PKWGHL++LH A+KLC K +L G + K EA +F+ + C AFL N +
Sbjct: 339 GLYKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHN 398
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEE 429
+++AT+ F Y +P SIS+L DC+TV F T +++ WE
Sbjct: 399 TKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQTAQNNVWEM 458
Query: 430 YK-EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSS 482
+ E +P Y + +R + N TKD +DY+WY FK + SD ++VL+V+S
Sbjct: 459 FDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKTVLEVNS 518
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH AF+N +FVG HG +K+FTLEK + L G N+V++L+ +G+ DSGAY+E R
Sbjct: 519 HGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSGAYMEHR 578
Query: 543 VAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+AG+ V I G D ++ WG+ VGL+GE+ QI+TD G V W + +PLT
Sbjct: 579 LAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWK--PAMNDRPLT 636
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQ 651
WYK FD P+G DPV +++ +MGKG +VNGQ IGRYW+S+ G PSQ
Sbjct: 637 WYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISYKHALGRPSQ 686
>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 831
Score = 651 bits (1680), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/837 (42%), Positives = 502/837 (59%), Gaps = 93/837 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV++DGR++ I+G R++L SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN HEP
Sbjct: 29 NVSHDGRAIKIDGKRRVLISGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNAHEPS 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
+DFSG D++RF+K +Q GLY LRIGP++ EW YGG+P W+H++P + R+ N
Sbjct: 89 RRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRTANS 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
F M+ + T+IV+M+K +L+ASQGGPIIL+QIENEYG V + + G Y+ W A +
Sbjct: 149 VFMNEMQNFTTLIVDMLKKEKLFASQGGPIILTQIENEYGNVISQYGDAGKAYMNWCANM 208
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L+ GVPW+MC++ DAP P+IN CNG C + F PNS + P +WTENW +++ +G
Sbjct: 209 AESLKVGVPWIMCQESDAPQPMINTCNGWYC-DNFE-PNSFNSPKMWTENWIGWFKNWGG 266
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R+AED+A+ VA F + G++ NYYMYHGGTNFGRTA Y+ T Y APLDEYG
Sbjct: 267 RDPHRTAEDVAFAVARFF-QTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYG 325
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
+ QPKWGHLKELHSA+K + + SG + + + I+ + + FL N +
Sbjct: 326 NIAQPKWGHLKELHSALKAMEEALTSGNVSETDLGNSVKVTIYATNGSSSCFLSNTNTTA 385
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------------SVEQWEEY 430
+AT+ F Y +P S+SILPDC+ +NTAK+ ++ +W
Sbjct: 386 DATLTFRGNNYTVPAWSVSILPDCQHEEYNTAKVKEQTSVMTKENSKAEKEAAILKWVWR 445
Query: 431 KEAIPT--YDETSLRANFLLEQMNTTKDASDYLWY--NFRFKH-DPSDSESV-LKVSSLG 484
E I + ++++ A+ LL+Q + DASDYLWY KH DP SE++ L+++ G
Sbjct: 446 SENIDKALHGKSNVSAHRLLDQKDAANDASDYLWYMTKLHVKHDDPVWSENMTLRINGSG 505
Query: 485 HVLHAFINGEFVGS---AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
HV+HAF+NGE++ S +G H+DK E + L +GTN +SLLSV VGL + GA+ +
Sbjct: 506 HVIHAFVNGEYIDSHWATYGIHNDK---FEPKIKLKHGTNTISLLSVTVGLQNYGAFFDT 562
Query: 542 RVAGLRN----VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS- 595
AGL VS++G + +K+ SS W Y++GL G ++F+D S S++ S
Sbjct: 563 WHAGLVGPIELVSVKGEETIIKNLSSHKWSYKIGLHGWDHKLFSD-DSPFAAQSKWESEK 621
Query: 596 --THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------- 642
T++ LTWYKT F AP G+DPV ++L MGKG AWVNG++IGR W S+
Sbjct: 622 LPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGYAWVNGKNIGRIWPSYNAEEDGCSDEP 681
Query: 643 ------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
+T G P+Q WYH+PRS+LK N LVL E G P ++ TV V
Sbjct: 682 CDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDGANTLVLFAELGGNPSLVNFQTVVVGN 741
Query: 691 LCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNG 750
+C + ++ +TL ++ C GRKIS I FAS+G+P G
Sbjct: 742 VCANAYEN-------------KTL-------------ELSC-QGRKISAIKFASFGDPKG 774
Query: 751 NCENYAIGSCHS-SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C + GSC S SN+ IV+KAC+GK +C++ + + F C + K L V+A C
Sbjct: 775 VCGAFTNGSCESKSNALPIVQKACVGKEACSIDLSEKTFGATACGNLAKRLAVEAVC 831
>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 729
Score = 651 bits (1679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/699 (47%), Positives = 440/699 (62%), Gaps = 49/699 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++IING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLV+FIK VQ GLYV LRIGP++ EW +GG P WL VPG+VFR+DNEP
Sbjct: 89 GQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV MMK +L+ +QGGPIILSQIENEYG +E G Y +W A++A
Sbjct: 149 FKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAEMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L TGVPW+MCKQDDAP+ +IN CNG C E F PNS +KP +WTENWT ++ +G
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYC-ENFK-PNSDNKPKMWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R AEDIA VA FI + GS++NYYMYHGGTNF RTA ++ T Y APLDEYGL
Sbjct: 267 VPYRPAEDIALSVARFI-QNGGSFINYYMYHGGTNFDRTAGEFIATSYDYDAPLDEYGLP 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLK LH +KLC ++S + QEA +F+ S CAAFL N + + A
Sbjct: 326 REPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKSKSSCAAFLSNYNTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------------DSVEQWEEYKEAIP 435
V F Y+LPP S+SILPDCKT +NTAK+ ++ W Y E IP
Sbjct: 386 RVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVQVRTSSIHMKMVPTNTPFSWGSYNEEIP 445
Query: 436 TY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESVLKVSSLGHVLHA 489
+ D + + L+EQ++ T+D +DY WY P + + +L + S GH LH
Sbjct: 446 SANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPLLTIGSAGHALHV 505
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+A+G T + + L G N ++LLS GLP+ G + E G L
Sbjct: 506 FVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGVHYETWNTGVLGP 565
Query: 549 VSIQGAKE-LKDFSSFSWGY-QVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
V++ G D + + W Y Q+G GE L + T GS V W + GS + QPLTWYK
Sbjct: 566 VTLNGVNSGTWDMTKWKWSYKQIGTKGEALSVHTLAGSSTVEW-KEGSLVAKKQPLTWYK 624
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LT 644
+ FD+PTG++P+A+++ +MGKG+ W+NGQ+IGR+W ++ L+
Sbjct: 625 STFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWPAYTARGKCERCSYAGTFTEKKCLS 684
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G SQ WYH+PRS+LKPT NL+++LEE G P GIS+
Sbjct: 685 NCGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPNGISL 723
>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
Length = 721
Score = 650 bits (1678), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/698 (50%), Positives = 439/698 (62%), Gaps = 47/698 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++++G R+IL SGSIHYPRSTPQMWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 24 SVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R DLV+F+K Q GLYV LRIGP+I EW GG P WL VPGI FR+DNE
Sbjct: 84 PGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRTDNE 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV++MK RL+ SQGGPIILSQIENEYG VE G Y +WAA++
Sbjct: 144 PFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQ+DAPDPVI+ CNG C E F PN KP +WTENWT +Y +G
Sbjct: 204 AVGLDTGVPWVMCKQEDAPDPVIDTCNGFYC-ENFK-PNKNTKPKMWTENWTGWYTDFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA FI + GS+VNYYMYHGGTNFGRT+ + YD APLDEYG
Sbjct: 262 AVPRRPAEDLAFSVARFI-QNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L +PK+ HL+ LH A+K +++ + EA +F CAAF+ N D ++
Sbjct: 321 LENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVFSAPGACAAFIANYDTKS 380
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAK-----------LDSVEQWEEYKEAIPT 436
A F N Y+LPP SISILPDCKTV +NTAK ++S W+ Y E +
Sbjct: 381 YAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKVGYGWLKKMTPVNSAFAWQSYNEEPAS 440
Query: 437 YDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHA 489
+ S+ A L EQ+N T+D+SDYLWY + ++ +L V S GHVLH
Sbjct: 441 SSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSPLLTVMSAGHVLHV 500
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
FING+ G+ G + T V L G N +SLLSV VGLP+ G + E AG L
Sbjct: 501 FINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVGVHFETWNAGVLGP 560
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G E +D S W Y+VGL GE L + T+ GS V W + GS + QPLTWYKT
Sbjct: 561 VTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQ-GSLVAKKQPLTWYKT 619
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL--------------------TP 645
F AP G+DP+A++L SMGKGE WVNG+SIGR+W ++ T
Sbjct: 620 TFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIAHGSCNACNYAGYYTDTKCRTN 679
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PRS+L GN LV+ EE G P GI++
Sbjct: 680 CGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDPNGIAL 717
>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
Length = 732
Score = 650 bits (1677), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/694 (49%), Positives = 435/694 (62%), Gaps = 49/694 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD ++LIING ++ILFSGSIHYPRSTPQMW LI KAK+GGLDV+ T VFWNLHEP
Sbjct: 27 NVTYDKKALIINGQKRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDVIDTYVFWNLHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F GR DLV+FIK V GLYV LRIGP+I GEW +GG P WL +PG++FR+DNE
Sbjct: 87 PGNYNFEGRNDLVQFIKLVHKAGLYVHLRIGPYICGEWNFGGFPVWLKYIPGMIFRTDNE 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV MMK +LY SQGGPIILSQIENEY + +F G Y+ WAA +
Sbjct: 147 PFKLQMQKFTQKIVQMMKDEQLYESQGGPIILSQIENEYEPEDKAFGAAGHAYMTWAAHM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK+ DAPDPV+N CNG C + PN KP +WTE WT ++ +G
Sbjct: 207 AVSLNTGVPWVMCKEFDAPDPVVNTCNGFYC--DYFSPNKAYKPTMWTEAWTGWFTDFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+A+ VA FI K GS+VNYYMYHGGTNFGRTA +T YD AP+DEYG
Sbjct: 265 PIHQRPVEDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L+RQPK+GHLK+LH A+KLC + +LS V ++A +F +S +CAAFL N + +
Sbjct: 324 LIRQPKYGHLKDLHKAIKLCERALLSSDPVVTTLGSYEQAHVFSSNSGDCAAFLANYNPK 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-------------SVEQWEEYKEA 433
A V F+N+ Y LPP S+SILPDCK V FNTA++ WE E
Sbjct: 384 ATAKVTFNNMHYNLPPWSVSILPDCKNVVFNTAEVGVQPSKIQMLPTEARFLSWEALSED 443
Query: 434 IPTYDETSL-RANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHV 486
I + D+ + LLEQ+N T+DASDYLWY S++ +LKV S GH
Sbjct: 444 ISSVDDDKIGTVAGLLEQINVTRDASDYLWYTTGVHISSSETFLDGGQPPILKVISAGHG 503
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLE-KMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
+H F+NG+ GS +G ++ + ++ L G N +SLLSV VGLP++G E G
Sbjct: 504 IHVFVNGQLSGSVYGTRGNRRISFSGELKQLHAGRNRISLLSVAVGLPNNGPRFETWNTG 563
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V I G + +D + W Y+VGL GE L + + + W + + + QPLT
Sbjct: 564 VLGPVVIHGLDQGHRDLTWQKWSYKVGLKGEDLNLGSPNSIPSINWMQESAMVAERQPLT 623
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ--------------- 646
W++ FDAP G DP+A+++ SM KG+ W+NG SIGRYW +
Sbjct: 624 WHRAFFDAPRGDDPLALDMSSMVKGQVWINGNSIGRYWTVYADGNCTACSYSGTFRPSTC 683
Query: 647 ----GTPSQSWYHIPRSFLKPTGNLLVLLEEENG 676
G P+Q WYHIPRS LKPT NLLV+ EE G
Sbjct: 684 QFGCGQPTQKWYHIPRSLLKPTENLLVVFEEIGG 717
>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
lyrata]
Length = 732
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/708 (48%), Positives = 434/708 (61%), Gaps = 52/708 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD ++++INGHR+IL SGSIHYPRSTP+MW LI KAK+GGLDV+ T VFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
PG ++F GR DLVRFIK +Q GLYV LRIGP++ EW +GG P WL V GI FR+DN
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
PFK M+ + IV MMK R +ASQGGPIILSQIENE+ G YV WAAK
Sbjct: 149 GPFKAAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPELKGLGPAGHSYVNWAAK 208
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV L TGVPWVMCK+DDAPDP+IN+CNG C + PN P KP +WTE W+ ++ +G
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINSCNGFYC--DYFTPNKPYKPTMWTEAWSGWFTEFG 266
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R ED+A+ VA FI K GSY+NYYMYHGGTNFGRTA +T YD AP+DEY
Sbjct: 267 GTIPKRPVEDLAFGVARFIQK-GGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 325
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDK 385
GL+++PK+ HLK+LH A+K C ++S +EA +F G C AFL N
Sbjct: 326 GLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHM 385
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-------------VEQWEEYKE 432
A V F+N Y LP SISILPDC+ V FNTA + + + Y E
Sbjct: 386 NAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMMPSGSILYSVARYDE 445
Query: 433 AIPTY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSL 483
I TY D ++ A LLEQ+N T+D +DYLWY D SES L+ V S
Sbjct: 446 DIATYGDRGTITARGLLEQVNVTRDTTDYLWYTTSV--DIKASESFLRGGKWPTLTVDSA 503
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH +H F+NG F GSA G ++ F+ V+L G N ++LLSV VGLP+ G + E
Sbjct: 504 GHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANRIALLSVAVGLPNVGPHFETWA 563
Query: 544 AGLR-NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQP 599
G+ +V + G E KD S W YQ GL GE +++ + V W + QP
Sbjct: 564 TGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGEAMKLVSPTEDSSVDWIKGSLAKQNKQP 623
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
LTWYK FDAP G++P+A++L SMGKG+AW+NGQSIGRYW++F
Sbjct: 624 LTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFAKGNCGSCNYAGTYRQN 683
Query: 647 ------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
G P+Q WYH+PRS+LKP GNLLVL EE G +S+ SV
Sbjct: 684 KCQSGCGEPTQRWYHVPRSWLKPRGNLLVLFEELGGDISKVSVVKRSV 731
>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
Length = 835
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/858 (40%), Positives = 497/858 (57%), Gaps = 102/858 (11%)
Query: 4 CQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPR 63
C L L +L + + V+YDGR+LII+G R++L SGSIHYPRSTP+MWP
Sbjct: 25 CVLFVLLNVLASAV-----------EVSYDGRALIIDGKRRVLQSGSIHYPRSTPEMWPD 73
Query: 64 LIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIE 123
LI KAK GGLD ++T VFWN+HEP ++DFSG DL+RFI+ +QA+GLY LRIGP++
Sbjct: 74 LIRKAKAGGLDAIETYVFWNVHEPLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIGPYVC 133
Query: 124 GEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQI 183
EW YGG P WLH++PGI FR+ N+ F M+ + T+IV+M K +L+ASQGGPII++QI
Sbjct: 134 AEWTYGGFPMWLHNMPGIEFRTANKVFMNEMQNFTTLIVDMAKQEKLFASQGGPIIIAQI 193
Query: 184 ENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETF 243
ENEYG + + + G YV W A +A L GVPW+MC+Q DAP P+IN CNG C ++F
Sbjct: 194 ENEYGNIMAPYGDAGKVYVDWCAAMANSLDIGVPWIMCQQSDAPQPMINTCNGWYC-DSF 252
Query: 244 AGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGT 303
PN+P+ P +WTENWT +++ +G + R+AED++Y VA F + G++ NYYMYHGGT
Sbjct: 253 T-PNNPNSPKMWTENWTGWFKNWGGKDPHRTAEDLSYSVARFF-QTGGTFQNYYMYHGGT 310
Query: 304 NFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFS 362
NFGR A Y+ T Y APLDE+G L QPKWGHLK+LH+ +K + + G + +++
Sbjct: 311 NFGRVAGGPYITTSYDYDAPLDEFGNLNQPKWGHLKDLHTVLKSMEETLTEGNITTIDMG 370
Query: 363 KLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD 422
E ++ + F N + N+AT + Y +P S+SILPDCK +NTAK++
Sbjct: 371 NSVEVTVYATQKVSSCFFSNSNTTNDATFTYGGTEYTVPAWSVSILPDCKKEVYNTAKVN 430
Query: 423 SVE-----------------QWEEYKEAIPTYDETS------LRANFLLEQMNTTKDASD 459
+ +W E I D+T+ + AN L++Q TT D SD
Sbjct: 431 AQTSVMVKNKNEAEDQPASLKWSWRPEMI---DDTAVLGKGQVSANRLIDQ-KTTNDRSD 486
Query: 460 YLWYNFRFKHDPSD----SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVH 515
YLWY D L+V++ GH+LHA++NGE++GS + ++ E+ V
Sbjct: 487 YLWYMNSVDLSEDDLVWTDNMTLRVNATGHILHAYVNGEYLGSQWATNGIFNYVFEEKVK 546
Query: 516 LINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKE----LKDFSSFSWGYQVG 570
L G N ++LLS +G + GA+ + +G+ V I G K +KD SS W Y+VG
Sbjct: 547 LKPGKNLIALLSATIGFQNYGAFYDLVQSGISGPVEIVGRKGDETIIKDLSSHKWSYKVG 606
Query: 571 LLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWV 630
+ G ++++ W ++ LTWYKT F AP G+D V ++L +GKGEAWV
Sbjct: 607 MHGMAMKLYDP--ESPYKWEEGNVPLNRNLTWYKTTFKAPLGTDAVVVDLQGLGKGEAWV 664
Query: 631 NGQSIGRYWVSFLTPQ---------------------GTPSQSWYHIPRSFLKPTGNLLV 669
NGQS+GRYW S + G P+Q WYH+PRSFL N LV
Sbjct: 665 NGQSLGRYWPSSIAEDGCNATCDYRGPYTNTKCVRNCGNPTQRWYHVPRSFLTADENTLV 724
Query: 670 LLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQI 729
L EE G P ++ TV++ T CG+ ++++ +++
Sbjct: 725 LFEEFGGNPSLVNFQTVTIGTACGNAYENNV--------------------------LEL 758
Query: 730 RCPSGRKISKILFASYGNPNGNCENYAIGSCH-SSNSRAIVEKACLGKRSCTVPVWTEKF 788
C + R IS I FAS+G+P G+C +++ GSC + ++ I++KAC+GK SC++ V + F
Sbjct: 759 ACQN-RPISDIKFASFGDPQGSCGSFSKGSCEGNKDALDIIKKACVGKESCSLDVSEKAF 817
Query: 789 YGDPCPGIPKALLVDAQC 806
C IPK L V+A C
Sbjct: 818 GSTSCGSIPKRLAVEAVC 835
>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 832
Score = 648 bits (1671), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/836 (41%), Positives = 486/836 (58%), Gaps = 91/836 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R++ I+G RK+LFSGSIHYPRST +MWP LI KAKEGGLDV++T VFWN HEPQP
Sbjct: 22 VSYDSRAITIDGKRKVLFSGSIHYPRSTAEMWPSLINKAKEGGLDVIETYVFWNAHEPQP 81
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
Q+DFSG DLV+FIK +Q +GLY LRIGP++ EW YGG P WLH++P + FR++N
Sbjct: 82 RQYDFSGNLDLVKFIKTIQKEGLYAMLRIGPYVCAEWNYGGFPVWLHNMPNMEFRTNNTA 141
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ M+ + T+IV+ M+ L+ASQGGPIIL+QIENEYG + + E G YV+W A+LA
Sbjct: 142 YMNEMQTFTTLIVDKMRHENLFASQGGPIILAQIENEYGNIMSEYGENGKQYVQWCAQLA 201
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ GVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENWT +++ +G
Sbjct: 202 ESYKIGVPWVMCQQSDAPDPIINTCNGWYCDQ--FSPNSKSKPKMWTENWTGWFKNWGGP 259
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+A D+AY VA F + G++ NYYMYHGGTNFGRT+ Y+ T Y APLDEYG
Sbjct: 260 IPHRTARDVAYAVARFF-QYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 318
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
QPKWGHLK+LH +K + G ++ L A ++ S + A FL N + N+
Sbjct: 319 KNQPKWGHLKQLHELLKSMEDVLTQGTTNHTDYGNLLTATVYNYSGKSACFLGNANSSND 378
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDE--------- 439
AT+ F + Y +P S+SILP+C +NTAK+++ K+ +E
Sbjct: 379 ATIMFQSTQYIVPAWSVSILPNCVNEVYNTAKINAQTSIMVMKDNKSDNEEEPHSTLNWQ 438
Query: 440 -----------------TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE-SVLKVS 481
S +A LL+Q T D SDYLWY +D S ++VS
Sbjct: 439 WMHEPHVQMKDGQVLGSVSRKAAQLLDQKVVTNDTSDYLWYITSVDISENDPIWSKIRVS 498
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
+ GHVLH F+NG G +G++ SFT E + L GTN +SLLS VGLP+ GA+
Sbjct: 499 TNGHVLHVFVNGAQAGYQYGQNGKYSFTYEAKIKLKKGTNEISLLSGTVGLPNYGAHFSN 558
Query: 542 RVAGL----RNVSIQGAKEL-KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
G+ + V++Q E+ KD ++ +W Y+VGL GE ++++ ++ W+ G T
Sbjct: 559 VSVGVCGPVQLVALQNNTEVVKDITNNTWNYKVGLHGEIVKLYCPENNK--GWNTNGLPT 616
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------- 643
++ WYKT+F +P G+DPV ++L + KG+AWVNG +IGRYW +L
Sbjct: 617 NRVFVWYKTLFKSPKGTDPVVVDLKGLKKGQAWVNGNNIGRYWTRYLADDNGCTATCNYR 676
Query: 644 ---------TPQGTPSQSWYHIPRSFLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCG 693
T G P+Q WYH+PRSFL+ N LVL EE G+P + TV V +C
Sbjct: 677 GPYSSDKCITKCGRPTQRWYHVPRSFLRQDNQNTLVLFEEFGGHPNEVKFATVMVEKICA 736
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCE 753
+ + ++ +++ C + ISKI FAS+G P G C
Sbjct: 737 NSYEGNV--------------------------LELSCREEQVISKIKFASFGVPEGECG 770
Query: 754 NYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPK---ALLVDAQC 806
++ C S N+ +I+ K+CLGK+SC+V V +++ G +P+ L ++A C
Sbjct: 771 SFKKSQCESPNALSILSKSCLGKQSCSVQV-SQRMLGPTGCRMPQNQNKLAIEAVC 825
>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
Length = 825
Score = 647 bits (1670), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/831 (41%), Positives = 481/831 (57%), Gaps = 84/831 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+++DGR++ I+G R++L SGSIHYPRSTPQMWP LI K+KEGGLD ++T VFWN+HEP
Sbjct: 25 ISHDGRAITIDGKRRVLLSGSIHYPRSTPQMWPDLIKKSKEGGLDAIETYVFWNVHEPSR 84
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
Q+DF G DLVRFIK VQ +GLY LRIGP++ EW YGG P WLH++PGI R+ N
Sbjct: 85 RQYDFGGNLDLVRFIKAVQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGIELRTANSI 144
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F M+ + ++IV+MMK +L+ASQGGPII++Q+ENEYG V S+ G Y+ W A +A
Sbjct: 145 FMNEMQNFTSLIVDMMKQEQLFASQGGPIIIAQVENEYGNVMSSYGAAGKAYIDWCANMA 204
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L GVPW+MC+Q DAPDP+IN CNG C + P++P+ P +WTENWT +++ +G +
Sbjct: 205 ESLNIGVPWIMCQQSDAPDPMINTCNGWYCDQ--FTPSNPNSPKMWTENWTGWFKSWGGK 262
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+AED+A+ VA F + G++ NYYMYHGGTNFGRTA Y+ T Y APLDE+G
Sbjct: 263 DPHRTAEDVAFAVARFF-QTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEFGN 321
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
L QPKWGHLK+LH + + + SG + S+++ A I+ E + FL N ++ ++
Sbjct: 322 LNQPKWGHLKQLHDVLHSMEEILTSGTVSSVDYDNSVTATIYATDKESSCFLSNANETSD 381
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VEQWEEYKEAIPT-------- 436
AT+ F Y +P S+SILPDC V +NTAK+ + + + + E PT
Sbjct: 382 ATIEFKGTTYTIPAWSVSILPDCANVGYNTAKVKTQTSVMVKRDNKAEDEPTSLNWSWRP 441
Query: 437 --YDETSL------RANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSSLG 484
D+T L A +++Q DASDYLWY D + ++++ G
Sbjct: 442 ENVDKTVLLGQGHIHAKQIVDQKAVANDASDYLWYMTSVDLKKDDLIWSKDMSIRINGSG 501
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H+LHA++NGE++GS ++S ++ EK V L +G N ++LLS VGL + GA + A
Sbjct: 502 HILHAYVNGEYLGSQWSEYSVSNYVFEKSVKLKHGRNLITLLSATVGLANYGANYDLIQA 561
Query: 545 GLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQP 599
G+ V + G K +KD S+ W Y+VGLLG + +++ W T++
Sbjct: 562 GILGPVELVGRKGDETIIKDLSNNRWSYKVGLLGLEDKLYLSDSKHASKWQEQELPTNKM 621
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
LTWYKT F AP G+DPV ++L +GKG AW+NG SIGRYW SFL
Sbjct: 622 LTWYKTTFKAPLGTDPVVLDLQGLGKGMAWINGNSIGRYWPSFLAEDDGCSTDLCDYRGP 681
Query: 647 ----------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
G P+Q WYH+PRSFL+ N LVL EE G P ++ TV C
Sbjct: 682 YDNNKCVSNCGKPTQRWYHVPRSFLQDNENTLVLFEEFGGNPSQVNFQTVVTGVACVSGD 741
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYA 756
+ + V+I C +G+ IS + FAS+G+P G C +
Sbjct: 742 EGEV--------------------------VEISC-NGQSISAVQFASFGDPQGTCGSSV 774
Query: 757 IGSCH-SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
GSC + ++ IV+KAC+G SC++ V + F C L V+ C
Sbjct: 775 KGSCEGTEDALLIVQKACVGNESCSLEVSHKLFGSTSCDNGVNRLAVEVLC 825
>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
Length = 728
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/697 (47%), Positives = 440/697 (63%), Gaps = 46/697 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++LIING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP P
Sbjct: 29 VTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+F K V GLY+ LRIGP++ EW +GG P WL VPGIVFR+DNEP
Sbjct: 89 GNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+R+ IV+MMK +L+ +QGGPIILSQIENEYG +E G Y +W A++A
Sbjct: 149 FKIAMQRFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMEWEMGAAGKAYSKWTAEMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ L TGVPW+MCKQ+DAP P+I+ CNG C E F PNS +KP +WTENWT ++ +G
Sbjct: 209 LGLSTGVPWIMCKQEDAPYPIIDTCNGFYC-EGFK-PNSDNKPKLWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R EDIA+ VA FI + GS++NYYMY+GGTNF RTA ++ T Y APLDEYGLL
Sbjct: 267 IPNRPVEDIAFSVARFI-QNGGSFLNYYMYYGGTNFDRTAGVFIATSYDYDAPLDEYGLL 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLKELH +KLC ++S + QE +F+ + CAAFL N D + A
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEVHVFKSKTSCAAFLSNYDTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPTY 437
+ F Y+LPP S+SILPDCKT +NTAK+ + WE Y E P+
Sbjct: 386 RIMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMVPTSTKFSWESYNEGSPSS 445
Query: 438 -DETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDPS----DSESVLKVSSLGHVLHAF 490
D+ + + L+EQ++ T+D +DY WY + D S + +L + S GH LH F
Sbjct: 446 NDDGTFVKDGLVEQISMTRDKTDYFWYLTDITIGSDESFLKTGDDPLLTIFSAGHALHVF 505
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
+NG G+++G S+ T + + L G N ++LLS VGLP++G + E G L V
Sbjct: 506 VNGLLAGTSYGALSNSKLTFSQKIKLSVGINKLALLSTAVGLPNAGVHYETWNTGVLGPV 565
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST--HQPLTWYKTV 606
+++G D S + W Y++G+ GE + T GS V W GS +PLTWYK+
Sbjct: 566 TLKGVNSGTWDMSKWKWSYKIGIRGEAMSFHTIAGSSAVKWWIKGSFVVKKEPLTWYKSS 625
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQ 646
FD P G++P+A+++ +MGKG+ WVNG +IGR+W ++ L+
Sbjct: 626 FDTPKGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTARGNCGRCNYAGIYNEKKCLSHC 685
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PRS+LKP GNLLV+ EE G P GIS+
Sbjct: 686 GEPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISL 722
>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
Length = 787
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/697 (49%), Positives = 441/697 (63%), Gaps = 47/697 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD RSL+ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDVVQT VFWN HEP
Sbjct: 94 VSYDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPVK 153
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FS R DL+RF+K V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 154 GQYYFSDRYDLIRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 213
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+R+ IV+MMK+ RL+ QGGPII+SQ+ENE+G +E + PY WAAK+A
Sbjct: 214 FKAEMQRFVEKIVSMMKSERLFEWQGGPIIMSQVENEFGPMESAGGVGAKPYANWAAKMA 273
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V TGVPWVMCKQ+DAPDPVIN CNG C + PN +KPA+WTE WT ++ +G
Sbjct: 274 VATNTGVPWVMCKQEDAPDPVINTCNGFYC--DYFTPNKKNKPAMWTEAWTGWFTSFGGA 331
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA FI K GS+VNYYMYHGGTNFGRTA +V T Y AP+DE+GL
Sbjct: 332 VPHRPVEDMAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEFGL 390
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K ++SG + ++A++F+ + CAAFL N +
Sbjct: 391 LRQPKWGHLRDLHKAIKQAEPTLVSGDPTIQSLGNYEKAYVFKSKNGACAAFLSNYHMNS 450
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTA---------KLDSVEQ--WEEYKEAIPT 436
V F+ Y+LP SISILPDCKTV FNTA K+ V + W+ Y E +
Sbjct: 451 AVKVRFNGRHYDLPAWSISILPDCKTVVFNTATVKEPTLLPKMHPVVRFTWQSYSEDTNS 510
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES-----VLKVSSLGHVLHAFI 491
D+++ + L+EQ++ T D SDYLWY P + L V S GH + F+
Sbjct: 511 LDDSAFTKDGLVEQLSMTWDKSDYLWYTTFVNIGPGELSKNGQWPQLTVYSAGHSMQVFV 570
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVS 550
NG+ GS +G + T + V + G+N +S+LS VGLP+ G + ER V L V+
Sbjct: 571 NGKSYGSVYGGFENPKLTYDGHVKMWQGSNKISILSSAVGLPNVGDHFERWNVGVLGPVT 630
Query: 551 IQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
+ G E K D S W YQVGL GE L I T GS V W G + QPLTW+K +F+A
Sbjct: 631 LSGLSEGKRDLSHQKWTYQVGLKGESLGIHTVSGSSAVEWG--GPGSKQPLTWHKALFNA 688
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------------GT 648
P+GSDPVA+++ SMGKG+ WVNG +GRYW S+ P G
Sbjct: 689 PSGSDPVALDMGSMGKGQMWVNGHHVGRYW-SYKAPSRGCGGCSYAGTYREDKCRSSCGE 747
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ WYH+PRS+LKP GNLLV+LEE G G+++ T
Sbjct: 748 LSQRWYHVPRSWLKPGGNLLVVLEEYGGDVAGVTLAT 784
>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 826
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/841 (42%), Positives = 500/841 (59%), Gaps = 91/841 (10%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G V++DGR++II+G R++L SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 19 GSNAVEVSHDGRAIIIDGKRRVLLSGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWN 78
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP +DFSG D++RF+K +Q GLY LRIGP++ EW YGG+P W+H++P +
Sbjct: 79 AHEPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEI 138
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+ N + M+ + T+IV+M+K +L+ASQGGPIIL+QIENEYG V + + G Y+
Sbjct: 139 RTANSVYMNEMQNFTTLIVDMVKKEKLFASQGGPIILTQIENEYGNVISHYGDAGKAYMN 198
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
W A +A L GVPW+MC++ DAP +IN CNG C + F PN+P P +WTENW ++
Sbjct: 199 WCANMAESLNVGVPWIMCQESDAPQSMINTCNGFYC-DNFE-PNNPSSPKMWTENWVGWF 256
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAP 322
+ +G R+AED+A+ VA F + G++ NYYMYHGGTNF RTA Y+ T Y AP
Sbjct: 257 KNWGGRDPHRTAEDVAFAVARFF-QTGGTFQNYYMYHGGTNFDRTAGGPYITTSYDYDAP 315
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVN 382
LDEYG + QPKWGHLKELH+ +K + + SG + +F +A I+ + + FL +
Sbjct: 316 LDEYGNIAQPKWGHLKELHNVLKSMEETLTSGNVSETDFGNSVKATIYATNGSSSCFLSS 375
Query: 383 KDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------------SVE 425
+ +AT+ F Y +P S+SILPDC+ +NTAK++ +
Sbjct: 376 TNTTTDATLTFRGKNYTVPAWSVSILPDCEHEEYNTAKVNVQTSVMVKENSKAEEEATAL 435
Query: 426 QWEEYKEAIPT--YDETSLRANFLLEQMNTTKDASDYLWY--NFRFKH-DPSDSESV-LK 479
+W E I + ++++ AN LL+Q + DASDYLWY KH DP E++ L+
Sbjct: 436 KWVWRSENIDNALHGKSNVSANRLLDQKDAANDASDYLWYMTKLHVKHDDPVWGENMTLR 495
Query: 480 VSSLGHVLHAFINGEFVGS---AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
++S GHV+HAF+NGE +GS +G H+DK E + L +GTN +SLLSV VGL + G
Sbjct: 496 INSSGHVIHAFVNGEHIGSHWATYGIHNDK---FEPKIKLKHGTNTISLLSVTVGLQNYG 552
Query: 537 AYLERRVAGLRN----VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVP--W 589
A+ + AGL VS++G + +K+ SS W Y+VGL G ++F+D P W
Sbjct: 553 AFFDTWHAGLVEPIELVSVKGDETIIKNLSSNKWSYKVGLHGWDHKLFSDDSPFAAPNKW 612
Query: 590 SRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------- 642
T + LTWYKT F+AP G+DPV ++L MGKG AWVNGQ+IGR W S+
Sbjct: 613 ESEKLPTDRMLTWYKTTFNAPLGTDPVVVDLQGMGKGYAWVNGQNIGRIWPSYNAEEDGC 672
Query: 643 ----------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTV 686
+T G P+Q WYH+PRS+LK N LVL E G P ++ TV
Sbjct: 673 SDEPCDYRGEYTDSKCVTNCGKPTQRWYHVPRSYLKDGANNLVLFAELGGNPSQVNFQTV 732
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYG 746
V T+C + ++ +TL ++ C GRKIS I FAS+G
Sbjct: 733 VVGTVCANAYEN-------------KTL-------------ELSC-QGRKISAIKFASFG 765
Query: 747 NPNGNCENYAIGSCHS-SNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQ 805
+P G C + GSC S SN+ +IV+KAC+GK++C+ V + F C + K L V+A
Sbjct: 766 DPEGVCGAFTNGSCESKSNALSIVQKACVGKQACSFDVSEKTFGPTACGNVAKRLAVEAV 825
Query: 806 C 806
C
Sbjct: 826 C 826
>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 732
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/708 (48%), Positives = 433/708 (61%), Gaps = 52/708 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD ++++INGHR+IL SGSIHYPRSTP+MW LI KAK+GGLDV+ T VFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
PG ++F GR DLVRFIK +Q GLYV LRIGP++ EW +GG P WL V GI FR+DN
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
PFK M+ + IV MMK R +ASQGGPIILSQIENE+ G YV WAAK
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV L TGVPWVMCK+DDAPDP+IN CNG C + PN P KP +WTE W+ ++ +G
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYC--DYFTPNKPYKPTMWTEAWSGWFTEFG 266
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R ED+A+ VA FI K GSY+NYYMYHGGTNFGRTA +T YD AP+DEY
Sbjct: 267 GTVPKRPVEDLAFGVARFIQK-GGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 325
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDK 385
GL+++PK+ HLK+LH A+K C ++S +EA +F G C AFL N
Sbjct: 326 GLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHM 385
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-------------VEQWEEYKE 432
A V F+N Y LP SISILPDC+ V FNTA + + + Y E
Sbjct: 386 NAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMVPSGSILYSVARYDE 445
Query: 433 AIPTY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSL 483
I TY + ++ A LLEQ+N T+D +DYLWY D SES L+ V S
Sbjct: 446 DIATYGNPGTITARGLLEQVNVTRDTTDYLWYTTSV--DIKASESFLRGGKWPTLTVDSA 503
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH +H F+NG F GSA G ++ F+ V+L G N ++LLSV VGLP+ G + E
Sbjct: 504 GHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFETWA 563
Query: 544 AGL-RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQP 599
G+ +V++ G E KD S W YQ GL GE + + + V W + QP
Sbjct: 564 TGIVGSVALHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQNKQP 623
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
LTWYK FDAP G++P+A++L SMGKG+AW+NGQSIGRYW++F
Sbjct: 624 LTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFAKGDCGSCNYAGTYRQN 683
Query: 647 ------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
G P+Q WYH+PRS+LKP GNLLVL EE G +S+ SV
Sbjct: 684 KCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRSV 731
>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
Length = 731
Score = 646 bits (1667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/721 (47%), Positives = 450/721 (62%), Gaps = 55/721 (7%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
+L LF + + S V+YD +++IING ++IL SGSIHYPRSTP+MWP LI
Sbjct: 11 ILLLFSCIFSAASAS---------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 61
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GGLDV+QT VFWN HEP PG++ F R DLV+FIK VQ GL+V LRIGP++ E
Sbjct: 62 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK M+++ IV+MMKA +L+ SQGGPIILSQIEN
Sbjct: 122 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIEN 181
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
E+G VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFK- 239
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTE WT +Y +G R AED+A+ VA FI + GS++NYYMYHGGTNF
Sbjct: 240 PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFI-QSGGSFLNYYMYHGGTNF 298
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA + YD APLDEYGL R+PKWGHL++LH A+K C ++S
Sbjct: 299 GRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSN 358
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
QEA +F+ S+CAAFL N D + + V F Y+LPP SISILPDCKT +NTAK+ S
Sbjct: 359 QEAHVFKSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQ 418
Query: 425 EQ------------WEEY-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKH 469
W+ + +E + + + + L EQ+N T+D +DYLWY +
Sbjct: 419 SSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGS 478
Query: 470 DPS----DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSL 525
D + +L +SS GH L+ FING+ G+ +G + + + V+L +G N ++L
Sbjct: 479 DEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLAL 538
Query: 526 LSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LS+ VGLP+ G + E AG L ++++G D S + W Y+ GL GE L + T G
Sbjct: 539 LSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTG 598
Query: 584 SRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF 642
S V W S + QPLTWYK F+AP G P+A+++ SMGKG+ W+NGQS+GR+W +
Sbjct: 599 SSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGY 658
Query: 643 L--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGIS 682
+ T G PSQ WYHIPRS+L PTGNLLV+ EE G P GIS
Sbjct: 659 IARGSCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSGIS 718
Query: 683 I 683
+
Sbjct: 719 L 719
>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
Length = 731
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/697 (48%), Positives = 440/697 (63%), Gaps = 47/697 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG+++I+NG R+IL +GSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 31 VTYDGKAIIVNGQRRILIAGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+F+K VQ GLYV LRIGP+ EW +GG P WL VPG+ FR+DNEP
Sbjct: 91 GNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYACAEWNFGGFPVWLKYVPGMSFRTDNEP 150
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IVNMMK +L+ QGGPIILSQIENEYG +E G Y +WAA++A
Sbjct: 151 FKAAMQKFTEKIVNMMKQEQLFEPQGGPIILSQIENEYGPIEWELKAPGKAYAQWAAQMA 210
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPW+ CKQ+DAPDP+I+ CN C E F PN KP +WTE WT+++ +G+
Sbjct: 211 VGLNTGVPWIACKQEDAPDPLIDTCNAYYC-EKFT-PNKSYKPKMWTEAWTAWFTSWGNP 268
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R AED A+ V FI + GSY NYYMYHGGTNFGRTA +V T Y APLDEYGL
Sbjct: 269 VLYRPAEDQAFSVLKFI-QSGGSYANYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEYGL 327
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
PK+ HLK +H A+K K ++S + QEA ++ SS CAAFL N D +
Sbjct: 328 TNDPKYTHLKHMHKAIKQSEKALVSADATVTSLGTNQEAHVYSSSSGCAAFLANYDVSYS 387
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPT- 436
V F + Y+LP SISILPDCKT +NTAK+ + W+ Y + + +
Sbjct: 388 VKVNFGSGQYDLPAWSISILPDCKTEVYNTAKVLAPRVHKKMTPLGGFTWDSYIDEVASG 447
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHAF 490
+ + + L EQ+ TKD+SDYLWY K ++ + L V S GH L+ F
Sbjct: 448 FASDTTTEDGLWEQLYMTKDSSDYLWYMQDVKIGSDEAFLTNGKDPFLNVQSAGHFLNVF 507
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
+NG+ +GSA+G + + T + V L G N ++LLS VGL + G + E G L V
Sbjct: 508 VNGKLIGSAYGSNDNPKLTFSQSVKLNVGVNKIALLSASVGLANVGLHFENYNVGVLGPV 567
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKTV 606
++ G + D + + W Y+VG+ GEKLQ+ T GS V W + GS + QPLTWYK+
Sbjct: 568 TLTGLNQGTVDMTKWKWSYKVGVQGEKLQLNTVAGSSSVEWVK-GSMLAKKQPLTWYKST 626
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQ 646
F+AP G+DPVA+++ISMGKG+ W+NGQ IGRYW ++ LT
Sbjct: 627 FNAPEGNDPVALDMISMGKGQIWINGQGIGRYWPAYTAQGNCGGCSYGGYFTEKKCLTGC 686
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G P+Q WYH+PRS+LKPTGNLLV+ EE G P GIS+
Sbjct: 687 GQPTQRWYHVPRSWLKPTGNLLVVFEEWGGDPTGISM 723
>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
Precursor
gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
Length = 732
Score = 645 bits (1665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/708 (48%), Positives = 432/708 (61%), Gaps = 52/708 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD ++++INGHR+IL SGSIHYPRSTP+MW LI KAK+GGLDV+ T VFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
PG ++F GR DLVRFIK +Q GLYV LRIGP++ EW +GG P WL V GI FR+DN
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
PFK M+ + IV MMK R +ASQGGPIILSQIENE+ G YV WAAK
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV L TGVPWVMCK+DDAPDP+IN CNG C + PN P KP +WTE W+ ++ +G
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYC--DYFTPNKPYKPTMWTEAWSGWFTEFG 266
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R ED+A+ VA FI K GSY+NYYMYHGGTNFGRTA +T YD AP+DEY
Sbjct: 267 GTVPKRPVEDLAFGVARFIQK-GGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 325
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDK 385
GL+++PK+ HLK+LH A+K C ++S +EA +F G C AFL N
Sbjct: 326 GLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHM 385
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-------------VEQWEEYKE 432
A V F+N Y LP SISILPDC+ V FNTA + + + Y E
Sbjct: 386 NAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMVPSGSILYSVARYDE 445
Query: 433 AIPTY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSL 483
I TY + ++ A LLEQ+N T+D +DYLWY D SES L+ V S
Sbjct: 446 DIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSV--DIKASESFLRGGKWPTLTVDSA 503
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH +H F+NG F GSA G ++ F+ V+L G N ++LLSV VGLP+ G + E
Sbjct: 504 GHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFETWA 563
Query: 544 AGL-RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQP 599
G+ +V + G E KD S W YQ GL GE + + + V W + QP
Sbjct: 564 TGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQNKQP 623
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
LTWYK FDAP G++P+A++L SMGKG+AW+NGQSIGRYW++F
Sbjct: 624 LTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFAKGDCGSCNYAGTYRQN 683
Query: 647 ------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
G P+Q WYH+PRS+LKP GNLLVL EE G +S+ SV
Sbjct: 684 KCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRSV 731
>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
Length = 745
Score = 645 bits (1664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/707 (48%), Positives = 443/707 (62%), Gaps = 48/707 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +++IING R+IL SGSIHYPRSTP+MW LI KAK GGLDV+ T VFWN+HEP
Sbjct: 27 SVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKVGGLDVIDTYVFWNVHEPS 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
P ++F GR DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DN
Sbjct: 87 PSNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + IV MMK +L+ SQGGPIILSQIENEYG + G Y WAAK+
Sbjct: 147 PFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGAVGHAYSNWAAKM 206
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCK+DDAPDPVIN+CNG C + PN P KP +WTE+W+ ++ +G
Sbjct: 207 AVGLGTGVPWVMCKEDDAPDPVINSCNGFYCDDF--SPNKPYKPKLWTESWSGWFSEFGG 264
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R A+D+A+ VA FI K GS+ NYYMYHGGTNFGR+A +T YD AP+DEYG
Sbjct: 265 PVPQRPAQDLAFAVARFIQK-GGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF-QGSSECAAFLVNKDKR 386
LLR+PK+GHLK+LH A+K C ++S + ++A +F G+ CAAFL N
Sbjct: 324 LLREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTQTCAAFLANYHSN 383
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-------------DSVEQWEEYKEA 433
+ A V F+N Y+LPP SISILPDCKT FNTA++ + WE Y E
Sbjct: 384 SAARVTFNNRHYDLPPWSISILPDCKTDVFNTARVRFQNSKIQMLPSNSKLLSWETYDED 443
Query: 434 IPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHV 486
+ + E+S + A+ LLEQ+N T+D SDYLWY PS+S + + V S G
Sbjct: 444 VSSLAESSRITASGLLEQINATRDTSDYLWYITSVDISPSESFLRGGNKPSISVHSSGDA 503
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+H FING+F GSA G +S T ++L GTN ++LLSV VGLP+ G + E G+
Sbjct: 504 VHVFINGKFSGSAFGTREQRSCTFNGPINLHAGTNKIALLSVAVGLPNGGIHFESWKTGI 563
Query: 547 RN-VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQP-LTW 602
+ + G KD + W YQVGL GE + + + G V W R +S +QP L W
Sbjct: 564 TGPILLHGLDHGQKDLTWQKWSYQVGLKGEAMNLVSPNGVSSVDWVRESLASQNQPQLKW 623
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------------- 646
+K F+AP G++ +A+++ MGKG+ W+NGQSIGRYW+ +
Sbjct: 624 HKAYFNAPDGNEALALDMSGMGKGQVWINGQSIGRYWLVYAKGNCNSCNYAGTYRQAKCQ 683
Query: 647 ---GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
G P+Q WYH+PRS+LKPT NL+V+ EE G P IS+ ++ T
Sbjct: 684 LGCGQPTQRWYHVPRSWLKPTNNLMVVFEELGGNPWKISLVKRTIHT 730
>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 838
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/832 (41%), Positives = 481/832 (57%), Gaps = 80/832 (9%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+NV+YD ++IING R+++ SGS+HYPRST MWP LI KAK+GGLD ++T +FW+ HE
Sbjct: 34 GDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHE 93
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ ++DF+GR D ++F + VQ GLYV +RIGP++ EW YGG P WLH++PGI FR+D
Sbjct: 94 PQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQFRTD 153
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+ +K M+ + T IVNM K A L+ASQGGPIIL+QIENEYG V + G Y+ W A
Sbjct: 154 NQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCA 213
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A L G+PW+MC+Q+DAP P+IN CNG C F+ PN+P P ++TENW +++ +
Sbjct: 214 QMAESLNIGIPWIMCQQNDAPQPIINTCNGFYCDYDFS-PNNPKSPKMFTENWVGWFKKW 272
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDE 325
GD+ RS ED+A+ VA F + G + NYYMYHGGTNFGRTA +T YD APLDE
Sbjct: 273 GDKDPYRSPEDVAFAVARFF-QSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDE 331
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG--SSECAAFLVNK 383
YG L QPKWGHLK+LH+++K+ K + + S F S E FL N
Sbjct: 332 YGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQKISSFVTLTKFSNPTSGERFCFLSNT 391
Query: 384 DKRNNATVYFS---NLMYELPPLSISILPDCKTVAFNTAKLDS-------VEQWEEYKE- 432
D +N+AT+ +P S+SIL C FNTAK++S V+ +E +
Sbjct: 392 DNKNDATIDLQADGKYFVPVPAWSVSILDGCNKEVFNTAKINSQTSMFVKVQNKKENAQF 451
Query: 433 -----AIPTYD----ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS--ESVLKVS 481
P D + + +AN LLEQ TT D SDYLWY + + S L+V+
Sbjct: 452 SWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSLQNVTLQVN 511
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
+ GH+LHAF+N ++GS + + +SF EK + + GTN ++LLS VGL + A+ +
Sbjct: 512 TKGHMLHAFVNRRYIGS-QWRSNGQSFVFEKPILIKPGTNTITLLSATVGLKNYDAFYDT 570
Query: 542 RVAGLRN--VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STH 597
G+ + + G +K D SS W Y+VGL GE Q++ S+ WS S
Sbjct: 571 VPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTNWSTINQKSIG 630
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------- 646
+ +TWYKT F P+G D V +++ MGKG+AWVNGQSIGR+W SF+
Sbjct: 631 RRMTWYKTSFKTPSGIDRVTLDMQGMGKGQAWVNGQSIGRFWPSFIASNDSCSTTCDYRG 690
Query: 647 -----------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
G PSQ WYHIPRSFL N LVL EE G P +S+ T+++ T+CG+
Sbjct: 691 AYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITIGTICGNA 750
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENY 755
++ +++ C G IS+I FASYGNP G C ++
Sbjct: 751 NEGS--------------------------TLELSCQGGHIISEIQFASYGNPEGKCGSF 784
Query: 756 AIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
GS H NS +VEK C+G+ SC++ V + F + L + A C+
Sbjct: 785 KQGSWHVINSAILVEKLCIGRESCSIDVSAKSFGLGDVTNLSARLAIQALCS 836
>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 763
Score = 645 bits (1663), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/774 (44%), Positives = 461/774 (59%), Gaps = 68/774 (8%)
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+DF GR DLVRF+K GLYV LRIGP++ EW YGG P WLH +PGI R+DNEPF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+R+ +V MK A LYASQGGPIILSQIENEYG + S+ G Y+RWAA +AV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L TGVPWVMC+Q DAP+P+IN CNG C + P+ P +P +WTENW+ ++ +G
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFT--PSLPSRPKLWTENWSGWFLSFGGAV 178
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLL 329
R ED+A+ VA F + G+ NYYMYHGGTNFGR++ ++ YD AP+DEYGL+
Sbjct: 179 PYRPTEDLAFAVARFYQR-GGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLV 237
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
RQPKWGHL+++H A+K+C +++ M+ + EA +++ S CAAFL N D +++
Sbjct: 238 RQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGSLCAAFLANIDDQSDK 297
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-------------------------- 423
TV F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 298 TVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAEL 357
Query: 424 -VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDP--SDSESV 477
W E + E +L L+EQ+NTT DASD+LWY+ +P + S+S
Sbjct: 358 AASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQSN 417
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L V+SLGHVL FING+ GS+ G S +L V L+ G N + LLS VGL + GA
Sbjct: 418 LPVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGA 477
Query: 538 YLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
+ + AG+ V + G K D SS W YQ+GL GE L ++ + S T
Sbjct: 478 FFDLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEASPEWVSDNSYPT 537
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------- 646
+ PLTWYK+ F AP G DPVAI+ MGKGEAWVNGQSIGRYW + + PQ
Sbjct: 538 NNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSDCVNSCNYR 597
Query: 647 ------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGH 694
G PSQ YH+PRSFL+P N +VL E+ G P IS T ++C H
Sbjct: 598 GSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQTESVCAH 657
Query: 695 VSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGNCE 753
VS+ H + SW S Q+ ++ P +++ CP G+ IS I FAS+G P+G C
Sbjct: 658 VSEDHPDQIDSWVSSQQKLQRSG-------PALRLECPKEGQVISSIKFASFGTPSGTCG 710
Query: 754 NYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+Y+ G C SS + A+ ++AC+G SC+VPV + K +GDPC G+ K+L+V+A C+
Sbjct: 711 SYSHGECSSSQALAVAQEACVGVSSCSVPV-SAKNFGDPCRGVTKSLVVEAACS 763
>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
Length = 732
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/708 (48%), Positives = 431/708 (60%), Gaps = 52/708 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD ++++INGHR+IL SGSIHYPRSTP+MW LI KAK+GGLDV+ T VFWN HEP
Sbjct: 29 SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
PG ++F GR DLVRFIK +Q GLYV LRIGP++ EW +GG P WL V GI FR+DN
Sbjct: 89 SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
PFK M+ + IV MMK R +ASQGGPIILSQIENE+ G YV WAAK
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV L TGVPWVMCK+DDAPDP+IN CNG C + PN P KP +WTE W+ ++ +G
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYC--DYFTPNKPYKPTMWTEAWSGWFTEFG 266
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R ED+A+ VA FI K GSY+NYYMYHGGTNFGRTA +T YD AP+DEY
Sbjct: 267 GTVPKRPVEDLAFGVARFIQK-GGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEY 325
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDK 385
GL+++PK+ HLK+LH A+K C ++S +EA +F G C AFL N
Sbjct: 326 GLVQEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHM 385
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-------------VEQWEEYKE 432
A V F+N Y LP SISILPDC+ V FNTA + + + Y E
Sbjct: 386 NAPAKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSHVQMVPSGSILYSVARYDE 445
Query: 433 AIPTY-DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSL 483
I TY + ++ A LLEQ+N T+D +DYLWY D SES L+ V S
Sbjct: 446 DIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSV--DIKASESFLRGGKWPTLTVDSA 503
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH +H F+NG F GSA G ++ F+ V+L G N ++LLSV VGLP+ G + E
Sbjct: 504 GHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFETWA 563
Query: 544 AGLR-NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQP 599
G+ +V + G E KD S W YQ GL GE + + + V W + QP
Sbjct: 564 TGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQNKQP 623
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
LTWYK FD P G++P+A++L SMGKG+AW+NGQSIGRYW++F
Sbjct: 624 LTWYKAYFDVPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAFAKGDCGSCNYAGTYRQN 683
Query: 647 ------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
G P+Q WYH+PRS+LKP GNLLVL EE G +S+ SV
Sbjct: 684 KCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRSV 731
>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
Length = 827
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/835 (42%), Positives = 485/835 (58%), Gaps = 88/835 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV++DGR++II+G R++L SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN HEP
Sbjct: 24 NVSHDGRAIIIDGQRRVLLSGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNAHEPA 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV-FRSDN 147
Q+DFSG DL+RFIK +Q +GLY LRIGP++ EW YGG P WLH++PG+ FR+ N
Sbjct: 84 RRQYDFSGHLDLIRFIKTIQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGVQEFRTVN 143
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
E F M+ + T+IV+M+K +L+ASQGGPII++QIENEYG + ++ + G Y+ W AK
Sbjct: 144 EVFMNEMQNFTTLIVDMVKQEKLFASQGGPIIIAQIENEYGNMISNYGDAGKVYIDWCAK 203
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A L GVPW+MC++ DAP P+IN CNG C ++F PN P+ P +WTENWT +++ +G
Sbjct: 204 MAESLDIGVPWIMCQESDAPQPMINTCNGWYC-DSFT-PNDPNSPKMWTENWTGWFKSWG 261
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
+ R+AED+A+ VA F + G++ NYYMYHGGTNFGRT+ LT YD APLDE+
Sbjct: 262 GKDPHRTAEDLAFSVARFF-QTGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPLDEF 320
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
G L QPKWGHLKELH+ +K K + G + + +F A ++ + F N +
Sbjct: 321 GNLNQPKWGHLKELHTVLKAMEKTLTHGNVSTTDFGNSVTATVYATEEGSSCFFGNANTT 380
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------------SVEQWEE 429
+AT+ F Y +P S+SILPDCKT A+NTAK++ S +W
Sbjct: 381 GDATITFQGSDYVVPAWSVSILPDCKTEAYNTAKVNTQTSVIVKKPNQAENEPSSLKWVW 440
Query: 430 YKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSS 482
EAI + S A+FL++Q DASDYLWY P D L+V++
Sbjct: 441 RPEAIDEPVVQGKGSFSASFLIDQ-KVINDASDYLWYMTSVDLKPDDIIWSDNMTLRVNT 499
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
G VLHAF+NGE VGS K+ ++ V L G N +SLLSV VGL + G +
Sbjct: 500 TGIVLHAFVNGEHVGSQWTKYGVFKDVFQQQVKLNPGKNQISLLSVTVGLQNYGPMFDMV 559
Query: 543 VAGLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQIFTDYGS--RIVPWSRYGSS 595
AG+ V + G K +KD S W Y+VGL G + F S WS
Sbjct: 560 QAGITGPVELIGQKGDETVIKDLSCHKWTYEVGLTGLEDNKFYSKASTNETCGWSAENVP 619
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------ 643
++ +TWYKT F AP G+DPV ++L MGKG AWVNG ++GRYW S+L
Sbjct: 620 SNSKMTWYKTTFKAPLGNDPVVLDLQGMGKGFAWVNGYNLGRYWPSYLAEADGCSSDPCD 679
Query: 644 -----------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLC 692
T G PSQ WYH+PRSFL+ N LVL EE G P ++ T+ V ++C
Sbjct: 680 YRGQYDNNKCVTNCGQPSQRWYHVPRSFLQDGENTLVLFEEFGGNPWQVNFQTLVVGSVC 739
Query: 693 GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC 752
G+ + + +++ C +GR IS I FAS+G+P G C
Sbjct: 740 GNAHE--------------------------KKTLELSC-NGRPISAIKFASFGDPQGTC 772
Query: 753 ENYAIGSCHSSNS-RAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
++ G+C + ++++ C+GK +C++ + +K C + K L V+A C
Sbjct: 773 GSFQAGTCQTEQDILPVLQQECVGKETCSIDISEDKLGKTNCGSVVKKLAVEAVC 827
>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
Length = 724
Score = 643 bits (1659), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/721 (47%), Positives = 450/721 (62%), Gaps = 55/721 (7%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
+L LF + + S V+YD +++IING ++IL SGSIHYPRSTP+MWP LI
Sbjct: 4 ILLLFSCIFSAASAS---------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 54
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GGLDV+QT VFWN HEP PG++ F R DLV+FIK VQ GL+V LRIGP++ E
Sbjct: 55 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 114
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK M+++ IV+MMKA +L+ SQGGPIILSQIEN
Sbjct: 115 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIEN 174
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
E+G VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F
Sbjct: 175 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFK- 232
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTE WT +Y +G R AED+A+ VA FI + GS++NYYMYHGGTNF
Sbjct: 233 PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFI-QSGGSFLNYYMYHGGTNF 291
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA + YD APLDEYGL R+PKWGHL++LH A+K C ++S
Sbjct: 292 GRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSN 351
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
QEA +F+ S+CAAFL N D + + V F Y+LPP SISILPDCKT +NTAK+ S
Sbjct: 352 QEAHVFKSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQ 411
Query: 425 EQ------------WEEY-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKH 469
W+ + +E + + + + L EQ+N T+D +DYLWY +
Sbjct: 412 SSQVQMTPVHSGFPWQSFIEETTSSDETDTTYMDGLYEQINITRDTTDYLWYMTDITIGS 471
Query: 470 DPS----DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSL 525
D + +L +SS GH L+ FING+ G+ +G + + + V+L +G N ++L
Sbjct: 472 DEAFLKNGKSPLLTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLAL 531
Query: 526 LSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LS+ VGLP+ G + E AG L ++++G D S + W Y+ GL GE L + T G
Sbjct: 532 LSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTG 591
Query: 584 SRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF 642
S V W S + QPLTW+K F+AP G P+A+++ SMGKG+ W+NGQS+GR+W +
Sbjct: 592 SSSVEWVEGPSMAKKQPLTWHKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGY 651
Query: 643 L--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGIS 682
+ T G PSQ WYHIPRS+L PTGNLLV+ EE G P GIS
Sbjct: 652 IARGSCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSGIS 711
Query: 683 I 683
+
Sbjct: 712 L 712
>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
Length = 725
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/701 (49%), Positives = 441/701 (62%), Gaps = 52/701 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V YD +++IING R+IL SGSIHYPRSTP+MWP LI KAK GGLDV+QT VFWN HEP
Sbjct: 25 SVGYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKAGGLDVIQTYVFWNGHEPS 84
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLV+FIK VQ GL+V LRIGP++ EW +GG P WL VPGI FR+DNE
Sbjct: 85 PGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIWLKYVPGIAFRTDNE 144
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IVNMMKA +L+ ++GGPIILSQIENEYG VE G Y +WAA++
Sbjct: 145 PFKAAMQKFTEKIVNMMKAEKLFQTEGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAQM 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F PN KP +WTE WT +Y +G
Sbjct: 205 AVGLNTGVPWIMCKQEDAPDPVIDTCNGYYC-ENFK-PNKVYKPKMWTEVWTGWYTEFGG 262
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+A+ VA FI + GS+ NYYMYHGGTNFGRTA + YD APLDEYG
Sbjct: 263 AIPTRPVEDLAFSVARFI-QSGGSFFNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYG 321
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQGSSECAAFLVNKD 384
LL+QPKWGHLK+LH A+K C + V V + +KL QEA +F S CAAFL N D
Sbjct: 322 LLQQPKWGHLKDLHKAIKSCEYAL---VAVDPSVTKLGNNQEAHVFNTKSGCAAFLANYD 378
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------------SVEQWEEYKE 432
+ V F Y+LPP SISILPDCKT FNTAK+ S W+ + E
Sbjct: 379 TKYPVRVSFGQGQYDLPPWSISILPDCKTAVFNTAKVTWKTSQVQMKPVYSRLPWQSFIE 438
Query: 433 AIPTYDET-SLRANFLLEQMNTTKDASDYLWY--NFRFKHDPSDSES----VLKVSSLGH 485
T DE+ + + L EQ+ T+DA+DYLWY + D + + +L + S H
Sbjct: 439 ETTTSDESGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEAFLNNGKFPLLTIFSACH 498
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH FING+ G+ +G + T + V L G N ++LLS+ VGLP+ G + E AG
Sbjct: 499 ALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFETWNAG 558
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTW 602
L +S++G D S + W Y++G+ GE L + T GS V W+ S + QPLTW
Sbjct: 559 VLGPISLKGLNTGTWDMSRWKWTYKIGMKGEALGLHTVTGSSSVDWAEGPSMAKKQPLTW 618
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------- 643
YK F+AP G P+A+++ SMGKG+ W+NGQS+GR+W ++
Sbjct: 619 YKATFNAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPGYIAQGSCGTCNYAGTFYDKKC 678
Query: 644 -TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G PSQ WYHIPRS+L PTGNLLV+ EE G P +S+
Sbjct: 679 RTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPQWMSL 719
>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
Precursor
gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
Length = 727
Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/696 (47%), Positives = 439/696 (63%), Gaps = 45/696 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++LIING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP P
Sbjct: 29 VTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+F K V GLY+ LRIGP++ EW +GG P WL VPG+VFR+DNEP
Sbjct: 89 GNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK +L+ +QGGPIILSQIENEYG ++ G Y +W A++A
Sbjct: 149 FKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAEMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ L TGVPW+MCKQ+DAP P+I+ CNG C E F PNS +KP +WTENWT ++ +G
Sbjct: 209 LGLSTGVPWIMCKQEDAPYPIIDTCNGFYC-EGFK-PNSDNKPKLWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R EDIA+ VA FI + GS++NYYMY+GGTNF RTA ++ T Y AP+DEYGLL
Sbjct: 267 IPNRPVEDIAFSVARFI-QNGGSFMNYYMYYGGTNFDRTAGVFIATSYDYDAPIDEYGLL 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLKELH +KLC ++S + QE +F+ + CAAFL N D + A
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVFKSKTSCAAFLSNYDTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPTY 437
V F Y+LPP S+SILPDCKT +NTAK+ + WE Y E P+
Sbjct: 386 RVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIPTSTKFSWESYNEGSPSS 445
Query: 438 DET-SLRANFLLEQMNTTKDASDYLWY--NFRFKHDPS----DSESVLKVSSLGHVLHAF 490
+E + + L+EQ++ T+D +DY WY + D S +L + S GH LH F
Sbjct: 446 NEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVF 505
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
+NG G+++G S+ T + + L G N ++LLS VGLP++G + E G L V
Sbjct: 506 VNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPV 565
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST-HQPLTWYKTVF 607
+++G D S + W Y++GL GE + + T GS V W G QPLTWYK+ F
Sbjct: 566 TLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSF 625
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQG 647
D P G++P+A+++ +MGKG+ WVNG +IGR+W ++ L+ G
Sbjct: 626 DTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTARGNCGRCNYAGIYNEKKCLSHCG 685
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
PSQ WYH+PRS+LKP GNLLV+ EE G P GIS+
Sbjct: 686 EPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISL 721
>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
Length = 724
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/698 (48%), Positives = 442/698 (63%), Gaps = 48/698 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV++T VFWN HEP
Sbjct: 28 SVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R DLV+FIK V GLYV LRIGP++ EW +GG P WL VPG+ FR+DNE
Sbjct: 88 PGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MK++ IV MMKA +L+ +QGGPIIL+QIENEYG VE G Y +W A++
Sbjct: 148 PFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW+MCKQ+DAP P+I+ CNG C E F PNS +KP +WTENWT +Y +G
Sbjct: 208 ALGLSTGVPWIMCKQEDAPGPIIDTCNGYYC-EDFK-PNSINKPKMWTENWTGWYTDFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
R EDIAY VA FI K GS VNYYMYHGGTNF RTA ++ + Y APLDEYGL
Sbjct: 266 AVPYRPVEDIAYSVARFIQK-GGSLVNYYMYHGGTNFDRTAGEFMASSYDYDAPLDEYGL 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+ HLK LH A+KL +LS + QEA++F S CAAFL NKD+ +
Sbjct: 325 PREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFWSKSSCAAFLSNKDENSA 384
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPT 436
A V F Y+LPP S+SILPDCKT +NTAK+++ W + EA PT
Sbjct: 385 ARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNAPSVHRNMVPTGTKFSWGSFNEATPT 444
Query: 437 YDETSLRA-NFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHA 489
+E A N L+EQ++ T D SDY WY ++ +L V S GH LH
Sbjct: 445 ANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHV 504
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+A+G T + + L G N ++LLSV VGLP+ G + E+ G L
Sbjct: 505 FVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGP 564
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G D S + W Y++G+ GE L + T+ S V W++ GS + QPLTWYK+
Sbjct: 565 VTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQ-GSFVAKKQPLTWYKS 623
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTP 645
F P G++P+A+++ +MGKG+ W+NG++IGR+W ++ L+
Sbjct: 624 TFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKAQGSCGRCNYAGTFDAKKCLSN 683
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G SQ WYH+PRS+LK + NL+V+ EE G P GIS+
Sbjct: 684 CGEASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISL 720
>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
Length = 724
Score = 642 bits (1655), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/698 (48%), Positives = 442/698 (63%), Gaps = 48/698 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV++T VFWN HEP
Sbjct: 28 SVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R DLV+FIK V GLYV LRIGP++ EW +GG P WL VPG+ FR+DNE
Sbjct: 88 PGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MK++ IV MMKA +L+ +QGGPIIL+QIENEYG VE G Y +W A++
Sbjct: 148 PFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TGVPW+MCKQ+DAP P+I+ CNG C E F PNS +KP +WTENWT +Y +G
Sbjct: 208 ALGLSTGVPWIMCKQEDAPGPIIDTCNGYYC-EDFK-PNSINKPKMWTENWTGWYTDFGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
R EDIAY VA FI K GS +NYYMYHGGTNF RTA ++ + Y APLDEYGL
Sbjct: 266 AVPYRPVEDIAYSVARFIQK-GGSLINYYMYHGGTNFDRTAGEFMASSYDYDAPLDEYGL 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
R+PK+ HLK LH A+KL +LS + QEA++F S CAAFL NKD+ +
Sbjct: 325 PREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFWSKSSCAAFLSNKDENSA 384
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPT 436
A V F Y+LPP S+SILPDCKT +NTAK+++ W + EA PT
Sbjct: 385 ARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKVNAPSVHRNMVPTGTKFSWGSFNEATPT 444
Query: 437 YDETSLRA-NFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHA 489
+E A N L+EQ++ T D SDY WY ++ +L V S GH LH
Sbjct: 445 ANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHV 504
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
F+NG+ G+A+G T + + L G N ++LLSV VGLP+ G + E+ G L
Sbjct: 505 FVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGP 564
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G D S + W Y++G+ GE L + T+ S V W++ GS + QPLTWYK+
Sbjct: 565 VTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQ-GSFVAKKQPLTWYKS 623
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTP 645
F P G++P+A+++ +MGKG+ W+NG++IGR+W ++ L+
Sbjct: 624 TFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKAQGSCGRCNYAGTFDAKKCLSN 683
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G SQ WYH+PRS+LK + NL+V+ EE G P GIS+
Sbjct: 684 CGEASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISL 720
>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 813
Score = 641 bits (1653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/830 (41%), Positives = 480/830 (57%), Gaps = 78/830 (9%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+NV+YD ++IING R+++ SGS+HYPRST MWP LI KAK+GGLD ++T +FW+ HE
Sbjct: 9 GDNVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHE 68
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ ++DF+GR D ++F + VQ GLYV +RIGP++ EW YGG P WLH++PGI FR+D
Sbjct: 69 PQRRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQFRTD 128
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+ +K M+ + T IVNM K A L+ASQGGPIIL+QIENEYG V + G Y+ W A
Sbjct: 129 NQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCA 188
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A L G+PW+MC+Q DAP P+IN CNG C F+ PN+P P ++TENW +++ +
Sbjct: 189 QMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDYDFS-PNNPKSPKMFTENWVGWFKKW 247
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDE 325
GD+ RS ED+A+ VA F + G + NYYMYHGGTNFGRTA +T YD APLDE
Sbjct: 248 GDKDPYRSPEDVAFAVARFF-QSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDE 306
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG--SSECAAFLVNK 383
YG L QPKWGHLK+LH+++K+ K + + F S E FL N
Sbjct: 307 YGNLNQPKWGHLKQLHASIKMGEKILTNSTRSDQKLXSFVTLTKFSNPTSGERFCFLSNT 366
Query: 384 DKRNNATVYF-SNLMYELPPLSISILPDCKTVAFNTAKLDS-------VEQWEEYKE--- 432
D +N+AT+ ++ Y +P S+SIL C FNTAK++S V+ +E +
Sbjct: 367 DNKNDATIDLQADGKYFVPAWSVSILDGCNKEVFNTAKINSQTSMFVKVQNKKENAQFSW 426
Query: 433 ---AIPTYD----ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS--ESVLKVSSL 483
P D + + +AN LLEQ TT D SDYLWY + + S L+V++
Sbjct: 427 VWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSLQNVTLQVNTK 486
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH+LHAF+N ++GS + + +SF K + + GTN ++LLS VGL + A+ +
Sbjct: 487 GHMLHAFVNRRYIGS-QWRSNGQSFVFXKPILIKPGTNTITLLSATVGLKNYDAFYDTVP 545
Query: 544 AGLRN--VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQP 599
G+ + + G +K D SS W Y+VGL GE Q++ S+ WS S +
Sbjct: 546 TGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTNWSTINQKSIGRR 605
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
+T YKT F P+G DPV +++ MGKG+AWVNGQSIGR+W SF+
Sbjct: 606 MTLYKTNFKTPSGIDPVTLDMQGMGKGQAWVNGQSIGRFWPSFIAGNDSCSTTCDYRGAY 665
Query: 647 ---------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
G PSQ WYHIPRSFL N LVL EE G P +S+ T+++ T+CG+ ++
Sbjct: 666 NPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITIGTICGNANE 725
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
+++ C G IS+I FASYGNP G C ++
Sbjct: 726 GS--------------------------TLELSCQGGHIISEIQFASYGNPEGKCGSFKQ 759
Query: 758 GSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
GS H NS +VEK C+G SC++ V + F I L + A C+
Sbjct: 760 GSWHVINSAILVEKLCIGMESCSIDVSAKSFGLGDVTNISARLAIQALCS 809
>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 803
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 348/830 (41%), Positives = 478/830 (57%), Gaps = 79/830 (9%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+NV+YD ++IING R+++FSGSIHYPRST MWP LI KAK+GGLD ++T +FW+ HE
Sbjct: 2 GDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHE 61
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ ++DFSG + ++F + VQ GLY+ +RIGP++ EW YGG P WLH++PGI R+D
Sbjct: 62 PQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTD 121
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+ +K M + T IVNM K A L+ASQGGPIIL+QIENEYG V + G Y+ W A
Sbjct: 122 NQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCA 181
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A L GVPW+MC+Q DAP P+IN CNG C ++F+ PN+P P ++TENW +++ +
Sbjct: 182 QMAESLNIGVPWIMCQQSDAPQPIINTCNGFYC-DSFS-PNNPKSPKMFTENWVGWFKKW 239
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDE 325
GD+ RSAED+A+ VA F + G + NYYMYHGGTNFGRT+ +T YD APLDE
Sbjct: 240 GDKDPYRSAEDVAFSVARFF-QSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDE 298
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG--SSECAAFLVNK 383
YG L QPKWGHLK+LHS++KL K + +G + F F + E FL N
Sbjct: 299 YGNLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTLTKFSNPTTKERFCFLSNT 358
Query: 384 DKRNNATVYF-SNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------W 427
D N+AT+ ++ Y +P S+SI+ CK FNTAK++S W
Sbjct: 359 DDTNDATIDLQADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTSMFVKVQNEKENVKLSW 418
Query: 428 EEYKEAIPT--YDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDPSDSESVLKVSSL 483
EA+ + + + N LLEQ TT D+SDYLWY N S L+V++
Sbjct: 419 VWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSIHNVTLQVNTK 478
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GHVLHAF+N ++GS G + +SF EK + L GTN ++LLS VGL + A+ +
Sbjct: 479 GHVLHAFVNTRYIGSQWGNNG-QSFVFEKPILLKAGTNIITLLSATVGLKNYDAFYDTLP 537
Query: 544 AGLRN---VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQP 599
G+ I + SS W Y+VGL GE Q++ S+ W+ +S +
Sbjct: 538 TGIDGGPIYLIGDGNVTTNLSSNLWSYKVGLNGEIKQLYNPVFSQETSWNTLNKNSIGRR 597
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
+TWYKT F P+G DPV +++ MGKGEAW+NGQSIGR+W SF+
Sbjct: 598 MTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWPSFIAGNDNCSETCDYRGAY 657
Query: 647 ---------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
G PSQ WYHIPRSFL N LVL EE G P +S+ T+++ T+CG+ ++
Sbjct: 658 DPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQTITIGTICGNANE 717
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
+++ C IS+I FASYGNP G C ++
Sbjct: 718 GS--------------------------TLELSCQGEYIISEIQFASYGNPKGKCGSFKQ 751
Query: 758 GSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
GS +NS ++EK C +SC+V V + F + L+V A C+
Sbjct: 752 GSWDVTNSALLLEKTCKDMKSCSVDVSAKLFGLGDAVNLSARLVVQALCS 801
>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
Length = 828
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 358/861 (41%), Positives = 499/861 (57%), Gaps = 90/861 (10%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M Q LL LF +L+T+ G ++ V++D R++ I+G R+IL SGSIHYPRST M
Sbjct: 3 MKQFNLLSLFLILITSFGSANS-----TIVSHDERAITIDGQRRILLSGSIHYPRSTSDM 57
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI+KAK+GGLD ++T VFWN HEP Q+DFSG DLVRFIK +Q+ GLY LRIGP
Sbjct: 58 WPDLISKAKDGGLDTIETYVFWNAHEPSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIGP 117
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
++ EW YGG P WLH++P + FR+ N F M+ + T IVNMMK L+ASQGGPIIL
Sbjct: 118 YVCAEWNYGGFPVWLHNMPDMKFRTINPGFMNEMQNFTTKIVNMMKEESLFASQGGPIIL 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
+QIENEYG V S+ +G Y+ W A +A L GVPW+MC+Q AP P+I CNG C
Sbjct: 178 AQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWIMCQQPHAPQPMIETCNGFYCD 237
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ P++P P +WTENWT +++ +G + R+AED+A+ VA F + G++ NYYMYH
Sbjct: 238 Q--YKPSNPSSPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFF-QTGGTFQNYYMYH 294
Query: 301 GGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGR A Y+ T Y APLDEYG L QPKWGHLK+LH+ +K KP+ G + ++
Sbjct: 295 GGTNFGRVAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHTLLKSMEKPLTYGNISTI 354
Query: 360 NFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
+ A ++ + + + F+ N + +A V F Y +P S+S+LPDC A+NTA
Sbjct: 355 DLGNSVTATVYSTNEKSSCFIGNVNATADALVNFKGKDYNVPAWSVSVLPDCDKEAYNTA 414
Query: 420 KL---------DSVEQWEEYK---EAIPTYDETSLR------ANFLLEQMNTTKDASDYL 461
++ DS ++ E+ K T +T L+ A L++Q + T DASDYL
Sbjct: 415 RVNTQTSIITEDSCDEPEKLKWTWRPEFTTQKTILKGSGDLIAKGLVDQKDVTNDASDYL 474
Query: 462 WYNFRF---KHDPSDSESV-LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
WY R K DP S ++ L+V S HVLHA++NG++VG+ + + + EK V+L+
Sbjct: 475 WYMTRVHLDKKDPIWSRNMSLRVHSNAHVLHAYVNGKYVGNQIVRDNKFDYRFEKKVNLV 534
Query: 518 NGTNNVSLLSVMVGLPDSGAYLERRVAGLRN----VSIQGAKEL-KDFSSFSWGYQVGLL 572
+GTN+++LLSV VGL + G + E G+ V +G + + KD S W Y++GL
Sbjct: 535 HGTNHLALLSVSVGLQNYGPFFESGPTGINGPVKLVGYKGDETIEKDLSKHQWDYKIGLN 594
Query: 573 GEKLQIFT--DYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWV 630
G ++F+ G WS + L+WYK F AP G DPV ++L +GKGE W+
Sbjct: 595 GFNHKLFSMKSAGHHHRKWSTEKLPADRMLSWYKANFKAPLGKDPVIVDLNGLGKGEVWI 654
Query: 631 NGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTG-NL 667
NGQSIGRYW SF + G P+Q WYH+PRSFL G N
Sbjct: 655 NGQSIGRYWPSFNSSDEGCTEECDYRGEYGSDKCAFMCGKPTQRWYHVPRSFLNDKGHNT 714
Query: 668 LVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKV 727
+ L EE G P + TV +C K H+ KV
Sbjct: 715 ITLFEEMGGDPSMVKFKTVVTGRVCA---------------------KAHE-----HNKV 748
Query: 728 QIRCPSGRKISKILFASYGNPNGNCENYAIGSCH-SSNSRAIVEKACLGKRSCTVPVWTE 786
++ C + R IS + FAS+GNP+G C ++A GSC + ++ +V K C+GK +CT+ V +
Sbjct: 749 ELSC-NNRPISAVKFASFGNPSGQCGSFAAGSCEGAKDAVKVVAKECVGKLNCTMNVSSH 807
Query: 787 KFYGD-PCPGIPKALLVDAQC 806
KF + C PK L V+ +C
Sbjct: 808 KFGSNLDCGDSPKRLFVEVEC 828
>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; AltName:
Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
Length = 731
Score = 639 bits (1649), Expect = e-180, Method: Compositional matrix adjust.
Identities = 341/721 (47%), Positives = 448/721 (62%), Gaps = 55/721 (7%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
+L LF + + S V+YD +++IING ++IL SGSIHYPRSTP+MWP LI
Sbjct: 11 ILLLFSCIFSAASAS---------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 61
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GGLDV+QT VFWN HEP PG + F R DLV+FIK VQ +GL+V LRIGP++ E
Sbjct: 62 QKAKDGGLDVIQTYVFWNGHEPSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIGPYVCAE 121
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK M+++ IV+MMKA +L+ +QGGPIILSQIEN
Sbjct: 122 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIEN 181
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
E+G VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFK- 239
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTE WT +Y +G R AED+A+ VA FI + GS++NYYMYHGGTNF
Sbjct: 240 PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFI-QSGGSFLNYYMYHGGTNF 298
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA + YD APLDEYGL R+PKWGHL++LH A+K C ++S
Sbjct: 299 GRTAGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSN 358
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
QEA +F+ S+CAAFL N D + + V F Y+LPP SISILPDCKT +NTAK+ S
Sbjct: 359 QEAHVFKSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQ 418
Query: 425 EQ------------WEEY-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKH 469
W+ + +E + + + + L EQ+N T+D +DYLWY +
Sbjct: 419 SSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGS 478
Query: 470 DPS----DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSL 525
D + +L + S GH L+ FING+ G+ +G + + + V+L +G N ++L
Sbjct: 479 DEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLAL 538
Query: 526 LSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LS+ VGLP+ G + E AG L ++++G D S + W Y+ GL GE L + T G
Sbjct: 539 LSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTG 598
Query: 584 SRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF 642
S V W S + QPLTWYK F+AP G P+A+++ SMGKG+ W+NGQS+GR+W +
Sbjct: 599 SSSVEWVEGPSMAEKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGY 658
Query: 643 L--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGIS 682
+ T G PSQ WYHIPRS+L PTGNLLV+ EE G P IS
Sbjct: 659 IARGSCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSRIS 718
Query: 683 I 683
+
Sbjct: 719 L 719
>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
Length = 727
Score = 639 bits (1647), Expect = e-180, Method: Compositional matrix adjust.
Identities = 330/696 (47%), Positives = 438/696 (62%), Gaps = 45/696 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++LIING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP P
Sbjct: 29 VTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+F K V GLY+ LRIGP++ EW +GG P WL VPG+VFR+DNEP
Sbjct: 89 GNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK +L+ +QGGPIILSQIENEYG ++ G Y +W A++A
Sbjct: 149 FKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAEMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ L TGVPW+M KQ+DAP P+I+ CNG C E F PNS +KP +WTENWT ++ +G
Sbjct: 209 LGLSTGVPWIMSKQEDAPYPIIDTCNGFYC-EGFK-PNSDNKPKLWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R EDIA+ VA FI + GS++NYYMY+GGTNF RTA ++ T Y AP+DEYGLL
Sbjct: 267 IPNRPVEDIAFSVARFI-QNGGSFMNYYMYYGGTNFDRTAGVFIATSYDYDAPIDEYGLL 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLKELH +KLC ++S + QE +F+ + CAAFL N D + A
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVFKSKTSCAAFLSNYDTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPTY 437
V F Y+LPP S+SILPDCKT +NTAK+ + WE Y E P+
Sbjct: 386 RVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIPTSTKFSWESYNEGSPSS 445
Query: 438 DET-SLRANFLLEQMNTTKDASDYLWY--NFRFKHDPS----DSESVLKVSSLGHVLHAF 490
+E + + L+EQ++ T+D +DY WY + D S +L + S GH LH F
Sbjct: 446 NEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVF 505
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
+NG G+++G S+ T + + L G N ++LLS VGLP++G + E G L V
Sbjct: 506 VNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPV 565
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST-HQPLTWYKTVF 607
+++G D S + W Y++GL GE + + T GS V W G QPLTWYK+ F
Sbjct: 566 TLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVVKKQPLTWYKSSF 625
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQG 647
D P G++P+A+++ +MGKG+ WVNG +IGR+W ++ L+ G
Sbjct: 626 DTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTARGNCGRCNYAGIYNEKKCLSHCG 685
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
PSQ WYH+PRS+LKP GNLLV+ EE G P GIS+
Sbjct: 686 EPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISL 721
>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 731
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 340/721 (47%), Positives = 448/721 (62%), Gaps = 55/721 (7%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
+L LF + + S V+YD +++IING ++IL SGSIHYPRSTP+MWP LI
Sbjct: 11 ILLLFSCIFSAASAS---------VSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLI 61
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GGLDV+QT VFWN HEP PG++ F R DLV+FIK VQ GL+V LRIGP++ E
Sbjct: 62 QKAKDGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK M+++ IV+MMKA +L+ +QGGPIILSQIEN
Sbjct: 122 WNFGGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIEN 181
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
E+G VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFK- 239
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTE WT +Y +G R AED+A+ VA FI + GS++NYYMYHGGTNF
Sbjct: 240 PNKDYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFI-QSGGSFLNYYMYHGGTNF 298
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA + YD APLDEYGLLR+PKWGHL++LH A+K C ++S
Sbjct: 299 GRTAGGPFMATSYDYDAPLDEYGLLREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSN 358
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
QEA +F+ S+CAAFL N D + + V F Y+LPP SISILPDCKT ++TAK+ S
Sbjct: 359 QEAHVFKSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYSTAKVGSQ 418
Query: 425 EQ------------WEEY-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKH 469
W+ + +E + + + + L EQ+N T+D +DYLWY +
Sbjct: 419 SSQVQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGS 478
Query: 470 DPS----DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSL 525
D + +L + S GH L+ FING+ G+ +G + + + V+L +G N ++L
Sbjct: 479 DEAFLKNGKSPLLTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLAL 538
Query: 526 LSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LS+ VGLP+ G + E AG L ++++G D S + W Y+ GL GE L + T G
Sbjct: 539 LSISVGLPNVGTHFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTG 598
Query: 584 SRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF 642
S V W S + QPLTWYK F+AP G P+A+++ SMGKG+ W+NGQS+GR+W +
Sbjct: 599 SSSVEWVEGPSMAKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGY 658
Query: 643 L--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGIS 682
+ T G PSQ WYHIPRS+L P GNLLV+ EE G P IS
Sbjct: 659 IARGSCGDCSYAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPNGNLLVVFEEWGGDPSRIS 718
Query: 683 I 683
+
Sbjct: 719 L 719
>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
Length = 725
Score = 638 bits (1646), Expect = e-180, Method: Compositional matrix adjust.
Identities = 349/728 (47%), Positives = 448/728 (61%), Gaps = 69/728 (9%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
+L LF + + S G YD +++IING R+IL SGSIHYPRSTP MWP LI
Sbjct: 11 ILLLFSCIFSAASASVG---------YDHKAIIINGQRRILISGSIHYPRSTPGMWPDLI 61
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK GGLDV+QT VFWN HEP PG++ F R DLV+FIK VQ GL+V LRIGP++ E
Sbjct: 62 QKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK M+++ IVNMMKA +L+ +QGGPIILSQIEN
Sbjct: 122 WNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIEN 181
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
E+G VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYC-ENFK- 239
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTE WT +Y +G R AED+A+ VA FI + GS+ NYYMYHGGTNF
Sbjct: 240 PNKVYKPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFI-QSGGSFFNYYMYHGGTNF 298
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA + YD APLDEYGLL+QPKWGHL++LH A+K C + V V + +KL
Sbjct: 299 GRTAGGPFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHAL---VAVDPSVTKL 355
Query: 365 ---QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL 421
QEA +F S CAAFL N D + + V F + Y+LPP SISILPDCKT FNTAK+
Sbjct: 356 GNNQEAHVFNSKSGCAAFLANHDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKV 415
Query: 422 DSVEQWEEYK-EAIPTYDETSLRA----------------NFLLEQMNTTKDASDYLWY- 463
W+ + + P Y ++ + L EQ+ T+DA+DYLWY
Sbjct: 416 ----AWKASEVQMKPVYSRLPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWYM 471
Query: 464 -NFRFKHDPSDSES----VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLIN 518
+ D + ++ +L + S GH LH FING+ G+ +G + T + V L
Sbjct: 472 TDITIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRP 531
Query: 519 GTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKL 576
G N ++LLS+ VGLP+ G + E G L +S++G D S + W Y++G+ GE L
Sbjct: 532 GINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESL 591
Query: 577 QIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
+ T GS V W+ S + QPLTWYK FDAP G P+A+++ SMGKG+ W+NGQS+
Sbjct: 592 GLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSV 651
Query: 636 GRYWVSFL--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
GR+W ++ T G PSQ WYHIPRS+L PTGNLLV+ EE
Sbjct: 652 GRHWPGYIAQGSCGNCYYAGTFNDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWG 711
Query: 676 GYPPGISI 683
G P +S+
Sbjct: 712 GDPSWMSL 719
>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
Length = 848
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 354/856 (41%), Positives = 478/856 (55%), Gaps = 97/856 (11%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
CLF + TI V++DGR++ I+G R++L SGSIHYPRST +MWP LI
Sbjct: 35 FFCLFTFVSATI------------VSHDGRAITIDGKRRVLISGSIHYPRSTAEMWPDLI 82
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
K+KEGGLD ++T VFWN HEP Q+DFSG DLVRFIK +QA+GLY LRIGP++ E
Sbjct: 83 KKSKEGGLDAIETYVFWNSHEPSRRQYDFSGNLDLVRFIKTIQAEGLYAVLRIGPYVCAE 142
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W YGG P WLH++PG R+ N F M+ + ++IV+MMK L+ASQGGPIIL+Q+EN
Sbjct: 143 WNYGGFPMWLHNLPGCELRTANSVFMNEMQNFTSLIVDMMKDENLFASQGGPIILAQVEN 202
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
EYG V ++ G Y+ W + +A L GVPW+MC+Q DAP P+IN CNG C +
Sbjct: 203 EYGNVMSAYGAAGKTYIDWCSNMAESLDIGVPWIMCQQSDAPQPMINTCNGWYCDQ--FT 260
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN+ + P +WTENWT +++ +G + R+AED+A+ VA F + G++ NYYMYHGGTNF
Sbjct: 261 PNNANSPKMWTENWTGWFKSWGGKDPHRTAEDVAFAVARFF-QTGGTFQNYYMYHGGTNF 319
Query: 306 GRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA Y+ T Y APLDEYG L QPKWGHLK+LH + + G + ++++
Sbjct: 320 GRTAGGPYITTSYDYDAPLDEYGNLNQPKWGHLKQLHDILHSMEYTLTHGNISTIDYDNS 379
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS- 423
A I+ E A F N ++ ++AT+ F Y +P S+SILPDC+ V +NTAK+ +
Sbjct: 380 VTATIYATDKESACFFGNANETSDATIVFKGTEYNVPAWSVSILPDCENVGYNTAKVKTQ 439
Query: 424 ----VEQWEEYKEA--------IPTYDETS-------LRANFLLEQMNTTKDASDYLWYN 464
V+Q E ++ IP T+ A L++Q DASDYLWY
Sbjct: 440 TAIMVKQKNEAEDQPSSLKWSWIPENTHTTSLLGKGHAHARQLIDQKAAANDASDYLWYM 499
Query: 465 FRF---KHDPS-DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGT 520
K DP S+ L+V+ GHVLHA++NG+ +GS K+ S+ EK + L G
Sbjct: 500 TSLHIKKDDPVWSSDMSLRVNGSGHVLHAYVNGKHLGSQFAKYGVFSYVFEKSLKLRPGK 559
Query: 521 NNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQG----AKELKDFSSFSWGYQVGLLGEK 575
N +SLLS VGL + G + G+ V I G K +KD SS W Y VGL G
Sbjct: 560 NVISLLSATVGLQNYGPMFDLVQTGIPGPVEIIGHRGDEKVVKDLSSHKWSYSVGLNGFH 619
Query: 576 LQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
++++ W T++ + WYKT F AP G DPV ++L MGKG AWVNG +I
Sbjct: 620 NELYSSNSRHASRWVEQDLPTNKMMIWYKTTFKAPLGKDPVVLDLQGMGKGFAWVNGNNI 679
Query: 636 GRYWVSFLTPQ-----------------------GTPSQSWYHIPRSFLKPTGNLLVLLE 672
GRYW SFL + G P+Q WYH+PRSF N LVL E
Sbjct: 680 GRYWPSFLAEEDGCSTEVCDYRGAYDNNKCVTNCGKPTQRWYHVPRSFFNDYENTLVLFE 739
Query: 673 EENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP 732
E G P G++ TV+V G VS S G +++ C
Sbjct: 740 EFGGNPAGVNFQTVTV----GKVSGS----------------------AGEGETIELSC- 772
Query: 733 SGRKISKILFASYGNPNGNCENYAIGSCHSSNSR-AIVEKACLGKRSCTVPVWTEKFYGD 791
+G+ IS I FAS+G+P G Y G+C SN +IV+KAC+GK +C + + F
Sbjct: 773 NGKSISAIEFASFGDPQGTSGAYVKGTCEGSNDAFSIVQKACVGKETCKLEASKDVFGPT 832
Query: 792 PC-PGIPKALLVDAQC 806
C + L V A C
Sbjct: 833 SCGSDVVNTLAVQATC 848
>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
Length = 722
Score = 636 bits (1640), Expect = e-179, Method: Compositional matrix adjust.
Identities = 326/708 (46%), Positives = 453/708 (63%), Gaps = 47/708 (6%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G + V YD R LIING ++L S SIHYPR+ PQMW +LI+ AK GG+DV++T VFW+
Sbjct: 19 GLSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDG 78
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
H+P ++F GR DLV F+K V GLY LRIGP++ EW GG P WL DVPGI FR
Sbjct: 79 HQPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVPGIEFR 138
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
++N+PFK M+ + IV MMK +L+A QGGPIIL+QIENEYG ++ ++ G Y+ W
Sbjct: 139 TNNQPFKAEMQAFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMEW 198
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA +A L TGVPW+MC+Q DAPD +++ CNG C + +A PN+ KP +WTENW+ ++Q
Sbjct: 199 AANMAQGLGTGVPWIMCQQSDAPDYILDTCNGFYC-DAWA-PNNKKKPKMWTENWSGWFQ 256
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPL 323
+G+ + R ED+A+ VA F + GS+ NYYMY GGTNFGR++ YV T Y AP+
Sbjct: 257 KWGEASPHRPVEDVAFAVARFFQR-GGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPI 315
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLV 381
DE+G++RQPKWGHLK+LH+A+KLC + S ++ +LQEA ++ +S CAAFL
Sbjct: 316 DEFGVIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLA 375
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------------SVEQWEE 429
N D ++ATV F++ Y LP S+SILPDCKTV+ NTAK+ + WE
Sbjct: 376 NIDSSSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVHVQTAMPTMKPSITGLAWES 435
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDPSDSESVLKVSSLGHV 486
Y E + + ++ + A+ LLEQ+NTTKD SDYLWY + D + +++L + S+ V
Sbjct: 436 YPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAASGKALLSLESMRDV 495
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+H F+NG+ GSA K + +E+ + L +G N++++L VGL + G ++E AG+
Sbjct: 496 VHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFIETWGAGI 555
Query: 547 R-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
+V ++G + D ++ W +QVGL GE L IFT+ GS+ V WS Q L WYK
Sbjct: 556 NGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWSS-AVPQGQALVWYK 614
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------------ 646
FD+P+G+DPVA++L SMGKG+AW+NGQSIGR+W S P
Sbjct: 615 AHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPSLRAPDTAGCPQTCDYRGSYSSSK 674
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G PSQ WYH+PRS+L+ +GNL+VL EEE G P G+S T +V
Sbjct: 675 CRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEGGKPSGVSFVTRTVV 722
>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
Length = 718
Score = 635 bits (1637), Expect = e-179, Method: Compositional matrix adjust.
Identities = 343/701 (48%), Positives = 440/701 (62%), Gaps = 56/701 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD ++L+I+G R+IL SGSIHYPRSTP+MWP L KAK+GGLDV+QT VFWN HEP
Sbjct: 24 SVSYDHKALVIDGQRRILISGSIHYPRSTPEMWPDLFQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG + R D V+ K Q L V LR+ P + G P WL VPG+ FR+DNE
Sbjct: 84 PGNYTLKDRLDWVKLSKLAQQAVLNVHLRMVP------TFVGFPVWLKYVPGMAFRTDNE 137
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ T IV MMKA L+ +QGGPII+SQIENEYG VE G Y +WAA++
Sbjct: 138 PFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWAAQM 197
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPW MCKQ+DAPDPVI+ CNG C E F PN KP +WTENW+ +Y +G
Sbjct: 198 AVGLDTGVPWDMCKQEDAPDPVIDTCNGYYC-ENFT-PNENFKPKMWTENWSGWYTDFGG 255
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R ED+AY VA FI + +GS+VNYYMYHGGTNFGRT+S + YD AP+DEYG
Sbjct: 256 AISHRPTEDLAYSVATFI-QNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 314
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLS--GVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
L +PKW HLK LH A+K C ++S + + L+ + +S CAAFL N D
Sbjct: 315 LPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLANYDT 374
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTA---------KLDSVE---QWEEYKEA 433
++ ATV F N Y+LPP S+SILPDCKTV FNTA ++ VE W+ Y E
Sbjct: 375 KSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHSFHKRMTPVETTFDWQSYSEE 434
Query: 434 IPTY--DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGH 485
P Y D+ S+ AN L EQ+N T+D+SDYLWY PS+S L ++S GH
Sbjct: 435 -PAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQFPTLTINSAGH 493
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVA 544
VLH F+NG+ G+ +G + T + V+L G N +SLLSV VGLP+ G + E V
Sbjct: 494 VLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNVGLHFETWNVG 553
Query: 545 GLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTW 602
L V ++G E +D S W Y+VGL GE L + T GS + W++ S + QPLTW
Sbjct: 554 VLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQGSSLAKKQPLTW 613
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------------- 643
YKT FDAP+G+DPVA+++ SMGKGE W+N QSIGR+W +++
Sbjct: 614 YKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIAHGNCDECNYAGTFTNPKC 673
Query: 644 -TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G P+Q WYHIPRS+L +GN+LV+LEE G P GIS+
Sbjct: 674 RTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISL 714
>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
Length = 725
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 348/728 (47%), Positives = 447/728 (61%), Gaps = 69/728 (9%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
+L LF + + S G YD +++IING R+IL SGSIHYPRSTP MWP LI
Sbjct: 11 ILLLFSCIFSAASASVG---------YDHKAIIINGQRRILISGSIHYPRSTPGMWPDLI 61
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK GGLDV+QT VFWN HEP PG++ F R DLV+FIK VQ GL+V LRIGP++ E
Sbjct: 62 QKAKAGGLDVIQTYVFWNGHEPSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAE 121
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPGI FR+DNEPFK M+++ IVNMMKA +L+ +QGGPIILSQIEN
Sbjct: 122 WNFGGFPIWLKYVPGIAFRTDNEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIEN 181
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
E+G VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F
Sbjct: 182 EFGPVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYC-ENFK- 239
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTE WT +Y +G R AED+A+ VA FI + GS+ NYYMYHGGTNF
Sbjct: 240 PNKVYKPKMWTEVWTGWYTEFGGAIPTRPAEDLAFSVARFI-QSGGSFFNYYMYHGGTNF 298
Query: 306 GRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRTA + YD APLDEYGLL+QPKWGHL++LH A+K C + V V + +KL
Sbjct: 299 GRTAGGPFMATSYDYDAPLDEYGLLQQPKWGHLRDLHKAIKSCEHAL---VAVDPSVTKL 355
Query: 365 ---QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL 421
QEA +F S CAAFL N D + + V F + Y+LPP SISILPDCKT FNTAK+
Sbjct: 356 GNNQEAHVFNSKSGCAAFLANYDTKYSVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKV 415
Query: 422 DSVEQWEEYK-EAIPTYDETSLRA----------------NFLLEQMNTTKDASDYLWY- 463
W+ + + P Y ++ + L EQ+ T+DA+DYLWY
Sbjct: 416 ----AWKASEVQMKPVYSRLPWQSFIEETTTSDETGTTTLDGLYEQIYMTRDATDYLWYM 471
Query: 464 -NFRFKHDPSDSES----VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLIN 518
+ D + ++ +L + S GH LH FING+ G+ +G + T + V L
Sbjct: 472 TDITIGSDEAFLKNGKFPLLTIFSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRP 531
Query: 519 GTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKL 576
G N ++LLS+ VGLP+ G + E G L +S++G D S + W Y++G+ GE L
Sbjct: 532 GINKLALLSISVGLPNVGTHFETWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESL 591
Query: 577 QIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
+ T GS V W+ S + QPLTWYK FDAP G P+A+++ SMGKG+ W+NGQS+
Sbjct: 592 GLHTVTGSSSVDWAEGPSMAQKQPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSV 651
Query: 636 GRYWVSFL--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
GR+W ++ T G PSQ W HIPRS+L PTGNLLV+ EE
Sbjct: 652 GRHWPGYIAQGSCGNCYYAGTFNDKKCRTYCGKPSQRWCHIPRSWLTPTGNLLVVFEEWG 711
Query: 676 GYPPGISI 683
G P +S+
Sbjct: 712 GDPSWMSL 719
>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 726
Score = 634 bits (1635), Expect = e-179, Method: Compositional matrix adjust.
Identities = 339/702 (48%), Positives = 441/702 (62%), Gaps = 54/702 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++IING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV++T VFWN HEP
Sbjct: 28 SVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R DLV+FIK V GLYV LRIGP++ EW +GG P WL VPG+ FR+DNE
Sbjct: 88 PGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNE 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILS--QIENEYGMVEHSFLEKGPPYVRWAA 206
PFK MK++ IV MMKA +L+ +QGGPIIL+ QIENEYG VE G Y +W A
Sbjct: 148 PFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQGQIENEYGPVEWEIGAPGKAYTKWVA 207
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ L TGVPW+MCKQ+DAP P+I+ CNG C E F PNS +KP +WTENWT +Y +
Sbjct: 208 QMALGLSTGVPWIMCKQEDAPSPIIDTCNGYYC-EDFK-PNSSNKPKMWTENWTGWYTEF 265
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G R EDIAY VA FI K GS+VNYYMYHGGTNF RTA ++ + Y APLDEY
Sbjct: 266 GGAVPYRPVEDIAYSVARFIQK-GGSFVNYYMYHGGTNFDRTAGEFMASSYDYDAPLDEY 324
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
GL R+PK+ HLK LH +KL +LS + QEA++F S CAAFL NKD+
Sbjct: 325 GLPREPKYSHLKALHKVIKLSEPALLSADATVTSLGAKQEAYVFWSKSSCAAFLSNKDES 384
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAI 434
+ A V F Y LPP S+SILPDCKT +NTAK+++ W + EA
Sbjct: 385 SAARVMFRGFPYVLPPWSVSILPDCKTEFYNTAKVNAPSVHRNMVPTGARFSWGSFNEAT 444
Query: 435 PTYDETSLRA-NFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGH 485
PT +E A N L+EQ++ T D SDY WY E+ LK V S GH
Sbjct: 445 PTANEAGTFARNGLVEQISMTWDKSDYFWYLTDI--TIGSGETFLKTGDFPLFTVMSAGH 502
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LH F+NG+ G+A+G T + + L G N ++LLSV VGLP+ G + E+ G
Sbjct: 503 ALHVFVNGQLSGTAYGGLDHPKLTFTQKIKLHAGVNKLALLSVAVGLPNVGTHFEQWNKG 562
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLT 601
L V+++G D S + W Y++G+ GE L + TD S V W++ GS + QPLT
Sbjct: 563 VLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTDTESSGVRWTQ-GSFVAKKQPLT 621
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------- 642
WYK+ F P G++P+A+++ +MGKG+ W+NG++IGR+W ++
Sbjct: 622 WYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKAQGSCGRCNYAGTFNAKK 681
Query: 643 -LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
L+ G SQ WYH+PRS+LK + NL+V+ EE G P GIS+
Sbjct: 682 CLSNCGEASQRWYHVPRSWLK-SQNLIVVFEEWGGDPNGISL 722
>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
Length = 780
Score = 632 bits (1630), Expect = e-178, Method: Compositional matrix adjust.
Identities = 337/812 (41%), Positives = 483/812 (59%), Gaps = 89/812 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V++DGR++ I+GHR++L SGSIHYPRST +MWP LI K KEGGLD ++T VFWN HEP
Sbjct: 23 VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGGLDAIETYVFWNAHEPTR 82
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
Q+DFSG DL+RF+K +Q +G+Y LRIGP++ EW YGG P WLH++PG+ FR+ N
Sbjct: 83 RQYDFSGNLDLIRFLKTIQDEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 142
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F M+ + TMIV M+K +L+ASQGGPIIL+QIENEYG V S+ E G Y++W A +A
Sbjct: 143 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIKWCANMA 202
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L GVPW+MC+QDDAP P++N CNG C + F PN+P+ P +WTENWT +Y+ +G +
Sbjct: 203 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYC-DNFT-PNNPNTPKMWTENWTGWYKNWGGK 260
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+ ED+A+ VA F + G++ NYYMYHGGTNF RTA Y+ T Y APLDE+G
Sbjct: 261 DPHRTTEDVAFAVARFFQR-GGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGN 319
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
L QPK+GHLK+LH + K + G + +++F L A +++ + F+ N ++ ++
Sbjct: 320 LNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYKTEEGSSCFIGNVNETSD 379
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------------SVEQWEEYK 431
A + F Y++P S+SILPDCKT +NTAK++ S +W
Sbjct: 380 AKINFQGTFYDVPAWSVSILPDCKTETYNTAKINTQTSVMVKKANEAENEPSTLKWSWRP 439
Query: 432 EAIPTY-----DETSLRANFLLEQMNTTKDASDYLWY----NFRFKHDPSDSESV-LKVS 481
E I E+++R L +Q + D SDYLWY N + + DP +++ L+++
Sbjct: 440 ENIDNVLLKGKGESTMRQ--LFDQKVVSNDESDYLWYMTTVNIK-EQDPVWGKNMSLRIN 496
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S HVLHAF+NG+ +G+ ++ + E+ G N ++LLS+ VGLP+ GA+ E
Sbjct: 497 STAHVLHAFVNGQHIGNYRAENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAFFEN 556
Query: 542 RVAGLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
AG+ V I G +KD S+ W Y+ GL G + Q+F S+
Sbjct: 557 VPAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFENQLF---------------SS 601
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
P TW AP GS+PV ++L+ +GKG AW+NG +IGRYW +FL S YH+
Sbjct: 602 ESPSTW-----SAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLADIDGCSAE-YHV 655
Query: 657 PRSFLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLK 715
PRSFL G N LVL EE G P ++ T+ V +C +V + ++
Sbjct: 656 PRSFLNSDGDNTLVLFEEIGGNPSLVNFQTIGVGNVCANVYEKNV--------------- 700
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN-SRAIVEKACL 774
+++ C +G+ IS I FAS+GNP GNC ++ G+C +SN + AI+ + C+
Sbjct: 701 -----------LELSC-NGKPISSIKFASFGNPGGNCGSFEKGTCEASNDAAAILTQECV 748
Query: 775 GKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
GK C++ V +KF C G+ K L V+A C
Sbjct: 749 GKEKCSIDVSEKKFGAADCGGLAKRLAVEAIC 780
>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
Length = 830
Score = 631 bits (1628), Expect = e-178, Method: Compositional matrix adjust.
Identities = 344/835 (41%), Positives = 494/835 (59%), Gaps = 91/835 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V++DGR++ I+G R++L SGSIHYPRSTPQMWP LI KAKEGGLD ++T VFWN HEP
Sbjct: 27 VSHDGRAIKIDGKRRVLISGSIHYPRSTPQMWPDLIKKAKEGGLDAIETYVFWNAHEPIR 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
++DFSG DL+RF+K +Q +GL+ LRIGP++ EW YGG+P W++++PG+ R+ N+
Sbjct: 87 REYDFSGNNDLIRFLKTIQDEGLFAVLRIGPYVCAEWNYGGIPVWVYNLPGVEIRTANKV 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F M+ + T+IV+M++ +L+ASQGGPIILSQIENEYG V ++ ++G Y+ W A +A
Sbjct: 147 FMNEMQNFTTLIVDMVRKEKLFASQGGPIILSQIENEYGNVMSAYGDEGKAYINWCANMA 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
GVPW+MC+Q DAP P+IN CNG C + PN+P+ P +WTENW +++ +G +
Sbjct: 207 DSFNIGVPWIMCQQPDAPQPMINTCNGWYCHD--FEPNNPNSPKMWTENWVGWFKNWGGK 264
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+AEDIAY VA F + G++ NYYMYHGGTNFGRTA Y+ T Y APLDEYG
Sbjct: 265 DPHRTAEDIAYSVARFF-ETGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 323
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
+ QPKWGHLKELH +K + +G + ++ +A ++ + + FL N + +
Sbjct: 324 IAQPKWGHLKELHLVLKSMENSLTNGNVSKIDLGSYVKATVYATNDSSSCFLTNTNTTTD 383
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD--------SVEQWEEYKEAI------ 434
ATV F Y +P S+SILPDC+T +NTAK++ + E+ EA+
Sbjct: 384 ATVTFKGNTYNVPAWSVSILPDCQTEEYNTAKVNVQTSIMVKRENKAEDEPEALKWVWRA 443
Query: 435 -----PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSSLGH 485
++S+ N +++Q D+SDYLWY R + D + ++L+++ GH
Sbjct: 444 ENVHNSLIGKSSVSKNTIVDQKIAANDSSDYLWYMTRLDINQKDPVWTNNTILRINGTGH 503
Query: 486 VLHAFINGEFVGS---AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
V+HAF+NGE +GS +G H+D+ E + L +G N++SLLSV VGL + G ++
Sbjct: 504 VIHAFVNGEHIGSHWATYGIHNDQ---FETNIKLKHGRNDISLLSVTVGLQNYGKEYDKW 560
Query: 543 VAGLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQIFTD--YGSRIVPWSRYGSS 595
GL + + + G K +KD SS W Y+VGL G + + F+ + + W
Sbjct: 561 QDGLVSPIELIGTKGDETIIKDLSSHKWTYKVGLHGWENKFFSQDTFFASSSKWESNELP 620
Query: 596 THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------- 642
++ LTWYKT F AP SDP+ ++L MGKG AWVNG S+GRYW S+
Sbjct: 621 INKMLTWYKTTFKAPLESDPIVVDLQGMGKGYAWVNGHSLGRYWPSYNADEDGCSDDPCD 680
Query: 643 ----------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLC 692
++ G PSQ WYH+PR F++ N LVL EE G P I+ TV V + C
Sbjct: 681 YRGEYNDTKCVSNCGKPSQRWYHVPRDFIEDGVNTLVLFEEIGGNPSQINFQTVIVGSAC 740
Query: 693 GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC 752
+ ++ +TL ++ C GR IS I FAS+GNP G C
Sbjct: 741 ANAYEN-------------KTL-------------ELSC-HGRSISDIKFASFGNPQGTC 773
Query: 753 ENYAIGSCHSSN-SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+ GSC S+N + ++V+KAC+GK SC++ V + F C + K L V+A C
Sbjct: 774 GAFTKGSCESNNEALSLVQKACVGKESCSIDVSEKTFGATNCGNMVKRLAVEAVC 828
>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
Length = 781
Score = 631 bits (1627), Expect = e-178, Method: Compositional matrix adjust.
Identities = 341/736 (46%), Positives = 446/736 (60%), Gaps = 59/736 (8%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M C +LCL LT + GG G+NV+YDGRSLII+G RK+L S SIHYPRS P M
Sbjct: 1 MNLCFILCLVSTSLTF---TLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAM 57
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI AKEGG+DV++T VFWN HE PG + F GR DLV+F K VQ G+Y+ LRIGP
Sbjct: 58 WPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGP 117
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
F+ EW +GG+P WLH +PG VFR+ N+PF HM+++ T IVN+MK +L+ASQGGPIIL
Sbjct: 118 FVAAEWNFGGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIIL 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG E+ + E G Y WAAK+AV T VPW+MC+Q DAPDPVI+ CN C
Sbjct: 178 SQIENEYGYYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCD 237
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ P SP +P +WTENW +++ +G R ED+A+ VA F K GS NYYMYH
Sbjct: 238 Q--FTPTSPKRPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQK-GGSLNNYYMYH 294
Query: 301 GGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA +T YD AP+DEYGL R PKWGHLKELH A+KLC +L G V++
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNI 354
Query: 360 NFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
+ EA I+ SS CAAF+ N D +N+ V F N Y LP S+SILPDCK V FNT
Sbjct: 355 SLGPSVEADIYTDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNT 414
Query: 419 AKLDS--------------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
AK+ S +W+ +KE + + N ++ +NTTKD +
Sbjct: 415 AKVSSPTNIVAMIPEHLQQSDKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTT 474
Query: 459 DYLWYNFRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEK 512
DYLW+ D ++ S+ L + S GH LHAF+N ++ G+ G S +FT +
Sbjct: 475 DYLWHTTSILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKN 534
Query: 513 MVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELK-DFSSFSWGYQVGL 571
+ L G N +++LS+ VGL +G + + AG+ +V I G D SS +W Y++G+
Sbjct: 535 PISLRAGKNEIAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGV 594
Query: 572 LGEKLQIFTDYGSRIVPWSRYGSSTH-QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWV 630
LGE L I+ G V W+ Q LTWYK + DAP+G +PV ++++ MGKG AW+
Sbjct: 595 LGEHLSIYQGEGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWL 654
Query: 631 NGQSIGRYWVSFL-----------------------TPQGTPSQSWYHIPRSFLKPTGNL 667
NG+ IGRYW T G PSQ WYH+PRS+ KP+GN+
Sbjct: 655 NGEEIGRYWPRISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNV 714
Query: 668 LVLLEEENGYPPGISI 683
LV+ EE+ G P I+
Sbjct: 715 LVIFEEKGGDPTKITF 730
>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
Precursor
gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
Length = 779
Score = 631 bits (1627), Expect = e-178, Method: Compositional matrix adjust.
Identities = 336/812 (41%), Positives = 486/812 (59%), Gaps = 89/812 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V++DGR++ I+GHR++L SGSIHYPRST +MWP LI K KEG LD ++T VFWN HEP
Sbjct: 22 VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGSLDAIETYVFWNAHEPTR 81
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
Q+DFSG DL+RF+K +Q +G+Y LRIGP++ EW YGG P WLH++PG+ FR+ N
Sbjct: 82 RQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 141
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F M+ + TMIV M+K +L+ASQGGPIIL+QIENEYG V S+ E G Y++W A +A
Sbjct: 142 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIQWCANMA 201
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L GVPW+MC+QDDAP P++N CNG C + F+ PN+P+ P +WTENWT +Y+ +G +
Sbjct: 202 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYC-DNFS-PNNPNTPKMWTENWTGWYKNWGGK 259
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+ ED+A+ VA F K +G++ NYYMYHGGTNF RTA Y+ T Y APLDE+G
Sbjct: 260 DPHRTTEDVAFAVARFFQK-EGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGN 318
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
L QPK+GHLK+LH + K + G + +++F L A ++Q + F+ N ++ ++
Sbjct: 319 LNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYQTEEGSSCFIGNVNETSD 378
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------------SVEQWEEYK 431
A + F Y++P S+SILPDCKT +NTAK++ S +W
Sbjct: 379 AKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTSVMVKKANEAENEPSTLKWSWRP 438
Query: 432 EAIPTY-----DETSLRANFLLEQMNTTKDASDYLWY----NFRFKHDPSDSESV-LKVS 481
E I + E+++R L +Q + D SDYLWY N + + DP +++ L+++
Sbjct: 439 ENIDSVLLKGKGESTMRQ--LFDQKVVSNDESDYLWYMTTVNLK-EQDPVLGKNMSLRIN 495
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S HVLHAF+NG+ +G+ ++ + E+ G N ++LLS+ VGLP+ GA+ E
Sbjct: 496 STAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAFFEN 555
Query: 542 RVAGLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
AG+ V I G +KD S+ W Y+ GL G + Q+F S+
Sbjct: 556 FSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFENQLF---------------SS 600
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
P TW AP GS+PV ++L+ +GKG AW+NG +IGRYW +FL+ S YH+
Sbjct: 601 ESPSTW-----SAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLSDIDGCSAE-YHV 654
Query: 657 PRSFLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLK 715
PRSFL G N LVL EE G P ++ T+ V ++C +V + ++
Sbjct: 655 PRSFLNSEGDNTLVLFEEIGGNPSLVNFQTIGVGSVCANVYEKNV--------------- 699
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS-NSRAIVEKACL 774
+++ C +G+ IS I FAS+GNP G+C ++ G+C +S N+ AI+ + C+
Sbjct: 700 -----------LELSC-NGKPISAIKFASFGNPGGDCGSFEKGTCEASNNAAAILTQECV 747
Query: 775 GKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
GK C++ V +KF C + K L V+A C
Sbjct: 748 GKEKCSIDVSEDKFGAAECGALAKRLAVEAIC 779
>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
Precursor
gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
Length = 826
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 350/854 (40%), Positives = 483/854 (56%), Gaps = 87/854 (10%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+LL LF +L+T++ + V++D R++ ING R+IL SGSIHYPRST MWP L
Sbjct: 8 RLLSLFFILITSLSLAKS-----TIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDL 62
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I KAK+GGLD ++T VFWN HEP+ ++DFSG D+VRFIK +Q GLY LRIGP++
Sbjct: 63 INKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCA 122
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW YGG P WLH++P + FR+ N F M+ + T IV MMK +L+ASQGGPIIL+QIE
Sbjct: 123 EWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVKMMKEEKLFASQGGPIILAQIE 182
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG V S+ +G Y+ W A +A L GVPW+MC+Q +AP P++ CNG C +
Sbjct: 183 NEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQ--Y 240
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
P +P P +WTENWT +++ +G + R+AED+A+ VA F + G++ NYYMYHGGTN
Sbjct: 241 EPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFF-QTGGTFQNYYMYHGGTN 299
Query: 305 FGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
FGR A Y+ T Y APLDE+G L QPKWGHLK+LH+ +K K + G + ++
Sbjct: 300 FGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGN 359
Query: 364 LQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS 423
+A I+ + F+ N + +A V F Y +P S+S+LPDC A+NTAK+++
Sbjct: 360 SIKATIYTTKEGSSCFIGNVNATADALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNT 419
Query: 424 VEQWEEYKEAIPTYDETSLR----------------ANFLLEQMNTTKDASDYLWYNFRF 467
+ P E + R A L++Q + T DASDYLWY R
Sbjct: 420 QTSIMTEDSSKPERLEWTWRPESAQKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRL 479
Query: 468 KHDPSD----SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV-HLINGTNN 522
D D L+V S HVLHA++NG++VG+ K + E+ V HL++GTN+
Sbjct: 480 HLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNH 539
Query: 523 VSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQ 577
+SLLSV VGL + G + E G+ VS+ G K KD S W Y++GL G +
Sbjct: 540 ISLLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDK 599
Query: 578 IFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGR 637
+F+ W+ T + LTWYK F AP G +PV ++L +GKGEAW+NGQSIGR
Sbjct: 600 LFSIKSVGHQKWANEKLPTGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGR 659
Query: 638 YWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTG-NLLVLLEEE 674
YW SF + G P+Q WYH+PRSFL +G N + L EE
Sbjct: 660 YWPSFNSSDDGCKDECDYRGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEM 719
Query: 675 NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSG 734
G P ++ TV V T+C + + KV++ C
Sbjct: 720 GGNPSMVNFKTVVVGTVCARAHEHN--------------------------KVELSC-HN 752
Query: 735 RKISKILFASYGNPNGNCENYAIGSCHSSNSRA-IVEKACLGKRSCTVPVWTEKFYGD-P 792
R IS + FAS+GNP G+C ++A+G+C A V K C+GK +CTV V ++ F
Sbjct: 753 RPISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKECVGKLNCTVNVSSDTFGSTLD 812
Query: 793 CPGIPKALLVDAQC 806
C PK L V+ +C
Sbjct: 813 CGDSPKKLAVELEC 826
>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 347/838 (41%), Positives = 476/838 (56%), Gaps = 95/838 (11%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+NV+YD ++IING R+++FSGSIHYPRST MWP LI KAK+GGLD ++T +FW+ HE
Sbjct: 2 GDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRHE 61
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ ++DFSG + ++F + VQ GLY+ +RIGP++ EW YGG P WLH++PGI R+D
Sbjct: 62 PQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTD 121
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+ +K M + T IVNM K A L+ASQGGPIIL+QIENEYG V + G Y+ W A
Sbjct: 122 NQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWCA 181
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A GVPW+MC+Q DAP P+IN CNG C ++F+ PN+P P ++TENW +++ +
Sbjct: 182 QMAESFNIGVPWIMCQQSDAPQPIINTCNGFYC-DSFS-PNNPKSPKMFTENWVGWFKKW 239
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDE 325
GD+ RSAED+A+ VA F + G + NYYMYHGGTNFGRT+ +T YD APLDE
Sbjct: 240 GDKDPYRSAEDVAFSVARFF-QSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDE 298
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-----------SS 374
YG L QPKWGHLK+LHS++KL K + +G + F F +
Sbjct: 299 YGNLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTFKTFGSFVTLTKFSNPTTK 358
Query: 375 ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-------- 426
E FL N K + Y +P S+SI+ CK FNTAK++S
Sbjct: 359 ERFCFLSNTXKADGK--------YFVPAWSVSIIDGCKKEVFNTAKINSQTSIFVKVQNE 410
Query: 427 -------WEEYKEAIPT--YDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDPSDSE 475
W EA+ + + + N LLEQ TT D+SDYLWY N S
Sbjct: 411 KENVKLSWVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSIHN 470
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
L+V++ GHVLHAF+N ++GS G + +SF EK + L GTN ++LLS VGL +
Sbjct: 471 VTLQVNTKGHVLHAFVNTRYIGSQWGNNG-QSFVFEKPILLKAGTNIITLLSATVGLKNY 529
Query: 536 GAYLERRVAGLRN--VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
A+ + G+ + + G +K D SS W Y+VGL GE Q++ S+ W+
Sbjct: 530 DAFYDTLPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEIKQLYNPVFSQETSWNTL 589
Query: 593 G-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----- 646
+S + +TWYKT F P+G DPV +++ MGKGEAW+NGQSIGR+W SF+
Sbjct: 590 NKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWPSFIAGNDNCSE 649
Query: 647 -----------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G PSQ WYHIPRSFL N LVL EE G P +S+ T+++
Sbjct: 650 TCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQTITIG 709
Query: 690 TLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPN 749
T+CG+ ++ +++ C IS+I FASYGNP
Sbjct: 710 TICGNANEGS--------------------------TLELSCQGEYIISEIQFASYGNPK 743
Query: 750 GNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G C ++ GS +NS ++EK C G +SC+V V + F + L+V A C+
Sbjct: 744 GKCGSFKQGSWDVTNSALLLEKTCKGMKSCSVDVSAKLFGLGDAVNLSARLVVQALCS 801
>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
Length = 826
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 348/854 (40%), Positives = 484/854 (56%), Gaps = 87/854 (10%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+LL LF +L+T+ ++ V++D R++ ING R+IL SGSIHYPRST MWP L
Sbjct: 8 RLLSLFFILITSFSLANS-----TIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDL 62
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I KAK+GGLD ++T VFWN HEP+ ++DFSG D+VRFIK +Q GLY LRIGP++
Sbjct: 63 INKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCA 122
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW YGG P WLH++P + FR+ N F M+ + T IV MMK +L+ASQGGPIIL+QIE
Sbjct: 123 EWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTKIVEMMKEEKLFASQGGPIILAQIE 182
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG V S+ G Y+ W A +A L GVPW+MC+Q +AP P++ CNG C +
Sbjct: 183 NEYGNVISSYGAAGKAYIDWCANMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQ--Y 240
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
P +P P +WTENWT +++ +G + R+AED+A+ VA F + G++ NYYMYHGGTN
Sbjct: 241 EPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSVARFF-QTGGTFQNYYMYHGGTN 299
Query: 305 FGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
FGR A Y+ T Y AP+DE+G L QPKWGHLK+LH +K K + G + ++
Sbjct: 300 FGRVAGGPYITTSYDYHAPIDEFGNLNQPKWGHLKQLHRVLKSMEKSLTYGNISRIDLGN 359
Query: 364 LQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS 423
+A I+ + F+ N + NA V F Y +P S+S+LP+C A+NTAK+++
Sbjct: 360 SIKATIYTTKEGSSCFIGNVNATANALVNFKGKDYHVPAWSVSVLPECDKEAYNTAKVNT 419
Query: 424 VE-------------QWE---EYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF 467
+W E + + L A L++Q + T DASDYLWY R
Sbjct: 420 QTSIMTEDSSKPEKLEWTWRPESAQKMILKSSGDLIAKGLVDQKDVTNDASDYLWYMTRV 479
Query: 468 KHDPSD----SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV-HLINGTNN 522
D D L+V S HVLHA++NG++VG+ K + EK V HL++GTN+
Sbjct: 480 HLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYVGNQFVKDGKFDYRFEKKVNHLVHGTNH 539
Query: 523 VSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQ 577
+SLLSV VGL + GA+ E G+ VS+ G K KD S W Y++GL G +
Sbjct: 540 ISLLSVSVGLQNYGAFFESGPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNNK 599
Query: 578 IFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGR 637
+F+ + W+ T + LTWYK F AP G +PV ++ +GKGEAW+NGQSIGR
Sbjct: 600 LFSTKSVGHIKWANEMFPTSRMLTWYKAKFKAPLGKEPVIVDFNGLGKGEAWINGQSIGR 659
Query: 638 YWVSFLTPQ----------------------GTPSQSWYHIPRSFLKPTG-NLLVLLEEE 674
YW SF + G P+Q WYH+PRSFLK +G N + L EE
Sbjct: 660 YWPSFNSSDDGCKDECDYRGEYGSDKCAFMCGEPTQRWYHVPRSFLKASGHNTITLFEEM 719
Query: 675 NGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSG 734
G P ++ TV V T+C + + KV++ C +
Sbjct: 720 GGNPSMVNFKTVVVGTVCARAHEHN--------------------------KVELSCHN- 752
Query: 735 RKISKILFASYGNPNGNCENYAIGSCH-SSNSRAIVEKACLGKRSCTVPVWTEKFYGD-P 792
IS + FAS+GNP G+C +A+G+C ++ V K C+GK +CT+ V ++ F
Sbjct: 753 HPISAVKFASFGNPVGHCGTFAVGTCQGDKDAVKTVAKECVGKLNCTINVSSDTFGSTLD 812
Query: 793 CPGIPKALLVDAQC 806
C PK L V+ +C
Sbjct: 813 CGDSPKKLAVELEC 826
>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
Precursor
gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
Length = 715
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 336/695 (48%), Positives = 431/695 (62%), Gaps = 45/695 (6%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD RSL ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ FS R DLVRF+K V+ GLYV LRIGP++ EW YGG P WL VPGI FR+DN PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E YV WAAK+AV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
GVPW+MCKQDDAPDPVIN CNG C + F PNS +KP++WTE W+ ++ +G
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYC-DDFT-PNSKNKPSMWTEAWSGWFTAFGGTV 260
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+A+ VA FI K GS++NYYMYHGGTNF RTA ++ T Y AP+DEYGLL
Sbjct: 261 PQRPVEDLAFAVARFIQK-GGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLL 319
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNN 388
RQPKWGHL LH A+K +++G N ++A++F+ SS +CAAFL N
Sbjct: 320 RQPKWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAA 379
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-----------WEEYKEAIPTY 437
A V F+ Y+LP SIS+LPDC+T +NTA + + W+ Y EA +
Sbjct: 380 ARVAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAKMNPAGGFTWQSYGEATNSL 439
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLHAFI 491
DET+ + L+EQ++ T D SDYLWY D S L V S GH + F+
Sbjct: 440 DETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSAGHSVQVFV 499
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVS 550
NG++ G+A+G + T V + G+N +S+LS VGLP+ G + E + L V+
Sbjct: 500 NGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWNIGVLGPVT 559
Query: 551 IQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
+ G E K D S W YQ+GL GEKL + + GS V W G++ QP+TW++ F+A
Sbjct: 560 LSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWG--GAAGKQPVTWHRAYFNA 617
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------GTPS 650
P G PVA++L SMGKG+AWVNG IGRYW + G S
Sbjct: 618 PAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKASGNCGGCSYAGTYSEKKCQANCGDAS 677
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
Q WYH+PRS+L P+GNL+VLLEE G G+++ T
Sbjct: 678 QRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMT 712
>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
Length = 717
Score = 630 bits (1624), Expect = e-177, Method: Compositional matrix adjust.
Identities = 336/695 (48%), Positives = 431/695 (62%), Gaps = 45/695 (6%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD RSL ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP G
Sbjct: 25 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 84
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ FS R DLVRF+K V+ GLYV LRIGP++ EW YGG P WL VPGI FR+DN PF
Sbjct: 85 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 144
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E YV WAAK+AV
Sbjct: 145 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 204
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
GVPW+MCKQDDAPDPVIN CNG C + F PNS +KP++WTE W+ ++ +G
Sbjct: 205 ATNAGVPWIMCKQDDAPDPVINTCNGFYC-DDFT-PNSKNKPSMWTEAWSGWFTAFGGTV 262
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+A+ VA FI K GS++NYYMYHGGTNF RTA ++ T Y AP+DEYGLL
Sbjct: 263 PQRPVEDLAFAVARFIQK-GGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLL 321
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNN 388
RQPKWGHL LH A+K +++G N ++A++F+ SS +CAAFL N
Sbjct: 322 RQPKWGHLTNLHKAIKQAEPALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAA 381
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-----------WEEYKEAIPTY 437
A V F+ Y+LP SIS+LPDC+T +NTA + + W+ Y EA +
Sbjct: 382 ARVAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAKMNPAGGFTWQSYGEATNSL 441
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLHAFI 491
DET+ + L+EQ++ T D SDYLWY D S L V S GH + F+
Sbjct: 442 DETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSAGHSVQVFV 501
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVS 550
NG++ G+A+G + T V + G+N +S+LS VGLP+ G + E + L V+
Sbjct: 502 NGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWNIGVLGPVT 561
Query: 551 IQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
+ G E K D S W YQ+GL GEKL + + GS V W G++ QP+TW++ F+A
Sbjct: 562 LSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWG--GAAGKQPVTWHRAYFNA 619
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------GTPS 650
P G PVA++L SMGKG+AWVNG IGRYW + G S
Sbjct: 620 PAGGAPVALDLGSMGKGQAWVNGHLIGRYWSYKASGNCGGCSYAGTYSEKKCQANCGDAS 679
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
Q WYH+PRS+L P+GNL+VLLEE G G+++ T
Sbjct: 680 QRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMT 714
>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 831
Score = 630 bits (1624), Expect = e-177, Method: Compositional matrix adjust.
Identities = 352/826 (42%), Positives = 495/826 (59%), Gaps = 67/826 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
V+YD R+L ++G+R++L SGSIHYPRSTP MWP LIAKAK+GGLDV+QT VFW+ HEP
Sbjct: 24 TVSYDQRALKLDGNRRMLVSGSIHYPRSTPTMWPGLIAKAKKGGLDVIQTYVFWSGHEPT 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G ++F+GR DL +F++ V G+YV LRIGP++ EW +GG P WL +PGI FR+DNE
Sbjct: 84 QGVYNFAGRYDLPKFLRLVHEAGMYVNLRIGPYVCAEWNFGGFPGWLRFLPGIEFRTDNE 143
Query: 149 PFKFHMKR-YATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
FK H+ + + ++++ +R + Q +I +QIENEYG ++ + E G Y+ W A
Sbjct: 144 SFKVHLSHSFTSSLISVY--SRSFNIQ--LVICAQIENEYGSIDAVYGEAGQKYLNWIAN 199
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV VPW+MC Q DAP VI+ CNG C + F PNS KPA+WTENWT ++Q +G
Sbjct: 200 MAVATNISVPWIMCNQPDAPPSVIDTCNGFYC-DGFR-PNSEGKPALWTENWTGWFQSWG 257
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYG 327
+ A R +DIA+ VA F K GS+++YYMYHGGTNF R+A V T Y AP+DEYG
Sbjct: 258 EGAPTRPVQDIAFAVARFFQK-GGSFMHYYMYHGGTNFERSAMEGVTTNYDYDAPIDEYG 316
Query: 328 LLRQPKWGHLKELHSAVKLCLKPM--LSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKD 384
+RQPKWGHLK+LH+A+KLC + + V ++ QEA ++ S+ CAAFL +
Sbjct: 317 DVRQPKWGHLKDLHAALKLCELCLVGVDTVPSEISLGPYQEAHVYNSSTGACAAFLASWG 376
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------------SVEQWEEYKE 432
+++TV F Y+LP S+SILPDCK+V FNTAK+ V W Y+E
Sbjct: 377 T-DDSTVLFQGQSYDLPAWSVSILPDCKSVVFNTAKVGVQSMTMTMQSAIPVTNWVSYRE 435
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESVLKVSSLGHVL 487
+ + T N L+EQ+ TTKD +DYLWY + SD +++ L +S L
Sbjct: 436 PLEPWGST-FSTNELVEQIATTKDTTDYLWYTTNVEVAESDAPNGLAQATLVMSYLRDAA 494
Query: 488 HAFINGEFVG--SAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
H F+N G SAHG + +S +L G N+V +LS+ GL +G +LE+ AG
Sbjct: 495 HIFVNKWLTGTKSAHGSEASQSISLRP------GINSVKVLSMTTGLQGTGPFLEKEKAG 548
Query: 546 LR-NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ-PLTW 602
++ + ++G +W YQVGL GE ++F GS WS ++Q L+W
Sbjct: 549 IQFGIRVEGLPSGAIIMQRNTWTYQVGLQGENNRLFESNGSLSAVWSTSTDVSNQMSLSW 608
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVS--------------------- 641
+KT FD P + VA++L SMGKG+ WVNG ++GRYW S
Sbjct: 609 FKTTFDMPERNGTVALDLSSMGKGQVWVNGINLGRYWSSCIAHTDGCVDNCDYRGSHSES 668
Query: 642 -FLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHL 700
LT G PSQSWYH+PR +L NLLVL EE+ G P I+I +C +S+SH
Sbjct: 669 KCLTKCGQPSQSWYHVPREWLLSKQNLLVLFEEQEGNPEAITIAPRIPQHICSRMSESH- 727
Query: 701 PPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
P I S +R +T P P + + C G+ IS+I FASYG P+G+C ++ + SC
Sbjct: 728 PFPIPLSSSTKRGSQT--STPPIAP-LALECADGQHISRISFASYGTPSGDCGDFKLSSC 784
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
H+++S+ ++ KAC+G++ C VP+ + GDPCPG+ K+L A+C
Sbjct: 785 HANSSKDVLSKACVGRQKCLVPIVSSICGGDPCPGMIKSLAATAEC 830
>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
Length = 663
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 326/631 (51%), Positives = 419/631 (66%), Gaps = 27/631 (4%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD +++II+G R+IL SGSIHYPRSTPQMWP LI KAK+G +DV+QT VFWN HEP P
Sbjct: 34 VSYDHKAIIIDGQRRILISGSIHYPRSTPQMWPDLIQKAKDG-VDVIQTYVFWNGHEPSP 92
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++ F R DLVRFIK VQ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEP
Sbjct: 93 GKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIEFRTDNEP 152
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMKA +L+ +QGGPIILSQIENE+G VE G Y +WAA++A
Sbjct: 153 FKAAMQKFTEKIVSMMKAEKLFETQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAQMA 212
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V L TGVPWVMCKQDDAPDPVIN CNG C E F PN +KP +WTENWT ++ +G
Sbjct: 213 VGLDTGVPWVMCKQDDAPDPVINTCNGFYC-ENFV-PNQKNKPKMWTENWTGWFTAFGGP 270
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R AED+A+ VA FI + GS+VNYYMYHGGTNFGRTA ++ T Y APLDEYGL
Sbjct: 271 TPQRPAEDVAFSVARFI-QNGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 329
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKRN 387
LR+PKWGHL++LH A+KLC ++S + QE +F S CAAFL N D +
Sbjct: 330 LREPKWGHLRDLHKAIKLCESALVSTDPTVTSLGNNQEVHVFNPKSGSCAAFLANYDTTS 389
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---DSVEQ--------WEEY-KEAIP 435
+A V F + YELPP SISILPDCKT FNTA+L S++Q W+ Y +E+
Sbjct: 390 SAKVNFKIMQYELPPWSISILPDCKTAVFNTARLGAQSSLKQMTPVSTFSWQSYIEESAS 449
Query: 436 TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHA 489
+ D+ + + L EQ+N T+DASDYLWY D ++ + +L + S GH LH
Sbjct: 450 SSDDKTFTTDGLWEQLNVTRDASDYLWYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHV 509
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
FING+ G+ +G + T + V + G N +SLLS+ VGL + G + E+ G L
Sbjct: 510 FINGQLSGTVYGGVDNPKLTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGP 569
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKTV 606
V+++G E +D S W Y++GL GE L + T GS V W S + QPLTWYKT
Sbjct: 570 VTLRGLNEGTRDLSKQQWSYKIGLKGEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTT 629
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGR 637
F+AP G++P+A+++ +MGKG W+N QSIGR
Sbjct: 630 FNAPAGNEPLALDMSTMGKGLIWINSQSIGR 660
>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
Precursor
gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
Length = 741
Score = 629 bits (1621), Expect = e-177, Method: Compositional matrix adjust.
Identities = 330/714 (46%), Positives = 437/714 (61%), Gaps = 54/714 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD RSL I R+++ S +IHYPRS P MWP L+ AKEGG + +++ VFWN HEP
Sbjct: 31 NVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHEPS 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F GR ++V+FIK VQ G+++ LRIGPF+ EW YGG+P WLH VPG VFR+DNE
Sbjct: 91 PGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRADNE 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+K +M+ + T IVN++K +L+A QGGPIILSQ+ENEYG E + E G Y +W+A +
Sbjct: 151 PWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSASM 210
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV GVPW+MC+Q DAP VI+ CNG C + PN+PDKP IWTENW +++ +G
Sbjct: 211 AVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQ--FTPNTPDKPKIWTENWPGWFKTFGG 268
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+AY VA F K GS NYYMYHGGTNFGRT+ +T YD +AP+DEYG
Sbjct: 269 RDPHRPAEDVAYSVARFFGK-GGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYG 327
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLK+LH A+ L ++SG + EA ++ SS CAAFL N D +
Sbjct: 328 LPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDK 387
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE------------QWEEY 430
N+ V F N Y LP S+SILPDCKT FNTAK+ S VE +WE +
Sbjct: 388 NDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSSKVEMLPEDLKSSSGLKWEVF 447
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLG 484
E + N L++ +NTTKD +DYLWY ++ S VL + S G
Sbjct: 448 SEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFIESKG 507
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LH FIN E++G+A G + F L+K V L G NN+ LLS+ VGL ++G++ E A
Sbjct: 508 HTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYEWVGA 567
Query: 545 GLRNVSIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWS-RYGSSTHQPLTW 602
GL +VSI+G K + ++ W Y++G+ GE L++F S V W+ QPLTW
Sbjct: 568 GLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQPLTW 627
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF-------------------- 642
YK V + P+GS+PV +++ISMGKG AW+NG+ IGRYW
Sbjct: 628 YKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDYRGKF 687
Query: 643 -----LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTL 691
LT G PSQ WYH+PRS+ K +GN LV+ EE+ G P I + V+ +
Sbjct: 688 MPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLSKRKVSVV 741
>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
Length = 826
Score = 628 bits (1619), Expect = e-177, Method: Compositional matrix adjust.
Identities = 343/814 (42%), Positives = 470/814 (57%), Gaps = 92/814 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD RSLIING R+++FSG++HYPRST QMWP +I KAK+GGLD +++ VFW+ HEP
Sbjct: 28 VTYDARSLIINGERRVIFSGAVHYPRSTVQMWPDIIQKAKDGGLDAIESYVFWDRHEPVR 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
++DFSG D ++F + +Q GLY LRIGP++ EW +GG P WLH++PGI R+DN
Sbjct: 88 REYDFSGNLDFIKFFQIIQEAGLYAILRIGPYVCAEWNFGGFPLWLHNMPGIELRTDNPI 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+K M+ + T IVNM K A+L+ASQGGPIIL+QIENEYG + + E G Y++W A++A
Sbjct: 148 YKNEMQIFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNIMTDYGEAGKTYIKWCAQMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ GVPW+MC+Q DAP P+IN CNG C ++F PN+P P ++TENW ++Q +G+
Sbjct: 208 LAQNIGVPWIMCQQHDAPQPMINTCNGHYC-DSFQ-PNNPKSPKMFTENWIGWFQKWGER 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
RSAED A+ VA F + G NYYMYHGGTNFGRTA Y+ T Y APLDEYG
Sbjct: 266 VPHRSAEDSAFSVARFF-QNGGILNNYYMYHGGTNFGRTAGGPYMTTSYEYDAPLDEYGN 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF-SKLQEAFIFQGSSECAAFLVNKDKRN 387
L QPKWGHLK+LH+A+KL K + +G +F +++ + E FL N +
Sbjct: 325 LNQPKWGHLKQLHAAIKLGEKIITNGTRTDKDFGNEVTLTTYTHTNGERFCFLSNTNDSK 384
Query: 388 NATVYF-SNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ----------------WEEY 430
+A V + Y LP S++IL C FNTAK++S W
Sbjct: 385 DANVDLQQDGNYFLPAWSVTILDGCNKEVFNTAKVNSQTSIMVKKSDDASNKLTWAWIPE 444
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD--SESVLKVSSLGHVLH 488
K+ + + + + N LLEQ T D SDYLWY + + S + L+V++ GH L
Sbjct: 445 KKKDTMHGKGNFKVNQLLEQKELTFDVSDYLWYMTSVDINDTSIWSNATLRVNTRGHTLR 504
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
A++NG VG + +FT EK V L G N ++LLS VGLP+ GA ++ G+
Sbjct: 505 AYVNGRHVGYKFSQWGG-NFTYEKYVSLKKGLNVITLLSATVGLPNYGAKFDKIKTGIAG 563
Query: 549 VSIQ---GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS--THQP---- 599
+Q E D S+ W Y++GL GEK +++ P R G S T+ P
Sbjct: 564 GPVQLIGNNNETIDLSTNLWSYKIGLNGEKKRLYD-------PQPRIGVSWRTNSPYPIG 616
Query: 600 --LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------- 646
LTWYK F AP+G+DPV ++L+ +GKGEAWVNGQSIGRYW S++T
Sbjct: 617 RSLTWYKADFVAPSGNDPVVVDLLGLGKGEAWVNGQSIGRYWTSWITATNGCSDTCDYRG 676
Query: 647 ------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGH 694
G PSQ WYH+PRSFLK N LVL EE G P +S TV T+C
Sbjct: 677 KYVPAQKCNTNCGNPSQRWYHVPRSFLKNDKNTLVLFEEIGGNPQNVSFQTVITGTICAQ 736
Query: 695 VSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCEN 754
V + L +++ C G+ IS+I F+S+GNP GNC +
Sbjct: 737 VQEGAL--------------------------LELSCQGGKTISQIQFSSFGNPTGNCGS 770
Query: 755 YAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKF 788
+ G+ +++ +++VE AC+G+ SC V E F
Sbjct: 771 FKKGTWEATDGQSVVEAACVGRNSCGFMVTKEAF 804
>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 846
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 321/744 (43%), Positives = 450/744 (60%), Gaps = 38/744 (5%)
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q F GR DL++F+K +Q+ +Y +RIGPFI+ EW +GGLP+WL ++P I+FR++NEP+
Sbjct: 105 QVQFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEPY 164
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+++ IV +K A ++ASQGGP+IL+QIENEYG ++ + +G Y+ WAA++A+
Sbjct: 165 KKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMAI 224
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
TGVPW+MCKQ AP VI CNGR CG+T+ + +KP +WTENWT+ ++ +GD+
Sbjct: 225 STNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDK-NKPRLWTENWTAQFRAFGDQL 283
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLR 330
+RSAEDIAY V F AK G+ VNYYMY+GGTNFGRT ++YVLTGYYD+ P+DEYG+ +
Sbjct: 284 ALRSAEDIAYSVLRFFAK-GGTLVNYYMYYGGTNFGRTGASYVLTGYYDEGPVDEYGMPK 342
Query: 331 QPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKDKRNN 388
PK+GHL++LH+ +K + L G + EA F+ E C AF+ N + +
Sbjct: 343 APKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGED 402
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTA---------------KLDSVEQWEEYKEA 433
TV F Y +P S+SIL DCK V +NT KL WE Y E
Sbjct: 403 GTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQKLAKSNAWEMYSEP 462
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSLGHVL 487
IP Y TS+R +EQ N TKD SDYLWY +FR + D D V++V S H L
Sbjct: 463 IPRYKLTSIRNKEPMEQYNLTKDDSDYLWYTTSFRLEADDLPFRGDIRPVVQVKSTSHAL 522
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
F+N F G+ G +K F E ++L G N+++LLS +G+ DSG L G++
Sbjct: 523 MGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEVKGGIQ 582
Query: 548 NVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
+ +IQG D WG++V L GE +I+T+ G V W ++T + +TWYK
Sbjct: 583 DCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKW--VPATTGRAVTWYKRY 640
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
FD P G DPV +++ SMGKG +VNG+ +GRYW S+ T G PSQ+ YHIPR FLKP N
Sbjct: 641 FDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVGGVPSQAMYHIPRPFLKPKNN 700
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRS---QNQRTLKTHKRIPGR 723
LLV+ EEE G P GI I TV +C +S+ + + +W Q + + H
Sbjct: 701 LLVIFEEELGKPEGILIQTVRRDDICVFISEHNPAQIKTWDKDGGQIKVIAEDHS----- 755
Query: 724 RPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPV 783
+ ++CP + I +++FAS+GNP G+C N+ GSCH+ N++ IV K CLGK+SC +PV
Sbjct: 756 -TRGILKCPPKKTIQEVVFASFGNPEGSCANFTAGSCHTPNAKDIVAKECLGKKSCVLPV 814
Query: 784 WTEKFYGD-PCPGIPKALLVDAQC 806
+ D CP L V +C
Sbjct: 815 LHTVYGADINCPTTTATLAVQVRC 838
>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
Length = 725
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 331/697 (47%), Positives = 428/697 (61%), Gaps = 46/697 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R+++ING R+IL SGSIHYPRSTP+MWP L+ KAK+GGLDVVQT VFWN HEPQ
Sbjct: 31 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVVQTYVFWNGHEPQQ 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLVRF+K + GL+V LRIGP++ EW +GG P WL VPG+ FR+DN P
Sbjct: 91 GQYYFGDRYDLVRFVKLAKQAGLFVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNAP 150
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV+MMKA L+ QGGPIIL+Q+ENEYG +E PY WAAK+A
Sbjct: 151 FKAAMQAFVEKIVSMMKAEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKMA 210
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PNS KP +WTE WT ++ +G
Sbjct: 211 VATGAGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNSNSKPTMWTEAWTGWFTAFGGA 268
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA FI K GS+VNYYMYHGGTNF RT+ ++ T Y AP+DEYGL
Sbjct: 269 VPHRPVEDMAFAVARFIQK-GGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 327
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K ++SG ++A++++ SS CAAFL N
Sbjct: 328 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQTIGNYEKAYVYKSSSGACAAFLSNYHTNA 387
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPT 436
A V F+ Y+LP SIS+LPDC+T FNTA + S W+ Y EA +
Sbjct: 388 AARVVFNGRRYDLPAWSISVLPDCRTAVFNTATVSSPSAPARMTPAGGFSWQSYSEATNS 447
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKVSSLGHVLHAF 490
D+ + + L+EQ++ T D SDYLWY N + S L + S GH L F
Sbjct: 448 LDDRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTIYSAGHALQVF 507
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNV 549
+NG+ G+A+G + T V + G+N +S+LS VGLP+ G + E V L V
Sbjct: 508 VNGQSYGAAYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYEAWNVGVLGPV 567
Query: 550 SIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
++ G E K D S+ W YQ+GL GE L + + GS V W ++ QPLTW+K F+
Sbjct: 568 TLSGLNEGKRDLSNQKWTYQIGLHGESLGVHSVAGSSSVEWGS--AAGKQPLTWHKAYFN 625
Query: 609 APTGSDPVAINLISMGKGEAWVNGQSIGRYW--------------------VSFLTPQGT 648
AP+G+ PVA+++ SMGKG+AWVNG IGRYW T G
Sbjct: 626 APSGNAPVALDMSSMGKGQAWVNGHHIGRYWSYKATGGSCGGCSYAGTYSETKCQTGCGD 685
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ +YH+PRS+L P+GNLLV+LEE G G+ + T
Sbjct: 686 VSQRYYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLVT 722
>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
sativus]
Length = 712
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 338/719 (47%), Positives = 443/719 (61%), Gaps = 53/719 (7%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+ + LF LLT +G + G VTYD +++IIN R+IL SGSIHYPRSTPQMWP L
Sbjct: 3 KTVLLFLSLLTWVGSTIGA------VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDL 56
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I KAK+GGLD+++T VFWN HEP G+ + D + + + + +V L P
Sbjct: 57 IQKAKDGGLDIIETYVFWNGHEPSEGKVTW---EDFL-YEQILYINCFHVALFXFPPYFX 112
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
+ G P WL VPGI FR+DNEPFK M+++ T IV+MMK +LY +QGGPIILSQIE
Sbjct: 113 FQKFSGFPIWLKFVPGIAFRTDNEPFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIE 172
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG VE G Y +W A++AVDL+TGVPWVMCKQ+DAPDP+I+ CNG C E F
Sbjct: 173 NEYGPVEWQIGAPGKSYTKWFAQMAVDLKTGVPWVMCKQEDAPDPLIDTCNGFYC-ENFK 231
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
PN KP IWTENW+ +Y +G R ED+A+ VA FI + GS VNYY+YHGGTN
Sbjct: 232 -PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFI-QNNGSLVNYYVYHGGTN 289
Query: 305 FGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
FGRT+ ++ T Y AP+DEYGL+R+PKWGHL++LH A+KLC ++S S K
Sbjct: 290 FGRTSGLFIATSYDFDAPIDEYGLIREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKN 349
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-- 422
QEA +F+ SS CAAFL N D + V F N Y+LPP SISILPDCKTV FNTA++
Sbjct: 350 QEARVFKSSSACAAFLANYDTSASVKVNFWNNPYDLPPWSISILPDCKTVTFNTAQIGVK 409
Query: 423 ---------SVEQWEEYKEA-IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS 472
S W YKE Y + + + L+EQ++ T D +DYLWY D +
Sbjct: 410 SYEAKMMPISSFGWLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDYLWYMQDISIDST 469
Query: 473 D------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLL 526
+ +L V+S GH+LH FING+ GS +G D T K V+L G N +S+L
Sbjct: 470 EGFLKSGKWPLLSVNSAGHLLHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSML 529
Query: 527 SVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGS 584
SV VGLP+ G + + AG L V+++G E +D S + W Y+VGL GE L +++D GS
Sbjct: 530 SVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLSGESLNLYSDKGS 589
Query: 585 RIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
V W++ + QPLTWYKT F P G++P+ +++ SM KG+ WVNG+SIGRY+ ++
Sbjct: 590 NSVQWTKGSLTQKQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWVNGRSIGRYFPGYIA 649
Query: 645 PQ--------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYHIPR +L P+ NLLV+ EE G P GIS+
Sbjct: 650 NGKCDKCSYAGLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEEIGGSPDGISL 708
>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
Length = 722
Score = 627 bits (1616), Expect = e-176, Method: Compositional matrix adjust.
Identities = 337/697 (48%), Positives = 432/697 (61%), Gaps = 46/697 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V YD R++I+NG R+IL SGSIHYPRSTP+MWP L+ KAK+GGLDV+QT VFWN HEP
Sbjct: 26 SVGYDHRAIIVNGKRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVLQTYVFWNGHEPS 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F R DLV+FIK Q GLYV LRIGP+I EW +GG P WL VPGI FR+DN
Sbjct: 86 PGKYYFEDRYDLVKFIKLAQQHGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDNR 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF M+++ IV MMKA RL+ +QGGPIILSQIENEYG VE G Y +WAAK+
Sbjct: 146 PFMAAMEKFTQKIVYMMKAERLFQTQGGPIILSQIENEYGPVEWEIGAPGKSYTQWAAKM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQ+DAPDP+I+ CNG C E F PN KP +WTE WT +Y +G
Sbjct: 206 AVGLNTGVPWVMCKQEDAPDPIIDTCNGFYC-ENFT-PNKNYKPKMWTEIWTGWYTEFGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R A+D+A+ VA FI + GS+ NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 264 AVPTRPAQDLAFSVARFI-QNGGSFANYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L R+PK+ HLK +H A+K+ +L+ QEA ++Q S CAAFL N D +
Sbjct: 323 LPREPKYSHLKYMHKAIKMAEPALLATDAAVSKLGNNQEAHVYQSRSGCAAFLANYDTKY 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE----------QWEEYKEAIPT- 436
V F N Y LPP SISILPDCKT FNTA++ W+ Y E + T
Sbjct: 383 PVRVTFWNKQYNLPPWSISILPDCKTEVFNTARVGQSPPTKMTPVAHLSWQAYIEDVATS 442
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHAF 490
D+ + + L EQ++ T D +DYLWY P++ LKV S GH LH F
Sbjct: 443 ADDNAFTSVGLREQISLTWDNTDYLWYMTDITIGPNEQFLRTGKYPTLKVDSAGHALHVF 502
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
ING+ GSA+G + + V L G N ++LLSV VGL + G + E G L V
Sbjct: 503 INGQLSGSAYGTLAFPKLEFNQGVKLRAGINKLALLSVSVGLANVGLHFETWNTGVLGPV 562
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKTV 606
++ G D + + W Y++G+ GE + + T GS V W + GS + ++PLTWYK +
Sbjct: 563 TLAGVNSGTWDMTRWQWTYKIGMRGEDMSLHTVSGSSSVEWVQ-GSLLAQYRPLTWYKAI 621
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQ 646
+AP G+ P+A+++ SMGKG+ W+NGQSIGR+W ++ T
Sbjct: 622 LNAPPGNAPLALDMGSMGKGQMWINGQSIGRHWPAYKAHGSCGACYYAGTYTENKCRTNC 681
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYH+PRS+LK +GNLLV+ EE G P IS+
Sbjct: 682 GQPSQRWYHVPRSWLKSSGNLLVVFEEWGGDPTKISL 718
>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 741
Score = 626 bits (1615), Expect = e-176, Method: Compositional matrix adjust.
Identities = 329/714 (46%), Positives = 436/714 (61%), Gaps = 54/714 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD RSL I R+++ S +IHYPRS P MWP L+ AKEGG + +++ VFWN HEP
Sbjct: 31 NVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHEPS 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F GR ++V+FIK VQ G+++ LRIGPF+ EW YGG+P WLH VPG VFR+DNE
Sbjct: 91 PGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRADNE 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+K +M+ + T IVN++K +L+A QGGPIILSQ+ENEYG E + E G Y +W+A +
Sbjct: 151 PWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSASM 210
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV GVPW+MC+Q DAP VI+ CNG C + PN+PDKP IWTENW +++ +G
Sbjct: 211 AVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQ--FTPNTPDKPKIWTENWPGWFKTFGG 268
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+AY VA F K GS NYYMYHGGTNFGRT+ +T YD +AP+DEYG
Sbjct: 269 RDPHRPAEDVAYSVARFFGK-GGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYG 327
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLK+LH A+ L ++SG + EA ++ SS CAAFL N D +
Sbjct: 328 LPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDK 387
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE------------QWEEY 430
N+ V F N Y LP S+SILPDCKT FNTAK+ S VE +WE +
Sbjct: 388 NDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSSKVEMLPEDLKSSSGLKWEVF 447
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLG 484
E + N L++ +NTTKD +DYLWY ++ S VL + S G
Sbjct: 448 SEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFIESKG 507
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LH FIN E++G+A G + F L+K V L G N+ LLS+ VGL ++G++ E A
Sbjct: 508 HTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGETNIDLLSMTVGLANAGSFYEWVGA 567
Query: 545 GLRNVSIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWS-RYGSSTHQPLTW 602
GL +VSI+G K + ++ W Y++G+ GE L++F S V W+ QPLTW
Sbjct: 568 GLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQPLTW 627
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF-------------------- 642
YK V + P+GS+PV +++ISMGKG AW+NG+ IGRYW
Sbjct: 628 YKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDYRGKF 687
Query: 643 -----LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTL 691
LT G PSQ WYH+PRS+ K +GN LV+ EE+ G P I + V+ +
Sbjct: 688 MPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLSKRKVSVV 741
>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
Length = 740
Score = 626 bits (1614), Expect = e-176, Method: Compositional matrix adjust.
Identities = 334/686 (48%), Positives = 425/686 (61%), Gaps = 46/686 (6%)
Query: 32 YDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQ 91
YD RSL+ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP GQ
Sbjct: 47 YDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQ 106
Query: 92 FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFK 151
+ F+ R DLVRF+K V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DN PFK
Sbjct: 107 YHFADRYDLVRFVKLVRQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFK 166
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVD 211
M+++ IV+MMK+ L+ QGGPII++Q+ENE+G +E PY WAA++AV
Sbjct: 167 AAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGAKPYAHWAAQMAVG 226
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEAR 271
TGVPWVMCKQDDAPDPVIN CNG C + PN KP +WTE WT ++ +G
Sbjct: 227 TNTGVPWVMCKQDDAPDPVINTCNGFYC--DYFTPNRKYKPTMWTEAWTGWFTKFGGALP 284
Query: 272 IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLR 330
R ED+A+ VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DE+GLLR
Sbjct: 285 HRPVEDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLR 343
Query: 331 QPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNA 389
QPKWGHL++LH A+K ++SG + ++A+IF+ + CAAFL N +
Sbjct: 344 QPKWGHLRDLHRAIKQAEPALISGDPTIQSIGNYEKAYIFKSKNGACAAFLSNYHMKTAV 403
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTA---------KLDSVEQ--WEEYKEAIPTYD 438
+ F Y+LP SISILPDCKT FNTA K++ V W+ Y E + D
Sbjct: 404 KIRFDGRHYDLPAWSISILPDCKTAVFNTATVKEPTLLPKMNPVLHFAWQSYSEDTNSLD 463
Query: 439 ETSLRANFLLEQMNTTKDASDYLWYNFRF------KHDPSDSESVLKVSSLGHVLHAFIN 492
+++ N L+EQ++ T D SDYLWY + S L V S GH + F+N
Sbjct: 464 DSAFTRNGLVEQLSLTWDKSDYLWYTTHVSIGGNEQFLKSGQWPQLTVYSAGHSMQVFVN 523
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVSI 551
G GS +G + + T V + G+N +S+LS VGLP++G + E V L V++
Sbjct: 524 GRSYGSVYGGYDNPKLTFNGHVKMWQGSNKISILSSAVGLPNNGNHFELWNVGVLGPVTL 583
Query: 552 QGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAP 610
G E K D S W YQVGL GE L + T GS V W+ G QPLTW+K +F+AP
Sbjct: 584 SGLNEGKRDLSHQKWTYQVGLKGESLGLHTVTGSSAVEWA--GPGGKQPLTWHKALFNAP 641
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWV--------------------SFLTPQGTPS 650
GSDPVA+++ SMGKG+ WVNG GRYW L+ G S
Sbjct: 642 AGSDPVALDMGSMGKGQIWVNGHHAGRYWSYRAYSGSCRRCSYAGTYREDQCLSNCGDIS 701
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENG 676
Q WYH+PRS+LKP+GNLLV+LEE G
Sbjct: 702 QRWYHVPRSWLKPSGNLLVVLEEYGG 727
>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 826
Score = 625 bits (1611), Expect = e-176, Method: Compositional matrix adjust.
Identities = 336/807 (41%), Positives = 471/807 (58%), Gaps = 80/807 (9%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
GNNV+YD ++IING R+I+FSGSIHYPRST +MWP LI KAK+GGLD ++T +FW+ HE
Sbjct: 24 GNNVSYDSNAIIINGERRIIFSGSIHYPRSTEEMWPDLIQKAKDGGLDAIETYIFWDRHE 83
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P ++DFSG + +++ + +Q GLYV +RIGP++ EW YGG P WLH++PGI R++
Sbjct: 84 PHRRKYDFSGHLNFIKYFQLIQEAGLYVVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRTN 143
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+ +K M+ + T IVNM K A L+ASQGGPIIL+QIENEYG V + E G Y+ W A
Sbjct: 144 NQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGEAGKTYINWCA 203
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A L G+PW+MC+Q DAP P+IN CNG C + F PN+P+ P ++TENW +++ +
Sbjct: 204 QMAESLNIGIPWIMCQQSDAPQPIINTCNGFYC-DNFT-PNNPNSPKMFTENWVGWFKKW 261
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDE 325
GD+ R+AED+A+ VA F + G NYYMYHGGTNFGRT+ +T YD APLDE
Sbjct: 262 GDKDPHRTAEDVAFSVARFF-QSGGILNNYYMYHGGTNFGRTSGGPFITTSYDYDAPLDE 320
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF--SKLQEAFIFQGSSECAAFLVNK 383
YG L QPKWGHLK+LH+++KL K + + +F S F + E FL N
Sbjct: 321 YGNLNQPKWGHLKQLHASIKLGEKILTNSTRSDQDFGSSVTFTKFSNLETGEKFCFLSNA 380
Query: 384 DKRNNATV-YFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----VEQWEEYKEAIPTY 437
D+ N+A V + Y LP S+SIL C FNTAK+ S ++ E + A ++
Sbjct: 381 DENNDAIVDMLGDRKYFLPAWSVSILDGCNKEIFNTAKVSSQTSLFFKKQNEKENAKLSW 440
Query: 438 DETS------------LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS--ESVLKVSSL 483
+ S +AN LLEQ T D+SDYLWY + + S L+V++
Sbjct: 441 NWASEPMRDTLQGYGTFKANLLLEQKGATIDSSDYLWYMTNVNSNTTSSLQNLTLQVNTK 500
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GHVLHAFIN ++GS G + +SF EK + L GTN ++LLS VGL + A+ +
Sbjct: 501 GHVLHAFINRRYIGSQWGSNG-QSFVFEKPIQLKLGTNTITLLSATVGLKNYDAFYDTVP 559
Query: 544 AGLRN---VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STHQP 599
G+ I D SS W Y+VGL GE+ Q++ S WS S +
Sbjct: 560 TGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLNGERKQLYNPMFSNRTKWSTLNKKSIGRR 619
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------------- 646
+TW+K F P+G+DPV +++ MGKG+AWVNG+SIGR+W SF+
Sbjct: 620 MTWFKATFKTPSGTDPVVLDMQGMGKGQAWVNGRSIGRFWPSFIASNDSCSETCDYKGSY 679
Query: 647 ---------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
G SQ WYHIPRSF+ + N L+L EE G P +S+ T+++ T+CG+ ++
Sbjct: 680 NPNKCVRNCGNSSQRWYHIPRSFMNDSINTLILFEEIGGNPQMVSVQTITIGTICGNANE 739
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
+++ C G IS+I FASYG+P G C ++
Sbjct: 740 G--------------------------STLELSCQGGHVISEIQFASYGHPEGKCGSFQS 773
Query: 758 GSCHSSNSRA-IVEKACLGKRSCTVPV 783
G + S IVEKAC+G ++C++ +
Sbjct: 774 GLWDVTKSTTIIVEKACIGMKNCSIDI 800
>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
Length = 729
Score = 624 bits (1610), Expect = e-176, Method: Compositional matrix adjust.
Identities = 335/696 (48%), Positives = 430/696 (61%), Gaps = 47/696 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD RSL+ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FS R DLVRF+K V+ GLYV LRIGP++ EW +GG P WL VPG+ FR+DN P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK+ L+ QGGPII+SQ+ENE+G +E PY WAAK+A
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V TGVPWVMCKQDDAPDPVIN CNG C + PN KP++WTE WT ++ +G
Sbjct: 218 VGTNTGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNKNYKPSMWTEAWTGWFTSFGGG 275
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DE+GL
Sbjct: 276 VPHRPVEDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K ++S + ++A++F+ + CAAFL N
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTA---------KLDSVEQ--WEEYKEAIPT 436
V F+ Y LP SISILPDCKT FNTA K++ V + W+ Y E +
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKMNPVVRFAWQSYSEDTNS 454
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES----VLKVSSLGHVLHAFIN 492
+++ + L+EQ++ T D SDYLWY +D S L V S GH + F+N
Sbjct: 455 LSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVN 514
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN---- 548
G+ GS +G + + T V + G+N +S+LS VGLP+ G + E G+
Sbjct: 515 GKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTL 574
Query: 549 VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
S+ G KD S W YQVGL GE L + T GS V W G +QPLTW+K F+
Sbjct: 575 SSLNGGT--KDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWG--GPGGYQPLTWHKAFFN 630
Query: 609 APTGSDPVAINLISMGKGEAWVNGQSIGRYW----------VSFL---------TPQGTP 649
AP G+DPVA+++ SMGKG+ WVNG +GRYW S+ + G
Sbjct: 631 APAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKASGGCGGCSYAGTYHEDKCRSNCGDL 690
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ WYH+PRS+LKP GNLLV+LEE G G+S+ T
Sbjct: 691 SQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLAT 726
>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 846
Score = 624 bits (1610), Expect = e-176, Method: Compositional matrix adjust.
Identities = 347/865 (40%), Positives = 477/865 (55%), Gaps = 95/865 (10%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
+C L +F L L+ I + V+YD R+L I+G R+ILFSGSIHYPRSTP+MWP
Sbjct: 5 KCSLSAMFLLCLSLISIAINAL----EVSYDERALTIDGKRRILFSGSIHYPRSTPEMWP 60
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFI 122
LI KAKEGGLDV++T VFWN HEPQ Q+DFS DLVRFI+ +Q +GLY +RIGP+I
Sbjct: 61 YLIRKAKEGGLDVIETYVFWNAHEPQRRQYDFSENLDLVRFIRTIQKEGLYAMIRIGPYI 120
Query: 123 EGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
EW YGGLP WLH++P + FR+ N F MK + IV+MM+ L+A QGGPII++Q
Sbjct: 121 SSEWNYGGLPVWLHNIPNMEFRTHNRAFMEEMKTFTRKIVDMMQDETLFAVQGGPIIIAQ 180
Query: 183 IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET 242
IENEYG V H++ G Y++W A+LA +TGVPWVM +Q +AP +I++C+G C +
Sbjct: 181 IENEYGNVMHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQ- 239
Query: 243 FAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
PN KP IWTENWT Y+ +G + R AED+AY VA F + G++ NYYMYHGG
Sbjct: 240 -FQPNDNHKPKIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFF-QFGGTFQNYYMYHGG 297
Query: 303 TNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
TNF RTA YV T Y APLDEYG L QPKWGHL++LH+ +K + G ++
Sbjct: 298 TNFKRTAGGPYVTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQHTDY 357
Query: 362 SKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK- 420
+ A ++ + F+ N + +AT+ F N Y +P S+SILP+C + A+NTAK
Sbjct: 358 GNMVTATVYTYDGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKV 417
Query: 421 --------------LDSVEQWEEYKEAIPTYDE------TSLRANFLLEQMNTTKDASDY 460
L+ +W+ +E + L A LL+Q T D SDY
Sbjct: 418 NTQTTIMVKKDNEDLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDY 477
Query: 461 LWY----NFRFKHDPS-DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVH 515
LWY + + DPS E L+V + GHVLH F+NG+ VG+ H K+ F E +
Sbjct: 478 LWYITSIDIKGDDDPSWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIK 537
Query: 516 LINGTNNVSLLSVMVGLPDSGAYLE----------RRVAGLRNVSIQGAKELKDFSSFSW 565
L G N +SLLS VGLP+ G + + + VA + + + +KD S W
Sbjct: 538 LTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQW 597
Query: 566 GYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGK 625
Y+VGL GE ++ Y + + W T + L WYKT F +P G DPV ++L +GK
Sbjct: 598 SYKVGLHGEH-EMHYSYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGK 656
Query: 626 GEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKP 663
G AWVNG SIGRYW S+L + PSQ WYH+PRSFL+
Sbjct: 657 GHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRD 716
Query: 664 TG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
N LVL EE G P ++ TV+V +C + + +
Sbjct: 717 DDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGN----------------------- 753
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
+++ C + IS+I FAS+G P G C ++ G+C SS + + ++ C+GK C++
Sbjct: 754 ---TLELACNKNQVISEIKFASFGLPKGECGSFQKGNCESSEALSAIKAQCIGKDKCSIQ 810
Query: 783 VWTEKFYGDPCP-GIPKALLVDAQC 806
V C + L V+A C
Sbjct: 811 VSERALGPTRCRVAEDRRLAVEAVC 835
>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
Precursor
gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
Length = 729
Score = 624 bits (1609), Expect = e-176, Method: Compositional matrix adjust.
Identities = 335/696 (48%), Positives = 430/696 (61%), Gaps = 47/696 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD RSL+ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FS R DLVRF+K V+ GLYV LRIGP++ EW +GG P WL VPG+ FR+DN P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK+ L+ QGGPII+SQ+ENE+G +E PY WAAK+A
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V TGVPWVMCKQDDAPDPVIN CNG C + PN KP++WTE WT ++ +G
Sbjct: 218 VRTNTGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNKNYKPSMWTEAWTGWFTSFGGG 275
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DE+GL
Sbjct: 276 VPHRPVEDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K ++S + ++A++F+ + CAAFL N
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTA---------KLDSVEQ--WEEYKEAIPT 436
V F+ Y LP SISILPDCKT FNTA K++ V + W+ Y E +
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKMNPVVRFAWQSYSEDTNS 454
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES----VLKVSSLGHVLHAFIN 492
+++ + L+EQ++ T D SDYLWY +D S L V S GH + F+N
Sbjct: 455 LSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVN 514
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN---- 548
G+ GS +G + + T V + G+N +S+LS VGLP+ G + E G+
Sbjct: 515 GKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTL 574
Query: 549 VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
S+ G KD S W YQVGL GE L + T GS V W G +QPLTW+K F+
Sbjct: 575 SSLNGGT--KDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWG--GPGGYQPLTWHKAFFN 630
Query: 609 APTGSDPVAINLISMGKGEAWVNGQSIGRYW----------VSFL---------TPQGTP 649
AP G+DPVA+++ SMGKG+ WVNG +GRYW S+ + G
Sbjct: 631 APAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKASGGCGGCSYAGTYHEDKCRSNCGDL 690
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ WYH+PRS+LKP GNLLV+LEE G G+S+ T
Sbjct: 691 SQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLAT 726
>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
Length = 741
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 325/725 (44%), Positives = 453/725 (62%), Gaps = 64/725 (8%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G + V YD R LIING ++L S SIHYPR+ PQMW +LI+ AK GG+DV++T VFW+
Sbjct: 21 GLSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDG 80
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
H+P ++F GR DLV F+K V GLY LRIGP++ EW GG P WL DV GI FR
Sbjct: 81 HQPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVAGIEFR 140
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
++N+PFK M+ + IV MMK +L+A QGGPIIL+QIENEYG ++ ++ G Y+ W
Sbjct: 141 TNNQPFKAEMQTFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMVW 200
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA ++ L TGVPW+MC+Q DAPD +++ CNG C + +A PN+ KP +WTENW+ ++Q
Sbjct: 201 AANMSQGLGTGVPWIMCQQSDAPDYILDTCNGFYC-DAWA-PNNKKKPKMWTENWSGWFQ 258
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPL 323
+G+ + R ED+A+ VA F + GS+ NYYMY GGTNFGR++ YV T Y AP+
Sbjct: 259 KWGEASPHRPVEDVAFAVARFFQR-GGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPI 317
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLV 381
DE+G++RQPKWGHLK+LH+A+KLC + S ++ +LQEA ++ +S CAAFL
Sbjct: 318 DEFGVIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLA 377
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------------SVEQWEE 429
N D ++ATV F++ Y LP S+SILPDCKTV+ NTAK+D + WE
Sbjct: 378 NIDSSSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKVDVQTAMPTMKPSITGLAWES 437
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDPSDSESVLKVSSLGHV 486
Y E + + ++ + A+ LLEQ+NTTKD SDYLWY + D + +++L + S+ V
Sbjct: 438 YPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAASGKALLYLESMRDV 497
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+H F+NG+ GSA K + +E+ + L +G N++++L VGL + G ++E AG+
Sbjct: 498 VHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFIETWGAGI 557
Query: 547 R-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
+V ++G + D ++ W +QVGL GE L IFT+ GS+ V WS Q L WYK
Sbjct: 558 NGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWSS-AVPQGQALVWYK 616
Query: 605 TV-----------------FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ- 646
+ FD+P+G+DPVA++L SMGKG+AW+NGQSIGR+W S P
Sbjct: 617 VIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPSLRAPDT 676
Query: 647 ----------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
G PSQ WYH+PRS+L+ GNL+VL EEE G P G+S
Sbjct: 677 AGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDGGNLVVLFEEEGGKPSGVSFV 736
Query: 685 TVSVT 689
T +V
Sbjct: 737 TRTVV 741
>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
Length = 740
Score = 623 bits (1607), Expect = e-175, Method: Compositional matrix adjust.
Identities = 328/714 (45%), Positives = 437/714 (61%), Gaps = 54/714 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD RSL I R+++ S +IHYPRS P MWP L+ AKEGG + +++ VFWN HEP
Sbjct: 30 NVSYDHRSLSIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHEPS 89
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
P ++ F GR ++V+FIK VQ G+++ LRIGPF+ EW YGG+P WLH VPG VFR+DNE
Sbjct: 90 PRKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRADNE 149
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+K +M+ + T IVN++K +L+A QGGPIILSQ+ENEYG E + E G Y +W+A +
Sbjct: 150 PWKHYMESFTTYIVNLLKKEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSASM 209
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV GVPW+MC+Q DAP VI+ CNG C + PN+PDKP IWTENW +++ +G
Sbjct: 210 AVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQ--FTPNTPDKPKIWTENWPGWFKTFGG 267
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+AY VA F K GS NYYMYHGGTNFGRT+ +T YD +AP+DEYG
Sbjct: 268 RDPHRPAEDVAYSVARFFGK-GGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYG 326
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R PKWGHLK+LH A+ L +++G + EA ++ SS CAAFL N D +
Sbjct: 327 LPRLPKWGHLKDLHKAIMLSENLLINGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDK 386
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE------------QWEEY 430
N+ TV F N Y LP S+SILPDCK FNTAK+ S VE +WE +
Sbjct: 387 NDKTVMFRNTSYHLPAWSVSILPDCKNEVFNTAKVTSKFSKVEMLPEDLRSSSGLKWEVF 446
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLG 484
E + E N L++ +NTTKD +DYLWY ++ S VL + S G
Sbjct: 447 SEKPGIWGEADFVKNELVDHINTTKDTTDYLWYTTSITVSTNEEFLKKGSPPVLFIESKG 506
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LH FIN E++G+A G + F L+K V L G NN+ LLS+ VGL ++G++ E A
Sbjct: 507 HTLHVFINKEYLGTATGNGTHVPFKLKKSVALKAGENNIDLLSMTVGLSNAGSFYEWVGA 566
Query: 545 GLRNVSIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWS-RYGSSTHQPLTW 602
GL +VSI+G K + ++ W Y++G+ G L++F S V W+ QPLTW
Sbjct: 567 GLTSVSIKGFNKGTLNLTNSKWSYKLGVQGVHLELFKPGDSGAVKWTVTTKPPKKQPLTW 626
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF-------------------- 642
YK V D P+GS+PV ++++SMGKG AW+NG+ IGRYW
Sbjct: 627 YKVVIDPPSGSEPVGLDMMSMGKGMAWLNGEEIGRYWPRIARKSTPNDECVKECDYRGKF 686
Query: 643 -----LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTL 691
LT G PSQ WYH+PRS+ K +GN LV+ EE+ G P I++ V+ +
Sbjct: 687 MPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGDPMKITLSKRKVSVV 740
>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
Length = 829
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 344/837 (41%), Positives = 477/837 (56%), Gaps = 93/837 (11%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ +T D R ++ING RKIL SGS+HYPRSTP+MWP LI K+K+GGL+ + T VFW+LHEP
Sbjct: 28 DQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEP 87
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
Q Q+DF+G +DLVRFIK +QAQGLY LRIGP++ EW YGG P WLH+ P I R++N
Sbjct: 88 QRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNN 147
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
+ M+ + TMIV+MMK +L+ASQGGPII+SQIENEYG V ++ + G Y+ W A+
Sbjct: 148 TVYMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQ 207
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A L TGVPW+MC+QD+AP P+IN CNG C + PN+P+ P +WTENW+ +Y+ +G
Sbjct: 208 MAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQ--FTPNNPNSPKMWTENWSGWYKNWG 265
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEY 326
R+AED+A+ VA F ++ G++ NYYMYHGGTNFGRTA Y+ T Y APL+EY
Sbjct: 266 GSDPHRTAEDLAFSVARFY-QLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEY 324
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI--FQGSSECAAFLVNKD 384
G QPKWGHL++LH + K + G + ++++ L A I +QG S C F N +
Sbjct: 325 GNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSN 382
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------------QW 427
+ T+ + + Y +P S+SILPDC +NTAK++S QW
Sbjct: 383 ADRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGSEAENEPNSLQW 442
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS-DSESVLKVSSLGHV 486
E I A+ LL+Q +D SDYL+Y DP + L V++ GH+
Sbjct: 443 TWRGETIQYITPGRFTASELLDQKTVAEDTSDYLYY-MTTNDDPIWGKDLTLSVNTSGHI 501
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
LHAF+NGE +G + F + V L G N ++LLS VGL + G + G+
Sbjct: 502 LHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMVNQGI 561
Query: 547 RN-----VSIQGAKELKDFSSFS-WGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
S A +KD S+ + W Y+ GL GE +IF +R W ++
Sbjct: 562 HGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGEDKKIFLGR-ARYNQWKSDNLPVNRSF 620
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL----------------- 643
WYK FDAP G DPV ++L+ +GKGEAWVNG S+GRYW S++
Sbjct: 621 VWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYRGPYK 680
Query: 644 -----TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDS 698
T G PSQ WYH+PRSFL T N LVL EE G P ++ TV+V C + +
Sbjct: 681 AEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNACANAREG 740
Query: 699 HLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC------ 752
+ +++ C GR IS I FAS+G+P G C
Sbjct: 741 Y--------------------------TLELSC-QGRAISGIKFASFGDPQGTCGKPFAT 773
Query: 753 --ENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDP-CPGIPKALLVDAQC 806
+ + G+C +++S +I++K C+GK SC++ V +E+ G C K L V+A C
Sbjct: 774 GSQVFEKGTCEAADSLSIIQKLCVGKYSCSIDV-SEQILGPAGCTADTKRLAVEAIC 829
>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
Length = 923
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 346/865 (40%), Positives = 477/865 (55%), Gaps = 95/865 (10%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
+C L F L L+ I + V+YD R+L I+G R+ILFS SIHYPRSTP+MWP
Sbjct: 5 KCSLSASFLLCLSLISIAINAL----EVSYDERALTIDGKRRILFSASIHYPRSTPEMWP 60
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFI 122
LI KAKEGGLDV++T VFWN HEPQ Q++FS DLVRFI+ +Q +GLY +RIGP+I
Sbjct: 61 YLIRKAKEGGLDVIETYVFWNAHEPQRRQYEFSENLDLVRFIRTIQKEGLYAMIRIGPYI 120
Query: 123 EGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
EW YGGLP WLH++P + FR+ N F MK + T IV+MM+ L+A QGGPII++Q
Sbjct: 121 SSEWNYGGLPVWLHNIPNMEFRTHNRAFMEEMKTFTTKIVDMMQDETLFAVQGGPIIIAQ 180
Query: 183 IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET 242
IENEYG V H++ G Y++W A+LA +TGVPWVM +Q +AP +I++C+G C +
Sbjct: 181 IENEYGNVMHAYGNNGTQYLKWCAQLADSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQ- 239
Query: 243 FAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
PN KP IWTENWT Y+ +G + R AED+AY VA F + G++ NYYMYHGG
Sbjct: 240 -FQPNDNHKPKIWTENWTGGYKNWGTQNPHRPAEDVAYAVARFF-QFGGTFQNYYMYHGG 297
Query: 303 TNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
TNF RTA YV T Y APLDEYG L QPKWGHL++LH+ +K + G + ++
Sbjct: 298 TNFKRTAGGPYVTTSYDYDAPLDEYGNLNQPKWGHLRQLHNLLKSKENILTQGSSQNTDY 357
Query: 362 SKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK- 420
+ A ++ + F+ N + +AT+ F N Y +P S+SILP+C + A+NTAK
Sbjct: 358 GNMVTATVYTYDGKSTCFIGNAHQSKDATINFRNNEYTIPAWSVSILPNCSSEAYNTAKV 417
Query: 421 --------------LDSVEQWEEYKEAIPTYDE------TSLRANFLLEQMNTTKDASDY 460
L+ +W+ +E + L A LL+Q T D SDY
Sbjct: 418 NTQTTIMVKKDNEDLEYALRWQWRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDY 477
Query: 461 LWY----NFRFKHDPS-DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVH 515
LWY + + DPS E L+V + GHVLH F+NG+ VG+ H K+ F E +
Sbjct: 478 LWYITSIDIKGDDDPSWTKEFRLRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIK 537
Query: 516 LINGTNNVSLLSVMVGLPDSGAYLE----------RRVAGLRNVSIQGAKELKDFSSFSW 565
L G N +SLLS VGLP+ G + + + VA + + + +KD S W
Sbjct: 538 LTTGKNEISLLSTTVGLPNYGPFFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQW 597
Query: 566 GYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGK 625
Y+VGL GE ++ Y + + W T + L WYKT F +P G DPV ++L +GK
Sbjct: 598 SYKVGLHGEH-EMHYSYENSLKTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGK 656
Query: 626 GEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKP 663
G AWVNG SIGRYW S+L + PSQ WYH+PRSFL+
Sbjct: 657 GHAWVNGNSIGRYWSSYLADENGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRD 716
Query: 664 TG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
N LVL EE G P ++ TV+V +C + + +
Sbjct: 717 NDQNTLVLFEELGGQPYYVNFLTVTVGKVCANAYEGN----------------------- 753
Query: 723 RRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
+++ C + IS+I FAS+G P G C ++ G+C SS + + ++ C+GK C++
Sbjct: 754 ---TLELACNKNQVISEIKFASFGLPKGECGSFQKGNCESSEALSAIKAQCIGKDKCSIQ 810
Query: 783 VWTEKFYGDPCP-GIPKALLVDAQC 806
V C + L V+A C
Sbjct: 811 VSERTLGPTRCRVAEDRRLAVEAVC 835
>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
Length = 723
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 332/698 (47%), Positives = 423/698 (60%), Gaps = 47/698 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R+++ING R+IL SGSIHYPRSTP+MWP L+ KAK+GGLDVVQT VFWN HEP
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLVRF+K + GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E PY WAAK+A
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PNS KP +WTE WT ++ +G
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNSNSKPTMWTEAWTGWFTAFGGA 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA FI K GS+VNYYMYHGGTNF RT+ ++ T Y AP+DEYGL
Sbjct: 266 VPHRPVEDMAFAVARFIQK-GGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K ++SG + ++A++F+ S CAAFL N
Sbjct: 325 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSA 384
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPT 436
A V F+ Y+LP SIS+LPDCK FNTA + W+ Y EA +
Sbjct: 385 AARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSPAGGFSWQSYSEATNS 444
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKVSSLGHVLHAF 490
D + + L+EQ++ T D SDYLWY N + S L V S GH L F
Sbjct: 445 LDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHSLQVF 504
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNV 549
+NG+ G+ +G + T V + G+N +S+LS VGLP+ G + E V L V
Sbjct: 505 VNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVLGPV 564
Query: 550 SIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
++ G E K D S+ W YQ+GL GE L + + GS V W ++ QPLTW+K F
Sbjct: 565 TLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGS--AAGKQPLTWHKAYFS 622
Query: 609 APTGSDPVAINLISMGKGEAWVNGQSIGRYW---------------------VSFLTPQG 647
AP+G PVA+++ SMGKG+AWVNG+ IGRYW T G
Sbjct: 623 APSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSGGCGGCSYAGTYSETKCQTGCG 682
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ +YH+PRS+L P+GNLLVLLEE G PG+ + T
Sbjct: 683 DVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVT 720
>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
Length = 833
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 343/841 (40%), Positives = 476/841 (56%), Gaps = 95/841 (11%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
+ +T D R ++ING RKIL SGS+HYPRSTP+MWP LI K+K+GGL+ + T VFW+LHE
Sbjct: 27 ADQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHE 86
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ Q+DF+G +DLVRFIK +QAQGLY LRIGP++ EW YGG P WLH+ P I R++
Sbjct: 87 PQRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTN 146
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N + M+ + TMIV+MMK +L+ASQGGPII+SQIENEYG V ++ + G Y+ W A
Sbjct: 147 NTVYMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCA 206
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A L TGVPW+MC+QD+AP P+IN CNG C + PN+P+ P +WTENW+ +Y+ +
Sbjct: 207 QMAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQ--FTPNNPNSPKMWTENWSGWYKNW 264
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDE 325
G R+AED+A+ VA F ++ G++ NYYMYHGGTNFGRTA Y+ T Y APL+E
Sbjct: 265 GGSDPHRTAEDLAFSVARFY-QLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNE 323
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI--FQGSSECAAFLVNK 383
YG QPKWGHL++LH + K + G + ++++ L A I +QG S C F N
Sbjct: 324 YGNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNS 381
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------------Q 426
+ + T+ + + Y +P S+SILPDC +NTAK++S Q
Sbjct: 382 NADRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGSEAENEPNSLQ 441
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSS 482
W E I A+ LL+Q +D SDYL+Y D + L V++
Sbjct: 442 WTWRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTTVDISNDDPIWGKDLTLSVNT 501
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
GH+LHAF+NGE +G + F + V L G N ++LLS VGL + G +
Sbjct: 502 SGHILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMV 561
Query: 543 VAGLRN-----VSIQGAKELKDFSSFS-WGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
G+ S A +KD S+ + W Y+ GL GE +IF +R W
Sbjct: 562 NQGIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGEDKKIFLGR-ARYNQWKSDNLPV 620
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------------- 643
++ WYK FDAP G DPV ++L+ +GKGEAWVNG S+GRYW S++
Sbjct: 621 NRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYR 680
Query: 644 ---------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGH 694
T G PSQ WYH+PRSFL T N LVL EE G P ++ TV+V C +
Sbjct: 681 GPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNACAN 740
Query: 695 VSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC-- 752
+ + +++ C GR IS I FAS+G+P G C
Sbjct: 741 AREGY--------------------------TLELSC-QGRAISGIKFASFGDPQGTCGK 773
Query: 753 ------ENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDP-CPGIPKALLVDAQ 805
+ + G+C +++S +I++K C+GK SC++ V +E+ G C K L V+A
Sbjct: 774 PFATGSQVFEKGTCEAADSLSIIQKLCVGKYSCSIDV-SEQILGPAGCTADTKRLAVEAI 832
Query: 806 C 806
C
Sbjct: 833 C 833
>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
Length = 827
Score = 620 bits (1600), Expect = e-175, Method: Compositional matrix adjust.
Identities = 334/832 (40%), Positives = 471/832 (56%), Gaps = 85/832 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+Y R + I+G KI SGSIHYPRSTPQMWP LI K+KEGGLD ++T VFWN HEP
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGI-VFRSDNE 148
Q+DFS DLVRFIK +Q +GLY LRIGP++ EW YGG P WLH++PGI R+ N
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
F M+ + T+IV+MMK L+ASQGGPIIL+QIENEYG V S+ + G YV W A +
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A GVPW+MC+QDDAP+P IN CNG C + PN+ P +WTENWT +++ +G
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQ--FTPNNAKSPKMWTENWTGWFKSWGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
+R+ ED+A+ VA F ++ G++ NYYMYHGGTNF R A Y+ T Y APLDEYG
Sbjct: 264 RDPVRTPEDLAFSVARFF-QLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L QPK+GHLK+LH+A+K K ++SG + + + + + + F N ++
Sbjct: 323 NLNQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGKSCFFSNINETT 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------------VEQWEEY 430
+A V + + +P S+SILPDC+ +NTAK+++ V +W
Sbjct: 383 DALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVLEWMWR 442
Query: 431 KEAIPT---YDETSLRANFLLEQMNTTKDASDYLWY----NFRFKHDPSDSESVLKVSSL 483
E I + + AN L++Q + DASDYLWY N + K +E L+++
Sbjct: 443 PENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSNEMTLRINVS 502
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH++HAF+NGE +GS + ++ E+ V L G N +SLLS +GL + GA +
Sbjct: 503 GHIVHAFVNGEHIGSQWASYDVYNYIFEQEVKLKPGKNIISLLSATIGLKNYGAQYDLIQ 562
Query: 544 AGL----RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
+G+ + + G + +KD S+ W Y+VGL G + ++F+ W ++
Sbjct: 563 SGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQSGNLPVNR 622
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGT---------- 648
+TWYKT F P G+DPV ++L +GKG AWVNG SIGRYW SF+ G
Sbjct: 623 MMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEPCDYRGS 682
Query: 649 ------------PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
P+Q WYH+PRS+L N LVL EE G P ++ T+++ CGH
Sbjct: 683 YTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEKACGHAY 742
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYA 756
+ + +++ C G++I+ I FAS+G+P G+C N++
Sbjct: 743 E--------------------------KKSLELSC-QGKEITGIKFASFGDPTGSCGNFS 775
Query: 757 IGSCHSSN-SRAIVEKACLGKRSCTVPVWTEKFYGDPCP-GIPKALLVDAQC 806
GSC N + IVE C+GK SC + + + F C G+ K L V+A C
Sbjct: 776 KGSCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCALGVVKRLAVEAVC 827
>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
sativus]
Length = 827
Score = 620 bits (1599), Expect = e-174, Method: Compositional matrix adjust.
Identities = 334/832 (40%), Positives = 471/832 (56%), Gaps = 85/832 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+Y R + I+G KI SGSIHYPRSTPQMWP LI K+KEGGLD ++T VFWN HEP
Sbjct: 26 VSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHEPVR 85
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGI-VFRSDNE 148
Q+DFS DLVRFIK +Q +GLY LRIGP++ EW YGG P WLH++PGI R+ N
Sbjct: 86 RQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRTTNP 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
F M+ + T+IV+MMK L+ASQGGPIIL+QIENEYG V S+ + G YV W A +
Sbjct: 146 VFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWCANM 205
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A GVPW+MC+QDDAP+P IN CNG C + PN+ P +WTENWT +++ +G
Sbjct: 206 ADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQ--FTPNNAKSPKMWTENWTGWFKSWGG 263
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
+R+ ED+A+ VA F ++ G++ NYYMYHGGTNF R A Y+ T Y APLDEYG
Sbjct: 264 RDPVRTPEDLAFSVARFF-QLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYG 322
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L QPK+GHLK+LH+A+K K ++SG + + + + + + F N ++
Sbjct: 323 NLNQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGKSCFFSNINETT 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------------VEQWEEY 430
+A V + + +P S+SILPDC+ +NTAK+++ V +W
Sbjct: 383 DALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVLEWMWR 442
Query: 431 KEAIPT---YDETSLRANFLLEQMNTTKDASDYLWY----NFRFKHDPSDSESVLKVSSL 483
E I + + AN L++Q + DASDYLWY N + K +E L+++
Sbjct: 443 PENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSNEMTLRINVS 502
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH++HAF+NGE +GS + ++ E+ V L G N +SLLS +GL + GA +
Sbjct: 503 GHIVHAFVNGEHIGSQWASYDVYNYIXEQEVKLKPGKNIISLLSATIGLKNYGAQYDLIQ 562
Query: 544 AGL----RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
+G+ + + G + +KD S+ W Y+VGL G + ++F+ W ++
Sbjct: 563 SGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQSGNLPVNR 622
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGT---------- 648
+TWYKT F P G+DPV ++L +GKG AWVNG SIGRYW SF+ G
Sbjct: 623 MMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEPCDYRGS 682
Query: 649 ------------PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
P+Q WYH+PRS+L N LVL EE G P ++ T+++ CGH
Sbjct: 683 YTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEKACGHAY 742
Query: 697 DSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYA 756
+ + +++ C G++I+ I FAS+G+P G+C N++
Sbjct: 743 E--------------------------KKSLELSC-QGKEITGIKFASFGDPTGSCGNFS 775
Query: 757 IGSCHSSN-SRAIVEKACLGKRSCTVPVWTEKFYGDPCP-GIPKALLVDAQC 806
GSC N + IVE C+GK SC + + + F C G+ K L V+A C
Sbjct: 776 KGSCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCALGVVKRLAVEAVC 827
>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
Length = 754
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 331/684 (48%), Positives = 424/684 (61%), Gaps = 47/684 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD RSL+ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 38 VSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQ 97
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FS R DLVRF+K V+ GLYV LRIGP++ EW +GG P WL VPG+ FR+DN P
Sbjct: 98 GQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTDNGP 157
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK+ L+ QGGPII+SQ+ENE+G +E PY WAAK+A
Sbjct: 158 FKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAAKMA 217
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V TGVPWVMCKQDDAPDPVIN CNG C + PN KP++WTE WT ++ +G
Sbjct: 218 VGTNTGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNKNYKPSMWTEAWTGWFTSFGGG 275
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DE+GL
Sbjct: 276 VPHRPVEDLAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K ++S + ++A++F+ + CAAFL N
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTA---------KLDSVEQ--WEEYKEAIPT 436
V F+ Y LP SISILPDCKT FNTA K++ V + W+ Y E +
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATVKEPTLMPKMNPVVRFAWQSYSEDTNS 454
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES----VLKVSSLGHVLHAFIN 492
+++ + L+EQ++ T D SDYLWY +D S L V S GH + F+N
Sbjct: 455 LSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVN 514
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN---- 548
G+ GS +G + + T V + G+N +S+LS VGLP+ G + E G+
Sbjct: 515 GKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTL 574
Query: 549 VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
S+ G KD S W YQVGL GE L + T GS V W G +QPLTW+K F+
Sbjct: 575 SSLNGGT--KDLSHQKWTYQVGLKGETLGLQTVTGSSAVEWG--GPGGYQPLTWHKAFFN 630
Query: 609 APTGSDPVAINLISMGKGEAWVNGQSIGRYW----------VSFL---------TPQGTP 649
AP G+DPVA+++ SMGKG+ WVNG +GRYW S+ + G
Sbjct: 631 APAGNDPVALDMGSMGKGQLWVNGHHVGRYWSYKASGGCGGCSYAGTYHEDKCRSNCGDL 690
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEE 673
SQ WYH+PRS+LKP GNLLV+LEE
Sbjct: 691 SQRWYHVPRSWLKPGGNLLVVLEE 714
>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
gi|194689400|gb|ACF78784.1| unknown [Zea mays]
gi|224030521|gb|ACN34336.1| unknown [Zea mays]
gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
Length = 722
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 329/697 (47%), Positives = 421/697 (60%), Gaps = 46/697 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R+++ING R+IL SGSIHYPRSTP+MWP L+ KAK+GGLDVVQT VFWN HEP
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLVRF+K + GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E PY WAAK+A
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PNS KP +WTE WT ++ +G
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNSNSKPTMWTEAWTGWFTAFGGA 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA FI K GS+VNYYMYHGGTNF RT+ ++ T Y AP+DEYGL
Sbjct: 266 VPHRPVEDMAFAVARFIQK-GGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 324
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K ++SG + ++A++F+ S CAAFL N
Sbjct: 325 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSA 384
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPT 436
A V F+ Y+LP SIS+LPDCK FNTA + W+ Y EA +
Sbjct: 385 AARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSPAGGFSWQSYSEATNS 444
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKVSSLGHVLHAF 490
D + + L+EQ++ T D SDYLWY N + S L + S GH L F
Sbjct: 445 LDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTIYSAGHSLQVF 504
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNV 549
+NG+ G+ +G + T V + G+N +S+LS VGLP+ G + E V L V
Sbjct: 505 VNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVLGPV 564
Query: 550 SIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
++ G E K D S W YQ+GL GE L + + GS V W ++ QPLTW+K F
Sbjct: 565 TLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGS--AAGKQPLTWHKAYFS 622
Query: 609 APTGSDPVAINLISMGKGEAWVNGQSIGRYW--------------------VSFLTPQGT 648
AP+G PVA+++ SMGKG+AWVNG+ IGRYW T G
Sbjct: 623 APSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSGCGGCSYAGTYSETKCQTGCGD 682
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ +YH+PRS+L P+GNLLV+LEE G G+ + T
Sbjct: 683 VSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVT 719
>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
distachyon]
Length = 719
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 326/698 (46%), Positives = 426/698 (61%), Gaps = 49/698 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD ++++ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 26 VSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQ 85
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLVRF+K + GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 86 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 145
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E PY WAAK+A
Sbjct: 146 FKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKMA 205
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PNS KP +WTE W+ ++ +G
Sbjct: 206 VATGAGVPWVMCKQDDAPDPVINTCNGFYC--DYFTPNSNGKPNMWTEAWSGWFTAFGGA 263
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA F+ K GS+VNYYMYHGGTNF RTA ++ T Y AP+DEYGL
Sbjct: 264 VPHRPVEDLAFAVARFVQK-GGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGL 322
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K M+SG + ++A++F+ S+ CAAFL N +
Sbjct: 323 LRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSS 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPT 436
A V ++ YELP SISILPDCKT +NTA + W+ Y E +
Sbjct: 383 PAKVVYNGRRYELPAWSISILPDCKTAVYNTATVKEPSAPAKMNPAGGFSWQSYSEDTNS 442
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHVLH 488
D+++ + L+EQ++ T D SD+LWY D SE LK ++S GH L
Sbjct: 443 LDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNID--SSEQFLKSGQWPQLTINSAGHTLQ 500
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLR 547
F+NG+ G+ +G + + K V + G+N +S+LS VGL + G + E V L
Sbjct: 501 VFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYENWNVGVLG 560
Query: 548 NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V++ G + K D S+ W YQ+GL GE L + + GS V W ++ QPLTW+K
Sbjct: 561 PVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGS--ANGAQPLTWHKAY 618
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW-------------------VSFLTPQG 647
F AP G PVA+++ SMGKG+ WVNG++ GRYW T G
Sbjct: 619 FSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWSYKASGSCGSCSYTGTYSETKCQTNCG 678
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ WYH+PRS+L P+GNLLV+LEE G G+ + T
Sbjct: 679 DISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMT 716
>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
Full=SR12 protein; Flags: Precursor
gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
Length = 731
Score = 617 bits (1590), Expect = e-173, Method: Compositional matrix adjust.
Identities = 334/701 (47%), Positives = 426/701 (60%), Gaps = 51/701 (7%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R++ IN R+IL SGSIHYPRSTP+MWP +I KAK+ LDV+QT VFWN HEP
Sbjct: 30 NVWYDYRAIKINDQRRILLSGSIHYPRSTPEMWPDIIEKAKDSQLDVIQTYVFWNGHEPS 89
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G++ F GR DLV+FIK + GL+V LRIGPF EW +GG P WL VPGI FR+DN
Sbjct: 90 EGKYYFEGRYDLVKFIKLIHQAGLFVHLRIGPFACAEWNFGGFPVWLKYVPGIEFRTDNG 149
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + T IV+MMKA +L+ QGGPIIL+QIENEYG VE G Y WAA++
Sbjct: 150 PFKEKMQVFTTKIVDMMKAEKLFHWQGGPIILNQIENEYGPVEWEIGAPGKAYTHWAAQM 209
Query: 209 AVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
A L GVPW+MCKQD D PD VI+ CNG C E F P KP +WTENWT +Y YG
Sbjct: 210 AQSLNAGVPWIMCKQDSDVPDNVIDTCNGFYC-EGFV-PKDKSKPKMWTENWTGWYTEYG 267
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYG 327
R AED+A+ VA FI + GS++NYYM+HGGTNF TA +V T Y APLDEYG
Sbjct: 268 KPVPYRPAEDVAFSVARFI-QNGGSFMNYYMFHGGTNFETTAGRFVSTSYDYDAPLDEYG 326
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKR 386
L R+PK+ HLK LH A+K+C ++S N QEA ++ +S CAAFL N D +
Sbjct: 327 LPREPKYTHLKNLHKAIKMCEPALVSSDAKVTNLGSNQEAHVYSSNSGSCAAFLANYDPK 386
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD--------------SVEQWEEYKE 432
+ V FS + +ELP SISILPDCK +NTA+++ S W+ Y +
Sbjct: 387 WSVKVTFSGMEFELPAWSISILPDCKKEVYNTARVNEPSPKLHSKMTPVISNLNWQSYSD 446
Query: 433 AIPTYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
+PT D + R L EQ+N T D SDYLWY D ++ E L V+S GH
Sbjct: 447 EVPTADSPGTFREKKLYEQINMTWDKSDYLWYMTDVVLDGNEGFLKKGDEPWLTVNSAGH 506
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
VLH F+NG+ G A+G + T + V + G N +SLLS +VGL + G + ER G
Sbjct: 507 VLHVFVNGQLQGHAYGSLAKPQLTFSQKVKMTAGVNRISLLSAVVGLANVGWHFERYNQG 566
Query: 546 -LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWY 603
L V++ G E +D + W Y++G GE+ Q++ GS V W + QPL WY
Sbjct: 567 VLGPVTLSGLNEGTRDLTWQYWSYKIGTKGEEQQVYNSGGSSHVQWGP--PAWKQPLVWY 624
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW---------------------VSF 642
KT FDAP G+DP+A++L SMGKG+AW+NGQSIGR+W
Sbjct: 625 KTTFDAPGGNDPLALDLGSMGKGQAWINGQSIGRHWSNNIAKGSCNDNCNYAGTYTETKC 684
Query: 643 LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
L+ G SQ WYH+PRS+L+P GNLLV+ EE G +S+
Sbjct: 685 LSDCGKSSQKWYHVPRSWLQPRGNLLVVFEEWGGDTKWVSL 725
>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
distachyon]
Length = 721
Score = 616 bits (1588), Expect = e-173, Method: Compositional matrix adjust.
Identities = 326/700 (46%), Positives = 427/700 (61%), Gaps = 51/700 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD ++++ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 26 VSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQ 85
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLVRF+K + GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 86 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 145
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E PY WAAK+A
Sbjct: 146 FKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAAKMA 205
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PNS KP +WTE W+ ++ +G
Sbjct: 206 VATGAGVPWVMCKQDDAPDPVINTCNGFYC--DYFTPNSNGKPNMWTEAWSGWFTAFGGA 263
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R ED+A+ VA F+ K GS+VNYYMYHGGTNF RTA ++ T Y AP+DEYGL
Sbjct: 264 VPHRPVEDLAFAVARFVQK-GGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGL 322
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRN 387
LRQPKWGHL++LH A+K M+SG + ++A++F+ S+ CAAFL N +
Sbjct: 323 LRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSS 382
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-------------WEEYKEAI 434
A V ++ YELP SISILPDCKT +NTA + + W+ Y E
Sbjct: 383 PAKVVYNGRRYELPAWSISILPDCKTAVYNTATVRQKWKEKKLWMNPAGGFSWQSYSEDT 442
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHV 486
+ D+++ + L+EQ++ T D SD+LWY D SE LK ++S GH
Sbjct: 443 NSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNID--SSEQFLKSGQWPQLTINSAGHT 500
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAG 545
L F+NG+ G+ +G + + K V + G+N +S+LS VGL + G + E V
Sbjct: 501 LQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTHYENWNVGV 560
Query: 546 LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
L V++ G + K D S+ W YQ+GL GE L + + GS V W ++ QPLTW+K
Sbjct: 561 LGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGS--ANGAQPLTWHK 618
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW-------------------VSFLTP 645
F AP G PVA+++ SMGKG+ WVNG++ GRYW T
Sbjct: 619 AYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWSYKASGSCGSCSYTGTYSETKCQTN 678
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
G SQ WYH+PRS+L P+GNLLV+LEE G G+ + T
Sbjct: 679 CGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMT 718
>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 788
Score = 614 bits (1584), Expect = e-173, Method: Compositional matrix adjust.
Identities = 339/818 (41%), Positives = 463/818 (56%), Gaps = 82/818 (10%)
Query: 41 GHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDL 100
G R+IL SGSIHYPRST MWP LI KAK+GGLD ++T VFWN HEP+ ++DFSG D+
Sbjct: 1 GKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDV 60
Query: 101 VRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATM 160
VRFIK +Q GLY LRIGP++ EW YGG P WLH++P + FR+ N F M+ + T
Sbjct: 61 VRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTK 120
Query: 161 IVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVM 220
IV MMK +L+ASQGGPIIL+QIENEYG V S+ +G Y+ W A +A L GVPW+M
Sbjct: 121 IVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLM 180
Query: 221 CKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAY 280
C+Q +AP P++ CNG C + P +P P +WTENWT +++ +G + R+AED+A+
Sbjct: 181 CQQPNAPQPMLETCNGFYCDQ--YEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAF 238
Query: 281 HVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKE 339
VA F + G++ NYYMYHGGTNFGR A Y+ T Y APLDE+G L QPKWGHLK+
Sbjct: 239 SVARFF-QTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQ 297
Query: 340 LHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYE 399
LH+ +K K + G + ++ +A I+ + F+ N + +A V F Y
Sbjct: 298 LHTVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEGSSCFIGNVNATADALVNFKGKDYH 357
Query: 400 LPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLR---------------- 443
+P S+S+LPDC A+NTAK+++ + P E + R
Sbjct: 358 VPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPESAQKMILKGSGDLI 417
Query: 444 ANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSSLGHVLHAFINGEFVGSA 499
A L++Q + T DASDYLWY R D D L+V S HVLHA++NG++VG+
Sbjct: 418 AKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYVGNQ 477
Query: 500 HGKHSDKSFTLEKMV-HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKE- 556
K + E+ V HL++GTN++SLLSV VGL + G + E G+ VS+ G K
Sbjct: 478 FVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVGYKGE 537
Query: 557 ---LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGS 613
KD S W Y++GL G ++F+ W+ T + LTWYK F AP G
Sbjct: 538 ETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWANEKLPTGRMLTWYKAKFKAPLGK 597
Query: 614 DPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQ 651
+PV ++L +GKGEAW+NGQSIGRYW SF + G P+Q
Sbjct: 598 EPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDYRGAYGSDKCAFMCGKPTQ 657
Query: 652 SWYHIPRSFLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQN 710
WYH+PRSFL +G N + L EE G P ++ TV V T+C + +
Sbjct: 658 RWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCARAHEHN----------- 706
Query: 711 QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRA-IV 769
KV++ C + R IS + FAS+GNP G+C ++A+G+C A V
Sbjct: 707 ---------------KVELSCHN-RPISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTV 750
Query: 770 EKACLGKRSCTVPVWTEKFYGD-PCPGIPKALLVDAQC 806
K C+GK +CTV V ++ F C PK L V+ +C
Sbjct: 751 AKECVGKLNCTVNVSSDTFGSTLDCGDSPKKLAVELEC 788
>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 716
Score = 613 bits (1580), Expect = e-172, Method: Compositional matrix adjust.
Identities = 329/697 (47%), Positives = 427/697 (61%), Gaps = 49/697 (7%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
+YD R+++ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP G
Sbjct: 24 SYDHRAVVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPARG 83
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ F+ R DLVRF+K + GLYV LRIGP++ EW +GG P WL VPGI FR+DN PF
Sbjct: 84 QYHFADRYDLVRFVKLARQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGPF 143
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+R+ IV+MMK+ L+ QGGPIIL+Q+ENEYG +E + PY WAA +AV
Sbjct: 144 KAEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESAMGAGAKPYANWAANMAV 203
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
GVPWVMCKQDDAPDPVIN CNG C + PNS KP +WTE WT ++ +G
Sbjct: 204 ATDAGVPWVMCKQDDAPDPVINTCNGFYC--DYFTPNSNSKPTMWTEAWTGWFTAFGGPV 261
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+A+ VA FI K GS+VNYYMYHGGTNF RTA ++ T Y AP+DEYGL+
Sbjct: 262 PHRPVEDMAFAVARFIQK-GGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLI 320
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNN 388
RQPKWGHL++LH A+K ++SG ++A++F+ S+ CAAFL N +
Sbjct: 321 RQPKWGHLRDLHKAIKQAEPALVSGDPTIQRIGNYEKAYVFKSSTGACAAFLSNYHTSSA 380
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPTY 437
A + ++ Y+LP SISILPDCKT FNTA + W+ Y E
Sbjct: 381 ARIVYNGRRYDLPAWSISILPDCKTAVFNTATVKEPTAPAKMNPAGGFAWQSYSEDTNAL 440
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHVLHA 489
D ++ + L+EQ++ T D SDYLWY D SE LK ++S GH +
Sbjct: 441 DSSAFTKDGLVEQLSMTWDKSDYLWYTTYVNID--SSEQFLKTGQWPQLTINSAGHSVQV 498
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRN 548
F+NG+ G A+G ++ T K V + G+N +S+LS +GLP+ G + E V L
Sbjct: 499 FVNGQSFGVAYGGYNSPKLTYSKPVKMWQGSNKISILSSAMGLPNQGTHYEAWNVGVLGP 558
Query: 549 VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVF 607
V++ G + K D S+ W YQ+GL GE L + + GS V +S QPLTW+K F
Sbjct: 559 VTLSGLNQGKRDLSNQKWTYQIGLKGESLGVNSISGSSSV--EWSSASGAQPLTWHKAYF 616
Query: 608 DAPTGSDPVAINLISMGKGEAWVNGQSIGRYW-------------------VSFLTPQGT 648
AP GS PVA+++ SMGKG+ WVNG + GRYW T G
Sbjct: 617 AAPAGSAPVALDMGSMGKGQIWVNGNNAGRYWSYRASGSCGGCSYAGTFSEAKCQTNCGD 676
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
SQ WYH+PRS+LKP+GNLLV+LEE G G+++ T
Sbjct: 677 ISQRWYHVPRSWLKPSGNLLVVLEEFGGDLSGVTLMT 713
>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
Length = 828
Score = 612 bits (1577), Expect = e-172, Method: Compositional matrix adjust.
Identities = 358/842 (42%), Positives = 462/842 (54%), Gaps = 97/842 (11%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G GG VTY+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 25 GVGGTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 84
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D+VRF KE+Q GLY LRIGP+I GEW YGGLP WL D+PG+ F
Sbjct: 85 GHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQF 144
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+IVN MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 145 RLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEY 204
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 205 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 262
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYD 319
+++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y
Sbjct: 263 GWFKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDY 321
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F
Sbjct: 322 DAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSACF 381
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE--- 425
+ N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 382 INNRNDNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKANMVEKEP 441
Query: 426 ---QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L
Sbjct: 442 ESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHK-GEASYTLF 500
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V++ GH L+AF+NG VG H + F LE L +G N +SLLS +GL + G
Sbjct: 501 VNTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLF 560
Query: 540 ERRVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
E+ AG+ I + D S+ SW Y+ GL GE QI D W +
Sbjct: 561 EKMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPG--CTWDNNNGTV 618
Query: 597 --HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------ 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 619 PINKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCD 678
Query: 643 --------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVS 687
LT G PSQ +YH+PRSFLK N L+L EE G P +S TV+
Sbjct: 679 YRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTLILFEEAGGDPSHVSFRTVA 738
Query: 688 VTTLC--GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFAS 744
++C V D+ + + C K IS I S
Sbjct: 739 AGSVCASAEVGDT----------------------------ITLSCGQHSKTISAINMTS 770
Query: 745 YGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDA 804
+G G C Y G C S + +ACLGK SCTV + T G C + L V A
Sbjct: 771 FGVARGQCGAYK-GGCESKAAYKAFTEACLGKESCTVQI-TNAVTGSGC--LSNVLTVQA 826
Query: 805 QC 806
C
Sbjct: 827 SC 828
>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
Length = 822
Score = 610 bits (1574), Expect = e-172, Method: Compositional matrix adjust.
Identities = 363/860 (42%), Positives = 487/860 (56%), Gaps = 117/860 (13%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G G VTYD R++ I+G RK++ SGSIHYPRSTP+MWP+LI KAKEGGL+ ++T VFWN
Sbjct: 2 GFGYEVTYDNRAIKIDGARKLILSGSIHYPRSTPEMWPQLIRKAKEGGLNTIETYVFWNA 61
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEP Q+DFSG DL+RFIK ++ +GLY LRIGP++ EW YGG P WLH++PGI R
Sbjct: 62 HEPHQRQYDFSGNLDLIRFIKTIRDEGLYAILRIGPYVCAEWNYGGFPVWLHNLPGIQIR 121
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
++NE +K M+ + T+IVNMMK +L+ASQGGPIILSQIENEYG V+ S+ ++G YV+W
Sbjct: 122 TNNEVYKNEMEIFTTLIVNMMKDGKLFASQGGPIILSQIENEYGNVQSSYGDEGKEYVKW 181
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
A LA + GVPW+MC+Q DAP P+I++CNG C + ++ N+ P IWTENWT ++Q
Sbjct: 182 CANLAESFKVGVPWIMCQQSDAPSPMIDSCNGFYCDQYYS--NNKSLPKIWTENWTGWFQ 239
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPL 323
+G + RSAED+A+ VA F ++ GS +NYYMYHGGTNFG T +T YD APL
Sbjct: 240 DWGQKNPHRSAEDVAFAVARFF-QLGGSVMNYYMYHGGTNFGTTGGGPYITASYDYDAPL 298
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI----FQGSSECAAF 379
DEYG LRQPKWGHL++LHS + + + G + N+ FI +QG C F
Sbjct: 299 DEYGNLRQPKWGHLRDLHSVLNSMEQTLTYGESKNSNYPDNNNIFITIFAYQGKRSC--F 356
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------------DSVE 425
+ D ++ T+ F Y LP S+SILPDC T +NTA + DS
Sbjct: 357 FSSIDYKDQ-TISFEGTDYFLPAWSVSILPDCFTEVYNTATVNVQTSIMENKANAADSFR 415
Query: 426 -----QWEEYKEAIP------TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS 474
QW+ E I + +L AN L++Q T SDYLW + H+ +DS
Sbjct: 416 EPNSLQWKWRPEKIRGLSLQGDFVGNTLVANELMDQKAVTNGTSDYLWIMTNYDHNMNDS 475
Query: 475 ------ESVLKVSSLGHVLHAFINGEFVG--SAHGKHSDKSFTLEKMVHLINGTNNVSLL 526
+ +L+V + GHV+HAF+NG+ VG SA + F E + L G N +SL+
Sbjct: 476 LWGAGKDIILQVHTNGHVVHAFVNGKHVGSQSASIESGRFDFVFESKIKLKRGINRISLV 535
Query: 527 SVMVGLPDSGAYLERRVAGLRN-VSIQGAKELK-------DFSSFSWGYQVGLLGEKLQI 578
SV VGL + GA + G+ ++I G +L D SS W Y+ GL GE
Sbjct: 536 SVSVGLQNYGANFDTAPTGINGPITIIGRSKLGNQPDVTVDISSNRWVYKTGLHGE---- 591
Query: 579 FTDYGSRIV-PWSRYGSST-----HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNG 632
D G + V P R T +QP WYKT F+AP G DPV ++L+ +GKG AWVNG
Sbjct: 592 --DQGFQAVRPRHRRQFYTKHVLINQPFVWYKTSFNAPLGQDPVVVDLLGLGKGTAWVNG 649
Query: 633 QSIGRYWVSFLTPQ-----------------------GTPSQSWYHIPRSFLKPTGNLLV 669
++IGR+W L P G P+Q +YHIPR +LKP N LV
Sbjct: 650 RNIGRFWPKALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRYYHIPRDWLKPEDNKLV 709
Query: 670 LLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQI 729
L EE G P +S+ TV+V +C H + H V++
Sbjct: 710 LFEELGGTPDFVSVQTVTVGKVCVHGYEGH--------------------------TVEL 743
Query: 730 RCPSGRKISKILFASYGNPNGNCENYAIGS---CHSSNSRAIVEKACLGKRSCTVPVWTE 786
C GRK SKI FAS+G P G C ++ + CH+ S IVEKAC+GK C++ + +
Sbjct: 744 SCQHGRKFSKITFASFGLPQGKCGSFTPSNNHDCHADVS-TIVEKACVGKERCSIDISEK 802
Query: 787 KFYGDPCPGIPKALLVDAQC 806
C L V+A C
Sbjct: 803 ALAPIHCDARIYRLAVEAVC 822
>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
Length = 806
Score = 610 bits (1574), Expect = e-172, Method: Compositional matrix adjust.
Identities = 340/837 (40%), Positives = 477/837 (56%), Gaps = 100/837 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +LIING R+++FSG+IHYPRST +MWP LI KAK+GGLD ++T +FW+ HEP
Sbjct: 10 VTYDSNALIINGERRLIFSGAIHYPRSTVEMWPDLIQKAKDGGLDAIETYIFWDRHEPVR 69
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
+++FSG D V+F + +Q GLY +RIGP+ EW +GG P WLH++PGI R++N
Sbjct: 70 REYNFSGNLDFVKFFQLIQKAGLYAIMRIGPYACAEWNFGGFPSWLHNMPGIELRTNNSV 129
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+K M+ + T IVN++K A+L+ASQGGPIIL+QIENEYG + ++ + G YV+WAA++A
Sbjct: 130 YKNEMQNFTTEIVNVVKEAKLFASQGGPIILAQIENEYGDIMWNYKDAGKAYVQWAAQMA 189
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ GVPW+MC+Q DAP P+IN CNG C F PN+P P I+TENW ++Q +G+
Sbjct: 190 LAQNIGVPWIMCQQQDAPQPIINTCNGYYC-HNFQ-PNNPKSPKIFTENWIGWFQKWGER 247
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
RSAED A+ VA F + G NYYMYHGGTNFGRTA Y+ T Y AP+DEYG
Sbjct: 248 VPHRSAEDSAFSVARFF-QNGGVLNNYYMYHGGTNFGRTAGGPYITTSYDYDAPIDEYGN 306
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG---------SSECAAF 379
L QPKWGHLK LH+A+KL G V N+S ++ + G S F
Sbjct: 307 LNQPKWGHLKNLHAAIKL-------GENVLTNYSARKDEDLGNGLTLTTYTNSSGARFCF 359
Query: 380 LVNKDKRN-NATVYFSNL-MYELPPLSISILPDCKTVAFNTAKLDS-------------- 423
L N + + A V N +Y +P S+SI+ C FNTAK++S
Sbjct: 360 LSNNNNTDLGARVDLKNDGVYIVPAWSVSIINGCNQEVFNTAKVNSQTSMMVKKSDNVSS 419
Query: 424 ---VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SES 476
+W+ + + SL+A LLEQ T DASDYLWY D +D S +
Sbjct: 420 TNLTWEWKVEPKRDTIHGNGSLKAQKLLEQKELTLDASDYLWY--MTSADINDTSIWSNA 477
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
L+V++ GH LH ++N +VG ++ ++ FT EK V L NGTN ++LLS VGL + G
Sbjct: 478 TLRVNTSGHSLHGYVNQRYVGYQFSQYGNQ-FTYEKQVSLKNGTNIITLLSATVGLANYG 536
Query: 537 AYLERRVAGLRN--VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
A+ + + G+ V + G + D S+ W Y++GL GE+ ++ + V W
Sbjct: 537 AWFDDKKTGISGGPVELIGKNNVTMDLSTNLWSYKIGLNGERRHLYDAQQNVSVAWHTNS 596
Query: 594 S--STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----- 646
S +PL WY+ F +P G++P+ ++L +GKG AWVNG SIGRYW S+++P
Sbjct: 597 SYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAWVNGHSIGRYWSSWISPSDGCSD 656
Query: 647 -----------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G+PSQ WYH+PRSFL N LVL EE G P + TV+
Sbjct: 657 TCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNTLVLFEEIGGNPQSVQFQTVTTG 716
Query: 690 TLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPN 749
T+C +V + + ++ C SG+ +S+I FASYGNP
Sbjct: 717 TICANVYEG--------------------------AQFELSCQSGQVMSQIQFASYGNPE 750
Query: 750 GNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C ++ G+ ++NS+++VE +C+GK +C V E F IP+ L V C
Sbjct: 751 GQCGSFKKGNFDAANSQSVVEASCVGKNNCGFNVTKEMFGVTNVSSIPR-LAVQVTC 806
>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
Length = 786
Score = 610 bits (1573), Expect = e-171, Method: Compositional matrix adjust.
Identities = 327/811 (40%), Positives = 476/811 (58%), Gaps = 103/811 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V++DGR++ I+GHR++L SGSIHYPRST +MWP LI K KEG LD ++T VFWN HEP
Sbjct: 45 VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGSLDAIETYVFWNAHEPTR 104
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
Q+DFSG DL+RF+K +Q +G+Y LRIGP++ EW YGG P WLH++PG+ FR+ N
Sbjct: 105 RQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 164
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F M+ + TMIV M+K +L+ASQGGPIIL+QIENEYG V S+ E G Y++W A +A
Sbjct: 165 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIQWCANMA 224
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L GVPW+MC+QDDAP P++N CNG C + F+ PN+P+ P +WTENWT +Y+ +G +
Sbjct: 225 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYC-DNFS-PNNPNTPKMWTENWTGWYKNWGGK 282
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+ ED+A+ VA F K +G++ NYYMYHGGTNF RTA Y+ T Y APLDE+G
Sbjct: 283 DPHRTTEDVAFAVARFFQK-EGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGN 341
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN 388
L QPK+GHLK+LH + K + G + +++F L A ++Q + F+ N ++ ++
Sbjct: 342 LNQPKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVYQTEEGSSCFIGNVNETSD 401
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------------SVEQWEEYK 431
A + F Y++P S+SILPDCKT +NTAK++ S +W
Sbjct: 402 AKINFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTSVMVKKANEAENEPSTLKWSWRP 461
Query: 432 EAIPTY-----DETSLRANFLLEQMNTTKDASDYLWY----NFRFKHDPSDSESV-LKVS 481
E I + E+++R L +Q + D SDYLWY N + + DP +++ L+++
Sbjct: 462 ENIDSVLLKGKGESTMRQ--LFDQKVVSNDESDYLWYMTTVNLK-EQDPVLGKNMSLRIN 518
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
S HVLHAF+NG+ +G+ ++ + E+ G N ++LLS+ VGLP+ GA+ E
Sbjct: 519 STAHVLHAFVNGQHIGNYRVENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAFFEN 578
Query: 542 RVAGLRN-VSIQGAKE----LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
AG+ V I G +KD S+ W Y+ GL G + Q+F S+
Sbjct: 579 FSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSGFENQLF---------------SS 623
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
P TW AP GS+PV ++L+ +GKG AW+NG +IGRYW +FL+
Sbjct: 624 ESPSTW-----SAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLSD----------- 667
Query: 657 PRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKT 716
N LVL EE G P ++ T+ V ++C +V + ++
Sbjct: 668 -----IDGDNTLVLFEEIGGNPSLVNFQTIGVGSVCANVYEKNV---------------- 706
Query: 717 HKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSS-NSRAIVEKACLG 775
+++ C +G+ IS I FAS+GNP G+C ++ G+C +S N+ AI+ + C+G
Sbjct: 707 ----------LELSC-NGKPISAIKFASFGNPGGDCGSFEKGTCEASNNAAAILTQECVG 755
Query: 776 KRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
K C++ V +KF C + K L V+A C
Sbjct: 756 KEKCSIDVSEDKFGAAECGALAKRLAVEAIC 786
>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
Precursor
gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 827
Score = 607 bits (1564), Expect = e-170, Method: Compositional matrix adjust.
Identities = 350/837 (41%), Positives = 457/837 (54%), Gaps = 92/837 (10%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T +FWN H
Sbjct: 27 GCTSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGH 86
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP Q++F G D+VRF KE+Q G+Y LRIGP+I GEW YGGLP WL D+PG+ FR
Sbjct: 87 EPHRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRL 146
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVR 203
NEPF+ M+ + T+IVN MK ++++A QGGPIIL+QIENEYG M + + + Y+
Sbjct: 147 HNEPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIH 206
Query: 204 WAAKLAVDLQTGVPWVMCKQ-DDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
W A +A GVPW+MC+Q DD P V+N CNG C + F PN P IWTENWT +
Sbjct: 207 WCADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWTGW 264
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQA 321
++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y A
Sbjct: 265 FKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDA 323
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
PLDEYG LRQPK+GHLKELHS +K K ++ G N+ + S A F+
Sbjct: 324 PLDEYGNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSACFIN 383
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-----------DSVEQ---- 426
N+ + V + LP S+SILPDCKTVAFN+AK+ ++ EQ
Sbjct: 384 NRFDDKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQES 443
Query: 427 --WEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVS 481
W E + T ++ + R N LLEQ+ T+ D SDYLWY H S L V+
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHKGEGSYK-LYVN 502
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
+ GH L+AF+NG+ +G H D F LE V L +G N +SLLS VGL + G E+
Sbjct: 503 TTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYGPSFEK 562
Query: 542 RVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
G+ I D S+ SW Y+ GL E QI D + ++
Sbjct: 563 MPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYKWNGNNGTIPINR 622
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF---------------- 642
P TWYK F+AP+G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 623 PFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHRCDYRGA 682
Query: 643 ----------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVSVTTL 691
LT G PSQ +YH+PRSFL N L+L EE G P G+++ TV +
Sbjct: 683 FQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVALRTVVPGAV 742
Query: 692 C--GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPN 749
C G D+ V + C G +S + AS+G
Sbjct: 743 CTSGEAGDA----------------------------VTLSCGGGHAVSSVDVASFGVGR 774
Query: 750 GNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C Y G C S + AC+GK SCTV + T F G C + L V A C
Sbjct: 775 GRCGGYE-GGCESKAAYEAFTAACVGKESCTVEI-TGAFAGAGC--LSGVLTVQATC 827
>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
Length = 828
Score = 606 bits (1563), Expect = e-170, Method: Compositional matrix adjust.
Identities = 357/840 (42%), Positives = 461/840 (54%), Gaps = 93/840 (11%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G GG VTY+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 25 GVGGTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 84
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D+VRF KE+Q GLY LRIGP+I GEW YGGLP WL D+PG+ F
Sbjct: 85 GHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQF 144
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+IVN MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 145 RLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEY 204
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 205 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 262
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYD 319
+++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y
Sbjct: 263 GWFKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDY 321
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F
Sbjct: 322 DAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTSACF 381
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE--- 425
+ N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 382 INNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEP 441
Query: 426 ---QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L
Sbjct: 442 ENLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDH-KGEASYTLF 500
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V++ GH L+AF+NG VG H + F LE V L +G N +SLLS +GL + G
Sbjct: 501 VNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLF 560
Query: 540 ERRVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDY-GSRIVPWSRYGSS 595
E+ AG+ I D S+ SW Y+ GL GE QI D G R W +
Sbjct: 561 EKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR---WDNNNGT 617
Query: 596 T--HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------- 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 618 VPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHC 677
Query: 643 ---------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTV 686
LT G PSQ +YH+PRSFLK N L+L EE G P + +V
Sbjct: 678 DYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSV 737
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYG 746
++C +S + TL + + IS I S+G
Sbjct: 738 VAGSVC-----------VSAEVGDAITLSCGQH--------------SKTISTIDVTSFG 772
Query: 747 NPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C Y G C S + +ACLGK SCTV + G C + L V A C
Sbjct: 773 VARGQCGAYE-GGCESKAAYKAFTEACLGKESCTVQI-INALTGSGC--LSGVLTVQASC 828
>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 785
Score = 606 bits (1562), Expect = e-170, Method: Compositional matrix adjust.
Identities = 338/749 (45%), Positives = 437/749 (58%), Gaps = 99/749 (13%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD RSL+ING R+IL SGSIHYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F+ R DLVRF+K V+ GLYV LR+GP++ EW +GG P WL VPGI FR+DN P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK+ L+ QGGPII++Q+ENE+G +E G PY WAA++A
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PN+ KP +WTE WT ++ +G
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYC--DYFTPNNKHKPTMWTEAWTGWFTKFGGA 277
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEY-- 326
A R ED+A+ VA F+ K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DE+
Sbjct: 278 APHRPVEDLAFAVARFVQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGM 336
Query: 327 -----------------------------------------------GLLRQPKWGHLKE 339
GLLRQPKWGHL+
Sbjct: 337 QWLLPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRN 396
Query: 340 LHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMY 398
+H A+K ++SG + ++A++F+ + CAAFL N ++ + F Y
Sbjct: 397 MHRAIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHY 456
Query: 399 ELPPLSISILPDCKTVAFNTA---------KLDSVEQ---WEEYKEAIPTYDETSLRANF 446
+LP SISILPDCKT FNTA K+ V W+ Y E + D+++ +
Sbjct: 457 DLPAWSISILPDCKTAVFNTATVKEPTLLPKMSPVMHRFAWQSYSEDTNSLDDSAFARDG 516
Query: 447 LLEQMNTTKDASDYLWY--------NFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGS 498
L+EQ++ T D SDYLWY N RF S L V S GH + F+NG GS
Sbjct: 517 LIEQLSLTWDKSDYLWYTTHVNIGSNERFLK--SGQWPQLSVYSAGHSMQVFVNGRSYGS 574
Query: 499 AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVSIQGAKEL 557
+G + + T V + G+N +S+LS VGLP++G + E V L V++ G E
Sbjct: 575 VYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTLSGLNEG 634
Query: 558 K-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPV 616
K D S W YQVGL GE L + T GS V W+ G T QPLTW+K +F+AP GSDPV
Sbjct: 635 KRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGGGT-QPLTWHKALFNAPAGSDPV 693
Query: 617 AINLISMGKGEAWVNGQSIGRYWV---------------SFLTPQ-----GTPSQSWYHI 656
A+++ SMGKG+ WVNG+ GRYW ++ Q G SQ WYH+
Sbjct: 694 ALDMGSMGKGQVWVNGRHAGRYWSYRAHSRGCGRCSYAGTYREDQCTSNCGDLSQRWYHV 753
Query: 657 PRSFLKPTGNLLVLLEEENGYPPGISIDT 685
PRS+LKP+GNLLV+LEE G G+S+ T
Sbjct: 754 PRSWLKPSGNLLVVLEEYGGDLAGVSLAT 782
>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
Length = 828
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 352/834 (42%), Positives = 464/834 (55%), Gaps = 93/834 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YDGRSLI++G R+I+ SGSIHYPRSTP+MWP LI KAKEGGL+ ++T VFWN HEP+
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
+F+F G D+VRF KE+Q G+Y LRIGP+I GEW YGGLP WL D+PGI FR N+P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAK 207
F+ M+ + T+IV MK A ++A QGGPIIL+QIENEYG M++ ++ Y+ W A
Sbjct: 151 FENEMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 208 LAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+A GVPW+MC+QD D P V+N CNG C E F+ N P +WTENWT +Y+ +
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFS--NRTSIPKMWTENWTGWYRDW 268
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDE 325
R EDIA+ VA+F +M+GS NYYMYHGGTNFGRTA Y+ T Y APLDE
Sbjct: 269 DQPEFRRPTEDIAFAVAMFF-QMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDE 327
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
YG LRQPK+GHLKELHS + K +L G + N+ + ++ A F+ N+
Sbjct: 328 YGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSACFINNRFD 387
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VEQWEEYKE-- 432
+ V + LP S+SILPDCKTVAFN+AK+ + VEQ E+ +
Sbjct: 388 DRDVNVTLDGTTHFLPAWSVSILPDCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWS 447
Query: 433 -------AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGH 485
T ++ + R N LLEQ+ TT D SDYLWY +H + VL V++ GH
Sbjct: 448 WMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHK-GEGSYVLYVNTTGH 506
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
L+AF+NG+ VG + + + +F L+ V L +G N +SLLS VGL + G E AG
Sbjct: 507 ELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGGSFELLPAG 566
Query: 546 LRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS--THQPL 600
+ I + D S+ SW Y+ GL GE +I+ D W + S+ ++P
Sbjct: 567 IVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDKPGN--KWRSHNSTIPINRPF 624
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------ 642
TWYKT F AP G D V ++L + KG AWVNG S+GRYW S+
Sbjct: 625 TWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCHHCDYRGVFK 684
Query: 643 --------LTPQGTPSQSWYHIPRSFL-KPTGNLLVLLEEENGYPPGISIDTVSVTTLCG 693
LT G PSQ YH+PRSFL K N L+L EE G P +++ TV ++C
Sbjct: 685 AEVEAQKCLTGCGEPSQQLYHVPRSFLHKGEPNTLILFEEAGGDPSEVAVRTVVEGSVCA 744
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS-GRKISKILFASYGNPNGNC 752
V + C + GR IS + AS+G G C
Sbjct: 745 SAELGD--------------------------TVTLSCGAHGRTISSVDVASFGVARGRC 778
Query: 753 ENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+Y G C S + AC+GK SCTV V T+ F C + L V A C
Sbjct: 779 GSYD-GGCDSKVAYDAFAAACVGKESCTVLV-TDAFANAGC--VSGVLTVQATC 828
>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
Length = 828
Score = 603 bits (1555), Expect = e-169, Method: Compositional matrix adjust.
Identities = 341/838 (40%), Positives = 471/838 (56%), Gaps = 95/838 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V YD +LIING R+++FSG+IHYPRST MWP L+ KAK+GGLD ++T +FW+ HE
Sbjct: 25 VKYDSNALIINGERRLIFSGAIHYPRSTVDMWPDLVQKAKDGGLDAIETYIFWDRHEQVR 84
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+++FSG D V+F K +Q GLY +RIGP+ EW YGG P WLH +PGI R+DN
Sbjct: 85 GRYNFSGNLDFVKFFKTIQEAGLYGIIRIGPYSCAEWNYGGFPVWLHQIPGIEMRTDNAA 144
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+K M+ + T I+N+ K A L+ASQGGPIIL+QIENEYG + +F E G Y++WAA++A
Sbjct: 145 YKNEMQIFVTKIINVAKEANLFASQGGPIILAQIENEYGDIMWNFKEPGKAYIKWAAQMA 204
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ GVPW MC+Q+DAP P+IN CNG C F PN+P P ++TENW ++Q +G+
Sbjct: 205 LAQNIGVPWFMCQQNDAPQPIINTCNGYYC-HNFK-PNNPKSPKMFTENWIGWFQKWGER 262
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
A R+AED AY VA F + G + NYYMYHGGTNFGRT+ Y++T Y AP++EYG
Sbjct: 263 APHRTAEDSAYAVARFF-QNGGVFNNYYMYHGGTNFGRTSGGPYIITSYDYDAPINEYGN 321
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAA----FLVNKD 384
L QPK+GHLK LH A+KL K + + S N L + FL N
Sbjct: 322 LNQPKYGHLKFLHEAIKLGEKVLTN--YTSRNDKDLGNGITLTTYTNSVGARFCFLSNDK 379
Query: 385 KRNNATVYFSNL-MYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDET--- 440
+ V N Y +P S++IL C FNTAK++S E K + ++
Sbjct: 380 DNTDGNVDLQNDGKYFVPAWSVTILDGCNKEVFNTAKVNSQTSIMEKKIDNSSTNKLTWA 439
Query: 441 --------------SLRANFLLEQMNTTKDASDYLWYNFRFK-HDPSD-SESVLKVSSLG 484
S++A+ LLEQ T DASDYLWY +D S+ S + L V + G
Sbjct: 440 WIMEPKKDTMNGRGSIKAHQLLEQKELTLDASDYLWYMTSVDINDTSNWSNANLHVETSG 499
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H LH ++N ++G H + + +FT EK V L NGTN ++LLS VGL + GA +
Sbjct: 500 HTLHGYVNKRYIGYGHSQFGN-NFTYEKQVSLKNGTNIITLLSATVGLANYGARFDEIKT 558
Query: 545 GLRN--VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
G+ + V + G + D S+ +W ++VGL GEK + + V W+ T +PLT
Sbjct: 559 GISDGPVKLVGQNSVTIDLSTGNWSFKVGLNGEKRRFYDLQPRSGVAWNTSSYPTGKPLT 618
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQG-------------- 647
WYKT F +P G +P+ ++L +GKG AWVNG+SIGRYW S++T
Sbjct: 619 WYKTQFKSPLGPNPIVVDLQGLGKGHAWVNGKSIGRYWTSWITSTAGCSDTCDYRGNYKK 678
Query: 648 --------TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSH 699
+PSQ WYH+PRSFL N L+L EE G P +S T + T+C +V +
Sbjct: 679 EKCNTGCASPSQRWYHVPRSFLNDDMNTLILFEEIGGNPQNVSFLTETTKTICANVYEG- 737
Query: 700 LPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGS 759
K+++ C +G+ I+ I FAS+GNP G C ++ GS
Sbjct: 738 -------------------------GKLELSCQNGQVITSINFASFGNPQGQCGSFKKGS 772
Query: 760 CHSSNSRAIVEKACLGKRSCTVPVWTEKFYG---DPCP--------GIPKALLVDAQC 806
S NS++++E +C+GK C V T +G DP GIP+ L V A C
Sbjct: 773 WESLNSQSMMETSCIGKTGCGFTV-TRDMFGVNLDPLSASKASVKDGIPR-LAVQATC 828
>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
Length = 824
Score = 603 bits (1554), Expect = e-169, Method: Compositional matrix adjust.
Identities = 344/829 (41%), Positives = 459/829 (55%), Gaps = 85/829 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V YD ++IING RKI+ SGSIHYPRST +MW LI KAKEGGLD ++T +FWN HE +
Sbjct: 30 VEYDSSAVIINGQRKIILSGSIHYPRSTVEMWSDLIQKAKEGGLDTIETYIFWNAHERRR 89
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
+++F+G D V+F ++VQ GLY LRIGP+ EW YGG P WLH++P I FR+DNE
Sbjct: 90 REYNFTGNLDFVKFFQKVQEAGLYGILRIGPYACAEWNYGGFPVWLHNIPEIKFRTDNEI 149
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + T IVNM K A+L+ASQGGPIIL+QIENEYG V + E G YV+W A++A
Sbjct: 150 FKNEMQTFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNVMGPYGEAGKSYVQWCAQMA 209
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPW+MC+Q DAP VIN CNG C +TF PNSP P +WTENWT +Y+ +G +
Sbjct: 210 VAQNIGVPWIMCQQSDAPSSVINTCNGFYC-DTFT-PNSPKSPKMWTENWTGWYKKWGQK 267
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+AED+A+ VA F + G NYYMY+GGTNFGRT+ ++ T Y APLDEYG
Sbjct: 268 DPHRTAEDLAFSVARFF-QYNGVLQNYYMYYGGTNFGRTSGGPFIATSYDYDAPLDEYGN 326
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK---LQEAFIFQGSSECAAFLVNKDK 385
L QPKWGHLK LH+A+KL K + + + + +S + E FL N
Sbjct: 327 LNQPKWGHLKNLHAALKLGEKILTNSTVKTTKYSDGWVELTTYTSNIDGERLCFLSNTKM 386
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------------QW 427
+ Y +P S+SIL DC +NTAK++ +W
Sbjct: 387 DGLDVDLQQDGKYFVPAWSVSILQDCNKETYNTAKVNVQTSLIVKKLHENDTPLKLSWEW 446
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV-LKVSSLGHV 486
P + + +A LLEQ T D SDYLWY ++ + S++V L+V G
Sbjct: 447 APEPTKAPLHGQGGFKATQLLEQKAATYDESDYLWYMTSVDNNGTASKNVTLRVKYSGQF 506
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLE---RRV 543
LHAF+NG+ +GS HG +FT EK L GTN +SLLS VGL + G + + +
Sbjct: 507 LHAFVNGKEIGSQHG----YTFTFEKPALLKPGTNIISLLSATVGLQNYGEFFDEGPEGI 562
Query: 544 AGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWY 603
AG I D SS W Y+VGL GE + F D S W + +TWY
Sbjct: 563 AGGPVELIDSGNTTTDLSSNEWSYKVGLNGEGGR-FYDPTSGRAKWVSGNLRVGRAMTWY 621
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------- 642
KT F AP+G++PV ++L MGKG AWVNG S+GR+W
Sbjct: 622 KTTFQAPSGTEPVVVDLQGMGKGHAWVNGNSLGRFWPILTADPNGCDGKCDYRGQYKEGK 681
Query: 643 -LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
L+ G P+Q WYH+PRSFL N L+L EE G P +S + T+CG+ +
Sbjct: 682 CLSNCGNPTQRWYHVPRSFLNNGSNTLILFEEIGGNPSDVSFQITATETICGNTYEG--- 738
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFASYGNPNG-NCENYAIGS 759
+++ C GR+ IS I +AS+G+P G +C ++ GS
Sbjct: 739 -----------------------TTLELSCNGGRRIISDIQYASFGDPQGSSCGSFQRGS 775
Query: 760 CHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIP-KALLVDAQCT 807
+S S + VEKAC+GK SC++ V F + G+ L+V A CT
Sbjct: 776 VEASRSFSAVEKACMGKESCSINVSKATFGVEDSFGVDNNRLVVQAVCT 824
>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
Length = 828
Score = 602 bits (1553), Expect = e-169, Method: Compositional matrix adjust.
Identities = 353/836 (42%), Positives = 467/836 (55%), Gaps = 97/836 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YDGRSLI++G R+I+ SGSIHYPRSTP+MWP LI KAKEGGL+ ++T VFWN HEP+
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
+F+F G D+VRF KE+Q G+Y LRIGP+I GEW YGGLP WL D+PGI FR N+P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAK 207
F+ M+ + T+IV MK A ++A QGGPIIL+QIENEYG M++ ++ Y+ W A
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 208 LAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+A GVPW+MC+QD D P V+N CNG C E F+ N P +WTENWT +Y+ +
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFS--NRTSIPKMWTENWTGWYRDW 268
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDE 325
R EDIA+ VA+F +M+GS NYYMYHGGTNFGRTA Y+ T Y APLDE
Sbjct: 269 DQPEFRRPTEDIAFAVAMFF-QMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDE 327
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
YG LRQPK+GHLKELHS + K +L G + N+ + ++ A F+ N+
Sbjct: 328 YGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSACFINNRFD 387
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VEQWEEYKE-- 432
+ V + LP S+SILP+CKTVAFN+AK+ + VEQ E+ +
Sbjct: 388 DRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWS 447
Query: 433 -------AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGH 485
T ++ + R N LLEQ+ TT D SDYLWY +H + VL V++ GH
Sbjct: 448 WMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHK-GEGSYVLYVNTTGH 506
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
L+AF+NG+ VG + + + +F L+ V L +G N +SLLS VGL + G E AG
Sbjct: 507 ELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGGSFELLPAG 566
Query: 546 LRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS--THQPL 600
+ I + D S+ SW Y+ GL GE +I+ D W + S+ ++P
Sbjct: 567 IVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDKPGN--KWRSHNSTIPINRPF 624
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------ 642
TWYKT F AP G D V ++L + KG AWVNG S+GRYW S+
Sbjct: 625 TWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCHHCDYRGVFK 684
Query: 643 --------LTPQGTPSQSWYHIPRSFL-KPTGNLLVLLEEENGYPPGISIDTVSVTTLC- 692
LT G PSQ YH+PRSFL K N L+L EE G P +++ TV ++C
Sbjct: 685 AEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVRTVVEGSVCA 744
Query: 693 -GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS-GRKISKILFASYGNPNG 750
V D+ V + C + GR IS + AS+G G
Sbjct: 745 SAEVGDT----------------------------VTLSCGAHGRTISSVDVASFGVARG 776
Query: 751 NCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C +Y G C S + AC+GK SCTV V T+ F C + L V A C
Sbjct: 777 RCGSYD-GGCESKVAYDAFAAACVGKESCTVLV-TDAFANAGC--VSGVLTVQATC 828
>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
Length = 824
Score = 602 bits (1552), Expect = e-169, Method: Compositional matrix adjust.
Identities = 353/840 (42%), Positives = 459/840 (54%), Gaps = 93/840 (11%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G GG V Y+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 21 GVGGTTVAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 80
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D++RF KE+Q GLY LRIGP+I GEW YGGLP WL D+P + F
Sbjct: 81 GHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQF 140
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+I+N MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 141 RMHNAPFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEY 200
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 201 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 258
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYD 319
+++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y
Sbjct: 259 GWFKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDY 317
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F
Sbjct: 318 DAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTSACF 377
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE--- 425
+ N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 378 INNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEP 437
Query: 426 ---QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L
Sbjct: 438 ENLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDH-KGEASYTLF 496
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V++ GH L+AF+NG VG H + F LE V L +G N +SLLS +GL + G
Sbjct: 497 VNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLF 556
Query: 540 ERRVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDY-GSRIVPWSRYGSS 595
E+ AG+ I D S+ SW Y+ GL GE QI D G R W +
Sbjct: 557 EKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR---WDNNNGT 613
Query: 596 T--HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------- 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 614 VPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHC 673
Query: 643 ---------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTV 686
LT G PSQ +YH+PRSFLK N L+L EE G P + +V
Sbjct: 674 DYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSV 733
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYG 746
++C +S + TL + + IS I S+G
Sbjct: 734 VAGSVC-----------VSAEVGDAITLSCGQH--------------SKTISTIDVTSFG 768
Query: 747 NPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C Y G C S + +ACLGK SCTV + G C + L V A C
Sbjct: 769 VARGQCGAYE-GGCESKAAYKAFTEACLGKESCTVQI-INALTGSGC--LSGVLTVQASC 824
>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
Length = 730
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 328/726 (45%), Positives = 437/726 (60%), Gaps = 59/726 (8%)
Query: 129 GGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG 188
GG P WL VPGI FR+DN PFK M+ + IV M+K+ L+ASQGGPIILSQIENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEYG 60
Query: 189 MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS 248
+ G Y+ WAAK+AV L TGVPWVMCK+DDAPDPVINACNG C + F+ PN
Sbjct: 61 PESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYC-DGFS-PNK 118
Query: 249 PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRT 308
P KP +WTE W+ ++ +G R +D+A+ VA FI K GSY NYYMYHGGTNFGRT
Sbjct: 119 PYKPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQK-GGSYFNYYMYHGGTNFGRT 177
Query: 309 ASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
A +T YD AP+DEYGL R+PK+ HLKELH A+KL ++S + ++A
Sbjct: 178 AGGPFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSAGPTITSLGTYEQA 237
Query: 368 FIFQ-GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL----- 421
+I+ G +CAAFL N + ++ A V F+N Y LPP SISILPDC+ VA+NTA +
Sbjct: 238 YIYNSGPRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALVGVQTS 297
Query: 422 --------DSVEQWEEYKEAIPTYDETS-LRANFLLEQMNTTKDASDYLWYNFRFKHDPS 472
S+ WE Y E I + DE + + A LLEQ+N T+D SDYLWY D S
Sbjct: 298 HVHMLPTGTSLLSWETYDEVISSLDERARMTAVGLLEQINVTRDTSDYLWYMTSV--DIS 355
Query: 473 DSESVLK--------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
SES L+ V S GH + FING+F GSA G + FT V+L G+N +S
Sbjct: 356 SSESFLRGGQKPTLNVQSAGHAVRVFINGQFSGSAFGTREHRQFTFTGPVNLRAGSNKIS 415
Query: 525 LLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDY 582
LLS+ VGLP+ G + E G L V + G K D + W YQVGL GE + + T
Sbjct: 416 LLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGKRDLTWQKWSYQVGLKGEAMNLVTPE 475
Query: 583 GSRIVPWSR--YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV 640
G+ W R + + QPLTWYK F+AP G++P+A++L SMGKG+ +NGQSIGRYW
Sbjct: 476 GASSADWVRGSLAARSVQPLTWYKAYFNAPNGNEPLALDLRSMGKGQVRINGQSIGRYWT 535
Query: 641 SFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGI 681
++ +P+Q WYH+PRS+LKP NLLV+ EE G I
Sbjct: 536 AYAKGDCEACSYTGHSGRQNVNLVVASPTQRWYHVPRSWLKPKQNLLVIFEELGGDASKI 595
Query: 682 SIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKIL 741
++ S+T +C + ++H P + + + +Q K + V ++C G+ IS I
Sbjct: 596 ALLRRSLTNVCANAFENH-PSMAKYSTSSQDGSKV------KEATVNLQCGPGQSISAIE 648
Query: 742 FASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALL 801
FAS+G P+G C ++ IG+CH+ NSR+I+EK C+G++SC+V + F DPCP + K L
Sbjct: 649 FASFGTPSGTCGSFHIGTCHAPNSRSIIEKKCVGQKSCSVTISNSIFGADPCPNVLKRLT 708
Query: 802 VDAQCT 807
V+A C+
Sbjct: 709 VEAVCS 714
>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 824
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 351/840 (41%), Positives = 459/840 (54%), Gaps = 93/840 (11%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G G V Y+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 21 GVGCTTVAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 80
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D++RF KE+Q GLY LRIGP+I GEW YGGLP WL D+P + F
Sbjct: 81 GHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQF 140
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+I+N MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 141 RMHNAPFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEY 200
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 201 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 258
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYD 319
+++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y
Sbjct: 259 GWFKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDY 317
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F
Sbjct: 318 DAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTSACF 377
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE--- 425
+ N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 378 INNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEP 437
Query: 426 ---QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L
Sbjct: 438 ESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDH-KGEASYTLF 496
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V++ GH L+AF+NG VG H + F LE V L +G N +SLLS +GL + G
Sbjct: 497 VNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLF 556
Query: 540 ERRVAGLRNVSIQGAKELK---DFSSFSWGYQVGLLGEKLQIFTDY-GSRIVPWSRYGSS 595
E+ AG+ ++ D S+ SW Y+ GL GE QI D G R W +
Sbjct: 557 EKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR---WDNNNGT 613
Query: 596 T--HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------- 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 614 VPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHC 673
Query: 643 ---------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTV 686
LT G PSQ +YH+PRSFLK N L+L EE G P + +V
Sbjct: 674 DYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSV 733
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYG 746
++C +S + TL + + IS I S+G
Sbjct: 734 VAGSVC-----------VSAEVGDAITLSCGQH--------------SKTISTIDVTSFG 768
Query: 747 NPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C Y G C S + +ACLGK SCTV + G C + L V A C
Sbjct: 769 VARGQCGAYE-GGCESKAAYKAFTEACLGKESCTVQI-INALTGSGC--LSGVLTVQASC 824
>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
Precursor
gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
Length = 828
Score = 599 bits (1544), Expect = e-168, Method: Compositional matrix adjust.
Identities = 351/840 (41%), Positives = 459/840 (54%), Gaps = 93/840 (11%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G G V Y+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 25 GVGCTTVAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 84
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D++RF KE+Q GLY LRIGP+I GEW YGGLP WL D+P + F
Sbjct: 85 GHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQF 144
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+I+N MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 145 RMHNAPFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEY 204
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 205 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 262
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYD 319
+++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y
Sbjct: 263 GWFKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDY 321
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F
Sbjct: 322 DAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTSACF 381
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE--- 425
+ N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 382 INNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEP 441
Query: 426 ---QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L
Sbjct: 442 ESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDH-KGEASYTLF 500
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V++ GH L+AF+NG VG H + F LE V L +G N +SLLS +GL + G
Sbjct: 501 VNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLF 560
Query: 540 ERRVAGLRNVSIQGAKELK---DFSSFSWGYQVGLLGEKLQIFTDY-GSRIVPWSRYGSS 595
E+ AG+ ++ D S+ SW Y+ GL GE QI D G R W +
Sbjct: 561 EKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR---WDNNNGT 617
Query: 596 T--HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------- 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 618 VPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHC 677
Query: 643 ---------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTV 686
LT G PSQ +YH+PRSFLK N L+L EE G P + +V
Sbjct: 678 DYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSV 737
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYG 746
++C +S + TL + + IS I S+G
Sbjct: 738 VAGSVC-----------VSAEVGDAITLSCGQH--------------SKTISTIDVTSFG 772
Query: 747 NPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C Y G C S + +ACLGK SCTV + G C + L V A C
Sbjct: 773 VARGQCGAYE-GGCESKAAYKAFTEACLGKESCTVQI-INALTGSGC--LSGVLTVQASC 828
>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
Length = 829
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 346/864 (40%), Positives = 470/864 (54%), Gaps = 98/864 (11%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M + L + L+ +G ++ V Y+ R+L+I+G R+I+ SGSIHYPRSTP+M
Sbjct: 6 MARASLALVLLLITAAVGAANC-----TTVAYNDRALVIDGQRRIVLSGSIHYPRSTPEM 60
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI KAKEGGLD ++T VFWN HEP+P Q++F+G D+VRF KE+Q G+Y LRIGP
Sbjct: 61 WPDLIKKAKEGGLDAIETYVFWNGHEPRPRQYNFAGNYDIVRFFKEIQNAGMYAILRIGP 120
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
+I GEW YGGLP WL D+PG+ FR N+PF+ M+ + T+IVN +K A ++A QGGPIIL
Sbjct: 121 YICGEWNYGGLPAWLRDIPGMQFRMHNQPFEHEMETFTTLIVNKLKDANMFAGQGGPIIL 180
Query: 181 SQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGR 237
SQIENEYG M + + Y+ W A +A GVPW+MC+QD D P VIN CNG
Sbjct: 181 SQIENEYGNIMANLTDAQSASEYIHWCAAMANKQNVGVPWIMCQQDADVPPNVINTCNGF 240
Query: 238 QCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYY 297
C + F P D P IWTENWT +++ + RSA+DIA+ VA+F K +GS NYY
Sbjct: 241 YCHDWF--PKRTDIPKIWTENWTGWFKAWDKPDFHRSAQDIAFAVAMFFQK-RGSLQNYY 297
Query: 298 MYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVL 356
MYHGGTNFGRTA Y+ T Y APLDEYG +R+PK+GHLK+LH+ +K K ++ G
Sbjct: 298 MYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNIREPKYGHLKDLHAVLKSMEKILVHGDF 357
Query: 357 VSMNFSK--LQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTV 414
+N+ + + GSS C F+ N+ +A + +P S+S+LPDCK V
Sbjct: 358 SDINYGRNVTVTKYTLDGSSVC--FISNQFDDRDANATIDGTTHVVPAWSVSVLPDCKAV 415
Query: 415 AFNTAKL-----------DSVEQ------WE---EYKEAIPTYDETSLRANFLLEQMNTT 454
A+NTAK+ ++VEQ W E+ + T ++ S R N LLEQ+ T+
Sbjct: 416 AYNTAKIKAQTSVMVKKPNTVEQEPENLKWSWMPEHLKPFMTDEKGSFRKNELLEQITTS 475
Query: 455 KDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV 514
D SDYLWY F+H +++ L V++ GH ++AF+NG+ G H + F LE V
Sbjct: 476 TDQSDYLWYRTSFEHK-GEAKYKLSVNTTGHQIYAFVNGKLAGRQHSPNGAFIFQLESPV 534
Query: 515 HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ---GAKELKDFSSFSWGYQVGL 571
L +G N +SLLS +GL + GA E AG+ ++ D S+ SW Y+ GL
Sbjct: 535 KLHDGKNYLSLLSATMGLKNYGALFELMPAGIVGGPVKLVDNNGSTIDLSNSSWSYKAGL 594
Query: 572 LGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVN 631
GE QI D ++ TWYK F AP G + V +L+ + KG AWVN
Sbjct: 595 AGEHRQIHLDKPGYKWHGDNGTIPINRAFTWYKATFQAPAGEEAVVADLMGLNKGVAWVN 654
Query: 632 GQSIGRYWVSF--------------------------LTPQGTPSQSWYHIPRSFLKP-T 664
G ++GRYW S+ LT P+Q +YH+PR FL+
Sbjct: 655 GNNLGRYWPSYVAAEMGGCHHCDYRGAFKAEGDGLKCLTGCNEPAQRFYHVPRVFLRAGE 714
Query: 665 GNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRR 724
N +VL EE G P + TV+V +C ++ +
Sbjct: 715 PNTVVLFEEAGGDPSRVGFHTVAVGPVCVEAAE-------------------------KG 749
Query: 725 PKVQIRC--PSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVP 782
V + C GR IS + ASYG G C Y G C S + +AC+GK SCTV
Sbjct: 750 DNVTLSCGQHKGRTISSVDLASYGVTRGQCGAYQ-GGCESKAAYEAFAEACVGKESCTVQ 808
Query: 783 VWTEKFYGDPCPGIPKALLVDAQC 806
T+ F G C L V A C
Sbjct: 809 -HTDAFSGAGCQS--GVLTVQATC 829
>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
Length = 824
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 352/840 (41%), Positives = 458/840 (54%), Gaps = 93/840 (11%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G G V Y+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 21 GVGCTTVAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 80
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D++RF KE+Q GLY LRIGP+I GEW YGGLP WL D+P + F
Sbjct: 81 GHEPHRRQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQF 140
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+I+N MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 141 RMHNAPFENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEY 200
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 201 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 258
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYD 319
+++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y
Sbjct: 259 GWFKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDY 317
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F
Sbjct: 318 DAPLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTSACF 377
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE--- 425
+ N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 378 INNRNDNKDLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTTIMVKKANMVEKEP 437
Query: 426 ---QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L
Sbjct: 438 ESLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDH-KGEASYTLF 496
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V++ GH L+AF+NG VG H + F LE V L +G N +SLLS +GL + G
Sbjct: 497 VNTTGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLF 556
Query: 540 ERRVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDY-GSRIVPWSRYGSS 595
E+ AG+ I D S+ SW Y+ GL GE QI D G R W +
Sbjct: 557 EKMPAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR---WDNNNGT 613
Query: 596 T--HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------- 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 614 VPINRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHC 673
Query: 643 ---------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTV 686
LT G PSQ +YH+PRSFLK N L+L EE G P + +V
Sbjct: 674 DYRGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSV 733
Query: 687 SVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYG 746
++C +S + TL + + IS I S+G
Sbjct: 734 VAGSVC-----------VSAEVGDAITLSCGQH--------------SKTISTIDVTSFG 768
Query: 747 NPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C Y G C S + +ACLGK SCTV + G G+ L V A C
Sbjct: 769 VARGQCGAYE-GGCESKAAYKAFTEACLGKESCTVQI-INALTGS--GGLSGVLTVQASC 824
>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
Length = 831
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 344/844 (40%), Positives = 461/844 (54%), Gaps = 99/844 (11%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G V+YD R+L+I+G R+I+ SGSIHYPRSTP+MWP LI KAK+GGL+ ++T VFWN
Sbjct: 27 GASCTEVSYDERALVIDGQRRIILSGSIHYPRSTPEMWPDLIQKAKDGGLNTIETYVFWN 86
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP+P Q++F G D++RF KEVQ G+Y LRIGP+I GEW YGGLP WL D+P + F
Sbjct: 87 GHEPRPRQYNFEGNYDIMRFFKEVQKAGMYAILRIGPYICGEWNYGGLPAWLRDIPDMQF 146
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSF--LEKGPPY 201
R NEPF+ M+ + T+IVN MK A ++A QGGPIIL+QIENEYG V+ + E Y
Sbjct: 147 RLHNEPFEREMETFTTLIVNKMKDANMFAGQGGPIILTQIENEYGNVQSNLPDQESATKY 206
Query: 202 VRWAAKLAVDLQTGVPWVMCKQ-DDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+Q +D P VI CNG C + P + P IWTENWT
Sbjct: 207 IHWCADMANKQNVGVPWIMCQQSNDVPPNVIETCNGFYCHD--FKPKGSNMPKIWTENWT 264
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYD 319
+++ + R AED+AY VA+F + +GS NYYMYHGGTNFGRT+ Y+ T Y
Sbjct: 265 GWFKAWDKPDYHRPAEDVAYAVAMFF-QNRGSVQNYYMYHGGTNFGRTSGGPYITTTYDY 323
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIF---QGSSEC 376
APLDEYG +RQPK+GHLK LH+ + K ++ G N +A + GSS C
Sbjct: 324 DAPLDEYGNIRQPKYGHLKALHTVLTSMEKHLVYGQQNETNLDDKVKATKYTLDDGSSAC 383
Query: 377 AAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAI-- 434
F+ N + V F Y++P S+S+LPDCKTVA+NTAK+ + KE+
Sbjct: 384 --FISNSHDNKDVNVTFEGSAYQVPAWSVSVLPDCKTVAYNTAKVKTQTSVMVKKESAAK 441
Query: 435 -------------PTYDET--SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
P++ ++ S ++N LLEQ+ T D SDYLWY P + + L
Sbjct: 442 GGLKWSWLPEFLRPSFTDSYGSFKSNELLEQIVTGADESDYLWYKTSLTRGPKE-QFTLY 500
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
V++ GH L+AF+NGE G H + F E V L G N +SLLS VGL + GA
Sbjct: 501 VNTTGHELYAFVNGELAGYKHAVNGPYLFQFEAPVTLKPGKNYISLLSATVGLKNYGASF 560
Query: 540 ERRVAGL-----RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDY-GSRIVPWSRYG 593
E AG+ + VS G D S+ +W Y+ GL GE+ QI D G R WS +
Sbjct: 561 ELMPAGIVGGPVKLVSAHG--NTIDLSNNTWTYKTGLFGEQKQIHLDKPGLR---WSPFA 615
Query: 594 SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------- 642
T++P TWYK F AP G++ V ++L+ + KG +VNG ++GRYW S+
Sbjct: 616 VPTNRPFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNGHNLGRYWPSYVAGDMDGCHRC 675
Query: 643 ---------------LTPQGTPSQSWYHIPRSFLKPTG---NLLVLLEEENGYPPGISID 684
LT G Q +YH+PRSFL N +VL EE G P ++
Sbjct: 676 DYRGEYVTWNNQEKCLTGCGEVGQRFYHVPRSFLNAAHGAPNTVVLFEEAGGDPAKVNFR 735
Query: 685 TVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFAS 744
TV+V +C + V + C GR IS + AS
Sbjct: 736 TVAVGPVCADAE--------------------------KGDAVTLACAHGRTISSVDTAS 769
Query: 745 YGNPNGNCENYAIGS-CHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVD 803
+G G C Y GS C S + + AC+GK+ CTV +T+ F C G L V
Sbjct: 770 FGVSGGQCGAYEGGSGCESKPALEAITAACVGKKWCTVS-YTDAFDSADCKG-SGVLTVQ 827
Query: 804 AQCT 807
A C+
Sbjct: 828 ATCS 831
>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
sativus]
Length = 803
Score = 590 bits (1520), Expect = e-165, Method: Compositional matrix adjust.
Identities = 324/815 (39%), Positives = 457/815 (56%), Gaps = 102/815 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGRSL ING RKI+ SG+IHYPRS+P MWP L+ KAK GGL+ ++T VFWN HEPQ
Sbjct: 16 VTYDGRSLKINGERKIIISGAIHYPRSSPGMWPMLMKKAKNGGLNAIETYVFWNAHEPQR 75
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+DFSG DLV+FIK VQ + LY LRIGP++ EW YGG P WLH++PGI FR++N+
Sbjct: 76 GQYDFSGNNDLVQFIKAVQKERLYAILRIGPYVCAEWNYGGFPVWLHNLPGIKFRTNNQV 135
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+K + + N+ K ++ + + IENE+G VE S+ ++G YV+W A+LA
Sbjct: 136 YKVTFX-FFFLTKNLKKINNMF-------LKNXIENEFGNVEGSYGQEGKEYVKWCAELA 187
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
PW+MC+Q DAP P++ C+ + PN+ + P +WTE+W +++ +G+
Sbjct: 188 QSYNLSEPWIMCQQGDAPQPIVCNCDQFK-------PNNKNSPKMWTESWAGWFKGWGER 240
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+AED+A+ VA F + GS NYYMYHGGTNFGR+A Y+ T Y APLDEYG
Sbjct: 241 DPYRTAEDLAFAVARFF-QYGGSLHNYYMYHGGTNFGRSAGGPYITTSYDYNAPLDEYGN 299
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVL--VSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
+ QPKWGHLK+LH ++ K + G + + S ++ ++G S C F N +
Sbjct: 300 MNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHSTTATSYTYKGKSSC--FFGNPE-N 356
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-------------------QW 427
++ + F Y +P S+++LPDCKT +NTAK+++ QW
Sbjct: 357 SDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTTIREMVPSLVGKHKKPLKWQW 416
Query: 428 EEYKEAIPTYD----ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLK 479
K T++ +++ AN L++Q T D+SDYLWY F + +D L+
Sbjct: 417 RNEKIEHLTHEGDISGSAITANSLIDQKMVTNDSSDYLWYLTGFHLNGNDPLFGKRVTLR 476
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV-HLINGTNNVSLLSVMVGLPDSGAY 538
V + GH+LHAF+N + +G+ G + SFTLEK V +L +G N ++LLS VGLP+ GAY
Sbjct: 477 VKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRNLRHGFNQIALLSATVGLPNYGAY 536
Query: 539 LERRVAGLRNVS--IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
E G+ I K ++D S+ W Y+VGL GEK + F PW
Sbjct: 537 YENVEVGIYGPVELIADGKTIRDLSTNEWIYKVGLDGEKYEFFDPDHKFRKPWLSNNLPL 596
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ---------- 646
+Q TWYKT F P G + V ++L+ MGKG+AWVNG+SIGRYW S+L +
Sbjct: 597 NQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVNGKSIGRYWPSYLATENGCSSSCDYR 656
Query: 647 ------------GTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVSVTTLCG 693
G P+Q WYHIPRS++ N L+L EE G P I I T V +C
Sbjct: 657 GAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTLILFEEFGGMPLNIEIKTTRVKKVCA 716
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCE 753
V K+++ C R + +I+F +GNP GNC
Sbjct: 717 KVDLG--------------------------SKLELTCHD-RTVKRIIFVGFGNPKGNCN 749
Query: 754 NYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKF 788
N+ GSCHSS + +++EK CL KR C++ V +K
Sbjct: 750 NFHKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKL 784
>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
Length = 809
Score = 589 bits (1519), Expect = e-165, Method: Compositional matrix adjust.
Identities = 337/771 (43%), Positives = 441/771 (57%), Gaps = 109/771 (14%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTP--------------------------QMWPR 63
VTYD ++++I+G R+ILFSGSIHYPRSTP +MW
Sbjct: 27 VTYDKKAVLIDGQRRILFSGSIHYPRSTPDVTAFYKISSPPTIPWRGLWLRIYGSEMWEG 86
Query: 64 LIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIE 123
LI KAK+GGLDV+QT VFWN HEP PG + G++ F E
Sbjct: 87 LIQKAKDGGLDVIQTYVFWNGHEPTPGN----------------DSDGIFFRFEQYYFEE 130
Query: 124 GEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ- 182
G P WL VPGI FR+DNEPFK M+ + IV MMK+ L+ASQGGPIILSQ
Sbjct: 131 S-----GFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPIILSQA 185
Query: 183 --------IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINAC 234
IENEYG F G Y+ WAAK+AV L TGVPWVMCK++DAPDPVINAC
Sbjct: 186 SIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGTGVPWVMCKEEDAPDPVINAC 245
Query: 235 NGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYV 294
NG C + F+ PN P KP +WTE W+ ++ +G R R ED+A+ VA F+ K GS++
Sbjct: 246 NGFYC-DAFS-PNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQK-GGSFI 302
Query: 295 NYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLS 353
NYYMYHGGTNFGRTA +T YD AP+DEYGL+R+PK HLKELH AVKLC + ++S
Sbjct: 303 NYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVS 362
Query: 354 GVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKT 413
+QEA +FQ S CAAFL N + + A V F+N Y LPP SISILPDCK
Sbjct: 363 VDPAITTLGTMQEARVFQSPSGCAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKN 422
Query: 414 VAFNTAKLD-------------SVEQWEEYKEAIPTYDETSLRANF-LLEQMNTTKDASD 459
V FN+A + S WE Y E + + L LLEQ+N T+D+SD
Sbjct: 423 VVFNSATVGVQTSQMQMWGDGASSMTWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSD 482
Query: 460 YLWYNFRFKHDPSDSESVLK---------VSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
YLWY D S SE+ L+ V S GH LH F+NG+ GSA+G D+
Sbjct: 483 YLWYITSV--DISSSENFLQGGGKPLSLSVQSAGHALHVFVNGQLQGSAYGTREDRRIKY 540
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKE-LKDFSSFSWGYQ 568
L GTN ++LLSV GLP+ G + E G+ V + G E +D + +W YQ
Sbjct: 541 NGNASLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLDEGSRDLTWQTWSYQ 600
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYG--SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKG 626
VGL GE++ + + GS V W + + QPL WY+ F+ P+G +P+A+++ SMGKG
Sbjct: 601 VGLKGEQMNLNSIEGSSSVEWMQGSLIAQNQQPLAWYRAYFETPSGDEPLALDMGSMGKG 660
Query: 627 EAWVNGQSIGRYWV--------------SFLTPQ-----GTPSQSWYHIPRSFLKPTGNL 667
+ W+NGQSIGRYW +F P+ G P+Q WYH+P+S+L+PT NL
Sbjct: 661 QIWINGQSIGRYWTAYADGDCKECSYTGTFRAPKCQSGCGQPTQRWYHVPKSWLQPTRNL 720
Query: 668 LVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHK 718
LV+ EE G I++ SV+++C VS+ H P + +W+ ++ + H+
Sbjct: 721 LVVFEELGGDSSKIALVKRSVSSVCADVSEDH-PNIKNWQIESYGEREYHR 770
>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
gi|223947135|gb|ACN27651.1| unknown [Zea mays]
gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
Length = 822
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 351/866 (40%), Positives = 472/866 (54%), Gaps = 103/866 (11%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M Q L L + +T + + VTY+ R+L+I+G R+I+ SGSIHYPRSTPQM
Sbjct: 1 MTALQFLLLALVAVTQVASA-------TTVTYNDRALVIDGQRRIILSGSIHYPRSTPQM 53
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI KAKEGGL+ ++T VFWN HEP+ Q++F G D++RF KE+Q G++ LRIGP
Sbjct: 54 WPDLINKAKEGGLNTIETYVFWNGHEPRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIGP 113
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
+I GEW YGGLP WL D+PG+ FR N PF+ M+ + T+IVN MK ++A QGGPIIL
Sbjct: 114 YICGEWNYGGLPAWLRDIPGMQFRLHNAPFEREMETFTTLIVNKMKDVNMFAGQGGPIIL 173
Query: 181 SQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGR 237
+QIENEYG M + + Y+ W A +A + GVPW+MC+QD D P VIN CNG
Sbjct: 174 AQIENEYGNIMGQLKNNQSASQYIHWCADMANKQEVGVPWIMCQQDNDVPHNVINTCNGF 233
Query: 238 QCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYY 297
C + F PN P IWTENWT +++ + RSAEDIA+ VA+F K +GS NYY
Sbjct: 234 YCHDWF--PNRTGIPKIWTENWTGWFKAWDKPDFHRSAEDIAFAVAMFFQK-RGSVHNYY 290
Query: 298 MYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVL 356
MYHGGTNFGRT+ Y+ T Y APLDEYG +RQPK+GHLK+LH ++ K ++ G
Sbjct: 291 MYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILVHGKY 350
Query: 357 VSMNFSK--LQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTV 414
++ K +++ GSS C F+ N+ + V + +P S+SILP+CKTV
Sbjct: 351 NDTSYGKNVTVTKYMYGGSSVC--FINNQFVDRDMKVTLGGETHLVPAWSVSILPNCKTV 408
Query: 415 AFNTAKL-----------DSVEQ------WEEYKEAIP---TYDETSLRANFLLEQMNTT 454
A+NTAK+ +SVE+ W E + T S R + LLEQ+ T+
Sbjct: 409 AYNTAKIKTQTSVMVKKANSVEKEPETMRWSWMPENLKPFMTDHRGSFRQSQLLEQIATS 468
Query: 455 KDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMV 514
D SDYLWY +H S + L V++ GH ++AF+NG VG H F L+ V
Sbjct: 469 TDQSDYLWYRTSLEHKGEGSYT-LYVNTSGHEMYAFVNGRLVGQNHSADGAFVFQLQSPV 527
Query: 515 HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN--VSIQGAKELK-DFSSFSWGYQVGL 571
L +G N VSLLS VGL + G E AG+ V + G D + SW Y+ GL
Sbjct: 528 KLHSGKNYVSLLSGTVGLKNYGPSFELVPAGIAGGPVKLVGTNGTAIDLTKSSWSYKSGL 587
Query: 572 LGEKLQIFTDYGSRIVPWSRYGSS--THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAW 629
GE QI D W + + ++P TWYKT F+AP G + V ++L+ + KG AW
Sbjct: 588 AGELRQIHLDKPG--YKWQSHNGTIPVNRPFTWYKTTFEAPAGEEAVVVDLLGLNKGVAW 645
Query: 630 VNGQSIGRYWVSF--------------------------LTPQGTPSQSWYHIPRSFLKP 663
VNG S+GRYW S+ LT G P+Q +YH+PRSFL+
Sbjct: 646 VNGNSLGRYWPSYTAAEMPGCHVCDYRGKFIAEGDGIRCLTGCGEPAQRFYHVPRSFLRA 705
Query: 664 -TGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPG 722
N L+L EE G P + TV+V +C V+ L
Sbjct: 706 GEPNTLILFEEAGGDPTRAAFHTVAVGPVC--VAAVELG--------------------- 742
Query: 723 RRPKVQIRCPS-GRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTV 781
V + C GR ++ + AS+G G+C Y G C S + AC+G+ SCTV
Sbjct: 743 --DDVTLSCGGHGRVVASVDVASFGVARGSCGAYK-GGCESKAALKAFTDACVGRESCTV 799
Query: 782 PVWTEKFYGDPCPGIPKALLVDAQCT 807
+T F G C AL V A C+
Sbjct: 800 K-YTAAFAGAGCQ--SGALTVQATCS 822
>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 830
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 349/843 (41%), Positives = 458/843 (54%), Gaps = 98/843 (11%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V YD R+L+I+G R++L SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN HE
Sbjct: 23 GTEVGYDDRALVIDGERRLLISGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNGHE 82
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ Q++F G D+VRF KEVQ G+Y LRIGP+I GEW YGGLP WL D+ G+ FR
Sbjct: 83 PRRRQYNFEGSYDIVRFFKEVQDAGMYAILRIGPYICGEWNYGGLPAWLRDISGMQFRMH 142
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRW 204
N PF+ M+ + T+IV+ +K A+++A QGGPIILSQIENEYG M + + E Y+ W
Sbjct: 143 NHPFEQEMETFTTLIVDKLKEAKMFAGQGGPIILSQIENEYGNIMGKLNNNESASEYIHW 202
Query: 205 AAKLAVDLQTGVPWVMCKQ-DDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
A +A GVPW+MC+Q DD P VIN NG C + F P D P IWTENWT ++
Sbjct: 203 CAAMANKQNVGVPWIMCQQDDDVPSNVINTWNGFYCHDWF--PKRTDIPKIWTENWTGWF 260
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAP 322
+ + RSAEDIA+ VA+F + +GS NYYMYHGGTNFGRT+ Y+ T Y AP
Sbjct: 261 KAWDKPDFHRSAEDIAFSVAMFF-QTRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAP 319
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM---NFSKLQEAFIFQGSSECAAF 379
LDEYG +RQPK+GHLK+LH+ +K K +L G N + + SS C F
Sbjct: 320 LDEYGNIRQPKYGHLKDLHNVLKSMEKILLHGDYKDTTMGNTNVTVTKYTLDNSSAC--F 377
Query: 380 LVNK--DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ----------- 426
+ NK DK N T+ + + +P S+SILPDCKTVA+N+AK+ +
Sbjct: 378 ISNKFDDKEVNVTL-DNGATHTVPAWSVSILPDCKTVAYNSAKIKTQTSVMVKRPGAETV 436
Query: 427 -----WE---EYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVL 478
W E + T ++ + R N LLEQ+ T+ D SDYLWY F+H +S L
Sbjct: 437 TDGLAWSWMPENLQPFMTDEKGNFRKNELLEQIATSGDQSDYLWYRTSFEH-KGESNYKL 495
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
V++ GH L+AF+NG+ VG + + +F +E V L +G N +SLLS +GL + GA
Sbjct: 496 HVNTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPVKLHSGKNYISLLSATIGLKNYGAL 555
Query: 539 LERRVAGL-----RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
E AG+ + V D S+ SW Y+ GL GE + D + WS
Sbjct: 556 FEMMPAGIVGGPVKLVDTVTNTTAYDLSNSSWSYKAGLAGEYRETHLDKANDRSQWSGGL 615
Query: 594 SST---HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF-------- 642
+ T H+P TWYK F+AP G +PV +L+ +GKG WVNG ++GRYW S+
Sbjct: 616 NGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKGVVWVNGNNLGRYWPSYVAADMDGC 675
Query: 643 ------------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISI 683
LT PSQ +YH+PRSF+K N +VL EE G P +S
Sbjct: 676 QRCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSFIKAGEPNTMVLFEEAGGDPTRVSF 735
Query: 684 DTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFA 743
TV+V +V + C GR IS + A
Sbjct: 736 HTVAVGA-------------------------ACAEAAEVGDEVALACSHGRTISSVDVA 770
Query: 744 SYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVD 803
S G G C Y G C S + A AC+GK SCTV + G C L V
Sbjct: 771 SLGVARGKCGAYQ-GGCESKAALAAFTAACVGKESCTVRHTEDFRAGSGCD--SGVLTVQ 827
Query: 804 AQC 806
A C
Sbjct: 828 ATC 830
>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
Length = 837
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 319/721 (44%), Positives = 417/721 (57%), Gaps = 58/721 (8%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T +FWN H
Sbjct: 27 GCTSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGH 86
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP Q++F G D+VRF KE+Q G+Y LRIGP+I GEW YGGLP WL D+PG+ FR
Sbjct: 87 EPHRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRL 146
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVR 203
NEPF+ M+ + T+IVN MK ++++A QGGPIIL+QIENEYG M + + + Y+
Sbjct: 147 HNEPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIH 206
Query: 204 WAAKLAVDLQTGVPWVMCKQ-DDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
W A +A GVPW+MC+Q DD P V+N CNG C + F PN P IWTENWT +
Sbjct: 207 WCADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWTGW 264
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQA 321
++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y A
Sbjct: 265 FKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDA 323
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
PLDEYG LRQPK+GHLKELHS +K K ++ G N+ + S A F+
Sbjct: 324 PLDEYGNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSACFIN 383
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-----------DSVEQ---- 426
N+ + V + LP S+SILPDCKTVAFN+AK+ ++ EQ
Sbjct: 384 NRFDDKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQES 443
Query: 427 --WEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVS 481
W E + T ++ + R N LLEQ+ T+ D SDYLWY H S L V+
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHKGEGSYK-LYVN 502
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
+ GH L+AF+NG+ +G H D F LE V L +G N +SLLS VGL + G E+
Sbjct: 503 TTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYGPSFEK 562
Query: 542 RVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
G+ I D S+ SW Y+ GL E QI D + ++
Sbjct: 563 MPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYKWNGNNGTIPINR 622
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF---------------- 642
P TWYK F+AP+G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 623 PFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHRCDYRGA 682
Query: 643 ----------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVSVTTL 691
LT G PSQ +YH+PRSFL N L+L EE G P G+++ TV +
Sbjct: 683 FQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVALRTVVPGPV 742
Query: 692 C 692
C
Sbjct: 743 C 743
>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
Precursor
gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 808
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 343/836 (41%), Positives = 455/836 (54%), Gaps = 117/836 (13%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YDGRSLI++G R+I+ SGSIHYPRSTP+MWP LI KAKEGGL+ ++T VFWN HEP+
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
+F+F G D+VRF KE+Q G+Y LRIGP+I GEW YGGLP WL D+PGI FR N+P
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAK 207
F+ M+ + T+IV MK A ++A QGGPIIL+QIENEYG M++ ++ Y+ W A
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHEYIHWCAD 210
Query: 208 LAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
+A GVPW+MC+QD D P V+N CNG C E F+ N P +WTENWT +Y+ +
Sbjct: 211 MANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFS--NRTSIPKMWTENWTGWYRDW 268
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDE 325
R EDIA+ VA+F +M+GS NYYMYHGGTNFGRTA Y+ T Y APLDE
Sbjct: 269 DQPEFRRPTEDIAFAVAMFF-QMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDE 327
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
YG LRQPK+GHLKELHS + K +L G + N+ + ++ A F+ N+
Sbjct: 328 YGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATSACFINNRFD 387
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VEQWEEYKE-- 432
+ V + LP S+SILP+CKTVAFN+AK+ + VEQ E+ +
Sbjct: 388 DRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTTVMVNKTSMVEQQTEHFKWS 447
Query: 433 -------AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGH 485
T ++ + R N LLEQ+ TT D SDYLWY +H + VL V++ GH
Sbjct: 448 WMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLEHK-GEGSYVLYVNTTGH 506
Query: 486 VLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
L+AF+NG+ VG + + + +F L+ P+ G E AG
Sbjct: 507 ELYAFVNGKLVGQQYSPNENFTFQLKS--------------------PNYGGSFELLPAG 546
Query: 546 LRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS--THQPL 600
+ I + D S+ SW Y+ GL GE +I+ D W + S+ ++P
Sbjct: 547 IVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDKPGN--KWRSHNSTIPINRPF 604
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------ 642
TWYKT F AP G D V ++L + KG AWVNG S+GRYW S+
Sbjct: 605 TWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCHHCDYRGVFK 664
Query: 643 --------LTPQGTPSQSWYHIPRSFL-KPTGNLLVLLEEENGYPPGISIDTVSVTTLC- 692
LT G PSQ YH+PRSFL K N L+L EE G P +++ TV ++C
Sbjct: 665 AEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVRTVVEGSVCA 724
Query: 693 -GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPS-GRKISKILFASYGNPNG 750
V D+ V + C + GR IS + AS+G G
Sbjct: 725 SAEVGDT----------------------------VTLSCGAHGRTISSVDVASFGVARG 756
Query: 751 NCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C +Y G C S + AC+GK SCTV V T+ F C + L V A C
Sbjct: 757 RCGSYD-GGCESKVAYDAFAAACVGKESCTVLV-TDAFANAGC--VSGVLTVQATC 808
>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
Length = 830
Score = 577 bits (1486), Expect = e-161, Method: Compositional matrix adjust.
Identities = 348/838 (41%), Positives = 465/838 (55%), Gaps = 100/838 (11%)
Query: 32 YDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQ 91
Y+ R+++I+G R+I+ SGSIHYPRSTPQMWP LI KAKEGGL+ ++T VFWN HEP+ Q
Sbjct: 30 YNDRAVVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHEPRRRQ 89
Query: 92 FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFK 151
++F G D+VRF KE+Q G++ LRIGP+I GEW YGGLP WL D+PG+ FR N+PF+
Sbjct: 90 YNFEGNYDIVRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNDPFE 149
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLA 209
M+ + T+IVN MK A ++A QGGPIIL+QIENEYG M + + Y+ W A +A
Sbjct: 150 REMETFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGKLENNQSASQYIHWCADMA 209
Query: 210 VDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
+ GVPW+MC+QD D P VIN CNG C + F PN P IWTENWT +++ +
Sbjct: 210 NKQKIGVPWIMCQQDNDVPHNVINTCNGFYCYDWF--PNRTGIPKIWTENWTGWFKAWDK 267
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y APLDEYG
Sbjct: 268 PDFHRSAEDIAFAVAMFFQK-RGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYG 326
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK--LQEAFIFQGSSECAAFLVNK-- 383
+RQPK+GHLK+LH+ +K K ++ G + K + + GSS C F+ N+
Sbjct: 327 NIRQPKYGHLKDLHNLLKSMEKILVHGEYKDTSHGKNVTVTKYTYGGSSVC--FISNQFD 384
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-----------DSVEQ------ 426
D+ N T+ ++L +P S+SILPDCKTVA+NTAK+ +SVE+
Sbjct: 385 DRDVNVTLAGTHL---VPAWSVSILPDCKTVAYNTAKIKTQTSVMVKKANSVEKEPEALR 441
Query: 427 WEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSL 483
W E + T D S R + LLEQ+ T+ D SDYLWY +H S + L V++
Sbjct: 442 WSWMPENLKPFMTDDHGSFRQSRLLEQIATSTDQSDYLWYRTSLEHKGEGSYT-LYVNTT 500
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH ++AF+NG+ VG + F L+ V L +G N VSLLS VGL + G E
Sbjct: 501 GHKIYAFVNGKLVGQNQSSNGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGPLFELVP 560
Query: 544 AGLRN--VSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS----T 596
AG+ V + GA + D + SW Y+ GL GE QI D W + S
Sbjct: 561 AGIAGGPVKLVGANDTAIDLTHSSWSYKSGLAGEHRQIHLDKPG--YKWRSHNGSGSIPV 618
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF-------------- 642
++P TWYKT F AP G + V ++L+ + KG AWVNG S+GRYW S+
Sbjct: 619 NRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAAWVNGNSLGRYWPSYTAAEMGGCHGACDY 678
Query: 643 -------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVSV 688
LT G PSQ +YH+PRSFL+ N LVL EE G P + TV+V
Sbjct: 679 RGKFKAEGDGIRCLTGCGEPSQRFYHVPRSFLRAGEPNTLVLFEEAGGDPARAAFHTVAV 738
Query: 689 TTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNP 748
+C ++ +S + ++ + AS+G
Sbjct: 739 GHVCVAAAEVGDDVTLSCGGGLGGGV----------------------VASVDVASFGVT 776
Query: 749 NGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G C +Y G C S + AC+G+ SCTV +T F G C L V A C
Sbjct: 777 RGGCGDYQ-GGCESKAALKAFRDACVGRESCTVK-YTPAFAGPGCQ--SGKLTVQATC 830
>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
Length = 811
Score = 573 bits (1478), Expect = e-160, Method: Compositional matrix adjust.
Identities = 342/843 (40%), Positives = 445/843 (52%), Gaps = 116/843 (13%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G GG VTY+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 25 GVGGTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 84
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D+VRF KE+Q GLY LRIGP+I GEW YGGLP WL D+PG+ F
Sbjct: 85 GHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQF 144
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+IVN MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 145 RLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEY 204
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 205 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 262
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ 320
+++ + RSAEDIA+ VA+F K G Y+ T Y
Sbjct: 263 GWFKAWDKPDFHRSAEDIAFAVAMFFQKRGGPYIT-------------------TSYDYD 303
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFL 380
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F+
Sbjct: 304 APLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSACFI 363
Query: 381 VNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE---- 425
N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 364 NNRNDNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKAKMVEKEPE 423
Query: 426 --QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKV 480
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L V
Sbjct: 424 SLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHK-GEASYTLFV 482
Query: 481 SSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLE 540
++ GH L+AF+NG VG H + F LE L +G N +SLLS +GL + G E
Sbjct: 483 NTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFE 542
Query: 541 RRVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST- 596
+ AG+ I + D S+ SW Y+ GL GE QI D W +
Sbjct: 543 KMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPG--CTWDNNNGTVP 600
Query: 597 -HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------- 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 601 INKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAARSMRRLPTTA 660
Query: 643 ---------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTV 686
LT G PSQ +YH+PRSFLK N ++L EE G P +S TV
Sbjct: 661 HYRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTV 720
Query: 687 SVTTLC--GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFA 743
+ ++C V D+ + + C K IS I
Sbjct: 721 AAGSVCASAEVGDT----------------------------ITLSCGQHSKTISAINVT 752
Query: 744 SYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVD 803
S+G G C Y G C S + +ACLGK SCTV + T G C + L V
Sbjct: 753 SFGVARGQCGAYK-GGCESKAAYKAFTEACLGKESCTVQI-TNAVTGSGC--LSNVLTVQ 808
Query: 804 AQC 806
A C
Sbjct: 809 ASC 811
>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
Flags: Precursor
gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
Length = 809
Score = 573 bits (1478), Expect = e-160, Method: Compositional matrix adjust.
Identities = 342/841 (40%), Positives = 445/841 (52%), Gaps = 114/841 (13%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G GG VTY+ RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T VFWN
Sbjct: 25 GVGGTTVTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWN 84
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
HEP Q++F G D+VRF KE+Q GLY LRIGP+I GEW YGGLP WL D+PG+ F
Sbjct: 85 GHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQF 144
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPY 201
R N PF+ M+ + T+IVN MK A ++A QGGPIIL+QIENEYG M + + + Y
Sbjct: 145 RLHNAPFENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEY 204
Query: 202 VRWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W A +A GVPW+MC+QD D P V+N CNG C + F PN P IWTENWT
Sbjct: 205 IHWCADMANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWT 262
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ 320
+++ + RSAEDIA+ VA+F K G Y+ T Y
Sbjct: 263 GWFKAWDKPDFHRSAEDIAFAVAMFFQKRGGPYIT-------------------TSYDYD 303
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFL 380
APLDEYG LRQPK+GHLK+LHS +K K ++ G V N+S + S A F+
Sbjct: 304 APLDEYGNLRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTSACFI 363
Query: 381 VNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-----------VE---- 425
N++ + V + LP S+SILPDCKTVAFN+AK+ + VE
Sbjct: 364 NNRNDNMDVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTTVMVNKAKMVEKEPE 423
Query: 426 --QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKV 480
+W +E + T ++ S R N LLEQ+ T+ D SDYLWY H ++ L V
Sbjct: 424 SLKWSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHK-GEASYTLFV 482
Query: 481 SSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLE 540
++ GH L+AF+NG VG H + F LE L +G N +SLLS +GL + G E
Sbjct: 483 NTTGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFE 542
Query: 541 RRVAGLRNVS---IQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST- 596
+ AG+ I + D S+ SW Y+ GL GE QI D W +
Sbjct: 543 KMPAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDKPG--CTWDNNNGTVP 600
Query: 597 -HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------- 642
++P TWYKT F AP G D V ++L+ + KG AWVNG ++GRYW S+
Sbjct: 601 INKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 660
Query: 643 -------------LTPQGTPSQSWYHIPRSFLKP-TGNLLVLLEEENGYPPGISIDTVSV 688
LT G PSQ +YH+PRSFLK N ++L EE G P +S TV+
Sbjct: 661 RGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTVAA 720
Query: 689 TTLC--GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFASY 745
++C V D+ + + C K IS I S+
Sbjct: 721 GSVCASAEVGDT----------------------------ITLSCGQHSKTISAINVTSF 752
Query: 746 GNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQ 805
G G C Y G C S + +ACLGK SCTV + T G C + L V A
Sbjct: 753 GVARGQCGAYK-GGCESKAAYKAFTEACLGKESCTVQI-TNAVTGSGC--LSNVLTVQAS 808
Query: 806 C 806
C
Sbjct: 809 C 809
>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
Length = 771
Score = 573 bits (1476), Expect = e-160, Method: Compositional matrix adjust.
Identities = 318/724 (43%), Positives = 417/724 (57%), Gaps = 90/724 (12%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
L S SIHYPRS P MWP LI AKEGG+DV++T VFWN HE PG + F GR DLV+F K
Sbjct: 1 LISASIHYPRSVP-MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAK 59
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGG---------------------------------LP 132
VQ G+Y+ LRIGPF+ EW +GG +P
Sbjct: 60 VVQDAGMYLILRIGPFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVP 119
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WLH +PG VFR+ N+PF HM+++ T IVN+MK +L+ASQGGPIILSQIENEYG E+
Sbjct: 120 VWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYEN 179
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKP 252
+ E G Y WAAK+AV T VPW+MC+Q DAPDPVI+ CN C + P SP +P
Sbjct: 180 YYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQ--FTPTSPKRP 237
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY 312
+WTENW +++ +G R ED+A+ VA F K GS NYYMYHGGTNFGRTA
Sbjct: 238 KMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQK-GGSLNNYYMYHGGTNFGRTAGGP 296
Query: 313 VLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ 371
+T YD AP+DEYGL R PKWGHLKELH A+KLC +L G V+++ EA I+
Sbjct: 297 FITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYT 356
Query: 372 GSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS------- 423
SS CAAF+ N D +N+ V F N Y LP S+SILPDCK V FNTAK+ S
Sbjct: 357 DSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAM 416
Query: 424 -------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+W+ +KE + + N ++ +NTTKD +DYLW+ D
Sbjct: 417 IPEHLQQSDKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILID 476
Query: 471 PSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
++ S+ L + S GH LHAF+N ++ G+ G S +FT + + L G N ++
Sbjct: 477 ANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIA 536
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYG 583
+LS+ VGL +G + + AG+ +V I G D SS +W Y++G+LGE L I+ G
Sbjct: 537 ILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGEG 596
Query: 584 SRIVPWSRYGSSTH-QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF 642
V W+ Q LTWYK + DAP+G +PV ++++ MGKG AW+NG+ IGRYW
Sbjct: 597 MNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWPRI 656
Query: 643 L-----------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
T G PSQ WYH+PRS+ KP+GN+LV+ EE+ G P
Sbjct: 657 SEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGDPT 716
Query: 680 GISI 683
I+
Sbjct: 717 KITF 720
>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
Length = 650
Score = 569 bits (1467), Expect = e-159, Method: Compositional matrix adjust.
Identities = 317/678 (46%), Positives = 399/678 (58%), Gaps = 78/678 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD ++++++G R+IL SGSIHYPRSTPQMWP LI KAK+GGLDV+QT VFWN HEP
Sbjct: 24 SVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+ F R DLV+F+K Q GLYV LRIGP+I EW GG P WL VPGI FR+DNE
Sbjct: 84 PGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRTDNE 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+++ IV++MK RL+ SQGGPIILSQIENEYG VE G Y +WAA++
Sbjct: 144 PFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWAAQM 203
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQ+DAPDPVI+ CNG C E F PN KP +WTENWT +Y +G
Sbjct: 204 AVGLDTGVPWVMCKQEDAPDPVIDTCNGFYC-ENFK-PNKNTKPKMWTENWTGWYTDFGG 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED+A+ VA FI + GS+VNYYMYHGGTNFGRT+ + YD APLDEYG
Sbjct: 262 AVPRRPAEDLAFSVARFI-QNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRN 387
L +PK+ HL+ LH A+K +++ + EA +F CAAF+ N D ++
Sbjct: 321 LENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVFSAPGACAAFIANYDTKS 380
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAK-----------LDSVEQWEEYKEAIPT 436
A F N Y+LPP SISILPDCKTV +NTAK ++S W+ Y E +
Sbjct: 381 YAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKVGYGWLKKMTPVNSAFAWQSYNEEPAS 440
Query: 437 YDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHA 489
+ S+ A L EQ+N T+D+SDYLWY + ++ +L V S GHVLH
Sbjct: 441 SSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSPLLTVMSAGHVLHV 500
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRN 548
FING+ G+ G + T V L G N +SLLSV VGLP+ G + E AG L
Sbjct: 501 FINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVGVHFETWNAGVLGP 560
Query: 549 VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKT 605
V+++G E +D S W Y+VGL GE L + T+ GS V W + GS + QPLTWY
Sbjct: 561 VTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQ-GSLVAKKQPLTWY-- 617
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG 665
H+PRS+L G
Sbjct: 618 -------------------------------------------------HVPRSWLSSGG 628
Query: 666 NLLVLLEEENGYPPGISI 683
N LV+ EE G P GI++
Sbjct: 629 NSLVVFEEWGGDPNGIAL 646
>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 643
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 308/643 (47%), Positives = 403/643 (62%), Gaps = 50/643 (7%)
Query: 92 FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFK 151
++F R DLVRF+K V GLYV LRIGP++ EW +GG P WL VPGI FR+DN PFK
Sbjct: 6 YNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFK 65
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVD 211
M+++ IV +MK +LY SQGGPIILSQIENEYG VE G Y +WAA++A+
Sbjct: 66 AAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQMALG 125
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEAR 271
L TGVPWVMCKQDDAPDPVI+ CNG C E F PN KP +WTE WT ++ +G A
Sbjct: 126 LDTGVPWVMCKQDDAPDPVIDTCNGFYC-ENFK-PNKVYKPKMWTEAWTGWFTEFGGPAP 183
Query: 272 IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLR 330
R ED+AY VA FI + GS++NYYMYHGGTNFGRTA ++ T Y AP+DEYGLLR
Sbjct: 184 YRPVEDMAYSVARFI-QNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 242
Query: 331 QPKWGHLKELHSAVKLCLKPMLSGVLVSMNF-SKLQEAFIFQG-SSECAAFLVNKDKRNN 388
+PKW HL++LH A+KLC +P L V ++++ QEA +F+ S CAAFL N D ++
Sbjct: 243 EPKWSHLRDLHKAIKLC-EPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASSS 301
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-----------SVEQWEEYKEAIPT- 436
ATV F N Y+LPP S+SILPDCK+V FNTAK+ S W Y E +
Sbjct: 302 ATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKVGAPTSQPKMTPVSSFSWLSYNEETASA 361
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLHAF 490
Y E + L+EQ++ T+D++DYLWY + DP S +L V S GH LH F
Sbjct: 362 YTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLLTVFSAGHALHVF 421
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
ING+ G+ +G + T K V+L G N +S+LSV VGLP+ G + E G L V
Sbjct: 422 INGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHYETWNTGVLGPV 481
Query: 550 SIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKTV 606
+++G E +D S + W Y++GL GE L + + GS V W GS + QPLTWYKT
Sbjct: 482 TLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVT-GSLVAQKQPLTWYKTT 540
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------- 646
FD+P G++P+A+++ SMGKG+ W+NGQSIGR+W ++
Sbjct: 541 FDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTAKGSCGKCNYGGIFNEKKCHSNC 600
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVT 689
G PSQ WYH+PR++LK +GN+LV+ EE G P GIS+ S++
Sbjct: 601 GEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRSIS 643
>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
Length = 636
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 291/597 (48%), Positives = 374/597 (62%), Gaps = 26/597 (4%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD +++IING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP P
Sbjct: 29 VTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLV+FIK VQ GLYV LRIGP++ EW +GG P WL VPG+VFR+DNEP
Sbjct: 89 GQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV MMK +L+ +QGGPIILSQIENEYG +E G Y +W A++A
Sbjct: 149 FKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWVAEMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
L TGVPW+MCKQDDAP+ +IN CNG C E F PNS +KP +WTENWT ++ +G
Sbjct: 209 QGLSTGVPWIMCKQDDAPNSIINTCNGFYC-ENFK-PNSDNKPKMWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R AEDIA VA FI + GS++NYYMYHGGTNF RTA ++ T Y APLDEYGL
Sbjct: 267 VPYRPAEDIALSVARFI-QNGGSFINYYMYHGGTNFDRTAGEFIATSYDYDAPLDEYGLP 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLK LH +KLC ++S + QEA +F+ S CAAFL N + + A
Sbjct: 326 REPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVFKSKSSCAAFLSNYNTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPTY 437
V F Y+LPP S+SILPDCKT +NTAK+ + W Y E IP+
Sbjct: 386 RVLFGGSTYDLPPWSVSILPDCKTEYYNTAKVRTSSIHMKMVPTNTPFSWGSYNEEIPSA 445
Query: 438 -DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESVLKVSSLGHVLHAFI 491
D + + L+EQ++ T+D +DY WY P + + +L + S GH LH F+
Sbjct: 446 NDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPLLTIGSAGHALHVFV 505
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVS 550
NG+ G+A+G T + + L G N ++LLS GLP+ G + E G L V+
Sbjct: 506 NGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGVHYETWNTGVLGPVT 565
Query: 551 IQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYK 604
+ G D + + W Y++G GE L + T GS V W + GS + QPLTWYK
Sbjct: 566 LNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLAGSSTVEW-KEGSLVAKKQPLTWYK 621
>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 851
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 326/842 (38%), Positives = 461/842 (54%), Gaps = 104/842 (12%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+++G R++L +G IHYPRSTP+MWP L A+AK GLDV+QT +FW++++P
Sbjct: 49 NVTYDSRALLLDGQRRLLIAGCIHYPRSTPEMWPELFARAKANGLDVIQTYLFWDVNQPT 108
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+F + R D VRFIK Q GL V RIGP++ EW YGG P WL + GIVFR +++
Sbjct: 109 PGEFVMTDRFDYVRFIKLAQQAGLMVNFRIGPYVCAEWNYGGFPAWLRQISGIVFRDNDK 168
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+ + Y T V ++K +L A+ GGP+IL QIENEYG +E S+ GP YV+W +L
Sbjct: 169 PWLDVVGPYITKTVQVLKDNKLLAADGGPVILLQIENEYGNIEDSY-AGGPAYVQWCGQL 227
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A L G W+MC+QDDAP I CNG C +P +WTENW ++Q +G
Sbjct: 228 AASLNAGAQWIMCQQDDAPANTIATCNGFYCDNYVP---HKGQPMMWTENWPGWFQTWGQ 284
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
+ R A+D+A+ A F AK G+Y++YYMYHGGTNFGRTA +T YD LDEYG
Sbjct: 285 PSPHRPAQDVAFAAARFYAK-GGTYMSYYMYHGGTNFGRTAGGPGITTSYDYDVALDEYG 343
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLS-GVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
+ +PK+ HL LH+ + ++S V ++ K EA +F SS C AFL N D
Sbjct: 344 MPSEPKYSHLGSLHAVLHANEHIIMSMNVPAPISLGKNLEAHVFNSSSGCVAFLSNIDSS 403
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTA--------------------------- 419
+A V F+ +ELP S+SIL +C +NTA
Sbjct: 404 VDAEVQFNGRTFELPAWSVSILHNCAFAIYNTAAVSAPLNARRMTPLVVHEDAVSDAADH 463
Query: 420 -----------KLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFK 468
++ + + Y E I E ++ EQ+NTT D +DYLWY +
Sbjct: 464 RRSLSKGEGQERVGAFSTFASYAETIGRRAEEAVYFTSPQEQINTTNDTTDYLWYTTTYN 523
Query: 469 HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSV 528
+ S+ VL +S++ V++ ++N +FV + S ++ K V L+ GTN + +LS
Sbjct: 524 SASATSQ-VLSISNVNDVVYVYVNRQFVTMSW------SGSVNKAVPLMAGTNVIDVLST 576
Query: 529 MVGLPDSGAYLERRVAGLRNVSIQGAKEL--KDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
GL + G +LE+ G IQG +L D + W +QVGLLGE+L IF +
Sbjct: 577 TFGLQNYGTFLEQVTRG-----IQGTVKLGSTDLTQNGWWHQVGLLGEELGIFLPQNASN 631
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSD-PVAINLISMGKGEAWVNGQSIGRYWVSFLTP 645
VPW+ ++T++ LTWY++ FD P S P+A+++ MGKG WVNG ++GRYW S +
Sbjct: 632 VPWAT-PATTNRGLTWYRSSFDLPQSSQAPLALDMTGMGKGFVWVNGHNLGRYWPSRIAD 690
Query: 646 -------------------QGT--PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
QG PSQ +YH+PR +L+PT NL+V+LEE G P IS+
Sbjct: 691 SMACDDCDYRGAYDDSRCRQGCNIPSQRYYHVPREWLQPTNNLIVMLEEIGGNPALISLV 750
Query: 685 TVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFAS 744
CG V + + P V + C + I ++ FAS
Sbjct: 751 EREEDISCGAVGEDY---------------------PADDLSVVLGCGLHQTIRRVEFAS 789
Query: 745 YGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDA 804
+G P G C +++GSC+++NS AIVE CLG+++C VPV F GDPCP K L V
Sbjct: 790 FGTPVGTCRQFSLGSCNAANSTAIVESLCLGRQACHVPVAINHF-GDPCPDTTKRLFVQV 848
Query: 805 QC 806
C
Sbjct: 849 SC 850
>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
Length = 749
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 310/776 (39%), Positives = 436/776 (56%), Gaps = 96/776 (12%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MWP L KAKEGG+D ++T +FW+ HEP Q+ FSG +D+V+F K Q GL+V LRIG
Sbjct: 1 MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
P++ EW YGG P WLH++PGI R+DNE +K M+ + T IV++ K A+L+A QGGPII
Sbjct: 61 PYVCAEWSYGGFPMWLHNIPGIELRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPII 120
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQC 239
L+QIENEYG V + + G YV W A++AV GVPW+MC+Q +AP P+IN CNG C
Sbjct: 121 LAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFYC 180
Query: 240 GETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMY 299
+ PN+P P +WTENW+ +++++G R+AED+A+ VA FI + G +YYMY
Sbjct: 181 DQ--FKPNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFI-QNGGVLNSYYMY 237
Query: 300 HGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVS 358
HGGTNFGRTA Y+ T Y APLDEYG L QPKWGHLK+LH A+K + + +G + S
Sbjct: 238 HGGTNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTS 297
Query: 359 MNF--SKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAF 416
NF Q + QG+ E FL N + + Y LP S++IL DC +
Sbjct: 298 KNFWGGVDQTTYTNQGTGERFCFLSNTNMEEANVDLGQDGKYSLPAWSVTILQDCNKEIY 357
Query: 417 NTAKLDS-----VEQWEEYKEAIP-------------TYDETSLRANFLLEQMNTTKDAS 458
NTAK+++ V++ E + + + RA LLEQ TT D +
Sbjct: 358 NTAKVNTQTSIMVKKLHEEDKPVQLSWTWAPEPMKGVLQGKGRFRATELLEQKETTVDTT 417
Query: 459 DYLWYNFRFKHDPSD----SESVLKVSSLGHVLHAFINGEFVGSAHGKHS---------D 505
DYLWY + + + L+V + GH LHA++N + +G+ K + D
Sbjct: 418 DYLWYMTSVNLNETTLKKWTNVTLRVGTRGHTLHAYVNKKEIGTQFSKQANAQQSVKGDD 477
Query: 506 KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ---GAKELKDFSS 562
SF EK V L +GTN +SLLS VGL + G Y +++ G+ +Q K D +S
Sbjct: 478 YSFLFEKPVTLTSGTNTISLLSATVGLANYGQYYDKKPVGIAEGPVQLVANGKPFMDLTS 537
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQP----LTWYKTVFDAPTGSDPVAI 618
+ W Y++GL GE + + D S S++ +S + P +TWYKT F +P+G++PV +
Sbjct: 538 YQWSYKIGLSGEAKR-YNDPNSPHA--SKFTASDNLPTGRAMTWYKTTFASPSGTEPVVV 594
Query: 619 NLISMGKGEAWVNGQSIGRYWVS----------------------FLTPQGTPSQSWYHI 656
+L+ MGKG AWVNG+S+GR+W + +T G PSQ WYHI
Sbjct: 595 DLLGMGKGHAWVNGKSLGRFWPTQIADAKGCPDTCDYRGSYNGDKCVTNCGNPSQRWYHI 654
Query: 657 PRSFLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLK 715
PRS+L G N L+L EE G P +S V+V T+CG+ +
Sbjct: 655 PRSYLNKDGQNTLILFEEVGGNPTNVSFQIVAVETICGNAYEGST--------------- 699
Query: 716 THKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEK 771
+++ C GR IS I FASYG+P G C + GS +++ S A+VEK
Sbjct: 700 -----------LELSCEGGRTISDIQFASYGDPEGTCGAFMKGSFYATRSAAVVEK 744
>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 702
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 314/713 (44%), Positives = 422/713 (59%), Gaps = 70/713 (9%)
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G Y+RWAA +AV L
Sbjct: 1 MQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLD 60
Query: 214 TGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ ++ +G R
Sbjct: 61 TGVPWVMCQQSDAPDPLINTCNGFYCDQFT--PNSKSKPKMWTENWSGWFLSFGGAVPYR 118
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQAPLDEYGLLRQP 332
AED+A+ VA F + G++ NYYMYHGGTNFGR T ++ T Y AP+DEYG++RQP
Sbjct: 119 PAEDLAFAVARFYQR-GGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQP 177
Query: 333 KWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS--SECAAFLVNKDKRNNAT 390
KWGHL+++H A+KLC +++ + + EA ++Q + S CAAFL N D +++ T
Sbjct: 178 KWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQSDKT 237
Query: 391 VYFSNLMYELPPLSISILPDCKTVAFNTAKLDS--------------------------- 423
V F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 238 VKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLITPELA 297
Query: 424 VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF--KHDP---SDSESVL 478
W E + E +L L+EQ+NTT DASD+LWY+ K D + S+S L
Sbjct: 298 TAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVKGDEPYLNGSQSNL 357
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
V+SLGHVL +ING+ GSA G S +L+ V L+ G N + LLS VGL + GA+
Sbjct: 358 LVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAF 417
Query: 539 LERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH 597
+ AG+ V + G + SS W YQ+GL GE L ++ + S T+
Sbjct: 418 FDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEASPEWVSDNAYPTN 477
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------- 646
QPL WYKT F AP G DPVAI+ MGKGEAWVNGQSIGRYW + L PQ
Sbjct: 478 QPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRG 537
Query: 647 -----------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHV 695
G PSQ+ YH+PRSFL+P N LVL E+ G P IS T +++C HV
Sbjct: 538 AYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHV 597
Query: 696 SDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNGNCEN 754
S+ H + SW S Q+T +T + P +++ CP G+ IS I FAS+G P+G C N
Sbjct: 598 SEMHPAQIDSWISP-QQTSQT------QGPALRLECPREGQVISNIKFASFGTPSGTCGN 650
Query: 755 YAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
Y G C SS + A+V++AC+G +C+VPV + F GDPC G+ K+L+V+A C+
Sbjct: 651 YNHGECSSSQALAVVQEACVGMTNCSVPVSSNNF-GDPCSGVTKSLVVEAACS 702
>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
Length = 812
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 315/787 (40%), Positives = 449/787 (57%), Gaps = 60/787 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V YD ++I+NG RK++ SG+IHYPRST QMWP LI KAK+G LD ++T +FW+LHEP
Sbjct: 26 VEYDSSAIILNGERKLIISGAIHYPRSTSQMWPDLIMKAKDGDLDAIETYIFWDLHEPVR 85
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
++DFSG D ++F+K Q QGLYV LRIGP++ EW YGG P WLH++PGI R+DN
Sbjct: 86 RKYDFSGNLDFIKFLKIAQEQGLYVVLRIGPYVCAEWNYGGFPMWLHNMPGIQLRTDNAV 145
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK MK + T IV M K A L+A QGGPIIL+QIENEYG V + E G Y++W A++A
Sbjct: 146 FKEEMKIFTTKIVTMCKEAGLFAPQGGPIILAQIENEYGDVISHYGEAGNSYIKWCAEMA 205
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ GVPW+MCKQ +AP +I+ CNG C +TF PN+P P I+TENW ++Q +G+
Sbjct: 206 LAQNIGVPWIMCKQKNAPATIIDTCNGYYC-DTFK-PNNPKSPKIFTENWVGWFQKWGER 263
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGL 328
R+AED A+ VA F + G+ NYY+YHGGTNFGRTA +++T Y APLDEYG
Sbjct: 264 RPHRTAEDSAFSVARFF-QNGGALQNYYLYHGGTNFGRTAGGPFIITTYDYDAPLDEYGN 322
Query: 329 LRQPKWGHLKELHSAVKLCLKPMLSGVLV--SMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
L +PK+GHLK LH+A+KL K + +G S S + +G+ + FL N
Sbjct: 323 LIEPKYGHLKRLHAAIKLGEKVLTNGTATWESHGDSLWMTTYTNKGTGQKFCFLSNSHTS 382
Query: 387 NNATVYFS-NLMYELPPLSISILPDCKTVAFNTAKLDS-----VEQWEEYKEAIPTYDET 440
+A V + Y +P S+S+L DC +NTAK ++ ++Q ++ P + T
Sbjct: 383 KDAEVDLQQDGKYYVPAWSMSLLQDCNKEVYNTAKTEAQTNIYMKQLDQKLGNSPEWSWT 442
Query: 441 S------------LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS--ESVLKVSSLGHV 486
S A+ LL+Q + T ASDYLWY + +++ ++ ++V++ GH+
Sbjct: 443 SDPMEDTFQGKGTFTASQLLDQKSVTVGASDYLWYMTEVVVNDTNTWGKAKVQVNTTGHI 502
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
L+ FING G+ HG S F E + L GTN +SLLSV VG + GA+ + + G+
Sbjct: 503 LYLFINGFLTGTQHGTVSQPGFIHEGNISLNQGTNIISLLSVTVGHANYGAFFDMQETGI 562
Query: 547 -----RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+ SI+ + D S +W Y+VG+ G + + + V W S P+T
Sbjct: 563 VGGPVKLFSIENPNNVLDLSKSTWSYKVGINGMTKKFYDPKTTIGVQWKTNNVSIGVPMT 622
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
WYKT F P G++PV ++LI + KGEAWVNGQSIGRYW + L S +
Sbjct: 623 WYKTTFKTPDGTNPVVLDLIGLQKGEAWVNGQSIGRYWPAMLAENKGCSDT--------- 673
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIP 721
Y + D + CG S S+ + + TL + +
Sbjct: 674 -------------CDYRGEYNAD--KCLSGCGEPSQRFYHVPRSFLNNDVNTLVLFEEMG 718
Query: 722 GRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTV 781
+G+ +S+I FASYG+P G+C ++ IG S S+ +VEKAC+GK+SC++
Sbjct: 719 FDATPF-----NGKTMSEIQFASYGDPEGSCGSFKIGEWESRYSKTVVEKACIGKQSCSI 773
Query: 782 PVWTEKF 788
V + F
Sbjct: 774 NVTSSTF 780
>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
Length = 1078
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 295/680 (43%), Positives = 406/680 (59%), Gaps = 62/680 (9%)
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDL 212
+MK++ T+IVN +K A+L+ASQGGPIIL+QIENEY +E +F E G Y+ WAAK+A+
Sbjct: 425 YMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMAIAT 484
Query: 213 QTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARI 272
TGVPW+MCKQ AP VI CNGR CG+T+ GP KP +WTENWT+ Y+V+GD
Sbjct: 485 NTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQYRVFGDPPSQ 544
Query: 273 RSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQP 332
RSAEDIA+ VA F + + G+ NYYMYHGGTNFGR +A+V+ YYD+APLDE+GL ++P
Sbjct: 545 RSAEDIAFSVARFFS-VGGTMANYYMYHGGTNFGRNGAAFVMPRYYDEAPLDEFGLYKEP 603
Query: 333 KWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKDKRNNAT 390
KWGHL++LH A++ C K +L G KL EA +F+ + C AFL N + + + T
Sbjct: 604 KWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKEDGT 663
Query: 391 VYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ---------------WEEY-KEAI 434
V F Y + SISIL DCKTV F+T ++S WE Y +E I
Sbjct: 664 VTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEMYSEEKI 723
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGE 494
P Y +TS+R LEQ N TKD +DYLWY F+ + D +V +
Sbjct: 724 PRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYRKEVKPV----------- 772
Query: 495 FVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGA 554
G+ G+ S +SFT+EK + L G N+V++LS +GL DSG+YLE R+AG+ V+I+G
Sbjct: 773 LEGAGTGRRSTRSFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHRMAGVYTVTIRGL 832
Query: 555 KE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGS 613
D ++ WG+ G +QPLTWY+ FD P+G+
Sbjct: 833 NTGTLDLTTNGWGHVPG------------------------KDNQPLTWYRRRFDPPSGT 868
Query: 614 DPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEE 673
DPV I+L MGKG +VNG+ +GRYWVS+ G PSQ YH+PRS L+P GN L+ EE
Sbjct: 869 DPVVIDLTPMGKGFLFVNGEGLGRYWVSYHHALGKPSQYLYHVPRSLLRPKGNTLMFFEE 928
Query: 674 ENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWR-----SQNQRTLKTHKRIPGRRPKVQ 728
E G P I I TV +C +++ + P + W SQ + G +P
Sbjct: 929 EGGKPDAIMILTVKRDNICTFMTEKN-PAHVRWSWESKDSQPKAVAGAGAGAGGLKPTAV 987
Query: 729 IRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKF 788
+ CP+ + I ++FASYGNP G C NY +GSCH+ ++ +VEKAC+G+++C++ V +E +
Sbjct: 988 LSCPTKKTIQSVVFASYGNPLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVY 1047
Query: 789 YGD-PCPGIPKALLVDAQCT 807
GD CPG L V A+C+
Sbjct: 1048 GGDVHCPGTTGTLAVQAKCS 1067
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 202/428 (47%), Positives = 265/428 (61%), Gaps = 70/428 (16%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYD RSLII+GHR+I FSGSIHYPRS P WP LI+KAKEGGL+V+++ VFWN HE
Sbjct: 30 GTVITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHE 89
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLH----DVPGIV 142
P+ G ++F GR DL++F K +Q + +Y +RIGPF++ EW +G F H ++P I+
Sbjct: 90 PEQGVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHG---FVCHIGSGEIPDII 146
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
FR++NEPFK +MK++ T+IVN +K A+L+ASQGGPIIL+QIENEY +E +F E G Y+
Sbjct: 147 FRTNNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYI 206
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
WAAK+A+ TGVPW+MCKQ AP VI CNGR CG+T+ GP KP +WTENWT+
Sbjct: 207 NWAAKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHCGDTWPGPADKKKPLLWTENWTAQ 266
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM------------------------ 298
Y+V+GD RSAEDIA+ VA F + + G+ NYYM
Sbjct: 267 YRVFGDPPSQRSAEDIAFSVARFFS-VGGTMANYYMVVLNSNSNLFLTKKRDEISDRTDT 325
Query: 299 ----------YHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCL 348
YHGGTNFGR +A+V+ YYD+APLDE+GL ++PKWGHL++LH A++ C
Sbjct: 326 GGFTCVNNQQYHGGTNFGRNGAAFVMPRYYDEAPLDEFGLYKEPKWGHLRDLHHALRHCK 385
Query: 349 KPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISIL 408
K +L G KL Y + SISIL
Sbjct: 386 KALLWGNPSVQPLGKLTRG----------------------------QKYFVARRSISIL 417
Query: 409 PDCKTVAF 416
DCKTV +
Sbjct: 418 ADCKTVKY 425
>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
Length = 641
Score = 553 bits (1425), Expect = e-154, Method: Compositional matrix adjust.
Identities = 290/621 (46%), Positives = 383/621 (61%), Gaps = 37/621 (5%)
Query: 22 GGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVF 81
GG NVTYD R+L+I+G R++L SGSIHYPRSTP MWP LI KAK+GGLDV++T VF
Sbjct: 22 AGGARAANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVF 81
Query: 82 WNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGI 141
W++HEP GQ+DF GR+DL F+K V GLYV LRIGP++ EW YGG P WLH +PGI
Sbjct: 82 WDIHEPVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGI 141
Query: 142 VFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPY 201
FR+DNEPFK M+R+ +V+ MK A LYASQGGPIILSQIENEYG ++ ++ G Y
Sbjct: 142 KFRTDNEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAY 201
Query: 202 VRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTS 261
+RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+
Sbjct: 202 MRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQF--TPNSAAKPKMWTENWSG 259
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQ 320
++ +G R ED+A+ VA F + G++ NYYMYHGGTN R++ ++ T Y
Sbjct: 260 WFLSFGGAVPYRPVEDLAFAVARFYQR-GGTFQNYYMYHGGTNLDRSSGGPFIATSYDYD 318
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFL 380
AP+DEYGL+RQPKWGHL+++H A+KLC +++ + EA +++ S CAAFL
Sbjct: 319 APIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVYKVGSVCAAFL 378
Query: 381 VNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----------------- 423
N D +++ TV F+ MY LP S+SILPDCK V NTA+++S
Sbjct: 379 ANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVAS 438
Query: 424 ----------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP 471
V W E + + +L L+EQ+NTT DASD+LWY + K D
Sbjct: 439 DGSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITVKGDE 498
Query: 472 ---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSV 528
+ S+S L V+SLGHVL +ING+ GSA G S + +K + L+ G N + LLS
Sbjct: 499 PYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSA 558
Query: 529 MVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIV 587
VGL + GA+ + AG+ V + G D SS W YQ+GL GE L ++ +
Sbjct: 559 TVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEASPE 618
Query: 588 PWSRYGSSTHQPLTWYKTVFD 608
S + PL WYK +
Sbjct: 619 WVSANAYPINHPLIWYKVSME 639
>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
Length = 773
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 316/825 (38%), Positives = 446/825 (54%), Gaps = 121/825 (14%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ +T D R ++ING RKIL SGS+HYPRSTP+MWP LI K+K+GGL+ + T VFW+LHEP
Sbjct: 24 DQITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEP 83
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
Q Q+DF+G +DLVRFIK +QAQGLY LRIGP++ EW YGG P WLH+ P I R++N
Sbjct: 84 QRRQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNN 143
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
+ IENEYG V ++ + G Y+ W A+
Sbjct: 144 TVY-------------------------------MIENEYGNVMRAYHDAGVQYINWCAQ 172
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A L TGVPW+MC+QD+AP P+IN CNG C + PN+P+ P +WTENW+ +Y+ +G
Sbjct: 173 MAAALDTGVPWIMCQQDNAPQPMINTCNGYYCDQ--FTPNNPNSPKMWTENWSGWYKNWG 230
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEY 326
R+AED+A+ VA F ++ G++ NYYMYHGGTNFGRTA Y+ T Y APL+EY
Sbjct: 231 GSDPHRTAEDLAFSVARFY-QLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEY 289
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI--FQGSSECAAFLVNKD 384
G QPKWGHL++LH + K + G + ++++ L A I +QG S C F N +
Sbjct: 290 GNKNQPKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSN 347
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRA 444
+ T+ + + Y +P S+SILPDC +NTAK++S K + + SL+
Sbjct: 348 ADRDVTINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYSTFVKKGSEAENEPNSLQW 407
Query: 445 NFLLEQMNTTKDAS------DYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGS 498
+ E + S D +W + L V++ GH+LHAF+NGE +G
Sbjct: 408 TWRGETIQYITPGSVDISNDDPIW----------GKDLTLSVNTSGHILHAFVNGEHIGY 457
Query: 499 AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-----VSIQG 553
+ F + + L G N ++LLSV VGL + G + G+ S
Sbjct: 458 QYALLGQFEFQFRRSITLQLGKNEITLLSVTVGLTNYGPDFDMVNQGIHGPVQIIASNGS 517
Query: 554 AKELKDFSSFS-WGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTG 612
A +KD S+ + W Y+ GL GE +IF +R W ++ WYK FDAP G
Sbjct: 518 ADIIKDLSNNNQWAYKAGLNGEDKKIFLGR-ARYNQWKSDNLPVNRSFVWYKATFDAPPG 576
Query: 613 SDPVAINLISMGKGEAWVNGQSIGRYWVSFL----------------------TPQGTPS 650
DPV ++L+ +GKGEAWVNG S+GRYW S++ T G PS
Sbjct: 577 EDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYRGPYKAEKCNTNCGNPS 636
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQN 710
Q WYH+PRSFL T N LVL EE G P ++ TV+V C + + +
Sbjct: 637 QRWYHVPRSFLASTDNRLVLFEEFXGNPSSVTFQTVTVGNACANAREGY----------- 685
Query: 711 QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC--------ENYAIGSCHS 762
+++ C GR IS I FAS+G+P G C + + G+C +
Sbjct: 686 ---------------TLELSC-QGRAISXIKFASFGDPQGTCGKPFATGSQVFEKGTCEA 729
Query: 763 SNSRAIVEKACLGKRSCTVPVWTEKFYGDP-CPGIPKALLVDAQC 806
++S +I++K C+GK SC++ V +E+ G C K L V+A C
Sbjct: 730 ADSLSIIQKLCVGKYSCSIDV-SEQILGPAGCTADTKRLAVEAIC 773
>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
Length = 683
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 300/680 (44%), Positives = 401/680 (58%), Gaps = 53/680 (7%)
Query: 171 YASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPV 230
+ASQGGPIILSQIENEYG + G Y+ WAAK+AV L TGVPWVMCK+DDAPDP+
Sbjct: 2 FASQGGPIILSQIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPM 61
Query: 231 INACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMK 290
INACNG C + F+ PN P KP +WTE W+ ++ +G R +D+A+ VA FI K
Sbjct: 62 INACNGFYC-DGFS-PNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQK-G 118
Query: 291 GSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLK 349
GSY+NYYMYHGGTNFGRTA +T YD P+DEYGL+RQPK+GHLKELH A+KLC
Sbjct: 119 GSYINYYMYHGGTNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEH 178
Query: 350 PMLSGVLVSMNFSKLQEAFIFQ-GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISIL 408
++S + Q+A++F G CAAFL N A + F+N+ Y+LP SISIL
Sbjct: 179 ALVSSDPTVTSLGAYQQAYVFNSGPRRCAAFLSNFHS-TGARMTFNNMHYDLPAWSISIL 237
Query: 409 PDCKTVAFNTAKL-------------DSVEQWEEYKEAIPT-YDETSLRANFLLEQMNTT 454
PDC+ V FNTAK+ + W+ Y E + + ++ +S+ A LLEQ+N T
Sbjct: 238 PDCRNVVFNTAKVGVQTSRVQMIPTNSRLFSWQTYDEDVSSLHERSSIAAGGLLEQINVT 297
Query: 455 KDASDYLWYNFRFKHDPSD----SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL 510
+D SDYLWY S+ + L V S GH LH F+NG+F GSA G + FT
Sbjct: 298 RDTSDYLWYMTNVDISSSELRGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTREHRQFTF 357
Query: 511 EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQG-AKELKDFSSFSWGYQ 568
K VHL G N ++LLS+ VGLP+ G + E G L V + G + KD + W +
Sbjct: 358 AKPVHLRAGINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGRKDLTMQKWFNK 417
Query: 569 VGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKG 626
VGL GE + + + G V W R + T Q L WYK F+AP G +P+A+++ SMGKG
Sbjct: 418 VGLKGEAMDLVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKG 477
Query: 627 EAWVNGQSIGRYWVSFLTPQ-------------------GTPSQSWYHIPRSFLKPTGNL 667
+ W+NGQSIG+YW+++ G P+Q WYH+PRS+LKPT NL
Sbjct: 478 QVWINGQSIGKYWMAYANGDCSLCSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTQNL 537
Query: 668 LVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKV 727
+V+ EE G P I++ SV +C + + H P + KT + +V
Sbjct: 538 VVVFEELGGDPSKITLVKRSVAGVCADLQEHH-PNAEKLDIDSHEESKTL-----HQAQV 591
Query: 728 QIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEK 787
++C G+ IS I FAS+G P G C ++ G+CH++NS AIVEK C+G+ SC V V
Sbjct: 592 HLQCVPGQSISSIKFASFGTPTGTCGSFQQGTCHATNSHAIVEKNCIGRESCLVTVSNSI 651
Query: 788 FYGDPCPGIPKALLVDAQCT 807
F DPCP + K L V+A C+
Sbjct: 652 FGTDPCPNVLKRLSVEAVCS 671
>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
Length = 763
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 310/766 (40%), Positives = 424/766 (55%), Gaps = 98/766 (12%)
Query: 126 WGYG-GLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
W Y G P WL DVPGI FR+DN PFK M+R+ IV++++ +L+ QGGP+I+ Q+E
Sbjct: 1 WDYCRGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVE 60
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG +E S+ ++G Y++W +A+ L VPWVMC+Q DAP +IN+CNG C A
Sbjct: 61 NEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKA 120
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
NSP KP WTENW ++ +G+ + R ED+A+ VA F + +GS+ NYYMY GGTN
Sbjct: 121 --NSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQR-EGSFQNYYMYFGGTN 177
Query: 305 FGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
FGRTA + +T Y +P+DEYGL+R+PKWGHLK+LH+A+KLC ++S S + K
Sbjct: 178 FGRTAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSAD--SPQYIK 235
Query: 364 L---QEAFIFQGSSE--------------CAAFLVNKDKRNNATVYFSNLMYELPPLSIS 406
L QEA ++ S+ C+AFL N D+R V F+ Y LPP S+S
Sbjct: 236 LGPKQEAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVS 295
Query: 407 ILPDCKTVAFNTAK-----------------------LDSVEQ---------WEEYKEAI 434
ILPDC+ V FNTAK L + +Q W KE I
Sbjct: 296 ILPDCQNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPI 355
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE--------SVLKVSSLGHV 486
+ + + +LE +N TKD SDYLWY R D + + S+ V
Sbjct: 356 GIWSDQNFTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDV 415
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
F+NG+ GSA G+ + V + G N++ LLS +GL +SGA++E+ AG+
Sbjct: 416 FRVFVNGKLTGSAIGQW----VKFVQPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGI 471
Query: 547 R-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWY 603
R + + G K D S W YQVGL GE L ++ + W+ + TWY
Sbjct: 472 RGRIKLTGFKNGDIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWY 531
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------- 646
K F +P G+DPVAINL SMGKG+AWVNG IGRYW S ++P+
Sbjct: 532 KAYFSSPDGTDPVAINLGSMGKGQAWVNGHHIGRYW-SVVSPKDGCPRKCDYRGAYNSGK 590
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+QSWYHIPRS+LK + NLLVL EE G P I + S +CG VS+SH P
Sbjct: 591 CATNCGRPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTGVICGQVSESHYP 650
Query: 702 PVISWRSQNQRTLKTHKRIPGR-RPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSC 760
S R + + + + R P++ + C G IS + FASYG P G+C ++ G C
Sbjct: 651 ---SLRKLSNDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRGPC 707
Query: 761 HSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
H++NS ++V +ACLGK SCTV + F GDPC I K L V+A+C
Sbjct: 708 HATNSLSVVSQACLGKNSCTVEISNSAFGGDPCHSIVKTLAVEARC 753
>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
Length = 628
Score = 546 bits (1408), Expect = e-152, Method: Compositional matrix adjust.
Identities = 298/632 (47%), Positives = 387/632 (61%), Gaps = 36/632 (5%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
M C +LCL LT + GG G+NV+YDGRSLII+G RK+L S SIHYPRS P M
Sbjct: 1 MNLCFILCLVSTSLTF---TLVYGGVGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAM 57
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
WP LI AKEGG+DV++T VFWN HE PG + F GR DLV+F K VQ G+Y+ LRIGP
Sbjct: 58 WPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGP 117
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
F+ EW +GG+P WLH +PG VFR+ N+PF HM+++ T IVN+MK +L+ASQGGPIIL
Sbjct: 118 FVAAEWNFGGVPVWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIIL 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
SQIENEYG E+ + E G Y WAAK+AV T VPW+MC+Q DAPDPVI+ CN C
Sbjct: 178 SQIENEYGYYENYYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCD 237
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ P SP +P +WTENW +++ +G R ED+A+ VA F K GS NYYMYH
Sbjct: 238 Q--FTPTSPKRPKMWTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQK-GGSLNNYYMYH 294
Query: 301 GGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTNFGRTA +T YD AP+DEYGL R PKWGHLKELH A+KLC +L G V++
Sbjct: 295 GGTNFGRTAGGPFITTSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNI 354
Query: 360 NFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNT 418
+ EA I+ SS CAAF+ N D +N+ V F N Y LP S+SILPDCK V FNT
Sbjct: 355 SLGPSVEADIYTDSSGACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNT 414
Query: 419 AKLDS--------------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
AK+ S +W+ +KE + + N ++ +NTTKD +
Sbjct: 415 AKVSSPTNIVAMIPEHLQQSDKGQKTLKWDVFKENPGIWGKADFVKNGFVDHINTTKDTT 474
Query: 459 DYLWYNFRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEK 512
DYLW+ D ++ S+ L + S GH LHAF+N ++ G+ G S +FT +
Sbjct: 475 DYLWHTTSILIDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKN 534
Query: 513 MVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELK-DFSSFSWGYQVGL 571
+ L G N +++LS+ VGL +G + + AG+ +V I G D SS +W Y++G+
Sbjct: 535 PISLRAGKNEIAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGV 594
Query: 572 LGEKLQIFTDYGSRIVPWSRYGSSTH-QPLTW 602
LGE L I+ G V W+ Q LTW
Sbjct: 595 LGEHLSIYQGEGMNSVKWTSTSEPPKGQALTW 626
>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
Length = 774
Score = 543 bits (1400), Expect = e-151, Method: Compositional matrix adjust.
Identities = 312/757 (41%), Positives = 429/757 (56%), Gaps = 92/757 (12%)
Query: 130 GLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM 189
G P WL DVPGI FR+DNEP+K M+ + T IV++MK +LY+ QGGPIIL QIENEYG
Sbjct: 19 GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 78
Query: 190 VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSP 249
++ + + G Y+ WAA++A+ L TGVPWVMC+Q DAP+ ++N CN C + F PNS
Sbjct: 79 IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYC-DGFK-PNSY 136
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
+KP IWTE+W +Y +G+ R A+D A+ VA F + GS NYYMY GGTNF RTA
Sbjct: 137 NKPTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQR-GGSLQNYYMYFGGTNFERTA 195
Query: 310 SAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---Q 365
+ YD AP+DEYG+LRQPKWGHLK+LH+A+KLC + L+ V S ++ KL Q
Sbjct: 196 GGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLC-ESALTAVDGSPHYVKLGPMQ 254
Query: 366 EAFIF-----------QGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKT 413
EA ++ G+S+ C+AFL N D+ A+V+ Y LPP S+SILPDC+T
Sbjct: 255 EAHVYSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCET 314
Query: 414 VAFNTAKLDS------VEQ-------------------------WEEYKEAIPTYDETSL 442
VAFNTA++ + VE W +KE + + E
Sbjct: 315 VAFNTARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIF 374
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFKHDPSDS--------ESVLKVSSLGHVLHAFINGE 494
A +LE +N TKD SDYL Y R D L + + V F+NG+
Sbjct: 375 TAQGILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGK 434
Query: 495 FVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQG 553
GS G +L + + L+ G N ++LLS +VGL + GA+LE+ AG R V + G
Sbjct: 435 LAGSKVGHW----VSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTG 490
Query: 554 AKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY-GSSTHQPLTWYKTVFDAPT 611
D ++ W YQ+GL GE +I++ WS T P TW+KT+FDAP
Sbjct: 491 LSNGDIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPE 550
Query: 612 GSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPS--------------------- 650
G+ PV I+L SMGKG+AWVNG IGRYW G PS
Sbjct: 551 GNGPVTIDLGSMGKGQAWVNGHLIGRYWSLVAPESGCPSSCNYAGTYSDSKCRSNCGIAT 610
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQN 710
QSWYHIPR +L+ +GNLLVL EE G P IS++ T+C +S+++ PP+ +W
Sbjct: 611 QSWYHIPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAW---- 666
Query: 711 QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVE 770
R + P+++++C G ISKI FASYG P G C+N+++G+CH+S + +V
Sbjct: 667 SRAANGRPSVNTVAPELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVV 726
Query: 771 KACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+AC GK C + V T + +GDPC + K L V+A+C+
Sbjct: 727 EACEGKNRCAISV-TNEVFGDPCRKVVKDLAVEAECS 762
>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
Length = 607
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 292/587 (49%), Positives = 371/587 (63%), Gaps = 38/587 (6%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
LC F +T +VTYD ++++ING R+IL SGSIHYPRSTPQMWP LI
Sbjct: 16 FLCFFVCYVTA------------SVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLI 63
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GG+DV++T VFWN HEP G++ F R DLV+FIK VQ GLYV LRIGP++ E
Sbjct: 64 QKAKDGGVDVIETYVFWNGHEPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAE 123
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W +GG P WL VPG+ FR+DNEPFK M+++ T IV++MK+ L+ SQGGPIILSQIEN
Sbjct: 124 WNFGGFPVWLKYVPGVAFRTDNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIEN 183
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
EYG VE G Y +W +++AV L TGVPWVMCKQ+DAPDP+I+ CNG C E F+
Sbjct: 184 EYGPVEWEIGAPGKSYTKWFSQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYC-ENFS- 241
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP +WTENWT +Y +G R AED+A+ VA F+ + +GSYVNYYMYHGGTNF
Sbjct: 242 PNKNYKPKMWTENWTGWYTDFGTAVPYRPAEDLAFSVARFV-QNRGSYVNYYMYHGGTNF 300
Query: 306 GRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
GRT+S ++ T Y AP+DEYGL+ +PKWGHL++LH A+K C ++S K
Sbjct: 301 GRTSSGLFIATSYDYDAPIDEYGLISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKN 360
Query: 365 QEAFIFQGS-SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-- 421
E +++ S CAAFL N D + A V F N Y+LPP SISILPDCKT FNTAK+
Sbjct: 361 LEVHLYKTSFGACAAFLANYDTGSWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRA 420
Query: 422 ----------DSVEQWEEYKEAIPTYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHD 470
+S W+ Y E E+ S AN LLEQ++ T D SDYLWY
Sbjct: 421 PRVHRSMTPANSAFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNIS 480
Query: 471 PSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
P++ VL S GHVLH FING+F G+A+G + T V L G N +S
Sbjct: 481 PNEGFIKNGQNPVLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKIS 540
Query: 525 LLSVMVGLPDSGAYLER-RVAGLRNVSIQGAKE-LKDFSSFSWGYQV 569
LLSV VGL + G + E+ V L V+++G E +D S W Y+V
Sbjct: 541 LLSVAVGLSNVGVHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKV 587
>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
Length = 450
Score = 540 bits (1390), Expect = e-150, Method: Compositional matrix adjust.
Identities = 271/486 (55%), Positives = 338/486 (69%), Gaps = 53/486 (10%)
Query: 183 IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET 242
IENEYG +E +F EKG YV WAAK+AVDLQTGVPW+MCKQ DAPDPVIN CNG +CGET
Sbjct: 1 IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKCGET 60
Query: 243 FAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
F GPNSP+KP++WTENWTSFYQVYG E IRSA+DIA+HVALFIAK GSYVNYYMYHGG
Sbjct: 61 FGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAK-NGSYVNYYMYHGG 119
Query: 303 TNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFS 362
TNFGRTA+AYV+TGYYDQAPLDEYGL+RQPKWGHLKELH+ +K C +L GV +++
Sbjct: 120 TNFGRTAAAYVITGYYDQAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTNLSVG 179
Query: 363 KLQEAFIFQGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL 421
+LQ+A++F+ C AFLVN D N ATV F N +EL P SISILPDC + FNTAK+
Sbjct: 180 QLQQAYMFEAQGGGCVAFLVNNDSVN-ATVGFRNKSFELLPKSISILPDCDNIIFNTAKV 238
Query: 422 DS------------VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKH 469
++ + WE+Y + IP Y +++++++ LLE MNTTKD SDYLWY F F+
Sbjct: 239 NAGSNRRITTSSKKLNTWEKYIDVIPNYSDSTIKSDTLLEHMNTTKDKSDYLWYTFSFQP 298
Query: 470 DPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDK-SFTLEKMVHLING--TNNVSLL 526
+ S ++ +L V SL HV +AF+N ++ GSAHG + K F +E + L + +NN+S+L
Sbjct: 299 NLSCTKPLLHVESLAHVAYAFVNNKYSGSAHGSKNGKVPFIMEVPIVLDDDGLSNNISIL 358
Query: 527 SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
SV+VGL VGLLGE LQ++ +
Sbjct: 359 SVLVGL-----------------------------------SVGLLGETLQLYGKEHLEM 383
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
V WS+ S QPLTW+K FD P G+DPV +NL +M KGEAWVNGQSIGRYW+SFLT +
Sbjct: 384 VKWSKADISIAQPLTWFKLEFDTPKGNDPVVLNLATMSKGEAWVNGQSIGRYWISFLTSK 443
Query: 647 GTPSQS 652
G PSQ+
Sbjct: 444 GHPSQT 449
>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
Length = 774
Score = 534 bits (1375), Expect = e-149, Method: Compositional matrix adjust.
Identities = 317/835 (37%), Positives = 432/835 (51%), Gaps = 151/835 (18%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++VTYD RSLII+G R++L S SIHYPRS P+MWP+L+A+AK+GG D V+T VFWN HEP
Sbjct: 36 SSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHEP 95
Query: 88 QPGQ--------------------FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWG 127
GQ + F R DLVRF K V+ GLY+ LRIGPF+ EW
Sbjct: 96 AQGQVRAASPKFVMDLACSIRDKPYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWT 155
Query: 128 YGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEY 187
+GG+P WLH PG VFR++NEPFK HMKR+ T IV+MMK + +ASQGG IIL+Q+ENEY
Sbjct: 156 FGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEY 215
Query: 188 GMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN 247
G +E ++ PY WAA +A+ TGVPW+MC+Q DAPDPVIN CN C + PN
Sbjct: 216 GDMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQF--KPN 273
Query: 248 SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR 307
SP KP WTENW ++Q +G+ R ED+A+ VA F K GS NY
Sbjct: 274 SPTKPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGK-GGSLQNY----------- 321
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
YV Y DQ+ G S V +++ S + +
Sbjct: 322 ----YVADVYTDQS-------------GGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVS 364
Query: 368 FIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW 427
+ +C N K + T LM ++ P ++ ++K+D W
Sbjct: 365 IL----PDCKNVAFNTAKVRSQT-----LMMDMVPANLE-----------SSKVDG---W 401
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD---SESVLKVSSLG 484
++E + L N ++ +NTTKD++DYLWY F D S VL + S G
Sbjct: 402 SIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIESKG 461
Query: 485 HVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA 544
H + AF+N E +GSA+G S +F++E V+L G N +SLLS+ VGL + G E A
Sbjct: 462 HAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEWAGA 521
Query: 545 GLRNVSIQGAK-ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWY 603
G+ +V I G + + D SS W Y+V +
Sbjct: 522 GITSVKISGMENRIIDLSSNKWEYKVNV-------------------------------- 549
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF----------------LTPQ- 646
D P G DPV +++ SMGKG AW+NG +IGRYW +P
Sbjct: 550 ----DVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTFSPNK 605
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q WYH+PRS+ P+GN LV+ EE+ G P I+ +V ++C VS+ +
Sbjct: 606 CRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSEHY-- 663
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
P I S ++ T + KVQ+ CP G+ IS + F S+GNP+G C +Y GSCH
Sbjct: 664 PSIDLESWDRNTQNDGRDA----AKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCH 719
Query: 762 SSNSRAIVEK---------ACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
NS ++VEK ACL CTV + E F D CPG+ K L ++A C+
Sbjct: 720 HPNSISVVEKGTLGWAHRRACLNMNGCTVSLSDEGFGEDLCPGVTKTLAIEADCS 774
>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
Length = 579
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 276/560 (49%), Positives = 351/560 (62%), Gaps = 24/560 (4%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD RSL ING R+IL SGSIHYPRSTP+MWP LI KAK+GGLDV+QT VFWN HEP G
Sbjct: 23 TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ FS R DLVRF+K V+ GLYV LRIGP++ EW YGG P WL VPGI FR+DN PF
Sbjct: 83 QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E YV WAAK+AV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
GVPW+MCKQDDAPDPVIN CNG C + PNS +KP++WTE W+ ++ +G
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDF--TPNSKNKPSMWTEAWSGWFTAFGGTV 260
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+A+ VA FI K GS++NYYMYHGGTNF RTA ++ T Y AP+DEYGLL
Sbjct: 261 PQRPVEDLAFAVARFIQK-GGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLL 319
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNN 388
RQPKWGHL LH A+K +++G N ++A++F+ SS +CAAFL N
Sbjct: 320 RQPKWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAA 379
Query: 389 ATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-----------WEEYKEAIPTY 437
A V F+ Y+LP SIS+LPDC+T +NTA + + W+ Y EA +
Sbjct: 380 ARVAFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAKMNPAGGFTWQSYGEATNSL 439
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SDSESVLKVSSLGHVLHAFI 491
DET+ + L+EQ++ T D SDYLWY D S L V S GH + F+
Sbjct: 440 DETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYSAGHSVQVFV 499
Query: 492 NGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVS 550
NG++ G+A+G + T V + G+N +S+LS VGLP+ G + E + L V+
Sbjct: 500 NGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETWNIGVLGPVT 559
Query: 551 IQGAKELK-DFSSFSWGYQV 569
+ G E K D S W YQV
Sbjct: 560 LSGLNEGKRDLSKQKWTYQV 579
>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
Length = 2260
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 263/506 (51%), Positives = 334/506 (66%), Gaps = 31/506 (6%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV YD R+L+I+G R++L SGSIHYPRSTPQMWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 21 NVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEPV 80
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+DF GR+DLV+F+K V GLYV LRIGP++ EW YGG P WLH +PGI FR+DNE
Sbjct: 81 KGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCSEWNYGGFPLWLHFIPGIKFRTDNE 140
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK MKR+ T IV++MK +LYASQGGPIILSQIENEYG ++ ++ G Y+ WAAK+
Sbjct: 141 PFKVEMKRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWAAKM 200
Query: 209 AVDLQTGVPWVMCKQDDAPDP-VINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
A L TGVPWVMC+Q DAPDP VIN CNG C + PNS KP +WTENW+++Y ++G
Sbjct: 201 ATSLDTGVPWVMCQQADAPDPIVINTCNGFYCDQ--FTPNSKTKPKLWTENWSAWYLLFG 258
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQAPLDEY 326
R ED+A+ VA F + G++ NYYMYHGGTNF R T ++ T Y AP+DEY
Sbjct: 259 GGFPHRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFDRSTGGPFIATSYDFDAPIDEY 317
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
G++RQPKWGHLK++H A+KLC + +++ EA +++ S CAAFL N D +
Sbjct: 318 GVIRQPKWGHLKDVHKAIKLCEEALIAAEPKITYLGPNLEAAVYKTGSVCAAFLANVDAK 377
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV---------------------- 424
++ TV FS Y LP S+SILPDCK V NTAK++S
Sbjct: 378 SDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASTISNFVTESLKEDISSSETSR 437
Query: 425 EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR--FKHDPSDSESVLKVSS 482
+W E + + L LLEQ+N T D SDYLWY+ K DP S++VL + S
Sbjct: 438 SKWSWINEPVGISKDDILSKTGLLEQINITADRSDYLWYSLSVDLKDDPG-SQTVLHIES 496
Query: 483 LGHVLHAFINGEFVG-SAHGKHSDKS 507
LGH LHAFING+ S G SD +
Sbjct: 497 LGHALHAFINGKLADKSDSGDKSDSA 522
Score = 243 bits (619), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 189/338 (55%), Gaps = 37/338 (10%)
Query: 496 VGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGA 554
+GS G + + +++G N + LLS+ VGL + GA+ + AG+ V ++G
Sbjct: 1932 LGSQTGNKEKPKLNEDIPITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGL 1991
Query: 555 K---ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPT 611
K + D SS W YQVGL GE L + + GS S+ QPL WYKT FDAP+
Sbjct: 1992 KNGNKTLDLSSRKWTYQVGLKGEDLGLSS--GSSGAWNSKTTFPKKQPLIWYKTNFDAPS 2049
Query: 612 GSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTP 649
GS+PV I+ MGKGEAWVNGQSIGRYW +++ G P
Sbjct: 2050 GSNPVVIDFTGMGKGEAWVNGQSIGRYWPTYVASNVDCTDSCNYRGPFTQTKCHMNCGKP 2109
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ 709
SQ+ YH+P+SFLKP GN LVL EE G P IS T + ++C HVSDSH P + W
Sbjct: 2110 SQTLYHVPQSFLKPNGNTLVLFEESGGDPTQISFATKQIGSVCAHVSDSHPPQIDLWNQD 2169
Query: 710 NQRTLKTHKRIPGRRPKVQIRCPSGRK-ISKILFASYGNPNGNCENYAIGSCHSSNSRAI 768
+ K P + + CP+ + IS I FASYG P G C N+ G C S+ + +I
Sbjct: 2170 TESGGKV-------GPALLLNCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKTLSI 2222
Query: 769 VEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
V+KAC+G RSC++ V T+ F GDPC G+PK+L V+A C
Sbjct: 2223 VKKACIGSRSCSIGVSTDTF-GDPCKGVPKSLAVEATC 2259
>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 592
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 258/529 (48%), Positives = 346/529 (65%), Gaps = 25/529 (4%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+ VTYDGRSL+I+G R + FSG+IHYPRS P++WP+LI +AKEGGL+ ++T +FWN HE
Sbjct: 33 GSVVTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG+++F GR DL++++K +Q +Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N+P+K M+++ IV +K A L+ASQGGPIIL+QIENEYG ++ G Y+ WAA
Sbjct: 153 NDPYKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ QTGVPW+MCKQ AP VI CNGR CG+T+ +KP +WTENWT ++ Y
Sbjct: 213 QMALSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWT-LRDKNKPMLWTENWTQQFRAY 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ +RSAEDIAY V F AK GS VNYYMYHGGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 272 GDQVAMRSAEDIAYAVLRFFAK-GGSLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEY 330
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ ++ K L G S EA IF+ E C +FL N +
Sbjct: 331 GMYKEPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNN 390
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------------DSVEQWEE 429
+ TV F + +P S+SIL CK V +NT ++ QWE
Sbjct: 391 TGEDGTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSYHTSEVTSKNNQWEM 450
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSL 483
Y E IP Y +T +R LEQ N TKDASDYLWY +FR + D +D VL+V S
Sbjct: 451 YSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVKSS 510
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGL 532
H + F N FVG A G K F EK V L G N+V LLS +G+
Sbjct: 511 AHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGM 559
>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
Length = 713
Score = 526 bits (1356), Expect = e-146, Method: Compositional matrix adjust.
Identities = 289/645 (44%), Positives = 379/645 (58%), Gaps = 61/645 (9%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD RSL+I+G R+I+ SGSIHYPRSTP+MWP LI KAKEGGLD ++T +FWN H
Sbjct: 27 GCTSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGH 86
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP Q++F G D+VRF KE+Q G+Y LRIGP+I GEW YGGLP WL D+PG+ FR
Sbjct: 87 EPHRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRL 146
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVR 203
NEPF+ M+ + T+IVN MK ++++A QGGPIIL+QIENEYG M + + + Y+
Sbjct: 147 HNEPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIH 206
Query: 204 WAAKLAVDLQTGVPWVMCKQDD-APDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
W A +A GVPW+MC+QDD P V+N CNG C + F PN P IWTENWT +
Sbjct: 207 WCADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWF--PNRTGIPKIWTENWTGW 264
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQA 321
++ + RSAEDIA+ VA+F K +GS NYYMYHGGTNFGRT+ Y+ T Y A
Sbjct: 265 FKAWDKPDFHRSAEDIAFAVAMFFQK-RGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDA 323
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
PLDEYG LRQPK+GHLKELHS +K K ++ G N+ + S A F+
Sbjct: 324 PLDEYGNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSSACFIN 383
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-----------DSVEQ---- 426
N+ + V + LP S+SILPDCKTVAFN+AK+ ++ EQ
Sbjct: 384 NRFDDKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTSVMVKKPNTAEQEQES 443
Query: 427 --WEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVS 481
W E + T ++ + R N LLEQ+ T+ D SDYLWY H S L V+
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLNHKGEGSYK-LYVN 502
Query: 482 SLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER 541
+ GH L+AF+NG+ +G H D F LE V L +G N +SLLS VGL + G E+
Sbjct: 503 TTGHELYAFVNGKLIGKNHSADGDFVFQLESPVKLHDGKNYISLLSATVGLKNYGPSFEK 562
Query: 542 RVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
G+ + G +L D S G + L WS
Sbjct: 563 MPTGI----VGGPVKLID----SNGTAIDLSNSS-------------WS----------- 590
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
YK F+AP+G DPV ++L+ + KG AWVNG ++GRYW S+ +
Sbjct: 591 -YKATFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAE 634
>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
Length = 620
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 283/620 (45%), Positives = 377/620 (60%), Gaps = 48/620 (7%)
Query: 107 VQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMK 166
V GLYV LRIGP++ EW +GG P WL VPG+ FR+DNEPFK MK++ IV MMK
Sbjct: 2 VHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMK 61
Query: 167 AARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDA 226
A +L+ +QGGPIIL+QIENEYG VE G Y +W A++A+ L TGVPW+MCKQ+DA
Sbjct: 62 AEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDA 121
Query: 227 PDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFI 286
P P+I+ CNG C E F PNS +KP +WTENWT +Y +G R EDIAY VA FI
Sbjct: 122 PGPIIDTCNGYYC-EDFK-PNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFI 179
Query: 287 AKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKL 346
K GS VNYYMYHGGTNF RTA ++ + Y APLDEYGL R+PK+ HLK LH A+KL
Sbjct: 180 QK-GGSLVNYYMYHGGTNFDRTAGEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKL 238
Query: 347 CLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSIS 406
+LS + QEA++F S CAAFL NKD+ + A V F Y+LPP S+S
Sbjct: 239 SEPALLSADATVTSLGAKQEAYVFWSKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVS 298
Query: 407 ILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPTYDETSLRA-NFLLEQMNT 453
ILPDCKT +NTAK+++ W + EA PT +E A N L+EQ++
Sbjct: 299 ILPDCKTEVYNTAKVNAPSVHRNMVPTGTKFSWGSFNEATPTANEAGTFARNGLVEQISM 358
Query: 454 TKDASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKS 507
T D SDY WY ++ +L V S GH LH F+NG+ G+A+G
Sbjct: 359 TWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSGTAYGGLDHPK 418
Query: 508 FTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSW 565
T + + L G N ++LLSV VGLP+ G + E+ G L V+++G D S + W
Sbjct: 419 LTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKW 478
Query: 566 GYQVGLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKTVFDAPTGSDPVAINLISM 623
Y++G+ GE L + T+ S V W++ GS + QPLTWYK+ F P G++P+A+++ +M
Sbjct: 479 SYKIGVKGEALSLHTNTESSGVRWTQ-GSFVAKKQPLTWYKSTFATPAGNEPLALDMNTM 537
Query: 624 GKGEAWVNGQSIGRYWVSF--------------------LTPQGTPSQSWYHIPRSFLKP 663
GKG+ W+NG++IGR+W ++ L+ G SQ WYH+PRS+LK
Sbjct: 538 GKGQVWINGRNIGRHWPAYKAQGSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLK- 596
Query: 664 TGNLLVLLEEENGYPPGISI 683
+ NL+V+ EE G P GIS+
Sbjct: 597 SQNLIVVFEELGGDPNGISL 616
>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
Length = 569
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 266/544 (48%), Positives = 350/544 (64%), Gaps = 23/544 (4%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD ++LIING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP P
Sbjct: 29 VTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEPSP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G + F R DLV+F K V GLY+ LRIGP++ EW +GG P WL VPG+VFR+DNEP
Sbjct: 89 GNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDNEP 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK +L+ +QGGPIILSQIENEYG ++ G Y +W A++A
Sbjct: 149 FKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAEMA 208
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ L TGVPW+MCKQ+DAP P+I+ CNG C E F PNS +KP +WTENWT ++ +G
Sbjct: 209 LGLSTGVPWIMCKQEDAPYPIIDTCNGFYC-EGFK-PNSDNKPKLWTENWTGWFTEFGGA 266
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLL 329
R EDIA+ VA FI + GS++NYYMY GGTNF RTA ++ T Y AP+DEYGLL
Sbjct: 267 IPNRPVEDIAFSVARFI-QNGGSFMNYYMYXGGTNFDRTAGVFIATSYDYDAPIDEYGLL 325
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
R+PK+ HLKELH +KLC ++S + QE +F+ + CAAFL N D + A
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVFKSKTSCAAFLSNYDTSSAA 385
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------------QWEEYKEAIPTY 437
V F Y+LPP S+SILPDCKT +NTAK+ + WE Y E P+
Sbjct: 386 RVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRAPTILMKMIPTSTKFSWESYNEGSPSS 445
Query: 438 DET-SLRANFLLEQMNTTKDASDYLWY--NFRFKHDPS----DSESVLKVSSLGHVLHAF 490
+E + + L+EQ++ T+D +DY WY + D S +L + S GH LH F
Sbjct: 446 NEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLLTIFSAGHALHVF 505
Query: 491 INGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNV 549
+NG G+++G S+ T + + L G N ++LLS VGLP++G + E G L V
Sbjct: 506 VNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVHYETWNTGILGPV 565
Query: 550 SIQG 553
+++G
Sbjct: 566 TLKG 569
>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
Length = 830
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 314/846 (37%), Positives = 435/846 (51%), Gaps = 110/846 (13%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+L+I+G R++L SGSIHYPRSTP MWP L A+AK G+DV+QT +FWN + P
Sbjct: 26 NVTYDSRALLIDGRRRLLVSGSIHYPRSTPDMWPELFARAKANGIDVIQTYLFWNTNVPT 85
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+F S R D VRF++ Q GLYV RIGPF+ EW YGGLP WL +P I+FR ++
Sbjct: 86 PGEFVMSDRFDYVRFVQLAQEAGLYVNFRIGPFVCAEWTYGGLPAWLRQIPDIMFRDYDQ 145
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+ Y T V ++K RL A QGGPIIL QIENEYG E + GP YV W +L
Sbjct: 146 PWLQVAGEYITKTVQILKDNRLLAGQGGPIILLQIENEYGGTESRY-AGGPQYVEWCGQL 204
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A +L W+MC Q DAP +I CN C + P +P++WTENW ++Q +GD
Sbjct: 205 AANLTDAAQWIMCSQPDAPANIIATCNAFYCDDFVP---HPGQPSMWTENWPGWFQKWGD 261
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R A+D+AY V + K GSY+NYYMYHGGTNF RTA +T YD A LDEYG
Sbjct: 262 PTPHRPAQDVAYAVTRYYIK-GGSYMNYYMYHGGTNFERTAGGPFITTNYDYDASLDEYG 320
Query: 328 LLRQPKWGHLKELHSAVK-----LCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVN 382
+ +PK+ HL +H+ + + P + + N EA I+ S C AFL N
Sbjct: 321 MPNEPKYSHLGSMHAVLHDNEAIMMAVPAPKPISLGTNL----EAHIYNSSVGCVAFLSN 376
Query: 383 KDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------DSVEQWEEYKEAI 434
+ + + V F+ YELP S+S+L C T +NTA D+ E +
Sbjct: 377 NNNKTDVEVQFNGRTYELPAWSVSVLHGCVTAIYNTAVCRAHQRAPHDAACCARESRRVC 436
Query: 435 -------------------------------PTYDETSLRANFLLEQMNTTKDASDYLWY 463
P T LEQ++ T D +DYLWY
Sbjct: 437 DRLPPLRPKARAPCQSGRIRHLCLVVLTSIGPQAPATKYWNKTPLEQIDQTLDHTDYLWY 496
Query: 464 NFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNV 523
+ + S + + L + + V + ++NG+FV + S + V L+ G N +
Sbjct: 497 STSYVSS-SATYAQLSLPQITDVAYVYVNGKFVTVSW------SGNVSATVSLVAGPNTI 549
Query: 524 SLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYG 583
+LS+ +GL + G L GL G+ L + W +Q G++GE+ IF
Sbjct: 550 DILSLTMGLDNGGDILSEYNCGLLGGVYLGSVNLTE---NGWWHQTGVVGERNAIFLPEN 606
Query: 584 SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSD-PVAINLISMGKGEAWVNGQSIGRYWVSF 642
+ V W+ + + LTWYK+ FD P S P+A++L MGKG WVNG ++GRYW +
Sbjct: 607 LKKVAWTT-PAVLNTGLTWYKSSFDVPRDSQAPLALDLTGMGKGYVWVNGHNLGRYWPTI 665
Query: 643 LTP---------QGT------------PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGI 681
L +GT PSQ+ YH+PR +L+ N+LVLLEE G P I
Sbjct: 666 LATNWPCDVCDYRGTYDAPHCKQGCNMPSQTHYHVPREWLQAENNVLVLLEEMGGNPSKI 725
Query: 682 SIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKIL 741
++ CG V + + P V + C + + I+ +
Sbjct: 726 ALVEREEYVSCGVVGEDY---------------------PADDLAVVLGCGTHQTIAGVD 764
Query: 742 FASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIP-KAL 800
FASYG P G+C +Y GSCH+SNS IV C GK++C++PV + +G+PCP + K L
Sbjct: 765 FASYGTPMGSCRSYQQGSCHASNSTEIVLSLCHGKQACSIPV-SAAMFGNPCPDVTNKRL 823
Query: 801 LVDAQC 806
V C
Sbjct: 824 AVQVAC 829
>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
Length = 514
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 262/490 (53%), Positives = 327/490 (66%), Gaps = 26/490 (5%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD +++ ING R+IL SGSIHYPRSTP+MWP LI KAKEGGLDV+QT VFWN HEP
Sbjct: 20 SVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 79
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ F G DLVRFIK V+ GLYV LRIGP++ EW +GG P WL +PGI FR++N
Sbjct: 80 PGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNNG 139
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK +M+R+ IV+MMKA L+ SQGGPIILSQIENEYG +E+ G Y +WAA++
Sbjct: 140 PFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQM 199
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
AV L TGVPWVMCKQDDAPDP+IN+CNG C + PN KP +WTE WT ++ +G
Sbjct: 200 AVGLGTGVPWVMCKQDDAPDPIINSCNGFYC--DYFSPNKAYKPKMWTEAWTGWFTEFGG 257
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYG
Sbjct: 258 AVPYRPVEDLAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 316
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDKR 386
L+RQPKWGHLK+LH A+KLC ++SG M + QEA +F+ CAAFL N + R
Sbjct: 317 LVRQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPR 376
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKE 432
+ A V F N+ Y LPP SISILPDCK +NTA++ + W+ Y E
Sbjct: 377 SFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQSARMKMVPVPIHGAFSWQAYNE 436
Query: 433 AIPTYD-ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGH 485
P+ + E S L+EQ+NTT+D SDYLWY+ K DP + L V S GH
Sbjct: 437 EAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPTLTVLSAGH 496
Query: 486 VLHAFINGEF 495
LH F+N +
Sbjct: 497 ALHVFVNDQL 506
>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
Length = 700
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 291/650 (44%), Positives = 376/650 (57%), Gaps = 79/650 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD RSL+ING R+IL SGSIHYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP
Sbjct: 40 VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F+ R DLVRF+K V+ GLYV LR+GP++ EW +GG P WL VPGI FR+DN P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+++ IV+MMK+ L+ QGGPII++Q+ENE+G +E G PY WAA++A
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PN+ KP +WTE WT ++ +G
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYC--DYFTPNNKHKPTMWTEAWTGWFTKFGGA 277
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEY-- 326
A R ED+A+ VA F+ K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DE+
Sbjct: 278 APHRPVEDLAFAVARFVQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGM 336
Query: 327 -----------------------------------------------GLLRQPKWGHLKE 339
GLLRQPKWGHL+
Sbjct: 337 QWLLPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRN 396
Query: 340 LHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMY 398
+H A+K ++SG + ++A++F+ + CAAFL N ++ + F Y
Sbjct: 397 MHRAIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHY 456
Query: 399 ELPPLSISILPDCKTVAFNTA---------KLDSVEQ---WEEYKEAIPTYDETSLRANF 446
+LP SISILPDCKT FNTA K+ V W+ Y E + D+++ +
Sbjct: 457 DLPAWSISILPDCKTAVFNTATVKEPTLLPKMSPVMHRFAWQSYSEDTNSLDDSAFARDG 516
Query: 447 LLEQMNTTKDASDYLWY--------NFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGS 498
L+EQ++ T D SDYLWY N RF S L V S GH + F+NG GS
Sbjct: 517 LIEQLSLTWDKSDYLWYTTHVNIGSNERFLK--SGQWPQLSVYSAGHSMQVFVNGRSYGS 574
Query: 499 AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVSIQGAKEL 557
+G + + T V + G+N +S+LS VGLP++G + E V L V++ G E
Sbjct: 575 VYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTLSGLNEG 634
Query: 558 K-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
K D S W YQVGL GE L + T GS V W+ G T QPLTW+K +
Sbjct: 635 KRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGGGT-QPLTWHKVL 683
>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 616
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 275/590 (46%), Positives = 360/590 (61%), Gaps = 37/590 (6%)
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+DF GR DLVRF+K GLYV LRIGP++ EW YGG P WLH +PGI R+DNEPF
Sbjct: 1 QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+R+ +V MK A LYASQGGPIILSQIENEYG + S+ G Y+RWAA +AV
Sbjct: 61 KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L TGVPWVMC+Q DAP+P+IN CNG C + P+ P +P +WTENW+ ++ +G
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFT--PSLPSRPKLWTENWSGWFLSFGGAV 178
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLL 329
R ED+A+ VA F + G+ NYYMYHGGTNFGR++ ++ YD AP+DEYGL+
Sbjct: 179 PYRPTEDLAFAVARFYQR-GGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLV 237
Query: 330 RQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
RQPKWGHL+++H A+K+C +++ M+ + EA +++ S CAAFL N D +++
Sbjct: 238 RQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVYKSGSLCAAFLANIDDQSDK 297
Query: 390 TVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-------------------------- 423
TV F+ Y+LP S+SILPDCK V NTA+++S
Sbjct: 298 TVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAEL 357
Query: 424 -VEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHDP--SDSESV 477
W E + E +L L+EQ+NTT DASD+LWY+ +P + S+S
Sbjct: 358 AASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQSN 417
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L V+SLGHVL FING+ GS+ G S +L V L+ G N + LLS VGL + GA
Sbjct: 418 LLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGA 477
Query: 538 YLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
+ + AG+ V + G K D SS W YQ+GL GE L ++ + S T
Sbjct: 478 FFDLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEASPEWVSDNSYPT 537
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
+ PLTWYK+ F AP G DPVAI+ MGKGEAWVNGQSIGRYW + + PQ
Sbjct: 538 NNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQ 587
>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
Length = 589
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 281/589 (47%), Positives = 365/589 (61%), Gaps = 50/589 (8%)
Query: 141 IVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP 200
+ FR+DNEPFK M+++ T IV MMKA L+ +QGGPII+SQIENEYG VE G
Sbjct: 1 MAFRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKA 60
Query: 201 YVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
Y +WAA++AV L TGVPW MCKQ+DAPDPVI+ CNG C E F PN KP +WTENW+
Sbjct: 61 YTKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYC-ENFT-PNENFKPKMWTENWS 118
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD- 319
+Y +G R ED+AY VA FI + +GS+VNYYMYHGGTNFGRT+S + YD
Sbjct: 119 GWYTDFGGAISHRPTEDLAYSVATFI-QNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDY 177
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLS--GVLVSMNFSKLQEAFIFQGSSECA 377
AP+DEYGL +PKW HLK LH A+K C ++S + + L+ + +S CA
Sbjct: 178 DAPIDEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICA 237
Query: 378 AFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK------------LDSVE 425
AFL N D ++ ATV F N Y+LPP S+SILPDCKTV FNTA +++
Sbjct: 238 AFLANYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATVNGHSFHKRMTPVETTF 297
Query: 426 QWEEYKEAIPTY--DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDS------ESV 477
W+ Y E P Y D+ S+ AN L EQ+N T+D+SDYLWY PS+S
Sbjct: 298 DWQSYSEE-PAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQFPT 356
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L ++S GHVLH F+NG+ G+ +G + T + V+L G N +SLLSV VGLP+ G
Sbjct: 357 LTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNVGL 416
Query: 538 YLER-RVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS- 594
+ E V L V ++G E +D S W Y+VGL GE L + T GS + W++ S
Sbjct: 417 HFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQGSSL 476
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL----------- 643
+ QPLTWYKT FDAP+G+DPVA+++ SMGKGE W+N QSIGR+W +++
Sbjct: 477 AKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIAHGNCDECNYA 536
Query: 644 ---------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G P+Q WYHIPRS+L +GN+LV+LEE G P GIS+
Sbjct: 537 GTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISL 585
>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
Length = 705
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 284/642 (44%), Positives = 383/642 (59%), Gaps = 66/642 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+++I G R++L S +HYPR+TP+MWP LIAK KEGG DV++T VFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKFKEGGADVIETYVFWNGHEPA 122
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQ+ F R DLV+F K V A+GL++ LRIGP+ EW +GG P WL D+PGI FR+DNE
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+ + T IV +MK +LY+ QGGPIIL QIENEYG ++ ++ + G Y++WAA++
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A+ L TG+PWVMC+Q DAP+ +I+ CN C + F PNS +KP IWTE+W +Y +G
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYC-DGFK-PNSYNKPTIWTEDWDGWYADWGG 300
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYG 327
R AED A+ VA F + GS NYYMY GGTNF RTA + YD AP+DEYG
Sbjct: 301 ALPHRPAEDSAFAVARFYQR-GGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYG 359
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQ-----------GS 373
+LRQPKWGHLK+LH+A+KLC +P L V+ S + KL QEA ++ G+
Sbjct: 360 ILRQPKWGHLKDLHTAIKLC-EPALIAVVGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGN 418
Query: 374 SE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD------SVEQ 426
++ C+AFL N D+ A+V+ Y LPP S+SILPDC+ VAFNTA++ +VE
Sbjct: 419 AQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVES 478
Query: 427 --------------------------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDY 460
W KE I T+ + +LE +N TKD SDY
Sbjct: 479 GSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDY 538
Query: 461 LWYNFRFKHDPSD-----SESV---LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEK 512
LWY R +D S+ V L + + V F+NG+ GS G +L++
Sbjct: 539 LWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQ 594
Query: 513 MVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVG 570
+ L+ G N ++LLS +VGL + GA+LE+ AG R V++ G + D ++ W YQVG
Sbjct: 595 PIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVG 654
Query: 571 LLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTG 612
L GE I+ WSR + QP TWYK + + G
Sbjct: 655 LKGEFSMIYAPEKQGCAGWSRMQKDSVQPFTWYKNICNQSVG 696
>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
vinifera]
Length = 722
Score = 506 bits (1304), Expect = e-140, Method: Compositional matrix adjust.
Identities = 312/805 (38%), Positives = 422/805 (52%), Gaps = 160/805 (19%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G V+YDGR LI+NG R++LFSGSIH +PR I + W
Sbjct: 52 GVKGVSYDGRPLIVNGKRELLFSGSIH--------YPRSIPE-------------MW--- 87
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
P I + +GGL + + F +
Sbjct: 88 ----------------------------------PDIIXKARHGGL----NVIHTYAFWN 109
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
+EP + HMKR+ MI++MM + ASQGGPIIL+ +++ +F E G V WA
Sbjct: 110 LHEPVQDHMKRFTRMIIDMMSKEKXIASQGGPIILALVDSAI-----AFKEMGTRCVHWA 164
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
+AV L+TG+P VMCKQ DAPDPVIN C GR CG+TF GPN P+K ++ + + Y+V
Sbjct: 165 GTMAVGLKTGIPXVMCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSV-SNHXLGMYRV 223
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDE 325
+GD R+AED+A+ + FI+K G+ NYYMY+ TNFGRT S++ T YYD+APLDE
Sbjct: 224 FGDPPSQRAAEDLAF--SXFISK-NGTLANYYMYYSVTNFGRTTSSFATTCYYDEAPLDE 280
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNK 383
YGL R+ KWGHL++LH+A++L K +L GV + + EA I++ GS+ CA FL+N
Sbjct: 281 YGLPRETKWGHLRDLHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNN 340
Query: 384 DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------QWEEYKEAIPTY 437
R T Y LP SIS LPDCKTV FNT + S QW ++A+PTY
Sbjct: 341 ITRTPTTTTLRGSKYYLPQHSISNLPDCKTVVFNTQTVVSQYSVNKNLQWXMSQDALPTY 400
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS------DSESVLKVSSLGHVLHAFI 491
+E + +E M TKD +DYLWY + + D V +VS+LGHV+HAF+
Sbjct: 401 EECPTKTKSPVELMTMTKDTTDYLWYTTNIELARTGLPFRKDVLRVPQVSNLGHVMHAFL 460
Query: 492 NGEFV-----GSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
NGE++ G+ HG + +KSF K + L G N ++ L VGLPDSG+Y+E R+AG+
Sbjct: 461 NGEYMEFYLTGTRHGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHRLAGV 520
Query: 547 RNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKT 605
NV+IQG D WG +K
Sbjct: 521 HNVAIQGLNTRTIDLPKNGWG------------------------------------HKA 544
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG 665
FDAP G PVA+ L +M KG AW+NG+SI YWVS+L+P G PSQS YH+PR+FLK +
Sbjct: 545 YFDAPEGDVPVALELSTMAKGMAWINGKSIDXYWVSYLSPLGKPSQSVYHVPRAFLKTSD 604
Query: 666 NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRP 725
NLLVL EE P GI I T++ T+C ++S+ H V SW+ +
Sbjct: 605 NLLVLFEETGRNPDGIEILTLNRDTICCYISEHHPTHVRSWKREAS-------------- 650
Query: 726 KVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWT 785
+QI +G+P G C + G+C + NS +VEK CLGK SC++PV
Sbjct: 651 DIQI---------------FGDPTGTCXEFIPGNCAAPNSXKVVEKHCLGKSSCSIPVEQ 695
Query: 786 EKFYGDPC----PGIPKALLVDAQC 806
E D GI KAL V C
Sbjct: 696 EIVSKDGISISGSGITKALAVQVLC 720
>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
Japonica Group]
Length = 713
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 284/650 (43%), Positives = 382/650 (58%), Gaps = 74/650 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+++I G R++L S +HYPR+TP+MWP LIAK KEGG DV++T VFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 89 PGQFDFSGRRDLVRFIK--------EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPG 140
GQ+ F R DLV+F K V A+GL++ LRIGP+ EW +GG P WL D+PG
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPG 182
Query: 141 IVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP 200
I FR+DNEPFK M+ + T IV +MK +LY+ QGGPIIL QIENEYG ++ ++ + G
Sbjct: 183 IEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKR 242
Query: 201 YVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
Y++WAA++A+ L TG+PWVMC+Q DAP+ +I+ CN C + F PNS +KP IWTE+W
Sbjct: 243 YMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYC-DGFK-PNSYNKPTIWTEDWD 300
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD- 319
+Y +G R AED A+ VA F + GS NYYMY GGTNF RTA + YD
Sbjct: 301 GWYADWGGALPHRPAEDSAFAVARFYQR-GGSLQNYYMYFGGTNFARTAGGPLQITSYDY 359
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---QEAFIFQ----- 371
AP+DEYG+LRQPKWGHLK+LH+A+KLC +P L V S + KL QEA ++
Sbjct: 360 DAPIDEYGILRQPKWGHLKDLHTAIKLC-EPALIAVDGSPQYIKLGSMQEAHVYSTGEVH 418
Query: 372 ------GSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLD-- 422
G+++ C+AFL N D+ A+V+ Y LPP S+SILPDC+ VAFNTA++
Sbjct: 419 TNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQ 478
Query: 423 ----SVEQ--------------------------WEEYKEAIPTYDETSLRANFLLEQMN 452
+VE W KE I T+ + +LE +N
Sbjct: 479 TSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLN 538
Query: 453 TTKDASDYLWYNFRFKHDPSD-----SESV---LKVSSLGHVLHAFINGEFVGSAHGKHS 504
TKD SDYLWY R +D S+ V L + + V F+NG+ GS G
Sbjct: 539 VTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW- 597
Query: 505 DKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSIQGAKELK-DFSS 562
+L++ + L+ G N ++LLS +VGL + GA+LE+ AG R V++ G + D ++
Sbjct: 598 ---VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTN 654
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTG 612
W YQVGL GE I+ WSR + QP TWYK + + G
Sbjct: 655 SLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQPFTWYKNICNQSVG 704
>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
Length = 338
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 242/370 (65%), Positives = 278/370 (75%), Gaps = 36/370 (9%)
Query: 1 MGQCQLLCLFGLLL----TTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRS 56
MG L C FGLL+ TT GG +GG V+YDGRSLII G RK+LFSGSIHYPRS
Sbjct: 1 MGAFWLSC-FGLLMVMWTTTRGGVEGG-----QVSYDGRSLIIEGQRKLLFSGSIHYPRS 54
Query: 57 TPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCL 116
TP MWP LI+KAK GGLDV++T VFWNLHEP+ GQ+DF GR ++VRFI+E+QA GLY +
Sbjct: 55 TPDMWPSLISKAKHGGLDVIETYVFWNLHEPRHGQYDFKGRHNIVRFIREIQAHGLYAFI 114
Query: 117 RIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGG 176
RIGPFIE EW YGGLPFWLHDVPGIV+RSDNEPFK+HM+ + T IVN+ K+ LYA QGG
Sbjct: 115 RIGPFIEAEWTYGGLPFWLHDVPGIVYRSDNEPFKYHMQNFTTKIVNLFKSEGLYAPQGG 174
Query: 177 PIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNG 236
PIIL QIENEY E +F EKGPPYV+WAA +AV LQTGVPWVMCKQDDAPDPVIN CNG
Sbjct: 175 PIILQQIENEYKNAERAFHEKGPPYVQWAAAMAVGLQTGVPWVMCKQDDAPDPVINTCNG 234
Query: 237 RQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNY 296
R CGETF GPNSP+KPAIWT+NWTS GS+VNY
Sbjct: 235 RTCGETFVGPNSPNKPAIWTDNWTSL--------------------------KNGSFVNY 268
Query: 297 YMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVL 356
YMYHGGTNFGRT SA+VLT YYD+AP+DEYGL+RQPKWGHLK+LHS +K C + +L GV+
Sbjct: 269 YMYHGGTNFGRTGSAFVLTSYYDEAPIDEYGLIRQPKWGHLKQLHSVIKSCSQTLLHGVI 328
Query: 357 VSMNFSKLQE 366
+ QE
Sbjct: 329 SVSPLGQQQE 338
>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 613
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 272/612 (44%), Positives = 371/612 (60%), Gaps = 32/612 (5%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MWP LI KAK+GGLD ++T +FW+ HEPQ ++DFSGR D ++F + +Q GLYV +RIG
Sbjct: 1 MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
P++ EW YGG P WLH++PGI R++N+ +K M+ + T IVNM K A L+ASQGGPII
Sbjct: 61 PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 120
Query: 180 LSQIENEYGMV-EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQ 238
L+QIENEYG V ++ + G Y+ W A++A L GVPW+MC+Q DAP P+IN CNG
Sbjct: 121 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 180
Query: 239 CGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM 298
C + F PN+P P ++TENW +++ +GD+ R+AED+A+ VA F + G + NYYM
Sbjct: 181 C-DNFT-PNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFF-QSGGVFNNYYM 237
Query: 299 YHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLV 357
YHGGTNFGRT+ +T YD APLDEYG L QPKWGHLK+LH+++KL K + +
Sbjct: 238 YHGGTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRS 297
Query: 358 SMNF--SKLQEAFIFQGSSECAAFLVNKDKRNNATVYF-SNLMYELPPLSISILPDCKTV 414
+ NF S F + E FL N D +N+AT+ + Y +P S+SIL C
Sbjct: 298 NQNFGSSVTLTKFSNPTTGERFCFLSNTDGKNDATIDLQEDGKYFVPAWSVSILDGCNKE 357
Query: 415 AFNTAKLDS-----VEQWEEYKEAI--------PTYD----ETSLRANFLLEQMNTTKDA 457
+NTAK++S V++ E + A P D AN LLEQ T D
Sbjct: 358 VYNTAKVNSQTSMFVKEQNEKENAQLSWAWAPEPMKDTLQGNGKFAANLLLEQKRVTVDF 417
Query: 458 SDYLWYNFRFKHDPSDS--ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVH 515
SDY WY + + + S L+V++ GHVLHAF+N ++GS G + +SF EK +
Sbjct: 418 SDYFWYMTKVDTNGTSSLQNVTLQVNTKGHVLHAFVNKRYIGSKWGSNG-QSFVFEKPIL 476
Query: 516 LINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN---VSIQGAKELKDFSSFSWGYQVGLL 572
L +G N ++LLS VGL + A+ + G+ I D SS W Y+VGL
Sbjct: 477 LKSGINTITLLSATVGLKNYDAFYDMVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLN 536
Query: 573 GEKLQIFTDYGSRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVN 631
GE QI+ S+ W S + +TWYKT F P G DPV +++ MGKG+AWVN
Sbjct: 537 GEMKQIYNPVFSQRTNWIPLNQKSIGRRMTWYKTSFKTPAGIDPVVLDMQGMGKGQAWVN 596
Query: 632 GQSIGRYWVSFL 643
GQSIGR+W SF+
Sbjct: 597 GQSIGRFWPSFI 608
>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
[Cucumis sativus]
Length = 635
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 279/638 (43%), Positives = 366/638 (57%), Gaps = 57/638 (8%)
Query: 216 VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSA 275
VPWVMCKQDDAPDP+IN CNG C + PN P KP WTE WT+++ +G R
Sbjct: 3 VPWVMCKQDDAPDPMINTCNGFYC--DYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPV 60
Query: 276 EDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKW 334
ED+A+ VA FI K GS VNYYMYHGGTNFGRTA +T YD AP+DEYGL+RQPK+
Sbjct: 61 EDLAFGVARFIQK-GGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKF 119
Query: 335 GHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYF 393
GHLK LH AVKLC K +L+G + Q+A +F SS +CAAFL N N A V F
Sbjct: 120 GHLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTF 179
Query: 394 SNLMYELPPLSISILPDCKTVAFNTAKLD-----------SVE--QWEEYKEAIPTYDE- 439
+ Y LPP SISILPDCK+V +NTA++ VE WE Y E I + +E
Sbjct: 180 NGRHYTLPPWSISILPDCKSVIYNTAQVQVQTNQLSFLPTKVESFSWETYNENISSIEED 239
Query: 440 TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE------SVLKVSSLGHVLHAFING 493
+S+ + LLEQ+ TKD SDYLWY DP++S L +S GH +H FING
Sbjct: 240 SSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGHGMHVFING 299
Query: 494 EFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQ 552
+ GS+ G H + FT ++L G N VSLLS+ GLP++G + E R G L V+I
Sbjct: 300 KLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMGVLGPVAIH 359
Query: 553 GAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--YGSSTHQPLTWYKTVFDA 609
G K D S W Y+VGL GE + + + + V W++ QPLTWYK FDA
Sbjct: 360 GLDXGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQENAQPLTWYKAYFDA 419
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWV-------------SFLTPQ------GTPS 650
P G +P+A+++ SM KG+ W+NGQ++GRYW P+ G P+
Sbjct: 420 PEGDEPLALDMGSMQKGQVWINGQNVGRYWTITANGNCTDCSYSGTYRPRKCQFGCGQPT 479
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVIS--WRS 708
Q WYH+PRS+L PT NL+V+ EE G P IS+ SVT++C S PVI
Sbjct: 480 QQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICTEASQYR--PVIKNVHMH 537
Query: 709 QNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAI 768
QN L + K+ + C +G+ IS I FAS+G P+G C ++ G+CHS S +
Sbjct: 538 QNNGELNEQNVL-----KINLHCAAGQFISAIKFASFGTPSGACGSHKQGTCHSPKSDYV 592
Query: 769 VEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
++K C+G++ C + T F DPCP + K L + C
Sbjct: 593 LQKLCVGRQRCLATIPTSIFGEDPCPNLRKKLSAEVVC 630
>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
Length = 677
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 285/687 (41%), Positives = 392/687 (57%), Gaps = 75/687 (10%)
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCG 240
++IENEYG ++ ++ G Y+RWAA +AV L TGVPWVMC+Q DAPDP+IN CNG C
Sbjct: 6 AKIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCD 65
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ PNS KP +WTENW+ ++ +G R ED+A+ VA F + G++ NYYMYH
Sbjct: 66 QFT--PNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQR-GGTFQNYYMYH 122
Query: 301 GGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSM 359
GGTN R++ ++ T Y AP+DEYGL+RQPKWGHL+++H A+KLC +++
Sbjct: 123 GGTNLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYT 182
Query: 360 NFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
+ EA +++ S CAAFL N D +++ TV F+ MY LP S+SILPDCK V NTA
Sbjct: 183 SLGPNVEAAVYKVGSVCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTA 242
Query: 420 KLDS---------------------------VEQWEEYKEAIPTYDETSLRANFLLEQMN 452
+++S V W E + + +L L+EQ+N
Sbjct: 243 QINSQTTGSEMRYLESSNVASDGSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQIN 302
Query: 453 TTKDASDYLWY--NFRFKHDP---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKS 507
TT DASD+LWY + K D + S+S L V+SLGHVL +ING+ GSA G S
Sbjct: 303 TTADASDFLWYSTSITVKGDEPYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSL 362
Query: 508 FTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWG 566
+ +K + L+ G N + LLS VGL + GA+ + AG+ V + G D SS W
Sbjct: 363 ISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWT 422
Query: 567 YQVGLLGEKLQIFTDYGSRIVPW-SRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGK 625
YQ+GL GE L ++ D W S + PL WYKT F P G DPVAI+ MGK
Sbjct: 423 YQIGLRGEDLHLY-DPSEASPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGK 481
Query: 626 GEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKP 663
GEAWVNGQSIGRYW + L PQ G PSQ+ YH+PRSFL+P
Sbjct: 482 GEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQP 541
Query: 664 TGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGR 723
N LVL E G P IS ++C VS++H + SW SQ P +
Sbjct: 542 GSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQ----------PMQ 591
Query: 724 R--PKVQIRCP-SGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
R P +++ CP G+ IS + FAS+G P+G C +Y+ G C S+ + +IV++AC+G SC+
Sbjct: 592 RYGPALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCS 651
Query: 781 VPVWTEKFYGDPCPGIPKALLVDAQCT 807
VPV + ++G+PC G+ K+L V+A C+
Sbjct: 652 VPV-SSNYFGNPCTGVTKSLAVEAACS 677
>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
Length = 625
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 274/636 (43%), Positives = 363/636 (57%), Gaps = 58/636 (9%)
Query: 219 VMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
V+CKQDDAPDP+INACNG C + PN KP +WTE WT ++ +G R AED+
Sbjct: 1 VLCKQDDAPDPIINACNGFYC--DYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDM 58
Query: 279 AYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHL 337
A+ VA FI K GS++NYYMYHGGTNFGRTA ++ T Y APLDEYGL RQPKWGHL
Sbjct: 59 AFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHL 117
Query: 338 KELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNL 396
K+LH A+KLC ++SG M QEA +++ S C+AFL N + ++ A V F N
Sbjct: 118 KDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNN 177
Query: 397 MYELPPLSISILPDCKTVAFNTAKLDSVE--------------QWEEYKEAIPTYDETSL 442
Y LPP SISILPDCK +NTA++ + W+ Y E TY + S
Sbjct: 178 HYNLPPWSISILPDCKNTVYNTARVGAQTSRMKMVRVPVHGGLSWQAYNEDPSTYIDESF 237
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHAFINGEFV 496
L+EQ+NTT+D SDYLWY K D ++ L V S GH +H FING+
Sbjct: 238 TMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFINGQLS 297
Query: 497 GSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAK 555
GSA+G T K V+L G N +++LS+ VGLP+ G + E AG L VS+ G
Sbjct: 298 GSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSLNGLN 357
Query: 556 -ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGS 613
+D S W Y+VGL GE L + + GS V W+ + QPLTWYKT F AP G
Sbjct: 358 GGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGD 417
Query: 614 DPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQGTPSQSW 653
P+A+++ SMGKG+ W+NGQS+GR+W ++ L G SQ W
Sbjct: 418 SPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGSCSECSYTGTFREDKCLRNCGEASQRW 477
Query: 654 YHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ--NQ 711
YH+PRS+LKP+GNLLV+ EE G P GI++ V ++C + + W+S N
Sbjct: 478 YHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYE--------WQSTLVNY 529
Query: 712 RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEK 771
+ + K PK ++C G+KI+ + FAS+G P G C +Y GSCH+ +S K
Sbjct: 530 QLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNK 589
Query: 772 ACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C+G+ C+V V E F GDPCP + K L V+A C
Sbjct: 590 LCVGQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVCA 625
>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
Length = 740
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 276/677 (40%), Positives = 370/677 (54%), Gaps = 101/677 (14%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NVTYD R+++I G R++L S +HYPR+TP+MWP LIAK KEGG DV++T VFWN HEP
Sbjct: 63 NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122
Query: 89 PGQFDFSGRRDLVRFIK-----------------------------------EVQAQGLY 113
GQ+ F R DLV+F K E Y
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLMWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYY 182
Query: 114 VCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYAS 173
R P + G P WL D+PGI FR+DNEPFK M+ + T IV +MK +LY+
Sbjct: 183 FEERFDPVKFEKHVIFGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSW 242
Query: 174 QGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINA 233
QGGPIIL QIENEYG ++ ++ + G Y++WAA++A+ L TG+PWVMC+Q DAP+ +I+
Sbjct: 243 QGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDT 302
Query: 234 CNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSY 293
CN C + F PNS +KP IWTE+W +Y +G R AED A+ VA F + GS
Sbjct: 303 CNAFYC-DGFK-PNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQR-GGSL 359
Query: 294 VNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPML 352
NYYMY GGTNF RTA + YD AP+DEYG+LRQPKWGHLK+LH+A+KLC +P L
Sbjct: 360 QNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLC-EPAL 418
Query: 353 SGVLVSMNFSKL---QEAFIFQ-----------GSSE-CAAFLVNKDKRNNATVYFSNLM 397
V S + KL QEA ++ G+++ C+AFL N D+ A+V+
Sbjct: 419 IAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKS 478
Query: 398 YELPPLSISILPDCKTVAFNTAKLD------SVEQ------------------------- 426
Y LPP S+SILPDC+ VAFNTA++ +VE
Sbjct: 479 YSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSS 538
Query: 427 -WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-----SESV--- 477
W KE I T+ + +LE +N TKD SDYLWY R +D S+ V
Sbjct: 539 TWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPS 598
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L + + V F+NG+ GS G +L++ + L+ G N ++LLS +VGL + GA
Sbjct: 599 LTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGA 654
Query: 538 YLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS 595
+LE+ AG R V++ G + D ++ W YQVGL GE I+ WSR
Sbjct: 655 FLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKD 714
Query: 596 THQPLTWYKTVFDAPTG 612
+ QP TWYK + + G
Sbjct: 715 SVQPFTWYKNICNQSVG 731
>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 578
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 248/576 (43%), Positives = 341/576 (59%), Gaps = 56/576 (9%)
Query: 278 IAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGH 336
+A+ VA FI K GS+VNYYMYHGGTNFGRTA +T YD AP+DEYGL+RQPK+GH
Sbjct: 1 LAFGVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGH 59
Query: 337 LKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSN 395
LKELH A+K+C K ++S V + Q+A ++ S +C+AFL N D + A V F+N
Sbjct: 60 LKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNN 119
Query: 396 LMYELPPLSISILPDCKTVAFNTAKL----DSVE---------QWEEYKEAIPTYDETS- 441
+ Y LPP SISILPDC+ FNTAK+ +E QWE Y E + + D++S
Sbjct: 120 VHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKNFQWESYLEDLSSLDDSST 179
Query: 442 LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHVLHAFING 493
+ LLEQ+N T+D SDYLWY D DSES L + S GH +H F+NG
Sbjct: 180 FTTHGLLEQINVTRDTSDYLWYMTSV--DIGDSESFLHGGELPTLIIQSTGHAVHIFVNG 237
Query: 494 EFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQ 552
+ GSA G ++ FT + ++L +GTN ++LLSV VGLP+ G + E G L V++
Sbjct: 238 QLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGILGPVALH 297
Query: 553 GAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH--QPLTWYKTVFDA 609
G + K D S W YQVGL GE + + + + W + QPLTW+KT FDA
Sbjct: 298 GLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDA 357
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------GTPS 650
P G++P+A+++ MGKG+ WVNG+SIGRYW +F T G P+
Sbjct: 358 PEGNEPLALDMEGMGKGQIWVNGESIGRYWTAFATGDCSHCSYTGTYKPNKCQTGCGQPT 417
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQN 710
Q WYH+PR++LKP+ NLLV+ EE G P +S+ SV+ +C VS+ H P + +W+ ++
Sbjct: 418 QRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIES 476
Query: 711 QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVE 770
+T RPKV ++C G+ I+ I FAS+G P G C +Y G CH++ S AI+E
Sbjct: 477 YGKGQTF-----HRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILE 531
Query: 771 KACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
+ C+GK C V + F DPCP + K L V+A C
Sbjct: 532 RKCVGKARCAVTISNSNFGKDPCPNVLKRLTVEAVC 567
>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
Length = 580
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 243/578 (42%), Positives = 329/578 (56%), Gaps = 31/578 (5%)
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV 313
+WTENWT ++ YGD+ +RSAEDIAY V F AK GS VNYYMYHGGTNFGRT ++YV
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAK-GGSLVNYYMYHGGTNFGRTGASYV 60
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS 373
LTGYYD+AP+DEYG+ ++PK+GHL++LH+ ++ K L G S EA IF+
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 374 SE--CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------- 421
E C +FL N + + TV F + +P S+SIL CK V +NT ++
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSFH 180
Query: 422 -----DSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP--- 471
QWE + E IP Y +T +R LEQ N TKD +DYLWY +FR + D
Sbjct: 181 TSDVTSKNNQWEMFSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRLESDDLPF 240
Query: 472 -SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMV 530
+D VL+V S H + F N FVG A G K F EK V L G N+V LLS +
Sbjct: 241 RNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHVVLLSSTM 300
Query: 531 GLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW 589
G+ DSG L G++ IQG D WG++ L GE +I+++ G V W
Sbjct: 301 GMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYSEKGLGKVQW 360
Query: 590 SRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTP 649
+ + TWYK FD P G DPV +++ SM KG +VNG+ +GRYWVS+ T GTP
Sbjct: 361 KP--AENDRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWVSYRTLAGTP 418
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ 709
SQ+ YHIPR FLK NLLV+ EEE G P GI + TV+ +C +S+ + + +W +
Sbjct: 419 SQAVYHIPRPFLKSKDNLLVIFEEEMGKPDGILVQTVTRDDICLFISEHNPGQIKTWDTD 478
Query: 710 NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIV 769
+ +K RR + CP + I +++FAS+GNP+G C N+ +G+CH+ N++ IV
Sbjct: 479 GDK-IKLIAEDHSRRG--TLTCPPEKTIQEVVFASFGNPDGMCGNFTVGTCHTPNAKQIV 535
Query: 770 EKACLGKRSCTVPVWTEKFYGD-PCPGIPKALLVDAQC 806
EK CLGK SC +PV + D C L V +C
Sbjct: 536 EKECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 573
>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
Length = 592
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 253/601 (42%), Positives = 339/601 (56%), Gaps = 56/601 (9%)
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-Y 312
+WTE WT ++ +G R AED+A+ VA FI K GS++NYYMYHGGTNFGRTA +
Sbjct: 1 MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQK-GGSFINYYMYHGGTNFGRTAGGPF 59
Query: 313 VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG 372
+ T Y APLDEYGL RQPKWGHLK+LH A+KLC ++SG M QEA +++
Sbjct: 60 IATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKS 119
Query: 373 SS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------ 425
S C+AFL N + ++ A V F N Y LPP SISILPDCK +NTA++ +
Sbjct: 120 KSGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQTSRMKMV 179
Query: 426 --------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD---- 473
W+ Y E TY + S L+EQ+NTT+D SDYLWY K D ++
Sbjct: 180 RVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLR 239
Query: 474 --SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVG 531
L V S GH +H FING+ GSA+G T K V+L G N +++LS+ VG
Sbjct: 240 NGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVG 299
Query: 532 LPDSGAYLERRVAG-LRNVSIQGAK-ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW 589
LP+ G + E AG L VS+ G +D S W Y+VGL GE L + + GS V W
Sbjct: 300 LPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEW 359
Query: 590 SRYG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------ 642
+ + QPLTWYKT F AP G P+A+++ SMGKG+ W+NGQS+GR+W ++
Sbjct: 360 AEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKAVGSC 419
Query: 643 --------------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
L G SQ WYH+PRS+LKP+GNLLV+ EE G P GI++ V
Sbjct: 420 SECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREV 479
Query: 689 TTLCGHVSDSHLPPVISWRSQ--NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYG 746
++C + + W+S N + + K PK ++C G+KI+ + FAS+G
Sbjct: 480 DSVCADIYE--------WQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFG 531
Query: 747 NPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
P G C +Y GSCH+ +S K C+G+ C+V V E F GDPCP + K L V+A C
Sbjct: 532 TPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVC 591
Query: 807 T 807
Sbjct: 592 A 592
>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
gi|224029591|gb|ACN33871.1| unknown [Zea mays]
Length = 580
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 243/578 (42%), Positives = 328/578 (56%), Gaps = 31/578 (5%)
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV 313
+WTENWT ++ YGD+ +RSAEDIAY V F AK GS VNYYMYHGGTNFGRT ++YV
Sbjct: 2 LWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAK-GGSLVNYYMYHGGTNFGRTGASYV 60
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS 373
LTGYYD+AP+DEYG+ ++PK+GHL++LH+ ++ K L G S EA IF+
Sbjct: 61 LTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFELP 120
Query: 374 SE--CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL---------- 421
E C +FL N + + TV F + +P S+SIL CK V +NT ++
Sbjct: 121 EEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSFH 180
Query: 422 -----DSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP--- 471
QWE E IP Y +T +R LEQ N TKD +DYLWY +FR + D
Sbjct: 181 TSDVTSKNNQWEMSSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRLESDDLPF 240
Query: 472 -SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMV 530
+D VL+V S H + F N FVG A G K F EK V L G N+V LLS +
Sbjct: 241 RNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHVVLLSSTM 300
Query: 531 GLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW 589
G+ DSG L G++ IQG D WG++ L GE +I+++ G V W
Sbjct: 301 GMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYSEKGLGKVQW 360
Query: 590 SRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTP 649
+ + TWYK FD P G DPV +++ SM KG +VNG+ +GRYWVS+ T GTP
Sbjct: 361 KP--AENDRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWVSYRTLAGTP 418
Query: 650 SQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ 709
SQ+ YHIPR FLK NLLV+ EEE G P GI + TV+ +C +S+ + + +W +
Sbjct: 419 SQAVYHIPRPFLKSKDNLLVIFEEEMGKPDGILVQTVTRDDICLFISEHNPGQIKTWDTD 478
Query: 710 NQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIV 769
+ +K RR + CP + I +++FAS+GNP+G C N+ +G+CH+ N++ IV
Sbjct: 479 GDK-IKLIAEDHSRRG--TLTCPPEKTIQEVVFASFGNPDGMCGNFTVGTCHTPNAKQIV 535
Query: 770 EKACLGKRSCTVPVWTEKFYGD-PCPGIPKALLVDAQC 806
EK CLGK SC +PV + D C L V +C
Sbjct: 536 EKECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 573
>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 493
Score = 434 bits (1116), Expect = e-118, Method: Compositional matrix adjust.
Identities = 221/459 (48%), Positives = 297/459 (64%), Gaps = 25/459 (5%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+NV+YD ++IING R+I+FSGSIHYPRST MWP LI KAK+GGLD ++T +FW+ HE
Sbjct: 19 GDNVSYDSNAIIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHE 78
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ ++DFSGR D ++F + +Q GLYV +RIGP++ EW YGG P WLH++PGI R++
Sbjct: 79 PQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWLHNMPGIQLRTN 138
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV-EHSFLEKGPPYVRWA 205
N+ +K M+ + T IVNM K A L+ASQGGPIIL+QIENEYG V ++ + G Y+ W
Sbjct: 139 NQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAYGDAGKAYINWC 198
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
A++A L GVPW+MC+Q DAP P+IN CNG C + F PN+P P ++TENW +++
Sbjct: 199 AQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYC-DNFT-PNNPKSPKMFTENWVGWFKK 256
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLD 324
+GD+ R+AED+A+ VA F + G + NYYMYHGGTNFGRT+ +T YD APLD
Sbjct: 257 WGDKDPYRTAEDVAFSVARFF-QSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLD 315
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF--SKLQEAFIFQGSSECAAFLVN 382
EYG L QPKWGHLK+LH+++KL K + +G + NF S F + E FL N
Sbjct: 316 EYGNLNQPKWGHLKQLHASIKLGEKILTNGTHTNQNFGSSVTLTKFFNPTTGERFCFLSN 375
Query: 383 KDKRNNATVYF-SNLMYELPPLSISILPDCKTVAFNTAKLDS-----VEQWEEYKEAI-- 434
D +N+AT+ ++ Y +P S+SIL C +NTAK++S V++ E + A
Sbjct: 376 TDGKNDATIDLQADGKYFVPAWSVSILDGCNKEVYNTAKVNSQTSMFVKEQNEKENAQLS 435
Query: 435 ------PTYDETS----LRANFLLEQMNTTKDASDYLWY 463
P D AN LEQ T D SDY WY
Sbjct: 436 WAWAPEPMKDTLQGNGKFAANLFLEQKRVTADFSDYFWY 474
>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
Length = 706
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 248/662 (37%), Positives = 354/662 (53%), Gaps = 89/662 (13%)
Query: 183 IENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET 242
IENE+G VE S+ ++G YV+W A+LA PW+MC+Q DAP P+IN CNG C +
Sbjct: 1 IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYCDQ- 59
Query: 243 FAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
PN+ + P +WTE+W +++ +G+ R+AED+A+ VA F + GS NYYMYHGG
Sbjct: 60 -FKPNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFF-QYGGSLHNYYMYHGG 117
Query: 303 TNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVL--VSM 359
TNFGR+A Y+ T Y APLDEYG + QPKWGHLK+LH ++ K + G + +
Sbjct: 118 TNFGRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDT 177
Query: 360 NFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
S ++ ++G S C F N + ++ + F Y +P S+++LPDCKT +NTA
Sbjct: 178 GHSTTATSYTYKGKSSC--FFGNPE-NSDREITFQERKYTVPGWSVTVLPDCKTEVYNTA 234
Query: 420 KLDSVE-------------------QWEEYKEAIPTYDE----TSLRANFLLEQMNTTKD 456
K+++ QW K T++ +++ AN L++Q T D
Sbjct: 235 KVNTQTTIREMVPSLVGKHKKPLKWQWRNEKIEHLTHEGDISGSAITANSLIDQKMVTND 294
Query: 457 ASDYLWYNFRFKHDPSD----SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEK 512
+SDYLWY F + +D L+V + GH+LHAF+N + +G+ G + SFTLEK
Sbjct: 295 SSDYLWYLTGFHLNGNDPLFGKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEK 354
Query: 513 MV-HLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVS--IQGAKELKDFSSFSWGYQV 569
V +L +G N ++LLS VGLP+ GAY E G+ I K ++D S+ W Y+V
Sbjct: 355 KVRNLRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVELIADGKTIRDLSTNEWIYKV 414
Query: 570 GLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAW 629
GL GEK + F PW +Q TWYKT F P G + V ++L+ MGKG+AW
Sbjct: 415 GLDGEKYEFFDPDHKFRKPWLSNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAW 474
Query: 630 VNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSFLKP-TGN 666
VNG+SIGRYW S+L + G P+Q WYHIPRS++ N
Sbjct: 475 VNGKSIGRYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKEN 534
Query: 667 LLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPK 726
L+L EE G P I I T V +C V K
Sbjct: 535 TLILFEEFGGMPLNIEIKTTRVKKVCAKVDLG--------------------------SK 568
Query: 727 VQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTE 786
+++ C R + +I+F +GNP GNC N+ GSCHSS + +++EK CL KR C++ V +
Sbjct: 569 LELTCHD-RTVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEKECLWKRKCSIEVTKD 627
Query: 787 KF 788
K
Sbjct: 628 KL 629
>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
Length = 568
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 245/585 (41%), Positives = 331/585 (56%), Gaps = 71/585 (12%)
Query: 273 RSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQ 331
R AEDIA+ VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DEYGLLR+
Sbjct: 3 RPAEDIAFAVARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 61
Query: 332 PKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNAT 390
PKWGHL++LH A+KLC ++SG + Q++ +F+ + CAAFL N D + A
Sbjct: 62 PKWGHLRDLHRAIKLCEPALVSGDPTVTSIGHYQQSHVFRSKAGACAAFLSNYDSGSYAR 121
Query: 391 VYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPTYDE 439
V F+ + Y++PP SISILPDCKT FNTA++ + WE Y E ++D+
Sbjct: 122 VVFNGIHYDIPPWSISILPDCKTTVFNTARIGAQTSQLKMEWAGKFSWESYNEDTNSFDD 181
Query: 440 TSLRANFLLEQMNTTKDASDYLWYN-----------FRFKHDPSDSESVLKVSSLGHVLH 488
S L+EQ++ T+D +DYLWY + H P VL V+S GH +H
Sbjct: 182 RSFTKVGLVEQISMTRDNTDYLWYTTYVNIGENEGFLKNGHYP-----VLTVNSAGHSMH 236
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LR 547
+ING+ G+ +G + T V L G+N +S+LSV VGLP+ G + E G L
Sbjct: 237 IYINGQLTGTIYGALENPKLTYTGSVKLWAGSNKISILSVAVGLPNIGGHFETWNTGVLG 296
Query: 548 NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V++ G E K D S W YQ+GL GE L + T GS V W G S Q LTWYKT
Sbjct: 297 PVTLSGLNEGKRDLSWQKWIYQIGLKGEALNLHTLSGSSSVEWG--GPSQKQSLTWYKTS 354
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------- 646
F+AP G+DP+A+++ SMGKG+ W+NGQS+GRYW ++
Sbjct: 355 FNAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPAYKASGSCGGCDYRGTYNEKKCQSNC 414
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW 706
G +Q WYH+PRS+L PTGNLLV+ EE G P GIS+ V ++C +++ W
Sbjct: 415 GESTQRWYHVPRSWLNPTGNLLVVFEEWGGDPSGISMVRRKVESVCAEIAE--------W 466
Query: 707 RSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSR 766
+ N + T R K + C G+K++ I FAS+G P G C ++ G+CH+ S
Sbjct: 467 QP-NMDNVHTGNY---GRSKAHLSCAPGQKMTNIKFASFGTPQGTCGAFSEGTCHAHKSY 522
Query: 767 AIVEKA-----CLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
EK C+G++SC V V E F GDPCPG K L V+A C
Sbjct: 523 DAFEKESLLQNCIGQQSCAVLVAPEVFGGDPCPGTMKKLAVEAIC 567
>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
Length = 446
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 201/407 (49%), Positives = 280/407 (68%), Gaps = 4/407 (0%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G V+YD RSL+I+G R + FSG+IHYPRS P+MW +L+ AK GGL+ ++T VFWN HE
Sbjct: 33 GTVVSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHE 92
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG++ F GR DL+RF+ ++ +Y +RIGPFI+ EW +GGLP+WL ++ I+FR++
Sbjct: 93 PEPGKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRAN 152
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK M+++ IV +K A ++A QGGPIILSQIENEYG ++ +G Y+ WAA
Sbjct: 153 NEPFKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAA 212
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++A+ GVPWVMCKQ AP VI CNGR CG+T+ + +KP +WTENWT+ ++ +
Sbjct: 213 EMAISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDK-NKPRLWTENWTAQFRTF 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
GD+ RSAEDIAY V F AK G+ VNYYMYHGGTNFGRT ++YVLTGYYD+AP+DEY
Sbjct: 272 GDQLAQRSAEDIAYAVLRFFAK-GGTLVNYYMYHGGTNFGRTGASYVLTGYYDEAPMDEY 330
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKD 384
G+ ++PK+GHL++LH+ +K K L G EA ++ + C +FL N +
Sbjct: 331 GMCKEPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNN 390
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYK 431
+ TV F + +P S+SIL DCKTV +NT ++ + ++ E K
Sbjct: 391 TGEDGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVCVLHKFTENK 437
>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
vinifera]
Length = 563
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 225/542 (41%), Positives = 313/542 (57%), Gaps = 38/542 (7%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MW L+ AKEGG+DV++T VF N HE P + F G DL++F+K VQ G+Y+ L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
PF+ EW +GG+P WLH VP +F+++++PFK+HM+++ T+IVN+MK +L+ASQGGPII
Sbjct: 61 PFVATEWNFGGVPIWLHYVPRTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 120
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQC 239
L+Q+ENEYG + + + G PYV WAA + + GVPW+MC+ + DP+IN CN C
Sbjct: 121 LTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFYC 180
Query: 240 GETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMY 299
+ PNSP K +WTENW +++ +G R EDIA+ VALF NYYMY
Sbjct: 181 DQ--FTPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFFFPKS---XNYYMY 235
Query: 300 HGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVS 358
HGGTNFG T+ ++ T Y AP+DEYGL R PK GHLKEL A+K C +L G ++
Sbjct: 236 HGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPIN 295
Query: 359 MNFSKLQEAFIFQGS-SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFN 417
+ QE ++ S AAF+ N D++ + + F N Y +P S+SILPDCK V FN
Sbjct: 296 LXLGPSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVVFN 355
Query: 418 TAKLDS----VEQ--------------------WEEYKEAIPTYDETSLRANFLLEQMNT 453
TAK+ S VE W+ + E + E N ++ +NT
Sbjct: 356 TAKVVSQISQVEMVLEDLQPSLVPSNKDLKGLXWKTFVEKAGIWGEADFVKNGFVDHINT 415
Query: 454 TKDASDYLWYNFRFKHDPSD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKS 507
TKD +D LWY S+ S+ +L V S GH LHAF+N + GSA G S
Sbjct: 416 TKDTTDXLWYTVSITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNGSHSP 475
Query: 508 FTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWG 566
F E + L G N + +LS+ VGL + + E A L +V I+G + D S++ W
Sbjct: 476 FKFECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLTSVKIKGLNNGIMDLSTYPWI 535
Query: 567 YQ 568
Y+
Sbjct: 536 YK 537
>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
Length = 420
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 211/410 (51%), Positives = 268/410 (65%), Gaps = 20/410 (4%)
Query: 298 MYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLV 357
MYHGGTNFGRT+S+Y +TGYYDQAPLDEYGLLRQPK+GHLKELH+A+K P+L G
Sbjct: 1 MYHGGTNFGRTSSSYFITGYYDQAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQT 60
Query: 358 SMNFSKLQEAFIFQGSSE-CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAF 416
++ +Q+A++F+ ++ C AFLVN D + + + F N Y L P SI IL +CK + +
Sbjct: 61 ILSLGPMQQAYVFEDANNGCVAFLVNNDAKA-SQIQFRNNAYSLSPKSIGILQNCKNLIY 119
Query: 417 NTAKLDSV---------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYL 461
TAK++ + W ++E IP + TSL+ N LLE N TKD +DYL
Sbjct: 120 ETAKVNVKMNTRVTTPVQVFNVPDNWNLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYL 179
Query: 462 WYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTN 521
WY FK D + + S GHV+H F+N GS HG + L+ V LING N
Sbjct: 180 WYTSSFKLDSPCTNPSIYTESSGHVVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQN 239
Query: 522 NVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ-GAKELKDFSSFSWGYQVGLLGEKLQIFT 580
N+S+LS MVGLPDSGAY+ERR GL V I G + D S WGY VGLLGEK++++
Sbjct: 240 NISILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQ 299
Query: 581 DYGSRIVPWS--RYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY 638
V WS + G ++PL WYKT FD P G PV +++ SMGKGE WVNG+SIGRY
Sbjct: 300 WKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRY 359
Query: 639 WVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
WVSFLTP G PSQS YHIPR+FLKP+GNLLV+ EEE G P GIS++T+SV
Sbjct: 360 WVSFLTPAGQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNTISV 409
>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
Length = 377
Score = 414 bits (1064), Expect = e-112, Method: Compositional matrix adjust.
Identities = 183/292 (62%), Positives = 236/292 (80%), Gaps = 1/292 (0%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDG SLII+G R++L+SGSIHYPRSTP+MWP +I +AK+GGL+ +QT VFWN+HEPQ
Sbjct: 41 VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 100
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSGR DLV+FIK +Q G+YV LR+GPFI+ EW +GGLP+WL +VPGI FR+DN+
Sbjct: 101 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKQ 160
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK H +RY MI++ MK RL+ASQGGPIIL QIENEY V+ ++ + G Y++WA+ L
Sbjct: 161 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASNLV 220
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
++ G+PWVMCKQ+DAPDP+INACNGR CG+TF GPN +KP++WTENWT+ ++V+GD
Sbjct: 221 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 280
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQA 321
RS EDIAY VA F +K G++VNYYMYHGGTNFGRT++ YV T YY+ A
Sbjct: 281 PTQRSVEDIAYSVARFFSK-NGTHVNYYMYHGGTNFGRTSAHYVTTRYYEDA 331
>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
Length = 532
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 231/535 (43%), Positives = 305/535 (57%), Gaps = 54/535 (10%)
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+AV GVPW+MC+Q DAP VI+ CNG C + PN+PDKP IWTENW +++ +G
Sbjct: 1 MAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQ--FTPNTPDKPKIWTENWPGWFKTFG 58
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R AED+AY VA F K GS NYYMYHGGTNFGRT+ +T YD +AP+DEY
Sbjct: 59 GRDPHRPAEDVAYSVARFFGK-GGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEY 117
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDK 385
GL R PKWGHLK+LH A+ L ++SG + EA ++ SS CAAFL N D
Sbjct: 118 GLPRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDD 177
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS----VE------------QWEE 429
+N+ V F N Y LP S+SILPDCKT FNTAK+ S VE +WE
Sbjct: 178 KNDKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKSSKVEMLPEDLKSSSGLKWEV 237
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSSL 483
+ E + N L++ +NTTKD +DYLWY ++ S VL + S
Sbjct: 238 FSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFIESK 297
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
GH LH FIN E++G+A G + F L+K V L G NN+ LLS+ VGL ++G++ E
Sbjct: 298 GHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYEWVG 357
Query: 544 AGLRNVSIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWS-RYGSSTHQPLT 601
AGL +VSI+G K + ++ W Y++G+ GE L++F S V W+ QPLT
Sbjct: 358 AGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQPLT 417
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------------- 642
WYK V + P+GS+PV +++ISMGKG AW+NG+ IGRYW
Sbjct: 418 WYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDYRGK 477
Query: 643 ------LTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTL 691
LT G PSQ WYH+PRS+ K +GN LV+ EE+ G P I + V+ +
Sbjct: 478 FMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKLSKRKVSVV 532
>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
Length = 621
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 242/651 (37%), Positives = 343/651 (52%), Gaps = 82/651 (12%)
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A L GVPW+MC+Q +AP P++ CNG C + P +P P +WTENWT +++ +G
Sbjct: 1 MANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQ--YEPTNPSTPKMWTENWTGWFKNWG 58
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEY 326
+ R+AED+A+ VA F + G++ NYYMYHGGTNFGR A Y+ T Y APLDE+
Sbjct: 59 GKHPYRTAEDLAFSVARFF-QTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEF 117
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
G L QPKWGHLK+LH+ +K K + G + ++ +A I+ + F+ N +
Sbjct: 118 GNLNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIYTTKEGSSCFIGNVNAT 177
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLR--- 443
+A V F Y +P S+S+LPDC A+NTAK+++ + P E + R
Sbjct: 178 ADALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPES 237
Query: 444 -------------ANFLLEQMNTTKDASDYLWYNFRFKHDPSD----SESVLKVSSLGHV 486
A L++Q + T DASDYLWY R D D L+V S HV
Sbjct: 238 AQKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHSNAHV 297
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMV-HLINGTNNVSLLSVMVGLPDSGAYLERRVAG 545
LHA++NG++VG+ K + E+ V HL++GTN++SLLSV VGL + G + E G
Sbjct: 298 LHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTG 357
Query: 546 LRN-VSIQGAKEL----KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
+ VS+ G K KD S W Y++GL G ++F+ W+ T + L
Sbjct: 358 INGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWANEKLPTGRML 417
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------- 646
TWYK F AP G +PV ++L +GKGEAW+NGQSIGRYW SF +
Sbjct: 418 TWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDKCDYRGAYG 477
Query: 647 --------GTPSQSWYHIPRSFLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSD 697
G P+Q WYH+PRSFL +G N + L EE G P ++ TV V T+C +
Sbjct: 478 SDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCARAHE 537
Query: 698 SHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAI 757
+ KV++ C + R IS + FAS+GNP G+C ++A+
Sbjct: 538 HN--------------------------KVELSCHN-RPISAVKFASFGNPLGHCGSFAV 570
Query: 758 GSCHSSNSRA-IVEKACLGKRSCTVPVWTEKFYGD-PCPGIPKALLVDAQC 806
G+C A V K C+GK +CTV V ++ F C PK L V+ +C
Sbjct: 571 GTCQGDKDAAKTVAKECVGKLNCTVNVSSDTFGSTLDCGDSPKKLAVELEC 621
>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 727
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 243/704 (34%), Positives = 374/704 (53%), Gaps = 79/704 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD RSLIING RK+L S SIHYPR+TP MW ++ K G+D+++T FWNLHEP
Sbjct: 42 NVSYDHRSLIINGERKLLLSASIHYPRATPSMWRPVLEATKAAGIDLIETYTFWNLHEPT 101
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F G ++ F+ GLYV +R GP++ EW YGG PFWL ++ GIVFR N+
Sbjct: 102 PGTYNFEGNANVTAFLDICAELGLYVTVRFGPYVCAEWNYGGFPFWLKEIDGIVFRDYNQ 161
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF M + T IVN ++ YAS GGPIIL+Q+ENEYG +E ++ G Y WAA+
Sbjct: 162 PFMDQMSNWMTYIVNYLRP--YYASNGGPIILAQVENEYGWLEAAYGASGTKYALWAAQF 219
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGE--TFAGPNSPDKPAIWTENWTSFYQVY 266
A L G+PW+MC QDD VIN CNG C + P++PA WTENW ++Q +
Sbjct: 220 ANSLDIGIPWIMCSQDDIAT-VINTCNGFYCHDWIDVHWTAYPNQPAFWTENWPGWFQNW 278
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-TASAYVLTGYYDQAPLDE 325
R +D+ Y VA +IA GS +NYYM+ GGT FGR T ++ T Y +DE
Sbjct: 279 EGGVPHRPVQDVLYSVARWIA-YGGSMMNYYMWFGGTTFGRWTGGPFITTSYDYDGAIDE 337
Query: 326 YGLLRQPKWGHLKELHSAVK------LCL---KPMLSGVLVSMNFSKLQEAFIFQGSSEC 376
YG +PK+ E H+ + L + KP+L G V ++ F + E
Sbjct: 338 YGYPYEPKYSQSLEFHTIIHAYEHIILSMNPPKPILLGENVEIS------HFYSVETGES 391
Query: 377 AAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA----------------K 420
+FL N TV ++ + +++ P S+ +L + ++ F+T+
Sbjct: 392 FSFLANFGATGVQTVQWNGITFKVQPWSVQLLYNNVSI-FDTSATPIGSPVPKQFTPIKS 450
Query: 421 LDSVEQW-EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
+++ QW E + Y ET +EQ++ T+D +DYLWY + + + ++ L
Sbjct: 451 FENIGQWSESFDLTFTNYSETP------MEQLSLTRDQTDYLWYVTKIEVNRVGAQ--LS 502
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYL 539
+ ++ ++H F++ +++ + G + TL + + G + + +L VGL + ++
Sbjct: 503 LPNISDMVHVFVDNQYIATGRGP---TNITLNSTIGV--GGHTLQVLHTKVGLVNYAEHM 557
Query: 540 ERRVAGL-RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
E VAG+ V++ D SS W + + GE LQ++ S V W+ + +
Sbjct: 558 EATVAGIFEPVTLDSV----DISSNGWSMKPFVQGETLQLYNPNHSGSVQWTN--VTGNP 611
Query: 599 PLTWYKTVFDAPTGSD-PVAINLISMGKGEAWVNGQSIGRYWVSF------------LTP 645
PLTWYK F+ S+ +A++++ M KG +VNG +IGRYW++ +P
Sbjct: 612 PLTWYKFNFNLELSSNMSLALDMLGMTKGMIFVNGYNIGRYWLALAYGCNPCTYQGGYSP 671
Query: 646 Q------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ +YH+P +L N +V+ EE G P I++
Sbjct: 672 SMCQLGCGEPSQQYYHVPTDWLMNGENEIVIFEEVYGNPEAITL 715
>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
Length = 1064
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 185/328 (56%), Positives = 242/328 (73%), Gaps = 5/328 (1%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
NV+YD R+L+I+G R++L S IHYPR+TP+MWP LIAK+KEGG DV+QT VFWN HEP
Sbjct: 28 NVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNGHEPV 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q++F GR D+V+F+K V + GLY+ LRIGP++ EW +GG P WL D+PGI FR+DN
Sbjct: 88 RRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PFK M+R+ IV++M+ L++ QGGPII+ QIENEYG VE SF ++G YV+WAA++
Sbjct: 148 PFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKWAARM 207
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
A++L GVPWVMC+Q DAPD +INACNG C + PNS +KP +WTE+W ++ +G
Sbjct: 208 ALELDAGVPWVMCQQADAPDIIINACNGFYCDAFW--PNSANKPKLWTEDWNGWFASWGG 265
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R EDIA+ VA F + GS+ NYYMY GGTNFGR++ + +T Y AP+DEYG
Sbjct: 266 RTPKRPVEDIAFAVARFFQR-GGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEYG 324
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGV 355
LL QPKWGHLKELH+A+KLC +P L V
Sbjct: 325 LLSQPKWGHLKELHAAIKLC-EPALVAV 351
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 190/477 (39%), Positives = 260/477 (54%), Gaps = 50/477 (10%)
Query: 374 SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV--------- 424
S C+AFL N D+ A+V F +Y+LPP S+SILPDC+T FNTAK+ +
Sbjct: 585 SSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNTAKVGAQTSIKTNKIS 644
Query: 425 ---EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD-------- 473
+ W KE I + E + +LE +N TKD SDYLW R D
Sbjct: 645 YVPKTWMTLKEPISVWSENNFTIQGVLEHLNVTKDHSDYLWRITRINVSAEDISFWEENQ 704
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
L + S+ +LH F+NG+ +GS G H K + + + L+ G N++ LLS VGL
Sbjct: 705 VSPTLSIDSMRDILHIFVNGQLIGSVIG-HWVK---VVQPIQLLQGYNDLVLLSQTVGLQ 760
Query: 534 DSGAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
+ GA+LE+ AG + V + G K + D S +SW YQVGL GE +I+ S W+
Sbjct: 761 NYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTD 820
Query: 592 YG-SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------- 643
++ TWYKT FDAP G +PVA++L SMGKG+AWVNG IGRYW
Sbjct: 821 LTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWTRVAPKDGCGK 880
Query: 644 -------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTT 690
T G P+Q WYHIPRS+L+ + NLLVL EE G P IS+ + S T
Sbjct: 881 CDYRGHYHTSKCATNCGNPTQIWYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQT 940
Query: 691 LCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNG 750
+C VS+SH P + +W + + ++ P++ ++C G IS I FASYG P G
Sbjct: 941 ICAEVSESHYPSLQNWSPSDFIDQNSKNKM---TPEMHLQCDDGHTISSIEFASYGTPQG 997
Query: 751 NCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+C+ ++ G CH+ NS A+V KAC GK SC + + F GDPC GI K L V+A+C
Sbjct: 998 SCQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFGGDPCRGIVKTLAVEAKCA 1054
>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
Length = 486
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 198/342 (57%), Positives = 239/342 (69%), Gaps = 15/342 (4%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
LCL + +TIG +VTYD +++IING R+IL SGSIHYPRSTPQMWP LI
Sbjct: 8 FLCLLTWVCSTIG----------SVTYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLI 57
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK+GGLD+++T VFWN HEP PG++ F R DLVRFIK VQ GLYV LRIGP++ E
Sbjct: 58 QKAKDGGLDIIETYVFWNGHEPSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAE 117
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W YGG P WL VPGI FR+DN PFK M+++ IV+MMK +L+ +QGGPIILSQIEN
Sbjct: 118 WNYGGFPIWLKFVPGIAFRTDNAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIEN 177
Query: 186 EYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG 245
EYG VE G Y +WAA++AV L+TGVPWVMCKQ+DAPDP+I+ CNG C E F
Sbjct: 178 EYGPVEWEIGAPGKSYTKWAAQMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYC-ENFK- 235
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
PN KP IWTENW+ +Y +G R ED+A+ VA FI GS VNYYMYHGGTNF
Sbjct: 236 PNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQN-GGSLVNYYMYHGGTNF 294
Query: 306 GRTASAYVLTGYYDQAPLDEYGLLRQPKWG--HLKELHSAVK 345
GRT+ +V T Y AP+DEYGLLR+P G LK L+ +
Sbjct: 295 GRTSGLFVTTSYDFDAPIDEYGLLREPILGPVTLKGLNEGTR 336
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 72/159 (45%), Positives = 97/159 (61%), Gaps = 22/159 (13%)
Query: 546 LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
L V+++G E +D S + W Y+VGL GE L +++ GS V W + GS QPLTWYK
Sbjct: 323 LGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMK-GSFQKQPLTWYK 381
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY--------------WVSFLTPQ---- 646
T F+ P G++P+A+++ SM KG+ WVNG+SIGRY + F T +
Sbjct: 382 TTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIARGKCNKCSYTGFFTEKKCLW 441
Query: 647 --GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
G PSQ WYHIPR +L P GNLL++LEE G P GIS+
Sbjct: 442 NCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISL 480
>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
Length = 585
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 237/583 (40%), Positives = 311/583 (53%), Gaps = 81/583 (13%)
Query: 298 MYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVL 356
MY GGTNFGRT+ + +T Y APLDEYGL +PKWGHLK+LH+A+KLC +++
Sbjct: 1 MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAAD- 59
Query: 357 VSMNFSKL---QEAFIFQGSSE-----CAAFLVNKDKRNNATVYFSNLMYELPPLSISIL 408
+ + KL QEA I+ G E CAAFL N D+ +A V F+ Y LPP S+SIL
Sbjct: 60 -APQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSIL 118
Query: 409 PDCKTVAFNTAKL----------------------------DSV----EQWEEYKEAIPT 436
PDC+ VAFNTAK+ D+V + W KE I
Sbjct: 119 PDCRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGI 178
Query: 437 YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD--------SESVLKVSSLGHVLH 488
+ E + LLE +N TKD SDYLW+ R D S + + S+ VL
Sbjct: 179 WGENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLR 238
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR- 547
F+N + GS G H K+ + V I G N++ LL+ VGL + GA+LE+ AG R
Sbjct: 239 VFVNKQLAGSIVG-HWVKAV---QPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRG 294
Query: 548 NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL-TWYKT 605
+ G K D S SW YQVGL GE +I+T + WS + + WYKT
Sbjct: 295 KAKLTGFKNGDLDLSKSSWTYQVGLKGEADKIYTVEHNEKAEWSTLETDASPSIFMWYKT 354
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWV---------------------SFLT 644
FD P G+DPV +NL SMG+G+AWVNGQ IGRYW T
Sbjct: 355 YFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRYWNIISQKDGCDRTCDYRGAYNSDKCTT 414
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVI 704
G P+Q+ YH+PRS+LKP+ NLLVL EE G P IS+ TV+ LCG VS+SH PP+
Sbjct: 415 NCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPPLR 474
Query: 705 SWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
W + + + I P+V + C G IS I FASYG P G+C+ ++IG CH+SN
Sbjct: 475 KWSTPDY--INGTMSINSVAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASN 532
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S +IV +AC G+ SC + V F DPC G K L V ++C+
Sbjct: 533 SLSIVSEACKGRNSCFIEVSNTAFISDPCSGTLKTLAVMSRCS 575
>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
Length = 338
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 178/323 (55%), Positives = 237/323 (73%), Gaps = 5/323 (1%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+NV+YD +LIING R+I+FSGSIHYPRST MWP LI KAK+GGLD ++T +FW+ HE
Sbjct: 19 GDNVSYDSNALIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHE 78
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
PQ ++DFSGR D ++F + +Q GLYV +RIGP++ EW YGG P WLH++PGI R++
Sbjct: 79 PQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWLHNMPGIQLRTN 138
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV-EHSFLEKGPPYVRWA 205
N+ +K M+ + T IVNM K A L+ASQGGPIIL+QIENEYG V ++ + G Y+ W
Sbjct: 139 NQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAYGDAGKAYINWC 198
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
A++A L GVPW+MC+Q DAP P+IN CNG C + F PN+P P ++TENW +++
Sbjct: 199 AQMAESLNIGVPWIMCQQSDAPQPMINTCNGFYC-DNFT-PNNPKSPKMFTENWVGWFKK 256
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLD 324
+GD+ R+AED+A+ VA F + G + NYYMYHGGTNFGRT+ +T YD APLD
Sbjct: 257 WGDKDPYRTAEDVAFSVARFF-QSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLD 315
Query: 325 EYGLLRQPKWGHLKELHSAVKLC 347
EYG L QPKWGHLK+LH+++ +C
Sbjct: 316 EYGNLNQPKWGHLKQLHASIXIC 338
>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
Length = 827
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 243/713 (34%), Positives = 362/713 (50%), Gaps = 78/713 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R++IING RK+L+S SIHYPRST MWP ++ + K G++ ++T +FWNLH+P P
Sbjct: 32 VSYDNRAIIINGERKLLYSASIHYPRSTRTMWPDILKRTKAAGINTIETYIFWNLHQPTP 91
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
+DF G D+ F+ + +G +V +R GP++ EW GGLP WL VPGIV+R+ NEP
Sbjct: 92 DTYDFEGSSDVKHFLDLCKEEGFHVIVRFGPYVCAEWNNGGLPSWLKAVPGIVYRTHNEP 151
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEK-GPPYVRWAAKL 208
F MK++ IV+ + + YA GGPII++QIENEYG +E+ + E+ GP YV WA KL
Sbjct: 152 FMREMKKWMDYIVHYL--SDYYAPNGGPIIMAQIENEYGWLEYEYREQGGPEYVDWAVKL 209
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGE--TFAGPNSPDKPAIWTENWTSFYQVY 266
A TG+PW+MC+Q+ D VIN CNG C + + PD+PA +TE WT + Q +
Sbjct: 210 AKSYNTGIPWIMCQQNTRSD-VINTCNGFYCHDWLQYHQRTFPDQPAFFTELWTGWPQYF 268
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
+ R D+ Y A F ++ G VNYYM+HGGT FGR S ++ T Y APLDEY
Sbjct: 269 EEGFPTRPTVDVLYSAARFYSR-GGGMVNYYMWHGGTTFGRFTSPFLTTSYDYDAPLDEY 327
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF---SKLQEAFIFQGSSECAAFLVNK 383
G ++PK+ L +LH ++ +L V + E ++ +E FLVN
Sbjct: 328 GFPQEPKYSMLTKLHVTLEKYSSVILHDPNVPPPYVFPDNTVEMIEYKKDAESVVFLVNW 387
Query: 384 D----------------KRNNATVYFSNLM----YELP-----------PLSISILPDCK 412
D + + +Y++N + +E+P P++ + L
Sbjct: 388 DDTFAKQVDMNGKNVKINQWSVQIYYNNELVFDTFEIPANLTRPNPPFKPIAKTSLDATA 447
Query: 413 TVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS 472
T ++ V W E + TY+ +S Q+ T D SDY+WY D +
Sbjct: 448 AATSRTGLVNLVSSWNE-PFSFLTYNASSQTPT---AQLKLTGDNSDYIWYETEI--DLT 501
Query: 473 DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGL 532
++ +L + + F++G+F+ G F + V G + + +L +G+
Sbjct: 502 KTDEILYLYKSYDFSYVFVDGQFLYWHRGSPIQAYFNGKFPV----GKHTLQILCAAMGV 557
Query: 533 PDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
P GA++E+ GL G+K + D W + L GE L + V WS
Sbjct: 558 PSYGAHIEQHERGLTGDIFLGSKNITD---NGWKMRPFLSGELLGLHA--SPSTVKWSPV 612
Query: 593 GSSTH-QPLTWYKTVFDAPTGSD--PVAINLISMGKGEAWVNGQSIGRYWVS-------- 641
T +TWYK P+ D A++L SM KG +VNG SIGRYWV+
Sbjct: 613 SKGTAGSGVTWYKFNVKTPSFEDGPAFALDLKSMWKGLVFVNGNSIGRYWVAKGWCEEKC 672
Query: 642 ----------FLTPQGTPSQSWYHIPRSFLKPTG-NLLVLLEEENGYPPGISI 683
G SQ +YH+P+ FLK + N +++ EE G P I +
Sbjct: 673 NQTGLYDNYGCRENCGESSQRYYHVPKDFLKESSDNEVIIFEELQGDPYSIEL 725
>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
Length = 775
Score = 380 bits (976), Expect = e-102, Method: Compositional matrix adjust.
Identities = 241/630 (38%), Positives = 337/630 (53%), Gaps = 85/630 (13%)
Query: 231 INACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMK 290
IN CNG C +TF PN+P P ++TENW+ +Y+++G + R+AED+A+ VA F+ +
Sbjct: 164 INTCNGYYC-DTFK-PNNPKSPKMFTENWSGWYKLWGGKTSYRTAEDMAFSVARFV-QAG 220
Query: 291 GSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLK 349
G + NYYMY+GGTNFGRTA +T YD +PLDEYG L QPKWGHLK+LH+++KL K
Sbjct: 221 GVFNNYYMYYGGTNFGRTAGGPYITASYDYDSPLDEYGNLNQPKWGHLKQLHASIKLGEK 280
Query: 350 PMLSGVLVSMNFSKLQE--AFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISI 407
+ +G + NF + A+ + E FL N + + + Y +P S+SI
Sbjct: 281 IITNGTVTIKNFQAGVDLTAYTNNATRERFCFLSNINIADAHIDLQQDGNYTIPAWSVSI 340
Query: 408 LPDCKTVAFNTAKLD---SVEQWEEYKEAIPT---------------YDETSLRANFLLE 449
L +C FNTAK++ S+ + Y+ PT + R + LL+
Sbjct: 341 LQNCSKEIFNTAKVNTQTSLMVKKLYENDKPTNLSWVWAPEPMKDTLLGKGRFRTSQLLD 400
Query: 450 QMNTTKDASDYLWYNFRFKHDPSD---SESVLKVSSLGHVLHAFINGEF-VGSAHGKHSD 505
Q TT DASDYLWY F + + + L+V+S GHVLHA++N + VGS +
Sbjct: 401 QKETTVDASDYLWYMTSFDMNKNTLQWTNVTLRVTSRGHVLHAYVNKKLIVGSQLVIQGE 460
Query: 506 KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ---GAKELKDFSS 562
FT EK V L G N +SLLS VGL + G++ ++ G+ + +Q K + D SS
Sbjct: 461 --FTFEKPVTLKPGNNVISLLSATVGLANYGSFFDKTPVGIVDGPVQLMANGKPVMDLSS 518
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRY-GSSTHQPLTWYKTVFDAPTGSDPVAINLI 621
W Y++GL GE + F D SR WS G ST +P+TWYKT F +P+G+DPV ++L
Sbjct: 519 NLWSYKIGLNGEAKR-FYDPTSRHNKWSAANGVSTARPMTWYKTTFSSPSGTDPVVVDLQ 577
Query: 622 SMGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRS 659
MGKG AW NG+S+GRYW S + G P+Q WYH+PRS
Sbjct: 578 GMGKGHAWANGKSLGRYWPSQIANANGCSGTCDYRGPYNAGKCTRNCGIPTQRWYHVPRS 637
Query: 660 FLKPTG-NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHK 718
FL G N L+L EE G P GIS V+ T+CG+ +
Sbjct: 638 FLNSNGKNTLILFEEVGGDPSGISFQIVTTETICGNAYEGS------------------- 678
Query: 719 RIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRS 778
+++ C GR IS+I FASYGNP G C ++ GS + NS +V+K C+GK S
Sbjct: 679 -------TLELSCQGGRTISEIQFASYGNPQGTCSSFKKGSFDAMNSVQMVQKECVGKDS 731
Query: 779 CTVPVWTEKFYGDPCPGIP-KALLVDAQCT 807
C++ E F + GI K L V A C+
Sbjct: 732 CSIIASDETFMVNEPQGISNKRLAVQAHCS 761
Score = 178 bits (452), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 75/122 (61%), Positives = 94/122 (77%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V YD +LIING RKI+FSG+IHYPRSTP+MWP LI KAK+GGLD ++T VFW+ HEP
Sbjct: 25 VEYDSNALIINGERKIIFSGAIHYPRSTPEMWPELINKAKDGGLDAIETYVFWDRHEPVR 84
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
Q+DFSG D+V+F + +Q GLYV LRIGP++ EW YGG P WLH+ PG+ R+DNE
Sbjct: 85 RQYDFSGNLDIVKFFRVIQEAGLYVILRIGPYVCAEWNYGGFPMWLHNTPGVELRTDNEI 144
Query: 150 FK 151
+K
Sbjct: 145 YK 146
>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
lyrata]
Length = 534
Score = 377 bits (969), Expect = e-101, Method: Compositional matrix adjust.
Identities = 228/537 (42%), Positives = 305/537 (56%), Gaps = 67/537 (12%)
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPML-SGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
GLLRQPKWGHL++LH A+KLC ++ + +S S L+ A S CAAFL N
Sbjct: 9 GLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGT 68
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV--------------------- 424
+++ATV F+ Y LP S+SILPDCK VAFNTAK++S
Sbjct: 69 KSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKINSATEPTAFARQSLKPDGGSSAEL 128
Query: 425 -EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF--KHDPS----DSESV 477
+W KE I + LLEQ+NTT D SDYLWY+ R K D + S++V
Sbjct: 129 GSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAV 188
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L + SLG V++AFING+ GS HGK + +L+ ++L+ G N V LLSV VGL + GA
Sbjct: 189 LHIESLGQVVYAFINGKLAGSGHGK---QKISLDIPINLVAGKNTVDLLSVTVGLANYGA 245
Query: 538 YLERRVAGLRN-VSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
+ + AG+ V+++ AK D +S W YQVGL GE + S V S+
Sbjct: 246 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLGAVDSSEWV--SKSPL 303
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------- 646
T QPL WYKT FDAP+GS+PVAI+ KG AWVNGQSIGRYW + +
Sbjct: 304 PTKQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIGRYWPTSIAGNGGCTDSCD 363
Query: 647 --------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV-TTL 691
G PSQ+ YH+PRS+LKP+GN LVL EE G P IS T + L
Sbjct: 364 YRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQTGSNL 423
Query: 692 CGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCP-SGRKISKILFASYGNPNG 750
C VS SH PPV +W S ++ + + RP + ++CP S + IS I FAS+G P G
Sbjct: 424 CLTVSQSHPPPVDTWTSDSKISNRNR-----TRPVLSLQCPVSTQVISSIKFASFGTPKG 478
Query: 751 NCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
C ++ GSC+SS S ++V+KAC+G RSC + V T + +G+PC G+ K+L V+A C+
Sbjct: 479 TCGSFTSGSCNSSRSLSLVQKACIGSRSCNIEVST-RVFGEPCRGVVKSLAVEASCS 534
>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
Length = 759
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 243/712 (34%), Positives = 367/712 (51%), Gaps = 90/712 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V YD RSL ING RK++ SGSIHYPRSTP MWP LI K+K+ G+++++T VFWNLH+P
Sbjct: 46 VEYDQRSLKINGERKLMISGSIHYPRSTPSMWPSLIKKSKDAGINMIETYVFWNLHQPNN 105
Query: 90 GQ-FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
Q ++F G ++ F+ Q +GLYV LRIGP++ EW YGG+P WL ++PGIVFR N+
Sbjct: 106 SQEYNFEGNANITHFLDLCQQEGLYVHLRIGPYVCAEWNYGGIPSWLRNIPGIVFRDYNQ 165
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
P+ M + T IVN +K +AS GGPIIL+Q+ENEYG +E+ + + G Y WA
Sbjct: 166 PWMTEMASWMTFIVNYLKP--YFASNGGPIILAQVENEYGWLENEYGDSGKLYAEWAISF 223
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGE--TFAGPNSPDKPAIWTENWTSFYQVY 266
A L G+PW MC+Q+D D IN CNG C + + P++PA +TENW + Q Y
Sbjct: 224 AKSLNIGIPWTMCQQNDI-DDAINTCNGFYCHDWIQYHFQVYPNQPAFFTENWAGWIQYY 282
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
+ R ED+ Y VA + ++ GS +NYYM+HGGT F R +S ++ Y A LDEY
Sbjct: 283 SEGVPHRPTEDLLYSVARWFSR-GGSLMNYYMWHGGTTFARYSSTFLTNSYDYDAALDEY 341
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVS--MNFSK---------LQEAFIFQGSSE 375
G +PK+ L +LHS + +LS V+ +N S +Q G+ E
Sbjct: 342 GYEAEPKYSALAQLHSVLSQYSYILLSSGEVARPVNISNITTCNTIEIIQYNTTINGTLE 401
Query: 376 CAAFLVNKDKRNNATVY--FSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEA 433
F+ N ++A V ++ + P S+ IL + +TV + +E+ ++
Sbjct: 402 TITFVTNFGVSSSAPVQLNWNGQTITVNPWSVLILYNNQTVIDTSYVKQQYSAQKEFYQS 461
Query: 434 -------IPTYDE--------TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVL 478
+ ++ E + AN EQ++ T D +DYL
Sbjct: 462 KRVKNVLVSSWTEPIGVGNYSNVVTANLPSEQLDLTLDQTDYL----------------- 504
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
+ +++ +I+GE+ + G S F L+ + GT+ +S+LS+ +GL G++
Sbjct: 505 --CNADDMIYIYIDGEYQSWSRG--SPAHFVLDTKFGI--GTHKLSILSLTMGLISYGSH 558
Query: 539 LERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS-STH 597
E GL G +D ++ W + L+GE I ++ + WS S +
Sbjct: 559 FESYKRGLNGTVTLGT---QDITNNGWSMRPYLVGEMQGIQSN--PHLTSWSINNELSIN 613
Query: 598 QPLTWYK---TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF------------ 642
QPLTWYK + + A+++I M KG VNG SIGRYW++
Sbjct: 614 QPLTWYKLNLIIQSEIQDTSSFALDMIGMNKGFIIVNGNSIGRYWLTLGWGCGSGCNYTG 673
Query: 643 --------LTPQGTPSQSWYHIPRS--FLKPTG-NLLVLLEEENGYPPGISI 683
T G PS+ +YH+P +L+P N +++ EE +G P I +
Sbjct: 674 DGYQGYLCRTGCGEPSERYYHVPNDYLYLEPNQLNEIIVFEELSGDPNSIQL 725
>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
Length = 500
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 216/504 (42%), Positives = 286/504 (56%), Gaps = 47/504 (9%)
Query: 222 KQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYH 281
KQDDAPDPVIN CNG C + PN KP++WTE WT ++ +G R ED+A+
Sbjct: 1 KQDDAPDPVINTCNGFYC--DYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFA 58
Query: 282 VALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DE+GLLRQPKWGHL++L
Sbjct: 59 VARFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDL 117
Query: 341 HSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG-SSECAAFLVNKDKRNNATVYFSNLMYE 399
H A+K ++S + ++A++F+ + CAAFL N V F+ Y
Sbjct: 118 HRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYN 177
Query: 400 LPPLSISILPDCKTVAFNTA---------KLDSVEQ--WEEYKEAIPTYDETSLRANFLL 448
LP SISILPDCKT FNTA K++ V + W+ Y E + +++ + L+
Sbjct: 178 LPAWSISILPDCKTAVFNTATVKEPTLMPKMNPVVRFAWQSYSEDTNSLSDSAFTKDGLV 237
Query: 449 EQMNTTKDASDYLWYNFRFKHDPSDSES----VLKVSSLGHVLHAFINGEFVGSAHGKHS 504
EQ++ T D SDYLWY +D S L V S GH + F+NG+ GS +G +
Sbjct: 238 EQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVNGKSYGSVYGGYD 297
Query: 505 DKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN----VSIQGAKELKDF 560
+ T V + G+N +S+LS VGLP+ G + E G+ S+ G KD
Sbjct: 298 NPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSLNGGT--KDL 355
Query: 561 SSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINL 620
S W YQVGL GE L + T GS V W G +QPLTW+K F+AP G+DPVA+++
Sbjct: 356 SHQKWTYQVGLKGETLGLHTVTGSSAVEWG--GPGGYQPLTWHKAFFNAPAGNDPVALDM 413
Query: 621 ISMGKGEAWVNGQSIGRYW----------VSFL---------TPQGTPSQSWYHIPRSFL 661
SMGKG+ WVNG +GRYW S+ + G SQ WYH+PRS+L
Sbjct: 414 GSMGKGQLWVNGHHVGRYWSYKASGGCGGCSYAGTYHEDKCRSNCGDLSQRWYHVPRSWL 473
Query: 662 KPTGNLLVLLEEENGYPPGISIDT 685
KP GNLLV+LEE G G+S+ T
Sbjct: 474 KPGGNLLVVLEEYGGDLAGVSLAT 497
>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 655
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 210/520 (40%), Positives = 292/520 (56%), Gaps = 54/520 (10%)
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE-CAAFLVNKDK 385
GLLR+PKWGHLKELH A+KLC +++G + + Q+A +F+ S++ C AFL NKDK
Sbjct: 149 GLLREPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDK 208
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDS-VEQ----------WEEYKEAI 434
+ A V F+ + Y+LPP SISILPDCKT +NTA + S + Q W+ Y E I
Sbjct: 209 VSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQISQMKMEWAGGFTWQSYNEDI 268
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYN--FRFKHD----PSDSESVLKVSSLGHVLH 488
+ + S LLEQ+N T+D +DYLWY D + +L V S GH LH
Sbjct: 269 NSLGDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVMSAGHALH 328
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
F+NG+ G+ +G D T V L +G+N +S LS+ VGLP+ G + E AG+
Sbjct: 329 IFVNGQLTGTVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFETWNAGILG 388
Query: 549 -VSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTV 606
V++ G E +D + W Y+VGL GE L + + GS V W QPL+WYK
Sbjct: 389 PVTLDGLNEGRRDLTWQKWTYKVGLKGEALSLHSLSGSSSVEWGE--PVQKQPLSWYKAF 446
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQ 646
F+AP G +P+A+++ SMGKG+ W+NGQ IGRYW + T
Sbjct: 447 FNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKASGTCGICDYRGEYDEKKCQTNC 506
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW 706
G SQ WYH+PRS+L PTGNLLV+ EE G P GIS+ ++C VS+ P + +W
Sbjct: 507 GDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRIAGSICADVSEWQ-PSMANW 565
Query: 707 RSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSR 766
R++ + KV ++C GRK++ I FAS+G P G+C +Y+ G CH+ S
Sbjct: 566 RTKGYE-----------KAKVHLQCDHGRKMTHIKFASFGTPQGSCGSYSEGGCHAHKSY 614
Query: 767 AIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
I K+C+G+ C V V + F GDPCPG K +V+A C
Sbjct: 615 DIFWKSCIGQERCGVSVVPDAFGGDPCPGTMKRAVVEAIC 654
>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
Length = 744
Score = 369 bits (946), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 243/753 (32%), Positives = 358/753 (47%), Gaps = 112/753 (14%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
V++D R+L+++G R ++ SG++HYPRSTP MWPR++ ++ GL+ V+T +FWNLHE +
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G DFSGR DLVRF + QA+GL V LRIGP+I E YGGLP WL DVP I R+DNE
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
FK R+ ++ +++ L A GGP+IL+QIENEY + ++ E G Y+RW+ +L
Sbjct: 122 AFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 209 AVDLQTGVPWVMC-----KQDDAPDPVINACNGRQCGETFAG--------PNSPDKPAIW 255
A L G+PWV C + D V +A + + F P++PA+W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLT 315
TENW +YQ +G R E++AY A F A GS VNY+++HGGTNFGR + T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAA-GGSGVNYFLWHGGTNFGRDGMYLLTT 298
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE 375
Y PLDEYG L K HL L+ A+ C +L+ + FQ SS
Sbjct: 299 AYEFGGPLDEYG-LPTTKARHLARLNKALAACADKILASERPRAITGERNGLLKFQYSSG 357
Query: 376 CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW--EEYKEA 433
+ + + ++Y+ S + P +T + + + W E A
Sbjct: 358 LTFWCDDVARTVRIVGKNGEVLYD---SSARVAPVRRTWKASGVRF-APWGWRAEPLPAA 413
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF-------------------------- 467
P ++++ A LEQ+ TKD +DY WY
Sbjct: 414 WPAEAQSAVTARKPLEQLLLTKDETDYCWYETAIVVEGSGDVLVAGRDGSPAGLERGALA 473
Query: 468 ---------------KHDPSDSESVLKVSSLGHVLHAFINGEFVGSA-------HGKHSD 505
P+++ + L+++ + ++H FI+G FV + GK
Sbjct: 474 RVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVFIDGTFVATTPTPLRERRGKMDA 533
Query: 506 KSFTLE-----KMVHLINGTNNVSLLSVMVGLPD-------SGAYLERRVAGLRNVSIQG 553
FT K + + G + +SLL +GL LE++ GL
Sbjct: 534 GLFTQTFELDLKALRITPGKHRLSLLCCALGLIKGDWMIGYENMALEKK--GLWAPVFWN 591
Query: 554 AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW----SRYGSSTHQPLTWYKTVFDA 609
K+L+ W +Q GLLGE+ ++ W + G +PL W++T F
Sbjct: 592 GKKLEG----EWRHQPGLLGERCGFADPAAGSLLAWKTAKAATGRGARRPLRWWRTTFTR 647
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT-----------------PQGTPSQS 652
P G P A++L MGKG AW+NG IGRYW+ T P P+Q
Sbjct: 648 PKGHGPWALDLGGMGKGMAWINGHCIGRYWLLADTDPMGPWMAWMKGSLTAAPSSGPTQR 707
Query: 653 WYHIPRSFLKPTG--NLLVLLEEENGYPPGISI 683
+YH+P +L+ G + LVL EE G P + +
Sbjct: 708 YYHVPDDWLRTDGGPDTLVLFEELGGDPATVRL 740
>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
Length = 735
Score = 368 bits (944), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 239/720 (33%), Positives = 357/720 (49%), Gaps = 85/720 (11%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G N+TYD RSLIING RK+L SGS+HYPR++ W ++ +K G+D+++T +FWN+
Sbjct: 37 NNGLNITYDHRSLIINGERKLLVSGSVHYPRASVSKWNEILKSSKLAGVDIIETYIFWNV 96
Query: 85 HEPQ-PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
H+P P +F ++ F+ + L+V LRIGP++ EW YGG P WL ++ GIVF
Sbjct: 97 HQPNTPNEFYLEDNANITLFLDLCKENELFVNLRIGPYVCAEWNYGGFPIWLKNIEGIVF 156
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R N+PF M + TM+V+ K +A GGPII++QIENEYG +E+ + G Y
Sbjct: 157 RDYNQPFMDAMSTWVTMVVD--KLQDYFAPNGGPIIIAQIENEYGWLENEYGASGREYAL 214
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
WA A L G+PW+MC Q+D D IN CNG C + + PD+PA WTENW
Sbjct: 215 WAINFAKSLNIGIPWIMCAQEDI-DSAINTCNGFYCHDWIDRHWNAFPDQPAFWTENWVG 273
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQ 320
+++ +G R +D+ + A FIA GS NYYM+ GGTNFGR+ +++T Y
Sbjct: 274 WFENWGQAVPKRPVQDMLFSSARFIA-YGGSLFNYYMWFGGTNFGRSVGGPWIITSYEYD 332
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN------FSKLQEAFIFQGSS 374
APLDE+G +PK+ + H + +++ M+ S + EA +
Sbjct: 333 APLDEFGFPNEPKYSMSTQFHFVIH-----KYESIIMGMDPPTPVPLSNISEAHPY---G 384
Query: 375 ECAAFLVNKDKRNNATVYFSNLMYELPPLSI------SILPDCKTVAFNTAKLDSVEQWE 428
E FL N + + + Y L P S+ S++ D V K + +Q++
Sbjct: 385 EDLVFLTNFGLVIDY-IQWQGTNYTLQPWSVVIVYSGSVVFDTSYVPDEYIKPSTRDQFK 443
Query: 429 EYKEAIPTYD---------------ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD 473
+ AI YD + + LEQ+N T D +DYLWY + +
Sbjct: 444 DVPNAI-NYDSILSFSEWGQSDIINDCIINNESPLEQINLTNDTTDYLWYTTNITLNET- 501
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
+ L + ++ H F+NG + G +G TLE IN + +L++ +GL
Sbjct: 502 --TTLTIENMYDFCHVFLNGAYQG--NGWSPVAYITLEPTNGNINY--QLQILTMTMGLE 555
Query: 534 DSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
+ A++E GL G + ++ W + G+LGEKLQI+ +Y S V W Y
Sbjct: 556 NYAAHMESYSRGLLGSISLGQTNI---TNNQWSMKPGILGEKLQIYNEYSSSKVNWQPYN 612
Query: 594 SSTHQPLTWYK-----TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY---------- 638
S Q +TWY+ + S+ +N+ SM KG +VNG +IGRY
Sbjct: 613 PSATQSMTWYQFNISLDGLSSDPSSNAYVLNMTSMNKGFVYVNGFNIGRYFLMEATQSNC 672
Query: 639 -----WVSFLTPQGT------PSQSWYHIPRSFLKPTGN----LLVLLEEENGYPPGISI 683
++ TP PSQS YHIP +L + ++L EE NG P I +
Sbjct: 673 TLKQDYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFLQQDKQYATVILFEEVNGDPTKIQL 732
>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
gi|194699714|gb|ACF83941.1| unknown [Zea mays]
gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 346
Score = 367 bits (942), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 184/313 (58%), Positives = 219/313 (69%), Gaps = 4/313 (1%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD +++++NG R+IL SGSIHYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + T IV+MMK+ L+ QGGPIILSQIENE+G +E E Y WAA +AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L T VPWVMCK+DDAPDP+IN CNG C + PN P KP +WTE WTS+Y +G
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTSWYTGFGIPV 267
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+AY VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DEYG L
Sbjct: 268 PHRPVEDLAYGVAKFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGEL 326
Query: 330 RQPKWGHLKELHS 342
+G L+S
Sbjct: 327 NTFYFGKRHALYS 339
>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
Length = 346
Score = 365 bits (936), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 183/313 (58%), Positives = 218/313 (69%), Gaps = 4/313 (1%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD +++++NG R+IL SGSIHYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL VPGI R+DNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISLRTDNEPF 149
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K M+ + T IV+MMK+ L+ QGGPIILSQIENE+G +E E Y WAA +AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L T VPWVMCK+DDAPDP+IN CNG C + PN P KP +WTE WTS+Y +G
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTSWYTGFGIPV 267
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+AY VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DEYG L
Sbjct: 268 PHRPVEDLAYGVAKFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGEL 326
Query: 330 RQPKWGHLKELHS 342
+G L+S
Sbjct: 327 NTFYFGKRHALYS 339
>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
Length = 735
Score = 365 bits (936), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 242/716 (33%), Positives = 378/716 (52%), Gaps = 90/716 (12%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V+YD R++ ING+R +LFSG IHYPRSTP MWP L++KAKE GL+ +QT VFWN+HE +
Sbjct: 33 HVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNMHEQK 92
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G +DFSGR +L F++E GL+V LR+GP++ EW YG LP WL+++P I FRS N+
Sbjct: 93 RGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSND 152
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
+K MKR+ + I+ + A GGPIIL+QIENEYG + + YV W L
Sbjct: 153 AWKSEMKRFLSDIIVYVDG--FLAKNGGPIILAQIENEYGGNDRA-------YVDWCGSL 203
Query: 209 AVD--LQTGVPWVMCKQDDAPDPVINACNGRQCGE----TFAGPNSPDKPAIWTENWTSF 262
+ T +PW+MC A + I CNG C + P++P ++TENW +
Sbjct: 204 VSNDFASTQIPWIMCN-GLAANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GW 261
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAP 322
+Q +G+ IR+ ED+AY VA + A G+Y YYM+HGG ++GRT + + T Y D
Sbjct: 262 FQGWGEGLGIRTPEDLAYSVAEWFAN-GGAYHAYYMWHGGNHYGRTGGSGLTTAYSDDVI 320
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL---------------QEA 367
L G +PK+ HL L L + VL+S + ++L Q
Sbjct: 321 LRADGTPNEPKFTHLNRLQR-----LLASQAQVLLSQDSARLPIPYWDGKQWSVGTQQMV 375
Query: 368 FIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ- 426
+ + S + F++N+ + V F+ + S+ I + + + +N+A + + +
Sbjct: 376 YSYPPSIQ---FVINQAAF-SLFVLFNKQNISIAGQSVQIYDNNEHLLWNSADVSGIFRN 431
Query: 427 -------------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD 473
W+ Y E + D + A+ LEQ+N T D + YLWY
Sbjct: 432 NTFLVPIVVGPLDWQVYSEPFLS-DLPVIVASTPLEQLNLTNDETIYLWYRRNVSLSQPS 490
Query: 474 SESVLKVSS-LGHVLHAFINGEFVG----SAHGKHS-DKSFTLEKMVHLINGTNNVSLLS 527
++++++V + + L F++ +FVG +H + + + + TL L N +LS
Sbjct: 491 AQTIVQVQTRRANSLIFFMDRQFVGYFDDHSHAQGTINVNITLNLSQFLPNQQYLFEILS 550
Query: 528 VMVGLPD----SGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYG 583
V +G+ + G++ + + G NVS+ G + D +S W +Q GL GE QI+T+ G
Sbjct: 551 VSLGIDNFNIGPGSFEYKGIVG--NVSLGGQSLVGDEASI-WEHQKGLFGEAYQIYTEQG 607
Query: 584 SRIVPWS-RYGSSTHQPLTWYKTVFD------APTGSDPVAINLISMGKGEAWVNGQSIG 636
S+ V W+ R+ ++ ++ +TW++T FD ++PV ++ + +G A+VNG IG
Sbjct: 608 SKTVEWNPRWTTAINKSVTWFQTRFDLNHLVREDLNANPVLLDAFGLNRGHAFVNGNDIG 667
Query: 637 RYWVSFLTPQG-------------TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
YW+ T Q PSQ +YHIP +LKPT NLL + EE P
Sbjct: 668 LYWLIEGTCQNKLCCCLQNQTNCQQPSQRYYHIPSDWLKPTNNLLTVFEEIGASSP 723
>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 707
Score = 361 bits (927), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 217/606 (35%), Positives = 332/606 (54%), Gaps = 52/606 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYDGRSL+ING RK+ SGS+HYPRSTP +W +++A +K G++++ T VFW+LHEPQ
Sbjct: 108 VTYDGRSLLINGERKLFVSGSVHYPRSTPTIWKKVLALSKNSGINMIDTYVFWDLHEPQR 167
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F G +L F+ Q GL+V LRIGP+I EW YGGLP WL D+PGI R N
Sbjct: 168 GVYNFEGNANLKHFLDLCQQNGLFVNLRIGPYICAEWNYGGLPIWLKDIPGIKMRDFNTQ 227
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ ++R+ IV+ + +A QGGPI+L+QIENEY V+ + E G + W A LA
Sbjct: 228 YMEEVERWMKFIVDYLHG--YFAPQGGPIVLAQIENEYNWVQWRYQESGRKFAHWCADLA 285
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGE--TFAGPNSPDKPAIWTENWTSFYQVYG 267
L G+PW+MC+QDD P VIN CNG C E F N D+P ++TENW+ ++ +
Sbjct: 286 NRLDIGIPWIMCQQDDIPT-VINTCNGYYCHEWINFHWNNFKDQPPLFTENWSGWFNNWV 344
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYG 327
+ R R D+ Y A + A G+ +NYYM+HGGTNFGR + + Y APL+EYG
Sbjct: 345 NAVRHRPVADLLYSAARWFAS-GGALMNYYMWHGGTNFGRKSGPMIALSYDYDAPLNEYG 403
Query: 328 LLRQPKWGHLKELHSAVKLCLKPML------SGVLVSMNFSKLQEAFIFQGSSECAAFLV 381
R PK+ ++ + + L L+ +L + + ++ N S + ++ + A+F++
Sbjct: 404 NPRNPKYSQTRDFNKLI-LSLEDILLSQYPPTPIFLANNISVIH----YRNGNNSASFII 458
Query: 382 NKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAK-----LDSVEQWEE------- 429
N ++ N+ V F Y S+ IL + +V F++++ D+V + E
Sbjct: 459 NSNENGNSKVMFEGRSYFSYAYSVQILKNYVSV-FDSSQNPRNYTDTVVESEPNIPFANS 517
Query: 430 -YKEAIPTYD-ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVL 487
+ + +D E SL N L+EQ+N TKD +DY+WY HD D E +LKV + ++
Sbjct: 518 IISKHVERFDFEESLYDNRLMEQLNLTKDETDYIWYTTMINHD-QDGE-ILKVINKTDIV 575
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR 547
H F++ +VG+ + + G + + LL +G+ ++E AG+
Sbjct: 576 HVFVDSYYVGTIMSDSL-------AITGVPLGPSTLQLLHTKMGIQHYELHMENTKAGIL 628
Query: 548 NVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTD-YGSRIVPWSRYGSSTHQ-----PLT 601
G E+ ++ WG + + EK + TD S+ V WS ++ PLT
Sbjct: 629 GPVYYGDIEI---TNQMWGSKPFVSSEK--VITDPIQSKFVRWSPLDRKPNEVFYSVPLT 683
Query: 602 WYKTVF 607
WYK +F
Sbjct: 684 WYKFIF 689
>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
Length = 735
Score = 360 bits (925), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 239/712 (33%), Positives = 370/712 (51%), Gaps = 84/712 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R++ ING+R +LFSG IHYPRSTP MWP L++KAKE GL+ +QT VFWN+HE +
Sbjct: 34 VSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNIHEQKR 93
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G +DFSGR +L F++E GL+V LR+GP++ EW YG LP WL+++P I FRS N+
Sbjct: 94 GTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSNDA 153
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+K MKR+ + I+ + A GGPIIL+QIENEYG + + YV W L
Sbjct: 154 WKSEMKRFLSDIIVYVDG--FLAKNGGPIILAQIENEYGGNDRA-------YVDWCGSLV 204
Query: 210 VD--LQTGVPWVMCKQDDAPDPVINACNGRQCGE----TFAGPNSPDKPAIWTENWTSFY 263
+ T +PW+MC A + I CNG C + P++P ++TENW ++
Sbjct: 205 SNDFASTQIPWIMCN-GLAANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GWF 262
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPL 323
Q +G+ IR+ ED+AY VA + A G+Y YYM+HGG ++GRT + + T Y D L
Sbjct: 263 QGWGEGLGIRTPEDLAYSVAEWFAN-GGAYHAYYMWHGGNHYGRTGGSGLTTAYSDDVIL 321
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAF------------IFQ 371
G +PK+ HL L L + VL+S + ++L + +
Sbjct: 322 RADGTPNEPKFTHLNRLQR-----LLASQAQVLLSQDSNRLSIPYWNGKQWTVGTQQMVY 376
Query: 372 GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ----- 426
F++N+ + V F+ + S+ I + + +N+A + + +
Sbjct: 377 SYPPSVQFVINQAAF-SLFVLFNKQNISIAGQSVQIYDYNEHLLWNSADVSGISRNNTFL 435
Query: 427 ---------WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV 477
W+ Y E T D + A+ LEQ+N T D + YLWY +++
Sbjct: 436 VPIVVGPLDWQVYSEPF-TSDLPVIVASTPLEQLNLTNDETIYLWYRRNVSLSQPSVQTI 494
Query: 478 LKVSS-LGHVLHAFINGEFVG----SAHGKHS-DKSFTLEKMVHLINGTNNVSLLSVMVG 531
++V + + L F++ +FVG +H + + + + TL L N +LSV +G
Sbjct: 495 VQVQTRRANSLLFFMDRQFVGYFDDHSHTQGTINVNITLNLSQFLPNQQYIFEILSVSLG 554
Query: 532 LPD----SGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIV 587
+ + G++ + + G NVS+ G + D +S W +Q GL GE QI+T+ GS+ V
Sbjct: 555 IDNFNIGPGSFEYKGIVG--NVSLGGQSLVGDEASI-WEHQKGLFGEAHQIYTEQGSKTV 611
Query: 588 PWS-RYGSSTHQPLTWYKTVFD------APTGSDPVAINLISMGKGEAWVNGQSIGRYWV 640
W+ ++ + ++P+TW++T FD ++P+ ++ +G A+VNG IG YW+
Sbjct: 612 EWNPKWTTVINKPVTWFQTRFDLNHLAREDLNANPILLDAFGFNRGHAFVNGNDIGLYWL 671
Query: 641 SFLTPQGT-------------PSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
T Q PSQ +YHI +LKPT NLL + EE P
Sbjct: 672 IEGTCQNNLCCCLQNQTNCQQPSQRYYHISSDWLKPTNNLLTVFEEIGASSP 723
>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
Length = 342
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 183/313 (58%), Positives = 217/313 (69%), Gaps = 8/313 (2%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
TYD +++++NG R+IL SGSIHYPRS P+MWP LI KAK+GGLDVVQT VFWN HEP
Sbjct: 30 TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
Q+ F GR DLV FIK V+ GLYV LRIGP++ EW +GG P WL VPGI FR+DNEPF
Sbjct: 90 QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
K + T IV+MMK+ L+ QGGPIILSQIENE+G +E E Y WAA +AV
Sbjct: 150 ----KNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 205
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA 270
L T VPWVMCK+DDAPDP+IN CNG C + PN P KP +WTE WTS+Y +G
Sbjct: 206 ALNTSVPWVMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTSWYTGFGIPV 263
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLL 329
R ED+AY VA FI K GS+VNYYMYHGGTNFGRTA ++ T Y AP+DEYG L
Sbjct: 264 PHRPVEDLAYGVAKFIQK-GGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGEL 322
Query: 330 RQPKWGHLKELHS 342
+G L+S
Sbjct: 323 NTFYFGKRHALYS 335
>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 326
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 175/299 (58%), Positives = 211/299 (70%), Gaps = 4/299 (1%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R+++ING R+IL SGSIHYPRSTP+MWP L+ KAK+GGLDVVQT VFWN HEP
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ F R DLVRF+K + GLYV LRIGP++ EW +GG P WL VPGI FR+DN P
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 147
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
FK M+ + IV+MMK+ L+ QGGPIIL+Q+ENEYG +E PY WAAK+A
Sbjct: 148 FKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAAKMA 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
V GVPWVMCKQDDAPDPVIN CNG C + PNS KP +WTE WT ++ +G
Sbjct: 208 VATGAGVPWVMCKQDDAPDPVINTCNGFYC--DYFSPNSNSKPTMWTEAWTGWFTAFGGA 265
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYG 327
R ED+A+ VA FI K GS+VNYYMYHGGTNF RT+ ++ T Y AP+DEYG
Sbjct: 266 VPHRPVEDMAFAVARFIQK-GGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYG 323
>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
gi|194695440|gb|ACF81804.1| unknown [Zea mays]
Length = 467
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 190/457 (41%), Positives = 270/457 (59%), Gaps = 29/457 (6%)
Query: 376 CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ--------- 426
C AFL N + +++AT+ F Y +P SIS+L DC+TV F T +++
Sbjct: 7 CVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQ 66
Query: 427 ------WEEYK-EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP------SD 473
WE + E +P Y + +R + N TKD +DY+WY FK + SD
Sbjct: 67 TAQNNVWEMFDGENVPKYKQAKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSD 126
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
++VL+V+S GH AF+N +FVG HG +K+FTLEK + L G N+V++L+ +G+
Sbjct: 127 IKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMT 186
Query: 534 DSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
DSGAY+E R+AG+ V I G D ++ WG+ VGL+GE+ QI+TD G V W
Sbjct: 187 DSGAYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWK-- 244
Query: 593 GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQS 652
+ +PLTWYK FD P+G DPV +++ +MGKG +VNGQ IGRYW+S+ G PSQ
Sbjct: 245 PAMNDRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISYKHALGRPSQQ 304
Query: 653 WYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISW-RSQNQ 711
YH+PRSFL+ N+LVL EEE G P I I TV +C +S+ + ++SW R +Q
Sbjct: 305 LYHVPRSFLRQKDNMLVLFEEEFGRPDAIMILTVKRDNICTFISERNPAHIMSWERKDSQ 364
Query: 712 RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEK 771
T K + R + + CP + I +++FASYGNP G C NY +GSCH+ ++ +VEK
Sbjct: 365 ITAKAN--ADDLRARAALACPPKKLIQQVVFASYGNPAGICGNYTVGSCHTPRAKEVVEK 422
Query: 772 ACLGKRSCTVPVWTEKFYGDP-CPGIPKALLVDAQCT 807
ACLGKR CT+PV + + GD C G L V A+C+
Sbjct: 423 ACLGKRVCTLPVAADVYGGDANCSGTTATLAVQAKCS 459
>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
Length = 743
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 241/757 (31%), Positives = 356/757 (47%), Gaps = 121/757 (15%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
V++D R+L+++G R ++ SG++HYPRSTP MWPR++ ++ GL+ V+T +FWNLHE +
Sbjct: 2 TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G DFSGR DLVRF + QA+GL V LRIGP+I E YGGLP WL DVP I R+DNE
Sbjct: 62 RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
FK R+ ++ +++ L A GGP+IL+QIENEY + ++ E G Y+RW+ +L
Sbjct: 122 AFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179
Query: 209 AVDLQTGVPWVMC-----KQDDAPDPVINACNGRQCGETFAG--------PNSPDKPAIW 255
A L G+PWV C + D V +A + + F P++PA+W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLT 315
TENW +YQ +G R E++AY A F A GS VNY+++HGGTNFGR + T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAA-GGSGVNYFLWHGGTNFGRDGMYLLTT 298
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQE---AFIFQG 372
Y PLDEYGL K H A +G L++ + E +
Sbjct: 299 AYEFGGPLDEYGLPTT------KARHLARLNAALAACAGELLASERPGVVEKSSGVVEYH 352
Query: 373 SSECAAFLVNKDKRNNATVYFS-NLMYELPPLSISILPDCKTVAFNTAKLDSVEQW--EE 429
F+ + R V S ++Y+ S+ + P + + + + W E
Sbjct: 353 YDSGLVFVCDDTARAVRIVKKSGEVLYD---SSVRVAPVRRAWKSSGVRF-APWGWRAEP 408
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---------------------- 467
A P ++++ A LEQ+ TKD +DY WY
Sbjct: 409 LPAAWPAEAQSAVTARKPLEQLLPTKDETDYCWYETAIVVEGSGDVLVAGRDGSPAGLER 468
Query: 468 -------------------KHDPSDSESVLKVSSLGHVLHAFINGEFVGSA-------HG 501
P+++ + L+++ + ++H FI+G FV + G
Sbjct: 469 GALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVFIDGTFVATTPTPLRERRG 528
Query: 502 KHSDKSFTLE-----KMVHLINGTNNVSLLSVMVGLPD-------SGAYLERRVAGLRNV 549
K FT K + + G + +SLL +GL LE++ GL
Sbjct: 529 KMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIKGDWMIGYENMALEKK--GLWAP 586
Query: 550 SIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW----SRYGSSTHQPLTWYKT 605
K+L+ W +Q GLLGE+ ++ W + G +PL W++T
Sbjct: 587 VFWNGKKLEG----EWRHQPGLLGERCGFADPAAGSLLAWKTAKAATGRGARRPLNWWRT 642
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT-----------------PQGT 648
F P G P A++L MGKG W+NG IGRYW+ T P G
Sbjct: 643 TFTRPKGHGPWALDLGGMGKGFCWINGHCIGRYWLLPDTDPMGPWMAWMKGSLTAAPSGG 702
Query: 649 PSQSWYHIPRSFLKPTG--NLLVLLEEENGYPPGISI 683
P+Q +YH+P +L+ G + LVL EE G P + +
Sbjct: 703 PTQRYYHVPDDWLRTDGGPDTLVLFEELGGDPATVRL 739
>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
Flags: Precursor
gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
AX4]
Length = 761
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 229/736 (31%), Positives = 367/736 (49%), Gaps = 103/736 (13%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ- 88
VTYDGRSLIING RK+LFSGSIHYPR++ +MWP ++ ++K+ G+D++ T +FWN+H+P
Sbjct: 40 VTYDGRSLIINGERKLLFSGSIHYPRTSEEMWPIILKQSKDAGIDIIDTYIFWNIHQPNS 99
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
P ++ F G ++ +F+ + LYV LRIGP++ EW YGG P WL ++P IV+R N+
Sbjct: 100 PSEYYFDGNANITKFLDLCKEFDLYVNLRIGPYVCAEWTYGGFPIWLKEIPNIVYRDYNQ 159
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
+ M + +V + +A GGPIIL+Q+ENEYG +E + G Y +W+
Sbjct: 160 QWMNEMSIWMEFVVKYLD--NYFAPNGGPIILAQVENEYGWLEQEYGINGTEYAKWSIDF 217
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQVY 266
A L G+PW+MC+Q+D + IN CNG C + + P++P+ WTENW +++ +
Sbjct: 218 AKSLNIGIPWIMCQQNDI-ESAINTCNGYYCHDWISSHWEQFPNQPSFWTENWIGWFENW 276
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDE 325
G R +DI Y A FIA GS +NYYM+ GGTNFGRT+ +++T Y APLDE
Sbjct: 277 GQAKPKRPVQDILYSNARFIA-YGGSLINYYMWFGGTNFGRTSGGPWIITSYDYDAPLDE 335
Query: 326 YGLLRQPKWGHLKELHSAVK------LCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
+G +PK+ + H + L +P S +S F ++ + I +F
Sbjct: 336 FGQPNEPKFSLSSKFHQVLHAIESDLLNNQPPKSPTFLSQ-FIEVHQYGI------NLSF 388
Query: 380 LVNKDKRNN-ATVYFSNLMYELPPLSISILPDCKTVAFNTAKL--------DSVEQWEEY 430
+ N + + N Y + P S+ I+ + + + F+T+ + +++ ++
Sbjct: 389 ITNYGTSTTPKIIQWMNQTYTIQPWSVLIIYNNE-ILFDTSFIPPNTLFNNNTINNFKPI 447
Query: 431 KEAI------------------PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP- 471
+ I D S+ + +EQ+ TKD SDY WY+
Sbjct: 448 NQNIIQSIFQISDFNLNSGGGGGDGDGNSVNSVSPIEQLLITKDTSDYCWYSTNVTTTSL 507
Query: 472 ---SDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTN-NVSLLS 527
L ++ +H FI+ E+ GSA S ++ + N T + +LS
Sbjct: 508 SYNEKGNIFLTITEFYDYVHIFIDNEYQGSAFS----PSLCQLQLNPINNSTTFQLQILS 563
Query: 528 VMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIV 587
+ +GL + +++E G+ + G++ L ++ W + GL+GE ++IF + +
Sbjct: 564 MTIGLENYASHMENYTRGILGSILIGSQNL---TNNQWLMKSGLIGENIKIFNN--DNTI 618
Query: 588 PWSRYGSS-----THQPLTWYK---TVFDAP--TGSDPVAINLISMGKGEAWVNGQSIGR 637
W SS +PLTWYK ++ P S A+++ SM KG WVNG SIGR
Sbjct: 619 NWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVYALDMSSMNKGMIWVNGYSIGR 678
Query: 638 YWV-------------------------SFLTPQGTPSQSWYHIPRSFLKPTG-----NL 667
YW+ ++ PSQS Y +P +L
Sbjct: 679 YWLIEATQSICNQSAIENYSYIGEYDPSNYRIDCNKPSQSIYSVPIDWLFNNNYNNQYAT 738
Query: 668 LVLLEEENGYPPGISI 683
++++EE NG P I +
Sbjct: 739 IIIIEELNGNPNEIQL 754
>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
Length = 825
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 242/724 (33%), Positives = 367/724 (50%), Gaps = 96/724 (13%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+Y R I+G R +L GSIHYPRS+ W L+ AK GL+ ++ VFWNLH
Sbjct: 83 AGYSVSYSARGFEIDGRRTLLLGGSIHYPRSSEGEWETLLRAAKRDGLNHIEMYVFWNLH 142
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
E + G F+F+G + RF + GL++ +R GP++ EW GGLP WL+ +PG+ RS
Sbjct: 143 EQERGVFNFAGNANATRFYELAAEVGLFLHVRFGPYVCAEWSNGGLPLWLNWIPGMKVRS 202
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
N P+++ M+R+ T +V + + A GGPII++QIENE+ M P YV W
Sbjct: 203 SNAPWQWEMERFVTYMVELSRP--FLAKNGGPIIMAQIENEFAM-------HDPEYVEWC 253
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN---SPDKPAIWTENWTSF 262
L L T +PWVMC + A + ++ +CNG C + FA + P P +WTE+ +
Sbjct: 254 GDLVKRLDTSIPWVMCYANAAENTIL-SCNGNDCVD-FAVKHVKERPSDPLVWTED-EGW 310
Query: 263 YQVYGDEAR------IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTG 316
+Q + + + R+AED+AY VA + A + G+ NYYMYHGG NFGR ASA V T
Sbjct: 311 FQTWAKDKKNPLPNDQRTAEDMAYAVARWFA-VGGAAHNYYMYHGGNNFGRAASAGVTTK 369
Query: 317 YYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL------------ 364
Y D L GL +PK HL++LH A+ C ++ ++ +L
Sbjct: 370 YADGVNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDRQLLHPHELAPTHGETAEASS 429
Query: 365 --QEAFIF--QGSSECAAFLVNK-DKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTA 419
Q AFI+ + AFL N+ DK+ TV F + YEL P S+ I+ D + FNTA
Sbjct: 430 LQQRAFIYGAEDGPNQVAFLENQADKK--VTVVFRDNKYELAPTSMMIIKD-GALLFNTA 486
Query: 420 KLD-----------------SVEQWEEYKEAIPTYDETSLR--ANFLLEQMNTTKDASDY 460
+ + QWE + E + R A +EQ+ T D SDY
Sbjct: 487 DVRKSFPGTVHRAYTPIVQAATLQWETWSELNVSSLTPRRRVVAERPVEQLRLTADRSDY 546
Query: 461 LWYNFRFKHDPSDS-------ESVLKVSSL-GHVLHAFINGEFVGSAH----GKHSDKSF 508
L Y F DP+D+ S +KV+S + AF++G +G + G + K F
Sbjct: 547 LTYETTFTVDPADTPIDIDSDASTVKVTSCEASSIIAFVDGWLIGERNLAYPGGNCSKEF 606
Query: 509 TLEKMVHL-INGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGY 567
++ + +++ L+SV +G+ G+ + + G V G K L W
Sbjct: 607 RFSLPTNIDVTRQHSLKLVSVSLGIYSLGSNHTKGLTGKVRV---GRKNLA--KGHQWEM 661
Query: 568 QVGLLGEKLQIFTDYGSRIVPWS---RYGSSTHQPLTWYKT-----VFDAPTGSDPVA-- 617
L+GE+L+I+ VPW+ R +S Q ++WY T F+ P +DPV+
Sbjct: 662 YPTLVGEQLEIYRPEWLSSVPWTPVPRVVASGRQLMSWYWTSFSYPAFELPAEADPVSEP 721
Query: 618 ----INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL-KPTGNLLVLLE 672
++ I + +G A++NG +GRYW+ + +G Q +YH+PR +L K N+LV+ +
Sbjct: 722 FSILLDCIGLTRGRAYINGHDLGRYWL--VNDEGEFVQRYYHVPRDWLVKDQANVLVVFD 779
Query: 673 EENG 676
E G
Sbjct: 780 ELGG 783
>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
Length = 347
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 177/349 (50%), Positives = 225/349 (64%), Gaps = 17/349 (4%)
Query: 129 GGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG 188
GG P WL VPGI FR+DNEPFK M+++ IV+MMKA +L+ +QGGPIILSQIENE+G
Sbjct: 1 GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 60
Query: 189 MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS 248
VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F PN
Sbjct: 61 PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFK-PNK 118
Query: 249 PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRT 308
KP +WTE WT +Y +G R AED+A+ VA FI + GS++NYYMYHGGTNFGRT
Sbjct: 119 DYKPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFI-QGGGSFLNYYMYHGGTNFGRT 177
Query: 309 ASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
A + YD APLDEYGL R+PKWGHL++LH A+K C ++S QEA
Sbjct: 178 AGGPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEA 237
Query: 368 FIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ- 426
+F+ S+CAAFL N D + + V F Y+LPP SISILPDCKT +NTAK+ S
Sbjct: 238 HVFKSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ 297
Query: 427 -----------WEEY-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWY 463
W+ + +E + + + + L EQ+N T+D +DYLWY
Sbjct: 298 VQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWY 346
>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
Length = 504
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 206/506 (40%), Positives = 278/506 (54%), Gaps = 53/506 (10%)
Query: 346 LCLKPMLSGVLVSMNFSKLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLS 404
+C K ++S V + Q+A+++ S +C+AFL N D +++A V F+N+ Y LPP S
Sbjct: 1 MCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWS 60
Query: 405 ISILPDCKTVAFNTAKLDSVEQ-------------WEEYKEAIPTYDETSLRANFLLEQM 451
+SILPDC+ FNTAK+ WE ++E + T++ A+ LLEQ+
Sbjct: 61 VSILPDCRNAVFNTAKVGVQTSQMQMLPTNSERFSWESFEEDTSSSSATTITASGLLEQI 120
Query: 452 NTTKDASDYLWYNFRFKHDPSDSESVLK--------VSSLGHVLHAFINGEFVGSAHGKH 503
N T+D SDYLWY D SES L V S GH +H FING GSA+G
Sbjct: 121 NVTRDTSDYLWYITSV--DVGSSESFLHGGKLPSLIVQSTGHAVHVFINGRLSGSAYGTR 178
Query: 504 SDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKELK-DFS 561
D+ F V+L GTN ++LLSV VGLP+ G + E G L V I G + K D S
Sbjct: 179 EDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHGLDKGKLDLS 238
Query: 562 SFSWGYQVGLLGEKLQIFTDYGSRIVPW--SRYGSSTHQPLTWYKTVFDAPTGSDPVAIN 619
W YQVGL GE + + + G V W S +QPLTW+KT FDAP G +P+A++
Sbjct: 239 WQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPEGEEPLALD 298
Query: 620 LISMGKGEAWVNGQSIGRYWV--------------SFLTPQ-----GTPSQSWYHIPRSF 660
+ MGKG+ W+NG SIGRYW SF P+ G P+Q WYH+PRS+
Sbjct: 299 MDGMGKGQIWINGISIGRYWTAIATGSCNDCNYAGSFRPPKCQLGCGQPTQRWYHVPRSW 358
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
LK NLLV+ EE G P IS+ SV+++C VS+ H P + +W + +
Sbjct: 359 LKQNHNLLVVFEELGGDPSKISLAKRSVSSVCADVSEYH-PNLKNWHIDSYGKSENF--- 414
Query: 721 PGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
R PKV + C G+ IS I FAS+G P G C +Y G+CHSS+S I+E+ C+GK C
Sbjct: 415 --RPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQGACHSSSSYDILEQKCIGKPRCI 472
Query: 781 VPVWTEKFYGDPCPGIPKALLVDAQC 806
V V F DPCP + K L V+A C
Sbjct: 473 VTVSNSNFGRDPCPNVLKRLSVEAVC 498
>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
Length = 473
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 193/473 (40%), Positives = 260/473 (54%), Gaps = 44/473 (9%)
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-Y 312
+WTE WT ++ +G R ED+A+ VA FI K GS+VNYYMYHGGTNF RT+ +
Sbjct: 1 MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQK-GGSFVNYYMYHGGTNFDRTSGGPF 59
Query: 313 VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQG 372
+ T Y AP+DEYGLLRQPKWGHL++LH A+K ++SG + ++A++F+
Sbjct: 60 IATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKS 119
Query: 373 S-SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------ 425
S CAAFL N A V F+ Y+LP SIS+LPDCK FNTA +
Sbjct: 120 SGGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMS 179
Query: 426 -----QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDS 474
W+ Y EA + D + + L+EQ++ T D SDYLWY N + S
Sbjct: 180 PAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQ 239
Query: 475 ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPD 534
L + S GH L F+NG+ G+ +G + T V + G+N +S+LS VGLP+
Sbjct: 240 WPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPN 299
Query: 535 SGAYLER-RVAGLRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
G + E V L V++ G E K D S W YQ+GL GE L + + GS V W
Sbjct: 300 QGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGS- 358
Query: 593 GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW------------- 639
++ QPLTW+K F AP+G PVA+++ SMGKG+AWVNG+ IGRYW
Sbjct: 359 -AAGKQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSGCGGCS 417
Query: 640 -------VSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
T G SQ +YH+PRS+L P+GNLLV+LEE G G+ + T
Sbjct: 418 YAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVT 470
>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
Length = 268
Score = 332 bits (850), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 151/249 (60%), Positives = 185/249 (74%), Gaps = 2/249 (0%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
NV YD R+L+I+G R++L SGSIHYPRSTPQMWP LI K+K+GGLDV++T VFWNLHEP
Sbjct: 20 TNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEP 79
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
GQ+DF GR+DLV+F+K V GLYV LRIGP++ EW YGG P WLH +PGI FR+DN
Sbjct: 80 VKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 139
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK MKR+ IV++MK +LYASQGGPIILSQIENEYG ++ + G Y+ WAAK
Sbjct: 140 EPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWAAK 199
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A L TGVPWVMC+Q DAPDP+IN CNG C + PNS KP +WTENW+ ++ +G
Sbjct: 200 MATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQF--TPNSNTKPKMWTENWSGWFLSFG 257
Query: 268 DEARIRSAE 276
R E
Sbjct: 258 GAVPHRPVE 266
>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
Length = 811
Score = 330 bits (846), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 241/708 (34%), Positives = 344/708 (48%), Gaps = 84/708 (11%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V Y R +I+G IL GSIHY RSTP W L+AKAKE GL++VQ +FWN H
Sbjct: 95 NGYDVKYTKRGFVIDGKASILLGGSIHYARSTPDTWDSLLAKAKEDGLNLVQLYIFWNFH 154
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
EP+ G F F+ R +L F + V A GL+V LR GP++ EW GGLP WL +PG+ RS
Sbjct: 155 EPRRGSFYFADRGNLTHFFERVVAHGLFVHLRFGPYVCAEWNRGGLPLWLDRIPGMKVRS 214
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
++E ++ M R +++N+ + ++ GGPII++QIENEY P YV W
Sbjct: 215 NSESWRQEMNRIILIMINLARP--YFSVNGGPIIMAQIENEYN-------GHDPTYVAWL 265
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS---PDKPAIWTEN---- 258
++L L G+PW MC A + I+ CN C + FA N+ P +P +WTEN
Sbjct: 266 SQLVRKLGIGIPWTMCNGASAVN-TISTCNDNDCFQ-FAEKNAKVFPSQPLVWTENEAWY 323
Query: 259 --WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTG 316
W + + RS E +AY VA + A + G+ NYYMYHGG NFGRTASA V T
Sbjct: 324 EKWATKNIAQDGQNDQRSPEQVAYVVARWFA-VGGAMHNYYMYHGGNNFGRTASAGVTTM 382
Query: 317 YYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK----------LQE 366
Y D A L GL +PK HL++LH + C K +LS +N +K Q
Sbjct: 383 YADGAILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNER-QLNHAKPLGPEGKNAYTQR 441
Query: 367 AFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV-- 424
A+I+ S FL N + A + Y LPP +I IL D V +NT+ +
Sbjct: 442 AYIYGNCS----FLENTHAIHRACFRYQLKEYCLPPQTIVIL-DHNNVLYNTSDVSGTLG 496
Query: 425 ------------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFR 466
+ W E+ + P + + LEQ+ T+D +DYL Y
Sbjct: 497 SRSTRSFSPLIRFRKSDWKIWSEW-DVNPHNVRDQIVNDSPLEQLLVTQDTTDYLMYQNE 555
Query: 467 FK---HDPSDSE---SVLK-VSSLGHVLHAFINGEFVGSAH----GKHSDKSFTLEKMVH 515
+ + P+ ++ S+LK +S + FINGEF+G H G F +
Sbjct: 556 VRWGSNGPTKNKMKSSILKFISCDANSFLVFINGEFIGEQHLAYPGDDCSNIFRFDLGPL 615
Query: 516 LINGTN-NVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGE 574
G N +S+LS+ +G+ G E+ G+ + + L W GL+GE
Sbjct: 616 GKYGANLTLSILSISLGIHSLG---EKHQKGIVSDVQIDERSLVYGPHERWVMFSGLIGE 672
Query: 575 KLQIFTDYGSRIVPWSRYGSSTHQPLT--WYKTVF-----DAPTGSDPVAINLISMGKGE 627
L+++ S VPW T + T WY T F D T + V ++ M +G
Sbjct: 673 LLKLYDPMWSNSVPWRNLNVQTDRKRTSKWYMTKFVLKQLDWDTETS-VLLDCKGMNRGR 731
Query: 628 AWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG--NLLVLLEE 673
++NG +GRYW+ G Q +Y IP ++L N LV+ EE
Sbjct: 732 IYLNGHDLGRYWL-IRRSDGAYVQRYYTIPVAWLHAANKSNYLVIFEE 778
>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
Length = 721
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 217/705 (30%), Positives = 353/705 (50%), Gaps = 63/705 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
VTYD RS ++G R I +GS+HYPR+TP+MW ++ +A E GL+++Q FWNLHEP
Sbjct: 35 VTYDERSFFLDGKRSIFLAGSVHYPRATPEMWDTILDQAVEDGLNLIQIYTFWNLHEPVK 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+++ G D+ F+++ +GL+V +RIGP++ EW GG+P W++ + G+ R++N+
Sbjct: 95 GQYNWEGIADIRLFLQKCADRGLFVNMRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDV 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+K M + ++ + + +A +GGPII SQIENE Y+ W + A
Sbjct: 155 WKKEMGDWMKVLTDYTR--DFFADRGGPIIFSQIENE-------LWGGAREYIDWCGEFA 205
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETF-----AGPNSPDKPAIWTENWTSFYQ 264
L+ VPW+MC D + INACNG C +G D+P WTEN ++Q
Sbjct: 206 ESLELNVPWMMC-NGDTSEKTINACNGNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQ 263
Query: 265 VYGDEA---------RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLT 315
++G + RSAED ++V F+ + GSY NYYM+ GG ++G+ A +
Sbjct: 264 IHGAASAERDDYEGWDARSAEDYTFNVLKFMDR-GGSYHNYYMWFGGNHYGKWAGNGMTN 322
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ----EAFIFQ 371
Y + + L +PK H ++H + + +L+ N L AF ++
Sbjct: 323 WYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYR 382
Query: 372 GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE------ 425
+F+ N +K + V + +++YELP S+ +L + V F T + V
Sbjct: 383 YGDRLVSFVEN-NKGSADKVIYRDIVYELPAWSMIVLDEYDNVLFETNNVKPVNKHRVYH 441
Query: 426 -----QWEEYKEAIPTYDETSLR------ANFLLEQMNTTKDASDYLWYNFRFKHDPSDS 474
++E + E + T + + R AN EQ+N T+D +++L+Y + P D
Sbjct: 442 CEEKLEFEYWNEPVSTLSQEAPRVVVSPKAN---EQLNMTRDLTEFLYYETEVEF-PQDE 497
Query: 475 ESVLKVSSLGHVLHAFINGEFVGS-AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
++ + + A+++ FVGS H D T+ + G + + LLS +G+
Sbjct: 498 CTLSIGGTDANAFVAYVDDHFVGSDDEHTHHDGWHTMNINMKSGKGKHKLVLLSESLGVS 557
Query: 534 DS-GAYLERRVAGLRNVSIQGAKEL--KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWS 590
+ + L+ A R I G +L D + W + GL+GE Q+FTD G + V W
Sbjct: 558 NGMDSNLDPSWASSRLKGICGWIKLCGNDIFNQEWKHYPGLVGEAKQVFTDEGMKTVTW- 616
Query: 591 RYGSSTHQPLTWYKTVFDAPTGSD---PVAINLISMGKGEAWVNGQSIGRYWVSFLTPQG 647
+ L WY++ F P G V + M +G+A+VNG +IGRYW+ G
Sbjct: 617 KSDVENADNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRGQAYVNGHNIGRYWM-IKDGNG 675
Query: 648 TPSQSWYHIPRSFLKPTG--NLLVLLEEENGYPPGISIDTVSVTT 690
+Q +YHIP+ +LK G N+LVL E P ++I T +
Sbjct: 676 EYTQGYYHIPKDWLKGEGEENVLVLGETLGASDPSVTICTTEYVS 720
>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
Length = 446
Score = 322 bits (824), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 173/442 (39%), Positives = 242/442 (54%), Gaps = 28/442 (6%)
Query: 388 NATVYFSNLMYELPPLSISILPDCKTVAFNTAKL------------DSVEQ---WEEYKE 432
+ TV F + +P S+SIL DCKTV +NT ++ D + WE Y E
Sbjct: 2 DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNVWEMYSE 61
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWY--NFRFKHDP----SDSESVLKVSSLGHV 486
AIP + +T +R LEQ N TKD SDYLWY +FR + D D V+++ S H
Sbjct: 62 AIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIKSTAHA 121
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
+ F N FVG+ G +KSF EK + L G N++++LS +G+ DSG L G+
Sbjct: 122 MIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVEVKGGI 181
Query: 547 RNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKT 605
++ +QG D WG++ L GE +I+T+ G W + P+TWYK
Sbjct: 182 QDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKP--AENDLPITWYKR 239
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG 665
FD P G DP+ +++ SM KG +VNG+ IGRYW SF+T G PSQS YHIPR+FLKP G
Sbjct: 240 YFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLAGHPSQSVYHIPRAFLKPKG 299
Query: 666 NLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRP 725
NLL++ EEE G P GI I TV +C +S+ + + +W S + + R
Sbjct: 300 NLLIIFEEELGKPGGILIQTVRRDDICVFISEHNPAQIKTWESDGGQIKLIAEDTSTRG- 358
Query: 726 KVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWT 785
+ CP R I +++FAS+GNP G C N+ G+CH+ +++AIVEK CLGK SC +PV
Sbjct: 359 --TLNCPPKRTIQEVVFASFGNPEGACGNFTAGTCHTPDAKAIVEKECLGKESCVLPVVN 416
Query: 786 EKFYGD-PCPGIPKALLVDAQC 806
+ D CP L V +C
Sbjct: 417 TVYGADINCPATTATLAVQVRC 438
>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 249
Score = 321 bits (822), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 143/208 (68%), Positives = 172/208 (82%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G VTYDGR+LI++G R++LFSG +HYPRSTP+MWP LIAKAK+GGLDV+QT VFWN
Sbjct: 33 AGRGEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNA 92
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEP GQF+F GR DLV+FI+E+ AQGLYV LRIGPF+E EW YGGLPFWL +P I FR
Sbjct: 93 HEPVQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYGGLPFWLRGIPNITFR 152
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
SDNEPFK HM+++ T IVN+MK RL+ QGGPII+SQIENEY +VE +F KG YV W
Sbjct: 153 SDNEPFKRHMQKFVTKIVNLMKDERLFYPQGGPIIISQIENEYKLVEAAFHSKGSSYVHW 212
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVIN 232
AA +AV+LQTGVPW+MCKQDDAPDP+++
Sbjct: 213 AAAMAVNLQTGVPWMMCKQDDAPDPIVS 240
>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
Length = 244
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 142/204 (69%), Positives = 168/204 (82%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +TYDGR+L+++G R++ FSG +HY RSTP+MWP+LIAKAK GGLDV+QT VFWN+HE
Sbjct: 26 GREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVHE 85
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P GQ++F GR DLV+FI+E+QAQGLYV LRIGPF+E EW YGG PFWLHDVP I FRSD
Sbjct: 86 PIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRSD 145
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
NEPFK HM+ + T IV MMK LY QGGPII+SQIENEY M+E +F GP YVRWAA
Sbjct: 146 NEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWAA 205
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPV 230
+AV LQTGVPW+MCKQ+DAPDPV
Sbjct: 206 AMAVGLQTGVPWMMCKQNDAPDPV 229
>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
Length = 402
Score = 317 bits (812), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 166/383 (43%), Positives = 237/383 (61%), Gaps = 27/383 (7%)
Query: 294 VNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLS 353
NYYMYHGGTNFGRT++A+V+ YYD+APLDE+GL ++PKWGHL++LH A+KLC K +L
Sbjct: 2 TNYYMYHGGTNFGRTSAAFVMPKYYDEAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61
Query: 354 GVLVSMNFSKLQEAFIFQGSSE--CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDC 411
G + K EA +F+ + C AFL N + +++ T+ F Y +P SISIL DC
Sbjct: 62 GKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADC 121
Query: 412 KTVAFNTAKLDSVEQ---------------WEEY-KEAIPTYDETSLRANFLLEQMNTTK 455
KTV F T +++ W+ + +E +P Y ++ +R + N TK
Sbjct: 122 KTVVFGTQHVNAQHNQRTFHFADQTTQNNVWQMFDEEKVPKYKQSKIRLRKAGDLYNLTK 181
Query: 456 DASDYLWYNFRFKHDPSDS------ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFT 509
D +DY+WY FK + D ++VL+V+S GH AF+N +FVG HG +K+FT
Sbjct: 182 DKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFT 241
Query: 510 LEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQ 568
LEK + L G N+V++L+ +G+ DSGAYLE R+AG+ V I+G D ++ WG+
Sbjct: 242 LEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHI 301
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
VGL+GE+ QI+TD G V W + +PLTWYK FD P+G DP+ +++ +MGKG
Sbjct: 302 VGLVGEQKQIYTDKGMGSVTWK--PAVNDRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLM 359
Query: 629 WVNGQSIGRYWVSFLTPQGTPSQ 651
+VNGQ IGRYW+S+ G PSQ
Sbjct: 360 FVNGQGIGRYWISYKHALGRPSQ 382
>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
Length = 450
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 187/450 (41%), Positives = 249/450 (55%), Gaps = 45/450 (10%)
Query: 278 IAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGH 336
+A+ VA FI K GS+VNYYMYHGGTNF RT+ ++ T Y AP+DEYGLLRQPKWGH
Sbjct: 1 MAFAVARFIQK-GGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGH 59
Query: 337 LKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS-SECAAFLVNKDKRNNATVYFSN 395
L++LH A+K ++SG + ++A++F+ S CAAFL N A V F+
Sbjct: 60 LRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNG 119
Query: 396 LMYELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPTYDETSLRA 444
Y+LP SIS+LPDCK FNTA + W+ Y EA + D +
Sbjct: 120 RRYDLPAWSISVLPDCKAAVFNTATVSEPSAPARMSPAGGFSWQSYSEATNSLDGRAFTK 179
Query: 445 NFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGS 498
+ L+EQ++ T D SDYLWY N + S L V S GH L F+NG+ G+
Sbjct: 180 DGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHSLQVFVNGQSYGA 239
Query: 499 AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER-RVAGLRNVSIQGAKEL 557
+G + T V + G+N +S+LS VGLP+ G + E V L V++ G E
Sbjct: 240 VYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEG 299
Query: 558 K-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPV 616
K D S+ W YQ+GL GE L + + GS V W ++ QPLTW+K F AP+G PV
Sbjct: 300 KRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGS--AAGKQPLTWHKAYFSAPSGDAPV 357
Query: 617 AINLISMGKGEAWVNGQSIGRYW---------------------VSFLTPQGTPSQSWYH 655
A+++ SMGKG+AWVNG+ IGRYW T G SQ +YH
Sbjct: 358 ALDMGSMGKGQAWVNGRHIGRYWSYKASSSGGCGGCSYAGTYSETKCQTGCGDVSQRYYH 417
Query: 656 IPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
+PRS+L P+GNLLVLLEE G PG+ + T
Sbjct: 418 VPRSWLNPSGNLLVLLEEFGGDLPGVKLVT 447
>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
Length = 285
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/288 (54%), Positives = 197/288 (68%), Gaps = 4/288 (1%)
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW +GG P WL VPGI FR+DN PFK M+++ IVNMMKA +L+ Q GPII+SQIE
Sbjct: 1 EWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQIE 60
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG +E G Y +WAA++AV L TGVPW+MCKQ+DAPDP+I+ CNG C E F
Sbjct: 61 NEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYC-ENFM 119
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
PN+ KP ++TE WT +Y +G R AED+AY VA FI + +GS++NYYMYHGGTN
Sbjct: 120 -PNANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFI-QNRGSFINYYMYHGGTN 177
Query: 305 FGRTASA-YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
FGRTA ++ T Y APLDEYGL R+PKWGHL++LH +KLC ++S +
Sbjct: 178 FGRTAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGS 237
Query: 364 LQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDC 411
QEA +F + CAAFL N D + + V F NL Y+LPP S+SILPDC
Sbjct: 238 NQEAHVFWTKTSCAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285
>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
Length = 383
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 170/385 (44%), Positives = 230/385 (59%), Gaps = 29/385 (7%)
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAA 378
PLDE+GL R+PKWGHLK++H A+ LC + + G ++ Q+A ++Q G+S CAA
Sbjct: 4 GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAA 63
Query: 379 FLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ------------ 426
L N + R V F LP SIS+LPDCKTV FNT + +
Sbjct: 64 LLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNSRNFVRSEIANK 123
Query: 427 ---WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRF---KHD---PSDSESV 477
WE Y+E P + + E + TKD +DY WY + D + V
Sbjct: 124 NFNWEMYREVPPV--GLGFKFDVPRELFHLTKDTTDYAWYTTSLLLGRRDLPMKKNVRPV 181
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
L+V+SLGH +HA++NGE+ GSAHG +KSF ++ L G N+++LL +VGLPDSGA
Sbjct: 182 LRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRELSSLKEGENHIALLGYLVGLPDSGA 241
Query: 538 YLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
Y+E+R AG R+++I G D S WG+QVG GEK ++FT+ GS+ V W++
Sbjct: 242 YMEKRFAGPRSITILGLNTGTLDISQNGWGHQVGTDGEKKKLFTEEGSKSVQWTK--PDQ 299
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
PLTWYK FDAP G +PVAI + MGKG WVNG+SIGRYW ++L+P P+QS YHI
Sbjct: 300 GGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSPLKKPTQSEYHI 359
Query: 657 PRSFLKPTGNLLVLLEEENGYPPGI 681
PR++LKP NL+VLLEEE G P +
Sbjct: 360 PRAYLKPK-NLIVLLEEEGGNPKDV 383
>gi|297734971|emb|CBI17333.3| unnamed protein product [Vitis vinifera]
Length = 447
Score = 309 bits (792), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 181/432 (41%), Positives = 248/432 (57%), Gaps = 50/432 (11%)
Query: 220 MCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIA 279
MCKQ DAPDPVIN C GR CG+TF GPN P+K ++ TE Y + ++ + I
Sbjct: 1 MCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSVSTE--------YLETPHLKGQQKIL 52
Query: 280 YHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKE 339
+ +LFI+K G+ NYYMY+ TNFGRT S++ T YYD+APLDEYGL R+ KWGHL++
Sbjct: 53 H--SLFISK-NGTLANYYMYYSVTNFGRTTSSFATTCYYDEAPLDEYGLPRETKWGHLRD 109
Query: 340 LHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ--GSSECAAFLVNKDKRNNATVYFSNLM 397
LH+A++L K +L GV + + EA I++ GS+ CA FL+N R T
Sbjct: 110 LHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGSK 169
Query: 398 YELPPLSISILPDCKTVAFNT------------AKLDSVEQWEEYKEAIPTYDETSLRAN 445
Y LP SIS LPDCKTV FNT + DS+ + +A+PTY+E +
Sbjct: 170 YYLPQHSISNLPDCKTVVFNTQTVASNYLIFPFSMFDSLNEPNMKTDALPTYEECPTKTK 229
Query: 446 FLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFV------GSA 499
+E M TKD +DYLWY + D V +VS+LGHV+HAF+NGE+V G+
Sbjct: 230 SPVELMTMTKDTTDYLWYTTK-----KDVLRVPQVSNLGHVMHAFLNGEYVMEFYLTGTR 284
Query: 500 HGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELK- 558
HG + +KSF K + L G N ++ L VGLPDSG+Y+E R+AG+ NV+IQG
Sbjct: 285 HGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHRLAGVHNVAIQGLNTRTI 344
Query: 559 DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKT-----VFDAPTGS 613
D WG++VGL G+KL +FT S+ S H P + KT V TG
Sbjct: 345 DLPKNGWGHKVGLNGDKLHLFTQPPSQ--------SVYHVPRAFLKTSDNLLVLFEETGR 396
Query: 614 DPVAINLISMGK 625
+P I ++++ +
Sbjct: 397 NPDGIEILTLNR 408
Score = 63.9 bits (154), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/81 (40%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 649 PSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRS 708
PSQS YH+PR+FLK + NLLVL EE P GI I T++ T+C ++S+ H V SW+
Sbjct: 369 PSQSVYHVPRAFLKTSDNLLVLFEETGRNPDGIEILTLNRDTICCYISEHHPTHVRSWKR 428
Query: 709 QNQRTLKTHKRIPGRRPKVQI 729
+ + G +PK ++
Sbjct: 429 EAS---DIQMFVDGVKPKAKL 446
>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
Length = 286
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 156/289 (53%), Positives = 192/289 (66%), Gaps = 5/289 (1%)
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW +GG P WL VPGI FR+DNEPFK M+ + IV MMK +L+ SQGGPIILSQIE
Sbjct: 1 EWNFGGFPVWLKFVPGISFRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQIE 60
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEY F G Y+ WAA++A L TGVPWVMCK+ DAPDPVIN CNG C +
Sbjct: 61 NEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKF-- 118
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
PN P KP +WTE WT ++ +G R ED+A+ VA FI + GS+VNYYMYHGGTN
Sbjct: 119 SPNKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFI-QAGGSFVNYYMYHGGTN 177
Query: 305 FGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
FGRTA +T YD AP+DEYGL+R+PK+ HLKELH AVKLC +L M+
Sbjct: 178 FGRTAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYADPYVMSLGN 237
Query: 364 LQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDC 411
++A +F +S CAAFL N + +++A V F+ + LPP SISILPDC
Sbjct: 238 YEQAHVFSSTSGGCAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286
>gi|293334807|ref|NP_001170541.1| uncharacterized protein LOC100384558 [Zea mays]
gi|238005922|gb|ACR33996.1| unknown [Zea mays]
Length = 345
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 214/337 (63%), Gaps = 7/337 (2%)
Query: 473 DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGL 532
D ++VL+V+S GH AF+N +FVG HG +K+FTLEK + L G N+V++L+ +G+
Sbjct: 6 DIKTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGM 65
Query: 533 PDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
DSGAYLE R+AG+ V I+G D ++ WG+ VGL+GE+ QI+TD G V W
Sbjct: 66 MDSGAYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKP 125
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQ 651
+ +PLTWYK FD P+G DP+ +++ +MGKG +VNGQ IGRYW+S+ G PSQ
Sbjct: 126 --AVNDRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISYKHALGRPSQ 183
Query: 652 SWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQ 711
YHIPRSFL+ N+LVL EEE G P I I TV +C +S+ + + SW ++
Sbjct: 184 QLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVKRDNICTFISERNPAHIKSWERKDS 243
Query: 712 RTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEK 771
+ T + +P+ + C + I +++FASYGNP G C NY IGSCH+ ++ +VEK
Sbjct: 244 QITVTAADL---KPRATLTCSPKKLIQQVVFASYGNPMGICGNYTIGSCHTPRAKELVEK 300
Query: 772 ACLGKRSCTVPVWTEKFYGD-PCPGIPKALLVDAQCT 807
ACLGKR CT+PV + + GD CPG L V A+C+
Sbjct: 301 ACLGKRICTLPVSADVYGGDVNCPGTTATLAVQAKCS 337
>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
Length = 263
Score = 293 bits (749), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 147/266 (55%), Positives = 181/266 (68%), Gaps = 4/266 (1%)
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
+DNEPFK M+++ IV+MMKA +L+ SQGGPIILSQIENE+G VE G Y +W
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F PN KP +WTE WT +Y
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFT-PNKNYKPKMWTEVWTGWYT 118
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPL 323
+G R AED+A+ +A I K GS+VNYYMYHGGTNFGRTA + YD APL
Sbjct: 119 EFGGAVPTRPAEDLAFSIARLIQK-GGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPL 177
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNK 383
DEYGL R+PKWGHL++LH A+K ++S + QEA +F+ S CAAFL N
Sbjct: 178 DEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVFKSKSGCAAFLANY 237
Query: 384 DKRNNATVYFSNLMYELPPLSISILP 409
D +++A V F N YELPP SISILP
Sbjct: 238 DTKSSAKVSFGNGQYELPPWSISILP 263
>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
Length = 263
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 148/266 (55%), Positives = 179/266 (67%), Gaps = 4/266 (1%)
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
+DNEPFK M+++ IV+MMKA +L+ SQGGPIILSQIENE+G VE G Y +W
Sbjct: 1 TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
AA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F PN KP +WTE WT +Y
Sbjct: 61 AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC-ENFT-PNKNYKPKMWTEVWTGWYT 118
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPL 323
+G R AED+A+ +A FI K GS VNYYMYHGGTNFGRTA + YD APL
Sbjct: 119 EFGGAVPTRPAEDLAFSIARFIQK-GGSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPL 177
Query: 324 DEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNK 383
DEYGL R+PKWGHL+ LH A+K ++S + QEA F+ S CAAFL N
Sbjct: 178 DEYGLPREPKWGHLRNLHKAIKSSESALVSAEPSVTSLGNSQEAHAFKSKSGCAAFLANY 237
Query: 384 DKRNNATVYFSNLMYELPPLSISILP 409
D +++A V F N YELPP SISILP
Sbjct: 238 DTKSSAKVSFGNGQYELPPWSISILP 263
>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
Length = 425
Score = 289 bits (739), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 164/425 (38%), Positives = 237/425 (55%), Gaps = 53/425 (12%)
Query: 318 YDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSS-EC 376
YD AP+DEYGL R PKWGHLK+LH A+KLC +L G V+++ EA ++ SS C
Sbjct: 1 YD-APVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVNVSLGPSVEADVYTDSSGAC 59
Query: 377 AAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVE----------- 425
AAF+ N D +N+ TV F N Y +P S+SILPDCK V +NTAK+ +
Sbjct: 60 AAFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTNKIAMIPEKLQ 119
Query: 426 ---------QWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD--- 473
+W+ +KE + + N ++ +NTTKD +DYLW+ D ++
Sbjct: 120 QSDKGQKTFKWDVWKENPGIWGKPDFVINGFVDHINTTKDTTDYLWHTTSISIDENEELL 179
Query: 474 ---SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMV 530
S+ VL + S GH LHAF+N ++ G+A+G S +FT + + L G N ++LLS+ V
Sbjct: 180 KKGSKPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKNPISLKAGKNEIALLSLTV 239
Query: 531 GLPDSGAYLERRVAGLRNVSIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPW 589
GL +G + + AG+ +V I+G + D SS +W Y++G+ GE L+I+ G V W
Sbjct: 240 GLQTAGPFYDFVGAGVTSVKIKGLNNKTIDLSSNAWTYKIGVQGEHLKIYQGNGLNSVSW 299
Query: 590 SRYGSSTH-QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW--VSFLTPQ 646
+ Q LTWYK + DAP G +PV ++++ MGKG AW+NG+ IGRYW +S +
Sbjct: 300 TSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAWLNGEGIGRYWPRISEFKKE 359
Query: 647 ---------------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
G PSQ WYH+PRS+ KP+GN+LV EE+ G P I+
Sbjct: 360 DCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVFFEEKGGDPTKITFVR 419
Query: 686 VSVTT 690
V+T
Sbjct: 420 RKVST 424
>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
max]
Length = 482
Score = 288 bits (737), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 141/324 (43%), Positives = 195/324 (60%), Gaps = 9/324 (2%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD S IIN + I+FSG +HYP ST +WP + + K GGLD +++ +FW+ HEP
Sbjct: 9 VSYDAHSHIINEEKHIIFSGVVHYPXSTVDLWPAIFKRXKYGGLDAIESYIFWDRHEPVR 68
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
++D SG D + F+K +Q LY LRIGP++ W +GG WLH++P I R DN
Sbjct: 69 REYDCSGNLDFIDFLKLIQEAELYFILRIGPYVCEXWNFGGFSLWLHNMPEIELRIDNPI 128
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
K M+ + T IVNM K A+L+A GGPIIL+ IENEYG + + E PY++W A++A
Sbjct: 129 XKNEMQIFTTKIVNMAKEAKLFAPXGGPIILTPIENEYGNIMTDYREARKPYIKWCAQMA 188
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE 269
+ GVPW+MC DAP P+IN CNG C ++F PN+P ++ +Q +G+
Sbjct: 189 LTQNIGVPWIMCXXRDAPQPMINTCNGHYC-DSFX-PNNPKSSKMFRX-----FQKWGER 241
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEYGL 328
+SAE+ + VA F + G NYYMYHGGTNFG +T Y+ APLDEYG
Sbjct: 242 VPHKSAEESTFSVARFF-QSGGILNNYYMYHGGTNFGHMVGGPYMTASYEYDAPLDEYGN 300
Query: 329 LRQPKWGHLKELHSAVKLCLKPML 352
L +PKW H K+LH + + L
Sbjct: 301 LNKPKWEHFKQLHKELTFDVSDFL 324
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 38/53 (71%)
Query: 731 CPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPV 783
C G+ IS+I FAS+GNP GNC ++ G+ +++S+++VE AC+G+ SC V
Sbjct: 425 CQIGKTISQIQFASFGNPEGNCGSFKGGTWEATDSQSVVEVACIGRNSCGFTV 477
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 50/94 (53%), Gaps = 5/94 (5%)
Query: 552 QGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAP 610
Q KEL D S F W Y + + ++ + R+ S G + ++ F+AP
Sbjct: 311 QLHKELTFDVSDFLW-YMTSIDIPDISLWNNSTLRV---STMGHTLRAYVSGRADDFEAP 366
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
G DP+ ++L GK +AWVNG+SIG YW S++T
Sbjct: 367 FGIDPMVMDLQDSGKRQAWVNGKSIGCYWSSWIT 400
>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
lyrata]
Length = 448
Score = 283 bits (723), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 199/335 (59%), Gaps = 40/335 (11%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGN-----------NVTYDGRSLIINGHRKILFSGSI 51
+ + L L+++ + G GGG VTYDG SLIING R++LFS S+
Sbjct: 4 RTRYLIAILLVVSLCSKASHGHGGGEVDDDNDEKKKKGVTYDGTSLIINGKRELLFSVSV 63
Query: 52 HYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQG 111
HYPRSTP MWP +I KA+ GGL+ +QT VFWN+HEP+ ++DF GR DLV FIK +Q +G
Sbjct: 64 HYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEHRKYDFKGRFDLVTFIKLIQEKG 123
Query: 112 LYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLY 171
LYV LR+GPFI+ EW +GGLP+WL +VP + FR+DNEPFK H +RY I+ MMK +L
Sbjct: 124 LYVTLRLGPFIQAEWNHGGLPYWLREVPEVYFRTDNEPFKEHTERYVRKILGMMKEEKLL 183
Query: 172 ASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVI 231
ASQ L ENE V+ ++ E G Y++WAA L ++ G+PWVMCKQ++A D +I
Sbjct: 184 ASQRRSHHLG-TENECNAVQLAYKENGERYIKWAANLVESMKLGIPWVMCKQNNASDNLI 242
Query: 232 NACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKG 291
NACNGR C ++ G I +EDIA+ VA + +K G
Sbjct: 243 NACNGRHC-----------------------FEFLGILQLIEQSEDIAFSVARYFSK-NG 278
Query: 292 SYVNYYM----YHGGTNFGRTASAYVLTGYYDQAP 322
S+VNYYM YH +F + + ++ P
Sbjct: 279 SHVNYYMMVDRYHIPRSFMKEEKKKNMLVILEEEP 313
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 69/121 (57%), Gaps = 6/121 (4%)
Query: 654 YHIPRSFLK--PTGNLLVLLEEENGYP-PGISIDTVSVTTLCGHVSDSHLPPVISWRSQN 710
YHIPRSF+K N+LV+LEEE G I V+ T+C +V + + V SW+ +
Sbjct: 290 YHIPRSFMKEEKKKNMLVILEEEPGVKLEAIDFVLVNRDTICSYVGEDYPVSVKSWKRER 349
Query: 711 QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVE 770
+ K + R K ++CP +++ + FAS+G+P G C N+ +G C +S S+ +VE
Sbjct: 350 PKIASRSKDM---RLKAVMKCPPEKQMVAVEFASFGDPTGTCGNFTMGKCSASKSKEVVE 406
Query: 771 K 771
K
Sbjct: 407 K 407
>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
Length = 317
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 199/320 (62%), Gaps = 26/320 (8%)
Query: 510 LEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQ 568
E + LI GTN+++LLSVMVGLP+SG + ER++AG+ V+++G K+ +D S W YQ
Sbjct: 2 FELPISLIPGTNDIALLSVMVGLPNSGGHFERKIAGISTVTLRGFKDGTRDLSQELWTYQ 61
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
+GLLGE I++D G V W+ S+ + PLTWYK V D P G +PV ++L SMGKG+A
Sbjct: 62 IGLLGEMSTIYSDVGFISVNWTS-SSTPNPPLTWYKAVIDVPDGDEPVILDLSSMGKGQA 120
Query: 629 WVNGQSIGRYWVSFLTPQG---------------------TPSQSWYHIPRSFLKPTGNL 667
W+NG+ IGRYW+SFL P G PSQ+ YH+PRS+L+PTGNL
Sbjct: 121 WINGEHIGRYWISFLAPLGDCSKCDYRGNYSLHKCATNCGQPSQTLYHVPRSWLRPTGNL 180
Query: 668 LVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKV 727
LVL EE G P +S+ T S+ ++C H ++H P + SW+ + + + P +
Sbjct: 181 LVLFEETGGDPSKVSLLTRSIDSVCAHAFETHPPSIQSWQKTKVNSEVLRENV---EPSL 237
Query: 728 QIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEK 787
Q+ C GR+IS I FAS+GNP G C N+ G+CHS S VEKACLG+ C++ ++
Sbjct: 238 QLDCSVGRRISSIKFASFGNPKGVCGNFMKGTCHSVESEKAVEKACLGQHGCSITNSPKE 297
Query: 788 FYGDPCPGIPKALLVDAQCT 807
F GD C G K+L V+A C+
Sbjct: 298 FGGDACVGTVKSLAVEATCS 317
>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
Length = 418
Score = 279 bits (713), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 162/422 (38%), Positives = 227/422 (53%), Gaps = 42/422 (9%)
Query: 47 FSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKE 106
F GS+HYPR P+MWP + KAK QF+F G DL++FIK
Sbjct: 9 FYGSVHYPRCPPEMWPDIFKKAK---------------------QFNFEGNYDLIKFIKM 47
Query: 107 VQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMK 166
+ G+ +C++ +E LP WL ++P I+FRSDN+PF +HM+++ MI+ M+
Sbjct: 48 I---GIMICMQ---HLELVHSLKELPIWLREIPNIIFRSDNQPFMYHMEQFTKMIIKKMR 101
Query: 167 AARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDA 226
+ + + QIENE+ V+ ++ E G YV+W +AV L TGVPW+MCKQ +A
Sbjct: 102 DEKFFPRK-------QIENEHTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNA 154
Query: 227 PDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFI 286
PV+N CNGR CG+TF+GPN I ++ Y+ +GD R+AEDIA VA F
Sbjct: 155 LGPVMNTCNGRYCGDTFSGPNKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFF 212
Query: 287 AKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKL 346
+K KG+ NYYMY+GGTNFGRT+S++V T YYD+AP+ EYGL R+PKWGH ++LH A+KL
Sbjct: 213 SK-KGTMANYYMYYGGTNFGRTSSSFVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKL 271
Query: 347 CLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNK----DKRNNATVYFSNLMYELPP 402
C K +L G K E Q S + +NN V + +L
Sbjct: 272 CQKALLWGTQPVQMLGKDLEVGQKQFGSYVSMLYHTPRAILQPKNNFLVVLEEMGGKLDG 331
Query: 403 LSI-SILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYL 461
+ I ++ D +VE W YK I T +T A L+ N T D+
Sbjct: 332 IEILTVNRDTICSIAGEHYPPNVETWSRYKGVIRTNVDTPKPAANLVCLDNKTITQVDFA 391
Query: 462 WY 463
Y
Sbjct: 392 SY 393
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 71/117 (60%), Gaps = 3/117 (2%)
Query: 654 YHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRT 713
YH PR+ L+P N LV+LEE G GI I TV+ T+C ++ H PP + S+ +
Sbjct: 305 YHTPRAILQPKNNFLVVLEEMGGKLDGIEILTVNRDTICS-IAGEHYPPNVETWSRYKGV 363
Query: 714 LKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVE 770
++T+ P +P + C + I+++ FASYG+P GNC ++ +G C++ NS+ IVE
Sbjct: 364 IRTNVDTP--KPAANLVCLDNKTITQVDFASYGDPVGNCGHFILGKCNAPNSQKIVE 418
>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
Length = 219
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 133/221 (60%), Positives = 158/221 (71%), Gaps = 2/221 (0%)
Query: 59 QMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRI 118
+MWP LI +AK+GGLDV+QT VFWN HEP PG++ F DLV+FIK VQ GLYV LRI
Sbjct: 1 EMWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRI 60
Query: 119 GPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPI 178
GP++ EW +GG P WL +PGI FR+DN PFK M+R+ T IVNMMKA RL+ S GGPI
Sbjct: 61 GPYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPI 120
Query: 179 ILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQ 238
ILSQIENEYG +E+ G Y WAA++AV L TGVPWVMCKQDDAPDPVINACNG
Sbjct: 121 ILSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFY 180
Query: 239 CGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIA 279
C + PN KP +WTE WT ++ +G R AED+A
Sbjct: 181 C--DYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219
>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
Length = 1171
Score = 276 bits (706), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 166/467 (35%), Positives = 234/467 (50%), Gaps = 44/467 (9%)
Query: 44 KILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRF 103
+ILF SIHYPR P W +LI AKE G++ ++T VFWN HE + G +DFSGR DL F
Sbjct: 476 RILFPASIHYPRCQPSDWQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGF 535
Query: 104 IKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVN 163
I+ + GLY LRIGP+I E +GG P WL D+ GI FR+ NEPF+ R+ +V
Sbjct: 536 IRTIAKAGLYALLRIGPYICAETHFGGFPHWLRDIDGIEFRTQNEPFQRESSRWVRFLVE 595
Query: 164 MMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
+ + + SQGGPI++ Q ENEY ++ ++ E G Y++W ++LA DLQ VP MCK
Sbjct: 596 KLNSNNCFYSQGGPIVMVQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCKG 655
Query: 224 D-DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHV 282
+ IN G Q E P++PAIWTE WT +Y V+G IR +D+ Y V
Sbjct: 656 SIENVLETINDFYGHQEMENHHR-EYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYAV 714
Query: 283 ALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
F A+ G +NYYM+HGGTN+ + A T Y AP+DEYG + K+ L+ +H
Sbjct: 715 LRFFAQ-GGKGINYYMFHGGTNYDQLAMYLQTTSYDYDAPIDEYG-RKTKKYFGLQYIHR 772
Query: 343 AVK-----LCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLM 397
++ L LK L + FI++ F N + V +
Sbjct: 773 QLEQHFASLALK--LEAPIAHSYEDNYVWIFIWEEQGSNCIFFCNDHPTSTKQVQWKEQE 830
Query: 398 YELPPLSISILPDCKTVAFNTAKLDSVEQ-----------------WEEYKEAIPTYD-- 438
Y L PLS+ ++ D + + +L E+ W+ YKE IPT D
Sbjct: 831 YCLAPLSVQMVVDHHRLILKSDQLFVDEELIQKELKPISVTTEEWTWQYYKENIPTTDIT 890
Query: 439 --------------ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP 471
T + +E + T A+DY WY ++ DP
Sbjct: 891 SSASQSSSISSLSSNTEIETQVPVEMLRYTGTATDYAWYIAHYQIDP 937
>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
Length = 706
Score = 271 bits (694), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 187/587 (31%), Positives = 293/587 (49%), Gaps = 45/587 (7%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G +VTY R I+G + +L GSIHYPRS+P W +L+ +AK GL+ ++ VFWNLHE
Sbjct: 82 GYSVTYSPRGFEIDGKQTLLLGGSIHYPRSSPGEWEQLLREAKRDGLNHIEMYVFWNLHE 141
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
+ G F+F+G ++ RF + GL++ +R GP++ EW GGLP WL+ +PG+ RS
Sbjct: 142 QERGVFNFAGNANITRFYELAAEVGLFLHVRFGPYVCAEWNNGGLPLWLNWIPGMEVRSS 201
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
N P++ M+R+ +V + + A GGPII++QIENE F P Y+ W
Sbjct: 202 NAPWQREMERFIRYMVELSRP--FLAKNGGPIIMAQIENE-------FAWHDPEYIAWCG 252
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN---SPDKPAIWTENWTSFY 263
L L T +PWVMC + A + ++ +CN C + FA + P P +WTE+ ++
Sbjct: 253 NLVKQLDTSIPWVMCYANAAENTIL-SCNDDDCVD-FAVKHVKERPSDPLVWTED-EGWF 309
Query: 264 QVYGDEAR------IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY 317
Q + + + RS ED+AY VA + A + G+ NYYMYHGG N+GR ASA V T Y
Sbjct: 310 QTWQKDKKNPLPNDQRSPEDVAYAVARWFA-VGGAAHNYYMYHGGNNYGRAASAGVTTMY 368
Query: 318 YDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ--EAFIFQGSSE 375
D L GL +PK HL++LH A+ C +L +N +L + + SS+
Sbjct: 369 ADGVNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRNDRQVLNPRELPLVDEQTVKASSQ 428
Query: 376 CAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKE--A 433
AF+ + N ++++ + S P + + S W+ + E
Sbjct: 429 QRAFVYGPEAEPNQD---GAILFDTADVRKS-FPGRQHRTYTPLVKASALAWKAWSELNV 484
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFK----HDPSDSESVLKVSSL-GHVLH 488
T + A+ +EQ+ T D SDYL Y F D D +KV+S +
Sbjct: 485 SSTTPRRRVVADQPIEQLRLTADQSDYLTYETTFTPKQLSDVDDDMWTVKVTSCEASSII 544
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHL-----INGTNNVSLLSVMVGLPDSGAYLERRV 543
A ++G +G + + + + E HL + +++ L+SV +G+ G+ + V
Sbjct: 545 ALVDGWLIGERNLAYPGGNCSKEFSFHLPASIEVGRQHDLKLVSVSLGIYSLGSNHSKGV 604
Query: 544 AGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWS 590
G + G K+L W L+GE+L+I+ VPW+
Sbjct: 605 TGSVRI---GHKDLA--RGQRWEMYPSLIGEQLEIYRSQWIDAVPWT 646
>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
Length = 281
Score = 268 bits (685), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 141/289 (48%), Positives = 179/289 (61%), Gaps = 10/289 (3%)
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW +GG P WL VPGI FR+DN PFK M ++ IV MMK+ L+ SQGGPIILSQIE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG VE+ Y+ WAA++AV L T VPWVMCKQDDAPDPVINACNG C +
Sbjct: 61 NEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYC--DYF 118
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
PN P KP +WTE WT ++ + + A V ++ + + GTN
Sbjct: 119 SPNKPYKPTMWTEAWTGWFTGFRGPVLTDCEDCFAVQV------IRRWILVTTIVPWGTN 172
Query: 305 FGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
FGRTA ++ YD AP+DEYGLLRQPKWGHL++LH A+K+C ++SG
Sbjct: 173 FGRTAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGN 232
Query: 364 LQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDC 411
QEA +++ S CAAFL N + + A+V F+ + Y +P SISILPDC
Sbjct: 233 YQEAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281
>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
Length = 752
Score = 263 bits (673), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 217/752 (28%), Positives = 336/752 (44%), Gaps = 131/752 (17%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
++D R++ +NG R +L GS+ YP+ W + AKE GL+ + VFWN+HE + G
Sbjct: 8 SFDSRAITLNGKRTLLLGGSLQYPKIHHTQWNNTLKLAKECGLNFLDIYVFWNVHEKKRG 67
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
F F+ D+ RF++ GL V LR+GP+I E YGG P WL ++PGI FR+ N+PF
Sbjct: 68 IFTFTEEADIFRFLQMAHQHGLLVMLRLGPYICAETSYGGFPCWLREIPGIQFRTYNDPF 127
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAV 210
+KR+ I ++K RL+ QGGPI+L Q+ENEY +V L KG Y+ W +L
Sbjct: 128 MREVKRWLFYITTLLKEKRLFFPQGGPIVLVQLENEYDLVSKIQLSKGEQYLNWYNELYR 187
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQ------------CGETFAG-----------PN 247
+L VP +MC+ +P+ V C+ + C ETF
Sbjct: 188 ELAFDVPLIMCR--SSPEEVGEFCSCSKEPELSTIASVETCIETFNSFYGHKKIADLRRR 245
Query: 248 SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR 307
P +P +WTE W +Y ++ R RS ED+ Y FIA+ G+ +YYM+HGGT+F
Sbjct: 246 KPHQPILWTEFWIGWYDIWTSAPRKRSTEDVIYAALRFIAQ-GGAGFSYYMFHGGTHFNN 304
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGH--LKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
A T YY +P+DEYG +P + LK ++ + S L+S + ++
Sbjct: 305 LAMYSQTTSYYFDSPIDEYG---RPSFLFYMLKRINHILH-----QFSSHLLSQDHPQVL 356
Query: 366 E------AFIFQ--GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFN 417
AFI+Q S + +FL N D A + F M ++ PLS+++ + + + +
Sbjct: 357 HLLPQVVAFIWQEHSSQQSLSFLCN-DSEQIAYIMFQQSMMKMNPLSVAVFLENELLFDS 415
Query: 418 TAKLDSVEQWEEYK-------EAIPTYD--------ETSLRANFLLEQMNTTKDASDYLW 462
++ D + ++K + T+ +S + L + ++ T+D +DY+W
Sbjct: 416 SSGYDWQIPFRDFKPLERAYFRELKTFQLDIPIPPLSSSCDFSQLPDMLSVTQDETDYMW 475
Query: 463 Y----NFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKM----- 513
Y E VL + ++H FIN +++GS+ K D+ F K
Sbjct: 476 YISSATLPVSSKEFTCEKVLLQIEMADLIHLFINQQYMGSSWIKIDDERFANGKNGFRFS 535
Query: 514 -----------VHLINGTNNVSLLSVMVGLPD------SGAYLERRVAGL---------- 546
V N VS+L +GL GA +E+ GL
Sbjct: 536 IEFENSVYPQPVFSSNSKLYVSILVCSLGLIKGEFQLWKGATMEKEKKGLFKQPIIHFVV 595
Query: 547 RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPL----T 601
++ ++ F+S SW L I D+ S V Y + +PL T
Sbjct: 596 KHSELETETIPLSFTS-SWAMM------PLSIMKDHQSAFV--KEYNIKNVDKPLSLGPT 646
Query: 602 WYK-TVFDAPTGSDP----VAINLISMGKGEAWVNGQSIGRYW-VSFLTPQGTPS----- 650
+YK TV D + I+ SM KG N GRY+ + L + PS
Sbjct: 647 YYKQTVIINKAMIDALKWGLVIDFSSMTKGIFRWNSFCCGRYYSIQVLGKERDPSLRNSP 706
Query: 651 ----------QSWYHIPRSFLKPTGNLLVLLE 672
Q +YHIP+ L+ L V E
Sbjct: 707 VQEDHLFKSTQRYYHIPKGVLQERNELEVFEE 738
>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
Length = 652
Score = 262 bits (669), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 171/523 (32%), Positives = 274/523 (52%), Gaps = 55/523 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
VT+D R+++I+G R IL+ GS HYP+ + WP+ + AK+ GL+ ++ +FWN+HE +
Sbjct: 5 QVTFDKRAVVIDGKRTILYCGSYHYPKIHYEHWPQALELAKDCGLNCLEVYIFWNVHEKK 64
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G + F ++ RF++ Q +GL V LR+GP+I E YGG P+WL ++PGI FR+ NE
Sbjct: 65 KGVYHFEREGNIFRFLQLAQERGLKVILRMGPYICAETSYGGFPYWLREIPGIEFRTYNE 124
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF MKR+ T I M+K +LY +GGPIIL QIENEY +V + G Y+ W +L
Sbjct: 125 PFMKEMKRWLTDINRMLKENKLYHQKGGPIILVQIENEYDIVSSIYGAAGQKYLHWCYEL 184
Query: 209 AVDLQTGVPWVMCKQD--------DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ W+ K D IN G + ++ P +P +WTE W
Sbjct: 185 YK--EGASEWLTSKDSEYFRVASIDKSIETINDFYGHRRIDSLKALK-PHQPLLWTEFWI 241
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ 320
+Y ++ R R +D+ Y A FIA+ GS +NYYM+HGGT+FG A TGY
Sbjct: 242 GWYNIWRGAQRQRPVDDVIYAAARFIAQ-GGSGMNYYMFHGGTHFGNLAMYGQTTGYDFD 300
Query: 321 APLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAF---------IFQ 371
AP+D YG + K+ LK+L+ CL L +L+S + ++Q+ +
Sbjct: 301 APVDSYGRPTE-KFERLKQLNH----CLSN-LEYILLSQDEPEVQKLTPNVNVYRWKDIE 354
Query: 372 GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTV------AFNTA-----K 420
EC+ V D+R+ + V + L PLS+ I + + V ++N + +
Sbjct: 355 SGDECS--FVCNDQRSQSYVIVAERAVCLKPLSVKIYLNHEEVFDSSQNSYNVSQKSYHR 412
Query: 421 LDSV-EQWEEYKEAIPT---YDETSLRANF--LLEQMNTTKDASDYLWYN--------FR 466
LD V +W+ + IP+ D+ +F + + ++ T+D +DY+WY F+
Sbjct: 413 LDYVCNEWKTMQIPIPSKEKKDKEHFEFSFPHIPDMLHITQDETDYMWYTGVGTIYCPFK 472
Query: 467 FKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFT 509
++ P + +++ + +V H F+N ++VGS D+ FT
Sbjct: 473 GENTPHCLKIHMELEAADYV-HVFLNRKYVGSCRSPCYDERFT 514
>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
Length = 282
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/290 (48%), Positives = 178/290 (61%), Gaps = 11/290 (3%)
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW +GG P WL VPGI FR+DN PFK M ++ IV MMK+ L+ SQGGPIILSQIE
Sbjct: 1 EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG VE+ Y+ WAA++AV L TGVPWVMCKQDDAPDPVINA NG C +
Sbjct: 61 NEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYC--DYF 118
Query: 245 GPNSPDKPAIWTE-NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGT 303
PNS + +W +R+ + + +I + NYYMYHGGT
Sbjct: 119 SPNSLKTFFGGLKLDWLVPVSGSSSSQTVRTGFCVQVYTEGWIFR------NYYMYHGGT 172
Query: 304 NFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFS 362
NFGRTA ++ YD AP+DEY LLRQPKWGHL++LH A+K+C ++SG
Sbjct: 173 NFGRTAGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLG 232
Query: 363 KLQEAFIFQGSS-ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDC 411
QEA +++ S CAAFL N + + A+V F+ + Y +P SISILPDC
Sbjct: 233 NYQEAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282
>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
Length = 376
Score = 258 bits (659), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 141/354 (39%), Positives = 197/354 (55%), Gaps = 29/354 (8%)
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
L V S GH LH F+NG+F GSA G + FT K VHL G N ++LLS+ VGLP+ G
Sbjct: 17 TLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRAGINKIALLSIAVGLPNVG 76
Query: 537 AYLERRVAGLRN-VSIQG-AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR--Y 592
+ E G+ V + G + KD + W +VGL GE + + + G V W R
Sbjct: 77 LHYESWKTGILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAMDLVSPNGGSSVDWIRGSL 136
Query: 593 GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ------ 646
+ T Q L WYK F+AP G +P+A+++ SMGKG+ W+NGQSIGRYW+++
Sbjct: 137 ATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSIGRYWMAYANGDCSLCSY 196
Query: 647 -------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCG 693
G P+Q WYH+PRS+LKPT NL+V+ EE G P I++ SV +C
Sbjct: 197 IGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKNLMVMFEELGGDPSKITLVKRSVAGVCA 256
Query: 694 HVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCE 753
+ + H P + + KT + +V ++C G+ IS I FAS+G P G C
Sbjct: 257 DLQEHH-PNAEKFDIDSHEESKTL-----HQAQVHLQCVPGQSISSIKFASFGTPTGTCG 310
Query: 754 NYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
++ G+CH++NS AIVEK C+G+ SC V V F DPCP + K L V+A C+
Sbjct: 311 SFQQGTCHATNSHAIVEKNCIGRESCLVTVSNSIFGTDPCPNVLKRLSVEAVCS 364
>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
Length = 362
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 202/351 (57%), Gaps = 42/351 (11%)
Query: 365 QEAFIFQ-GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL-- 421
QE +F S CAAFL N D ++A V F N+ YELPP SISILPDCKT FNTA+L
Sbjct: 9 QEVHVFNPKSGSCAAFLANYDTTSSAKVNFQNMQYELPPWSISILPDCKTAVFNTARLGA 68
Query: 422 -DSVEQ--------WEEY-KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDP 471
S++Q W+ Y +E+ + D+ + + L EQ+N T+DASDYLWY D
Sbjct: 69 QSSLKQMTPVSTFSWQSYIEESASSSDDKTFTTDGLWEQLNVTRDASDYLWYMTNINIDS 128
Query: 472 SD------SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSL 525
++ + +L + S GH LH FING+ G+ +G + T + V + G N +SL
Sbjct: 129 NEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNVKMRVGVNQLSL 188
Query: 526 LSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYG 583
LS+ VGL + G + E+ G L V+++G E +D S W Y++GL GE L + T G
Sbjct: 189 LSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTRDLSKQQWSYKIGLKGEDLSLHTVSG 248
Query: 584 SRIVPWSRYGS-STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSF 642
S V W S + QPLTWYKT F+AP G++P+A+++ +MGKG W+N QSIGR+W +
Sbjct: 249 SSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWINSQSIGRHWPGY 308
Query: 643 L--------------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEE 673
+ T G PSQ WYH+PRS+L PTGNLLV+L+
Sbjct: 309 IAHGSCGECNYAGTYTDKKCHTNCGQPSQRWYHVPRSWLNPTGNLLVVLKR 359
>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
Length = 203
Score = 252 bits (644), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 124/206 (60%), Positives = 147/206 (71%), Gaps = 3/206 (1%)
Query: 54 PRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLY 113
PRSTP+MWP LI AKEGGLDV+QT VFWN HEP PG + F R D V+FIK V GLY
Sbjct: 1 PRSTPEMWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLY 60
Query: 114 VCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYAS 173
V LRIGP+I GEW +GG P WL VPGI FR+DN PFK M+++ IVNMMKA +L+
Sbjct: 61 VHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEP 120
Query: 174 QGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINA 233
QGGP I+SQIE EYG + G Y +WAA++AV L TGVPW+MCKQ+DAPDP+I+
Sbjct: 121 QGGP-IMSQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDT 179
Query: 234 CNGRQCGETFAGPNSPDKPAIWTENW 259
CNG C E F PN+ KP +WTE W
Sbjct: 180 CNGFYC-ENFM-PNANYKPKMWTEAW 203
>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
PN500]
Length = 611
Score = 251 bits (641), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 174/572 (30%), Positives = 282/572 (49%), Gaps = 59/572 (10%)
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
M+ + I ++ R +A+ GGPII+SQ+ENEYG V+ + E G Y +W+A+LA L
Sbjct: 1 MESWMRFITKYLE--RHFAANGGPIIMSQVENEYGWVQERYGESGTKYAQWSARLAQSLN 58
Query: 214 TGVPWVMCKQDDAPDPVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQVYGDEAR 271
GVPW+MC+QDD D VIN CNG C + G P++PA +TENW ++Q +
Sbjct: 59 VGVPWIMCQQDDI-DSVINTCNGFYCHDWIEGHWARYPNQPAFFTENWPGWFQQWKQSTP 117
Query: 272 IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQ 331
R ED+ Y V + A+ GS +NYYM+HGGTNFGRT+S V+ Y A LDEYG +
Sbjct: 118 HRPVEDVLYAVGNWFAR-GGSLMNYYMWHGGTNFGRTSSPMVVNSYDYDAALDEYGNPSE 176
Query: 332 PKWGHLKELHSAVKLCLKPMLSGVLV--SMNFSKLQEAFIFQGSSECAAFLVNKDKRNNA 389
PK+ H + ++ ++ L+ + S + + E +FL+N +
Sbjct: 177 PKYSHAAKFNNLLQKYSHIFLNAPEIPRSEYLGGSSSIYHYTFGGESLSFLINNHESALN 236
Query: 390 TVYFSNLMYELPPLSISIL------------PDCKTVAFNTAKLDSVEQWE-----EYKE 432
+ ++ + + P S+ +L P+ +A + + V + ++ E
Sbjct: 237 DIVWNGQNHIIKPWSVHLLYNNHTVFDSAATPEVSKLAMTSKRFSPVNSFNNAYISQWVE 296
Query: 433 AIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFIN 492
I D T ++ LEQ++ T D +DYLWY +E + +++ VLHA+I+
Sbjct: 297 EIDMTDST--WSSKPLEQLSLTHDKTDYLWYVTEINLQVRGAE--VFTTNVSDVLHAYID 352
Query: 493 GEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLR-NVSI 551
G++ + S F ++ + L G + + +L+ +G+ +E+ GL N+ +
Sbjct: 353 GKYQSTI---WSANPFNIKSDIPL--GWHKLQILNSKLGVQHYTVDMEKVTGGLLGNIWV 407
Query: 552 QGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVF-DAP 610
G D ++ W + + GE+L I+ V WS + S QPLTWYK F
Sbjct: 408 GGT----DITNNGWSMKPYVNGERLAIYNPNNIFKVDWSSF-SGVQQPLTWYKINFLHEL 462
Query: 611 TGSDPVAINLISMGKGEAWVNGQSIGRYWVS------------------FLTPQGTPSQS 652
+ + ++N+ M KG W+NG+ + RYW++ T G PSQ
Sbjct: 463 SPNKHYSLNMSGMNKGMIWLNGKHVARYWITKGWGCNGCSYQGGYTDQLCSTNCGEPSQI 522
Query: 653 WYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
YH+P+ +L NLLV+ EE G P I ++
Sbjct: 523 NYHLPQDWLIEGANLLVIFEEVGGNPKSIKLE 554
>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
Length = 208
Score = 249 bits (636), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 114/185 (61%), Positives = 140/185 (75%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+NVTYD ++L+I+G R++L SGSIHYPRSTPQMWP LI K+K+GG+DV++T VFWNLHEP
Sbjct: 24 SNVTYDHKALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEP 83
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
GQ++F GR DLV F+K V A GLYV LRIGP++ EW YGG P WLH + GI FR++N
Sbjct: 84 VRGQYNFEGRGDLVGFVKVVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNN 143
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
EPFK MKR+ IV+MMK LYASQGGPIILSQIENEYG ++ Y+ WAA
Sbjct: 144 EPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAAS 203
Query: 208 LAVDL 212
+A L
Sbjct: 204 MATSL 208
>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
Length = 777
Score = 246 bits (627), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 212/746 (28%), Positives = 334/746 (44%), Gaps = 110/746 (14%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+TYD RSL ING SG++HY RS P WP++ + GL+ V+T VFW HE
Sbjct: 8 REITYDSRSLRINGKPFFCLSGAVHYVRSHPSAWPQIFRCMRRDGLNTVETYVFWGDHEF 67
Query: 88 QPGQF-------DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDV-- 138
+P + DFSG RDLVRF++ + GL LR+GP++ E YGG P+WL V
Sbjct: 68 EPPEMPDAEPRADFSGPRDLVRFLRCAKLHGLNAILRLGPYVCAEVNYGGFPWWLRQVCE 127
Query: 139 ----PGIVFRSDNEPFKFHMKRYATMIVN-MMKAARLYASQGGPIILSQIENEYGMVEHS 193
+ FR+ + + ++R+ +V+ ++K AR++A QGGP+IL+QIENEY M+ S
Sbjct: 128 KGSSKPVRFRTWDPAYCAQVERWLKYLVDHVLKPARVFAPQGGPVILAQIENEYAMIAES 187
Query: 194 FLEKGPPYVRWAAKLAVDLQTGVPWVMC-----KQDDAPDPVINACNGRQCGETFAGPNS 248
+ G Y+ W A LA L GVP VMC ++ INA + E+
Sbjct: 188 YGPDGQQYLDWIASLANQLALGVPLVMCYGASQRESGRVIETINAFYAHEHVESLRRAQG 247
Query: 249 PD-KPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR 307
+ +P +WTE WT +Y V+G R A D+AY V F+A G+ +NYYMY GGTN+ R
Sbjct: 248 ANPQPLLWTECWTGWYDVWGAPHHRRDAADLAYAVLRFLAA-GGAGINYYMYFGGTNWRR 306
Query: 308 TASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQE 366
+ Y+ YD APL+EY ++ K HL+ LH ++ +P LS ++ S+L E
Sbjct: 307 ENTMYLQATSYDYDAPLNEY-VMETTKSRHLRRLHESI----QPFLSDRDGVLDMSRL-E 360
Query: 367 AFIFQGSS-----ECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKL 421
+F+G E + + D R+ +V +++ + + + + + + N A
Sbjct: 361 LKVFEGERRAILYERSTVSGDADHRSEESV---RCVFDSADIRVHLALELREIIVNAASR 417
Query: 422 DSVE--QWEEYKEAIP---TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSES 476
D+ + +W E P +TS + + ++ T SDY WY R
Sbjct: 418 DTGQDLRWRMLPEPPPLRAALSDTSATLATIPDLVDATAGTSDYAWYILRCPTAQGSGLL 477
Query: 477 VLKVSSLGHVLH--AFING--------EFVGSAHGKHSDKSF-----TLEK-----MVHL 516
L+V+ G V A G E+ + + F + E V
Sbjct: 478 QLEVADFGRVWRRKAVDQGDDAERQPLEWAAAGPEPPVEDRFPNAWNSTEYGYGIVEVGA 537
Query: 517 INGTNNVSLLSVMVGL--------PDSGAYLERRVAGLRNVSIQGAKELKD---FSSFSW 565
I+ +L +G+ P G ER+ GL S + D +
Sbjct: 538 IDCHEEYVVLVSSLGMVKGDWQLPPGYGMARERK--GLLRASYRSDVTFADDEWRDALVV 595
Query: 566 GYQVGLLGEKLQIFTDYGSRIVPW-------SRYGSSTHQPLTWYKTVFDAP----TGSD 614
G+ GL GE+++ + + P+ + G P WY+ P ++
Sbjct: 596 GFAAGLRGERIRSVIEGDADAYPYLWTPQKAALSGRRFSWP-RWYRASLAIPPPNADETE 654
Query: 615 PVAINLISMGKGEAWV--NGQSIGRYW-VSFLTPQ------------------GTPSQSW 653
+ ++L G + W+ NG+ GR+W V P+ G P+Q +
Sbjct: 655 GIILDLYESGVEKGWIYMNGEPCGRHWRVHGTMPKNGFLRQGDQEAPIEQVGHGQPTQRY 714
Query: 654 YHIPRSFLKPTG---NLLVLLEEENG 676
++IP L G L++ E NG
Sbjct: 715 FYIPPWHLHAKGRPSTLVIFDEHANG 740
>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
gi|217314871|gb|ACK36970.1| lectin [Glycine max]
Length = 447
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 147/415 (35%), Positives = 217/415 (52%), Gaps = 47/415 (11%)
Query: 425 EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD--------SES 476
+ W KE + + ++S + E +N TKD SDYLWY+ R SD
Sbjct: 33 KSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSDSDILFWEENDVHP 92
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
L + + +L FING+ + D+ F + ++ + G N+ + S+ + G
Sbjct: 93 KLTIDGVRDILRVFINGQLIVK------DEQF--KAVISVSIGKNDCTAGSI----NNYG 140
Query: 537 AYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
A+LE+ AG+R + I G + D S W YQVGL GE L+ +++
Sbjct: 141 AFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQGEFLKFYSEENENSEWVELTPD 200
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------- 646
+ TWYKT FD P G DPVA++ SMGKG+AWVNGQ IGRYW ++P+
Sbjct: 201 AIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYWTR-VSPKSGCQQVCD 259
Query: 647 --------------GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLC 692
G P+Q+ YH+PRS+LK T NLLV+LEE G P IS+ S +C
Sbjct: 260 YRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVILEETGGNPFEISVKLHSSRIIC 319
Query: 693 GHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNC 752
VS+S+ PP+ + N + P++ + C G IS + FAS+G P G+C
Sbjct: 320 AQVSESNYPPL--QKLVNADLIGEEVSANNMIPELHLHCQQGHTISSVAFASFGTPGGSC 377
Query: 753 ENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+N++ G+CH+ +S +IV +AC GKRSC++ + F DPCPG+ K L V+A+CT
Sbjct: 378 QNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAFGVDPCPGVVKTLSVEARCT 432
>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
Length = 770
Score = 242 bits (617), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 158/466 (33%), Positives = 233/466 (50%), Gaps = 46/466 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD R+ I+G R +L GSIHYPR W ++ + GL+ VQ VFWN HEP+
Sbjct: 50 SVTYDSRAFKIDGVRTLLLGGSIHYPRVAVDEWEPMLEEMGRDGLNHVQLYVFWNYHEPR 109
Query: 89 P-----------GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHD 137
P ++DFSGR DL+ FI+ + L+V LRIGP++ EW +GGLP WL D
Sbjct: 110 PPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAKKDLFVSLRIGPYVCAEWAFGGLPLWLRD 169
Query: 138 VPGIVFRS--------------------DNEPFKFHMKRYATMIVNMMKAARLYASQGGP 177
V G+ FRS +P++ +M + I M+K A L A+QGGP
Sbjct: 170 VEGMCFRSICGYNGSPGKCKPWEGGKFRSCDPWRKYMADFVMEIGRMVKEANLMAAQGGP 229
Query: 178 IILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGR 237
+IL Q+ENEYG HS + G Y+ W +L+ L VPWVMC A + +N CNG
Sbjct: 230 VILGQLENEYG--HHS--DAGRAYIDWVGELSFGLGLDVPWVMCNGISA-NGTLNVCNGD 284
Query: 238 QCGETFAGPNS---PDKPAIWTENWTSFYQVYGDEA--RIRSAEDIAYHVALFIAKMKGS 292
C + + + PD+P WTEN ++ +G RSAE++AY +A ++A + GS
Sbjct: 285 DCADEYKTDHDKRWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVA-VGGS 342
Query: 293 YVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV-KLCLKPM 351
+ NYYM++GG + + +A + Y D GL +PK HL+ LH + KL + M
Sbjct: 343 HHNYYMWYGGNHLAQWGAASLTNAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNGELM 402
Query: 352 LSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNN-ATVYFSNLMYELPPLSISIL-P 409
S+ +L+ + AFL + V+++ Y + + ++ P
Sbjct: 403 QVEDRHSVMPVQLENGVEVYEWTAGLAFLHRPACSGSPVEVHYAKATYSIACREVLVVDP 462
Query: 410 DCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTK 455
TV F TA ++ + A T D S+R LL M T +
Sbjct: 463 SSSTVLFATASVEPPPELVRRVVATLTADRWSMRKEELLHGMATVE 508
>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
Length = 601
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 177/619 (28%), Positives = 291/619 (47%), Gaps = 63/619 (10%)
Query: 116 LRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQG 175
+RIGP++ EW GG+P W++ + G+ R++N+ +K M + ++ + + +A +G
Sbjct: 1 MRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTR--DFFADRG 58
Query: 176 GPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACN 235
GPII SQIENE Y+ W + A L+ VPW+MC D + INACN
Sbjct: 59 GPIIFSQIENE-------LWGGAREYIDWCGEFAESLELNVPWMMC-NGDTSEKTINACN 110
Query: 236 GRQCGETF-----AGPNSPDKPAIWTENWTSFYQVYGDEA---------RIRSAEDIAYH 281
G C +G D+P WTEN ++Q++G + RSAED ++
Sbjct: 111 GNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFN 169
Query: 282 VALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELH 341
V F+ + GSY NYYM+ GG ++G+ A + Y + + L +PK H ++H
Sbjct: 170 VLKFMDR-GGSYHNYYMWFGGNHYGKWAGNGMTNWYTNGVMIHSDTLPNEPKHSHTAKMH 228
Query: 342 SAVKLCLKPMLSGVLVSMNFSKLQ----EAFIFQGSSECAAFLVNKDKRNNATVYFSNLM 397
+ + +L+ N L AF ++ +F+ N K + V + +++
Sbjct: 229 RMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYRYGDRLVSFVENS-KGSADKVIYRDIV 287
Query: 398 YELPPLSISILPDCKTVAFNTAKLDSVE-----------QWEEYKEAIPTYDETSLR--- 443
YELP S+ +L + V F T + V ++E + E + T + + R
Sbjct: 288 YELPAWSMIVLDEYDNVLFETNNVKPVNKHRVYHCEEKLEFEYWNEPVSTLSQEAPRVVV 347
Query: 444 ---ANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGS-A 499
AN EQ+N T+D +++L+Y + P D ++ + + A+++ FVGS
Sbjct: 348 SPKAN---EQLNMTRDLTEFLYYETEVEF-PQDECTLSIGGTDANAFVAYVDDHFVGSDD 403
Query: 500 HGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS-GAYLERRVAGLRNVSIQGAKEL- 557
H D T+ + G + + LLS +G+ + + L+ A R I G +L
Sbjct: 404 EHTHHDGWHTMNINMKSGKGKHKLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWIKLC 463
Query: 558 -KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSD-- 614
D + W + GL+GE Q+FTD G + V W + L WY++ F P G
Sbjct: 464 GNDIFNQEWKHYPGLVGEAKQVFTDEGMKTVTW-KSDVENADNLAWYRSTFKTPQGLKRG 522
Query: 615 -PVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG--NLLVLL 671
V + M +G+A+ NG +IGRYW+ G +Q +YHIP+ +LK G N+LVL
Sbjct: 523 IEVLLRPEGMNRGQAYANGHNIGRYWM-IKDGNGEYTQGFYHIPKDWLKGEGEENVLVLG 581
Query: 672 EEENGYPPGISIDTVSVTT 690
E P ++I T +
Sbjct: 582 ETLGASDPSVTICTTEYVS 600
>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
Length = 317
Score = 223 bits (569), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 121/282 (42%), Positives = 163/282 (57%), Gaps = 14/282 (4%)
Query: 536 GAYLERRVAGLR-NVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
GA+LE+ AG + V + G K + D S +SW YQVGL GE +I+ S W+
Sbjct: 28 GAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLT 87
Query: 594 -SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPS-- 650
++ TWYKT FDAP G +PVA++L SMGKG+AWVNG IGRYW G
Sbjct: 88 PDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWTRVAPKDGCGKCD 147
Query: 651 ------QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVI 704
S YHIPRS+L+ + NLLVL EE G P IS+ + S T+C VS+SH P +
Sbjct: 148 YRGHYHTSKYHIPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTICAEVSESHYPSLQ 207
Query: 705 SWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSN 764
+W + + ++ P++ ++C G IS I FASYG P G+C+ ++ G CH+ N
Sbjct: 208 NWSPSDFIDQNSKNKM---TPEMHLQCDDGHTISSIEFASYGTPQGSCQMFSQGQCHAPN 264
Query: 765 SRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
S A+V KAC GK SC + + F GDPC GI K L V+A+C
Sbjct: 265 SLALVSKACQGKGSCVIRILNSAFGGDPCRGIVKTLAVEAKC 306
>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
Length = 283
Score = 222 bits (565), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 116/286 (40%), Positives = 164/286 (57%), Gaps = 27/286 (9%)
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A L TGVPW+MC+Q +APDP+IN CN C + PNS +KP +WTENW+ ++ +G
Sbjct: 1 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQ--FTPNSDNKPKMWTENWSGWFLAFG 58
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLDEY 326
R ED+A+ VA F + G++ NYYMYHGGTNFGRT ++ YD AP+DEY
Sbjct: 59 GAVPYRPVEDLAFAVARFFQR-GGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEY 117
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
G +RQPKWGHLK+LH A+KLC + +++ + E +++ + C+AFL N
Sbjct: 118 GDIRQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVYKTGAVCSAFLANI-GM 176
Query: 387 NNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYK--------------- 431
++ATV F+ Y LP S+SILPDCK V NTAK+++ +
Sbjct: 177 SDATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLKEKVDSLDSSS 236
Query: 432 -------EAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHD 470
E + + + LLEQ+NTT D SDYLWY+ ++
Sbjct: 237 SGWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVYE 282
>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
Length = 307
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 168/294 (57%), Gaps = 32/294 (10%)
Query: 421 LDSVEQWEEYKEAIPTYD-ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------ 473
+ S W+ Y EA + + S AN LLEQ+ T+D+SDYLWY P++
Sbjct: 11 VSSAFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNG 70
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
VL S GHVLH F+NG+F G+A+G + T V L G N +SLLSV VGL
Sbjct: 71 QYPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLS 130
Query: 534 DSGAYLER-RVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
+ G + E V L V+++G E +D S W Y++GL GE L + T GS V W++
Sbjct: 131 NVGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTK 190
Query: 592 YGSS--THQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL------ 643
GSS QPLTWYK FDAP G+DP+A+++ SMGKGE WVNG+SIGR+W +++
Sbjct: 191 -GSSLVEKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAYIARGSCG 249
Query: 644 --------------TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
T G P+Q WYHIPRS++ P GN LV+LEE G P GIS+
Sbjct: 250 GCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISL 303
>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 154
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 99/154 (64%), Positives = 120/154 (77%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+VTYD +++IING R+IL SGSIHYPRSTPQMWP LI KAK+GGLD+++T VFWN HEP
Sbjct: 1 SVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPS 60
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
P ++ F R DLVRFIK VQ GLYV LRIGP++ EW YGG P WL VPGI FR+DN
Sbjct: 61 PDKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTDNA 120
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
PFK M+++ IV+MMK +L+ +QGGPIILSQ
Sbjct: 121 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 154
>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
Length = 480
Score = 213 bits (541), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 125/332 (37%), Positives = 176/332 (53%), Gaps = 39/332 (11%)
Query: 497 GSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAK 555
G+ +G D T V L G+N +S LS+ VGLP+ G + E AG L V++ G
Sbjct: 165 GTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLN 224
Query: 556 E-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSD 614
E +D + W YQVGL GE + + GS V W + F+AP G +
Sbjct: 225 EGRRDLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNASN-----MAFFNAPDGDE 279
Query: 615 PVAINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQGTPSQSWY 654
P+A+++ SMGKG+ W+NGQ IGRYW + T G SQ WY
Sbjct: 280 PLALDMSSMGKGQIWINGQGIGRYWPGYKASGNCGTCDYRGEYDETKCQTNCGDSSQRWY 339
Query: 655 HIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTL 714
H+PRS+L PTGNLLV+ EE G P GIS+ S+ ++C VS+ P + +W +++
Sbjct: 340 HVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWHTKDYE-- 396
Query: 715 KTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACL 774
+ KV ++C +G+KI++I FAS+G P G+C +Y G CH+ S I K C+
Sbjct: 397 ---------KAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIFWKNCV 447
Query: 775 GKRSCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
G+ C V V E F GDPCPG K +V+A C
Sbjct: 448 GQERCGVSVVPEIFGGDPCPGTMKRAVVEAIC 479
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/146 (54%), Positives = 98/146 (67%), Gaps = 3/146 (2%)
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
M+++ T IV MMK+ L+ QGGPIILSQIENE+G +E E Y WAA +AV L
Sbjct: 1 MQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALN 60
Query: 214 TGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
T VPW+MCK+DDAPDP+IN CNG C + PN P KP +WTE WT++Y +G R
Sbjct: 61 TSVPWIMCKEDDAPDPIINTCNGFYC--DWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHR 118
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMY 299
ED+AY VA FI K GS+VNYYM+
Sbjct: 119 PVEDLAYGVAKFIQK-GGSFVNYYMF 143
>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
Length = 315
Score = 206 bits (524), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 98/171 (57%), Positives = 122/171 (71%)
Query: 12 LLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEG 71
+LL I G VTYD R+L+I+G R++L SGSIHYPRS P++WP +I K+KEG
Sbjct: 142 VLLVLIAVCVFEGCYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEG 201
Query: 72 GLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGL 131
GLDV++T VFWN HEP G++ F GR DLVRF+K VQ GL V LRIGP+ EW YGG
Sbjct: 202 GLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGF 261
Query: 132 PFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
P WLH +PGI FR+ N+ FK MKR+ IV++MK A L+A QGGPIIL+Q
Sbjct: 262 PVWLHFIPGIQFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 312
>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
Length = 275
Score = 205 bits (521), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 108/269 (40%), Positives = 154/269 (57%), Gaps = 27/269 (10%)
Query: 559 DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH--QPLTWYKTVFDAPTGSDPV 616
D S W YQVGL GE + + + + W + QPLTW+KT FDAP G++P+
Sbjct: 2 DLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPL 61
Query: 617 AINLISMGKGEAWVNGQSIGRYWVSFLTPQ-------------------GTPSQSWYHIP 657
A+++ MGKG+ WVNG+SIGRYW +F T G P+Q WYH+P
Sbjct: 62 ALDMEGMGKGQIWVNGESIGRYWTAFATGDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVP 121
Query: 658 RSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTH 717
R++LKP+ NLLV+ EE G P +S+ SV+ +C VS+ H P + +W+ ++ +T
Sbjct: 122 RAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTF 180
Query: 718 KRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKR 777
RPKV ++C G+ I+ I FAS+G P G C +Y G CH++ S AI+E+ C+GK
Sbjct: 181 -----HRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKA 235
Query: 778 SCTVPVWTEKFYGDPCPGIPKALLVDAQC 806
C V + F DPCP + K L V+A C
Sbjct: 236 RCAVTISNSNFGKDPCPNVLKRLTVEAVC 264
>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
Length = 177
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 98/171 (57%), Positives = 122/171 (71%)
Query: 12 LLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEG 71
+LL I G VTYD R+L+I+G R++L SGSIHYPRS P++WP +I K+KEG
Sbjct: 7 VLLVLIAVCVFEGCYCKTVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEG 66
Query: 72 GLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGL 131
GLDV++T VFWN HEP G++ F GR DLVRF+K VQ GL V LRIGP+ EW YGG
Sbjct: 67 GLDVIETYVFWNNHEPVRGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGF 126
Query: 132 PFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
P WLH +PGI FR+ N+ FK MKR+ IV++MK A L+A QGGPIIL+Q
Sbjct: 127 PVWLHFIPGIQFRTTNDLFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 177
>gi|414888317|tpg|DAA64331.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
Length = 284
Score = 202 bits (515), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 109/277 (39%), Positives = 155/277 (55%), Gaps = 7/277 (2%)
Query: 532 LPDSGAYLERRVAGLRNVSIQGAKE-LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWS 590
L DSG L +G++ IQG D WG++ L GE +I+++ G V W
Sbjct: 6 LQDSGGELAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWK 65
Query: 591 RYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPS 650
+ + TWYK FD P G DPV +++ SM KG +VNG+ +GRYWVS+ T GTPS
Sbjct: 66 P--AENGRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSYRTLAGTPS 123
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQN 710
Q+ YHIPR FLK NLLV+ EEE G P GI + TV+ +C +S+ + + +W +
Sbjct: 124 QALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRDDICLFISEHNPGQIKTWDTDG 183
Query: 711 QRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVE 770
+ +K RR + CP + I +++FAS+GNP G C N+ +G+CH+ N++ IVE
Sbjct: 184 DK-IKLIAEDHSRRGT--LMCPPEKTIQEVVFASFGNPEGMCGNFTVGTCHTPNAKQIVE 240
Query: 771 KACLGKRSCTVPVWTEKFYGD-PCPGIPKALLVDAQC 806
K CLGK SC +PV + D C L V +C
Sbjct: 241 KECLGKPSCMLPVDHTVYGADINCQSTTATLGVQVRC 277
>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
Length = 172
Score = 202 bits (514), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 101/175 (57%), Positives = 122/175 (69%), Gaps = 3/175 (1%)
Query: 129 GGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG 188
GG P WL VPGI FR+DNEPFK M+ + IVN+MK+ L+ SQGGPIILSQIENEYG
Sbjct: 1 GGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYG 60
Query: 189 MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS 248
+ G YV WAA +AV L TGVPWVMCK++DAPDPVIN CNG C ++F+ PN
Sbjct: 61 PQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYC-DSFS-PNR 118
Query: 249 PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGT 303
P KP IWTE W+ ++ +G R +D+A+ VA FI K GS+ NYYMYHGGT
Sbjct: 119 PYKPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQK-GGSFFNYYMYHGGT 172
>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
Length = 267
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 106/263 (40%), Positives = 150/263 (57%), Gaps = 20/263 (7%)
Query: 298 MYHGGTNFGR-TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVL 356
MYHGGTNF R T ++ T Y AP+DEYG++RQ KWGHLK+++ A+KLC + +++
Sbjct: 1 MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTDP 60
Query: 357 VSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAF 416
+ + EA +++ S CAAFL N D +N+ TV FS Y LP S+S+LPDCK V
Sbjct: 61 KISSLGQNLEAAVYKTGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPDCKNVVL 120
Query: 417 NTAKLDSV------------------EQWEEYKEAIPTYDETSLRANFLLEQMNTTKDAS 458
NTAK++S +W E + + L LLEQ+NTT D S
Sbjct: 121 NTAKINSASAISNFVTEDISSLETSSSKWSWINEPVGISKDDILSKTGLLEQINTTADRS 180
Query: 459 DYLWYNFRFK-HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLI 517
DYLWY+ D S++VL + SLGH LHAFING+ G+ G ++ + L+
Sbjct: 181 DYLWYSLSLDLADDPGSQTVLHIESLGHTLHAFINGKLAGNQAGNSDKSKLNVDIPIALV 240
Query: 518 NGTNNVSLLSVMVGLPDSGAYLE 540
+G N + LLS+ VGL + GA+ +
Sbjct: 241 SGKNKIDLLSLTVGLQNYGAFFD 263
>gi|62529271|gb|AAX84941.1| beta-galactosidase [Prunus persica]
Length = 287
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 157/284 (55%), Gaps = 21/284 (7%)
Query: 312 YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQ 371
++ T Y APLDEYGL R+PKWGHL++LH A+K ++S + QEA +F+
Sbjct: 3 FMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVFK 62
Query: 372 GSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ----- 426
S CAAFL N D +++A V F N YELPP SISILPDCKT +NTA+L S
Sbjct: 63 SKSGCAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCKTAVYNTARLGSQSSQMKMT 122
Query: 427 -------WEEYKEAIPTYDET-SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD----- 473
W+ + E + DE+ + + L EQ+N T+D +DYLWY P +
Sbjct: 123 PVKSALPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEGFIKR 182
Query: 474 -SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGL 532
+L + S GH LH FING+ G+ +G + T + V L +G N ++LLS+ VGL
Sbjct: 183 GESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGINKLALLSISVGL 242
Query: 533 PDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQVGLLGE 574
P+ G + E AG L V+++G D S + W Y+ GL GE
Sbjct: 243 PNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKTGLKGE 286
>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
Length = 173
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 99/177 (55%), Positives = 122/177 (68%), Gaps = 6/177 (3%)
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
+WG+ L VPGI FR+DN PFK M+++ IVNMMK+ +L+ QGGPII+SQIE
Sbjct: 1 DWGFSCL---AQYVPGIAFRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIE 57
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA 244
NEYG VE G Y +WAA++AV L TGVPW+MCKQ+DAPDPVI+ CNG C E F
Sbjct: 58 NEYGPVEWEIGAPGKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYC-EGFR 116
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHG 301
PN KP +WTENWT +Y +G A R ED+A+ VA FI + GS+VNYYMYHG
Sbjct: 117 -PNKNYKPKMWTENWTGWYTKFGGPAPYRPVEDLAFSVARFI-QNNGSFVNYYMYHG 171
>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
Length = 288
Score = 193 bits (491), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 155/281 (55%), Gaps = 26/281 (9%)
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD-QAPLD 324
+GD R ED+A+ VA F + G++ NYYM+HGGTNFGRT ++ YD P+D
Sbjct: 9 FGDVVPHRPVEDLAFAVARFYQR-GGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTPID 67
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKD 384
EYG++RQPKW HLK +H A+KLC K +L+ EA ++ + AAFL N
Sbjct: 68 EYGIIRQPKWDHLKNVHKAIKLCEKALLATGPTITYLGPNIEAAVYNIGAVSAAFLANIA 127
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQ-----WEEYKEAIPTYDE 439
K +A V F+ Y LP +S LPDCK+V NTAK++S E KE + + D+
Sbjct: 128 K-TDAKVSFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMISSFTTESLKEEVGSLDD 186
Query: 440 T-----------------SLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSS 482
+ S +LLEQ+NTT D SDYLWY+ D + +E+VL + S
Sbjct: 187 SGSGWSWISEPIGISKAHSFSKFWLLEQINTTADRSDYLWYSSSIDLDAA-TETVLHIES 245
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNV 523
LGH LHAF+NG+ GS G H S ++ + L+ G N +
Sbjct: 246 LGHALHAFVNGKLAGSGTGNHEKVSVKVDIPITLVYGKNTI 286
>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
Length = 270
Score = 189 bits (481), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 108/273 (39%), Positives = 151/273 (55%), Gaps = 31/273 (11%)
Query: 558 KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG-SSTHQPLTWYKTVFDAPTGSDPV 616
+D S W Y+VGL GE L + + GS V W+ + QPLTWYKT F AP G P+
Sbjct: 6 RDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPL 65
Query: 617 AINLISMGKGEAWVNGQSIGRYWVSF--------------------LTPQGTPSQSWYHI 656
A+++ SMGKG+ W+NGQS+GR+W ++ L G SQ WYH+
Sbjct: 66 AVDMGSMGKGQIWINGQSLGRHWPAYKAVGSCSECSYTGTFREDKCLRNCGEASQRWYHV 125
Query: 657 PRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQ--NQRTL 714
PRS+LKP+GNLLV+ EE G P GI++ V ++C + + W+S N +
Sbjct: 126 PRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYE--------WQSTLVNYQLH 177
Query: 715 KTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACL 774
+ K PK ++C G+KI+ + FAS+G P G C +Y GSCH+ +S K C+
Sbjct: 178 ASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCV 237
Query: 775 GKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
G+ C+V V E F GDPCP + K L V+A C
Sbjct: 238 GQNWCSVTVAPEMFGGDPCPNVMKKLAVEAVCA 270
>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
Length = 296
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/287 (38%), Positives = 152/287 (52%), Gaps = 30/287 (10%)
Query: 427 WEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY------NFRFKHDPSDSESVLKV 480
W+ Y EA + D + + L+EQ++ T D SDYLWY N + S L +
Sbjct: 9 WQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTI 68
Query: 481 SSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLE 540
S GH L F+NG+ G+ +G + T V + G+N +S+LS VGLP+ G + E
Sbjct: 69 YSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYE 128
Query: 541 R-RVAGLRNVSIQGAKELK-DFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQ 598
V L V++ G E K D S W YQ+GL GE L + + GS V W ++ Q
Sbjct: 129 TWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGS--AAGKQ 186
Query: 599 PLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW------------------- 639
PLTW+K F AP+G PVA+++ SMGKG+AWVNG+ IGRYW
Sbjct: 187 PLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWSYKASSSGCGGCSYAGTYS 246
Query: 640 -VSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
T G SQ +YH+PRS+L P+GNLLV+LEE G G+ + T
Sbjct: 247 ETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVT 293
>gi|219117911|ref|XP_002179741.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408794|gb|EEC48727.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 951
Score = 182 bits (461), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 193/775 (24%), Positives = 319/775 (41%), Gaps = 132/775 (17%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD R++ IN R +L SGS+H R+T W + +A GL+++ +FW H
Sbjct: 146 GNLSVSYDERAIRINDKRVLLLSGSMHPVRATRGTWEHALDEAVYNGLNMITVYIFWGAH 205
Query: 86 EP---QPGQFDFSGRR--------DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFW 134
+ +P + G +L ++ +GL++ +RIGP+ GE+ YGG+P W
Sbjct: 206 QSFRDEPLNWSLDGSSIGPKESQWELADALRSAANRGLFIHVRIGPYACGEYTYGGIPEW 265
Query: 135 L-HDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE------- 186
L + R N P+ M+ + + + + L+A QGGPI+++QIENE
Sbjct: 266 LPLQSSTMRMRRLNRPWLDAMEGFVAATITYLSSFNLWAHQGGPILIAQIENELGSGVDG 325
Query: 187 -----------------------------YGMVEHSFLEKG----------PPYVRWAAK 207
YG + + +G Y W
Sbjct: 326 SAAANYVVLERDEFNDDKHEDSHLLQLDRYGHILENASSRGMDSELRNATVQDYADWCGN 385
Query: 208 LAVDLQTGVPWVMCKQDDAPDPV--INACNGRQCGETF--AGPNSPDKPAIWTENWTSFY 263
L L V W MC A + + N NG E + +G D+PAIWTE+ F
Sbjct: 386 LVARLAPNVIWTMCNGLSAENTISTFNGNNGIDWLEKYGDSGRIQVDQPAIWTEDEGGF- 444
Query: 264 QVYGDEARI-------RSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTG 316
Q++GD+ R++ +A + A+ G+++NYYM+ GG N GR+++A ++
Sbjct: 445 QLWGDQPSKPSDYFWGRTSRAMATDALQWFAR-GGTHLNYYMWWGGYNRGRSSAAGIMNA 503
Query: 317 YYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPML---SGVLVSMNFSKL--------- 364
Y A L G R PK+ H LH + +L + +L + + +
Sbjct: 504 YATDAFLCSSGQRRHPKYDHFLALHLVIADIAAILLHAPTSLLKNASVEIMDGDDWIVGD 563
Query: 365 -QEAFIFQ----GSSECAAFLVNKDK-----RNNATVYFSNLMYELPPLSISILPDC--- 411
Q F++Q S+ FL N R +L++ + P S I+ D
Sbjct: 564 NQRQFLYQVLDTHDSKQVIFLENDANTTEMARLTGAKADDSLVFVMKPYSSQIVIDGIVA 623
Query: 412 --------------KTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDA 457
+T+ + A L + W E T D+ + + LEQ N A
Sbjct: 624 FDSSTISTKAMSFRRTLHYEPAVLLHLTSWSEPIAGADT-DQNAHVSTEPLEQTNLNSKA 682
Query: 458 ---SDYLWYNFRFKHDPSDSESVLKV-SSLGHVLHAFINGEFVGSAHG-KHSDKSFTLE- 511
SDY WY K D S+ L + + L FI+G F+G A+ +H++ L
Sbjct: 683 SISSDYAWYGTDVKIDVVLSQVKLYIGTEKATALAVFIDGAFIGEANNHQHAEGPTVLSI 742
Query: 512 KMVHLINGTNNVSLLSVMVGLPDS----GAYLERRVAGLRNVSIQGAKELKDFSSFSWGY 567
++ L GT+ +++L +G + GA + G+ + G+ L + S G
Sbjct: 743 EIESLAAGTHRLAILCESLGYHNLIGRWGAITTAKPKGITGNVLIGSPLLSENISLVDGR 802
Query: 568 QV-----GLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLIS 622
Q+ GL E+ + + PL W +F +P V +
Sbjct: 803 QMWWSLPGLSVERKAARHGLRRESFEDAAQAEAGLHPL-WSSVLFTSPQFDSTVHSLFLD 861
Query: 623 M--GKGEAWVNGQSIGRYW-VSFLTPQGTPSQSWYHIPRSFLKPTGNL--LVLLE 672
+ G+G W+NG+ +GRYW ++ SQ +Y +P FL G L L+L +
Sbjct: 862 LTSGRGHLWLNGKDLGRYWNITRGNSWNDYSQRYYFLPADFLHLDGQLNELILFD 916
>gi|222616996|gb|EEE53128.1| hypothetical protein OsJ_35926 [Oryza sativa Japonica Group]
Length = 314
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 135/226 (59%), Gaps = 28/226 (12%)
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------- 646
+T+F P G+DPVAI+L SMGKG+AWVNG IGRYW S + P+
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSSSCYYPGAYNERK 141
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q+WYHIPR +LK + NLLVL EE G P IS++ T+C +S+++ P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
P+ +W + + P+++++C G IS+I FASYG P+G C N++ G+CH
Sbjct: 202 PLSAWSHLS----SGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCH 257
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+S++ +V +AC+G C + V + F GDPC G+ K L V+A+C+
Sbjct: 258 ASSTLDLVTEACVGNTKCAISVSNDVF-GDPCRGVLKDLAVEAKCS 302
>gi|77554857|gb|ABA97653.1| Galactose binding lectin domain containing protein, expressed
[Oryza sativa Japonica Group]
Length = 317
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 135/226 (59%), Gaps = 28/226 (12%)
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------- 646
+T+F P G+DPVAI+L SMGKG+AWVNG IGRYW S + P+
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSSSCYYPGAYNERK 141
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q+WYHIPR +LK + NLLVL EE G P IS++ T+C +S+++ P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
P+ +W + + P+++++C G IS+I FASYG P+G C N++ G+CH
Sbjct: 202 PLSAWSHLS----SGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCH 257
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+S++ +V +AC+G C + V + F GDPC G+ K L V+A+C+
Sbjct: 258 ASSTLDLVTEACVGNTKCAISVSNDVF-GDPCRGVLKDLAVEAKCS 302
>gi|125536445|gb|EAY82933.1| hypothetical protein OsI_38150 [Oryza sativa Indica Group]
Length = 314
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/226 (39%), Positives = 134/226 (59%), Gaps = 28/226 (12%)
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ----------------- 646
+T+F P G+DPVAI+L SMGKG+AWVNG IGRYW S + P+
Sbjct: 83 ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYW-SLVAPESGCSSSCYYPGAYNERK 141
Query: 647 -----GTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLP 701
G P+Q+WYHIPR +LK + NLLVL EE G P IS++ +C +S+++ P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKAVCSRISENYYP 201
Query: 702 PVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCH 761
P+ +W + + P+++++C G IS+I FASYG P+G C N++ G+CH
Sbjct: 202 PLSAWSHLS----SGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCH 257
Query: 762 SSNSRAIVEKACLGKRSCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+S++ +V +AC+G C + V + F GDPC G+ K L V+A+C+
Sbjct: 258 ASSTLDLVTEACVGNTKCAISVSNDVF-GDPCRGVLKDLAVEAKCS 302
>gi|297841097|ref|XP_002888430.1| hypothetical protein ARALYDRAFT_338750 [Arabidopsis lyrata subsp.
lyrata]
gi|297334271|gb|EFH64689.1| hypothetical protein ARALYDRAFT_338750 [Arabidopsis lyrata subsp.
lyrata]
Length = 470
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/260 (40%), Positives = 146/260 (56%), Gaps = 42/260 (16%)
Query: 426 QWEEYKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVL 478
++E + E IP+ D SL L E TKD +DY WY K + D +++L
Sbjct: 208 KFEMFSEDIPSILDGDSL---ILGELYYLTKDKTDYAWYTTSIKIEDDDIPDQKGQKTIL 264
Query: 479 KVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY 538
+V+ LGH L ++NGE+ ++L N +S+L V+ GLPDSG+Y
Sbjct: 265 RVAGLGHTLIVYVNGEYA-----------------INLRTRDNCISILGVLTGLPDSGSY 307
Query: 539 LERRVAGLRNVSIQGAKE-LKDF-SSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
+E AG R VSI G K +D + WG+ V +T+ GS+ V W +YG
Sbjct: 308 MEHTYAGPRGVSIIGLKSGTRDLIENNEWGHLV---------YTEEGSKKVKWEKYGE-- 356
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
H+PLTWYKT F+ P G + VAI + MGKG WVNG +GRYW+SF++P G P Q+ YHI
Sbjct: 357 HKPLTWYKTYFETPEGENAVAIRMKGMGKGLIWVNGIGVGRYWMSFVSPLGEPIQTEYHI 416
Query: 657 PRSFLK--PTGNLLVLLEEE 674
PRSF+K ++LV+LEEE
Sbjct: 417 PRSFMKEEKKKSMLVILEEE 436
>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
Length = 586
Score = 172 bits (436), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 109/292 (37%), Positives = 153/292 (52%), Gaps = 26/292 (8%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ +TYD S +++G L SG++HY R+ P+ W + K K G + V+T V WNLHEP
Sbjct: 2 SQLTYDD-SFLLDGKEIRLLSGAMHYFRTVPEYWEDRLLKLKACGFNTVETYVAWNLHEP 60
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
+ GQF F G D+VRFIK + GL+V +R GPFI EW +GG P+WL VP I R N
Sbjct: 61 EEGQFVFEGIADIVRFIKTAEKVGLHVIVRPGPFICAEWEFGGFPYWLLTVPNIKLRCFN 120
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
+P+ + Y ++ ++ L +S GGPII QIENEYG + +K Y+R K
Sbjct: 121 QPYLEKVDAYFDVLFERLRP--LLSSNGGPIIALQIENEYGSFGND--QKYLQYLRDGIK 176
Query: 208 LAVDLQTGVPWVMCKQDDAPDP----------VINACN-GRQCGETFAGPN--SPDKPAI 254
V + + D P+P + N G + FA P+ P +
Sbjct: 177 KRVGNE------LLFTSDGPEPSMLSGGMIEGIFETVNFGSRAESAFAQLKQYQPNAPLM 230
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+E RSAE + + I K GS VN+YM HGGTNFG
Sbjct: 231 CMEFWHGWFDHWGEEHHTRSAESVVETLEE-ILKQNGS-VNFYMAHGGTNFG 280
>gi|125526285|gb|EAY74399.1| hypothetical protein OsI_02287 [Oryza sativa Indica Group]
Length = 255
Score = 172 bits (436), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 115/202 (56%), Gaps = 48/202 (23%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G +V+YD RSL+I+G R+I+ SGSIHYPRSTP+
Sbjct: 26 GCTSVSYDDRSLVIDGQRRIILSGSIHYPRSTPE-------------------------- 59
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
E+Q G+Y LRIGP+I GEW YGGLP WL D+PG+ FR
Sbjct: 60 --------------------EIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRL 99
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVR 203
NEPF+ M+ + T+IVN MK ++++A QGGPIIL+QIENEYG M + + + Y+
Sbjct: 100 HNEPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIH 159
Query: 204 WAAKLAVDLQTGVPWVMCKQDD 225
W A +A GVPW+MC+QDD
Sbjct: 160 WCADMANKQNVGVPWIMCQQDD 181
>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
Length = 144
Score = 172 bits (435), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 76/124 (61%), Positives = 94/124 (75%), Gaps = 1/124 (0%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP- 87
NV+YD RSLIING RK+L S +IHYPRS P MWP L+ AKEGG+DV++T VFWN+H+P
Sbjct: 20 NVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVHQPT 79
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
P ++ F GR DLV+FI VQ G+Y+ LRIGPF+ EW +GG+P WLH V G VFR+DN
Sbjct: 80 SPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFRTDN 139
Query: 148 EPFK 151
FK
Sbjct: 140 YNFK 143
>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
Length = 172
Score = 171 bits (434), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 88/162 (54%), Positives = 106/162 (65%), Gaps = 3/162 (1%)
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
+A+ L TGVPW+MCKQ+DAP P+I+ CNG C E F PNS +KP +WTENWT +Y +G
Sbjct: 1 MALGLSTGVPWIMCKQEDAPGPIIDTCNGYYC-EDFK-PNSINKPKMWTENWTGWYTDFG 58
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYG 327
R EDIAY VA FI K GS VNYYMYHGGTNF RTA ++ + Y APLDEYG
Sbjct: 59 GAVPYRPVEDIAYSVARFIQK-GGSLVNYYMYHGGTNFDRTAGEFMASSYDYDAPLDEYG 117
Query: 328 LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI 369
L R+PK+ HLK LH A+KL +LS + QE I
Sbjct: 118 LPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEVTI 159
>gi|297840773|ref|XP_002888268.1| hypothetical protein ARALYDRAFT_338522 [Arabidopsis lyrata subsp.
lyrata]
gi|297334109|gb|EFH64527.1| hypothetical protein ARALYDRAFT_338522 [Arabidopsis lyrata subsp.
lyrata]
Length = 246
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 104/256 (40%), Positives = 143/256 (55%), Gaps = 42/256 (16%)
Query: 430 YKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSS 482
+ E IP+ D SL L E TKD +DY WY K + D +++L+V+
Sbjct: 2 FSEDIPSILDGDSL---ILGELYYLTKDKTDYAWYTTSIKIEDDDIPDQKGQKTILRVAG 58
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
LGH L ++NGE+ ++L N +S+L V+ GLPDSG+Y+E
Sbjct: 59 LGHALIVYVNGEYA-----------------INLRTRDNCISILGVLTGLPDSGSYMEHT 101
Query: 543 VAGLRNVSIQGAKE-LKDF-SSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
AG R VSI G K +D + WG+ V +T+ GS+ V W +YG H+PL
Sbjct: 102 YAGPRGVSIIGLKSGTRDLIENNEWGHLV---------YTEEGSKKVKWEKYGE--HKPL 150
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWYKT F+ P G + VAI + MGKG WVNG +GRYW+SF++P G P Q+ YHIPRSF
Sbjct: 151 TWYKTYFETPEGENAVAIRMKGMGKGLIWVNGIGVGRYWMSFVSPLGEPIQTEYHIPRSF 210
Query: 661 LK--PTGNLLVLLEEE 674
+K ++LV+LEEE
Sbjct: 211 MKEEKKKSMLVILEEE 226
>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
Length = 857
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/339 (32%), Positives = 173/339 (51%), Gaps = 21/339 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ +D S II+G RK + S ++HY R W +I KA+ GG + ++T + WN HE
Sbjct: 2 IQFDSNSWIIDGKRKFIISAAVHYFRLPRAEWAAVIRKARLGGCNAIETYIAWNYHETAE 61
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
Q+DFSG +DL F +G+YV +R GP+I EW +GGLP++L++ GI +R N
Sbjct: 62 EQWDFSGDKDLAAFFAICHDEGMYVIVRPGPYICAEWDFGGLPYYLNNTDGIEYRCSNAA 121
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
++ ++RY I+ +++ +L GG II+ QIENEY H+F +K ++R+ +L
Sbjct: 122 YEQAVRRYFERIMPIIRRYQL--GSGGSIIMVQIENEY----HAFGKKDLAHIRFLEELT 175
Query: 210 VDLQTGVPWVMC-KQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
VP V C + N +G + +P E W + + +G
Sbjct: 176 RGFGITVPLVSCYGAGRNTVEMRNFWSGAERAAAVLRERQSGQPLGIMEFWIGWVEHWGG 235
Query: 269 E-ARIRSAEDIAYHVALFIAKMKGSYV--NYYMYHGGTNF----GRTASAY--VLTGYYD 319
E + + AE + H +K +V NYYMY GG+NF GRT A+ +T YD
Sbjct: 236 EPQKHKPAEAVLSHC---FEALKSGFVFFNYYMYFGGSNFGSWGGRTIGAHKIFMTQSYD 292
Query: 320 -QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLV 357
APLDE+G K+ L LH+ + + +G L+
Sbjct: 293 YDAPLDEFG-FETEKYRLLAVLHTFIAWLENDLTAGSLL 330
Score = 48.5 bits (114), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 76/176 (43%), Gaps = 35/176 (19%)
Query: 516 LINGTNNVSLLSVMVG-LPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGE 574
L +GTN + L + G + +L LRN +++ E++DF L +
Sbjct: 710 LTSGTNELYLDVLQKGTIQKLSLFLAAESDRLRNWNVRPIAEVQDF----------LSAK 759
Query: 575 KLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVA---INLISMGKGEAWVN 631
L ++TD G +I P ++YKT PV + L S+ KG + N
Sbjct: 760 NLPMYTDTG-KIFP------------SFYKTRVRLSPAKTPVLAAYLKLGSLQKGNIYFN 806
Query: 632 GQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVS 687
G IGR+W Q Y IP S L+ T N LV+ +E P G+S+ V+
Sbjct: 807 GFDIGRFW-------NIGPQIKYKIPVSLLQET-NELVIFDEYGANPNGVSLCIVT 854
>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 110/163 (67%), Gaps = 11/163 (6%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MW L+ AKEGG+DV++T VF N HE P + F G DL++F+K VQ G+Y+ L IG
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPII 179
PF+ EW +G +F+++++PFK+HM+++ T+IVN+MK +L+ASQGGPII
Sbjct: 61 PFVATEWNFG-----------TIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 109
Query: 180 LSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK 222
L+Q +NEYG + + + G PYV WAA + + GVPW+MC+
Sbjct: 110 LTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQ 152
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 30/41 (73%), Gaps = 1/41 (2%)
Query: 294 VNYYMYHGGTNFGRTASA-YVLTGYYDQAPLDEYGLLRQPK 333
VNYYMYHGGTNFG T+ ++ T Y AP+DEYGL R PK
Sbjct: 237 VNYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK 277
>gi|217070908|gb|ACJ83814.1| unknown [Medicago truncatula]
Length = 200
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 94/207 (45%), Positives = 119/207 (57%), Gaps = 31/207 (14%)
Query: 623 MGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSF 660
MGKGEAWVNGQSIGRYW +++ G PSQ+ YH+PRSF
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSF 60
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
LKP GN LVL EE G P IS T + ++C HVSDSH P + W + K
Sbjct: 61 LKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGGKVG--- 117
Query: 721 PGRRPKVQIRCPSGRK-ISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSC 779
P + + CP+ + IS I FASYG P G C N+ G C S+ + +IV+KAC+G RSC
Sbjct: 118 ----PALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSC 173
Query: 780 TVPVWTEKFYGDPCPGIPKALLVDAQC 806
+V V T+ F GDPC G+PK+L V+A C
Sbjct: 174 SVGVSTDTF-GDPCRGVPKSLAVEATC 199
>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
25986]
gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
Length = 598
Score = 169 bits (428), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 180/676 (26%), Positives = 284/676 (42%), Gaps = 139/676 (20%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P W + K G + V+T V WNLHEP+PG FDFSG DL F+
Sbjct: 19 ILSGAIHYMRVHPSDWHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSGSIDLAAFLD 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
E + GLY +R PFI EW +GG+P WL + RS + F H+ +Y ++ ++
Sbjct: 79 EAASLGLYAIVRPSPFICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQYYDHLMPIL 138
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ ++ +GG II+ Q+ENEYG S+ E Y+R +L V+ VP +C D
Sbjct: 139 VSRQI--DKGGNIIMMQVENEYG----SYCED-KDYLRAIRRLMVERGVSVP--LCTSDG 189
Query: 226 -----------APDPVINACN-GRQCGETFAGPNSPDK------PAIWTENWTSFYQVYG 267
D V+ N G E F ++ K P + E W ++ YG
Sbjct: 190 PWRGCLRAGTLIDDDVLCTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWDGWFNRYG 249
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--------RTASAYVLTGYYD 319
+ R ED+A V + ++ GS +N YM+HGGTNFG T + +T Y
Sbjct: 250 ENVIRRDPEDLASCVRE-VLELGGS-LNLYMFHGGTNFGFMNGCSARHTHDLHQVTSYDY 307
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
APLDE G+ E + A++ + + + S +K +AF
Sbjct: 308 DAPLDEQ--------GNPTEKYFAIQRTVHELYPDIAQSKPLTK--KAF----------- 346
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDE 439
+P +S+S + FN LD + + E + +P
Sbjct: 347 -------------------SMPDISVSE----RVSLFNV--LDILSEPIEAQYPMP---- 377
Query: 440 TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSA 499
+E+M + Y Y + D +D E + + + F+NG+ V +
Sbjct: 378 --------MEEMGQSYG---YTLYTTTVERDRADEERIRVIDARDRA-QMFVNGDKVATQ 425
Query: 500 HGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKD 559
+ +H + + +H + LP L+ + V+ G K L D
Sbjct: 426 YQEH------IGEDIHCV--------------LPCEHNRLDVLTEDMGRVNY-GHKLLAD 464
Query: 560 FSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH-------QPLTWYKTVFDAPTG 612
+ G + G+ + L T + R +P + + QP ++Y+ FD
Sbjct: 465 --TQHKGIRTGVCVD-LHFVTGWEMRCLPLDNIDNLDYSAGWVEGQP-SFYRAKFDISEP 520
Query: 613 SDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
+D I+ GKG A+VNG ++GR+W P + Y +P L P N LV+ E
Sbjct: 521 ADTF-IDTTGFGKGVAFVNGTNVGRFW------DKGPIMTLY-VPHGLLHPGTNELVMFE 572
Query: 673 EENGYPPGISIDTVSV 688
E Y IS+ + V
Sbjct: 573 TEGVYDAKISLRSEPV 588
>gi|414879450|tpg|DAA56581.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
Length = 154
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 74/104 (71%), Positives = 88/104 (84%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G VTYDGR+LI++G R++LFSG +HYPRSTP+MWP LIAKAK+GGLDV+QT VFWN H
Sbjct: 34 GRGEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAH 93
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYG 129
EP GQF+F GR DLV+FI+E+ AQGLYV LRIGPF+E EW YG
Sbjct: 94 EPVQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYG 137
>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
Length = 469
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/348 (34%), Positives = 173/348 (49%), Gaps = 61/348 (17%)
Query: 298 MYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVL 356
MYHG TNF RTA +T YD APLDE+G L QPK+GHLK+LH K + G +
Sbjct: 23 MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLTYGNI 82
Query: 357 VSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAF 416
+ +F L ++Q + F+ N NA + F Y++P +SILPDCKT ++
Sbjct: 83 STADFGNLVMTTVYQTEEGSSCFIGNV----NAKINFQGTSYDVPAWYVSILPDCKTESY 138
Query: 417 NTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWY----NFRFKHDPS 472
NTAK + TSLR N + D SD+LWY N + + DP+
Sbjct: 139 NTAKRMKLR--------------TSLRFK------NVSNDESDFLWYMTTVNLK-EQDPA 177
Query: 473 DSESV-LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVG 531
+++ L+++S HVLH F+NG+ G+ ++ + E+ G N ++LLSV V
Sbjct: 178 WGKNMSLRINSTAHVLHGFVNGQHTGNYRVENGKFHYVFEQDAKFNPGVNVITLLSVTVD 237
Query: 532 LPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
LP+ GA+ E AG I G + +G G++ + +
Sbjct: 238 LPNYGAFFENVPAG-----ITGPV-----------FIIGRNGDETVV------------K 269
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYW 639
Y STH T T+F AP GS+PV ++L+ GKG+A +N GRYW
Sbjct: 270 Y-LSTHNGAT-KLTIFKAPLGSEPVVVDLLGFGKGKASINENYTGRYW 315
>gi|242077941|ref|XP_002443739.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
gi|241940089|gb|EES13234.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
Length = 111
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 75/108 (69%), Positives = 88/108 (81%)
Query: 59 QMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRI 118
QMWP+LIAKAKEGGLDV+QT VFWN+HEP GQ++F GR D VRFIKE+Q QGLYV LRI
Sbjct: 1 QMWPKLIAKAKEGGLDVIQTYVFWNVHEPVQGQYNFEGRYDFVRFIKEIQGQGLYVNLRI 60
Query: 119 GPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMK 166
GPFIE EW YGG PFWLHDVP I FRSDNEPFK ++ +V++++
Sbjct: 61 GPFIESEWKYGGFPFWLHDVPNITFRSDNEPFKPSVRNMLGELVSLLE 108
>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
Length = 288
Score = 166 bits (420), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 83/171 (48%), Positives = 111/171 (64%), Gaps = 4/171 (2%)
Query: 178 IILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGR 237
++L + G +E+ + + G Y +WAAK A+ L GVPWVMC+Q DAP +I+ CN
Sbjct: 32 LVLGTVSLGVGAIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTCNAY 91
Query: 238 QCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYY 297
C + F PNS +KP +WTENW +Y +G+ R ED+A+ VA F + GS+ NYY
Sbjct: 92 YC-DGFK-PNSHNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQR-GGSFQNYY 148
Query: 298 MYHGGTNFGRTASAYVLTGYYDQ-APLDEYGLLRQPKWGHLKELHSAVKLC 347
MY G TNFGRTA + YD A +DEYG LR+PKWGHLK+LH+A+KLC
Sbjct: 149 MYFGRTNFGRTAGGPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLC 199
>gi|388493008|gb|AFK34570.1| unknown [Lotus japonicus]
Length = 189
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/191 (42%), Positives = 118/191 (61%), Gaps = 6/191 (3%)
Query: 620 LISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
+ MGKG WVNG+SIGR+WVSFL+P G P+Q+ YHIPR++L P NLLV+LEE+ G P
Sbjct: 1 MTGMGKGMIWVNGRSIGRHWVSFLSPLGLPTQAEYHIPRAYLNPKDNLLVILEEDQGTPE 60
Query: 680 GISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISK 739
I I V+ T+C + +S P V SW S + + R+ + + C SG+KI
Sbjct: 61 KIEIMNVNRDTVCSIIEESDPPNVNSWVSSHG---QFRPRVSNVATQASLSCGSGKKIVA 117
Query: 740 ILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPVWTEKFY---GDPCPGI 796
+ FAS+GNP+G+C +G C+++ ++ IVE+ CLGK SC V + F D CPG+
Sbjct: 118 VEFASFGNPSGSCGKLVLGDCNAAATQQIVEQQCLGKGSCNVDLNRATFIKNGKDACPGL 177
Query: 797 PKALLVDAQCT 807
K L + +C+
Sbjct: 178 VKKLAIQVKCS 188
>gi|388518087|gb|AFK47105.1| unknown [Lotus japonicus]
Length = 220
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 23/206 (11%)
Query: 623 MGKGEAWVNGQSIGRYWVSF---------------------LTPQGTPSQSWYHIPRSFL 661
MGKG+AWVNG IGRYW T G P+Q+ YH+PRS+L
Sbjct: 1 MGKGQAWVNGHHIGRYWTRVSPKSGCEQVCDYRGAYNSDKCTTNCGKPTQTLYHVPRSWL 60
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIP 721
K + NLLV+ EE G P IS+ S +C VS+SH P+ + N +
Sbjct: 61 KASDNLLVIFEETGGNPFRISVKLHSARIVCAKVSESHYQPL--HKLMNADLIGHEVSAN 118
Query: 722 GRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCTV 781
P++ +RC GR IS I FASYGNP G+C++++ G+CH+ +S AIV KAC GKRSC++
Sbjct: 119 SMIPELHLRCQDGRIISSITFASYGNPEGSCQSFSRGNCHAPSSMAIVSKACQGKRSCSI 178
Query: 782 PVWTEKFYGDPCPGIPKALLVDAQCT 807
+ F GDPC G+ K L V+A+CT
Sbjct: 179 KISDTIFGGDPCQGVMKTLSVEARCT 204
>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
Length = 582
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/309 (33%), Positives = 152/309 (49%), Gaps = 26/309 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG++HY R P +W I KA+ GL+ ++T V WN H P+ G FD
Sbjct: 10 DFLLDGEPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTD 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RF+++V A GLY +R GP+I EW GGLP WL PG+ R F ++
Sbjct: 70 GMLDLGRFLEQVAAAGLYAIVRPGPYICAEWDNGGLPAWLFQEPGVGVRRYEPRFLAAVE 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+Y ++++++ L QGGP++L Q+ENEYG + P Y+ A +
Sbjct: 130 QYLEQVLDLVRP--LQVDQGGPVLLLQVENEYGAFGND-----PEYLEAVAGMIRKAGIT 182
Query: 216 VPWVMCKQDDAP-------DPVINACN-GRQCGETFAG--PNSPDKPAIWTENWTSFYQV 265
VP V Q D V+ + G + E A + P P + E W ++
Sbjct: 183 VPLVTVDQPTGEMLAAGGLDGVLRTGSFGSRSAERLATLREHQPTGPLMCMEFWDGWFDH 242
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYY 318
+G S ED A + +A G+ VN YM+HGGTNFG T+ A +T Y
Sbjct: 243 WGGPHHTTSVEDAARELDALLA--AGASVNIYMFHGGTNFGLTSGADDKGVFRPTVTSYD 300
Query: 319 DQAPLDEYG 327
APLDE G
Sbjct: 301 YDAPLDEAG 309
>gi|449532986|ref|XP_004173458.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
Length = 213
Score = 163 bits (413), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/213 (43%), Positives = 126/213 (59%), Gaps = 23/213 (10%)
Query: 498 SAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE 556
S +G D T K V+L G N +S+LSV VGLP+ G + + AG L V+++G E
Sbjct: 1 SVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNE 60
Query: 557 -LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDP 615
+D S + W Y+VGL GE L +++ GS V W + GS QPLTWYKT F+ P G++P
Sbjct: 61 GTRDMSKYKWSYKVGLKGEILNLYSVKGSNSVQWMK-GSFQKQPLTWYKTTFNTPAGNEP 119
Query: 616 VAINLISMGKGEAWVNGQSIGRY--------------WVSFLTPQ------GTPSQSWYH 655
+A+++ SM KG+ WVNG+SIGRY + F T + G PSQ WYH
Sbjct: 120 LALDMSSMSKGQIWVNGRSIGRYFPGYIASGKCNKCSYTGFFTEKKCLWNCGGPSQKWYH 179
Query: 656 IPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
IPR +L P GNLL++LEE G P GIS+ +V
Sbjct: 180 IPRDWLSPNGNLLIILEEIGGNPQGISLVKRTV 212
>gi|297788786|ref|XP_002862437.1| hypothetical protein ARALYDRAFT_359611 [Arabidopsis lyrata subsp.
lyrata]
gi|297307951|gb|EFH38695.1| hypothetical protein ARALYDRAFT_359611 [Arabidopsis lyrata subsp.
lyrata]
Length = 256
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 103/256 (40%), Positives = 141/256 (55%), Gaps = 46/256 (17%)
Query: 430 YKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSS 482
+ E IP+ D SL L E TKD +DY WY K + D +++L+V+
Sbjct: 2 FSEDIPSILDGDSL---ILGELYYLTKDKTDYAWYTTSIKIEDDDIPDQKGQKTILRVAG 58
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
LGH L ++NGE+ ++L N +S+L V+ GLPDSG+Y+E
Sbjct: 59 LGHALIVYVNGEYA-----------------INLRTRDNCISILGVLTGLPDSGSYMEHT 101
Query: 543 VAGLRNVSIQGAKE-LKDF-SSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
AG R VSI G K +D + WG+ V +T+ GS+ V W +YG H+PL
Sbjct: 102 YAGPRGVSIIGLKSGTRDLIENNEWGHLV---------YTEEGSKKVKWEKYGE--HKPL 150
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWYKT P G + VAI + MGKG WVNG +GRYW+SF++P G P Q+ YHIPRSF
Sbjct: 151 TWYKT----PEGENAVAIRMKGMGKGLIWVNGIGVGRYWMSFVSPLGEPIQTEYHIPRSF 206
Query: 661 LK--PTGNLLVLLEEE 674
+K ++LV+LEEE
Sbjct: 207 MKEEKKKSMLVILEEE 222
>gi|217075719|gb|ACJ86219.1| unknown [Medicago truncatula]
Length = 200
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 117/208 (56%), Gaps = 31/208 (14%)
Query: 623 MGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSF 660
MGKGEAWVNGQSIGRYW ++++P G PSQ+ YH+PR++
Sbjct: 1 MGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAW 60
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
LKP N VL EE G P IS T + ++C HV++SH PPV +W S + K
Sbjct: 61 LKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTESHPPPVDTWNSNAESERKVG--- 117
Query: 721 PGRRPKVQIRCP-SGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSC 779
P + + CP + IS I FAS+G P C NY GSC S+ + +IV+KAC+G SC
Sbjct: 118 ----PVLSLECPYPNQAISSIKFASFGTPRRTCGNYNHGSCSSNRALSIVQKACIGSSSC 173
Query: 780 TVPVWTEKFYGDPCPGIPKALLVDAQCT 807
+ V F G+PC G+ K+L V+A CT
Sbjct: 174 NIGVSINTF-GNPCRGVTKSLAVEAACT 200
>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
Length = 615
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 111/339 (32%), Positives = 163/339 (48%), Gaps = 38/339 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+T+ + + G + SGS+HY R P+ W + + GL+ V T V WN HE +
Sbjct: 24 TLTHTHGAFLRRGRPHRVLSGSLHYFRVHPEQWADRLDRLAALGLNTVDTYVPWNFHERR 83
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+ F G RDL RF++ Q GL V +R GP+I EW GGLP WL PG+ R+ ++
Sbjct: 84 PGEARFDGWRDLARFVRLAQRAGLDVMVRPGPYICAEWDNGGLPAWLTGTPGMRLRAGHQ 143
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
P+ + R+ +V + A L A GGP++ QIENEYG +H+ YVRW
Sbjct: 144 PYLDAVARWFDALVP--RVAELQAVHGGPVVAVQIENEYGSYGDDHA-------YVRWVR 194
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVI---NACNGRQCGETFAG----------PNSPDKPA 253
VD G+ ++ D P P++ G TF P +P
Sbjct: 195 DALVD--RGITELLYTA-DGPTPLMLDGGTVPGELAAATFGSRAAEAAALLRSRRPGEPF 251
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY- 312
+ E W ++ +G++ +RS + A V + G V+ YM HGGTNFG A A
Sbjct: 252 LCAEFWNGWFDHWGEKHHVRSRDGAAQEVEEILD--AGGSVSLYMAHGGTNFGLWAGANH 309
Query: 313 -------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
+T Y AP+ E+G L PK+ L+E +A+
Sbjct: 310 DGGVLRPTVTSYDSDAPVSEHGAL-TPKFHALRERFAAL 347
>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
Length = 655
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 117/365 (32%), Positives = 173/365 (47%), Gaps = 48/365 (13%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ +NG + +L SG++HY R P+ W + K K GL+ V+T V WN HE G FDFS
Sbjct: 10 AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RFI+ Q GLYV LR GP+I EW +GGLP WL P + R+ P+ +
Sbjct: 70 GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVRWA 205
Y I+ ++ ++ S+GGPII Q+ENEYG +++ F++ G + +
Sbjct: 130 AYLAKILPLVNDLQM--SKGGPIIAVQLENEYGSYGDDLDYKLFLKNQFIKYGIEELLFT 187
Query: 206 AKLAVDLQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
+ +Q G +P V+ + G E P P + E W+ ++
Sbjct: 188 SDNGTGIQNGPIPGVLATTNFQEQE-----QGYLMFEYLRNIKQPGLPMMVMEFWSGWFD 242
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMK-----GSYVNYYMYHGGTNFGRTASAYV------ 313
+G++ + H A FI K GS VN+YM+HGGTNFG A A
Sbjct: 243 HWGEQHNL-------CHHAEFIDVFKWILLEGSSVNFYMFHGGTNFGFMAGANEDFGATN 295
Query: 314 ----------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
T Y P+ E G L + K+ ++ + S +K L P SG LV +F
Sbjct: 296 EGGGEPYAADTTSYDYDCPVSESGQLNE-KFYEIRNILSEMKTLLPPG-SGGLVKKHFFS 353
Query: 364 LQEAF 368
+ + F
Sbjct: 354 IIKFF 358
>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
Length = 606
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 159/335 (47%), Gaps = 31/335 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+T+ G +L+ G + SGS+HY R P W +A+ GL+ V T V WN HE
Sbjct: 16 TLTHAGGTLLRAGRPHRILSGSLHYFRVHPGQWADRLARLAALGLNTVDTYVPWNFHERT 75
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG F G RDL RF++ Q GL V +R GP+I EW GGLP WL PG+ R+ +
Sbjct: 76 PGDVRFDGWRDLDRFVRLAQETGLDVIVRPGPYICAEWDNGGLPAWLTGTPGMRPRTSHP 135
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF + R+ ++ + A L A +GGP++ QIENEYG S+ + G YVRW
Sbjct: 136 PFLAAVARWFDQLIPRIAA--LQAGRGGPVVAVQIENEYG----SYGDDG-DYVRWVRDA 188
Query: 209 AVDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAG----------PNSPDKPAIWT 256
GV ++ D + +++ A G TF P++P
Sbjct: 189 LT--ARGVTELLYTADGPTELMLDAGAVEGELAAATFGSRPEQAARLLRSRRPEEPFFCA 246
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY---- 312
E W ++ +G++ +R A A V + G ++ YM HGGTNFG A A
Sbjct: 247 EFWNGWFDHWGEQHHVRPARSAADDVGRILG--AGGSLSLYMAHGGTNFGLWAGANHDGD 304
Query: 313 ----VLTGYYDQAPLDEYGLLRQPKWGHLKELHSA 343
+T Y AP+ E+G L + + EL +A
Sbjct: 305 RLQPTVTSYDSDAPVAEHGALTEKFFALRDELTAA 339
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/58 (41%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+ L GKG WVNG +GRYW + PQ T ++P FL P N L +LE E
Sbjct: 532 VALPGFGKGFCWVNGHLLGRYW--HIGPQTT-----LYLPAPFLHPGDNTLTVLELER 582
>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
queenslandica]
Length = 689
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 115/332 (34%), Positives = 165/332 (49%), Gaps = 27/332 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
++ D S I G + + SGSIHY R P W + K K GL+ V T V WNLHEP P
Sbjct: 71 LSLDEDSFYIRGKKTHILSGSIHYFRVVPDYWTDRLKKLKAMGLNTVDTYVSWNLHEPMP 130
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+FDFSG ++ FIK + L V +R GP+I EW GGLP WL P + RS+ +P
Sbjct: 131 GEFDFSGLLNIHEFIKIAHSLELNVIVRPGPYICSEWDNGGLPAWLLHDPNMKIRSNYKP 190
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL- 208
++ +KR+ T + ++ L +S GGPII Q+ENEY G ++++ A L
Sbjct: 191 YQDAVKRFFTKLFEILTP--LQSSYGGPIIAFQVENEYAAYGPRN-ATGRHHMQYLANLM 247
Query: 209 ----AVDL---QTGVPWVMCKQDDAPDPVINACNGRQCGETFAGP---NSPDKPAIWTEN 258
AV+L G + D AP+ + N + P+KP + E
Sbjct: 248 RSLGAVELFITSDGQNDIKASSDMAPNNALLTVNFQNDPSEALNKLLLVQPNKPPLVMEY 307
Query: 259 WTSFYQVYGDE--ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
WT ++ +G R S + ++ I +M GS+ N YM+HGGTNFG A +
Sbjct: 308 WTGWFDHWGRRHLERTLSPSQLIVNIGT-ILQMGGSF-NLYMFHGGTNFGFMNGANIEGG 365
Query: 314 -----LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y APL E G + + K+ L+EL
Sbjct: 366 EYRPDVTSYDYDAPLSEAGDITK-KYTLLREL 396
>gi|166092020|gb|ABY82047.1| beta-galactosidase [Hymenaea courbaril var. stilbocarpa]
Length = 138
Score = 159 bits (401), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 83/138 (60%), Positives = 91/138 (65%), Gaps = 3/138 (2%)
Query: 182 QIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGE 241
QIENEYG VE G Y WAAK+AV L TGVPWVMCKQDDAPDPVI+ CNG C E
Sbjct: 1 QIENEYGPVEWEIRAPGKAYTAWAAKMAVGLNTGVPWVMCKQDDAPDPVIDTCNGYYC-E 59
Query: 242 TFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHG 301
F PN KP +WTENW+ +Y YG R EDIAY V FI + GS+VNYYMYHG
Sbjct: 60 NFT-PNKNYKPKMWTENWSGWYTEYGGAVPKRPVEDIAYSVTRFI-QNGGSFVNYYMYHG 117
Query: 302 GTNFGRTASAYVLTGYYD 319
GTNFGRT S + YD
Sbjct: 118 GTNFGRTYSGLFIATSYD 135
>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
18170]
gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
Length = 784
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 156/326 (47%), Gaps = 38/326 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W I + K G++ + VFWN HE +PG+FDF+
Sbjct: 39 TFLLNGEPFVVKAAELHYPRIPRAYWEHRIKQCKALGMNTICLYVFWNFHEEKPGEFDFT 98
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G++DL F + Q +YV LR GP++ EW GGLP+WL I R D+ F +
Sbjct: 99 GQKDLAEFCRLCQKNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREDDPYFLERVA 158
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHS--FLEKGPPYVR---------- 203
+ + N + A L +GGPII+ Q+ENEYG S ++ K VR
Sbjct: 159 IFEKEVAN--QVAGLTIQKGGPIIMVQVENEYGSYGESKEYVAKIRDIVRGNFGDVTLFQ 216
Query: 204 --WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENW 259
WA+ ++ + W M N G E FA PD P + +E W
Sbjct: 217 CDWASNFQLNALDDLVWTM-----------NFGTGANIDEQFAPLKKVRPDSPLMCSEFW 265
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R+A+D+ + ++ KG + YM HGGTN+G A A
Sbjct: 266 SGWFDKWGANHETRAADDMIAGIDEMLS--KGISFSLYMTHGGTNWGHWAGANSPGFAPD 323
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y AP+ E G + PK+ L+E
Sbjct: 324 VTSYDYDAPISESGKI-TPKYEKLRE 348
>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
Length = 589
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 148/287 (51%), Gaps = 26/287 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
I++G + SG+IHY R P+ W + K G + V+T + WNLHEP+ G+FDF
Sbjct: 9 EFIVDGKPIKILSGAIHYFRIVPKHWEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQ 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +D+V FIK+ Q L V +R P+I EW +GGLP WL + RSD + +K
Sbjct: 69 GIKDVVSFIKKAQEMELMVIVRPSPYICAEWEFGGLPAWLLTYDNLHLRSDCPRYLEKVK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y +++ M+ + L ++QGGPII+ Q+ENE+G ++ Y++ K+ +DL
Sbjct: 129 NYYEVLLPMLTS--LQSTQGGPIIMMQVENEFGSFSNN-----KTYLKKLKKIMLDLGVE 181
Query: 216 VP-------WVMCKQDDA---PDPVINACNGRQCGET------FAGPNSPDKPAIWTENW 259
VP W + + D ++ A G E F + P + E W
Sbjct: 182 VPLFTSDGSWQQALESGSLIDDDVLVTANFGSHSHENLDVLEQFMANHQKKWPLMSMEFW 241
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
++ +G+E R A+D+A V + +GS +N YM+HGGTNFG
Sbjct: 242 DGWFNRWGEEIITRDAQDLANCVKELLT--RGS-INLYMFHGGTNFG 285
>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
Length = 867
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 122/403 (30%), Positives = 196/403 (48%), Gaps = 33/403 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+TYD +S I+ R + S +IHY R W ++ KAK GG + ++T + WN HE +
Sbjct: 2 ITYDKKSWKIHNKRIFILSAAIHYFRLPKAEWDDVLEKAKAGGCNTIETYIPWNFHEMKE 61
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++DFSG +DL F++ +GLYV R GP+I EW +GG P+WL I +RS
Sbjct: 62 GEWDFSGDKDLAHFLQLCANKGLYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPS 121
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE---YGMVEHSFLEKGPPYVRWAA 206
F ++ +Y +++++ +L ++ G +I+ QIENE YG + ++E Y+R
Sbjct: 122 FLHYVDQYFDQVISIIDEYQL--TKNGSVIMVQIENEFQAYGKPDKKYME----YLR-DG 174
Query: 207 KLAVDLQTGVPWVMC-KQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
+A ++ VP+V C D N +G D+P E W +++
Sbjct: 175 MIARGIE--VPFVTCYGAVDGAVEFRNFWSGANRAAEILDERFADQPKGVMEFWIGWFEH 232
Query: 266 Y-GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF----GRTASAYVL--TGYY 318
+ G++A ++ E + + + + +NYYMY GGTNF GRT S V T Y
Sbjct: 233 WGGNKANQKTPEQLERECYQLL-RNGFTTINYYMYFGGTNFDHWGGRTVSEQVFCTTTYD 291
Query: 319 DQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN----FSKLQEAFIFQGSS 374
+DEY L K+ LK H VK L+P+ + + + S L+ I
Sbjct: 292 YDVAIDEY-LQPTRKYEVLKRYHLFVKW-LEPLFTNAEQANSDVKLSSDLKSGRIVSPHG 349
Query: 375 ECAAFLVNKDKRNNATVYFSNLMYELPPLSI---SILPDCKTV 414
E N+++R + V N EL P +I ++LP + V
Sbjct: 350 EVLFIENNRNERIQSHVKHGN---ELVPFTIEANAVLPIVRNV 389
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 74/164 (45%), Gaps = 13/164 (7%)
Query: 527 SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
S + G+ D A L ++ + ++ +Q ++ F + + + + G K + F +
Sbjct: 695 SAVYGVADISAAL-KQGKNVLDLDVQNITSIRRFDLYLFNEKEQISGWKTKAFAQQ-HEV 752
Query: 587 VPWSRYGSSTHQPLT--WYKTVFD-APTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
W +S Q + W+K+ F P V + L + KG WVNGQ +GRYW +
Sbjct: 753 REWKIVNNSDQQTINPRWHKSRFTWNPDNGSIVKVRLNQLSKGCFWVNGQCLGRYWN--I 810
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVS 687
PQ Y IP S LK N +V+ +EE P + I + S
Sbjct: 811 GPQED-----YKIPASLLKEQ-NEIVIFDEEGVVPDHVVIHSYS 848
>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
Length = 638
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 180/707 (25%), Positives = 290/707 (41%), Gaps = 145/707 (20%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G N T DG+ + I SG+IHY R + W + K K GL+ ++T V WNLHE
Sbjct: 15 GENFTLDGKPVQI-------LSGAIHYFRVPREYWRDRMLKLKACGLNTLETYVCWNLHE 67
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+ G+FDF+G D+ +++E GL+V R GP+I EW YGGLP WL P + R+
Sbjct: 68 PEKGKFDFTGMLDIAAYLREAANLGLWVIFRPGPYICAEWDYGGLPSWLLRDPNMQVRTT 127
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLE 196
+P+ ++R+ ++ ++K + +GGPII Q+ENEYG V+ + +
Sbjct: 128 YQPYMEAVERFFDALLPIVKPFQY--KEGGPIIAMQVENEYGSYARDDKYLTAVKQAIQK 185
Query: 197 KGPPYVRWAAK--LAVDLQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPA 253
+G + + L+ G +P V+ + N +Q G P++P
Sbjct: 186 RGIEELLLTSDGGQIERLERGCIPGVLMTAN------FNFNPKKQLGAL--KKLQPNRPQ 237
Query: 254 IWTENWTSFYQVYGDE---ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTAS 310
+ E W+ ++ +G + + E + + F S VN+YM+HGGTNFG
Sbjct: 238 MVMEFWSGWFDHWGRDHHKLHVEKFEQLLGDILRF-----PSSVNFYMFHGGTNFGFMNG 292
Query: 311 AYVLTGY------YD-QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSK 363
A + GY YD APL E G PK+ +EL + + G + S
Sbjct: 293 ANYINGYKPDVTSYDYDAPLSEAG-DPTPKYYKTRELLKTLA------MKGAVPSE---- 341
Query: 364 LQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPL----SISILPDCKTVAFNTA 419
+ E+PP S P K +AF
Sbjct: 342 ---------------------------------LPEVPPATEKSSYGPFPVEKYIAFE-- 366
Query: 420 KLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLK 479
D+++ E P ET + +L N + Y+ Y + P+ LK
Sbjct: 367 --DALKVLGE-----PIKSETVMSME-MLPINNDNGQSYGYILYRHKLSETPATDSVTLK 418
Query: 480 VSSLGHVLHAFINGEFVGSAHGKHSDKSFT-------LEKMV---------HLINGTNNV 523
F+NGE G + + + + + L+ +V ++G
Sbjct: 419 CDVRDRA-QIFVNGEESGMLNWRVGEIAMSGLKENDILDILVENQGRVNFAQTMDGVKKF 477
Query: 524 SLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYG 583
L SV G+ A L++R GL + LK + F L++ ++
Sbjct: 478 VLESV-AGVNRGDALLDQR-KGLVGEVLLNTTPLKTWEIFP-----------LELKPEFQ 524
Query: 584 SRIVP---WSRYGSSTHQPLTWYKTV-FDAPTGSDPVAINLIS-MGKGEAWVNGQSIGRY 638
+R+V W +T P + V F+ P +++ GKG A +NG ++GRY
Sbjct: 525 TRLVESPDWQEPTDATEVPFPAFHLVNFNIPEEPKDTFLDMKKGWGKGVAILNGFNLGRY 584
Query: 639 WVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDT 685
W + PQ T ++P FLK N L+L E+ + + DT
Sbjct: 585 W--HIGPQET-----LYVPAPFLKKGDNQLLLFEQHIPFKEVVFTDT 624
>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
Length = 586
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 110/316 (34%), Positives = 160/316 (50%), Gaps = 33/316 (10%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G +NG + SG++HY R P++W + K K GL+ V+T V WNLHEP GQF
Sbjct: 12 GDQFHLNGQPFRVLSGALHYFRVLPELWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFR 71
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
+ G DL FI+ ++ GLYV +R GPFI EW +GGLP WL P + R +P+
Sbjct: 72 YEGGLDLAAFIRLAESLGLYVIVRPGPFICAEWEFGGLPAWLLADPYMEVRCCYQPYLEA 131
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
++R+ ++ + ++ +GGPI+ Q+ENEYG L Y+ W +L +D
Sbjct: 132 VRRFYDDLLPRLLPLQI--QRGGPILAMQVENEYGSYGSDQL-----YLTWLRRLMLD-- 182
Query: 214 TGVPWVMCKQDDAPDPVI----------NACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
GV ++ D A D ++ +A G + E FA PD P + E W
Sbjct: 183 GGVETLLFTSDGATDHMLKHGTLAQVWKSANFGSRAEEEFAKLREYQPDGPLMCMEFWNG 242
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--RTASAYVLTGYYD 319
++ +G+ R A D A + +A G++VN YM+HGGTNFG A+ +LT Y
Sbjct: 243 WFDHWGEPHHTRDAADAADALERIMA--CGAHVNVYMFHGGTNFGFMNGANTDLLTRDYQ 300
Query: 320 --------QAPLDEYG 327
APLDE G
Sbjct: 301 PTVNSYDYDAPLDETG 316
>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
Length = 621
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 166/679 (24%), Positives = 279/679 (41%), Gaps = 157/679 (23%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
++NG ++SG IHYPR W + K GL+ V T VFWN HE PG+++FSG
Sbjct: 38 FLLNGKPFTIYSGEIHYPRVPSAYWKHRLEMMKAMGLNTVTTYVFWNYHEEAPGKWNFSG 97
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
+DL +FIK Q GLYV +R GP++ EW +GG P+WL + R DN+ F +
Sbjct: 98 EKDLQKFIKTAQETGLYVIIRPGPYVCAEWEFGGYPWWLQKNKELEIRRDNKAFSEECWK 157
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVRWAA 206
Y + + + ++ + GGP+I+ Q ENE+G + EH + +
Sbjct: 158 YISQLAKQITPMQI--TNGGPVIMVQAENEFGSYVAQRKDIPLEEHRKYSHKIKEMLLKS 215
Query: 207 KLAVDLQT--------------GVPWVMCKQD-DAPDPVINACNGRQ----CGETFAGPN 247
++V L T +P + D D IN NG + E + G
Sbjct: 216 GISVPLFTSDGSSLFKGGSVEGALPTANGESDIDVLKKSINEYNGGKGPYMIAEYYPG-- 273
Query: 248 SPDKPAIWTENWTS-FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++W F +V S E++ L+I G NYYM HGGTNFG
Sbjct: 274 -------WLDHWAEPFVKV--------STEEVVKQTNLYIE--NGVSFNYYMIHGGTNFG 316
Query: 307 RTASAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLV 357
T+ A LT Y AP+ E G PK+ L+++
Sbjct: 317 FTSGANYDKDHDIQPDLTSYDYDAPISEAGWA-TPKYNALRKI----------------- 358
Query: 358 SMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELP-PLSISILPDCKTVAF 416
F K+ + N + ++P P+ + +P+ +
Sbjct: 359 ---FQKIHK----------------------------NKLPDVPKPIKVITIPEIEFSKV 387
Query: 417 NTAKLDSVEQWEEYKEAIP-TYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSE 475
++ LD ++ + K +P T+++ ++ ++L +R K D +D +
Sbjct: 388 SSL-LDLTDRMKPVKSDMPLTFEDLNIGNGYIL----------------YRKKFD-TDQK 429
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
+L+V L + +ING++ G + + +E I + + +L +G +
Sbjct: 430 GLLEVKGLRDYANVYINGKWKGELNRVNKKYDLDIE-----IKSGDRLEILVENMGRINY 484
Query: 536 GAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGS 594
GA + + G+ + V I G + S +W +L F + +
Sbjct: 485 GAEIVHNLKGIISPVKINGTE-----VSGNW----EMLPLPFDTFPKHH-----FKNKNI 530
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWY 654
H P+ TG +++ + GKG ++NG++ GRYW S + PQ T
Sbjct: 531 EDHSPVIQEAEFTLNETGD--TFLDMRNFGKGIVFINGRNAGRYW-STVGPQQT-----L 582
Query: 655 HIPRSFLKPTGNLLVLLEE 673
+IP +LK N + + E+
Sbjct: 583 YIPGVWLKKGRNKIQIFEQ 601
>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 584
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 157/318 (49%), Gaps = 24/318 (7%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T G+ L++N + +G+IHY R P+ W + K K G + V+T V WN HEP+
Sbjct: 4 LTIQGKQLMLNDRPFRIIAGAIHYFRVVPEYWRDRLLKLKACGFNTVETYVPWNFHEPEE 63
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F F G DL +FI GLY +R P+I EW +GGLP WL PG+ R +P
Sbjct: 64 GRFVFEGMADLEKFIALAGELGLYAIVRPSPYICAEWEFGGLPAWLLKDPGMRLRCSYKP 123
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWA-A 206
F Y ++ + +++GGP+I QIENEYG + ++L Y++ A
Sbjct: 124 FLDKADAYYDELIPRLTP--FLSTKGGPLIAMQIENEYGSYGNDKTYLN----YLKEALV 177
Query: 207 KLAVDL---QTGVPWVMCKQDDAPDPVINACN-GRQCGETFAGPNS--PDKPAIWTENWT 260
K VD+ + P Q + V N G + E FA PD+P + E W
Sbjct: 178 KRGVDVLLFTSDGPEDFMLQGGMVEGVWETVNFGSRSAEAFAKLQEYQPDQPLMCMEFWN 237
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------V 313
++ +G+ R A D+A + +A G+ VN+YM+HGGTNFG + A
Sbjct: 238 GWFDHWGETHHTRGAADVALVLDEMLA--AGASVNFYMFHGGTNFGFFSGANYTDRLLPT 295
Query: 314 LTGYYDQAPLDEYGLLRQ 331
+T Y +PL E G L +
Sbjct: 296 VTSYDYDSPLSESGELTE 313
>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
Length = 584
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 155/324 (47%), Gaps = 26/324 (8%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+N + SGSIHY R P W + K + G + V+T V WN+HEPQ G+FDFS
Sbjct: 12 LNDQPMKIISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNL 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL RFI+ Q GLYV LR P+I EW +GGLP+WL P + R D PF + RY
Sbjct: 72 DLRRFIQLAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYF 131
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
T + + + + L +Q GPI++ Q+ENEYG + S+L K +R +
Sbjct: 132 TQLFS--QVSDLQITQEGPILMMQVENEYGSYGNDKSYLRKSAELMRHNGIDVSLFTSDG 189
Query: 217 PWVMCKQD----DAPDPVINACNGRQCGETFAGPNS---PDKPAIWTENWTSFYQVYGDE 269
PW+ ++ D P IN G E F +P + E W ++ +GD+
Sbjct: 190 PWLDMLENGSIKDIALPTINC--GSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDD 247
Query: 270 A-RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLD---- 324
S D A + ++ VN YM+HGGTNFG A YY++ D
Sbjct: 248 KHHTTSVTDAANELR---DCLEAGSVNIYMFHGGTNFGFMNGA----NYYEKLSPDVTSY 300
Query: 325 EYGLLRQPKWGHLKELHSAVKLCL 348
+Y L +WG + + A + +
Sbjct: 301 DYDALLS-EWGDVTPKYEAFQQVI 323
>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
Length = 618
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 109/369 (29%), Positives = 169/369 (45%), Gaps = 33/369 (8%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
GN DG +++G ++SG +HYPR + W + K GL+ V T VFWN HE
Sbjct: 25 GNFEIKDGH-FLLSGKPFTIYSGEMHYPRVPSEYWKHRLQMMKSMGLNTVTTYVFWNYHE 83
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
+PG+++FSG +DL +FIK Q GLYV +R GP++ EW +GG P+WL + R+D
Sbjct: 84 EEPGKWNFSGEKDLKKFIKTAQEAGLYVIIRPGPYVCAEWEFGGYPWWLQKDKNLEIRTD 143
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYV 202
N+ F + Y + + L + GGP+I+ Q ENE+G + LE+ Y
Sbjct: 144 NKAFLKQCENYINELAKQI--IPLQINNGGPVIMVQAENEFGSYVAQRKDISLEQHKKYS 201
Query: 203 RWAAKLAVDLQTGVPWVMCK-----QDDAPDPVINACNGRQCGETFAGP----NSPDKPA 253
V VP+ ++ + + + NG + N+ P
Sbjct: 202 HKIKDFLVKSGITVPFFTSDGSWLFKEGSIEGALPTANGEGDVDNLRKKINEFNNGKGPY 261
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV 313
+ E + + + + S ED+ L+I G NYYM HGGTNFG T+ A
Sbjct: 262 MVAEYYPGWLDHWAEPFVKVSTEDVVKQTELYIK--NGISFNYYMIHGGTNFGFTSGANY 319
Query: 314 ---------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKL-----CLKPMLSGVLVSM 359
LT Y AP++E G + PK+ L+++ + KPM + +
Sbjct: 320 DKNHDIQPDLTSYDYDAPINEAGWV-TPKFNALRDIFQKINRQRLPEVPKPMKVITIPEI 378
Query: 360 NFSKLQEAF 368
F+K+ F
Sbjct: 379 KFTKINSLF 387
>gi|297835700|ref|XP_002885732.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297331572|gb|EFH61991.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 336
Score = 156 bits (394), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 97/256 (37%), Positives = 134/256 (52%), Gaps = 52/256 (20%)
Query: 430 YKEAIPT-YDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD------SESVLKVSS 482
+ E IP+ D SL L E TKD +DY WY K + D +++L+V+
Sbjct: 2 FSEDIPSILDGDSL---ILGELYYLTKDKTDYAWYTTSIKIEDDDIPDQKGQKTILRVAG 58
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERR 542
LGH L ++NGE+ +AHG H + DSG+Y+E
Sbjct: 59 LGHALIVYVNGEYASNAHGSHE---------------------------MKDSGSYMEHT 91
Query: 543 VAGLRNVSIQGAKE-LKDF-SSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
AG R VSI G K +D + WG+ V + + GS+ V W +YG H+PL
Sbjct: 92 YAGPRGVSIIGLKSGTRDLIENNEWGHLV---------YIEEGSKKVKWEKYGE--HKPL 140
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
TWYKT F+ P G + VAI + MGKG WV+G +GRYW+SF++P G P Q+ YHIPRSF
Sbjct: 141 TWYKTYFETPEGENAVAIRMKGMGKGLIWVHGIGVGRYWMSFVSPLGEPIQTEYHIPRSF 200
Query: 661 LK--PTGNLLVLLEEE 674
+K ++ V+LEEE
Sbjct: 201 MKEEKKKSMFVILEEE 216
>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
30_1]
Length = 584
Score = 155 bits (392), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 108/319 (33%), Positives = 154/319 (48%), Gaps = 25/319 (7%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+N + SGSIHY R P W + K + G + V+T V WN+HEPQ G+FDFS
Sbjct: 12 LNDQPMKIISGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNL 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL RFI+ Q GLYV LR P+I EW +GGLP+WL P + R D PF + RY
Sbjct: 72 DLRRFIQLAQEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYF 131
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
T + + + + L +Q GPI++ Q+ENEYG + S+L K +R +
Sbjct: 132 TQLFS--QVSDLQITQEGPILMMQVENEYGSYGNDKSYLRKSAELMRHNGIDVPLFTSDG 189
Query: 217 PWVMCKQD----DAPDPVINACNGRQCGETFAGPNS---PDKPAIWTENWTSFYQVYGDE 269
PW+ ++ D P IN G E F +P + E W ++ +GD+
Sbjct: 190 PWLDMLENGSIKDIALPTINC--GSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDD 247
Query: 270 A-RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-------LTGYYDQA 321
S D A + ++ VN YM+HGGTNFG A +T Y A
Sbjct: 248 KHHTTSVTDAANELR---DCLEAGSVNIYMFHGGTNFGFMNGANYYEKLLPDVTSYDYDA 304
Query: 322 PLDEYGLLRQPKWGHLKEL 340
L E+G + PK+ +++
Sbjct: 305 LLSEWGDV-TPKYEAFQQV 322
>gi|420262409|ref|ZP_14765050.1| beta-galactosidase [Enterococcus sp. C1]
gi|394770166|gb|EJF49970.1| beta-galactosidase [Enterococcus sp. C1]
Length = 585
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 167/639 (26%), Positives = 268/639 (41%), Gaps = 101/639 (15%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K + G + V+T V WNLHE Q G + F G DL RFI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGILDLRRFIQ 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
Q GLYV LR P+I EW +GGLP+WL P + R D PF + RY + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
+ ++ +QGGPI++ Q+ENEYG + +L K +R + + PW +
Sbjct: 139 RDLQI--TQGGPILMMQVENEYGSYANDKEYLRKMVAAMRQQGVETPLVTSDGPWHDMLE 196
Query: 224 D----DAPDPVIN-ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE-ARIRSAED 277
+ D P IN N ++ E + +P + E W ++ +GD+ S D
Sbjct: 197 NGSIKDLALPTINCGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDHHHTTSTAD 256
Query: 278 IAYHVALFIAKMKGSYVNYYMYHGGTNFG-RTASAYVLTGYYDQAPLDEYGLLRQPKWGH 336
+ +A +GS VN YM+HGGTNFG S Y D D LL + WG
Sbjct: 257 AVKELQDCLA--EGS-VNIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTE--WGE 311
Query: 337 LKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNL 396
+ A K + +++++ E
Sbjct: 312 PTAKYQAFKKVIA----------DYAEIPEF----------------------------- 332
Query: 397 MYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKD 456
PLS+ + + A+ T SV++ I T + +R N+ L M
Sbjct: 333 -----PLSMKL----ERKAYGTF---SVKERVSLFSTIDTISQPIIR-NYPL-SMEACNQ 378
Query: 457 ASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHL 516
A+ Y++Y R P+ + ++ + H FIN E + + + +S++ + L
Sbjct: 379 ATGYIYY--RSLIGPARKIADYRLINTMDRAHTFINQELLRIDYDREIGQSYSFD----L 432
Query: 517 INGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEK 575
N + +L +G + + + G+++ V I GA F +++ L
Sbjct: 433 SESENELGILVENMGRVNYSVKMNHQHKGIKDGVIINGA--------FQSNWEIYPLPMD 484
Query: 576 LQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
D+ + W + S + ++ VFD + I L GKG VNG I
Sbjct: 485 NLHAIDFQGK---WQKGQPS----FSRFECVFDECADT---FIELPGWGKGFVQVNGHMI 534
Query: 636 GRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
GR+W + P Q Y +P FLK N +++ E +
Sbjct: 535 GRFW------EKGPQQRLY-VPAPFLKTGMNEIIVFESD 566
>gi|325569852|ref|ZP_08145846.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325156975|gb|EGC69143.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 585
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 167/639 (26%), Positives = 268/639 (41%), Gaps = 101/639 (15%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K + G + V+T V WNLHE Q G + F G DL RFI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGILDLRRFIQ 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
Q GLYV LR P+I EW +GGLP+WL P + R D PF + RY + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
+ ++ +QGGPI++ Q+ENEYG + +L K +R + + PW +
Sbjct: 139 RDLQI--TQGGPILMMQVENEYGSYANDKEYLRKMVAAMRQQGVETPLVTSDGPWHDMLE 196
Query: 224 D----DAPDPVIN-ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDE-ARIRSAED 277
+ D P IN N ++ E + +P + E W ++ +GD+ S D
Sbjct: 197 NGTIKDLALPTINCGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDHHHTTSTAD 256
Query: 278 IAYHVALFIAKMKGSYVNYYMYHGGTNFG-RTASAYVLTGYYDQAPLDEYGLLRQPKWGH 336
+ +A +GS VN YM+HGGTNFG S Y D D LL + WG
Sbjct: 257 AVKELQDCLA--EGS-VNIYMFHGGTNFGFMNGSNYYERLAPDVTSYDYDALLTE--WGE 311
Query: 337 LKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNL 396
+ A K + +++++ E
Sbjct: 312 PTAKYQAFKKVIA----------DYAEIPEF----------------------------- 332
Query: 397 MYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKD 456
PLS+ + + A+ T SV++ I T + +R N+ L M
Sbjct: 333 -----PLSMKL----ERKAYGTF---SVKERVSLFSTIDTISQPIIR-NYPL-SMEACNQ 378
Query: 457 ASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHL 516
A+ Y++Y R P+ + ++ + H FIN E + + + +S++ + L
Sbjct: 379 ATGYIYY--RSLIGPARKIADYRLINTMDRAHTFINQELLRIDYDREIGQSYSFD----L 432
Query: 517 INGTNNVSLLSVMVGLPDSGAYLERRVAGLRN-VSIQGAKELKDFSSFSWGYQVGLLGEK 575
N + +L +G + + + G+++ V I GA F +++ L
Sbjct: 433 SESENELGILVENMGRVNYSVKMNHQHKGIKDGVIINGA--------FQSNWEIYPLPMD 484
Query: 576 LQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSI 635
D+ + W + S + ++ VFD + I L GKG VNG I
Sbjct: 485 NLHAIDFQGK---WQKGQPS----FSRFECVFDECADT---FIELPGWGKGFVQVNGHMI 534
Query: 636 GRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
GR+W + P Q Y +P FLK N +++ E +
Sbjct: 535 GRFW------EKGPQQRLY-VPAPFLKTGMNEIIVFESD 566
>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
Length = 786
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 166/348 (47%), Gaps = 21/348 (6%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
L LF LL T+ S G G ++ ++NG ++ + +HYPR W I
Sbjct: 8 LAILFALL--TVFTSFGAPKRGGIFVAGDKTFLLNGKPFVIKAAELHYPRIPRPYWEHRI 65
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
K G++ + VFWN+HE Q G+F+F+G D+ F + Q GLYV +R GP++ E
Sbjct: 66 RMCKALGMNTICLYVFWNIHEQQEGKFNFTGNNDVAAFCRLAQKHGLYVIVRPGPYVCAE 125
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W GGLP+WL I R + F +K + + N + A L +GGPII+ Q+EN
Sbjct: 126 WEMGGLPWWLLKKKDIRLRERDPYFMERVKVFEQQVGNQL--APLTIDKGGPIIMVQVEN 183
Query: 186 EYGM--VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVI---NACNGRQCG 240
EYG V+ ++ + VR + V L W + + D +I N G
Sbjct: 184 EYGSYGVDKEYVSQIRDIVRSSGFDKVAL-FQCDWASNFEKNGLDDLIWTMNFGTGANID 242
Query: 241 ETFA--GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM 298
E F G P P + +E W+ ++ +G R A+++ + + KG + YM
Sbjct: 243 EQFKRLGELRPQSPKMCSEFWSGWFDKWGARHETRPAKNMVAGIDEMLT--KGISFSLYM 300
Query: 299 YHGGTNFGRTASAYV------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
HGGT+FG A A +T Y AP++EYGL PK+ L+ +
Sbjct: 301 THGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGLA-TPKYYELRAM 347
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 50/240 (20%), Positives = 100/240 (41%), Gaps = 33/240 (13%)
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
S L ++ + F++ + +G ++K+ + I +S+L +G +
Sbjct: 420 STLTINDPHDYVQVFLDNQLIGRIDRVKNEKTLPMPA----IRKGQRLSILVEAMGRINF 475
Query: 536 GAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSS 595
G ++ NV++ G + + W ++ + L I DY + V W+ +
Sbjct: 476 GRAIKDHKGITDNVTLSGETD-----NLQWEARITDW-KMLPIPDDYAT--VRWAVDALT 527
Query: 596 THQPLTWYKTV-----------FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
+ + W KT+ F+ D +N+ + GKG+ ++NG +IGR+W +
Sbjct: 528 RMKEIVWSKTIPQDKIGYYRGYFNLKKVGD-TFLNMEAFGKGQVYINGYAIGRFWN--IG 584
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLEE--ENGYPPGISIDTVSVTTLCGHVSDSHLPP 702
PQ T ++P +LK N +++L+ G P + D + L S+ H P
Sbjct: 585 PQQT-----LYVPGCWLKKGQNEVIVLDMVGPKGNPVLFAQDKPELDKLNLEKSNKHNNP 639
>gi|357455525|ref|XP_003598043.1| Beta-galactosidase [Medicago truncatula]
gi|355487091|gb|AES68294.1| Beta-galactosidase [Medicago truncatula]
Length = 309
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 99/283 (34%), Positives = 145/283 (51%), Gaps = 20/283 (7%)
Query: 418 TAKLDSVEQWEEYKEAIPTYD----ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD 473
T L + +WE E P D + + A+ LL Q N T ASDYLWY + +
Sbjct: 19 TCSLGNTLKWEWASE--PMQDTLLGKGTFTASKLLNQKNVTAGASDYLWYMTEVVVNDTK 76
Query: 474 --SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVG 531
++ L V + G +L+++ING + G G S F E+ V L G N +SLLSV +G
Sbjct: 77 IWGKARLHVDTKGPILYSYINGFWWGVEGGSPSKPGFVYEEDVSLKQGANIISLLSVTLG 136
Query: 532 LPDSGAYLERRVAGL-----RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
+ Y++ + G+ + +S + + D S +W Y+VG+ G + + + +
Sbjct: 137 KSNCSGYIDMKETGIVGGPAKLISTEYPNNVLDLSKSTWSYKVGMNGVARKFYDPKSTNV 196
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
VPW S P+TWYKT F P GS+ V ++LI + +G+AWVNGQSIGRYW+
Sbjct: 197 VPWQTRNVSIEGPMTWYKTTFKTPEGSNLVVLDLIGLQRGKAWVNGQSIGRYWIG----- 251
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEE--ENGYPPGISIDTVS 687
S +Y +PR FL N LVL EE P +S+D VS
Sbjct: 252 ENSSFRFYAVPRPFLNKDVNTLVLFEELGLGEGPFNVSVDIVS 294
>gi|351700626|gb|EHB03545.1| Beta-galactosidase-1-like protein 2 [Heterocephalus glaber]
Length = 654
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 170/660 (25%), Positives = 267/660 (40%), Gaps = 92/660 (13%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP++ E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 LAAEVGLWVILRPGPYVCAEIDLGGLPSWLLQDPGMKLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L GGPII Q+ENEYG + P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKHGGPIIAVQVENEYGSY-----NRDPAYMPYVKKALED--RGIIELLLTSDN 233
Query: 226 APDPVINACNG------------RQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
+G Q TF ++P + E WT ++ +G I
Sbjct: 234 KDGLQKGVVHGVLATINLQSQQELQLLTTFLLSVQGNQPKMVMEYWTGWFDSWGSPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPK 333
+ ++ V+ + GS +N YM+HGGTNFG A Y ++ + YG +
Sbjct: 294 DSSEVLETVSAIVN--AGSSINLYMFHGGTNFGFINGAMHFNEY--KSDVTSYG---KQF 346
Query: 334 WGH--LKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATV 391
WG L++LH + + + + KL++ F GS A
Sbjct: 347 WGQGRLRQLHGCLADYDAVLTEAGDYTAKYGKLRDFF---GSRSGAP-------LPPPPD 396
Query: 392 YFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQM 451
+ YE P++ S + W+ K Y E +++ +
Sbjct: 397 LLPKMAYE--PIAPSFY---------------LSLWDALK-----YMEKPIKSEKPINME 434
Query: 452 NTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVL---HAFINGEFVGSAHGKHSDKSF 508
N + + + + S VL GHV F+N +G K
Sbjct: 435 NLPVNDGNGQAFGYTLYETTIASSGVLH----GHVRDQGQVFVNTVSIGFLDYK------ 484
Query: 509 TLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQ 568
T + ++ LI G + +L G + G ++ + GL LK+F +S
Sbjct: 485 TTKIVIPLIQGYTVLRILVENRGRVNYGNNIDDQRKGLIGNLYLNNSPLKNFRIYS---- 540
Query: 569 VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
L K F +G+ WS + P + + P+ SD + L KG
Sbjct: 541 ---LDMKKSFFQRFGTD--KWSTLPEAPTFPAFFLGVLSVVPSPSDTF-LKLEGWEKGVV 594
Query: 629 WVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
++NGQ++GRYW + PQ T ++P ++L P N +++ EE P S DT +
Sbjct: 595 FINGQNLGRYWN--IGPQET-----LYLPGAWLNPGDNQVIIFEEAMAGPMVQSTDTAHL 647
>gi|1669595|dbj|BAA13685.1| AR782 [Arabidopsis thaliana]
Length = 206
Score = 153 bits (386), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 121/208 (58%), Gaps = 30/208 (14%)
Query: 624 GKGEAWVNGQSIGRYWVS----------------------FLTPQGTPSQSWYHIPRSFL 661
GKG AWVNGQSIGRYW + L G PSQ+ YH+PRS+L
Sbjct: 5 GKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPRSWL 64
Query: 662 KPTGNLLVLLEEENGYPPGISIDTVSV-TTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
KP+GN+LVL EE G P IS T + LC VS SH PPV +W S ++ + + R
Sbjct: 65 KPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTR- 123
Query: 721 PGRRPKVQIRCP-SGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSC 779
P + ++CP S + I I FAS+G P G C ++ G C+SS S ++V+KAC+G RSC
Sbjct: 124 ----PVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSC 179
Query: 780 TVPVWTEKFYGDPCPGIPKALLVDAQCT 807
V V T + +G+PC G+ K+L V+A C+
Sbjct: 180 NVEVST-RVFGEPCRGVVKSLAVEASCS 206
>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
Length = 595
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 157/338 (46%), Gaps = 34/338 (10%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ ++Y +L+ NG L +GS+HY R P W + + GL+ V T V WN HE
Sbjct: 4 STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
G F G RDL RFI+ Q +GL V +R GP+I EW GGLP WL PG+ R+ +
Sbjct: 64 TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
P+ + R+ +V + A L A +GGP++ QIENEYG YVR
Sbjct: 124 GPYLEAVDRWFDALVP--RIAELQAGRGGPVVAVQIENEYGSYGDDRA-----YVRHIRD 176
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVIN---ACNGRQCGETFAG----------PNSPDKPAI 254
V G+ ++ D P P++ A G TF P +P
Sbjct: 177 ALV--ARGITELLYTA-DGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFF 233
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-- 312
E W ++ +GD+ +R A A + + +G V+ YM HGGTNFG A A
Sbjct: 234 CAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILD--EGGSVSLYMAHGGTNFGLWAGANHE 291
Query: 313 ------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
+T Y AP+ E G L PK+ L++ +A+
Sbjct: 292 GGTIRPTVTSYDSDAPIAENGAL-TPKFFALRDRLTAL 328
>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
Length = 595
Score = 152 bits (384), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 110/338 (32%), Positives = 157/338 (46%), Gaps = 34/338 (10%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ ++Y +L+ NG L +GS+HY R P W + + GL+ V T V WN HE
Sbjct: 4 STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
G F G RDL RFI+ Q +GL V +R GP+I EW GGLP WL PG+ R+ +
Sbjct: 64 TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
P+ + R+ +V + A L A +GGP++ QIENEYG YVR
Sbjct: 124 GPYLEAVDRWFDALVP--RIAELQAGRGGPVVAVQIENEYGSYGDDRA-----YVRHIRD 176
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVIN---ACNGRQCGETFAG----------PNSPDKPAI 254
V G+ ++ D P P++ A G TF P +P
Sbjct: 177 ALV--ARGITELLYTA-DGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFF 233
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-- 312
E W ++ +GD+ +R A A + + +G V+ YM HGGTNFG A A
Sbjct: 234 CAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILD--EGGSVSLYMAHGGTNFGLWAGANHE 291
Query: 313 ------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
+T Y AP+ E G L PK+ L++ +A+
Sbjct: 292 GGTIRPTVTSYDSDAPIAENGAL-TPKFFALRDRLTAL 328
Score = 40.0 bits (92), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 23/58 (39%), Positives = 30/58 (51%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+ L GKG WVN +GRYW + PQ T ++P L+P GN L +LE E
Sbjct: 521 VALPGFGKGFLWVNDTLLGRYWE--IGPQST-----LYLPGPLLRPGGNTLTVLELER 571
>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
Length = 591
Score = 152 bits (383), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 105/312 (33%), Positives = 149/312 (47%), Gaps = 23/312 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ TYDG L L+SG+IHY R P+ W + K K G + V+T V WNLHEP
Sbjct: 9 DRFTYDGEELR-------LYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEP 61
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
Q G+F F G DL RFI+ GL+V +R P+I EW +GGLP WL PG+ R +
Sbjct: 62 QEGRFVFEGMADLERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCAD 121
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEK-GPPYVRW 204
+ + Y ++ + L + GGP+IL Q+ENEYG + ++LE VR
Sbjct: 122 PLYLSKVDAYYDELIP--RLVPLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRR 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSF 262
+ + G M + P + G + E+FA P P + E W +
Sbjct: 180 GIDVPLFTSDGPTDAMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGW 239
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLT 315
+ + +E R A D A + G+ VN+YM+HGGTNFG A +T
Sbjct: 240 FDHWMEEHHQRDAADAARVFGEMLE--AGASVNFYMFHGGTNFGFYNGANHIKTYEPTIT 297
Query: 316 GYYDQAPLDEYG 327
Y +PL E+G
Sbjct: 298 SYDYDSPLTEWG 309
>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
Length = 779
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 171/354 (48%), Gaps = 32/354 (9%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
LL L L++ +G S G + ++NG ++ + IHYPR + W I
Sbjct: 5 LLYLLILVVAVLGSSCSQSSEGT-FEVGKNTFLLNGEPFVVKAAEIHYPRIPKEYWEHRI 63
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
K G++ + VFWN HEP+ G++DF+G++D+ F + Q G+YV +R GP++ E
Sbjct: 64 KMCKALGMNTICLYVFWNFHEPEEGRYDFAGQKDIAAFCRLAQENGMYVIVRPGPYVCAE 123
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIILSQIE 184
W GGLP+WL I R + ++M+R + + K A L S+GG II+ Q+E
Sbjct: 124 WEMGGLPWWLLKKKDIKLREQD---PYYMERVKLFLNEVGKQLADLQISKGGNIIMVQVE 180
Query: 185 NEYGM--VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK-----QDDAPDPV---INAC 234
NEYG ++ ++ + V+ A TGVP C +++A D + IN
Sbjct: 181 NEYGAFGIDKPYISEIRDMVKQAGF------TGVPLFQCDWNSNFENNALDDLLWTINFG 234
Query: 235 NGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGS 292
G E F PD P + +E W+ ++ +G + RSAE++ + + +
Sbjct: 235 TGANIDEQFKRLKELRPDTPLMCSEFWSGWFDHWGAKHETRSAEELVKGMKEMLD--RNI 292
Query: 293 YVNYYMYHGGTNFGRTASAY------VLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+ YM HGGT+FG A T Y AP++E G + PK+ ++ L
Sbjct: 293 SFSLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAPINESGKV-TPKYLEVRNL 345
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 55/100 (55%), Gaps = 13/100 (13%)
Query: 577 QIFT---DYG-SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNG 632
Q++T DY +R + + ++ +QP +Y++ F+ D +N+++ KG WVNG
Sbjct: 502 QVYTIPVDYSFARDKQYKQQENAENQP-AYYRSTFNLNELGDTF-LNMMNWSKGMVWVNG 559
Query: 633 QSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
+IGRYW + P Q+ Y +P +LK N +++L+
Sbjct: 560 HAIGRYW------EIGPQQTLY-VPGCWLKKGENEIIILD 592
>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
6_1_58FAA_CT1]
Length = 823
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 164/671 (24%), Positives = 263/671 (39%), Gaps = 139/671 (20%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG I+ + +HYPR W + I K G++ + VFWNLHEP+PG+FDF+
Sbjct: 74 TFLLNGKPFIIRAAELHYPRIPKPYWEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFT 133
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ DL F + Q +YV LR GP++ EW GGLP+WL I R + F +
Sbjct: 134 GQNDLAAFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREADPYFIERVN 193
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG--------------MVEHSFLEKGPPY 201
+ + + L GGPII+ Q+ENEYG +V +F +
Sbjct: 194 IFEQEVAR--QVGGLTIQNGGPIIMVQVENEYGSYGESKEYVSLIRDIVRTNFGDVTLFQ 251
Query: 202 VRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENW 259
WA+ + + W IN G + FAG PD P + +E W
Sbjct: 252 CDWASNFTKNALPDLLW-----------TINFGTGANIDQQFAGLKKLRPDSPLMCSEFW 300
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A D+ + ++ KG + YM HGGTN+G A A
Sbjct: 301 SGWFDKWGANHETRPASDMIAGIDEMLS--KGISFSLYMTHGGTNWGHWAGANSPGFAPD 358
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS 373
+T Y AP+ E G W K L + + + ++ S++ AF F
Sbjct: 359 VTSYDYDAPISESGQTTPKYWALRKTLGKYMNGEKQTKVPDMIKSVSIP----AFQF--- 411
Query: 374 SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEA 433
+E A F+NL P+S K ++ EEY +
Sbjct: 412 TEVAPL-------------FANL-----PIS--------------KKDKNIRTMEEYDQG 439
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFING 493
T ++ T +A DY F+NG
Sbjct: 440 FGTILYRTILPEITSSAQLTVNEAHDY--------------------------AQIFVNG 473
Query: 494 EFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNV---- 549
+++G ++ +K TL GT + +L +G + G ++ NV
Sbjct: 474 KYIGKLDRRNGEKQLTLPACP---KGT-QLDILVEAMGRINFGRAIKDYKGITENVELSI 529
Query: 550 SIQGAK---ELKDFSSF----SWGYQVGLLGEKLQIFTD-YGSRIVPWSRYGSSTHQPLT 601
+I G +LK++ F S+ + + ++ D YG RI
Sbjct: 530 NIDGYPFICDLKNWEVFNIEDSYEFYKKMKFHPIRSLKDKYGQRIP-------------G 576
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
Y+ F D +N + GKG +VNG ++GR W + P Q+ Y +P +L
Sbjct: 577 CYRATFQVKKPGD-TFLNFETWGKGLVYVNGYALGRIW------EIGPQQTLY-VPGCWL 628
Query: 662 KPTGNLLVLLE 672
K N +++ +
Sbjct: 629 KKGENEILVFD 639
>gi|115361550|gb|ABI95864.1| beta-galactosidase [Planococcus sp. L4]
Length = 552
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 104/304 (34%), Positives = 147/304 (48%), Gaps = 12/304 (3%)
Query: 51 IHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQ 110
+HY R+ P+ W + K K GL+ V+T + WN HEP+ GQF FSG D+ FI+
Sbjct: 1 MHYFRTVPEQWEDRLQKLKALGLNTVETYIPWNFHEPKKGQFHFSGMADIEGFIELAHRL 60
Query: 111 GLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARL 170
GLYV LR P+I EW GGLP WL +V RS + F H++ Y + + K +
Sbjct: 61 GLYVILRPAPYICAEWEMGGLPSWLMKDKNLVLRSSDPAFLGHVEDYFAEL--LPKFTKH 118
Query: 171 YASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPD 228
GGP+I QIENEYG + ++L+ L L T Q PD
Sbjct: 119 LYQNGGPVIAMQIENEYGAYGNDSAYLDFFKAQYEHHG-LNTFLFTSDGPDFITQGSMPD 177
Query: 229 PVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFI 286
G + E+F ++ PD P + E W ++ + E +RS +D+A ++F
Sbjct: 178 VTTTLNFGSRVDESFQALDAFKPDSPKMVAEFWIGWFDYWSGEHTVRSGDDVA---SVFK 234
Query: 287 AKM-KGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
M K VN+YM+HGGTNFG A YY +Y L + G + E + AVK
Sbjct: 235 EIMEKNISVNFYMFHGGTNFGFMNGANHYDIYYPTITSYDYDSLLT-EGGAITEKYKAVK 293
Query: 346 LCLK 349
L+
Sbjct: 294 EVLR 297
>gi|332264040|ref|XP_003281056.1| PREDICTED: beta-galactosidase-1-like protein 3 [Nomascus
leucogenys]
Length = 655
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 114/338 (33%), Positives = 173/338 (51%), Gaps = 30/338 (8%)
Query: 25 GGGNNVTYDGR-SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G G T G+ + GH+ ++F GSIHY R + W + K K G + V T V WN
Sbjct: 67 GLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWN 126
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
LHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP WL P ++
Sbjct: 127 LHEPERGKFDFSGNMDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPQLLL 186
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+ N+ F +++Y ++ + L QGGP+I Q+ENEYG SF K Y+
Sbjct: 187 RTTNKGFIEAVEKYFDHLIP--RVIPLQYRQGGPVIAVQVENEYG----SF-NKDKTYMP 239
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAP-------DPVINACNGRQCGE-TFAGPN--SPDKPA 253
+ K L+ G+ ++ D V+ A N ++ + TF+ + DKP
Sbjct: 240 YLHKAL--LRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQNTFSQLHKVQRDKPL 297
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY- 312
+ E W ++ +GD+ ++ A+++ + V+ FI K + S+ N YM+HGGTNFG A
Sbjct: 298 LIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGGTNFGFMNGATY 355
Query: 313 ------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
++T Y A L E G + K+ L++L +V
Sbjct: 356 FGKHTGIVTSYDYDAVLTEAGDYTE-KYFKLQKLFESV 392
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 79/184 (42%), Gaps = 21/184 (11%)
Query: 499 AHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVA--GLRNVSIQGAKE 556
AH + F E M+ ++N NN L + G D YL V G N S Q E
Sbjct: 469 AHAHDMAQVFLDETMIGILN-ENNKDLHILNSGYQDC-RYLRILVENQGRVNFSWQIQNE 526
Query: 557 LKDFS-------SFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
K + S G+ V L K+ F G R W S P + T+
Sbjct: 527 QKGITGSVSINNSSLEGFTVYSLEMKMSFFE--GLRSATWKPVPDSHQGPAFYRGTLKAG 584
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLV 669
P+ D ++L++ G ++NG+++GRYW + PQ T ++P ++L P N ++
Sbjct: 585 PSPKD-TFLSLLNWNYGFVFINGRNLGRYWN--IGPQKT-----LYLPGAWLHPEDNEVI 636
Query: 670 LLEE 673
L E+
Sbjct: 637 LFEK 640
>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
Length = 789
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 157/673 (23%), Positives = 267/673 (39%), Gaps = 134/673 (19%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++N ++ + +HYPR W I K G++ + VFWN+HE + G+FDFS
Sbjct: 38 TFLLNNRPFVVKAAELHYPRIPRAYWDHRIKMCKALGMNTICLYVFWNIHEQREGEFDFS 97
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+ F + Q G+Y+ +R GP++ EW GGLP+WL I R + F ++
Sbjct: 98 GNSDVAAFCRLTQKNGMYIIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVE 157
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH---------SFLEK-------GP 199
+ + + A L GGPII+ Q+ENEYG L K GP
Sbjct: 158 IFEQKVAEQL--APLTIQNGGPIIMVQVENEYGSYGEDKKYVGQIRDVLRKYWYTNGRGP 215
Query: 200 PYVR--WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA--GPNSPDKPAIW 255
+ WA+ + + W M N G F G PD P +
Sbjct: 216 ALFQCDWASNFEKNGLEDLIWTM-----------NFGTGANIDAQFMRLGELRPDAPKMC 264
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E W+ ++ +G R A+D+ + ++ KG + YM HGGT+FG A A
Sbjct: 265 SEFWSGWFDKWGARHETRPAKDMVAGIDEMLS--KGISFSLYMTHGGTSFGHWAGANSPG 322
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI 369
+T Y AP++EYG + PK+ L+ K+ E +
Sbjct: 323 FAPDVTSYDYDAPINEYGQV-TPKFWELR------------------------KMMEKY- 356
Query: 370 FQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEE 429
N KR A + ++++ + +A K V+ +EE
Sbjct: 357 ------------NDGKRMPAVPKAPMPLVSFSKVTLTQAKTMRQLATRQVKSRDVKTFEE 404
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHA 489
+ T+ + T DA DY
Sbjct: 405 MDMGWGSAFYTTTLPEISQPSLLTLNDAHDY--------------------------AQI 438
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNV 549
FIN E++G ++K+ M+ + + +++L +G + G ++ RNV
Sbjct: 439 FINSEYIGKIDRVRNEKTL----MLPAVKVGSQLTILVEAMGRINFGRAIKDFKGITRNV 494
Query: 550 SIQGAK-------ELKDFSSFSWGYQVGLLGEKLQIFTD---YGSRIVPWSRYGSSTHQP 599
+I +LKD++ + + +L++ + + + + SRY +
Sbjct: 495 TISTQSGGHELTYDLKDWTIDLVPDEADTILSRLKLPHNDIAFATDVKNGSRYPGA---- 550
Query: 600 LTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRS 659
Y F+ D IN+ + GKG+ +VNG ++GR+W + PQ T + P +
Sbjct: 551 ---YVGTFNLRKVGDTF-INMENFGKGQVYVNGHALGRFWR--IGPQQT-----LYCPGA 599
Query: 660 FLKPTGNLLVLLE 672
+LK N +V+L+
Sbjct: 600 WLKKGKNEIVVLD 612
>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
Length = 612
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 156/323 (48%), Gaps = 23/323 (7%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
+GR ++G + SG++HY R PQ W I K K GL+ V+T V WNLHE G F
Sbjct: 45 NGRHFTMDGKPFTILSGAMHYFRIPPQYWEDRIVKLKAMGLNTVETYVSWNLHEEIQGDF 104
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF-K 151
+F D+V FIK Q LYV +R GP+I EW GGLP WL P I RS + F K
Sbjct: 105 NFKDGLDIVEFIKTAQKHDLYVIMRPGPYICAEWDLGGLPSWLLHNPNIYLRSLDPIFMK 164
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHS--FLEK-GPPYVRWAAKL 208
++ + +I ++ S GGPII QIENEY ++S ++ K V K
Sbjct: 165 ATLRFFDELIPRLIDYQ---YSNGGPIIAWQIENEYLSYDNSSAYMRKLQQEMVIRGVKE 221
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET--FAGPN--SPDKPAIWTENWTSFYQ 264
+ G+ W M + P + Q ET G P+ P + TE W+ ++
Sbjct: 222 LLFTSDGI-WQMQIEKKYSLPGVLKTVNFQRNETNILKGLRKLQPNMPLMVTEFWSGWFD 280
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD----- 319
+G++ + + E A I KM+ S +NYYM HGGTNFG A G Y
Sbjct: 281 HWGEDKHVLTVEKAAERTK-NILKMESS-INYYMLHGGTNFGFMNGANAENGKYKPTITS 338
Query: 320 ---QAPLDEYGLLRQPKWGHLKE 339
AP+ E G + PK+ L+E
Sbjct: 339 YDYDAPISESGDI-TPKYRELRE 360
>gi|297727459|ref|NP_001176093.1| Os10g0340600 [Oryza sativa Japonica Group]
gi|255679317|dbj|BAH94821.1| Os10g0340600 [Oryza sativa Japonica Group]
Length = 143
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 67/108 (62%), Positives = 84/108 (77%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YDGRSLI++G R+I+ SGSIHYPRSTP+MWP LI KAKEGGL+ ++T VFWN HEP+
Sbjct: 31 VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHD 137
+F+F G D+VRF KE+Q G+Y LRIGP+I GEW YG +P D
Sbjct: 91 REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGYMPMLYLD 138
>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
Length = 591
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/314 (32%), Positives = 151/314 (48%), Gaps = 34/314 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
++G L SG+IHY R P+ W + K K G + V+T + WNLHEP+PGQF F
Sbjct: 10 QFCLDGESIRLVSGAIHYFRVVPEYWRDRLLKLKACGFNTVETYIPWNLHEPKPGQFRFD 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+VRF++ GL+V +R P+I EW +GGLP WL PG+ R + P+ +
Sbjct: 70 GLADVVRFVEIAGEVGLHVIVRPSPYICAEWEFGGLPAWLLADPGMRVRCMHRPYLDRVD 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVRWA 205
Y V + L + GGPII QIENEYG ++ + L++G
Sbjct: 130 AYYD--VLLPLLKPLLCTNGGPIIAMQIENEYGSYGNDRAYLVYLKDAMLQRG------- 180
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA--GPNSPDKPAIWTENWTSFY 263
+ + G M + P + G + E F PD P + E W ++
Sbjct: 181 MDVLLFTSDGPEHFMLQGGMIPGVLETVNFGSRAEEAFEMLRKYQPDGPIMCMEYWNGWF 240
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMK-GSYVNYYMYHGGTNFGRTASAY---------V 313
+G++ R A+D+A +F ++ G+ VN+YM+HGGTNFG + A
Sbjct: 241 DHWGEQHHTRDAKDVA---DVFDDMLRLGASVNFYMFHGGTNFGYMSGANCPQRDHYEPT 297
Query: 314 LTGYYDQAPLDEYG 327
+T Y PL+E G
Sbjct: 298 ITSYDYDVPLNESG 311
>gi|397498227|ref|XP_003819886.1| PREDICTED: beta-galactosidase-1-like protein 3 [Pan paniscus]
Length = 653
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 116/349 (33%), Positives = 176/349 (50%), Gaps = 30/349 (8%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGR-SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGG 72
LT + + G G T G+ + GH+ ++F GSIHY R + W + K K G
Sbjct: 56 LTPLELKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACG 115
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
+ V T V WNLHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP
Sbjct: 116 FNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLP 175
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WL P ++ R+ N+ F +++Y ++ + L QGGP+I Q+ENEYG
Sbjct: 176 SWLLQDPRLLLRTTNKSFIEAVEKYFDHLIP--RVIPLQYRQGGPVIAVQVENEYG---- 229
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAP-------DPVINACNGRQC-GETFA 244
SF K Y+ + K L+ G+ ++ D V+ A N ++ +TF
Sbjct: 230 SF-NKDKTYMPYLHKAL--LRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQDTFN 286
Query: 245 GPN--SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
+ DKP + E W ++ +GD+ ++ A+++ + V+ FI K + S+ N YM+HGG
Sbjct: 287 QLHKIQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGG 344
Query: 303 TNFGRTASAY-------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
TNFG A ++T Y A L E G + K+ L++L +V
Sbjct: 345 TNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSV 392
>gi|332838248|ref|XP_001156615.2| PREDICTED: galactosidase, beta 1-like 3 [Pan troglodytes]
Length = 653
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 116/349 (33%), Positives = 176/349 (50%), Gaps = 30/349 (8%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGR-SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGG 72
LT + + G G T G+ + GH+ ++F GSIHY R + W + K K G
Sbjct: 56 LTPLELKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACG 115
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
+ V T V WNLHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP
Sbjct: 116 FNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLP 175
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WL P ++ R+ N+ F +++Y ++ + L QGGP+I Q+ENEYG
Sbjct: 176 SWLLQDPRLLLRTTNKSFIEAVEKYFDHLIP--RVIPLQYRQGGPVIAVQVENEYG---- 229
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAP-------DPVINACNGRQC-GETFA 244
SF K Y+ + K L+ G+ ++ D V+ A N ++ +TF
Sbjct: 230 SF-NKDKTYMPYLHKAL--LRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQDTFN 286
Query: 245 GPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
+ DKP + E W ++ +GD+ ++ A+++ + V+ FI K + S+ N YM+HGG
Sbjct: 287 QLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGG 344
Query: 303 TNFGRTASAY-------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
TNFG A ++T Y A L E G + K+ L++L +V
Sbjct: 345 TNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSV 392
>gi|255550369|ref|XP_002516235.1| beta-galactosidase, putative [Ricinus communis]
gi|223544721|gb|EEF46237.1| beta-galactosidase, putative [Ricinus communis]
Length = 451
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 146/515 (28%), Positives = 214/515 (41%), Gaps = 114/515 (22%)
Query: 284 LFIAKMKGSYVNYY---------MYHGGTNFGRTASAYVLTGYYD-QAPLDEYGLLRQPK 333
+ A+++ Y N++ MYHGGTNF R + ++ YD APLDEYG L QPK
Sbjct: 15 IVFAQIENDYGNFWPNNPKSRNQMYHGGTNFRRMSGGPMIVTSYDYDAPLDEYGNLNQPK 74
Query: 334 WGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYF 393
WGHL++LH + L L G+ + ++ +I + E FL N +A +
Sbjct: 75 WGHLRDLHVRILLHLS-QSRGLGFATVYALNLTTYINNATGERFCFLSNTKTNEDANI-- 131
Query: 394 SNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNT 453
+L I +P W Y Y + NF +Q
Sbjct: 132 -----DLQQDGIFFVP----------------AWIYY------YSSRVQQGNF--QQCKA 162
Query: 454 TKDASDYLWYNFR----FKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFT 509
T D +DYL Y R F D S + + + +F G++ +
Sbjct: 163 TSDETDYLRYITRYFDFFTVSVKDVHS--RCQQCNNTEEHDLACDFFGTSPACSCQSAAR 220
Query: 510 LEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQV 569
L+++ H S+ ++ G + G + + G I GA +L SS W Y++
Sbjct: 221 LQQVFH--------SIYNLTSGKQNYGEFFDEGPEG-----IAGAADL---SSNQWAYKI 264
Query: 570 GLLGEKLQIFT-DYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEA 628
GL GE +++ + G R V + + +TWYKT F P+G+DP+ +NL MGKG A
Sbjct: 265 GLGGEAKRLYDPNSGHRDVFRTSAILPVGRAMTWYKTTFHVPSGTDPLVLNLQGMGKGHA 324
Query: 629 WVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSV 688
WVNG S+GR+W P QS PTG + Y D
Sbjct: 325 WVNGHSLGRFW---------PMQS--------ADPTG-YSGSCDYRGKY------DKDKC 360
Query: 689 TTLCGHVSDSHLPPVISWRSQNQRTLKTHKRIPGRRPKVQIRCPSGRKISKILFASYGNP 748
T CG+ P W K I P +I IS I FAS+GNP
Sbjct: 361 LTNCGN-------PTQRW-----------KHIATFMPNGRI-------ISVIQFASFGNP 395
Query: 749 NGNCENYAIGSCHSSNSRAIVEKACLGKRSCTVPV 783
G C + G ++ + VEKAC+GK SC++ V
Sbjct: 396 EGTCGSLQKGDFEAAYTAFAVEKACVGKESCSLGV 430
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 164 MMKAARLYASQGGPIILSQIENEYG 188
M K A+L+AS GGPI+ +QIEN+YG
Sbjct: 1 MAKEAKLFASSGGPIVFAQIENDYG 25
>gi|445062232|ref|ZP_21374649.1| beta-galactosidase [Brachyspira hampsonii 30599]
gi|444506390|gb|ELV06735.1| beta-galactosidase [Brachyspira hampsonii 30599]
Length = 592
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 167/671 (24%), Positives = 265/671 (39%), Gaps = 119/671 (17%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
I+NG + SG+IHY R + W + K G + V+T + WN+HE G FDF
Sbjct: 8 EEFILNGKPIKILSGAIHYFRFVREYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGFFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG +D+ FIK Q L V LR P+I EW +GGLP WL I R++ + F +
Sbjct: 68 SGNKDIASFIKTAQKLDLLVILRPTPYICAEWEFGGLPAWLLRYDNIKVRTNTQLFLSKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y + + ++ ++ GP+I+ QIENEYG + Y+R L +
Sbjct: 128 DAYYKELFKHIDDLQI--TRNGPVIMMQIENEYGSFGND-----KEYLRALKNLMIKHGA 180
Query: 215 GVPWVMCKQDDAPDPVINACNGRQCG------------------ETFAGPNSPDKPAIWT 256
VP + D A D V+ A G E F KP +
Sbjct: 181 EVP--LFTSDGAWDAVLEAGTLIDDGILATVNFGSKAKESFDDTEKFFARKGIKKPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTG 316
E W ++ ++ D R A+D V + +GS +N YM+ GGTNFG V TG
Sbjct: 239 EFWDGWFNLWKDPIIKRDADDFIMEVKEILK--RGS-INLYMFIGGTNFGFYNGTSV-TG 294
Query: 317 YYDQAPLDEYGL-LRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSE 375
Y D + Y +WG E F KLQ+
Sbjct: 295 YTDFPQITSYDYDAVLTEWGEPTE--------------------KFYKLQK--------- 325
Query: 376 CAAFLVNKDKRNNATVYFSNLMYELPPLSISILP-DCKTVAFNTAKLDSVEQWEEYKEAI 434
L+ EL P + P D K + F+ AKL + + I
Sbjct: 326 --------------------LINELFPEIKTFEPRDHKRLDFSEAKLKNKTSLFSVIDKI 365
Query: 435 PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGE 494
++ + K S Y + +R K ++ ++ +H ++NGE
Sbjct: 366 SKCQKSDF-------PITMEKAGSGYGYMLYRTKVKGFNNNMNVRAVGASDRVHFYLNGE 418
Query: 495 FVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER--RVAGLRNVSIQ 552
+ G K+ D+ +M H +G N + LL VG + G L+ +V G+R + +
Sbjct: 419 YKGV---KYQDELIEPIEM-HFNDGDNILELLVENVGRVNYGYKLQECSQVKGIR-IGVM 473
Query: 553 GAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTG 612
F G++ L D+ + + + P ++Y+ F+
Sbjct: 474 AD------IHFETGFEQYALSLDNIEDVDFSADWIE--------NTP-SFYRYEFEVKEA 518
Query: 613 SDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
+D ++ +GKG A++NG ++GRYW + +IP LK N +++ E
Sbjct: 519 ADTF-LDCSKLGKGVAFINGFNLGRYW-------SEGPACYLYIPAPLLKIGVNEIIVFE 570
Query: 673 EENGYPPGISI 683
EN I++
Sbjct: 571 TENMLADSIAL 581
>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
Length = 603
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 149/319 (46%), Gaps = 28/319 (8%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G + DGRSL I SG++HY R P W I KA+ GL+ V+T V WN+H
Sbjct: 7 GPEDFLLDGRSLQI-------VSGALHYFRVHPDQWADRIRKARLLGLNTVETYVAWNVH 59
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
P+ G FD SGRRDL RF+ V A+GL+ +R GP+I EW GGLP WL P + R
Sbjct: 60 SPERGVFDTSGRRDLARFLDLVAAEGLHAIVRPGPYICAEWTGGGLPAWLFADPEVGVRR 119
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
F + Y ++ ++ A ++GGP+++ Q+ENEYG + Y+R
Sbjct: 120 AEPRFLEAIGEYYAALLPIV--AERQVTRGGPVLMVQVENEYGAYGDDPPVERERYLRAL 177
Query: 206 AKLAVDLQTGVPWVMCKQDD--------APDPVINACNGRQCGETFA--GPNSPDKPAIW 255
A + VP Q + P+ + A G + E A + P P +
Sbjct: 178 ADMIRAQGIDVPLFTSDQANDHHLSRGSLPELLTTANFGSRATERLAILRKHQPTGPLMC 237
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY--- 312
E W ++ G E A + +A G+ VN YM HGGTNFG T+ A
Sbjct: 238 MEFWDGWFDSAGLHHHTTPPEANARDLDDLLA--AGASVNLYMLHGGTNFGLTSGANDKG 295
Query: 313 ----VLTGYYDQAPLDEYG 327
+ T Y APL E+G
Sbjct: 296 VYRPITTSYDYDAPLSEHG 314
>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
Length = 940
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/352 (31%), Positives = 169/352 (48%), Gaps = 31/352 (8%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
V YD S II+G R + S ++HY R W ++ K+KE G + ++T V WN HE
Sbjct: 4 TRVQYDRNSWIIDGRRVFILSAAVHYFRLPRAEWAEVLDKSKEAGCNCIETYVPWNWHEE 63
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
+ GQ+DFSG +DL F+ +GLYV +R GP+I EW GGLP+WL P + +R +
Sbjct: 64 EEGQWDFSGDKDLGAFLDLCAERGLYVIVRPGPYICAEWDMGGLPYWLERKPDMQYRKFH 123
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
F ++ Y +V ++ L S G +I+ Q+ENE+ + + Y+ +
Sbjct: 124 REFLHYVDLYWDRLVPVVLPRLL--SNSGTVIMVQVENEF----QALGKPDKAYMEYLRD 177
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACN--------GRQCGETFAGPNSPDKPAIWTENW 259
++ VP V C A D + N R E FA D+P E W
Sbjct: 178 GLIERGIDVPLVTCY--GAVDGAVEFRNFWSHAEEHARTLEERFA-----DQPKGVLEFW 230
Query: 260 TSFYQVYGD-EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF----GRT--ASAY 312
+++ +G A ++A + I + + +NYYM+ GGTNF GRT +
Sbjct: 231 IGWFEQWGGPRANQKTASQVERKTYELI-REGFTAINYYMFFGGTNFGHWGGRTIGEHTF 289
Query: 313 VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
+ T Y A LDEY L K+ LK +H V+ ++P+L+ S F L
Sbjct: 290 MTTSYDYDAALDEY-LRPTAKYKALKLVHDFVRW-MEPLLTETTGSTAFIPL 339
Score = 43.5 bits (101), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 47/106 (44%), Gaps = 29/106 (27%)
Query: 602 WYKTVFDAP--TGSD-------------------PVAINLISMGKGEAWVNGQSIGRYWV 640
W+K FD P +G D + I L + KG WVNG +GRYW
Sbjct: 839 WFKAAFDWPEHSGDDSLKRTDSVHAEQAGEPDGAKLKITLDGLSKGILWVNGFCLGRYW- 897
Query: 641 SFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTV 686
Q P +S Y IP S LK N ++ +EE +P G+ ++ V
Sbjct: 898 -----QIGPQES-YKIPVSLLKKR-NEVLFYDEEGCHPGGVRLELV 936
>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
Length = 591
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 149/312 (47%), Gaps = 23/312 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ TYDG + L+SG+IHY R P+ W + K K G + V+T V WNLHEP
Sbjct: 9 DRFTYDGEEIR-------LYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEP 61
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
Q G+F F G DL RFI+ GL+V +R P+I EW +GGLP WL PG+ R +
Sbjct: 62 QEGRFVFEGMADLERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCAD 121
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEK-GPPYVRW 204
+ + Y ++ + L + GGP+IL Q+ENEYG + ++LE VR
Sbjct: 122 PLYLSKVDAYYDELIP--RLVPLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRR 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSF 262
+ + G M + P + G + E+FA P P + E W +
Sbjct: 180 GIDVPLFTSDGPTDSMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGW 239
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLT 315
+ + +E R A D A + G+ VN+YM+HGGTNFG A +T
Sbjct: 240 FDHWMEEHHQRDAADAARVFGEMLE--AGASVNFYMFHGGTNFGFYNGANHIKTYEPTIT 297
Query: 316 GYYDQAPLDEYG 327
Y +PL E+G
Sbjct: 298 SYDYDSPLTEWG 309
>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
Length = 591
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 149/312 (47%), Gaps = 23/312 (7%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ TYDG + L+SG+IHY R P+ W + K K G + V+T V WNLHEP
Sbjct: 9 DRFTYDGEEIR-------LYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEP 61
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
Q G+F F G DL RFI+ GL+V +R P+I EW +GGLP WL PG+ R +
Sbjct: 62 QEGRFVFEGMADLERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCAD 121
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEK-GPPYVRW 204
+ + Y ++ + L + GGP+IL Q+ENEYG + ++LE VR
Sbjct: 122 PLYLSKVDAYYDELIP--RLVPLLCTSGGPVILVQVENEYGSYGSDKAYLEHLRDGLVRR 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSF 262
+ + G M + P + G + E+FA P P + E W +
Sbjct: 180 GIDVPLFTSDGPTDSMLQGGSLPGVLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGW 239
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLT 315
+ + +E R A D A + G+ VN+YM+HGGTNFG A +T
Sbjct: 240 FDHWMEEHHQRDAADAARVFGEMLE--AGASVNFYMFHGGTNFGFHNGANHIKTYEPTIT 297
Query: 316 GYYDQAPLDEYG 327
Y +PL E+G
Sbjct: 298 SYDYDSPLTEWG 309
>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
Length = 664
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/332 (33%), Positives = 158/332 (47%), Gaps = 32/332 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++F GSIHY R + W + K K G + + T V WNLHEPQ G+FDFSG
Sbjct: 93 LGGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSGNL 152
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL F+ GL+V LR GP+I E GGLP WL P ++ R+ + F + +Y
Sbjct: 153 DLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPKMILRTTYKGFVEAVNKYF 212
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGP--PYVRWAAKLAVDLQTGV 216
+++ + L + GPII Q+ENEYG SF E PY++ A L+ G+
Sbjct: 213 DHLIS--RVVPLQYRKRGPIIAVQVENEYG----SFAEDKDYMPYIQKAL-----LERGI 261
Query: 217 PWVMCKQDDAPDPVINACNGRQCG---ETFAGPN-------SPDKPAIWTENWTSFYQVY 266
++ DDA + G TF + +KP + E W ++ +
Sbjct: 262 VELLMTSDDAKHMLKGYIEGVLATINMNTFQINDFKQLSQVQRNKPIMVMEFWVGWFDTW 321
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G + I++AED+ V+ FI N YM+HGGTNFG A V+T Y
Sbjct: 322 GGKHMIKNAEDVEDTVSKFITSEIS--FNVYMFHGGTNFGFMNGATYFGKHRGVVTSYDY 379
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPM 351
A L E G + + K S V + L P+
Sbjct: 380 DAVLTEAGDYTEKYFKLRKLFGSVVAVHLPPL 411
>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
CL03T12C18]
Length = 782
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/327 (30%), Positives = 163/327 (49%), Gaps = 31/327 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG ++ + IHYPR + W I K G++ + VFWN HEP+ G++DF
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G++D+ F + Q G+YV +R GP++ EW GGLP+WL I R + ++M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQD---PYYM 149
Query: 155 KRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVD 211
+R + + K A L S+GG II+ Q+ENEYG ++ ++ + V+ A
Sbjct: 150 ERVKLFMNEVGKQLADLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF---- 205
Query: 212 LQTGVPWVMCK-----QDDAPDPV---INACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
TGVP C +++A D + IN G + F PD P + +E W+
Sbjct: 206 --TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSG 263
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------VLT 315
++ +G + RSAED+ + + + + YM HGGT+FG A T
Sbjct: 264 WFDHWGAKHETRSAEDLVKGMKEMLD--RNISFSLYMTHGGTSFGHWGGANFPNFSPTCT 321
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHS 342
Y AP++E G + PK+ ++ L S
Sbjct: 322 SYDYDAPINESGKV-TPKYFEVRNLLS 347
>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
Length = 586
Score = 150 bits (379), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 147/304 (48%), Gaps = 16/304 (5%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG++HY R P +W I KA+ GL+ ++T V WN H PQ G+F
Sbjct: 7 DFLLDGKPFRILSGALHYFRVHPDLWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTD 66
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RF++ V+A+G+ +R GP+I EW GGLP WL P + R D + +
Sbjct: 67 GALDLERFLRLVEAEGMLAIVRPGPYICAEWDNGGLPGWLFRDPAVGVRRDEPLYMEAVS 126
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQ 213
Y +++++ A +GGP++L Q+ENEYG +H +LEK R
Sbjct: 127 EYLGTVLDLV--APFQVDRGGPVVLVQVENEYGAYGSDHVYLEKLMALTRSHGITVPLTS 184
Query: 214 TGVPWVMCKQDDAPDPVINACN-GRQCGETFAG--PNSPDKPAIWTENWTSFYQVYGDEA 270
P D + D + + G + E A + P P + E W ++ +G
Sbjct: 185 IDQPSGTMLADGSIDGLHRTGSFGSRSAERLATLREHQPTGPLMCAEFWDGWFDHWGAHH 244
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQAPL 323
SA+D A + +A G+ VN YM+HGGTNFG T+ A T Y APL
Sbjct: 245 HTTSAQDAARELDELLAA--GASVNIYMFHGGTNFGFTSGANDKGVYQPTTTSYDYDAPL 302
Query: 324 DEYG 327
E G
Sbjct: 303 AEDG 306
>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
Length = 586
Score = 149 bits (377), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 146/311 (46%), Gaps = 30/311 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG++HY R P W I KA+ GL+ ++T V WN H P+PG FD
Sbjct: 10 DFLLDGEPFRILSGALHYFRVHPDQWADRIEKARLMGLNTIETYVPWNAHSPRPGVFDTD 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RF++ V+ G+Y +R GPFI EW GGLP WL PG+ R F ++
Sbjct: 70 GILDLPRFLRLVKDAGMYAIVRPGPFICAEWDNGGLPPWLFREPGVGIRRHEPRFLDEVE 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQ 213
+Y ++ +++ ++ GGP++L Q+ENEYG + +L+ +R A
Sbjct: 130 KYLHQVLALVRPHQV--DLGGPVLLVQVENEYGAYGDDRDYLQAVADMIRGAG------- 180
Query: 214 TGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS----------PDKPAIWTENWTSFY 263
VP V Q +G +F ++ P P + E W ++
Sbjct: 181 IDVPLVTVDQPVDAMLAAGGLDGVLRTSSFGSDSANRLRTLRDHQPTGPLMCMEFWDGWF 240
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTG 316
+G E A + +A G+ VN YM+HGGTNFG T+ A +T
Sbjct: 241 DHWGGRHHTTPVEQAAEELDALLA--AGASVNVYMFHGGTNFGLTSGANDKGIYRPTVTS 298
Query: 317 YYDQAPLDEYG 327
Y APLDE G
Sbjct: 299 YDYDAPLDEAG 309
>gi|332030018|gb|EGI69843.1| Beta-galactosidase [Acromyrmex echinatior]
Length = 594
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 112/331 (33%), Positives = 159/331 (48%), Gaps = 50/331 (15%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V Y+ +++G SGS HY R+ Q W + K + GL+ + T V W+LHEP+
Sbjct: 1 DVDYENNQFLLDGKPFQYVSGSFHYFRTPRQYWRDRLRKMRAAGLNAISTYVEWSLHEPE 60
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFW-LHDVPGIVFRSDN 147
PGQF+++G DLV F+ Q + L+V LR GP+I E GGLP+W L +VP I R+ +
Sbjct: 61 PGQFNWTGDADLVNFLNIAQEEDLFVLLRPGPYICAERDMGGLPYWLLREVPNINLRTKD 120
Query: 148 EPFKFHMKRYATMIVN--MMKAARLYASQGGPIILSQIENEYG-----------MVEHSF 194
F RYAT+ +N + K L GGPII+ QIENEYG M++ F
Sbjct: 121 ADF----VRYATLYLNEILSKIRPLLRGNGGPIIMVQIENEYGSYYACDIEYMDMLKEVF 176
Query: 195 LEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA--------GP 246
++K V A L + C ++ +F GP
Sbjct: 177 VKK----VGNKALLYTTDGAAASLLRCGFISGAYATVDFGTASNVTNSFLSMRLYQPRGP 232
Query: 247 --NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
NS P W +W +Q EA ++S E++ +AL G+ VN+YM++GGTN
Sbjct: 233 LVNSEFYPG-WLTHWGEPFQRTKTEAIVKSLEEM---LAL------GASVNFYMFYGGTN 282
Query: 305 FGRTASAY--------VLTGYYDQAPLDEYG 327
FG T+ A LT Y APL E G
Sbjct: 283 FGFTSGANGGAGVYNPQLTSYDYDAPLTEAG 313
>gi|291557570|emb|CBL34687.1| Beta-galactosidase [Eubacterium siraeum V10Sc8a]
Length = 579
Score = 149 bits (377), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 167/667 (25%), Positives = 273/667 (40%), Gaps = 118/667 (17%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
++G + SGSIHY R+ P+ W + K G + V+T + WN HE + G F+++G
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWNGMH 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
D+ RFI+ GLY+ +R P+I EW +GGLP WL + R +P+ + Y
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDSYY 131
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
+++ M K A GG II+ QIENEYG + S+LE +R + +
Sbjct: 132 SVL--MPKLAPYQIDNGGNIIMMQIENEYGYYGNDTSYLEFLRDTMRKYGITVPFVTSDG 189
Query: 217 PW----VMCKQDDAPDPVINACNGR--QCGET--FAGPNSPDKPAIWTENWTSFYQVYGD 268
PW D P N + Q GE F G DKP + E W ++ V+G+
Sbjct: 190 PWSEFVFKSGMVDGALPTGNFGSSAEWQFGEMRRFIG---EDKPLMCMEFWNGWFDVWGE 246
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG------RTASAYVLTGYYDQAP 322
E I + E A + + +K +N+YM+ GGTNFG ++T Y AP
Sbjct: 247 EHNITAPEKAAQELDIL---LKNGSMNFYMFEGGTNFGFMSGKNNEKKTGIVTSYDYDAP 303
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVN 382
L E G + + K+ KE+ S + L+ + + + K++ C A
Sbjct: 304 LTEDGRITE-KYEKCKEVISRYTDINEVPLTTQIRRLEYGKIR----------CTA---- 348
Query: 383 KDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSL 442
KT F+T LDS+ + K P
Sbjct: 349 -----------------------------KTDLFST--LDSIS--DPIKSVYP------- 368
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGK 502
E+++ S Y + +R +++ S ++ + + F NG++ +A +
Sbjct: 369 ---LSFEELD-----SYYGYVLYRLHIRENETVSTVRCENTADRVQGFRNGKYAFTAFAE 420
Query: 503 HSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSS 562
D+ F L + + LL +G + G LE + G + G + D
Sbjct: 421 TIDEQFELAEK----SAGGTTDLLVENIGRVNFGTGLECQHKG-----VLGGIRINDHRQ 471
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLIS 622
+ + L E DY G + P +YK F+ +D ++
Sbjct: 472 YGFEMFTLPLDENQLGRIDYNR--------GYNDGVP-AFYKFEFEISEVADTF-LDTDG 521
Query: 623 MGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGIS 682
GKG A++NG ++GR+W Q +IP LK N +V+ E E G S
Sbjct: 522 FGKGVAFINGFNLGRFW-------NIGPQKKLYIPAPLLKKGKNEIVIFETE-----GNS 569
Query: 683 IDTVSVT 689
D+++++
Sbjct: 570 ADSITLS 576
>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 782
Score = 149 bits (376), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 163/327 (49%), Gaps = 31/327 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG+ ++ + IHYPR + W I K G++ + VFWN HEP+ G++DF
Sbjct: 33 KTFLLNGNPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G++D+ F + Q G+YV +R GP++ EW GGLP+WL I R + ++M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQD---PYYM 149
Query: 155 KRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVD 211
+R + + K L S+GG II+ Q+ENEYG ++ ++ + V+ A
Sbjct: 150 ERVKLFMNEVGKQLTDLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF---- 205
Query: 212 LQTGVPWVMCK-----QDDAPDPV---INACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
TGVP C +++A D + IN G + F PD P + +E W+
Sbjct: 206 --TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSG 263
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------VLT 315
++ +G + RSAED+ + + + + YM HGGT+FG A T
Sbjct: 264 WFDHWGAKHETRSAEDLVKGMKEMLD--RNISFSLYMTHGGTSFGHWGGANFPNFSPTCT 321
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHS 342
Y AP++E G + PK+ ++ L S
Sbjct: 322 SYDYDAPINESGKV-TPKYFEVRNLLS 347
>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
Length = 781
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 128/452 (28%), Positives = 209/452 (46%), Gaps = 51/452 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG ++ + IHYPR + W I +K G++ + VFWN HEP+ G++DF
Sbjct: 33 KTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMSKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G++D+ F + Q G+YV +R GP++ EW GGLP+WL I R + ++M
Sbjct: 93 TGQKDIAAFCRMAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKEDIKLREQD---PYYM 149
Query: 155 KRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGMVEHSF-LEKGPPYVRWAAKLAVDL 212
+R + + K A L S+GG II+ Q+ENEYG SF ++K PY+ +
Sbjct: 150 ERVKLFMNEVGKQLADLQISKGGNIIMVQVENEYG----SFGIDK--PYIAAIRDMVKQA 203
Query: 213 Q-TGVPWVMCK-----QDDAPDPV---INACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
TGVP C +++A D + +N G + F P+ P + +E W+
Sbjct: 204 GFTGVPLFQCDWNSNFENNALDDLLWTVNFGTGANIDQQFERLKELRPNTPLMCSEFWSG 263
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------VLT 315
++ +G + RSAE++ + + + + YM HGGT+FG A T
Sbjct: 264 WFDHWGAKHETRSAEELVKGMKEMLD--RNISFSLYMTHGGTSFGHWGGANFPNFSPTCT 321
Query: 316 GYYDQAPLDEYG-----------LLRQ--PKWGHLKELHSAVKLCLKPMLSGVLVSMNFS 362
Y AP++E G LL+Q P+ L + ++ P V++ F
Sbjct: 322 SYDYDAPINESGKVTPKFLEVRDLLKQYLPEGEELAPIPDSIPTIAVPEFKLDEVAVLFD 381
Query: 363 KLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNL--MYELPPLSISILPDCKTVAFNTAK 420
L E I + AF D+ + +Y + L E L I+ D V N K
Sbjct: 382 NLPEPKISKDIKSMEAF----DQGWGSILYRTTLPASKEEQTLIITEAHDWAQVFLNGKK 437
Query: 421 LDSVEQWE-EYKEAIPTYDETSLRANFLLEQM 451
L ++ + + E +P E S R + L+E M
Sbjct: 438 LATLSRLKGEGTVILPPMKEES-RLDILVEAM 468
Score = 39.3 bits (90), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 19/55 (34%), Positives = 33/55 (60%), Gaps = 7/55 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
+N+++ KG W+NG ++GRYW + P Q+ Y +P +LK N +V+L+
Sbjct: 546 LNMMNWSKGMVWINGHAVGRYW------EIGPQQTLY-VPGCWLKEGDNEVVILD 593
>gi|345800024|ref|XP_546385.3| PREDICTED: galactosidase, beta 1-like 3 [Canis lupus familiaris]
Length = 808
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 156/327 (47%), Gaps = 33/327 (10%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
+ GH+ +F GSIHY R W + K K G + V T V WNLHEP+ G+FDFSG
Sbjct: 235 FTLGGHKFQVFGGSIHYFRVPRAYWGDRLRKLKACGFNTVTTYVPWNLHEPERGKFDFSG 294
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
D+ F+ GL+V LR GP+I E GGLP WL P +V R+ F + +
Sbjct: 295 NLDMEAFVLLAAEMGLWVILRPGPYICSEIDLGGLPSWLLQDPKMVLRTTYSGFVKAVDK 354
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGP--PYVRWAAKLAVDLQT 214
Y +++ + L +GGPII Q+ENEYG SF E PY++ A L+
Sbjct: 355 YFDHLIS--RVVPLQYRRGGPIIAVQVENEYG----SFAEDRGYMPYLQKAL-----LER 403
Query: 215 GVPWVMCKQDDAPD----------PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
G+ ++ DDA + IN + ++ +KP + E W ++
Sbjct: 404 GIVELLVTSDDAENLLKGHIKGVLATINMNSFQESDFKLLSYVQSNKPIMVMEFWVGWFD 463
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGY 317
+G E ++++ +D+ V FIA N YM+HGGTNFG A V+T Y
Sbjct: 464 TWGSEHKVKNPKDVEETVTKFIASEIS--FNVYMFHGGTNFGFMNGATDFGIHRGVVTSY 521
Query: 318 YDQAPLDEYGLLRQPKWGHLKELHSAV 344
A L E G + K+ L+ L +V
Sbjct: 522 DYDAVLTEAGDYTE-KYFKLRRLFGSV 547
>gi|119588243|gb|EAW67839.1| hCG1729998, isoform CRA_d [Homo sapiens]
Length = 653
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 175/349 (50%), Gaps = 30/349 (8%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGR-SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGG 72
LT + + G G T G+ + GH+ ++F GSIHY R + W + K K G
Sbjct: 56 LTPLELKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACG 115
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
+ V T V WNLHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP
Sbjct: 116 FNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLP 175
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WL P ++ R+ N+ F +++Y ++ + L Q GP+I Q+ENEYG
Sbjct: 176 SWLLQDPRLLLRTTNKSFIEAVEKYFDHLIP--RVIPLQYRQAGPVIAVQVENEYG---- 229
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAP-------DPVINACNGRQCGE-TFA 244
SF K Y+ + K L+ G+ ++ D V+ A N ++ + TF
Sbjct: 230 SF-NKDKTYMPYLHKAL--LRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQDTFN 286
Query: 245 GPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
+ DKP + E W ++ +GD+ ++ A+++ + V+ FI K + S+ N YM+HGG
Sbjct: 287 QLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGG 344
Query: 303 TNFGRTASAY-------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
TNFG A ++T Y A L E G + K+ L++L +V
Sbjct: 345 TNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSV 392
>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
Length = 587
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 168/657 (25%), Positives = 265/657 (40%), Gaps = 136/657 (20%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ +G +HY R+ W + K K G + V+T V WN+HE + G + F+G D+ FI+
Sbjct: 20 IIAGGMHYFRTMKDSWKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIE 79
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
Q+ L+V +R P+I EW +GGLP WL PG+ R+ +PF H+K Y ++ ++
Sbjct: 80 LAQSLELFVIVRPSPYICAEWEFGGLPAWLLKDPGMKVRTVYKPFMKHVKEYFEVLFKIL 139
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK--- 222
A L Q GPIIL QIENEYG + Y+ K+ D T VP V
Sbjct: 140 --APLQIDQDGPIILMQIENEYG-----YYGNDKEYLSTLLKIMRDFGTTVPVVTSDGPW 192
Query: 223 ---------QDDAPDPVINACNG-RQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA-R 271
D P +N G ++ E F +KP + E W ++ +GD+
Sbjct: 193 GEALDAGSLLADVSLPTMNFGTGAKEHIENFK-EKYVNKPVMCMEFWVGWFDAWGDDRHH 251
Query: 272 IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVL-------TGYYDQAPLD 324
R A D A + + +GS VN YM+HGGTNFG A L T Y A L
Sbjct: 252 TRDASDAANELRDILN--EGS-VNIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILT 308
Query: 325 EYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKD 384
E G L + K+ K++ S F++++E
Sbjct: 309 ECGDLTE-KYYEFKKVISE-----------------FTEIKE------------------ 332
Query: 385 KRNNATVYFSNLMYELPPLSISILPDCKTVAF-NTAKLDSVEQWEEYKEAIPTYDETSLR 443
+ +LP +A+ A LD V + + + ++
Sbjct: 333 --------------------VELLPQTHKIAYGRVAVLDKVSLFNTLETL-----SSPVK 367
Query: 444 ANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSL--GHVLHAFINGEFVGSAH- 500
N+ L M Y+ Y + D D+ V K+ L F+N + + +
Sbjct: 368 HNYPL-SMEELNQNYGYILY----RSDLGDARRVEKMYLLEANDRAQIFVNNNHIATQYD 422
Query: 501 ---GKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKEL 557
G+H EK +N + +L +G + G L + GL+ G +
Sbjct: 423 QEIGQHLSVDLEQEK-------SNRIDILIENMGRANFGPKLNAQRKGLK-----GGLVI 470
Query: 558 KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVA 617
+ ++W E + D ++ V +SR H P +YK + D
Sbjct: 471 DNHGHYNW--------EHYNLELDDINK-VDFSR-EYEDHLP-AFYKFELEIECMGDTF- 518
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
I++ GKG ++N +IGRYW Q+ ++P S LK N +++ E E
Sbjct: 519 IDMTGFGKGVVFINNVNIGRYW-------EVGPQTKLYVPESLLKKGKNTIIVFETE 568
>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
Length = 782
Score = 149 bits (375), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 162/327 (49%), Gaps = 31/327 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG ++ + IHYPR + W I K G++ + VFWN HEP+ G++DF
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G++D+ F + Q G+YV +R GP++ EW GGLP+WL I R + ++M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQD---PYYM 149
Query: 155 KRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVD 211
+R + + K L S+GG II+ Q+ENEYG ++ ++ + V+ A
Sbjct: 150 ERVKLFMNEVGKQLTDLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF---- 205
Query: 212 LQTGVPWVMCK-----QDDAPDPV---INACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
TGVP C +++A D + IN G + F PD P + +E W+
Sbjct: 206 --TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSG 263
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------VLT 315
++ +G + RSAED+ + + + + YM HGGT+FG A T
Sbjct: 264 WFDHWGAKHETRSAEDLVKGMKEMLD--RNISFSLYMTHGGTSFGHWGGANFPNFSPTCT 321
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHS 342
Y AP++E G + PK+ ++ L S
Sbjct: 322 SYDYDAPINESGKV-TPKYFEVRNLLS 347
>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
Length = 588
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 148/307 (48%), Gaps = 30/307 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
++G + SG +HY R P +W + KA+ GL+ V+T V WNLH+P+P +F G
Sbjct: 18 LDGEPFRILSGGLHYFRVHPGLWRDRLHKARLMGLNTVETYVPWNLHQPRPDEFRMDGGL 77
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL RF+ A+GL+V LR GP+I EW GGLP WL P + RS + F + Y
Sbjct: 78 DLPRFLDLAAAEGLHVLLRPGPYICAEWEGGGLPSWLLADPAMRLRSRDPNFLAAVDDYF 137
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGM-------VEH---SFLEKGPPYVRWAAKL 208
++ + RL AS+GGP++ Q+ENEYG +EH S G +
Sbjct: 138 RRLLPPLH-DRL-ASRGGPVLAVQVENEYGAYGDDTAYLEHLADSLRRHGVDVPLFTCDQ 195
Query: 209 AVDLQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
DL+ G + V+ + P + R P P + TE W ++ +G
Sbjct: 196 PADLERGALAGVLATANFGSRPAAHLATLRTA--------RPSAPLLCTEFWIGWFDRWG 247
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQ 320
+R AE + + +A G+ VN+YM+HGGTNFG A +T Y
Sbjct: 248 GNHVVRDAEQASQELDELLA--TGASVNFYMFHGGTNFGFMNGANDKHTYRPTVTSYDYD 305
Query: 321 APLDEYG 327
APLDE G
Sbjct: 306 APLDEAG 312
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 8/80 (10%)
Query: 593 GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQS 652
G +T +Y+ F+A +D ++L KG AWVNG ++GRYW P +S
Sbjct: 494 GPATPTGPAFYRGTFEADRAADAF-LHLDGWTKGSAWVNGFALGRYWSR------GPQRS 546
Query: 653 WYHIPRSFLKPTGNLLVLLE 672
Y +P L+ N +V+LE
Sbjct: 547 LY-VPGPVLRRGANEVVVLE 565
>gi|354585216|ref|ZP_09004105.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
gi|353188942|gb|EHB54457.1| glycoside hydrolase family 35 [Paenibacillus lactis 154]
Length = 619
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 165/340 (48%), Gaps = 42/340 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T+ +++G + SG++HY R P+ W + K K G + V+T + WN+HEP
Sbjct: 4 LTWKNGQYLLDGQPYRIISGAVHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPTE 63
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSG D+ FI+ GL+V +R PFI EW +GGLP WL I R +
Sbjct: 64 GEFNFSGMADVGSFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPL 123
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAK 207
+ + Y ++ M L +S GGPI+ Q+ENEYG +H++LE Y+R
Sbjct: 124 YLSKVDHYYDELIPRM--VPLLSSNGGPILAVQVENEYGSYGNDHAYLE----YLR---- 173
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACN----------GRQCGETFAGPNS--PDKPAIW 255
A ++ GV ++ D D ++ + G + E+F D+P +
Sbjct: 174 -AGLVRRGVDVLLFTSDGPTDEMLLGGSIDHVHATVNFGSRVEESFGKYREYRTDEPLMV 232
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLT 315
E W ++ + ++ +R A D+A + + KGS +N YM+HGGTNFG + A +
Sbjct: 233 MEFWNGWFDHWMEDHHVRDAADVAGVLDEMLE--KGSSINMYMFHGGTNFGFYSGANHIK 290
Query: 316 GY------YD-QAPLDEYGLLRQPKWGHLKELHSAVKLCL 348
Y YD APL E WG E + AV+ L
Sbjct: 291 TYEPTTTSYDYDAPLTE--------WGDKTEKYEAVRTVL 322
>gi|281337336|gb|EFB12920.1| hypothetical protein PANDA_005061 [Ailuropoda melanoleuca]
Length = 655
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 94/280 (33%), Positives = 139/280 (49%), Gaps = 25/280 (8%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++F GSIHY R + W + K K G + + T V WNLHEP+ G+FDFS
Sbjct: 78 LGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENL 137
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL F+ GL+V LR GP+I E GGLP WL P ++ R+ + F + +Y
Sbjct: 138 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYF 197
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGV 216
+++ + L +GGPII Q+ENEYG V+ ++ PYVR A L+ G+
Sbjct: 198 DHLIS--RVVPLQYHKGGPIIAVQVENEYGSFAVDKDYM----PYVRKAL-----LERGI 246
Query: 217 PWVMCKQDDAPD----------PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++ DDA + IN + +KP + E W ++ +
Sbjct: 247 VELLVTSDDAENLQKGYLEGVLATINMNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDTW 306
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
G + + +AED+ V+ FI N YM+HGGTNFG
Sbjct: 307 GGKHMVNNAEDVEETVSKFITSEIS--FNVYMFHGGTNFG 344
>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
Length = 574
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/308 (32%), Positives = 151/308 (49%), Gaps = 24/308 (7%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG++HY R P+ W I AK GL+ ++T V WN HEP G++D +
Sbjct: 10 DFLLDGRPHQVISGTLHYFRIHPEHWADRIRTAKAMGLNTIETYVAWNAHEPVRGEWDAT 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RF+ + A+GL+ +R GP+I EW GGLP WL PGI R F +
Sbjct: 70 GWNDLGRFLDLIAAEGLHAIVRPGPYICAEWHNGGLPVWLTSTPGIGIRRSEPQFVEAVS 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWA--AKLAVD 211
Y + ++ ++ +GG ++L QIENEYG + +L + VR A + V
Sbjct: 130 EYLRRVYEIVAPRQI--DRGGNVVLVQIENEYGAYGSDKEYLRE---LVRVTKDAGITVP 184
Query: 212 LQT---GVPWVMCKQDDAPDPVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQVY 266
L T +PW M + P+ + G + E A + P P + +E W ++ +
Sbjct: 185 LTTVDQPMPW-MLEAGSLPELHLTGSFGSRSAERLATLREHQPTGPLMCSEFWDGWFDWW 243
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G A+ + + +A G+ VN YM HGGTNFG T A ++T Y
Sbjct: 244 GSIHHTTDPAASAHDLDVLLA--AGASVNIYMVHGGTNFGTTNGANDKGRFDPIVTSYDY 301
Query: 320 QAPLDEYG 327
AP+DE G
Sbjct: 302 DAPIDESG 309
>gi|223982755|ref|ZP_03632983.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
gi|223965255|gb|EEF69539.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
12042]
Length = 592
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 151/315 (47%), Gaps = 33/315 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G L SG++HY R P+ W + K K G + V+T + WN HEP+ GQFDFS
Sbjct: 9 DFMLDGQPVKLISGALHYFRIVPEYWQDRLEKLKNMGCNCVETYIPWNYHEPKKGQFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
GR+D+ RF+++ QA GL+V LR P+I EW +GGLP WL + RS +P+ +
Sbjct: 69 GRKDVARFVRKAQALGLWVILRPTPYICAEWEFGGLPAWLLADDSMRVRSTYQPYLDAVD 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y + +++ L+ + GGP+++ QIENEYG + Y++ +L
Sbjct: 129 AYYAELFKVIRP--LFFTHGGPVLMCQIENEYGSFGND-----KQYLKAIKRLMEKHGCD 181
Query: 216 VP-------W-------VMCKQDDAPDPVINACNGRQCG--ETFAGPNSPDKPAIWTENW 259
VP W + + P + Q G F N P + E W
Sbjct: 182 VPMFTSDGGWREVLDAGTLLNEGVLPTANFGSRTDEQIGALRQFMNDNDIHGPLMCMEFW 241
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN--FGRTASAY----- 312
++ +G + R A++ A + A ++ VN YM+HGGTN F S +
Sbjct: 242 IGWFNNWGSPLKTRDAKEAADELD---AMLRQGSVNIYMFHGGTNPEFYNGCSYHNGMDP 298
Query: 313 VLTGYYDQAPLDEYG 327
+T Y APL E+G
Sbjct: 299 QITSYDYAAPLTEWG 313
>gi|296216696|ref|XP_002807336.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Callithrix jacchus]
Length = 652
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 174/355 (49%), Gaps = 39/355 (10%)
Query: 19 GSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQT 78
G++ G G + T + GH+ ++F GSIHY R + W + K K G + V T
Sbjct: 68 GTESTGQGNPHFT-------LEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTT 120
Query: 79 LVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDV 138
V WNLHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP WL
Sbjct: 121 YVPWNLHEPERGRFDFSGNLDLEAFVLMASEIGLWVILRPGPYICSEIDLGGLPSWLLQD 180
Query: 139 PGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKG 198
P ++ R+ N+ F +++Y ++ + L QGGP+I Q+ENEYG +K
Sbjct: 181 PQLLLRTTNKGFIEAVEKYFDHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKD--KKY 236
Query: 199 PPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNG--------RQCGETFAGPN--S 248
PY+ A L+ G+ ++ D + + G + TF+ +
Sbjct: 237 MPYLHKAM-----LRRGIVELLLTSDGEKNVLSGHTKGVLATINLQKLHRNTFSQLHKVQ 291
Query: 249 PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRT 308
DKP + E W ++ + D+ + A++I + V+ FI K + S+ N YM+HGGTNFG
Sbjct: 292 RDKPLLNMEYWVGWFDRWXDKHHVTDAKEIEHTVSEFI-KYEISF-NVYMFHGGTNFGFL 349
Query: 309 ASAY-------VLTGYYDQAPLDEYGLLRQPKWGHLKEL---HSAVKLCLKPMLS 353
A V+T Y A L E G + K+ L++L SA+ L P L+
Sbjct: 350 NGATYFGKHAGVVTSYDYDAVLTEAGDYTE-KYFKLQKLFGSFSAIPLPRVPKLT 403
>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
R26]
Length = 628
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 117/363 (32%), Positives = 173/363 (47%), Gaps = 40/363 (11%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
++NG + SG +HYPR + W + K GL+ V T VFWN HE PG++++SG
Sbjct: 36 FLLNGKLFSIHSGEMHYPRIPQEYWKHRLQMMKAMGLNAVTTYVFWNYHEENPGKWNWSG 95
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
+DL +FIK Q GLYV +R GP++ EW +GG P+WL ++ G+ R DN F ++
Sbjct: 96 EKDLKKFIKTAQEVGLYVIIRPGPYVCAEWEFGGYPWWLQNIKGLKIREDNNLFLAETQK 155
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFL--EKGPP---YVRWAAKLAVD 211
Y T + N +K ++ + GGP+I+ Q ENE+G SF+ K P + + AK+
Sbjct: 156 YITQLYNQVKDLQI--TNGGPVIMVQAENEFG----SFVAQRKDIPLASHRTYNAKIVKQ 209
Query: 212 LQ-TGVPWVMCKQDDA----PDPVINA---CNGRQCGETFAGP----NSPDKPAIWTENW 259
L+ G M D + V+ A NG E N+ P + E +
Sbjct: 210 LKDAGFSVPMFTSDGSWLFEGGSVVGALPTANGEDNIENLKKIVNQYNNNQGPYMVAEFY 269
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ + ++ A +A ++ K S+ NYYM HGGTNFG T A
Sbjct: 270 PGWLAHWAEKFPRVDAGTVARQTDKYL-KNDVSF-NYYMVHGGTNFGFTNGANYDKNHDI 327
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL---HSAVKLCLKPMLSGV--LVSMNFSKLQ 365
LT Y AP+ E G R PK+ L+ + H+ KL P V + + SKL
Sbjct: 328 QPDLTSYDYDAPITEAG-WRTPKYDSLRAVISKHTKAKLPEVPAPIKVIDIKDIKLSKLY 386
Query: 366 EAF 368
F
Sbjct: 387 NFF 389
>gi|301763006|ref|XP_002916929.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Ailuropoda
melanoleuca]
Length = 1209
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 156/325 (48%), Gaps = 33/325 (10%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++F GSIHY R + W + K K G + + T V WNLHEP+ G+FDFS
Sbjct: 499 LGGHKFLIFGGSIHYFRVPREYWRDRLMKLKACGFNTLTTYVPWNLHEPERGKFDFSENL 558
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL F+ GL+V LR GP+I E GGLP WL P ++ R+ + F + +Y
Sbjct: 559 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPEMILRTTYKGFVEAVDKYF 618
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGV 216
+++ + L +GGPII Q+ENEYG V+ ++ PYVR A L+ G+
Sbjct: 619 DHLIS--RVVPLQYHKGGPIIAVQVENEYGSFAVDKDYM----PYVRKAL-----LERGI 667
Query: 217 PWVMCKQDDAPD----------PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
++ DDA + IN + +KP + E W ++ +
Sbjct: 668 VELLVTSDDAENLQKGYLEGVLATINMNTFEKSAFEQLSQLQRNKPIMVMEYWVGWFDTW 727
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G + + +AED+ V+ FI N YM+HGGTNFG A V+T Y
Sbjct: 728 GGKHMVNNAEDVEETVSKFITSEIS--FNVYMFHGGTNFGFMNGATYFGIHRAVVTSYDY 785
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAV 344
A L E G + K+ L+ L +V
Sbjct: 786 DALLTEAGDYTK-KYFKLQRLFRSV 809
Score = 65.9 bits (159), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 76/177 (42%), Gaps = 27/177 (15%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
+G S ++G ++ +G+IHY R + W + K K G + V T
Sbjct: 52 EGSSFTLDGSPFLIIAGTIHYFRVPREYWRDRLMKLKACGFNTVTTA------------- 98
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
F+ GL+V L GP+I + GGLP WL P + R+ F
Sbjct: 99 ----------FVAMASDVGLWVILCPGPYIGSDLDLGGLPSWLLRDPKMKLRTTYRGFTK 148
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ Y I+ K +L +GGPII Q+ENEYG ++ PY++ A ++
Sbjct: 149 AVNLYFDKIIP--KIVQLQYGKGGPIIALQVENEYGSYHQD--KRYMPYIKKLAPVS 201
>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
domestica]
Length = 673
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 105/325 (32%), Positives = 156/325 (48%), Gaps = 27/325 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
++ G R +F GSIHY R + W + K K GL+ + T + WNLHEP+ G+F+FS
Sbjct: 89 EFLLEGSRFRIFGGSIHYFRVPREYWKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFS 148
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+ F++ GL+V LR GP+I EW GGLP WL + R+ F +
Sbjct: 149 GNLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYVGFIKAVD 208
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + L +QGGPII Q+ENEYG +K P Y+ + K+A+ L+ G
Sbjct: 209 LYFNQLIP--RVVPLQYTQGGPIIAVQVENEYGSY-----DKDPNYMPY-IKMAL-LKRG 259
Query: 216 VPWVMCKQDDAPD----------PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQV 265
+ ++ D+ IN N + +KP + TE WT ++
Sbjct: 260 IVELLMTSDNKDGLSGGYVEGVLATINLKNVDSIIFNYLQSFQDNKPTMVTEFWTGWFDT 319
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYY-DQAPLD 324
+G I A+D+ V+ I G+ +N YM+HGGTNFG A T Y D D
Sbjct: 320 WGGPHHIVDADDVMVSVSSIIQ--MGASLNLYMFHGGTNFGFMNGAQHFTDYQADVTSYD 377
Query: 325 EYGLLRQ-----PKWGHLKELHSAV 344
+L + PK+ L+E S +
Sbjct: 378 YDAILTEAGDYTPKFFKLREYFSTL 402
>gi|346320352|gb|EGX89953.1| beta-calactosidase, putative [Cordyceps militaris CM01]
Length = 633
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 153/322 (47%), Gaps = 33/322 (10%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
+Y+ ++NG + G + R P+ W + A+ GL+ + + ++WNLHEP+PG
Sbjct: 30 SYNRTDFLLNGQPFQIIGGQMDPQRILPEYWTHRLKMARAMGLNTIFSYLYWNLHEPRPG 89
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
+DFSGR D+ RF + Q +GL V LR GP+I GE +GG P WL VPG+ R +N PF
Sbjct: 90 AWDFSGRNDVARFFRLAQQEGLRVVLRPGPYICGERDWGGFPAWLSQVPGMAVRQNNRPF 149
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKL 208
K Y + + +L +QGGPI+++Q+ENEYG + ++L +R +
Sbjct: 150 LDAAKSYIDRLGKEL--GQLQITQGGPILMAQLENEYGSFGTDKTYLAALAAMLRENFDV 207
Query: 209 AVDLQTG----------VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAI-WTE 257
+ G + V+ D A + T GP + I W +
Sbjct: 208 FLYTNDGGGQSYLEGGQLHGVLAVIDGDSQSGFAARDKYVTDPTSLGPQLNGEYYISWID 267
Query: 258 NWTSFY---QVYGDEARIRSAEDIAYHVALFIAKMKGSY-VNYYMYHGGTNFGRTAS--- 310
W S Y Q+ G +A D+A VA + G Y + YM+HGGTNFG
Sbjct: 268 QWGSDYPHQQIAGSQA------DVAKAVADLDWTLAGGYSFSIYMFHGGTNFGFENGGIR 321
Query: 311 -----AYVLTGYYDQAPLDEYG 327
A + T Y APLDE G
Sbjct: 322 DDGPLAAMTTSYDYGAPLDESG 343
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 40/79 (50%), Gaps = 13/79 (16%)
Query: 601 TWYKTVFDAPTGS--DPVAINLISMGKG---EAWVNGQSIGRYWVSFLTPQGTPSQSWYH 655
+Y FD P G+ DP +++ KG WVNG ++GRYW P QS Y
Sbjct: 537 VFYTGSFDMPAGAAADPSGDTFLAVPKGIKGVLWVNGVNMGRYWTV------GPQQSLY- 589
Query: 656 IPRSFLKPTGNLLVLLEEE 674
+P S LK N +VLLE E
Sbjct: 590 VPGSILKAR-NKVVLLELE 607
>gi|255015104|ref|ZP_05287230.1| beta-glycosidase [Bacteroides sp. 2_1_7]
gi|410104527|ref|ZP_11299440.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
gi|409234336|gb|EKN27166.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
Length = 768
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 166/671 (24%), Positives = 269/671 (40%), Gaps = 133/671 (19%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + SG +HYPR Q W + + GL+ V T VFWNLHE +PG++DF G +
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
+L +I+ +GL V LR GP++ EW +GG P+WL ++PG+ R DN F K Y
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---YVRWAAKLA---VDL 212
+ + L S+GGPII+ Q ENE+G + K P + R+ AK+ D
Sbjct: 159 DKLYE--QVGDLQVSKGGPIIMVQAENEFG--SYVAQRKDIPLEEHRRYNAKIKRQLADA 214
Query: 213 QTGVPWV------MCKQDDAPDPV------INACNGRQCGETFAGPNSPDKPAI----WT 256
VP + + P + N N ++ + G P A W
Sbjct: 215 GFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
+W + D R E + F N+YM HGGTNFG T+ A
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKK 325
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
LT Y AP+ E G + PK+ ++ +
Sbjct: 326 HDIQPDLTSYDYDAPISEAGWV-TPKFDSIR------------------------NVIRK 360
Query: 368 FIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW 427
++ E A + + E+P +S++ + D +A
Sbjct: 361 YVTYDVPEAPAPIP---------------LIEIPSISLTKVADVLALA------------ 393
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVL 487
KE P T L EQ+N Y+ Y+ F + L++ L
Sbjct: 394 ---KEGEPVASPTPL----TFEQLN---QGYGYVLYSTHFNQ---PLKGRLEIPGLRDYA 440
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
+++GE VG ++ F M I + +L +G + G + R G +
Sbjct: 441 TIYVDGERVGEL-----NRCFNQYAMEIDIPFNATLDILVENMGRINYGEEIVRNTKGII 495
Query: 547 RNVSIQGAKELKDFSSFS--WGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
+V I G+ E+ D+ + L+ ++ ++ + + + ++P+ +
Sbjct: 496 SSVKINGS-EISDWKMYKLPMDRMPALVSDEPYVYKNGSPEV------AALGNKPVLYEG 548
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPT 664
T + TG I++ GKG ++NG +IGRYW + P Q+ Y IP +L
Sbjct: 549 TFHLSDTGD--TFIDMEDWGKGIIFINGVNIGRYWYA------GPQQTLY-IPGVWLNKG 599
Query: 665 GNLLVLLEEEN 675
N +V+ E+ N
Sbjct: 600 ENKIVIYEQLN 610
>gi|290956543|ref|YP_003487725.1| glycosyl hydrolase family 42 [Streptomyces scabiei 87.22]
gi|260646069|emb|CBG69162.1| putative glycosyl hydrolase (family 42) [Streptomyces scabiei
87.22]
Length = 591
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 157/321 (48%), Gaps = 36/321 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T ++NG + SG++HY R P +W + KA+ GL+ V+T V WNLH+P P
Sbjct: 6 LTTSSDGFLLNGEPFRIVSGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQPDP 65
Query: 90 GQ-FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G DL R++ +A+GL+V LR GP+I EW GGLP WL PGI RS +
Sbjct: 66 DSPLVLDGLLDLPRYLSLARAEGLHVLLRPGPYICAEWDGGGLPSWLTSDPGIRLRSSDP 125
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F + Y + + + A+ GGP+I Q+ENEYG + ++L+ +V A
Sbjct: 126 RFTDALDGY--LDILLPPLLPYMAANGGPVIAVQVENEYGAYGDDTAYLK----HVHQAL 179
Query: 207 KLAVDLQTGVPWVMCKQDDA-----------PDPVINACNGRQCGETFAG--PNSPDKPA 253
+ GV ++ D A P + A G + E+ A + P+ P
Sbjct: 180 R-----ARGVEELLFTCDQAGSGHHLAAGSLPGVLSTATFGGKIEESLAALRAHMPEGPL 234
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY- 312
+ +E W ++ +G+E +R AE A + +A G+ VN YM+HGGTNFG T A
Sbjct: 235 MCSEFWIGWFDHWGEEHHVRDAESAAADLDKLLA--AGASVNIYMFHGGTNFGFTNGANH 292
Query: 313 ------VLTGYYDQAPLDEYG 327
++T Y A L E G
Sbjct: 293 DQCYAPIVTSYDYDAALTESG 313
Score = 41.2 bits (95), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 39/72 (54%), Gaps = 8/72 (11%)
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
+++ F+ T +D ++L KG+AW+NG +GRYW P ++ Y +P
Sbjct: 506 AFHRGTFEIDTPAD-TFLSLPGWTKGQAWINGFHLGRYW------NRGPQRTLY-VPGPV 557
Query: 661 LKPTGNLLVLLE 672
L+P N LVLLE
Sbjct: 558 LRPGANELVLLE 569
>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
Length = 797
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/411 (27%), Positives = 184/411 (44%), Gaps = 35/411 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ + VFWN+HE + GQFDF+
Sbjct: 38 TFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFT 97
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ F + Q G+YV +R GP++ EW GGLP+WL I R + F ++
Sbjct: 98 GQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVE 157
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR--WA------ 205
+ + + A L +GGPII+ Q+ENEYG + +++ + +R W+
Sbjct: 158 LFEQKVAEQL--APLTIRRGGPIIMVQVENEYGSYGEDKAYVSQIRDVLRRYWSLSPTGE 215
Query: 206 --AKLAVDLQTGVPWVMCKQDDAPDPVI---NACNGRQCGETFA--GPNSPDKPAIWTEN 258
+ A L W + D ++ N G + F G PD P + +E
Sbjct: 216 GRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPDAPKMCSEF 275
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W+ ++ +G R A D+ + ++ KG + YM HGGT+FG A A
Sbjct: 276 WSGWFDKWGARHETRPARDMVAGIDEMLS--KGISFSLYMTHGGTSFGHWAGANSPGFAP 333
Query: 314 -LTGYYDQAPLDEYGLLRQPKW---GHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI 369
+T Y AP++EYG W +++ + KL P + LVS LQ A
Sbjct: 334 DVTSYDYDAPINEYGQATPKFWELRKTMEKYNDGRKLPAVPKAAAPLVSFPKVTLQPALT 393
Query: 370 FQ--GSSECAAFLVNKDKRNN---ATVYFSNLMYELPPLSISILPDCKTVA 415
+ + + V + + ++S + E+P S+ L D A
Sbjct: 394 LRHFATRTVKSLDVKSFEEMGMGWGSAFYSTTLPEVPQPSLLTLNDAHDFA 444
>gi|359496328|ref|XP_003635211.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
gi|296080974|emb|CBI18606.3| unnamed protein product [Vitis vinifera]
Length = 198
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/207 (39%), Positives = 118/207 (57%), Gaps = 32/207 (15%)
Query: 623 MGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSF 660
MGKG+AWVNGQSIGRYW ++L P G P+Q+ YHIPR++
Sbjct: 1 MGKGQAWVNGQSIGRYWPAYLAPSTGCTTNCDYRGAYDASKCLRNCGQPAQTLYHIPRTW 60
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
+ NLLVL EE G P IS+ T + +C HVS++ PP SW + +
Sbjct: 61 VHSGKNLLVLHEELGGDPSKISLLTRTGQEVCAHVSEADPPPADSW--------QPNLEF 112
Query: 721 PGRRPKVQIRCPSGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKRSCT 780
+ +V++ C G IS I FAS+G P G+C + G+CH +N ++V++AC+G+ C
Sbjct: 113 MSQSSQVRLTCEQGWHISMINFASFGTPRGHCGTFNPGNCH-ANVLSVVQQACIGQEGCA 171
Query: 781 VPVWTEKFYGDPCPGIPKALLVDAQCT 807
+PV T + GDPCPG+ K+L ++A C+
Sbjct: 172 IPVSTARL-GDPCPGVLKSLAIEALCS 197
>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
Length = 859
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/411 (27%), Positives = 184/411 (44%), Gaps = 35/411 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ + VFWN+HE + GQFDF+
Sbjct: 100 TFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFT 159
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ F + Q G+YV +R GP++ EW GGLP+WL I R + F ++
Sbjct: 160 GQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVE 219
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR--WA------ 205
+ + + A L +GGPII+ Q+ENEYG + +++ + +R W+
Sbjct: 220 LFEQKVAEQL--APLTIRRGGPIIMVQVENEYGSYGEDKAYVSQIRDVLRRYWSLSPTGE 277
Query: 206 --AKLAVDLQTGVPWVMCKQDDAPDPVI---NACNGRQCGETFA--GPNSPDKPAIWTEN 258
+ A L W + D ++ N G + F G PD P + +E
Sbjct: 278 GRGEAASPLMFQCDWSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPDAPKMCSEF 337
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W+ ++ +G R A D+ + ++ KG + YM HGGT+FG A A
Sbjct: 338 WSGWFDKWGARHETRPARDMVAGIDEMLS--KGISFSLYMTHGGTSFGHWAGANSPGFAP 395
Query: 314 -LTGYYDQAPLDEYGLLRQPKW---GHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI 369
+T Y AP++EYG W +++ + KL P + LVS LQ A
Sbjct: 396 DVTSYDYDAPINEYGQATPKFWELRKTMEKYNDGRKLPAVPKAAAPLVSFPKVTLQPALT 455
Query: 370 FQ--GSSECAAFLVNKDKRNN---ATVYFSNLMYELPPLSISILPDCKTVA 415
+ + + V + + ++S + E+P S+ L D A
Sbjct: 456 LRHFATRTVKSLDVKSFEEMGMGWGSAFYSTTLPEVPQPSLLTLNDAHDFA 506
>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
Length = 589
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/288 (32%), Positives = 148/288 (51%), Gaps = 26/288 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G + SG+IHY R P W + K G + V+T V WNLHE + GQFDF
Sbjct: 8 EEFLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G +DLV F+K+ + GL V LR GP+I EW GGLP WL + + R D+E F +
Sbjct: 68 TGGKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y +++ ++ L ++GGP+I+ Q+ENEYG + L Y+R K+ D
Sbjct: 128 ENYFKVLLPLI--VPLQVTKGGPVIMVQVENEYGSFSNDKL-----YLRALKKMIEDAGI 180
Query: 215 GVP-------W---VMCKQDDAPDPVINACNGRQCGETFAGPNS----PDK--PAIWTEN 258
VP W +M + ++ A G + E F S DK P + E
Sbjct: 181 DVPLFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEF 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ + ++ +R A+++ + + +GS +N YM+HGGTNFG
Sbjct: 241 WCGWFNRWNEDIILRDADEVMTCMKELLQ--RGS-LNLYMFHGGTNFG 285
>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
Length = 589
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/288 (32%), Positives = 148/288 (51%), Gaps = 26/288 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G + SG+IHY R P W + K G + V+T V WNLHE + GQFDF
Sbjct: 8 EEFLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G +DLV F+K+ + GL V LR GP+I EW GGLP WL + + R D+E F +
Sbjct: 68 TGGKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y +++ ++ L ++GGP+I+ Q+ENEYG + L Y+R K+ D
Sbjct: 128 ENYFKVLLPLI--VPLQVTKGGPVIMVQVENEYGSFSNDKL-----YLRALKKMIEDAGI 180
Query: 215 GVP-------W---VMCKQDDAPDPVINACNGRQCGETFAGPNS----PDK--PAIWTEN 258
VP W +M + ++ A G + E F S DK P + E
Sbjct: 181 DVPLFTSDGAWEQALMSGTLIEEEVLVTANFGSRGNENFDVLQSFMEKHDKKWPLMCMEF 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ + ++ +R A+++ + + +GS +N YM+HGGTNFG
Sbjct: 241 WCGWFNRWNEDIILRDADEVMTCMKELLQ--RGS-LNLYMFHGGTNFG 285
>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
Length = 1630
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 169/357 (47%), Gaps = 42/357 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP- 87
++ DGRSL++NG R +L SGSIHYPRSTP MWP+L A+A+ GL+ +++ FWN H
Sbjct: 1037 SIARDGRSLLVNGSRVLLLSGSIHYPRSTPAMWPKLFAEARANGLNAIESYAFWNKHSAT 1096
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP------------FWL 135
+ G +D+ D+ F+ L+V R GP++ EW GG+P W+
Sbjct: 1097 RYGAYDYGFNGDVDLFLSLAAEHDLFVLWRFGPYVCAEWPAGGIPARAPRRAVFASNAWI 1156
Query: 136 HDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFL 195
HDVPG+ R++N + R+ + + + S+ G ++IENEYG +
Sbjct: 1157 HDVPGMKTRTNNTAWLNETGRW---MRDHFAVIEPHLSRNG--ASNRIENEYGGSKSDAA 1211
Query: 196 EKGPPYVRWAAKLAVDLQTGVPWVMCKQDD--APDPVI--NAC---NGRQCGETFAGPNS 248
A AV + + W+MC APD + N C G P
Sbjct: 1212 AVAYVDALDALADAVAPE--LVWMMCGFVSLVAPDALHTGNGCPHDQGPASAHVVVPPAP 1269
Query: 249 PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRT 308
PA +TE+ +Y +G + R D+AY VA ++A G+ N+YM+HGG ++G
Sbjct: 1270 GADPAWYTED-ELWYDAWGLPSLARPPADVAYGVASYVA-TGGAMHNFYMWHGGNHYGNW 1327
Query: 309 ASAYVLTG-------------YYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPML 352
++A G Y + APL G +P + HL +H + + +L
Sbjct: 1328 STATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLFSHLAAVHGTLDAYAEVLL 1384
>gi|384939972|gb|AFI33591.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
gi|387541294|gb|AFJ71274.1| beta-galactosidase-1-like protein 3 [Macaca mulatta]
Length = 653
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 166/332 (50%), Gaps = 29/332 (8%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGR-SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGG 72
LT + + G G T G+ + GHR ++ GSIHY R + W + K + G
Sbjct: 56 LTPLELKNRSVGLGTASTGRGKPHFTLEGHRFLICGGSIHYFRVPREYWRDRLLKLRACG 115
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
+ V T V WNLHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP
Sbjct: 116 FNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLP 175
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WL P ++ R+ N+ F +++Y ++ + L QGGP+I Q+ENEYG
Sbjct: 176 SWLLQDPRLLLRTTNKGFTEAVEKYFDHLIP--RVIPLQYRQGGPVIAVQVENEYG---- 229
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDP-------VINACNGRQCGE-TFA 244
SF K Y+ + K L+ G+ ++ D + V+ A N ++ TF
Sbjct: 230 SF-NKDKTYMPYLHKAL--LRRGIVELLLTSDGEKNVLSGHTKGVLAAINLQKVQRNTFN 286
Query: 245 GPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
+ DKP + E W ++ +GD+ ++ A+++ + V+ FI K + S+ N YM+HGG
Sbjct: 287 QLHKVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGG 344
Query: 303 TNFGRTASAY-------VLTGYYDQAPLDEYG 327
TNFG A ++T Y A L E G
Sbjct: 345 TNFGFMNGATNFGKHTGIVTSYDYDAVLTEAG 376
>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
3_8_47FAA]
Length = 782
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 98/327 (29%), Positives = 162/327 (49%), Gaps = 31/327 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG ++ + IHYPR + W I K G++ + VFWN HEP+ G++DF
Sbjct: 33 KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G++D+ F + Q G+YV +R GP++ EW GGLP+WL I R + ++M
Sbjct: 93 TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQD---PYYM 149
Query: 155 KRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVD 211
+R + + K L ++GG II+ Q+ENEYG ++ ++ + V+ A
Sbjct: 150 ERVKLFMNEVGKQLTDLQINKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGF---- 205
Query: 212 LQTGVPWVMCK-----QDDAPDPV---INACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
TGVP C +++A D + IN G + F PD P + +E W+
Sbjct: 206 --TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSG 263
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------VLT 315
++ +G + RSAED+ + + + + YM HGGT+FG A T
Sbjct: 264 WFDHWGAKHETRSAEDLVKGMKEMLD--RNISFSLYMTHGGTSFGHWGGANFPNFSPTCT 321
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHS 342
Y AP++E G + PK+ ++ L S
Sbjct: 322 SYDYDAPINESGKV-TPKYFEVRNLLS 347
>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
Length = 592
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 171/661 (25%), Positives = 254/661 (38%), Gaps = 133/661 (20%)
Query: 42 HRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLV 101
HR + SG+IHY R P +W + + GL+ V+T V WN HE G+ DF+G RDL
Sbjct: 24 HR--VLSGAIHYFRIHPDLWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLA 81
Query: 102 RFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMI 161
RFI GL V +R GP+I EW +GGLP WL PGI R+ + F + + +
Sbjct: 82 RFISLAGDLGLDVIVRPGPYICAEWDFGGLPAWLMTEPGIALRTSDPAFLAAVDDWFDAV 141
Query: 162 VNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWV 219
V +++ L + GGP++ Q+ENEYG + ++LE + R L G+ V
Sbjct: 142 VPVIRP--LLTTAGGPVVAVQVENEYGSYGDDAAYLE----HCRKGL-----LDRGID-V 189
Query: 220 MCKQDDAPDP----------VINACN-GRQCGETFAGPN--SPDKPAIWTENWTSFYQVY 266
+ D P P V+ N G + E FA P P + E W ++ +
Sbjct: 190 LLFTSDGPGPDWLDNGTIPGVLATVNFGSRTDEAFAELRKVQPAGPDMVMEYWNGWFDHW 249
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--------LTGYY 318
G+ +R +D A L G VN+YM HGGTNFG + A V +T Y
Sbjct: 250 GEPHHVRDVDDAAG--VLDDVLRAGGSVNFYMAHGGTNFGLWSGANVEDGKLQPTVTSYD 307
Query: 319 DQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAA 378
A + E G L PK+ +E+ S + P
Sbjct: 308 YDAAVGEAGEL-TPKFHAFREVISRYAVTALP---------------------------- 338
Query: 379 FLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEE-YKEAIPTY 437
ELPPL + P V A LD+++ ++E +P
Sbjct: 339 --------------------ELPPLPARLAPQTAEVDGWVALLDTMDLFDEPVSGPVPQS 378
Query: 438 DETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVG 497
E D+ ++R L++ L +G +G
Sbjct: 379 MEAL---------------GQDHGLVHYRGNALVPTDGRTLELDGLADRATVLADGVLLG 423
Query: 498 SAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKEL 557
+D S +L G ++ G + GA + R GLR V I + +
Sbjct: 424 RV--DRNDVSQSLPLTPRPDGGRTTFDVIVENQGRINFGAAIGER-KGLRGVRI-AHRNV 479
Query: 558 KDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR---YGSSTHQPLTWYKTVFDAPTGSD 614
+ S + + L L D+G V R + +T + DAP
Sbjct: 480 HGWESSA----IRLDDPALTSRLDFGDAAVDAQRGPVFARATFE--------IDAPADG- 526
Query: 615 PVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
+ L GKG W+NG +GRYW G Q + P + N +V+LE E
Sbjct: 527 --FLALPGWGKGFLWLNGTLLGRYW-------GIGPQVTLYAPAPLWRTGSNDIVILEME 577
Query: 675 N 675
Sbjct: 578 Q 578
>gi|355567243|gb|EHH23622.1| hypothetical protein EGK_07120 [Macaca mulatta]
Length = 653
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 166/332 (50%), Gaps = 29/332 (8%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGR-SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGG 72
LT + + G G T G+ + GHR ++ GSIHY R + W + K + G
Sbjct: 56 LTPLELKNRSVGLGTASTGRGKPHFTLEGHRFLICGGSIHYFRVPREYWRDRLLKLRACG 115
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
+ V T V WNLHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP
Sbjct: 116 FNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLP 175
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WL P ++ R+ N+ F +++Y ++ + L QGGP+I Q+ENEYG
Sbjct: 176 SWLLQDPRLLLRTTNKGFTEAVEKYFDHLIP--RVIPLQYRQGGPVIAVQVENEYG---- 229
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDP-------VINACNGRQCGE-TFA 244
SF K Y+ + K L+ G+ ++ D + V+ A N ++ TF
Sbjct: 230 SF-NKDKTYMPYLHKAL--LRRGIVELLLTSDGEKNVLSGHTKGVLAAINLQKVQRNTFN 286
Query: 245 GPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
+ DKP + E W ++ +GD+ ++ A+++ + V+ FI K + S+ N YM+HGG
Sbjct: 287 QLHKVQRDKPLLVMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGG 344
Query: 303 TNFGRTASAY-------VLTGYYDQAPLDEYG 327
TNFG A ++T Y A L E G
Sbjct: 345 TNFGFMNGATNFGKHTGIVTSYDYDAVLTEAG 376
>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
Length = 598
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 104/346 (30%), Positives = 158/346 (45%), Gaps = 42/346 (12%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G+ +++G + SG++HY R P+ W + K G + V+T V WN+HEP+ G F+
Sbjct: 7 GKDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGIFN 66
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F G DLV++++ Q GL V LR P+I EW +GGLP WL I RS+ F
Sbjct: 67 FEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLNK 126
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
++ + +++ M+ L GGPII+ Q+ENEYG + YVR KL DL
Sbjct: 127 VENFYKVLLPMVTP--LQVENGGPIIMMQVENEYGSFGND-----KEYVRNIKKLMRDLG 179
Query: 214 TGVP-------WVMCKQDDA---PDPVINACNGRQCG------ETFAGPNSPDKPAIWTE 257
VP W + + D ++ G + E+F N + P + E
Sbjct: 180 VTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--------RTA 309
W ++ +G E R ++A V +K + +N+YM+ GGTNFG
Sbjct: 240 FWDGWFNRWGMEIIRRDGSELAEEVKEL---LKRASINFYMFQGGTNFGFMNGCSSRENV 296
Query: 310 SAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGV 355
+T Y A L E WG + AV+ +K + S V
Sbjct: 297 DLPQITSYDYDALLTE--------WGEPTSKYYAVQRAIKEVCSDV 334
>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
Length = 867
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 161/332 (48%), Gaps = 17/332 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+TYD +S I+ R + S +IHY R W ++ KAK GG + ++T + WN HE
Sbjct: 2 ITYDKKSWKIHNERVFILSAAIHYFRLPRAEWNEVLDKAKAGGCNTIETYIPWNFHEMNE 61
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G++DFSG +DL F + + LYV R GP+I EW +GG P+WL I +RS
Sbjct: 62 GEWDFSGDKDLAHFFQLCADKELYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPA 121
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F ++ +Y ++ ++ +L ++ G +I+ Q+ENE+ ++ + PY+ +
Sbjct: 122 FLHYVDQYFDRVIPIIDEYQL--TKNGTVIMVQVENEF----QAYGKPDKPYMEYIRDGM 175
Query: 210 VDLQTGVPWVMC-KQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVY-G 267
VP V C + N + + PD+P E W +++ + G
Sbjct: 176 KARGIDVPLVTCYGAVEGAVEFRNFWSHSKHAAAILDERFPDQPKGVMEFWIGWFEQWGG 235
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF----GRTASAYVL--TGYYDQA 321
++A ++ E + ++ + +NYYMY GGTNF GRT L T Y
Sbjct: 236 NKADQKTPEQLERECYQLLSN-GFTAINYYMYFGGTNFDHWGGRTVGEQTLCTTTYDYDV 294
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPMLS 353
+DEY L K+ LK HS VK L+P+ +
Sbjct: 295 AIDEY-LQPTRKYEVLKRYHSFVKW-LEPLFT 324
Score = 46.2 bits (108), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 28/163 (17%)
Query: 527 SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
S + G+ D L ++ + ++ +Q ++ F + L +K QIF D+ ++
Sbjct: 695 SAVYGVADISGAL-KQGENVLDLDVQNISSIRRFDLY-------LFHDKEQIF-DWKTKS 745
Query: 587 VP-------WSRYGSSTHQPL--TWYKTVFD-APTGSDPVAINLISMGKGEAWVNGQSIG 636
W Q + WYK+ F P V + L + KG WVNG+ +G
Sbjct: 746 FAELHEEKDWKTANCGDQQTIYPRWYKSHFTWNPDNGSIVKVRLNHLSKGCFWVNGECLG 805
Query: 637 RYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPP 679
RYW + PQ Y IP S LK +++ EE GY P
Sbjct: 806 RYWN--IGPQED-----YKIPVSLLKDQNEIVIFDEE--GYAP 839
>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
Length = 585
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/269 (35%), Positives = 137/269 (50%), Gaps = 13/269 (4%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K + G + V+T V WNLHE Q G + F G DL RFI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQ 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
Q GLYV LR P+I EW +GGLP+WL P + R D PF + RY + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
+ ++ +QGGPII+ Q+ENEYG + +L K +R + + PW +
Sbjct: 139 RDLQI--TQGGPIIMMQVENEYGSYANDKEYLRKMVAAMRQHGVETPLVTSDGPWHDMLE 196
Query: 224 D----DAPDPVIN-ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA-RIRSAED 277
+ D P IN N ++ E + +P + E W ++ +GD+ S +D
Sbjct: 197 NGSIKDLALPTINCGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAWGDDQHHTTSTQD 256
Query: 278 IAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ +A GS VN YM+HGGTNFG
Sbjct: 257 AVKELQDCLA--LGS-VNIYMFHGGTNFG 282
>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
Length = 585
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 95/269 (35%), Positives = 137/269 (50%), Gaps = 13/269 (4%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K + G + V+T V WNLHE Q G + F G DL RFI+
Sbjct: 19 VISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGILDLRRFIQ 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
Q GLYV LR P+I EW +GGLP+WL P + R D PF + RY + +
Sbjct: 79 TAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYFAHLFPQV 138
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
+ ++ +QGGPII+ Q+ENEYG + +L K +R + + PW +
Sbjct: 139 RDLQI--TQGGPIIMMQVENEYGSYANDKEYLRKMVAAMRQHGVETPLVTSDGPWHDMLE 196
Query: 224 D----DAPDPVIN-ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEA-RIRSAED 277
+ D P IN N ++ E + +P + E W ++ +GD+ S +D
Sbjct: 197 NGSIKDLALPTINCGSNIKENFEKLRKFHGEKRPLMVMEFWIGWFDAWGDDQHHTTSIQD 256
Query: 278 IAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ +A GS VN YM+HGGTNFG
Sbjct: 257 AVKELQDCLA--LGS-VNIYMFHGGTNFG 282
>gi|297194972|ref|ZP_06912370.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|297152570|gb|EFH31854.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 599
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/328 (32%), Positives = 156/328 (47%), Gaps = 33/328 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+ T +++G L SG++HY R W +A + GL+ V+T V WNLHEP+
Sbjct: 10 DFTVGDTDFLLDGRPVRLLSGALHYFRVHEGQWGHRLAMLRAMGLNCVETYVPWNLHEPE 69
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ G L RF+ V A G++ +R GP+I EW GGLPFWL G R+++
Sbjct: 70 PGRYADDG--ALGRFLDAVHAAGMWAIVRPGPYICAEWENGGLPFWLTGRVGRRVRTEDP 127
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
+ H++R+ T ++ + + ++GGP+++ Q+ENEYG S+ G Y+R +L
Sbjct: 128 EYLGHVERWFTRLLPQVVEREI--TRGGPVVMVQVENEYG----SYGSDG-GYLRQLVEL 180
Query: 209 AVDLQTGVPWV--------MCKQDDAPDPVINACNGRQCGETFAG--PNSPDKPAIWTEN 258
GVP M P + G GE FA + P P + E
Sbjct: 181 LRSCGVGVPLFTSDGPEDHMLSGGSVPGVLATVNFGSGAGEAFAALRRHRPTGPLMCMEF 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------ 312
W +++ +G E R AED A AL G+ VN YM HGGT+FG A A
Sbjct: 241 WCGWFEHWGAEPARRDAEDAAR--ALREILEAGASVNVYMAHGGTSFGGWAGANRSGELH 298
Query: 313 ------VLTGYYDQAPLDEYGLLRQPKW 334
+T Y AP+DE G + W
Sbjct: 299 DGVLEPTVTSYDYDAPVDEAGRPTEKFW 326
>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
Length = 571
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 156/326 (47%), Gaps = 23/326 (7%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG +T DG + ++G + SG+IHY R Q W + + GL+ + + WN
Sbjct: 2 GGEKVGLTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWN 61
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
LHE + G FDF G DLV F GL V R GP+I EW +GGLP WL P +
Sbjct: 62 LHEKERGNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHI 121
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
RS+ ++ + Y + ++ ++ A L S GGPII Q+ENEYG +++K ++
Sbjct: 122 RSNYCGYQAAVSSYFSKLLPLL--APLQHSNGGPIIAFQVENEYG----DYVDKDNEHLP 175
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
W A L +++ + + D + A + T S P+KP + TE W
Sbjct: 176 WLADL---MKSHGLFELFFISDGGHTIRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAG 232
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVL-TGYYD- 319
++ +G R D+ I K +G+ VN+YM+HGGTNFG A L GYY
Sbjct: 233 WFDYWG-HGRNLLNNDVFEKTLKEILK-RGASVNFYMFHGGTNFGFMNGAIELEKGYYTA 290
Query: 320 -------QAPLDEYGLLRQPKWGHLK 338
P+DE G R KW +K
Sbjct: 291 DVTSYDYDCPVDESG-NRTEKWEIIK 315
Score = 40.8 bits (94), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 50/108 (46%), Gaps = 9/108 (8%)
Query: 584 SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
S I W+ Y + +KT I + KG +VNG+++GRYWV+
Sbjct: 472 SSITAWTNYLQTAAVLPALFKTTVKILDYPKDTFILMHGWSKGVIFVNGRNLGRYWVT-K 530
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTL 691
PQ T ++P S+L N ++ LEEE G+SI+ VS L
Sbjct: 531 GPQKT-----LYLPASWLIKGENEIIWLEEEQ---LGMSIELVSSPDL 570
>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
4CC1]
Length = 591
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 143/289 (49%), Gaps = 26/289 (8%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G+ +++G + SG++HY R P+ W + K G + V+T V WN+HEP+ G F+
Sbjct: 7 GKDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGVFN 66
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F G DLV++++ Q GL V LR P+I EW +GGLP WL I RS+ F
Sbjct: 67 FEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYRDIRVRSNTNLFLNK 126
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
++ + +++ ++ + L GGPII+ Q+ENEYG + YVR KL DL
Sbjct: 127 VENFYKVLLPLVTS--LQVENGGPIIMMQVENEYGSFGND-----KEYVRSIKKLMRDLG 179
Query: 214 TGVP-------WVMCKQDDA---PDPVINACNGRQCG------ETFAGPNSPDKPAIWTE 257
VP W + + D ++ G + E+F N + P + E
Sbjct: 180 VTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNALESFIKENKKEWPLMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G E R + ++A V +K + +N+YM+ GGTNFG
Sbjct: 240 FWDGWFNRWGMEIIRRDSSELAEEVKEL---LKRASINFYMFQGGTNFG 285
>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
Length = 385
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 161/342 (47%), Gaps = 32/342 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ YD + +GH SGSIHY R W + K K GL+ +QT V WN HEPQ
Sbjct: 27 IDYDCNCFVKDGHPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLNAIQTYVPWNYHEPQM 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G +DFSG RDL F++ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 87 GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 146
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ ++++ +++ MK LY GGPII+ Q+ENEYG S+ Y+R K+
Sbjct: 147 YLTAVEKWMGVLLPKMK-PHLY-HNGGPIIMVQVENEYG----SYFACDYDYLRSLLKI- 199
Query: 210 VDLQTGVPWVMCKQDDAPD------------PVINACNGRQCGETFAGPNS--PDKPAIW 255
G V+ D A ++ G F S P P +
Sbjct: 200 FRQHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPTGPLVN 259
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G + +E IA + +A +G+ VN YM+ GGTNF A +
Sbjct: 260 SEFYTGWLDHWGHRHIVVPSETIAKTLNEILA--RGANVNLYMFIGGTNFAYWNGANMPY 317
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKL---CLK 349
T Y APL E G L + K+ L+E+ V + CL+
Sbjct: 318 MSQPTSYDYDAPLSEAGDLTE-KYFALREVIGMVSIPSTCLE 358
>gi|167856235|ref|ZP_02478970.1| beta-galactosidase [Haemophilus parasuis 29755]
gi|167852655|gb|EDS23934.1| beta-galactosidase [Haemophilus parasuis 29755]
Length = 596
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 158/319 (49%), Gaps = 38/319 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ ++NG + SG++HY R P+ W + + K G + V+T V WNLH+PQP QF+F
Sbjct: 8 KDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFNF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
S R DLV+F++ + GLYV LR P+I EW +GGLP WL ++P I R ++ F +
Sbjct: 68 SKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAEI 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
RY + + + A +QGG I++ QIENEYG + Y+R A LA+ L
Sbjct: 128 DRYFQEL--LPRIAPYQITQGGNILMMQIENEYGSFGND-----KNYLR--AILALMLIH 178
Query: 215 GVPWVMCKQDDA-----------PDPVINACN-GRQCGET------FAGPNSPDKPAIWT 256
GV + D A D ++ N G + E + + P +
Sbjct: 179 GVNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDKHGKSYPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--RTASAYV- 313
E W ++ + + R A+D+A + + + +N+YM+ GGTNFG SA +
Sbjct: 239 EFWDGWFNRWKEPVIRRDAQDLADCTKELLER---ASINFYMFQGGTNFGFWNGCSARLD 295
Query: 314 -----LTGYYDQAPLDEYG 327
+T Y AP+ E+G
Sbjct: 296 TDLPQVTSYDYDAPVHEWG 314
>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 632
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 158/337 (46%), Gaps = 44/337 (13%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
YDG+++ I SG +HY R Q W + K GL+ V T VFWNLHEP+PG
Sbjct: 35 VYDGKAIRI-------ISGEMHYARIPHQYWRHRMKMLKAMGLNAVATYVFWNLHEPEPG 87
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
++DFSG R+L +I+ +GL V LR GP++ EW +GG P+WL +V G+ R DNE F
Sbjct: 88 KWDFSGDRNLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGMELRRDNEQF 147
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAA 206
+ K Y + + +L +QGGPII+ Q ENE+G + LE+ Y
Sbjct: 148 LKYTKLYLERLYK--EVGKLQITQGGPIIMVQGENEFGSYVSQRKDITLEEHRAYNAKII 205
Query: 207 KLAVDLQTGVPWVMCKQDDA----------PDPVINACNG----RQCGETFAGPNSPDKP 252
K ++ VP M D + P N N ++ + G P
Sbjct: 206 KQLKEVGFDVP--MFTSDGSWLFEGGYVPGALPTANGENNIENLKKVVNQYNGGQGPYMV 263
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY 312
A + W + + + + A IA ++A G NYYM HGGTNFG T+ A
Sbjct: 264 AEFYPGWLAHWCEPHPQVK---ASTIARQTEKYLA--NGVSFNYYMVHGGTNFGFTSGAN 318
Query: 313 V---------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 319 YDKKHDIQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 354
>gi|398787680|ref|ZP_10550020.1| beta-galactosidase [Streptomyces auratus AGR0001]
gi|396992782|gb|EJJ03876.1| beta-galactosidase [Streptomyces auratus AGR0001]
Length = 603
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 103/344 (29%), Positives = 161/344 (46%), Gaps = 21/344 (6%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
+G + G T + ++G GG +T G+ +++G + SG+ HY R+ PQ
Sbjct: 2 VGAASVGVTVGNSRTVLAQAEGPGG----LTIRGKEFLLDGKPFRILSGAFHYFRTHPQD 57
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
W + + + GL+ V+T V WN H+P + DF+G RD+V F++ GL V +R GP
Sbjct: 58 WRDRLMRMRAMGLNTVETYVAWNFHQPDEKEADFTGWRDVVAFVRTADEVGLKVIVRPGP 117
Query: 121 FIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIIL 180
+I EW +GGLP WL R + F+ + + + + + L A++GGPII
Sbjct: 118 YICAEWDFGGLPAWLLKDKDAPLRRSDPAFERAVDAWFAEL--LPRFVDLQATRGGPIIA 175
Query: 181 SQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDL-QTGVPWVMCKQDDAPDPVINACNGR 237
Q+ENEYG +H++LE +R + G K PD + G
Sbjct: 176 MQVENEYGSYGDDHAYLEHLRDTMRAQGIDGLLFCSNGATQEALKAGSLPDLLSTVNFGG 235
Query: 238 QCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVN 295
FA + PDKP TE W ++ +G+ R A V + G+ +N
Sbjct: 236 DPTGPFAELRAFQPDKPLFCTEFWDGWFDHWGERHRTTDPAQTAADVEKMLE--AGASIN 293
Query: 296 YYMYHGGTNFGRTASAYV--------LTGYYDQAPLDEYGLLRQ 331
+YM GGTNFG +A A + +T Y +P+ E G L +
Sbjct: 294 FYMAVGGTNFGWSAGANLSGSGYQPTVTSYDYDSPISESGELTE 337
>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
22836]
Length = 630
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 108/340 (31%), Positives = 161/340 (47%), Gaps = 46/340 (13%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+ YDG+ + I SG +HYPR Q W + K GL+ V T VFWN+HEP+
Sbjct: 34 DFVYDGKPVRI-------ISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNIHEPE 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++DF+G ++L +IK +GL V LR GP++ EW +GG P+WL +V G+ R DNE
Sbjct: 87 PGKWDFTGDKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGLELRRDNE 146
Query: 149 PFKFHMKRYATMIVNMM--KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---YVR 203
F +Y + +N + + L ++GGPI++ Q ENE+G + K P + R
Sbjct: 147 QF----LKYTQLYINRLYKEVGNLQITKGGPIVMVQAENEFG--SYVSQRKDIPLEEHRR 200
Query: 204 WAAKLAVDLQTG---VP-------WVMCKQDDAPDPVINACNGRQCGETFAGP----NSP 249
+ AK+ L+ VP W+ + A + NG E N
Sbjct: 201 YNAKIVQQLKDAGFDVPSFTSDGSWLF--EGGAVPGALPTANGESNIENLKKAVDKYNGG 258
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
P + E + + + + SA IA ++ +NYYM HGGTNFG T+
Sbjct: 259 QGPYMVAEFYPGWLAHWLEPHPQISATSIARQTEKYL--QNNVSINYYMVHGGTNFGFTS 316
Query: 310 SAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
A LT Y AP+ E G + PK+ L+ +
Sbjct: 317 GANYDKKHDIQPDLTSYDYDAPISEAGWVT-PKYDSLRNV 355
>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
Length = 778
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/352 (29%), Positives = 157/352 (44%), Gaps = 39/352 (11%)
Query: 10 FGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAK 69
LL TT+ G T ++ ++NG ++ + +HYPR W I K
Sbjct: 1 MALLATTMLTPASTAQKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCK 60
Query: 70 EGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYG 129
G++ V VFWN+HE Q G+FDF+G D+ F + Q GLYV +R GP++ EW G
Sbjct: 61 ALGMNTVCLYVFWNIHEQQEGKFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMG 120
Query: 130 GLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG- 188
GLP+WL I R + F +K + + + A L GGPII+ Q+ENEYG
Sbjct: 121 GLPWWLLKKKDIRLREPDPYFMERVKLFERKVGEQL--ASLTIQNGGPIIMVQVENEYGS 178
Query: 189 -------------MVEHSFLEKGPPY-VRWAAKLAVDLQTGVPWVMCKQDDAPDPVINAC 234
+V S +K + WA+ + + W M N
Sbjct: 179 YGKNKAYVSAIRDIVRRSGFDKVTLFQCDWASNFEKNGLDDLVWTM-----------NFG 227
Query: 235 NGRQCGETFA--GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGS 292
G + F G P+ P + +E W+ ++ +G R A+ + + ++ KG
Sbjct: 228 TGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLS--KGI 285
Query: 293 YVNYYMYHGGTNFGRTASAYV------LTGYYDQAPLDEYGLLRQPKWGHLK 338
+ YM HGGT+FG A A +T Y AP++EYG PK+ L+
Sbjct: 286 SFSLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKYWELR 336
>gi|297194215|ref|ZP_06911613.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
gi|197722531|gb|EDY66439.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
Length = 590
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 26/325 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
V+ +G SL +G L SG++HY R P+ WP + + GLD V+T V WNLHEP+
Sbjct: 3 RVSTEGFSL--DGRPLRLLSGALHYFRVLPEQWPHRLRMLRAMGLDTVETYVPWNLHEPR 60
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGI-VFRSDN 147
PG++DF G DL RF+ + GL+ +R P+I EW GGLP+WL P + R +
Sbjct: 61 PGEYDFDGIADLDRFLHATREAGLHAIVRPSPYICAEWENGGLPWWLLADPEVGALRCQD 120
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWA 205
+ H+ R+ ++ ++ A ++ S+GG +++ Q+ENEYG + +LE +R A
Sbjct: 121 PAYLAHVDRWFDRLIPVVAAHQV--SRGGNVLMVQVENEYGSYGTDTGYLEHLAAGLR-A 177
Query: 206 AKLAVDLQT--GVPWVMCKQDDAPDPVINACNGRQCGETFA--GPNSPDKPAIWTENWTS 261
+ V L T G P + G + E A PD PA+ E W
Sbjct: 178 RGIDVPLFTSDGPDDFFLTGGALPGHLATVNFGSRPKEALADLARLRPDDPAMCMEFWCG 237
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY--------- 312
++ +G + +R D A + +A G+ VN YM HGGTNF A A
Sbjct: 238 WFDHWGTDHVVRDPADAAGVLEELLA--AGASVNVYMAHGGTNFSTWAGANTEDPAAGTG 295
Query: 313 ---VLTGYYDQAPLDEYGLLRQPKW 334
+T Y AP+DE G + W
Sbjct: 296 YRPTVTSYDYDAPVDERGAATEKFW 320
>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
Length = 579
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 149/311 (47%), Gaps = 30/311 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + +G++HY R P +W I KA+ GL+ ++T WNLHEP G +DF+
Sbjct: 10 DFLLDGRPHRVLAGALHYFRVHPDLWADRIEKARLMGLNTIETYTPWNLHEPVEGAYDFT 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RF++ V G++ +R GP+I EW GGLP WL+ P + R + +
Sbjct: 70 GMLDLERFLRLVADAGMHAIVRPGPYICAEWDNGGLPAWLYRDPEVGVRRSEPRYLGAVS 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y + +++ L +GGP++L QIENEYG Y+R L +
Sbjct: 130 AYLRRVYDVVTP--LQIDRGGPVVLVQIENEYGAYGSDKF-----YLRHLVDLTRECGIT 182
Query: 216 VPWVMCKQDDAPDPVINACN----------GRQCGETFAG--PNSPDKPAIWTENWTSFY 263
VP + D D +++ + G + E A + P P + +E W ++
Sbjct: 183 VP--LTTVDQPTDEMLSQGSLDCLHRTGSFGSRATERLATLRRHQPTGPLMCSEFWNGWF 240
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTG 316
+GD SAED A + +A VN YM+HGGTNFG T+ A +T
Sbjct: 241 DHWGDRHHTTSAEDSAAELDALLAAGAS--VNIYMFHGGTNFGLTSGANDKGVYQPTITS 298
Query: 317 YYDQAPLDEYG 327
Y APLDE G
Sbjct: 299 YDYDAPLDEAG 309
>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
4AJ1]
Length = 591
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 91/289 (31%), Positives = 141/289 (48%), Gaps = 26/289 (8%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G+ +++G + SG++HY R P+ W + K G + V+T V WN+HEP+ G F+
Sbjct: 7 GKDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNIHEPKEGVFN 66
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F G DLV++++ Q GL V LR P+I EW +GGLP WL I RS+ F
Sbjct: 67 FEGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLDK 126
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
++ + +++ M+ L GGPII+ Q+ENEYG + YVR K+ DL
Sbjct: 127 VENFYKVLLPMVTP--LQVENGGPIIMMQVENEYGSFGND-----KEYVRSIKKIMRDLD 179
Query: 214 TGVP-------WVMCKQDDA---PDPVINACNGRQCG------ETFAGPNSPDKPAIWTE 257
VP W + + D ++ G + E+F N + P + E
Sbjct: 180 VTVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G E R ++A V +K + +N+YM+ GGTNFG
Sbjct: 240 FWDGWFNRWGMEIIRRDGSELAEEVKEL---LKRASINFYMFQGGTNFG 285
>gi|150008152|ref|YP_001302895.1| beta-glycosidase [Parabacteroides distasonis ATCC 8503]
gi|149936576|gb|ABR43273.1| glycoside hydrolase family 35, candidate beta-glycosidase
[Parabacteroides distasonis ATCC 8503]
Length = 768
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 166/671 (24%), Positives = 268/671 (39%), Gaps = 133/671 (19%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + SG +HYPR Q W + + GL+ V T VFWNLHE +PG++DF G +
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
+L +I+ +GL V LR GP++ EW +GG P+WL ++PG+ R DN F K Y
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---YVRWAAKLA---VDL 212
+ + L S+GGPII+ Q ENE+G + K P + R+ AK+ D
Sbjct: 159 DKLYE--QVGDLQVSKGGPIIMVQAENEFG--SYVAQRKDIPLEEHRRYNAKIKRQLADA 214
Query: 213 QTGVPWV------MCKQDDAPDPV------INACNGRQCGETFAGPNSPDKPAI----WT 256
VP + + P + N N ++ + G P A W
Sbjct: 215 GFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
+W + D R E + F N+YM HGGTNFG T+ A
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKK 325
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
LT Y AP+ E G + PK+ ++ +
Sbjct: 326 HDIQPDLTSYDYDAPISEAGWV-TPKFDSIR------------------------NVIRK 360
Query: 368 FIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQW 427
++ E A + + E+P +S++ + D +A
Sbjct: 361 YVTYDVPEAPAPIP---------------LIEIPSISLTKVADVLALA------------ 393
Query: 428 EEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVL 487
KE P T L EQ+N Y+ Y+ F + L++ L
Sbjct: 394 ---KEGEPVASPTPL----TFEQLN---QGYGYVLYSTHFNQ---PLKGRLEIPGLRDYA 440
Query: 488 HAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-L 546
+++GE VG ++ F M I + +L +G + G + R G +
Sbjct: 441 TIYVDGERVGEL-----NRCFNQYAMEIDIPFNATLDILVENMGRINYGEEIVRNTKGII 495
Query: 547 RNVSIQGAKELKDFSSFS--WGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
+V I G+ E+ D+ + L+ + ++ + + + ++P+ +
Sbjct: 496 SSVKINGS-EISDWKMYKLPMDRMPALVSGEPYVYKNGSPEV------AALGNKPVLYEG 548
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPT 664
T + TG I++ GKG ++NG +IGRYW + P Q+ Y IP +L
Sbjct: 549 TFHLSDTGD--TFIDMEDWGKGIIFINGVNIGRYWYA------GPQQTLY-IPGVWLNKG 599
Query: 665 GNLLVLLEEEN 675
N +V+ E+ N
Sbjct: 600 ENKIVIYEQLN 610
>gi|390336578|ref|XP_792349.2| PREDICTED: beta-galactosidase-like [Strongylocentrotus purpuratus]
Length = 671
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 101/301 (33%), Positives = 145/301 (48%), Gaps = 43/301 (14%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ YD + + +G SGS HY R W + K K GL+ VQT V WN HE +P
Sbjct: 31 IDYDSNTFLKDGQPFRYVSGSFHYSRVPAFYWQDRLDKMKMAGLNAVQTYVIWNFHELKP 90
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+F G D++ F+K+ GL V LR GP+I GEW GGLP WL ++PGIV RS N+
Sbjct: 91 GEFNFDGDHDILSFLKKANDTGLAVILRPGPYICGEWDLGGLPAWLLNIPGIVLRSSNDL 150
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAA 206
+ H+ + + ++ LY + GGPII+ Q+ENEYG +H + + Y + A
Sbjct: 151 YMAHVTEWMNFFLPKLR-PYLYVN-GGPIIMVQVENEYGSYQTCDHQYQRQ--LYHLFRA 206
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETF---AGPNS-----------PDKP 252
L D+ V+ D D ++ + T AG NS P P
Sbjct: 207 NLGPDV------VLFTTDGPGDHLLQCGTLQDMYATIDFGAGSNSTGMFQEMRKFEPKGP 260
Query: 253 AI-------WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ W ++W +Q A S + + +AL G+ VN YM+ GGTNF
Sbjct: 261 LVNSEYYTGWLDHWEHPHQTVKTAAVCTSLDQM---LAL------GANVNMYMFEGGTNF 311
Query: 306 G 306
G
Sbjct: 312 G 312
>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
Length = 787
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 158/357 (44%), Gaps = 39/357 (10%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+ LL+T + G T ++ ++NG ++ + +HYPR W
Sbjct: 5 HFIATVALLVTAMLPPVSAARKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHR 64
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I K G++ V VFWN+HE Q G+FDF+G D+ F + Q GLYV +R GP++
Sbjct: 65 IKMCKALGMNTVCLYVFWNIHEQQEGRFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCA 124
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW GGLP+WL I R + F +K + + + A L GGPII+ Q+E
Sbjct: 125 EWEMGGLPWWLLKKKDIRLREPDPYFMERVKLFERKVGEQL--ASLTIQNGGPIIMVQVE 182
Query: 185 NEYG--------------MVEHSFLEKGPPY-VRWAAKLAVDLQTGVPWVMCKQDDAPDP 229
NEYG +V S +K + WA+ + + W M
Sbjct: 183 NEYGSYGENKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTM--------- 233
Query: 230 VINACNGRQCGETFA--GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIA 287
N G + F G P+ P + +E W+ ++ +G R A+ + + ++
Sbjct: 234 --NFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKAMVEGIDEMLS 291
Query: 288 KMKGSYVNYYMYHGGTNFGRTASAYV------LTGYYDQAPLDEYGLLRQPKWGHLK 338
KG + YM HGGT+FG A A +T Y AP++EYG PK+ L+
Sbjct: 292 --KGISFSLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKYWELR 345
>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
Length = 601
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 151/310 (48%), Gaps = 20/310 (6%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G ++N + SG++HY R P+ W + K K G + V+T V WN+HEP+ G+FD
Sbjct: 8 GSQFLLNDKPLRIISGALHYFRVVPEYWRDRLLKMKACGCNTVETYVAWNVHEPEEGKFD 67
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F G D++ F++ GL+V +R P+I EW +GGLP WL + R + KF
Sbjct: 68 FGGIADVIAFVELAGELGLHVIVRPSPYICAEWEFGGLPAWLLKDSEMQLRCSDP--KFL 125
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR---WAAKLAV 210
K A V + K L + GGPII Q+ENEYG + G Y+R A + V
Sbjct: 126 AKVDAYYDVLLPKFVPLLCTNGGPIIAMQVENEYGSYGNDKAYLG--YLRDGMIARGIDV 183
Query: 211 DLQT--GVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVY 266
L T G M + PD + G + E+FA PD+P + E W ++ +
Sbjct: 184 LLFTSDGPTDEMLQGGTLPDVLATVNFGSRPEESFAKFREYRPDEPLMCMEFWNGWFDHW 243
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
+E R ED A + + G+ VN+YM+HGGTNFG + A +T Y
Sbjct: 244 MEEHHTRDGEDAARVLDDMLG--AGASVNFYMFHGGTNFGFYSGANHIKTYEPTVTSYDY 301
Query: 320 QAPLDEYGLL 329
APL E G L
Sbjct: 302 DAPLTERGDL 311
>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
Length = 1104
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 152/659 (23%), Positives = 253/659 (38%), Gaps = 115/659 (17%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ + VFWN HEPQPG FDF+
Sbjct: 355 TFLLNGKPFVVKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHEPQPGVFDFT 414
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ DL F + + +YV LR GP++ EW GGLP+WL I R + F +
Sbjct: 415 GQNDLAEFCRLCRQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVG 474
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + + A + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 475 IFEKAVAE--QVADMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVTLFQ 532
Query: 204 --WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENW 259
WA+ + + W M N G + FA PD P + +E W
Sbjct: 533 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 581
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A D+ + ++ KG + YM HGGTN+G A A
Sbjct: 582 SGWFDKWGANHETRPAADMIAGIDEMLS--KGISFSLYMTHGGTNWGHWAGANSPGFAPD 639
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGS 373
+T Y AP+ E G W K L + + + ++ + Q
Sbjct: 640 VTSYDYDAPISESGQTTPKYWELRKTLSKYMDGEKQAKVPALIKPIRIPAFQ-------- 691
Query: 374 SECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEA 433
E+ PL LP K K ++ EEY +
Sbjct: 692 -----------------------FTEMAPL-FDNLPAAK-------KDRNIRTMEEYNQG 720
Query: 434 IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFING 493
S+ L +M T+ S+L V+ F+NG
Sbjct: 721 F-----GSILYRTTLPEMKTS---------------------SLLTVNDAHDYAQIFLNG 754
Query: 494 EFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQG 553
+++G ++ +K + +L +G + G ++ R+V +
Sbjct: 755 KYIGKLDRRNGEKQLAFPACPK----GARLDILVEAMGRINFGRAIKDFKGITRSVELTV 810
Query: 554 AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGS 613
+ F+ ++V L + + + R + + S P Y+ F S
Sbjct: 811 DIDGHPFTCDLKDWEVYNLEDTYDFYKNMKFRPIGSLKDESGQRIPGC-YRATFKVNKPS 869
Query: 614 DPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
D +N + GKG +VNG ++GR W + PQ T +IP +LK N +++ +
Sbjct: 870 DTF-LNFETWGKGLVYVNGHAMGRIWE--IGPQQT-----LYIPGCWLKKGENEVMVFD 920
>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
Length = 584
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 146/318 (45%), Gaps = 26/318 (8%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
R ++G + SG+IHY R P W I KA+ GL+ ++T V WN H P +F
Sbjct: 9 RDFTLDGEPFQIISGAIHYFRVHPDSWRDRIRKARLMGLNTIETYVAWNFHAPSRDEFHT 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G RDL RF+ +Q +GL +R GP+I EW GGLP WL P IV RS + + +
Sbjct: 69 DGARDLGRFLDIIQEEGLRAIVRPGPYICAEWDNGGLPTWLTATPDIVVRSSDPTYLTEV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+RY + +++ ++ + GGPIIL Q+ENEYG + Y+ + +L
Sbjct: 129 ERYLEHLAPIVEPRQI--NHGGPIILMQVENEYGAYGND-----RAYLTHLTNVYRNLGF 181
Query: 215 GVPWVMCKQ--DDA------PDPVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQ 264
VP Q DD PD G + E A + P + +E W ++
Sbjct: 182 VVPLTTVDQPMDDMLAHGTLPDLHTTGSFGSRIDERLATLREHQTTGPLMCSEFWIGWFD 241
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGY 317
+G D A + + G+ VN YM+HGGTNFG T A ++T Y
Sbjct: 242 HWGAHHHTTDVADAANALDRLLG--AGASVNIYMFHGGTNFGFTNGANDKGVYQPLVTSY 299
Query: 318 YDQAPLDEYGLLRQPKWG 335
APL E G + W
Sbjct: 300 DYDAPLAEDGYPTEKYWA 317
>gi|426371159|ref|XP_004052521.1| PREDICTED: beta-galactosidase-1-like protein 3 [Gorilla gorilla
gorilla]
Length = 653
Score = 146 bits (368), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 167/323 (51%), Gaps = 29/323 (8%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++F GSIH R + W + K K G + V T V WNLHEP+ G+FDFSG
Sbjct: 82 LEGHKFLIFGGSIHCFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 141
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL F+ GL+V LR GP+I E GGLP WL P ++ R+ N+ F +++Y
Sbjct: 142 DLEAFVLMGAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEKYF 201
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
++ + L QGGP+I Q+ENEYG SF +K Y+ + K L+ G+
Sbjct: 202 DHLIP--RVIPLQYRQGGPVIAVQVENEYG----SF-KKDKTYMLYLHKAL--LRRGIVE 252
Query: 219 VMCKQDDAP-------DPVINACNGRQC-GETFAGPN--SPDKPAIWTENWTSFYQVYGD 268
++ D V+ A N ++ +TF + DKP + E W ++ +GD
Sbjct: 253 LLLTSDGEKHVLSGHTKGVLAAINLQKLHQDTFNQLHKVQRDKPLLIMEYWVGWFDRWGD 312
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQA 321
+ ++ A+++ + V+ FI K + S+ N YM+HGGTNFG A ++T Y A
Sbjct: 313 KHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDA 370
Query: 322 PLDEYGLLRQPKWGHLKELHSAV 344
L E G + K+ L++L +V
Sbjct: 371 VLTEAGDYTE-KYLKLQKLFQSV 392
>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
Length = 587
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 141/309 (45%), Gaps = 26/309 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
S +NG + SG++HY R P W + KA+ GL+ V+T V WNLH+P+PG
Sbjct: 10 SFELNGEPFRIISGALHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLD 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RF++ A+GL V LR GP+I EW GGLP WL + RS + F +
Sbjct: 70 GLLDLPRFLRLAHAEGLKVLLRPGPYICAEWDGGGLPHWLMSESDVQLRSSDPKFTAIID 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
RY +++ + A GGP+I Q+ENEYG + Y+++ +
Sbjct: 130 RYLDLLLPPLLPH--MAESGGPVIAVQVENEYGAYGNDA-----EYLKYLVEAFRSRGIE 182
Query: 216 VPWVMCKQDDAPDPVINACNGRQCGETFAG----------PNSPDKPAIWTENWTSFYQV 265
C Q + + G TF G + P+ P + E W ++
Sbjct: 183 ELLFTCDQVNPEHQQAGSIPGVLSTGTFGGKIETALATLRAHQPEGPLMCAEFWIGWFDH 242
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYY 318
+G R D+A + +A G+ VN YM+HGGTNFG T A +T Y
Sbjct: 243 WGGPHHTRDTADVAADLDKLLA--AGASVNIYMFHGGTNFGLTNGANHHHTYAPTITSYD 300
Query: 319 DQAPLDEYG 327
APL E G
Sbjct: 301 YDAPLTENG 309
Score = 42.4 bits (98), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 7/55 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
++L KG+AWVNG S+GRYW P Q+ Y +P L+P N L++LE
Sbjct: 518 LSLPGWTKGQAWVNGFSLGRYW------NRGPQQTLY-VPGPVLRPGANTLIVLE 565
>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
Length = 823
Score = 145 bits (367), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 98/326 (30%), Positives = 155/326 (47%), Gaps = 19/326 (5%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G + T + ++NG ++ + +HYPR W + I K G++ V VFWN+HE
Sbjct: 66 GGDFTVGKNTFLLNGQPFVVKAAELHYPRIPRPYWEQRIKMCKSLGMNTVCLYVFWNIHE 125
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
Q G+FDF+G D+ F + Q G+YV +R GP++ EW GGLP+WL I R D
Sbjct: 126 QQEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRED 185
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRW 204
+ F +K + + + A L GGPII+ Q+ENEYG V ++ + V+
Sbjct: 186 DPYFMARVKAFEAEVGRQL--APLTIQNGGPIIMVQVENEYGSYGVNKKYVSQIRDIVKA 243
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVI---NACNGRQCGETFAGPNS--PDKPAIWTENW 259
+ V L W +++ D ++ N G F PD P + +E W
Sbjct: 244 SGFDKVTL-FQCDWASNFENNGLDDLVWTMNFGTGSNIDAQFKRLKQLRPDAPLMCSEFW 302
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A+ + + ++ K + YM HGGT+FG A A
Sbjct: 303 SGWFDKWGARHETRPAKAMVEGIDEMLS--KNISFSLYMTHGGTSFGHWAGANSPGFAPD 360
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y AP++EYG PK+ L++
Sbjct: 361 VTSYDYDAPINEYGHA-TPKFWELRK 385
>gi|261406481|ref|YP_003242722.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261282944|gb|ACX64915.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 619
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 103/340 (30%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T++ +++G + SG+IHY R P+ W + K K G + V+T + WN+HEPQ
Sbjct: 4 LTWENGQYLLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQE 63
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSG D+ FI+ GL+V +R PFI EW +GGLP WL I R +
Sbjct: 64 GEFNFSGMADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPL 123
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAK 207
+ + Y ++ + L ++ GGPI+ Q+ENEYG +H++LE Y+R
Sbjct: 124 YLSKVDHYYDELIPQL--VPLLSTHGGPILAVQVENEYGSYGNDHAYLE----YLREGL- 176
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVI----------NACNGRQCGETFAGPNS--PDKPAIW 255
++ GV ++ D D ++ G + E+F ++P +
Sbjct: 177 ----VRRGVDVLLFTSDGPTDEMLLGGTLSDVHATVNFGSRVEESFRKYREYRAEEPLMV 232
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLT 315
E W ++ + ++ +R A D+A + + GS +N YM+HGGTNFG + A +
Sbjct: 233 MEFWNGWFDHWMEDHHVRDAADVAGVLDEMLE--MGSSMNMYMFHGGTNFGFYSGANHIQ 290
Query: 316 GY------YD-QAPLDEYGLLRQPKWGHLKELHSAVKLCL 348
Y YD APL E WG E + AV+ L
Sbjct: 291 AYEPTTTSYDYDAPLTE--------WGDKTEKYEAVRRVL 322
>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
Length = 588
Score = 145 bits (367), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 140/288 (48%), Gaps = 25/288 (8%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ ++NG ++SG++HY R P W + K K GL+ V+T + WN+HEPQ GQF F
Sbjct: 10 KEFLLNGQPFKIYSGAVHYFRIAPSEWRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVF 69
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
R D+ +F+K Q+ GLYV LR P+I EW +GGLP WL P +V RS+ F +
Sbjct: 70 EDRYDIGKFVKLAQSIGLYVILRPSPYICAEWEFGGLPAWLLRYPDMVVRSNTPRFMEKV 129
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y + ++ L + GGP+++ Q+ENEYG + Y+R L
Sbjct: 130 ANYYEALFKVL--VPLQITHGGPVLMMQVENEYGSFGND-----KAYLRHVKSLMETNGV 182
Query: 215 GVP-------WVMCKQDDA---PDPVINACNGRQCGETFAG------PNSPDKPAIWTEN 258
VP W + + D + A G + E A + + P + E
Sbjct: 183 DVPLFTADGSWQQALKAGSLIEDDVFVTANFGSKSRENLAELRQFMLMHHKNWPLMCMEF 242
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ + +E RSA+ +A + K + S+ N YM+ GGTNFG
Sbjct: 243 WDGWFNRWQEEIVTRSADSFQTDLAELV-KEQASF-NLYMFRGGTNFG 288
>gi|218260271|ref|ZP_03475643.1| hypothetical protein PRABACTJOHN_01305, partial [Parabacteroides
johnsonii DSM 18315]
gi|218224641|gb|EEC97291.1| hypothetical protein PRABACTJOHN_01305 [Parabacteroides johnsonii
DSM 18315]
Length = 539
Score = 145 bits (367), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 164/356 (46%), Gaps = 29/356 (8%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
Q + L LLL G + G + ++ +++G ++ + IHY R + W
Sbjct: 5 QNTAIWLTALLLFAFSGCNQKPAGEHTFAIGNKTFLLDGKPFVIKAAEIHYTRIPAEYWE 64
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFI 122
I K G++ + FWN+HE +PG+FDFSG+ D+ F + Q +Y+ LR GP++
Sbjct: 65 HRIQLCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKYDMYIMLRPGPYV 124
Query: 123 EGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
EW GGLP+WL I R+++ F K + I + A L ++GG II+ Q
Sbjct: 125 CSEWEMGGLPWWLLKKDDIKLRTNDPYFLERTKLFMNEIGKQL--ADLQITKGGNIIMVQ 182
Query: 183 IENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK-----QDDAPDPV---IN 232
+ENEYG + ++ V+ A T VP C Q++A D + IN
Sbjct: 183 VENEYGSYATDKEYIANIRDIVKGAGF------TDVPLFQCDWSSNFQNNALDDLVWTIN 236
Query: 233 ACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMK 290
G E F P+ P + +E W+ ++ +G + R AE + + + +
Sbjct: 237 FGTGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLD--R 294
Query: 291 GSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
G + YM HGGT FG A + + + Y AP+ E G PK+ L+EL
Sbjct: 295 GISFSLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAG-WTTPKYFKLREL 349
>gi|167750408|ref|ZP_02422535.1| hypothetical protein EUBSIR_01382 [Eubacterium siraeum DSM 15702]
gi|167656559|gb|EDS00689.1| glycosyl hydrolase family 35 [Eubacterium siraeum DSM 15702]
Length = 579
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 165/668 (24%), Positives = 274/668 (41%), Gaps = 120/668 (17%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
++G + SGSIHY R+ P+ W + K G + V+T + WN HE + G F+++G
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWNGMH 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
D+ RFI+ GLY+ +R P+I EW +GGLP WL + R +P+ + Y
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDSYY 131
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
+++ M K A GG II+ QIENEYG + S+LE +R + +
Sbjct: 132 SVL--MPKLAPYQIDNGGNIIMMQIENEYGYYGNDTSYLEFLRDTMRKYGITVPFVTSDG 189
Query: 217 PW----VMCKQDDAPDPVINACNGR--QCGET--FAGPNSPDKPAIWTENWTSFYQVYGD 268
PW D P N + Q GE F G DKP + E W ++ V+G+
Sbjct: 190 PWSEFVFKSGMVDGALPTGNFGSSAEWQFGEMRRFIG---EDKPLMCMEFWNGWFDVWGE 246
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG------RTASAYVLTGYYDQAP 322
E I + E A + + +K +N+YM+ GGTNFG ++T Y AP
Sbjct: 247 EHNITAPEKAAQELDIL---LKNGSMNFYMFEGGTNFGFMSGKNNEKKTGIVTSYDYDAP 303
Query: 323 LDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVN 382
L E G + + K+ KE+ S ++ V ++ +L+ G C A
Sbjct: 304 LTEDGRITE-KYEKCKEVISRY-----TDINEVPLTTQIRRLE-----YGEIRCTA---- 348
Query: 383 KDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIP-TYDETS 441
KT F+T LDS+ + K P +++E
Sbjct: 349 -----------------------------KTDLFST--LDSIS--DPVKSVYPLSFEELD 375
Query: 442 LRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHG 501
++L +++ ++ ++ S ++ + + F NG++ +A
Sbjct: 376 SYYGYVLYRLHIREN----------------ETVSTVRCENAADRVQGFRNGKYAFTAFA 419
Query: 502 KHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFS 561
+ D+ F L + + LL +G + G LE + G + G + D
Sbjct: 420 ETIDEQFELAEK----SAGGTTDLLVENIGRVNFGTGLECQHKG-----VLGGIRINDHR 470
Query: 562 SFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLI 621
+ + L E DY G + P +YK F+ +D ++
Sbjct: 471 QYGFEMFTLPLDENQLDRIDYNR--------GYNDGVP-AFYKFEFEISETADTF-LDTD 520
Query: 622 SMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGI 681
KG A++NG ++GR+W Q +IP LK N +V+ E E G
Sbjct: 521 GFRKGVAFINGFNLGRFW-------NIGPQKKLYIPAPLLKKGKNEIVIFETE-----GN 568
Query: 682 SIDTVSVT 689
S D+++++
Sbjct: 569 SADSITLS 576
>gi|164519026|ref|NP_001073876.2| beta-galactosidase-1-like protein 3 [Homo sapiens]
gi|269849685|sp|Q8NCI6.3|GLBL3_HUMAN RecName: Full=Beta-galactosidase-1-like protein 3
Length = 653
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 174/349 (49%), Gaps = 30/349 (8%)
Query: 14 LTTIGGSDGGGGGGNNVTYDGR-SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGG 72
LT + + G G T G+ + GH+ ++F GSIHY R + W + K K G
Sbjct: 56 LTPLELKNRSVGLGTESTGRGKPHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACG 115
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
+ V T V WNLHEP+ G+FDFSG DL F+ GL+V LR G +I E GGLP
Sbjct: 116 FNTVTTYVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGRYICSEMDLGGLP 175
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH 192
WL P ++ R+ N+ F +++Y ++ + L Q GP+I Q+ENEYG
Sbjct: 176 SWLLQDPRLLLRTTNKSFIEAVEKYFDHLIP--RVIPLQYRQAGPVIAVQVENEYG---- 229
Query: 193 SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAP-------DPVINACNGRQCGE-TFA 244
SF K Y+ + K L+ G+ ++ D V+ A N ++ + TF
Sbjct: 230 SF-NKDKTYMPYLHKAL--LRRGIVELLLTSDGEKHVLSGHTKGVLAAINLQKLHQDTFN 286
Query: 245 GPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGG 302
+ DKP + E W ++ +GD+ ++ A+++ + V+ FI K + S+ N YM+HGG
Sbjct: 287 QLHKVQRDKPLLIMEYWVGWFDRWGDKHHVKDAKEVEHAVSEFI-KYEISF-NVYMFHGG 344
Query: 303 TNFGRTASAY-------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
TNFG A ++T Y A L E G + K+ L++L +V
Sbjct: 345 TNFGFMNGATYFGKHSGIVTSYDYDAVLTEAGDYTE-KYLKLQKLFQSV 392
>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
12058]
Length = 1106
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 98/327 (29%), Positives = 150/327 (45%), Gaps = 39/327 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
S ++NG ++ + +HYPR W + I K G++ V VFWN HEPQPG +DF+
Sbjct: 356 SFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFT 415
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
+ DL F + Q +YV LR GP++ EW GGLP+WL I R + F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVN 475
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVR---------- 203
+ + +K L + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGADKGYVSQIRDIVRTHFGNDIALF 533
Query: 204 ---WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTEN 258
WA+ ++ + W M N G + FA P+ P + +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKKLRPNSPLMCSEF 582
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W+ ++ +G R AED+ + ++ +G + YM HGGTN+G A A
Sbjct: 583 WSGWFDKWGANHETRPAEDMIKGIDDMLS--RGISFSLYMTHGGTNWGHWAGANSPGFAP 640
Query: 314 -LTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y AP+ E G PK+ L+E
Sbjct: 641 DVTSYDYDAPISESGQT-TPKYWKLRE 666
>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
Length = 791
Score = 145 bits (366), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 157/357 (43%), Gaps = 39/357 (10%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+ LL+T + G T ++ ++NG ++ + +HYPR W
Sbjct: 9 HFIATVALLVTAMLSPVSAARKGGTFTVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHR 68
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I K G++ V VFWN+HE Q G+FDF+ D+ F + Q GLYV +R GP++
Sbjct: 69 IKMCKALGMNTVCLYVFWNIHEQQEGKFDFTDNNDVAEFCRLAQRNGLYVIVRPGPYVCA 128
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
EW GGLP+WL I R + F +K + + + A L GGPII+ Q+E
Sbjct: 129 EWEMGGLPWWLLKKKDIRLREPDPYFMERVKLFERKVGEQL--ASLTIQNGGPIIMVQVE 186
Query: 185 NEYG--------------MVEHSFLEKGPPY-VRWAAKLAVDLQTGVPWVMCKQDDAPDP 229
NEYG +V S +K + WA+ + + W M
Sbjct: 187 NEYGSYGENKAYVSAIRDIVRQSGFDKVTLFQCDWASNFEKNGLDDLVWTM--------- 237
Query: 230 VINACNGRQCGETFA--GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIA 287
N G + F G P+ P + +E W+ ++ +G R A+ + + ++
Sbjct: 238 --NFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWFDKWGARHETRPAKTMVEGIDEMLS 295
Query: 288 KMKGSYVNYYMYHGGTNFGRTASAYV------LTGYYDQAPLDEYGLLRQPKWGHLK 338
KG + YM HGGT+FG A A +T Y AP++EYG PK+ L+
Sbjct: 296 --KGISFSLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYGQA-TPKYWELR 349
>gi|257899628|ref|ZP_05679281.1| glycosyl hydrolase [Enterococcus faecium Com15]
gi|257837540|gb|EEV62614.1| glycosyl hydrolase [Enterococcus faecium Com15]
Length = 595
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +D+V+F+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKDIVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|297204198|ref|ZP_06921595.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
gi|197714112|gb|EDY58146.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
Length = 588
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 148/315 (46%), Gaps = 26/315 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T +++G + SG++HY R P W + KA+ GL+ ++T + WNLHEP+P
Sbjct: 7 LTTSSDGFLLHGEPFRIISGAMHYFRIHPDQWTDRLRKARLMGLNTIETYLPWNLHEPEP 66
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G G DL R+++ Q +GL+V LR GPFI EW GGLP WL P I RS +
Sbjct: 67 GTLVLDGFLDLPRWLRLAQDEGLHVLLRPGPFICAEWDDGGLPAWLLADPDIRLRSSDPR 126
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F Y ++ ++ A+ GGP+I Q+ENEYG Y++ +
Sbjct: 127 FTGAFDGYLDQLLPALRP--FMAAHGGPVIAVQVENEYGAYGDDTA-----YLKHVHQAL 179
Query: 210 VDLQTGVPWVMCKQDDA--------PDPVINACNGRQCGETFAG--PNSPDKPAIWTENW 259
D C Q A P + A G + E A + P+ P + +E W
Sbjct: 180 RDRGVEELLYTCDQASAEHLAAGTLPGTLATATFGSRVEENLAALRTHQPEGPLMCSEFW 239
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------- 312
++ +G +RSA D A + ++ G+ VN YM+HGGTNFG T A
Sbjct: 240 VGWFDHWGGPHHVRSAADAAADLDRLLS--AGASVNIYMFHGGTNFGFTNGANHKHAYEP 297
Query: 313 VLTGYYDQAPLDEYG 327
+T Y APL E G
Sbjct: 298 TVTSYDYDAPLTESG 312
Score = 43.1 bits (100), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 44/86 (51%), Gaps = 8/86 (9%)
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
VP+ ++T +++ F+ + +D ++L KG+AWVNG +GRYW
Sbjct: 489 VPFGPSTATTDAVPAFHRGTFEVDSPAD-TFLSLPGWTKGQAWVNGFHLGRYW------N 541
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLE 672
P + Y +P L+P N LVLLE
Sbjct: 542 RGPQHTLY-VPAPVLRPGANELVLLE 566
>gi|431919325|gb|ELK17922.1| Beta-galactosidase-1-like protein 3 [Pteropus alecto]
Length = 1113
Score = 145 bits (366), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 108/338 (31%), Positives = 160/338 (47%), Gaps = 43/338 (12%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ +F GSIHY R + W + K K G + V T V WNLHEPQ G FDFS
Sbjct: 631 LGGHKFRIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPQRGAFDFSENL 690
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL F+ GL+V LR GP+I E GGLP WL + R+ ++ F + +Y
Sbjct: 691 DLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDSNVRLRTTDQGFVEAVDKYF 750
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVRWAAKL 208
++ + L QGGPII Q+ENEYG ++ + L++G + +
Sbjct: 751 DHLI--ARVVPLQYRQGGPIIAVQVENEYGSFDKDKYYMPYIQQALLKRGIVELLLTSDA 808
Query: 209 AVDLQTG-VPWVMCK------QDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTS 261
++ G + V+ Q+DA +P+ N +KP + E W
Sbjct: 809 KTEVLKGYIKGVLAAINIEKFQNDAFEPLYNI--------------QKNKPILVMEYWVG 854
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VL 314
++ +GDE ++ A+D+ V+ FI K + S+ N YM+HGGTNFG A +
Sbjct: 855 WFDKWGDEHNVKDAQDVENTVSEFI-KFEISF-NVYMFHGGTNFGFINGATNFGKHKSIA 912
Query: 315 TGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPML 352
T Y A L E G + K+ L++L +V P L
Sbjct: 913 TSYDYDAVLTEAGDYTE-KYFKLRKLFGSVLALPLPHL 949
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 133/295 (45%), Gaps = 21/295 (7%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
+G + ++G ++ +G+IHY R + W + K K G + V V W+ HEPQ +F
Sbjct: 52 EGSNFTLDGFPFLIIAGTIHYFRVPREYWKDRLLKLKACGFNTVTMHVPWSHHEPQRHKF 111
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
F+G DL FI +GL+V L GP+I + GGLP WL P + R+ + F
Sbjct: 112 YFTGDLDLRAFISIASNEGLWVILCPGPYIGSDLDLGGLPSWLLQDPKMKLRTTYKGFTK 171
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDL 212
+ +Y ++ + A GPII Q+ENEYG L+K Y+ + K V
Sbjct: 172 AVNQYFDQLIP--RIAPFQYENYGPIIAVQVENEYGSYH---LDKR--YMSYVKKALV-- 222
Query: 213 QTGVPWVMCKQDDAPD-------PVINACNGRQC-GETFAGPNSPD--KPAIWTENWTSF 262
+ G+ ++ DD + VI + + ET+ S P + TS
Sbjct: 223 KRGIKAMLMTADDGQEIIRGYLNKVIATVHMKNIKKETYKNLFSIQGLSPILMMVYTTSS 282
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY 317
+G + + +V ++ S+ N+YM+HGGTNFG A L Y
Sbjct: 283 SDSWGHSHHTLDSHVLMKNVHEMF-NLRFSF-NFYMFHGGTNFGFIGGASSLNSY 335
>gi|257888197|ref|ZP_05667850.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|431040248|ref|ZP_19492755.1| beta-galactosidase [Enterococcus faecium E1590]
gi|431763679|ref|ZP_19552228.1| beta-galactosidase [Enterococcus faecium E3548]
gi|257824251|gb|EEV51183.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
gi|430562100|gb|ELB01353.1| beta-galactosidase [Enterococcus faecium E1590]
gi|430622052|gb|ELB58793.1| beta-galactosidase [Enterococcus faecium E3548]
Length = 595
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +++VRF+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|425056292|ref|ZP_18459750.1| putative beta-galactosidase [Enterococcus faecium 505]
gi|403032128|gb|EJY43702.1| putative beta-galactosidase [Enterococcus faecium 505]
Length = 595
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +D+V+F+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKDVVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|311281324|ref|YP_003943555.1| glycoside hydrolase [Enterobacter cloacae SCF1]
gi|308750519|gb|ADO50271.1| glycoside hydrolase family 35 [Enterobacter cloacae SCF1]
Length = 591
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 91/284 (32%), Positives = 140/284 (49%), Gaps = 22/284 (7%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++L+ +G L SG+IHY R PQ W + K G + V+T + WN+H+P P +F F
Sbjct: 8 KNLLQDGKPVQLISGAIHYFRLVPQYWEHSLNNLKALGANCVETYLPWNIHQPDPERFCF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G D+ RFI Q +GL+V LR P+I EW +GGLP WL P + RS F +
Sbjct: 68 TGMADVERFIALAQRKGLFVILRPSPYICAEWEFGGLPAWLLRDPSMRVRSSQPAFLQAV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+RY + + + A +GGP+++ Q+ENEYG + Y+R A +
Sbjct: 128 ERYYAEL--LPRLAPWQYDRGGPVVMMQLENEYGSFGND-----KAYLRTLAAMMRRYGV 180
Query: 215 GVP-------WVMCKQDDA--PDPVINACN-GRQCGETF--AGPNSPDKPAIWTENWTSF 262
VP W Q + D V+ N G + E+ P++P + E W +
Sbjct: 181 SVPLFTSDGAWQEALQAGSLCEDNVLATANFGSRSAESLDNLAAFQPERPLMCLEFWNGW 240
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ YGD R A+D+ + + + + +N YM+ GGTNFG
Sbjct: 241 FNRYGDAIIRRDADDVGQEIRTLLTR---ASINIYMFQGGTNFG 281
>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
adhaerens]
Length = 543
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 155/314 (49%), Gaps = 34/314 (10%)
Query: 48 SGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEV 107
SG+IHY R P+ W + K K GL+ V+T V WNLHEP PGQFD++G ++ +FI
Sbjct: 15 SGAIHYFRVVPEYWRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFDYTGILNVRKFILLA 74
Query: 108 QAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA 167
Q G YV LR GP+I EW +GG+P WL + RS +PFK + R+ + +K+
Sbjct: 75 QELGFYVILRPGPYICAEWEFGGMPSWLLSDKNMQVRSTYKPFKDAVNRFFDGFIPEIKS 134
Query: 168 ARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAP 227
L AS+GGPII Q+ENEYG + E+ ++R A + G+ ++ D++
Sbjct: 135 --LQASKGGPIIAVQVENEYG--SYGSDEEYMQFIRDAL-----INRGIVELLVTSDNSE 185
Query: 228 DPVINACNGRQCGETFAG---------PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
G F G D P+I E W+ ++ +G++ I
Sbjct: 186 GIKHGGAPGVLKTYNFQGHAKSHLSILERLQDAPSIVMEFWSGWFDHWGEKN--HQVHTI 243
Query: 279 AYHVALF--IAKMKGSYVNYYMYHGGTNFGRTASAYVL----------TGYYDQAPLDEY 326
A+ F I S+ N+Y++HGGTNFG A + T Y APL E
Sbjct: 244 AHVTNTFKDILDCDASF-NFYVFHGGTNFGFMNGANFIDFFSYYLPTVTSYDYDAPLSEA 302
Query: 327 GLLRQPKWGHLKEL 340
G + + K+ L+++
Sbjct: 303 GDITE-KYMELRKI 315
>gi|227552575|ref|ZP_03982624.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257896912|ref|ZP_05676565.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|293379016|ref|ZP_06625170.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|431750982|ref|ZP_19539676.1| beta-galactosidase [Enterococcus faecium E2620]
gi|227178324|gb|EEI59296.1| possible beta-galactosidase [Enterococcus faecium TX1330]
gi|257833477|gb|EEV59898.1| glycosyl hydrolase [Enterococcus faecium Com12]
gi|292642358|gb|EFF60514.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
gi|430616240|gb|ELB53164.1| beta-galactosidase [Enterococcus faecium E2620]
Length = 595
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +++VRF+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|256393561|ref|YP_003115125.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256359787|gb|ACU73284.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 584
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
++T DG SL +G + SG +HY R P W + KA+ GL+ + T + WNLHE +
Sbjct: 5 DITGDGFSL--DGQPFRIVSGGLHYFRVHPAQWSDRLRKARLMGLNTIDTYIPWNLHERR 62
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG FDF G DL F+ A+GL+V LR GP+I GEW GGLP WL P + RS +
Sbjct: 63 PGTFDFGGILDLAAFLDAAAAEGLHVLLRPGPYICGEWEGGGLPSWLLADPDLALRSTDP 122
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F ++ Y I+ ++ RL ++GGP+I Q+ENEYG + +++E+
Sbjct: 123 AFLQAVEAYLDAIMPIV-LPRL-GTRGGPVIAVQVENEYGAYGSDTAYMER---LYEALT 177
Query: 207 KLAVDLQTGVPWVMCKQ-----DDAPDPVINACN-GRQCGETFAG--PNSPDKPAIWTEN 258
+D VP+ Q D A V+ N G + + A P P + E
Sbjct: 178 SRGID----VPFFTSDQPNDLADGALPGVLATANFGGKVTASLAALRAQQPTGPLMCAEF 233
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA------- 311
W ++ +G RSAED AL G+ VN+YM+HGGTNFG T A
Sbjct: 234 WNGWFDYWGGTHAQRSAEDAG--AALEEMLQAGASVNFYMFHGGTNFGFTNGANDKGTYR 291
Query: 312 YVLTGYYDQAPLDEYG 327
+T Y +PLDE G
Sbjct: 292 ATVTSYDYDSPLDEAG 307
>gi|219870459|ref|YP_002474834.1| beta-galactosidase [Haemophilus parasuis SH0165]
gi|219690663|gb|ACL31886.1| beta-galactosidase, glucosyl hydrolase family protein [Haemophilus
parasuis SH0165]
Length = 596
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 153/317 (48%), Gaps = 34/317 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ ++NG + SG++HY R P+ W + + K G + V+T V WNLH+PQP QF+F
Sbjct: 8 KDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFNF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
S R DLV+F++ + GLYV LR P+I EW +GGLP WL ++P I R ++ F +
Sbjct: 68 SKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAEI 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
RY + + + A +QGG I++ QIENEYG + Y+R L +
Sbjct: 128 DRYFQEL--LPRIAPYQITQGGNILMMQIENEYGSFGND-----KNYLRAIRALMLIHGV 180
Query: 215 GVP-------W-------VMCKQDDAPDPVINACNGRQCGET--FAGPNSPDKPAIWTEN 258
VP W + + D P + + E + + P + E
Sbjct: 181 NVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDKHGKSYPLMCMEF 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--RTASAYV--- 313
W ++ + + R A+D+A + + + +N+YM+ GGTNFG SA +
Sbjct: 241 WDGWFNRWKEPVIRRDAQDLANCTKELLER---ASINFYMFQGGTNFGFWNGCSARLDTD 297
Query: 314 ---LTGYYDQAPLDEYG 327
+T Y AP+ E+G
Sbjct: 298 LPQVTSYDYDAPVHEWG 314
>gi|402895880|ref|XP_003911040.1| PREDICTED: beta-galactosidase-1-like protein 3 [Papio anubis]
Length = 653
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 161/326 (49%), Gaps = 35/326 (10%)
Query: 19 GSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQT 78
G+ G G + T +GR +I G GSIHY R W + K + G + V T
Sbjct: 69 GTASTGRGKPHFTLEGRRFLICG-------GSIHYFRVPRAYWRDRLLKLRACGFNTVTT 121
Query: 79 LVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDV 138
V WNLHEP+ G+FDFSG DL F+ GL+V LR GP+I E GGLP WL
Sbjct: 122 YVPWNLHEPERGKFDFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQD 181
Query: 139 PGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKG 198
P ++ R+ N+ F +++Y ++ + L QGGP+I Q+ENEYG SF K
Sbjct: 182 PRLLLRTTNKGFTEAVEKYFDHLIP--RVIPLQYRQGGPVIAVQVENEYG----SF-NKD 234
Query: 199 PPYVRWAAKLAVDLQTGVPWVMCKQDDAPDP-------VINACNGRQCGE-TFAGPN--S 248
Y+ + K L+ G+ ++ D + V+ A N ++ TF +
Sbjct: 235 KTYMPYLHKAL--LRRGIVELLLTSDGEKNVLSGHTKGVLAAINLQKVQRNTFNQLHKVQ 292
Query: 249 PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRT 308
DKP + E W ++ +GD+ ++ A+++ V+ FI K + S+ N YM+HGGTNFG
Sbjct: 293 RDKPLLVMEYWVGWFDRWGDKHHVKDAKEVERAVSEFI-KYEISF-NVYMFHGGTNFGFM 350
Query: 309 ASAY-------VLTGYYDQAPLDEYG 327
A ++T Y A L E G
Sbjct: 351 NGATNFGKHTGIVTSYDYDAVLTEAG 376
>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
Length = 583
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/334 (30%), Positives = 160/334 (47%), Gaps = 35/334 (10%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG +T DG + ++G + SG+IHY R Q W + + GL+ + + WN
Sbjct: 2 GGEKVGLTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWN 61
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
LHE + G FDF+G DLV F GL V R GP+I EW +GGLP WL P +
Sbjct: 62 LHEKERGNFDFAGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHI 121
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
RS+ ++ + Y + ++ ++ A L S GGPII Q+ENEYG +++K ++
Sbjct: 122 RSNYCGYQAAVSSYFSKLLPLL--APLQHSNGGPIIAFQVENEYG----DYVDKDNEHLP 175
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPV-------------INACNGRQCGETFAGPN-SP 249
W A L +++ + + D + +N+ + + + F+ + P
Sbjct: 176 WLADL---MKSHGLFELFFISDGGHTIRKANMLKVRSTAQLNSGSFQLLAKAFSLKSLQP 232
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
+KP + TE W ++ +G + + E + L +G+ VN+YM+HGGTNFG
Sbjct: 233 NKPMLVTEFWAGWFDYWGHGRNLLNNE--VFEKTLKEILKRGASVNFYMFHGGTNFGFMN 290
Query: 310 SAYVL-TGYYD--------QAPLDEYGLLRQPKW 334
A L GYY P+DE G R KW
Sbjct: 291 GAIELEKGYYTADVTSYDYDCPVDESG-NRTEKW 323
Score = 40.8 bits (94), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 50/108 (46%), Gaps = 9/108 (8%)
Query: 584 SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFL 643
S I W+ Y + +KT I + KG +VNG+++GRYWV+
Sbjct: 484 SSITAWTNYLQTAAVLPALFKTTVKILDYPKDTFILMHGWSKGVIFVNGRNLGRYWVT-K 542
Query: 644 TPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISIDTVSVTTL 691
PQ T ++P S+L N ++ LEEE G+SI+ VS L
Sbjct: 543 GPQKT-----LYLPASWLIKGENEIIWLEEEQ---LGMSIELVSSPDL 582
>gi|424764212|ref|ZP_18191655.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
gi|402420907|gb|EJV53177.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
Length = 595
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +++VRF+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|423342145|ref|ZP_17319860.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
gi|409219016|gb|EKN11981.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
CL02T12C29]
Length = 779
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/356 (28%), Positives = 164/356 (46%), Gaps = 29/356 (8%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
Q + L LLL G + G + ++ +++G ++ + IHY R + W
Sbjct: 5 QNTAIWLTALLLFAFSGCNQKPAGEHTFAIGNKTFLLDGKPFVIKAAEIHYTRIPAEYWE 64
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFI 122
I K G++ + FWN+HE +PG+FDFSG+ D+ F + Q +Y+ LR GP++
Sbjct: 65 HRIQLCKALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKYDMYIMLRPGPYV 124
Query: 123 EGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
EW GGLP+WL I R+++ F K + I + A L ++GG II+ Q
Sbjct: 125 CSEWEMGGLPWWLLKKDDIKLRTNDPYFLERTKLFMNEIGKQL--ADLQITKGGNIIMVQ 182
Query: 183 IENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK-----QDDAPDPV---IN 232
+ENEYG + ++ V+ A T VP C Q++A D + IN
Sbjct: 183 VENEYGSYATDKEYIANIRDIVKGAGF------TDVPLFQCDWSSNFQNNALDDLVWTIN 236
Query: 233 ACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMK 290
G E F P+ P + +E W+ ++ +G + R AE + + + +
Sbjct: 237 FGTGANIDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLD--R 294
Query: 291 GSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
G + YM HGGT FG A + + + Y AP+ E G PK+ L+EL
Sbjct: 295 GISFSLYMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAG-WTTPKYFKLREL 349
Score = 44.3 bits (103), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 87/201 (43%), Gaps = 27/201 (13%)
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
S + L ++ + + NG+ +G + + S K+ L GT L+ M +
Sbjct: 420 SGTTLLITEVHDWAQVYANGKLLGRLDRRRGENSL---KLPALAAGTQLDILIEAMGRVN 476
Query: 534 DSGAYLERR--VAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
A +R+ + ++ +ELK++ +S+ + EK + P
Sbjct: 477 FDKAIHDRKGITEKVELLNESSTQELKNWQVYSFPVDYPFVKEK---------KYAP--- 524
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQ 651
G P +Y+ F+ D V +++ + GKG WVNG++IGR+W + PQ T
Sbjct: 525 -GKKLDGP-AYYRATFNLEEAGD-VFLDMQTWGKGMVWVNGKAIGRFWE--IGPQQT--- 576
Query: 652 SWYHIPRSFLKPTGNLLVLLE 672
+P +LK N +++L+
Sbjct: 577 --LFMPGCWLKKGENEIIVLD 595
>gi|431741495|ref|ZP_19530400.1| beta-galactosidase [Enterococcus faecium E2039]
gi|430601673|gb|ELB39267.1| beta-galactosidase [Enterococcus faecium E2039]
Length = 595
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +++VRF+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
Length = 780
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/325 (29%), Positives = 149/325 (45%), Gaps = 38/325 (11%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G + T + ++NG ++ + +HYPR W + I K G++ + VFWN+HE
Sbjct: 25 GGDFTVGKNTFLLNGRPFVIKAAELHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHE 84
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
+ GQFDF+G D+ F + G+YV +R GP++ EW GGLP+WL + R D
Sbjct: 85 QREGQFDFTGNNDVAAFCRLAHKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVRLRED 144
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG--------------MVEH 192
+ F +K + + + A L GGPII+ Q+ENEYG +V+
Sbjct: 145 DPYFMARVKAFEAEVGRQL--APLTIQNGGPIIMVQVENEYGSYGINKKYVSEIRDIVKA 202
Query: 193 SFLEKGPPY-VRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--P 249
S +K + WA+ + + W M N G E F P
Sbjct: 203 SGFDKVTLFQCDWASNFEHNGLDDLVWTM-----------NFGTGANIDEQFRRLKQLRP 251
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
+ P + +E W+ ++ +G R A+D+ + + KG + YM HGGT+FG A
Sbjct: 252 EAPLMCSEFWSGWFDKWGARHETRPAKDMVEGIDEML--RKGISFSLYMTHGGTSFGHWA 309
Query: 310 SAYV------LTGYYDQAPLDEYGL 328
A +T Y AP++EYG+
Sbjct: 310 GANSPGFAPDVTSYDYDAPINEYGM 334
Score = 39.3 bits (90), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 8/75 (10%)
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIP 657
Q + +Y+ FD D +NL GKG+ +VNG ++GR+W + PQ T ++P
Sbjct: 536 QNIGYYRGYFDLKKTGD-TFLNLEQWGKGQVYVNGHALGRFW--HIGPQQT-----LYLP 587
Query: 658 RSFLKPTGNLLVLLE 672
+LK N +++L+
Sbjct: 588 GCWLKKGRNEIIVLD 602
>gi|293570811|ref|ZP_06681858.1| beta-galactosidase [Enterococcus faecium E980]
gi|430840422|ref|ZP_19458347.1| beta-galactosidase [Enterococcus faecium E1007]
gi|431064256|ref|ZP_19493603.1| beta-galactosidase [Enterococcus faecium E1604]
gi|431124630|ref|ZP_19498626.1| beta-galactosidase [Enterococcus faecium E1613]
gi|431738579|ref|ZP_19527522.1| beta-galactosidase [Enterococcus faecium E1972]
gi|291609079|gb|EFF38354.1| beta-galactosidase [Enterococcus faecium E980]
gi|430495187|gb|ELA71394.1| beta-galactosidase [Enterococcus faecium E1007]
gi|430566915|gb|ELB06003.1| beta-galactosidase [Enterococcus faecium E1613]
gi|430568897|gb|ELB07927.1| beta-galactosidase [Enterococcus faecium E1604]
gi|430597307|gb|ELB35110.1| beta-galactosidase [Enterococcus faecium E1972]
Length = 595
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +++VRF+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
51196]
Length = 664
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 171/351 (48%), Gaps = 42/351 (11%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGG-----NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQM 60
+L LF L ++ + + G + + +++G + SG +HY R
Sbjct: 1 MLALFLLPVSVMAAARRGNSSALSDQRGSFRVENGKFVLDGQPFQIISGEMHYERIPRAY 60
Query: 61 WPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGP 120
W + AK GL+ + T VFWNLHEP+PG+FDFSG DL +FI++ Q GL V LR GP
Sbjct: 61 WKARLQMAKAMGLNTIATYVFWNLHEPEPGKFDFSGNADLAQFIRDAQQTGLKVLLRAGP 120
Query: 121 FIEGEWGYGGLPFWLHDVPGI--VFRSDNEPFKFHMKRYATMIVNM-MKAARLYASQGGP 177
+ EW +GG P WL P + RS++ F MK I+ + + A L GGP
Sbjct: 121 YSCAEWEFGGFPAWLMKNPKMQTALRSNDPEF---MKPAEQWILRLGREVAPLQVGYGGP 177
Query: 178 IILSQIENEYG-------MVEH---SFLEKG-PPYVRWAAKLAVDLQTG-VPWVMCKQDD 225
II QIENEYG +EH FL+ G + + A + L G +P V +
Sbjct: 178 IIGVQIENEYGDFGGDAAYLEHLKKIFLKAGFTQSLLYTANPSRALVRGSIPGVYSAVNF 237
Query: 226 APDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALF 285
AP A + + AG +P + +E WT ++ +G+ ++ ++ V F
Sbjct: 238 APGHAAQALD--SLAQLRAG-----QPLLSSEYWTGWFDHWGEP---HQSKPLSLQVKDF 287
Query: 286 IAKMK-GSYVNYYMYHGGTNFG-RTASAYV-------LTGYYDQAPLDEYG 327
++ G+ VN YM+HGGT+FG + S++ +T Y APLDE G
Sbjct: 288 NYILRHGAGVNLYMFHGGTSFGMMSGSSWTKHQFLPDVTSYDYGAPLDEAG 338
>gi|194213011|ref|XP_001503026.2| PREDICTED: beta-galactosidase-1-like protein 3-like [Equus
caballus]
Length = 880
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 159/327 (48%), Gaps = 33/327 (10%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
+ GH+ ++F GSIHY R + W + K K G + V T V WNLHEP+ G+FDFSG
Sbjct: 248 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSG 307
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
DL F+ GL+V LR GP+I E GGLP L P + R+ ++ F + +
Sbjct: 308 NLDLEAFVLTAAEIGLWVILRPGPYICSEIDLGGLPSRLLQDPQVNLRTTDKGFVEAVDK 367
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGP--PYVRWAAKLAVDLQT 214
Y +++ + L +GGPII Q+ENEYG SF + PY++ A L+
Sbjct: 368 YFDHLIS--RVVHLQYRKGGPIIAVQVENEYG----SFYKDKDYMPYLQQAL-----LKR 416
Query: 215 GVPWVMCKQDDAPD----------PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
G+ ++ D+ D IN R+ DKP + E W ++
Sbjct: 417 GIVELLLTSDNVDDVLKGYIKGVLATINMKKFRKDAFQHLYKVQRDKPIMIMEYWVGWFD 476
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGY 317
+G + ++ A D+ V+ FI K + S+ N YM+HGGTNFG A V+T Y
Sbjct: 477 TWGSKHEVKDAGDVKNTVSEFI-KFEISF-NVYMFHGGTNFGFINGAINFVKHAGVVTSY 534
Query: 318 YDQAPLDEYGLLRQPKWGHLKELHSAV 344
A L E G + K+ L++L ++
Sbjct: 535 DYDAVLTEAGDYTK-KYFKLRKLFGSI 560
Score = 43.1 bits (100), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 566 GYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGK 625
G+ + L K+ F R VPW +S P +Y+ A + + L++
Sbjct: 709 GFTIYSLEMKMSFFKRL--RYVPWRPVPNSYSGP-AFYRATLRAGSSPKDTFLRLLNWNY 765
Query: 626 GEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEE 673
G ++NG+++GRYW+ + PQ T ++P ++L P N ++L E+
Sbjct: 766 GFVFINGRNLGRYWI--IGPQET-----LYLPGAWLHPEDNEIILFEK 806
>gi|354466872|ref|XP_003495895.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Cricetulus
griseus]
Length = 761
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 163/349 (46%), Gaps = 33/349 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
++GH+ ++ GSIHY R + W + K + G + V T + WNLHE G FDFS
Sbjct: 188 LDGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQNRGTFDFSEIL 247
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL ++ GL+V LR GP+I E GGLP WL P + R+ + F + +Y
Sbjct: 248 DLEAYVSLAATLGLWVILRPGPYICAEVDLGGLPSWLLGYPELQLRTTQQEFLDAVDKYF 307
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGP--PYVRWAAKLAVDLQTGV 216
++ + L +GGP+I QIENEYG SF + G Y++ A + + G+
Sbjct: 308 DHLIP--RILPLQYLRGGPVIAVQIENEYG----SFSKDGDYMEYIKEALQ-----KRGI 356
Query: 217 PWVMCKQDDAPDPVINACNGRQCGETFAGPNSP----------DKPAIWTENWTSFYQVY 266
++ D+ + G A DKP + E WT ++ +
Sbjct: 357 VELLLTSDNHKGIQTGSVKGALTTINMASFEKDSFIKLLQMQNDKPIMVMEYWTGWFDTW 416
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G E ++SAE+I Y V+ FI G N YM+HGGTNFG A+ V+T Y
Sbjct: 417 GREHNVKSAEEIRYTVSRFIK--YGISFNMYMFHGGTNFGFINGAFHYDKHSSVVTSYDY 474
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAF 368
A L E G + K+ L++L ++ + P L ++ + + AF
Sbjct: 475 DAVLTEAGDYTE-KYFKLRKLFASASVGFLPRLPQLIPKTVYPTVGLAF 522
>gi|329927841|ref|ZP_08281902.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938242|gb|EGG34637.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 619
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 164/340 (48%), Gaps = 42/340 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T+ +++G + SG+IHY R P+ W + K K G + V+T + WN+HEPQ
Sbjct: 4 LTWGNGQYLLDGQPYRIISGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIAWNVHEPQE 63
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F FSG D+ FI+ GL+V +R PFI EW +GGLP WL I R +
Sbjct: 64 GKFSFSGMADVASFIELAGKLGLHVIVRPSPFICAEWEFGGLPGWLLGYGEIRLRCSDPL 123
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAK 207
+ + Y ++ + L +S GGPI+ Q+ENEYG +H++L+ Y+R
Sbjct: 124 YLSKVDHYYDELIP--RLVPLLSSNGGPILAVQVENEYGSYGNDHAYLD----YLR---- 173
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVI----------NACNGRQCGETFAGPNS--PDKPAIW 255
A ++ G+ ++ D D ++ G + E+F ++P +
Sbjct: 174 -AGLVRRGIDVLLFTSDGPTDEMLLGGTLNDVHATVNFGSRVEESFRKYREYRTEEPLMV 232
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLT 315
E W ++ + ++ +R A D+A + + KGS +N YM+HGGTNFG + A +
Sbjct: 233 MEFWNGWFDHWMEDHHVRDAADVAGVLDEMLE--KGSSMNMYMFHGGTNFGFYSGANHIQ 290
Query: 316 GY------YD-QAPLDEYGLLRQPKWGHLKELHSAVKLCL 348
Y YD APL E WG E + AV+ L
Sbjct: 291 TYEPTTTSYDYDAPLTE--------WGDKTEKYEAVRRVL 322
>gi|431593417|ref|ZP_19521746.1| beta-galactosidase [Enterococcus faecium E1861]
gi|430591294|gb|ELB29332.1| beta-galactosidase [Enterococcus faecium E1861]
Length = 595
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +++VRF+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINAC------------------NGRQCGETFAGPNSPDKPAIWTE 257
VP + D A V++A Q + F + + P + E
Sbjct: 182 VP--LFTSDGAWLEVLDAGILIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|431758215|ref|ZP_19546843.1| beta-galactosidase [Enterococcus faecium E3083]
gi|430617878|gb|ELB54742.1| beta-galactosidase [Enterococcus faecium E3083]
Length = 595
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 141/289 (48%), Gaps = 30/289 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W + K G + V+T + WNLHEPQ G FDFS
Sbjct: 9 EFLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G +++VRF+K Q L V LR +I EW +GGLP WL P I RS + F +K
Sbjct: 69 GFKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLK 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 129 NYYQVL--LPKLAPLQITQGGPVIMMQLENEYGSYG---MEKS--YLRQTKELMLAHSID 181
Query: 216 VPWVMCKQDDAPDPVINACN------------------GRQCGETFAGPNSPDKPAIWTE 257
+P + D A V++A Q + F + + P + E
Sbjct: 182 IP--LFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 240 YWDGWFNRWGEPIITRDPEELATEVK---EMLEIGSLNLYMFHGGTNFG 285
>gi|380512533|ref|ZP_09855940.1| beta-galactosidase [Xanthomonas sacchari NCPPB 4393]
Length = 616
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 140/289 (48%), Gaps = 31/289 (10%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G I +G + SG+IH+ R W + KA+ GL+ V+T VFWNL EP+PGQFD
Sbjct: 38 GDHFIRDGKPYQVISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVEPRPGQFD 97
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F+ E AQGL V LR GP++ EW GG P WL PG+ RS + F
Sbjct: 98 FSGNNDIAAFVDEAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAEPGMRVRSQDPRFLAA 157
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+ Y + +K RL + GGPI+ Q+ENEYG +H+++ A+
Sbjct: 158 SQAYLDALAAQVK-PRLNGN-GGPIVAVQVENEYGSYGDDHAYMR---------LNRAMF 206
Query: 212 LQTGVPWVMCKQDDAPDPVINAC-------------NGRQCGETFAGPNSPDKPAIWTEN 258
+Q G + D PD + N + + ET A P +P + E
Sbjct: 207 VQAGFDKALLFTADGPDVLANGTLPDTLAVVNFAPGDAKNAFETLAK-FRPGQPQMVGEY 265
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMK-GSYVNYYMYHGGTNFG 306
W ++ +G++ +A D + F ++ G N YM+ GGT+FG
Sbjct: 266 WAGWFDQWGEK---HAATDATKQASEFEWILRQGHSANIYMFVGGTSFG 311
>gi|410100792|ref|ZP_11295748.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
gi|409214073|gb|EKN07084.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
CL02T12C30]
Length = 779
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 162/352 (46%), Gaps = 29/352 (8%)
Query: 9 LFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKA 68
L +L+ + G G ++ ++NG I+ + IHY R + W I
Sbjct: 11 LMVMLICVLSGCKNQSGSNGTFEIGDKTFLLNGKPFIIKAAEIHYTRIPVEYWEHRIQMC 70
Query: 69 KEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGY 128
K G++ + FWN+HE +PG+FDFSG+ D+ F + Q G+Y+ LR GP++ EW
Sbjct: 71 KALGMNTICIYAFWNIHEQKPGEFDFSGQNDIAAFCRLAQKNGMYIMLRPGPYVCSEWEM 130
Query: 129 GGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG 188
GGLP+WL I R+++ F + Y I + ++ ++GG II+ Q+ENEYG
Sbjct: 131 GGLPWWLLKKEDIQLRTNDPYFIERTRIYMNEIGKQLADRQI--TRGGNIIMVQVENEYG 188
Query: 189 --MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK-----QDDAPDPVINACN---GRQ 238
+ S++ K +R A T VP C ++A D ++ N G
Sbjct: 189 SYATDKSYIAKNRDILRDAGF------TDVPLFQCDWSSNFLNNALDDLVWTVNFGTGAN 242
Query: 239 CGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNY 296
E F P+ P + +E W+ ++ +G + R AE + + + + +
Sbjct: 243 IDEQFKKLKEVRPNTPLMCSEFWSGWFDHWGRKHETRDAETMIAGLRDMLD--RNISFSL 300
Query: 297 YMYHGGTNFGR------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
YM HGGT FG A + + + Y AP+ E G PK+ L+E +
Sbjct: 301 YMTHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWA-TPKYHKLREFMA 351
Score = 42.7 bits (99), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 46/201 (22%), Positives = 88/201 (43%), Gaps = 31/201 (15%)
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
+ L + + FI+G+ +G + + FT+ K+ G L+ M +
Sbjct: 422 TTLLIDEVHDWAQVFIDGKLIGRLDRRRGE--FTI-KLPATAAGARLDILIEAMGRVNFD 478
Query: 536 GAYLERRVAGLRN----VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
A +R+ G+ N ++ + ELKD+ ++ + +K + P
Sbjct: 479 KAIHDRK--GITNKVVLITESSSDELKDWQVYNLPVDYSFVKDK---------KYTP--- 524
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQ 651
G P +Y+ F+ T D V +++ + GKG WVNG+++GR+W + PQ T
Sbjct: 525 -GKKIEAP-AYYRATFNLETPGD-VFLDMQTWGKGMVWVNGKAMGRFWE--IGPQQT--- 576
Query: 652 SWYHIPRSFLKPTGNLLVLLE 672
+P +LK N +++L+
Sbjct: 577 --LFMPGCWLKKGENEIIVLD 595
>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 633
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 147/320 (45%), Gaps = 41/320 (12%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G +NG L SG +HY R + W + AK GL+ V T +FWN+HEP+PG +D
Sbjct: 46 GDHFELNGEPVQLLSGEMHYARIPREYWRARLQMAKAMGLNTVATYIFWNVHEPKPGVYD 105
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVP--GIVFRSDNEPFK 151
FSG D+ F+K Q +GL V LR GP+ EW +GG P WL P G RS++E +
Sbjct: 106 FSGNHDVAAFVKMAQEEGLNVILRAGPYACAEWEFGGYPSWLMKDPKMGSALRSNDEVYM 165
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVD 211
++R+ + M L S GGPI+ Q+ENEYG + G A L +
Sbjct: 166 APVERWIKRLGQEM--VPLLISNGGPIVAVQVENEYG-------DFGGDKKYLAHMLEIF 216
Query: 212 LQTGVPWVMCKQDDAPDPVIN-ACNGRQCGETFAGPNS-----------PDKPAIWTENW 259
G D ++N + G G F N+ P +P +E W
Sbjct: 217 QNAGFKDSFLYTVDPSKALVNGSLEGLPSGVNFGVGNAERGLTALAHLRPGQPLFASEYW 276
Query: 260 TSFYQVYGDEARIR----SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA---- 311
++ +G R +DIAY + S +N YM+HGGT+FG + A
Sbjct: 277 PGWFDHWGHPHETRPIPPQLKDIAYTLD------HKSSINIYMFHGGTSFGFMSGASWTG 330
Query: 312 --YV--LTGYYDQAPLDEYG 327
Y+ +T Y APLDE G
Sbjct: 331 GEYLPDVTSYDYDAPLDEAG 350
>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
Length = 788
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 158/335 (47%), Gaps = 31/335 (9%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G T ++ ++NG ++ + +HYPR W I K G++ V VFWN+HE
Sbjct: 29 GGTFTTGDKTFLLNGKPFVVKAAELHYPRIPRAYWEHRIKMCKALGMNTVCLYVFWNIHE 88
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
+ G+FDF+G D+ F + Q G+YV +R GP++ EW GGLP+WL I R
Sbjct: 89 QEEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQ 148
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
+ F ++ + + + A L GGPII+ Q+ENEYG K PYV +A
Sbjct: 149 DPYFMQRVEIFEKEVGKQL--APLTIQNGGPIIMVQVENEYGS-----YGKDKPYV--SA 199
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINA-----------CNGRQCGETFA--GPNSPDKPA 253
+ ++G V Q D +N G + F G P+ P
Sbjct: 200 IRDIVRKSGFDKVSLFQCDWSSNFLNNGLDDLTWTMNFGTGANIDQQFKRLGEVRPNAPK 259
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV 313
+ +E W+ ++ +G R A+D+ + ++ KG + YM HGGT+FG A A
Sbjct: 260 MCSEFWSGWFDKWGARHETRPAKDMVEGMDEMLS--KGISFSLYMTHGGTSFGHWAGANS 317
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
+T Y AP++E+GL PK+ L+++ +
Sbjct: 318 PGFQPDVTSYDYDAPINEWGLA-TPKFYELQKMMA 351
>gi|223942939|gb|ACN25553.1| unknown [Zea mays]
Length = 199
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 116/210 (55%), Gaps = 36/210 (17%)
Query: 623 MGKGEAWVNGQSIGRYWVSFLTPQ----------------------GTPSQSWYHIPRSF 660
MGKGEAWVNGQSIGRYW + L PQ G PSQ+ YH+PRSF
Sbjct: 1 MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSF 60
Query: 661 LKPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVSDSHLPPVISWRSQNQRTLKTHKRI 720
L+P N LVL E G P IS ++C VS++H + SW SQ
Sbjct: 61 LQPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQ---------- 110
Query: 721 PGRR--PKVQIRCP-SGRKISKILFASYGNPNGNCENYAIGSCHSSNSRAIVEKACLGKR 777
P +R P +++ CP G+ IS + FAS+G P+G C +Y+ G C S+ + +IV++AC+G
Sbjct: 111 PMQRYGPALRLECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVS 170
Query: 778 SCTVPVWTEKFYGDPCPGIPKALLVDAQCT 807
S + ++G+PC G+ K+L V+A C+
Sbjct: 171 S-CSVPVSSNYFGNPCTGVTKSLAVEAACS 199
>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
Length = 580
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/297 (31%), Positives = 148/297 (49%), Gaps = 30/297 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
++Y+ + ++ G L SG++HY R P+ W + K K G + V+T + WN+HEP+
Sbjct: 4 LSYEDQHFMLEGKPIQLISGAVHYFRIVPEYWEDRLRKVKAMGCNCVETYIAWNVHEPRD 63
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQF+F G D+V FI+ Q L V +R P+I EW +GG+P WL I R +
Sbjct: 64 GQFNFDGIADVVEFIRIAQRVDLLVIVRPSPYICAEWEFGGMPAWLLK-EDIRLRCSDPR 122
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGP 199
F + Y ++ +K L ++ GGPII QIENEYG + + +E+G
Sbjct: 123 FLEKVSAYYDALIPQLKP--LLSTSGGPIIAVQIENEYGSYGNDQAYLQALRNMLVERGI 180
Query: 200 PYVRWAAKLAVD--LQTGVPWVMCKQDDAPDPVINACN-GRQCGETFAGPNS--PDKPAI 254
+ + + D LQ G+ + V+ N G + E F P+ P +
Sbjct: 181 DVLLFTSDGPADDMLQGGMT----------EGVLATVNFGSRPKEAFGKLEEYQPNAPLM 230
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
E W ++ + +E RSAED A + ++ G+ VN+YM HGGTNFG ++ A
Sbjct: 231 CMEYWNGWFDHWFEEHHTRSAEDAAQVLDEMLSM--GASVNFYMLHGGTNFGFSSGA 285
>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
CL03T12C61]
Length = 774
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 148/316 (46%), Gaps = 31/316 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ DG + ++G L G +HY R + W + +A+ GL+ + VFWN HE QP
Sbjct: 29 IKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+FDFSG+ D+ F++ Q +GLYV LR GP+ EW +GG P WL +V+RS +
Sbjct: 89 GEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDPR 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F + +RY + + A L + GG I++ Q+ENEYG Y+ +
Sbjct: 149 FLEYCERYIKALGKQL--APLTVNNGGNILMVQVENEYGSYAAD-----KEYLAALRDMI 201
Query: 210 VDLQTGVPWVMCK---QDDAP--DPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSF 262
D VP C Q +A D + NG + F + P P E + ++
Sbjct: 202 KDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPAW 261
Query: 263 YQVYGDEARI----RSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY- 317
+ V+G R AE + + + +G V+ YM+HGGTNF A GY
Sbjct: 262 FDVWGQRHSTVDYKRPAEQLDWMLG------QGVSVSMYMFHGGTNFWYMNGANTAGGYR 315
Query: 318 -----YD-QAPLDEYG 327
YD APL E+G
Sbjct: 316 PQPTSYDYDAPLGEWG 331
Score = 42.4 bits (98), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 95/223 (42%), Gaps = 33/223 (14%)
Query: 469 HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTL-EKMVHLINGTNNVSLL- 526
H ++SE+VL + LG V +I+ + + GK L + V L++G SL
Sbjct: 382 HQTTESENVLSMEDLG-VDFGYIHYQTTINKAGKQKLIIQDLRDYAVILVDGKQVASLDR 440
Query: 527 -----SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTD 581
+VM+ + + A LE V V+ G L F+ QV EKL
Sbjct: 441 RYNQNNVMLDIQKAPATLEILVENTGRVNY-GPDIL--FNRKGITNQVLCGDEKLT---- 493
Query: 582 YGSRIVPWSRY---------GSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNG 632
G I P Y G S ++K +F D +++ GKG WVNG
Sbjct: 494 -GWSITPLPLYKEKVSEMNFGESIQGKPAFHKGIFTVRQKGD-CFVDMSRWGKGAVWVNG 551
Query: 633 QSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+S+GR+W + PQ T ++P +LK N +V+ E E+
Sbjct: 552 KSLGRFWN--IGPQQT-----LYLPAPWLKEGENEIVVFEMED 587
>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
Length = 584
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 96/309 (31%), Positives = 144/309 (46%), Gaps = 26/309 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG++HY R P +W I KA+ GL+ ++T V WN H P+PG FD S
Sbjct: 10 DFLLDGRPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLS 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL RF++ V G+Y +R GP+I EW GGLP WL P + R + ++
Sbjct: 70 GGLDLDRFLRLVADAGMYAIVRPGPYICAEWDNGGLPAWLFRDPSVGVRRYEPKYLDAVR 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y T + ++ ++ +GGP++L Q+ENEYG Y++ A+ +
Sbjct: 130 EYLTKVYEVVVPHQI--DRGGPVLLVQVENEYGA-----FGDDKRYLKALAEHTREAGVT 182
Query: 216 VPWVMCKQDDAPDPVINACNGRQCGETFAG----------PNSPDKPAIWTENWTSFYQV 265
VP Q + +G +F + P P + +E W ++
Sbjct: 183 VPLTTVDQPTPEMLEAGSLDGLHRTASFGSGAEARLAILRAHQPTGPLMCSEFWNGWFDH 242
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYY 318
+G SA D A + +A VN YM+HGGTNFG T A ++T Y
Sbjct: 243 WGAHHHTTSAADSAAELDALLAAGAS--VNLYMFHGGTNFGLTNGANDKGVYQPLITSYD 300
Query: 319 DQAPLDEYG 327
APLDE G
Sbjct: 301 YDAPLDEAG 309
>gi|332376142|gb|AEE63211.1| unknown [Dendroctonus ponderosae]
Length = 659
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 162/335 (48%), Gaps = 38/335 (11%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF---------SG 96
+FSG++HY R P W + K + GL+ V+T V WN+HEP+ G FDF S
Sbjct: 41 IFSGALHYFRVHPLYWRDRLKKYRAAGLNCVETYVPWNIHEPEDGSFDFGEDPDRNDFSL 100
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
DLV+F+K Q + L+V LR GP+I EW +GGLP WL + R+ + F F+++R
Sbjct: 101 FLDLVQFLKIAQEEDLFVILRPGPYICAEWEFGGLPSWLLRHEDLKVRTSDSKFLFYVER 160
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH-------SFLEKGPPYVRWAAKLA 209
Y ++ +++ + ++GG II QIENEYG V+ ++LE ++ +
Sbjct: 161 YFKKLLALVEPLQF--TKGGSIIAVQIENEYGNVKEDDKPIDIAYLEALKDIIKKNGIVE 218
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYG 267
+ + P P + A + CG A S P KP + E WT ++ Y
Sbjct: 219 LLFTSDTP-TQGFHGALPGVLATANCDKDCGLELARLESYQPTKPLMVMEYWTGWFDHYS 277
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVL------------T 315
++ I++ E ++ L M + N YM HGGTN+G A + T
Sbjct: 278 EKHHIQTVE--QFYANLSDILMGHASFNLYMMHGGTNWGFLNGANICGATDDNSGFQPDT 335
Query: 316 GYYD-QAPLDEYGLLRQPKWGHLKELHSAV-KLCL 348
YD APL E G K+ L++L + +LC+
Sbjct: 336 SSYDYHAPLAENGDYTD-KYVQLQQLTAEYNELCI 369
>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 774
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 148/316 (46%), Gaps = 31/316 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ DG + ++G L G +HY R + W + +A+ GL+ + VFWN HE QP
Sbjct: 29 IKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+FDFSG+ D+ F++ Q +GLYV LR GP+ EW +GG P WL +V+RS +
Sbjct: 89 GEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDPR 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F + +RY + + A L + GG I++ Q+ENEYG Y+ +
Sbjct: 149 FLEYCERYIKALGKQL--APLTVNNGGNILMVQVENEYGSYAAD-----KEYLAALRDMI 201
Query: 210 VDLQTGVPWVMCK---QDDAP--DPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSF 262
D VP C Q +A D + NG + F + P P E + ++
Sbjct: 202 KDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPAW 261
Query: 263 YQVYGDEARI----RSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY- 317
+ V+G R AE + + + +G V+ YM+HGGTNF A GY
Sbjct: 262 FDVWGQRHSTVDYKRPAEQLDWMLG------QGVSVSMYMFHGGTNFWYMNGANTAGGYR 315
Query: 318 -----YD-QAPLDEYG 327
YD APL E+G
Sbjct: 316 PQPTSYDYDAPLGEWG 331
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 8/84 (9%)
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQ 651
+G S ++K +F D +++ GKG WVNG+S+GR+W + PQ T
Sbjct: 512 FGESIQGKPAFHKGIFTVRQKGD-CFVDMSRWGKGAVWVNGKSLGRFWN--IGPQQT--- 565
Query: 652 SWYHIPRSFLKPTGNLLVLLEEEN 675
++P +LK N +V+ E E+
Sbjct: 566 --LYLPAPWLKEGENEIVVFEMED 587
>gi|433461907|ref|ZP_20419504.1| beta-galactosidase [Halobacillus sp. BAB-2008]
gi|432189486|gb|ELK46587.1| beta-galactosidase [Halobacillus sp. BAB-2008]
Length = 579
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 159/327 (48%), Gaps = 18/327 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T + ++N + SG+IHY R+ P+ W + K K GL+ V+T V WNLHEP+
Sbjct: 2 LTAENGQFLLNDKPFQILSGAIHYFRTVPEHWEDRLEKLKALGLNTVETYVPWNLHEPRR 61
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F+FSG D+ FI+ GLYV +R P+I EW GGLP WL +V RS +
Sbjct: 62 GEFEFSGLADIEGFIQTAADLGLYVIVRPAPYICAEWEMGGLPSWLLKDKDVVMRSSDPV 121
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH-----SFLEKGPPYVRW 204
+ +++ Y ++ LY + GGPII QIENEYG + +FL+K Y +
Sbjct: 122 YLSYVESYYKELLPKF-VPHLYQN-GGPIIAMQIENEYGAYGNDQKYLTFLKK--QYEQH 177
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSF 262
+ G ++ +Q PD G + + F ++ P + E W +
Sbjct: 178 GLDTFLFTSDGPDFI--EQGSLPDVTTTLNFGSKVEQAFERLDAFKTGSPKMVAEFWIGW 235
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKM-KGSYVNYYMYHGGTNFGRTASAYVLTGYYDQA 321
+ + E R A D A A+F M + + VN+YM+HGGTNFG A YY
Sbjct: 236 FDYWTGEHHTRDAGDAA---AVFRELMERKASVNFYMFHGGTNFGFMNGANHYDVYYPTI 292
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCL 348
+Y L G + E ++AVK L
Sbjct: 293 TSYDYDSLLTES-GAITEKYNAVKSIL 318
Score = 43.1 bits (100), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 51/190 (26%), Positives = 79/190 (41%), Gaps = 37/190 (19%)
Query: 490 FINGEFVGSAHGKHSDKSFTL---EKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL 546
++NG + + + K TL EK+ N + +L +G + G +LE R
Sbjct: 403 YVNGTYQKTIYINDEQKKTTLVFPEKI-------NTLEILVENMGRANYGEHLEDRKGLT 455
Query: 547 RNVSIQGAKELKDFSSFSWG-YQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKT 605
+N+ L + F W Y V L I+P S + +++
Sbjct: 456 KNIW------LGEQYFFEWEMYAVEL-------------DILPESYAKQEDSRYPKFFRG 496
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG 665
FDAP G I+ KG +VNG ++GRYW + P + Y +P LK G
Sbjct: 497 TFDAP-GRHDTYIDSEGFTKGNLFVNGFNLGRYWNT-----AGPQKRIY-VPGPLLKEQG 549
Query: 666 NLLVLLEEEN 675
N LV+LE E+
Sbjct: 550 NELVILELEH 559
>gi|357450861|ref|XP_003595707.1| Beta-galactosidase [Medicago truncatula]
gi|355484755|gb|AES65958.1| Beta-galactosidase [Medicago truncatula]
Length = 308
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 149/284 (52%), Gaps = 21/284 (7%)
Query: 418 TAKLDSVEQWEEYKEAIPTYD----ETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSD 473
T L + +WE E P D + + A+ LL+Q N T ASDYLWY + +
Sbjct: 19 TCSLGNPLKWEWASE--PMQDTLLGQGTFTASKLLDQKNVTAGASDYLWYMTEVVVNDTT 76
Query: 474 --SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVG 531
+S L+V++ G +++++ING + G S +SF ++ + L GTN +SLLSV +G
Sbjct: 77 VWGKSTLQVNAKGPIIYSYINGFWWGVYDSVPSTRSFVYDEDISLKRGTNIISLLSVTLG 136
Query: 532 LPDSGAYLERRVAGL-----RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
+ +++ + G+ + +SI+ + D S +W Y+VG+ G + F D S
Sbjct: 137 KSNCSGFIDMKETGIVGGHVKLISIEYPDNVLDLSKSTWSYKVGMNGMARK-FYDPKSNG 195
Query: 587 VPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQ 646
VPW S P+TWYKT F P GS+ V ++LI + +G+AWVNGQ IGRY +
Sbjct: 196 VPWIPRNVSIGVPMTWYKTTFKTPEGSNLVVLDLIGLQRGKAWVNGQCIGRYRLG----- 250
Query: 647 GTPSQSWYHIPRSFLKPTGNLLVLLEE--ENGYPPGISIDTVSV 688
S +Y +PR F N LVL EE P +S+D +S+
Sbjct: 251 ENSSFRYYAVPRPFFNKDVNTLVLFEELGLGKGPFNVSVDIISI 294
>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
Length = 624
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 153/326 (46%), Gaps = 24/326 (7%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
DG ++G ++ SG +HYPR W + A+ GL+ V T FW+ HEP+PGQ+
Sbjct: 36 DGAHFKLDGQPFVIRSGEMHYPRIPRAAWRERLRMARAMGLNTVTTYAFWSQHEPEPGQW 95
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
FSG+ DL FIK +GL V LR GP++ E +GG P WL G+ RS + +
Sbjct: 96 SFSGQNDLRTFIKTAAEEGLNVVLRPGPYVCAEVDFGGFPAWLMRTQGLRVRSMDARYLA 155
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA---- 206
RY + + A L +S+GGPI++ Q+ENEYG +H +L +R A
Sbjct: 156 ASARYFKRLAQ--EVADLQSSRGGPILMLQLENEYGSYGRDHDYLRAVRTQMRQAGFDAP 213
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVIN---ACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
D G + D P V+N + Q P P + E W ++
Sbjct: 214 LFTSDGGAGRLFEGGTLADVP-AVVNFGGGADDAQASVQELAAWRPHGPRMAGEYWAGWF 272
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------L 314
+G++ +S E+ A V ++ +G N YM+HGGT+FG A A
Sbjct: 273 DHWGEQHHTQSPEEAARTVERMLS--QGVSFNLYMFHGGTSFGWLAGANYSGSEPYQPDT 330
Query: 315 TGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y A LDE G PK+ L+++
Sbjct: 331 TSYDYDAALDEAG-RPTPKYFALRDV 355
>gi|313240094|emb|CBY32448.1| unnamed protein product [Oikopleura dioica]
Length = 677
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 146/298 (48%), Gaps = 14/298 (4%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG +T DG + ++G + SG+IHY R Q W + + GL+ + + WN
Sbjct: 2 GGEKVGLTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWN 61
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
LHE + G FDF G DLV F GL V R GP+I EW +GGLP WL P +
Sbjct: 62 LHEKERGNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHI 121
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
RS+ ++ + Y + ++ ++ A L S GGPII Q+ENEYG +++K ++
Sbjct: 122 RSNYCGYQAAVSSYFSKLLPLL--APLQHSNGGPIIAFQVENEYG----DYVDKDNEHLP 175
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTS 261
W A L +++ + + D + A + T S P+KP + TE W
Sbjct: 176 WLADL---MKSHGLFELFFISDGGHTIRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAG 232
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVL-TGYY 318
++ +G R D+ I K +G+ VN+YM+HGGTNFG A L GYY
Sbjct: 233 WFDYWG-HGRNLLNNDVFEKTLKEILK-RGASVNFYMFHGGTNFGFMNGAIELEKGYY 288
>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Cavia porcellus]
Length = 679
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 140/303 (46%), Gaps = 22/303 (7%)
Query: 25 GGGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
G G T GR+ + GH+ ++F GSIHY R + W + K K G + V T + WN
Sbjct: 89 GLGTASTTKGRAHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYIPWN 148
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVF 143
LHEPQ G+F FSG DL F+ GL+V LR GP+I E GGLP WL P
Sbjct: 149 LHEPQRGKFVFSGNLDLEAFVLLAAEIGLWVILRPGPYICAEIDLGGLPSWLLQNPKTQL 208
Query: 144 RSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
R+ F + Y + M + L GGP+I Q+ENEYG SF G
Sbjct: 209 RTTERTFVDAVDAYFDHL--MRRMVPLQYHHGGPVIAVQVENEYG----SFNRDGQYMAY 262
Query: 204 WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA--GPNS--------PDKPA 253
L L+ G+ ++ D D V + G G NS KP
Sbjct: 263 LKEAL---LKRGIVELLFTCDYYKDVVNGSLKGVLATVNLGSLGKNSFYQLLQVQSHKPI 319
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV 313
+ E W +Y +G +SA ++A+ V+ FI G N YM+HGGTNFG +A +
Sbjct: 320 LIMEYWVGWYDSWGLPHANKSAAEVAHTVSTFIK--NGISFNVYMFHGGTNFGFINAAGI 377
Query: 314 LTG 316
+ G
Sbjct: 378 VEG 380
>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
Length = 651
Score = 143 bits (360), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 150/313 (47%), Gaps = 30/313 (9%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
D + + N +IL SG++HY R P+ W + + K GL+ V+T V WNLHE G+F
Sbjct: 60 DYKFFLDNKELRIL-SGAMHYFRIVPEYWLDRLTRMKAAGLNTVETYVPWNLHEEIHGEF 118
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
F+G D+ RF+ + GL V LR GPFI EW +GGLP WL P + RS PF
Sbjct: 119 VFTGMLDIRRFVAIAEKVGLLVILRPGPFICSEWEFGGLPSWLLRDPQMDVRSTYRPFMD 178
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDL 212
+ Y +++ ++ + GGPII QIENEYG Y++ + D
Sbjct: 179 AARSYMRSLISELEDMQY--QYGGPIIAMQIENEYGSYSDDV-----NYMQELKNIMTD- 230
Query: 213 QTGVPWVMCKQDDA----PDPV------INACNGRQCGETFAGPNS--PDKPAIWTENWT 260
+GV ++ D+ P V N N + G F + P KP + E W+
Sbjct: 231 -SGVIEILFTSDNKHGLQPGRVPGVFMTTNFKNTNEGGRMFDKLHELQPGKPLMVMEFWS 289
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------VL 314
++ + ++ S E+ A V + +GS +N YM+HGGTNFG A +
Sbjct: 290 GWFDHWEEKHHTMSLEEYASAVEYILQ--QGSSINLYMFHGGTNFGFLNGANTEPYLPTV 347
Query: 315 TGYYDQAPLDEYG 327
T Y +PL E G
Sbjct: 348 TSYDYDSPLSEAG 360
>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
Length = 602
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 98/318 (30%), Positives = 151/318 (47%), Gaps = 31/318 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+TY +L+ G + +G++HY R P W + + GL+ V T + WN HE +
Sbjct: 9 LTYSEGTLLRAGRPHQVLAGTLHYFRVHPDQWHDRLERLAAMGLNTVDTYIAWNFHERRT 68
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+ F G RD+ RF++ Q GL V +R GP+I EW GGLP WL D PG+ RS P
Sbjct: 69 GEHRFDGWRDIERFVRTAQRTGLDVIVRPGPYICAEWDNGGLPAWLTDRPGMRPRSSYAP 128
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRW--- 204
+ + R+ +++ + A L A++GGP++ Q+ENEYG +H+ Y+RW
Sbjct: 129 YLDEVARWFDVLIP--RIADLQAARGGPVVAVQVENEYGSYGDDHA-------YMRWVHD 179
Query: 205 --AAKLAVDL---QTGVPWVMCKQDDAPDPVINACNGRQCGET--FAGPNSPDKPAIWTE 257
A + +L G +M P + A G + + +P + E
Sbjct: 180 ALAGRGVTELLYTADGPTELMLDGGSLPGVLATATLGSRADQAAQLLRTRRSGEPFLCAE 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY----- 312
W ++ +G++ RS A + +A KG V+ Y HGGTNFG A A
Sbjct: 240 FWNGWFDHWGEKHHTRSVGSAAAALDEILA--KGGSVSLYPAHGGTNFGLWAGANHADGA 297
Query: 313 ---VLTGYYDQAPLDEYG 327
+T Y AP+ E+G
Sbjct: 298 LQPTVTSYDSDAPIAEHG 315
>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
Length = 608
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 106/342 (30%), Positives = 166/342 (48%), Gaps = 58/342 (16%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
DG + I+G L SG++HY R P+ W + K K GL+ ++T V WNLHEP+ +
Sbjct: 26 DGANFTIDGKPVRLLSGAMHYFRVVPEYWRDRMLKMKAAGLNTLETYVPWNLHEPEKYTY 85
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
+F G DL R++ GL+V LR GP+I EW +GG+P WL V K
Sbjct: 86 NFEGILDLGRYLDIAHEVGLWVILRPGPYICAEWEFGGIPGWLAYV------------KE 133
Query: 153 HMKRYATMIVNMMKA--ARLYA-------SQGGPIILSQIENEYGMVEHS--FLEKGPPY 201
H++ M ++ ++ RL A + GGPII QIENEYG +S ++E+
Sbjct: 134 HVRTTRPMFIDPVEVWFGRLLAEVVPRQYTNGGPIIAVQIENEYGGFSNSTEYMERLKKI 193
Query: 202 V--RWAAKLAVD-------LQTGVPWVMCK---QDDAPDPVINACNGRQCGETFAGPNSP 249
+ R +L + G+P V+ Q++A D + ++ E P
Sbjct: 194 LESRGIVELLFTSDGKGALISGGIPGVLKTVNFQNNASDKL------QKLKEI-----QP 242
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
D+P + E WT ++ +G++ + E ++ ++F G+ VN+YM+HGGTNFG
Sbjct: 243 DRPMMVMEYWTGWFDHWGEDHHLYRLESESFVHSVFYILDAGASVNFYMFHGGTNFGFMN 302
Query: 310 SAY-----------VLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
A +T Y AP+ E G L PK+ ++E+
Sbjct: 303 GANTRYKSGGRTLPTITSYDYDAPISETGDL-TPKYFKIREI 343
>gi|333384209|ref|ZP_08475850.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826788|gb|EGJ99602.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
BAA-286]
Length = 632
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 123/415 (29%), Positives = 181/415 (43%), Gaps = 85/415 (20%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G + YDG+ + I SG +HYPR Q W + K GL+ V T VFWN HE
Sbjct: 34 GGDFVYDGKPVRI-------ISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNAHE 86
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P+PG++DF+ ++L +IK +GL V LR GP++ EW +GG P+WL +V + R D
Sbjct: 87 PEPGKWDFTEDKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEEMELRRD 146
Query: 147 NEPFKFHMKRYATMIVNMM--KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---Y 201
NE F +Y + +N + + L ++GGPII+ Q ENE+G + K P +
Sbjct: 147 NEQF----LKYTQLYINRLYQEVGNLQITKGGPIIMVQAENEFG--SYVSQRKDIPLEEH 200
Query: 202 VRWAAKLAVDLQT-------------------GVPWVMCKQD-----DAPDPVINACNGR 237
R+ AK+ L+T VP + + D V+N NG
Sbjct: 201 RRYNAKIVQQLKTAGFDIPSFTSDGSWLFEGGAVPGALPTANGESNIDNLKKVVNRYNGG 260
Query: 238 Q----CGETFAGPNSPDKPAIWTENWTSFY-QVYGDEARIRSAEDIAYHVALFIAKMKGS 292
Q E + G W +W + QV SA +A ++
Sbjct: 261 QGPYMVAEFYPG---------WLAHWVEPHPQV--------SATSVARQTEKYL--QNDV 301
Query: 293 YVNYYMYHGGTNFGRTASAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKE-LHS 342
+NYYM HGGTNFG T+ A LT Y AP+ E G + PK+ L+ +
Sbjct: 302 SINYYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPVSEAGWV-TPKFDSLRNVIQK 360
Query: 343 AVKLCLKPMLSGV-LVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNL 396
V L S + L+ + +L + +G + K NN + F L
Sbjct: 361 YVDYTLPEAPSAIDLIEIPSIRLDKVATLEG-------MDFKTTENNTPLTFEQL 408
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 603 YKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLK 662
YK F+ D IN+ GKG ++NG++IGRYW ++ PQ T +IP +LK
Sbjct: 546 YKGTFNLTETGD-TFINMEDWGKGIIFINGKNIGRYW--YVGPQQT-----LYIPGVWLK 597
Query: 663 PTGNLLVLLEEENGYPPGISIDTVSVTTLC 692
N +++ E+ N P + T V L
Sbjct: 598 KGENKIIIFEQLND-KPHTEVRTTKVPVLA 626
>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
Length = 786
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 107/351 (30%), Positives = 165/351 (47%), Gaps = 33/351 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG I+ + +HYPR W + I K G++ + VFWN+HE + G+FDF
Sbjct: 41 KTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDF 100
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G D+ FI+ Q GLYV +R GP++ EW GGLP+WL I R + F M
Sbjct: 101 TGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYF---M 157
Query: 155 KRYATMIVNM-MKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
+RY + + L +GGPII+ Q+ENEYG S+ E PYV + D
Sbjct: 158 ERYRIFAKKLGEQIGDLTIEKGGPIIMVQVENEYG----SYGED-KPYVSGIRDIIRD-- 210
Query: 214 TGVPWVMCKQDD---------APDPV--INACNGRQCGETFA--GPNSPDKPAIWTENWT 260
+G V Q D D V +N G F G P+ P + +E W+
Sbjct: 211 SGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGTGANIENEFKKLGELRPESPQMCSEFWS 270
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------L 314
++ +G R ++++ + + KG + YM HGGT++G A A +
Sbjct: 271 GWFDKWGGRHETRGSKEMVGGLKEMLD--KGISFSLYMTHGGTSWGHWAGANSPGFSPDV 328
Query: 315 TGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
T Y AP++E G + PK+ L+E+ S P + +N K+Q
Sbjct: 329 TSYDYDAPINEAGQV-TPKYMELREMLSGYSDKKLPSIPKEFPVINVPKIQ 378
Score = 47.8 bits (112), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 50/209 (23%), Positives = 95/209 (45%), Gaps = 24/209 (11%)
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
+R K ++S+L ++ FING+ +GS ++ +K+ L M + +
Sbjct: 413 YRTKTPAVPTQSILTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMKE----GDQLD 468
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFS-SFSWGYQVGLLGEKLQIFTDYG 583
+L +G + G ++ +G E + S + + G QV + + QI+T
Sbjct: 469 ILVEAMGRINFGRAIK---------DFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSD 519
Query: 584 S-RIVPWSRYGSSTHQPLT-WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVS 641
S ++ +Y Q + Y+ F+ D +NL + GKG+ +VNG +IGR+W
Sbjct: 520 SYQVQKDMKYVPLKDQKVPGCYRATFNLKKTGDTF-LNLETWGKGQVYVNGHAIGRFWK- 577
Query: 642 FLTPQGTPSQSWYHIPRSFLKPTGNLLVL 670
+ PQ T ++P +LK N +++
Sbjct: 578 -IGPQQT-----LYMPGCWLKKGENEIIV 600
>gi|227538632|ref|ZP_03968681.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
gi|227241551|gb|EEI91566.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
Length = 638
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 176/684 (25%), Positives = 268/684 (39%), Gaps = 142/684 (20%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N YDG++ I SG +HY R Q W + K GL+ V T VFWN HE
Sbjct: 40 NFVYDGKTTRI-------LSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEES 92
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F G DL FIK GL+V LR GP+ EW +GG P+WL + G+ R DN
Sbjct: 93 PGNWNFEGDHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDNA 152
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKG 198
F + K+Y + + L + GGPII+ Q ENE+G + EH
Sbjct: 153 KFLEYTKKYIDRLAK--EVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAK 210
Query: 199 PPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPV------INACNGRQCGETFAGPNSPDKP 252
A V L T + + P + N N ++ + + P
Sbjct: 211 IKKQLEEAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQYNNNQGPYMV 270
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYV------NYYMYHGGTNFG 306
A + W + AE A A IA+ Y+ NYYM HGGTNFG
Sbjct: 271 AEFYPGWLDHW-----------AEPFAKVDAGRIARQTEKYLQNDISFNYYMVHGGTNFG 319
Query: 307 RTASAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLV 357
T+ A +T Y AP+ E G PK+ ++
Sbjct: 320 FTSGANYNNKSDIQPDITSYDYDAPISEAG-WATPKYDSIRT------------------ 360
Query: 358 SMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFN 417
+ Q ++ V K +N + E+P + ++ + + F+
Sbjct: 361 -----------VIQKYADYTVPAVPK----------ANPVIEIPSIKLTAVANV----FD 395
Query: 418 TAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV 477
A K T +ET L NF EQ+N A+ Y+ Y+ +F P + +
Sbjct: 396 YA-----------KSGKTTINETPL--NF--EQLN---QANGYVLYSKQFNQ-PINGK-- 434
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
LK+ L +I+G VG ++ F +M I + + +L +G + G+
Sbjct: 435 LKIDGLRDFAVVYIDGTKVGEL-----NRVFKNYEMDIDIPFNSTLQILVENMGRINYGS 489
Query: 538 YLERRVAG------LRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
+ G + ++ I G ++ G +Q S+I
Sbjct: 490 EMIHNHKGIISPVLINDMEITGDWTMQQLPMDKVPDLAGKQTAAIQNTKTNASKI----- 544
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQ 651
+ T QP+ Y+ FD D I++ GKG ++NG +IGRYW + PQ T
Sbjct: 545 -AALTGQPVL-YQGTFDLKEIGDTF-IDMEKWGKGIVFINGINIGRYWKT--GPQHT--- 596
Query: 652 SWYHIPRSFLKPTGNLLVLLEEEN 675
+IP +LK N +V+ E+ N
Sbjct: 597 --LYIPAPYLKKGSNSIVIFEQLN 618
>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
Length = 587
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 142/297 (47%), Gaps = 22/297 (7%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K + GL+ V+T + WNLHEP+ GQF F G DL RF++
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR P+I EW +GGLP WL P I R + + + +Y ++
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIP-- 138
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHS-----FLEKGPPYVRWAAKLAVDLQTGVPWVM 220
+ L S+GGP+I QIENEYG + +L+ G ++ + + G M
Sbjct: 139 RLVPLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDG--LIKRGVDVLLFTSDGPTDGM 196
Query: 221 CKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
+ P + G + E F P+ P + E W ++ + R AED
Sbjct: 197 LQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDA 256
Query: 279 AYHVALFIAKMK-GSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQAPLDEYG 327
A A+F + + VN+YM+HGGTNFG A LT Y APL E G
Sbjct: 257 A---AVFKEMLDLNASVNFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 68/156 (43%), Gaps = 25/156 (16%)
Query: 527 SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
++ + +P +GA LE V + ++ +LKD+ + G ++ Q D+
Sbjct: 429 ALPIDVPAAGAKLEIVVENMGRINY--GPKLKDYKGITEGVRM-----NNQFLYDWSIYP 481
Query: 587 VPWSRYGSSTHQPL----------TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
+P ++ QPL T+Y+ F D I L GKG WVNG ++G
Sbjct: 482 LPLDHPNAAPFQPLEGPLEQQDRPTFYRGEFLVDDIGD-TFIRLDGWGKGVVWVNGFNLG 540
Query: 637 RYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
RYW QG Q+ ++P LK N +++ E
Sbjct: 541 RYW-----EQG--PQAALYLPGPLLKQGRNEILVFE 569
>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
Length = 587
Score = 142 bits (359), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 142/297 (47%), Gaps = 22/297 (7%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K + GL+ V+T + WNLHEP+ GQF F G DL RF++
Sbjct: 21 ILSGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVR 80
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR P+I EW +GGLP WL P I R + + + +Y ++
Sbjct: 81 IAGDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIP-- 138
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHS-----FLEKGPPYVRWAAKLAVDLQTGVPWVM 220
+ L S+GGP+I QIENEYG + +L+ G ++ + + G M
Sbjct: 139 RLVPLLTSKGGPVIAMQIENEYGSYGNDTAYLEYLKDG--LIKRGVDVLLFTSDGPTDGM 196
Query: 221 CKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
+ P + G + E F P+ P + E W ++ + R AED
Sbjct: 197 LQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAEDA 256
Query: 279 AYHVALFIAKMK-GSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQAPLDEYG 327
A A+F + + VN+YM+HGGTNFG A LT Y APL E G
Sbjct: 257 A---AVFKEMLDLNASVNFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310
Score = 41.6 bits (96), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 68/156 (43%), Gaps = 25/156 (16%)
Query: 527 SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
++ + +P +GA LE V + ++ +LKD+ + G ++ Q D+
Sbjct: 429 ALPIDVPAAGAKLEIVVENMGRINY--GPKLKDYKGITEGVRM-----NNQFLYDWSIYP 481
Query: 587 VPWSRYGSSTHQPL----------TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
+P ++ QPL T+Y+ F D I L GKG WVNG ++G
Sbjct: 482 LPLDHPNAAPFQPLEGPFEQQDRPTFYRGEFYVDDIGD-TFIRLDGWGKGVVWVNGFNLG 540
Query: 637 RYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
RYW QG Q+ ++P LK N +++ E
Sbjct: 541 RYW-----EQG--PQAALYLPGPLLKQGRNEILVFE 569
>gi|257876100|ref|ZP_05655753.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
gi|257810266|gb|EEV39086.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
Length = 591
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 146/290 (50%), Gaps = 31/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + V+T + WNLHEP+ G +DF
Sbjct: 8 EDFLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+ F+K+ QA GL V LR +I EW +GGLP WL + P + RS + F +
Sbjct: 68 EGMKDICAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K L + GGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 127 RNYFQVL--LPKLVPLQITHGGPVIMMQVENEYGSYG---MEKA--YLRQTKELMEEYGI 179
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G + E F + + P +
Sbjct: 180 DVP--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCM 237
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R+ +D+A V +A GS +N YM+HGGTNFG
Sbjct: 238 EYWDGWFNRWGEPIIKRAGQDLANEVKEMLA--VGS-LNLYMFHGGTNFG 284
>gi|413922057|gb|AFW61989.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
Length = 139
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 65/100 (65%), Positives = 79/100 (79%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V+YD R+++ING R+IL SGSIHYPRSTP+MWP L+ KAK+GGLDVVQT VFWN HEP
Sbjct: 28 VSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHEPVR 87
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYG 129
GQ+ F R DLVRF+K + GLYV LRIGP++ EW +G
Sbjct: 88 GQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFG 127
>gi|395846590|ref|XP_003795986.1| PREDICTED: beta-galactosidase-1-like protein 3 [Otolemur garnettii]
Length = 681
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 108/331 (32%), Positives = 157/331 (47%), Gaps = 29/331 (8%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++F GSIHY R + W + K K G + V T V WNLHEPQ G+FDFS
Sbjct: 110 LEGHKFLIFGGSIHYFRVPREYWQDRLLKLKACGFNTVTTYVPWNLHEPQRGKFDFSENL 169
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL F+ GL+V LR GP+I E GGLP WL P + R+ + F + +Y
Sbjct: 170 DLEAFVLLAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPELKLRTTSPGFLEAVDKYF 229
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
++ + L SQGGP+I Q+ENEYG K PY+ LQ G+
Sbjct: 230 DHLIP--RVIPLQYSQGGPVIALQVENEYGAYAQDV--KYMPYLH-----KTLLQRGIVE 280
Query: 219 VMCKQDDAPD----------PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
++ D + +N R+ + KP + E W ++ +G+
Sbjct: 281 LLLTSDGEKEVLKGHIKGVLATVNLKKLRKNAFSQLYEVQRGKPLLIMEFWVGWFDRWGE 340
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-------YVLTGYYDQA 321
I +A+++ Y+V+ I K + S+ N YM+HGGTNFG A V+T Y A
Sbjct: 341 SHHITNADNLEYNVSKLI-KHEISF-NLYMFHGGTNFGFMNGASYMGRHVSVVTSYDYDA 398
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPML 352
L E G + K+ L++L V + P L
Sbjct: 399 VLTEAGDYTE-KYFKLRKLLENVSVTPLPSL 428
>gi|306832839|ref|ZP_07465973.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
gi|304424978|gb|EFM28110.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
Length = 595
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 184/394 (46%), Gaps = 52/394 (13%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
S ++G + SGSIHY R P W + + K G + V+T V WNLHEP+ G+FDF
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G DL RF+ Q GLY +R P+I EW +GGLP WL + G+ RS ++ F +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWLLE-KGVRVRSQDKGFLQVV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
KRY +++ + +L QGG I++ Q+ENEYG S+ E Y+R ++ ++L
Sbjct: 127 KRYYEVLIPRLIKHQL--DQGGNILMFQVENEYG----SYGED-KVYLRELKQMMLELGL 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFAGPN------SPDKPAIW 255
P+ D P D ++ G + E FA P +
Sbjct: 180 EEPFFTS---DGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--------R 307
E W ++ +G+ R E++A V + ++ +N YM+HGGTNFG +
Sbjct: 237 MEFWDGWFNRWGEPVIKRDPEELADAV---MEAIEIGSINLYMFHGGTNFGFMNGCSARK 293
Query: 308 TASAYVLTGYYDQAPLDEYG-------LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
+T Y A LDE G +L+ ELH A L +KP ++ ++++
Sbjct: 294 QTDLPQVTSYDYDAILDEAGNPTKKFYILQHRLKNKYPELHYATPL-VKPTMAIKDIALS 352
Query: 361 FSKLQEAFIFQGSSEC--AAFLVNKDKRNNATVY 392
+K + + EC + + N + N +T Y
Sbjct: 353 -AKTNLVSVLEDIGECHTSFYPQNMEALNQSTGY 385
>gi|270295887|ref|ZP_06202087.1| beta-galactosidase [Bacteroides sp. D20]
gi|270273291|gb|EFA19153.1| beta-galactosidase [Bacteroides sp. D20]
Length = 1106
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 159/671 (23%), Positives = 262/671 (39%), Gaps = 139/671 (20%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ + VFWN HE QPG FDF+
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ DL F + Q +YV LR GP++ EW GGLP+WL I R + F +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + + A + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 477 IFEKAVAE--QVAGMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 204 --WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENW 259
WA+ + + W M N G + FA PD P + +E W
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A D+ + ++ KG + YM HGGTN+G A A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLS--KGISFSLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLK---------ELHSAVKLCLKPMLSGVLVSMNFSKL 364
+T Y AP+ E G PK+ L+ E + V +KP+ + S F+++
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWELRKALSKYMNGEKQAKVPALIKPIR---IPSFQFTEM 697
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
F +++ + ++ N F +++Y + LP+ KT + T D+
Sbjct: 698 APLFDNLPAAKKDRNIRTMEEYNQG---FGSILYR------TTLPEMKTPSLLTVN-DAH 747
Query: 425 EQWEEYKEAIPTYDETSLRANFL--LEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSS 482
+ Y + L ++ L++ N K F P + + V +
Sbjct: 748 D-----------YAQVFLDGKYIGKLDRRNGEK--------QLEFPACPKGARLDILVEA 788
Query: 483 LGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAY-LER 541
+G + +F G + + T ++ L D Y LE
Sbjct: 789 MGRINFGRAIKDFKG---------------ITQSVELTVDIDDRPFTCNLKDWEVYNLED 833
Query: 542 RVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLT 601
+N+ Q LKD + G RI R ++P
Sbjct: 834 TYDFYKNMKFQPIGSLKD---------------------ELGQRIPGCYRATFKVNKP-- 870
Query: 602 WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL 661
SD +N + GKG +VNG ++GR W + PQ T +IP +L
Sbjct: 871 -----------SDTF-LNFETWGKGLVYVNGHAMGRIWE--IGPQQT-----LYIPGCWL 911
Query: 662 KPTGNLLVLLE 672
K N +++ +
Sbjct: 912 KKGENEVIVFD 922
>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
Length = 583
Score = 142 bits (358), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 104/334 (31%), Positives = 166/334 (49%), Gaps = 33/334 (9%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ +T +G ++G + +G++HY R P W + K K GL+ V+T V WNLHEP
Sbjct: 2 STLTIEGDHFELDGEPFRILAGAMHYFRVHPAYWKDRLLKLKAMGLNTVETYVAWNLHEP 61
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
G+F F ++ R+I+ GLYV +R GP+I EW GGLP WL P + R
Sbjct: 62 HEGEFHFGDWLNIERYIELAGELGLYVIVRPGPYICAEWEMGGLPAWLLKDPQMKLRCMY 121
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK 207
+P+ + Y + + M + L +++GGPII Q+ENEYG + Y+++ +
Sbjct: 122 QPYLDAVGEYFSQL--MHRLVPLQSTRGGPIIAMQVENEYGSYGND-----TRYLKYLEE 174
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVIN---------ACN-GRQCGETFAGPNSPDK--PAIW 255
L Q GV ++ D D ++ A N G + G+ F P +
Sbjct: 175 LL--RQCGVDVLLFTADGVADEMMQYGSLPHLFKAVNFGNRPGDAFEKLREYQTGGPLLV 232
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--RTASAY- 312
E W ++ +G+ RSA ++A + ++ +G+ VN YM+HGGTNFG A+A+
Sbjct: 233 AEFWDGWFDHWGERHHTRSAGEVARVLDDLLS--EGASVNLYMFHGGTNFGFMNGANAFP 290
Query: 313 ------VLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y APL E G + PK+ ++E+
Sbjct: 291 SPHYTPTVTSYDYDAPLSECGNI-TPKYEAMREV 323
>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
Length = 778
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 152/326 (46%), Gaps = 38/326 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ I G L G +HYPR + W + +A+ GL+ V VFWN HE QPG+FDFS
Sbjct: 38 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 97
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ FI+ Q +GLYV LR GP++ EW +GG P WL + +RS + F + +
Sbjct: 98 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
RY + + + L + GG II+ Q+ENEYG +KG Y+ + +
Sbjct: 158 RYIKELGKQL--SPLTINNGGNIIMVQVENEYGSYA---ADKG--YLAAIRDMIKEAGFN 210
Query: 216 VPWVMCK--------QDDAPDPVINACNGRQCGETFAGPNSPDK--PAIWTENWTSFYQV 265
VP C + P +N G + F + K P E + +++
Sbjct: 211 VPLFTCDGGGQVEAGHTEGALPTLNGVFGE---DIFKVIDKYQKGGPYFVAEFYPAWFDE 267
Query: 266 YGDE----ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ- 320
+G A R AE + + ++ G V+ YM+HGGTNF T A GY Q
Sbjct: 268 WGRRHSSVAYERPAEQLDWMLS------HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQP 321
Query: 321 ------APLDEYGLLRQPKWGHLKEL 340
APL E+G PK+ +E+
Sbjct: 322 TSYDYDAPLGEWGNCY-PKYHAFREV 346
Score = 39.3 bits (90), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+++ GKG WVNG+S+GR+W + PQ T ++P +LK N +V+ E E+
Sbjct: 540 VDMSQWGKGAVWVNGKSLGRFWN--IGPQQT-----LYLPAPWLKEGENEIVVFEMED 590
>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
F0087]
gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
Length = 786
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 166/351 (47%), Gaps = 33/351 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG I+ + +HYPR W + I K G++ + VFWN+HE + G+FDF
Sbjct: 41 KTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFDF 100
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G D+ FI+ Q GLYV +R GP++ EW GGLP+WL I R + F M
Sbjct: 101 TGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYF---M 157
Query: 155 KRYATMIVNM-MKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
+RY + + L +GGPII+ Q+ENEYG S+ E PYV + D
Sbjct: 158 ERYRIFAQKLGEQIGDLTIEKGGPIIMVQVENEYG----SYGED-KPYVSAIRDIIRD-- 210
Query: 214 TGVPWVMCKQDD---------APDPV--INACNGRQCGETFA--GPNSPDKPAIWTENWT 260
+G V Q D D V +N G F G P+ P + +E W+
Sbjct: 211 SGFDKVTLFQCDWSSNFTKNGLDDLVWTMNFGTGANIENEFKKLGELRPESPQMCSEFWS 270
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------L 314
++ +G R ++++ + + KG + YM HGGT++G A A +
Sbjct: 271 GWFDKWGGRHETRGSKEMVGGLKEMLD--KGISFSLYMTHGGTSWGHWAGANSPGFSPDV 328
Query: 315 TGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQ 365
T Y AP++E G + PK+ L+E+ + P + + +N K+Q
Sbjct: 329 TSYDYDAPINEAGQV-TPKYMELREMLAGYSDKKLPSIPKEIPVINVPKIQ 378
Score = 48.1 bits (113), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 95/209 (45%), Gaps = 24/209 (11%)
Query: 465 FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVS 524
+R K ++SVL ++ FING+ +GS ++ +K+ L M + +
Sbjct: 413 YRTKTPAVPTQSVLTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMKE----GDQLD 468
Query: 525 LLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFS-SFSWGYQVGLLGEKLQIFTDYG 583
+L +G + G ++ +G E + S + + G QV + + QI+T
Sbjct: 469 ILVEAMGRINFGRAIK---------DFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSD 519
Query: 584 S-RIVPWSRYGSSTHQPLT-WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVS 641
S ++ +Y Q + Y+ F+ D +NL + GKG+ +VNG +IGR+W
Sbjct: 520 SYQVQKDMKYVPLKDQKVPGCYRATFNLKKTGD-TFLNLETWGKGQVYVNGHAIGRFWK- 577
Query: 642 FLTPQGTPSQSWYHIPRSFLKPTGNLLVL 670
+ PQ T ++P +LK N +++
Sbjct: 578 -IGPQQT-----LYMPGCWLKKGENEIIV 600
>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
Length = 777
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 152/326 (46%), Gaps = 38/326 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ I G L G +HYPR + W + +A+ GL+ V VFWN HE QPG+FDFS
Sbjct: 38 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 97
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ FI+ Q +GLYV LR GP++ EW +GG P WL + +RS + F + +
Sbjct: 98 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
RY + + + L + GG II+ Q+ENEYG +KG Y+ + +
Sbjct: 158 RYIKELGKQL--SPLTINNGGNIIMVQVENEYGSYA---ADKG--YLAAIRDMIKEAGFN 210
Query: 216 VPWVMCK--------QDDAPDPVINACNGRQCGETFAGPNSPDK--PAIWTENWTSFYQV 265
VP C + P +N G + F + K P E + +++
Sbjct: 211 VPLFTCDGGGQVEAGHTEGALPTLNGVFGE---DIFKVIDKYQKGGPYFVAEFYPAWFDE 267
Query: 266 YGDE----ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ- 320
+G A R AE + + ++ G V+ YM+HGGTNF T A GY Q
Sbjct: 268 WGRRHSSVAYERPAEQLDWMLS------HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQP 321
Query: 321 ------APLDEYGLLRQPKWGHLKEL 340
APL E+G PK+ +E+
Sbjct: 322 TSYDYDAPLGEWGNCY-PKYHAFREV 346
Score = 39.3 bits (90), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+++ GKG WVNG+S+GR+W + PQ T ++P +LK N +V+ E E+
Sbjct: 540 VDMSQWGKGAVWVNGKSLGRFWN--IGPQQT-----LYLPAPWLKEGENEIVVFEMED 590
>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
intestinalis]
Length = 658
Score = 142 bits (358), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 90/289 (31%), Positives = 146/289 (50%), Gaps = 14/289 (4%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T G++ ++G + SG++HY R + W + K K GL+ ++T V WNLHEP P
Sbjct: 58 LTAQGKTFKLDGKPMTIISGAVHYFRMPREYWRDRLMKMKACGLNTIETYVPWNLHEPIP 117
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+++F+G DLV FI YV LR GP+I EW +GGLP WL P + R+ P
Sbjct: 118 GKYNFTGDLDLVHFILLAHKLEFYVLLRPGPYICSEWEFGGLPSWLLRDPKMKVRTMYPP 177
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGP--PYVR--WA 205
+ + +Y ++ +K L GGPII Q++NEYG S+ + PY++
Sbjct: 178 YIAAVTKYFNYLLPFVKP--LQYQYGGPIIAFQLDNEYG----SYFKDADYLPYLKEFLQ 231
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFY 263
K ++L + + V+ N ++ F ++ PD P + E WT ++
Sbjct: 232 NKGIIELLFISDSIEGLRQQTIPGVLKTVNFKRMENHFTDLSNMQPDAPLMVMEFWTGWF 291
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY 312
+G++ I + ++ + + +G VN+YM+ GGTNFG AY
Sbjct: 292 DWWGEKHHILTVQEFGETLNEIFS--QGGSVNFYMFFGGTNFGFMNGAY 338
>gi|336063700|ref|YP_004558559.1| beta-galactosidase [Streptococcus pasteurianus ATCC 43144]
gi|334281900|dbj|BAK29473.1| beta-galactosidase precursor [Streptococcus pasteurianus ATCC
43144]
Length = 595
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 183/394 (46%), Gaps = 52/394 (13%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
S ++G + SGSIHY R P W + + K G + V+T V WNLHEP+ G+FDF
Sbjct: 8 ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G DL RF+ Q GLY +R P+I EW +GGLP WL + G+ RS ++ F +
Sbjct: 68 TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWLLE-KGVRVRSQDKDFLQVV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
KRY ++ + +L QGG I++ Q+ENEYG S+ E Y+R ++ ++L
Sbjct: 127 KRYYEALIPRLIKHQL--DQGGNILMFQVENEYG----SYGED-KVYLRELKQMMLELGL 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFAGPN------SPDKPAIW 255
P+ D P D ++ G + E FA P +
Sbjct: 180 EEPFFTS---DGPWHTALRAGSLIEDDVLVTGNFGSKAKENFASMEMFFQQYGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--------R 307
E W ++ +G+ R E++A V + ++ +N YM+HGGTNFG +
Sbjct: 237 MEFWDGWFNRWGEPVIKRDPEELADAV---MEAIEIGSINLYMFHGGTNFGFMNGCSARK 293
Query: 308 TASAYVLTGYYDQAPLDEYG-------LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMN 360
+T Y A LDE G +L+ ELH A L +KP ++ ++++
Sbjct: 294 QTDLPQVTSYDYDAILDEAGNPTKKFYILQHRLKNKYPELHYAAPL-VKPTMAIKDIALS 352
Query: 361 FSKLQEAFIFQGSSEC--AAFLVNKDKRNNATVY 392
+K + + EC + + N + N +T Y
Sbjct: 353 -AKTNLVSVLEDIGECHTSFYPQNMEALNQSTGY 385
>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
Length = 781
Score = 142 bits (357), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 165/354 (46%), Gaps = 34/354 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++ ++NG + + +HYPR W I K G++ + VFWN+HE + G+F+F
Sbjct: 36 KTFLLNGKPFTVKAAELHYPRIPRPYWEHRIKMCKALGMNAICIYVFWNIHEQKEGEFNF 95
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G D+ F + Q G+YV +R GP++ EW GGLP+WL I R + F +
Sbjct: 96 TGNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLRERDPYFMERV 155
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVR--WAAKLAV 210
K + + + A L +GGPII+ Q+ENEYG ++ ++ + +R W + +
Sbjct: 156 KIFEDKVAEQL--APLTIQRGGPIIMVQVENEYGSYGIDKQYVGEIRDMLRQGWGNDVKM 213
Query: 211 DLQTGVPWVMCKQDDAPDPVI---NACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQV 265
W + D +I N G F S PD P + +E W+ ++
Sbjct: 214 ---FQCDWSSNFTHNGLDDLIWTMNFGTGANIDNQFKKLKSLRPDAPLMCSEFWSGWFDK 270
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------LTGYYD 319
+G R A+D+ ++ ++ KG + YM HGGT+FG A A +T Y
Sbjct: 271 WGARHETRPAQDMVNNIDEMLS--KGISFSLYMTHGGTSFGHWAGANSPGFQPDVTSYDY 328
Query: 320 QAPLDEYG-------LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQE 366
AP++EYG LLR + + +S +L P L+ + +LQE
Sbjct: 329 DAPINEYGQATAKYQLLR-----NTLQKYSDKRLPAVPQAPAPLIRVPLFQLQE 377
>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
616]
Length = 624
Score = 142 bits (357), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 154/326 (47%), Gaps = 35/326 (10%)
Query: 41 GHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDL 100
G + SG +HY R Q W + K GL+ V T VFWNLHE +PG++DFSG ++L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 101 VRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATM 160
+I+ +G+ V LR GP++ EW +GG P+WL ++PG+ R DN F + K+Y
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDR 154
Query: 161 IVNMMKAARLYASQGGPIILSQIENEYGMV-----EHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPII+ Q ENE+G + SF E + +LA D
Sbjct: 155 LYQ--EVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLA-DAGFT 211
Query: 216 VP-------WVM---CKQDDAP--DPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
VP W+ C P + + N ++ + G P A + W S
Sbjct: 212 VPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLSH- 270
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------L 314
+G+ SA +IA ++ N+YM HGGTNFG T+ A L
Sbjct: 271 --WGEPFPQVSASEIARQTEAYL--QNNVSFNFYMVHGGTNFGFTSGANYDKKRDIQPDL 326
Query: 315 TGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y AP+ E G + PK+ ++ +
Sbjct: 327 TSYDYDAPISEAGWI-TPKYDSIRSV 351
>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
MP5ACTX9]
gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
Length = 607
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/335 (28%), Positives = 146/335 (43%), Gaps = 50/335 (14%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ +T D + +++G L SG +HYPR W + KA+ GL+ V FWN HE
Sbjct: 24 HRLTTDPQHFLLDGQPFQLISGEMHYPRIPRAAWRDRLRKARAMGLNAVTVYAFWNFHEE 83
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
+ G FDF+G+RD+ F++ Q +GL+V LR GP++ EW GG P WL P + RS +
Sbjct: 84 EEGHFDFTGQRDIAEFVRIAQQEGLFVILRPGPYVCAEWDLGGYPSWLLKSPAVNLRSLD 143
Query: 148 EPFKFHMKRYATMIVNMMKA-----ARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYV 202
RY MKA A L A++GGPI+ Q+ENEYG S Y+
Sbjct: 144 -------SRYIAAADKWMKALGQQLAPLQAAKGGPILAVQVENEYGSFPDSAQPNAQAYL 196
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGR-QCGETFAGPNSPDKPAI------- 254
++ +D G + D D + G + +S A+
Sbjct: 197 DRVHQMVLD--AGFKDSLLYTGDGADVLARGTFADLTAGIDYGTGDSARSIALYKKFRPN 254
Query: 255 -----------WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGT 303
W ++W + ++V ++ D+ G ++ YM HGGT
Sbjct: 255 TNIYTAEYWDGWFDHWGAKHEVVDASIHLKEVHDVL---------TSGGSISLYMLHGGT 305
Query: 304 NFGRTASAYV--------LTGYYDQAPLDEYGLLR 330
+FG A + +T Y AP+DE G LR
Sbjct: 306 SFGWMNGANIDHNHYEPDVTSYDYDAPIDEAGQLR 340
Score = 41.6 bits (96), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 39/196 (19%), Positives = 82/196 (41%), Gaps = 30/196 (15%)
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
LK+ L +++G+ VG+ D+ + + IN + +L G +
Sbjct: 420 TLKLDRLHSYARIYLDGKLVGTL-----DRRLDQDHIDLQINKPTQLDILVENTGRVNFT 474
Query: 537 AYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
+ AG+ + + ++++ +S ++ +P + + +
Sbjct: 475 EAIRTEQAGITHQVLLNGTPVENWQIYSLPFES-----------------IPTTGFSTKP 517
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
+ Y F+ T D +++ ++ KG WVNG ++GR+W + P GT ++
Sbjct: 518 CEGPCLYHATFNLTTPVD-TYLDVHTLSKGNVWVNGHNLGRFWK--IGPLGT-----LYL 569
Query: 657 PRSFLKPTGNLLVLLE 672
P S+LKP N + +LE
Sbjct: 570 PSSWLKPGPNKIEVLE 585
>gi|260912222|ref|ZP_05918774.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
gi|260633656|gb|EEX51794.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
Length = 627
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 109/339 (32%), Positives = 152/339 (44%), Gaps = 45/339 (13%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
DG+ + NG L SG +HY R W + K GL+ V T VFWN HE +PG++
Sbjct: 39 DGQ-FVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKW 97
Query: 93 DF-SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFK 151
D+ +G R+L +F+K +G+ V LR GP+ EW +GG P+WL G+V R+DN+PF
Sbjct: 98 DWKTGNRNLRQFVKTAAEEGMLVILRPGPYCCAEWDFGGYPWWLSKAKGLVIRADNQPFL 157
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAK 207
+ Y + + M+ L ++GGPII+ Q ENE+G + LE Y +
Sbjct: 158 DSCRVYINQLASQMR--DLQITKGGPIIMVQAENEFGSYVAQRKDVPLESHRAYSAKIKQ 215
Query: 208 LAVDLQTGVPWVMCKQD--------DAPDPVINACNG----RQCGETFAGPNSPDKPAIW 255
+D VP + P N N ++ + G P A +
Sbjct: 216 QLIDAGFDVPLFTSDGSWLFKGGTIEGALPTANGENDIEKLKKVVNEYNGGKGPYMVAEF 275
Query: 256 TENWTS-----FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTAS 310
W S F QV S E I A ++ G NYYM HGGTNFG T+
Sbjct: 276 YPGWLSHWAEPFPQV--------STESIVKQTAKYLE--NGVSFNYYMVHGGTNFGFTSG 325
Query: 311 AYV---------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
A LT Y AP+ E G PK+ L+ L
Sbjct: 326 ANYTTATNLQSDLTSYDYDAPISEAG-WNTPKYDALRAL 363
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 41/75 (54%), Gaps = 8/75 (10%)
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
T Y F+ T D +N+ + GKG ++NG ++GRYW + P Q+ Y +P F
Sbjct: 541 TLYSGTFNLDTTGDTF-LNMETWGKGIVFINGFNLGRYW------KRGPQQTLY-LPGCF 592
Query: 661 LKPTGNLLVLLEEEN 675
LK N +V+ E++N
Sbjct: 593 LKKGENKIVVFEQQN 607
>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
CL07T00C01]
gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
CL07T12C05]
Length = 624
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 154/326 (47%), Gaps = 35/326 (10%)
Query: 41 GHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDL 100
G + SG +HY R Q W + K GL+ V T VFWNLHE +PG++DFSG ++L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 101 VRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATM 160
+I+ +G+ V LR GP++ EW +GG P+WL ++PG+ R DN F + K+Y
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDR 154
Query: 161 IVNMMKAARLYASQGGPIILSQIENEYGMV-----EHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPII+ Q ENE+G + SF E + +LA D
Sbjct: 155 LYQ--EVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLA-DAGFT 211
Query: 216 VP-------WVM---CKQDDAP--DPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
VP W+ C P + + N ++ + G P A + W S
Sbjct: 212 VPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLSH- 270
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------L 314
+G+ SA +IA ++ N+YM HGGTNFG T+ A L
Sbjct: 271 --WGEPFPQVSASEIARQTEAYL--QNDVSFNFYMVHGGTNFGFTSGANYDKKRDIQPDL 326
Query: 315 TGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y AP+ E G + PK+ ++ +
Sbjct: 327 TSYDYDAPISEAGWI-TPKYDSIRSV 351
>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
610]
Length = 624
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 154/326 (47%), Gaps = 35/326 (10%)
Query: 41 GHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDL 100
G + SG +HY R Q W + K GL+ V T VFWNLHE +PG++DFSG ++L
Sbjct: 35 GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 101 VRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATM 160
+I+ +G+ V LR GP++ EW +GG P+WL ++PG+ R DN F + K+Y
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDR 154
Query: 161 IVNMMKAARLYASQGGPIILSQIENEYGMV-----EHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPII+ Q ENE+G + SF E + +LA D
Sbjct: 155 LYQ--EVGPLQCTKGGPIIMVQCENEFGSYVSQRKDISFEEHRSYNAKIKGQLA-DAGFT 211
Query: 216 VP-------WVM---CKQDDAP--DPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFY 263
VP W+ C P + + N ++ + G P A + W S
Sbjct: 212 VPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYMVAEFYPGWLSH- 270
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------L 314
+G+ SA +IA ++ N+YM HGGTNFG T+ A L
Sbjct: 271 --WGEPFPQVSASEIARQTEAYL--QNDVSFNFYMVHGGTNFGFTSGANYDKKRDIQPDL 326
Query: 315 TGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y AP+ E G + PK+ ++ +
Sbjct: 327 TSYDYDAPISEAGWI-TPKYDSIRSV 351
>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
ACS-120-V-Sch1]
Length = 597
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 150/317 (47%), Gaps = 34/317 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G + SG+IHY R P+ W + K G + V+T V WN HE G+FDF
Sbjct: 8 EEFMLDGKPLKILSGAIHYFRVLPEDWEHSLYNLKALGFNAVETYVPWNFHETVEGEFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG +D+ RFI +A GLYV +R P+I EW +GGLP WL P + RS + F ++
Sbjct: 68 SGTKDIKRFIHTAEAIGLYVIIRPSPYICAEWEFGGLPAWLLTKPNLRVRSRDPQFLEYV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+RY + ++ ++ GPI++ Q+ENEYG S+ E Y+ A++ D
Sbjct: 128 ERYYDRLFEILTPLQI--DHHGPILMMQVENEYG----SYGED-KTYLSALARMMRDRGV 180
Query: 215 GVP-------WVMC-------KQDDAPDPVINACNGRQCG--ETFAGPNSPDKPAIWTEN 258
VP W C + D P + + ++ F P + E
Sbjct: 181 TVPLFTSDGSWQQCLEAGSLAEADIIPTGNFGSKSQKRLDNLHKFHQQFGKTWPLMSMEF 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR----TASAYV- 313
W ++ +GD R ++++ + +K +N YM+HGGTNFG +A +
Sbjct: 241 WDGWFNRWGDRIITRQSDELIDEIG---EVLKRGSINLYMFHGGTNFGFWNGCSARGRID 297
Query: 314 ---LTGYYDQAPLDEYG 327
+T Y APLDE G
Sbjct: 298 LPQVTSYDYDAPLDEAG 314
>gi|443689405|gb|ELT91801.1| hypothetical protein CAPTEDRAFT_23316, partial [Capitella teleta]
Length = 596
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/332 (29%), Positives = 160/332 (48%), Gaps = 28/332 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
S ++G R +FSGS HY R+ P +W + + K GL+ V T V WN HEP+ GQF
Sbjct: 8 SFYLDGRRFKIFSGSFHYFRTHPLLWGDRLLRMKAAGLNTVMTYVPWNFHEPRKGQFTLG 67
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN-EPFKFHM 154
G DLV F+++VQ GLY+ +R GP+I EW +GG P WL P + R+ + P+ +
Sbjct: 68 GLYDLVSFMEQVQKVGLYLIVRPGPYICAEWEFGGFPSWLLRDPKMNLRTSSYTPYLNEV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEY---GMVEHSFLE-KGPPYVRWAAKLAV 210
K+Y + + ++ + GGPII Q+ENE+ G+ + +L+ Y W +
Sbjct: 128 KQYLSQLFAVL--TKFTYKHGGPIIAFQVENEFGSKGVHDPEYLQFLVTQYSSWNLNELL 185
Query: 211 DLQTGVPWVMCKQDDAPD--PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
G ++ PD IN + + P++P + TE W ++ +G+
Sbjct: 186 FTSDGKKYL--SNGTLPDVLATINLNDHAKEDLEELKEFQPERPLMVTEFWAGWFDHWGE 243
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGL 328
E ++ + ++ + VN+YM+ GGTNFG A L+ D+ E L
Sbjct: 244 EHHHYGTTELERELEAILS--LNASVNFYMFIGGTNFGFWNGANYLSYNKDK----EASL 297
Query: 329 L-----------RQPKWGHLKELHSAVKLCLK 349
L +WGH+K ++ ++ LK
Sbjct: 298 LGPTVTSYDYDAAVSEWGHVKPKYNVIRNLLK 329
>gi|225407896|ref|ZP_03761085.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
gi|225042575|gb|EEG52821.1| hypothetical protein CLOSTASPAR_05117 [Clostridium asparagiforme
DSM 15981]
Length = 590
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 148/318 (46%), Gaps = 38/318 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
++G L SG++HY R P+ W + K G + V+T + WN+HEP+ G+FDFS
Sbjct: 9 EFCLDGRPVKLLSGAVHYFRLMPEYWEDCLYNLKAMGFNTVETYIPWNIHEPEEGEFDFS 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G RD+ F++ + GL+V LR PFI EW GGLP WL P + R++ F ++
Sbjct: 69 GSRDVEAFVRLAGSMGLHVILRPSPFICAEWEMGGLPAWLLRYPDMKVRTNTPLFLVKVE 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y + + A L ++GGP+IL Q+ENEYG + Y+R L
Sbjct: 129 AYYRELFRHI--ADLQITRGGPVILMQVENEYGSFGND-----KEYLRRIKSLMERFGAE 181
Query: 216 VPWVMCKQDDAPDPVINACNGRQCG------------------ETFAGPNSPDKPAIWTE 257
VP+ D + D + A + + G E F + P + E
Sbjct: 182 VPFFTS--DGSWDAALEAGSLIEDGVLATANFGSRSDENLDVLEAFFKRHGRKWPLMCME 239
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR----TASAYV 313
W ++ + ++ R AED+A V + + + +N YM+ GGTNFG +A Y
Sbjct: 240 FWDGWFNRWREKIITRDAEDLAMEVRQLLER---ASINLYMFQGGTNFGFYNGCSARGYT 296
Query: 314 ----LTGYYDQAPLDEYG 327
+T Y A L E+G
Sbjct: 297 DLPQITSYNYDAILTEWG 314
>gi|329960238|ref|ZP_08298680.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
gi|328532911|gb|EGF59688.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
Length = 778
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/348 (28%), Positives = 159/348 (45%), Gaps = 21/348 (6%)
Query: 8 CLFGLLLTTIGGSDGGGGGGNNVTYD--GRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
C+FG+ + G + T++ ++ +++G I+ + +HY R + W I
Sbjct: 7 CIFGVAVLITAIFMGCSTSNKSQTFEVGNQTFLLDGKPFIIKAAEMHYTRIPAEYWEHRI 66
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
K G++ + FWN+HE +PG+FDF G+ D+ F + Q G+Y+ LR GP++ E
Sbjct: 67 QMCKALGMNTICIYAFWNIHEQRPGEFDFKGQNDIAEFCRLAQKNGMYIMLRPGPYVCSE 126
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W GGLP+WL I R+++ F K + I + A L A +GG II+ Q+EN
Sbjct: 127 WEMGGLPWWLLKKKDIQLRTNDPYFLERTKLFMNEIGKQL--ADLQAPRGGNIIMVQVEN 184
Query: 186 EYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPV---INACNGRQCG 240
EYG V ++ VR A V L W Q + D + IN G
Sbjct: 185 EYGGYAVNKEYIANVRDIVRGAGFTDVPL-FQCDWSSTFQLNGLDDLLWTINFGTGANID 243
Query: 241 ETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM 298
F PD P + +E W+ ++ +G + R AE + + + + + YM
Sbjct: 244 AQFKSLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAETMVSGLKDMLD--RNISFSLYM 301
Query: 299 YHGGTNFGRTASA------YVLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
HGGT FG A + + Y AP+ E G PK+ L+E+
Sbjct: 302 AHGGTTFGHWGGANCPPYSAMCSSYDYDAPISEAGWA-TPKYYKLREM 348
Score = 48.9 bits (115), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 89/199 (44%), Gaps = 27/199 (13%)
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
+VL + + + +G+ +G + S+ S TL L GT L+ M +
Sbjct: 421 TVLLIDEVHDWAQVYADGKLLGRLDRRRSENSLTLPA---LKAGTQLDILVEAMGRVNFD 477
Query: 536 GAYLERR--VAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
A +R+ + ++ + KELK + +S+ +K D+ R G
Sbjct: 478 YAIHDRKGITEKVELLTEESRKELKGWQVYSFPTDADFAAQK-----DF--------RKG 524
Query: 594 SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSW 653
+ P +Y+ F+ D V +++ + GKG WVNG++IGR+W + PQ T
Sbjct: 525 NKAEGP-AYYRASFNLKETGD-VFLDMQTWGKGMVWVNGKAIGRFWE--IGPQQT----- 575
Query: 654 YHIPRSFLKPTGNLLVLLE 672
++P +LK N +V+L+
Sbjct: 576 LYMPGCWLKKGKNEIVVLD 594
>gi|313245457|emb|CBY40184.1| unnamed protein product [Oikopleura dioica]
Length = 620
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 150/312 (48%), Gaps = 40/312 (12%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+++YD ++ + L SGS+HY R + W +AK K GL+ V T V WNLHEP+
Sbjct: 9 SLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGLNGVTTYVPWNLHEPE 68
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+F FSG D+V FI + L+V LR GP+I EW +GGLP WL + R++
Sbjct: 69 PGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPAWLLRDSFMKVRTNYS 128
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM------------------- 189
+ +KR+ ++ ++K + + GGPI+ Q+ENEYGM
Sbjct: 129 GYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYGMYAGQDGAHLNTLAELLKNE 186
Query: 190 --VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN 247
VE F G W + + G+ V K + P+ + + G +
Sbjct: 187 GIVEPLFTSDGSSV--WDNEKNTIYEDGLKSVNFKSN--PEKHLKSLRG----------H 232
Query: 248 SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR 307
P++P E W ++ +G+ + D ++ + I K S +N+YM+HGGTNFG
Sbjct: 233 FPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDV-ILDHKAS-LNFYMFHGGTNFGF 290
Query: 308 TASAYVLT-GYY 318
T + GYY
Sbjct: 291 TNGGLTIARGYY 302
>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
Length = 775
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 153/322 (47%), Gaps = 30/322 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ I G L G +HYPR + W + +A+ GL+ V VFWN HE QPG+FDFS
Sbjct: 36 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 95
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ FI+ Q +GLYV LR GP++ EW +GG P WL + +RS + F + +
Sbjct: 96 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 155
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQ 213
RY + + + L + GG II+ Q+ENEYG + +L ++ A V L
Sbjct: 156 RYIKELGKQL--SPLTINNGGNIIMVQVENEYGSYAADKEYLAAIRDMIK-EAGFNVPLF 212
Query: 214 T--GVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK--PAIWTENWTSFYQVYGDE 269
T G V + P +N G + F + K P E + +++ +G
Sbjct: 213 TCDGGGQVEAGHVEGALPTLNGVFGE---DIFKVVDKYQKGGPYFVAEFYPAWFDEWGRR 269
Query: 270 ----ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ----- 320
A R AE + + ++ G V+ YM+HGGTNF T A GY Q
Sbjct: 270 HSSVAYERPAEQLDWMLS------HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYD 323
Query: 321 --APLDEYGLLRQPKWGHLKEL 340
APL E+G PK+ +E+
Sbjct: 324 YDAPLGEWGNCY-PKYHAFREV 344
Score = 39.3 bits (90), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+++ GKG WVNG+S+GR+W + PQ T ++P +LK N +V+ E E+
Sbjct: 538 VDMSQWGKGAVWVNGKSLGRFWN--IGPQQT-----LYLPAPWLKEGENEIVVFEMED 588
>gi|307188518|gb|EFN73255.1| Beta-galactosidase [Camponotus floridanus]
Length = 624
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 155/326 (47%), Gaps = 42/326 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V Y+ +++G SGS HY R+ Q W + K + GL+ V T V W+LHEP+P
Sbjct: 34 VDYENNQFLLDGKPFRYVSGSFHYFRAPRQYWRDRLRKMRAAGLNAVSTYVEWSLHEPEP 93
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFW-LHDVPGIVFRSDNE 148
GQF+++G DL+ F+ Q + L+V LR GP+I E GGLP+W L + P I R+ +
Sbjct: 94 GQFNWAGDADLIEFLNIAQEEDLFVLLRPGPYICAERDLGGLPYWLLREAPDIKLRTKDA 153
Query: 149 PFKFHMKRYATMIVNMM--KAARLYASQGGPIILSQIENEYGM-----VEHSFLEKGPPY 201
F +YAT +N + K L GGPII+ QIENEYG E++ + K
Sbjct: 154 AF----MKYATAYLNQVLEKVKPLLRGNGGPIIMVQIENEYGSYNACDTEYTDMLKEIIV 209
Query: 202 VRWAAKLAVDLQTGVPWVMCKQDDAPDPV--------INACNGRQCGETFA--GP--NSP 249
+ +K + G + + P +N N Q + GP NS
Sbjct: 210 GKVGSKALLYTTDGASASLLRCGFVPGAYATIDFGTSVNVTNSFQSMRLYQPRGPLVNSE 269
Query: 250 DKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
P W +W +Q EA ++ ++ +AL G+ VN YM++GGTNFG T+
Sbjct: 270 FYPG-WLTHWGETFQRVKTEAVTKTLREM---LAL------GASVNIYMFYGGTNFGFTS 319
Query: 310 SAY--------VLTGYYDQAPLDEYG 327
A +T Y APL E G
Sbjct: 320 GANGGVGAYSPQITSYDYDAPLTEAG 345
>gi|317479674|ref|ZP_07938798.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
gi|316904175|gb|EFV26005.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
Length = 1106
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 160/364 (43%), Gaps = 50/364 (13%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ + VFWN HE QPG FDF+
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ DL F + Q +YV LR GP++ EW GGLP+WL I R + F +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + + A + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 477 IFEKAVAE--QVAGMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 204 --WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENW 259
WA+ + + W M N G + FA PD P + +E W
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A D+ + ++ KG + YM HGGTN+G A A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLS--KGISFSLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLK---------ELHSAVKLCLKPMLSGVLVSMNFSKL 364
+T Y AP+ E G PK+ L+ E + V +KP+ + S F+++
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWELRKALSKYMNGEKQAKVPALIKPIR---IPSFQFTEM 697
Query: 365 QEAF 368
F
Sbjct: 698 APLF 701
>gi|160890905|ref|ZP_02071908.1| hypothetical protein BACUNI_03350 [Bacteroides uniformis ATCC 8492]
gi|156859904|gb|EDO53335.1| glycosyl hydrolase family 35 [Bacteroides uniformis ATCC 8492]
Length = 1106
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 160/364 (43%), Gaps = 50/364 (13%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ + VFWN HE QPG FDF+
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ DL F + Q +YV LR GP++ EW GGLP+WL I R + F +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + + A + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 477 IFEKAVAE--QVAGMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 204 --WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENW 259
WA+ + + W M N G + FA PD P + +E W
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A D+ + ++ KG + YM HGGTN+G A A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLS--KGISFSLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLK---------ELHSAVKLCLKPMLSGVLVSMNFSKL 364
+T Y AP+ E G PK+ L+ E + V +KP+ + S F+++
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWELRKALSKYMNGEKQAKVPALIKPIR---IPSFQFTEM 697
Query: 365 QEAF 368
F
Sbjct: 698 APLF 701
>gi|423303842|ref|ZP_17281841.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|423307438|ref|ZP_17285428.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
gi|392687173|gb|EIY80470.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
CL03T00C23]
gi|392690047|gb|EIY83318.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
CL03T12C37]
Length = 1106
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 160/364 (43%), Gaps = 50/364 (13%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ + VFWN HE QPG FDF+
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ DL F + Q +YV LR GP++ EW GGLP+WL I R + F +
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + + A + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 477 IFEKAVAE--QVAGMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534
Query: 204 --WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENW 259
WA+ + + W M N G + FA PD P + +E W
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A D+ + ++ KG + YM HGGTN+G A A
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLS--KGISFSLYMTHGGTNWGHWAGANSPGFAPD 641
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLK---------ELHSAVKLCLKPMLSGVLVSMNFSKL 364
+T Y AP+ E G PK+ L+ E + V +KP+ + S F+++
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWELRKALSKYMNGEKQAKVPALIKPIR---IPSFQFTEM 697
Query: 365 QEAF 368
F
Sbjct: 698 APLF 701
>gi|355690250|gb|AER99094.1| galactosidase, beta 1 [Mustela putorius furo]
Length = 648
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 148/322 (45%), Gaps = 30/322 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HEPQP
Sbjct: 23 IDYHHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 82
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG +D+ FIK GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 83 GQYKFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 142
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ Y+R+ KL
Sbjct: 143 YLAAVDKWLGVLLPRMKP--LLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQKL- 195
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS-------------PDKPAI 254
G ++ D A +P + A G F GP + P P +
Sbjct: 196 FHYHLGKDVLLFTTDGALEPFLQCGALQGLYATVDF-GPGANITAAFEVQRKSEPKGPLV 254
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G E +A + +A +G+ VN YM+ GGTNF A +
Sbjct: 255 NSEFYTGWLDHWGQPHSTVKTEVVASSLHDILA--RGANVNLYMFIGGTNFAYWNGANMP 312
Query: 314 ----LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 313 YKAQPTSYDYDAPLSEAGDLTE 334
>gi|313231869|emb|CBY08981.1| unnamed protein product [Oikopleura dioica]
Length = 664
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 150/312 (48%), Gaps = 40/312 (12%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+++YD ++ + L SGS+HY R + W +AK K GL+ V T V WNLHEP+
Sbjct: 53 SLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGLNGVTTYVPWNLHEPE 112
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG+F FSG D+V FI + L+V LR GP+I EW +GGLP WL + R++
Sbjct: 113 PGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPPWLLRDSFMKVRTNYS 172
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM------------------- 189
+ +KR+ ++ ++K + + GGPI+ Q+ENEYGM
Sbjct: 173 GYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYGMYAGQDGAHLNTLAELLKNE 230
Query: 190 --VEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN 247
VE F G W + + G+ V K + P+ + + G +
Sbjct: 231 GIVEPLFTSDGSSV--WDNEKNTIYEDGLKSVNFKSN--PEKHLKSLRG----------H 276
Query: 248 SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR 307
P++P E W ++ +G+ + D ++ + I K S +N+YM+HGGTNFG
Sbjct: 277 FPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDV-ILDHKAS-LNFYMFHGGTNFGF 334
Query: 308 TASAYVLT-GYY 318
T + GYY
Sbjct: 335 TNGGLTIARGYY 346
>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
Length = 920
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/320 (31%), Positives = 152/320 (47%), Gaps = 46/320 (14%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ +++G + SG +HYPR + W + KAK GL+ + T VFWNLHEPQ G++DFS
Sbjct: 346 AFLLDGQPFQIISGEMHYPRVPREAWRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFS 405
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+ F+K Q +GL+V LR P++ EW +GG P+WL ++ G+ RS EP +++
Sbjct: 406 GNNDIAAFVKTAQEEGLWVILRPSPYVCAEWEFGGYPYWLQNIKGLEVRS-KEP--QYLQ 462
Query: 156 RYATMIVNMMKA-ARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDL 212
Y I+ + K A L + GG I++ Q+ENEYG + +L+ + +
Sbjct: 463 AYKNYIMQVGKQLAPLQVNHGGNILMVQVENEYGAYGSDREYLD---------INRRLFI 513
Query: 213 QTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPA----------------IWT 256
+ G ++ D P+P + G G+ F N DKPA
Sbjct: 514 EAGFDGLLYTCD--PEPFL--AKGNLPGKLFTSINGLDKPARIKQLIKQNNEGKGPYFVA 569
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
E + +++ +G + AE Y L G VN YM+HGGT A
Sbjct: 570 EWYPAWFDWWGTQHHKVPAE--KYTPGLDSVLSAGMSVNMYMFHGGTTRDFMNGANYNDQ 627
Query: 314 ------LTGYYDQAPLDEYG 327
++ Y APLDE G
Sbjct: 628 NPYEPQISSYDYDAPLDEAG 647
Score = 46.2 bits (108), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 86/199 (43%), Gaps = 29/199 (14%)
Query: 475 ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPD 534
E LK+ L FING+ + + S L+ L + + +L +G +
Sbjct: 730 EGALKIKDLRDYGLVFINGKRISVLDRRLKQDSIWLK----LPDEKIQLDILVENLGRIN 785
Query: 535 SGAYLERRVAGL-RNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
G YL + G+ VS G KEL + F KL F D S + S+
Sbjct: 786 YGPYLLKNKKGITEGVSFNG-KELTGWQMF-----------KLP-FNDLNSVALKNSKTL 832
Query: 594 SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSW 653
S P+ K F T D +NL + GKG WVNG ++GRYW + PQ T
Sbjct: 833 SGA--PVL-KKGTFSLQTVGD-TYLNLGNWGKGVVWVNGHNLGRYWN--IGPQQT----- 881
Query: 654 YHIPRSFLKPTGNLLVLLE 672
++P +LK GN +++LE
Sbjct: 882 LYVPVEWLKKGGNEIIVLE 900
>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
Length = 662
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 157/322 (48%), Gaps = 29/322 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++ GSIHY R + W + K + G + V T + WNLHE + G+FDFS
Sbjct: 71 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 130
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL ++ + GL+V LR GP+I E GGLP WL P R+ N+ F + +Y
Sbjct: 131 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 190
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
++ K L GGP+I Q+ENEYG SF +K Y+ + K L+ G+
Sbjct: 191 DHLIP--KILPLQYRHGGPVIAVQVENEYG----SF-QKDRNYMNYLKKAL--LKRGIVE 241
Query: 219 VMCKQDDAPDPVINACNGRQ--------CGETFAGPN--SPDKPAIWTENWTSFYQVYGD 268
++ DD I + NG ++F + DKP + E WT +Y +G
Sbjct: 242 LLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGS 301
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-------RTASAYVLTGYYDQA 321
+ +SAE+I + V FI+ G N YM+HGGTNFG V+T Y A
Sbjct: 302 KHIEKSAEEIRHTVYKFIS--YGLSFNMYMFHGGTNFGFINGGRYENHHISVVTSYDYDA 359
Query: 322 PLDEYGLLRQPKWGHLKELHSA 343
L E G + K+ L++L ++
Sbjct: 360 VLSEAGDYTE-KYFKLRKLFAS 380
>gi|298384202|ref|ZP_06993762.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
gi|383123627|ref|ZP_09944306.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|251839745|gb|EES67828.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
gi|298262481|gb|EFI05345.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
Length = 624
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 151/325 (46%), Gaps = 33/325 (10%)
Query: 41 GHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDL 100
G + SG +HY R Q W + K GL+ V T VFWNLHE +PG++DFSG ++L
Sbjct: 35 GEEIPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94
Query: 101 VRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATM 160
+I+ +G+ V LR GP++ EW +GG P+WL ++PG+ R DN F + K+Y
Sbjct: 95 AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDR 154
Query: 161 IVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTGV 216
+ + L ++GGPII+ Q ENE+G + LE+ Y D +
Sbjct: 155 LYE--EVGDLQCTKGGPIIMVQCENEFGSYVSQRKDIPLEEHRSYNAKIKGQLADAGFTI 212
Query: 217 P-------WVM---CKQDDAP--DPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
P W+ C P + + N ++ + G P A + W S
Sbjct: 213 PLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGDKGPYMVAEFYSGWLSH-- 270
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------LT 315
+G+ SA +IA ++ N+YM HGGTNFG T+ A LT
Sbjct: 271 -WGEPFPQVSASEIARQTEAYL--QNDVSFNFYMVHGGTNFGFTSGANYDKKRDIQPDLT 327
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKEL 340
Y AP+ E G L PK+ ++ +
Sbjct: 328 SYDYDAPISEAGWL-TPKYDSIRSV 351
>gi|326922161|ref|XP_003207320.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Meleagris
gallopavo]
Length = 643
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/330 (33%), Positives = 155/330 (46%), Gaps = 29/330 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ YD + +G SGSIHY R W + K K GLD +QT V WN HE Q
Sbjct: 18 IDYDCNCFVKDGRPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLDAIQTYVPWNYHETQM 77
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G +DFSG RDL F++ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 78 GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 137
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ ++++ +++ MK LY + GGPII+ Q+ENEYG S+ Y+R K+
Sbjct: 138 YLTAVEKWMGVLLPKMK-PHLYQN-GGPIIMVQVENEYG----SYFACDYDYLRSLLKI- 190
Query: 210 VDLQTGVPWVMCKQDDAPD------------PVINACNGRQCGETFAGPNS--PDKPAIW 255
G V+ D A ++ G F S P P +
Sbjct: 191 FRQHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPTGPLVN 250
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G + ++ IA + +A +G+ VN YM+ GGTNF A +
Sbjct: 251 SEFYTGWLDHWGHRHAVVPSQTIAKTLNEILA--RGANVNLYMFIGGTNFAYWNGANMPY 308
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E G L + K+ L+E+
Sbjct: 309 MSQPTSYDYDAPLSEAGDLTE-KYFALREV 337
>gi|322703307|gb|EFY94918.1| beta-calactosidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 645
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 102/329 (31%), Positives = 153/329 (46%), Gaps = 43/329 (13%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N TYD + +++G L G + R P W + + AK GL+ + + VFWN EP
Sbjct: 33 NFTYDRHNFLLDGVPIQLIGGQMDPQRIPPAYWTQRLQMAKAMGLNTIFSYVFWNNIEPT 92
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G +DF GR D+ RF++ Q +GLYV LR GP+I GE +GG P WL +PG+ R +N+
Sbjct: 93 EGSWDFDGRNDIARFLRLAQQEGLYVVLRPGPYICGEHEWGGFPSWLAQIPGMAVRQNNK 152
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF + Y + + A + SQGGP++++Q+ENEYG SF K Y+R A +
Sbjct: 153 PFLDASRNYLEQLGKHLAATHI--SQGGPVLMTQLENEYG----SF-GKDKAYLRAMADM 205
Query: 209 AVDLQTGVPW-----------------VMCKQDDAPDPVINACNGRQCGETFAGPNSPDK 251
G + ++ + D P A + T GP +
Sbjct: 206 LKANFDGFLYTNDGGGKSYLDGGSLHGILAETDGDPKTGFAARDQYVTDPTMLGPQLDGE 265
Query: 252 PAI-WTENWTSF----YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ W ++W+S Y +A R +D+ + +A + + YM+HGGTN+G
Sbjct: 266 YYVTWIDDWSSNSPYQYTSGRPDATKRVLDDLDWILA------GNNSFSIYMFHGGTNWG 319
Query: 307 RTASAY--------VLTGYYDQAPLDEYG 327
V T Y APLDE G
Sbjct: 320 FENGGIWVDNRLNAVTTSYDYGAPLDESG 348
Score = 40.4 bits (93), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 57/195 (29%), Positives = 85/195 (43%), Gaps = 33/195 (16%)
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGL-RN 548
++NG VG H+ + V L G + + LL +G D G L + G+ N
Sbjct: 448 YVNGARVGVVDKTHAAPASV---SVDLKQG-DVLQLLVENLGRIDYGQQLREQQKGIVGN 503
Query: 549 VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFD 608
V++ G L+ +S++S L + D S P + G + +YK F
Sbjct: 504 VTVGGDAILEGWSAYSL-----PLTDLPAALADENSE-TPEIKDGGAP----VFYKGTFG 553
Query: 609 APTG-----SDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFL-- 661
P G S ++L + KG WVNG +GRYWV P QS Y +P ++L
Sbjct: 554 LPAGVGNDLSGDTFLSLPNGVKGSVWVNGHHLGRYWVV------GPQQSLY-VPGAYLYG 606
Query: 662 --KPTGNLLVLLEEE 674
KP N +V+LE E
Sbjct: 607 GNKP--NHVVVLELE 619
>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 604
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 162/343 (47%), Gaps = 43/343 (12%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
+ +T+ + ++G + SG+IHY R P+ W + K K G + V+T + WNLHEP
Sbjct: 2 SRLTWKDQKYRLDGEEFRILSGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIPWNLHEP 61
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
+ G F F G D+ RFI+ GL+V +R P+I EW +GGLP WL + DN
Sbjct: 62 REGSFRFDGFADVARFIETAGRLGLHVIVRPSPYICAEWEFGGLPAWLLKSSMGLRCMDN 121
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWA 205
E + + Y +I ++ L S+GGPII Q+ENEYG + ++L Y+R
Sbjct: 122 EYLEKVDRYYDELIPRLLP---LLDSRGGPIIAVQVENEYGSYGNDTAYL----AYLRDG 174
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVI----------NACNGRQCGETFAGPNS--PDKPA 253
++ GV ++ D D ++ G + E+ A D+P
Sbjct: 175 L-----IRRGVDCLLFTSDGPTDEMLLGGTVEGLHATVNFGSRVAESLAKYREYRQDEPL 229
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY- 312
+ E W ++ + +R A D+A + + + G+ VN YM+HGGTNFG + A
Sbjct: 230 MVMEYWLGWFDHWRKPHHVREAGDVANVLDEMLEQ--GASVNLYMFHGGTNFGFYSGANY 287
Query: 313 ------VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLK 349
+T Y APL E WG + E + A++ L+
Sbjct: 288 GEHYEPTITSYDYDAPLTE--------WGDITEKYKAIRSVLE 322
>gi|313241117|emb|CBY33414.1| unnamed protein product [Oikopleura dioica]
Length = 608
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 151/316 (47%), Gaps = 31/316 (9%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SGS+HY R + W + K K GL+ VQT + WNLHEP+ G F F D+ F+K
Sbjct: 19 ILSGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDELDVSEFLK 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR-SDNEPFKFHMKRYATMIVNM 164
+ GLYV +R GP+I EW +GG P WL ++ R + +E + ++ + T++ +
Sbjct: 79 IAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQNWFTVLFSQ 138
Query: 165 MKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQD 224
++ + S+GGPII Q+ENEY K Y+ W L D+ + +
Sbjct: 139 LRDHQW--SRGGPIISIQVENEYASY-----NKDSEYLPWVKNLLTDVGKCFLLKIINET 191
Query: 225 D--------APDPVINACNGRQCGETFAGPN--SPDKPAIWTENWTSFYQVYGDEARIRS 274
+ PD + A N + G F + P++P + TE W ++ +G + +
Sbjct: 192 NFFLKGAHLLPDTFLTA-NFQSVGNAFEVLDKLQPNRPKMVTEFWAGWFDHWGQQGH-ST 249
Query: 275 AEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVL----------TGYYDQAPLD 324
++ + GS VN YM+HGGT+FG A + L T Y APL
Sbjct: 250 LSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTTSYDYDAPLS 309
Query: 325 EYGLLRQPKWGHLKEL 340
E G L + KW +E+
Sbjct: 310 ESGDLTE-KWNVTREI 324
>gi|400603388|gb|EJP70986.1| glycoside hydrolase family 35 [Beauveria bassiana ARSEF 2860]
Length = 631
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 154/333 (46%), Gaps = 51/333 (15%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N +Y+ ++NG + G + R P+ W + A+ GL+ + + ++WNLHEP
Sbjct: 28 NFSYNRHQFLLNGQPYQIIGGQMDPQRIPPEYWTHRLKMARAMGLNTIFSYLYWNLHEPS 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++DF GR ++ F + Q +GL V LR GP+I GE +GG P WL VPG+ R +N
Sbjct: 88 PGEWDFQGRNNVAEFFRLAQEEGLKVVLRPGPYICGERDWGGFPAWLSQVPGMAVRQNNG 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
PF K Y + + + ++ +QGGPI+++Q+ENEYG SF + A L
Sbjct: 148 PFLDAAKSYINRVGKELGSLQI--TQGGPILMTQLENEYG----SFGTD----KEYLAAL 197
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQC-----GETFAGPNSPDK---------PAI 254
A L + D + G++ G + DK P +
Sbjct: 198 AAMLHDNFDVFLYTNDGGGKSYLEGGQFHGVLAVIDGDSKTGFEARDKYVTDPTSLGPQL 257
Query: 255 -------WTENWTSFY---QVYGDEARI-RSAEDIAYHVALFIAKMKGSY-VNYYMYHGG 302
W + W S Y Q G + +I ++ D+ + +A G+Y + YM+HGG
Sbjct: 258 NGEYYITWIDQWGSDYSHQQSSGSQTKIDKAVGDLDWTLA-------GNYSFSIYMFHGG 310
Query: 303 TNFGRTAS--------AYVLTGYYDQAPLDEYG 327
TNFG A V T Y APLDE G
Sbjct: 311 TNFGFENGGIRDDGPLAAVTTSYDYGAPLDESG 343
>gi|373953412|ref|ZP_09613372.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890012|gb|EHQ25909.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 610
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 142/306 (46%), Gaps = 19/306 (6%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ +++G + SG +HYPR + W + AK GL+ + T VFWNLHEPQ G FDFS
Sbjct: 34 AFMLDGKPFQMISGEMHYPRVPREAWRARMKMAKAMGLNTIGTYVFWNLHEPQKGHFDFS 93
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+ F+K + +GL+V LR P++ EW +GG P+WL + G+V RS + +
Sbjct: 94 GNNDVAEFVKIAKEEGLWVILRPSPYVCAEWEFGGYPYWLQNEKGLVVRSMEAQYIAEYR 153
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQ 213
+Y + + A L + GG I++ QIENEYG + ++L + AA L
Sbjct: 154 KYINEVGKQL--APLQINHGGNILMVQIENEYGSYGSDKAYLALNQQLFK-AAGFDGLLY 210
Query: 214 TGVPWVMCKQDDAPD--PVINACNGRQCGETFAGPNSPDKPAIWTENW-TSFYQVYGDEA 270
T P K P P IN + + N K + W +++ +G
Sbjct: 211 TCDPGADVKNGHLPGLMPAINGVDDPAKVKKIINENHNGKGPYYIAEWYPAWFDWWGASH 270
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------LTGYYDQA 321
+AE + +A G +N YM+HGGT A +T Y A
Sbjct: 271 HTVAAEKYVGRLDTVLA--AGISINMYMFHGGTTRAFMNGANYKDETPYEPQITSYDYDA 328
Query: 322 PLDEYG 327
PLDE G
Sbjct: 329 PLDEAG 334
Score = 44.7 bits (104), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 51/196 (26%), Positives = 79/196 (40%), Gaps = 26/196 (13%)
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
VLK+S L +NG+ +G+ + S T V L G + +L +G + G
Sbjct: 419 VLKLSDLRDYAVIMVNGKTIGTLDRRLKQDSMT----VTLPAGPVILDILVENMGRINFG 474
Query: 537 AYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
YL G+ E+ + F L + QI G +
Sbjct: 475 KYLLENKKGITKAVFFNGAEINKWQMFGLS-----LSDSKQIAFKAG--------VAAGG 521
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
+ P T+ K F+ +D I+L GKG WVNG ++GRYW P Q+ Y +
Sbjct: 522 NLP-TFKKGTFNLQKIAD-TYIDLSKWGKGVVWVNGHNLGRYW------NIGPEQTLY-L 572
Query: 657 PRSFLKPTGNLLVLLE 672
P +LK N +++ E
Sbjct: 573 PAEWLKKGANEIIVFE 588
>gi|86142033|ref|ZP_01060557.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
gi|85831596|gb|EAQ50052.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
MED217]
Length = 620
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 161/361 (44%), Gaps = 28/361 (7%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
Q F L+L + + + S + NG ++SG +HY R + W
Sbjct: 2 QVVRTNFFALVLIVLSFGFAQAQDDASFKIENGSFVYNGKPTPIYSGEMHYERIPKEYWR 61
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF-SGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K GL+ + T VFWN H P PG +DF SG R++ FIK + + ++V LR GP+
Sbjct: 62 HRIQMMKAMGLNTIATYVFWNYHNPAPGVWDFESGNRNVAEFIKIAKEEEMFVILRPGPY 121
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILS 181
GEW +GG P++L ++PG+ R +N F K Y + + A L + GG II++
Sbjct: 122 ACGEWEFGGYPWFLQNIPGLKVRENNAQFLAACKEYINELAK--QVAPLQVNNGGNIIMT 179
Query: 182 QIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK-----QDDAPDPVIN 232
Q+ENE+G E E Y K+ D P+ + + + V+
Sbjct: 180 QVENEFGSYVAQREDIAPEDHKAYKEAIFKMLKDAGFQAPFFTSDGAWLFEGGSLEGVLP 239
Query: 233 ACNGR----QCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAK 288
NG + N+ + P + E + + + + SA DIA +++
Sbjct: 240 TANGEGNIDNLKKVVNKFNNNEGPYMVAEFYPGWLDHWAEPFVKISASDIAKQTEVYLK- 298
Query: 289 MKGSYVNYYMYHGGTNFGRTASAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKE 339
G N+YM HGGTNFG T+ A +T Y AP+ E G + PK+ ++
Sbjct: 299 -NGVNFNFYMAHGGTNFGFTSGANYNDEHDIQPDITSYDYDAPISEAGWVT-PKYDSIRA 356
Query: 340 L 340
L
Sbjct: 357 L 357
Score = 42.7 bits (99), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 88/217 (40%), Gaps = 33/217 (15%)
Query: 460 YLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLING 519
Y+ Y RF + + LKV L ++NG+ VG ++ F +M I
Sbjct: 416 YVLYKKRFTQPITGT---LKVPGLRDFATVYVNGKKVGEL-----NRVFNSYEMPIKIPF 467
Query: 520 TNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFS-SFSWGYQVGLLGEKLQI 578
++ +L +G + GA + + G I + D+ + W E ++
Sbjct: 468 NGSLEILVENMGRINYGAEIVNNLKG-----ITAPVSINDYEITGGWEMYKAPFAEVPEV 522
Query: 579 FTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRY 638
+ T +P+ Y FD D +N+ MGKG +VNG ++GRY
Sbjct: 523 INSTEVK----------TGRPVV-YSGSFDLKKQGD-TFLNMSEMGKGIVFVNGHNLGRY 570
Query: 639 WVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
W + P Q+ Y +P +LK GN + + E+ N
Sbjct: 571 W------KVGPQQTLY-VPGCWLKKKGNTITIFEQLN 600
>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
Length = 649
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 157/322 (48%), Gaps = 29/322 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++ GSIHY R + W + K + G + V T + WNLHE + G+FDFS
Sbjct: 58 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 117
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL ++ + GL+V LR GP+I E GGLP WL P R+ N+ F + +Y
Sbjct: 118 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 177
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
++ K L GGP+I Q+ENEYG SF +K Y+ + K L+ G+
Sbjct: 178 DHLIP--KILPLQYRHGGPVIAVQVENEYG----SF-QKDRNYMNYLKKAL--LKRGIVE 228
Query: 219 VMCKQDDAPDPVINACNGRQ--------CGETFAGPN--SPDKPAIWTENWTSFYQVYGD 268
++ DD I + NG ++F + DKP + E WT +Y +G
Sbjct: 229 LLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGS 288
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-------RTASAYVLTGYYDQA 321
+ +SAE+I + V FI+ G N YM+HGGTNFG V+T Y A
Sbjct: 289 KHIEKSAEEIRHTVYKFIS--YGLSFNMYMFHGGTNFGFINGGRYENHHISVVTSYDYDA 346
Query: 322 PLDEYGLLRQPKWGHLKELHSA 343
L E G + K+ L++L ++
Sbjct: 347 VLSEAGDYTE-KYFKLRKLFAS 367
>gi|325567414|ref|ZP_08144081.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
gi|325158847|gb|EGC70993.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
Length = 591
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 144/290 (49%), Gaps = 31/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + V+T + WNLHEP+ G +DF
Sbjct: 8 EDFLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+ F+K+ Q GL V LR +I EW +GGLP WL + P + RS + F +
Sbjct: 68 EGMKDICAFVKQAQTLGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K L + GGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 127 RNYFQVL--LPKLVPLQITHGGPVIMMQVENEYGSYG---MEKA--YLRQTKELMEEYGI 179
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G + E F + + P +
Sbjct: 180 DVP--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCM 237
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 238 EYWDGWFNRWGEPIIKRDGQDLANEVKEMLA--VGS-LNLYMFHGGTNFG 284
>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
Length = 777
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 152/322 (47%), Gaps = 30/322 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ I G L G +HYPR + W + +A GL+ V VFWN HE QPG+FDFS
Sbjct: 38 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFS 97
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ FI+ Q +GLYV LR GP++ EW +GG P WL + +RS + F + +
Sbjct: 98 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQ 213
RY + + + L + GG II+ Q+ENEYG + +L ++ A V L
Sbjct: 158 RYIKELGKQL--SPLTINNGGNIIMVQVENEYGSYAADKEYLAAIRDMIK-EAGFNVPLF 214
Query: 214 T--GVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK--PAIWTENWTSFYQVYGDE 269
T G V + P +N G + F + K P E + +++ +G
Sbjct: 215 TCDGGGQVEAGHVEGALPTLNGVFGE---DIFKVVDKYQKGGPYFVAEFYPAWFDEWGRR 271
Query: 270 ----ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ----- 320
A R AE + + ++ G V+ YM+HGGTNF T A GY Q
Sbjct: 272 HSSVAYERPAEQLDWMLS------HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYD 325
Query: 321 --APLDEYGLLRQPKWGHLKEL 340
APL E+G PK+ +E+
Sbjct: 326 YDAPLGEWGNCY-PKYHAFREV 346
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+++ GKG WVNG+S+GR+W + PQ T ++P +LK N +V+ E E+
Sbjct: 540 VDMSQWGKGAVWVNGKSLGRFWN--IGPQQT-----LYLPAPWLKEGENEIVVFEMED 590
>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
CL03T12C18]
Length = 775
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 152/322 (47%), Gaps = 30/322 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ I G L G +HYPR + W + +A GL+ V VFWN HE QPG+FDFS
Sbjct: 36 TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFS 95
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ FI+ Q +GLYV LR GP++ EW +GG P WL + +RS + F + +
Sbjct: 96 GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 155
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQ 213
RY + + + L + GG II+ Q+ENEYG + +L ++ A V L
Sbjct: 156 RYIKELGKQL--SPLTINNGGNIIMVQVENEYGSYAADKEYLAAIRDMIK-EAGFNVPLF 212
Query: 214 T--GVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK--PAIWTENWTSFYQVYGDE 269
T G V + P +N G + F + K P E + +++ +G
Sbjct: 213 TCDGGGQVEAGHVEGALPTLNGVFGE---DIFKVVDKYQKGGPYFVAEFYPAWFDEWGRR 269
Query: 270 ----ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ----- 320
A R AE + + ++ G V+ YM+HGGTNF T A GY Q
Sbjct: 270 HSSVAYERPAEQLDWMLS------HGVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSYD 323
Query: 321 --APLDEYGLLRQPKWGHLKEL 340
APL E+G PK+ +E+
Sbjct: 324 YDAPLGEWGNCY-PKYHAFREV 344
Score = 39.7 bits (91), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+++ GKG WVNG+S+GR+W + PQ T ++P +LK N +V+ E E+
Sbjct: 538 VDMSQWGKGAVWVNGKSLGRFWN--IGPQQT-----LYLPAPWLKEGENEIVVFEMED 588
>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
Length = 583
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/334 (31%), Positives = 157/334 (47%), Gaps = 36/334 (10%)
Query: 28 NNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEP 87
++YD + L SG+IHY R P W + K K G + ++T V WNLHEP
Sbjct: 2 TTLSYDQGQFTMGDRPIQLISGAIHYFRVVPAYWEDRLRKIKAMGCNCIETYVAWNLHEP 61
Query: 88 QPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
+ G+F F G D+ F++ GLYV +R P+I EW +GGLP WL + R ++
Sbjct: 62 REGEFHFEGMSDVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWLLK-DDMRLRCND 120
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWA 205
F + Y ++ + L A++GGPII QIENEYG + ++L+
Sbjct: 121 PRFLEKVAAYYDALLPQLTP--LLATKGGPIIAVQIENEYGSYGNDQAYLQ--------- 169
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDP---------VINACN-GRQCGETFAGPNS--PDKPA 253
A+ A+ ++ GV ++ D D V+ N G + E F PD P
Sbjct: 170 AQRAMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPL 229
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY- 312
+ E W ++ + ++ R AED A + + G+ VN+YM HGGTNFG + A
Sbjct: 230 MCMEYWNGWFDHWFEQHHTRDAEDAARVLDDMLG--MGASVNFYMVHGGTNFGFGSGANH 287
Query: 313 ------VLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y A + E G L PK+ +E+
Sbjct: 288 SDKYEPTVTSYDYDAAISEAGDL-TPKYHAFREV 320
>gi|256840666|ref|ZP_05546174.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
gi|256737938|gb|EEU51264.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
Length = 768
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 150/333 (45%), Gaps = 45/333 (13%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + SG +HYPR Q W + + GL+ V T VFWNLHE +PG++DF G +
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
+L +I+ +GL V LR GP++ EW +GG P+WL ++PG+ R DN F K Y
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---YVRWAAKLA---VDL 212
+ + L S+GGPII+ Q ENE+G + K P + R+ AK+ D
Sbjct: 159 DKLYE--QVGDLQVSKGGPIIMVQAENEFG--SYVAQRKDIPLEEHRRYNAKIKRQLADA 214
Query: 213 QTGVPWV------MCKQDDAPDPV------INACNGRQCGETFAGPNSPDKPAI----WT 256
VP + + P + N N ++ + G P A W
Sbjct: 215 GFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
+W + D R E + F N+YM HGGTNFG T+ A
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKK 325
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 326 HDIQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|373953405|ref|ZP_09613365.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
gi|373890005|gb|EHQ25902.1| glycoside hydrolase family 35 [Mucilaginibacter paludis DSM 18603]
Length = 608
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 152/316 (48%), Gaps = 37/316 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ +++G + SG +HYPR + W + AK GL+ + T VFWNLHEPQ G+FDF
Sbjct: 32 EAFLLDGKPFQMISGEMHYPRVPRESWRARMKMAKAMGLNTIGTYVFWNLHEPQKGKFDF 91
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G D+ F++ + +GL+V LR P++ EW +GG P+WL + G+V RS + +
Sbjct: 92 TGNNDVAEFVRIAKQEGLWVILRPSPYVCAEWEFGGYPYWLQNEKGLVVRSKEAQY---L 148
Query: 155 KRYATMIVNMMKA-ARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVR 203
K Y + I + K A L + GG I++ QIENEYG + + F E G +
Sbjct: 149 KEYESYIKEVGKQLAPLQINHGGNILMVQIENEYGSYGSDKDYLAINQKLFKEAGFDGLL 208
Query: 204 WAAKLAVDLQTG-VPWVMCKQD--DAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
+ A DL G +P ++ + D PD V + G+ P A W W
Sbjct: 209 YTCDPAADLVNGHLPGLLPAVNGIDNPDKVKQIISQNHNGK------GPYYIAEWYPAW- 261
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-------RTASAY- 312
+ +G + A + + +A G +N YM+HGGT G + S Y
Sbjct: 262 --FDWWGTKHHTVPAAEYTGRLDSVLA--AGISINMYMFHGGTTRGFMNGANYKDTSPYE 317
Query: 313 -VLTGYYDQAPLDEYG 327
++ Y APLDE G
Sbjct: 318 PQVSSYDYDAPLDEAG 333
Score = 40.8 bits (94), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 44/196 (22%), Positives = 83/196 (42%), Gaps = 26/196 (13%)
Query: 477 VLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSG 536
+LK+ L +NG+ VG+ + + S ++ V G + +L +G + G
Sbjct: 418 LLKIKELRDYAVVMLNGKTVGTLDRRLNQDSLQIKLPV----GAVVLDILVENLGRINFG 473
Query: 537 AYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSST 596
YL + G+ + +++ ++ +S + E + + + GSST
Sbjct: 474 KYLLQNKKGITEKVLFNTQQVNNWQMYSLPFN---HAEAINL------------KSGSST 518
Query: 597 HQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHI 656
K+ + + +++ GKG WVNG ++GRYW Q P Q+ Y +
Sbjct: 519 MGTAPVIKSGYFNLQKTGDTYLDMRKWGKGLVWVNGHNLGRYW------QVGPQQTLY-V 571
Query: 657 PRSFLKPTGNLLVLLE 672
P +LK N + +LE
Sbjct: 572 PAEWLKKGQNEVRVLE 587
>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
Length = 688
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 154/322 (47%), Gaps = 29/322 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++ GSIHY R + W + K + G + V T + WNLHE + G+FDFS
Sbjct: 97 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 156
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL ++ + GL+V LR GP+I E GGLP WL P R+ N+ F + +Y
Sbjct: 157 DLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYF 216
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
++ K L GGP+I Q+ENEYG +K Y+ + K L+ G+
Sbjct: 217 DHLIP--KILPLQYRHGGPVIAVQVENEYGS-----FQKDRNYMNYLKKAL--LKRGIVE 267
Query: 219 VMCKQDDAPDPVINACNGRQCG---ETFAGPN-------SPDKPAIWTENWTSFYQVYGD 268
++ DD I + NG +F + DKP + E WT +Y +G
Sbjct: 268 LLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGS 327
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-------RTASAYVLTGYYDQA 321
+ +SAE+I + V FI+ G N YM+HGGTNFG V+T Y A
Sbjct: 328 KHIEKSAEEIRHTVYKFIS--YGLSFNMYMFHGGTNFGFINGGRYENHHISVVTSYDYDA 385
Query: 322 PLDEYGLLRQPKWGHLKELHSA 343
L E G + K+ L++L ++
Sbjct: 386 VLSEAGDYTE-KYFKLRKLFAS 406
>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1106
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 95/327 (29%), Positives = 149/327 (45%), Gaps = 39/327 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ V VFWN HEPQPG +DF+
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
+ DL F + Q +YV LR GP++ EW GGLP+WL + R + F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + +K L + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 476 LFEEAVAKQVK--NLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533
Query: 204 ---WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTEN 258
WA+ ++ + W M N G + FA P+ P + +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W+ ++ +G R A D+ + ++ +G + YM HGGTN+G A A
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLS--RGISFSLYMTHGGTNWGHWAGANSPGFAP 640
Query: 314 -LTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y AP+ E G PK+ L+E
Sbjct: 641 DVTSYDYDAPISESGQT-TPKYWALRE 666
>gi|423331257|ref|ZP_17309041.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
gi|409230553|gb|EKN23415.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
CL03T12C09]
Length = 768
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 150/333 (45%), Gaps = 45/333 (13%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + SG +HYPR Q W + + GL+ V T VFWNLHE +PG++DF G +
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
+L +I+ +GL V LR GP++ EW +GG P+WL ++PG+ R DN F K Y
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---YVRWAAKLAVDLQTG 215
+ + L S+GGPII+ Q ENE+G + K P + R+ AK+ L
Sbjct: 159 DKLYE--QVGDLQVSKGGPIIMVQAENEFG--SYVAQRKDIPLEEHRRYNAKIKRQLADA 214
Query: 216 ---VPWV------MCKQDDAPDPV------INACNGRQCGETFAGPNSPDKPAI----WT 256
VP + + P + N N ++ + G P A W
Sbjct: 215 GFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
+W + D R E + F N+YM HGGTNFG T+ A
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKK 325
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 326 HDIQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|298376422|ref|ZP_06986377.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
gi|298266300|gb|EFI07958.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
Length = 768
Score = 140 bits (352), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 150/333 (45%), Gaps = 45/333 (13%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + SG +HYPR Q W + + GL+ V T VFWNLHE +PG++DF G +
Sbjct: 39 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
+L +I+ +GL V LR GP++ EW +GG P+WL ++PG+ R DN F K Y
Sbjct: 99 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---YVRWAAKLAVDLQTG 215
+ + L S+GGPII+ Q ENE+G + K P + R+ AK+ L
Sbjct: 159 DKLYE--QVGDLQVSKGGPIIMVQAENEFG--SYVAQRKDIPLEEHRRYNAKIKRQLADA 214
Query: 216 ---VPWV------MCKQDDAPDPV------INACNGRQCGETFAGPNSPDKPAI----WT 256
VP + + P + N N ++ + G P A W
Sbjct: 215 GFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 274
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
+W + D R E + F N+YM HGGTNFG T+ A
Sbjct: 275 MHWAEPFPDISDSGIARQTETYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKK 325
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 326 HDIQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357
>gi|187736173|ref|YP_001878285.1| beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
gi|187426225|gb|ACD05504.1| Beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
Length = 780
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 145/308 (47%), Gaps = 18/308 (5%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ +++G + SG +HYPR Q W + K G++ V T +FWN+HEP+PG++DF
Sbjct: 39 ENFLMDGKPVKIISGEMHYPRVPRQHWKDRFQRIKAMGMNTVCTYLFWNVHEPEPGKWDF 98
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG D V FIKE Q GL+V +R GP++ EW +GG P WL + RS + F
Sbjct: 99 SGNLDFVEFIKEAQKAGLWVIVRPGPYVCAEWEFGGFPGWLLKDEDLKVRSQDPRFLEPA 158
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDL 212
Y + +M++ L ++GGPII++Q+ENEYG + +++K +R V
Sbjct: 159 MAYLKKVCSMLEP--LQITKGGPIIMAQVENEYGSYGSDKDYVKKHLDVIRKELPGVVPF 216
Query: 213 QTGVP--WVMCKQDDAPD--PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
+ P W M K P P +N G + + P I E W ++ +G
Sbjct: 217 TSDGPNDW-MIKNGTLPGVVPAMNFGGGAKGAFANLEKHKGKTPRINGEFWVGWFDHWGK 275
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-----RTASAYV--LTGYYDQA 321
S E + + N +M HGGT+FG AY +T Y A
Sbjct: 276 PKNGGSTEGFNRDLKWMLENNVSP--NLFMAHGGTSFGFMNGANWEGAYTPDVTNYDYGA 333
Query: 322 PLDEYGLL 329
P+ E G L
Sbjct: 334 PISENGTL 341
Score = 48.1 bits (113), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 42/199 (21%), Positives = 87/199 (43%), Gaps = 26/199 (13%)
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
LK++++ +++G+ G+A ++ S + + +G + V + +G + G
Sbjct: 425 LKMNNMQDRAIVYVDGKRQGAADRRYKQDSCD----IVIPSGLHTVDIFVENMGRINFGG 480
Query: 538 YLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH 597
++ G+R K+L++F ++ F G ++P+S +
Sbjct: 481 QIQGERKGIRGPITLDGKKLENFLIYN--------------FPCKGVELIPFSGKKPAGD 526
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIP 657
QP+ +++ F+ D KG WVNG+++GR+W SQ + P
Sbjct: 527 QPV-FHRGYFNVSNPKDTYLDMRDGWKKGVVWVNGRNLGRFWF-------IGSQQALYCP 578
Query: 658 RSFLKPTGNLLVLLEEENG 676
+LKP N +V+L+ + G
Sbjct: 579 GEYLKPGKNEIVVLDVDGG 597
>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
17393]
gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
Length = 1106
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 95/327 (29%), Positives = 149/327 (45%), Gaps = 39/327 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ V VFWN HEPQPG +DF+
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
+ DL F + Q +YV LR GP++ EW GGLP+WL + R + F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + +K L + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533
Query: 204 ---WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTEN 258
WA+ ++ + W M N G + FA P+ P + +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W+ ++ +G R A D+ + ++ +G + YM HGGTN+G A A
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLS--RGISFSLYMTHGGTNWGHWAGANSPGFAP 640
Query: 314 -LTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y AP+ E G PK+ L+E
Sbjct: 641 DVTSYDYDAPISESGQT-TPKYWALRE 666
>gi|288928311|ref|ZP_06422158.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
gi|288331145|gb|EFC69729.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
F0108]
Length = 674
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 109/349 (31%), Positives = 159/349 (45%), Gaps = 65/349 (18%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
DG+ + NG L SG +HY R W + K GL+ V T VFWN HE +PG++
Sbjct: 86 DGQ-FVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKW 144
Query: 93 DF-SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFK 151
D+ +G R+L +F+K +G+ V LR GP+ EW +GG P+WL G+V R+DN+PF
Sbjct: 145 DWKTGNRNLRQFVKTAAEEGMLVILRPGPYCCAEWEFGGYPWWLSKAKGLVIRADNQPFL 204
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG-------------------MVEH 192
+ Y + + M+ ++ ++GGPII+ Q ENE+G ++
Sbjct: 205 DSCRVYINQLASQMRDLQI--TKGGPIIMVQAENEFGSYVAQRKDIPLETHRAYSAKIKQ 262
Query: 193 SFLEKG---PPYV---RWAAKLAVDLQTGVPWVMCKQD-DAPDPVINACNGRQ----CGE 241
L+ G P + W K ++ +P + D + V+N NG + E
Sbjct: 263 QLLDAGFDVPLFTSDGSWLFKGGT-IEGALPTANGESDIEKLKKVVNEYNGGKGPYMVAE 321
Query: 242 TFAGPNSPDKPAIWTENWTS-FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
+ G W +W F QV S E I A ++ G NYYM H
Sbjct: 322 FYPG---------WLSHWAEPFPQV--------STESIVKQTAKYLE--NGISFNYYMVH 362
Query: 301 GGTNFGRTASAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
GGTNFG T+ A LT Y AP+ E G PK+ L+ L
Sbjct: 363 GGTNFGFTSGANYTTATNLQPDLTSYDYDAPISEAG-WNTPKYDALRAL 410
Score = 43.9 bits (102), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 8/78 (10%)
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
T Y F+ T D +N+ + GKG +VNG ++GRYW + P Q+ Y +P F
Sbjct: 588 TLYSGTFNLDTTGD-TFLNMETWGKGIVFVNGINLGRYW------KRGPQQTLY-LPGCF 639
Query: 661 LKPTGNLLVLLEEENGYP 678
LK N +V+ E++N P
Sbjct: 640 LKKGENKIVVFEQQNDTP 657
>gi|301309736|ref|ZP_07215675.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|423340209|ref|ZP_17317948.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
gi|300831310|gb|EFK61941.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
gi|409227644|gb|EKN20540.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
CL09T03C24]
Length = 765
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 103/333 (30%), Positives = 150/333 (45%), Gaps = 45/333 (13%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + SG +HYPR Q W + + GL+ V T VFWNLHE +PG++DF G +
Sbjct: 36 VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 95
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
+L +I+ +GL V LR GP++ EW +GG P+WL ++PG+ R DN F K Y
Sbjct: 96 NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 155
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP---YVRWAAKLAVDLQTG 215
+ + L S+GGPII+ Q ENE+G + K P + R+ AK+ L
Sbjct: 156 DKLYE--QVGDLQVSKGGPIIMVQAENEFG--SYVAQRKDIPLEEHRRYNAKIKRQLADA 211
Query: 216 ---VPWV------MCKQDDAPDPV------INACNGRQCGETFAGPNSPDKPAI----WT 256
VP + + P + N N ++ + G P A W
Sbjct: 212 GFNVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVGPYMVAEFYPGWL 271
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
+W + D R E + F N+YM HGGTNFG T+ A
Sbjct: 272 MHWAEPFPDISDSGIARQTETYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKK 322
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 323 HDIQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 354
>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
Length = 604
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVERFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|420261585|ref|ZP_14764229.1| glycosyl hydrolase [Enterococcus sp. C1]
gi|394771519|gb|EJF51280.1| glycosyl hydrolase [Enterococcus sp. C1]
Length = 591
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 144/290 (49%), Gaps = 31/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + V+T + WNLHEP+ G +DF
Sbjct: 8 EDFLLDGKPIKLISGAIHYFRMTPVQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+ F+K+ Q GL V LR +I EW +GGLP WL + P + RS + F +
Sbjct: 68 EGMKDICAFVKQAQTIGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K L + GGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 127 RNYFQVL--LPKLVPLQITHGGPVIMMQVENEYGSYG---MEKA--YLRQTKELMEEYGI 179
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G + E F + + P +
Sbjct: 180 DVP--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCM 237
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 238 EYWDGWFNRWGEPIIKRDGQDLANEVKEMLA--VGS-LNLYMFHGGTNFG 284
>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1106
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 95/327 (29%), Positives = 149/327 (45%), Gaps = 39/327 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + +HYPR W + I K G++ V VFWN HEPQPG +DF+
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
+ DL F + Q +YV LR GP++ EW GGLP+WL + R + F +
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVR---------- 203
+ + +K L + GGPII+ Q+ENEYG + ++ + VR
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNGIALF 533
Query: 204 ---WAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTEN 258
WA+ ++ + W M N G + FA P+ P + +E
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W+ ++ +G R A D+ + ++ +G + YM HGGTN+G A A
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLS--RGISFSLYMTHGGTNWGHWAGANSPGFAP 640
Query: 314 -LTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y AP+ E G PK+ L+E
Sbjct: 641 DVTSYDYDAPISESGQT-TPKYWALRE 666
>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
Length = 604
Score = 139 bits (351), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|194213013|ref|XP_001503036.2| PREDICTED: LOW QUALITY PROTEIN: galactosidase, beta 1-like 2 [Equus
caballus]
Length = 663
Score = 139 bits (350), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 98/286 (34%), Positives = 135/286 (47%), Gaps = 26/286 (9%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GS+HY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 91 IFGGSVHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGRFDFSGNLDLEAFVL 150
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL G+ R+ + F + Y + M
Sbjct: 151 TAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDSGMRLRTTYKGFTNAVDLYFDHL--MP 208
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 209 RVVPLQYKHGGPIIAVQVENEYGSY-----NKDPTYMPYIKKALED--RGIEELLLTSDN 261
Query: 226 -------APDPVINACNGR-----QCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
A D V+ N + Q TF +P + E WT ++ +G I
Sbjct: 262 KDGLSSGAVDGVLATINLQSQHDLQLLSTFLFTVQGARPKMVMEYWTGWFDSWGGTHNIL 321
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD 319
+ ++ V+ I GS +N YM+HGGTNFG A YYD
Sbjct: 322 DSSEVLKTVSAIID--AGSSINLYMFHGGTNFGFINGA---MHYYD 362
>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
Length = 604
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 604
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 18 EEFLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 189
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 246
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 303
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 304 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|395803570|ref|ZP_10482814.1| beta-galactosidase [Flavobacterium sp. F52]
gi|395434124|gb|EJG00074.1| beta-galactosidase [Flavobacterium sp. F52]
Length = 617
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 104/333 (31%), Positives = 159/333 (47%), Gaps = 35/333 (10%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF-SGRRDLVRFI 104
+ SG +HY R + W + K GL+ V T VFWN HE +PG +DF +G RDL F+
Sbjct: 43 IHSGEMHYERIPKEYWRHRLQMLKAMGLNTVATYVFWNYHEIEPGVWDFKTGNRDLAEFL 102
Query: 105 KEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNM 164
+ +++GLYV LR GP+ GEW +GG P+WL + P +V R++N+ F K Y + +
Sbjct: 103 RIAKSEGLYVILRPGPYACGEWEFGGYPWWLQNNPDLVIRTNNKAFLDACKTYLEHLYAV 162
Query: 165 MKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK 222
+K +A+QGGPII+ Q ENE+G + + + + A + +TG P
Sbjct: 163 VKGN--FANQGGPIIMVQAENEFGSYVSQRTDISAEDHKAYKTAIYNILKETGFPEPFFT 220
Query: 223 QDDA-------PDPVINACNGRQCGETFAGPNSPDK------PAIWTENWTSFYQVYGDE 269
D + + V+ NG E DK P + E + + + +
Sbjct: 221 SDGSWLFEGGMVEGVLPTANGESNIENLK--KQVDKYHKGQGPYMVAEFYPGWLDHWAEP 278
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------LTGYYDQ 320
+E+IA ++ G NYYM HGGTNFG T+ A +T Y
Sbjct: 279 FVKIGSEEIASQTKKYLD--AGVSFNYYMAHGGTNFGFTSGANYNEESDIQPDITSYDYD 336
Query: 321 APLDEYGLLRQPKWGHLKEL---HSAVKLCLKP 350
AP+ E G PK+ ++++ +S KL P
Sbjct: 337 APISEAG-WATPKFMAIRDVMQKYSKTKLAAIP 368
>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
15897]
gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
Length = 577
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 149/317 (47%), Gaps = 34/317 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
II+G + + SG++HY R P+ W + K+ G + V+T + WNLHEP G+FDF
Sbjct: 8 EDFIIDGQKTKIISGAVHYFRIVPEYWEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G++D+ F++ + GLYV +R P+I EW GGLP WL I R+++ + H+
Sbjct: 68 DGQKDVCAFLELAKKLGLYVIIRPSPYICSEWELGGLPAWLLKDSDIRLRTNDSVYMKHL 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y +++ M+ A+ ++ G IIL+Q+ENEYG Y++ K+ +
Sbjct: 128 EEYYAVLLPMI--AKYQINREGTIILAQLENEYGSYNQD-----KDYLKALLKMMREYGI 180
Query: 215 GVP-------W-------VMCKQDDAPDPVI--NACNGRQCGETFAGPNSPDKPAIWTEN 258
VP W + ++D P NA + F + P + E
Sbjct: 181 EVPIFTADGTWEEALEAGSLFEEDVFPTGNFGSNAKENIAVLKEFMKKHQIVAPIMCMEF 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--------RTAS 310
W ++ + E R E++ I GS +N+YM+HGGTNFG +
Sbjct: 241 WDGWFNRWNMEIVKRDPEELVQSAKEMID--LGS-INFYMFHGGTNFGWMNGCSARKEHD 297
Query: 311 AYVLTGYYDQAPLDEYG 327
+T Y A L EYG
Sbjct: 298 LPQITSYDYDAILTEYG 314
>gi|350588684|ref|XP_003130139.3| PREDICTED: galactosidase, beta 1-like 3 [Sus scrofa]
Length = 656
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 107/331 (32%), Positives = 155/331 (46%), Gaps = 29/331 (8%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH ++ GSIHY R + W + K K G + V T V WNLHEP+ G+FDFSG
Sbjct: 84 LEGHEFLILGGSIHYFRVPRESWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSGNL 143
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
D+ FI GL+V LR GP+I E GGLP L P R+ N F + Y
Sbjct: 144 DMEAFILLAAEVGLWVILRPGPYICSEIDLGGLPSRLLQDPTSQLRTTNHSFIEAVDEYL 203
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
++ + L +GGPII Q+ENEYG E PY+ A L+ G+
Sbjct: 204 DHLI--ARVVPLQYRKGGPIIAVQVENEYGSFHKD--EAYMPYLHKAL-----LKRGIVE 254
Query: 219 VMCKQDDAPDPVINACNGRQCG---ETFAGPNSPD-------KPAIWTENWTSFYQVYGD 268
++ D+ + + G ++F D KP + E W ++ +G+
Sbjct: 255 LLLTSDNTNEVLKGHIKGVLATVNMKSFKEGEFKDLYQVQSNKPILIMEFWVGWFDTWGN 314
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQA 321
+ +R A D+ + FI +++ S+ N YM+HGGTNFG A V+T Y A
Sbjct: 315 KHAVRDAIDVENTIFDFI-RLEISF-NVYMFHGGTNFGFMNGATYFEQHRGVVTSYDYDA 372
Query: 322 PLDEYGLLRQPKWGHLKELHSAVKLCLKPML 352
L E G PK+ L+EL ++ + P L
Sbjct: 373 VLTEAGDY-TPKFFKLRELFKSIFVTPLPAL 402
>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 594
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|189096261|pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
Thetaiotaomicron
Length = 612
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/323 (29%), Positives = 153/323 (47%), Gaps = 29/323 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++NG ++ + IHYPR + W I K G + + VFWN HEP+ G++DF+
Sbjct: 14 TFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPEEGRYDFA 73
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G++D+ F + Q G YV +R GP++ EW GGLP+WL I R + + +K
Sbjct: 74 GQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDPYYXERVK 133
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVDLQ 213
+ + + A L S+GG II Q+ENEYG ++ ++ + V+ A
Sbjct: 134 LFLNEVGKQL--ADLQISKGGNIIXVQVENEYGAFGIDKPYISEIRDXVKQAGF------ 185
Query: 214 TGVPWVMCK-----QDDAPDPV---INACNGRQCGETFAGPNS--PDKPAIWTENWTSFY 263
TGVP C +++A D + IN G E F PD P +E W+ ++
Sbjct: 186 TGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEFWSGWF 245
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------VLTGY 317
+G + RSAE++ + + + Y HGGT+FG A T Y
Sbjct: 246 DHWGAKHETRSAEELVKGXKEXLD--RNISFSLYXTHGGTSFGHWGGANFPNFSPTCTSY 303
Query: 318 YDQAPLDEYGLLRQPKWGHLKEL 340
AP++E G + PK+ ++ L
Sbjct: 304 DYDAPINESGKV-TPKYLEVRNL 325
Score = 41.2 bits (95), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 53/225 (23%), Positives = 96/225 (42%), Gaps = 32/225 (14%)
Query: 454 TKDASDYLWYN--FRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLE 511
T +A D W + +R SD E L ++ F+NG+ + + K +
Sbjct: 374 TXEAFDQGWGSILYRTSLSASDKEQTLLITEAHDWAQVFLNGKKLATLS---RLKGEGVV 430
Query: 512 KMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQ---GAKELKDFSSFSWGYQ 568
K+ L G + + +L G + G + V +Q G + +KD+ ++
Sbjct: 431 KLPPLKEG-DRLDILVEAXGRXNFGKGIYDWKGITEKVELQSDKGVELVKDWQVYT---- 485
Query: 569 VGLLGEKLQIFTDYG-SRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGE 627
I DY +R + + ++ +QP +Y++ F+ D +N + KG
Sbjct: 486 ---------IPVDYSFARDKQYKQQENAENQP-AYYRSTFNLNELGD-TFLNXXNWSKGX 534
Query: 628 AWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
WVNG +IGRYW + PQ T ++P +LK N +++L+
Sbjct: 535 VWVNGHAIGRYWE--IGPQQT-----LYVPGCWLKKGENEIIILD 572
>gi|256396208|ref|YP_003117772.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
gi|256362434|gb|ACU75931.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
Length = 625
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 99/333 (29%), Positives = 151/333 (45%), Gaps = 31/333 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T DG + G + S +IHY R P +W + + + G + V+ + WN H+P P
Sbjct: 7 LTIDGGRFLRGGREHRIVSAAIHYFRIHPDLWRDRLQRLRAMGCNTVECYIAWNFHQPTP 66
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
F G RD+ F++ G V R GP+I EW +GGLP WL + R+ +
Sbjct: 67 AAPRFDGWRDVAGFVRLAGELGFDVIARPGPYICAEWDFGGLPAWLLADENVRLRTTDPV 126
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG-------MVEHSFLEKGPPYV 202
+ + + ++ ++ A L A++GGP++ QIENEYG ++H L KG +
Sbjct: 127 YLAAVDAWFDELIPVL--AELQATRGGPVVAVQIENEYGSFGADPDYLDH--LRKG--LI 180
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN--SPDKPAIWTENWT 260
+ G +M PD + G + E FA PD P + E W
Sbjct: 181 ERGVDTLLFTSDGPQELMLAGGTVPDVLATVNFGSRADEAFATLRRVRPDDPPVCMEFWN 240
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------- 312
++ +G+ RSA+D A + +A G VN+YM HGGTNFG A A
Sbjct: 241 GWFDHFGEPHHTRSAQDAARSLDEILA--AGGSVNFYMGHGGTNFGFWAGANHSGVGTGD 298
Query: 313 -----VLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y AP+ E G L PK+ +E+
Sbjct: 299 PGYQPTITSYDYDAPVGEAGEL-TPKFHLFREV 330
>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
Length = 919
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 105/357 (29%), Positives = 166/357 (46%), Gaps = 31/357 (8%)
Query: 16 TIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDV 75
TI ++G V Y+ S ING + L S +IHY R + W ++ KAK G++
Sbjct: 4 TIVQTNGLPHKNTAVQYNAFSYNINGEQVFLNSAAIHYFRMPKEEWREVLVKAKLAGMNC 63
Query: 76 VQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWL 135
V T WN+HEP+ G+++F G D F+ GL+V R GPFI EW +GG P+WL
Sbjct: 64 VDTYFAWNVHEPEEGEWNFEGDNDCGAFLDLCHELGLWVIARPGPFICAEWDFGGFPYWL 123
Query: 136 HDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFL 195
+ + FR+ + + ++ RY I+ +++ + A GG +IL Q+ENEYG +
Sbjct: 124 NTKKDMKFRAFDMQYLTYVDRYMDRIIPIIRDREINA--GGSVILVQVENEYGYLASD-- 179
Query: 196 EKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETF-AGPN------- 247
E Y+ + +D VP + C + G G F +G +
Sbjct: 180 EVARDYMLHLRDVMLDRGVMVPLITC---------VGGAEGTVEGANFWSGADHHYNNLV 230
Query: 248 --SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM----YHG 301
PD P I TE WT +++ +G A + + L + + V++YM +
Sbjct: 231 QKQPDTPKIVTEFWTGWFEHWGAPAATQKTAALYEKRMLESLRAGFTGVSHYMFFGGTNF 290
Query: 302 GTNFGRTASA---YVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGV 355
G GRT A +++T Y APL EYG + K+ K + V+ +L+ V
Sbjct: 291 GGYGGRTVGASDIFMVTSYDYDAPLSEYGRVTD-KYNTAKRMSYFVQATESVLLNAV 346
Score = 44.3 bits (103), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 43/96 (44%), Gaps = 13/96 (13%)
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPV----AINLISMGKGEAWVNGQSIGRYWVSFLTPQG 647
Y T P+ W+ FD P V + L M KG W+NG +GRYW Q
Sbjct: 818 YAGDTGVPV-WHTVQFDKPELPADVNAKLKLRLTGMSKGTLWLNGIDLGRYW------QV 870
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISI 683
P + Y IP ++LK N LVL +E P + +
Sbjct: 871 GPQED-YKIPMAWLKDR-NELVLFDENGASPSKVRL 904
>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 775
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 149/330 (45%), Gaps = 34/330 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V + + ING L G +HYPR + W + +A+ GL+ V VFWN HE QP
Sbjct: 30 VKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRARAMGLNTVSAYVFWNFHERQP 89
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G FDFSG+ D+ F++ Q +GLYV LR GP++ EW +GG P WL + +RS +
Sbjct: 90 GVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDLTYRSKDPR 149
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F + +RY + + A L + GG II+ Q+ENEYG Y+ +
Sbjct: 150 FMSYCERYIKELGKQL--APLTINNGGNIIMVQVENEYGSYAAD-----KEYLAAIRDML 202
Query: 210 VDLQTGVPWVMCK---QDDAPD-----PVINACNGRQCGETFAGPNSPDKPAIWTENWTS 261
+ VP C Q +A P +N G + P P E + +
Sbjct: 203 QEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFK-IVDKYHPGGPYFVAEFYPA 261
Query: 262 FYQVYGDE----ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY 317
++ +G A R AE + + + G V+ YM+HGGTNF A G+
Sbjct: 262 WFDEWGKRHSSVAYERPAEQLDWMLG------HGVSVSMYMFHGGTNFWYMNGANTSGGF 315
Query: 318 YDQ-------APLDEYGLLRQPKWGHLKEL 340
Q APL E+G PK+ +E+
Sbjct: 316 RPQPTSYDYDAPLGEWGNCY-PKYHAFREI 344
Score = 40.4 bits (93), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+++ GKG WVNG+S+GR+W + PQ T +IP +LK N +V+ E E+
Sbjct: 538 VDMSQWGKGAVWVNGKSLGRFWN--IGPQQT-----LYIPAPWLKKGENEIVVFEMED 588
>gi|257866484|ref|ZP_05646137.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257873001|ref|ZP_05652654.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
gi|257800442|gb|EEV29470.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
gi|257807165|gb|EEV35987.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
Length = 591
Score = 139 bits (350), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 144/290 (49%), Gaps = 31/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R T W + K G + V+T + WNLHEP+ G +DF
Sbjct: 8 EDFLLDGKPIKLISGAIHYFRMTSAQWADSLYNLKALGANTVETYIPWNLHEPREGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+ F+K+ QA GL V LR +I EW +GGLP WL + P + RS + F +
Sbjct: 68 EGMKDIFAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K L + GGP+I+ Q+ENEYG +EK Y+R +L +
Sbjct: 127 RNYFQVL--LPKLVPLQITHGGPVIMMQVENEYGSYG---MEKA--YLRQTKELMEECGI 179
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G + E F + + P +
Sbjct: 180 DVP--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCM 237
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 238 EYWDGWFNRWGEPIIKRDGQDLANEVKEMLA--VGS-LNLYMFHGGTNFG 284
>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
Length = 593
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 138/284 (48%), Gaps = 26/284 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
I+ ++ + SG++HY R P W + K G + V+T + WN+HEP G+FDF G +
Sbjct: 12 IDDNKFKILSGAVHYFRIHPSQWGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIK 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
D+ +FIK + GLYV LR P+I EW +GGLP WL I RS ++ F ++ Y
Sbjct: 72 DIEKFIKISEKLGLYVILRPTPYICAEWEFGGLPAWLLKDKEIKLRSSDDNFIEKLRNYY 131
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVP- 217
+ + + + ++GGP+++ Q+ENEYG + Y+R A + + VP
Sbjct: 132 NDL--LPRLVKYQVTKGGPVLMMQVENEYGSYGNE-----KEYLRIVASIMKENGVDVPL 184
Query: 218 ------WV---MCKQDDAPDPVINACNGRQCGET------FAGPNSPDKPAIWTENWTSF 262
W+ C D ++ G + E F N + P + E W +
Sbjct: 185 FTSDGTWIEALECGSLIEDDIFVSGNFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDGW 244
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ +G++ R + D+A V +K +N YM+ GGTNFG
Sbjct: 245 FNRWGEDIIRRDSIDLAEDVK---EMLKIGSINLYMFRGGTNFG 285
>gi|402895882|ref|XP_003911041.1| PREDICTED: beta-galactosidase-1-like protein 2 [Papio anubis]
Length = 636
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 131/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMAYVKKALED--RGIVELLLTSDN 233
Query: 226 APD----------PVINACNGR--QCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
IN + R Q TF +P + E WT ++ +G I
Sbjct: 234 KDGLSKGIVQGVLATINLQSTRELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 294 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 324
>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 604
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 604
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
Length = 604
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFDMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALA--LGS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
Length = 604
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 604
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
Length = 604
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 604
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|313238883|emb|CBY13879.1| unnamed protein product [Oikopleura dioica]
Length = 601
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 150/316 (47%), Gaps = 31/316 (9%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SGS+HY R + W + K K GL+ VQT + WNLHEP+ G F F D+ F+K
Sbjct: 19 ILSGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDELDVSEFLK 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR-SDNEPFKFHMKRYATMIVNM 164
+ GLYV +R GP+I EW +GG P WL ++ R + +E + ++ + T++ +
Sbjct: 79 IAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQNWFTVLFSQ 138
Query: 165 MKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQD 224
++ + S+GGPII Q+ENEY K Y+ W L D+ + +
Sbjct: 139 LRDHQW--SRGGPIISIQVENEYASY-----NKDSEYLPWVKNLLTDVGKCFLLKIINET 191
Query: 225 D--------APDPVINACNGRQCGETFAGPN--SPDKPAIWTENWTSFYQVYGDEARIRS 274
+ PD + A N + G F + P++P + TE W ++ +G +
Sbjct: 192 NFFLKGAHLLPDTFLTA-NFQSVGNAFEVLDKLQPNRPKMVTEFWAGWFDHWGQQGH-SL 249
Query: 275 AEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVL----------TGYYDQAPLD 324
++ + GS VN YM+HGGT+FG A + L T Y APL
Sbjct: 250 LSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTTSYDYDAPLS 309
Query: 325 EYGLLRQPKWGHLKEL 340
E G L + KW +E+
Sbjct: 310 ESGDLTE-KWNVTREI 324
>gi|320109257|ref|YP_004184847.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
gi|319927778|gb|ADV84853.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
Length = 640
Score = 139 bits (349), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 163/352 (46%), Gaps = 42/352 (11%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G ++G + +G +HY R W + KAK GL+ + T VFWN+HEP+PG +D
Sbjct: 30 GDHFELDGKPFRILTGEMHYARIPRARWDDAMQKAKALGLNAITTYVFWNVHEPRPGVYD 89
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F+G+ DL ++ Q GL V LR GP+ EW +GG P WL P +V RS + F
Sbjct: 90 FTGQNDLGEYLAAAQRAGLKVILRPGPYACAEWEFGGYPAWLIKDPTVVVRSSDPKF--- 146
Query: 154 MKRYATMIVNMMKAARLY-ASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA---- 206
MK A + + + Y A+ GGPII Q+ENEYG +H+++E+ V +
Sbjct: 147 MKPVAKWFHRLGQEVQPYLAANGGPIIAVQVENEYGSFGNDHAYMEQMKDLVISSGIGGK 206
Query: 207 --KLAVD-------------LQTGVPWVMCKQDDAPD-PVINACNGRQCGETFAGPNS-- 248
K AVD L T V P+ P + G Q A +
Sbjct: 207 NPKKAVDEDGKNVPQDTGTMLYTADGGVQLPNGTLPELPAVVNFGGGQAKSELARYEAFR 266
Query: 249 PDKPAIWTENWTSFYQVYG-DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR 307
P+ P + E W ++ +G + + +AE +A + + +G V+ YM +GGT+FG
Sbjct: 267 PNGPRMVGEYWAGWFDHWGNNHQKTNAAEQVAEYEYML---KRGYSVSLYMLYGGTSFGW 323
Query: 308 TASAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKP 350
A A +T Y AP+DE G PK+ L+E+ V P
Sbjct: 324 MAGANSGDKAPYEPDVTSYDYDAPIDERG-NPTPKYFALREVIQRVTGITPP 374
>gi|432894411|ref|XP_004075980.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oryzias
latipes]
Length = 640
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 102/326 (31%), Positives = 151/326 (46%), Gaps = 37/326 (11%)
Query: 21 DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLV 80
+G +N T + + +I G GSIHY R W + K K GL+ + T V
Sbjct: 46 EGLKADSSNFTLERKPFLILG-------GSIHYFRVPKAYWEDRLLKLKACGLNTLTTYV 98
Query: 81 FWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPG 140
WNLHEP+ G FDF G DL ++ + G++V LR GP+I EW GGLP WL
Sbjct: 99 PWNLHEPERGVFDFEGELDLEAYLGLAASLGIWVILRPGPYICAEWDLGGLPSWLLRDQN 158
Query: 141 IVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP 200
+ R+ F + Y ++ K A S+GGPII Q+ENEYG ++ E+ P
Sbjct: 159 MRLRTTYPGFTAAVDSYFDHLIK--KVAPYQYSRGGPIIAVQVENEYG--SYAMDEEYMP 214
Query: 201 YVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPN----------SPD 250
+++ A L G+ ++ D+ + G F + P
Sbjct: 215 FIKEAL-----LSRGITELLVTSDNKDGLKLGGVKGALETINFQKLDPEEIKYLEKIQPQ 269
Query: 251 KPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF----- 305
KP + E W+ ++ ++G + AE++ V I K+ S +N YM+HGGTNF
Sbjct: 270 KPKMVMEYWSGWFDLWGGLHHVFPAEEMM-AVVTEILKLDMS-INLYMFHGGTNFGFMSG 327
Query: 306 ----GRTASAYVLTGYYDQAPLDEYG 327
GR + A ++T Y APL E G
Sbjct: 328 AFAVGRPSPAPMVTSYDYDAPLSEAG 353
Score = 40.0 bits (92), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 77/187 (41%), Gaps = 26/187 (13%)
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
F+ +FVG K + S K G + LL G + G L+ + GL
Sbjct: 456 VFVEKQFVGVLDYKEQELSIPDGK------GKRTLGLLVENCGRVNYGKTLDEQRKGLVG 509
Query: 549 VSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL--TWYKTV 606
A L+DF S L + D+ SR+ +++ S +P +++T
Sbjct: 510 DIQLNANILRDFMIHS-----------LDMKPDFVSRLQSSAQWKSMREKPSFPAFFQTK 558
Query: 607 FDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGN 666
+ + L KG +VNG+++GRYW + PQ T ++P ++L N
Sbjct: 559 LYLSSSPKDTFLKLPGWSKGVVFVNGKNLGRYWS--VGPQQT-----LYVPGAWLNRWDN 611
Query: 667 LLVLLEE 673
+++ EE
Sbjct: 612 EIIVFEE 618
>gi|157824103|ref|NP_001101662.1| beta-galactosidase precursor [Rattus norvegicus]
gi|149018351|gb|EDL76992.1| galactosidase, beta 1 (mapped) [Rattus norvegicus]
Length = 647
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 149/322 (46%), Gaps = 30/322 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GLD +QT V WN HEPQP
Sbjct: 35 LDYKRDRFLKDGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLDAIQTYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+DFSG RD+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 95 GQYDFSGDRDVEHFIQLAHQLGLLVILRPGPYICAEWDMGGLPAWLLEKESIVLRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK RL GGPII Q+ENEYG S+ Y+R+
Sbjct: 155 YLAAVDKWLAVLLPKMK--RLLYQNGGPIITVQVENEYG----SYFACDYNYLRFLEH-R 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--------------PDKPAIW 255
G ++ D A + ++ + T + P P I
Sbjct: 208 FRYHLGNDIILFTTDGAAEKLLKCGTLQDLYATVDFGTTGNITRAFLIQRNFEPKGPLIN 267
Query: 256 TENWTSFYQVYGD-EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G +++ + + +A +L+ G+ VN YM+ GGTNF A +
Sbjct: 268 SEFYTGWLDHWGQPHSKVNTKKLVA---SLYNLLAYGASVNLYMFIGGTNFAYWNGANMP 324
Query: 314 ----LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 325 YAPQPTSYDYDAPLSEAGDLTE 346
>gi|156552637|ref|XP_001603160.1| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
Length = 629
Score = 139 bits (349), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 106/336 (31%), Positives = 156/336 (46%), Gaps = 38/336 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ +++G SGS HY R+ Q W ++ K + GGL+ V T V W++HEP+
Sbjct: 33 IDYENDQFLLDGKPFRYVSGSFHYFRTPRQHWRGILRKMRAGGLNAVSTYVEWSMHEPEF 92
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFW-LHDVPGIVFRSDNE 148
Q+ + G D+V FIK Q + L+V LR GP+I E +GG P+W L VP I R+ +E
Sbjct: 93 DQWVWDGDADIVEFIKIAQEEDLFVILRPGPYICAERDFGGFPYWLLSRVPDIKLRTKDE 152
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG-------MVEHSFLEKGPPY 201
+ F+ +R+ I+ K L GGPII+ Q+ENEYG + E +
Sbjct: 153 RYVFYAERFLNEILRRTKP--LLRGNGGPIIMVQVENEYGSFYACDDQYKSKMYEIFHRH 210
Query: 202 VRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA--GPNSPDKPAI----- 254
V+ A L + + C I+ NG + SP P +
Sbjct: 211 VKNDAVLFTTDGSARSMLKCGSIPGVYATIDFGNGANVPFNYKIMREFSPKGPLVNSEYY 270
Query: 255 --WTENW-TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
W +W SF +V E +AY+V+ VN YMY+GGTNF T+ A
Sbjct: 271 PGWLTHWGESFQRVNSHNVAKTLDEMLAYNVS----------VNIYMYYGGTNFAFTSGA 320
Query: 312 YV-------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+ LT Y APL E G PK+ L+++
Sbjct: 321 NINEHYWPQLTSYDYDAPLTEAG-DPTPKYFELRDV 355
Score = 43.1 bits (100), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 36/57 (63%), Gaps = 6/57 (10%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEE 674
+N GKG A++NG ++GRYW S L PQ T ++P ++LK N LVLLE++
Sbjct: 560 LNTQGWGKGVAYINGFNLGRYWPS-LGPQVT-----LYVPATYLKKGKNSLVLLEQD 610
>gi|315647882|ref|ZP_07900983.1| Beta-galactosidase [Paenibacillus vortex V453]
gi|315276528|gb|EFU39871.1| Beta-galactosidase [Paenibacillus vortex V453]
Length = 587
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 147/298 (49%), Gaps = 24/298 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG++HY R P+ W + K K G + V+T + WNLHEP+ GQF F G DL F++
Sbjct: 21 ILSGAVHYFRIVPEYWEDRLMKLKACGFNTVETYIPWNLHEPKEGQFTFDGIADLEGFVQ 80
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
+ GL+V LR P+I EW +GGLP WL P I R + + + Y ++
Sbjct: 81 KAGHLGLHVILRPSPYICAEWEFGGLPAWLLQYPDIHLRCMDPVYLEKVDHYYDELIP-- 138
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWA-AKLAVDL----QTGVPW 218
+ L S+GGP+I QIENEYG + ++LE Y++ + VD+ G
Sbjct: 139 RIVPLLTSKGGPVIAIQIENEYGSYGNDTAYLE----YLKDGLSARGVDVLLFTSDGPTD 194
Query: 219 VMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAE 276
M + P+ + G + GE FA + P + E W ++ + RS+E
Sbjct: 195 GMLQGGTVPNVLATVNFGSRPGEAFAKLREYRTEDPLMCMEYWNGWFDHWLKPHHTRSSE 254
Query: 277 DIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQAPLDEYG 327
++A V + ++ S VN+YM+HGGTNFG A +T Y APL E G
Sbjct: 255 EVA-QVFEEMLRLNAS-VNFYMFHGGTNFGFYNGANDQEKYEPTVTSYDYDAPLSECG 310
Score = 39.7 bits (91), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 65/156 (41%), Gaps = 25/156 (16%)
Query: 527 SVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
++ + +P +GA LE V + V+ +LKD+ + G ++ Q D+
Sbjct: 429 ALQLDIPAAGAKLEIVVENMGRVNY--GPKLKDYKGITEGARM-----NNQFLFDWSIYP 481
Query: 587 VPWSRYGSSTHQPL----------TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIG 636
+P +++ Q L T+Y F D I L GKG W+NG ++G
Sbjct: 482 LPLENPNTASFQALEGALDQQDRPTFYTGEFTVDEIGD-TFIRLDGWGKGVVWINGFNLG 540
Query: 637 RYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
RYW PQ T ++P LK N + + E
Sbjct: 541 RYWKE--GPQAT-----LYVPGPLLKQGRNAITVFE 569
>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 586
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 146/310 (47%), Gaps = 26/310 (8%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ +++G + SG++HY R P +W I KA+ GL+ ++T V WN H P+ G FD
Sbjct: 9 QDFLLDGEPLQILSGALHYFRVHPDLWADRIRKARLMGLNTIETYVAWNAHAPERGVFDL 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
+G DL RF+ V A+GL+ +R GP+I EW GGLP WL PG+ R+ + +
Sbjct: 69 TGNLDLGRFLDLVAAEGLHAIVRPGPYICAEWDNGGLPAWLMATPGVGVRTAEPQYLEAI 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y I+ ++ ++ ++GGP+++ Q+ENEYG Y+R + +
Sbjct: 129 AGYYDEILAVVAPRQV--TRGGPVLMVQVENEYGAYGDD-----ADYLRALVTMMRERGI 181
Query: 215 GVPWVMCKQDD--------APDPVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQ 264
VP C Q + P+ A G + E + P P + E W ++
Sbjct: 182 EVPLTTCDQANDEMLGRGGLPELHKTATFGSRSPERLETLRRHQPTGPLMCMEYWDGWFD 241
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGY 317
+G++ + + L + +G+ N YM+HGGTN G T A + T Y
Sbjct: 242 SWGEQH--HTTDAAEAAADLDLLLSQGASANLYMFHGGTNLGFTNGANDKGTYLPITTSY 299
Query: 318 YDQAPLDEYG 327
APL E G
Sbjct: 300 DYDAPLAEDG 309
>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
Length = 594
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
Length = 594
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
Length = 604
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 158/345 (45%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVERFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--VNGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
Length = 594
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|422727867|ref|ZP_16784288.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315151617|gb|EFT95633.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 593
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 142/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
Length = 591
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 96/309 (31%), Positives = 138/309 (44%), Gaps = 26/309 (8%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+++G + SG+IHY R P W I KA+ GL+ ++T V WN HEP GQ+ +
Sbjct: 10 DFLLDGRPHRILSGAIHYFRIHPDQWADRIHKARLMGLNTIETYVAWNAHEPVEGQWSWE 69
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G DL F+K V +G++ +R P+I EW GGLP WL R D F ++
Sbjct: 70 GGLDLAAFLKAVADEGMHAIVRPAPYICAEWDNGGLPAWLFGEKAAGVRRDEPVFMAAVQ 129
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y + +++ +++ GGP+IL QIENEYG P Y+R +
Sbjct: 130 AYLRRVYEVIEPLQIH--HGGPVILVQIENEYGAYGSD-----PEYLRKLVDITSSAGIT 182
Query: 216 VPWVMCKQDD--------APDPVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQV 265
VP Q + P + G + E A + P P + E W ++
Sbjct: 183 VPLTTVDQPEDGMLAAGSLPGLLRTGSFGSRSPERLATLRRHQPTGPLMCMEYWNGWFDD 242
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYY 318
+G AE A + + G+ VN YM GGTNFG T A ++T Y
Sbjct: 243 WGTPHHTTDAEASAADLDALLG--SGASVNLYMLCGGTNFGLTNGANDKGTYEPIVTSYD 300
Query: 319 DQAPLDEYG 327
APLDE G
Sbjct: 301 YDAPLDEAG 309
>gi|229548754|ref|ZP_04437479.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257421063|ref|ZP_05598053.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|312951816|ref|ZP_07770707.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|422691033|ref|ZP_16749073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|422707894|ref|ZP_16765431.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
gi|229306094|gb|EEN72090.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
gi|257162887|gb|EEU92847.1| glycosyl hydrolase [Enterococcus faecalis X98]
gi|310630219|gb|EFQ13502.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
gi|315154243|gb|EFT98259.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
gi|315154885|gb|EFT98901.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
Length = 593
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 142/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 594
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|224027078|ref|ZP_03645444.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
gi|224020314|gb|EEF78312.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
18228]
Length = 783
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 149/319 (46%), Gaps = 19/319 (5%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ ++NG ++ + IHY R + W I K G++ + FWN+HE +PG+FDF
Sbjct: 38 KEFLLNGKPFLIKAAEIHYTRIPAEYWEHRIEMCKALGMNTICIYAFWNIHEQRPGEFDF 97
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G+ D+ RF + Q G+Y+ LR GP++ EW GGLP+WL I R+ + F
Sbjct: 98 EGQNDVARFCRLAQKHGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIALRTSDPYFLERT 157
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDL 212
K + + + A L A +GG II+ Q+ENEYG + ++ VR A V L
Sbjct: 158 KIFMNELGKQL--ADLQAPRGGNIIMVQVENEYGAYAEDKEYIASIRDIVRGAGFTDVPL 215
Query: 213 QTGVPWVMCKQDDAPDPV---INACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQVYG 267
W Q + D + IN G + F P+ P + +E W+ ++ +G
Sbjct: 216 -FQCDWASTFQRNGLDDLLWTINFGTGADIDQQFKALREARPETPLMCSEYWSGWFDHWG 274
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-----TASAYVLTGYYD-QA 321
+ R A+ + + + + + YM HGGT FG + S + YD A
Sbjct: 275 RKHETRPADVMVKGIKDMMD--RNISFSLYMTHGGTTFGHWGGANSPSYSAMCSSYDYDA 332
Query: 322 PLDEYGLLRQPKWGHLKEL 340
P+ E G PK+ L++L
Sbjct: 333 PISEAGWA-TPKYYQLRDL 350
>gi|344291571|ref|XP_003417508.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
3-like [Loxodonta africana]
Length = 770
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/324 (32%), Positives = 156/324 (48%), Gaps = 34/324 (10%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++F GSIHY R W + K K G + + T V WNLHEP+ G+FDFSG
Sbjct: 202 LEGHKFLIFGGSIHYFRVPRAYWRDRLLKLKACGFNTLTTYVPWNLHEPERGKFDFSGNL 261
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL FI GL+V LR GP+I E GGLP WL P + +R +
Sbjct: 262 DLEAFIWMAAELGLWVILRPGPYICSEIDLGGLPSWLLQDPDLNWRHTX------LVTQX 315
Query: 159 TMIVNMM-KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVP 217
++ +++ + L +GGPII Q+ENEYG + PYV+ A LQ G+
Sbjct: 316 SLFDHLIPRVVPLQYHRGGPIIAVQVENEYGSYNKD--KDYMPYVQQAL-----LQRGIV 368
Query: 218 WVMCKQDDAPD----------PVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
++ D+ D +N + + +KP + E W ++ +G
Sbjct: 369 ELLLTSDNERDVLKGYIKGVLATVNMKTLSRDAFSLLNKAQSEKPIMIMEFWVGWFDTWG 428
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQ 320
++ +R A+++ + V FI K + S+ N YM+HGGTNFG A V+T Y
Sbjct: 429 NQHFLRDAKEVEHTVLEFI-KAEISF-NAYMFHGGTNFGFMNGATYLGKHRGVVTSYDYD 486
Query: 321 APLDEYGLLRQPKWGHLKELHSAV 344
A L E G + K+ L++L +V
Sbjct: 487 AVLTEAGDYTE-KYFKLRKLFGSV 509
>gi|257415380|ref|ZP_05592374.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257157208|gb|EEU87168.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 593
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 142/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|384209874|ref|YP_005595594.1| beta-galactosidase [Brachyspira intermedia PWS/A]
gi|343387524|gb|AEM23014.1| beta-galactosidase [Brachyspira intermedia PWS/A]
Length = 592
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/310 (31%), Positives = 138/310 (44%), Gaps = 31/310 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
I+NG L SG+IHY R + W + K G + V+T + WN+HE G FDF
Sbjct: 8 EDFILNGKPIKLLSGAIHYFRFVEEYWEDCLYNLKAAGFNTVETYIPWNIHEIDEGVFDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG +D+ FIK Q L V LR P+I EW +GGLP WL + R++ E F +
Sbjct: 68 SGNKDIASFIKLAQKMDLLVILRPTPYICAEWEFGGLPAWLLRYDNMKVRTNTELFLSKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y + + A L ++ GP+I+ QIENEYG + Y++ L V
Sbjct: 128 DAYYKELFKQI--ADLQITRNGPVIMMQIENEYGSFGND-----KEYLKALKNLMVKHGA 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGETFAGPNS------PDKPAIWT 256
VP + D A D V+ A G Q E+F P +
Sbjct: 181 EVP--LFTSDGAWDAVLEAGTLVDDGILATVNFGSQAKESFDATEKFFERKGIKNPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTG 316
E W ++ ++ + R A+D V I +GS +N YM+ GGTNFG V TG
Sbjct: 239 EFWDGWFNLWKEPIIKRDADDFIMEVKEIIK--RGS-INLYMFIGGTNFGFYNGTSV-TG 294
Query: 317 YYDQAPLDEY 326
Y D + Y
Sbjct: 295 YTDFPQITSY 304
Score = 47.0 bits (110), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 89/199 (44%), Gaps = 30/199 (15%)
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLER--RVA 544
+H ++NGE+ G K+ D+ +M H NG N + LL VG + G L+ +V
Sbjct: 411 VHFYLNGEYKGV---KYQDELIEPIEM-HFNNGDNVLELLVENVGRVNYGYKLQECSQVK 466
Query: 545 GLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYK 604
G+R + + F G++ L D+ S+ + + P ++Y+
Sbjct: 467 GIR-IGVMAD------IHFETGWEQYALPLDNIKDVDFSSKWIE--------NTP-SFYR 510
Query: 605 TVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPT 664
FD +D ++ +GKG A++NG ++GRYW + +IP LK
Sbjct: 511 YEFDVKEPADTF-LDCSKLGKGAAFINGFNLGRYW-------SEGPVCYLYIPAPLLKTG 562
Query: 665 GNLLVLLEEENGYPPGISI 683
N +++ E EN + I++
Sbjct: 563 KNEIIIFETENVFADTIAL 581
>gi|449493221|ref|XP_002196735.2| PREDICTED: beta-galactosidase [Taeniopygia guttata]
Length = 636
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 147/321 (45%), Gaps = 28/321 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ YD + +G SGSIHY R P W + K K GLD +QT V WN HEPQ
Sbjct: 11 IDYDSNCFVKDGKPFRYISGSIHYSRVPPYYWKDRLLKMKMAGLDAIQTYVPWNYHEPQM 70
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G +DF G +DL F++ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 71 GTYDFFGGKDLQYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSSDSD 130
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEK--------- 197
+ ++R+ +++ M+ LY GGPII+ Q+ENEYG ++++L
Sbjct: 131 YLEAVERWMGVLLPKMR-PYLY-QNGGPIIMVQVENEYGSYFACDYNYLRFLLKLFRLHL 188
Query: 198 GPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIW 255
G V + A + C ++ G F S P P +
Sbjct: 189 GDEVVLFTTDGASQFH-----LKCGALQGLYATVDFAPGANVTAAFLAQRSSEPKGPLVN 243
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G + A+ IA + +A G+ VN YM+ GGTNF A +
Sbjct: 244 SEFYTGWLDHWGHHHSVVPAQTIAKTLNEILA--SGANVNLYMFIGGTNFAYWNGANMPY 301
Query: 314 ---LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 302 MPQPTSYDYDAPLSEAGDLTE 322
>gi|449672638|ref|XP_002158331.2| PREDICTED: beta-galactosidase-1-like protein 2-like [Hydra
magnipapillata]
Length = 476
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 118/412 (28%), Positives = 176/412 (42%), Gaps = 65/412 (15%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
+GR+ + + + SGS+HY R + W + K K GL+ V + WNLHEP+PG F
Sbjct: 48 NGRNFTLKREKFRIMSGSMHYFRIPFRKWSDRLLKLKAMGLNTVDIYIPWNLHEPEPGHF 107
Query: 93 DFSGRR-DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFK 151
DFS + +L F+ +Q GLY +R GP+I E GGLP WL + RS F
Sbjct: 108 DFSSDQLNLSEFLYLLQGYGLYAVIRPGPYICAELDLGGLPSWLLRDKNMKLRSLYPGFI 167
Query: 152 FHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVD 211
++RY + +++ + S GGPII QIENEYG+ + Y+++ ++ +
Sbjct: 168 EPVERYFKQLFAILQPFQF--SYGGPIIAFQIENEYGVYDQDV-----NYMKYLKEIYIS 220
Query: 212 LQTGVPWVMCKQDDA-----PDPVINACN-----GRQCGETFAGPNSPDKPAIWTENWTS 261
+ +C + V+ N + + PDKP TE W
Sbjct: 221 NGLSELFFVCDNKQGLGKYKLEGVLQTINFMWLDAKGMIDKLEAV-QPDKPVFVTELWDG 279
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA---------- 311
++ +G+ I D A +AL +G+ N YM+HGGTNFG A
Sbjct: 280 WFDHWGENHHIVKTADAA--LALEYVIKRGASFNLYMFHGGTNFGFINGANANNDGSNYQ 337
Query: 312 YVLTGYYDQAPLDEYGLLRQ-------------PKWGHLKEL-----------HSAVKLC 347
+T Y AP+ E G L Q PK K L + +KL
Sbjct: 338 STITSYDYDAPVSETGHLSQKFDELKLTIKNNAPKGAVPKTLPWIPDDSPYTGYGMIKLT 397
Query: 348 LKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYE 399
+ LS +L +NF K Q+ + N NNA F ++YE
Sbjct: 398 TQMDLSEILKHVNFKKYQQVVNME----------NLSINNNAGQSFGYIVYE 439
>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
Length = 594
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 594
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|313231409|emb|CBY08524.1| unnamed protein product [Oikopleura dioica]
Length = 493
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 141/290 (48%), Gaps = 26/290 (8%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
++G + L SGSIHY R + W + K K GL+ V+ V WNLHEP G+F+FSG
Sbjct: 65 LDGEKITLVSGSIHYFRVPNEYWLDRLTKLKYAGLNTVELYVSWNLHEPYSGEFNFSGDL 124
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
D+VRFI+ GL+V R GP+I EW +GG P+WL + R+ + ++++
Sbjct: 125 DVVRFIEMAGELGLHVLFRPGPYICAEWEWGGHPYWLLHDTDMKVRTTYPGYLEAVEKFY 184
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKG---PPYVRWAAKLAVDLQ-- 213
+ + + L GGPII QIENEY +F E G P ++ W + D Q
Sbjct: 185 SELFG--RVNHLMYRNGGPIIAVQIENEYAGFADAF-EIGPLDPGFLTWLRQTIKDQQCE 241
Query: 214 -----TGVPWVMCKQDDAPDP-------VINACNGRQCGETFAGPNSPDKPAIWTENWTS 261
+ W K + DP V+ A E N P KP + E W+
Sbjct: 242 ELLFTSDGGWDFYKYELEGDPYGLNFDDVLRANYWLNILEN----NQPGKPKMVMEWWSG 297
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
++ +G + +A+ ++ ++ + + VNYYM+HGGTNFG A
Sbjct: 298 WFDFWGYHHQGTTADSFEENLRAILS--QNASVNYYMFHGGTNFGYMNGA 345
>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 594
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
Length = 594
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 148/330 (44%), Gaps = 34/330 (10%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V + + ING L G +HYPR + W + +A GL+ V VFWN HE QP
Sbjct: 30 VKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRAHAMGLNTVSAYVFWNFHERQP 89
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G FDFSG+ D+ F++ Q +GLYV LR GP++ EW +GG P WL + +RS +
Sbjct: 90 GVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDLTYRSKDPR 149
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
F + +RY + + A L + GG II+ Q+ENEYG Y+ +
Sbjct: 150 FMSYCERYIKELGKQL--APLTINNGGNIIMVQVENEYGSYAAD-----KEYLAAIRDML 202
Query: 210 VDLQTGVPWVMCK---QDDAPD-----PVINACNGRQCGETFAGPNSPDKPAIWTENWTS 261
+ VP C Q +A P +N G + P P E + +
Sbjct: 203 QEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFK-IVDKYHPGGPYFVAEFYPA 261
Query: 262 FYQVYGDE----ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY 317
++ +G A R AE + + + G V+ YM+HGGTNF A G+
Sbjct: 262 WFDEWGKRHSSVAYERPAEQLDWMLG------HGVSVSMYMFHGGTNFWYMNGANTSGGF 315
Query: 318 YDQ-------APLDEYGLLRQPKWGHLKEL 340
Q APL E+G PK+ +E+
Sbjct: 316 RPQPTSYDYDAPLGEWGNCY-PKYHAFREI 344
Score = 40.4 bits (93), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 34/58 (58%), Gaps = 7/58 (12%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEEN 675
+++ GKG WVNG+S+GR+W + PQ T +IP +LK N +V+ E E+
Sbjct: 538 VDMSQWGKGAVWVNGKSLGRFWN--IGPQQT-----LYIPAPWLKKGENEIVVFEMED 588
>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
Length = 651
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/347 (32%), Positives = 159/347 (45%), Gaps = 27/347 (7%)
Query: 12 LLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEG 71
LLL + G G V Y +G + SGSIHY R W + K
Sbjct: 10 LLLLMLFGRSLGESPSFTVDYQNDCFRKDGEKFQYISGSIHYNRIPRVYWKDRLLKMYMA 69
Query: 72 GLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGL 131
GL+ +QT V WN HE PG ++FSG RDL F+K Q GL V LR GP+I EW GGL
Sbjct: 70 GLNAIQTYVPWNYHEEVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGPYICAEWDMGGL 129
Query: 132 PFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVE 191
P WL IV RS + + + ++ ++ M+K LY GGPII Q+ENEYG
Sbjct: 130 PAWLLKKKDIVLRSTDPDYIAAVDKWMGKLLPMIK-PYLY-QNGGPIITVQVENEYG--- 184
Query: 192 HSFLEKGPPYVRWAAKL-------AVDLQT----GVPWVMCKQDDAPDPVINACNGRQCG 240
S+ Y+R +KL V L T G+ ++ C ++ G
Sbjct: 185 -SYFACDYNYMRHLSKLFRSYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGPGANVT 243
Query: 241 ETFAGPN--SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYM 298
F P P + +E +T + +G + S +A ++ + + G+ VN YM
Sbjct: 244 AAFEPQRQVQPHGPLVNSEFYTGWLDHWGSRHSVVSPTQVAKALSEML--LMGANVNLYM 301
Query: 299 YHGGTNFG-----RTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+ GGTNFG T A T Y APL E G L + K+ ++E+
Sbjct: 302 FIGGTNFGYWNGANTPYAAQPTSYDYDAPLTEAGDLTE-KYFAIREV 347
>gi|422694237|ref|ZP_16752232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
gi|315148319|gb|EFT92335.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
Length = 593
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 142/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIHREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
Length = 628
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 147/330 (44%), Gaps = 41/330 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D+
Sbjct: 157 RLYK--EVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADVGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNG-------RQCGETFAGPNSPDKPAI----WTENW 259
VP + A + NG ++ + + P A W +W
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW 274
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ G R E + F N+YM HGGTNFG T+ A
Sbjct: 275 AEPFPQIGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y AP+ E G + PK+ ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|319900291|ref|YP_004160019.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
gi|319415322|gb|ADV42433.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
Length = 629
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 151/333 (45%), Gaps = 45/333 (13%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + + SG +HY R Q W + K GL+ V T VFWN HE +PG++DF+G +
Sbjct: 38 LNGKQTPILSGEMHYARIPHQYWRHRLQMMKGMGLNAVATYVFWNHHETEPGKWDFTGDK 97
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
+L +IK +G+ V LR GP++ EW +GG P+WL +VPG+ R DN F H + Y
Sbjct: 98 NLAEYIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVPGMEIRRDNPQFLKHTEAYI 157
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ + L ++GGPI++ Q ENE+G + L++ Y + D
Sbjct: 158 QRLYK--EVGHLQCTKGGPIVMVQCENEFGSYVAQRKDITLQEHRAYNAKIKQQLADAGF 215
Query: 215 GVP-------WVM-CKQDDAPDPVINA----CNGRQCGETFAGPNSPDKPAIWTENWTSF 262
VP W+ + P N N ++ + G P A + W S
Sbjct: 216 DVPLFTSDGSWLFEGGSTEGALPTANGETDIANLKKVVNQYHGGQGPYMVAEFYPGWLSH 275
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYV------NYYMYHGGTNFGRTASAYV--- 313
+ AE A +A+ SY+ N YM HGGTNFG T+ A
Sbjct: 276 W-----------AEPFPQVSASSVARTTESYLKNDVSFNVYMVHGGTNFGFTSGANYDKK 324
Query: 314 ------LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 325 RDIQPDLTSYDYDAPISEAGWV-TPKYDSIRAV 356
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 109/268 (40%), Gaps = 32/268 (11%)
Query: 414 VAFNTAKLDSVEQWEEYKEAI-PTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPS 472
+ + KLD V Y E PT ++T + EQ+N Y+ Y F
Sbjct: 375 IEIPSIKLDKVTDMLAYTETTEPTVNDTPM----TFEQLN---QGYGYVLYTRHFNQPIG 427
Query: 473 DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGL 532
+ L++ L +I+GE G + + +++++E V N T + +L +G
Sbjct: 428 GT---LQIDGLRDYAVVYIDGEKAGVLN--RNTQTYSMEIDVPF-NAT--LQILVENMGR 479
Query: 533 PDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVP--WS 590
+ G+ + G+ + G KE+ + W L K G P +
Sbjct: 480 INYGSEIVHNTKGIISPVTIGGKEI----TGGWN-MYPLPMSKAPEAAKAGRNAYPNTSA 534
Query: 591 RYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPS 650
+ G P+ + T TG I++ GKG +VNG +IGRYW Q P
Sbjct: 535 QAGKLKGSPVAYEGTFTLNRTGD--TFIDMEDWGKGIIFVNGINIGRYW------QAGPQ 586
Query: 651 QSWYHIPRSFLKPTGNLLVLLEEENGYP 678
Q+ Y IP +LK N +V+ E+ N P
Sbjct: 587 QTLY-IPGVWLKKGENKIVIFEQLNEKP 613
>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
Length = 594
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|29375402|ref|NP_814556.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|29342862|gb|AAO80626.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
Length = 592
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 128 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 181 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 285
>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
Length = 594
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|336428330|ref|ZP_08608312.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336005980|gb|EGN36021.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 583
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 142/299 (47%), Gaps = 29/299 (9%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
++G + SG++HY R P+ W + K K G + V+T V WN+HEPQ G+F F G
Sbjct: 14 LDGKPFKIISGAVHYFRIVPEYWRDRLEKLKAMGANTVETYVPWNMHEPQKGKFVFEGML 73
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
D+ RFI Q GLYV +R P+I EW +GGLP WL G+ R EPF ++ Y
Sbjct: 74 DISRFILLAQELGLYVIVRPSPYICAEWEFGGLPAWLLKEDGMRLRGCYEPFLEAVREYY 133
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW 218
+++ ++ +++ GGP+IL Q+ENEYG + Y+ +L +D VP
Sbjct: 134 SVLFPILVPLQIH--HGGPVILMQVENEYG-----YYGDDTRYMETMKQLMLDNGAEVPL 186
Query: 219 VMCKQDDAPDPVINACN-----------GRQCGETFA--GPNSPDKPAIWTENWTSFYQV 265
V D P +C G + E F + P + TE W ++
Sbjct: 187 VTS---DGPMDESLSCGRLPGVLPTGNFGSKTEERFEVLKKYTEGGPLMCTEFWVGWFDH 243
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLD 324
+G+ +R ++ ++ +VN YM+ GGTNFG + YYD+ D
Sbjct: 244 WGNGGHMRG--NLEESTKDLDKMLEMGHVNIYMFEGGTNFGFMNG----SNYYDELTPD 296
>gi|227554928|ref|ZP_03984975.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|422713751|ref|ZP_16770500.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422716430|ref|ZP_16773136.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|227175936|gb|EEI56908.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|315575268|gb|EFU87459.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315581351|gb|EFU93542.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
Length = 593
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
Length = 784
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 153/326 (46%), Gaps = 19/326 (5%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G + T + ++NG ++ + +HYPR W + I K G++ + VFWN+HE
Sbjct: 27 GGDFTAGKNTFLLNGQPFVVKAAELHYPRIPRPYWDQRIKMCKALGMNTICLYVFWNIHE 86
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
Q ++DF+G D+ F + Q G+YV +R GP++ EW GGLP+WL I R D
Sbjct: 87 QQESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRED 146
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRW 204
+ F +K + + + A L GGPII+ Q+ENEYG V ++ + V+
Sbjct: 147 DPYFLARVKAFEAEVGRQL--APLTIQNGGPIIMVQVENEYGSYGVNKQYVSQIRDIVKA 204
Query: 205 AAKLAVDLQTGVPWVMCKQDDAPDPVI---NACNGRQCGETFAGPNS--PDKPAIWTENW 259
+ V L W + + D ++ N G F P+ P + +E W
Sbjct: 205 SGFDKVTL-FQCDWASNFEKNGLDDLLWTMNFGTGSNIDAQFKRLKQLRPETPLMCSEFW 263
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ ++ +G R A+ + + ++ K + YM HGGT+FG A A
Sbjct: 264 SGWFDKWGARHETRPAKAMVEGINEMLS--KNISFSLYMTHGGTSFGHWAGANSPGFAPD 321
Query: 314 LTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y AP++EYG PK+ L++
Sbjct: 322 VTSYDYDAPINEYGHA-TPKFWELRK 346
>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
Length = 594
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|22760570|dbj|BAC11247.1| unnamed protein product [Homo sapiens]
Length = 636
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 131/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 233
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 234 KDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 294 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 324
>gi|257418414|ref|ZP_05595408.1| beta-galactosidase [Enterococcus faecalis T11]
gi|257160242|gb|EEU90202.1| beta-galactosidase [Enterococcus faecalis T11]
Length = 592
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 128 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 181 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 285
>gi|426371167|ref|XP_004052524.1| PREDICTED: beta-galactosidase-1-like protein 2 [Gorilla gorilla
gorilla]
Length = 678
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 131/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 105 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 164
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 165 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 222
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 223 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 275
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 276 KDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 335
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 336 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 366
>gi|384512509|ref|YP_005707602.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430358961|ref|ZP_19425649.1| beta-galactosidase [Enterococcus faecalis OG1X]
gi|327534398|gb|AEA93232.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429513519|gb|ELA03099.1| beta-galactosidase [Enterococcus faecalis OG1X]
Length = 592
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKKKGVRLRSTDPIFMTKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 128 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 181 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 285
>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 604
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 18 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 77
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 78 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 136
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 137 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 189
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 190 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQVFFEEHGKKWPLMC 246
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 247 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 303
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 304 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|312901648|ref|ZP_07760918.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
gi|311291259|gb|EFQ69815.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
Length = 593
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|256761574|ref|ZP_05502154.1| beta-galactosidase [Enterococcus faecalis T3]
gi|422736227|ref|ZP_16792491.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|256682825|gb|EEU22520.1| beta-galactosidase [Enterococcus faecalis T3]
gi|315166978|gb|EFU10995.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 593
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|31543093|ref|NP_612351.2| beta-galactosidase-1-like protein 2 precursor [Homo sapiens]
gi|74728154|sp|Q8IW92.1|GLBL2_HUMAN RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|26251705|gb|AAH40641.1| Galactosidase, beta 1-like 2 [Homo sapiens]
gi|119588247|gb|EAW67843.1| hypothetical protein BC008326, isoform CRA_b [Homo sapiens]
gi|119588248|gb|EAW67844.1| hypothetical protein BC008326, isoform CRA_b [Homo sapiens]
Length = 636
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 131/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 233
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 234 KDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 294 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 324
>gi|37182117|gb|AAQ88861.1| HYDRL-14 [Homo sapiens]
Length = 636
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 131/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 233
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 234 KDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 294 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 324
>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
Length = 604
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/337 (31%), Positives = 157/337 (46%), Gaps = 26/337 (7%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYV 202
S+N + H+ Y +++ + +L GG I++ QIENEYG E ++L +
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--VNGGNILMIQIENEYGSFGEEKAYLRAIRDLM 184
Query: 203 RWAAKLAVDLQTGVPWVMCKQDDA---PDPVINACNGRQCGETFA------GPNSPDKPA 253
A+ + PW + + D ++ G + E F + P
Sbjct: 185 IARGVTALFFTSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPL 244
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR------ 307
+ E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 245 MCMEFWDGWFNRWKEPIIKRDPQELAESVREALA--LGS-INLYMFHGGTNFGFMNGCSA 301
Query: 308 --TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 302 RGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|332264034|ref|XP_003281053.1| PREDICTED: beta-galactosidase-1-like protein 2 [Nomascus
leucogenys]
Length = 679
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 131/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 106 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 165
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 166 MAAEIGLWVILRPGPYICSELDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 223
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 224 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 276
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 277 KDGLSKGVVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 336
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 337 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 367
>gi|358415935|ref|XP_600640.6| PREDICTED: uncharacterized protein LOC522360 [Bos taurus]
Length = 1360
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/308 (32%), Positives = 135/308 (43%), Gaps = 28/308 (9%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
+ GH ++ GS+HY R W + K + G + V T V WNLHEP+ G FDFSG
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
DL FI + GL+V LR GP+I E GGLP WL P R+ N F + +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNK 440
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGV 216
Y ++ + A L QGGPII Q+ENEYG E PY+ A + Q G+
Sbjct: 441 YFDHLIP--RVALLQYLQGGPIIAVQVENEYGFFYKD--EAYMPYLLQALQ-----QRGI 491
Query: 217 PWVMCKQDDAPDPVINACNGRQCGETFAGPN----------SPDKPAIWTENWTSFYQVY 266
++ D + + G G KP + E W ++ +
Sbjct: 492 GGLLLTADSTEEVMRGHIKGVLASINMKGFKVDSFKHLYKLQRHKPILIMEFWVGWFDTW 551
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G + R+ ++ V+ FI G N YM+HGGTNFG A V T Y
Sbjct: 552 GIDHRVMGVNEVEKSVSEFI--RYGISFNVYMFHGGTNFGFMNGATSFEKHRGVTTSYDY 609
Query: 320 QAPLDEYG 327
A L E G
Sbjct: 610 DAVLTEAG 617
>gi|397699203|ref|YP_006536991.1| beta-galactosidase [Enterococcus faecalis D32]
gi|397335842|gb|AFO43514.1| beta-galactosidase [Enterococcus faecalis D32]
Length = 593
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 92/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
Length = 592
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 157/665 (23%), Positives = 257/665 (38%), Gaps = 123/665 (18%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ +++G + SGSIHY R P+ W + K K G + V+T + WN+ EP+ G+F F
Sbjct: 9 TFLLDGKPFQIISGSIHYFRVVPEYWQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFD 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D +F+ Q GLY +R P+I EW GGLP W+ VPG+ R NEP+ +++
Sbjct: 69 GLCDFEKFLDLAQKLGLYAIVRPSPYICAEWELGGLPSWIFTVPGLEPRCKNEPYYQNVR 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y +++ + ++ +GG IIL QIENEYG + K Y+ + L +
Sbjct: 129 DYYKVLLPRLVNHQI--DKGGNIILMQIENEYG-----YYGKDMSYMHFLEGLMREGGIT 181
Query: 216 VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSP--------------DKPAIWTENWTS 261
VP+V + C+G F P P + E W
Sbjct: 182 VPFVTSDGPWGKMFIHGQCDGALPTGNFGSHARPLFANMKRMMKKTGNRGPLMCMEFWIG 241
Query: 262 FYQVYGDEAR-----IRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-RTASAYV-- 313
++ +G++ R+ +D+ Y +K VN+YM+HGGTNFG S Y
Sbjct: 242 WFDAWGNKEHKTSKLKRNIKDLNYM-------LKKGNVNFYMFHGGTNFGFMNGSNYFTK 294
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFI 369
T Y APL E G + + + S +K + +E +
Sbjct: 295 LTPDTTSYDYDAPLSEDGKITE----KYRTFQSIIK--------------KYRDFEEMPL 336
Query: 370 FQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEE 429
+ A V K SI + T+A AK SVE+
Sbjct: 337 STKIEQKAYGKVKAGK------------------SIKLFDILDTLA--VAKTSSVEK--- 373
Query: 430 YKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHA 489
L M + Y+ Y + P+ S + LK+ +H
Sbjct: 374 ------------------LTGMEASGQDYGYILYKTKV---PAASNT-LKIEDGLDRIHE 411
Query: 490 FINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNV 549
F NGE K + K L + + ++LL +G + + + G+
Sbjct: 412 FKNGELKAVLFDKETAKPVELT-----LASGDELTLLVENLGRVNFATKIPFQRKGILGR 466
Query: 550 SIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDA 609
+ K L D++ ++ L + D+ + G T T + D
Sbjct: 467 VLADEKPLTDWTYYNLNLDKAQLSK-----IDWNKAEEGIAGTGKITSPSFTHMTLMVDK 521
Query: 610 PTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLV 669
+ ++ GKG ++NG ++GR+W + P + Y +P LK N ++
Sbjct: 522 ACDT---YLDFTGWGKGCIFLNGFNLGRFW------EIGPQKRLY-VPAPLLKEGENEII 571
Query: 670 LLEEE 674
+ E E
Sbjct: 572 IFETE 576
>gi|119588246|gb|EAW67842.1| hypothetical protein BC008326, isoform CRA_a [Homo sapiens]
Length = 643
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/297 (32%), Positives = 139/297 (46%), Gaps = 25/297 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL F+
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFVL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 233
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 234 KDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLR 330
+ ++ V+ + GS +N YM+HGGTNFG A Y ++ + YG R
Sbjct: 294 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFGFMNGAMHFHDY--KSDVTSYGKAR 346
>gi|255652865|ref|NP_001157373.1| beta-galactosidase [Bombyx mori]
gi|239938036|gb|ACS36117.1| beta-galactosidase [Bombyx mori]
Length = 606
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 164/676 (24%), Positives = 269/676 (39%), Gaps = 132/676 (19%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G+N++ G +I+G + SGS+HY R W + K K GL+ V T V W+ HE
Sbjct: 3 GHNISIVGDKFMIDGKPLHIISGSLHYFRVPAVYWRDRLHKFKAAGLNTVATYVEWSYHE 62
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFW-LHDVPGIVFRS 145
P+ Q++F G RDLVRF++ GL+V LR+GP+I E GGLP+W L P I R+
Sbjct: 63 PEEKQYNFEGDRDLVRFVQTAAEVGLHVLLRVGPYICAERDLGGLPYWLLGKYPNIKLRT 122
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
++ F + + + + L GGPIIL Q+ENEYG + K
Sbjct: 123 TDKDFIAESDIWLKKLFE--QVSHLLFGNGGPIILVQVENEYGSYDSDLAYKE------K 174
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK-------------P 252
+ + G ++ D G F + P + P
Sbjct: 175 MRDLISAHVGDKALLYTTDGPSLVGAGMIPGVHATIDFGVTSQPTEQFDSLFHLRPAPGP 234
Query: 253 AIWTENWTSFYQVYGDE-ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
+ +E + + +G+ AR+ + + + + + K+ +VN+Y++ GG+NF T+ A
Sbjct: 235 LMNSEFYPGWLTHWGERMARVGTNDIVLTLRNMIVNKI---HVNFYVFFGGSNFEFTSGA 291
Query: 312 YV-------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKL 364
+T Y APL E G PK+ ++E L +NF
Sbjct: 292 NFDGTYQPDITSYDYDAPLSEAG-DPTPKYYAIRE---------------TLKQLNFV-- 333
Query: 365 QEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSV 424
D++ Y P++ + ++ + D
Sbjct: 334 -------------------DEKIEPPQPSPKGRYGAVPVAAKL-----SIMSPKGRCDLG 369
Query: 425 EQWEEYKEA-IPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSL 483
+++E+ +PT++E R+ +L + +++E VL ++
Sbjct: 370 KRYEDVSGGTLPTFEELRQRSGLVLYETTL------------------NETEGVLVLNKP 411
Query: 484 GHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMV---GLPDSGAYLE 540
++ F++G+ G H K HL + S LS++V G + G L
Sbjct: 412 RDLVFVFVDGKPQGVLSRMH--------KKYHLRISSTAGSKLSLLVENQGRINYGTLLH 463
Query: 541 RRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
R L V Y ++G K I T Y V ++ S Q
Sbjct: 464 DRKGILSEVI----------------YNNKVIGGKWSI-TGYPLETVQFNSSVSEVTQGP 506
Query: 601 TWYKTVFDAPTGSDPVAINLISMG--KGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPR 658
T+Y+ F P G P+ L + G KG WVNG ++GRYW G Q ++P
Sbjct: 507 TFYEGTFVLPEGQKPLDTFLDTTGWDKGYVWVNGHNLGRYW------PGVGPQVTLYVPG 560
Query: 659 SFL--KPTGNLLVLLE 672
+L P N+L +LE
Sbjct: 561 VWLLEAPQPNVLQILE 576
>gi|403304858|ref|XP_003942999.1| PREDICTED: beta-galactosidase-1-like protein 2 [Saimiri boliviensis
boliviensis]
Length = 636
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/273 (34%), Positives = 131/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL FI
Sbjct: 63 IFGGSIHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 MASEIGLWVILRPGPYICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 233
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 234 KDGLSKGIVHGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 294 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 324
>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
Length = 787
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 101/341 (29%), Positives = 156/341 (45%), Gaps = 37/341 (10%)
Query: 7 LCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIA 66
L + LLLT + G + T ++ ++NG ++ + +HYPR W I
Sbjct: 6 LLITALLLTFAQFASAG-----DFTVGNKTFLLNGEPFVVKAAEVHYPRIPRPYWEHRIK 60
Query: 67 KAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEW 126
K G++ + VFWN+HE + GQFDF+ D+ F + Q G+YV +R GP++ EW
Sbjct: 61 MCKALGMNTLCIYVFWNIHEQREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGPYVCAEW 120
Query: 127 GYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENE 186
GGLP+WL I R + F +K + + + A L GGPII+ Q+ENE
Sbjct: 121 EMGGLPWWLLKKKDIRLRERDPYFLERVKIFEQKVGEQL--APLTIQNGGPIIMVQVENE 178
Query: 187 YGMVEHSFLEKGPPYVR---------WAAKLAVDLQTGVPWVMCKQDDAPDPVI---NAC 234
YG S+ E PYV + KL + W + + D ++ N
Sbjct: 179 YG----SYGED-KPYVSEIRDCLRGIYGEKLTL---FQCDWSSNFERNGLDDLVWTMNFG 230
Query: 235 NGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGS 292
G FA P+ P + +E W+ ++ +G R A+D+ + ++ K
Sbjct: 231 TGANIDHEFARLKQLRPNAPLMCSEFWSGWFDKWGANHETRPAKDMVDGMDEMLS--KNI 288
Query: 293 YVNYYMYHGGTNFGRTASAYV------LTGYYDQAPLDEYG 327
+ YM HGGT+FG A A +T Y AP++EYG
Sbjct: 289 SFSLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDAPINEYG 329
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 45/208 (21%), Positives = 88/208 (42%), Gaps = 29/208 (13%)
Query: 473 DSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGL 532
D+ SVL ++ FI+ ++G ++KS L + + +L +G
Sbjct: 410 DTPSVLTLNDGHDFAQVFIDSTYIGKIDRVRNEKSLLLPA----VKKGQELKILIEAMGR 465
Query: 533 PDSGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRY 592
+ G ++ +V++ K+ G+++ ++ IFT S
Sbjct: 466 INFGRAIKDYKGITESVTLSTDKD---------GHELIWNLKRWDIFTIPDSYAAAKKAL 516
Query: 593 GSSTHQPLT--------WYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLT 644
++ LT +Y+ F+ D +N+ + GKG+ +VNG +IGR+W +
Sbjct: 517 DTAKRDSLTKMVFKGSGYYRGYFNLKRVGDTF-LNMENWGKGQVYVNGHAIGRFWS--IG 573
Query: 645 PQGTPSQSWYHIPRSFLKPTGNLLVLLE 672
PQ T ++P +LK N +V+L+
Sbjct: 574 PQQT-----LYVPGCWLKKGKNEVVVLD 596
>gi|114641374|ref|XP_001157987.1| PREDICTED: galactosidase, beta 1-like 2 isoform 2 [Pan troglodytes]
Length = 636
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/285 (32%), Positives = 135/285 (47%), Gaps = 23/285 (8%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + ++ G +F GSIHY R + W + K K GL+ + T V WNLHEP+ +FD
Sbjct: 51 GWNFVLEGSTFWIFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERSKFD 110
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG DL F+ GL+V LR GP+I E GGLP WL PG+ R+ + F
Sbjct: 111 FSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEA 170
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
+ Y + M + L +GGPII Q+ENEYG K P Y+ + K D
Sbjct: 171 VDLYFDHL--MSRVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED-- 221
Query: 214 TGVPWVMCKQDDAP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTS 261
G+ ++ D+ V+ N + E TF +P + E WT
Sbjct: 222 RGIVELLLTSDNKDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTG 281
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
++ +G I + ++ V+ + GS +N YM+HGGTNFG
Sbjct: 282 WFDSWGGPHNILDSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 324
>gi|156375241|ref|XP_001629990.1| predicted protein [Nematostella vectensis]
gi|156217002|gb|EDO37927.1| predicted protein [Nematostella vectensis]
Length = 578
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/304 (32%), Positives = 148/304 (48%), Gaps = 32/304 (10%)
Query: 58 PQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLR 117
P+ W + K K GL+ V+T V WNLHE F F D+V+F+ Q GL+V +R
Sbjct: 2 PEYWADRLKKLKAMGLNTVETYVAWNLHEQVKENFKFKDEVDIVKFVNLAQELGLHVIIR 61
Query: 118 IGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGP 177
GP+I EW GGLP WL + P + RS PF +++Y + + ++ + S+GGP
Sbjct: 62 PGPYICSEWDLGGLPSWLLNDPNMRLRSTYGPFMEAVEKYFSKLFALLTPLQF--SRGGP 119
Query: 178 IILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGR 237
II Q+ENEY V+ E Y+ KL L+ G ++ DD +
Sbjct: 120 IIAWQVENEYASVQE---EVDNHYMELLHKLM--LKNGATELLFTSDDV--GYTKRYPIK 172
Query: 238 QCGETFAGPN---------SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAK 288
G + N PDKP + TE W+ ++ +G++ + + E + I
Sbjct: 173 LDGGKYMSFNKWFCLFLHFQPDKPIMVTEYWSGWFDHWGEKHHVLNTERKMINEVKDILD 232
Query: 289 MKGSYVNYYMYHGGTNFG-----RTASAYVLTGY------YD-QAPLDEYGLLRQPKWGH 336
M G+ +N+YM+HGGTNFG TA + GY YD APL E G + PK+
Sbjct: 233 M-GASINFYMFHGGTNFGFMNGANTAGNRIDDGYQPDVTSYDYDAPLSEAGDI-TPKYKA 290
Query: 337 LKEL 340
L++L
Sbjct: 291 LRKL 294
>gi|3025876|gb|AAC12775.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 662
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 152/331 (45%), Gaps = 31/331 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HEPQP
Sbjct: 29 IDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 88
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG +D+ FIK GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 89 GQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 148
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ Y+R+ KL
Sbjct: 149 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL- 201
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS-------------PDKPAI 254
G ++ D A + + A G F GP + P P +
Sbjct: 202 FHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDF-GPGANITAAFQIQRKSEPKGPLV 260
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G E +A + +A G+ VN YM+ GGTNF A +
Sbjct: 261 NSEFYTGWLDHWGQPHSTVRTEVVASSLHDILA--HGANVNLYMFIGGTNFAYWNGANMP 318
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E G L + K+ L+E+
Sbjct: 319 YQAQPTSYDYDAPLSEAGDLTE-KYFALREV 348
>gi|357409426|ref|YP_004921162.1| glycoside hydrolase 35 [Streptomyces flavogriseus ATCC 33331]
gi|320006795|gb|ADW01645.1| glycoside hydrolase family 35 [Streptomyces flavogriseus ATCC
33331]
Length = 628
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 155/330 (46%), Gaps = 39/330 (11%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN---LHEPQP 89
DGR L G + SGS+HY R P +W I + + GL+ V T V WN LHE +
Sbjct: 37 DGR-LYRGGVPHRILSGSLHYFRVHPDLWQDRIRRIADLGLNTVDTYVPWNFHQLHEDRS 95
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
+FD G RDL RFI+ V +GL V +R GP+I EW GGLP WL + RS +
Sbjct: 96 PRFD--GWRDLERFIRTVGEEGLDVVVRPGPYICAEWSNGGLPSWL-TAKDLAIRSSDPA 152
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAK 207
F + R+ ++ + A L AS+GGP++ Q+ENE+G +H+ YVRW
Sbjct: 153 FTTAVARWFDHLIPRL--ATLQASRGGPVVAVQVENEFGSYGDDHA-------YVRWCRD 203
Query: 208 LAVD--------LQTGVPWVMCKQDDAPDPVINACNGR--QCGETFAGPNSPDKPAIWTE 257
V+ G +M P + A G + P++P + E
Sbjct: 204 ALVERGIGELLFTADGPTELMLDGGTLPGTLTAATLGSKPEAARRLLVSRRPEEPFLVAE 263
Query: 258 NWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY----- 312
W ++ +G+ +R E A H I GS V+ YM HGGTNFG A A
Sbjct: 264 FWNGWFDHWGERHHVRGVES-AVHTLRGIIADHGS-VSIYMAHGGTNFGLWAGANESDGR 321
Query: 313 ---VLTGYYDQAPLDEYGLLRQPKWGHLKE 339
V+T Y AP+ E G L PK+ ++E
Sbjct: 322 LEPVVTSYDSDAPIAEDGRL-TPKFFAMRE 350
>gi|26345448|dbj|BAC36375.1| unnamed protein product [Mus musculus]
Length = 682
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 151/330 (45%), Gaps = 19/330 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 35 LDYSRDRFLKDGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 95 GQYEFSGDRDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAA 206
+ + ++ +++ MK L GGPII Q+ENEYG ++ +L R+
Sbjct: 155 YLVAVDKWLAVLLPKMKP--LLYQNGGPIITVQVENEYGSYFACDYDYLRFLVHRFRYHL 212
Query: 207 KLAVDLQT--GVPWVMCKQDDAPD--PVINACNGRQCGETFAGPN--SPDKPAIWTENWT 260
V L T G M K D ++ G + F P P I +E +T
Sbjct: 213 GNDVILFTTDGASEKMLKCGTLQDLYATVDFGTGNNITQAFLVQRKFEPKGPLINSEFYT 272
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF-----GRTASAYVLT 315
+ +G + +A +L+ +G+ VN YM+ GGTNF T T
Sbjct: 273 GWLDHWGKPHSTVKTKTLA--TSLYNLLARGANVNLYMFIGGTNFAYWNGANTPYEPQPT 330
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
Y APL E G L + K+ L+E+ K
Sbjct: 331 SYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
Length = 594
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 152/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--VNGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|26339346|dbj|BAC33344.1| unnamed protein product [Mus musculus]
Length = 756
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 151/330 (45%), Gaps = 19/330 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 35 LDYSRDRFLKDGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 95 GQYEFSGDRDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAA 206
+ + ++ +++ MK L GGPII Q+ENEYG ++ +L R+
Sbjct: 155 YLVAVDKWLAVLLPKMKP--LLYQNGGPIITVQVENEYGSYFACDYDYLRFLVHRFRYHL 212
Query: 207 KLAVDLQT--GVPWVMCKQDDAPD--PVINACNGRQCGETFAGPN--SPDKPAIWTENWT 260
V L T G M K D ++ G + F P P I +E +T
Sbjct: 213 GNDVILFTTDGASEKMLKCGTLQDLYATVDFGTGNNITQAFLVQRKFEPKGPLINSEFYT 272
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF-----GRTASAYVLT 315
+ +G + +A +L+ +G+ VN YM+ GGTNF T T
Sbjct: 273 GWLDHWGKPHSTVKTKTLA--TSLYNLLARGANVNLYMFIGGTNFAYWNGANTPYEPQPT 330
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
Y APL E G L + K+ L+E+ K
Sbjct: 331 SYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|313149603|ref|ZP_07811796.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
gi|313138370|gb|EFR55730.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
Length = 628
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 145/331 (43%), Gaps = 43/331 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGNLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNGRQCGETF----------AGPNSPDK--PAIWTEN 258
VP + A + NG E GP + P W +
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDGKGPYMVAEFYPG-WLSH 273
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W + G R E + F N+YM HGGTNFG T+ A
Sbjct: 274 WAEPFPQVGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRD 324
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 325 IQPDLTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|297483826|ref|XP_002693891.1| PREDICTED: galactosidase, beta 1-like 3 [Bos taurus]
gi|296479482|tpg|DAA21597.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 899
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/308 (32%), Positives = 135/308 (43%), Gaps = 28/308 (9%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
+ GH ++ GS+HY R W + K + G + V T V WNLHEP+ G FDFSG
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
DL FI + GL+V LR GP+I E GGLP WL P R+ N F + +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNK 440
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGV 216
Y ++ + A L QGGPII Q+ENEYG E PY+ A + Q G+
Sbjct: 441 YFDHLIP--RVALLQYLQGGPIIAVQVENEYGFFYKD--EAYMPYLLQALQ-----QRGI 491
Query: 217 PWVMCKQDDAPDPVINACNGRQCGETFAGPN----------SPDKPAIWTENWTSFYQVY 266
++ D + + G G KP + E W ++ +
Sbjct: 492 GGLLLTADSTEEVMRGHIKGVLASINMKGFKVDSFKHLYKLQRHKPILIMEFWVGWFDTW 551
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G + R+ ++ V+ FI G N YM+HGGTNFG A V T Y
Sbjct: 552 GIDHRVMGVNEVEKSVSEFI--RYGISFNVYMFHGGTNFGFMNGATSFEKHRGVTTSYDY 609
Query: 320 QAPLDEYG 327
A L E G
Sbjct: 610 DAVLTEAG 617
>gi|332187631|ref|ZP_08389367.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
gi|332012379|gb|EGI54448.1| glycosyl hydrolases 35 family protein [Sphingomonas sp. S17]
Length = 613
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 161/353 (45%), Gaps = 20/353 (5%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
++ L+ TI + G ++ T G + +G + S +HY R W +
Sbjct: 7 MMVAASALVPTIASAQGTTPA-HSFTVQGNGFLKDGKPYQVISAEMHYTRIPRAYWRDRL 65
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
KAK GL+ + T FWN HEP+PG +DF+G+ D+ FI++ QA+GL V LR GP++ E
Sbjct: 66 RKAKAMGLNTITTYSFWNAHEPRPGTYDFTGQNDIAAFIRDAQAEGLDVILRPGPYVCAE 125
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W GG P WL ++ RS + + + R+ + +K L GGPI+ Q+EN
Sbjct: 126 WELGGYPSWLLKDRNLLLRSTDPKYTAAVDRWLARLGQEVKP--LLLRNGGPIVAIQLEN 183
Query: 186 EYGMV--EHSFLEK-GPPYVRWAAKLAVDLQTGVPWVMCKQD--DAPDPVINACNGRQCG 240
EYG + ++LE Y R V + + K + P V G Q
Sbjct: 184 EYGAFGSDKAYLEGLKASYQRAGLADGVLFTSNQAGDLAKGSLPEVPSVVNFGSGGAQNA 243
Query: 241 ETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYH 300
PD + E W ++ +G++ + A + + +G V+ YM+H
Sbjct: 244 VAKLEAFRPDGLRMVGEYWAGWFDKWGEDHHETDGKKEAEELGFMLK--RGYSVSLYMFH 301
Query: 301 GGTNFG--RTASAYVLTGY------YD-QAPLDEYGLLRQPKWGHLKELHSAV 344
GGT FG A ++ T Y YD APLDE G R K+G L + + V
Sbjct: 302 GGTTFGWMNGADSHTGTDYHPDTTSYDYNAPLDEAGNPRY-KYGLLASVIAEV 353
>gi|336424850|ref|ZP_08604882.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336013315|gb|EGN43197.1| hypothetical protein HMPREF0994_00888 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 596
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 133/286 (46%), Gaps = 30/286 (10%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+NG + SG IHY R P+ W + K KE G + V+T + WN+HEP G+FDF G
Sbjct: 16 LNGEPFQIISGGIHYFRILPEYWEDRLQKLKELGCNTVETYIPWNMHEPVKGKFDFYGEH 75
Query: 99 -----DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
D+V F++ Q GL+V LR P+I EW +GGLPFWL + R+ +E + H
Sbjct: 76 VHGMLDVVSFVRTAQRLGLWVILRPSPYICAEWDFGGLPFWLMAGEEMDLRTSDERYLRH 135
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
++ Y ++ ++ A L QGGP+++ Q+ENEYG + Y+ + +
Sbjct: 136 VRDYYDRLMPLL--APLQIDQGGPVLMLQVENEYGSFGND-----KKYLESLRDMMRERG 188
Query: 214 TGVPWVMCKQDDAPD-------------PVINACNGRQCGETFAGPNSPDKPAIWTENWT 260
VP D PD P N +G + + P + TE W
Sbjct: 189 ITVPLFAS---DGPDHNMLANTKTEGIFPTANFGSGASKAFSILEEYTDGGPCMCTEFWI 245
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
++ + DE + A I ++ VN YM+ GGTNFG
Sbjct: 246 GWFDAWHDEVHHEGDTETAVKELENILELGN--VNIYMFEGGTNFG 289
>gi|148677363|gb|EDL09310.1| galactosidase, beta 1, isoform CRA_b [Mus musculus]
Length = 669
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 151/330 (45%), Gaps = 19/330 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 50 LDYSRDRFLKDGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 109
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 110 GQYEFSGDRDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDPD 169
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAA 206
+ + ++ +++ MK L GGPII Q+ENEYG ++ +L R+
Sbjct: 170 YLVAVDKWLAVLLPKMKP--LLYQNGGPIITVQVENEYGSYFACDYDYLRFLVHRFRYHL 227
Query: 207 KLAVDLQT--GVPWVMCKQDDAPD--PVINACNGRQCGETFAGPN--SPDKPAIWTENWT 260
V L T G M K D ++ G + F P P I +E +T
Sbjct: 228 GNDVILFTTDGASEKMLKCGTLQDLYATVDFGTGNNITQAFLVQRKFEPKGPLINSEFYT 287
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF-----GRTASAYVLT 315
+ +G + +A +L+ +G+ VN YM+ GGTNF T T
Sbjct: 288 GWLDHWGKPHSTVKTKTLA--TSLYNLLARGANVNLYMFIGGTNFAYWNGANTPYEPQPT 345
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
Y APL E G L + K+ L+E+ K
Sbjct: 346 SYDYDAPLSEAGDLTK-KYFALREVIQMFK 374
>gi|83415088|ref|NP_001032730.1| beta-galactosidase precursor [Canis lupus familiaris]
gi|94730362|sp|Q9TRY9.3|BGAL_CANFA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|76470548|gb|ABA43388.1| lysosomal beta-galactosidase [Canis lupus familiaris]
Length = 668
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/331 (32%), Positives = 152/331 (45%), Gaps = 31/331 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HEPQP
Sbjct: 35 IDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG +D+ FIK GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 95 GQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ Y+R+ KL
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL- 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS-------------PDKPAI 254
G ++ D A + + A G F GP + P P +
Sbjct: 208 FHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDF-GPGANITAAFQIQRKSEPKGPLV 266
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G E +A + +A G+ VN YM+ GGTNF A +
Sbjct: 267 NSEFYTGWLDHWGQPHSTVRTEVVASSLHDILA--HGANVNLYMFIGGTNFAYWNGANMP 324
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E G L + K+ L+E+
Sbjct: 325 YQAQPTSYDYDAPLSEAGDLTE-KYFALREV 354
>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
Length = 604
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/345 (31%), Positives = 159/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V W+L
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|134096920|ref|YP_001102581.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|291006638|ref|ZP_06564611.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
gi|133909543|emb|CAL99655.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
Length = 594
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 101/338 (29%), Positives = 158/338 (46%), Gaps = 46/338 (13%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T G +++G + +G +HY R+ P W + + + GL+ V T V WN HEP+
Sbjct: 17 LTVRGNEFLLDGEPFRIIAGEMHYFRTHPDQWRNRLDRMRALGLNSVDTYVAWNFHEPRR 76
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+ DF+G RD+VRF++ GL V +R GP+I EW +GGLP WL S N P
Sbjct: 77 GEVDFTGWRDVVRFVETAAEAGLKVIIRPGPYICAEWDFGGLPAWL-------LESGNPP 129
Query: 150 FKFHMKRYATMIVN-----MMKAARLYASQGGPIILSQIENEYG----------MVEHSF 194
+ Y + + + + A L A++GGP++ Q+ENEYG +
Sbjct: 130 LRCSDPAYTELTLRWFDELLPRLAPLQATRGGPVLAFQVENEYGSYGNDQTHLEQLRAGM 189
Query: 195 LEKGPPYVRWAAKLAVD--LQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK 251
LE+G + + + D L+ G +P + + A DP R+ P+
Sbjct: 190 LERGIDSLLFCSNGPSDYMLRGGNLPDTLATVNFAGDPTAPFEALREY--------QPEG 241
Query: 252 PAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
P TE W ++ +G+E + A HV +A G+ V+ YM GGTNFG A A
Sbjct: 242 PLWCTEFWDGWFDHWGEEHHTTDPVETAGHVDRMLA--AGASVSLYMAVGGTNFGWWAGA 299
Query: 312 Y----------VLTGYYDQAPLDEYGLLRQPKWGHLKE 339
+T Y +P+ E G L + K+ ++E
Sbjct: 300 NYDTSKDQYQPTITSYDYDSPIGEAGELTE-KFQRIRE 336
>gi|375360076|ref|YP_005112848.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
gi|383119863|ref|ZP_09940600.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|251944025|gb|EES84544.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
gi|301164757|emb|CBW24316.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
Length = 628
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 146/330 (44%), Gaps = 41/330 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNG-------RQCGETFAGPNSPDKPAI----WTENW 259
VP + A + NG ++ + + P A W +W
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW 274
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ G R E + F N+YM HGGTNFG T+ A
Sbjct: 275 AEPFPQIGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y AP+ E G + PK+ ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|60683238|ref|YP_213382.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
gi|60494672|emb|CAH09473.1| putative exported beta-galactosidase [Bacteroides fragilis NCTC
9343]
Length = 628
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 146/330 (44%), Gaps = 41/330 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNG-------RQCGETFAGPNSPDKPAI----WTENW 259
VP + A + NG ++ + + P A W +W
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW 274
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ G R E + F N+YM HGGTNFG T+ A
Sbjct: 275 AEPFPQIGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y AP+ E G + PK+ ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|6753190|ref|NP_033882.1| beta-galactosidase precursor [Mus musculus]
gi|114944|sp|P23780.1|BGAL_MOUSE RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|192187|gb|AAA37293.1| beta-galactosidase [Mus musculus]
gi|74143070|dbj|BAE42549.1| unnamed protein product [Mus musculus]
Length = 647
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 151/330 (45%), Gaps = 19/330 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 35 LDYSRDRFLKDGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 95 GQYEFSGDRDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAA 206
+ + ++ +++ MK L GGPII Q+ENEYG ++ +L R+
Sbjct: 155 YLVAVDKWLAVLLPKMKP--LLYQNGGPIITVQVENEYGSYFACDYDYLRFLVHRFRYHL 212
Query: 207 KLAVDLQT--GVPWVMCKQDDAPD--PVINACNGRQCGETFAGPN--SPDKPAIWTENWT 260
V L T G M K D ++ G + F P P I +E +T
Sbjct: 213 GNDVILFTTDGASEKMLKCGTLQDLYATVDFGTGNNITQAFLVQRKFEPKGPLINSEFYT 272
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF-----GRTASAYVLT 315
+ +G + +A +L+ +G+ VN YM+ GGTNF T T
Sbjct: 273 GWLDHWGKPHSTVKTKTLA--TSLYNLLARGANVNLYMFIGGTNFAYWNGANTPYEPQPT 330
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
Y APL E G L + K+ L+E+ K
Sbjct: 331 SYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|395846556|ref|XP_003795969.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Otolemur
garnettii]
Length = 633
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 96/296 (32%), Positives = 134/296 (45%), Gaps = 23/296 (7%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G++ I+ +F GSIHY R + W + K K GL+ + T V WNLHEPQ G+FD
Sbjct: 51 GQNFILEDAPFWIFGGSIHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPQRGKFD 110
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG DL F+ GL+V LR GP+I E GGLP WL PG+ R+ + F
Sbjct: 111 FSGNLDLEAFVLLAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEA 170
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
+ Y + M + L GGPII Q+ENEYG K P Y+ + K D
Sbjct: 171 VDLYFDHL--MSRVVPLQYKHGGPIIAVQVENEYGS-----YYKDPAYMPYVKKALED-- 221
Query: 214 TGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPD------------KPAIWTENWTS 261
G+ ++ D+ +G P +P + TE WT
Sbjct: 222 RGIVELLFTSDNKDGLRKGIIHGVLATINLQSPQELQLLTTLLVSIQGVQPKMVTEYWTG 281
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY 317
++ +G I + ++ V+ + GS +N YM+HGGTNFG A Y
Sbjct: 282 WFDSWGGPHNILDSSEVLKTVSAIVD--TGSSINLYMFHGGTNFGFINGAMHFQDY 335
>gi|444724418|gb|ELW65022.1| Beta-galactosidase-1-like protein 2 [Tupaia chinensis]
Length = 656
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/306 (31%), Positives = 142/306 (46%), Gaps = 25/306 (8%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G++ ++ +F GSIHY R + W + K K G++ + T V WNLHEP+ G+FD
Sbjct: 67 GQNFMLEDSTFWIFGGSIHYFRVPKEYWRDRLLKMKACGMNTLTTYVPWNLHEPERGKFD 126
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG DL FI GL+V LR GP++ E GGLP WL PG+ R+ + F
Sbjct: 127 FSGNLDLEAFILLAAELGLWVILRPGPYVCSEIDLGGLPSWLLQDPGMRLRTTYKGFTEA 186
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
+ Y + M + L GGPII Q+ENEYG K P Y+ + K D
Sbjct: 187 VDLYFDHL--MSRVVPLQYKHGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED-- 237
Query: 214 TGVPWVMCKQDD--------APDPV----INACNGRQCGETFAGPNSPDKPAIWTENWTS 261
G+ ++ D+ P + + + + Q TF +P + E WT
Sbjct: 238 RGIVELLLTSDNKDGLSKGVVPGALATINLQSQHELQLLNTFLVNAQVVQPKMVMEYWTG 297
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQA 321
++ +G I + ++ V+ + GS +N YM+HGGTNFG A Y A
Sbjct: 298 WFDSWGGPHHILDSSEVLKTVSALVD--AGSSINLYMFHGGTNFGFMNGAMHFHDY--SA 353
Query: 322 PLDEYG 327
+ YG
Sbjct: 354 DVTSYG 359
>gi|395816938|ref|XP_003781939.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Otolemur
garnettii]
Length = 669
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 146/317 (46%), Gaps = 18/317 (5%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HEPQ
Sbjct: 33 KIDYSRDRFLKDGQPFRYISGSIHYSRLPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 92
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG++ FS D+ FI+ GL V LR GP+I EW GGLP WL + ++ RS +
Sbjct: 93 PGKYQFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKESMILRSSDP 152
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWA 205
+ + ++ +++ MK L GGPII Q+ENEYG +H ++ R+
Sbjct: 153 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIISVQVENEYGSYFTCDHDYMRFLLKRFRYY 210
Query: 206 AKLAVDLQT--GV--PWVMCKQDDAPDPVINACNGRQCGETFA--GPNSPDKPAIWTENW 259
V L T G+ ++ C ++ G F + P P I +E +
Sbjct: 211 LGDDVVLFTTDGIFEKYLNCGALQGLYATVDFGTGVNITAAFKLQRKSEPKGPLINSEFY 270
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-----L 314
T + +G ED+A+ +LF +G+ VN YM+ GGTNF A +
Sbjct: 271 TGWLDHWGQPHSTVKTEDVAF--SLFDILARGASVNLYMFTGGTNFAYWNGANIPYSAQP 328
Query: 315 TGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 329 TSYDYDAPLSEAGDLTE 345
>gi|336412039|ref|ZP_08592497.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|423261296|ref|ZP_17242197.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|423267821|ref|ZP_17246801.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|423272270|ref|ZP_17251238.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|423276726|ref|ZP_17255658.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|423283105|ref|ZP_17261990.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
gi|335939211|gb|EGN01088.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
gi|387774329|gb|EIK36442.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
CL07T00C01]
gi|392695462|gb|EIY88674.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
CL05T00C42]
gi|392695591|gb|EIY88799.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
CL07T12C05]
gi|392696055|gb|EIY89256.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
CL05T12C13]
gi|404581379|gb|EKA86078.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
615]
Length = 628
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 146/330 (44%), Gaps = 41/330 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNG-------RQCGETFAGPNSPDKPAI----WTENW 259
VP + A + NG ++ + + P A W +W
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW 274
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ G R E + F N+YM HGGTNFG T+ A
Sbjct: 275 AEPFPQIGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y AP+ E G + PK+ ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
Length = 628
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 146/330 (44%), Gaps = 41/330 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNG-------RQCGETFAGPNSPDKPAI----WTENW 259
VP + A + NG ++ + + P A W +W
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW 274
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ G R E + F N+YM HGGTNFG T+ A
Sbjct: 275 AEPFPQIGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y AP+ E G + PK+ ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
Length = 673
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 159/341 (46%), Gaps = 28/341 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+G + +G SGSIHY R W + K K GL+ ++T V WN HEP P
Sbjct: 63 IDYEGDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLFKMKMAGLNAIETYVPWNFHEPFP 122
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG +DL F++ V GL V LR GP+I EW GGLP WL + I RS +
Sbjct: 123 GQYQFSGEQDLEYFLQLVHEVGLLVILRPGPYICAEWDMGGLPVWLLEKKSIFLRSSDPD 182
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK LY + GGPII Q+ENEYG S+ Y+R+ K+
Sbjct: 183 YLKAVDKWLEVLLPKMK-PYLYQN-GGPIITVQVENEYG----SYFACDYNYLRFLLKV- 235
Query: 210 VDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--------------PDKPAIW 255
G V+ D A + + + T S P P +
Sbjct: 236 FRQHLGEEVVLFTTDGAGENYLKCGTLQDLYATVDFGTSSNITQAFMIQRKVEPKGPLVN 295
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G+ + S ++I + ++ +G+ VN YM+ GGTNFG A +
Sbjct: 296 SEFYTGWLDHWGESHQTVSTKNIVASLTDMLS--RGANVNLYMFIGGTNFGFWNGANMPY 353
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPM 351
T Y APL E G L + + + + KL P+
Sbjct: 354 LPQPTSYDYDAPLSEAGDLTEKYYAVREAIGKFEKLPEGPI 394
>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 11122]
Length = 613
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 149/358 (41%), Gaps = 39/358 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ
Sbjct: 31 NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQ 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQFDFSG D+ F++E AQGL V LR GP+ EW GG P WL I RS +
Sbjct: 91 QGQFDFSGNNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDP 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F + Y + N ++ L GGPII Q+ENEYG +H+++ A
Sbjct: 151 RFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------AD 199
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAI 254
A+ ++ G + D D + N P PD+P +
Sbjct: 200 NRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRM 259
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
E W ++ +G A A + +G N YM+ GGT+FG A
Sbjct: 260 VGEYWAGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFGFMNGANFQ 317
Query: 314 ----------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 318 NNPSDHYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGIQPPALPATIATTTL 374
>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 604
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 109/345 (31%), Positives = 158/345 (45%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++N + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNDQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|192185|gb|AAA37292.1| acid beta-galactosidase [Mus musculus]
gi|148677364|gb|EDL09311.1| galactosidase, beta 1, isoform CRA_c [Mus musculus]
Length = 647
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 151/330 (45%), Gaps = 19/330 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 35 LDYSRDRFLKDGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 95 GQYEFSGDRDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAA 206
+ + ++ +++ MK L GGPII Q+ENEYG ++ +L R+
Sbjct: 155 YLVAVDKWLAVLLPKMKP--LLYQNGGPIITVQVENEYGSYFACDYDYLRFLVHRFRYHL 212
Query: 207 KLAVDLQT--GVPWVMCKQDDAPD--PVINACNGRQCGETFAGPN--SPDKPAIWTENWT 260
V L T G M K D ++ G + F P P I +E +T
Sbjct: 213 GNDVILFTTDGASEKMLKCGTLQDLYATVDFGTGNNITQAFLVQRKFEPKGPLINSEFYT 272
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF-----GRTASAYVLT 315
+ +G + +A +L+ +G+ VN YM+ GGTNF T T
Sbjct: 273 GWLDHWGKPHSTVKTKTLA--TSLYNLLARGANVNLYMFIGGTNFAYWNGANTPYEPQPT 330
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
Y APL E G L + K+ L+E+ K
Sbjct: 331 SYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|424665121|ref|ZP_18102157.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
gi|404574985|gb|EKA79730.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
616]
Length = 628
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 145/331 (43%), Gaps = 43/331 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNGRQCGETF----------AGPNSPDK--PAIWTEN 258
VP + A + NG E GP + P W +
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDGKGPYMVAEFYPG-WLSH 273
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W + G R E + F N+YM HGGTNFG T+ A
Sbjct: 274 WAEPFPQVGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRD 324
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 325 IQPDLTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|22137334|gb|AAH28875.1| Galactosidase, beta 1 [Mus musculus]
Length = 647
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 151/330 (45%), Gaps = 19/330 (5%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 35 LDYSRDRFLKDGQPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 95 GQYEFSGDRDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAA 206
+ + ++ +++ MK L GGPII Q+ENEYG ++ +L R+
Sbjct: 155 YLVAVDKWLAVLLPKMKP--LLYQNGGPIITVQVENEYGSYFACDYDYLRFLVHRFRYHL 212
Query: 207 KLAVDLQT--GVPWVMCKQDDAPD--PVINACNGRQCGETFAGPN--SPDKPAIWTENWT 260
V L T G M K D ++ G + F P P I +E +T
Sbjct: 213 GNDVILFTTDGASEKMLKCGTLQDLYATVDFGTGNNITQAFLVQRKFEPKGPLINSEFYT 272
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF-----GRTASAYVLT 315
+ +G + +A +L+ +G+ VN YM+ GGTNF T T
Sbjct: 273 GWLDHWGKPHSTVKTKTLA--TSLYNLLARGANVNLYMFIGGTNFAYWNGANTPYEPQPT 330
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
Y APL E G L + K+ L+E+ K
Sbjct: 331 SYDYDAPLSEAGDLTK-KYFALREVIQMFK 359
>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
Length = 594
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 103/327 (31%), Positives = 151/327 (46%), Gaps = 25/327 (7%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V WNLHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDL 212
Y +++ + +L GG I++ QIENEYG E ++L + A+
Sbjct: 127 AEYYDVLMEKIVPHQL--VNGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFF 184
Query: 213 QTGVPWVMCKQDDA---PDPVINACNGRQCGETFA------GPNSPDKPAIWTENWTSFY 263
+ PW + + D ++ G + E F + P + E W ++
Sbjct: 185 TSDGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWF 244
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR--------TASAYVLT 315
+ + R +++A V +A GS +N YM+HGGTNFG T +T
Sbjct: 245 NRWKEPIIKRDPQELAESVREALA--LGS-INLYMFHGGTNFGFMNGCSARGTIDLPQIT 301
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHS 342
Y APLDE G + + K LH
Sbjct: 302 SYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
Length = 613
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 162/656 (24%), Positives = 258/656 (39%), Gaps = 99/656 (15%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ GQFD
Sbjct: 36 GTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 95
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F++E AQGL V LR GP+ EW GG P WL I RS + F
Sbjct: 96 FSGNNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAA 155
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFL-EKGPPYVRWAAKLAV 210
+ Y + ++ L GGPII Q+ENEYG +H+++ + YV+ A+
Sbjct: 156 SQAYLDAVAKQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL 213
Query: 211 DLQTGVPWVMCKQDDAPD--PVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQVY 266
L T M PD V+N G + F PD+P + E W ++ +
Sbjct: 214 -LFTSDGAEMLANGTLPDTLAVVNFAPG-EAKSAFDKLIAFRPDQPRMVGEYWAGWFDHW 271
Query: 267 GDEARIRSAEDIAYHVALFIAKMK-GSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDE 325
G + +A D F ++ G N YM+ GGT+FG ++ + P D
Sbjct: 272 G---KPHAATDATQQAEEFEWILRQGHSANLYMFIGGTSFG-----FMNGANFQNNPSDH 323
Query: 326 YGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDK 385
Y S +A + + A F + +D
Sbjct: 324 Y------------------------------APQTTSYDYDAIVDEAGRPTAKFALMRDA 353
Query: 386 RNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSLRAN 445
T + P++ + LPD T +S W+ I D +
Sbjct: 354 IARVTGVQPPALPA--PIATTTLPD-------TPLRESASLWDNLPAPI-AIDTPQPMEH 403
Query: 446 FLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSD 505
F DY + +R + L + + V H +++ VGS +
Sbjct: 404 F----------GQDYGYILYRTTVT-GPRKGPLYLGDVRDVAHVYLDQTPVGSVERRLQQ 452
Query: 506 KSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSSFSW 565
S T V + G + + +L G + G + AGL + + G ++L + +F
Sbjct: 453 VSTT----VDIPAGHHTLDVLVENSGRINYGTRMADGRAGLVDPVLLGNQQLTGWQAFP- 507
Query: 566 GYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGK 625
L + T I W+R Q +++ T +D +++ + GK
Sbjct: 508 ----------LPMRTP--DSIRGWTR---KAVQGPAFHRGTVRIGTPAD-TYLDMRAFGK 551
Query: 626 GEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGI 681
G AW NG ++GR+W Q+ + P F + N +V+ + ++ P +
Sbjct: 552 GFAWANGVNLGRHW-------NIGPQTALYFPAPFQRRGDNTVVVFDLDDVATPSV 600
>gi|423280524|ref|ZP_17259436.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
gi|404583731|gb|EKA88404.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
610]
Length = 628
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 145/331 (43%), Gaps = 43/331 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNGRQCGETF----------AGPNSPDK--PAIWTEN 258
VP + A + NG E GP + P W +
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDGKGPYMVAEFYPG-WLSH 273
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV----- 313
W + G R E + F N+YM HGGTNFG T+ A
Sbjct: 274 WAEPFPQVGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRD 324
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
LT Y AP+ E G + PK+ ++ +
Sbjct: 325 IQPDLTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
Length = 604
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 109/345 (31%), Positives = 158/345 (45%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++NG + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGG NF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGINF 293
Query: 306 GR--------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G T +T Y APLDE G + + K LH
Sbjct: 294 GFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|424687003|ref|ZP_18123658.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402366194|gb|EJV00591.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
Length = 593
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V + GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLT--VGS-LNLYMFHGGTNFG 286
>gi|227517783|ref|ZP_03947832.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|424678087|ref|ZP_18114931.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681129|ref|ZP_18117923.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685648|ref|ZP_18122340.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424689662|ref|ZP_18126226.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424693525|ref|ZP_18129955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424698239|ref|ZP_18134537.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424701365|ref|ZP_18137539.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424702750|ref|ZP_18138894.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424711867|ref|ZP_18144074.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424717978|ref|ZP_18147248.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424722429|ref|ZP_18151489.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723619|ref|ZP_18152577.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733091|ref|ZP_18161660.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424746203|ref|ZP_18174452.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424755204|ref|ZP_18183090.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|227074744|gb|EEI12707.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|402351976|gb|EJU86842.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402352513|gb|EJU87362.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358223|gb|EJU92905.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402367111|gb|EJV01460.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402371797|gb|EJV05943.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402373001|gb|EJV07093.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402373959|gb|EJV08006.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402382684|gb|EJV16335.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402383232|gb|EJV16843.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402386182|gb|EJV19689.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402388743|gb|EJV22170.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402392403|gb|EJV25665.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402397550|gb|EJV30559.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402397571|gb|EJV30579.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402401167|gb|EJV33955.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 593
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V + GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLT--VGS-LNLYMFHGGTNFG 286
>gi|91078180|ref|XP_967491.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Tribolium
castaneum]
gi|270002868|gb|EEZ99315.1| beta-galactosidase-like protein [Tribolium castaneum]
Length = 630
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 168/361 (46%), Gaps = 59/361 (16%)
Query: 15 TTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLD 74
T+ G SDG N T + + L I FSG++HY R Q W + K + GL+
Sbjct: 12 TSSGISDGLSTKQTNFTLNNKPLTI-------FSGALHYFRVPQQYWRDRLRKIRAAGLN 64
Query: 75 VVQTLVFWNLHEPQPGQFDF-SGRRD------LVRFIKEVQAQGLYVCLRIGPFIEGEWG 127
V+T V WNLHEPQ G +DF G D L +F+K Q + L +R GP+I EW
Sbjct: 65 TVETYVPWNLHEPQIGIYDFGQGGSDFSEFLYLEKFLKLAQEEDLLAIVRPGPYICAEWD 124
Query: 128 YGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEY 187
+GGLP WL + R+ F H+ R+ T ++ ++ A + ++GGPI+ Q+ENEY
Sbjct: 125 FGGLPSWLLR-ENVKVRTSEPKFMSHVTRFFTRLLPILAALQF--TKGGPIVAFQVENEY 181
Query: 188 GMVEHS-----------FLEKGPPYVRWAAKLAVDLQTG-VPWVMCK---QDDAPDPVIN 232
G +++ F E G + + + + +G +P ++ QDDA + +
Sbjct: 182 GNTKNNDTEYLTNLKVLFEENGIRELLFTSDTPSNGFSGTLPGILATANFQDDARNEL-- 239
Query: 233 ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGS 292
PDKP + E WT ++ + ++ RS++ A+ L + S
Sbjct: 240 ---------ALLRKYQPDKPLMVMEYWTGWFDHWTEKHHQRSSQ--AFGAVLDEILSENS 288
Query: 293 YVNYYMYHGGTNFGRTASAYV-------------LTGYYDQAPLDEYGLLRQPKWGHLKE 339
VN YM+HGGTN+G A + T Y APL E G K+ +KE
Sbjct: 289 SVNMYMFHGGTNWGFLNGANIKDLTTDNSAYQPDTTSYDYDAPLSEAGDYTD-KYHKVKE 347
Query: 340 L 340
L
Sbjct: 348 L 348
>gi|424760912|ref|ZP_18188500.1| putative beta-galactosidase [Enterococcus faecalis R508]
gi|402402633|gb|EJV35336.1| putative beta-galactosidase [Enterococcus faecalis R508]
Length = 593
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V + GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLT--VGS-LNLYMFHGGTNFG 286
>gi|328956117|ref|YP_004373450.1| beta-galactosidase [Coriobacterium glomerans PW2]
gi|328456441|gb|AEB07635.1| Beta-galactosidase [Coriobacterium glomerans PW2]
Length = 597
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 159/371 (42%), Gaps = 43/371 (11%)
Query: 27 GNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHE 86
G++ DGR I SG+IHY R P W + K G + V+T + WN+HE
Sbjct: 7 GSDFYMDGRPFQIR-------SGAIHYFRLHPDDWEHSLYNLKAMGFNTVETYIPWNMHE 59
Query: 87 PQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSD 146
P +F + D RF+ GL+ +R PFI EW +GGLP WL G+ RS+
Sbjct: 60 PHKDEFRITAETDFERFLGLASDLGLWAIVRPSPFICAEWEFGGLPAWLLAERGMRIRSN 119
Query: 147 NEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAA 206
+ F + Y M+ M A+ ++G II+ QIENEYG S+ E Y+R
Sbjct: 120 DPRFLERLALYYDML--MPHLAKHQITRGANIIMMQIENEYG----SYCEDS-DYMRSVR 172
Query: 207 KLAVDLQTGV-------PWVMCKQDDA--PDPVINACN-GRQCGETFAGPNSPDK----- 251
L V+ V PW C++ + D V+ N G E FA K
Sbjct: 173 DLMVERGIDVKLCTSDGPWRACQRAGSLIEDNVLATGNFGSHATENFAALKGFHKEHGKT 232
Query: 252 -PAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG---- 306
P + E W ++ +G+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 233 WPLMCMEFWAGWFNRWGESVVRRDPEELARSVR---EALREGSINLYMFHGGTNFGFMNG 289
Query: 307 ----RTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV--KLCLKPMLSGVLVSMN 360
+ +T Y APLDE G + + + + P + G L M
Sbjct: 290 CSARHDHDLHQITSYDYDAPLDEAGNPTEKFYALQRMVREDFPDARTASPRIKGTLAPMT 349
Query: 361 FSKLQEAFIFQ 371
+ A +F+
Sbjct: 350 LERCGLAGLFE 360
>gi|167524869|ref|XP_001746770.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775040|gb|EDQ88666.1| predicted protein [Monosiga brevicollis MX1]
Length = 600
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 152/320 (47%), Gaps = 32/320 (10%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
++ GH ++SGS+HY R + W + AK GL+ + T V WN HE PG FDF
Sbjct: 59 FLLYGHPFDIWSGSLHYFRIPAEYWLDRLEMAKHMGLNTISTYVPWNFHEVGPGSFDFET 118
Query: 97 R-RDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
DL RF+ GL V +R P+I EW +GGLP L P + RS N+ F ++
Sbjct: 119 HAHDLARFLNLAHEVGLRVLIRPSPYICAEWDFGGLPARLMANPDLELRSSNDAFLDEVE 178
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVDLQ 213
RY ++ +++ L AS GGPII +ENEYG + +L+ A +A+
Sbjct: 179 RYYDALMPILRP--LQASNGGPIIAFYVENEYGSYGADRDYLQ---------ALVAMMRD 227
Query: 214 TGVPWVMCKQDDAPDPVINACNGRQCGETFA----------GPNSPDKPAIWTENWTSFY 263
G+ M D+A A G F PD+P + +E WT ++
Sbjct: 228 RGIVEQMFTCDNAQGLSRGALPGALQTINFQDNVERHLDQLAHFQPDQPLMVSEYWTGWF 287
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-----LTGYY 318
G+E +ED+ + + +G+ N Y++HGGT+FG A A +T Y
Sbjct: 288 DHDGEEHHTFDSEDLVEGLQKILD--RGASFNLYVFHGGTSFGWNAGANSPYAPDITSYD 345
Query: 319 DQAPLDEYGLLRQPKWGHLK 338
APL E+G + PK+ ++
Sbjct: 346 YDAPLSEHGQV-TPKYEDIQ 364
>gi|348508360|ref|XP_003441722.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 648
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 145/302 (48%), Gaps = 30/302 (9%)
Query: 45 ILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFI 104
++ GSIHY R W + K K GL+ + T V WNLHEP+ G F F + DL ++
Sbjct: 72 LILGGSIHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGVFKFDDQLDLEAYL 131
Query: 105 KEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNM 164
+ + GL+V LR GP+I EW GGLP WL P + R+ F + + + ++
Sbjct: 132 RLAASLGLWVILRPGPYICAEWDLGGLPSWLLRDPQMKLRTTYSGFTYAVNSFFDEVIK- 190
Query: 165 MKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQD 224
KA S+GGPII Q+ENEYG ++ E P+++ A L G+ ++ D
Sbjct: 191 -KAVPHQYSKGGPIIAVQVENEYG--SYATDENYMPFIKEAL-----LSRGITELLLTSD 242
Query: 225 DAPDPVINACNGRQCGETFAGPN----------SPDKPAIWTENWTSFYQVYGDEARIRS 274
+ + G F + P +P + E W+ ++ ++G + +
Sbjct: 243 NKDGLKLGGVKGALETINFQKLDPDEIKYLEQIQPQQPKMVMEYWSGWFDLWGGLHHVYT 302
Query: 275 AEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY---------VLTGYYDQAPLDE 325
AE++ V I K+ S +N YM+HGGTNFG + A+ ++T Y APL E
Sbjct: 303 AEEMI-PVVTEILKLDMS-INLYMFHGGTNFGFMSGAFAVGLPAPKPMVTSYDYDAPLSE 360
Query: 326 YG 327
G
Sbjct: 361 AG 362
>gi|348508362|ref|XP_003441723.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
niloticus]
Length = 605
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/322 (30%), Positives = 145/322 (45%), Gaps = 19/322 (5%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
D + G + GS+HY R W + K K GL+ + T V WNLHEP+ G F
Sbjct: 10 DSSQFTLEGKPFRILGGSVHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGTF 69
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
+F + DL ++ GL+V LR GP+I EW GGLP WL + R+ F
Sbjct: 70 NFQDQLDLKAYVSLAAQLGLWVILRPGPYICAEWDLGGLPSWLLQDEEMQLRTTYPGFVN 129
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAK---LA 209
+ Y +++++K L GGPII Q+ENEYG +K P+++ + +
Sbjct: 130 AVNLYFDKLISVIKP--LMFEGGGPIIAVQVENEYGSFAKD--DKYMPFIKNCLQSRGIK 185
Query: 210 VDLQTGVPW--VMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYG 267
L T W + C + +N P KP + E W+ ++ V+G
Sbjct: 186 ELLMTSDNWEGLRCGGVEGALKTVNLQRLSFGAIQHLADIQPQKPLMVMEYWSGWFDVWG 245
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQ------- 320
+ + AED+ V+ + +G +N YM+HGGT FG A Y Q
Sbjct: 246 EHHHVFYAEDMLAVVSEILD--RGVSINLYMFHGGTTFGFMNGAMDFGTYKSQVTSYDYD 303
Query: 321 APLDEYGLLRQPKWGHLKELHS 342
APL E G PK+ HL+ L S
Sbjct: 304 APLSEAGDC-TPKYHHLRNLFS 324
Score = 40.8 bits (94), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 76/189 (40%), Gaps = 28/189 (14%)
Query: 489 AFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRN 548
F+N E VG K T E + G +S L G + G L+ + G+
Sbjct: 413 VFVNRECVGCLDYK------THEVAIPDGKGERTLSFLVENCGRVNYGKALDEQRKGIVG 466
Query: 549 VSIQGAKELKDFSSFSWGYQ---VGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKT 605
+ L+ FS + + L Q TD+ S VP +
Sbjct: 467 DIVLNNTPLRGFSISCLDMKPSFIKRLTNSGQWKTDFKSHCVP----------GFFQARL 516
Query: 606 VFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTG 665
D P ++L S GKG +VNGQ++GRYW F+ P Q + ++P +L+
Sbjct: 517 CVDGPPKD--TFVSLRSWGKGVIFVNGQNLGRYW--FIGP-----QHFLYLPAPWLRSGE 567
Query: 666 NLLVLLEEE 674
N +++ EE+
Sbjct: 568 NEIIVFEEQ 576
>gi|295689222|ref|YP_003592915.1| beta-galactosidase [Caulobacter segnis ATCC 21756]
gi|295431125|gb|ADG10297.1| Beta-galactosidase [Caulobacter segnis ATCC 21756]
Length = 617
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 105/334 (31%), Positives = 151/334 (45%), Gaps = 37/334 (11%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + +G + S +HY R W + KAK GL+ + T FWN+HEP+PG +D
Sbjct: 38 GAGFLKDGAPHQVISAEMHYVRIPRAYWRDRLQKAKTMGLNTITTYAFWNVHEPRPGVYD 97
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F+G+ DL FI+ QA+GL V LR GP++ EW GG P WL ++ RS +
Sbjct: 98 FTGQNDLAAFIRAAQAEGLDVILRPGPYVCSEWELGGYPSWLLKDRNVLLRSTEPQYAAA 157
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
++R+ + +K L GGPI+ Q+ENEYG + ++LE R A
Sbjct: 158 VERWMARLGREVKP--LLLKNGGPIVAIQLENEYGAFGDDKAYLEGLEATYRRAG----- 210
Query: 212 LQTGVPWVMCKQDD--------APDPVINACNGRQCG----ETFAGPNSPDKPAIWTENW 259
L GV + + D P V G + ETF PD + E W
Sbjct: 211 LADGVLFTSNQASDLAKGSLPHLPSMVNFGSGGAEKSVAQLETF----RPDGLRMVGEYW 266
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTG--- 316
++ +G+E A + + +G V+ YM+HGGT+FG A TG
Sbjct: 267 AGWFDKWGEEHHETDGRKEAEELRFML--QRGYSVSLYMFHGGTSFGWMNGADSHTGKDY 324
Query: 317 ------YYDQAPLDEYGLLRQPKWGHLKELHSAV 344
Y APLDE G R K+G L + + V
Sbjct: 325 HPDTTSYDYDAPLDEAGAPRY-KYGLLASVIAEV 357
>gi|255971270|ref|ZP_05421856.1| beta-galactosidase [Enterococcus faecalis T1]
gi|255962288|gb|EET94764.1| beta-galactosidase [Enterococcus faecalis T1]
Length = 593
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTRQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V + GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLT--VGS-LNLYMFHGGTNFG 286
>gi|256959941|ref|ZP_05564112.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293384307|ref|ZP_06630193.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388457|ref|ZP_06632963.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907112|ref|ZP_07766105.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312979309|ref|ZP_07791007.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|256950437|gb|EEU67069.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291078380|gb|EFE15744.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291082147|gb|EFE19110.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626889|gb|EFQ10172.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311287903|gb|EFQ66459.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
Length = 593
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 142/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y++ ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLQQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|255973889|ref|ZP_05424475.1| beta-galactosidase [Enterococcus faecalis T2]
gi|307284354|ref|ZP_07564519.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|255966761|gb|EET97383.1| beta-galactosidase [Enterococcus faecalis T2]
gi|306503294|gb|EFM72546.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
Length = 593
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTRQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V + GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLT--VGS-LNLYMFHGGTNFG 286
>gi|257090118|ref|ZP_05584479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|256998930|gb|EEU85450.1| beta-galactosidase [Enterococcus faecalis CH188]
Length = 594
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/335 (31%), Positives = 153/335 (45%), Gaps = 41/335 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P W + K G + V+T V W+LHEPQ G F F
Sbjct: 8 EEFLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+K Q GLY +R P+I EW +GG P WL + PG + RS+N + H+
Sbjct: 68 EGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-RSNNPTYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L + GG I++ QIENEYG SF E+ Y+R L +
Sbjct: 127 AEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRAIRDLMIARGV 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------GPNSPDKPAIW 255
P+ D P D ++ G + E F + P +
Sbjct: 180 TAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR-------- 307
E W ++ + + R +++A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNFGFMNGCSARG 293
Query: 308 TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
T +T Y APLDE G + + K LH
Sbjct: 294 TIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 328
>gi|163790001|ref|ZP_02184436.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
gi|159874701|gb|EDP68770.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
Length = 595
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 148/317 (46%), Gaps = 34/317 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R P+ W + K G + V+T + WN+HE + ++DF
Sbjct: 8 EEFLLNGEPFKIISGAIHYFRILPEDWYHSLYNLKALGFNTVETYIPWNVHETKEREYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG+ D+ RF++ + GL+V LR P+I EW +GGLP WL + RS + F +
Sbjct: 68 SGQLDIQRFVQTAKELGLFVILRPSPYICAEWEFGGLPAWLLTYKNMRIRSSDPQFIEKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y + + L + GGP+I+ Q+ENEYG S+ E Y++ +L ++L
Sbjct: 128 SSYYKKLFEQI--VPLQVTSGGPVIMMQLENEYG----SYGED-KEYLKTLYELMLELGV 180
Query: 215 GVP-------WVMCKQDDAP---DPVINACNGRQCGETFAG------PNSPDKPAIWTEN 258
VP W ++ D + G Q E F + P + E
Sbjct: 181 TVPIFTSDGAWKATQEAGTMTDLDILTTGNFGSQSKENFKNLKEFHESKGKNWPLMCMEY 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--RTASAYV--- 313
W ++ + D R A+D+ V +K +N YM+HGGTNFG SA +
Sbjct: 241 WGGWFNRWNDPIIKRDAQDLTNDVK---EALKIGSLNLYMFHGGTNFGFMNGCSARLGKD 297
Query: 314 ---LTGYYDQAPLDEYG 327
LT Y APL+E G
Sbjct: 298 LPQLTSYDYDAPLNEQG 314
Score = 40.8 bits (94), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 48/114 (42%), Gaps = 25/114 (21%)
Query: 586 IVPWSRYGSSTHQPLT------W---------YKTVFDAPTGSDPVAINLISMGKGEAWV 630
I W +Y +PLT W YK D P + IN+ GKG V
Sbjct: 479 ITDWEQYSLDFLKPLTIDFNEEWKENAPSFYQYKVTIDTP---EDTFINMELFGKGIVLV 535
Query: 631 NGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
NG +IGR+W P+ S Y P+S K N +++ E E + IS++
Sbjct: 536 NGFNIGRFW------NVGPTLSLY-APKSLFKKGENEIIVFETEGIWSETISLE 582
>gi|327282153|ref|XP_003225808.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
Length = 649
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 108/334 (32%), Positives = 155/334 (46%), Gaps = 27/334 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GLD +QT V WN HEP+
Sbjct: 32 IDYGHNCFLKDGQPFRYISGSIHYSRIPRYYWKDRLLKMKMAGLDAIQTYVPWNFHEPER 91
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G ++F+G RDL F++ Q GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 92 GVYNFTGDRDLEYFLQLAQEVGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDPD 151
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL- 208
+ + + + + MK LY GGPII+ Q+ENEYG S+ Y+R+ L
Sbjct: 152 YLTAVGSWMGIFLPKMK-PHLY-QNGGPIIMVQVENEYG----SYFACDFDYLRYLQNLF 205
Query: 209 ------AVDLQT----GVPWVMCKQDDAPDPVINACNGRQCGETFAGP--NSPDKPAIWT 256
V L T + ++ C ++ GR F+ P P + +
Sbjct: 206 RQYLGDEVVLFTTDGASMFYLRCGALQGLYSTVDFGPGRNVTAAFSTQRHTEPKGPLVNS 265
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--- 313
E +T + +G A +A ++ +A G+ VN YM+ GGTNFG A +
Sbjct: 266 EFYTGWLDHWGHRHITVPASIVAKSLSEILA--SGANVNMYMFIGGTNFGYWNGANMPYM 323
Query: 314 --LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
T Y APL E G L + K+ ++E+ K
Sbjct: 324 AQPTSYDYDAPLSEAGDLTE-KYFAIREVIGMFK 356
>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB1386]
Length = 613
Score = 136 bits (343), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 149/358 (41%), Gaps = 39/358 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ
Sbjct: 31 NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQ 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQFDFSG D+ F++E AQGL V LR GP+ EW GG P WL I RS +
Sbjct: 91 QGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDP 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F + Y + N ++ L GGPII Q+ENEYG +H+++ A
Sbjct: 151 RFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------AD 199
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAI 254
A+ ++ G + D D + N P PD+P +
Sbjct: 200 NRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRM 259
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
E W ++ +G A A + +G N YM+ GGT+FG A
Sbjct: 260 VGEYWAGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFGFMNGANFQ 317
Query: 314 ----------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 318 NNPSDHYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGVQPPALPAPIATTTL 374
>gi|423252157|ref|ZP_17233159.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|423252477|ref|ZP_17233408.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
gi|392647903|gb|EIY41596.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
CL03T00C08]
gi|392660553|gb|EIY54162.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
CL03T12C07]
Length = 628
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 99/330 (30%), Positives = 146/330 (44%), Gaps = 41/330 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
NG + SG +HY R Q W + K GL+ V T VFWNLHEP+PG++DF+G ++
Sbjct: 37 NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
L FIK +G+ V LR GP++ EW +GG P+WL +V G+ R DN F + K Y
Sbjct: 97 LAEFIKIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYID 156
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYG----MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ + L ++GGPI++ Q ENE+G + LE+ Y + D
Sbjct: 157 RLYK--EVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFN 214
Query: 216 VPWVMCK-----QDDAPDPVINACNG-------RQCGETFAGPNSPDKPAI----WTENW 259
VP + A + NG ++ + + P A W +W
Sbjct: 215 VPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHW 274
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
+ G R E + F N+YM HGGTNFG T+ A
Sbjct: 275 AEPFPQIGASGIARQTEKYLQNDVSF---------NFYMVHGGTNFGFTSGANYDKKRDI 325
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+T Y AP+ E G + PK+ ++ +
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV 354
>gi|255602598|ref|XP_002537886.1| beta-galactosidase, putative [Ricinus communis]
gi|223514710|gb|EEF24497.1| beta-galactosidase, putative [Ricinus communis]
Length = 91
Score = 136 bits (343), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 60/71 (84%), Positives = 65/71 (91%)
Query: 59 QMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRI 118
QMWP LI KAKEGGLDV+QT VFWNLHEPQPGQ+DFSGR DLV+F+KE+QAQGLYVCLRI
Sbjct: 17 QMWPSLIGKAKEGGLDVIQTYVFWNLHEPQPGQYDFSGRYDLVKFVKEIQAQGLYVCLRI 76
Query: 119 GPFIEGEWGYG 129
GPFIE EW YG
Sbjct: 77 GPFIESEWTYG 87
>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
859]
Length = 613
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 149/358 (41%), Gaps = 39/358 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ
Sbjct: 31 NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQ 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQFDFSG D+ F++E AQGL V LR GP+ EW GG P WL I RS +
Sbjct: 91 QGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDP 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F + Y + N ++ L GGPII Q+ENEYG +H+++ A
Sbjct: 151 RFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------AD 199
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAI 254
A+ ++ G + D D + N P PD+P +
Sbjct: 200 NRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRM 259
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
E W ++ +G A A + +G N YM+ GGT+FG A
Sbjct: 260 VGEYWAGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFGFMNGANFQ 317
Query: 314 ----------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 318 NNPSDHYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGVQPPALPAPIATTTL 374
>gi|387790696|ref|YP_006255761.1| beta-galactosidase [Solitalea canadensis DSM 3403]
gi|379653529|gb|AFD06585.1| beta-galactosidase [Solitalea canadensis DSM 3403]
Length = 790
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 145/312 (46%), Gaps = 30/312 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
++NG ++ +G IH+PR + W I K G++ + +FWN HE +P QFDF+
Sbjct: 44 EFLLNGKPFLIRAGEIHFPRIPREYWDHRIKLCKAMGMNTICIYLFWNFHEQKPDQFDFT 103
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G++D+ F+K VQA G+Y +R GP+ EW GGLP+WL P + R+ + ++ M+
Sbjct: 104 GQKDVAAFVKLVQANGMYCIVRPGPYACAEWDMGGLPWWLLKKPDLKVRTLED--RYFME 161
Query: 156 RYATMIVNMMKA-ARLYASQGGPIILSQIENEYGMVEHS--FLEKGPPYVRWAAKLAVDL 212
R A + + K A L GG II+ Q+ENEY +S +++ ++ A V L
Sbjct: 162 RSAKYLKEVGKQLALLQIQNGGNIIMVQVENEYAAFGNSAEYMDANRKNLKDAGFNKVQL 221
Query: 213 QTGVPWVMCKQDDAPDP----VINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQVY 266
W DP +N G + F G P P + +E WT ++ +
Sbjct: 222 MR-CDWSSTFNSYITDPEVAITLNFGAGSDVDKQFKGFQEKHPTAPLMCSEYWTGWFDHW 280
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSY-----VNYYMYHGGTNFGRTASA-----YVLTG 316
G RS + FI +K + YM HGGT FG+ A +
Sbjct: 281 GRPHETRS-------INSFIGSLKDMMDRKISFSLYMAHGGTTFGQWGGANSPPYSAMVA 333
Query: 317 YYD-QAPLDEYG 327
YD AP+ E G
Sbjct: 334 SYDYNAPIGEQG 345
Score = 45.8 bits (107), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 61/254 (24%), Positives = 110/254 (43%), Gaps = 43/254 (16%)
Query: 430 YKEAIPTYDETSL-RANFLLEQMNTTKDASDYLW--YNFRFKHDPSDSESVLKVSSLGHV 486
++EA P +D +A+ +++ M + D W N+R S + L ++ +
Sbjct: 385 FEEAAPLFDNLPPGKASEIIKPM----EMFDQGWGRINYRTNLTASTTPRKLIITEVHDW 440
Query: 487 LHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMV---GLPDSGAYLERRV 543
FING+ VG + +D T+E I T ++L ++V G + G + R
Sbjct: 441 AQVFINGKLVGKLDRRRADS--TIE-----IPATKAGAVLDILVEATGRVNFGEAVIDRK 493
Query: 544 AGLRNVSIQG---AKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPL 600
V I +ELK+++ +++ DY + +++
Sbjct: 494 GITEKVEISDGSTVQELKNWTVYNFP-------------VDY--QFQANAKFVKQKVNGP 538
Query: 601 TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSF 660
WY+ F+ D I+L + GKG WVNG +IGR+W + PQ T + +P +
Sbjct: 539 AWYRAKFNLNQTGD-TYIDLSTWGKGMIWVNGYNIGRFWK--IGPQQT-----FLMPGVW 590
Query: 661 LKPTGNLLVLLEEE 674
LK N +++L+ E
Sbjct: 591 LKRGMNEIIILDLE 604
>gi|390469877|ref|XP_002807335.2| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Callithrix jacchus]
Length = 718
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 130/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL FI
Sbjct: 145 IFGGSIHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIL 204
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+ LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 205 MASEIGLWXILRPGPYICSEIDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 262
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 263 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 315
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 316 KDGLSKGIVHGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 375
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 376 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 406
>gi|440800373|gb|ELR21412.1| lysosomal betagalactosidase, partial [Acanthamoeba castellanii str.
Neff]
Length = 604
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 144/315 (45%), Gaps = 40/315 (12%)
Query: 40 NGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
+G + SGSIHY RS P+ WP + + GL+ V T V WNLHEP PGQ+DFSGR D
Sbjct: 36 DGQEFRIVSGSIHYFRSLPEQWPARLRTLRSCGLNTVTTYVPWNLHEPTPGQYDFSGRLD 95
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
+VRFI+ Q +G V +R P+I E +GGLP WL + G+ R + + +KR +
Sbjct: 96 IVRFIEAAQQEGFLVIVRPPPYICAELEFGGLPAWLLNEEGLQLRCSDPKY---LKRVDS 152
Query: 160 MIVNMMKAARLYA-SQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDL-QTGVP 217
+ + + Y S+GGPII Q+ENEYG + L Y+R L + Q +
Sbjct: 153 FLDHFLPMLATYQYSRGGPIIAMQVENEYGSYGNDHL-----YLR---HLELKFRQHQID 204
Query: 218 WVMCKQDDAPD--------PVINACNGRQCGETFAG------PNSPDKPAIWTENWTSFY 263
++ + A D P + G G P P TE W ++
Sbjct: 205 AILFSSNGAGDQMFVGGALPSLLRTVNFGTGADVEGNLKVLRKYQPSGPLFVTEFWDGWF 264
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY----------- 312
+G+E + + ++ + VN YM GGTNFG T A
Sbjct: 265 DHWGEEHHTTTPTQSMKTLEAILS--NNASVNLYMAFGGTNFGFTNGANKGYGETDPYQP 322
Query: 313 VLTGYYDQAPLDEYG 327
T Y AP++E G
Sbjct: 323 TTTSYDYDAPVNESG 337
>gi|307289489|ref|ZP_07569436.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|422703871|ref|ZP_16761687.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
gi|306499556|gb|EFM68926.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
gi|315164595|gb|EFU08612.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
Length = 593
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL + RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKSVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
Length = 587
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 161/360 (44%), Gaps = 42/360 (11%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K K GL+ V+T + WN HEP G+F+FSG D+ FI
Sbjct: 20 ILSGAIHYFRVVPEYWEDRLLKLKACGLNTVETYIPWNWHEPDEGRFNFSGMADIEAFIT 79
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V +R P+I EW +GGLP WL P + R + F + Y ++
Sbjct: 80 LAGKLGLHVIVRPSPYICAEWEFGGLPAWLLQDPHMQLRCLDPKFLKKVDAYYDELIP-- 137
Query: 166 KAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVRWAAKLAVDLQTG 215
+ L ++ GGPII QIENEYG ++ + + +G + + + D
Sbjct: 138 RLVPLLSTNGGPIIAVQIENEYGSYGNDTAYLQYLQEALIARGVDVLLFTSDGPTD---- 193
Query: 216 VPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIR 273
M + P G + E FA + P + E W ++ + R
Sbjct: 194 ---GMLQGGTVPGVTATVNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMKPHHTR 250
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQAPLDEY 326
+ED A A +A G+ VN+YM+HGGTNFG A +T Y APL E
Sbjct: 251 DSEDAASVFAEMLA--LGASVNFYMFHGGTNFGFYNGANYHDKYEPTITSYDYDAPLSEC 308
Query: 327 GLLRQPKWGHLKEL---HSAVKLCLKPMLS--------GVLVSMNFSKLQEAFIFQGSSE 375
G + K+ ++++ H V+L P L G + +++ L E SSE
Sbjct: 309 GDVTT-KYEAVRQVIAKHQGVELGDLPALPDPVRKKAYGTVSMTSYADLLENLPVLASSE 367
>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
Length = 570
Score = 136 bits (342), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 142/309 (45%), Gaps = 37/309 (11%)
Query: 58 PQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLR 117
P+ W + K K GL+ V+T V WNLHE F F D+V+F+K Q GLYV +R
Sbjct: 2 PEYWKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVIIR 61
Query: 118 IGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGP 177
GP+I EW GGLP WL P + R+ PF + RY + ++ L QGGP
Sbjct: 62 PGPYICAEWDLGGLPSWLLSDPEMKLRTSYGPFMEAVDRYFQKLFPLLTP--LQYCQGGP 119
Query: 178 IILSQIENEYGMVEHSFLEK-GPPYVRWAAKLAVDLQTGVPWVMCKQDD----APDPV-- 230
II QIENEY SF +K Y+ K+ V + GV ++ D+ P+
Sbjct: 120 IIAWQIENEYS----SFDKKVDMTYMELLQKMMV--KNGVTEMLLMSDNLFSMKTHPINL 173
Query: 231 ----INACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFI 286
IN + PDKP + TE W ++ V+G + I E + +
Sbjct: 174 VLKTINLQKNVKDALLQLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKDLF 233
Query: 287 AKMKGSYVNYYMYHGGTNFGRTASAYV---------------LTGYYDQAPLDEYGLLRQ 331
+ G+ +N+YM+HGGTNFG A +T Y APL E G +
Sbjct: 234 S--LGASINFYMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESGDI-T 290
Query: 332 PKWGHLKEL 340
PK+ L++
Sbjct: 291 PKYKALRKF 299
>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
GSPB2388]
Length = 613
Score = 136 bits (342), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 93/292 (31%), Positives = 129/292 (44%), Gaps = 27/292 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ
Sbjct: 31 NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQ 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQFDFSG D+ F++E AQGL V LR GP+ EW GG P WL I RS +
Sbjct: 91 QGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDP 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F + Y + N ++ L GGPII Q+ENEYG +H+++ A
Sbjct: 151 RFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------AD 199
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAI 254
A+ ++ G + D D + N P PD+P +
Sbjct: 200 NRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRM 259
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G A A + +G N YM+ GGT+FG
Sbjct: 260 VGEYWAGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFG 309
>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
Length = 604
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 109/345 (31%), Positives = 160/345 (46%), Gaps = 42/345 (12%)
Query: 26 GGNNVTYDGRS-LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
GGN ++ + ++N + SG+IHY R P W + K G + V+T V WNL
Sbjct: 8 GGNVDRFEIKEEFLLNDQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNL 67
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HEPQ G F F G DL RF+K Q GLY +R P+I EW +GG P WL + PG + R
Sbjct: 68 HEPQKGTFHFEGILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPGRM-R 126
Query: 145 SDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
S+N + H+ Y +++ + +L + GG I++ QIENEYG SF E+ Y+R
Sbjct: 127 SNNPTYLKHVAEYYDVLMEKIVPHQL--ANGGNILMIQIENEYG----SFGEE-KAYLRA 179
Query: 205 AAKLAVDLQTGVPWVMCKQDDAP-------------DPVINACNGRQCGETFA------G 245
L + P+ D P D ++ G + E F
Sbjct: 180 IRDLMIARGVTAPFFTS---DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFE 236
Query: 246 PNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF 305
+ P + E W ++ + + R +++A V +A GS +N YM+HGGTNF
Sbjct: 237 EHGKKWPLMCMEFWDGWFNRWKEPIIKRDPQELAESVREALAL--GS-INLYMFHGGTNF 293
Query: 306 ----GRTASAYV----LTGYYDQAPLDEYGLLRQPKWGHLKELHS 342
G +A + +T Y APLDE G + + K LH
Sbjct: 294 EFMNGCSARGTIDLPQITSYDYDAPLDEQGNPTEKYFALQKMLHE 338
>gi|422700666|ref|ZP_16758509.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
gi|315170851|gb|EFU14868.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
Length = 593
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 142/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A + +QGGP+I+ Q+ENEYG +EK Y++ ++ +L
Sbjct: 129 RNYFQVL--LPKLAPMQITQGGPVIMMQVENEYGSYG---MEKA--YLQQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|395520729|ref|XP_003764476.1| PREDICTED: beta-galactosidase-1-like protein 2 [Sarcophilus
harrisii]
Length = 704
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 157/329 (47%), Gaps = 29/329 (8%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
+G + ++ G +F GSIHY R + W + K K GL+ + T + WNLHEP+ G+F
Sbjct: 118 EGPNFLLEGSHFQIFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYIPWNLHEPERGKF 177
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
+FSG D+ F++ GL+V LR GP+I EW GGLP WL + R+ F
Sbjct: 178 NFSGNLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYAGFLK 237
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDL 212
+ RY ++ + L QGGPII Q+ENEYG + PY++ A +
Sbjct: 238 AVDRYFNHLIP--RVVPLQYKQGGPIIAVQVENEYGSYDKD--SNYMPYIKKAL-----M 288
Query: 213 QTGVPWVMCKQDDAP-------DPVINACNGRQCGE---TFAGPNSPDKPAIWTENWTSF 262
G+ ++ D+ + V+ N + + +KP + TE WT +
Sbjct: 289 SRGINELLMTSDNKDGLSGGYLEGVLATVNLKHVDSMIFNYLHSFQENKPTMVTEYWTGW 348
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-----YV--LT 315
+ +G I A+D+ V+ I G+ +N YM+HGGTNFG A Y+ +T
Sbjct: 349 FDTWGGPHNIVDADDVVVTVSSII--QMGASLNLYMFHGGTNFGFMNGAQHFGEYLADVT 406
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAV 344
Y A L E G PK+ L+E S +
Sbjct: 407 SYDYDAILTEAGDY-TPKFFKLREFFSTI 434
>gi|348575339|ref|XP_003473447.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cavia
porcellus]
Length = 740
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 155/347 (44%), Gaps = 34/347 (9%)
Query: 3 QCQLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWP 62
+C L LFGL T + + Y + +G SGSIHY R W
Sbjct: 92 KCSLGPLFGLXNATQRMFE--------IDYSRDCFLKDGQPFRYISGSIHYSRVPRFYWA 143
Query: 63 RLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFI 122
+ K K GL+ +QT V WN HEPQPG ++FSG D+ F++ GL V LR GP+I
Sbjct: 144 DRLLKMKMAGLNAIQTYVPWNFHEPQPGHYEFSGDHDVEYFLQLAHKLGLLVILRPGPYI 203
Query: 123 EGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
EW GGLP WL + IV RS + + + ++ +++ MK L GGPII Q
Sbjct: 204 CAEWDMGGLPAWLLEKQSIVLRSSDPDYLASVDKWLGVLLPKMKP--LLYQNGGPIITVQ 261
Query: 183 IENEYGMVEHSFLEKGPPYVRWAAK-----LAVDL---QTGVP---WVMCKQDDAPDPVI 231
+ENEYG S+ Y+R+ K L D+ T P ++ C +
Sbjct: 262 VENEYG----SYFACDYNYLRFLQKHFHYHLGDDVLLFTTDGPRQEYLRCGTLQGLYATV 317
Query: 232 NACNGRQCGETF--AGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
+ G + F P P I +E +T + +G+ E + ++ +A
Sbjct: 318 DFGVGSNITDAFLVQRKAEPKGPLINSEFYTGWLDHWGERHWTVKTEAVVSSLSDMLA-- 375
Query: 290 KGSYVNYYMYHGGTNF-----GRTASAYVLTGYYDQAPLDEYGLLRQ 331
+G VN YM+ GGTNF T A T Y APL E G L +
Sbjct: 376 QGXNVNMYMFIGGTNFAYWNGANTPYAAQPTSYDYDAPLSEAGDLTE 422
>gi|295113973|emb|CBL32610.1| Beta-galactosidase [Enterococcus sp. 7L76]
Length = 592
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 141/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL + RS + F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 128 RNYFQVL--LPKLAPLQITQGGPVIMIQVENEYGSYG---MEKA--YLRQTKQIMEELGI 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 181 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 285
>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
Length = 612
Score = 135 bits (341), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 153/617 (24%), Positives = 242/617 (39%), Gaps = 97/617 (15%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G I +G L SG+IH+ R W + KA+ GL+ V+T VFWNL E + GQFD
Sbjct: 32 GTQFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFD 91
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F+G D+ F++E +QGL V LR GP++ EW GG P WL P + RS + F
Sbjct: 92 FTGNNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDA 151
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+RY + ++ L S GGPII Q+ENEYG +H +L+ A
Sbjct: 152 SQRYLEALGTQVRP--LLNSNGGPIIAMQVENEYGSYGDDHGYLQAVRALFIKAGLGGAL 209
Query: 212 LQTGVPWVMCKQDDAPDPVINACN-----GRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
L T M PD V+ A N +Q + A P +P + E W ++ +
Sbjct: 210 LFTSDGAQMLGNGTLPD-VLAAVNVAPGEAKQALDKLA-TFHPGQPQLVGEYWAGWFDQW 267
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G A+ A + + +G +N YM+ GGT+FG ++ + P D Y
Sbjct: 268 GKPHAQTDAKQQADEIEWML--RQGHSINLYMFVGGTSFG-----FMNGANFQGGPGDHY 320
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
S S ++ +A + + F + +D
Sbjct: 321 --------------------------SPQTTSYDY----DAALDEAGRPMPKFALFRDVI 350
Query: 387 NNATVYFSNLMYELPPLSISI----LPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSL 442
T + PPL + LPD A S W+ A+ T +
Sbjct: 351 TGVT------GLQPPPLPAATRFIDLPDTPLRA-------SASLWDNLPAAVATSADP-- 395
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGK 502
+ M A Y+ Y H P L + + H +++ FVG A +
Sbjct: 396 ------QPMERYGQAYGYILYRTTI-HGPRKGR--LYLGEVRDDAHVYVDRLFVGRAERR 446
Query: 503 HSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSS 562
+ V + +GT+ + +L G + G +L AGL + + + ++ +
Sbjct: 447 RQQ----VWVEVDIPSGTHRLDVLVENSGRVNYGPHLADGRAGLIGPVMLNHERVNNWET 502
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLIS 622
F Q E + +T + G + H+ + +T D +++ +
Sbjct: 503 FLLPLQT---PEAIHGWTTAPMQ-------GPAFHRGTLFIRTPGD-------TFLDMEA 545
Query: 623 MGKGEAWVNGQSIGRYW 639
KG W NG +GRYW
Sbjct: 546 FSKGVTWANGHMLGRYW 562
>gi|164519029|ref|NP_001019529.2| beta-galactosidase-1-like protein 3 precursor [Rattus norvegicus]
Length = 644
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 101/324 (31%), Positives = 156/324 (48%), Gaps = 33/324 (10%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++ GSIHY R + W + K + G + V T + WNLHE + G+FDFS
Sbjct: 71 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 130
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL ++ + GL+V LR GP+I E GGLP WL PG R+ N+ F + +Y
Sbjct: 131 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 190
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
++ K L +GGP+I Q+ENEYG + +++E Y++ A L G+
Sbjct: 191 DHLIP--KILPLQYRRGGPVIAVQVENEYGSFRNDKNYME----YIKKAL-----LNRGI 239
Query: 217 PWVMCKQDDAPDPVINACNGRQC--------GETFAGPN--SPDKPAIWTENWTSFYQVY 266
++ D+ I + G ++F + DKP + E WT +Y +
Sbjct: 240 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 299
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G + +SA +I + F + G N YM+HGGTNFG Y V+T Y
Sbjct: 300 GSKHTEKSANEIRRTIYRFFSY--GLSFNVYMFHGGTNFGFINGGYHENGHTNVVTSYDY 357
Query: 320 QAPLDEYGLLRQPKWGHLKELHSA 343
A L E G + K+ L++L ++
Sbjct: 358 DAVLSEAGDYTE-KYFKLRKLFAS 380
>gi|149027890|gb|EDL83350.1| similar to Hypothetical protein MGC47419 (predicted) [Rattus
norvegicus]
Length = 394
Score = 135 bits (341), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 100/300 (33%), Positives = 141/300 (47%), Gaps = 31/300 (10%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL FI
Sbjct: 79 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIW 138
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL P + R+ F + Y + M
Sbjct: 139 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYPGFTKAVDLYFDHL--MS 196
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
+ L GGPII Q+ENEYG +H+++ PY++ A + G+ ++
Sbjct: 197 RVVPLQYKHGGPIIAVQVENEYGSYNGDHAYM----PYIKKALE-----DRGIIEMLLTS 247
Query: 224 DDAP-------DPVINACNGRQCGETFAGPNS------PDKPAIWTENWTSFYQVYGDEA 270
D+ D V+ N Q + NS +P + E WT ++ +G
Sbjct: 248 DNKDGLEKGVVDGVLATIN-LQSQQELVALNSILLSIQGIQPKMVMEYWTGWFDSWGGSH 306
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLR 330
I + ++ V+ I GS +N YM+HGGTNFG A Y +A + YG LR
Sbjct: 307 NILDSSEVLQTVSAIIK--DGSSINLYMFHGGTNFGFINGAMHFGDY--KADVTSYGKLR 362
>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 640
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 174/374 (46%), Gaps = 46/374 (12%)
Query: 1 MGQCQLLCLFGLLLTTIGGSDGGGGGGNN----VTYDGRSLIINGHRKILFSGSIHYPRS 56
+G C LF +L S NN V Y+ + +G SG +HY R
Sbjct: 4 IGVCCFWSLFVFVLCDTSNS------TNNRTFIVDYEKNEFLKDGEVFRYVSGDLHYFRV 57
Query: 57 TPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCL 116
W I K K GL+ + T V W+LHEP PG ++F G DL FIK +Q +G+Y+ L
Sbjct: 58 PKSYWKDRIQKIKAAGLNAITTYVEWSLHEPFPGTYNFEGMADLEYFIKLIQDEGMYLLL 117
Query: 117 RIGPFIEGEWGYGGLPFWLHDV-PGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQG 175
R GP+I E +GG P+WL +V P R+++ +K ++ ++ ++++ M+ LY + G
Sbjct: 118 RPGPYICAERDFGGFPYWLLNVTPKGSLRTNDSSYKKYVSQWFSVLMKKMQ-PHLYGN-G 175
Query: 176 GPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL----AVDLQTGVPWVMCKQDD---APD 228
G II+ Q+ENEYG S+ Y W L D +C+Q D P
Sbjct: 176 GNIIMVQVENEYG----SYYACDSDYKLWLRDLLKGYVEDKALLYTIDICRQRDFDCGPI 231
Query: 229 PVINA-------CNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYH 281
P + A N C + F P++ +E + + + + +++D+ H
Sbjct: 232 PEVYATVDFGISVNAATCFD-FLKNYQKGGPSVNSEFYPGWLAHWQEPHPKVNSDDVVNH 290
Query: 282 VALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------------LTGYYDQAPLDEYGLL 329
+ ++ + S+ ++YM+HGGTNFG T+ A LT Y AP+ E G L
Sbjct: 291 MKSMLS-LNASF-SFYMFHGGTNFGFTSGANTNESDANIGYLPQLTSYDYDAPITEAGDL 348
Query: 330 RQPKWGHLKELHSA 343
+ + + L +A
Sbjct: 349 TEKYFKIKQTLENA 362
Score = 40.4 bits (93), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 49/100 (49%), Gaps = 12/100 (12%)
Query: 602 WYKTVFDAP---TGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPR 658
+Y+T F P T + ++ KG A++N ++GRYW P P + Y +P
Sbjct: 546 FYRTQFTLPEDYTSTLDTYLDTSGWTKGVAFLNDINLGRYW-----PLAGPQITLY-VPA 599
Query: 659 SFLK--PTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
SFLK P N LV+ E E P +SI V T+ G ++
Sbjct: 600 SFLKPPPAVNTLVMFELERA-PQDLSIKFVDKPTINGPIN 638
>gi|91078184|ref|XP_967722.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Tribolium
castaneum]
gi|270002869|gb|EEZ99316.1| beta-galactosidase-like protein [Tribolium castaneum]
Length = 624
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 99/326 (30%), Positives = 153/326 (46%), Gaps = 27/326 (8%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG + ++ + +N L+SG++HY R Q W + K + GL+ V+T V WN
Sbjct: 12 GGVTSGLSTNQSYFTLNSKNITLYSGALHYFRVPQQYWRDRLRKLRAAGLNTVETYVPWN 71
Query: 84 LHEPQPGQF-------DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLH 136
LHEPQ G + DFS L +F+K Q + L +R GP+I EW +GGLP WL
Sbjct: 72 LHEPQIGNYDFGDGGSDFSNFLHLEKFLKLAQEEDLLAIVRPGPYICAEWDFGGLPSWLL 131
Query: 137 DVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEH---- 192
+ R+ F H+ R+ T ++ ++ A + ++GGPI+ Q+ENEYG E
Sbjct: 132 R-DNVKVRTSEPKFMSHVTRFFTRLLPILAALQF--TKGGPIVAFQVENEYGSTEELGKF 188
Query: 193 ----SFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFA--GP 246
++++ +R + + + P + P+ A R G+ F G
Sbjct: 189 APDKLYIKQLSDLMRKFGLVELLFTSDSPSQHGDRGTLPELFQTANFARDPGKEFQALGE 248
Query: 247 NSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+P + E WT ++ +G+ R+ + + V I K S VN YM+HGGT+FG
Sbjct: 249 YQKSRPTMAMEFWTGWFDHWGEGHNRRNNTEFSL-VLNEILKYPAS-VNMYMFHGGTSFG 306
Query: 307 RTASAYV-----LTGYYDQAPLDEYG 327
A V T Y APL E G
Sbjct: 307 FLNGANVPYQPDTTSYDYDAPLTENG 332
>gi|294779195|ref|ZP_06744602.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
gi|294453706|gb|EFG22101.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
Length = 592
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 140/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 8 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL + RS + F +
Sbjct: 68 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 128 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 181 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 239 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 285
>gi|257083732|ref|ZP_05578093.1| beta-galactosidase [Enterococcus faecalis Fly1]
gi|256991762|gb|EEU79064.1| beta-galactosidase [Enterococcus faecalis Fly1]
Length = 593
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 90/290 (31%), Positives = 142/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP+ W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL G+ RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K + L +QGGP+I+ Q+ENEYG +EK Y++ ++ +L
Sbjct: 129 RNYFQVL--LPKLSPLQITQGGPVIMMQVENEYGSYG---MEKA--YLQQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|81889875|sp|Q5XIL5.1|GLBL3_RAT RecName: Full=Beta-galactosidase-1-like protein 3
gi|53734228|gb|AAH83665.1| Galactosidase, beta 1-like 3 [Rattus norvegicus]
Length = 631
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 101/324 (31%), Positives = 156/324 (48%), Gaps = 33/324 (10%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ GH+ ++ GSIHY R + W + K + G + V T + WNLHE + G+FDFS
Sbjct: 58 LEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEIL 117
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL ++ + GL+V LR GP+I E GGLP WL PG R+ N+ F + +Y
Sbjct: 118 DLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDKYF 177
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
++ K L +GGP+I Q+ENEYG + +++E Y++ A L G+
Sbjct: 178 DHLIP--KILPLQYRRGGPVIAVQVENEYGSFRNDKNYME----YIKKAL-----LNRGI 226
Query: 217 PWVMCKQDDAPDPVINACNGRQC--------GETFAGPN--SPDKPAIWTENWTSFYQVY 266
++ D+ I + G ++F + DKP + E WT +Y +
Sbjct: 227 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 286
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYD 319
G + +SA +I + F + G N YM+HGGTNFG Y V+T Y
Sbjct: 287 GSKHTEKSANEIRRTIYRFFSY--GLSFNVYMFHGGTNFGFINGGYHENGHTNVVTSYDY 344
Query: 320 QAPLDEYGLLRQPKWGHLKELHSA 343
A L E G + K+ L++L ++
Sbjct: 345 DAVLSEAGDYTE-KYFKLRKLFAS 367
>gi|76636681|ref|XP_597358.2| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|297483828|ref|XP_002693892.1| PREDICTED: galactosidase, beta 1-like 2 [Bos taurus]
gi|296479483|tpg|DAA21598.1| TPA: galactosidase, beta 1-like [Bos taurus]
Length = 758
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 146/325 (44%), Gaps = 30/325 (9%)
Query: 12 LLLTTIGGSDGGGGGGN-------NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
L+ +++ G D G G + + DG++ + +F GS+HY R W
Sbjct: 144 LVCSSLAGLDWSGLGASLWRRRHLGLRADGQNFKLENSAFWIFGGSVHYFRVPRAYWRDR 203
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
+ K + GL+ + T V WNLHEP+ G FDFSG DL FI GL+V LR GP+I
Sbjct: 204 LLKLRACGLNTLTTYVPWNLHEPERGTFDFSGNLDLEAFILLAAEVGLWVILRPGPYICS 263
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIE 184
E GGLP WL P + R+ + F + Y + M++ L GGPII Q+E
Sbjct: 264 EVDLGGLPSWLLRDPDMRLRTTYKGFTEAVDLYFDHL--MLRVVPLQYKHGGPIIAVQVE 321
Query: 185 NEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD-------APDPVINACNGR 237
NEYG K P Y+ + K D G+ ++ D+ D V+ N +
Sbjct: 322 NEYGSY-----NKDPAYMPYIKKALQD--RGIAELLLTSDNQGGLKSGVLDGVLATINLQ 374
Query: 238 QCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGS 292
E T +P + E WT ++ +G I + ++ V+ + GS
Sbjct: 375 SQSELQLFTTILLGAQGSQPKMVMEYWTGWFDSWGGPHYILDSSEVLNTVSAIVK--AGS 432
Query: 293 YVNYYMYHGGTNFGRTASAYVLTGY 317
+N YM+HGGTNFG A Y
Sbjct: 433 SINLYMFHGGTNFGFIGGAMHFQDY 457
>gi|256957323|ref|ZP_05561494.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|257077681|ref|ZP_05572042.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|307270129|ref|ZP_07551446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|422710565|ref|ZP_16767610.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|422721468|ref|ZP_16778057.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422867159|ref|ZP_16913760.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
gi|256947819|gb|EEU64451.1| beta-galactosidase [Enterococcus faecalis DS5]
gi|256985711|gb|EEU73013.1| beta-galactosidase [Enterococcus faecalis JH1]
gi|306513498|gb|EFM82113.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
gi|315031294|gb|EFT43226.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315035298|gb|EFT47230.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
gi|329577710|gb|EGG59137.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
Length = 593
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 140/290 (48%), Gaps = 30/290 (10%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG+IHY R TP W + K G + V+T + WN+HEP+ G +DF
Sbjct: 9 EDFLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDF 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +++ F++ + L V LR +I EW +GGLP WL + RS + F +
Sbjct: 69 EGMKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y ++ + K A L +QGGP+I+ Q+ENEYG +EK Y+R ++ +L
Sbjct: 129 RNYFQVL--LPKLAPLQITQGGPVIMMQVENEYGSYG---MEKA--YLRQTKQIMEELGI 181
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D A + V++A G E F + P +
Sbjct: 182 EVP--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCM 239
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R D+A V +A GS +N YM+HGGTNFG
Sbjct: 240 EYWDGWFNRWGEPVIQREGTDLAKEVKDMLA--VGS-LNLYMFHGGTNFG 286
>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
CL03T12C04]
Length = 725
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 151/307 (49%), Gaps = 30/307 (9%)
Query: 51 IHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQ 110
+HYPR + W + +A+ GL+ V VFWN HE QPG+FDF+G+ D+ F++ Q +
Sbjct: 1 MHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEE 60
Query: 111 GLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARL 170
GLYV LR GP++ EW +GG P WL +++RS + F + +RY + + + L
Sbjct: 61 GLYVILRPGPYVCAEWDFGGYPSWLLKEKDMIYRSKDPRFLSYCERYIKELGKQLSS--L 118
Query: 171 YASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQT--GVPWVMCKQDDA 226
+ GG II+ Q+ENEYG + +L ++ A V L T G V +
Sbjct: 119 TINNGGNIIMVQVENEYGSYAADKEYLAAIRDMIK-EAGFNVPLFTCDGGGQVEAGHIEG 177
Query: 227 PDPVINACNGRQCGETFAGPNSPDK--PAIWTENWTSFYQVYGDE----ARIRSAEDIAY 280
P +N G + F ++ K P E + +++ +G A R AE + +
Sbjct: 178 ALPTLNGVFGE---DIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQLDW 234
Query: 281 HVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY------YD-QAPLDEYGLLRQPK 333
++ G V+ YM+HGGTNF T A GY YD APL E+G PK
Sbjct: 235 MLS------HGVSVSMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWGNCY-PK 287
Query: 334 WGHLKEL 340
+ +E+
Sbjct: 288 YHAFREV 294
>gi|456387967|gb|EMF53457.1| glycosyl hydrolase family 42 [Streptomyces bottropensis ATCC 25435]
Length = 591
Score = 135 bits (340), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 103/322 (31%), Positives = 156/322 (48%), Gaps = 38/322 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
T DG +++G + SG++HY R P +W + KA+ GL+ V+T V WNLH+P
Sbjct: 7 TTTSDG--FLLHGEPFRIISGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQPD 64
Query: 89 PGQ-FDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDN 147
P G DL R+++ +A+GL+V LR GP+I EW GGLP WL P I RS +
Sbjct: 65 PDSPLVLDGLLDLPRYLRLARAEGLHVLLRPGPYICAEWDGGGLPSWLTSDPDIRLRSSD 124
Query: 148 EPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWA 205
F + Y + + + A+ GP+I Q+ENEYG + ++L+ +V A
Sbjct: 125 PRFTAALDGY--LDILLPPLLPYMAANDGPVIAVQVENEYGAYGDDTAYLK----HVHQA 178
Query: 206 AKLAVDLQTGVPWVMCKQDDA-----------PDPVINACNGRQCGETFAG--PNSPDKP 252
+ GV ++ D A P + A G + E+ A + P+ P
Sbjct: 179 LR-----ARGVEELLFTCDQAGSGHHLAAGSLPGVLSTATFGGKIEESLAALRAHMPEGP 233
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY 312
+ +E W ++ +G+E +R A A + +A G+ VN YM+HGGTNFG T A
Sbjct: 234 LMCSEFWIGWFDHWGEEHHVRDAAGAAADLDKLLA--AGASVNIYMFHGGTNFGFTNGAN 291
Query: 313 -------VLTGYYDQAPLDEYG 327
++T Y A L E G
Sbjct: 292 HDQCYAPIVTSYDYDAALTESG 313
Score = 43.1 bits (100), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 56/134 (41%), Gaps = 29/134 (21%)
Query: 557 LKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPV 616
+++ ++G ++G L T G+ ++ W + PL TV AP + PV
Sbjct: 447 VENMGGVNYGPRIGAAKGLLGPVTFNGTALLGWDAH----RLPLADLSTVPFAPADAAPV 502
Query: 617 AINLISMG------------------KGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPR 658
+ G KG+AW+NG +GRYW P ++ Y +P
Sbjct: 503 TVPAFHQGTFEVDTPADTFLSLPGWTKGQAWINGFHLGRYW------NRGPQRTLY-VPG 555
Query: 659 SFLKPTGNLLVLLE 672
L+P N LVLLE
Sbjct: 556 PVLRPGANDLVLLE 569
>gi|22760724|dbj|BAC11309.1| unnamed protein product [Homo sapiens]
Length = 636
Score = 135 bits (340), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/273 (33%), Positives = 130/273 (47%), Gaps = 23/273 (8%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG D F+
Sbjct: 63 IFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDQEAFVL 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL PG+ R+ + F + Y + M
Sbjct: 123 MAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEAVDLYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L +GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 181 RVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED--RGIVELLLTSDN 233
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
V+ N + E TF +P + E WT ++ +G I
Sbjct: 234 KDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ ++ V+ + GS +N YM+HGGTNFG
Sbjct: 294 DSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 324
>gi|298481696|ref|ZP_06999887.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
gi|298272237|gb|EFI13807.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
Length = 778
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 175/397 (44%), Gaps = 52/397 (13%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++L + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDFSG+ D+ F K Q G+YV +R GP+
Sbjct: 61 SHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL + R+ + ++M+R + + K A L ++GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDVALRTLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYGTD-----KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG-------LLRQ----- 331
+ + YM HGGT FG A + + + Y AP+ E G LLR
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTEKFFLLRDLLKNY 350
Query: 332 -PKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
P L E+ +A+ + P + V+ FS L EA
Sbjct: 351 LPAGESLPEVPAALPVIEIPEIHFNKVAPLFSNLPEA 387
>gi|344248604|gb|EGW04708.1| Beta-galactosidase [Cricetulus griseus]
Length = 650
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 146/321 (45%), Gaps = 28/321 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 16 LDYNQDRFLKDGLPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 75
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 76 GQYEFSGDRDVEYFIHLAHKLGLLVILRPGPYICAEWDMGGLPAWLLEKESIVLRSSDPD 135
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ T+++ MK L GGPII Q+ENEYG S+ Y+R+ A
Sbjct: 136 YLAAVDKWLTVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFACDYDYLRFLAH-R 188
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS------------PDKPAIW 255
G ++ D A + + G F + P P I
Sbjct: 189 FRYHLGNDVLLFTTDGANENFLRCGTLQGLYATVDFGAVKNITQAFLIQRKFEPKGPLIN 248
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G+ E +A +L+ +G+ VN YM+ GGTNF A +
Sbjct: 249 SEFYTGWLDHWGEPHYTVKTEIVA--ASLYDLLARGASVNLYMFIGGTNFAYWNGANIPY 306
Query: 314 ---LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 307 AAQPTSYDYDAPLSEAGDLTE 327
>gi|296086917|emb|CBI33129.3| unnamed protein product [Vitis vinifera]
Length = 186
Score = 135 bits (339), Expect = 1e-28, Method: Composition-based stats.
Identities = 56/110 (50%), Positives = 82/110 (74%)
Query: 73 LDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLP 132
++V++T VFW HE PG + F G DL++F+K VQ G+++ L IGPF+ EW + G+P
Sbjct: 69 INVIETYVFWIGHELSPGNYYFGGWYDLLKFVKIVQQDGMWLILHIGPFVAAEWNFDGIP 128
Query: 133 FWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQ 182
WLH V G VFR+++EPFK+HM+++ T+IVN+MK +L+ASQGGPI L+
Sbjct: 129 VWLHYVLGTVFRTNSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPINLAH 178
>gi|291530918|emb|CBK96503.1| Beta-galactosidase [Eubacterium siraeum 70/3]
Length = 579
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 150/320 (46%), Gaps = 25/320 (7%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
++G + SGSIHY R+ P+ W + K G + V+T + WN HE + G F++ G
Sbjct: 12 LDGKPFKVISGSIHYFRTVPEYWQDRLEKLVNIGCNTVETYIPWNFHETEKGNFNWDGMH 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
D+ RFI+ GLY+ +R P+I EW +GGLP WL + R +P+ + Y
Sbjct: 72 DICRFIELADKLGLYMIIRPSPYICSEWEFGGLPAWLLKDRSMRLRCSYKPYLNAVDNYY 131
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
+++ M K A GG II+ QIENEYG + S+LE +R + +
Sbjct: 132 SVL--MPKLAPYQIDNGGNIIMMQIENEYGYYGNDTSYLEFLRDTMRKYGITVPFVTSDG 189
Query: 217 PW----VMCKQDDAPDPVINACNGR--QCGET--FAGPNSPDKPAIWTENWTSFYQVYGD 268
PW D P N + Q GE F G KP + E W ++ V+G+
Sbjct: 190 PWSEFVFKSGMVDGALPTGNFGSSAEWQLGEMRRFIGEG---KPLMCMEFWNGWFDVWGE 246
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG------RTASAYVLTGYYDQAP 322
E I + E A + +K +N+YM+ GGTNFG ++T Y AP
Sbjct: 247 EHNITAPEKAAQELDTL---LKNGSMNFYMFEGGTNFGFMSGKNNEKKTGIVTSYDYDAP 303
Query: 323 LDEYGLLRQPKWGHLKELHS 342
L E G + + K+ KE+ S
Sbjct: 304 LTEDGRITE-KYEKCKEVIS 322
>gi|354472811|ref|XP_003498630.1| PREDICTED: beta-galactosidase [Cricetulus griseus]
Length = 681
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 146/321 (45%), Gaps = 28/321 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ + +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 47 LDYNQDRFLKDGLPFRYISGSIHYFRIPRFYWEDRLLKMKMAGLNAIQMYVPWNFHEPQP 106
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ++FSG RD+ FI GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 107 GQYEFSGDRDVEYFIHLAHKLGLLVILRPGPYICAEWDMGGLPAWLLEKESIVLRSSDPD 166
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ T+++ MK L GGPII Q+ENEYG S+ Y+R+ A
Sbjct: 167 YLAAVDKWLTVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFACDYDYLRFLAH-R 219
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS------------PDKPAIW 255
G ++ D A + + G F + P P I
Sbjct: 220 FRYHLGNDVLLFTTDGANENFLRCGTLQGLYATVDFGAVKNITQAFLIQRKFEPKGPLIN 279
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G+ E +A +L+ +G+ VN YM+ GGTNF A +
Sbjct: 280 SEFYTGWLDHWGEPHYTVKTEIVA--ASLYDLLARGASVNLYMFIGGTNFAYWNGANIPY 337
Query: 314 ---LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 338 AAQPTSYDYDAPLSEAGDLTE 358
>gi|324509196|gb|ADY43870.1| Beta-galactosidase [Ascaris suum]
Length = 639
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 154/319 (48%), Gaps = 31/319 (9%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
++ Y + +++G SGSIHY R P W +++ + GL+ +Q + WN HE
Sbjct: 28 SIDYVNKRFLLDGQPFRYISGSIHYFRVHPDQWNDRLSRMRAAGLNAIQFYIPWNFHEIY 87
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
G F G R++ RF+ LY +RIGP+I GEW GGLP+WL I R+ ++
Sbjct: 88 EGVIGFDGGRNITRFLSLAAQNELYALVRIGPYICGEWENGGLPWWLLKYDDIKMRTSDK 147
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR--WAA 206
F ++R+ +++ ++K + GGPI++ Q+ENEYG K ++R
Sbjct: 148 RFIRAVERWFGVLLPILKPS--LRKNGGPILMIQVENEYGSFTEGCDRKYTTFLRDLTIK 205
Query: 207 KLAVD-------------LQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNS--PD 250
L D L+ G +P V D P+ + Q + FA S P+
Sbjct: 206 HLGDDVVLYTTDGANNQSLKCGSIPGVFATVDFGPN------SEEQIDKNFATQRSYEPN 259
Query: 251 KPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNF----G 306
P + +E + + + + RI + D + + ++ K+ S+ NYYM++GGTNF G
Sbjct: 260 GPLVNSEFYPGWIVTWSQKGRIDPSVDEIINGSKYMFKLGASF-NYYMFYGGTNFAFWNG 318
Query: 307 RTASAYVLTGYYDQAPLDE 325
++ V+T Y APL E
Sbjct: 319 AETTSAVITSYDYFAPLTE 337
>gi|336404675|ref|ZP_08585368.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
gi|335941579|gb|EGN03432.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
Length = 778
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 174/397 (43%), Gaps = 52/397 (13%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++L + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDFSG+ D+ F K Q G+YV +R GP+
Sbjct: 61 SHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL + R+ + ++M+R + + K A L +GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDVALRTLD---PYYMERVGIFMKEVGKQLAPLQVDKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYGTD-----KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG-------LLRQ----- 331
+ + YM HGGT FG A + + + Y AP+ E G LLR
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTEKFFLLRDLLKNY 350
Query: 332 -PKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
P L E+ +A+ + P + V+ FS L EA
Sbjct: 351 LPAGESLPEVPAALPVIEIPEIHFNKVAPLFSNLPEA 387
>gi|241156773|ref|XP_002407847.1| beta-galactosidase precursor, putative [Ixodes scapularis]
gi|215494239|gb|EEC03880.1| beta-galactosidase precursor, putative [Ixodes scapularis]
Length = 388
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 156/316 (49%), Gaps = 22/316 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ + +G + SGS+HY R+ P+ W + K GL+ +QT + W+ HEP+
Sbjct: 35 IDYENNCFLKDGEPFQIISGSMHYFRTLPEQWEDRLTTMKTAGLNTLQTYIEWSSHEPEN 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIV-FRSDNE 148
GQ+DF G+ D+V+FIK + G V LR GPFI+ E GG P+WL V RS ++
Sbjct: 95 GQYDFEGQEDIVKFIKIAERLGFLVILRPGPFIDAERDMGGFPYWLLSEDNTVRLRSSDQ 154
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV-EHSFLEKGPPYVRWAAK 207
+ ++ RY + ++ ++K S GGP+++ Q+ENEYG E F+
Sbjct: 155 RYLKYVDRYFSKLLPLLKPLLY--SNGGPVLMLQVENEYGSYHECDFVYTAHLKDLMRRH 212
Query: 208 LAVDL------QTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDK--PAIWTENW 259
L D+ G ++ C ++D ++ G +FA P + +E +
Sbjct: 213 LGPDVLLYTTDGNGDRYLKCGKNDGAYTTVDFGPGSDVVASFAAQRRHQDRGPLMNSEFY 272
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD 319
+ + +GD+ +A +A + + M S VN Y++HGG++FG TA A + G Y
Sbjct: 273 SGWLDNWGDKHWEGNASAVAETLREMLT-MNAS-VNIYVFHGGSSFGCTAGANLDKGVYS 330
Query: 320 --------QAPLDEYG 327
AP++E G
Sbjct: 331 PNPTSYDYDAPMNEAG 346
>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
Length = 586
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 153/316 (48%), Gaps = 29/316 (9%)
Query: 45 ILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFI 104
++ GSIHY R + W + K + G + V T + WNLHE + G+FDFS DL ++
Sbjct: 1 MIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYV 60
Query: 105 KEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNM 164
+ GL+V LR GP+I E GGLP WL P R+ N+ F + +Y ++
Sbjct: 61 LLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIP- 119
Query: 165 MKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQD 224
K L GGP+I Q+ENEYG SF +K Y+ + K L+ G+ ++ D
Sbjct: 120 -KILPLQYRHGGPVIAVQVENEYG----SF-QKDRNYMNYLKKAL--LKRGIVELLLTSD 171
Query: 225 DAPDPVINACNGRQ--------CGETFAGPN--SPDKPAIWTENWTSFYQVYGDEARIRS 274
D I + NG ++F + DKP + E WT +Y +G + +S
Sbjct: 172 DKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKS 231
Query: 275 AEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-------RTASAYVLTGYYDQAPLDEYG 327
AE+I + V FI+ G N YM+HGGTNFG V+T Y A L E G
Sbjct: 232 AEEIRHTVYKFIS--YGLSFNMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAG 289
Query: 328 LLRQPKWGHLKELHSA 343
+ K+ L++L ++
Sbjct: 290 DYTE-KYFKLRKLFAS 304
>gi|397498763|ref|XP_003820147.1| PREDICTED: beta-galactosidase-1-like protein 2 [Pan paniscus]
Length = 720
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 134/285 (47%), Gaps = 23/285 (8%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + ++ +F GSIHY R + W + K K GL+ + T V WNLHEP+ +FD
Sbjct: 135 GWNFVLEDSSFRIFGGSIHYFRVPREYWRDRLLKMKACGLNTLTTYVPWNLHEPERSKFD 194
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG DL F+ GL+V LR GP+I E GGLP WL PG+ R+ + F
Sbjct: 195 FSGNLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPGMRLRTTYKGFTEA 254
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
+ Y + M + L +GGPII Q+ENEYG K P Y+ + K D
Sbjct: 255 VDLYFDHL--MSRVVPLQYKRGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED-- 305
Query: 214 TGVPWVMCKQDDAP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTS 261
G+ ++ D+ V+ N + E TF +P + E WT
Sbjct: 306 RGIVELLLTSDNKDGLSKGIVQGVLATINLQSTHELQLLTTFLFNVQGTQPKMVMEYWTG 365
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
++ +G I + ++ V+ + GS +N YM+HGGTNFG
Sbjct: 366 WFDSWGGPHNILDSSEVLKTVSAIVD--AGSSINLYMFHGGTNFG 408
>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
941]
Length = 613
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/358 (29%), Positives = 148/358 (41%), Gaps = 39/358 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N G +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ
Sbjct: 31 NFGTQGTQFARDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQ 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQFDFSG D+ F++E AQGL V LR GP+ EW GG P WL I RS +
Sbjct: 91 QGQFDFSGHNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDP 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F + Y + N ++ L GGPII Q+ENEYG +H+++ A
Sbjct: 151 RFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------AD 199
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAI 254
A+ ++ G + D D + N P PD+P +
Sbjct: 200 NRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRM 259
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
E W ++ +G A A + +G N YM+ GGT+FG A
Sbjct: 260 VGEYWAGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFGFMNGANFQ 317
Query: 314 ----------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 318 NNPSDHYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGVQPPALPAPIATTTL 374
>gi|125556151|gb|EAZ01757.1| hypothetical protein OsI_23786 [Oryza sativa Indica Group]
Length = 101
Score = 134 bits (338), Expect = 1e-28, Method: Composition-based stats.
Identities = 59/96 (61%), Positives = 70/96 (72%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MWP LI KAKEGGLD ++T VFWN HEP Q++F G D+VRF KE+Q GLY LRIG
Sbjct: 1 MWPDLIKKAKEGGLDAIETYVFWNGHEPHRRQYNFVGNYDIVRFFKEIQNAGLYAILRIG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
P+I GEW YGGLP WL D+PG+ FR N PF+ +K
Sbjct: 61 PYICGEWNYGGLPAWLRDIPGMQFRLHNAPFESVLK 96
>gi|296475022|tpg|DAA17137.1| TPA: galactosidase, beta 1 precursor [Bos taurus]
Length = 653
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 156/330 (47%), Gaps = 29/330 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HE QP
Sbjct: 33 IDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQP 92
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+++FSG D+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 93 GRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDPD 152
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ M+ L GGPII Q+ENEYG S+L Y+R+ K
Sbjct: 153 YLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFLQKRF 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFA-GPN-----------SPDKPAIW 255
D G ++ D + ++ A G F+ G N P P +
Sbjct: 207 HD-HLGEDVLLFTTDGVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQRKFEPTGPLVN 265
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G S++ +A+ + +A G+ VN YM+ GGTNF A +
Sbjct: 266 SEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLA--LGANVNMYMFIGGTNFAYWNGANIPY 323
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E G L + K+ L+++
Sbjct: 324 QPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|300770171|ref|ZP_07080050.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
gi|300762647|gb|EFK59464.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
Length = 638
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 169/681 (24%), Positives = 266/681 (39%), Gaps = 136/681 (19%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N YDG++ I SG +HY R Q W + K GL+ V T VFWN HE
Sbjct: 40 NFVYDGKATRI-------LSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEES 92
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++F G DL FIK GL+V LR GP+ EW +GG P+WL + G+ R DN
Sbjct: 93 PGNWNFEGDHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDNA 152
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKG 198
F + K+Y + + L + GGPII+ Q ENE+G + EH
Sbjct: 153 KFLEYTKKYIDRLAK--EVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAK 210
Query: 199 PPYVRWAAKLAVDLQTGVPWVMCKQDDAPDPV------INACNGRQCGETFAGPNSPDKP 252
A V L T + + P + N N ++ + + P
Sbjct: 211 IKKQLEEAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQYNNNQGPYMV 270
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYV------NYYMYHGGTNFG 306
A + W + AE A A IA+ Y+ NYYM HGGTNFG
Sbjct: 271 AEFYPGWLDHW-----------AEPFAKVDAGRIARQTEKYLQNDISFNYYMVHGGTNFG 319
Query: 307 RTASAYV---------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLV 357
T+ A +T Y AP+ E G PK+ ++
Sbjct: 320 FTSGANYNNKSDIQPDITSYDYDAPISEAG-WTTPKYDSIRT------------------ 360
Query: 358 SMNFSKLQEAFIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFN 417
+ Q ++ + K +N + E+P + ++ + + F+
Sbjct: 361 -----------VIQKYADYTVPAIPK----------ANPVIEIPSIKLTAVANV----FD 395
Query: 418 TAKLDSVEQWEEYKEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESV 477
A K A T +ET L NF EQ++ A+ Y+ Y+ +F P + +
Sbjct: 396 YA-----------KSAKTTINETPL--NF--EQLD---QANGYVLYSKQFNQ-PINGK-- 434
Query: 478 LKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGA 537
LK+ L +I+G VG ++ F +M I + + +L +G + G+
Sbjct: 435 LKIDGLRDFAVVYIDGTKVGEL-----NRVFKNYEMDIDIPFNSTLQILVENMGRINYGS 489
Query: 538 YLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTH 597
+ G+ + + E+ + W Q L +K+ + + ++ +S
Sbjct: 490 EIIHNHKGIISPVLINDMEI----TGDWTMQ-QLPMDKVPDLAGKQTATIQNTKVNTSKI 544
Query: 598 QPLTWYKTVFDAPTGSDPVAINLISMGK---GEAWVNGQSIGRYWVSFLTPQGTPSQSWY 654
L ++ + I M K G ++NG +IGRYW + PQ T
Sbjct: 545 ATLKGQPVLYQGTFDLKEIGDTFIDMEKWGKGIVFINGINIGRYWKT--GPQHT-----L 597
Query: 655 HIPRSFLKPTGNLLVLLEEEN 675
+IP +LK N +V+ E+ N
Sbjct: 598 YIPGPYLKKGSNSIVIFEQLN 618
>gi|71731106|gb|EAO33173.1| Beta-galactosidase [Xylella fastidiosa subsp. sandyi Ann-1]
Length = 612
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 107/340 (31%), Positives = 152/340 (44%), Gaps = 25/340 (7%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G I +G L SG+IH+ R W + KA+ GL+ V+T VFWNL E + GQFD
Sbjct: 32 GTQFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFD 91
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F+G D+ F++E +QGL V LR GP++ EW GG P WL P + RS + F
Sbjct: 92 FTGNNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDA 151
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+RY + ++ L GGPII Q+ENEYG +H +L+ A
Sbjct: 152 SQRYLEALGTQVRP--LLNGNGGPIIAVQVENEYGSYGDDHGYLQAVHALFIKAGLGGAL 209
Query: 212 LQTGVPWVMCKQDDAPDPVINACN-----GRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
L T M PD V+ A N +Q + A P +P + E W ++ +
Sbjct: 210 LFTADGAQMLGNGTLPD-VLAAVNFAPGEAKQALDKLA-TFHPGQPQLVGEYWAGWFDQW 267
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-----------LT 315
G A+ A + + +G +N YM+ GGT+FG A T
Sbjct: 268 GKPHAQTDAKQQADEIEWML--RQGHSINLYMFVGGTSFGFMNGANFQGGPGDHYSPQTT 325
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGV 355
Y A LDE G PK+ +++ + V P L G
Sbjct: 326 SYDYDAVLDEAG-RPMPKFALFRDVITRVTGLQPPPLPGA 364
>gi|357391354|ref|YP_004906195.1| putative beta-galactosidase [Kitasatospora setae KM-6054]
gi|311897831|dbj|BAJ30239.1| putative beta-galactosidase [Kitasatospora setae KM-6054]
Length = 588
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 100/330 (30%), Positives = 149/330 (45%), Gaps = 44/330 (13%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+TYD ++G + SG++HY RS P+ W +A + GL+ V+T V WNLHEP P
Sbjct: 2 LTYDSTGFRLDGRPLRVLSGAVHYFRSRPEQWADRLAAVRAMGLNTVETYVPWNLHEPAP 61
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+F G +L F+ E + QGL+ +R GP+I EW GGLP WL G R+ +
Sbjct: 62 GRFARVG--ELGAFLDEARRQGLWTIVRPGPYICAEWDNGGLPGWLTARLGRRVRTGDPE 119
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGP 199
F + + +++ + R + G +++ Q+ENEYG + E+G
Sbjct: 120 FLAAVGAFFDVLLPQV-VERQWGRPDGSVLMVQVENEYGAFGSDAGYLAALARGLRERGV 178
Query: 200 PYVRWAAKLAVD--LQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWT 256
+ + D L G VP V+ + DP R+ + P+ P
Sbjct: 179 SVPLFTSDGPEDHMLAAGTVPGVLATVNFGSDPERGFAALRR--------HRPEDPPFCM 230
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY---- 312
E W ++ +G R A+D A + +A G VN YM HGGT+FG +A A
Sbjct: 231 EFWNGWFDQWGRPHHTRGADDAADSLRRILA--AGGSVNLYMAHGGTSFGTSAGANHADP 288
Query: 313 --------------VLTGYYDQAPLDEYGL 328
+T Y APLDE GL
Sbjct: 289 PFNSTDWTHSPYQPTVTSYDYDAPLDERGL 318
>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
Length = 612
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 152/617 (24%), Positives = 241/617 (39%), Gaps = 97/617 (15%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G I +G L SG+IH+ R W + KA+ GL+ V+T VFWNL E + GQFD
Sbjct: 32 GTQFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFD 91
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F+G D+ F++E +QGL V LR GP++ EW GG P WL P + RS + F
Sbjct: 92 FTGNNDISAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDA 151
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+RY + ++ L GGPII Q+ENEYG +H +L+ A
Sbjct: 152 SQRYLEALGTQVRP--LLNGNGGPIIAVQVENEYGSYGDDHGYLQAVRALFIKAGLGGAL 209
Query: 212 LQTGVPWVMCKQDDAPDPVINACN-----GRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
L T M PD V+ A N +Q + A P +P + E W ++ +
Sbjct: 210 LFTADGAQMLGNGTLPD-VLAAVNVAPGEAKQALDKLA-TFHPGQPQLVGEYWAGWFDQW 267
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G A+ A + + +G +N YM+ GGT+FG ++ + P D Y
Sbjct: 268 GKPHAQTDAKQQADEIEWML--RQGHSINLYMFVGGTSFG-----FMNGANFQGGPSDHY 320
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
S S ++ +A + + F++ +D
Sbjct: 321 --------------------------SPQTTSYDY----DAALDEAGRPMPKFVLFRDVI 350
Query: 387 NNATVYFSNLMYELPPLSISI----LPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSL 442
T + PPL + LP NT S W+ A+ T +
Sbjct: 351 TRVT------GLQPPPLPAATRFIDLP-------NTPLRASASLWDNLPAAVATTADP-- 395
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGK 502
+ M A Y+ Y H P + L + + +++ FVG A +
Sbjct: 396 ------QPMERYGQAYGYILYRTTL-HGP--RKGTLYLGEVRDDARVYVDRLFVGRAERR 446
Query: 503 HSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSS 562
S V + +G + + +L G + G +L AGL + + + + ++ +
Sbjct: 447 RQQVSVE----VDIPSGAHRLDVLVENSGRVNYGPHLADGRAGLIDPVMLNHERVNNWET 502
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLIS 622
F Q E + +G P G + H+ +T D +++ +
Sbjct: 503 FLLPLQT---PEAI-----HGWTTAPMQ--GPAFHRGTLLIRTPGD-------TFLDMAA 545
Query: 623 MGKGEAWVNGQSIGRYW 639
KG W NG +GRYW
Sbjct: 546 FSKGVTWANGHLLGRYW 562
>gi|271968683|ref|YP_003342879.1| beta-galactosidase [Streptosporangium roseum DSM 43021]
gi|270511858|gb|ACZ90136.1| Beta-galactosidase [Streptosporangium roseum DSM 43021]
Length = 576
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 169/669 (25%), Positives = 260/669 (38%), Gaps = 141/669 (21%)
Query: 31 TYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPG 90
+ D S ++G + SG++HY R + W +A + GL+ V+T V WNLHEP PG
Sbjct: 5 SVDDGSFQLDGTPFRVLSGALHYFRVHREQWGHRLAMLRAMGLNTVETYVPWNLHEPWPG 64
Query: 91 QFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPF 150
DF +L F+ A+GL +R GP+I EW GGLP WL G + SD E +
Sbjct: 65 --DFRRVEELGAFLDAAAAEGLLAIVRPGPYICAEWDNGGLPVWLT---GHLRTSDPE-Y 118
Query: 151 KFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEK-GPPYVRWAAK 207
H+ RY I + + A ++GG +I+ Q+ENEYG +H++L VR +
Sbjct: 119 LAHVDRYLDRI--LPQVAERQVTRGGNVIMVQVENEYGSYGSDHAYLRHLADGLVRRGIE 176
Query: 208 LAVDLQTGVPWVMCKQDDAPDPVINACN-GRQCGETFAG--PNSPDKPAIWTENWTSFYQ 264
+ + G P D V+ N G + + FA + PD P E W ++
Sbjct: 177 VPLFTSDG-PADHYLTGGTIDGVLATVNFGSEPEQAFATLRAHRPDDPLFCMEFWCGWFD 235
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------------ 312
+G E +R D A + +A G+ VN YM HGG+N G A A
Sbjct: 236 HWGHEHVVRDPHDAADTLERILA--AGASVNLYMAHGGSNPGTRAGANRDGAQADGGWRP 293
Query: 313 VLTGYYDQAPLDEYGLLRQPKWGHLKELHSAV--KLCLKPMLSGVLVSMNFSKLQEAFIF 370
+T Y AP+DE G + W +E+ SA +L P + V
Sbjct: 294 TVTSYDYDAPIDERGAPTEKFW-RFREVLSAYNEELPEVPAVPAV--------------- 337
Query: 371 QGSSECAAFLVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEY 430
LPP ++ P+ + LD + + E
Sbjct: 338 -----------------------------LPPATLH--PEGSVLLRQA--LDVLARPEVV 364
Query: 431 KEAIPTYDETSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAF 490
PT++E L +L + Y L + + H F
Sbjct: 365 APVPPTFEELGLEHGLVLYRTTVPGPREPY----------------PLTLREVRDRAHVF 408
Query: 491 INGEFVGSAH-------GKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRV 543
++G G G + S +E +V + TN LL GL + ++ +
Sbjct: 409 VDGRPAGVVERDAEVLPGPVAGGSAVVEVLVESMGRTNYGPLLGERKGLLGGILHHQQYL 468
Query: 544 AGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWY 603
G +I L+D S+ ++G G+ P ++
Sbjct: 469 HGYGARAIP----LEDVSALAFG-------------------------QGTVDEAP-AFF 498
Query: 604 KTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKP 663
+TV + +D + L GKG WVNG +GRYW P ++ Y +P L+
Sbjct: 499 RTVLEVTEPAD-AFLMLPGWGKGYVWVNGVLLGRYW------DRGPQRTLY-VPAPLLRA 550
Query: 664 TGNLLVLLE 672
GN +V LE
Sbjct: 551 GGNEIVHLE 559
>gi|28199702|ref|NP_780016.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182682446|ref|YP_001830606.1| beta-galactosidase [Xylella fastidiosa M23]
gi|386083781|ref|YP_006000063.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|417557800|ref|ZP_12208811.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
gi|28057823|gb|AAO29665.1| beta-galactosidase [Xylella fastidiosa Temecula1]
gi|182632556|gb|ACB93332.1| Beta-galactosidase [Xylella fastidiosa M23]
gi|307578728|gb|ADN62697.1| Beta-galactosidase [Xylella fastidiosa subsp. fastidiosa GB514]
gi|338179583|gb|EGO82518.1| Beta-galactosidase [Xylella fastidiosa EB92.1]
Length = 612
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 152/617 (24%), Positives = 241/617 (39%), Gaps = 97/617 (15%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G I +G L SG+IH+ R W + KA+ GL+ V+T VFWNL E + GQFD
Sbjct: 32 GTQFIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFD 91
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
F+G D+ F++E +QGL V LR GP++ EW GG P WL P + RS + F
Sbjct: 92 FTGNNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDA 151
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+RY + ++ L GGPII Q+ENEYG +H +L+ A
Sbjct: 152 SQRYLEALGTQVRP--LLNGNGGPIIAVQVENEYGSYGDDHGYLQAVRALFIKAGLGGAL 209
Query: 212 LQTGVPWVMCKQDDAPDPVINACN-----GRQCGETFAGPNSPDKPAIWTENWTSFYQVY 266
L T M PD V+ A N +Q + A P +P + E W ++ +
Sbjct: 210 LFTADGAQMLGNGTLPD-VLAAVNVAPGEAKQALDKLA-TFHPGQPQLVGEYWAGWFDQW 267
Query: 267 GDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAPLDEY 326
G A+ A + + +G +N YM+ GGT+FG ++ + P D Y
Sbjct: 268 GKPHAQTDAKQQADEIEWML--RQGHSINLYMFVGGTSFG-----FMNGANFQGGPSDHY 320
Query: 327 GLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAFLVNKDKR 386
S S ++ +A + + F + +D
Sbjct: 321 --------------------------SPQTTSYDY----DAVLDEAGRPMPKFALFRDVI 350
Query: 387 NNATVYFSNLMYELPPLSISI----LPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDETSL 442
T + PPL + LPD A S W+ A+ T +
Sbjct: 351 TRVT------GLQPPPLPAASRFIDLPDTPLRA-------SASLWDNLPAAVATTADP-- 395
Query: 443 RANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGK 502
+ M A Y+ Y H P L + + H +++ FVG A +
Sbjct: 396 ------QPMERYGQAYGYILYRTTL-HGPRKGR--LYLGEVRDDAHVYVDRLFVGRAERR 446
Query: 503 HSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKDFSS 562
+ V + +GT+ + +L G + G +L AGL + + + ++ +
Sbjct: 447 RQQ----VWVEVDIPSGTHCLDVLVENSGRVNYGPHLADGRAGLIGPVMLNHERVNNWET 502
Query: 563 FSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLIS 622
F Q E + +T + G + H+ + +T D +++ +
Sbjct: 503 FLLPLQT---PEAIHGWTTAPMQ-------GPAFHRGTLFIRTPGD-------TFLDMEA 545
Query: 623 MGKGEAWVNGQSIGRYW 639
KG W NG +GRYW
Sbjct: 546 FSKGVTWANGHMLGRYW 562
>gi|374312360|ref|YP_005058790.1| glycoside hydrolase family protein [Granulicella mallensis
MP5ACTX8]
gi|358754370|gb|AEU37760.1| glycoside hydrolase family 35 [Granulicella mallensis MP5ACTX8]
Length = 627
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 145/319 (45%), Gaps = 40/319 (12%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG + Y R W + KA GL+ + VFWN+HEP P +DFSG+ D+ F++
Sbjct: 55 IVSGELEYARIPRPYWRDRLRKAHAMGLNAITIYVFWNIHEPTPEVYDFSGQNDVAEFVR 114
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
E Q +GLYV LR GP++ EW GG P WL + RS FK R+ M+
Sbjct: 115 EAQQEGLYVILRPGPYVCAEWDLGGYPAWLLKDHEMKLRSLQPEFKAAATRW--MLRLGQ 172
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L AS+GGPI+ Q+ENEYG SF + Y++W +L LQ G + D
Sbjct: 173 ELTPLQASRGGPILAVQVENEYG----SFGDDH-EYMKWVHELV--LQAGFGGSLLYTGD 225
Query: 226 APDPVINACNGRQCGETFAGPN----------------SPDKPAIWTENWTSFYQVYGDE 269
D + FAG + P P E W ++ +G++
Sbjct: 226 GADVLKQGT----LPSVFAGIDFGTGDAARSIKLYKAFRPQTPVYVAEYWDGWFDHWGEK 281
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY--------VLTGYYDQA 321
++ A + + +G ++ YM HGGT+FG A ++ Y A
Sbjct: 282 HQLTDAAKQETEIRSMLE--QGDSISLYMVHGGTSFGWMNGANNDHDGYQPDVSSYDYDA 339
Query: 322 PLDEYGLLRQPKWGHLKEL 340
PLDE G R PK+ L+ +
Sbjct: 340 PLDESGRPR-PKYFRLRNI 357
>gi|158455090|gb|AAI40686.2| Galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 156/330 (47%), Gaps = 29/330 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HE QP
Sbjct: 33 IDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQP 92
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+++FSG D+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 93 GRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDPD 152
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ M+ L GGPII Q+ENEYG S+L Y+R+ K
Sbjct: 153 YLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFLQKRF 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFA-GPN-----------SPDKPAIW 255
D G ++ D + ++ A G F+ G N P P +
Sbjct: 207 HD-HLGEDVLLFTTDGVNERLLQCGALQGLYATLDFSPGTNLTAAFMLQRKFEPTGPLVN 265
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G S++ +A+ + +A G+ VN YM+ GGTNF A +
Sbjct: 266 SEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLA--LGANVNMYMFIGGTNFAYWNGANIPY 323
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E G L + K+ L+++
Sbjct: 324 QPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|440698010|ref|ZP_20880386.1| glycosyl hydrolase family 35 [Streptomyces turgidiscabies Car8]
gi|440279645|gb|ELP67504.1| glycosyl hydrolase family 35 [Streptomyces turgidiscabies Car8]
Length = 586
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 146/316 (46%), Gaps = 28/316 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
T DG +++G + SG++HY R P W + KA+ GL+ V+T V WNLH+P+
Sbjct: 5 TTTSDG--FLLHGEPFRIISGAMHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPE 62
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG G DL R+++ QA+GL+V LR GPFI EW GGLP WL P I RS +
Sbjct: 63 PGTLALDGILDLPRYLRLAQAEGLHVLLRPGPFICAEWDGGGLPSWLTTDPDIRLRSSDP 122
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
F + RY +++ + A GGP+I Q+ENEYG Y+ A+
Sbjct: 123 RFTGAIDRYLDLLLPPLLPY--LAESGGPVIAVQVENEYGAYGDDAA-----YLEHLAEA 175
Query: 209 AVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAG----------PNSPDKPAIWTEN 258
G C Q + + G TF + P+ P + E
Sbjct: 176 LRSRGIGELLFTCDQANPEHLAAGSLPGVLTTGTFGSKVAASLEQLRAHQPEGPLMCAEF 235
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------ 312
W ++ +G+E R A D A + ++ G+ VN YM+HGGTNF T A
Sbjct: 236 WIGWFDHWGEEHHTRDAADAAADLDRLLS--AGASVNIYMFHGGTNFAFTNGANHDHAYQ 293
Query: 313 -VLTGYYDQAPLDEYG 327
++T Y A L E G
Sbjct: 294 PMVTSYDYDAALSENG 309
>gi|296399387|gb|ADH10509.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 148/321 (46%), Gaps = 28/321 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ S + +G SGSIHY R W + K K GLD +QT V WN HEP+
Sbjct: 9 IDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHEPRM 68
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G +DF G +DL F++ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 69 GTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSSDSD 128
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ ++R+ +++ M+ LY GGPII+ Q+ENEYG S+ Y+R
Sbjct: 129 YLEAVERWMGVLLPKMR-PYLY-QNGGPIIMVQVENEYG----SYFACDYDYLR-FLLKL 181
Query: 210 VDLQTGVPWVMCKQDDAPD------------PVINACNGRQCGETFAGPNS--PDKPAIW 255
L G V+ D A ++ G F S P P +
Sbjct: 182 FRLHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPMGPLVN 241
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G + AE +A + +A +G+ VN YM+ GGTNF A +
Sbjct: 242 SEFYTGWLDHWGHRHSVVPAETVAKTLNEILA--RGANVNLYMFIGGTNFAYWNGANMPY 299
Query: 314 ---LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 300 MPQPTSYDYDAPLSEAGDLTE 320
>gi|344291569|ref|XP_003417507.1| PREDICTED: beta-galactosidase-1-like protein 2 [Loxodonta africana]
Length = 650
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 135/290 (46%), Gaps = 23/290 (7%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G++ ++ +F GS+HY R Q W + K K GL+ + T V WNLHEP+ G+FD
Sbjct: 65 GQNFMLESSTFWIFGGSVHYFRVPRQYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFD 124
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG DL FI GL+V LR GP+I E GGLP WL P + R+ + F
Sbjct: 125 FSGNLDLEAFIWMAAELGLWVILRPGPYICSEIDLGGLPSWLLQDPNMKLRTTYKGFTEA 184
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ 213
+ Y ++ + L GGPII Q+ENEYG K P Y+ + K D
Sbjct: 185 VDLYFDHLI--ARVVPLQYKLGGPIIAVQVENEYGSY-----NKDPAYMPYVKKALED-- 235
Query: 214 TGVPWVMCKQDDAP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTS 261
G+ ++ D+ V+ N + E TF +P + E WT
Sbjct: 236 RGIVELLLTSDNKDGLSKGVIHGVLATINLQSQQELHLLTTFLLNAQGIQPKMVMEYWTG 295
Query: 262 FYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
++ +G I + ++ V+ I GS +N YM+HGGTNFG A
Sbjct: 296 WFDSWGGPHNILDSSEVLKTVSAIID--AGSSINLYMFHGGTNFGFINGA 343
>gi|327283884|ref|XP_003226670.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Anolis
carolinensis]
Length = 584
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/271 (33%), Positives = 127/271 (46%), Gaps = 20/271 (7%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ GS+HY R + W + K K GL+ V T V WNLHE G+FDFSG DL FIK
Sbjct: 29 ILGGSLHYFRIPREYWKDRLMKMKACGLNTVTTYVPWNLHEAIRGKFDFSGNLDLQVFIK 88
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
+ GL+V LR GP+I EW GGLP WL P + R+ F + Y ++
Sbjct: 89 MAEEVGLWVILRPGPYICSEWDLGGLPSWLLQDPEMQLRTTYRGFTEAVDNYFDRLIP-- 146
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQD- 224
+ L GGPII Q+ENEYG P Y+ + K+A+ + V +M +
Sbjct: 147 QVVPLQYKYGGPIIAVQVENEYGSYAQD-----PSYMTY-IKMALTSRKIVEMLMTSDNH 200
Query: 225 --------DAPDPVINACNGRQCGETFAGPNSPDK-PAIWTENWTSFYQVYGDEARIRSA 275
D IN F + +K P + E WT ++ +G + A
Sbjct: 201 DGLVSGTVDGALATINFQKLDTAIMVFLSTDQRNKMPKMVMEYWTGWFDSWGGLHHVFDA 260
Query: 276 EDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+D+ V I G+ +N YM+HGGTNFG
Sbjct: 261 DDMVQTVGKVIK--LGASINLYMFHGGTNFG 289
>gi|1352080|sp|P48982.1|BGAL_XANMN RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
gi|1045034|gb|AAC41485.1| beta-galactosidase [Xanthomonas axonopodis pv. manihotis]
Length = 598
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 127/287 (44%), Gaps = 27/287 (9%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ GQFD
Sbjct: 34 GTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 93
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F+KE AQGL V LR GP+ EW GG P WL I RS + F
Sbjct: 94 FSGNNDVAAFVKEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAA 153
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+ Y + ++ L GGPII Q+ENEYG +H+++ A A+
Sbjct: 154 SQAYLDALAKQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------ADNRAMY 202
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAIWTENW 259
++ G + D D + N P PD+P + E W
Sbjct: 203 VKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYW 262
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
++ +G A A + +G N YM+ GGT+FG
Sbjct: 263 AGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFG 307
>gi|296399420|gb|ADH10537.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
Length = 571
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 148/321 (46%), Gaps = 28/321 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ S + +G SGSIHY R W + K K GLD +QT V WN HEP+
Sbjct: 9 IDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHEPRM 68
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G +DF G +DL F++ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 69 GTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSSDSD 128
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ ++R+ +++ M+ LY GGPII+ Q+ENEYG S+ Y+R
Sbjct: 129 YLEAVERWMGVLLPKMR-PYLY-QNGGPIIMVQVENEYG----SYFACDYDYLR-FLLKL 181
Query: 210 VDLQTGVPWVMCKQDDAPD------------PVINACNGRQCGETFAGPNS--PDKPAIW 255
L G V+ D A ++ G F S P P +
Sbjct: 182 FRLHLGHEVVLFTTDGASQFHLKCGALQGLYATVDFAPGGNVTAAFLAQRSSEPMGPLVN 241
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G + AE +A + +A +G+ VN YM+ GGTNF A +
Sbjct: 242 SEFYTGWLDHWGHRHSVVPAETVAKTLNEILA--RGANVNLYMFIGGTNFAYWNGANMPY 299
Query: 314 ---LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 300 MPQPTSYDYDAPLSEAGDLTE 320
>gi|78042544|ref|NP_001030215.1| beta-galactosidase precursor [Bos taurus]
gi|75057630|sp|Q58D55.1|BGAL_BOVIN RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|61554628|gb|AAX46589.1| galactosidase, beta 1 [Bos taurus]
gi|148839051|dbj|BAF64285.1| galactosidase, beta 1 [Bos taurus]
Length = 653
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 156/330 (47%), Gaps = 29/330 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HE QP
Sbjct: 33 IDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQP 92
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+++FSG D+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 93 GRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDPD 152
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ M+ L GGPII Q+ENEYG S+L Y+R+ K
Sbjct: 153 YLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFLQKRF 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFA-GPN-----------SPDKPAIW 255
D G ++ D + ++ A G F+ G N P P +
Sbjct: 207 HD-HLGEDVLLFTTDGVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQRKFEPTGPLVN 265
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G S++ +A+ + +A G+ VN YM+ GGTNF A +
Sbjct: 266 SEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLA--LGANVNMYMFIGGTNFAYWNGANIPY 323
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E G L + K+ L+++
Sbjct: 324 QPQPTSYDYDAPLSEAGDLTE-KYFALRDI 352
>gi|171683861|ref|XP_001906872.1| hypothetical protein [Podospora anserina S mat+]
gi|170941891|emb|CAP67543.1| unnamed protein product [Podospora anserina S mat+]
Length = 1082
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 126/410 (30%), Positives = 181/410 (44%), Gaps = 58/410 (14%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHY---PRSTPQMWPRLIAKAKEGGLDVVQTLV 80
G G VTYD SL + G R +L+SG HY PRS P++W ++ K K G + V V
Sbjct: 95 GWQGPAVTYDNNSLSVYGERIMLYSGEFHYFRLPRS-PELWCDVLVKIKAMGFNAVSIYV 153
Query: 81 FWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPG 140
W + EP G++D G DL FI Q GLYV R GP+I GE GGLP WL
Sbjct: 154 PWMMLEPLRGEWDEVGWFDLDLFIGFAQTNGLYVIARPGPYINGEVTGGGLPGWLQRTTP 213
Query: 141 IVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPP 200
+ +D E F + Y + N+M A+ GGP+IL Q+ENEY M S+ KG P
Sbjct: 214 TLRTADLE-FLQAAENYVVRVANLM--AKWQVDNGGPVILYQVENEYTMSTDSY--KGFP 268
Query: 201 ---YVRWAAKLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET-------------FA 244
Y++W + A + +P + +DA P N+ G GE +
Sbjct: 269 DNGYMQWLIEKAKNASITIPII---NNDAW-PAGNSRPGIGVGEVDIYGHDLYPFGLDCS 324
Query: 245 GPNSPDKPAIWTENWTSF----------------YQVYG----DEARIRSAEDIAYHVAL 284
+ P+ A +T+ W+ Y +G DE ++ +D+ V
Sbjct: 325 AKDWPEN-ATYTDLWSKHIGMSPGTPYTIPEGGAYDTWGSVGYDEC-VKLFDDVQARVLF 382
Query: 285 FIAKMKGSYV-NYYMYHGGTNFGRTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSA 343
+ G V N YM GGTN+G YV T Y A + E + +PK+ LK +
Sbjct: 383 KNSYAAGVKVFNVYMIFGGTNWGNLGDPYVYTSYDYGAAIAEDRTIGRPKYSELKLQANF 442
Query: 344 VKLCLKPMLSGVLVSMNFSKLQEAFI-FQGSSECAAFLVNKDKRNNATVY 392
K+ G L +M F + E + FQ +S + + + T Y
Sbjct: 443 FKVS-----PGYLAAMPFENMTEGIVGFQMNSTDDKLVATQLTGDFGTFY 487
>gi|319893645|ref|YP_004150520.1| beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|386318129|ref|YP_006014292.1| glycosyl hydrolase [Staphylococcus pseudintermedius ED99]
gi|317163341|gb|ADV06884.1| Beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
gi|323463300|gb|ADX75453.1| glycosyl hydrolase, family 35 [Staphylococcus pseudintermedius
ED99]
Length = 590
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 138/301 (45%), Gaps = 24/301 (7%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R W + K G + V+T V WN HE ++DF G +DL FI+
Sbjct: 19 ILSGAIHYFRIPKDDWEDSLYNLKALGFNTVETYVPWNFHETIENEYDFKGHKDLKHFIE 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GLYV +R P+I EW +GG P WL + + RS +E + +K+Y + ++
Sbjct: 79 LAAKLGLYVIVRPSPYICAEWEFGGFPAWLLNDRTMRIRSRDEKYLEKVKKYYHELFKIL 138
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
++ QGGPII+ Q+ENEYG +H +L +R + W C +
Sbjct: 139 TPLQI--DQGGPIIMMQVENEYGSFGQDHDYLRSLAHMMREEGVTVPFFTSDGAWDQCLR 196
Query: 224 -----DDAPDPVIN----ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRS 274
+D P N + +TF S P + E W ++ +G+ R
Sbjct: 197 AGSLIEDDILPTGNFGSRTVQNFENLKTFQQEFSKKWPLMCMEFWDGWFNRWGEPVIKRD 256
Query: 275 AEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR--------TASAYVLTGYYDQAPLDEY 326
++D+A V +K +N YM+HGGTNFG T +T Y APLDE
Sbjct: 257 SDDLAEEVR---DAVKLGSLNLYMFHGGTNFGFWNGCSARGTKDLPQVTSYDYHAPLDEA 313
Query: 327 G 327
G
Sbjct: 314 G 314
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 9/89 (10%)
Query: 595 STHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWY 654
S QP +YK FD S+ I++ GKG VNG +IGRYW + PSQS Y
Sbjct: 502 SEQQP-AFYKYTFDLAE-SNNTHIDVSGFGKGVVLVNGFNIGRYW------EIGPSQSLY 553
Query: 655 HIPRSFLKPTGNLLVLLEEENGYPPGISI 683
IP++FLK N +++ + E YP I +
Sbjct: 554 -IPKAFLKQGQNEIIVFDSEGKYPESIQL 581
>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
ICPB 10535]
Length = 613
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 149/358 (41%), Gaps = 39/358 (10%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
N G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ
Sbjct: 31 NFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQ 90
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
GQFDFSG D+ F++E AQGL + LR GP+ EW GG P WL I RS +
Sbjct: 91 QGQFDFSGNNDVAAFVREAAAQGLNIILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDP 150
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAA 206
F + Y + N ++ L GGPII Q+ENEYG +H+++ A
Sbjct: 151 RFLAASQAYLDALANQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------AD 199
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAI 254
A+ ++ G + D D + N P PD+P +
Sbjct: 200 NRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRM 259
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
E W ++ +G A A + +G + YM+ GGT+FG A
Sbjct: 260 VGEYWAGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSASLYMFIGGTSFGFMNGANFQ 317
Query: 314 ----------LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 318 NNPSDHYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGVQTPALPAPIATTTL 374
>gi|237734327|ref|ZP_04564808.1| beta-galactosidase [Mollicutes bacterium D7]
gi|365831197|ref|ZP_09372750.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|374624872|ref|ZP_09697289.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
gi|229382557|gb|EEO32648.1| beta-galactosidase [Coprobacillus sp. D7]
gi|365262188|gb|EHM92085.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
gi|373916155|gb|EHQ47903.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
8_2_54BFAA]
Length = 584
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 100/337 (29%), Positives = 159/337 (47%), Gaps = 39/337 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ ING++ + SG++HY R P+ W + K G + V+T V WNLHEP G++DF
Sbjct: 8 KEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG +D+ F+K + L+V LR P+I EW GGLP WL P I R++++ + +
Sbjct: 68 SGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKCL 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+Y +++ + K ++ +Q GPIIL+Q+ENEYG S+ E Y+ ++
Sbjct: 128 DQYFSIL--LPKLSKYQITQNGPIILAQLENEYG----SYGED-KEYLLAVYQMMRKYGI 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D +NA + G Q E F + P +
Sbjct: 181 EVP--LFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESHQITAPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--------RT 308
E W ++ + E R ++ ++ GS VN+YM+ GGTNFG +
Sbjct: 239 EFWDGWFNRWNQEIIKRDPQEFVNSAQEMLS--LGS-VNFYMFQGGTNFGWMNGCSARKE 295
Query: 309 ASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
+T Y A L EYG + K+ L+E+ + K
Sbjct: 296 HDLPQITSYDYDAILTEYG-AKTEKYHLLREVITGKK 331
>gi|432954511|ref|XP_004085513.1| PREDICTED: beta-galactosidase-like [Oryzias latipes]
Length = 653
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 107/330 (32%), Positives = 153/330 (46%), Gaps = 27/330 (8%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
++ Y+ +G R SGSIHY R W + K GL+ +QT + WN HE
Sbjct: 29 SLDYNADCFRKDGQRFRFISGSIHYSRIPRVYWKDRLVKMYMAGLNAIQTYIPWNYHEES 88
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PG ++FSG RD+ F+K Q GL V LR GP+I EW GGLP WL IV RS +
Sbjct: 89 PGMYNFSGDRDVEYFLKLAQDIGLLVILRPGPYICAEWEMGGLPAWLLSKKDIVLRSSDP 148
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
+ + + ++ MMK LY GGPII Q+ENEYG S+ Y+R KL
Sbjct: 149 DYVAAVDTWMGKLLPMMK-PYLY-QNGGPIITVQVENEYG----SYFACDYNYMRHLTKL 202
Query: 209 -------AVDLQT----GVPWVMCKQDDAPDPVINACNGRQCGETFAGPN--SPDKPAIW 255
V L T G+ ++ C ++ G F P P +
Sbjct: 203 FRSHLGEDVVLFTTDGAGLNYLKCGAIQGLYATVDFGPGSNITAAFEAQRHAEPHGPLVN 262
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-----RTAS 310
+E +T + +G + S + +A + +A G+ VN YM+ GGTNFG +
Sbjct: 263 SEFYTGWLDHWGSRHSVVSPDLVAKSLNQQLA--MGANVNMYMFIGGTNFGYWNGANSPY 320
Query: 311 AYVLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+ T Y APL E G L + K+ ++E+
Sbjct: 321 SAQPTSYDYDAPLTEAGDLTE-KYFAIREV 349
>gi|423215069|ref|ZP_17201597.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692332|gb|EIY85570.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
CL03T12C04]
Length = 778
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 174/397 (43%), Gaps = 52/397 (13%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++L + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDF+G+ D+ F K Q G+YV +R GP+
Sbjct: 61 SHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFAGQNDIAAFCKLAQQHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL + R+ + ++M+R + + K A L +GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDVALRTLD---PYYMERVGIFMKEVGKQLAPLQVDKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYGTD-----KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG-------LLRQ----- 331
+ + YM HGGT FG A + + + Y AP+ E G LLR
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTEKFFLLRDLLKNY 350
Query: 332 -PKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
P L E+ +A+ + P + V+ FS L EA
Sbjct: 351 LPAGESLPEVPAALPVIEIPEIHFNKVAPLFSNLPEA 387
>gi|440904150|gb|ELR54700.1| Beta-galactosidase, partial [Bos grunniens mutus]
Length = 659
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 105/330 (31%), Positives = 156/330 (47%), Gaps = 29/330 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HE QP
Sbjct: 39 IDYRRNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVAWNFHELQP 98
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G+++FSG D+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 99 GRYNFSGDHDVEHFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKKSIVLRSSDPD 158
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ M+ L GGPII Q+ENEYG S+L Y+R+ K
Sbjct: 159 YLAAVDKWLGVLLPKMRP--LLYKNGGPIITVQVENEYG----SYLSCDYDYLRFLQKRF 212
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFA-GPN-----------SPDKPAIW 255
D G ++ D + ++ A G F+ G N P P +
Sbjct: 213 HD-HLGEDVLLFTTDGVNERLLQCGALQGLYATVDFSPGTNLTAAFMLQRKFEPTGPLVN 271
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV-- 313
+E +T + +G S++ +A+ + +A G+ VN YM+ GGTNF A +
Sbjct: 272 SEFYTGWLDHWGQRHSTVSSKAVAFTLHDMLA--LGANVNMYMFIGGTNFAYWNGANIPY 329
Query: 314 ---LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E G L + K+ L+++
Sbjct: 330 QPQPTSYDYDAPLSEAGDLTE-KYFALRDI 358
>gi|393782614|ref|ZP_10370797.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
gi|392672841|gb|EIY66307.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
CL02T12C01]
Length = 605
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 102/329 (31%), Positives = 154/329 (46%), Gaps = 36/329 (10%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF-SGRRDLVRFI 104
+ SG IH R + W + I K G + V + WN HE +PG FDF +G ++L +FI
Sbjct: 48 IISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGNKNLEKFI 107
Query: 105 KEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNM 164
+ VQ +G+++ R GP++ GEW +GGLP +L +P I R + + ++RY I +
Sbjct: 108 QTVQDEGMFLLFRPGPYVCGEWDFGGLPPYLLSIPDIKIRCMDTRYTAAVERYVDKIAPI 167
Query: 165 MKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW------ 218
+K + + GGPII+ Q+ENEYG + + Y++W L D VP+
Sbjct: 168 IKKYEI--TNGGPIIMVQVENEYGSYGNDRI-----YMKWMHDLWRDKGIEVPFYTADGA 220
Query: 219 --VMCKQDDAPDPVIN---ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
M + P I A + + E PD +E + + + +E +
Sbjct: 221 TPYMLEAGTLPGVAIGLDPAASKAEFDEALK--VHPDASVFCSELYPGWLTHWREEWQHP 278
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV---------LTGYYDQAPLD 324
S E I V + G NYY+ HGGTNFG A A +T Y AP++
Sbjct: 279 SIEKITTDVKWLLD--NGKSFNYYVIHGGTNFGFWAGANSPQPGTYQPDVTSYDYDAPIN 336
Query: 325 EYGLLRQPKWGHLKEL---HSAVKLCLKP 350
E G PK+ L+EL +S KL P
Sbjct: 337 EMG-QATPKYMALRELTQKYSKKKLAPIP 364
>gi|62859689|ref|NP_001015958.1| galactosidase, beta 1-like precursor [Xenopus (Silurana)
tropicalis]
gi|89271933|emb|CAJ82193.1| galactosidase, beta 1 [Xenopus (Silurana) tropicalis]
Length = 648
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 100/296 (33%), Positives = 137/296 (46%), Gaps = 18/296 (6%)
Query: 48 SGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEV 107
SGSIHY R W + K K GLD + T V WN HE +PG ++FSG D+ F+K
Sbjct: 50 SGSIHYSRVPQYYWKDRLLKMKMAGLDAIYTYVPWNFHETKPGVYNFSGDHDIESFLKLA 109
Query: 108 QAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA 167
GL V LR GP+I EW GGLP WL IV RS + + + + + + MK
Sbjct: 110 NEIGLLVILRAGPYICAEWDMGGLPAWLLAKESIVLRSSDPDYLQAVDNWMGVFLPKMKP 169
Query: 168 ARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAAKLAVDLQT----GVPWVM 220
GGPII Q+ENEYG ++++L R V L T G+ +V
Sbjct: 170 --FLYHNGGPIISVQVENEYGSYFTCDYNYLRHLLQLFRHHLGDEVVLFTTDGSGLQYVR 227
Query: 221 CKQDDAPDPVINACNGRQCGETFAGPN--SPDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
C ++ G ETF+ P P + +E +T + +G+ + + E +
Sbjct: 228 CGTIQGLYTTVDFGPGSNVTETFSVQRYCEPKGPLVNSEFYTGWLDHWGEPHSVVATEMV 287
Query: 279 AYHVALFIAKMKGSYVNYYMYHGGTNFG-----RTASAYVLTGYYDQAPLDEYGLL 329
+ +A G+ VN YM+ GGTNFG T A T Y APL E G L
Sbjct: 288 TKSLDEILA--HGANVNMYMFIGGTNFGYWNGANTPYAPQPTSYDYDAPLSEAGDL 341
>gi|393785841|ref|ZP_10373985.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
CL02T12C05]
gi|392660955|gb|EIY54552.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
CL02T12C05]
Length = 605
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 112/376 (29%), Positives = 167/376 (44%), Gaps = 38/376 (10%)
Query: 1 MGQCQLLCLFGLLLTTIGGS--DGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTP 58
M + L L L L T GG+ G ++ ++ + SG IH R
Sbjct: 1 MKKKLLTFLMALALLTGGGALVQAQTKGTHSFRLGDNQFWLDDKPFQIISGEIHPSRIPA 60
Query: 59 QMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF-SGRRDLVRFIKEVQAQGLYVCLR 117
+ W + I K G + V + WN HE +PG FDF +G +DL +FI+ VQ + +++ R
Sbjct: 61 EYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGNKDLEKFIRTVQEEDMFLLFR 120
Query: 118 IGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGP 177
GP++ GEW +GGLP +L P I R + + ++RYAT I ++K + + GGP
Sbjct: 121 PGPYVCGEWDFGGLPAYLLSTPDIKIRCMDPRYTTAVERYATAIAPIIK--KYEVTNGGP 178
Query: 178 IILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPW--------VMCKQDDAPDP 229
II+ Q+ENEYG + Y++W L D VP+ M + P
Sbjct: 179 IIMVQVENEYGSYGNDRT-----YMKWIHDLWRDKGIEVPFYTADGATPYMLEAGTLPGV 233
Query: 230 VIN---ACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFI 286
I A + + E PD +E + + + + + S E I V +
Sbjct: 234 AIGLDPAASKAEFDEALK--VHPDASVFCSELYPGWLTHWRENWQHPSIEKITTDVKWLL 291
Query: 287 AKMKGSYVNYYMYHGGTNFGRTASAYV---------LTGYYDQAPLDEYGLLRQPKWGHL 337
G NYY+ HGGTNFG A A +T Y AP++E G PK+ L
Sbjct: 292 D--NGKSFNYYVIHGGTNFGFWAGANSPQPGIYQPDVTSYDYDAPINEMG-QATPKYMAL 348
Query: 338 KEL---HSAVKLCLKP 350
+EL +S KL P
Sbjct: 349 RELTQKYSKKKLAPIP 364
>gi|123788298|sp|Q3UPY5.1|GLBL2_MOUSE RecName: Full=Beta-galactosidase-1-like protein 2; Flags: Precursor
gi|74224567|dbj|BAE25259.1| unnamed protein product [Mus musculus]
Length = 636
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 142/317 (44%), Gaps = 29/317 (9%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL FI+
Sbjct: 63 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 122
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL P + R+ F ++ Y + M
Sbjct: 123 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVELYFDHL--MS 180
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L GGPII Q+ENEYG PY++ A + G+ ++ D+
Sbjct: 181 RVVPLQYKHGGPIIAVQVENEYGSYNKD--RAYMPYIKKALE-----DRGIIEMLLTSDN 233
Query: 226 AP-------DPVINACNGRQCGETFAGPN-----SPDKPAIWTENWTSFYQVYGDEARIR 273
D V+ N + E A +P + E WT ++ +G I
Sbjct: 234 KDGLEKGVVDGVLATINLQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHNIL 293
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY-YDQAPLDEYGLLRQ- 331
+ ++ V+ I GS +N YM+HGGTNFG A Y D D +L +
Sbjct: 294 DSSEVLQTVSAIIKD--GSSINLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTEA 351
Query: 332 ----PKWGHLKELHSAV 344
K+ L+EL V
Sbjct: 352 GDYTAKYTKLRELFGTV 368
>gi|311264379|ref|XP_003130137.1| PREDICTED: galactosidase, beta 1-like 2 [Sus scrofa]
Length = 635
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 142/317 (44%), Gaps = 29/317 (9%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GS+HY R W + K K GL+ + T V WNLHEP+ G+FDFSG D+ FI
Sbjct: 62 IFGGSVHYFRVPRAYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDMEAFIL 121
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL + R+ E F + Y + M
Sbjct: 122 LAAEVGLWVILRPGPYICSEIDLGGLPSWLLQDSSMKLRTTYEGFTKAVDLYFDHL--MA 179
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L GGPII Q+ENEYG K P Y+ + K D G+ ++ D+
Sbjct: 180 RVVPLQYKNGGPIIAVQVENEYGSY-----NKDPAYMPYIKKALED--RGIVELLLTSDN 232
Query: 226 AP-------DPVINACNGRQCGE-----TFAGPNSPDKPAIWTENWTSFYQVYGDEARIR 273
D V+ N + E F +P + E WT ++ +G I
Sbjct: 233 EDGLSKGTVDGVLATINLQSQNELRLLHNFLQSVQGVRPKMVMEYWTGWFDSWGGPHHIL 292
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYY-DQAPLDEYGLLRQ- 331
++ V+ I G+ +N YM+HGGTNFG A Y D D +L +
Sbjct: 293 DTSEVLRTVSAIID--AGASINLYMFHGGTNFGFINGAMHFQDYMSDVTSYDYDAVLTEA 350
Query: 332 ----PKWGHLKELHSAV 344
PK+ L+EL ++
Sbjct: 351 GDYTPKYIRLRELFGSI 367
>gi|285018987|ref|YP_003376698.1| beta-galactosidase [Xanthomonas albilineans GPE PC73]
gi|283474205|emb|CBA16706.1| putative beta-galactosidase protein [Xanthomonas albilineans GPE
PC73]
Length = 614
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 151/620 (24%), Positives = 239/620 (38%), Gaps = 103/620 (16%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G NG + SG+IH+ R W + KA+ GL+ V+T VFWNL EP+PGQFD
Sbjct: 36 GDHFTRNGTPYQIISGAIHFQRIPRAYWNDRLQKARAMGLNTVETYVFWNLIEPRPGQFD 95
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ FI AQGL V LR GP++ EW GG P WL PG+ RS + F
Sbjct: 96 FSGNNDIAAFIDAAAAQGLNVILRPGPYVCAEWEAGGYPAWLFAEPGMRVRSQDPRFLAA 155
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVD 211
+ Y + +K RL + GGP+I Q+ENEYG +H+++ A A+
Sbjct: 156 SRAYLDALGAQVK-PRLNGN-GGPVIAVQVENEYGSYNYDHAYMR---------ANRAMY 204
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAIWTENW 259
+Q G + D PD + N GP P +P + E W
Sbjct: 205 VQAGFDKAVLFTADGPDVLANGTLPNTLAVVNFGPGDAKTAFQTLAKFRPGQPQMVGEYW 264
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYD 319
++ +GD+ +A A + +G N YM+ GGT+FG ++ +
Sbjct: 265 AGWFDQWGDKHAATNAAKQASEFEWIL--RQGHSANIYMFVGGTSFG-----FMNGANFQ 317
Query: 320 QAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAFIFQGSSECAAF 379
+ P D Y S +A + + F
Sbjct: 318 KNPTDHY------------------------------APQTTSYDYDAVLDEAGRPTPKF 347
Query: 380 LVNKDKRNNATVYFSNLMYELPPLSISILPDCKTVAFNTAKLDSVEQWEEYKEAIPTYDE 439
+ +D T + P + LPD T +S W+ A T D
Sbjct: 348 ALFRDAIARVTGIQPPALPA--PQHFADLPD-------TPLRESASLWDNLPPAAATTD- 397
Query: 440 TSLRANFLLEQMNTTKDASDYLWYNFRFKHDPSDSESVLKVSSLGHVLHAFINGEFVGSA 499
+ + M A Y+ Y S + +V V +++ GSA
Sbjct: 398 -------IPQPMERYGQAYGYILYRTSVTGPRKGSLYLGEVRDYARV---YVDRTLAGSA 447
Query: 500 HGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAGLRNVSIQGAKELKD 559
+ + V + GT+ + +L G + G +L AGL + + + L
Sbjct: 448 DRRRQQVAVD----VDIPAGTHTLDVLVENNGRINYGTHLPDGRAGLVDPVLLDGQPLTG 503
Query: 560 FSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYGSSTHQPLTWYKTVFDAPTGSDPVAIN 619
+ +F + D S + W+ ++ +++ T +D ++
Sbjct: 504 WQTFP-------------LPMDDASTLHGWT---TAKVDGPAFHRGTLKIATPAD-TFLD 546
Query: 620 LISMGKGEAWVNGQSIGRYW 639
+ + GKG AW NG ++GR+W
Sbjct: 547 MRAFGKGFAWANGHNLGRHW 566
>gi|348172902|ref|ZP_08879796.1| beta-galactosidase [Saccharopolyspora spinosa NRRL 18395]
Length = 633
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 98/316 (31%), Positives = 149/316 (47%), Gaps = 21/316 (6%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+T G +++G + +G +HY R+ P W +A+ + GL+ V T V WN HEP+
Sbjct: 42 LTVRGDQFLLDGEPFRIVAGEMHYFRTHPDHWRDRLARMRALGLNTVDTYVAWNFHEPRR 101
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
G DFS RDLVRF++ GL V +R GP+I EW +GGLP WL P + R D
Sbjct: 102 GAVDFSSWRDLVRFVETAAEVGLKVAVRPGPYICAEWDFGGLPAWLLADPDLPLRCDETA 161
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAK 207
+ + + ++ + + A L A++GGP+I Q+ENEYG + + L+ +R
Sbjct: 162 YPDLVDEWFGVL--LPRLAPLQATRGGPVIAFQVENEYGSYANDQAHLDHLRKTMRDNGI 219
Query: 208 LAVDLQTGVP--WVMCKQDDAPDPVINACNGRQCGETFAGPN--SPDKPAIWTENWTSFY 263
++ + P W M + + PD + G E FA P+ P TE W ++
Sbjct: 220 DSLLYCSNGPSEW-MLRGGNLPDVLATVNFGGDPTEPFAALRRYQPEGPLWCTEFWDGWF 278
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY----------V 313
+G+ + A V +A + V+ YM G TNFG A A
Sbjct: 279 DHWGEPHHTTDPVETAADVEKILAAK--ASVSLYMAVGSTNFGWWAGANFDEANGTYQPT 336
Query: 314 LTGYYDQAPLDEYGLL 329
+T Y AP+ E G L
Sbjct: 337 ITSYDYDAPIGEAGEL 352
>gi|220914306|ref|YP_002489615.1| beta-galactosidase [Arthrobacter chlorophenolicus A6]
gi|219861184|gb|ACL41526.1| Beta-galactosidase [Arthrobacter chlorophenolicus A6]
Length = 586
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/312 (32%), Positives = 147/312 (47%), Gaps = 30/312 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
R +++G + SG+IHY R P +W I KA+ GL+ ++T V WN H PG F
Sbjct: 9 RDFLLDGEPFRILSGAIHYFRVHPDLWADRIRKARLMGLNTIETYVPWNEHSSTPGAFRT 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+ V A+G+ +R GP+I EW GGLP WL P I RS + +
Sbjct: 69 DGGLDLGRFLDLVAAEGMQGIVRPGPYICAEWDNGGLPAWLFTDPSIGVRSSEPGYLAAV 128
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDL 212
+ ++ ++ ++ ++GGP+IL QIENEYG + ++L+ V A + V+
Sbjct: 129 DGFMDRLLPIVVERQI--TRGGPVILFQIENEYGAYGSDKAYLQH---LVDTATRAGVE- 182
Query: 213 QTGVPWVMCKQ------DDAPDPVINACN--GRQCGE--TFAGPNSPDKPAIWTENWTSF 262
VP C Q +D P ++ G + E F PD P + E W +
Sbjct: 183 ---VPLFTCDQPFETMIEDGSLPGLHKTGTFGSRADERLAFLRERQPDGPLMCAEFWNGW 239
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLT 315
+ +G + + A L G+ VN YM+HGGTNFG T A +T
Sbjct: 240 FDNWG--THHHTTDAAASAAELDALLAAGASVNIYMFHGGTNFGFTNGANDKGIYEPTIT 297
Query: 316 GYYDQAPLDEYG 327
Y APL E G
Sbjct: 298 SYDYDAPLSEDG 309
>gi|423301385|ref|ZP_17279409.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
gi|408471986|gb|EKJ90515.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
CL09T03C10]
Length = 779
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 166/374 (44%), Gaps = 48/374 (12%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G N DG+ ++ + +HY R W I K G++ + +FWN+
Sbjct: 32 AGKNTFLLDGKPFVVK-------AAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNI 84
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HE + G+FDF+G+ D+ F + Q G+YV +R GP++ EW GGLP+WL I R
Sbjct: 85 HEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALR 144
Query: 145 SDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
+ + ++M+R + + K A L ++GG II+ Q+ENEYG + PYV
Sbjct: 145 TLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSYGIN-----KPYVS 196
Query: 204 WAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINACN---GRQCGETFAGPNS--PDKP 252
L + T VP C ++A D +I N G + F P+ P
Sbjct: 197 AVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETP 256
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR----- 307
+ +E W+ ++ +G + R A+D+ + + + + YM HGGT FG
Sbjct: 257 LMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD--RNISFSLYMTHGGTTFGHWGGAN 314
Query: 308 -TASAYVLTGYYDQAPLDEYG-------LLRQ------PKWGHLKELHSAVKLCLKPMLS 353
A + + + Y AP+ E G LLR P L E+ +A+ + P
Sbjct: 315 NPAYSAMCSSYDYDAPISEAGWTTEKYFLLRDLLKNYLPAGAALPEVPAALPVMEIPEFH 374
Query: 354 GVLVSMNFSKLQEA 367
V+ FS L EA
Sbjct: 375 FTKVAPLFSNLPEA 388
>gi|423346501|ref|ZP_17324189.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
gi|409219652|gb|EKN12612.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
CL03T12C32]
Length = 780
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/325 (29%), Positives = 154/325 (47%), Gaps = 33/325 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ +++G ++ + IHY R + W I K G++ + FWN+HE +PG+FDF
Sbjct: 39 TFLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFK 98
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G+ D+ F + Q +G+Y+ LR GP++ EW GGLP+WL I R+++ F K
Sbjct: 99 GQNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTK 158
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG--MVEHSFLEKGPPYVRWAAKLAVDLQ 213
+ I + A L ++GG II+ Q+ENEYG + +++ +R A K A
Sbjct: 159 LFMNEIGKQL--ADLQVTRGGNIIMVQVENEYGAYATDKAYIAN----IRDAVKAAG--F 210
Query: 214 TGVPWVMCK-----QDDAPDPV---INACNGRQCGETFAGPNS--PDKPAIWTENWTSFY 263
T VP C Q + D + IN G F PD P + +E W+ ++
Sbjct: 211 TDVPLFQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWF 270
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNY--YMYHGGTNFGR------TASAYVLT 315
+G + R A + I M ++++ YM HGGT FG A + + +
Sbjct: 271 DHWGRKHETRDAGVMVSG----IKDMLDRHISFSLYMAHGGTTFGHWGGANSPAYSAMCS 326
Query: 316 GYYDQAPLDEYGLLRQPKWGHLKEL 340
Y AP+ E G PK+ L+EL
Sbjct: 327 SYDYDAPISEAGWA-TPKYYKLREL 350
Score = 43.5 bits (101), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 86/199 (43%), Gaps = 27/199 (13%)
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
+ L + + F +G+ +G + + + L L GT L+ M +
Sbjct: 423 TTLLIDEVHDWAQVFADGKLLGRLDRRRGESTVVLPA---LAAGTRLDILVEAMGRVNFD 479
Query: 536 GAYLERR--VAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
A +R+ + +S G +EL+D+ +S+ + +K Y + G
Sbjct: 480 VAIHDRKGITDKVELISDTGRQELEDWQVYSFPVDYAFVQDK-----KYAA--------G 526
Query: 594 SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSW 653
P +Y+T F+ D V +++ + GKG WVNG+++GR+W + PQ T
Sbjct: 527 DKLDGP-AYYRTTFELDEVGD-VFLDMQTWGKGMVWVNGKAMGRFWE--IGPQQT----- 577
Query: 654 YHIPRSFLKPTGNLLVLLE 672
+P +LK N +++L+
Sbjct: 578 LFMPGCWLKKGKNEIIILD 596
>gi|313202559|ref|YP_004041216.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
gi|312441875|gb|ADQ78231.1| glycoside hydrolase family 35 [Paludibacter propionicigenes WB4]
Length = 786
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/359 (28%), Positives = 158/359 (44%), Gaps = 36/359 (10%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
++NG I+ +G +HY R W I K G++ + +FWN+HE PG FDF
Sbjct: 39 EFMLNGKPYIIRAGELHYTRIPKAYWDHRIKMCKAMGMNTICIYLFWNIHEQTPGVFDFK 98
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP-FKFHM 154
G+ D+ F++ +Q G+Y +R GP++ EW GGLP+WL + RS ++ F
Sbjct: 99 GQNDVAEFVRLIQQNGMYCIVRPGPYVCAEWDMGGLPWWLLKKKDLQVRSLSDSYFMEQT 158
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVDL 212
K+Y + A L GG II+ Q+ENEYG + ++E VR A V L
Sbjct: 159 KKYLNEAGKQL--APLQIQNGGNIIMVQVENEYGTWGSDSKYMETMRNNVRQAGFGKVQL 216
Query: 213 QTGVPWVMCKQDDAPDPVINACN---GRQCGETFA--GPNSPDKPAIWTENWTSFYQVYG 267
W D +NA N G + F +PD P + E WT ++ +G
Sbjct: 217 -LRCDWSSNFFHYKLDGAVNALNFGAGSNIDDQFKKFKEMNPDSPLMCGEYWTGWFDQWG 275
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSY-----VNYYMYHGGTNFGRTASAYV-----LTGY 317
R + FI +K + YM HGGT++G+ A A T
Sbjct: 276 RPHETR-------EINSFIGSLKDMMDKRISFSLYMAHGGTSYGQWAGANAPAYAPTTSS 328
Query: 318 YD-QAPLDEYG-------LLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEAF 368
YD AP+DE G +R +L+E S + P ++ + ++ F++ F
Sbjct: 329 YDYNAPIDEAGNPTDKFYAIRDLLKNYLQEGESLPAIPQNPEITITIPTIKFTQTANVF 387
>gi|154490061|ref|ZP_02030322.1| hypothetical protein PARMER_00290 [Parabacteroides merdae ATCC
43184]
gi|423723056|ref|ZP_17697209.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
gi|154089210|gb|EDN88254.1| glycosyl hydrolase family 35 [Parabacteroides merdae ATCC 43184]
gi|409241481|gb|EKN34249.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
CL09T00C40]
Length = 780
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/355 (29%), Positives = 165/355 (46%), Gaps = 36/355 (10%)
Query: 6 LLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLI 65
+ C LLL+ G G ++ + + +++G ++ + IHY R + W I
Sbjct: 12 ITCCVILLLS---GCSPRQGEKHDFSIGKGTFLLDGKPFVIKAAEIHYTRIPAEYWQHRI 68
Query: 66 AKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGE 125
K G++ + FWN+HE +PG+FDF G+ D+ F + Q +G+Y+ LR GP++ E
Sbjct: 69 QMCKALGMNTICIYAFWNIHEQKPGEFDFKGQNDIAAFCRLAQKEGMYIMLRPGPYVCSE 128
Query: 126 WGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIEN 185
W GGLP+WL I R+++ F K + I + A L ++GG II+ Q+EN
Sbjct: 129 WEMGGLPWWLLKKEDIKLRTNDPYFLERTKLFMNEIGKQL--ADLQVTRGGNIIMVQVEN 186
Query: 186 EYG--MVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCK-----QDDAPDPV---INACN 235
EYG + +++ +R A K A T VP C Q + D + IN
Sbjct: 187 EYGAYATDKAYIAN----IRDAVKAAG--FTDVPLFQCDWSSTFQLNGLDDLVWTINFGT 240
Query: 236 GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSY 293
G F PD P + +E W+ ++ +G + R A + I M +
Sbjct: 241 GANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGRKHETRDAGVMVSG----IKDMLDRH 296
Query: 294 VNY--YMYHGGTNFGR------TASAYVLTGYYDQAPLDEYGLLRQPKWGHLKEL 340
+++ YM HGGT FG A + + + Y AP+ E G PK+ L+EL
Sbjct: 297 ISFSLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPISEAGWA-TPKYYKLREL 350
Score = 43.5 bits (101), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 86/199 (43%), Gaps = 27/199 (13%)
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
+ L + + F +G+ +G + + + L L GT L+ M +
Sbjct: 423 TTLLIDEVHDWAQVFADGKLLGRLDRRRGENTVVLPA---LAAGTRLDILVEAMGRVNFD 479
Query: 536 GAYLERR--VAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
A +R+ + +S G +EL+D+ +S+ + +K Y + G
Sbjct: 480 VAIHDRKGITDKVELISDTGRQELEDWQVYSFPVDYAFVQDK-----KYAA--------G 526
Query: 594 SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSW 653
P +Y+T F+ D V +++ + GKG WVNG+++GR+W + PQ T
Sbjct: 527 DKLDGP-AYYRTTFELDEVGD-VFLDMQTWGKGMVWVNGKAMGRFWE--IGPQQT----- 577
Query: 654 YHIPRSFLKPTGNLLVLLE 672
+P +LK N +++L+
Sbjct: 578 LFMPGCWLKKGKNEIIILD 596
>gi|298204831|emb|CBI25664.3| unnamed protein product [Vitis vinifera]
Length = 118
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 55/111 (49%), Positives = 80/111 (72%)
Query: 60 MWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIG 119
MW L+ AKEGG+DV++T VFWN HE PG + F G DL++F+K VQ G+Y+ LR G
Sbjct: 1 MWSGLVKTAKEGGIDVIETYVFWNGHELSPGNYYFGGWYDLLKFVKIVQQDGMYLILRFG 60
Query: 120 PFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARL 170
PF+ EW + G+ WLH +PG VF +++EPF +HM+++ T++VN+MK +L
Sbjct: 61 PFVVAEWNFSGVLVWLHYMPGTVFWTNSEPFNYHMQKFMTLVVNIMKKEKL 111
>gi|256424388|ref|YP_003125041.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256039296|gb|ACU62840.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 586
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 94/306 (30%), Positives = 148/306 (48%), Gaps = 17/306 (5%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ +++ + SG +H R + W I AK G + + VFWN HE + G+FDF
Sbjct: 17 KDFLLDSKPYQIISGEMHPARIPKEYWRHRIQMAKAMGCNTIAAYVFWNYHEQEEGKFDF 76
Query: 95 -SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
S RD+V FIK VQ +G++V LR GP++ EW +GGLP +L +P I R + +
Sbjct: 77 TSENRDIVAFIKMVQEEGMWVMLRPGPYVCAEWEFGGLPPYLLRIPDIKVRCMDPRYIAA 136
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHS---FLEKGPPYVRWAAKLAV 210
+RY + +K ++ + GGPI++ Q+ENEYG + L+ +V+ +
Sbjct: 137 TERYIKALSEEVKPLQI--TNGGPIVMVQVENEYGSFGNDREYMLKVKDMWVQNGINVPF 194
Query: 211 DLQTGVPWVMCKQDDAPDPVINACNGRQCGE-TFAGPNSPDKPAIWTENWTSFYQVYGDE 269
G + + P I +G G+ A +PD P+ +E++ + +G++
Sbjct: 195 YTADGPVSALLEAGSVPGAAIGLDSGSSEGDFAAAEKQNPDVPSFSSESYPGWLTHWGEK 254
Query: 270 ARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--------LTGYYDQA 321
I V F+ K S+ N Y+ HGGTNFG TA A LT Y A
Sbjct: 255 WARPDKAGIVKEVK-FLMDTKRSF-NLYVIHGGTNFGFTAGANSGGKGYEPDLTSYDYDA 312
Query: 322 PLDEYG 327
P++E G
Sbjct: 313 PINEQG 318
>gi|255692586|ref|ZP_05416261.1| beta-galactosidase [Bacteroides finegoldii DSM 17565]
gi|260621643|gb|EEX44514.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
Length = 779
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 166/374 (44%), Gaps = 48/374 (12%)
Query: 25 GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNL 84
G N DG+ ++ + +HY R W I K G++ + +FWN+
Sbjct: 32 AGKNTFLLDGKPFVVK-------AAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNI 84
Query: 85 HEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFR 144
HE + G+FDF+G+ D+ F + Q G+YV +R GP++ EW GGLP+WL I R
Sbjct: 85 HEQEEGKFDFTGQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKRDIALR 144
Query: 145 SDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVR 203
+ + ++M+R + + K A L ++GG II+ Q+ENEYG + PYV
Sbjct: 145 TLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQVENEYGSYGIN-----KPYVS 196
Query: 204 WAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINACN---GRQCGETFAGPNS--PDKP 252
L + T VP C ++A D +I N G + F P+ P
Sbjct: 197 AVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETP 256
Query: 253 AIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR----- 307
+ +E W+ ++ +G + R A+D+ + + + + YM HGGT FG
Sbjct: 257 LMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD--RNISFSLYMTHGGTTFGHWGGAN 314
Query: 308 -TASAYVLTGYYDQAPLDEYG-------LLRQ------PKWGHLKELHSAVKLCLKPMLS 353
A + + + Y AP+ E G LLR P L E+ +A+ + P
Sbjct: 315 NPAYSAMCSSYDYDAPISEAGWTTEKYFLLRDLLKNYLPAGAALPEVPAALPVIEIPEFH 374
Query: 354 GVLVSMNFSKLQEA 367
V+ FS L EA
Sbjct: 375 FTKVAPLFSNLPEA 388
>gi|24418925|ref|NP_722498.1| beta-galactosidase-1-like protein 2 [Mus musculus]
gi|23512349|gb|AAH38479.1| Galactosidase, beta 1-like 2 [Mus musculus]
gi|148693361|gb|EDL25308.1| cDNA sequence BC038479, isoform CRA_b [Mus musculus]
Length = 652
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 101/317 (31%), Positives = 141/317 (44%), Gaps = 29/317 (9%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL FI+
Sbjct: 79 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 138
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL P + R+ F + Y + M
Sbjct: 139 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYHGFTKAVDLYFDHL--MS 196
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQDD 225
+ L GGPII Q+ENEYG PY++ A + G+ ++ D+
Sbjct: 197 RVVPLQYKHGGPIIAVQVENEYGSYNKD--RAYMPYIKKALE-----DRGIIEMLLTSDN 249
Query: 226 AP-------DPVINACNGRQCGETFAGPN-----SPDKPAIWTENWTSFYQVYGDEARIR 273
D V+ N + E A +P + E WT ++ +G I
Sbjct: 250 KDGLEKGVVDGVLATINLQSQQELMALNTVLLSIQGIQPKMVMEYWTGWFDSWGGSHNIL 309
Query: 274 SAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY-YDQAPLDEYGLLRQ- 331
+ ++ V+ I GS +N YM+HGGTNFG A Y D D +L +
Sbjct: 310 DSSEVLQTVSAIIKD--GSSINLYMFHGGTNFGFINGAMHFNDYKADVTSYDYDAILTEA 367
Query: 332 ----PKWGHLKELHSAV 344
K+ L+EL V
Sbjct: 368 GDYTAKYTKLRELFGTV 384
>gi|16611713|gb|AAL27306.1|AF376481_1 BgaC [Carnobacterium maltaromaticum]
Length = 586
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 99/309 (32%), Positives = 142/309 (45%), Gaps = 19/309 (6%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG+IHY R P+ W + K G + V+T V WN HEP+ GQ+ FS DL RFI+
Sbjct: 19 IISGAIHYFRVVPEYWEHRLKLLKNMGCNTVETYVAWNQHEPKKGQYVFSDALDLRRFIQ 78
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
+ GL V LR P+I E+ +GGLP WL + RS PF ++ Y +
Sbjct: 79 LADSLGLKVILRPSPYICAEFEFGGLPAWLLKDRHMRVRSTYPPFMERVRLYYRELFK-- 136
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPW-VMCK 222
+ L + GGPIIL Q+ENEYG E +L++ ++ + + PW M +
Sbjct: 137 EVIDLQITSGGPIILMQVENEYGGYGSEKKYLQELVTMMKENGVTVPLVTSDGPWGDMLE 196
Query: 223 QDDAPDPVINACN-GRQCGETF---AGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
+ + N G E F A P + E W ++ + D+ D+
Sbjct: 197 NGSLQESALPTVNCGSAIPEHFDRLAAFKQKKGPLMVMEYWIGWFDAWQDKK--HHTTDV 254
Query: 279 AYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVL-------TGYYDQAPLDEYGLLRQ 331
V +K VN+YM+HGGTNFG A T Y APL+EYG +
Sbjct: 255 KSSVESLEEILKRGSVNFYMFHGGTNFGFMNGANYYGKLLPDTTSYDYDAPLNEYG-EQT 313
Query: 332 PKWGHLKEL 340
K+ KE+
Sbjct: 314 EKYKAFKEV 322
>gi|257870316|ref|ZP_05649969.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
gi|257804480|gb|EEV33302.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
Length = 593
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 102/320 (31%), Positives = 149/320 (46%), Gaps = 41/320 (12%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG L SG+IHY R P W + K G + V+T V WNLHEP G F F
Sbjct: 8 EEFLMNGSPFKLLSGAIHYFRVHPDDWEHSLYNLKALGFNTVETYVPWNLHEPHKGLFQF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL RF+ Q GLYV LR P+I EW +GGLP WL G + R+ + + H+
Sbjct: 68 EGILDLERFLSLAQELGLYVILRPSPYICAEWEFGGLPAWLLKESGRL-RACDPSYLAHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
Y +++ + +L S GG I++ Q+ENEYG S+ E+ Y+R ++ ++
Sbjct: 127 AEYYDVLLPKIIPYQL--SHGGNILMIQVENEYG----SYGEE-KAYLRAIKEMLINRGI 179
Query: 215 GVPWVMCKQDDAP-------------DPVINACNGRQCGETFAG------PNSPDKPAIW 255
+P D P D ++ G + E FA ++ P +
Sbjct: 180 DMPLFTS---DGPWQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGR----TASA 311
E W ++ + + R +D+A V ++ VN YM+HGGTNFG +A
Sbjct: 237 MEFWDGWFNRWNEPIIRRDPDDLAESVK---EALEIGSVNLYMFHGGTNFGFMNGCSARG 293
Query: 312 YV----LTGYYDQAPLDEYG 327
V +T Y APLDE G
Sbjct: 294 AVDLPQVTSYDYDAPLDEQG 313
>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
Length = 651
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 130/288 (45%), Gaps = 29/288 (10%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ GQFD
Sbjct: 74 GTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 133
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F++E AQGL V LR GP+ EW GG P WL I RS + F
Sbjct: 134 FSGNNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAA 193
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+ Y + ++ L GGPII Q+ENEYG +H+++ A A+
Sbjct: 194 SQAYLDAVAKQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------ADNRAMY 242
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAIWTENW 259
++ G + D D + N P PD+P + E W
Sbjct: 243 VKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYW 302
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMK-GSYVNYYMYHGGTNFG 306
++ +G + +A D F ++ G N YM+ GGT+FG
Sbjct: 303 AGWFDHWG---KPHAATDATQQAEEFEWILRQGHSANLYMFIGGTSFG 347
>gi|78048770|ref|YP_364945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78037200|emb|CAJ24945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
85-10]
Length = 650
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 104/353 (29%), Positives = 147/353 (41%), Gaps = 39/353 (11%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ GQFD
Sbjct: 73 GTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 132
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F++E AQGL V LR GP+ EW GG P WL I RS + F
Sbjct: 133 FSGNNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAA 192
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+ Y + ++ L GGPII Q+ENEYG +H+++ A A+
Sbjct: 193 SQSYLDALAKQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------ADNRAMY 241
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAIWTENW 259
++ G + D D + N P PD+P + E W
Sbjct: 242 VKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYW 301
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
++ +G A A + +G N YM+ GGT+FG A
Sbjct: 302 AGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFGFMNGANFQNNPSD 359
Query: 314 -----LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNF 361
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 360 HYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGVQPPALPAPIATATL 411
>gi|357132771|ref|XP_003568002.1| PREDICTED: beta-galactosidase 8-like [Brachypodium distachyon]
Length = 674
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 167/375 (44%), Gaps = 66/375 (17%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG +G + +G R + G +HY R P+ W + +AK GL+ VQT V WN
Sbjct: 27 GGASRRFWIEGDAFRKDGERFQIVGGDVHYFRIVPEYWKDRLLRAKALGLNTVQTYVPWN 86
Query: 84 LHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDV-PGIV 142
LHEP+P ++F+G D+ +++ + V LR+GP+I GEW GG P WL + P +
Sbjct: 87 LHEPEPQSWEFNGFADIESYLRLAHELEMLVMLRVGPYICGEWDLGGFPPWLLTIEPALK 146
Query: 143 FRSDNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG-------------M 189
RS + + ++R+ ++ + K A L S GGPII+ QIENE+G +
Sbjct: 147 LRSSDSAYLSLVERWWKVL--LPKVAPLLYSNGGPIIMVQIENEFGSFGDDKNYLHYLVL 204
Query: 190 VEHSFLEKGPPYVRWAAK------------------LAVDLQTGVPWVMCKQDDAPDPVI 231
+ +L G + + AVD TG D P P+
Sbjct: 205 LARRYL--GNDIILYTTDGGTIGTLKNGSIHQDDVFAAVDFSTG---------DDPWPIF 253
Query: 232 NACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKG 291
Q F G ++P + E +T + +G+ A A + + + G
Sbjct: 254 RL----QKEYNFPGKSAP----LTAEFYTGWLTHWGESIATTDASSTAKALKSILCR-NG 304
Query: 292 SYVNYYMYHGGTNF--------GRTASAYV--LTGYYDQAPLDEYGLLRQPKWGHLKE-L 340
S V YM HGGTNF G+ SAY LT Y AP+ E+G + PK+ L+ +
Sbjct: 305 SAV-LYMAHGGTNFGFYNGANTGQNESAYKADLTSYDYDAPIKEHGDVHNPKYKALRSVI 363
Query: 341 HSAVKLCLKPMLSGV 355
H L P+ + +
Sbjct: 364 HECTGTPLHPLPANI 378
>gi|419799561|ref|ZP_14324899.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
gi|385697826|gb|EIG28233.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
Length = 595
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 133/279 (47%), Gaps = 17/279 (6%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ G + SG+IHY R P W + K G + V+T V WN+HEP+ GQFDFSGR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL RFI+ Q+ GLY+ +R PFI EW +GGLP WL + + RS + F + RY
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDMRIRSSDPVFIEAVDRYY 130
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
++ ++ R QGGPI++ Q+ENEYG + ++L ++ +
Sbjct: 131 DHLLGLL--TRYQVDQGGPILMMQVENEYGSYGEDKAYLRAIRDLMKEKGVTCPLFTSDG 188
Query: 217 PWVMCKQDD---APDPVINACNGRQCG------ETFAGPNSPDKPAIWTENWTSFYQVYG 267
PW + D + G + + F P + E W ++ +
Sbjct: 189 PWRATLRAGNLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 249 EPVIQREPEELAEAVH---EVLELGSINLYMFHGGTNFG 284
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGY 677
+++ GKG A+VNG ++GR+W + P+ S Y +P FLK N L++ E E Y
Sbjct: 523 LDMTGFGKGVAFVNGHNLGRFW------EVGPTTSLY-VPHGFLKEGANSLIVFETEGRY 575
Query: 678 PPGISI 683
+ +
Sbjct: 576 QETLQL 581
>gi|332672111|ref|YP_004455119.1| beta-galactosidase [Cellulomonas fimi ATCC 484]
gi|332341149|gb|AEE47732.1| Beta-galactosidase [Cellulomonas fimi ATCC 484]
Length = 583
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 138/300 (46%), Gaps = 29/300 (9%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ SG++HY R P W + +A+E GL+ ++T + WN H P G+F G DL RF+
Sbjct: 20 ILSGALHYFRHHPDQWRDRLTRARELGLNTIETYIPWNAHSPARGEFRTDGILDLGRFLD 79
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
EV AQG++ +R GP+I EW GGLP WL G R + ++ Y + ++
Sbjct: 80 EVAAQGMWAIVRPGPYICAEWTGGGLPGWLF-TAGAAVRRHEPTYLAAIQDYYEAVAGIV 138
Query: 166 KAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVD---------LQTGV 216
++ +GGP++L Q+ENEYG Y+R KL + +
Sbjct: 139 APRQV--DRGGPVVLVQVENEYGAYGDD-----KDYLRALVKLLRESGITTPLTTIDQPE 191
Query: 217 PWVMCKQDDAPDPVINACNGRQCGETFAG--PNSPDKPAIWTENWTSFYQVYGDEARIRS 274
PW M + P+ G + E A + P P + E W ++ +G
Sbjct: 192 PW-MLENGSLPELHKTGSFGSRAAERLATLREHQPTGPLMCAEFWDGWFDSWGLHHHTTD 250
Query: 275 AEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYYDQAPLDEYG 327
A A+ + +A G+ VN YM GGTNFG T A ++T Y APLDE G
Sbjct: 251 AAASAHELDTLLA--AGASVNLYMVCGGTNFGFTNGANDKGTYVPIVTSYDYDAPLDEAG 308
>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 613
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 130/288 (45%), Gaps = 29/288 (10%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + +G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ GQFD
Sbjct: 36 GTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 95
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F++E AQGL V LR GP+ EW GG P WL I RS + F
Sbjct: 96 FSGNNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAA 155
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+ Y + ++ L GGPII Q+ENEYG +H+++ A A+
Sbjct: 156 SQAYLDAVAKQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------ADNRAMY 204
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAIWTENW 259
++ G + D D + N P PD+P + E W
Sbjct: 205 VKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYW 264
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMK-GSYVNYYMYHGGTNFG 306
++ +G + +A D F ++ G N YM+ GGT+FG
Sbjct: 265 AGWFDHWG---KPHAATDATQQAEEFEWILRQGHSANLYMFIGGTSFG 309
>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
Length = 588
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 142/302 (47%), Gaps = 29/302 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++G + SG+IHY R P+ W + K K G + V+T + WN+HEP+ G+F F
Sbjct: 16 NFYLDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFE 75
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+ RF+K Q GLYV LR P+I EW +GGLP WL G+ R PF H++
Sbjct: 76 GMLDIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQ 135
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y +++ + ++ + GGP+IL Q+ENEYG + + + +Q G
Sbjct: 136 DYYDVLLKKIVPYQI--NYGGPVILMQVENEYGYYAND--------REYLLAMRDKMQKG 185
Query: 216 VPWVMCKQDDAP-DPVINACN----------GRQCGETFA--GPNSPDKPAIWTENWTSF 262
V D P + +N + G + E F + P + TE W +
Sbjct: 186 GVVVPLVTSDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFWVGW 245
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAP 322
+ +G+ + ++ V ++ +VN YM+ GGTNFG + YYD+
Sbjct: 246 FDHWGNGGHMTG--NLEESVKDLDKMLELGHVNIYMFEGGTNFGFMNGS----NYYDELT 299
Query: 323 LD 324
D
Sbjct: 300 PD 301
>gi|167755577|ref|ZP_02427704.1| hypothetical protein CLORAM_01091 [Clostridium ramosum DSM 1402]
gi|167704516|gb|EDS19095.1| glycosyl hydrolase family 35 [Clostridium ramosum DSM 1402]
Length = 584
Score = 133 bits (334), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 100/337 (29%), Positives = 158/337 (46%), Gaps = 39/337 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ ING++ + SG++HY R P+ W + K G + V+T V WNLHEP G++DF
Sbjct: 8 KEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG +D+ F+K + L+V LR P+I EW GGLP WL P I R++++ + +
Sbjct: 68 SGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKCL 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+Y +++ + K ++ +Q GPIIL+Q+ENEYG S+ E Y+ ++
Sbjct: 128 DQYFSIL--LPKLSKYQITQNGPIILAQLENEYG----SYGED-KEYLLAVYQMMRKYGI 180
Query: 215 GVPWVMCKQDDAPDPVINACN------------GRQCGET------FAGPNSPDKPAIWT 256
VP + D +NA + G Q E F P +
Sbjct: 181 EVP--LFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKFMESYQITAPLMCM 238
Query: 257 ENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--------RT 308
E W ++ + E R ++ ++ GS VN+YM+ GGTNFG +
Sbjct: 239 EFWDGWFNRWNQEIIKRDPQEFVNSAQEMLS--LGS-VNFYMFQGGTNFGWMNGCSARKE 295
Query: 309 ASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVK 345
+T Y A L EYG + K+ L+E+ + K
Sbjct: 296 HDLPQITSYDYDAILTEYG-AKTEKYHLLREVITGKK 331
>gi|193690496|ref|XP_001952133.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 635
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 158/325 (48%), Gaps = 36/325 (11%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ + +G SGS+HY R W I K K GL+ + T V W+LHEP P
Sbjct: 27 IDYENNEFLKDGKVFRYVSGSLHYFRIPQLYWKDRIQKMKAAGLNTITTYVEWSLHEPFP 86
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDV-PGIVFRSDNE 148
G +DF G DL FI+ ++ + +Y+ LR GP+I E +GG P+WL +V P R++N
Sbjct: 87 GVYDFEGIADLEYFIELIKNENMYLILRPGPYICAERDFGGFPYWLLNVTPKRSLRTNNS 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
+K ++ ++ ++++ +++ LY + GG IIL Q+ENEYG S+ Y W L
Sbjct: 147 SYKKYVSKWFSVLMPIIQ-PHLYGN-GGNIILVQVENEYG----SYYACDSEYKLWIRDL 200
Query: 209 --AVDLQTGVPWVM--CKQ---DDAPDPVINAC-------NGRQCGETFAGPNSPDKPAI 254
+ V + + C Q D P + A N QC + F P +
Sbjct: 201 FRSYVENKAVLFTIDGCGQSYFDCGVIPEVYATVDFGISSNASQCFD-FMRKVQKGGPLV 259
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E + + + + I + D+ + + +A M S+ ++YM+HGGTNFG T+ A
Sbjct: 260 NSEFYPGWLTHWQESESIVNTTDVVKQMKVMLA-MNASF-SFYMFHGGTNFGFTSGANTN 317
Query: 314 -----------LTGYYDQAPLDEYG 327
LT Y APLDE G
Sbjct: 318 DTKESIGYLPQLTSYDYNAPLDEAG 342
>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
Length = 581
Score = 133 bits (334), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 142/302 (47%), Gaps = 29/302 (9%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ ++G + SG+IHY R P+ W + K K G + V+T + WN+HEP+ G+F F
Sbjct: 9 NFYLDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFE 68
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+ RF+K Q GLYV LR P+I EW +GGLP WL G+ R PF H++
Sbjct: 69 GMLDIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQ 128
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQTG 215
Y +++ + ++ + GGP+IL Q+ENEYG + + + +Q G
Sbjct: 129 DYYDVLLKKIVPYQI--NYGGPVILMQVENEYGYYAND--------REYLLAMRDKMQKG 178
Query: 216 VPWVMCKQDDAP-DPVINACN----------GRQCGETFA--GPNSPDKPAIWTENWTSF 262
V D P + +N + G + E F + P + TE W +
Sbjct: 179 GVVVPLVTSDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFWVGW 238
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGYYDQAP 322
+ +G+ + ++ V ++ +VN YM+ GGTNFG + YYD+
Sbjct: 239 FDHWGNGGHMTG--NLEESVKDLDKMLELGHVNIYMFEGGTNFGFMNGS----NYYDELT 292
Query: 323 LD 324
D
Sbjct: 293 PD 294
>gi|62321383|dbj|BAD94714.1| beta-galactosidase [Arabidopsis thaliana]
Length = 199
Score = 132 bits (333), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 76/196 (38%), Positives = 117/196 (59%), Gaps = 26/196 (13%)
Query: 512 KMVHLINGTNNVSLLSVMVGLPDSGAYLERRVAG-LRNVSIQGAKE-LKDFSSFSWGYQV 569
+ + L G N ++LLSV VGLP+ G + E+ G L V+++G D S + W Y++
Sbjct: 2 QKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGALGPVTLKGVNSGTWDMSKWKWSYKI 61
Query: 570 GLLGEKLQIFTDYGSRIVPWSRYGS--STHQPLTWYKTVFDAPTGSDPVAINLISMGKGE 627
G+ GE L + T+ S V W++ GS + QPLTWYK+ F P G++P+A+++ +MGKG+
Sbjct: 62 GVKGEALSLHTNTESSGVRWTQ-GSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQ 120
Query: 628 AWVNGQSIGRYWVSF--------------------LTPQGTPSQSWYHIPRSFLKPTGNL 667
W+NG++IGR+W ++ L+ G SQ WYH+PRS+LK + NL
Sbjct: 121 VWINGRNIGRHWPAYKAQGSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNL 179
Query: 668 LVLLEEENGYPPGISI 683
+V+ EE G P GIS+
Sbjct: 180 IVVFEELGGDPNGISL 195
>gi|295086466|emb|CBK67989.1| Beta-galactosidase [Bacteroides xylanisolvens XB1A]
Length = 778
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 154/344 (44%), Gaps = 39/344 (11%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++L + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVILFSSAQAQTTAHKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDFSG+ D+ F K Q G+YV +R GP+
Sbjct: 61 SHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCKLAQQHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL + R+ + ++M+R + + K A L +GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDVALRTLD---PYYMERVGIFMKEVGKQLAPLQVDKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYGTD-----KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG 327
+ + YM HGGT FG A + + + Y AP+ E G
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
Score = 40.0 bits (92), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 45/199 (22%), Positives = 90/199 (45%), Gaps = 26/199 (13%)
Query: 476 SVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPDS 535
+VLK++ + + G+ + + + + TL L GT L+ M +
Sbjct: 420 TVLKITEVHDWAQIYAGGKLLARLDRRKGEFTTTLPA---LKKGTQLDILVEAMGRVNFD 476
Query: 536 GAYLERR--VAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSRYG 593
+ +R+ + VS AKELK+++ +++ + +K + D ++I+P+
Sbjct: 477 KSIHDRKGITEKVELVSGNQAKELKNWTVYNFPVDYSFIKDKK--YND--TKILPFMP-- 530
Query: 594 SSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSW 653
+YK+ F D +++ + GKG WVNG ++GR+W + PQ T
Sbjct: 531 -------AYYKSTFKLDKVGDTF-LDMSTWGKGMVWVNGHAMGRFWE--IGPQQT----- 575
Query: 654 YHIPRSFLKPTGNLLVLLE 672
+P +LK N +++L+
Sbjct: 576 LFMPGCWLKEGENEILVLD 594
>gi|354490770|ref|XP_003507529.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
2-like [Cricetulus griseus]
Length = 689
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 95/287 (33%), Positives = 135/287 (47%), Gaps = 29/287 (10%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+F GS+HY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL FI+
Sbjct: 116 IFGGSVHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIQ 175
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL P + R+ F + Y + M
Sbjct: 176 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPNMKLRTTYYGFTKAVDLYFDHL--MS 233
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAK--------LAVDLQTG 215
+ L GGPII Q+ENEYG +H+++ PY++ A + L D + G
Sbjct: 234 RVVPLQYKHGGPIIAVQVENEYGSYYKDHAYM----PYIKKALEDRGIIEMLLTSDNKDG 289
Query: 216 VPWVMCKQDDAPDPVINACNGRQCGETFAGPN-----SPDKPAIWTENWTSFYQVYGDEA 270
+ Q V+ N + E A + +P + E WT ++ +G
Sbjct: 290 L------QKGVVSGVLATINLQSQQELKALSSVLLSIQGIQPKMVMEYWTGWFDSWGGPH 343
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYVLTGY 317
I + ++ V+ I GS +N YM+HGGTNFG A Y
Sbjct: 344 NILDSSEVLQTVSAIIK--SGSSINLYMFHGGTNFGFINGAMHFNDY 388
>gi|1911627|gb|AAB50770.1| beta-galactosidase [dogs, spleen, Peptide Partial, 667 aa]
Length = 667
Score = 132 bits (333), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 107/331 (32%), Positives = 150/331 (45%), Gaps = 31/331 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY W + K K GL+ +QT V WN HEPQP
Sbjct: 34 IDYSHNRFLKDGQPFRYISGSIHYSHVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 93
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG +D+ FIK GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 94 GQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 153
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ Y+R+ KL
Sbjct: 154 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL- 206
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS-------------PDKPAI 254
G ++ D A + + A G F GP + P P +
Sbjct: 207 FHHHLGNDVLLFTTDGANELFLQCGALQGLYATVDF-GPGANITAAFQIQRKSEPKGPLV 265
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G E +A + +A G+ VN YM+ GGTNF A +
Sbjct: 266 NSEFYTGWLDHWGQPHSTVRTEVVASSLHDILA--HGANVNLYMFIGGTNFAYWNGANMP 323
Query: 314 ----LTGYYDQAPLDEYGLLRQPKWGHLKEL 340
T Y APL E L + K+ L+E+
Sbjct: 324 YQAQPTSYDYDAPLSEAADLTE-KYFALREV 353
>gi|300795929|ref|NP_001178947.1| beta-galactosidase-1-like protein 2 [Rattus norvegicus]
Length = 652
Score = 132 bits (333), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 93/276 (33%), Positives = 132/276 (47%), Gaps = 29/276 (10%)
Query: 46 LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIK 105
+ GSIHY R + W + K K GL+ + T V WNLHEP+ G+FDFSG DL FI
Sbjct: 79 ILGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYVPWNLHEPERGKFDFSGNLDLEAFIW 138
Query: 106 EVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMM 165
GL+V LR GP+I E GGLP WL P + R+ F + Y + M
Sbjct: 139 LAAKIGLWVILRPGPYICSEIDLGGLPSWLLQDPDMKLRTTYPGFTKAVDLYFDHL--MS 196
Query: 166 KAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVPWVMCKQ 223
+ L GGPII Q+ENEYG +H+++ PY++ A + G+ ++
Sbjct: 197 RVVPLQYKHGGPIIAVQVENEYGSYNGDHAYM----PYIKKALE-----DRGIIEMLLTS 247
Query: 224 DDAP-------DPVINACNGRQCGETFAGPNS------PDKPAIWTENWTSFYQVYGDEA 270
D+ D V+ N Q + NS +P + E WT ++ +G
Sbjct: 248 DNKDGLEKGVVDGVLATIN-LQSQQELVALNSILLSIQGIQPKMVMEYWTGWFDSWGGSH 306
Query: 271 RIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
I + ++ V+ I GS +N YM+HGGTNFG
Sbjct: 307 NILDSSEVLQTVSAIIK--DGSSINLYMFHGGTNFG 340
>gi|294812047|ref|ZP_06770690.1| Beta-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|326440560|ref|ZP_08215294.1| putative beta-galactosidase [Streptomyces clavuligerus ATCC 27064]
gi|294324646|gb|EFG06289.1| Beta-galactosidase [Streptomyces clavuligerus ATCC 27064]
Length = 582
Score = 132 bits (333), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 100/311 (32%), Positives = 152/311 (48%), Gaps = 25/311 (8%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
R +++G L SG++HY R W +A + GL+ V+T V WNLHEP+PG+++
Sbjct: 9 RDFLLDGRPVRLLSGALHYFRVHEAQWGHRLAMLRAMGLNCVETYVPWNLHEPEPGRYED 68
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
L RF+ +A GL+ +R GP+I EW GGLP WL G R+ +E F +
Sbjct: 69 P--EALGRFLDAARAAGLWAIVRPGPYICAEWENGGLPHWLTGPLGRRTRTADEEFLVPV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGM--VEHSFLEKGPPYVRWAAKLAVDL 212
+R+ ++ + ++ +GGP+++ QIENEYG + +L + +R A+ L V L
Sbjct: 127 ERWFARLLPQVVERQI--DRGGPVLMVQIENEYGSWGSDARYLRRIERALR-ASGLVVPL 183
Query: 213 QT--GVPWVMCKQDDAPDPV--INACNGRQCGETFAGPNSPDKPAIWTENWTSFYQVYGD 268
T G M P + +N +G + + P P + E W ++ +GD
Sbjct: 184 FTSDGPEDHMLTGGSVPGALATVNFGSGARAAFGTLRGHRPSGPLMCMEFWCGWFDHWGD 243
Query: 269 EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY------------VLTG 316
E +R A++ A AL G+ VN YM HGG+NFG A A T
Sbjct: 244 EHAVRDADEAAD--ALREILECGASVNVYMAHGGSNFGGWAGANRSGEVQDGALEPTATS 301
Query: 317 YYDQAPLDEYG 327
Y AP+DE G
Sbjct: 302 YDYDAPIDEAG 312
>gi|336410484|ref|ZP_08590961.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
gi|335944314|gb|EGN06136.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
Length = 769
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 159/373 (42%), Gaps = 41/373 (10%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
N T + ++NG + + +HY R W I K G++ + VFWN+H
Sbjct: 17 AAQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIH 76
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
E GQFDF+G+ D+ F + Q G+YV +R GP++ EW GGLP+WL IV R+
Sbjct: 77 EQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRT 136
Query: 146 DNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRW 204
+ F M+R A + + K A L ++GG II+ Q+ENEYG PYV
Sbjct: 137 LDPYF---MERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYAVD-----KPYVSA 188
Query: 205 AAKLAVDLQ-TGVPWVMCKQDDAPDP--------VINACNGRQCGETFAGPNS--PDKPA 253
+ T VP C D IN G + F P+ P
Sbjct: 189 IRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLREARPETPL 248
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-- 311
+ +E W+ ++ +G + R A+ + + + + + YM HGGT FG A
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLD--RNISFSLYMAHGGTTFGHWGGANN 306
Query: 312 ----YVLTGYYDQAPL-------DEYGLLRQ------PKWGHLKELHSAVKLCLKPMLSG 354
+ + Y AP+ D+Y LLR P L E+ A + P +
Sbjct: 307 PSYSAMCSSYDYDAPISEPGWTTDKYFLLRDLLKNYLPAGEQLPEIPEAFPVIEIPEVEF 366
Query: 355 VLVSMNFSKLQEA 367
V+ FS L EA
Sbjct: 367 TQVAPLFSNLPEA 379
Score = 44.7 bits (104), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 45/207 (21%), Positives = 93/207 (44%), Gaps = 29/207 (14%)
Query: 469 HDPSDSESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSV 528
+P ++ + +K++ + F +G+ + + + F L+ + L GT L+
Sbjct: 405 QEPVENGTTMKITEVHDWAQVFADGKLLARLDRRRGE--FALQ-LPALKKGTRIDILVEA 461
Query: 529 MVGLPDSGAYLERRVAGLRNVSIQGAK--ELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI 586
M + + +R+ + ++G + ELK+++ +S+ + +K
Sbjct: 462 MGRVNFDESIHDRKGITEKVELVRGKQSAELKNWTVYSFPVDYSFVQDK----------- 510
Query: 587 VPWSRYGSSTHQPL-TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTP 645
RY + T Q + +Y+T F D +++ + GKG WVNG +IGR+W + P
Sbjct: 511 ----RYKNGTAQTMPAYYRTTFRLDKVGDTF-LDMSTWGKGMVWVNGLAIGRFWE--IGP 563
Query: 646 QGTPSQSWYHIPRSFLKPTGNLLVLLE 672
Q T +P +LK N +++L+
Sbjct: 564 QQT-----LFMPGCWLKEGENEIIVLD 585
>gi|153808925|ref|ZP_01961593.1| hypothetical protein BACCAC_03226 [Bacteroides caccae ATCC 43185]
gi|149128258|gb|EDM19477.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
Length = 778
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 104/394 (26%), Positives = 172/394 (43%), Gaps = 41/394 (10%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+L+ L L I S + +++G ++ + +HY R W
Sbjct: 4 RLIALLVLFTVVIFSSAQAQTTARKFEAGKNTFLLDGEPFVVKAAELHYTRIPQAYWEHR 63
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I K G++ + +FWN+HE + G+FDFSG+ D+ F + Q G+YV +R GP++
Sbjct: 64 IEMCKTLGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCA 123
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIILSQI 183
EW GGLP+WL + R+ + ++M+R + + K A L ++GG II+ Q+
Sbjct: 124 EWEMGGLPWWLLKKKDVALRTLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQV 180
Query: 184 ENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPV---INAC 234
ENEY S PYV L + T VP C ++A + + +N
Sbjct: 181 ENEY-----SSYATDKPYVAAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNFG 235
Query: 235 NGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGS 292
G + F P+ P + +E W+ ++ +G + R A+D+ + + +
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD--RNI 293
Query: 293 YVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG-------LLRQ------PK 333
+ YM HGGT FG A + + + Y AP+ E G LLR P
Sbjct: 294 SFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTEKYFLLRDLLKTYLPA 353
Query: 334 WGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
L E+ +A+ + P ++ FS L EA
Sbjct: 354 GEALPEIPAALPVIEIPEFHFTKIAPLFSNLPEA 387
>gi|337283005|ref|YP_004622476.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
gi|335370598|gb|AEH56548.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
Length = 595
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 134/279 (48%), Gaps = 17/279 (6%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ G + SG+IHY R P W + K G + V+T V WN+HEP+ GQFDFSGR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL RFI+ Q+ GLY+ +R PFI EW +GGLP WL + + RS + F + RY
Sbjct: 72 DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDMRIRSSDPAFIEAVDRYY 130
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
++ ++ ++ QGGPI++ Q+ENEYG + ++L ++ +
Sbjct: 131 DHLLGLLTPYQV--DQGGPILMMQVENEYGSYGEDKAYLRAIRDLMKKKGVTCPLFTSDG 188
Query: 217 PWVMCKQDDA---PDPVINACNGRQCG------ETFAGPNSPDKPAIWTENWTSFYQVYG 267
PW + D + G + + F P + E W ++ +
Sbjct: 189 PWRAALRAGTLIEEDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 249 EPVIQREPEELAEAVH---EVLELGSINLYMFHGGTNFG 284
Score = 41.2 bits (95), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 35/66 (53%), Gaps = 7/66 (10%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGY 677
+++ GKG +VNG ++GR+W + P+ S Y +P FLK N L++ E E Y
Sbjct: 523 LDMTGFGKGVVFVNGHNLGRFW------EVGPTTSLY-VPHGFLKEGANSLIVFETEGRY 575
Query: 678 PPGISI 683
+ +
Sbjct: 576 QETLQL 581
>gi|423220237|ref|ZP_17206732.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
gi|392623314|gb|EIY17417.1| hypothetical protein HMPREF1061_03505 [Bacteroides caccae
CL03T12C61]
Length = 778
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 104/394 (26%), Positives = 172/394 (43%), Gaps = 41/394 (10%)
Query: 5 QLLCLFGLLLTTIGGSDGGGGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRL 64
+L+ L L I S + +++G ++ + +HY R W
Sbjct: 4 RLIALLVLFTVVIFSSAQAQTTARKFEAGKNTFLLDGEPFVVKAAELHYTRIPQAYWEHR 63
Query: 65 IAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEG 124
I K G++ + +FWN+HE + G+FDFSG+ D+ F + Q G+YV +R GP++
Sbjct: 64 IEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPYVCA 123
Query: 125 EWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIILSQI 183
EW GGLP+WL + R+ + ++M+R + + K A L ++GG II+ Q+
Sbjct: 124 EWEMGGLPWWLLKKKDVALRTLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIMVQV 180
Query: 184 ENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPV---INAC 234
ENEY S PYV L + T VP C ++A + + +N
Sbjct: 181 ENEY-----SSYATDKPYVAAVRDLVRESGFTDVPLFQCDWSSNFTNNALEDLLWTVNFG 235
Query: 235 NGRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGS 292
G + F P+ P + +E W+ ++ +G + R A+D+ + + +
Sbjct: 236 TGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD--RNI 293
Query: 293 YVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG-------LLRQ------PK 333
+ YM HGGT FG A + + + Y AP+ E G LLR P
Sbjct: 294 SFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAGWTTEKYFLLRDLLKTYLPA 353
Query: 334 WGHLKELHSAVKLCLKPMLSGVLVSMNFSKLQEA 367
L E+ +A+ + P ++ FS L EA
Sbjct: 354 GEALPEIPAALPVIEIPEFHFTKIAPLFSNLPEA 387
>gi|57619080|ref|NP_001009860.1| beta-galactosidase precursor [Felis catus]
gi|5915775|sp|O19015.1|BGAL_FELCA RecName: Full=Beta-galactosidase; AltName: Full=Acid
beta-galactosidase; Short=Lactase; Flags: Precursor
gi|2547317|gb|AAB81350.1| lysosomal beta-galactosidase [Felis catus]
Length = 669
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 107/325 (32%), Positives = 148/325 (45%), Gaps = 40/325 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HEPQP
Sbjct: 35 IDYGHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG D+ F+K GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 95 GQYQFSGEHDVEYFLKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ Y+R+ +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQRRF 208
Query: 210 VD------------------LQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPD 250
D LQ G + + D PD I A Q + P
Sbjct: 209 RDHLGGDVLLFTTDGAHEKFLQCGALQGIYATVDFGPDANITAAFQIQRK------SEPR 262
Query: 251 KPAIWTENWTSFYQVYGD-EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
P + +E +T + +G +R+R+ E +A + +A G+ VN YM+ GGTNF
Sbjct: 263 GPLVNSEFYTGWLDHWGQPHSRVRT-EVVASSLHDVLA--HGANVNLYMFIGGTNFAYWN 319
Query: 310 SAYV-----LTGYYDQAPLDEYGLL 329
A + T Y APL E G L
Sbjct: 320 GANIPYQPQPTSYDYDAPLSEAGDL 344
>gi|432108623|gb|ELK33326.1| Beta-galactosidase [Myotis davidii]
Length = 739
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 144/322 (44%), Gaps = 30/322 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y+ +G SGSIHY R W + K K GL+ +Q V WN HEPQP
Sbjct: 39 IDYNHNCFRKDGQPFRYISGSIHYFRVPRFYWQDRLLKMKMAGLNAIQIYVPWNFHEPQP 98
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FS D+ FI+ GL V LR GP+I EW GGLP WL + IV RS +
Sbjct: 99 GQYQFSEEHDVEHFIQLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKENIVLRSSDPD 158
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + + +I+ MK L GGPII Q+ENEYG S+ Y+R+ K
Sbjct: 159 YLAAVDTWLGVILPKMKP--LLYQNGGPIITVQVENEYG----SYFSCDYDYLRFLQK-R 211
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS-------------PDKPAI 254
G V+ D + ++ A G F GP + P P I
Sbjct: 212 FHYHLGNDVVLFTTDGEMEKLMQCGALQGLYATVDF-GPGANITKAFLIQRKYEPKGPLI 270
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G E +A + +A +G+ VN YM+ GGTNFG A +
Sbjct: 271 NSEFYTGWLDHWGQPHSTVKTEVVASSLQDILA--RGANVNLYMFIGGTNFGYWNGANMP 328
Query: 314 ----LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 329 YQPQPTSYDYDAPLSEAGDLTE 350
>gi|325925751|ref|ZP_08187124.1| beta-galactosidase [Xanthomonas perforans 91-118]
gi|325543808|gb|EGD15218.1| beta-galactosidase [Xanthomonas perforans 91-118]
Length = 611
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 147/350 (42%), Gaps = 39/350 (11%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + +G + SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ GQFD
Sbjct: 34 GTQFVRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 93
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F++E AQGL V LR GP+ EW GG P WL I RS + F
Sbjct: 94 FSGNNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAA 153
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+ Y + ++ L GGPII Q+ENEYG +H+++ A A+
Sbjct: 154 SQSYLDALAKQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------ADNRAMY 202
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAIWTENW 259
++ G + D D + N P PD+P + E W
Sbjct: 203 VKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYW 262
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
++ +G A A + +G N YM+ GGT+FG A
Sbjct: 263 AGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFGFMNGANFQNNPSD 320
Query: 314 -----LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVS 358
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 321 HYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGVQPPALPAPIAT 369
>gi|346725882|ref|YP_004852551.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
gi|346650629|gb|AEO43253.1| beta-galactosidase [Xanthomonas axonopodis pv. citrumelo F1]
Length = 611
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 146/350 (41%), Gaps = 39/350 (11%)
Query: 34 GRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFD 93
G + G L SG+IH+ R W + KA+ GL+ V+T VFWNL EPQ GQFD
Sbjct: 34 GTQFVRAGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFD 93
Query: 94 FSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFH 153
FSG D+ F++E AQGL V LR GP+ EW GG P WL I RS + F
Sbjct: 94 FSGNNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAA 153
Query: 154 MKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVD 211
+ Y + ++ L GGPII Q+ENEYG +H+++ A A+
Sbjct: 154 SQSYLDALAKQVQP--LLNHNGGPIIAVQVENEYGSYADDHAYM---------ADNRAMY 202
Query: 212 LQTGVPWVMCKQDDAPDPVINACNGRQCGETFAGPNS------------PDKPAIWTENW 259
++ G + D D + N P PD+P + E W
Sbjct: 203 VKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYW 262
Query: 260 TSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV------ 313
++ +G A A + +G N YM+ GGT+FG A
Sbjct: 263 AGWFDHWGKPHAATDARQQAEEFEWIL--RQGHSANLYMFIGGTSFGFMNGANFQNNPSD 320
Query: 314 -----LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVS 358
T Y A LDE G PK+ +++ + V P L + +
Sbjct: 321 HYAPQTTSYDYDAILDEAG-HPTPKFALMRDAIARVTGVQPPALPAPIAT 369
>gi|2623150|gb|AAB86405.1| mutant lysosomal beta-galactosidase [Felis catus]
Length = 669
Score = 132 bits (332), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 107/325 (32%), Positives = 148/325 (45%), Gaps = 40/325 (12%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +QT V WN HEPQP
Sbjct: 35 IDYGHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG D+ F+K GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 95 GQYQFSGEHDVEYFLKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ Y+R+ +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQRRF 208
Query: 210 VD------------------LQTG-VPWVMCKQDDAPDPVINACNGRQCGETFAGPNSPD 250
D LQ G + + D PD I A Q + P
Sbjct: 209 RDHLGGDVLLFTTDGAHEKFLQCGALQGIYATVDFGPDANITAAFQIQRK------SEPR 262
Query: 251 KPAIWTENWTSFYQVYGD-EARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTA 309
P + +E +T + +G +R+R+ E +A + +A G+ VN YM+ GGTNF
Sbjct: 263 GPLVNSEFYTGWLDHWGQPHSRVRT-EVVASSLHDVLA--HGANVNLYMFIGGTNFAYWN 319
Query: 310 SAYV-----LTGYYDQAPLDEYGLL 329
A + T Y APL E G L
Sbjct: 320 GANIPYQPQPTSYDYDAPLSEAGDL 344
>gi|387878583|ref|YP_006308886.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
gi|386792040|gb|AFJ25075.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
Length = 595
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 132/279 (47%), Gaps = 17/279 (6%)
Query: 39 INGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRR 98
+ G + SG+IHY R P W + K G + V+T V WN+HEP+ GQFDFSGR
Sbjct: 12 LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71
Query: 99 DLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYA 158
DL RFI+ Q+ GLY+ +R PFI EW +GGLP WL + + RS + F + RY
Sbjct: 72 DLERFIQIAQSLGLYMIVRPSPFICAEWEFGGLPAWLLE-EDMRIRSSDPAFIEAVDRYY 130
Query: 159 TMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGV 216
++ ++ R QGGPI++ Q+ENEYG + +L ++ +
Sbjct: 131 DHLLGLL--TRYQVDQGGPILMMQVENEYGSYGEDKVYLRAIRDLMKKKGVTCPLFTSDG 188
Query: 217 PWVMCKQDDA---PDPVINACNGRQCG------ETFAGPNSPDKPAIWTENWTSFYQVYG 267
PW + D + G + + F P + E W ++ +
Sbjct: 189 PWRATLRAGTLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248
Query: 268 DEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
+ R E++A V ++ +N YM+HGGTNFG
Sbjct: 249 EPVIQREPEELAEAVH---EVLELGSINLYMFHGGTNFG 284
Score = 42.7 bits (99), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 7/66 (10%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGY 677
+++ GKG A+VNG ++GR+W + P+ S Y +P FLK N L++ E E Y
Sbjct: 523 LDMTGFGKGVAFVNGHNLGRFW------EVGPTTSLY-VPHGFLKEGANSLIVFETEGRY 575
Query: 678 PPGISI 683
+ +
Sbjct: 576 QETLQL 581
>gi|301767332|ref|XP_002919083.1| PREDICTED: beta-galactosidase-like [Ailuropoda melanoleuca]
Length = 668
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 145/322 (45%), Gaps = 30/322 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q+ V WN HEPQP
Sbjct: 35 IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 94
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG D+ FIK GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 95 GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ ++R+ KL
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFSCDYDHLRFLQKL- 207
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS-------------PDKPAI 254
G ++ D A + + A G F GP + P P +
Sbjct: 208 FHYHLGNDVLLFTTDGAHEMFLKCGALQGLYATVDF-GPGANITAAFEIQRKSEPRGPLV 266
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G E +A AL +G+ VN YM+ GGTNF A +
Sbjct: 267 NSEFYTGWLDHWGQPHSTAKTEVVA--SALHEILSRGANVNLYMFIGGTNFAYWNGANMP 324
Query: 314 ----LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 325 YQAQPTSYDYDAPLSEAGDLTE 346
>gi|281352249|gb|EFB27833.1| hypothetical protein PANDA_007660 [Ailuropoda melanoleuca]
Length = 626
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 145/322 (45%), Gaps = 30/322 (9%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
+ Y + +G SGSIHY R W + K K GL+ +Q+ V WN HEPQP
Sbjct: 8 IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 67
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEP 149
GQ+ FSG D+ FIK GL V LR GP+I EW GGLP WL I+ RS +
Sbjct: 68 GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 127
Query: 150 FKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLA 209
+ + ++ +++ MK L GGPII Q+ENEYG S+ ++R+ KL
Sbjct: 128 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYG----SYFSCDYDHLRFLQKL- 180
Query: 210 VDLQTGVPWVMCKQDDAPDPVIN--ACNGRQCGETFAGPNS-------------PDKPAI 254
G ++ D A + + A G F GP + P P +
Sbjct: 181 FHYHLGNDVLLFTTDGAHEMFLKCGALQGLYATVDF-GPGANITAAFEIQRKSEPRGPLV 239
Query: 255 WTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV- 313
+E +T + +G E +A AL +G+ VN YM+ GGTNF A +
Sbjct: 240 NSEFYTGWLDHWGQPHSTAKTEVVA--SALHEILSRGANVNLYMFIGGTNFAYWNGANMP 297
Query: 314 ----LTGYYDQAPLDEYGLLRQ 331
T Y APL E G L +
Sbjct: 298 YQAQPTSYDYDAPLSEAGDLTE 319
>gi|229545563|ref|ZP_04434288.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256619317|ref|ZP_05476163.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256853375|ref|ZP_05558745.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256964870|ref|ZP_05569041.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|257090147|ref|ZP_05584508.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|294614275|ref|ZP_06694194.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|307272958|ref|ZP_07554205.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|307277803|ref|ZP_07558888.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|307291733|ref|ZP_07571605.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|384518848|ref|YP_005706153.1| beta-galactosidase [Enterococcus faecalis 62]
gi|422685728|ref|ZP_16743941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|422689100|ref|ZP_16747212.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|422720655|ref|ZP_16777264.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|422731066|ref|ZP_16787446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|422739263|ref|ZP_16794446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|430849460|ref|ZP_19467237.1| glycosyl hydrolase [Enterococcus faecium E1185]
gi|229309303|gb|EEN75290.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
gi|256598844|gb|EEU18020.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
gi|256711834|gb|EEU26872.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
gi|256955366|gb|EEU71998.1| beta-galactosidase [Enterococcus faecalis HIP11704]
gi|256998959|gb|EEU85479.1| beta-galactosidase [Enterococcus faecalis CH188]
gi|291592934|gb|EFF24524.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
gi|306497185|gb|EFM66730.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
gi|306505543|gb|EFM74728.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
gi|306510572|gb|EFM79595.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
gi|315029440|gb|EFT41372.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
gi|315032046|gb|EFT43978.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
gi|315144925|gb|EFT88941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
gi|315162898|gb|EFU06915.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
gi|315577862|gb|EFU90053.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
gi|323480981|gb|ADX80420.1| beta-galactosidase [Enterococcus faecalis 62]
gi|430537598|gb|ELA77922.1| glycosyl hydrolase [Enterococcus faecium E1185]
Length = 611
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 140/291 (48%), Gaps = 33/291 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + ++T + WNLHEP G +DF
Sbjct: 8 EEFLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+V F+ Q GL V LR +I EW +GGLP WL + RS + F +
Sbjct: 68 EGMKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSF-LEKGPPYVRWAAKLAVDLQ 213
+ Y +++ + K L + GGP+I+ Q+ENEYG S+ +EK Y+R ++ +
Sbjct: 127 RTYFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEK--EYLRQTKQVMEEFG 178
Query: 214 TGVPWVMCKQDDAPDPVINACN------------GRQCGE------TFAGPNSPDKPAIW 255
VP + D A + V++ G E F + P +
Sbjct: 179 IDVP--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEYWDGWFNRWGEPIIKRDGQDLANEVKDMLA--LGS-LNLYMFHGGTNFG 284
>gi|312903586|ref|ZP_07762766.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
gi|310633462|gb|EFQ16745.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
Length = 611
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 140/291 (48%), Gaps = 33/291 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + ++T + WNLHEP G +DF
Sbjct: 8 EEFLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+V F+ Q GL V LR +I EW +GGLP WL + RS + F +
Sbjct: 68 EGMKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSF-LEKGPPYVRWAAKLAVDLQ 213
+ Y +++ + K L + GGP+I+ Q+ENEYG S+ +EK Y+R ++ +
Sbjct: 127 RTYFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEK--EYLRQTKQVMEEFG 178
Query: 214 TGVPWVMCKQDDAPDPVINACN------------GRQCGE------TFAGPNSPDKPAIW 255
VP + D A + V++ G E F + P +
Sbjct: 179 IDVP--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEYWDGWFNRWGEPIIKRDGQDLANEVKDMLA--LGS-LNLYMFHGGTNFG 284
>gi|237719727|ref|ZP_04550208.1| beta-galactosidase [Bacteroides sp. 2_2_4]
gi|229450996|gb|EEO56787.1| beta-galactosidase [Bacteroides sp. 2_2_4]
Length = 778
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 157/344 (45%), Gaps = 39/344 (11%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++ + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDFSG+ D+ F + Q G+YV +R GP+
Sbjct: 61 EHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIATFCRAAQKHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL I R+ + ++M+R + + K A L ++GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDIALRTLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG ++K PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYG---IDK--PYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG 327
+ + YM HGGT FG A + + + Y AP+ E G
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|421514041|ref|ZP_15960756.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
gi|401672838|gb|EJS79281.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
Length = 611
Score = 132 bits (331), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 140/291 (48%), Gaps = 33/291 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + ++T + WNLHEP G +DF
Sbjct: 8 EEFLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+V F+ Q GL V LR +I EW +GGLP WL + RS + F +
Sbjct: 68 EGMKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSF-LEKGPPYVRWAAKLAVDLQ 213
+ Y +++ + K L + GGP+I+ Q+ENEYG S+ +EK Y+R ++ +
Sbjct: 127 RTYFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEK--EYLRQTKQVMEEFG 178
Query: 214 TGVPWVMCKQDDAPDPVINACN------------GRQCGE------TFAGPNSPDKPAIW 255
VP + D A + V++ G E F + P +
Sbjct: 179 IDVP--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEYWDGWFNRWGEPIIKRDGQDLANEVKDMLA--LGS-LNLYMFHGGTNFG 284
>gi|148231352|ref|NP_001080304.1| galactosidase, beta 1-like 2 [Xenopus laevis]
gi|28422231|gb|AAH46858.1| Loc89944-prov protein [Xenopus laevis]
Length = 634
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 153/324 (47%), Gaps = 35/324 (10%)
Query: 37 LIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSG 96
++NG + GS+HY R W + K K G++ + T V WNLHEP+ G+FDFS
Sbjct: 51 FLLNGIPYRILGGSMHYFRVPMPYWRDRMKKMKACGINTLTTYVPWNLHEPRKGKFDFSK 110
Query: 97 RRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKR 156
D+ F+ GL+V LR GP+I EW GGLP WL + R+ F +
Sbjct: 111 DLDISEFLAIASEMGLWVILRPGPYICAEWDLGGLPSWLLRDKDMKLRTTYRGFTEATEA 170
Query: 157 YATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVRWAA 206
Y ++ + A+ S GGPII Q+ENEYG ++++ +EKG + +
Sbjct: 171 YLDELIP--RIAKYQYSNGGPIIAVQVENEYGSYAKDANYMEFIKNALVEKGIVELLLTS 228
Query: 207 KLAVDLQTGVPWVMCKQDDAPDPVINACNGRQCGET-FAGPNS--PDKPAIWTENWTSFY 263
L +G + + V+ N ++ F+ NS +KP + E WT ++
Sbjct: 229 DNKDGLSSG----------SLENVLATVNFQKIEPVLFSYLNSIQSNKPVMVMEFWTGWF 278
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTG 316
+G + I +++ V+ + + G+ +N YM+HGGTNFG A +T
Sbjct: 279 DYWGGKHHIFDVDEMISTVSEVLNR--GASINLYMFHGGTNFGFMNGALHFHEYRPDITS 336
Query: 317 YYDQAPLDEYGLLRQPKWGHLKEL 340
Y APL E G K+ L+EL
Sbjct: 337 YDYDAPLTEAGDYTS-KYFKLREL 359
>gi|160885481|ref|ZP_02066484.1| hypothetical protein BACOVA_03481 [Bacteroides ovatus ATCC 8483]
gi|423290348|ref|ZP_17269197.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
gi|156109103|gb|EDO10848.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
gi|392665735|gb|EIY59258.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
CL02T12C04]
Length = 778
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 154/344 (44%), Gaps = 39/344 (11%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++ + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDFSG+ D+ F + Q G+YV +R GP+
Sbjct: 61 EHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL + R+ + ++M+R + + K A L ++GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDVALRTLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYGTD-----KPYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG 327
+ + YM HGGT FG A + + + Y AP+ E G
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEAG 334
>gi|257143787|emb|CAZ44333.1| beta-D-galactosidase [Paenibacillus thiaminolyticus]
Length = 583
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 153/322 (47%), Gaps = 37/322 (11%)
Query: 41 GHRKI-LFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRD 99
G R I L SG+IHY R P W + K K G + ++T V WN+HEP+ G+F F D
Sbjct: 14 GDRPIQLISGAIHYFRIVPAYWEDRLRKIKAMGCNCIETYVAWNVHEPREGEFHFERMAD 73
Query: 100 LVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYAT 159
+ F++ GLYV +R P+I EW +GGLP WL + R ++ F + Y
Sbjct: 74 VAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWLLK-DDMRLRCNDPRFLEKVSAYYD 132
Query: 160 MIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDLQTGVP 217
++ + L A++GGPII QIENEYG + ++L+ A+ A+ ++ GV
Sbjct: 133 ALLPQLTP--LLATKGGPIIAVQIENEYGSYGNDQAYLQ---------AQRAMLIERGVD 181
Query: 218 WVMCKQDDAPDP---------VINACN-GRQCGETFAGPNS--PDKPAIWTENWTSFYQV 265
++ D D V+ N G + E F PD P + E W ++
Sbjct: 182 VLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMCMEYWNGWFDH 241
Query: 266 YGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAY-------VLTGYY 318
+ + R A+D A + + G+ VN+YM HGGTNFG + A +T Y
Sbjct: 242 WFEPHHTRDAKDAARVLDDMLG--MGASVNFYMVHGGTNFGFGSGANHSDKYEPTVTSYD 299
Query: 319 DQAPLDEYGLLRQPKWGHLKEL 340
A + E G L PK+ +E+
Sbjct: 300 YDAAISEAGDL-TPKYHAFREV 320
>gi|29376389|ref|NP_815543.1| glycosyl hydrolase [Enterococcus faecalis V583]
gi|227519038|ref|ZP_03949087.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227553661|ref|ZP_03983710.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256961654|ref|ZP_05565825.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|293383358|ref|ZP_06629271.1| beta-galactosidase [Enterococcus faecalis R712]
gi|293388990|ref|ZP_06633475.1| beta-galactosidase [Enterococcus faecalis S613]
gi|312907816|ref|ZP_07766806.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|312910433|ref|ZP_07769280.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|422714340|ref|ZP_16771066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|422715597|ref|ZP_16772313.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|424676484|ref|ZP_18113355.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|424681702|ref|ZP_18118489.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|424685588|ref|ZP_18122282.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|424686206|ref|ZP_18122874.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|424690524|ref|ZP_18127059.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|424694932|ref|ZP_18131318.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|424696643|ref|ZP_18132984.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|424700339|ref|ZP_18136532.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|424703758|ref|ZP_18139884.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|424712611|ref|ZP_18144783.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|424718249|ref|ZP_18147501.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|424721894|ref|ZP_18150963.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|424723972|ref|ZP_18152924.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|424733572|ref|ZP_18162127.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|424741709|ref|ZP_18170052.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|424751990|ref|ZP_18179997.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
gi|29343852|gb|AAO81613.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
gi|227073538|gb|EEI11501.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
gi|227177203|gb|EEI58175.1| possible beta-galactosidase [Enterococcus faecalis HH22]
gi|256952150|gb|EEU68782.1| beta-galactosidase [Enterococcus faecalis Merz96]
gi|291079149|gb|EFE16513.1| beta-galactosidase [Enterococcus faecalis R712]
gi|291081771|gb|EFE18734.1| beta-galactosidase [Enterococcus faecalis S613]
gi|310626177|gb|EFQ09460.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
gi|311289706|gb|EFQ68262.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
gi|315575942|gb|EFU88133.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
gi|315580774|gb|EFU92965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
gi|402350621|gb|EJU85522.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
gi|402356496|gb|EJU91227.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
gi|402358329|gb|EJU93003.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
gi|402364102|gb|EJU98549.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
gi|402367740|gb|EJV02077.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
gi|402369105|gb|EJV03397.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
gi|402374029|gb|EJV08075.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
gi|402377412|gb|EJV11319.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
gi|402379869|gb|EJV13650.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
gi|402382152|gb|EJV15835.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
gi|402384002|gb|EJV17579.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
gi|402390099|gb|EJV23464.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
gi|402391584|gb|EJV24885.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
gi|402396442|gb|EJV29504.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
gi|402401146|gb|EJV33935.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
gi|402404973|gb|EJV37581.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
Length = 611
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 140/291 (48%), Gaps = 33/291 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + ++T + WNLHEP G +DF
Sbjct: 8 EEFLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+V F+ Q GL V LR +I EW +GGLP WL + RS + F +
Sbjct: 68 EGMKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSF-LEKGPPYVRWAAKLAVDLQ 213
+ Y +++ + K L + GGP+I+ Q+ENEYG S+ +EK Y+R ++ +
Sbjct: 127 RTYFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEK--EYLRQTKQVMEEFG 178
Query: 214 TGVPWVMCKQDDAPDPVINACN------------GRQCGE------TFAGPNSPDKPAIW 255
VP + D A + V++ G E F + P +
Sbjct: 179 IDVP--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEYWDGWFNRWGEPIIKRDGQDLANEVKDMLA--LGS-LNLYMFHGGTNFG 284
>gi|307275710|ref|ZP_07556850.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
gi|306507586|gb|EFM76716.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
Length = 611
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 140/291 (48%), Gaps = 33/291 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+++G L SG+IHY R TP W + K G + ++T + WNLHEP G +DF
Sbjct: 8 EEFLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G +D+V F+ Q GL V LR +I EW +GGLP WL + RS + F +
Sbjct: 68 EGMKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWLLK-EHVRLRSTDPRFIAKV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSF-LEKGPPYVRWAAKLAVDLQ 213
+ Y +++ + K L + GGP+I+ Q+ENEYG S+ +EK Y+R ++ +
Sbjct: 127 RTYFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEK--EYLRQTKQVMEEFG 178
Query: 214 TGVPWVMCKQDDAPDPVINACN------------GRQCGE------TFAGPNSPDKPAIW 255
VP + D A + V++ G E F + P +
Sbjct: 179 IDVP--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMC 236
Query: 256 TENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG 306
E W ++ +G+ R +D+A V +A GS +N YM+HGGTNFG
Sbjct: 237 MEYWDGWFNRWGEPIIKRDGQDLANEVKDMLA--LGS-LNLYMFHGGTNFG 284
>gi|325261840|ref|ZP_08128578.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
gi|324033294|gb|EGB94571.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
Length = 581
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/325 (29%), Positives = 146/325 (44%), Gaps = 27/325 (8%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
I+ + + SG +HY R + W + K K G + V+T + WNLHE + G+F F
Sbjct: 8 EDFYIDNQKVKIISGGVHYFRIMAEYWKDCLLKLKAFGCNTVETYIPWNLHEKEKGEFCF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G D+ +F+ + GLYV LR P+I EW +GGLP+WL G+ R +PF H+
Sbjct: 68 EGNLDITKFVHIAKDLGLYVILRPSPYICAEWEFGGLPYWLLKEDGMRLRCSYKPFLKHV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
+ Y + ++ A L ++GGP+I+ Q+ENEYG + L Y++ V
Sbjct: 128 EEYYHRLFEVI--APLQYTKGGPVIMMQVENEYGYYGNDTL-----YLKTLQDFMVSYGC 180
Query: 215 GVPWVM----------CKQDDAPDPVINACNGRQCGETFAGPNSPDKPAIWTENWTSFYQ 264
VP V C + + N + + +KP + E W ++
Sbjct: 181 EVPLVTSDGPWGDAFDCGKLEGVLQTGNFGSKSRQQLQIMRDKIGNKPLMCMEFWVGWFD 240
Query: 265 VYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG-RTASAYV------LTGY 317
+G ED + ++ +VN YM+ GGTNFG S Y +T Y
Sbjct: 241 SWGQTE--HKQEDPNKNAENLDEILESGHVNIYMFMGGTNFGFMNGSNYYDVLTPDVTSY 298
Query: 318 YDQAPLDEYGLLRQPKWGHLKELHS 342
A L E G L PK+ LK + S
Sbjct: 299 DYDALLTEAGDL-TPKYELLKNVVS 322
>gi|423294349|ref|ZP_17272476.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
gi|392675540|gb|EIY68981.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
CL03T12C18]
Length = 778
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 157/344 (45%), Gaps = 39/344 (11%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++ + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDFSG+ D+ F + Q G+YV +R GP+
Sbjct: 61 EHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL I R+ + ++M+R + + K A L ++GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDIALRTLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG ++K PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYG---IDK--PYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG 327
+ + YM HGGT FG A + + + Y AP+ E G
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|62955063|ref|NP_001017547.1| beta-galactosidase precursor [Danio rerio]
gi|62089564|gb|AAH92166.1| Galactosidase, beta 1 [Danio rerio]
gi|182890870|gb|AAI65636.1| Glb1 protein [Danio rerio]
Length = 651
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 113/363 (31%), Positives = 162/363 (44%), Gaps = 43/363 (11%)
Query: 29 NVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQ 88
+V Y + +G SGSIHY R W + K GL+ +QT V WN HE
Sbjct: 27 SVDYHRNCFLKDGEPFRYISGSIHYSRIPRVYWKDRLLKMYMAGLNAIQTYVPWNFHEAV 86
Query: 89 PGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNE 148
PGQ+DFSG RDL +F++ Q GL V +R GP+I EW GGLP WL IV RS +
Sbjct: 87 PGQYDFSGDRDLEQFLQLCQDIGLLVIMRPGPYICAEWDMGGLPAWLLKKKDIVLRSSDP 146
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKL 208
+ + ++ ++ ++K R GGPII Q+ENEYG S+ Y+R ++L
Sbjct: 147 DYLAAVDKWMGKLLPIIK--RYLYQNGGPIITVQVENEYG----SYFACDFNYMRHLSQL 200
Query: 209 --------AVDLQT---GVPWVMCKQ--------DDAPDPVINACNGRQCGETFAGP--N 247
AV T G+ ++ C D P + A Q GP N
Sbjct: 201 FRFYLGEEAVLFTTDGAGLGYLKCGSLQGLYATVDFGPGANVTAAFEAQRHVEPRGPLVN 260
Query: 248 SPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG- 306
S P W ++W + V A +++ +I G+ VN YM+ GGTNFG
Sbjct: 261 SEFYPG-WLDHWGEKHSVVPTSAVVKTLNEIL---------EIGANVNLYMFIGGTNFGY 310
Query: 307 ----RTASAYVLTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGVLVSMNFS 362
T T Y +PL E G L + K+ ++E+ K + +L +
Sbjct: 311 WNGANTPYGPQPTSYDYDSPLTEAGDLTE-KYFAIREVIKMYKDVPEGILPPSTPKFAYG 369
Query: 363 KLQ 365
K+Q
Sbjct: 370 KVQ 372
>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
Length = 610
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 141/314 (44%), Gaps = 35/314 (11%)
Query: 36 SLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFS 95
+ +++G + SG IHYPR + W + AK GL+ + T VFWN+HEP+ GQ+DFS
Sbjct: 32 AFLLDGKPLQMISGEIHYPRVPRECWRDRMKMAKAMGLNTIGTYVFWNVHEPEKGQYDFS 91
Query: 96 GRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMK 155
G D+ F+K + + L+V LR P++ EW +GG P+WL ++ G+ RS EP
Sbjct: 92 GNNDIAAFVKMAKEEDLWVVLRPSPYVCAEWEFGGYPYWLQEIKGLKVRS-KEPQYLEAY 150
Query: 156 RYATMIVNMMKAARLYASQGGPIILSQIENEYG----------MVEHSFLEKGPPYVRWA 205
R M V + + L + GG I++ QIENEYG + F+E G + +
Sbjct: 151 RNYIMAVG-KQLSPLLVTHGGNILMVQIENEYGSYSDDKDYLDINRKMFVEAGFDGLLYT 209
Query: 206 AKLAVDLQTG-VPWVMCKQDDAPDP--VINACNGRQCGETFAGPNSPDKPAIWTENWTSF 262
++ G +P ++ + DP V N G+ P A W W +
Sbjct: 210 CDPKAAIKNGHLPGLLPAINGVDDPLQVKQLINENHSGK------GPYYIAEWYPAWFDW 263
Query: 263 YQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASAYV--------- 313
+ R Y L G +N YM+HGGT G A
Sbjct: 264 WGTKHHTVPYRQ-----YLGKLDSVLAAGISINMYMFHGGTTRGFMNGANANDADPYEPQ 318
Query: 314 LTGYYDQAPLDEYG 327
++ Y APLDE G
Sbjct: 319 ISSYDYDAPLDEAG 332
Score = 45.4 bits (106), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 82/203 (40%), Gaps = 37/203 (18%)
Query: 475 ESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLPD 534
+ +L++ L +NG+ G + S L+ L G + LL +G +
Sbjct: 415 KGLLQLKELRDYCVVMVNGKRAGVLDRRSKRDSIALD----LPAGKVKLDLLVENLGRIN 470
Query: 535 SGAYLERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRI---VPWSR 591
G YL G+ + +ELK + Q GL +KL G + VP R
Sbjct: 471 FGPYLLSNRKGITEKVLFDRQELKGWQ------QYGLPFDKLPAVAAKGIKAGANVPTYR 524
Query: 592 YGSSTHQPL--TWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTP 649
G+ T TW +++ + GKG W+NG +GRYW Q P
Sbjct: 525 QGTFTLDKTGDTW---------------LDMSNWGKGAVWINGHHLGRYW------QVGP 563
Query: 650 SQSWYHIPRSFLKPTGNLLVLLE 672
Q+ Y +P +LK N +V++E
Sbjct: 564 QQTIY-VPAEWLKKGMNDIVIME 585
>gi|328958462|ref|YP_004375848.1| beta-galactosidase [Carnobacterium sp. 17-4]
gi|328674786|gb|AEB30832.1| beta-galactosidase [Carnobacterium sp. 17-4]
Length = 589
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 96/319 (30%), Positives = 150/319 (47%), Gaps = 38/319 (11%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
++NG + SG++HY R P+ W + K G + V+T + WN+HEP+ G++ F
Sbjct: 8 EDFLLNGEPFKITSGAVHYFRVLPEDWYHSLYNLKALGFNTVETYIPWNVHEPKEGEYQF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
SG+ D+ +F++ + GL+V LR P+I EW +GGLP WL ++ RS + F +
Sbjct: 68 SGQWDIKKFVQLAEELGLFVILRPSPYICAEWEFGGLPAWLLTYKDMLIRSSDPVFIEKV 127
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQT 214
RY ++ + L GGP+I+ Q+ENEYG S+ E Y+R +L + L
Sbjct: 128 SRYYKELLKQITP--LQVDHGGPVIMMQLENEYG----SYGED-KEYLRTLYELMLKLGV 180
Query: 215 GVP-------WVMCKQDDAP---DPVINACNGRQCGETFAGPNSPDK------PAIWTEN 258
+P W ++ D + G + E F + P + E
Sbjct: 181 TIPIFTSDGAWRATQEAGTMTDLDILTTGNFGSRSKENFKELKEFHESKGKKWPLMCMEY 240
Query: 259 WTSFYQVYGDEARIRSAEDIAYHV--ALFIAKMKGSYVNYYMYHGGTNFG--RTASAYV- 313
W ++ + D R A ++ V AL I + N YM+HGGTNFG SA +
Sbjct: 241 WDGWFNRWNDPIIKRDALELTQDVKEALEIGSL-----NLYMFHGGTNFGFMNGCSARLR 295
Query: 314 -----LTGYYDQAPLDEYG 327
+T Y APL+E G
Sbjct: 296 KDLPQVTSYDYDAPLNEQG 314
Score = 42.4 bits (98), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 51/213 (23%), Positives = 86/213 (40%), Gaps = 30/213 (14%)
Query: 474 SESVLKVSSLGHVLHAFINGEFVGSAHGKHSDKSFTLEKMVHLINGTNNVSLLSVMVGLP 533
E +V LH F+N E + + + + + I+G+N + +L +G
Sbjct: 398 DEEFYRVIDGSDRLHFFLNEEKIATQYQEEIGEKI----YASPISGSNQLDVLVENMGRV 453
Query: 534 DSGAYL--ERRVAGLRNVSIQGAKELKDFSSFSWGYQVGLLGEKLQIFTDYGSRIVPWSR 591
+ G L + + G+R + + ++ +S + E L I D W
Sbjct: 454 NYGHKLLADTQQKGIRRGVMSDLHFITNWEQYSLDF-----SEPLSIDFD-----KEWKE 503
Query: 592 YGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQ 651
S +Q YK DAP + IN+ GKG VNG +IGR+W P+
Sbjct: 504 NSPSFYQ----YKVTIDAP---EDTFINMELFGKGIVLVNGFNIGRFW------NVGPTL 550
Query: 652 SWYHIPRSFLKPTGNLLVLLEEENGYPPGISID 684
S Y P S + N +++ E E + IS++
Sbjct: 551 SLY-APMSLFRKGENEIIVFETEGIWSKSISLE 582
>gi|189217683|ref|NP_001121284.1| galactosidase, beta 1-like precursor [Xenopus laevis]
gi|115527881|gb|AAI24928.1| LOC100158367 protein [Xenopus laevis]
Length = 645
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/312 (33%), Positives = 142/312 (45%), Gaps = 19/312 (6%)
Query: 48 SGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEV 107
SGSIHY R W + K K GLD + T V WN HE +PG ++FSG D+ F+K
Sbjct: 48 SGSIHYSRIPQFYWKDRLLKMKMAGLDAIYTYVPWNFHETKPGVYNFSGDHDIESFLKLA 107
Query: 108 QAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA 167
GL V LR GP+I EW GGLP WL IV RS + + + + + + MK
Sbjct: 108 NEIGLLVILRAGPYICAEWDMGGLPAWLLAKESIVLRSSDPDYLQAVDNWMGVFLPKMKP 167
Query: 168 ARLYASQGGPIILSQIENEYG---MVEHSFLEKGPPYVRWAAKLAVDLQT----GVPWVM 220
L GGPII Q+ENEYG ++++L R V L T + V
Sbjct: 168 --LLYHNGGPIISVQVENEYGSYFTCDYNYLRHLLQLFRHHLGDEVILFTTDGSALQLVR 225
Query: 221 CKQDDAPDPVINACNGRQCGETFAGPN--SPDKPAIWTENWTSFYQVYGDEARIRSAEDI 278
C ++ G ETF P P I +E +T + +G+ + + E +
Sbjct: 226 CGTIQGLYTTVDFGPGSNITETFLVQRHCEPKGPLINSEFYTGWLDHWGEPHSVVATERV 285
Query: 279 AYHVALFIAKMKGSYVNYYMYHGGTNFG-----RTASAYVLTGYYDQAPLDEYGLLRQPK 333
+ +A G+ VN YM+ GGTNFG T A T Y APL E G L K
Sbjct: 286 TKSLDEILA--IGASVNMYMFIGGTNFGYWNGANTPYAPQPTSYDYDAPLSEAGDLTD-K 342
Query: 334 WGHLKELHSAVK 345
+ ++E+ K
Sbjct: 343 YFAIREVIKKYK 354
Score = 39.7 bits (91), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 58/207 (28%), Positives = 82/207 (39%), Gaps = 47/207 (22%)
Query: 508 FTLEKMVHLINGTNNVSLLSVMVGLPDSG---------AYLERRVAGLRNVSIQGAKELK 558
F L + IN +N +L ++ G+ D LER NV+ EL
Sbjct: 411 FVLYRTTLPINCSNPTTLTTLFNGVRDRAYVMVNGVPQGVLERDKQTAINVTGAAGAELD 470
Query: 559 ---------DFSSFSWGYQ-----VGLLGEKLQIFT----DYGSRIVPWSRYGSSTHQPL 600
+F ++ ++ V L GE L +T D GS I S S+ H P
Sbjct: 471 LLVESMGRVNFGRYNNDFKGLLTNVTLNGETLVNWTMYPLDIGSAIN--SGLLSTIHSPY 528
Query: 601 T-------WYKTVFDAPTG----SDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQGTP 649
T +YK PTG I KG+ W+NG ++GRYW P P
Sbjct: 529 TSTFSAPTFYKGSLIIPTGIPQLPQDTFIQFPGWTKGQIWINGFNLGRYW-----PVRGP 583
Query: 650 SQSWYHIPRSFLKPTG-NLLVLLEEEN 675
+ Y +PR+ L T N + +LE EN
Sbjct: 584 QVTLY-VPRNILTTTQINNITVLELEN 609
>gi|383110805|ref|ZP_09931623.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
gi|313694380|gb|EFS31215.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
Length = 778
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/344 (28%), Positives = 157/344 (45%), Gaps = 39/344 (11%)
Query: 6 LLCLFGLLLTTIGGSDGGG----GGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMW 61
LL LF ++ + + G N DG+ ++ + +HY R W
Sbjct: 8 LLVLFTVIFFSTAQAQTTARKFEAGKNTFLLDGKPFVVK-------AAELHYTRIPQAYW 60
Query: 62 PRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPF 121
I K G++ + +FWN+HE + G+FDFSG+ D+ F + Q G+YV +R GP+
Sbjct: 61 EHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFSGQNDIAAFCRAAQKHGMYVIVRPGPY 120
Query: 122 IEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHMKRYATMIVNMMKA-ARLYASQGGPIIL 180
+ EW GGLP+WL I R+ + ++M+R + + K A L ++GG II+
Sbjct: 121 VCAEWEMGGLPWWLLKKKDIALRTLD---PYYMERVGIFMKEVGKQLAPLQVNKGGNIIM 177
Query: 181 SQIENEYGMVEHSFLEKGPPYVRWAAKLAVDLQ-TGVPWVMCK-----QDDAPDPVINAC 234
Q+ENEYG ++K PYV L + T VP C ++A D +I
Sbjct: 178 VQVENEYGSYG---IDK--PYVSAVRDLVRESGFTDVPLFQCDWSSNFTNNALDDLIWTV 232
Query: 235 N---GRQCGETFAGPNS--PDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKM 289
N G + F P+ P + +E W+ ++ +G + R A+D+ + +
Sbjct: 233 NFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFDHWGRKHETRPAKDMVQGIKDMLD-- 290
Query: 290 KGSYVNYYMYHGGTNFGR------TASAYVLTGYYDQAPLDEYG 327
+ + YM HGGT FG A + + + Y AP+ E G
Sbjct: 291 RNISFSLYMTHGGTTFGHWGGANNPAYSAMCSSYDYDAPISEPG 334
>gi|222152241|ref|YP_002561416.1| beta-galactosidase [Streptococcus uberis 0140J]
gi|222113052|emb|CAR40398.1| putative beta-galactosidase precursor [Streptococcus uberis 0140J]
Length = 594
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 125/450 (27%), Positives = 196/450 (43%), Gaps = 41/450 (9%)
Query: 35 RSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQFDF 94
+ ++G + SGSIHY R P+ W R + K G + V+T V WNLHEPQ G F F
Sbjct: 8 ENFYLDGKPFKILSGSIHYFRVAPEAWYRSLYNLKALGFNTVETYVPWNLHEPQKGNFHF 67
Query: 95 SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKFHM 154
G DL F+ Q GLY +R P+I EW +GGLP WL + P I RS + + H+
Sbjct: 68 DGLADLEGFLDLAQELGLYAIVRPSPYICAEWEFGGLPGWLLNEP-IRVRSRDPKYLKHV 126
Query: 155 KRYATMIVNMMKAARLYASQGGPIILSQIENEYGMV--EHSFLEKGPPYVRWAAKLAVDL 212
K Y ++ M K + GG I++ Q+ENEYG + +L + +R A
Sbjct: 127 KDYYDVL--MPKLVKRQLENGGNILMFQVENEYGSYGEDKDYLRELMTMMRQLGVTAPLF 184
Query: 213 QTGVPWVMCKQDDA--PDPVINACN-------GRQCGETFAGPNSPDKPAIWTENWTSFY 263
+ PW + + D V+ N + + F N+ P + E W ++
Sbjct: 185 TSDGPWHATLRSGSLIEDDVLVTGNFGSKAKINFESMKAFFKENNKKWPLMCMEFWIGWF 244
Query: 264 QVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFG--RTASAYV------LT 315
+ + R ++ + + ++ +N YM+HGGTNFG ASA + +T
Sbjct: 245 NRWKEPIIRRDPKET---IDAIMEVLEEGSINLYMFHGGTNFGFMNGASARLQQDLPQVT 301
Query: 316 GYYDQAPLDE-------YGLLRQPKWGHLKELHSAVKLCLKPM-LSGVLVSMNFSKLQEA 367
Y A LDE Y LL++ + LH L K + + G+ ++ + L E
Sbjct: 302 SYDYDAILDEAGNPTPKYFLLQERLQKNFPNLHFDKPLENKTIAIKGIALTEKVN-LVET 360
Query: 368 FIFQGSSECAAFLVNKDKRNNATVYFSNLMYELPP------LSISILPDCKTVAFNTAKL 421
+ A + VN + N T Y Y LP L + D V N +
Sbjct: 361 LDSISTLTEAFYPVNMESLNQTTGYILYRTY-LPKDNARERLRLIDARDRAKVYLNNRLI 419
Query: 422 DSVEQWEEYKEAIPTYDETSLRANFLLEQM 451
++ Q+E + I + + + + L+E M
Sbjct: 420 ETQYQFEIGNDIIIEQETENNQLDILIENM 449
Score = 39.7 bits (91), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 32/66 (48%), Gaps = 7/66 (10%)
Query: 618 INLISMGKGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPRSFLKPTGNLLVLLEEENGY 677
++L GKG A++N +GR+W P S Y +P SFLK N LV+ E E
Sbjct: 522 LDLSQFGKGVAYINNNHLGRFW------NVGPHLSLY-VPESFLKLGKNRLVIFETEGQM 574
Query: 678 PPGISI 683
P I
Sbjct: 575 TPSIQF 580
>gi|321461557|gb|EFX72588.1| hypothetical protein DAPPUDRAFT_58801 [Daphnia pulex]
Length = 648
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/334 (29%), Positives = 150/334 (44%), Gaps = 33/334 (9%)
Query: 24 GGGGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWN 83
GG + + ++NG +FSG++HY R P W + K + G+ VV+T V WN
Sbjct: 23 GGVTSGLVPTSNGFLLNGKPFRIFSGAVHYFRVHPAYWRDRLRKLRAAGITVVETYVAWN 82
Query: 84 LHEPQPGQFDF-------SGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLH 136
LHEPQ FDF S DL FI+ + L+V LR GP+I EW +GGLP WL
Sbjct: 83 LHEPQKNVFDFGKGNNDMSIFLDLKLFIQTAYEEDLFVILRPGPYICSEWDFGGLPSWLL 142
Query: 137 DVPGIVFRSDNEPFKFHMKRYATMIVNMMKAARLYASQG-GPIILSQIENEYGMVEHSFL 195
P + R+ P+ + +Y + N++ + +S G GPII Q+ENEYG +
Sbjct: 143 RDPTMHVRTSYGPYVDRVDKYLEKLSNLVNHMQFTSSYGKGPIIAFQVENEYGSFGYQDH 202
Query: 196 EKGPPYVRWAAKLAVDLQTGVPWVMCKQDDAPD-------PVINACNGRQCGET----FA 244
+ Y++ + L G+ + D P + Q G T
Sbjct: 203 PRDKAYLQHLSDKMKSL--GLKELFFTSDSPAGYLDWGSIPGVLQTANFQSGATQEFKML 260
Query: 245 GPNSPDKPAIWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTN 304
P+ P + TE W+ ++ + + R + + + +L + V++YM+HGGTN
Sbjct: 261 QELQPNMPLMVTEFWSGWFDHWTQDFR-KGLKLKDFETSLMEILSFDASVSFYMFHGGTN 319
Query: 305 FGRTASAYV-----------LTGYYDQAPLDEYG 327
FG A V +T Y APL E G
Sbjct: 320 FGFMNGANVRKEYPGGYLPDITSYDYDAPLSEAG 353
Score = 40.0 bits (92), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 7/86 (8%)
Query: 588 PWSRYGSSTHQPLTWYKTVFDAPTGSDPVAINLISMGKGEAWVNGQSIGRYWVSFLTPQG 647
P+S+ + PL ++ A SD I++ S GKG +VNG ++GRYW S++ PQ
Sbjct: 546 PFSKRSAGQPGPLLVRASLIVAGPISD-TFIDMSSWGKGVVFVNGFNLGRYW-SYMGPQK 603
Query: 648 TPSQSWYHIPRSFLKPTGNLLVLLEE 673
T ++P LK N +V+ E+
Sbjct: 604 T-----LYLPAPLLKRGENTIVIYEQ 624
>gi|301763008|ref|XP_002916930.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Ailuropoda
melanoleuca]
Length = 688
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 135/291 (46%), Gaps = 23/291 (7%)
Query: 33 DGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQPGQF 92
+G+ ++ +F GS+HY R + W + K K GL+ + T V WNLHEP+ G+F
Sbjct: 102 NGQYFMLEDSTFWIFGGSMHYFRVPKEYWRDRLLKMKACGLNTLTTYVPWNLHEPERGKF 161
Query: 93 DFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRSDNEPFKF 152
DFSG DL F+ GL+V LR GP+I E GGLP WL G+ R+ + F
Sbjct: 162 DFSGNLDLEAFVLMAAEIGLWVILRPGPYICSEIDLGGLPSWLLQDSGMRLRTTYKGFTE 221
Query: 153 HMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWAAKLAVDL 212
+ Y + M + L GGPII Q+ENEYG + P Y+ + K D
Sbjct: 222 AVDLYFDHL--MSRVVPLQYKHGGPIIAVQVENEYGSY-----NRDPAYMPYIKKALED- 273
Query: 213 QTGVPWVMCKQDDAP-------DPVINACNGR-----QCGETFAGPNSPDKPAIWTENWT 260
G+ ++ D+ D V+ N + Q F +P + E WT
Sbjct: 274 -RGIVELLLTSDNKDGLQKGVMDGVLATINLQSQHELQLLTNFLLSVQRVQPKMVMEYWT 332
Query: 261 SFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA 311
++ +G I + ++ V+ + GS +N YM+HGGTNFG A
Sbjct: 333 GWFDSWGGPHNILDSSEVLKTVSAILD--AGSSINLYMFHGGTNFGFINGA 381
>gi|328721397|ref|XP_003247292.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
Length = 628
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 167/349 (47%), Gaps = 28/349 (8%)
Query: 30 VTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLHEPQP 89
V Y+ + +G SGS+HY R W I K K GL+ + T V W+LHEP P
Sbjct: 17 VDYERNEFLKDGQVFRYVSGSLHYFRVPKPYWKDRIQKMKAAGLNAISTYVEWSLHEPYP 76
Query: 90 GQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHD-VPGIVFRSDNE 148
G+++F DL F++ V+ +G+Y+ LR GP+I E +GG PFWL + VP R+++
Sbjct: 77 GEYNFDDIADLEYFLQLVKDEGMYLLLRPGPYICAERDFGGFPFWLLNVVPKKRLRTNDP 136
Query: 149 PFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYG---MVEHSFL----EKGPPY 201
+K ++ ++ ++ M K R GG II+ Q+ENEYG + ++ + Y
Sbjct: 137 SYKHYVTKWFNVL--MPKIDRFLYGNGGNIIMVQVENEYGSYNACDQEYMLWLRDLYKRY 194
Query: 202 VRWAAKLAVDLQTGVPWVMCKQ-DDAPDPVINACNGRQCGETFAGPNSPDK--PAIWTEN 258
V + A L G + C D V + + + F + K P + +E
Sbjct: 195 VGYKALLYTTDGCGYSYFTCGAIPDVYATVDFGASVKDVSQCFKYMRTTQKRGPLVNSEY 254
Query: 259 WTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA------- 311
+ + + + + + S+ ++ + +A + +N+YM+HGGTNFG T+ A
Sbjct: 255 YAGWLSHWREPSPVISSYEVVETMKDMLA--LNASINFYMFHGGTNFGFTSGANKYESLK 312
Query: 312 ---YV--LTGYYDQAPLDEYGLLRQPKWGHLKELHSAVKLCLKPMLSGV 355
Y+ LT Y +PLDE G + K+ +K+L + +S V
Sbjct: 313 NPDYLPQLTSYDYNSPLDEAGDPTE-KYFKIKKLLEGTNFIVSNEISPV 360
Score = 47.0 bits (110), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 52/100 (52%), Gaps = 12/100 (12%)
Query: 602 WYKTVFDAPTG-SDPVAINLISMG--KGEAWVNGQSIGRYWVSFLTPQGTPSQSWYHIPR 658
+YKT F P G + P+ L G KG A+VNG +IGRYW P P + Y +P
Sbjct: 530 FYKTQFKLPDGLTKPLDTYLDVTGWKKGVAFVNGINIGRYW-----PSAGPQITLY-VPA 583
Query: 659 SFL--KPTGNLLVLLEEENGYPPGISIDTVSVTTLCGHVS 696
+FL +P N +V+LE E G P +SI L G ++
Sbjct: 584 TFLIPQPGLNTIVMLELE-GVPENLSISLTDKPILFGPIN 622
>gi|148273884|ref|YP_001223445.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
gi|147831814|emb|CAN02784.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
michiganensis NCPPB 382]
Length = 599
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 149/321 (46%), Gaps = 37/321 (11%)
Query: 26 GGNNVTYDGRSLIINGHRKILFSGSIHYPRSTPQMWPRLIAKAKEGGLDVVQTLVFWNLH 85
G ++ DGR HR I +G++HY R P W I KA+ GLD ++T V WN H
Sbjct: 14 GTDDFELDGRP-----HRVI--AGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAH 66
Query: 86 EPQPGQFDFSGRRDLVRFIKEVQAQGLYVCLRIGPFIEGEWGYGGLPFWLHDVPGIVFRS 145
P+ G FD S DL RF+ V A+G++ +R GP+I EW GGLP WL + P + R
Sbjct: 67 SPERGAFDTSAGLDLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFEDPAVGVRR 126
Query: 146 DNEPFKFHMKRYATMIVNMMKAARLYASQGGPIILSQIENEYGMVEHSFLEKGPPYVRWA 205
+ + + + ++ ++ GGP+IL QIENEYG Y+R
Sbjct: 127 SEPLYLAAVDEFLRRVYEIVAPRQI--DMGGPVILVQIENEYGAYGDD-----ADYLRHL 179
Query: 206 AKLAVDLQTGVPWVMCKQDDAPDPVINACN----------GRQCGETFAG--PNSPDKPA 253
L ++G+ + D D +++ + G + E A + P P
Sbjct: 180 VDLT--RESGIIVPLTTVDQPTDEMLSRGSLDELHRTGSFGSRATERLATLRRHQPTGPL 237
Query: 254 IWTENWTSFYQVYGDEARIRSAEDIAYHVALFIAKMKGSYVNYYMYHGGTNFGRTASA-- 311
+ +E W ++ +G+ SA D A + +A VN YM+HGGTNFG T A
Sbjct: 238 MCSEFWDGWFDHWGEHHHTTSAADAAAELDALLAAGAS--VNIYMFHGGTNFGFTNGANH 295
Query: 312 -----YVLTGYYDQAPLDEYG 327
+T Y APLDE G
Sbjct: 296 KGTYQSHVTSYDYDAPLDETG 316
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.434
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,108,900,063
Number of Sequences: 23463169
Number of extensions: 643388021
Number of successful extensions: 1576613
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2127
Number of HSP's successfully gapped in prelim test: 437
Number of HSP's that attempted gapping in prelim test: 1564339
Number of HSP's gapped (non-prelim): 5389
length of query: 807
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 656
effective length of database: 8,816,256,848
effective search space: 5783464492288
effective search space used: 5783464492288
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)