BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037925
         (821 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224116208|ref|XP_002317239.1| predicted protein [Populus trichocarpa]
 gi|222860304|gb|EEE97851.1| predicted protein [Populus trichocarpa]
          Length = 849

 Score = 1388 bits (3593), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 644/816 (78%), Positives = 724/816 (88%), Gaps = 3/816 (0%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYDH+ALVIDGKRRVLQSGSIHYPR+TPEVWPE+IRKSKEGGL+VIETYVFWNYHEP+R
Sbjct: 36  VTYDHKALVIDGKRRVLQSGSIHYPRTTPEVWPEIIRKSKEGGLDVIETYVFWNYHEPVR 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQYYFEGRFDLVRFVKTVQEAGLF+HLRIGPYACAEWNYGGFP+WLHFIPG+QFRT+N+ 
Sbjct: 96  GQYYFEGRFDLVRFVKTVQEAGLFVHLRIGPYACAEWNYGGFPLWLHFIPGVQFRTSNDI 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  MK FL KI+DLMK +NLFASQGGPIILAQVENEYGNV+WAYGVGGELYVKWAA+TA
Sbjct: 156 FKNAMKSFLTKIVDLMKDDNLFASQGGPIILAQVENEYGNVQWAYGVGGELYVKWAAETA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           ++LNT+VPWVMC QEDAPDP+INTCNGFYCD FTPNSPSKP MWTENYSGWFL+FGYAVP
Sbjct: 216 ISLNTTVPWVMCVQEDAPDPVINTCNGFYCDQFTPNSPSKPKMWTENYSGWFLAFGYAVP 275

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
           +RPVEDLAFAVARFFE GG+FQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ
Sbjct: 276 YRPVEDLAFAVARFFEYGGSFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 335

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PKWGHLR+LH AIK CEEYL+SSDP HQ+LG KLEAH+Y+K SNDCAAFLANYDS SDAN
Sbjct: 336 PKWGHLRDLHSAIKQCEEYLVSSDPVHQQLGNKLEAHVYYKHSNDCAAFLANYDSGSDAN 395

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           VTFNGN YFLPAWSVSIL DCKNV+FNTAKV++QR+ GD  F++   V+  L+A+S +SW
Sbjct: 396 VTFNGNTYFLPAWSVSILADCKNVIFNTAKVVTQRHIGDALFSRSTTVDGNLVAASPWSW 455

Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAAL 484
           Y+E+VGI GN SF +P L EQINTTKDTSD+LWY+ S++V  GQ KE  LNIESLGHAAL
Sbjct: 456 YKEEVGIWGNNSFTKPGLLEQINTTKDTSDFLWYSTSLYVEAGQDKEHLLNIESLGHAAL 515

Query: 485 VFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS 544
           VFVNK+ VAFGYGNHD A+F + ++I L EG NTLD+LSM++G+QNYG WFDV GAG+ S
Sbjct: 516 VFVNKRFVAFGYGNHDDASFSLTREISLEEGNNTLDVLSMLIGVQNYGPWFDVQGAGIHS 575

Query: 545 VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTT 604
           V L+DL   K+DLSSG+W YQVG+EGEY+GLD +SLANSS W QG++LPVNKSLIWYK T
Sbjct: 576 VFLVDLHKSKKDLSSGKWTYQVGLEGEYLGLDNVSLANSSLWSQGTSLPVNKSLIWYKAT 635

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
            +APEG GPLALNLASMGKGQAW+NGQSIGRYWSAYL+PS GCT  CDYRG+Y++ KCQK
Sbjct: 636 IIAPEGNGPLALNLASMGKGQAWINGQSIGRYWSAYLSPSAGCTDNCDYRGAYNSFKCQK 695

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVD 724
            CGQPAQTLYHIPRTWVHPGENLLV+HEELGGDPS+ISLLT+TGQ ICS VSE DPPP D
Sbjct: 696 KCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSQISLLTRTGQDICSIVSEDDPPPAD 755

Query: 725 SWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPIVQKAC 784
           SWKPNL  +S SP+VRL CE GWHIAAINFAS+G PEG CG+F PG CH D+L IVQKAC
Sbjct: 756 SWKPNLEFMSQSPEVRLTCEHGWHIAAINFASFGTPEGKCGTFTPGNCHADMLTIVQKAC 815

Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +G   CSIP+S+A LG     CPG++K   VEA CS
Sbjct: 816 IGHERCSIPISAAKLG---DPCPGVVKRFVVEALCS 848


>gi|255560830|ref|XP_002521428.1| beta-galactosidase, putative [Ricinus communis]
 gi|223539327|gb|EEF40918.1| beta-galactosidase, putative [Ricinus communis]
          Length = 841

 Score = 1344 bits (3479), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 638/819 (77%), Positives = 710/819 (86%), Gaps = 5/819 (0%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V+YDHRALVIDGKRRVLQSGSIHYPR+TPEVWP++IRKSKEGGL+VIETYVFWNYHE
Sbjct: 27  SGKVSYDHRALVIDGKRRVLQSGSIHYPRTTPEVWPDIIRKSKEGGLDVIETYVFWNYHE 86

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQYYFEGRFDLVRFVKT+QEAGL +HLRIGPYACAEWNYGGFP+WLHFIPGIQFRTT
Sbjct: 87  PVKGQYYFEGRFDLVRFVKTIQEAGLLVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTT 146

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  FKEEMK FL KI+++MK+ENLFASQGGPIILAQVENEYGNVEWAYG  GELYVKWAA
Sbjct: 147 NELFKEEMKLFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVEWAYGAAGELYVKWAA 206

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           +TAV+LNTSVPWVMC Q DAPDPIINTCNGFYCD F+PNSPSKP MWTENYSGWFLSFGY
Sbjct: 207 ETAVSLNTSVPWVMCAQVDAPDPIINTCNGFYCDRFSPNSPSKPKMWTENYSGWFLSFGY 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           A+P+RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF
Sbjct: 267 AIPYRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IRQPKWGHLR+LHKAIK CEE+LISSDP HQ+LG  LEAHIY+KSSNDCAAFLANYDSSS
Sbjct: 327 IRQPKWGHLRDLHKAIKQCEEHLISSDPIHQQLGNNLEAHIYYKSSNDCAAFLANYDSSS 386

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DANVTFNGN+YFLPAWSVSILPDCKNV+FNTAKV+   N GD  FA   +VNE+ L    
Sbjct: 387 DANVTFNGNIYFLPAWSVSILPDCKNVIFNTAKVLI-LNLGDDFFAHSTSVNEIPLEQIV 445

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           +SWY+E+VGI GN SF  P L EQINTTKD SD+LWY+ SI V   Q K++ LNIESLGH
Sbjct: 446 WSWYKEEVGIWGNNSFTAPGLLEQINTTKDISDFLWYSTSISVNADQVKDIILNIESLGH 505

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
           AALVFVNK LV   YGNHD A+F + +KI L EG NTLD+LSMM+G+QNYG WFDV GAG
Sbjct: 506 AALVFVNKVLVG-KYGNHDDASFSLTEKISLIEGNNTLDLLSMMIGVQNYGPWFDVQGAG 564

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
           +++V+L+     K DLSS +W YQVG+EGEY GLDK+SLANSS W QG++ P+NKSLIWY
Sbjct: 565 IYAVLLVGQSKVKIDLSSEKWTYQVGLEGEYFGLDKVSLANSSLWTQGASPPINKSLIWY 624

Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
           K TF+APEGKGPLALNLA MGKGQAWVNGQSIGRYW AYL+PSTGC   CDYRG+YD+ K
Sbjct: 625 KGTFVAPEGKGPLALNLAGMGKGQAWVNGQSIGRYWPAYLSPSTGCNDSCDYRGAYDSFK 684

Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
           C K CGQPAQTLYHIPRTWVHPGENLLV+HEELGGDPSKIS+LT+TG  ICS VSE DPP
Sbjct: 685 CLKKCGQPAQTLYHIPRTWVHPGENLLVLHEELGGDPSKISVLTRTGHEICSIVSEDDPP 744

Query: 722 PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPIVQ 781
           P DSWK +    S +P+VRL CE+GWHI +INFAS+G P G CG+F PG+CH D+L IVQ
Sbjct: 745 PADSWKSSSEFKSQNPEVRLTCEQGWHIKSINFASFGTPAGICGTFNPGSCHADMLDIVQ 804

Query: 782 KACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           KAC+GQ  CSI +S+A LG     CPG+LK  AVEA CS
Sbjct: 805 KACIGQEGCSISISAANLG---DPCPGVLKRFAVEARCS 840


>gi|116787095|gb|ABK24373.1| unknown [Picea sitchensis]
          Length = 861

 Score = 1045 bits (2703), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 504/843 (59%), Positives = 621/843 (73%), Gaps = 33/843 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHR+L+IDG+RRVL SGSIHYPRSTPE+WP++I+K+K+GGL+VIE+YVFWN HE
Sbjct: 28  AANVTYDHRSLLIDGQRRVLISGSIHYPRSTPEMWPDIIQKAKDGGLDVIESYVFWNMHE 87

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P + +YYFE RFDLV+FVK VQ+AGL +HLRIGPYACAEWNYGGFPVWLH IPGI FRT 
Sbjct: 88  PKQNEYYFEDRFDLVKFVKIVQQAGLLVHLRIGPYACAEWNYGGFPVWLHLIPGIHFRTD 147

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF AKI+D+MKQE LFASQGGPIILAQ+ENEYGN++  YG  G+ YVKWAA
Sbjct: 148 NEPFKNEMQRFTAKIVDMMKQEKLFASQGGPIILAQIENEYGNIDGPYGAAGKSYVKWAA 207

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV LNT VPWVMCQQ DAPDPIINTCNGFYCD FTPNSP+KP MWTEN+SGWFLSFG 
Sbjct: 208 SMAVGLNTGVPWVMCQQADAPDPIINTCNGFYCDAFTPNSPNKPKMWTENWSGWFLSFGG 267

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            +PFRP EDLAF+VARFF+ GGTFQNYYMY GGTNFGRT GGP +ATSYDYDAPIDEYG 
Sbjct: 268 RLPFRPTEDLAFSVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYGI 327

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHL+ELHKAIKLCE  L++++  +  LG+ LEAH+Y   S  CAAFLAN ++ S
Sbjct: 328 VRQPKWGHLKELHKAIKLCEAALVNAESNYTSLGSGLEAHVYSPGSGTCAAFLANSNTQS 387

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS- 420
           DA V FNGN Y LPAWSVSILPDCKNVVFNTAK+ SQ        + Q N   L+LA S 
Sbjct: 388 DATVKFNGNSYHLPAWSVSILPDCKNVVFNTAKIGSQTT------SVQMNPANLILAGSN 441

Query: 421 -----------AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ- 468
                      ++SW  E++GI G+ +F +P L EQINTT D+SDYLWYT SI V   + 
Sbjct: 442 SMKGTDSANAASWSWLHEQIGIGGSNTFSKPGLLEQINTTVDSSDYLWYTTSIQVDDNEP 501

Query: 469 ----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
               G +  L+++SLGHA  VF+N +    G G+   +   +   I L  G N +D+LS+
Sbjct: 502 FLHNGTQPVLHVQSLGHALHVFINGEFAGRGAGSSSSSKIALQTPITLKSGKNNIDLLSI 561

Query: 525 MVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS 583
            VGLQNYG++FD  GAG+   VIL   K+G+ DLS+ +W YQ+G+ GE +G+       S
Sbjct: 562 TVGLQNYGSFFDTWGAGITGPVILQGFKDGEHDLSTQQWTYQIGLTGEQLGIYSGDTKAS 621

Query: 584 SFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAP 643
           + W  GS LP  + +IWYKT F AP G  P+ALNL  MGKG AWVNGQSIGRYW +Y+A 
Sbjct: 622 AQWVAGSDLPTKQPMIWYKTNFDAPSGNDPVALNLLGMGKGVAWVNGQSIGRYWPSYIAS 681

Query: 644 STGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
            +GCT  CDYRG+Y ++KCQ +CGQP+Q LYH+PR+W+ P  N+LV+ EELGGDP++IS 
Sbjct: 682 QSGCTDSCDYRGAYSSTKCQTNCGQPSQKLYHVPRSWIQPTGNVLVLFEELGGDPTQISF 741

Query: 704 LTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLACERGWH-IAAINFASYG 758
           +T++   +C+ VSE   PPVDSWK +    L V     +++L C    H I +I FAS+G
Sbjct: 742 MTRSVGSLCAQVSETHLPPVDSWKSSATSGLEVNKPKAELQLHCPSSRHLIKSIKFASFG 801

Query: 759 IPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
             +G+CGSF  G C+ +  + IV++AC+G+  CS+ VS    G     C G +K LAVEA
Sbjct: 802 TSKGSCGSFTYGHCNTNSTMSIVEEACIGRESCSVEVSIEKFG---DPCKGTVKNLAVEA 858

Query: 818 HCS 820
            CS
Sbjct: 859 SCS 861


>gi|356539132|ref|XP_003538054.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
          Length = 836

 Score = 1033 bits (2672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 514/823 (62%), Positives = 609/823 (73%), Gaps = 15/823 (1%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 24  ANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY FEGR DLV+FVK V  AGL++HLRIGPYACAEWNYGGFP+WLHFIPGIQFRT N
Sbjct: 84  VRGQYNFEGRGDLVKFVKVVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PF+ EMK+F AKI+DLMKQENL+ASQGGPIIL+Q+ENEYGN+E  YG   + Y+KWAA 
Sbjct: 144 KPFEAEMKQFTAKIVDLMKQENLYASQGGPIILSQIENEYGNIEADYGPAAKSYIKWAAS 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L T VPWVMCQQ++APDPIIN CNGFYCD F PNS +KP +WTE Y+GWFL+FG A
Sbjct: 204 MATSLGTGVPWVMCQQQNAPDPIINACNGFYCDQFKPNSNTKPKIWTEGYTGWFLAFGDA 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGR +GGP VA+SYDYDAPIDEYGFI
Sbjct: 264 VPHRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRASGGPFVASSYDYDAPIDEYGFI 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL+++HKAIKLCEE LI++DPT   LG  +EA +Y K+   CAAFLAN  ++SD
Sbjct: 324 RQPKWGHLKDVHKAIKLCEEALIATDPTITSLGPNIEAAVY-KTGVVCAAFLANI-ATSD 381

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A VTFNGN Y LPAWSVSILPDCKNVV NTAK+ S            K+V  L  + S +
Sbjct: 382 ATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKITSASMISSFTTESLKDVGSLDDSGSRW 441

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
           SW  E +GIS   SF    L EQINTT D SDYLWY+ SI +    G + FL+I+SLGHA
Sbjct: 442 SWISEPIGISKADSFSTFGLLEQINTTADRSDYLWYSLSIDL--DAGAQTFLHIKSLGHA 499

Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
              F+N KL   G GNH+ AN  ++  I L  G NT+D+LS+ VGLQNYGA+FD  GAG+
Sbjct: 500 LHAFINGKLAGSGTGNHEKANVEVDIPITLVSGKNTIDLLSLTVGLQNYGAFFDTWGAGI 559

Query: 543 FS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
              VIL  LKNG   DLSS +W YQVG++ E +GL   S   S  W   STLP N+ L W
Sbjct: 560 TGPVILKCLKNGSNVDLSSKQWTYQVGLKNEDLGL---SSGCSGQWNSQSTLPTNQPLTW 616

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKT F+AP G  P+A++   MGKG+AWVNGQSIGRYW  Y +P  GCT  C+YRG+YDAS
Sbjct: 617 YKTNFVAPSGNNPVAIDFTGMGKGEAWVNGQSIGRYWPTYASPKGGCTDSCNYRGAYDAS 676

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
           KC K+CG+P+QTLYH+PR+W+ P  N LV+ EE GG+P +IS  TK    +CS VSE+ P
Sbjct: 677 KCLKNCGKPSQTLYHVPRSWLRPDRNTLVLFEESGGNPKQISFATKQIGSVCSHVSESHP 736

Query: 721 PPVDSWKPNL-GVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGACHMD-VL 777
           PPVDSW  N        P V L C      +++I FAS+G P G CG+F+ G C  +  L
Sbjct: 737 PPVDSWNSNTESGRKVVPVVSLECPYPNQVVSSIKFASFGTPLGTCGNFKHGLCSSNKAL 796

Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            IVQKAC+G   C I +S    G     C G+ K+LAVEA C+
Sbjct: 797 SIVQKACIGSSSCRIELSVNTFG---DPCKGVAKSLAVEASCA 836


>gi|356539454|ref|XP_003538213.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
          Length = 838

 Score = 1030 bits (2663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/823 (61%), Positives = 614/823 (74%), Gaps = 14/823 (1%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 25  ANVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 84

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           ++GQY FEGR DLV+FVK V  AGL++HLRIGPYACAEWNYGGFP+WLHFIPGIQFRT N
Sbjct: 85  VQGQYNFEGRADLVKFVKAVAAAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIQFRTDN 144

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PF+ EMKRF  KI+D+MKQE+L+ASQGGPIIL+QVENEYGN++ AYG   + Y+KWAA 
Sbjct: 145 KPFEAEMKRFTVKIVDMMKQESLYASQGGPIILSQVENEYGNIDAAYGPAAKSYIKWAAS 204

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG A
Sbjct: 205 MATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNAKPKMWTENWSGWFLSFGGA 264

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGRT GGP ++TSYDYDAPID+YG I
Sbjct: 265 VPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDQYGII 324

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL+++HKAIKLCEE LI++DPT    G  +EA +Y K+ + CAAFLAN  ++SD
Sbjct: 325 RQPKWGHLKDVHKAIKLCEEALIATDPTITSPGPNIEAAVY-KTGSICAAFLANI-ATSD 382

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ-QKNVNELLLASSA 421
           A VTFNGN Y LPAWSVSILPDCKNVV NTAK+ S            ++ V  L  + S 
Sbjct: 383 ATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSASMISSFTTESFKEEVGSLDDSGSG 442

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           +SW  E +GIS + SF +  L EQINTT D SDYLWY+ SI V    G +  L+IESLGH
Sbjct: 443 WSWISEPIGISKSDSFSKFGLLEQINTTADKSDYLWYSISIDVEGDSGSQTVLHIESLGH 502

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
           A   F+N K+   G GN   A   ++  + L  G N++D+LS+ VGLQNYGA+FD  GAG
Sbjct: 503 ALHAFINGKIAGSGTGNSGKAKVNVDIPVTLVAGKNSIDLLSLTVGLQNYGAFFDTWGAG 562

Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           +   VIL  LKNG   DLSS +W YQVG++ E +G    S  +S  W   STLP N+SLI
Sbjct: 563 ITGPVILKGLKNGSTVDLSSQQWTYQVGLKYEDLGP---SNGSSGQWNSQSTLPTNQSLI 619

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKT F+AP G  P+A++   MGKG+AWVNGQSIGRYW  Y++P+ GCT  C+YRG+Y +
Sbjct: 620 WYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVSPNGGCTDSCNYRGAYSS 679

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
           SKC K+CG+P+QTLYHIPR+W+ P  N LV+ EE GGDP++IS  TK    +CS VSE+ 
Sbjct: 680 SKCLKNCGKPSQTLYHIPRSWLQPDSNTLVLFEESGGDPTQISFATKQIGSMCSHVSESH 739

Query: 720 PPPVDSWKPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGACHMD-VL 777
           PPPVD W  + G     P + L C      I++I FAS+G P G CG+F+ G C  +  L
Sbjct: 740 PPPVDLWNSDKG-RKVGPVLSLECPYPNQLISSIKFASFGTPYGTCGNFKHGRCRSNKAL 798

Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            IVQKAC+G   C I +S    G     C G+ K+LAVEA C+
Sbjct: 799 SIVQKACIGSSSCRIGISINTFG---DPCKGVTKSLAVEASCA 838


>gi|359478691|ref|XP_002285084.2| PREDICTED: beta-galactosidase 8-like [Vitis vinifera]
 gi|297746241|emb|CBI16297.3| unnamed protein product [Vitis vinifera]
          Length = 846

 Score = 1021 bits (2641), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 501/834 (60%), Positives = 609/834 (73%), Gaps = 23/834 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            ++ VTYDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 22  FASTVTYDHRALVIDGKRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLH 81

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R QY F+GR DLV+FVKTV EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGIQFRT
Sbjct: 82  EPVRRQYDFKGRNDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIQFRT 141

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFKEEM+ F AKI+D+MK+ENL+ASQGGPIIL+Q+ENEYGN++ AYG   + Y++WA
Sbjct: 142 DNGPFKEEMQIFTAKIVDMMKKENLYASQGGPIILSQIENEYGNIDSAYGSAAKSYIQWA 201

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A +L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS  KP MWTEN++GWFLSFG
Sbjct: 202 ASMATSLDTGVPWVMCQQADAPDPMINTCNGFYCDQFTPNSVKKPKMWTENWTGWFLSFG 261

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RPVED+AFAVARFF+ GGTFQNYYMY GGTNFGRT GGP +ATSYDYDAPIDEYG
Sbjct: 262 GAVPYRPVEDIAFAVARFFQLGGTFQNYYMYHGGTNFGRTTGGPFIATSYDYDAPIDEYG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL++LHKAIKLCE  LI++DPT   LG  LEA +Y   +  CAAFLAN  ++
Sbjct: 322 LLRQPKWGHLKDLHKAIKLCEAALIATDPTITSLGTNLEASVYKTGTGSCAAFLANVRTN 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           SDA V F+GN Y LPAWSVSILPDCKNV  NTA++ S       P   Q+++   + +S 
Sbjct: 382 SDATVNFSGNSYHLPAWSVSILPDCKNVALNTAQINSM---AVMPRFMQQSLKNDIDSSD 438

Query: 421 AF----SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
            F    SW +E VGIS N +F +  L EQIN T D SDYLWY+ S  +   +     G +
Sbjct: 439 GFQSGWSWVDEPVGISKNNAFTKLGLLEQINITADKSDYLWYSLSTEIQGDEPFLEDGSQ 498

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L++ESLGHA   F+N KL   G GN   A   ++  + L  G NT+D+LS+ VGLQNY
Sbjct: 499 TVLHVESLGHALHAFINGKLAGSGTGNSGNAKVTVDIPVTLIHGKNTIDLLSLTVGLQNY 558

Query: 532 GAWFDVAGAGLFSVI-LIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
           GA++D  GAG+   I L  L NG   DLSS +W YQVG++GE +GL      +SS W  G
Sbjct: 559 GAFYDKQGAGITGPIKLKGLANGTTVDLSSQQWTYQVGLQGEELGLPS---GSSSKWVAG 615

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           STLP  + LIWYKTTF AP G  P+AL+   MGKG+AWVNGQSIGRYW AY++ + GCT 
Sbjct: 616 STLPKKQPLIWYKTTFDAPAGNDPVALDFMGMGKGEAWVNGQSIGRYWPAYVSSNGGCTS 675

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            C+YRG Y ++KC K+CG+P+Q LYH+PR+W+ P  N LV+ EE+GGDP++IS  TK  +
Sbjct: 676 SCNYRGPYSSNKCLKNCGKPSQQLYHVPRSWLQPSGNTLVLFEEIGGDPTQISFATKQVE 735

Query: 710 HICSFVSEADPPPVDSWKPNLGV-VSSSPQVRLACE-RGWHIAAINFASYGIPEGNCGSF 767
            +CS VSE  P PVD W  +L     SSP + L C      I++I FAS+G P G CGSF
Sbjct: 736 SLCSRVSEYHPLPVDMWGSDLTTGRKSSPMLSLECPFPNQVISSIKFASFGTPRGTCGSF 795

Query: 768 RPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
               C     L IVQ+AC+G   CSI VS    G     C G+ K+LAVEA C+
Sbjct: 796 SHSKCSSRTALSIVQEACIGSKSCSIGVSIDTFG---DPCSGIAKSLAVEASCT 846


>gi|356550171|ref|XP_003543462.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
          Length = 840

 Score = 1016 bits (2626), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 507/824 (61%), Positives = 609/824 (73%), Gaps = 13/824 (1%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN +EP
Sbjct: 24  ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLNEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY F+GR DLV+FVKTV  AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 84  VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF AKI+D++K+ENL+ASQGGP+IL+Q+ENEYGN++ AYG  G+ Y+KWAA 
Sbjct: 144 EPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNIDSAYGAAGKSYIKWAAT 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLPFGGA 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGII 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL+E+HKAIKLCEE LI++DPT   LG  LEA +Y K+ + CAAFLAN D+ SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVDTKSD 382

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
             V F+GN Y LPAWSVSILPDCKNVV NTAK+ S            K ++     +S+ 
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISSFTTESLKEDIGSSEASSTG 442

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           +SW  E VGIS   SF +  L EQINTT D SDYLWY+ SI      G +  L+IESLGH
Sbjct: 443 WSWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSIDYKGDAGSQTVLHIESLGH 502

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
           A   F+N KL     GN     F ++  + L  G NT+D+LS+ VGLQNYGA+FD  GAG
Sbjct: 503 ALHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGAG 562

Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           +   VIL  L NG   DLS  +W YQVG++GE +GL   S  +S  W   ST P N+ LI
Sbjct: 563 ITGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGL---SSGSSGQWNSQSTFPKNQPLI 619

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKTTF AP G  P+A++   MGKG+AWVNGQSIGRYW  Y+A   GCT  C+YRG Y A
Sbjct: 620 WYKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSA 679

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
           SKC+++CG+P+QTLYH+PR+W+ P  N+LV+ EE GGDP++IS +TK  + +C+ VS++ 
Sbjct: 680 SKCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSH 739

Query: 720 PPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-V 776
           PPPVD W  +        P + L C      I++I FASYG P G CG+F  G C  +  
Sbjct: 740 PPPVDLWNSDTESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKA 799

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           L IVQKAC+G   CS+ VSS   G     C G+ K+LAVEA C+
Sbjct: 800 LSIVQKACIGSSSCSVGVSSETFG---NPCRGVAKSLAVEATCA 840


>gi|356550173|ref|XP_003543463.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
          Length = 830

 Score = 1011 bits (2615), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 507/823 (61%), Positives = 608/823 (73%), Gaps = 21/823 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN +EP
Sbjct: 24  ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLNEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY F+GR DLV+FVKTV  AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 84  VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF AKI+D++K+ENL+ASQGGP+IL+Q+ENEYGN++ AYG  G+ Y+KWAA 
Sbjct: 144 EPFKAEMKRFTAKIVDMIKEENLYASQGGPVILSQIENEYGNIDSAYGAAGKSYIKWAAT 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLPFGGA 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGII 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL+E+HKAIKLCEE LI++DPT   LG  LEA +Y K+ + CAAFLAN D+ SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVDTKSD 382

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F+GN Y LPAWSVSILPDCKNVV NTAKV               ++   L +S+ +
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKVC---------LTNFISMFMWLPSSTGW 433

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
           SW  E VGIS   SF +  L EQINTT D SDYLWY+ SI      G +  L+IESLGHA
Sbjct: 434 SWISEPVGISKADSFPQTGLLEQINTTADKSDYLWYSLSIDYKGDAGSQTVLHIESLGHA 493

Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
              F+N KL     GN     F ++  + L  G NT+D+LS+ VGLQNYGA+FD  GAG+
Sbjct: 494 LHAFINGKLAGSQTGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGAGI 553

Query: 543 FS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
              VIL  L NG   DLS  +W YQVG++GE +GL   S  +S  W   ST P N+ LIW
Sbjct: 554 TGPVILKGLANGNTLDLSYQKWTYQVGLKGEDLGL---SSGSSGQWNSQSTFPKNQPLIW 610

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKTTF AP G  P+A++   MGKG+AWVNGQSIGRYW  Y+A   GCT  C+YRG Y AS
Sbjct: 611 YKTTFAAPSGSDPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASDAGCTDSCNYRGPYSAS 670

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
           KC+++CG+P+QTLYH+PR+W+ P  N+LV+ EE GGDP++IS +TK  + +C+ VS++ P
Sbjct: 671 KCRRNCGKPSQTLYHVPRSWLKPSGNILVLFEEKGGDPTQISFVTKQTESLCAHVSDSHP 730

Query: 721 PPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-VL 777
           PPVD W  +        P + L C      I++I FASYG P G CG+F  G C  +  L
Sbjct: 731 PPVDLWNSDTESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKAL 790

Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            IVQKAC+G   CS+ VSS   G     C G+ K+LAVEA C+
Sbjct: 791 SIVQKACIGSSSCSVGVSSETFG---NPCRGVAKSLAVEATCA 830


>gi|297822423|ref|XP_002879094.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
 gi|297324933|gb|EFH55353.1| beta-glactosidase 8 [Arabidopsis lyrata subsp. lyrata]
          Length = 846

 Score = 1009 bits (2608), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 493/837 (58%), Positives = 615/837 (73%), Gaps = 31/837 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 23  AVNVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIKKSKDGGLDVIETYVFWSGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P + +Y FEGR+DLV+FVK V+EAGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT 
Sbjct: 83  PEKNKYNFEGRYDLVKFVKLVEEAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFKEEM+RF  KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG   ++Y+KW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKIYIKWSA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFL FG 
Sbjct: 203 SMALSLDTGVPWNMCQQADAPDPMINTCNGFYCDQFTPNSNSKPKMWTENWSGWFLGFGD 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG 
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIKLCE+ LI++DPT   LG+ LEA +Y  +S  CAAFLAN  + S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGTKS 382

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DA V+FNG  Y LPAWSVSILPDCKNV FNTAK+    N+   P A  +   +    SSA
Sbjct: 383 DATVSFNGESYHLPAWSVSILPDCKNVAFNTAKI----NSATEPTAFARQSLKPDGGSSA 438

Query: 422 -----FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKE 471
                +S+ +E +GIS   +F++P L EQINTT D SDYLWY+  + +        +G +
Sbjct: 439 ELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSK 498

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L+IESLG     F+N KL   G+G    +   ++  I L  G NT+D+LS+ VGL NY
Sbjct: 499 AVLHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLAAGKNTVDLLSVTVGLANY 555

Query: 532 GAWFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
           GA+FD+ GAG+   V L   K G   DL+S +W YQVG++GE  GL  +   +SS W   
Sbjct: 556 GAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSK 612

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           S LP  + LIWYKTTF AP G  P+A++    GKG AWVNGQSIGRYW   +A + GCT 
Sbjct: 613 SPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTD 672

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TG 708
            CDYRGSY A+KC K+CG+P+QTLYH+PR+W+ P  N LV+ EE+GGDP++IS  TK TG
Sbjct: 673 SCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQTG 732

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNC 764
            ++C  VS++ PPPVD+W  +  + +   + P + L C      I++I FAS+G P+G C
Sbjct: 733 SNLCLMVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPVSTQVISSIKFASFGTPQGTC 792

Query: 765 GSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           GSF  G C+    L +VQKAC+G   C++ VS+   G     C G++K+LAVEA CS
Sbjct: 793 GSFTHGHCNSSRSLSVVQKACIGSRSCNVEVSTRVFGE---PCRGVIKSLAVEASCS 846


>gi|30683905|ref|NP_850121.1| beta-galactosidase 8 [Arabidopsis thaliana]
 gi|152013364|sp|Q9SCV4.2|BGAL8_ARATH RecName: Full=Beta-galactosidase 8; Short=Lactase 8; AltName:
           Full=Protein AR782; Flags: Precursor
 gi|330253033|gb|AEC08127.1| beta-galactosidase 8 [Arabidopsis thaliana]
          Length = 852

 Score = 1006 bits (2600), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/835 (58%), Positives = 612/835 (73%), Gaps = 27/835 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 29  AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P + +Y FEGR+DLV+FVK   +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT 
Sbjct: 89  PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFKEEM+RF  KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG   + Y+KW+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG 
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG 
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIKLCE+ LI++DPT   LG+ LEA +Y   S  CAAFLAN D+ S
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLA 418
           DA VTFNG  Y LPAWSVSILPDCKNV FNTAK+ S   +    FA+Q    +       
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPDGGSSAEL 446

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
            S +S+ +E +GIS   +F++P L EQINTT D SDYLWY+    +        +G +  
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L+IESLG     F+N KL   G+G    +   ++  I L  G NT+D+LS+ VGL NYGA
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGA 563

Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           +FD+ GAG+   V L   K G   DL+S +W YQVG++GE  GL  +   +SS W   S 
Sbjct: 564 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSP 620

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           LP  + LIWYKTTF AP G  P+A++    GKG AWVNGQSIGRYW   +A + GCT+ C
Sbjct: 621 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESC 680

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQH 710
           DYRGSY A+KC K+CG+P+QTLYH+PR+W+ P  N+LV+ EE+GGDP++IS  TK TG +
Sbjct: 681 DYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSN 740

Query: 711 ICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGS 766
           +C  VS++ PPPVD+W  +  + +   + P + L C      I +I FAS+G P+G CGS
Sbjct: 741 LCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGS 800

Query: 767 FRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           F  G C+    L +VQKAC+G   C++ VS+   G     C G++K+LAVEA CS
Sbjct: 801 FTQGHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 852


>gi|6686888|emb|CAB64744.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 852

 Score = 1006 bits (2600), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/835 (58%), Positives = 612/835 (73%), Gaps = 27/835 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 29  AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 88

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P + +Y FEGR+DLV+FVK   +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT 
Sbjct: 89  PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 148

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFKEEM+RF  KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG   + Y+KW+A
Sbjct: 149 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 208

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG 
Sbjct: 209 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 268

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG 
Sbjct: 269 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 328

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIKLCE+ LI++DPT   LG+ LEA +Y   S  CAAFLAN D+ S
Sbjct: 329 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 388

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLA 418
           DA VTFNG  Y LPAWSVSILPDCKNV FNTAK+ S   +    FA+Q    +       
Sbjct: 389 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPDGGSSAEL 446

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
            S +S+ +E +GIS   +F++P L EQINTT D SDYLWY+    +        +G +  
Sbjct: 447 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 506

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L+IESLG     F+N KL   G+G    +   ++  I L  G NT+D+LS+ VGL NYGA
Sbjct: 507 LHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGA 563

Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           +FD+ GAG+   V L   K G   DL+S +W YQVG++GE  GL  +   +SS W   S 
Sbjct: 564 FFDLMGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSP 620

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           LP  + LIWYKTTF AP G  P+A++    GKG AWVNGQSIGRYW   +A + GCT+ C
Sbjct: 621 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESC 680

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQH 710
           DYRGSY A+KC K+CG+P+QTLYH+PR+W+ P  N+LV+ EE+GGDP++IS  TK TG +
Sbjct: 681 DYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSN 740

Query: 711 ICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGS 766
           +C  VS++ PPPVD+W  +  + +   + P + L C      I +I FAS+G P+G CGS
Sbjct: 741 LCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGS 800

Query: 767 FRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           F  G C+    L +VQKAC+G   C++ VS+   G     C G++K+LAVEA CS
Sbjct: 801 FTQGHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 852


>gi|334184536|ref|NP_001189624.1| beta-galactosidase 8 [Arabidopsis thaliana]
 gi|330253034|gb|AEC08128.1| beta-galactosidase 8 [Arabidopsis thaliana]
          Length = 846

 Score = 1006 bits (2600), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/835 (58%), Positives = 612/835 (73%), Gaps = 27/835 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 23  AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P + +Y FEGR+DLV+FVK   +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT 
Sbjct: 83  PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFKEEM+RF  KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG   + Y+KW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG 
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG 
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIKLCE+ LI++DPT   LG+ LEA +Y   S  CAAFLAN D+ S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLA 418
           DA VTFNG  Y LPAWSVSILPDCKNV FNTAK+ S   +    FA+Q    +       
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPDGGSSAEL 440

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
            S +S+ +E +GIS   +F++P L EQINTT D SDYLWY+    +        +G +  
Sbjct: 441 GSQWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAV 500

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L+IESLG     F+N KL   G+G    +   ++  I L  G NT+D+LS+ VGL NYGA
Sbjct: 501 LHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGA 557

Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           +FD+ GAG+   V L   K G   DL+S +W YQVG++GE  GL  +   +SS W   S 
Sbjct: 558 FFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSP 614

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           LP  + LIWYKTTF AP G  P+A++    GKG AWVNGQSIGRYW   +A + GCT+ C
Sbjct: 615 LPTKQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESC 674

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQH 710
           DYRGSY A+KC K+CG+P+QTLYH+PR+W+ P  N+LV+ EE+GGDP++IS  TK TG +
Sbjct: 675 DYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSN 734

Query: 711 ICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGS 766
           +C  VS++ PPPVD+W  +  + +   + P + L C      I +I FAS+G P+G CGS
Sbjct: 735 LCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGS 794

Query: 767 FRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           F  G C+    L +VQKAC+G   C++ VS+   G     C G++K+LAVEA CS
Sbjct: 795 FTQGHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 846


>gi|4510395|gb|AAD21482.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 839

 Score = 1002 bits (2591), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/832 (59%), Positives = 610/832 (73%), Gaps = 28/832 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRALVIDGKR+VL SGSIHYPRSTPE+WPELI+KSK+GGL+VIETYVFW+ HE
Sbjct: 23  AANVTYDHRALVIDGKRKVLISGSIHYPRSTPEMWPELIQKSKDGGLDVIETYVFWSGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P + +Y FEGR+DLV+FVK   +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT 
Sbjct: 83  PEKNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFKEEM+RF  KI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ AYG   + Y+KW+A
Sbjct: 143 NEPFKEEMQRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSAYGAAAKSYIKWSA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A++L+T VPW MCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFL FG 
Sbjct: 203 SMALSLDTGVPWNMCQQTDAPDPMINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGD 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG 
Sbjct: 263 PSPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFDRTSGGPLISTSYDYDAPIDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIKLCE+ LI++DPT   LG+ LEA +Y   S  CAAFLAN D+ S
Sbjct: 323 LRQPKWGHLRDLHKAIKLCEDALIATDPTITSLGSNLEAAVYKTESGSCAAFLANVDTKS 382

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DA VTFNG  Y LPAWSVSILPDCKNV FNTAKV   + N         +  EL    S 
Sbjct: 383 DATVTFNGKSYNLPAWSVSILPDCKNVAFNTAKV---KFNSISKTPDGGSSAEL---GSQ 436

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFLNI 476
           +S+ +E +GIS   +F++P L EQINTT D SDYLWY+    +        +G +  L+I
Sbjct: 437 WSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRTDIKGDETFLDEGSKAVLHI 496

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           ESLG     F+N KL   G+G    +   ++  I L  G NT+D+LS+ VGL NYGA+FD
Sbjct: 497 ESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVTGTNTIDLLSVTVGLANYGAFFD 553

Query: 537 VAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           + GAG+   V L   K G   DL+S +W YQVG++GE  GL  +   +SS W   S LP 
Sbjct: 554 LVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLATV---DSSEWVSKSPLPT 610

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + LIWYKTTF AP G  P+A++    GKG AWVNGQSIGRYW   +A + GCT+ CDYR
Sbjct: 611 KQPLIWYKTTFDAPSGSEPVAIDFTGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYR 670

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQHICS 713
           GSY A+KC K+CG+P+QTLYH+PR+W+ P  N+LV+ EE+GGDP++IS  TK TG ++C 
Sbjct: 671 GSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCL 730

Query: 714 FVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
            VS++ PPPVD+W  +  + +   + P + L C      I +I FAS+G P+G CGSF  
Sbjct: 731 TVSQSHPPPVDTWTSDSKISNRNRTRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQ 790

Query: 770 GACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G C+    L +VQKAC+G   C++ VS+   G     C G++K+LAVEA CS
Sbjct: 791 GHCNSSRSLSLVQKACIGLRSCNVEVSTRVFGE---PCRGVVKSLAVEASCS 839


>gi|357453873|ref|XP_003597217.1| Beta-galactosidase [Medicago truncatula]
 gi|355486265|gb|AES67468.1| Beta-galactosidase [Medicago truncatula]
          Length = 833

 Score =  999 bits (2583), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 495/826 (59%), Positives = 606/826 (73%), Gaps = 20/826 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 20  TNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEP 79

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           ++GQY F+GR DLV+FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 80  VKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 139

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++  YG  G+ Y+ WAA 
Sbjct: 140 EPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWAAK 199

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG A
Sbjct: 200 MATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLSFGGA 259

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R+ GGP +ATSYDYDAPIDEYG I
Sbjct: 260 VPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDYDAPIDEYGII 319

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQ KWGHL+++HKAIKLCEE LI++DP    LG  LEA +Y K+ + CAAFLAN D+ +D
Sbjct: 320 RQQKWGHLKDVHKAIKLCEEALIATDPKISSLGQNLEAAVY-KTGSVCAAFLANVDTKND 378

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F+GN Y LPAWSVSILPDCKNVV NTAK+ S     +      ++++ L  +SS +
Sbjct: 379 KTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASAISNF---VTEDISSLETSSSKW 435

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
           SW  E VGIS +    +  L EQINTT D SDYLWY+ S+ +    G +  L+IESLGHA
Sbjct: 436 SWINEPVGISKDDILSKTGLLEQINTTADRSDYLWYSLSLDLADDPGSQTVLHIESLGHA 495

Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
              F+N KL     GN D +   ++  I L  G N +D+LS+ VGLQNYGA+FD  GAG+
Sbjct: 496 LHAFINGKLAGNQAGNSDKSKLNVDIPIALVSGKNKIDLLSLTVGLQNYGAFFDTVGAGI 555

Query: 543 FS-VILIDLKNGKR--DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
              VIL  LKNG    DLSS +W YQ+G++GE +    +S  +S  W   ST P N+ L+
Sbjct: 556 TGPVILKGLKNGNNTLDLSSRKWTYQIGLKGEDL---GLSSGSSGGWNSQSTYPKNQPLV 612

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKT F AP G  P+A++   MGKG+AWVNGQSIGRYW  Y+A + GCT  C+YRG Y +
Sbjct: 613 WYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTS 672

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
           SKC+K+CG+P+QTLYH+PR+++ P  N LV+ EE GGDP++IS  TK  + +CS VS++ 
Sbjct: 673 SKCRKNCGKPSQTLYHVPRSFLKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSH 732

Query: 720 PPPVDSWKPNL---GVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSFRPGACHMD 775
           PP +D W  +    G V   P + L+C      I++I FASYG P G CG+F  G C  +
Sbjct: 733 PPQIDLWNQDTESGGKV--GPALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSN 790

Query: 776 -VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             L IV+KAC+G   CS+ VS+   G     C G+ K+LAVEA C+
Sbjct: 791 KALSIVKKACIGSRSCSVGVSTDTFG---DPCRGVPKSLAVEATCA 833


>gi|357472237|ref|XP_003606403.1| Beta-galactosidase [Medicago truncatula]
 gi|355507458|gb|AES88600.1| Beta-galactosidase [Medicago truncatula]
          Length = 839

 Score =  998 bits (2581), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 500/824 (60%), Positives = 600/824 (72%), Gaps = 14/824 (1%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           +NVTYDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GG++VIETYVFWN HEP
Sbjct: 24  SNVTYDHRALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY FEGR DLV FVK V  AGL++HLRIGPY CAEWNYGGFP+WLHFI GI+FRT N
Sbjct: 84  VRGQYNFEGRGDLVGFVKAVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF AKI+D+MKQENL+ASQGGPIIL+Q+ENEYGN++       + Y+ WAA 
Sbjct: 144 EPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAAS 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+T VPW+MCQQ +APDPIINTCN FYCD FTPNS +KP MWTEN+SGWFL+FG A
Sbjct: 204 MATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGA 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNFGRT GGP ++TSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDI 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LHKAIKLCEE LI+SDPT    G  LE  +Y K+   C+AFLAN    SD
Sbjct: 324 RQPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVY-KTGAVCSAFLANI-GMSD 381

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
           A VTFNGN Y LP WSVSILPDCKNVV NTAKV +            K  V+ L  +SS 
Sbjct: 382 ATVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISSFATESLKEKVDSLDSSSSG 441

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           +SW  E VGIS   +F +  L EQINTT D SDYLWY+ SI      G +  L+IESLGH
Sbjct: 442 WSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSIVYEDNAGDQPVLHIESLGH 501

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
           A   FVN KL     G+   A   ++  I L  G NT+D+LS+ VGLQNYGA++D  GAG
Sbjct: 502 ALHAFVNGKLAGSKAGSSGNAKVNVDIPITLVTGKNTIDLLSLTVGLQNYGAFYDTVGAG 561

Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           +   VIL  LKNG   DL+S +W YQVG++GE++GL   S  N   W   S LP N+ L 
Sbjct: 562 ITGPVILKGLKNGSSVDLTSQQWTYQVGLQGEFVGL---SSGNVGQWNSQSNLPANQPLT 618

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKT F+AP G  P+A++   MGKG+AWVNGQSIGRYW  Y++P++GCT  C+YRG+Y A
Sbjct: 619 WYKTNFVAPSGSNPVAIDFTGMGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSA 678

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
           SKC K+CG+P+QTLYH+PR W+ P  N  V+ EE GGDP+KIS  TK  + +CS V+E+ 
Sbjct: 679 SKCLKNCGKPSQTLYHVPRAWLKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTESH 738

Query: 720 PPPVDSWKPNL-GVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGACHMD-V 776
           PPPVD+W  N        P + L C      I++I FAS+G P G CG++  G+C  +  
Sbjct: 739 PPPVDTWNSNAESERKVGPVLSLECPYPNQAISSIKFASFGTPRGTCGNYNHGSCSSNRA 798

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           L IVQKAC+G   C+I VS    G     C G+ K+LAVEA C+
Sbjct: 799 LSIVQKACIGSSSCNIGVSINTFG---NPCRGVTKSLAVEAACT 839


>gi|449462081|ref|XP_004148770.1| PREDICTED: beta-galactosidase 8-like [Cucumis sativus]
          Length = 844

 Score =  998 bits (2579), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/831 (58%), Positives = 608/831 (73%), Gaps = 22/831 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L+ NVTYDHRALVIDGKR+VL SGS+HYPRSTPE+WP +I+KSK+GGL+VIETYVFWN H
Sbjct: 23  LAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNLH 82

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R QY FEGR DLV+F+K V  AGL++H+RIGPY CAEWNYGGFPVWLHF+PG+QFRT
Sbjct: 83  EPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK EMKRF AKI+D++KQE L+ASQGGPIIL+Q+ENEYGNV+ ++G   + YV+WA
Sbjct: 143 DNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQWA 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A +LNT VPWVMC Q DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 203 ATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A+P+RPVEDLAFAVARF++TGG+ QNYYMY GGTNFGRT+GGP +ATSYDYDAPIDEYG
Sbjct: 263 GALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHLR++HKAIK+CEE L+S+DP    LG  LEA +Y KS + C+AFLAN D+ 
Sbjct: 323 LVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVY-KSGSQCSAFLANVDTQ 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK---VISQRNNGDHPFAQQKNVNELLL 417
           SD  VTFNGN Y LPAWSVSILPDCKNVV NTAK   V ++ +  + P     + +E   
Sbjct: 382 SDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAF- 440

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             S +SW +E +GIS N SF    L+EQINTT D SDYLWY+ S  +   +     G   
Sbjct: 441 -DSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNT 499

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L+++SLGH   VF+NKKL   G G+   +   ++  I L  G NT+D+LS+ VGLQNYG
Sbjct: 500 VLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYG 559

Query: 533 AWFDVAGAGLFSVILIDLK--NGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           A+F++ GAG+   + ++ +  N   DLSSG+W YQ+G+EGE +GL      ++S W    
Sbjct: 560 AFFELRGAGVTGPVKLENQKNNITVDLSSGQWTYQIGLEGEDLGLPS---GSTSQWLSQP 616

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            LP NK L WYKTTF AP G  PLAL+    GKG+AW+NG SIGRYW +Y+A S  CT  
Sbjct: 617 NLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIA-SGQCTSY 675

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDY+G+Y A+KC ++CG+P+QTLYH+P++W+ P  N LV+ EE+G DP++++  +K    
Sbjct: 676 CDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLGS 735

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
           +CS VSE+ PPPV+ W  +     + P + L C      I++I FAS+G P G CGSF  
Sbjct: 736 LCSHVSESHPPPVEMWSSDSKQQKTGPVLSLECPSPSQVISSIKFASFGTPRGTCGSFSH 795

Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G C   + L IVQKAC+G   CSI VS    G     C G  K+LAVEA+C
Sbjct: 796 GQCSTRNALSIVQKACIGSKSCSIDVSIKAFG---DPCRGKTKSLAVEAYC 843


>gi|449525184|ref|XP_004169598.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 8-like [Cucumis
           sativus]
          Length = 844

 Score =  997 bits (2578), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 490/831 (58%), Positives = 608/831 (73%), Gaps = 22/831 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L+ NVTYDHRALVIDGKR+VL SGS+HYPRSTPE+WP +I+KSK+GGL+VIETYVFWN H
Sbjct: 23  LAVNVTYDHRALVIDGKRKVLVSGSLHYPRSTPEMWPGIIQKSKDGGLDVIETYVFWNLH 82

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R QY FEGR DLV+F+K V  AGL++H+RIGPY CAEWNYGGFPVWLHF+PG+QFRT
Sbjct: 83  EPVRNQYDFEGRKDLVKFIKLVGAAGLYVHVRIGPYVCAEWNYGGFPVWLHFVPGVQFRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK EMKRF AKI+D++KQE L+ASQGGPIIL+Q+ENEYGNV+ ++G   + YV+WA
Sbjct: 143 DNEPFKAEMKRFTAKIVDVLKQEKLYASQGGPIILSQIENEYGNVQSSFGSAAKSYVQWA 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A +LNT VPWVMC Q DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 203 ATMATSLNTGVPWVMCNQPDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLSFG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A+P+RPVEDLAFAVARF++TGG+ QNYYMY GGTNFGRT+GGP +ATSYDYDAPIDEYG
Sbjct: 263 GALPYRPVEDLAFAVARFYQTGGSLQNYYMYHGGTNFGRTSGGPFIATSYDYDAPIDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHLR++HKAIK+CEE L+S+DP    LG  LEA +Y KS + C+AFLAN D+ 
Sbjct: 323 LVRQPKWGHLRDVHKAIKMCEEALVSTDPAVTSLGPNLEATVY-KSGSQCSAFLANVDTQ 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK---VISQRNNGDHPFAQQKNVNELLL 417
           SD  VTFNGN Y LPAWSVSILPDCKNVV NTAK   V ++ +  + P     + +E   
Sbjct: 382 SDKTVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSVTTRPSFSNQPLKVDVSASEAF- 440

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             S +SW +E +GIS N SF    L+EQINTT D SDYLWY+ S  +   +     G   
Sbjct: 441 -DSGWSWIDEPIGISKNNSFANLGLSEQINTTADKSDYLWYSLSTDIKGDEPYLANGSNT 499

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L+++SLGH   VF+NKKL   G G+   +   ++  I L  G NT+D+LS+ VGLQNYG
Sbjct: 500 VLHVDSLGHVLHVFINKKLAGSGKGSGGSSKVSLDIPITLVPGKNTIDLLSLTVGLQNYG 559

Query: 533 AWFDVAGAGLFS-VILIDLKNG-KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           A+F++ GAG+   V L + KN    DLSSG+W YQ+G+EGE +GL      ++S W    
Sbjct: 560 AFFELRGAGVTGPVKLENXKNNITVDLSSGQWTYQIGLEGEDLGLPS---GSTSQWLSQP 616

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            LP NK L WYKTTF AP G  PLAL+    GKG+AW+NG SIGRYW +Y+A S  CT  
Sbjct: 617 NLPKNKPLTWYKTTFDAPAGSDPLALDFTGFGKGEAWINGHSIGRYWPSYIA-SGQCTSY 675

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDY+G+Y A+KC ++CG+P+QTLYH+P++W+ P  N LV+ EE+G DP++++  +K    
Sbjct: 676 CDYKGAYSANKCLRNCGKPSQTLYHVPQSWLKPTGNTLVLFEEIGSDPTRLTFASKQLGS 735

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
           +CS VSE+ PPPV+ W  +     + P + L C      I++I FAS+G P G CGSF  
Sbjct: 736 LCSHVSESHPPPVEMWSSDSKQQKTGPVLSLECPSPSQVISSIKFASFGTPRGTCGSFSH 795

Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G C   + L IVQKAC+G   CSI VS    G     C G  K+LAVEA+C
Sbjct: 796 GQCSTRNALSIVQKACIGSKSCSIDVSIKAFG---DPCRGKTKSLAVEAYC 843


>gi|255578884|ref|XP_002530296.1| beta-galactosidase, putative [Ricinus communis]
 gi|223530194|gb|EEF32103.1| beta-galactosidase, putative [Ricinus communis]
          Length = 842

 Score =  995 bits (2573), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 488/832 (58%), Positives = 610/832 (73%), Gaps = 22/832 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            +ANVTYDHRAL+IDGKRRVL SGSIHYPRSTPE+WP LI+KSK+GGL+VIETYVFWN H
Sbjct: 21  FAANVTYDHRALLIDGKRRVLISGSIHYPRSTPEMWPGLIQKSKDGGLDVIETYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R QY FEGR+DLV+FVK V EAGL++H+RIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 81  EPVRNQYNFEGRYDLVKFVKLVAEAGLYVHIRIGPYVCAEWNYGGFPLWLHFIPGIKFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK EM+RF AKI+D+MKQE L+ASQGGPIIL+Q+ENEYGN++ A+G   + Y+ WA
Sbjct: 141 DNEPFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSAFGPAAKTYINWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A++L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWF SFG
Sbjct: 201 AGMAISLDTGVPWVMCQQADAPDPVINTCNGFYCDQFTPNSKNKPKMWTENWSGWFQSFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RPVEDLAFAVARF++  GTFQNYYMY GGTNFGRT GGP ++TSYDYDAP+DEYG
Sbjct: 261 GAVPYRPVEDLAFAVARFYQLSGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL+++HKAIKLCEE LI++DPT   LG+ LEA +Y K+ + CAAFLAN  ++
Sbjct: 321 LLRQPKWGHLKDVHKAIKLCEEALIATDPTTTSLGSNLEATVY-KTGSLCAAFLANI-AT 378

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLL 417
           +D  VTFNGN Y LPAWSVSILPDCKNV  NTAK+ S        FA+Q    +V+    
Sbjct: 379 TDKTVTFNGNSYNLPAWSVSILPDCKNVALNTAKINSVTIVPS--FARQSLVGDVDSSKA 436

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             S +SW  E VGIS N +FV+  L EQINTT D SDYLWY+ S ++   +     G + 
Sbjct: 437 IGSGWSWINEPVGISKNDAFVKSGLLEQINTTADKSDYLWYSLSTNIKGDEPFLEDGSQT 496

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L++ESLGHA   F+N KL   G G    A   ++  I L  G NT+D+LS+ VGLQNYG
Sbjct: 497 VLHVESLGHALHAFINGKLAGSGTGKSSNAKVTVDIPITLTPGKNTIDLLSLTVGLQNYG 556

Query: 533 AWFDVAGAGLFSVILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           A++++ GAG+   + +  +NG   DLSS +W YQ+G++GE  G+   S +    W    T
Sbjct: 557 AFYELTGAGITGPVKLKAQNGNTVDLSSQQWTYQIGLKGEDSGISSGSSSE---WVSQPT 613

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           LP N+ LIWYKT+F AP G  P+A++   MGKG+AWVNGQSIGRYW   ++PS+GC   C
Sbjct: 614 LPKNQPLIWYKTSFDAPAGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTNVSPSSGCADSC 673

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           +YRG Y ++KC K+CG+P+QT YHIPR+W+    N+LV+ EE+GGDP++I+  T+    +
Sbjct: 674 NYRGGYSSNKCLKNCGKPSQTFYHIPRSWIKSSGNILVLLEEIGGDPTQIAFATRQVGSL 733

Query: 712 CSFVSEADPPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
           CS VSE+ P PVD W  +  G   S P + L C      I++I FAS+G P G+CGS+  
Sbjct: 734 CSHVSESHPQPVDMWNTDSEGGKRSGPVLSLQCPHPDKVISSIKFASFGTPHGSCGSYSH 793

Query: 770 GAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G C     L IVQKACVG   C++ VS    G     C G+ K+LAVEA C+
Sbjct: 794 GKCSSTSALSIVQKACVGSKSCNVGVSINTFG---DPCRGVKKSLAVEASCT 842


>gi|56201401|dbj|BAD20774.2| beta-galactosidase [Raphanus sativus]
          Length = 851

 Score =  995 bits (2572), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/831 (58%), Positives = 605/831 (72%), Gaps = 25/831 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYDHRALVIDGKR++L SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP 
Sbjct: 32  SVTYDHRALVIDGKRKILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNGHEPE 91

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           + +Y FEGR+DLV+FVK   +AGL++HLRIGPYACAEWNYGGFPVWLHF+PGI+FRT N 
Sbjct: 92  KNKYNFEGRYDLVKFVKLAAKAGLYVHLRIGPYACAEWNYGGFPVWLHFVPGIKFRTDNE 151

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK EM+RF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++ +YG  G+ Y+KW+A  
Sbjct: 152 PFKAEMQRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSSYGAAGKSYMKWSASM 211

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A++L+T VPW MCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFL FG   
Sbjct: 212 ALSLDTGVPWNMCQQGDAPDPIINTCNGFYCDQFTPNSNNKPKMWTENWSGWFLGFGEPS 271

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF RT+GGPL++TSYDYDAPIDEYG +R
Sbjct: 272 PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFERTSGGPLISTSYDYDAPIDEYGLLR 331

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHLR+LHKAIKLCE+ LI++DP    LG+ LEA +Y  S+  CAAFLAN  + SDA
Sbjct: 332 QPKWGHLRDLHKAIKLCEDALIATDPKITSLGSNLEAAVYKTSTGSCAAFLANIGTKSDA 391

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLASS 420
            VTFNG  Y LPAWSVSILPDCKNV FNTAK+ S   +    FA+Q    N +      S
Sbjct: 392 TVTFNGKSYRLPAWSVSILPDCKNVAFNTAKINSATES--TAFARQSLKPNADSSAELGS 449

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFLN 475
            +S+ +E VGIS   +FV+P L EQINTT D SDYLWY+  + +        +G +  L+
Sbjct: 450 QWSYIKEPVGISKADAFVKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEGSKAVLH 509

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           ++S+G     F+N KL   G G    +   ++  I L  G NT+D+LS+ VGL NYG +F
Sbjct: 510 VQSIGQLVYAFINGKLAGSGNGKQKIS---LDIPINLVTGKNTIDLLSVTVGLANYGPFF 566

Query: 536 DVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           D+ GAG+   V L   K G   DLSS +W YQVG++GE  GL      +SS W   S LP
Sbjct: 567 DLTGAGITGPVSLKSAKTGSSTDLSSQQWTYQVGLKGEDKGLGS---GDSSEWVSNSPLP 623

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
            ++ LIWYKTTF AP G  P+A++    GKG AWVNGQSIGRYW   +A + GC   CDY
Sbjct: 624 TSQPLIWYKTTFDAPSGSDPVAIDFTGTGKGIAWVNGQSIGRYWPTSIARTDGCVGSCDY 683

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK-TGQHIC 712
           RGSY ++KC K+CG+P+QTLYH+PR+W+ P  N LV+ EE+GGDP+KIS  TK TG ++C
Sbjct: 684 RGSYRSNKCLKNCGKPSQTLYHVPRSWIKPSGNTLVLLEEMGGDPTKISFATKQTGSNLC 743

Query: 713 SFVSEADPPPVDSWKPNLGVVS-SSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPG 770
             VS++ P PVD+W  +    + +SP + L C      I++I FAS+G P G CGSF  G
Sbjct: 744 LTVSQSHPAPVDTWISDSKFSNRTSPVLSLKCPVSTQVISSIRFASFGTPTGTCGSFSYG 803

Query: 771 AC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            C     L +VQKACVG   C + VS+   G     C G++K+LAVEA C+
Sbjct: 804 HCSSARSLSVVQKACVGSRSCKVEVSTRVFGE---PCRGVVKSLAVEASCA 851


>gi|14970841|emb|CAC44501.1| beta-galactosidase [Fragaria x ananassa]
          Length = 840

 Score =  993 bits (2568), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 494/824 (59%), Positives = 598/824 (72%), Gaps = 21/824 (2%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V+YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP+
Sbjct: 29  TVSYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEPV 88

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY FEGR DLV FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT N 
Sbjct: 89  RGQYNFEGRNDLVGFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNE 148

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P+K EM RF AKI+++MK E L+ASQGGPIIL+Q+ENEYGN++ AYG   + Y+ WAA+ 
Sbjct: 149 PYKAEMHRFTAKIVEMMKNEKLYASQGGPIILSQIENEYGNIDKAYGPAAKTYINWAANM 208

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+L+T VPWVMCQQ DAP  +INTCNGFYCD F+PNS S P +WTEN+SGWFLSFG AV
Sbjct: 209 AVSLDTGVPWVMCQQADAPSSVINTCNGFYCDQFSPNSNSTPKIWTENWSGWFLSFGGAV 268

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGR++GGP +ATSYDYDAP+DEYG +R
Sbjct: 269 PQRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSSGGPFIATSYDYDAPLDEYGLLR 328

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHL+++HKAIKLCE  ++++DPT   LG  +EA +Y K+ + C+AFLAN D+ SDA
Sbjct: 329 QPKWGHLKDVHKAIKLCEPAMVATDPTISSLGQNIEAAVY-KTGSVCSAFLANVDTKSDA 387

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLLASS 420
            VTFNGN Y LPAWSVSILPDCKNVV NTAK+ +        F +Q    +V       S
Sbjct: 388 TVTFNGNSYQLPAWSVSILPDCKNVVINTAKINTATMVPS--FTRQSISADVEPTEAVGS 445

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +SW  E VGIS   +F R  L EQINTT D SDYLWY+ SI V  G   +  L+++SLG
Sbjct: 446 GWSWINEPVGISKGDAFTRVGLLEQINTTADKSDYLWYSTSIDVKGGYKAD--LHVQSLG 503

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   FVN KL   G GN   A   +   +E   G NT+D+LS+ VGLQNYGA+FD+ GA
Sbjct: 504 HALHAFVNGKLAGSGTGNSGNAKVSVEIPVEFASGKNTIDLLSLTVGLQNYGAFFDLVGA 563

Query: 541 GLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
           G+   V L    NG   DLSS +W YQ+G++GE    D+   + SS W    TLP N+ L
Sbjct: 564 GITGPVQLKGSANGTTIDLSSQQWTYQIGLKGE----DEDLPSGSSQWISQPTLPKNQPL 619

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYKT F AP G  P+AL+   MGKG+AWVNGQSIGRYW   +AP TGCT  C+YRG+Y 
Sbjct: 620 TWYKTQFDAPGGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNVAPKTGCT-DCNYRGAYS 678

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
           A KC+K+CG P+Q LYH+PR+W+    N LV+ EE+GGDP+++S  T+  + +CS VSE+
Sbjct: 679 ADKCRKNCGMPSQKLYHVPRSWMKSSGNTLVLFEEVGGDPTQLSFATRQVESLCSHVSES 738

Query: 719 DPPPVDSWKPNLGVVSSS-PQVRLACE-RGWHIAAINFASYGIPEGNCGSFRPGACHMD- 775
            P PVD W  +    S S P++ L C      I++I FASYG P G CGSF  G+C    
Sbjct: 739 HPSPVDMWSSDSKAGSKSRPRLSLECPFPNQVISSIKFASYGRPSGTCGSFSHGSCRSSR 798

Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            L IVQKACVG   CSI VS+   G     C GL K+LAVEA C
Sbjct: 799 ALSIVQKACVGSKSCSIEVSTHTFG---DPCKGLAKSLAVEASC 839


>gi|224106752|ref|XP_002314274.1| predicted protein [Populus trichocarpa]
 gi|222850682|gb|EEE88229.1| predicted protein [Populus trichocarpa]
          Length = 849

 Score =  991 bits (2563), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 485/830 (58%), Positives = 603/830 (72%), Gaps = 24/830 (2%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRAL+IDGKRRVL SGSIHYPRST E+W +LI+KSK+GGL+VIETYVFWN HEP+
Sbjct: 31  NVTYDHRALLIDGKRRVLVSGSIHYPRSTVEMWADLIQKSKDGGLDVIETYVFWNAHEPV 90

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           + QY FEGR+DLV+F+K V EAGL+ HLRIGPY CAEWNYGGFP+WLHF+PGI+FRT N 
Sbjct: 91  QNQYNFEGRYDLVKFIKLVGEAGLYAHLRIGPYVCAEWNYGGFPLWLHFVPGIKFRTDNE 150

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK EM+RF AKI+D+MKQE L+ASQGGPIIL+Q+ENEYGN++ +YG   + Y+ WAA  
Sbjct: 151 PFKAEMQRFTAKIVDMMKQEKLYASQGGPIILSQIENEYGNIDSSYGPAAKSYINWAASM 210

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG AV
Sbjct: 211 AVSLDTGVPWVMCQQADAPDPIINTCNGFYCDQFTPNSKNKPKMWTENWSGWFLSFGGAV 270

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+RPVEDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP ++TSYDYDAP+DEYG  R
Sbjct: 271 PYRPVEDLAFAVARFYQLGGTFQNYYMYHGGTNFGRSTGGPFISTSYDYDAPLDEYGLTR 330

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHL++LHK+IKLCEE L+++DP    LG  LEA +Y   +  C+AFLAN+  +SD 
Sbjct: 331 QPKWGHLKDLHKSIKLCEEALVATDPVTSSLGQNLEATVYKTGTGLCSAFLANF-GTSDK 389

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS---S 420
            V FNGN Y LP WSVSILPDCKNV  NTAK+ S     +  F  Q  + +   A    S
Sbjct: 390 TVNFNGNSYNLPGWSVSILPDCKNVALNTAKINSMTVIPN--FVHQSLIGDADSADTLGS 447

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++SW  E VGIS N +FV+P L EQINTT D SDYLWY+ S  +   +     G +  L+
Sbjct: 448 SWSWIYEPVGISKNDAFVKPGLLEQINTTADKSDYLWYSLSTVIKDNEPFLEDGSQTVLH 507

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           +ESLGHA   FVN KL   G GN   A   +   + L  G NT+D+LS+  GLQNYGA+F
Sbjct: 508 VESLGHALHAFVNGKLAGSGTGNAGNAKVAVEIPVTLLPGKNTIDLLSLTAGLQNYGAFF 567

Query: 536 DVAGAGLFSVILID-LKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++ GAG+   + ++ LKNG   DLSS +W YQ+G++GE +GL     + +S W     LP
Sbjct: 568 ELEGAGITGPVKLEGLKNGTTVDLSSLQWTYQIGLKGEELGLS----SGNSQWVTQPALP 623

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + LIWYKT+F AP G  P+A++ + MGKG+AWVNGQSIGRYW   ++P++GC+  C+Y
Sbjct: 624 TKQPLIWYKTSFNAPAGNDPIAIDFSGMGKGEAWVNGQSIGRYWPTKVSPTSGCS-NCNY 682

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
           RGSY +SKC K+C +P+QTLYH+PR+WV    N LV+ EE+GGDP++I+  TK    +CS
Sbjct: 683 RGSYSSSKCLKNCAKPSQTLYHVPRSWVESSGNTLVLFEEIGGDPTQIAFATKQSASLCS 742

Query: 714 FVSEADPPPVDSWKPNL-GVVSSSPQVRLACE-RGWHIAAINFASYGIPEGNCGSFRPGA 771
            VSE+ P PVD W  N      + P + L C      I++I FAS+G P G CGSF  G 
Sbjct: 743 HVSESHPLPVDMWSSNSEAERKAGPVLSLECPFPNQVISSIKFASFGTPRGTCGSFSHGQ 802

Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           C     L IVQKAC+G   CSI  S++  G     C G+ K+LAVEA C+
Sbjct: 803 CKSTRALSIVQKACIGSKSCSIGASASTFG---DPCRGVAKSLAVEASCA 849


>gi|356543464|ref|XP_003540180.1| PREDICTED: beta-galactosidase 8-like isoform 1 [Glycine max]
          Length = 840

 Score =  990 bits (2560), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 498/824 (60%), Positives = 596/824 (72%), Gaps = 13/824 (1%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 24  ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY F+GR DLV+FVKTV  AGL++HLRIGPY CAEWNYGGFPVWLHFIPGI+FRT N
Sbjct: 84  VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPVWLHFIPGIKFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF AKI+D++KQE L+ASQGGP+IL+Q+ENEYGN++ AYG  G+ Y+KWAA 
Sbjct: 144 EPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNIDTAYGAAGKSYIKWAAT 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+T VPWVMC Q DAPDPIINT NGFY D FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTKPKMWTENWSGWFLVFGGA 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R +GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGPFIATSYDYDAPIDEYGII 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL+E+HKAIKLCEE LI++DPT   LG  LEA +Y K+ + CAAFLAN  + SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVGTKSD 382

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
             V F+GN Y LPAWSVSILPDCK+VV NTAK+ S            K ++     +S+ 
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTESSKEDIGSSEASSTG 442

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           +SW  E VGIS   SF +  L EQINTT D SDYLWY+ SI        +  L+IESLGH
Sbjct: 443 WSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSIDYKADASSQTVLHIESLGH 502

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
           A   F+N KL     GN     F ++  + L  G NT+D+LS+ VGLQNYGA+FD  G G
Sbjct: 503 ALHAFINGKLAGSQPGNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGAFFDTWGVG 562

Query: 542 LFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           +   VIL    NG   DLSS +W YQVG++GE +GL   S  +S  W   ST P N+ L 
Sbjct: 563 ITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGL---SSGSSGQWNLQSTFPKNQPLT 619

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKTTF AP G  P+A++   MGKG+AWVNGQ IGRYW  Y+A    CT  C+YRG Y A
Sbjct: 620 WYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDSCNYRGPYSA 679

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
           SKC+K+C +P+QTLYH+PR+W+ P  N+LV+ EE GGDP++IS +TK  + +C+ VS++ 
Sbjct: 680 SKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTESLCAHVSDSH 739

Query: 720 PPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-V 776
           PPPVD W           P + L C      I++I FASYG P G CG+F  G C  +  
Sbjct: 740 PPPVDLWNSETESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYHGRCSSNKA 799

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           L IVQKAC+G   CS+ VSS   G     C G+ K+LAVEA C+
Sbjct: 800 LSIVQKACIGSSSCSVGVSSDTFG---DPCRGMAKSLAVEATCA 840


>gi|61162203|dbj|BAD91083.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 842

 Score =  989 bits (2556), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/831 (59%), Positives = 597/831 (71%), Gaps = 23/831 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HE 
Sbjct: 20  AKVTYDHRALVIDGKRRVLVSGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEA 79

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY F GR DLV+FVKTV EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGIQ RT N
Sbjct: 80  VRGQYDFGGRKDLVKFVKTVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIQLRTDN 139

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EM+RF AKI+D+MK+E L+ASQGGPIIL+Q+ENEYGN++ AYG   + Y+KWAAD
Sbjct: 140 EPFKAEMQRFTAKIVDMMKKEKLYASQGGPIILSQIENEYGNIDRAYGAAAQTYIKWAAD 199

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-PIMWTENYSGWFLSFGY 241
            AV+L+T VPWVMCQQ+DAP  +I+TCNGFYCD +TP  P K P MWTEN+SGWFLSFG 
Sbjct: 200 MAVSLDTGVPWVMCQQDDAPPSVISTCNGFYCDQWTPRLPEKRPKMWTENWSGWFLSFGG 259

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RPVEDLAFAVARFF+ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG 
Sbjct: 260 AVPQRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGL 319

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHL+++HKAIKLCEE ++++DP +   G  +EA +Y K+ + CAAFLAN D+ S
Sbjct: 320 LRQPKWGHLKDVHKAIKLCEEAMVATDPKYSSFGPNVEATVY-KTGSACAAFLANSDTKS 378

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR---NNGDHPFAQQKNVNELLLA 418
           DA VTFNGN Y LPAWSVSILPDCKNVV NTAK+ S     +   H      + +E L  
Sbjct: 379 DATVTFNGNSYHLPAWSVSILPDCKNVVLNTAKINSAAMIPSFMHHSVLDDIDSSEAL-- 436

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG-----QGKEVF 473
            S +SW  E VGIS   +F R  L EQINTT D SDYLWY+ SI V         G +  
Sbjct: 437 GSGWSWINEPVGISKKDAFTRVGLLEQINTTADKSDYLWYSLSIDVTSSDTFLQDGSQTI 496

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L++ESLGHA   F+N K    G    +     ++  +    G NT+D+LS+ +GLQNYGA
Sbjct: 497 LHVESLGHALHAFINGKPAGRGIITANNGKISVDIPVTFASGKNTIDLLSLTIGLQNYGA 556

Query: 534 WFDVAGAGLFS-VILIDLKNG-KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           +FD +GAG+   V L  LKNG   DLSS  W YQ+G++GE  G    S +    W    T
Sbjct: 557 FFDKSGAGITGPVQLKGLKNGTTTDLSSQRWTYQIGLQGEDSGFSSGSSSQ---WISQPT 613

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           LP  + L WYK TF AP+G  P+AL+   MGKG+AWVNGQSIGRYW    AP++GC   C
Sbjct: 614 LPKKQPLTWYKATFNAPDGSNPVALDFTGMGKGEAWVNGQSIGRYWPTNNAPTSGCPDSC 673

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           ++RG YD++KC+K+CG+P+Q LYH+PR+W+ P  N LV+ EE+GGDP++IS  T+  + +
Sbjct: 674 NFRGPYDSNKCRKNCGKPSQELYHVPRSWLKPSGNTLVLFEEIGGDPTQISFATRQIESL 733

Query: 712 CSFVSEADPPPVDSWKPNLGVVSS-SPQVRLACE-RGWHIAAINFASYGIPEGNCGSFRP 769
           CS VSE+ P PVD+W  +        P + L C      I++I FASYG P+G CGSF  
Sbjct: 734 CSHVSESHPSPVDTWSSDSKAGRKLGPVLSLECPFPNQVISSIKFASYGKPQGTCGSFSH 793

Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G C     L IVQKACVG   CSI VS    G     C G+ K+LAVEA C
Sbjct: 794 GQCKSTSALSIVQKACVGSKSCSIEVSVKTFG---DPCKGVAKSLAVEASC 841


>gi|152013362|sp|Q10NX8.2|BGAL6_ORYSJ RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
           Precursor
          Length = 858

 Score =  988 bits (2555), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 482/836 (57%), Positives = 595/836 (71%), Gaps = 24/836 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 30  AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
            +RGQY FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT 
Sbjct: 90  AVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 149

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  FK EM+RF  K++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++WAA
Sbjct: 150 NEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 209

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLSFG 
Sbjct: 210 GMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGG 269

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG 
Sbjct: 270 AVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGM 329

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
           +RQPKWGHLR++HKAIKLCE  LI+++P++  LG   EA +Y  + N  CAAFLAN D+ 
Sbjct: 330 VRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQ 389

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNE 414
           SD  V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ      R+ G        ++  
Sbjct: 390 SDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLIT 449

Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
             LA++ +S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V   +G E +L
Sbjct: 450 PELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDEPYL 506

Query: 475 N-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
           N       + SLGH   +++N KL     G+   +   +   + L  G N +D+LS  VG
Sbjct: 507 NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVG 566

Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
           L NYGA+FD+ GAG+   + +   NG  +LSS +W YQ+G+ GE + L   S A S  W 
Sbjct: 567 LSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SPEWV 625

Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
             +  P N+ LIWYKT F AP G  P+A++   MGKG+AWVNGQSIGRYW   LAP +GC
Sbjct: 626 SDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGC 685

Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
              C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS  T+ 
Sbjct: 686 VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQ 745

Query: 708 GQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCG 765
              IC+ VSE  P  +DSW  P     +  P +RL C R G  I+ I FAS+G P G CG
Sbjct: 746 TSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCG 805

Query: 766 SFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           ++  G C     L +VQ+ACVG   CS+PVSS   G     C G+ K+L VEA CS
Sbjct: 806 NYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 858


>gi|115451981|ref|NP_001049591.1| Os03g0255100 [Oryza sativa Japonica Group]
 gi|108707232|gb|ABF95027.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113548062|dbj|BAF11505.1| Os03g0255100 [Oryza sativa Japonica Group]
 gi|215695246|dbj|BAG90437.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 956

 Score =  988 bits (2554), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 482/836 (57%), Positives = 595/836 (71%), Gaps = 24/836 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 128 AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 187

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
            +RGQY FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+FRT 
Sbjct: 188 AVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKFRTD 247

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  FK EM+RF  K++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++WAA
Sbjct: 248 NEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 307

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLSFG 
Sbjct: 308 GMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGG 367

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG 
Sbjct: 368 AVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGM 427

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
           +RQPKWGHLR++HKAIKLCE  LI+++P++  LG   EA +Y  + N  CAAFLAN D+ 
Sbjct: 428 VRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQ 487

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNE 414
           SD  V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ      R+ G        ++  
Sbjct: 488 SDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLIT 547

Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
             LA++ +S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V   +G E +L
Sbjct: 548 PELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDEPYL 604

Query: 475 N-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
           N       + SLGH   +++N KL     G+   +   +   + L  G N +D+LS  VG
Sbjct: 605 NGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVG 664

Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
           L NYGA+FD+ GAG+   + +   NG  +LSS +W YQ+G+ GE + L   S A S  W 
Sbjct: 665 LSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SPEWV 723

Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
             +  P N+ LIWYKT F AP G  P+A++   MGKG+AWVNGQSIGRYW   LAP +GC
Sbjct: 724 SDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGC 783

Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
              C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS  T+ 
Sbjct: 784 VNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQ 843

Query: 708 GQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCG 765
              IC+ VSE  P  +DSW  P     +  P +RL C R G  I+ I FAS+G P G CG
Sbjct: 844 TSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCG 903

Query: 766 SFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           ++  G C     L +VQ+ACVG   CS+PVSS   G     C G+ K+L VEA CS
Sbjct: 904 NYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 956


>gi|61614851|gb|AAQ21371.2| beta-galactosidase [Sandersonia aurantiaca]
          Length = 818

 Score =  986 bits (2548), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 484/822 (58%), Positives = 597/822 (72%), Gaps = 18/822 (2%)

Query: 13  VIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR 72
           VIDG RRVL SGSIHYPRSTPE+WP+LI KSK GGL++IETYVFW+ HEP++GQY F+GR
Sbjct: 1   VIDGTRRVLISGSIHYPRSTPEMWPDLIDKSKSGGLDIIETYVFWDLHEPLQGQYDFQGR 60

Query: 73  FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF 132
            DLVRF+KTV EAGL++HLRIGPYACAEWNYGGFP+WLHFIPGI+FRT N PFK+EM+RF
Sbjct: 61  KDLVRFIKTVGEAGLYVHLRIGPYACAEWNYGGFPLWLHFIPGIKFRTDNKPFKDEMQRF 120

Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
             KI+DLMKQENL+ASQGGPIIL+Q+ENEYGN+++AYG   + Y+ WAA  A +L+T VP
Sbjct: 121 TTKIVDLMKQENLYASQGGPIILSQIENEYGNIDFAYGAAAKSYINWAASMATSLDTGVP 180

Query: 193 WVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLA 252
           WVMCQQ DAPDPIINTCNGFYCD F+PNS +KP +WTEN+SGWFLSFG  VP RPVEDLA
Sbjct: 181 WVMCQQTDAPDPIINTCNGFYCDQFSPNSNNKPKIWTENWSGWFLSFGGPVPQRPVEDLA 240

Query: 253 FAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
           FAVARFF+ GGTFQNYYMY  G NFG T+GGP +ATSYDYDAPIDEYG  RQPKWGHL+E
Sbjct: 241 FAVARFFQRGGTFQNYYMYTWGNNFGHTSGGPFIATSYDYDAPIDEYGITRQPKWGHLKE 300

Query: 313 LHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVY 372
           LHKAIKLCE  L+++D    +LG  LEAH+Y  +S  CAAFLAN  + SDA VTFNG  Y
Sbjct: 301 LHKAIKLCEPALVATDHHTLRLGPNLEAHVYKTASGVCAAFLANIGTQSDATVTFNGKSY 360

Query: 373 FLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNV--NELLLASSAF----SWYE 426
            LPAWSVSILPDC+ VVFNTA++ SQ  + +  +   +++  ++ + +S  F    S+  
Sbjct: 361 SLPAWSVSILPDCRTVVFNTAQINSQAIHSEMKYLNSESLTSDQQIGSSEVFQSDWSFVI 420

Query: 427 EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFLNIESLGH 481
           E VGIS + +  +  L EQINTT D SDYLWY+ SI +         G +  L+ ESLGH
Sbjct: 421 EPVGISKSNAIRKTGLLEQINTTADVSDYLWYSISIAIDGDEPFLSNGTQSNLHAESLGH 480

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
               FVN KL   G GN   A  +  K I L  G N++D+LS  VGLQNYGA+FD+ GAG
Sbjct: 481 VLHAFVNGKLAGSGIGNSGNAKIIFEKLIMLTPGNNSIDLLSATVGLQNYGAFFDLMGAG 540

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
           +   + +  +NG  DLSS  W YQ+G++GE + L + S  + S W   STLP N+ LIWY
Sbjct: 541 ITGPVKLKGQNGTLDLSSNAWTYQIGLKGEDLSLHENS-GDVSQWISESTLPKNQPLIWY 599

Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
           KTTF AP+G  P+A++   MGKG+AWVNGQSIGRYW  Y +P  GC+  C+YRG Y ASK
Sbjct: 600 KTTFNAPDGNDPVAIDFTGMGKGEAWVNGQSIGRYWPTYSSPQNGCSTACNYRGPYSASK 659

Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
           C K+CG+P+Q LYH+PR+++    N LV+ EE+GGDP++ISL TK    +C+ VSE+ P 
Sbjct: 660 CIKNCGKPSQILYHVPRSFIQSESNTLVLFEEMGGDPTQISLATKQMTSLCAHVSESHPA 719

Query: 722 PVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLP 778
           PVD+W         S P ++L C      I++I FAS+G P G CGSF    C    VL 
Sbjct: 720 PVDTWLSLQQKGKKSGPTIQLECPYPNQVISSIKFASFGTPSGMCGSFNHSQCSSASVLA 779

Query: 779 IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +VQKACVG   CS+ +SS  LG     C G++K+LAVEA CS
Sbjct: 780 VVQKACVGSKRCSVGISSKTLG---DPCRGVIKSLAVEAACS 818


>gi|125543160|gb|EAY89299.1| hypothetical protein OsI_10800 [Oryza sativa Indica Group]
          Length = 861

 Score =  985 bits (2547), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 484/839 (57%), Positives = 596/839 (71%), Gaps = 27/839 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 30  AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89

Query: 62  PIRGQ---YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF 118
           P+RGQ   Y FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+F
Sbjct: 90  PVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKF 149

Query: 119 RTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK 178
           RT N  FK EM+RF  K++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++
Sbjct: 150 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 209

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           WAA  AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLS
Sbjct: 210 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 269

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
           FG AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDE
Sbjct: 270 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 329

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANY 357
           YG +RQPKWGHLR++HKAIKLCE  LI+++P++  LG   EA +Y  + N  CAAFLAN 
Sbjct: 330 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANV 389

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKN 411
           D+ SD  V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ      R+ G        +
Sbjct: 390 DAQSDKAVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDS 449

Query: 412 VNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE 471
           +    LA++ +S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V   +G E
Sbjct: 450 LITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDE 506

Query: 472 VFLN-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
            +LN       + SLGH   V++N KL     G+   +   +   + L  G N +D+LS 
Sbjct: 507 PYLNGSQSNLLVNSLGHVLQVYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLST 566

Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
            VGL NYGA+FD+ GAG+   + +   NG  +LSS +W YQ+G+ GE + L   S A S 
Sbjct: 567 TVGLSNYGAFFDLIGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SP 625

Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
            W   +  P N+ LIWYKT F AP G  P+A++   MGKG+AWVNGQSIGRYW   LAP 
Sbjct: 626 EWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQ 685

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
           +GC   C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS  
Sbjct: 686 SGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFT 745

Query: 705 TKTGQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEG 762
           T+    IC+ VSE  P  +DSW  P     +  P +RL C R G  I+ I FAS+G P G
Sbjct: 746 TRQTSSICAHVSEMHPAQIDSWISPQQTSQTPGPALRLECPREGQVISNIKFASFGTPSG 805

Query: 763 NCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CG++  G C     L +VQ+ACVG   CS+PVSS   G     C G+ K+L VEA CS
Sbjct: 806 TCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 861


>gi|125583741|gb|EAZ24672.1| hypothetical protein OsJ_08441 [Oryza sativa Japonica Group]
          Length = 861

 Score =  983 bits (2542), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 482/839 (57%), Positives = 595/839 (70%), Gaps = 27/839 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRA+VIDG RRVL SGSIHYPRSTP++WP LI+KSK+GGL+VIETYVFW+ HE
Sbjct: 30  AANVTYDHRAVVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKSKDGGLDVIETYVFWDIHE 89

Query: 62  PIRGQ---YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF 118
            +RGQ   Y FEGR DLVRFVK V +AGL++HLRIGPY CAEWNYGGFPVWLHF+PGI+F
Sbjct: 90  AVRGQAQQYDFEGRKDLVRFVKAVADAGLYVHLRIGPYVCAEWNYGGFPVWLHFVPGIKF 149

Query: 119 RTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK 178
           RT N  FK EM+RF  K++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++
Sbjct: 150 RTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMR 209

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           WAA  AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLS
Sbjct: 210 WAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLS 269

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
           FG AVP+RP EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDE
Sbjct: 270 FGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDE 329

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANY 357
           YG +RQPKWGHLR++HKAIKLCE  LI+++P++  LG   EA +Y  + N  CAAFLAN 
Sbjct: 330 YGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANV 389

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKN 411
           D+ SD  V FNGN Y LPAWSVSILPDCKNVV NTA++ SQ      R+ G        +
Sbjct: 390 DAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDS 449

Query: 412 VNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE 471
           +    LA++ +S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V   +G E
Sbjct: 450 LITPELATAGWSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDE 506

Query: 472 VFLN-------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
            +LN       + SLGH   +++N KL     G+   +   +   + L  G N +D+LS 
Sbjct: 507 PYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLST 566

Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
            VGL NYGA+FD+ GAG+   + +   NG  +LSS +W YQ+G+ GE + L   S A S 
Sbjct: 567 TVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SP 625

Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
            W   +  P N+ LIWYKT F AP G  P+A++   MGKG+AWVNGQSIGRYW   LAP 
Sbjct: 626 EWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQ 685

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
           +GC   C+YRG+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS  
Sbjct: 686 SGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFT 745

Query: 705 TKTGQHICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEG 762
           T+    IC+ VSE  P  +DSW  P     +  P +RL C R G  I+ I FAS+G P G
Sbjct: 746 TRQTSSICAHVSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSG 805

Query: 763 NCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CG++  G C     L +VQ+ACVG   CS+PVSS   G     C G+ K+L VEA CS
Sbjct: 806 TCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 861


>gi|226503159|ref|NP_001146370.1| uncharacterized protein LOC100279948 precursor [Zea mays]
 gi|219886857|gb|ACL53803.1| unknown [Zea mays]
 gi|414865885|tpg|DAA44442.1| TPA: beta-galactosidase [Zea mays]
          Length = 852

 Score =  982 bits (2538), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 488/834 (58%), Positives = 600/834 (71%), Gaps = 23/834 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRALVIDG RRVL SGSIHYPRSTP++WP LI+K+K+GGL+VIETYVFW+ HE
Sbjct: 27  AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQY FEGR DL  FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT 
Sbjct: 87  PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF AK++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++WAA
Sbjct: 147 NEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAA 206

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG 
Sbjct: 207 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN  R++GGP +ATSYDYDAPIDEYG 
Sbjct: 267 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR++HKAIKLCE  LI++DP++  LG  +EA +Y K  + CAAFLAN D  S
Sbjct: 327 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVY-KVGSVCAAFLANIDGQS 385

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
           D  VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ    +  + +  NV         
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 445

Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
            LA S +S+  E VGI+ + +  +  L EQINTT D SD+LWY+ SI V   +G E +LN
Sbjct: 446 ELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 502

Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
                  + SLGH   V++N K+     G+   +     K IEL  G N +D+LS  VGL
Sbjct: 503 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 562

Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
            NYGA+FD+ GAG+   + +   NG  DLSS EW YQ+G+ GE + L   S A S  W  
Sbjct: 563 SNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 621

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
            +  P+N  LIWYKT F  P G  P+A++   MGKG+AWVNGQSIGRYW   LAP +GC 
Sbjct: 622 ANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCV 681

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
             C+YRG+Y +SKC K CGQP+QTLYH+PR+++ PG N LV+ E  GGDPSKIS + +  
Sbjct: 682 NSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFVMRQT 741

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSF 767
             +C+ VSEA P  +DSW     +    P +RL C + G  I+++ FAS+G P G CGS+
Sbjct: 742 GSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLECPKEGQVISSVKFASFGTPSGTCGSY 801

Query: 768 RPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             G C     L IVQ+AC+G   CS+PVSS Y G     C G+ K+LAVEA CS
Sbjct: 802 SHGECSSTQALSIVQEACIGVSSCSVPVSSNYFG---NPCTGVTKSLAVEAACS 852


>gi|356543466|ref|XP_003540181.1| PREDICTED: beta-galactosidase 8-like isoform 2 [Glycine max]
          Length = 848

 Score =  981 bits (2535), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 497/832 (59%), Positives = 595/832 (71%), Gaps = 21/832 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANV YDHRALVIDGKRRVL SGSIHYPRSTPE+WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 24  ANVEYDHRALVIDGKRRVLISGSIHYPRSTPEMWPDLIQKSKDGGLDVIETYVFWNLHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY F+GR DLV+FVKTV  AGL++HLRIGPY CAEWNYGGFPVWLHFIPGI+FRT N
Sbjct: 84  VRGQYDFDGRKDLVKFVKTVAAAGLYVHLRIGPYVCAEWNYGGFPVWLHFIPGIKFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF AKI+D++KQE L+ASQGGP+IL+Q+ENEYGN++ AYG  G+ Y+KWAA 
Sbjct: 144 EPFKAEMKRFTAKIVDMIKQEKLYASQGGPVILSQIENEYGNIDTAYGAAGKSYIKWAAT 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+T VPWVMC Q DAPDPIINT NGFY D FTPNS +KP MWTEN+SGWFL FG A
Sbjct: 204 MATSLDTGVPWVMCLQADAPDPIINTWNGFYGDEFTPNSNTKPKMWTENWSGWFLVFGGA 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R +GGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRASGGPFIATSYDYDAPIDEYGII 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL+E+HKAIKLCEE LI++DPT   LG  LEA +Y K+ + CAAFLAN  + SD
Sbjct: 324 RQPKWGHLKEVHKAIKLCEEALIATDPTITSLGPNLEAAVY-KTGSVCAAFLANVGTKSD 382

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASSA 421
             V F+GN Y LPAWSVSILPDCK+VV NTAK+ S            K ++     +S+ 
Sbjct: 383 VTVNFSGNSYHLPAWSVSILPDCKSVVLNTAKINSASAISSFTTESSKEDIGSSEASSTG 442

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           +SW  E VGIS   SF +  L EQINTT D SDYLWY+ SI        +  L+IESLGH
Sbjct: 443 WSWISEPVGISKTDSFSQTGLLEQINTTADKSDYLWYSLSIDYKADASSQTVLHIESLGH 502

Query: 482 AALVFVNKKLVA--------FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           A   F+N KL              N     F ++  + L  G NT+D+LS+ VGLQNYGA
Sbjct: 503 ALHAFINGKLAGKYKLKHSQLIICNSGKYKFTVDIPVTLVAGKNTIDLLSLTVGLQNYGA 562

Query: 534 WFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           +FD  G G+   VIL    NG   DLSS +W YQVG++GE +GL   S  +S  W   ST
Sbjct: 563 FFDTWGVGITGPVILKGFANGNTLDLSSQKWTYQVGLQGEDLGL---SSGSSGQWNLQST 619

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
            P N+ L WYKTTF AP G  P+A++   MGKG+AWVNGQ IGRYW  Y+A    CT  C
Sbjct: 620 FPKNQPLTWYKTTFSAPSGSDPVAIDFTGMGKGEAWVNGQRIGRYWPTYVASDASCTDSC 679

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           +YRG Y ASKC+K+C +P+QTLYH+PR+W+ P  N+LV+ EE GGDP++IS +TK  + +
Sbjct: 680 NYRGPYSASKCRKNCEKPSQTLYHVPRSWLKPSGNILVLFEERGGDPTQISFVTKQTESL 739

Query: 712 CSFVSEADPPPVDSWKPNL-GVVSSSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRP 769
           C+ VS++ PPPVD W           P + L C      I++I FASYG P G CG+F  
Sbjct: 740 CAHVSDSHPPPVDLWNSETESGRKVGPVLSLTCPHDNQVISSIKFASYGTPLGTCGNFYH 799

Query: 770 GACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G C  +  L IVQKAC+G   CS+ VSS   G     C G+ K+LAVEA C+
Sbjct: 800 GRCSSNKALSIVQKACIGSSSCSVGVSSDTFG---DPCRGMAKSLAVEATCA 848


>gi|326506982|dbj|BAJ95568.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 853

 Score =  980 bits (2533), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/832 (56%), Positives = 592/832 (71%), Gaps = 18/832 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYDHRALVIDG RRVL SGSIHYPRSTP++WP L++K+K+GGL+V+ETYVFW+ HE
Sbjct: 27  ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDVHE 86

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQY FEGR DLVRFVK   +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT 
Sbjct: 87  PVRGQYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTD 146

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF  K++  MK   L+ASQGGPIIL+Q+ENEYGN+  +YG  G+ Y++WAA
Sbjct: 147 NEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAA 206

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPWVMCQQ DAP+P+INTCNGFYCD FTP+ PS+P +WTEN+SGWFLSFG 
Sbjct: 207 GMAVALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGG 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG 
Sbjct: 267 AVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGL 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR++HKAIK+CE  LI++DP++  LG   EAH+Y KS + CAAFLAN D  S
Sbjct: 327 VRQPKWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVY-KSGSLCAAFLANIDDQS 385

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNEL 415
           D  VTFNG  Y LPAWSVSILPDCKNVV NTA++ SQ      RN G    A   +  E 
Sbjct: 386 DKTVTFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEA 445

Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKE 471
            LA+S++S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V  G+    G +
Sbjct: 446 ELAASSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQ 505

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L + SLGH   VF+N KL     G+   +   +   + L  G N +D+LS  VGL NY
Sbjct: 506 SNLLVNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNY 565

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           GA+FD+ GAG+   + +    G  DLSS EW YQ+G+ GE + L   S A S  W   ++
Sbjct: 566 GAFFDLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEA-SPEWVSDNS 624

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
            P N  L WYK+ F AP G  P+A++   MGKG+AWVNGQSIGRYW   +AP +GC   C
Sbjct: 625 YPTNNPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSGCVNSC 684

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           +YRGSY A+KC K CGQP+Q LYH+PR+++ PG N +V+ E+ GG+PSKIS  TK  + +
Sbjct: 685 NYRGSYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQTESV 744

Query: 712 CSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRP 769
           C+ VSE  P  +DSW      +  S P +RL C + G  I++I FAS+G P G CGS+  
Sbjct: 745 CAHVSEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFASFGTPSGTCGSYSH 804

Query: 770 GAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G C     L + Q+ACVG   CS+PVS+   G     C G+ K+L VEA CS
Sbjct: 805 GECSSSQALAVAQEACVGVSSCSVPVSAKNFG---DPCRGVTKSLVVEAACS 853


>gi|242036283|ref|XP_002465536.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
 gi|241919390|gb|EER92534.1| hypothetical protein SORBIDRAFT_01g040750 [Sorghum bicolor]
          Length = 860

 Score =  978 bits (2527), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 481/835 (57%), Positives = 604/835 (72%), Gaps = 24/835 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYDHRALVIDG RRVL SGSIHYPRSTP++WP +I+K+K+GGL+VIETYVFW+ HE
Sbjct: 34  ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGIIQKAKDGGLDVIETYVFWDIHE 93

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQY FEGR DL  FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT 
Sbjct: 94  PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 153

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF AK++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++WAA
Sbjct: 154 NEPFKTEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAA 213

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A++L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG 
Sbjct: 214 GMAISLDTGVPWVMCQQTDAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 273

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN  R++GGP +ATSYDYDAPIDEYG 
Sbjct: 274 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 333

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +R+PKWGHLR++HKAIKLCE  LI++DP++  LG   EA +Y K+ + CAAFLAN D  S
Sbjct: 334 VREPKWGHLRDVHKAIKLCEPALIATDPSYTSLGQNAEAAVY-KTGSVCAAFLANIDGQS 392

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
           D  VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ  + +  + +  N+         
Sbjct: 393 DKTVTFNGRMYRLPAWSVSILPDCKNVVLNTAQINSQVTSSEMRYLESSNMASDGSFITP 452

Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
            LA S +S+  E VGI+ + +  +  L EQINTT D SD+LWY+ SI V   +G E +LN
Sbjct: 453 ELAVSGWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 509

Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
                  + SLGH   V++N K+     G+   +     K IEL  G N +D+LS  VGL
Sbjct: 510 GSQSNLVVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 569

Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
            NYGA+FD+ GAG+   + +   NG  DLSS EW YQ+G+ GE + L   S A S  W  
Sbjct: 570 SNYGAFFDLVGAGITGPVKLSGTNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 628

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
            +  P+N+ LIWYKT F  P G  P+A++   MGKG+AWVNGQSIGRYW   LAP +GC 
Sbjct: 629 ANAYPINQPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCV 688

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
             C+YRGSY+++KC K CGQP+QTLYH+PR+++ PG N +V+ E+ GGDPSKIS + +  
Sbjct: 689 NSCNYRGSYNSNKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFVIRQT 748

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSS-SPQVRLACER-GWHIAAINFASYGIPEGNCGS 766
             +C+ VSE  P  +DSW  +   +    P++RL C + G  I++I FAS+G P G CGS
Sbjct: 749 GSVCAQVSEEHPAQIDSWNSSQQTMQRYGPELRLECPKDGQVISSIKFASFGTPSGTCGS 808

Query: 767 FRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +  G C     L +VQ+AC+G   CS+PVSS Y G     C G+ K+LAVEA CS
Sbjct: 809 YSHGECSSTQALSVVQEACIGVSSCSVPVSSNYFG---NPCTGVTKSLAVEAACS 860


>gi|385203117|gb|ADO34790.3| beta-galactosidase STBG5 [Solanum lycopersicum]
          Length = 852

 Score =  977 bits (2526), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 478/833 (57%), Positives = 600/833 (72%), Gaps = 22/833 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            +ANVTYDHRALV+DG+RRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 29  FAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R QY FEGR DL+ FVK V++AGLF+H+RIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 89  EPVRNQYDFEGRKDLINFVKLVEKAGLFVHIRIGPYVCAEWNYGGFPLWLHFIPGIEFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVK 178
            N PFK EMKRF AKI+D++KQENL+ASQGGP+IL+Q+ENEYGN  +E  YG   + YV 
Sbjct: 149 DNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIESRYGPRAKPYVN 208

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           WAA  A +LNT VPWVMCQQ DAP  +INTCNGFYCD F  NS   P MWTEN++GWFLS
Sbjct: 209 WAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQNSDKTPKMWTENWTGWFLS 268

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
           FG  VP+RPVED+AFAVARFF+ GGTFQNYYMY GGTNFGRT+GGP +ATSYDYDAP+DE
Sbjct: 269 FGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRTSGGPFIATSYDYDAPLDE 328

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
           YG I QPKWGHL++LHKAIKLCE  +++++P    LG+ +E  +Y K+ + CAAFLAN  
Sbjct: 329 YGLINQPKWGHLKDLHKAIKLCEAAMVATEPNITSLGSNIEVSVY-KTDSQCAAFLANTA 387

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           + SDA V+FNGN Y LP WSVSILPDCKNV F+TAK+ S        F  + +  +    
Sbjct: 388 TQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTIST--FVTRSSEADASGG 445

Query: 419 S-SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           S S ++   E VGIS   +F R  L EQINTT D SDYLWY+ S+++   +     G   
Sbjct: 446 SLSGWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDEPFLQDGSAT 505

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L++++LGH    ++N KL   G GN   +NF I   + L  G N +D+LS  VGLQNYG
Sbjct: 506 VLHVKTLGHVLHAYINGKLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYG 565

Query: 533 AWFDVAGAGLFS-VILIDLKNGK-RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           A+FD+ GAG+   V L   KNG   DLSS +W YQVG++GE +GL   S   S+ WK  +
Sbjct: 566 AFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGL---SNGGSTLWKSQT 622

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            LP N+ LIWYK +F AP G  PL+++   MGKG+AWVNGQSIGR+W AY+AP+ GCT  
Sbjct: 623 ALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDP 682

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C+YRG Y+A KC K+CG+P+Q LYH+PR+W+    N+LV+ EE+GGDP+K+S  T+  Q 
Sbjct: 683 CNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQS 742

Query: 711 ICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFR 768
           +CS +S+A P P+D W   +     S P + L C      I++I FAS+G P+G CGSF 
Sbjct: 743 VCSRISDAHPLPIDMWASEDDARKKSGPTLSLECPHPNQVISSIKFASFGTPQGTCGSFI 802

Query: 769 PGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G C   + L IV+KAC+G   CS+ VS    G     C G+ K+LAVEA C+
Sbjct: 803 HGRCSSSNALSIVKKACIGSKSCSLGVSINAFG---DPCKGVAKSLAVEASCT 852


>gi|350537827|ref|NP_001234312.1| TBG5 protein precursor [Solanum lycopersicum]
 gi|7939623|gb|AAF70824.1|AF154423_1 putative beta-galactosidase [Solanum lycopersicum]
          Length = 852

 Score =  975 bits (2520), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 477/833 (57%), Positives = 598/833 (71%), Gaps = 22/833 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            +ANVTYDHRALV+DG+RRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 29  FAANVTYDHRALVVDGRRRVLISGSIHYPRSTPDMWPDLIQKSKDGGLDVIETYVFWNLH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R QY FEGR DL+ FVK V+ AGLF+H+RIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 89  EPVRNQYDFEGRKDLINFVKLVERAGLFVHIRIGPYVCAEWNYGGFPLWLHFIPGIEFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVK 178
            N PFK EMKRF AKI+D++KQENL+ASQGGP+IL+Q+ENEYGN  +E  YG   + YV 
Sbjct: 149 DNEPFKAEMKRFTAKIVDMIKQENLYASQGGPVILSQIENEYGNGDIESRYGPRAKPYVN 208

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           WAA  A +LNT VPWVMCQQ DAP  +INTCNGFYCD F  NS   P MWTEN++GWFLS
Sbjct: 209 WAASMATSLNTGVPWVMCQQPDAPPSVINTCNGFYCDQFKQNSDKTPKMWTENWTGWFLS 268

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
           FG  VP+RPVED+AFAVARFF+ GGTFQNYYMY GGTNFGRT+GGP +ATSYDYDAP+DE
Sbjct: 269 FGGPVPYRPVEDIAFAVARFFQRGGTFQNYYMYHGGTNFGRTSGGPFIATSYDYDAPLDE 328

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
           YG I QPKWGHL++LHKAIKLCE  +++++P    LG+ +E  +Y K+ + CAAFLAN  
Sbjct: 329 YGLINQPKWGHLKDLHKAIKLCEAAMVATEPNVTSLGSNIEVSVY-KTDSQCAAFLANTA 387

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           + SDA V+FNGN Y LP WSVSILPDCKNV F+TAK+ S        F  + +  +    
Sbjct: 388 TQSDAAVSFNGNSYHLPPWSVSILPDCKNVAFSTAKINSASTIST--FVTRSSEADASGG 445

Query: 419 S-SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           S S ++   E VGIS   +F R  L EQINTT D SDYLWY+ S+++   +     G   
Sbjct: 446 SLSGWTSVNEPVGISNENAFTRMGLLEQINTTADKSDYLWYSLSVNIKNDEPFLQDGSAT 505

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L++++LGH    ++N +L   G GN   +NF I   + L  G N +D+LS  VGLQNYG
Sbjct: 506 VLHVKTLGHVLHAYINGRLSGSGKGNSRHSNFTIEVPVTLVPGENKIDLLSATVGLQNYG 565

Query: 533 AWFDVAGAGLFS-VILIDLKNGK-RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           A+FD+ GAG+   V L   KNG   DLSS +W YQVG++GE +GL   S   S+ WK  +
Sbjct: 566 AFFDLKGAGITGPVQLKGFKNGSTTDLSSKQWTYQVGLKGEDLGL---SNGGSTLWKSQT 622

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            LP N+ LIWYK +F AP G  PL+++   MGKG+AWVNGQSIGR+W AY+AP+ GCT  
Sbjct: 623 ALPTNQPLIWYKASFDAPAGDTPLSMDFTGMGKGEAWVNGQSIGRFWPAYIAPNDGCTDP 682

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C+YRG Y+A KC K+CG+P+Q LYH+PR+W+    N+LV+ EE+GGDP+K+S  T+  Q 
Sbjct: 683 CNYRGGYNAEKCLKNCGKPSQLLYHVPRSWLKSSGNVLVLFEEMGGDPTKLSFATREIQS 742

Query: 711 ICSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFR 768
           +CS  S+A P P+D W   +     S P + L C      I++I FAS+G P+G CGSF 
Sbjct: 743 VCSRTSDAHPLPIDMWASEDDARKKSGPTLSLECPHPNQVISSIKFASFGTPQGTCGSFI 802

Query: 769 PGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G C   + L IV+KAC+G   CS+ VS    G     C G+ K+LAVEA C+
Sbjct: 803 HGRCSSSNALSIVKKACIGSKSCSLGVSINAFG---DPCKGVAKSLAVEASCT 852


>gi|357113057|ref|XP_003558321.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 6-like
           [Brachypodium distachyon]
          Length = 852

 Score =  974 bits (2519), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 471/832 (56%), Positives = 585/832 (70%), Gaps = 18/832 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYDHRALVIDG RRVL SGSIHYPRSTP++WP L++K+K+GGL+V+ETYVFW+ HE
Sbjct: 26  ATNVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLMQKAKDGGLDVVETYVFWDIHE 85

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
               QY FEGR DLVRFVK   + GL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT 
Sbjct: 86  TATXQYDFEGRKDLVRFVKAAADTGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 145

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF  K++  MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++WAA
Sbjct: 146 NEPFKTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKSYIRWAA 205

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP +WTEN+SGWFLSFG 
Sbjct: 206 GMAVALDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSNSKPKLWTENWSGWFLSFGG 265

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG 
Sbjct: 266 AVPYRPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGL 325

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHL+++HKAIK CE  LI++DP++  +G   EAH+Y K+ + CAAFLAN D+ S
Sbjct: 326 VRQPKWGHLKDVHKAIKQCEPALIATDPSYMSMGQNAEAHVY-KAGSVCAAFLANMDTQS 384

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNEL 415
           D  VTFNGN Y LPAWSVSILPDCKNVV NTA++ SQ      R+ G    A   +  E 
Sbjct: 385 DKTVTFNGNAYKLPAWSVSILPDCKNVVLNTAQINSQTTTSEMRSLGSSTKASDGSSIET 444

Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKE 471
            LA S +S+  E VGI+   +  +P L EQINTT D SD+LWY+ S+ V  G+    G +
Sbjct: 445 ELALSGWSYAIEPVGITTENALTKPGLMEQINTTADASDFLWYSTSVVVKGGEPYLNGSQ 504

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L + SLGH    ++N K      G+   +   +   I L  G N +D+LS  VGL NY
Sbjct: 505 SNLLVNSLGHVLQAYINGKFAGSAKGSATSSLISLQTPITLVPGKNKIDLLSGTVGLSNY 564

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           GA+FD+ GAG+   + +    G  DLSS +W YQVG+ GE + L   S A S  W     
Sbjct: 565 GAFFDLVGAGITGPVKLSGPKGVLDLSSTDWTYQVGLRGEGLHLYNPSEA-SPEWVSDKA 623

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
            P N+ LIWYK+ F  P G  P+A++   MGKG+AWVNGQSIGRYW   LAP +GC   C
Sbjct: 624 YPTNQPLIWYKSKFTTPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSC 683

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           +YRG Y +SKC K CGQP+QTLYH+PR+++ PG N +V+ E+ GGDPSKIS  TK    +
Sbjct: 684 NYRGPYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDIVLFEQFGGDPSKISFTTKQTASV 743

Query: 712 CSFVSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRP 769
           C+ VSE  P  +DSW  P   V  S P +RL C + G  I++I FAS+G P G CG++  
Sbjct: 744 CAHVSEDHPDQIDSWISPQQKVQRSGPALRLECPKAGQVISSIKFASFGTPSGTCGNYNH 803

Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G C     L + Q+AC+G   CS+PVS+   G     C G+ K+L VEA CS
Sbjct: 804 GECSSPQALAVAQEACIGVSSCSVPVSTKNFG---DPCTGVTKSLVVEAACS 852


>gi|357453869|ref|XP_003597215.1| Beta-galactosidase [Medicago truncatula]
 gi|355486263|gb|AES67466.1| Beta-galactosidase [Medicago truncatula]
          Length = 866

 Score =  966 bits (2498), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 489/850 (57%), Positives = 598/850 (70%), Gaps = 41/850 (4%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 20  TNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEP 79

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           ++GQY F+GR DLV+FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT N
Sbjct: 80  VKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTDN 139

Query: 123 NPFK--EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            PFK   EMKRF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYG+++ AYG  G+ Y+ WA
Sbjct: 140 EPFKVEAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWA 199

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A +L+T VPWVMCQQEDAPD IINTCNGFYCD FTPNS +KP MWTEN+S W+L FG
Sbjct: 200 AKMATSLDTGVPWVMCQQEDAPDSIINTCNGFYCDQFTPNSNTKPKMWTENWSAWYLLFG 259

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYM---------------------YFGGTNFGR 279
              P RPVEDLAFAVARFF+ GGTFQNYYM                     Y GGTNF R
Sbjct: 260 GGFPHRPVEDLAFAVARFFQRGGTFQNYYMVLQPEMFFTSSIYYMVLFLRPYHGGTNFDR 319

Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
           + GGP +ATSYD+DAPIDEYG IRQPKWGHL++LHKA+KLCEE LI+++P    LG  LE
Sbjct: 320 STGGPFIATSYDFDAPIDEYGIIRQPKWGHLKDLHKAVKLCEEALIATEPKITSLGPNLE 379

Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR 399
           A +Y K+ + CAAFLAN D+ SD  V F+GN Y LPAWSVSILPDCKNVV NTAK+ S  
Sbjct: 380 AAVY-KTGSVCAAFLANVDTKSDKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSAS 438

Query: 400 NNGDHPFAQQK-NVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
              +      K +++ L  +SS +SW  E VGIS +  F +  L EQIN T D SDYLWY
Sbjct: 439 AISNFVTKSSKEDISSLETSSSKWSWINEPVGISKDDIFSKTGLLEQINITADRSDYLWY 498

Query: 459 TASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINT 518
           + S+ +    G +  L+IESLGHA   FVN KL     GN D     ++  I++  G N 
Sbjct: 499 SLSVDLKDDLGSQTVLHIESLGHALHAFVNGKLAGSHTGNKDKPKLNVDIPIKVIYGNNQ 558

Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKR--DLSSGEWIYQVGVEGEYIGL 575
           +D+LS+ VGLQNYGA+FD  GAG+   V L  LKNG    DLSS +W YQVG++GE +GL
Sbjct: 559 IDLLSLTVGLQNYGAFFDRWGAGITGPVTLKGLKNGNNTLDLSSQKWTYQVGLKGEDLGL 618

Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
              S  +S  W   ST P N+ LIWYKT F AP G  P+A++   MGKG+AWVNGQSIGR
Sbjct: 619 ---SSGSSEGWNSQSTFPKNQPLIWYKTNFDAPSGSNPVAIDFTGMGKGEAWVNGQSIGR 675

Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
           YW  Y+A +  CT  C+YRG +  +KC  +CG+P+QTLYH+PR+++ P  N LV+ EE G
Sbjct: 676 YWPTYVASNADCTDSCNYRGPFTQTKCHMNCGKPSQTLYHVPRSFLKPNGNTLVLFEENG 735

Query: 696 GDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSPQVRLAC-ERGWHIAA 751
           GDP++I+  TK  + +C+ VS++ PP +D W  +    G V   P + L C      I +
Sbjct: 736 GDPTQIAFATKQLESLCAHVSDSHPPQIDLWNQDTTSWGKVG--PALLLNCPNHNQVIFS 793

Query: 752 INFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLL 810
           I FASYG P G CG+F  G C  +  L IV+KAC+G   CSI VS+   G     C G+ 
Sbjct: 794 IKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSIGVSTDTFG---DPCRGVP 850

Query: 811 KALAVEAHCS 820
           K+LAVEA C+
Sbjct: 851 KSLAVEATCA 860


>gi|414865886|tpg|DAA44443.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
          Length = 830

 Score =  939 bits (2427), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 475/834 (56%), Positives = 582/834 (69%), Gaps = 45/834 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRALVIDG RRVL SGSIHYPRSTP++WP LI+K+K+GGL+VIETYVFW+ HE
Sbjct: 27  AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQY FEGR DL  FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT 
Sbjct: 87  PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF AKI                      ENEYGN++ AYG  G+ Y++WAA
Sbjct: 147 NEPFKAEMQRFTAKI----------------------ENEYGNIDSAYGAPGKAYMRWAA 184

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG 
Sbjct: 185 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 244

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN  R++GGP +ATSYDYDAPIDEYG 
Sbjct: 245 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 304

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR++HKAIKLCE  LI++DP++  LG  +EA +Y K  + CAAFLAN D  S
Sbjct: 305 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVY-KVGSVCAAFLANIDGQS 363

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
           D  VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ    +  + +  NV         
Sbjct: 364 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 423

Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
            LA S +S+  E VGI+ + +  +  L EQINTT D SD+LWY+ SI V   +G E +LN
Sbjct: 424 ELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 480

Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
                  + SLGH   V++N K+     G+   +     K IEL  G N +D+LS  VGL
Sbjct: 481 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 540

Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
            NYGA+FD+ GAG+   + +   NG  DLSS EW YQ+G+ GE + L   S A S  W  
Sbjct: 541 SNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 599

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
            +  P+N  LIWYKT F  P G  P+A++   MGKG+AWVNGQSIGRYW   LAP +GC 
Sbjct: 600 ANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCV 659

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
             C+YRG+Y +SKC K CGQP+QTLYH+PR+++ PG N LV+ E  GGDPSKIS + +  
Sbjct: 660 NSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEHFGGDPSKISFVMRQT 719

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSF 767
             +C+ VSEA P  +DSW     +    P +RL C + G  I+++ FAS+G P G CGS+
Sbjct: 720 GSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLECPKEGQVISSVKFASFGTPSGTCGSY 779

Query: 768 RPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             G C     L IVQ+AC+G   CS+PVSS Y G     C G+ K+LAVEA CS
Sbjct: 780 SHGECSSTQALSIVQEACIGVSSCSVPVSSNYFG---NPCTGVTKSLAVEAACS 830


>gi|108707233|gb|ABF95028.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
          Length = 796

 Score =  928 bits (2399), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 454/803 (56%), Positives = 564/803 (70%), Gaps = 24/803 (2%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +WP LI+KSK+GGL+VIETYVFW+ HE +RGQY FEGR DLVRFVK V +AGL++HLRIG
Sbjct: 1   MWPGLIQKSKDGGLDVIETYVFWDIHEAVRGQYDFEGRKDLVRFVKAVADAGLYVHLRIG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           PY CAEWNYGGFPVWLHF+PGI+FRT N  FK EM+RF  K++D MK   L+ASQGGPII
Sbjct: 61  PYVCAEWNYGGFPVWLHFVPGIKFRTDNEAFKAEMQRFTEKVVDTMKGAGLYASQGGPII 120

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
           L+Q+ENEYGN++ AYG  G+ Y++WAA  AV+L+T VPWVMCQQ DAPDP+INTCNGFYC
Sbjct: 121 LSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLDTGVPWVMCQQSDAPDPLINTCNGFYC 180

Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
           D FTPNS SKP MWTEN+SGWFLSFG AVP+RP EDLAFAVARF++ GGTFQNYYMY GG
Sbjct: 181 DQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPAEDLAFAVARFYQRGGTFQNYYMYHGG 240

Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
           TNFGR+ GGP +ATSYDYDAPIDEYG +RQPKWGHLR++HKAIKLCE  LI+++P++  L
Sbjct: 241 TNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWGHLRDVHKAIKLCEPALIAAEPSYSSL 300

Query: 335 GAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTA 393
           G   EA +Y  + N  CAAFLAN D+ SD  V FNGN Y LPAWSVSILPDCKNVV NTA
Sbjct: 301 GQNTEATVYQTADNSICAAFLANVDAQSDKTVKFNGNTYKLPAWSVSILPDCKNVVLNTA 360

Query: 394 KVISQ------RNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQIN 447
           ++ SQ      R+ G        ++    LA++ +S+  E VGI+   +  +P L EQIN
Sbjct: 361 QINSQVTTSEMRSLGSSIQDTDDSLITPELATAGWSYAIEPVGITKENALTKPGLMEQIN 420

Query: 448 TTKDTSDYLWYTASIHVMPGQGKEVFLN-------IESLGHAALVFVNKKLVAFGYGNHD 500
           TT D SD+LWY+ SI V   +G E +LN       + SLGH   +++N KL     G+  
Sbjct: 421 TTADASDFLWYSTSIVV---KGDEPYLNGSQSNLLVNSLGHVLQIYINGKLAGSAKGSAS 477

Query: 501 FANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG 560
            +   +   + L  G N +D+LS  VGL NYGA+FD+ GAG+   + +   NG  +LSS 
Sbjct: 478 SSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAFFDLVGAGVTGPVKLSGPNGALNLSST 537

Query: 561 EWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLAS 620
           +W YQ+G+ GE + L   S A S  W   +  P N+ LIWYKT F AP G  P+A++   
Sbjct: 538 DWTYQIGLRGEDLHLYNPSEA-SPEWVSDNAYPTNQPLIWYKTKFTAPAGDDPVAIDFTG 596

Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
           MGKG+AWVNGQSIGRYW   LAP +GC   C+YRG+Y ++KC K CGQP+QTLYH+PR++
Sbjct: 597 MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSNKCLKKCGQPSQTLYHVPRSF 656

Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW-KPNLGVVSSSPQV 739
           + PG N LV+ E+ GGDPS IS  T+    IC+ VSE  P  +DSW  P     +  P +
Sbjct: 657 LQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAHVSEMHPAQIDSWISPQQTSQTQGPAL 716

Query: 740 RLACER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSA 797
           RL C R G  I+ I FAS+G P G CG++  G C     L +VQ+ACVG   CS+PVSS 
Sbjct: 717 RLECPREGQVISNIKFASFGTPSGTCGNYNHGECSSSQALAVVQEACVGMTNCSVPVSSN 776

Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
             G     C G+ K+L VEA CS
Sbjct: 777 NFG---DPCSGVTKSLVVEAACS 796


>gi|168001886|ref|XP_001753645.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695052|gb|EDQ81397.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 929

 Score =  906 bits (2341), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 453/851 (53%), Positives = 577/851 (67%), Gaps = 41/851 (4%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD RAL+I+G+RR+L S  IHYPR+TPE+WP L++KSKEGG +V+++YVFWN HEP 
Sbjct: 34  NVTYDQRALIINGQRRMLISAGIHYPRATPEMWPSLVQKSKEGGADVVQSYVFWNGHEPK 93

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEGR+DLV+F+K VQ+AGL+ HLRIGPY CAEWN+GGFP WL  IPGI FRT N 
Sbjct: 94  QGQYNFEGRYDLVKFIKVVQQAGLYFHLRIGPYVCAEWNFGGFPYWLKDIPGIVFRTDNE 153

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+ F++KI++LMK+  LFA QGGPII+AQ+ENEYGN+EWA+G GG+ Y  WAA+ 
Sbjct: 154 PFKVAMEGFVSKIVNLMKENQLFAWQGGPIIMAQIENEYGNIEWAFGDGGKRYAMWAAEL 213

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+  VPWVMCQQ+DAP  IINTCNG+YCDGF  N+ +KP  WTE+++GWF  +G +V
Sbjct: 214 ALGLDAGVPWVMCQQDDAPGNIINTCNGYYCDGFKANTATKPAFWTEDWNGWFQYWGQSV 273

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVED AFA+ARFF+ GG+FQNYYMYFGGTNF RTAGGP + TSYDYDAP+DEYG IR
Sbjct: 274 PHRPVEDNAFAIARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTSYDYDAPLDEYGLIR 333

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           QPKWGHLR+LH AIKLCE  L + D  P    LG  +EAH+Y      CAAFLAN DS  
Sbjct: 334 QPKWGHLRDLHAAIKLCEPALTAVDEVPLSTWLGPNVEAHVY-SGRGQCAAFLANIDSWK 392

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS-- 419
            A V F G  Y LP WSVSILPDCKNVVFNTA+V +Q         + K   E+++ S  
Sbjct: 393 IATVQFKGKAYVLPPWSVSILPDCKNVVFNTAQVGAQTTLTRMTIVRSKLEGEVVMPSNM 452

Query: 420 -----------SAFSWYE--EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
                      S   W    E VGI G  + V   L EQ+N TKD++DYLWY+ SI V  
Sbjct: 453 LRKHAPESIVGSGLKWEASVEPVGIRGAATLVSNRLLEQLNITKDSTDYLWYSISIKVSV 512

Query: 465 -----MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
                +     +  L + S+  A  +FVN++LV    G    ++  + + + L EG N +
Sbjct: 513 EAVTALSKTKSQAILVLGSMRDAVHIFVNRQLVGSAMG----SDVQVVQPVPLKEGKNDI 568

Query: 520 DILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
           D+LSM VGLQNYGA+ +  GAG+  S +L  L +G  DLS+  W YQVG++GE   L + 
Sbjct: 569 DLLSMTVGLQNYGAYLETWGAGIRGSALLRGLPSGVLDLSTERWSYQVGIQGEEKRLFET 628

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
             A+   W   S+ P   +L WYKTTF AP+G  P+AL+L SMGKGQAWVNG  +GRYW 
Sbjct: 629 GTADGIQWDSSSSFPNASALTWYKTTFDAPKGTDPVALDLGSMGKGQAWVNGHHMGRYWP 688

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT-----LYHIPRTWVHPGENLLVIHEE 693
           + LA  +GC+  CDYRG+YDA KC+ +CG+P+Q      +YHIPR W+    NLLV+ EE
Sbjct: 689 SVLASQSGCS-TCDYRGAYDADKCRTNCGKPSQRWQYVDMYHIPRAWLQLSNNLLVLFEE 747

Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSPQVRLACERGWHIA 750
           +GGD SK+SL+T++   +C+ V E+ PPPV  W  N     + S S +  L C  G HI 
Sbjct: 748 IGGDVSKVSLVTRSAPAVCTHVHESQPPPVLFWPANSSMDAMSSRSGEAVLECIAGQHIR 807

Query: 751 AINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGL 809
            I FAS+G P+G+CG+F+ G CH M  L + +KAC+G   CSIPV     G     CP +
Sbjct: 808 HIKFASFGNPKGSCGNFQRGTCHAMKSLEVARKACMGMHRCSIPVQWQTFG-EFDPCPDV 866

Query: 810 LKALAVEAHCS 820
            K+LAV+  CS
Sbjct: 867 SKSLAVQVFCS 877


>gi|148906967|gb|ABR16628.1| unknown [Picea sitchensis]
          Length = 836

 Score =  906 bits (2341), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 448/832 (53%), Positives = 560/832 (67%), Gaps = 28/832 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYDH+ALVI+G+RR+L SGSIHYPRST E+WP+L RK+K+GGL+VI+TYVFWN H
Sbjct: 21  VECGVTYDHKALVINGERRILISGSIHYPRSTAEMWPDLFRKAKDGGLDVIQTYVFWNMH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGRFDLV+FVK  QEAGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81  EPSPGNYNFEGRFDLVKFVKLAQEAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  K++DLMK E LF SQGGPIILAQVENEY   E  YG+ G  Y+ WA
Sbjct: 141 DNEPFKNAMEGFTKKVVDLMKSEGLFESQGGPIILAQVENEYKPEEMEYGLAGAQYMNWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV ++T VPWVMC+Q+DAPDP+INTCNGFYCD F PN P KP MWTE +SGW+  FG
Sbjct: 201 AQMAVGMDTGVPWVMCKQDDAPDPVINTCNGFYCDNFVPNKPYKPTMWTEAWSGWYTEFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A P RPVEDLAFAVARFF  GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 261 GASPHRPVEDLAFAVARFFVKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPKWGHL+ELHKAIKLCE  L+S DP    LG   +A++Y   + +CAAF+ NYDS+
Sbjct: 321 LIRQPKWGHLKELHKAIKLCEPALVSGDPVVTSLGHFQQAYVYSAGAGNCAAFIVNYDSN 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S   V FNG  Y +  WSVSILPDC+NVVFNTAKV  Q        +Q K     +    
Sbjct: 381 SVGRVIFNGQRYKIAPWSVSILPDCRNVVFNTAKVDVQT-------SQMK-----MTPVG 428

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
            F W   +E +    + S     L EQIN T+D +DYLWY  S+ V   +     G    
Sbjct: 429 GFGWESIDENIASFEDNSISAVGLLEQINITRDNTDYLWYITSVEVDEDEPFIKNGGLPV 488

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L ++S G A  VF+N  L    YG  +      +  + LN G N + +LSM VGLQN G 
Sbjct: 489 LTVQSAGDALHVFINDDLAGSQYGRKENPKVRFSSGVRLNVGTNKISLLSMTVGLQNIGP 548

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F++A AG+   + L   K+G RDLSS  W YQ+G++GE + L   S  N+  W +G  +
Sbjct: 549 HFEMANAGVLGPITLSGFKDGTRDLSSQRWSYQIGLKGETMNL-HTSGDNTVEWMKGVAV 607

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
           P ++ L WYK  F AP G+ PL L+L+SMGKGQAWVNGQSIGRYW +YLA    C+  C 
Sbjct: 608 PQSQPLRWYKAEFDAPAGEDPLGLDLSSMGKGQAWVNGQSIGRYWPSYLAEGV-CSDGCS 666

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y G+Y   KC  +CGQ +Q  YH+PR+W+ P  N LV+ EE+GG+PS +SL+T++   +C
Sbjct: 667 YEGTYRPHKCDTNCGQSSQRWYHVPRSWLQPSGNTLVLFEEIGGNPSGVSLVTRSVDSVC 726

Query: 713 SFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           + VSE+    ++ W+            P+V L C +G  I+AI FAS+G P+G CGSF+ 
Sbjct: 727 AHVSESHSQSINFWRLESTDQVQKLHIPKVHLQCSKGQRISAIKFASFGTPQGLCGSFQQ 786

Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G CH  + +  +QK C+G  +CS+ VS    G     CPG+ K +A+EA CS
Sbjct: 787 GDCHSPNSVATIQKKCMGLRKCSLSVSEKIFG--GDPCPGVRKGVAIEAVCS 836


>gi|157313304|gb|ABV32545.1| beta-galactosidase protein 2 [Prunus persica]
          Length = 841

 Score =  892 bits (2305), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 448/837 (53%), Positives = 573/837 (68%), Gaps = 40/837 (4%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YD +A+VI+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 26  ASVSYDSKAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFE  +DLV+F+K +Q+AGL++HLRIGPY CAEWN+GGFPVWL +IPGIQFRT N
Sbjct: 86  SPGKYYFEDNYDLVKFIKLIQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIQFRTDN 145

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK +M+RF  KI+++MK E LF SQGGPIIL+Q+ENEYG +E+  G  G++Y  WAA 
Sbjct: 146 GPFKAQMQRFTTKIVNMMKAERLFQSQGGPIILSQIENEYGPMEYELGAPGKVYTDWAAH 205

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN   KP MWTE ++GW+  FG A
Sbjct: 206 MALGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWYTEFGGA 265

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 266 VPSRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+AIKLCE  L+S+DPT   LG   EAH++   S  CAAFLANY+  S 
Sbjct: 326 RQPKWGHLKDLHRAIKLCEPALVSADPTVTPLGTYQEAHVFKSKSGACAAFLANYNPRSF 385

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILPDCKN V+NTA+V +Q        AQ K     L    AF
Sbjct: 386 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------AQMKMPRVPL--HGAF 436

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           SW  Y ++     + SF    L EQINTT+D+SDYLWY   + + P +     GK   L 
Sbjct: 437 SWQAYNDETATYADTSFTTAGLLEQINTTRDSSDYLWYLTDVKIDPNEEFLRSGKYPVLT 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I S GHA  VF+N +L    YG+ +F     ++ + L  GIN + +LS+ VGL N G  F
Sbjct: 497 ILSAGHALRVFINGQLAGTSYGSLEFPKLTFSQGVNLRAGINQIALLSIAVGLPNVGPHF 556

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   VIL  L  G+RDLS  +W Y+VG++GE + L  +S ++S  W QGS +  
Sbjct: 557 ETWNAGVLGPVILNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWIQGSLVTR 616

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLAL++ SMGKGQ W+NG+SIGRYW AY A  +G    C+Y 
Sbjct: 617 RQPLTWYKTTFNAPAGNSPLALDMGSMGKGQVWINGRSIGRYWPAYKA--SGSCGACNYA 674

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           GSY   KC  +CG+ +Q  YH+PRTW++P  NLLV+ EE GGDP+ I L+ +    IC+ 
Sbjct: 675 GSYHEKKCLSNCGEASQRWYHVPRTWLNPTGNLLVVLEEWGGDPNGIFLVRREIDSICAD 734

Query: 715 VSEADPPPVDSWKPNL--------GVVSS--SPQVRLACERGWHIAAINFASYGIPEGNC 764
           + E        W+PNL        G V     P+  L+C  G  I++I FAS+G PEG C
Sbjct: 735 IYE--------WQPNLMSWQMQASGKVKKPVRPKAHLSCGPGQKISSIKFASFGTPEGGC 786

Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           GSFR G+CH  +     Q++C+GQ  CS+ V+    G     CP ++K L+VEA CS
Sbjct: 787 GSFREGSCHAHNSYDAFQRSCIGQNSCSVTVAPENFG--GDPCPNVMKKLSVEAICS 841


>gi|224087947|ref|XP_002308268.1| predicted protein [Populus trichocarpa]
 gi|222854244|gb|EEE91791.1| predicted protein [Populus trichocarpa]
          Length = 838

 Score =  890 bits (2301), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/832 (51%), Positives = 569/832 (68%), Gaps = 29/832 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+V+YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GG++VI+TYVFWN H
Sbjct: 24  VTASVSYDHKAVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGVDVIQTYVFWNGH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G YYFE R+DLV+F+K VQ+AGL+LHLRIGPY CAEWN+GGFPVWL ++PGI+FRT
Sbjct: 84  EPSPGNYYFEDRYDLVKFIKLVQQAGLYLHLRIGPYICAEWNFGGFPVWLKYVPGIEFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK E LF +QGGPIIL+Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 144 DNGPFKAAMQKFTEKIVGMMKSEKLFENQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           AD AV L T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN   KP +WTE ++GW+  FG
Sbjct: 204 ADMAVKLGTGVPWIMCKQEDAPDPMIDTCNGFYCENFKPNKDYKPKIWTEAWTGWYTEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP ED+AF+VARF + GG++ NYYMY GGTNFGRTAGGP +ATSYDYDAP+DE+G
Sbjct: 264 GAVPHRPAEDMAFSVARFIQNGGSYINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PKWGHLR+LHKAIKLCE  L+S DPT   LG+  EAH++ KS + CAAFLANYD+ 
Sbjct: 324 LPREPKWGHLRDLHKAIKLCEPALVSVDPTVTSLGSNQEAHVF-KSKSVCAAFLANYDTK 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               VTF    Y LP WSVSILPDCK  V+NTA++ SQ        +Q K    ++ ASS
Sbjct: 383 YSVKVTFGNGQYELPPWSVSILPDCKTAVYNTARLGSQS-------SQMK----MVPASS 431

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           +FSW    EE      + +     L EQIN T+D +DYLWY   + +   +     G+  
Sbjct: 432 SFSWQSYNEETASADDDDTTTMNGLWEQINVTRDATDYLWYLTDVKIDADEGFLKSGQNP 491

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L I S GHA  VF+N +L    YG         ++ I+L EGIN + +LS+ VGL N G
Sbjct: 492 LLTIFSAGHALHVFINGQLAGTAYGGLSNPKLTFSQNIKLTEGINKISLLSVAVGLPNVG 551

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   + L  L  G RDLS  +W Y++G++GE + L   S + S  W +GS 
Sbjct: 552 LHFETWNAGVLGPITLKGLNEGTRDLSGQKWSYKIGLKGESLSLHTASGSESVEWVEGSL 611

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   ++L WYKT F AP+G  PLAL+++SMGKGQ W+NGQ+IGR+W  Y+A   G    C
Sbjct: 612 LAQKQALTWYKTAFDAPQGNDPLALDMSSMGKGQMWINGQNIGRHWPGYIA--HGSCGDC 669

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           +Y G++D  KC+ +CG+P+Q  YH+PR+W+ P  NLL + EE GGDP+ IS + +T   +
Sbjct: 670 NYAGTFDDKKCRTNCGEPSQRWYHVPRSWLKPSGNLLAVFEEWGGDPTGISFVKRTTASV 729

Query: 712 CSFVSEADPPPVDSWK--PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           C+ + E   P + +W+   +  V+S  P+  L C  G  I+ I FAS+G+P+G CGSFR 
Sbjct: 730 CADIFEGQ-PALKNWQAIASGKVISPQPKAHLWCPTGQKISQIKFASFGMPQGTCGSFRE 788

Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G+CH        ++ CVG+  CS+ V+    G     CP   K L+VEA CS
Sbjct: 789 GSCHAHKSYDAFERNCVGKQSCSVTVAPEVFG--GDPCPDSAKKLSVEAVCS 838


>gi|242055159|ref|XP_002456725.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
 gi|241928700|gb|EES01845.1| hypothetical protein SORBIDRAFT_03g041450 [Sorghum bicolor]
          Length = 843

 Score =  888 bits (2295), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 438/827 (52%), Positives = 554/827 (66%), Gaps = 17/827 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           ++NVTYDHR+L+I G+RR++ S SIHYPRS PE+WP+L+ ++K+GG + IETYVFWN HE
Sbjct: 26  ASNVTYDHRSLIISGRRRLIISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQYYFE RFDLVRFVK V++AGL L LRIGP+  AEWN+GG PVWLH++PG  FRT 
Sbjct: 86  IAPGQYYFEDRFDLVRFVKVVKDAGLLLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTD 145

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-EWAYGVGGELYVKWA 180
           N PFK  MK F   I+++MK+E LFASQGG IILAQ+ENEYG+  E AY  GG+ Y  WA
Sbjct: 146 NEPFKSHMKSFTTYIVNMMKKEQLFASQGGNIILAQIENEYGDYYEQAYAPGGKPYAMWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV  NT VPW+MCQ+ DAPDP+IN+CNGFYCDGF PNSP+KP +WTEN+ GWF +FG
Sbjct: 206 ASMAVAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKLWTENWPGWFQTFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            + P RP ED+AFAVARFFE GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKW HLR+LHK+I+LCE  L+  + T   LG K EA IY   S  C AFLAN DS+
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +D  VTF    Y LPAWSVSILPDC+NVVFNTAKV SQ        +    V E L AS 
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQT-------SMVAMVPESLQASK 438

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP--GQGKEVFLNI 476
              W  + E+ GI G   FVR    + INTTKD++DYLWYT S  V     +G  V LNI
Sbjct: 439 PERWNIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDESYSKGSHVVLNI 498

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GH    F+N + +   YGN   ++F +   I L  G N L +LSM VGLQN G  ++
Sbjct: 499 DSKGHGVHAFLNNEFIGSAYGNGSQSSFSVKLPINLRTGKNELALLSMTVGLQNAGFSYE 558

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             GAG  +V +  ++NG  +LSS  W Y++G+EGEY  L K    N+  W   S  P N+
Sbjct: 559 WIGAGFTNVNISGVRNGTINLSSNNWAYKIGLEGEYYSLFKPDQRNNQRWIPQSEPPKNQ 618

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK     P+G  P+ +++ SMGKG  W+NG +IGRYW    +    CT  CDYRG 
Sbjct: 619 PLTWYKVNVDVPQGDDPVGIDMQSMGKGLVWLNGNAIGRYWPRTSSIDDRCTPSCDYRGE 678

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           ++ +KC+  CGQP Q  YHIPR+W HP  N+LVI EE GGDP+KI+   +    +CSFVS
Sbjct: 679 FNPNKCRTGCGQPTQRWYHIPRSWFHPSGNILVIFEEKGGDPTKITFSRRAVTSVCSFVS 738

Query: 717 EADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
           E  P   ++SW  +     +SP + +L+C  G +I+++ FAS G P G C S++ G+CH 
Sbjct: 739 EHFPSIDLESWDGSATNEGTSPAKAQLSCPIGKNISSLKFASLGTPSGTCRSYQKGSCHH 798

Query: 775 -DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            + L +V+KAC+    C++ +S    G     CPG+ K LA+EA CS
Sbjct: 799 PNSLSVVEKACLNTNSCTVSLSDESFG--KDLCPGVTKTLAIEADCS 843


>gi|114217395|dbj|BAF31233.1| beta-D-galactosidase [Persea americana]
          Length = 849

 Score =  887 bits (2291), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/829 (52%), Positives = 559/829 (67%), Gaps = 27/829 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + +V+YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 36  TCSVSYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 95

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFEGR+DLV+F+K V+EAGL++HLRIGPYACAEWN+GGFPVWL +IPGI FRT 
Sbjct: 96  PSPGEYYFEGRYDLVKFIKLVKEAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRTD 155

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M  F  KI+D+MK+E LF +QGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA
Sbjct: 156 NEPFKTAMAGFTKKIVDMMKEEELFETQGGPIILSQIENEYGPVEWEIGAPGQAYTKWAA 215

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + AV L T VPWVMC+Q+DAPDPIINTCN  YCD F+PN   KP MWTE ++ WF +FG 
Sbjct: 216 NMAVGLGTGVPWVMCKQDDAPDPIINTCNDHYCDWFSPNKNYKPTMWTEAWTSWFTAFGG 275

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP+RP ED+AFA+A+F + GG+F NYYMY GGTNFGRTAGGP VATSYDYDAPIDEYG 
Sbjct: 276 PVPYRPAEDMAFAIAKFIQRGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEYGL 335

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IRQPKWGHL++LHKAIK+CE  L+S DP    LG+  E+H++   S DCAAFLANYD  S
Sbjct: 336 IRQPKWGHLKDLHKAIKMCEAALVSGDPIVTSLGSSQESHVFKSESGDCAAFLANYDEKS 395

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V F G  Y LP WS+SILPDC N VFNTA+V           AQ  ++    +    
Sbjct: 396 FAKVAFQGMHYNLPPWSISILPDCVNTVFNTARV----------GAQTSSMTMTSVNPDG 445

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           FSW  Y E+     + S     L EQIN T+D +DYLWYT  I + P +     G+   L
Sbjct: 446 FSWETYNEETASYDDASITMEGLLEQINVTRDVTDYLWYTTDITIDPNEGFLKNGEYPVL 505

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + S GHA  +F+N +L    YG+ D         ++L  G N + +LS+ VGL N GA 
Sbjct: 506 TVMSAGHALHIFINGELSGTVYGSVDNPKLTYTGSVKLLAGNNKISVLSIAVGLPNIGAH 565

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+    G+   V+L  L  G+RDLS   W Y++G++GE + L  ++ ++S  W   S + 
Sbjct: 566 FETWNTGVLGPVVLNGLNEGRRDLSWQNWSYKIGLKGEALQLHSLTGSSSVEWS--SLIA 623

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYKTTF APEG GP AL+++ MGKGQ W+NGQSIGRYW AY A   G   +C Y
Sbjct: 624 QKQPLTWYKTTFNAPEGNGPFALDMSMMGKGQIWINGQSIGRYWPAYKA--YGNCGECSY 681

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G Y+  KC  +CG+ +Q  YH+P +W++P  NLLV+ EE GGDP+ ISL+ +T    C+
Sbjct: 682 TGRYNEKKCLANCGEASQRWYHVPSSWLYPTANLLVVFEEWGGDPTGISLVRRTTGSACA 741

Query: 714 FVSEADPPPVDSWKPNLGVVS--SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           F+SE  P        + G       P+  L+C  G  I++I FAS+G P+G CG+F  G+
Sbjct: 742 FISEWHPTLRKWHIKDYGRAERPRRPKAHLSCADGQKISSIKFASFGTPQGVCGNFTEGS 801

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           CH      I +K CVGQ  CS+ +S    G     CP ++K LAVEA C
Sbjct: 802 CHAHKSYDIFEKNCVGQQWCSVTISPDVFG--GDPCPNVMKNLAVEAIC 848


>gi|356561185|ref|XP_003548865.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
          Length = 848

 Score =  885 bits (2288), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/832 (53%), Positives = 564/832 (67%), Gaps = 30/832 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+V+ETYVFWN HEP
Sbjct: 25  ASVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGLDVVETYVFWNVHEP 84

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 85  SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+ENEYG      G  G+ YV WAA 
Sbjct: 145 EPFKTAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGDAGQNYVNWAAK 204

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV + T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP++WTE +SGWF  FG  
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +  RPV+DLAFAVARF   GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG I
Sbjct: 265 IHKRPVQDLAFAVARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELH+AIK+CE  L+S+DP    LG   +AH+Y   S DCAAFL+NYDS S 
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIITSLGESQQAHVYTTESGDCAAFLSNYDSKSS 384

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LP WSVSILPDC+NVVFNTAKV            Q   +  L   +  F
Sbjct: 385 ARVMFNNMHYNLPPWSVSILPDCRNVVFNTAKV----------GVQTSQMQMLPTNTQLF 434

Query: 423 SW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW  ++E V  +  + + + P L EQIN TKD SDYLWY  S+ +   +     G+   L
Sbjct: 435 SWESFDEDVYSVDDSSAIMAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPTL 494

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            ++S GHA  VF+N +L    YG  ++  F+   K+ L  GIN + +LS+ +GL N G  
Sbjct: 495 IVQSRGHAVHVFINGQLSGSAYGTREYRRFMYTGKVNLRAGINRIALLSVAIGLPNVGEH 554

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TL 592
           F+    G+   V L  L  GK DLS  +W YQVG++GE + L   +  +S  W Q +  +
Sbjct: 555 FESWSTGILGPVALHGLDQGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVV 614

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
             N+ L W+KT F APEG  PLAL++  MGKGQ W+NGQSIGRYW+ +   +TG    C+
Sbjct: 615 QRNQPLTWHKTHFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTTF---ATGNCNDCN 671

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y GS+   KCQ  CGQP Q  YH+PR+W+ P +NLLVI EELGG+PSKISL+ ++   +C
Sbjct: 672 YAGSFRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVIFEELGGNPSKISLVKRSVSSVC 731

Query: 713 SFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           + VSE   P + +W       S     P+V L C  G  I++I FAS+G P G CG++  
Sbjct: 732 ADVSEYH-PNIKNWHIESYGKSEEFHPPKVHLHCSPGQTISSIKFASFGTPLGTCGNYEQ 790

Query: 770 GACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           GACH      I++K C+G+  C++ VS++  G     CP +LK L+VEA C+
Sbjct: 791 GACHSPASYAILEKRCIGKPRCTVTVSNSNFG--QDPCPKVLKRLSVEAVCA 840


>gi|118488890|gb|ABK96254.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 846

 Score =  885 bits (2287), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/829 (52%), Positives = 564/829 (68%), Gaps = 20/829 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+V+YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 29  VTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLV+FVK  +EAGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 89  EPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M++F  KI+++MK E LF +QGGPIIL+Q+ENEYG +E+  G  G+ Y KWA
Sbjct: 149 DNGPFKAQMQKFTTKIVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWA 208

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 209 AEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFG 268

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 269 GPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 328

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL++LH+AIKLCE  L+S D T   LG   EAH+++  +  CAAFLANY   
Sbjct: 329 LLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQR 388

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V+F    Y LP WS+SILPDCKN V+NTA+V +Q        A+ K     +    
Sbjct: 389 SFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMTPVPMHGGF 441

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++  Y E+   SG+ +F    L EQINTT+D SDYLWY   +H+ P +     GK   L 
Sbjct: 442 SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLG 501

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VF+N +L    YG+ DF      + ++L  G+N + +LS+ VGL N G  F
Sbjct: 502 VLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHF 561

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V L  L  G+RDLS  +W Y++G+ GE +GL  IS ++S  W +GS +  
Sbjct: 562 ETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSLVAQ 621

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQ +GR+W AY A  +G    C Y 
Sbjct: 622 RQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA--SGTCGDCSYI 679

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G+Y+  KC  +CG+ +Q  YH+P++W+ P  NLLV+ EE GGDP+ ISL+ +    +C+ 
Sbjct: 680 GTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDVDSVCAD 739

Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + E  P  ++      G V+    P+  L+C  G  I +I FAS+G PEG CGS+R G+C
Sbjct: 740 IYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYRQGSC 799

Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H           CVGQ  CS+ V+    G     C  ++K LAVEA CS
Sbjct: 800 HAFHSYDAFNNLCVGQNSCSVTVAPEMFG--GDPCLNVMKKLAVEAICS 846


>gi|224134551|ref|XP_002327432.1| predicted protein [Populus trichocarpa]
 gi|222835986|gb|EEE74407.1| predicted protein [Populus trichocarpa]
          Length = 839

 Score =  884 bits (2284), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/829 (51%), Positives = 564/829 (68%), Gaps = 20/829 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+V+YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 22  VTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 81

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLV+FVK  +EAGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 82  EPSPGKYYFEGNYDLVKFVKLAKEAGLYVHLRIGPYICAEWNFGGFPVWLKYIPGINFRT 141

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M++F  K++++MK E LF +QGGPIIL+Q+ENEYG +E+  G  G+ Y KWA
Sbjct: 142 DNGPFKAQMQKFTTKVVNMMKAERLFETQGGPIILSQIENEYGPMEYEIGSPGKAYTKWA 201

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 202 AEMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFG 261

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 262 GPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL++LH+AIKLCE  L+S D T   LG   EAH+++  +  CAAFLANY   
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANYHQR 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V+F    Y LP WS+SILPDCKN V+NTA+V +Q        A+ K     +    
Sbjct: 382 SFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMTPVPMHGGF 434

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++  Y E+   SG+ +F    L EQINTT+D SDYLWY   +H+ P +     GK   L 
Sbjct: 435 SWQAYNEEPSASGDSTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLRSGKYPVLG 494

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VF+N +L    YG+ DF      + ++L  G+N + +LS+ VGL N G  F
Sbjct: 495 VLSAGHALHVFINGQLSGTAYGSLDFPKLTFTQGVKLRAGVNKISLLSIAVGLPNVGPHF 554

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V L  L  G+RDLS  +W Y++G+ GE +GL  IS ++S  W +GS +  
Sbjct: 555 ETWNAGILGPVTLNGLNEGRRDLSWQKWSYKIGLHGEALGLHSISGSSSVEWAEGSLVAQ 614

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQ +GR+W AY A  +G    C Y 
Sbjct: 615 RQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA--SGTCGDCSYI 672

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G+Y+  KC  +CG+ +Q  YH+P++W+ P  NLLV+ EE GGDP+ ISL+ +    +C+ 
Sbjct: 673 GTYNEKKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGISLVRRDVDSVCAD 732

Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + E  P  ++      G V+    P+  L+C  G  I +I FAS+G PEG CGS+R G+C
Sbjct: 733 IYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYRQGSC 792

Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H           CVGQ  CS+ V+    G     C  ++K LAVEA CS
Sbjct: 793 HAFHSYDAFNNLCVGQNSCSVTVAPEMFG--GDPCLNVMKKLAVEAICS 839


>gi|297738667|emb|CBI27912.3| unnamed protein product [Vitis vinifera]
          Length = 833

 Score =  883 bits (2282), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/830 (52%), Positives = 562/830 (67%), Gaps = 27/830 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+VTYD R+ +I+G+R++L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 20  SASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 79

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P RG+YYFEGR+DLVRF+K VQ AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 80  PSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTD 139

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F  KI+D+MK E LF  QGGPII++Q+ENEYG VE+  G  G+ Y KWAA
Sbjct: 140 NGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWAA 199

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + AV L T VPWVMC+QEDAPDP+I+ CNGFYC+ F PN   KP M+TE ++GW+  FG 
Sbjct: 200 EMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDYKPKMFTEAWTGWYTEFGG 259

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           A+P RP EDLA++VARF +  G+F NYYMY GGTNFGRTAGGP ++TSYDYDAPIDEYG 
Sbjct: 260 AIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGL 319

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
             +PKWGHLR+LHKAIKLCE  L+S+DPT   LG  LEAH+Y   S  CAAFLANYD  S
Sbjct: 320 PSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANYDPKS 379

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A VTF    Y LP WSVSILPDCKNVVFNTA++ +Q        + Q  +N +    S 
Sbjct: 380 SAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQ--------SSQMKMNPV----ST 427

Query: 422 FSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
           FSW  Y E+   +        D L EQIN T+DT+DYLWY   +H+ P +     G+   
Sbjct: 428 FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYPV 487

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S GHA  VF+N +L    YG         +  ++L  G N + +LS+ +GL N G 
Sbjct: 488 LTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGL 547

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   V L  L  G  D+SS +W Y++G++GE + L  I+ ++S  W +GS L
Sbjct: 548 HFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSLL 607

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYKTTF AP G  PLAL+++SMGKGQ W+NG+SIGR+W AY A   G    C+
Sbjct: 608 AQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWPAYTA--HGNCNGCN 665

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y G ++  KCQ  CG P+Q  YH+PR+W+ P  N L++ EELGG+P+ I+L+ +T   +C
Sbjct: 666 YAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVC 725

Query: 713 SFVSEADPPPVDSWKPNLGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           + + E  P   +S       V+S   +  L C  G  I+ I FAS+G+P+G CGSFR G+
Sbjct: 726 ADIFEGQPSLKNSQIIGSSKVNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGS 785

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH       +Q+ C+G+  CS+ V+    G     CPG +K L+VEA CS
Sbjct: 786 CHAHKSYDALQRNCIGKQSCSVSVAPEVFG--GDPCPGSMKKLSVEALCS 833


>gi|226494417|ref|NP_001151478.1| LOC100285111 precursor [Zea mays]
 gi|195647054|gb|ACG42995.1| beta-galactosidase precursor [Zea mays]
          Length = 844

 Score =  883 bits (2282), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 438/828 (52%), Positives = 549/828 (66%), Gaps = 18/828 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           ++NVTYDHR+L+I G+RR++ S SIHYPRS PE+WP+L+ ++K+GG + IETYVFWN HE
Sbjct: 26  ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQYYFE RFDLVRFVK V++AGL L LRIGPY  AEWNYGG PVWLH++PG  FRT 
Sbjct: 86  IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-EWAYGVGGELYVKWA 180
           N PFK  MK F   I+D+MK+E LFASQGG IILAQ+ENEYG+  E AYG GG+ Y  WA
Sbjct: 146 NEPFKNHMKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+  NT VPW+MCQ+ DAPDP+IN+CNGFYCDGF PNSP+KP +WTEN+ GWF +FG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            + P RP ED+AFAVARFFE GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKW HLRELHK+I+LCE  L+  + T   LG K EA IY   S  C AFLAN DS+
Sbjct: 326 LRRFPKWAHLRELHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +D  VTF    Y LPAWSVSILPDC+NVVFNTAKV SQ        +    V E L AS 
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQT-------SMVTMVPESLQASK 438

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLN 475
              W  + E+ GI G   FVR    + INTTKD++DYLWYT S  V      +G    LN
Sbjct: 439 PERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGSYSSKGSHAVLN 498

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I+S GH    F+N  L+   YGN   + F +   I L  G N L +LSM VGLQN G  +
Sbjct: 499 IDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLTINLRTGKNELALLSMTVGLQNAGFAY 558

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           +  GAG  +V +  ++ G  DLSS  W Y++G+EGEY  L K    N+  W   S  P N
Sbjct: 559 EWIGAGFTNVNISGVRTGIIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEPPKN 618

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK     P+G  P+ +++ SMGKG AW+NG +IGRYW    + +  CT  C+YRG
Sbjct: 619 QPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCNYRG 678

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           ++   KC+  CGQP Q  YHIPR+W HP  N+LV+ EE GGDP+KI+   +    +CSFV
Sbjct: 679 TFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVTSVCSFV 738

Query: 716 SEADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           SE  P   ++SW  +     + P + +L+C  G  I+++ FAS G P G C S++ G CH
Sbjct: 739 SEHFPSIDLESWDESAMNEGTPPAKAQLSCPEGKSISSVKFASLGNPSGTCRSYQMGRCH 798

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             + L +V+KAC+    C++ ++    G     C G+ K LA+EA CS
Sbjct: 799 HPNSLSVVEKACLNTNSCTVSLTDESFG--KDLCHGVTKTLAIEADCS 844


>gi|414879448|tpg|DAA56579.1| TPA: beta-galactosidase isoform 1 [Zea mays]
 gi|414879449|tpg|DAA56580.1| TPA: beta-galactosidase isoform 2 [Zea mays]
          Length = 844

 Score =  883 bits (2281), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/828 (52%), Positives = 549/828 (66%), Gaps = 18/828 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           ++NVTYDHR+L+I G+RR++ S SIHYPRS PE+WP+L+ ++K+GG + IETYVFWN HE
Sbjct: 26  ASNVTYDHRSLIISGRRRLVISTSIHYPRSVPEMWPKLVAEAKDGGADCIETYVFWNGHE 85

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQYYFE RFDLVRFVK V++AGL L LRIGPY  AEWNYGG PVWLH++PG  FRT 
Sbjct: 86  IAPGQYYFEDRFDLVRFVKVVRDAGLLLILRIGPYVAAEWNYGGVPVWLHYVPGTVFRTN 145

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-EWAYGVGGELYVKWA 180
           N PFK  +K F   I+D+MK+E LFASQGG IILAQ+ENEYG+  E AYG GG+ Y  WA
Sbjct: 146 NEPFKNHVKSFTTYIVDMMKKEQLFASQGGNIILAQIENEYGDYYEQAYGAGGKPYAMWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+  NT VPW+MCQ+ DAPDP+IN+CNGFYCDGF PNSP+KP +WTEN+ GWF +FG
Sbjct: 206 ASMALAQNTGVPWIMCQESDAPDPVINSCNGFYCDGFQPNSPTKPKIWTENWPGWFQTFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            + P RP ED+AFAVARFFE GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG
Sbjct: 266 ESNPHRPPEDVAFAVARFFEKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKW HLR+LHK+I+LCE  L+  + T   LG K EA IY   S  C AFLAN DS+
Sbjct: 326 LRRFPKWAHLRDLHKSIRLCEHTLLYGNTTFLSLGPKQEADIYSDQSGGCVAFLANIDSA 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +D  VTF    Y LPAWSVSILPDC+NVVFNTAKV SQ        +    V E L AS 
Sbjct: 386 NDKVVTFRNRQYDLPAWSVSILPDCRNVVFNTAKVQSQT-------SMVTMVPESLQASK 438

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLN 475
              W  + E+ GI G   FVR    + INTTKD++DYLWYT S  V      +G    LN
Sbjct: 439 PERWSIFRERTGIWGKNDFVRNGFVDHINTTKDSTDYLWYTTSFSVDGSYSSKGSHAVLN 498

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I+S GH    F+N  L+   YGN   + F +   I L  G N L +LSM VGLQN G  +
Sbjct: 499 IDSNGHGVHAFLNNVLIGSAYGNGSQSRFSVKLPINLRTGKNELALLSMTVGLQNAGFAY 558

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           +  GAG  +V +  ++ G  DLSS  W Y++G+EGEY  L K    N+  W   S  P N
Sbjct: 559 EWIGAGFTNVNISGVRTGTIDLSSNNWAYKIGLEGEYYNLFKPDQTNNQRWIPQSEPPKN 618

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK     P+G  P+ +++ SMGKG AW+NG +IGRYW    + +  CT  C+YRG
Sbjct: 619 QPLTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSINDRCTPSCNYRG 678

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           ++   KC+  CGQP Q  YHIPR+W HP  N+LV+ EE GGDP+KI+   +    +CSFV
Sbjct: 679 TFIPDKCRTGCGQPTQRWYHIPRSWFHPSGNILVVFEEKGGDPTKITFSRRAVTSVCSFV 738

Query: 716 SEADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           SE  P   ++SW  +     + P + +L C  G  I+++ FAS G P G C S++ G CH
Sbjct: 739 SEHFPSIDLESWDESAMTEGTPPAKAQLFCPEGKSISSVKFASLGNPSGTCRSYQMGRCH 798

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             + L +V+KAC+    C++ ++    G     CPG+ K LA+EA CS
Sbjct: 799 HPNSLSVVEKACLNTNSCTVSLTDESFG--KDLCPGVTKTLAIEADCS 844


>gi|225444920|ref|XP_002282132.1| PREDICTED: beta-galactosidase [Vitis vinifera]
          Length = 836

 Score =  883 bits (2281), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/830 (52%), Positives = 562/830 (67%), Gaps = 27/830 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+VTYD R+ +I+G+R++L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23  SASVTYDKRSFIINGQRKILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P RG+YYFEGR+DLVRF+K VQ AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 83  PSRGKYYFEGRYDLVRFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F  KI+D+MK E LF  QGGPII++Q+ENEYG VE+  G  G+ Y KWAA
Sbjct: 143 NGPFKVAMQGFTQKIVDMMKSEKLFQPQGGPIIMSQIENEYGPVEYEIGAPGKAYTKWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + AV L T VPWVMC+QEDAPDP+I+ CNGFYC+ F PN   KP M+TE ++GW+  FG 
Sbjct: 203 EMAVQLGTGVPWVMCKQEDAPDPVIDACNGFYCENFFPNKDYKPKMFTEAWTGWYTEFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           A+P RP EDLA++VARF +  G+F NYYMY GGTNFGRTAGGP ++TSYDYDAPIDEYG 
Sbjct: 263 AIPNRPAEDLAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
             +PKWGHLR+LHKAIKLCE  L+S+DPT   LG  LEAH+Y   S  CAAFLANYD  S
Sbjct: 323 PSEPKWGHLRDLHKAIKLCEPALVSADPTVTYLGTNLEAHVYKAKSGACAAFLANYDPKS 382

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A VTF    Y LP WSVSILPDCKNVVFNTA++ +Q        + Q  +N +    S 
Sbjct: 383 SAKVTFGNTQYDLPPWSVSILPDCKNVVFNTARIGAQ--------SSQMKMNPV----ST 430

Query: 422 FSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
           FSW  Y E+   +        D L EQIN T+DT+DYLWY   +H+ P +     G+   
Sbjct: 431 FSWQSYNEETASAYTEDTTTMDGLLEQINITRDTTDYLWYMTEVHIKPDEGFLKTGQYPV 490

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S GHA  VF+N +L    YG         +  ++L  G N + +LS+ +GL N G 
Sbjct: 491 LTVMSAGHALHVFINGQLSGTVYGELSNPKVTFSDNVKLTVGTNKISLLSVAMGLPNVGL 550

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   V L  L  G  D+SS +W Y++G++GE + L  I+ ++S  W +GS L
Sbjct: 551 HFETWNAGVLGPVTLKGLNEGTVDMSSWKWSYKIGLKGEALNLQAITGSSSDEWVEGSLL 610

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYKTTF AP G  PLAL+++SMGKGQ W+NG+SIGR+W AY A   G    C+
Sbjct: 611 AQKQPLTWYKTTFNAPGGNDPLALDMSSMGKGQIWINGESIGRHWPAYTA--HGNCNGCN 668

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y G ++  KCQ  CG P+Q  YH+PR+W+ P  N L++ EELGG+P+ I+L+ +T   +C
Sbjct: 669 YAGIFNDKKCQTGCGGPSQRWYHVPRSWLKPSGNQLIVFEELGGNPAGITLVKRTMDRVC 728

Query: 713 SFVSEADPPPVDSWKPNLGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           + + E  P   +S       V+S   +  L C  G  I+ I FAS+G+P+G CGSFR G+
Sbjct: 729 ADIFEGQPSLKNSQIIGSSKVNSLQSKAHLWCAPGLKISKIQFASFGVPQGTCGSFREGS 788

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH       +Q+ C+G+  CS+ V+    G     CPG +K L+VEA CS
Sbjct: 789 CHAHKSYDALQRNCIGKQSCSVSVAPEVFG--GDPCPGSMKKLSVEALCS 836


>gi|227053553|gb|ACP18875.1| beta-galactosidase pBG(a) [Carica papaya]
          Length = 836

 Score =  882 bits (2280), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 446/830 (53%), Positives = 566/830 (68%), Gaps = 24/830 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YDH+A+ I+GKRR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 19  ASVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 78

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYF G +DLVRF+K V++AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 79  SPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNN 138

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+RF  KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y +WAA 
Sbjct: 139 GPFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQ 198

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV L T VPWVMC+Q+DAPDPIIN+CNGFYCD F+PN   KP MWTE ++GWF  FG A
Sbjct: 199 MAVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 258

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 259 VPYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLV 318

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+AIKLCE  L+S DP+   LG   EAH++      CAAFLANY+  S 
Sbjct: 319 RQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSF 378

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILPDCKN V+NTA+V +Q        A+ K V   +    AF
Sbjct: 379 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMVP--VPIHGAF 429

Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW    EE    +G RSF    L EQINTT+D SDYLWY+  + + P +     GK   L
Sbjct: 430 SWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPTL 489

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + S GHA  VFVN +L    YG+ +F     +K + L  GIN + ILS+ VGL N G  
Sbjct: 490 TVLSAGHALHVFVNDQLSGTAYGSLEFPKITFSKGVNLRAGINKISILSIAVGLPNVGPH 549

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+   AG+   V L  L  G+RDLS  +W Y+VGVEGE + L  +S ++S  W  GS + 
Sbjct: 550 FETWNAGVLGPVTLNGLNEGRRDLSWQKWSYKVGVEGEAMSLHSLSGSSSVEWTAGSFVA 609

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+KTTF AP G  PLAL++ SMGKGQ W+NG+SIGR+W AY A  +G    CDY
Sbjct: 610 RRQPLTWFKTTFNAPAGNSPLALDMNSMGKGQIWINGKSIGRHWPAYKA--SGSCGWCDY 667

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G+++  KC  +CG+ +Q  YH+PR+W +P  NLLV+ EE GGDP+ ISL+ +    +C+
Sbjct: 668 AGTFNEKKCLSNCGEASQRWYHVPRSWPNPTGNLLVVFEEWGGDPNGISLVRREVDSVCA 727

Query: 714 FVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
            + E  P  ++      G V+    P+  L C  G  I+++ FAS+G PEG CGS+R G+
Sbjct: 728 DIYEWQPTLMNYQMQASGKVNKPLRPKAHLQCGPGQKISSVKFASFGTPEGACGSYREGS 787

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH        ++ CVGQ  CS+ V    +     A P ++K LAVE  CS
Sbjct: 788 CHAHHSYDAFERLCVGQNWCSVTVVPRNVSGEIPA-PSVMKKLAVEVVCS 836


>gi|157313306|gb|ABV32546.1| beta-galactosidase protein 1 [Prunus persica]
          Length = 836

 Score =  882 bits (2278), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/826 (51%), Positives = 565/826 (68%), Gaps = 21/826 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+KSK+GGL+VI+TYVFWN HE
Sbjct: 25  TASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 85  PSPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTD 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 145 NEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN   KP MWTE ++GW+  FG 
Sbjct: 205 QMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGG 264

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 265 AVPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 324

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R+PKWGHLR+LHKAIK  E  L+S++P+   LG   EAH++ KS + CAAFLANYD+ S
Sbjct: 325 PREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHVF-KSKSGCAAFLANYDTKS 383

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V+F    Y LP W +SILPDCK  V+NTA++ SQ        + Q  +  +  A   
Sbjct: 384 SAKVSFGNGQYELPPWPISILPDCKTAVYNTARLGSQ--------SSQMKMTPVKSALPW 435

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
            S+ EE      + +     L EQIN T+DT+DYLWY   I + P +     G+   L I
Sbjct: 436 QSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDITISPDEGFIKRGESPLLTI 495

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VF+N +L    YG  +      ++ ++   GIN L +LS+ VGL N G  F+
Sbjct: 496 YSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKPRSGINKLALLSISVGLPNVGLHFE 555

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   V L  L +G  D+S  +W Y++G++GE +GL  +S ++S  W +G ++   
Sbjct: 556 TWNAGVLGPVTLKGLNSGTWDMSRWKWTYKIGLKGEALGLHTVSGSSSVEWAEGPSMAQK 615

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK TF AP G GPLAL+++SMGKGQ W+NGQSIGR+W AY A   G    C Y G
Sbjct: 616 QPLTWYKATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTA--RGNCGNCYYAG 673

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           +YD  KC+ HCG+P+Q  YH+PR+W+ P  NLLV+ EE GGDP+KISL+ +    +C+ +
Sbjct: 674 TYDDKKCRTHCGEPSQRWYHVPRSWLTPSGNLLVVFEEWGGDPTKISLVERRTSSVCADI 733

Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
            E  P   +S K   G + + P+  L C  G  I+ I FASYG+P+G CGSF+ G+CH  
Sbjct: 734 FEGQPTLTNSQKLASGKL-NRPKAHLWCPPGQVISDIKFASYGLPQGTCGSFQEGSCHAH 792

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                 ++ C+G+  CS+ V+    G     CPG  K L+VEA CS
Sbjct: 793 KSYDAPKRNCIGKQSCSVAVAPEVFG--GDPCPGSTKKLSVEAVCS 836


>gi|255546097|ref|XP_002514108.1| beta-galactosidase, putative [Ricinus communis]
 gi|223546564|gb|EEF48062.1| beta-galactosidase, putative [Ricinus communis]
          Length = 840

 Score =  882 bits (2278), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/826 (51%), Positives = 561/826 (67%), Gaps = 21/826 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A V+YDHRA+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 28  ATVSYDHRAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 87

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G YYFE R+DLV+F+K VQ AGL++HLRIGPY CAEWN+GGFPVWL ++PGI+FRT N
Sbjct: 88  SPGNYYFEDRYDLVKFIKVVQAAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIEFRTDN 147

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW  G  G+ Y KWAAD
Sbjct: 148 GPFKAAMQKFTEKIVSMMKSEKLFESQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAAD 207

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV L T VPWVMC+Q+DAPDP+INTCNGFYC+ F PN   KP +WTEN++GW+  FG A
Sbjct: 208 MAVKLGTGVPWVMCKQDDAPDPVINTCNGFYCENFKPNKDYKPKLWTENWTGWYTEFGGA 267

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+ G  +ATSYDYDAP+DEYG  
Sbjct: 268 VPYRPAEDLAFSVARFIQNGGSFMNYYMYHGGTNFGRTSAGLFIATSYDYDAPLDEYGLT 327

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R PKWGHLR+LHKAIKLCE  L+S DPT + LG+  EAH++ +S + CAAFLANYD+   
Sbjct: 328 RDPKWGHLRDLHKAIKLCEPALVSVDPTVKSLGSNQEAHVF-QSKSSCAAFLANYDTKYS 386

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             VTF    Y LP WS+SILPDCK  VFNTA++ +Q        + Q  +  +  A S  
Sbjct: 387 VKVTFGNGQYDLPPWSISILPDCKTAVFNTARLGAQ--------SSQMKMTPVGGALSWQ 438

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
           S+ EE      + +     L EQIN T+D SDYLWY  ++++   +     G    L I 
Sbjct: 439 SYIEEAATGYTDDTTTLEGLWEQINVTRDASDYLWYMTNVNIDSDEGFLKNGDSPVLTIF 498

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GH+  VF+N +L    YG+ +      ++ ++L  GIN + +LS+ VGL N G  F+ 
Sbjct: 499 SAGHSLHVFINGQLAGTVYGSLENPKLTFSQNVKLTAGINKISLLSVAVGLPNVGVHFEK 558

Query: 538 AGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             AG+   V L  L  G RDLS  +W Y++G++GE + L  ++ ++S  W +GS     +
Sbjct: 559 WNAGILGPVTLKGLNEGTRDLSGWKWSYKIGLKGEALSLHTVTGSSSVEWVEGSLSAKKQ 618

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK TF APEG  P+AL+++SMGKGQ WVNGQSIGR+W AY A   G    C+Y G+
Sbjct: 619 PLTWYKATFDAPEGNDPVALDMSSMGKGQIWVNGQSIGRHWPAYTA--RGSCSACNYAGT 676

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           YD  KC+ +CG+P+Q  YH+PR+W++P  NLLV+ EE GG+PS ISL+ +T   +C+ + 
Sbjct: 677 YDDKKCRSNCGEPSQRWYHVPRSWLNPSGNLLVVFEEWGGEPSGISLVKRTTGSVCADIF 736

Query: 717 EADPPPVDSWKPNLGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
           E  P   +     LG +    P+  L C  G  I+ I FASYG P+G CGSF+ G+CH  
Sbjct: 737 EGQPALKNWQMIALGRLDHLQPKAHLWCPHGQKISKIKFASYGSPQGTCGSFKAGSCHAH 796

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                 +K C+G+  CS+ V++   G     CP   K L+VEA C+
Sbjct: 797 KSYDAFEKKCIGKQSCSVTVAAEVFG--GDPCPDSSKKLSVEAVCT 840


>gi|356556730|ref|XP_003546676.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
          Length = 840

 Score =  880 bits (2275), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/827 (52%), Positives = 570/827 (68%), Gaps = 22/827 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YD +A+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 27  ASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 86

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFEG +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 87  SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK +M++F  KI+DLMK E L+ SQGGPII++Q+ENEYG +E+  G  G+ Y KWAA+
Sbjct: 147 EPFKHQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENEYGPMEYEIGAAGKAYTKWAAE 206

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L T VPWVMC+Q+D PDP+INTCNGFYCD F+PN   KP MWTE ++GWF  FG  
Sbjct: 207 MAMGLGTGVPWVMCKQDDTPDPLINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGP 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 267 VPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 326

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+AIKLCE  L+S DPT  K+G   EAH++   S  CAAFLANY+  S 
Sbjct: 327 RQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAHVFKSKSGACAAFLANYNPKSY 386

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILPDCKN V+NTA+V SQ        AQ K     +     F
Sbjct: 387 ATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQS-------AQMKMTR--VPIHGGF 437

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           SW  + E+   + + SF    L EQ+NTT+D SDYLWY+  + + P +     GK+  L 
Sbjct: 438 SWLSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYSTDVVLDPNEGFLRNGKDPVLT 497

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VF+N +L    YG+ +F     N+ ++L  G+N + +LS+ VGL N G  F
Sbjct: 498 VFSAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRAGVNKISLLSVAVGLPNVGPHF 557

Query: 536 DVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   I L  L  G+RDLS  +W Y+VG++GE + L  +S ++S  W QGS +  
Sbjct: 558 ETWNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGEILSLHSLSGSSSVEWIQGSLVSQ 617

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQ++GRYW AY A  +G    CDY 
Sbjct: 618 RQPLTWYKTTFDAPAGTAPLALDMDSMGKGQVWLNGQNLGRYWPAYKA--SGTCDYCDYA 675

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G+Y+ +KC+ +CG+ +Q  YH+P++W+ P  NLLV+ EELGGDP+ I L+ +    +C+ 
Sbjct: 676 GTYNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEELGGDPNGIFLVRRDIDSVCAD 735

Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
           + E  P  +       G     P+V L+C  G  I++I FAS+G P G+CG+F  G+CH 
Sbjct: 736 IYEWQPNLISYQMQTSGKAPVRPKVHLSCSPGQKISSIKFASFGTPAGSCGNFHEGSCHA 795

Query: 775 -DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                  ++ CVGQ  C++ VS    G     CP +LK L+VEA CS
Sbjct: 796 HKSYDAFERNCVGQNWCTVTVSPENFG--GDPCPNVLKKLSVEAICS 840


>gi|168045621|ref|XP_001775275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673356|gb|EDQ59880.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 916

 Score =  880 bits (2275), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/852 (51%), Positives = 579/852 (67%), Gaps = 44/852 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD RA++IDG+RR+L S  IHYPR+TPE+WP +I+ +K+GG +V++TYVFWN HEP 
Sbjct: 31  NVTYDQRAVLIDGERRMLISAGIHYPRATPEMWPSIIQHAKDGGADVVQTYVFWNGHEPE 90

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEGR+DLV+F+K V++AGL+ HLRIGPY CAEWN+GGFP WL  IPGI FRT N 
Sbjct: 91  QGQYNFEGRYDLVKFIKLVKQAGLYFHLRIGPYVCAEWNFGGFPYWLKEIPGIVFRTDNE 150

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+ F +KI++LMK+  LF+ QGGPII+AQ+ENEYG++E  +G GG+ YV+WAAD 
Sbjct: 151 PFKVAMQGFTSKIVNLMKENELFSWQGGPIIMAQIENEYGDIESQFGDGGKRYVQWAADM 210

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A++L+T VPW+MC+QEDAP  IINTCNGFYCDG+ PN+  KPI+WTE+++GWF ++G A 
Sbjct: 211 ALSLDTRVPWIMCKQEDAPANIINTCNGFYCDGWKPNTALKPILWTEDWNGWFQNWGQAA 270

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVED AFAVARFF+ GG+FQNYYMYFGGTNF RTAGGP + T+YDYDAPIDEYG IR
Sbjct: 271 PHRPVEDNAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFMTTTYDYDAPIDEYGLIR 330

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQK--LGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           QPKWGHL++LH AIKLCE  L + D   Q   +G+  EAH Y  ++  CAAFLAN DS +
Sbjct: 331 QPKWGHLKDLHAAIKLCEPALTAVDTVPQSTWIGSNQEAHEY-SANGHCAAFLANIDSEN 389

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V F G  Y LPAWSVSILPDCKNV FNTA++ +Q        A   +  ++ L S+ 
Sbjct: 390 SVTVQFQGESYVLPAWSVSILPDCKNVAFNTAQIGAQTTVTRMRIAPSNSRGDIFLPSNT 449

Query: 422 --------------FSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM 465
                           W    E  GI G+ + V   L EQ+N TKDTSDYLWY+ SI + 
Sbjct: 450 LVHDHISDGGVFANLKWQASAEPFGIRGSGTTVSNSLLEQLNITKDTSDYLWYSTSITIT 509

Query: 466 PG------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
                    G E  L + ++  A  +FVN KL     G     N  + + I L +G N++
Sbjct: 510 SEGVTSDVSGTEANLVLGTMRDAVHIFVNGKLAGSAMG----WNIQVVQPITLKDGKNSI 565

Query: 520 DILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
           D+LSM +GLQNYGA+ +  GAG+  SV +  L  G   LS+ EW YQVG+ GE + L   
Sbjct: 566 DLLSMTLGLQNYGAYLETWGAGIRGSVSVTGLPYGNLSLSTAEWSYQVGLRGEELKLFHN 625

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
             A+   W   S+      L WYKTTF AP G  P+AL+L SMGKGQAW+NG  +GRY+ 
Sbjct: 626 GTADGFSWDS-SSFTNASYLTWYKTTFDAPGGTDPVALDLGSMGKGQAWINGHHLGRYF- 683

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQ-------TLYHIPRTWVHPGENLLVIH 691
             +AP +GC + CDYRG+Y+ +KC+ +CG+P+Q        +YHIPR W+    NLLV+ 
Sbjct: 684 LMVAPQSGC-ETCDYRGAYNTNKCRTNCGEPSQRWQVIHFQMYHIPRAWLQATGNLLVLF 742

Query: 692 EELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGV--VSSSPQVRLACERGWHI 749
           EE+GGD SK+S++T++   +C+ ++E+ PPP+ +W+P+  +   ++  ++ L C  G HI
Sbjct: 743 EEIGGDISKVSVVTRSAHAVCAHINESQPPPIRTWRPHRSIDAFNNPAEMLLECAAGQHI 802

Query: 750 AAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPG 808
             I FAS+G P G+CG F+ G CH +  +  V+K C+G+ +C IPV   + G S   CPG
Sbjct: 803 TKIKFASFGNPRGSCGHFQHGTCHANKSMEAVRKVCIGKQQCYIPVQRKFFG-SIDPCPG 861

Query: 809 LLKALAVEAHCS 820
           + K+LAV+ HCS
Sbjct: 862 VSKSLAVQVHCS 873


>gi|14970839|emb|CAC44500.1| beta-galactosidase [Fragaria x ananassa]
          Length = 843

 Score =  879 bits (2272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/839 (51%), Positives = 564/839 (67%), Gaps = 40/839 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD +A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+++K+GGL+VI+TYVFWN H
Sbjct: 26  VRASVSYDSKAIVINGQRRILISGSIHYPRSTPEMWPDLIQRAKDGGLDVIQTYVFWNGH 85

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFE  +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGIQFRT
Sbjct: 86  EPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIQFRT 145

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK++M+RF  KI+++MK E LF S GGPIIL+Q+ENEYG +E+  G  G+ Y  WA
Sbjct: 146 DNGPFKDQMQRFTTKIVNMMKAERLFESHGGPIILSQIENEYGPMEYEIGAPGKAYTDWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+Q+DAPDP+IN CNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 206 AQMAVGLGTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP EDLAF+VA+F + GG F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 266 GAVPYRPAEDLAFSVAKFLQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL++LH+AIKLCE  L+SSDPT   LG   EAH++  +S  CAAFLANY+  
Sbjct: 326 LLRQPKWGHLKDLHRAIKLCEPALVSSDPTVTPLGTYQEAHVFKSNSGACAAFLANYNRK 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F    Y LP WS+SILPDCKN V+NTA++ +Q      P          +    
Sbjct: 386 SFAKVAFGNMHYNLPPWSISILPDCKNTVYNTARIGAQTARMKMP---------RVPIHG 436

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
            FSW  Y ++     + SF    L EQIN T+D +DYLWY   + + P +     G    
Sbjct: 437 GFSWQAYNDETATYSDTSFTTAGLLEQINITRDATDYLWYMTDVKIDPSEDFLRSGNYPV 496

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S GHA  VF+N +L    YG+ +       + + L  GIN + +LS+ VGL N G 
Sbjct: 497 LTVLSAGHALRVFINGQLAGTAYGSLETPKLTFKQGVNLRAGINQIALLSIAVGLPNVGP 556

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   VIL  L  G+RDLS  +W Y++G++GE + L  ++ ++S  W +GS +
Sbjct: 557 HFETWNAGILGPVILNGLNEGRRDLSWQKWSYKIGLKGEALSLHSLTGSSSVEWTEGSFV 616

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYKTTF  P G  PLAL++ SMGKGQ W+N +SIGRYW AY A  +G   +C+
Sbjct: 617 AQRQPLTWYKTTFNRPAGNSPLALDMGSMGKGQVWINDRSIGRYWPAYKA--SGTCGECN 674

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y G++   KC  +CG+ +Q  YH+PR+W++P  NLLV+ EE GGDP+ I L+ +    +C
Sbjct: 675 YAGTFSEKKCLSNCGEASQRWYHVPRSWLNPTGNLLVVLEEWGGDPNGIFLVRREVDSVC 734

Query: 713 SFVSEADPPPVDSWKPNL--------GVVSS--SPQVRLACERGWHIAAINFASYGIPEG 762
           + + E        W+PNL        G V+    P+  L+C  G  I++I FAS+G PEG
Sbjct: 735 ADIYE--------WQPNLMSWQMQVSGRVNKPLRPKAHLSCGPGQKISSIKFASFGTPEG 786

Query: 763 NCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CGSFR G CH        +++C+GQ  CS+ VS    G     CP ++K L+VEA CS
Sbjct: 787 VCGSFREGGCHAHKSYNAFERSCIGQNSCSVTVSPENFG--GDPCPNVMKKLSVEAICS 843


>gi|224082924|ref|XP_002306893.1| predicted protein [Populus trichocarpa]
 gi|222856342|gb|EEE93889.1| predicted protein [Populus trichocarpa]
          Length = 853

 Score =  879 bits (2272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/834 (52%), Positives = 554/834 (66%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +A++IDG+RR+L SGSIHYPRSTP++W +L++K+K+GGL+VI+TYVFWN H
Sbjct: 24  IHCTVTYDKKAIIIDGQRRILISGSIHYPRSTPDMWEDLVQKAKDGGLDVIDTYVFWNVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGRFDLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84  EPSPGNYNFEGRFDLVRFIKTVQKGGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK E LF SQGGPII +Q+ENEYG    A+G  G  Y+ WA
Sbjct: 144 DNGPFKAAMQGFTQKIVQMMKDERLFQSQGGPIIFSQIENEYGPESRAFGAAGHSYINWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+++DAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF  FG
Sbjct: 204 AQMAVGLKTGVPWVMCKEDDAPDPVINTCNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A   RPV+DLAFAVARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GAFHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IR+PK+GHL+ELH+AIKLCE  L+SSDPT   LG   +AH++      C+AFLANY + 
Sbjct: 324 LIREPKYGHLKELHRAIKLCEHELVSSDPTITLLGTYQQAHVFSSGKRSCSAFLANYHTQ 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV            Q  +V  L   S 
Sbjct: 384 SAARVMFNNMHYVLPPWSISILPDCRNVVFNTAKV----------GVQTSHVQMLPTGSR 433

Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            FSW  Y+E +   G  S +    L EQIN T+DT+DYLWY  S+++ P +     G+  
Sbjct: 434 FFSWESYDEDISSLGASSRMTALGLMEQINVTRDTTDYLWYITSVNINPSESFLRGGQWP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L +ES GHA  VF+N +     +G  +   F     + L  G N + +LS+ VGL N G
Sbjct: 494 TLTVESAGHALHVFINGQFSGSAFGTRENREFTFTGPVNLRAGTNRIALLSIAVGLPNVG 553

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             ++    G+   V+L  L  G +DL+  +W YQVG++GE + L   + A+S  W QGS 
Sbjct: 554 VHYETWKTGILGPVMLHGLNQGNKDLTWQQWSYQVGLKGEAMNLVSPNRASSVDWIQGSL 613

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
               + L WYK  F AP G  PLAL++ SMGKGQ W+NGQSIGRYW +Y   + G    C
Sbjct: 614 ATRQQPLKWYKAYFDAPGGNEPLALDMRSMGKGQVWINGQSIGRYWLSY---AKGDCSSC 670

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
            Y G++   KCQ  CGQP Q  YH+PR+W+ P +NLLVI EELGGD SKISL+ ++   +
Sbjct: 671 GYSGTFRPPKCQLGCGQPTQRWYHVPRSWLKPKQNLLVIFEELGGDASKISLVKRSTTSV 730

Query: 712 CSFVSEADPPPVDSWKPNLGVVSS----SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           C+   E   P ++++       S       +V L C  G  I+AINFAS+G P G CGSF
Sbjct: 731 CADAFEHH-PTIENYNTESNGESERNLHQAKVHLRCAPGQSISAINFASFGTPTGTCGSF 789

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH  +   +V+K C+G+  C + +S++  G  A  CP  LK L+VEA CS
Sbjct: 790 QEGTCHAPNSHSVVEKKCIGRESCMVAISNSNFG--ADPCPSKLKKLSVEAVCS 841


>gi|115441369|ref|NP_001044964.1| Os01g0875500 [Oryza sativa Japonica Group]
 gi|75103778|sp|Q5N8X6.1|BGAL3_ORYSJ RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
           Precursor
 gi|56784847|dbj|BAD82087.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|113534495|dbj|BAF06878.1| Os01g0875500 [Oryza sativa Japonica Group]
 gi|222619622|gb|EEE55754.1| hypothetical protein OsJ_04267 [Oryza sativa Japonica Group]
          Length = 851

 Score =  879 bits (2272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/826 (52%), Positives = 547/826 (66%), Gaps = 16/826 (1%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +++VTYDHR+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 35  NSSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 94

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +GQYYFE RFDLVRF K V++AGL++ LRIGP+  AEW +GG PVWLH+ PG  FRT 
Sbjct: 95  PAQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTN 154

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  MKRF   I+D+MK+E  FASQGG IILAQVENEYG++E AYG G + Y  WAA
Sbjct: 155 NEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAA 214

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A+  NT VPW+MCQQ DAPDP+INTCN FYCD F PNSP+KP  WTEN+ GWF +FG 
Sbjct: 215 SMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGE 274

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           + P RP ED+AF+VARFF  GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG 
Sbjct: 275 SNPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGL 334

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R PKW HLR+LHK+IKL E  L+  + +   LG + EA +Y   S  C AFL+N DS  
Sbjct: 335 RRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEK 394

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  VTF    Y LPAWSVSILPDCKNV FNTAKV SQ    D        V   L +S  
Sbjct: 395 DKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM-------VPANLESSKV 447

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIE 477
             W  + EK GI GN   VR    + INTTKD++DYLWYT S  V      G    L+IE
Sbjct: 448 DGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIE 507

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA   F+N +L+   YGN   +NF +   + L  G N L +LSM VGLQN G  ++ 
Sbjct: 508 SKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEW 567

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
           AGAG+ SV +  ++N   DLSS +W Y++G+EGEY  L K        W   S  P N+ 
Sbjct: 568 AGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQP 627

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           + WYK     P+G  P+ L++ SMGKG AW+NG +IGRYW      S  CT  CDYRG++
Sbjct: 628 MTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTF 687

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
             +KC++ CGQP Q  YH+PR+W HP  N LVI EE GGDP+KI+   +T   +CSFVSE
Sbjct: 688 SPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSE 747

Query: 718 ADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
             P   ++SW  N       + +V+L+C +G  I+++ F S+G P G C S++ G+CH  
Sbjct: 748 HYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCHHP 807

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + + +V+KAC+    C++ +S    G     CPG+ K LA+EA CS
Sbjct: 808 NSISVVEKACLNMNGCTVSLSDE--GFGEDLCPGVTKTLAIEADCS 851


>gi|215734965|dbj|BAG95687.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 919

 Score =  879 bits (2270), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/826 (52%), Positives = 547/826 (66%), Gaps = 16/826 (1%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +++VTYDHR+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 103 NSSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 162

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +GQYYFE RFDLVRF K V++AGL++ LRIGP+  AEW +GG PVWLH+ PG  FRT 
Sbjct: 163 PAQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTN 222

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  MKRF   I+D+MK+E  FASQGG IILAQVENEYG++E AYG G + Y  WAA
Sbjct: 223 NEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAA 282

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A+  NT VPW+MCQQ DAPDP+INTCN FYCD F PNSP+KP  WTEN+ GWF +FG 
Sbjct: 283 SMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGE 342

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           + P RP ED+AF+VARFF  GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG 
Sbjct: 343 SNPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGL 402

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R PKW HLR+LHK+IKL E  L+  + +   LG + EA +Y   S  C AFL+N DS  
Sbjct: 403 RRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEK 462

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  VTF    Y LPAWSVSILPDCKNV FNTAKV SQ    D        V   L +S  
Sbjct: 463 DKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM-------VPANLESSKV 515

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIE 477
             W  + EK GI GN   VR    + INTTKD++DYLWYT S  V      G    L+IE
Sbjct: 516 DGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIE 575

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA   F+N +L+   YGN   +NF +   + L  G N L +LSM VGLQN G  ++ 
Sbjct: 576 SKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEW 635

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
           AGAG+ SV +  ++N   DLSS +W Y++G+EGEY  L K        W   S  P N+ 
Sbjct: 636 AGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQP 695

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           + WYK     P+G  P+ L++ SMGKG AW+NG +IGRYW      S  CT  CDYRG++
Sbjct: 696 MTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTF 755

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
             +KC++ CGQP Q  YH+PR+W HP  N LVI EE GGDP+KI+   +T   +CSFVSE
Sbjct: 756 SPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSE 815

Query: 718 ADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
             P   ++SW  N       + +V+L+C +G  I+++ F S+G P G C S++ G+CH  
Sbjct: 816 HYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFVSFGNPSGTCRSYQQGSCHHP 875

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + + +V+KAC+    C++ +S    G     CPG+ K LA+EA CS
Sbjct: 876 NSISVVEKACLNMNGCTVSLSDE--GFGEDLCPGVTKTLAIEADCS 919


>gi|218189464|gb|EEC71891.1| hypothetical protein OsI_04635 [Oryza sativa Indica Group]
          Length = 851

 Score =  879 bits (2270), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/826 (52%), Positives = 547/826 (66%), Gaps = 16/826 (1%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +++VTYD R+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 35  NSSVTYDQRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 94

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +GQYYFE RFDLVRF K V++AGL++ LRIGP+  AEW +GG PVWLH+ PG  FRT 
Sbjct: 95  PAQGQYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEWTFGGVPVWLHYAPGTVFRTN 154

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  MKRF   I+D+MK+E  FASQGG IILAQVENEYG++E AYG G + Y  WAA
Sbjct: 155 NEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENEYGDMEQAYGAGAKPYAMWAA 214

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A+  NT VPW+MCQQ DAPDP+INTCN FYCD F PNSP+KP  WTEN+ GWF +FG 
Sbjct: 215 SMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNSPTKPKFWTENWPGWFQTFGE 274

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           + P RP ED+AF+VARFF  GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG 
Sbjct: 275 SNPHRPPEDVAFSVARFFGKGGSLQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGL 334

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R PKW HLR+LHK+IKL E  L+  + +   LG + EA +Y   S  C AFL+N DS  
Sbjct: 335 RRLPKWAHLRDLHKSIKLGEHTLLYGNSSFVSLGPQQEADVYTDQSGGCVAFLSNVDSEK 394

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  VTF    Y LPAWSVSILPDCKNV FNTAKV SQ    D        V   L +S  
Sbjct: 395 DKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQTLMMDM-------VPANLESSKV 447

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIE 477
             W  + EK GI GN   VR    + INTTKD++DYLWYT S  V      G    L+IE
Sbjct: 448 DGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLWYTTSFDVDGSHLAGGNHVLHIE 507

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA   F+N +L+   YGN   +NF +   + L  G N L +LSM VGLQN G  ++ 
Sbjct: 508 SKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAGKNKLSLLSMTVGLQNGGPMYEW 567

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
           AGAG+ SV +  ++N   DLSS +W Y++G+EGEY  L K        W   S  P N+ 
Sbjct: 568 AGAGITSVKISGMENRIIDLSSNKWEYKIGLEGEYYSLFKADKGKDIRWMPQSEPPKNQP 627

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           + WYK     P+G  P+ L++ SMGKG AW+NG +IGRYW      S  CT  CDYRG++
Sbjct: 628 MTWYKVNVDVPQGDDPVGLDMQSMGKGLAWLNGNAIGRYWPRISPVSDRCTSSCDYRGTF 687

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
             +KC++ CGQP Q  YH+PR+W HP  N LVI EE GGDP+KI+   +T   +CSFVSE
Sbjct: 688 SPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKGGDPTKITFSRRTVASVCSFVSE 747

Query: 718 ADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
             P   ++SW  N       + +V+L+C +G  I+++ FAS+G P G C S++ G+CH  
Sbjct: 748 HYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVKFASFGNPSGTCRSYQQGSCHHP 807

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + + +V+KAC+    C++ +S    G     CPG+ K LA+EA CS
Sbjct: 808 NSISVVEKACLNMNGCTLSLSDE--GFGEDLCPGVTKTLAIEADCS 851


>gi|356502950|ref|XP_003520277.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
          Length = 848

 Score =  878 bits (2269), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 439/829 (52%), Positives = 560/829 (67%), Gaps = 24/829 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI K+KEGG++V+ETYVFWN HEP
Sbjct: 25  ASVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLILKAKEGGIDVVETYVFWNVHEP 84

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 85  SPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 144

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+ENEYG      G  G+ YV WAA 
Sbjct: 145 EPFKRAMQGFTEKIVGMMKSERLFESQGGPIILSQIENEYGAQSKLQGAAGQNYVNWAAK 204

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV + T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP++WTE +SGWF  FG  
Sbjct: 205 MAVEMGTGVPWVMCKEDDAPDPVINTCNGFYCDKFTPNRPYKPMIWTEAWSGWFTEFGGP 264

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +  RPV+DLAFA ARF   GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG I
Sbjct: 265 IHKRPVQDLAFAAARFIIRGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLI 324

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELH+AIK+CE  L+S+DP    LG   +AH+Y   S DCAAFL+NYDS S 
Sbjct: 325 RQPKYGHLKELHRAIKMCERALVSTDPIVTSLGEFQQAHVYTTESGDCAAFLSNYDSKSS 384

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LP WSVSILPDC+NVVFNTAKV  Q +       Q    N  L +  +F
Sbjct: 385 ARVMFNNMHYSLPPWSVSILPDCRNVVFNTAKVGVQTSQ-----MQMLPTNTQLFSWESF 439

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
              E+   +  + +   P L EQIN TKD SDYLWY  S+ +   +     G+   L ++
Sbjct: 440 D--EDIYSVDESSAITAPGLLEQINVTKDASDYLWYITSVDIGSSESFLRGGELPTLIVQ 497

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA  VF+N +L    +G  ++  F    K+ L  GIN + +LS+ +GL N G  F+ 
Sbjct: 498 STGHAVHVFINGQLSGSAFGTREYRRFTYTGKVNLLAGINRIALLSVAIGLPNVGEHFES 557

Query: 538 AGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVN 595
              G+   V L  L  GK DLS  +W YQVG++GE + L   +  +S  W Q +  +  N
Sbjct: 558 WSTGILGPVALHGLDKGKWDLSGQKWTYQVGLKGEAMDLASPNGISSVAWMQSAIVVQRN 617

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L W+KT F APEG  PLAL++  MGKGQ W+NGQSIGRYW+A+   +TG    C+Y G
Sbjct: 618 QPLTWHKTYFDAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAF---ATGNCNDCNYAG 674

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           S+   KCQ  CGQP Q  YH+PR+W+   +NLLVI EELGG+PSKISL+ ++   +C+ V
Sbjct: 675 SFRPPKCQLGCGQPTQRWYHVPRSWLKTTQNLLVIFEELGGNPSKISLVKRSVSSVCADV 734

Query: 716 SEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           SE   P + +W       S     P+V L C  G  I++I FAS+G P G CG++  GAC
Sbjct: 735 SEYH-PNIKNWHIESYGKSEEFRPPKVHLHCSPGQTISSIKFASFGTPLGTCGNYEQGAC 793

Query: 773 HMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H      I++K C+G+  C++ VS++  G     CP +LK L+VEA C+
Sbjct: 794 HSPASYVILEKRCIGKPRCTVTVSNSNFG--QDPCPKVLKRLSVEAVCA 840


>gi|297743077|emb|CBI35944.3| unnamed protein product [Vitis vinifera]
          Length = 841

 Score =  878 bits (2268), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/831 (52%), Positives = 567/831 (68%), Gaps = 26/831 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+V+YD RA+VI+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 26  VTASVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 85

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP +G+YYFEGR+DLVRF+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++ GI FRT
Sbjct: 86  EPSQGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRT 145

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+RF  KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y +WA
Sbjct: 146 NNEPFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 206 AKMAVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DE+G
Sbjct: 266 GAVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL++LH+AIKLCE  LIS DPT   LG   EAH++H  S  CAAFLANY+  
Sbjct: 326 LLRQPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPR 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V+F    Y LP WS+SILPDCKN V+NTA++             Q    ++   S 
Sbjct: 386 SYAKVSFRNMHYNLPPWSISILPDCKNTVYNTARL-----------GAQSATMKMTPVSG 434

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
            F W  Y E+     + SF    L EQINTT+D SDYLWY+  + +         G+   
Sbjct: 435 RFGWQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPV 494

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S GHA  VF+N +L    YG+ +      ++ ++L  G+NT+ +LS+ VGL N G 
Sbjct: 495 LTVLSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGP 554

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   V L  L  G+RDLS  +W Y+VG++GE + L  +S ++S  W +GS +
Sbjct: 555 HFETWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLM 614

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQ++GRYW AY A  TG    C+
Sbjct: 615 ARGQPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA--TGGCGDCN 672

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y G+Y   KC  +CG+P+Q  YH+P +W+ P  NLLV+ EE GG+P+ ISL+ +  + +C
Sbjct: 673 YAGTYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIESVC 732

Query: 713 SFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           + + E  P  ++      G V+    P+  L C  G  I++I FAS+G PEG CGS+R G
Sbjct: 733 ADIYEWQPTLMNYEMQASGKVNKPLRPKAHLWCAPGQKISSIKFASFGTPEGVCGSYREG 792

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +CH        +++C+G   CS+ V+    G     CP ++K L+VEA CS
Sbjct: 793 SCHAHKSYDAFERSCIGMNSCSVTVAPEIFG--GDPCPSVMKKLSVEAICS 841


>gi|356550446|ref|XP_003543598.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
          Length = 841

 Score =  878 bits (2268), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/825 (52%), Positives = 570/825 (69%), Gaps = 18/825 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YD +A+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 28  ASVSYDSKAITINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEP 87

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFEG +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 88  SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 147

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK +M++F  KI+DLMK E L+ SQGGPII++Q+ENEYG +E+  G  G+ Y KWAA+
Sbjct: 148 EPFKVQMQKFTTKIVDLMKAERLYESQGGPIIMSQIENEYGPMEYEIGAAGKAYTKWAAE 207

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L T VPW+MC+Q+D PDP+INTCNGFYCD F+PN   KP MWTE ++GWF  FG  
Sbjct: 208 MAMELGTGVPWIMCKQDDTPDPLINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGP 267

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 268 VPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 327

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+AIKLCE  L+S DPT  K+G   EAH++   S  CAAFLANY+  S 
Sbjct: 328 RQPKWGHLKDLHRAIKLCEPALVSGDPTVTKIGNYQEAHVFKSMSGACAAFLANYNPKSY 387

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILP+CKN V+NTA+V SQ        AQ K     +    ++
Sbjct: 388 ATVAFGNMHYNLPPWSISILPNCKNTVYNTARVGSQS-------AQMKMTRVPIHGGLSW 440

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
             + E+   + + SF    L EQ+NTT+D SDYLWY+  + + P +     GK+  L + 
Sbjct: 441 LSFNEETTTTDDSSFTMTGLLEQLNTTRDLSDYLWYSTDVVLDPNEGFLRNGKDPVLTVF 500

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA  VF+N +L    YG+ +F     N+ ++L  G+N + +LS+ VGL N G  F+ 
Sbjct: 501 SAGHALHVFINGQLSGTAYGSLEFPKLTFNEGVKLRTGVNKISLLSVAVGLPNVGPHFET 560

Query: 538 AGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             AG+   I L  L  G+RDLS  +W Y+VG++GE + L  +  ++S  W QGS +   +
Sbjct: 561 WNAGVLGPISLSGLNEGRRDLSWQKWSYKVGLKGETLSLHSLGGSSSVEWIQGSLVSQRQ 620

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYKTTF AP+G  PLAL++ SMGKGQ W+NGQ++GRYW AY A  +G    CDY G+
Sbjct: 621 PLTWYKTTFDAPDGTAPLALDMNSMGKGQVWLNGQNLGRYWPAYKA--SGTCDYCDYAGT 678

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           Y+ +KC+ +CG+ +Q  YH+P++W+ P  NLLV+ EELGGD + ISL+ +    +C+ + 
Sbjct: 679 YNENKCRSNCGEASQRWYHVPQSWLKPTGNLLVVFEELGGDLNGISLVRRDIDSVCADIY 738

Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV 776
           E  P  +       G     P+V L+C  G  I++I FAS+G P G+CG+F  G+CH  +
Sbjct: 739 EWQPNLISYQMQTSGKAPVRPKVHLSCSPGQKISSIKFASFGTPVGSCGNFHEGSCHAHM 798

Query: 777 -LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                ++ CVGQ  C++ VS    G     CP +LK L+VEA CS
Sbjct: 799 SYDAFERNCVGQNLCTVAVSPENFG--GDPCPNVLKKLSVEAICS 841


>gi|297829920|ref|XP_002882842.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328682|gb|EFH59101.1| hypothetical protein ARALYDRAFT_897617 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 847

 Score =  877 bits (2267), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 437/829 (52%), Positives = 556/829 (67%), Gaps = 20/829 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +S +V+YD RA+ I+GKRR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+VI+TYVFWN H
Sbjct: 30  VSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGH 89

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLVRFVK VQ++GL+LHLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 90  EPSPGKYYFEGNYDLVRFVKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 149

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M+RF  KI+++MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y  WA
Sbjct: 150 DNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWA 209

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 210 AKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 270 GPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 329

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             RQPKWGHL++LH+AIKLCE  L+S +PT   LG   EAH+Y   S  C+AFLANY+  
Sbjct: 330 LERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKAKSGACSAFLANYNPK 389

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V+F  N Y LP WS+SILPDCKN V+NTA+V +Q        ++ K V   +    
Sbjct: 390 SYAKVSFGSNHYNLPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGL 442

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++  Y E      + SF    L EQINTT+DTSDYLWY   + +   +     G    L 
Sbjct: 443 SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKIDANEGFLRNGDLPTLT 502

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VF+N +L    YG+ D       K + L  G N + ILS+ VGL N G  F
Sbjct: 503 VLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V L  L  G+RDLS  +W Y+VG++GE + L  +S ++S  W +G+ +  
Sbjct: 563 ETWNAGVLGPVSLNGLSGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLA+++ SMGKGQ W+NGQS+GR+W AY A   G   +C Y 
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYT 680

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G++   KC ++CG+ +Q  YH+PR+W+ P  NLLV+ EE GGDP+ ISL+ +    +C+ 
Sbjct: 681 GTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGISLVRREVDSVCAD 740

Query: 715 VSEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + E     V+      G V+    P+V L C  G  I  + FAS+G PEG CGS+R G+C
Sbjct: 741 IYEWQSTLVNYQLHASGKVNKPLHPKVHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSC 800

Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H         K CVGQ  CS+ V+    G     CP ++K LAVEA C+
Sbjct: 801 HDHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 847


>gi|165906266|gb|ABY71826.1| beta-galactosidase [Prunus salicina]
          Length = 836

 Score =  877 bits (2266), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/826 (51%), Positives = 564/826 (68%), Gaps = 21/826 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+KSK+GGL+VI+TYVFWN HE
Sbjct: 25  TASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKSKDGGLDVIQTYVFWNGHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 85  PSPGKYYFEDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTD 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 145 NEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN   KP MWTE ++GW+  FG 
Sbjct: 205 QMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEFGG 264

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 265 AVPTRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 324

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R+PKWGHLR+LHKAIK  E  L+S++P+   LG   EAH++ KS + CAAFLANYD+ S
Sbjct: 325 PREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVF-KSKSGCAAFLANYDTKS 383

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V+F    Y LP WS+SILPDC+  V+NTA++ SQ        + Q  +  +  A   
Sbjct: 384 SAKVSFGNGQYELPPWSISILPDCRTAVYNTARLGSQ--------SSQMKMTPVKSALPW 435

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
            S+ EE      + +     L EQIN T+DT+DY WY   I + P +     G+   L I
Sbjct: 436 QSFIEESASSDESDTTTLDGLWEQINVTRDTTDYSWYMTDITISPDEGFIKRGESPLLTI 495

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VF+N +L    YG  +      ++ ++L  GIN L +LS+ VGL N G  F+
Sbjct: 496 YSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGINKLALLSISVGLPNVGLHFE 555

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   V L  L +G  D+S  +W Y+VG++GE +GL  +S ++S  W +G ++   
Sbjct: 556 TWNAGVLGPVTLKGLNSGTWDMSRWKWTYKVGLKGEALGLHTVSGSSSVEWAEGPSMAQK 615

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WY+ TF AP G GPLAL+++SMGKGQ W+NGQSIGR+W AY A   G    C Y G
Sbjct: 616 QPLTWYRATFNAPPGNGPLALDMSSMGKGQIWINGQSIGRHWPAYTA--RGNCGNCYYAG 673

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           +YD  KC+ HCG+P+Q  YH+PR+W+    NLLV+ EE GGDP+KISL+ +    +C+ +
Sbjct: 674 TYDDKKCRTHCGEPSQRWYHVPRSWLTTSGNLLVVFEEWGGDPTKISLVERRTSSVCADI 733

Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
            E  P   +S K   G + + P+  L C  G  I+ I FASYG+ +G CGSF+ G+CH  
Sbjct: 734 FEGQPTLTNSQKLASGKL-NRPKAHLWCPPGQVISDIKFASYGLSQGTCGSFQEGSCHAH 792

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                 ++ C+G+  CS+ V+    G     CPG  K L+VEA CS
Sbjct: 793 KSYDAPKRNCIGKQSCSVTVAPEVFG--GDPCPGSTKKLSVEAVCS 836


>gi|1168654|sp|P45582.1|BGAL_ASPOF RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
 gi|452712|emb|CAA54525.1| beta-galactosidase [Asparagus officinalis]
          Length = 832

 Score =  877 bits (2265), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/833 (52%), Positives = 562/833 (67%), Gaps = 38/833 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH++++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 23  VTASVTYDHKSVIINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 82

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYF GR+DLVRF+K V++AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83  EPSPGQYYFGGRYDLVRFLKLVKQAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M +F  KI+ +MK E L+ +QGGPIIL+Q+ENEYG VE+  G  G+ Y  WA
Sbjct: 143 DNGPFKAAMGKFTEKIVSMMKAEGLYETQGGPIILSQIENEYGPVEYYDGAAGKSYTNWA 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV LNT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN  +KP MWTE ++GWF  FG
Sbjct: 203 AKMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKDNKPKMWTEAWTGWFTGFG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP ED+AFAVARF + GG+F NYYMY GGTNFGRTAGGP ++TSYDYDAPIDEYG
Sbjct: 263 GAVPQRPAEDMAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFISTSYDYDAPIDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHLR+LHKAIKLCE  L+S +PT   LG   E+++Y +S + CAAFLAN++S 
Sbjct: 323 LLRQPKWGHLRDLHKAIKLCEPALVSGEPTITSLGQNQESYVY-RSKSSCAAFLANFNSR 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
             A VTFNG  Y LP WSVSILPDCK  VFNTA+V +Q       +              
Sbjct: 382 YYATVTFNGMHYNLPPWSVSILPDCKTTVFNTARVGAQTTTMKMQYL------------G 429

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
            FSW  Y E      + +F +  L EQ++TT D SDYLWYT  + +   +     GK  +
Sbjct: 430 GFSWKAYTEDTDALNDNTFTKDGLVEQLSTTWDRSDYLWYTTYVDIAKNEEFLKTGKYPY 489

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S GHA  VF+N +L    YG+ D      +   +L  G N + ILS+ VGL N G 
Sbjct: 490 LTVMSAGHAVHVFINGQLSGTAYGSLDNPKLTYSGSAKLWAGSNKISILSVSVGLPNVGN 549

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+    G+   V L  L  GKRDLS  +W YQ+G+ GE + L  ++ +++  W + S  
Sbjct: 550 HFETWNTGVLGPVTLTGLNEGKRDLSLQKWTYQIGLHGETLSLHSLTGSSNVEWGEASQ- 608

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYKT F AP G  PLAL++ +MGKGQ W+NGQSIGRYW AY A  +G    CD
Sbjct: 609 --KQPLTWYKTFFNAPPGNEPLALDMNTMGKGQIWINGQSIGRYWPAYKA--SGSCGSCD 664

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           YRG+Y+  KC  +CG+ +Q  YH+PR+W+ P  N LV+ EE GGDP+ IS++ ++   +C
Sbjct: 665 YRGTYNEKKCLSNCGEASQRWYHVPRSWLIPTGNFLVVLEEWGGDPTGISMVKRSVASVC 724

Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + V E   P +D+W+         P+V L+C+ G  ++ I FAS+G P+G CGSF  G+C
Sbjct: 725 AEVEELQ-PTMDNWRTK---AYGRPKVHLSCDPGQKMSKIKFASFGTPQGTCGSFSEGSC 780

Query: 773 HM----DVLPI--VQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           H     D      + + CVGQ  CS+ V+    G     CPG +K LAVEA C
Sbjct: 781 HAHKSYDAFEQEGLMQNCVGQEFCSVNVAPEVFG--GDPCPGTMKKLAVEAIC 831


>gi|449458175|ref|XP_004146823.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
 gi|449515710|ref|XP_004164891.1| PREDICTED: beta-galactosidase 1-like [Cucumis sativus]
          Length = 841

 Score =  877 bits (2265), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/829 (52%), Positives = 564/829 (68%), Gaps = 20/829 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD +A++I+G RR+L SGSIHYPRST E+WP+LI+K+KEGGL+VIETYVFWN H
Sbjct: 24  VQASVSYDSKAIIINGHRRILISGSIHYPRSTSEMWPDLIQKAKEGGLDVIETYVFWNGH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLVRFVK V +AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 84  EPEPGKYYFEGNYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M+RF  KI+++MK E L+ SQGGPIIL+Q+ENEYG +E+  G  G+ Y KWA
Sbjct: 144 DNAPFKFQMERFTRKIVNMMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYSKWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+ L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 204 AQMALGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTQFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP ED+AFAVARF + GG   NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 264 GAVPHRPAEDMAFAVARFIQKGGALINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL++L++AIKLCE  L+S DP   +LG   EAH++   S  CAAFL+NY+  
Sbjct: 324 LLRQPKWGHLKDLNRAIKLCEPALVSGDPIVTRLGNYQEAHVFKSKSGACAAFLSNYNPR 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F    Y +P WS+SILPDCKN VFNTA+V +Q        A  K     +  S 
Sbjct: 384 SYATVAFGNMHYNIPPWSISILPDCKNTVFNTARVGAQT-------AIMKMSPVPMHESF 436

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++  Y E+      ++F    L EQINTT+D +DYLWYT  +H+   +     GK   L 
Sbjct: 437 SWQAYNEEPASYNEKAFTTVGLLEQINTTRDATDYLWYTTDVHIDANEGFLRSGKYPVLT 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VFVN +L    YG+ DF     ++ + L  G N + +LS+ VGL N G  F
Sbjct: 497 VLSAGHAMHVFVNGQLAGTAYGSLDFPKLTFSRGVNLRAGNNKIALLSIAVGLPNVGPHF 556

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           ++  AG+   V L  L  G+RDL+  +W Y++G++GE + L  +S ++S  W QGS +  
Sbjct: 557 EMWNAGILGPVNLNGLDEGRRDLTWQKWTYKIGLDGEAMSLHSLSGSSSVEWIQGSLVAQ 616

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L W+KTTF AP G  PLAL++ SMGKGQ W+NGQS+GRYW AY   STG    CDY 
Sbjct: 617 KQPLTWFKTTFNAPAGNSPLALDMGSMGKGQIWLNGQSLGRYWPAY--KSTGSCGSCDYT 674

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G+Y+  KC  +CG+ +Q  YH+PR+W++P  NLLV+ EE GGDP+ I L+ +    +C  
Sbjct: 675 GTYNEKKCSSNCGEASQRWYHVPRSWLNPTGNLLVVFEEWGGDPNGIHLVRRDVDSVCVN 734

Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           ++E  P  ++    + G V+    P+  L+C  G  I+++ FAS+G PEG CGSFR G+C
Sbjct: 735 INEWQPTLMNWQMQSSGKVNKPLRPKAHLSCGPGQKISSVKFASFGTPEGECGSFREGSC 794

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H        Q+ CVGQ  C++ V+    G     CP ++K L+VE  CS
Sbjct: 795 HAHHSYDAFQRTCVGQNFCTVTVAPEMFG--GDPCPNVMKKLSVEVICS 841


>gi|359482511|ref|XP_002279310.2| PREDICTED: beta-galactosidase-like [Vitis vinifera]
          Length = 828

 Score =  877 bits (2265), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/828 (52%), Positives = 565/828 (68%), Gaps = 26/828 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YD RA+VI+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN HEP 
Sbjct: 16  NVSYDRRAIVINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPS 75

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +G+YYFEGR+DLVRF+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++ GI FRT N 
Sbjct: 76  QGKYYFEGRYDLVRFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVQGINFRTNNE 135

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+RF  KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y +WAA  
Sbjct: 136 PFKWHMQRFTKKIVDMMKSEGLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTEWAAKM 195

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF  FG AV
Sbjct: 196 AVGLGTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGAV 255

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DE+G +R
Sbjct: 256 PHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEFGLLR 315

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHL++LH+AIKLCE  LIS DPT   LG   EAH++H  S  CAAFLANY+  S A
Sbjct: 316 QPKWGHLKDLHRAIKLCEPALISGDPTVTSLGNYEEAHVFHSKSGACAAFLANYNPRSYA 375

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V+F    Y LP WS+SILPDCKN V+NTA++             Q    ++   S  F 
Sbjct: 376 KVSFRNMHYNLPPWSISILPDCKNTVYNTARL-----------GAQSATMKMTPVSGRFG 424

Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
           W  Y E+     + SF    L EQINTT+D SDYLWY+  + +   +     G+   L +
Sbjct: 425 WQSYNEETASYDDSSFAAVGLLEQINTTRDVSDYLWYSTDVKIGYNEGFLKSGRYPVLTV 484

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VF+N +L    YG+ +      ++ ++L  G+NT+ +LS+ VGL N G  F+
Sbjct: 485 LSAGHALHVFINGRLSGTAYGSLENPKLTFSQGVKLRAGVNTIALLSIAVGLPNVGPHFE 544

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   V L  L  G+RDLS  +W Y+VG++GE + L  +S ++S  W +GS +   
Sbjct: 545 TWNAGVLGPVSLNGLNEGRRDLSWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLMARG 604

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQ++GRYW AY A  TG    C+Y G
Sbjct: 605 QPLTWYKTTFNAPGGNTPLALDMGSMGKGQIWINGQNVGRYWPAYKA--TGGCGDCNYAG 662

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           +Y   KC  +CG+P+Q  YH+P +W+ P  NLLV+ EE GG+P+ ISL+ +  + +C+ +
Sbjct: 663 TYSEKKCLSNCGEPSQRWYHVPHSWLSPTGNLLVVFEESGGNPAGISLVEREIESVCADI 722

Query: 716 SEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
            E  P  ++      G V+    P+  L C  G  I++I FAS+G PEG CGS+R G+CH
Sbjct: 723 YEWQPTLMNYEMQASGKVNKPLRPKAHLWCAPGQKISSIKFASFGTPEGVCGSYREGSCH 782

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                   +++C+G   CS+ V+    G     CP ++K L+VEA CS
Sbjct: 783 AHKSYDAFERSCIGMNSCSVTVAPEIFG--GDPCPSVMKKLSVEAICS 828


>gi|255572957|ref|XP_002527409.1| beta-galactosidase, putative [Ricinus communis]
 gi|223533219|gb|EEF34975.1| beta-galactosidase, putative [Ricinus communis]
          Length = 845

 Score =  876 bits (2264), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/825 (52%), Positives = 565/825 (68%), Gaps = 24/825 (2%)

Query: 7   YDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQ 66
           YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN HEP  G+
Sbjct: 34  YDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGHEPSPGK 93

Query: 67  YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
           YYFEG +DLV+F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PFK
Sbjct: 94  YYFEGNYDLVKFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGINFRTDNGPFK 153

Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
            +M+RF  KI+++MK E LF SQGGPIIL+Q+ENEYG +E+  G  G+ Y KWAA  AV 
Sbjct: 154 AQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGQAYSKWAAKMAVG 213

Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
           L T VPWVMC+Q+DAPDP+INTCNGFYCD F+PN P KP MWTE ++GWF  FG AVP+R
Sbjct: 214 LGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKPYKPKMWTEAWTGWFTEFGGAVPYR 273

Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPK 306
           P EDLAF+VARF + GG F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +RQPK
Sbjct: 274 PAEDLAFSVARFIQKGGAFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLLRQPK 333

Query: 307 WGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVT 366
           WGHL++LH+AIKLCE  L+S  P+   LG   EAH++   S  CAAFLANY+  S A V+
Sbjct: 334 WGHLKDLHRAIKLCEPALVSGAPSVMPLGNYQEAHVFKSKSGACAAFLANYNQRSFAKVS 393

Query: 367 FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW-- 424
           F    Y LP WS+SILPDCKN V+NTA++ +Q        A+ K     +     FSW  
Sbjct: 394 FGNMHYNLPPWSISILPDCKNTVYNTARIGAQS-------ARMK--MSPIPMRGGFSWQA 444

Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESL 479
           Y E+    G+ +F+   L EQINTT+D SDYLWY+  + +   +     GK   L + S 
Sbjct: 445 YSEEASTEGDNTFMMVGLLEQINTTRDVSDYLWYSTDVRIDSNEGFLRSGKYPVLTVLSA 504

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
           GHA  VFVN +L    YG+ +      ++ +++  GIN + +LS+ VGL N G  F+   
Sbjct: 505 GHALHVFVNGQLSGTAYGSLESPKLTFSQGVKMRAGINRIYLLSIAVGLPNVGPHFETWN 564

Query: 540 AGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
           AG+   V L  L  G+RDLS  +W Y++G+ GE + L  +S ++S  W QGS +   + L
Sbjct: 565 AGVLGPVTLNGLNEGRRDLSWQKWTYKIGLHGEALSLHSLSGSSSVEWAQGSFVSRKQPL 624

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
           +WYKTTF AP G  PLAL++ SMGKGQ W+NGQS+GRYW AY A  +G    C+Y G+++
Sbjct: 625 MWYKTTFNAPAGNSPLALDMGSMGKGQVWINGQSVGRYWPAYKA--SGNCGVCNYAGTFN 682

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
             KC  +CG+ +Q  YH+PR+W++   NLLV+ EE GGDP+ ISL+ +    +C+ + E 
Sbjct: 683 EKKCLTNCGEASQRWYHVPRSWLNTAGNLLVVFEEWGGDPNGISLVRREVDSVCADIYEW 742

Query: 719 DPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MD 775
            P  ++    + G V+    P+V L C  G  I+ I FAS+G PEG CGS+R G+CH   
Sbjct: 743 QPTLMNYMMQSSGKVNKPLRPKVHLQCGAGQKISLIKFASFGTPEGVCGSYRQGSCHAFH 802

Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                 + CVGQ  CS+ V+    G     CP ++K LAVEA CS
Sbjct: 803 SYDAFNRLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCS 845


>gi|357131396|ref|XP_003567324.1| PREDICTED: beta-galactosidase 3-like [Brachypodium distachyon]
          Length = 916

 Score =  875 bits (2262), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/826 (51%), Positives = 547/826 (66%), Gaps = 17/826 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + VTYD R+L+I G+RR+L S SIHYPRS P +WP+L+ ++K+GG + IETYVFWN HE 
Sbjct: 100 SGVTYDGRSLIISGRRRLLISTSIHYPRSVPAMWPKLVAEAKDGGADCIETYVFWNGHET 159

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFE RFDLVRF K V++AGL+L LRIGP+  AEWN+GG PVWLH+IPG  FRT N
Sbjct: 160 APGEYYFEDRFDLVRFAKVVKDAGLYLMLRIGPFVAAEWNFGGVPVWLHYIPGAVFRTNN 219

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  MK F  KI+D+MK+E  FASQGG IILAQ+ENEYG+ E AYG  G+ Y  WAA 
Sbjct: 220 EPFKSHMKSFTTKIVDMMKRERFFASQGGHIILAQIENEYGDTEQAYGADGKAYAMWAAS 279

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+  NT VPW+MCQQ DAP+ +INTCN FYCD F  NSP+KP +WTEN+ GWF +FG +
Sbjct: 280 MALAQNTGVPWIMCQQYDAPEHVINTCNSFYCDQFKTNSPTKPKIWTENWPGWFQTFGES 339

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P RP ED+AF+VARFF+ GG+ QNYY+Y GGTNFGRT GGP + TSYDYDAPIDEYG  
Sbjct: 340 NPHRPPEDVAFSVARFFQKGGSVQNYYVYHGGTNFGRTTGGPFITTSYDYDAPIDEYGLT 399

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R PKW HLR+LHK+IKLCE  L+  + T   LG K EA +Y   S  C AFLAN D  +D
Sbjct: 400 RLPKWAHLRDLHKSIKLCEHSLLYGNLTSLSLGTKQEADVYTDHSGGCVAFLANIDPEND 459

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             VTF    Y LPAWSVSILPDCKN VFNTAKV SQ    D        V E L ++   
Sbjct: 460 TVVTFRSRQYDLPAWSVSILPDCKNAVFNTAKVQSQTLMVDM-------VPETLQSTKPD 512

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIE 477
            W  + EK GI     F+R    + INTTKD++DYLW+T S +V    P  G    L+I+
Sbjct: 513 RWSIFREKTGIWDKNDFIRNGFVDHINTTKDSTDYLWHTTSFNVDRSYPTNGNRELLSID 572

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA   F+N +L+   YGN   ++F ++  I+L  G N + +LSM VGLQN G  ++ 
Sbjct: 573 SKGHAVHAFLNNELIGSAYGNGSKSSFNVHMPIKLKPGKNEIALLSMTVGLQNAGPHYEW 632

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
            GAGL SV +  +KNG  DLSS  W Y++G+EGE+ GL K    N+  W   S  P  + 
Sbjct: 633 VGAGLTSVNISGMKNGSIDLSSNNWAYKIGLEGEHYGLFKPDQGNNQRWSPQSEPPKGQP 692

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L WYK     P+G  P+ +++ SMGKG AW+NG +IGRYW    +    CT  C+YRG +
Sbjct: 693 LTWYKVNVDVPQGDDPVGIDMQSMGKGLAWLNGNAIGRYWPRTSSSDDRCTPSCNYRGPF 752

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
           + SKC+  CG+P Q  YH+PR+W HP  N LV+ EE GGDP+KI+   +    +CSFVSE
Sbjct: 753 NPSKCRTGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATKVCSFVSE 812

Query: 718 ADPP-PVDSWKPNLGVV-SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
             P   ++SW  ++      + +V+L+C +G +I+++ FAS+G P G C S++ G CH  
Sbjct: 813 NYPSIDLESWDKSISDDGKDTAKVQLSCPKGKNISSVKFASFGDPSGTCRSYQQGRCHHP 872

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             L +V+KAC+    C++ +S    G     CPG+ K LA+EA CS
Sbjct: 873 SSLSVVEKACLNINSCTVSLSDE--GFGKDLCPGVAKTLAIEADCS 916


>gi|61162201|dbj|BAD91082.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 854

 Score =  875 bits (2260), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/828 (51%), Positives = 560/828 (67%), Gaps = 26/828 (3%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A+VI+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+V+ETYVFWN HEP  
Sbjct: 28  VTYDRKAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVHEPTP 87

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLVRF+KT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT N P
Sbjct: 88  GNYNFEGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+ LMK E+LF SQGGPIIL+Q+ENEYG     +G  G  Y+ WAA+ A
Sbjct: 148 FKRAMQGFTQKIVGLMKSESLFESQGGPIILSQIENEYGAQSKLFGAAGHNYITWAAEMA 207

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V L+T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP +WTE +SGWF  FG  + 
Sbjct: 208 VGLDTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPYKPTIWTETWSGWFTEFGGPIH 267

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPV+DLA+AVA F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG IRQ
Sbjct: 268 QRPVQDLAYAVATFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLIRQ 327

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL+ELHKAIK+CE  L+S+DP    LG   +A++Y   S DC+AFL+N+DS S A 
Sbjct: 328 PKYGHLKELHKAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSKSAAR 387

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V FN   Y LP WS+SILPDC+NVVFNTAKV  Q +       Q    N  +L+  ++  
Sbjct: 388 VMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQTSQ-----MQMLPTNIPMLSWESYD- 441

Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESL 479
            E+   +  + +   P L EQIN T+D++DYLWY  S+ +   +     G+   L ++S 
Sbjct: 442 -EDLTSMDDSSTMTAPGLLEQINVTRDSTDYLWYITSVDIDSSESFLHGGELPTLIVQST 500

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
           GHA  +F+N +L    +G  +   F    K+ L  G N + +LS+ VGL N G  F+   
Sbjct: 501 GHAVHIFINGQLTGSAFGTRESRRFTYTGKVNLRAGTNKIALLSVAVGLPNVGGHFEAWN 560

Query: 540 AGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS- 597
            G+   V L  L  GK DLS  +W YQVG++GE + L   +  +S  W  GS +   K  
Sbjct: 561 TGILGPVALHGLNQGKWDLSWQKWTYQVGLKGEAMNLVSQNAFSSVEWISGSLIAQKKQQ 620

Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L W+KT F  PEG  PLAL++  MGKGQ W+NGQSIGRYW+A+   + G    C Y G 
Sbjct: 621 PLTWHKTIFNEPEGSEPLALDMEGMGKGQIWINGQSIGRYWTAF---ANGNCNGCSYAGG 677

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           +  +KCQ  CG+P Q  YH+PR+W+ P +NLLV+ EELGGDPS+ISL+ +    +CS V+
Sbjct: 678 FRPTKCQSGCGKPTQRYYHVPRSWLKPTQNLLVLFEELGGDPSRISLVKRAVSSVCSEVA 737

Query: 717 EADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           E   P + +W   + G V    SP+V L C  G  I++I FAS+G P G CGS++ G CH
Sbjct: 738 EYH-PTIKNWHIESYGKVEDFHSPKVHLRCNPGQAISSIKFASFGTPLGTCGSYQEGTCH 796

Query: 774 MDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                 +VQK C+G+  C++ +S++  G     CP +LK L+VEA C+
Sbjct: 797 ATTSYSVVQKKCIGKQRCAVTISNSNFG---DPCPKVLKRLSVEAVCA 841


>gi|297798272|ref|XP_002867020.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312856|gb|EFH43279.1| hypothetical protein ARALYDRAFT_491000 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 853

 Score =  874 bits (2257), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/831 (52%), Positives = 555/831 (66%), Gaps = 24/831 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +AL+I+G+RR+L SGSIHYPRSTP++W  LI+K+K+GG++VIETYVFWN H
Sbjct: 26  VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGIDVIETYVFWNLH 85

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 86  EPTPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 145

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK F  +I++LMK ENLF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 146 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+   T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF  FG
Sbjct: 206 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 266 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IR+PK+GHL+ELH+AIK+CE+ L+S+DP    +G K +AH+Y   S DC+AFLANYD+ 
Sbjct: 326 LIREPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+N VFNTAKV  Q +  +      KN         
Sbjct: 386 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKNFQWQ----- 440

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
             S+ E+   +  + +F    L EQIN T+DTSDYLWY  S+ +   +     G+   L 
Sbjct: 441 --SYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSVDIGDTESFLHGGELPTLI 498

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I+S GHA  +FVN +L    +G      F    KI L+ G N + +LS+ VGL N G  F
Sbjct: 499 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 558

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
           +    G+   V L  L  GKRDLS  +W YQVG++GE + L   +   S  W   S T+ 
Sbjct: 559 ESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLAFPTNTRSIGWMDASLTVQ 618

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+KT F APEG  PLAL++  MGKGQ WVNG+SIGRYW+A+   +TG   +C Y
Sbjct: 619 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSQCSY 675

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G+Y  +KCQ  CGQP Q  YH+PR+W+ P +NLLVI EELGG+PS +SL+ ++   +C+
Sbjct: 676 TGTYKPNKCQTGCGQPTQRYYHVPRSWLKPSQNLLVIFEELGGNPSSVSLVKRSVSGVCA 735

Query: 714 FVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            VSE   P + +W+      G     P+V L C  G  IA+I FAS+G P G CGS++ G
Sbjct: 736 EVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQG 794

Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH      I+++ CVG+  C++ +S+   G     CP +LK L VEA C+
Sbjct: 795 ECHAATSYAILERKCVGKARCAVTISNTNFG--KDPCPNVLKRLTVEAVCA 843


>gi|15231354|ref|NP_187988.1| beta galactosidase 1 [Arabidopsis thaliana]
 gi|75274602|sp|Q9SCW1.1|BGAL1_ARATH RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
           Precursor
 gi|6686874|emb|CAB64737.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|9294020|dbj|BAB01923.1| beta-galactosidase [Arabidopsis thaliana]
 gi|332641886|gb|AEE75407.1| beta galactosidase 1 [Arabidopsis thaliana]
          Length = 847

 Score =  874 bits (2257), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/829 (52%), Positives = 555/829 (66%), Gaps = 20/829 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +S +V+YD RA+ I+GKRR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+VI+TYVFWN H
Sbjct: 30  VSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGH 89

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLV+FVK VQ++GL+LHLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 90  EPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 149

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M+RF  KI+++MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y  WA
Sbjct: 150 DNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWA 209

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 210 AKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 270 GPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 329

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             RQPKWGHL++LH+AIKLCE  L+S +PT   LG   EAH+Y   S  C+AFLANY+  
Sbjct: 330 LERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPK 389

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V+F  N Y LP WS+SILPDCKN V+NTA+V +Q        ++ K V   +    
Sbjct: 390 SYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGL 442

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++  Y E      + SF    L EQINTT+DTSDYLWY   + V   +     G    L 
Sbjct: 443 SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLT 502

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VF+N +L    YG+ D       K + L  G N + ILS+ VGL N G  F
Sbjct: 503 VLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V L  L  G+RDLS  +W Y+VG++GE + L  +S ++S  W +G+ +  
Sbjct: 563 ETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLA+++ SMGKGQ W+NGQS+GR+W AY A   G   +C Y 
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYT 680

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G++   KC ++CG+ +Q  YH+PR+W+ P  NLLV+ EE GGDP+ I+L+ +    +C+ 
Sbjct: 681 GTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCAD 740

Query: 715 VSEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + E     V+      G V+    P+  L C  G  I  + FAS+G PEG CGS+R G+C
Sbjct: 741 IYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSC 800

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H         K CVGQ  CS+ V+    G     CP ++K LAVEA C+
Sbjct: 801 HAHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 847


>gi|114217393|dbj|BAF31232.1| beta-D-galactosidase [Persea americana]
          Length = 889

 Score =  873 bits (2256), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 448/857 (52%), Positives = 574/857 (66%), Gaps = 49/857 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+IDGKRR+L S  IHYPR+TPE+WP+LI KSKEGG ++I+TY FWN HEPI
Sbjct: 30  NVSYDHRALIIDGKRRMLISSGIHYPRATPEMWPDLIAKSKEGGADLIQTYAFWNGHEPI 89

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY FEGR+D+V+F+K    AGL+ HLRIGPY CAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 90  RGQYNFEGRYDIVKFIKLAGSAGLYFHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 149

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P+K+EM+RF+ KI+DLM+QE LF+ QGGPIIL Q+ENEYGN+E  YG  G+ YVKWAAD 
Sbjct: 150 PYKDEMQRFVKKIVDLMRQEMLFSWQGGPIILLQIENEYGNIERLYGQRGKDYVKWAADM 209

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMC+Q DAP+ II+ CN FYCDGF PNS  KP +WTE+++GW+ S+G  V
Sbjct: 210 AIGLGAGVPWVMCRQTDAPENIIDACNAFYCDGFKPNSYRKPALWTEDWNGWYTSWGGRV 269

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVED AFAVARFF+ GG++ NYYM+FGGTNFGRT+GGP   TSYDYDAPIDEYG + 
Sbjct: 270 PHRPVEDNAFAVARFFQRGGSYHNYYMFFGGTNFGRTSGGPFYVTSYDYDAPIDEYGLLS 329

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSS-------------N 348
           QPKWGHL++LH AIKLCE  L++ D  P + +LG   EAH+Y  SS              
Sbjct: 330 QPKWGHLKDLHSAIKLCEPALVAVDDAPQYIRLGPMQEAHVYRHSSYVEDQSSSTLGNGT 389

Query: 349 DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ 408
            C+AFLAN D  + ANV F G VY LP WSVSILPDCKNV FNTAKV SQ +     F+ 
Sbjct: 390 LCSAFLANIDEHNSANVKFLGQVYSLPPWSVSILPDCKNVAFNTAKVASQISVKTVEFSS 449

Query: 409 Q--KNVNE---LLL------ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLW 457
              +N  E   LLL       S+ +   +E +G  G  +F    + E +N TKDTSDYLW
Sbjct: 450 PFIENTTEPGYLLLHDGVHHISTNWMILKEPIGEWGGNNFTAEGILEHLNVTKDTSDYLW 509

Query: 458 YTASIHVMPG-----QGKEVF--LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKI 510
           Y   +H+        +  EV   L I+S+     +FVN +L     G+H      + + +
Sbjct: 510 YIMRLHISDEDISFWEASEVSPKLIIDSMRDVVRIFVNGQLA----GSHVGRWVRVEQPV 565

Query: 511 ELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVE 569
           +L +G N L ILS  VGLQNYGA+ +  GAG    I L  LK+G+ DL++  W+YQVG+ 
Sbjct: 566 DLVQGYNELAILSETVGLQNYGAFLEKDGAGFKGQIKLTGLKSGEYDLTNSLWVYQVGLR 625

Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
           GE++ +  +    S+ W       V  +  WYKT F AP+GK P++L L SMGKGQAWVN
Sbjct: 626 GEFMKIFSLEEHESADWVDLPNDSVPSAFTWYKTFFDAPQGKDPVSLYLGSMGKGQAWVN 685

Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
           G SIGRYWS  +AP  GC + CDYRG+Y  SKC  +CG+P Q+ YHIPR+W+ P +NLLV
Sbjct: 686 GHSIGRYWS-LVAPVDGC-QSCDYRGAYHESKCATNCGKPTQSWYHIPRSWLQPSKNLLV 743

Query: 690 IHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLAC 743
           I EE GG+P +IS+   +   IC+ VSE+  PP+  W         + + ++ P++ L C
Sbjct: 744 IFEETGGNPLEISVKLHSTSSICTKVSESHYPPLHLWSHKDIVNGKVSISNAVPEIHLQC 803

Query: 744 ERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVS 802
           + G  I++I FAS+G P+G+C  F  G CH  +   +V +AC G+  CSI VS+   G  
Sbjct: 804 DNGQRISSIMFASFGTPQGSCQRFSQGDCHAPNSFSVVSEACQGRNNCSIGVSNKVFG-- 861

Query: 803 AGACPGLLKALAVEAHC 819
              C G++K LAVEA C
Sbjct: 862 GDPCRGVVKTLAVEAKC 878


>gi|350537661|ref|NP_001234303.1| beta-galactosidase precursor [Solanum lycopersicum]
 gi|7939619|gb|AAF70822.1|AF154421_1 beta-galactosidase [Solanum lycopersicum]
 gi|4138137|emb|CAA10173.1| ss-galactosidase [Solanum lycopersicum]
          Length = 838

 Score =  873 bits (2256), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/830 (51%), Positives = 567/830 (68%), Gaps = 26/830 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A+V+YDHRA++++G+RR+L SGS+HYPRSTPE+WP +I+K+KEGG++VI+TYVFWN HE
Sbjct: 24  TASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHE 83

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +G+YYFEGR+DLV+F+K V +AGL++HLR+GPYACAEWN+GGFPVWL ++PGI FRT 
Sbjct: 84  PQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTD 143

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F AKI+++MK E L+ +QGGPIIL+Q+ENEYG +EW  G  G+ Y +WAA
Sbjct: 144 NGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAA 203

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN   KP +WTE ++ WF  FG 
Sbjct: 204 KMAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGN 263

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP+RP EDLAF+VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 264 PVPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 323

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHL++LH+AIKLCE  L+S DP    LG + EAH++   +  CAAFLANYD  S
Sbjct: 324 LRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHS 383

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V+F    Y LP WS+SILPDCKN VFNTA++ +Q        AQ K    +   S  
Sbjct: 384 FATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQS-------AQMK----MTPVSRG 432

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
             W  + E+     + SF    L EQINTT+D SDYLWY+  + +   +     GK  +L
Sbjct: 433 LPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWL 492

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VFVN +L    YG+ +      +K + L  G+N + +LS+ VGL N G  
Sbjct: 493 TIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPH 552

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+   AG+   V L  L  GKRDL+  +W Y+VG++GE + L  +S ++S  W +GS + 
Sbjct: 553 FETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLVA 612

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK+TF AP G  PLAL+L +MGKGQ W+NGQS+GRYW  Y A  +G    C+Y
Sbjct: 613 QRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKA--SGNCGACNY 670

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G ++  KC  +CG+ +Q  YH+PR+W++P  NLLV+ EE GG+P  ISL+ +    +C+
Sbjct: 671 AGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVCA 730

Query: 714 FVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
            ++E  P  V+      G V     P+  L+C  G  I +I FAS+G P+G CGSFR G+
Sbjct: 731 DINEWQPQLVNWQMQASGKVDKPLRPKAHLSCASGQKITSIKFASFGTPQGVCGSFREGS 790

Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH        ++ C+GQ  CS+PV+    G     CP ++K L+VE  CS
Sbjct: 791 CHAFHSYDAFERYCIGQNSCSVPVTPEIFG--GDPCPHVMKKLSVEVICS 838


>gi|326503960|dbj|BAK02766.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 845

 Score =  873 bits (2256), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/828 (51%), Positives = 548/828 (66%), Gaps = 21/828 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + VTYDHR+LVI G+RR+L S SIHYPRS P +WP+L+ ++KEGG + IETYVFWN HE 
Sbjct: 29  SGVTYDHRSLVISGRRRLLISASIHYPRSVPAMWPKLVAEAKEGGADCIETYVFWNGHET 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFE RFDLV+F + V++AGLFL LRIGP+  AEWN+GG P WLH+IPG  FRT N
Sbjct: 89  APGKYYFEDRFDLVQFARVVKDAGLFLMLRIGPFVAAEWNFGGVPAWLHYIPGTVFRTNN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  MK F  KI+D+MK++  FASQGG IILAQ+ENEYG  + AYG GG+ Y  WA  
Sbjct: 149 EPFKSHMKSFTTKIVDMMKEQRFFASQGGHIILAQIENEYGYYQQAYGAGGKAYAMWAGS 208

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A   NT VPW+MCQQ D PD +INTCN FYCD F PNSP++P +WTEN+ GWF +FG +
Sbjct: 209 MAQAQNTGVPWIMCQQYDVPDRVINTCNSFYCDQFKPNSPTQPKIWTENWPGWFQTFGES 268

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P RP ED+AF+VARFF  GG+ QNYY+Y GGTNF RTAGGP + TSYDYDAPIDEYG  
Sbjct: 269 NPHRPPEDVAFSVARFFGKGGSVQNYYVYHGGTNFDRTAGGPFITTSYDYDAPIDEYGLR 328

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R PKW HL+ELH++IKLCE  L+  + T   LG + EA +Y   S  C AFLAN DS  D
Sbjct: 329 RLPKWAHLKELHQSIKLCEHSLLFGNSTLLSLGPQQEADVYTDHSGGCVAFLANIDSEKD 388

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             VTF    Y LPAWSVSILPDCKNVVFNTAKV SQ    D        V   L AS   
Sbjct: 389 RVVTFRNRQYDLPAWSVSILPDCKNVVFNTAKVRSQTLMVDM-------VPGTLQASKPD 441

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIE 477
            W  + E++G+     FVR +  + INTTKD++DYLW+T S  V    P  G    LNI+
Sbjct: 442 QWSIFTERIGVWDKNDFVRNEFVDHINTTKDSTDYLWHTTSFDVDRNYPSSGNHPVLNID 501

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA   F+N  L+   YGN   ++F  +  I L  G N + ILSM VGL++ G +++ 
Sbjct: 502 SKGHAVHAFLNNMLIGSAYGNGSESSFSAHMPINLKAGKNEIAILSMTVGLKSAGPYYEW 561

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
            GAGL SV +  +KNG  DLSS  W Y+VG+EGE+ GL K    N+  W+  S  P ++ 
Sbjct: 562 VGAGLTSVNISGMKNGTTDLSSNNWAYKVGLEGEHYGLFKHDQGNNQRWRPQSQPPKHQP 621

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L WYK     P+G  P+ L++ SMGKG  W+NG +IGRYW      +  CT  CDYRG +
Sbjct: 622 LTWYKVNVDVPQGDDPVGLDMQSMGKGLVWLNGNAIGRYWPRTSPTNDRCTTSCDYRGKF 681

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
             +KC+  CG+P Q  YH+PR+W HP  N LV+ EE GGDP+KI+   +    +CSFVSE
Sbjct: 682 SPNKCRVGCGKPTQRWYHVPRSWFHPSGNTLVVFEEQGGDPTKITFSRRVATSVCSFVSE 741

Query: 718 ADPP-PVDSWKPNL---GVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
             P   ++SW  ++   G V++  +V+L+C +G +I+++ FAS+G P G C S++ G+CH
Sbjct: 742 NYPSIDLESWDKSISDDGRVAA--KVQLSCPKGKNISSVKFASFGDPSGTCRSYQQGSCH 799

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             D + +V+KAC+    C++ +S    G     CPG+ K LA+EA CS
Sbjct: 800 HPDSVSVVEKACMNMNSCTVSLSDE--GFGEDPCPGVTKTLAIEADCS 845


>gi|308550948|gb|ADO34788.1| beta-galactosidase STBG3 [Solanum lycopersicum]
          Length = 838

 Score =  873 bits (2255), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/830 (51%), Positives = 567/830 (68%), Gaps = 26/830 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A+V+YDHRA++++G+RR+L SGS+HYPRSTPE+WP +I+K+KEGG++VI+TYVFWN HE
Sbjct: 24  TASVSYDHRAIIVNGQRRILISGSVHYPRSTPEMWPGIIQKAKEGGVDVIQTYVFWNGHE 83

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +G+YYFEGR+DLV+F+K V +AGL++HLR+GPYACAEWN+GGFPVWL ++PGI FRT 
Sbjct: 84  PQQGKYYFEGRYDLVKFIKLVHQAGLYVHLRVGPYACAEWNFGGFPVWLKYVPGISFRTD 143

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F AKI+++MK E L+ +QGGPIIL+Q+ENEYG +EW  G  G+ Y +WAA
Sbjct: 144 NGPFKAAMQKFTAKIVNMMKAERLYETQGGPIILSQIENEYGPMEWELGAPGKSYAQWAA 203

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN   KP +WTE ++ WF  FG 
Sbjct: 204 KMAVGLDTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKIWTEAWTAWFTGFGN 263

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP+RP EDLAF+VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 264 PVPYRPAEDLAFSVAKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 323

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHL++LH+AIKLCE  L+S DP    LG + EAH++   +  CAAFLANYD  S
Sbjct: 324 LRQPKWGHLKDLHRAIKLCEPALVSGDPAVTALGHQQEAHVFRSKAGSCAAFLANYDQHS 383

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V+F    Y LP WS+SILPDCKN VFNTA++ +Q        AQ K    +   S  
Sbjct: 384 FATVSFANRHYNLPPWSISILPDCKNTVFNTARIGAQS-------AQMK----MTPVSRG 432

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
             W  + E+     + SF    L EQINTT+D SDYLWY+  + +   +     GK  +L
Sbjct: 433 LPWQSFNEETSSYEDSSFTVVGLLEQINTTRDVSDYLWYSTDVKIDSREKFLRGGKWPWL 492

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VFVN +L    YG+ +      +K + L  G+N + +LS+ VGL N G  
Sbjct: 493 TIMSAGHALHVFVNGQLAGTAYGSLEKPKLTFSKAVNLRAGVNKISLLSIAVGLPNIGPH 552

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+   AG+   V L  L  GKRDL+  +W Y+VG++GE + L  +S ++S  W +GS + 
Sbjct: 553 FETWNAGVLGPVSLTGLDEGKRDLTWQKWSYKVGLKGEALSLHSLSGSSSVEWVEGSLVA 612

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK+TF AP G  PLAL+L +MGKGQ W+NGQS+GRYW  Y A  +G    C+Y
Sbjct: 613 QRQPLTWYKSTFNAPAGNDPLALDLNTMGKGQVWINGQSLGRYWPGYKA--SGNCGACNY 670

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G ++  KC  +CG+ +Q  YH+PR+W++P  NLLV+ EE GG+P  ISL+ +    +C+
Sbjct: 671 AGWFNEKKCLSNCGEASQRWYHVPRSWLYPTGNLLVLFEEWGGEPHGISLVKREVASVCA 730

Query: 714 FVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
            ++E  P  V+      G V     P+  L+C  G  I +I FAS+G P+G CGSFR G+
Sbjct: 731 DINEWQPQLVNWQMQASGKVDKPLRPKAHLSCAPGQKITSIKFASFGTPQGVCGSFREGS 790

Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH        ++ C+GQ  CS+PV+    G     CP ++K L+VE  CS
Sbjct: 791 CHAFHSYDAFERYCIGQNSCSVPVTPEIFG--GDPCPHVMKKLSVEVICS 838


>gi|20260596|gb|AAM13196.1| galactosidase, putative [Arabidopsis thaliana]
          Length = 847

 Score =  873 bits (2255), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/829 (52%), Positives = 555/829 (66%), Gaps = 20/829 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +S +V+YD RA+ I+GKRR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+VI+TYVFWN H
Sbjct: 30  VSGSVSYDSRAITINGKRRILISGSIHYPRSTPEMWPDLIRKAKEGGLDVIQTYVFWNGH 89

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLV+FVK VQ++GL+LHLRIGPY CAEWN+GGFPVWL +IPGI FRT
Sbjct: 90  EPSPGKYYFEGNYDLVKFVKLVQQSGLYLHLRIGPYVCAEWNFGGFPVWLKYIPGISFRT 149

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M+RF  KI+++MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y  WA
Sbjct: 150 DNGPFKAQMQRFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGAPGRSYTNWA 209

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+Q+DAPDPIIN CNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 210 AKMAVGLGTGVPWVMCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 270 GPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 329

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             RQPKWGHL++LH+AIKLCE  L+S +PT   LG   EAH+Y   S  C+AFLANY+  
Sbjct: 330 LERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPK 389

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V+F  N Y LP WS+SILPDCKN V+NTA+V +Q        ++ K V   +    
Sbjct: 390 SYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGL 442

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++  Y E      + SF    L EQINTT+DTSDYLWY   + V   +     G    L 
Sbjct: 443 SWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLT 502

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  +F+N +L    YG+ D       K + L  G N + ILS+ VGL N G  F
Sbjct: 503 VLSAGHAMHLFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHF 562

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V L  L  G+RDLS  +W Y+VG++GE + L  +S ++S  W +G+ +  
Sbjct: 563 ETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQ 622

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLA+++ SMGKGQ W+NGQS+GR+W AY A   G   +C Y 
Sbjct: 623 KQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYT 680

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G++   KC ++CG+ +Q  YH+PR+W+ P  NLLV+ EE GGDP+ I+L+ +    +C+ 
Sbjct: 681 GTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCAD 740

Query: 715 VSEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + E     V+      G V+    P+  L C  G  I  + FAS+G PEG CGS+R G+C
Sbjct: 741 IYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSC 800

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H         K CVGQ  CS+ V+    G     CP ++K LAVEA C+
Sbjct: 801 HAHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 847


>gi|359476858|ref|XP_002274449.2| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
          Length = 898

 Score =  872 bits (2253), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/834 (51%), Positives = 559/834 (67%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W ++I+K+K+GGL+V+ETYVFWN H
Sbjct: 77  IQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVH 136

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRF++TVQ+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 137 EPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 196

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ LMK E LF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 197 DNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWA 256

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP +WTE +SGWF  FG
Sbjct: 257 ANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPTIWTEAWSGWFNEFG 316

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 317 GPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 376

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPK+GHL+ELH++IKLCE  L+S+DP    LG+  +AH+Y   + DCAAFL+NYD+ 
Sbjct: 377 LVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSDAGDCAAFLSNYDTK 436

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+N VFNTAKV            Q  ++  L   + 
Sbjct: 437 SSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKV----------GVQTAHMEMLPTNAE 486

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y+E +  +  + +F    L EQIN T+D SDYLWY   I +   +     G+  
Sbjct: 487 MLSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGSSESFLRGGELP 546

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L +++ GHA  VF+N +L    +G  ++  F   +K+ L+ G NT+ +LS+ VGL N G
Sbjct: 547 TLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIALLSVAVGLPNVG 606

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  L  GK DLS   W Y+VG++GE + L   +  +S  W QGS 
Sbjct: 607 GHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLVSPNGISSVDWMQGSL 666

Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
               +  L W+K  F APEG  PLAL++  MGKGQ W+NGQSIGRYW+AY   + G  + 
Sbjct: 667 AAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGRYWTAY---ANGNCQG 723

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y   KCQ  CGQP Q  YH+PR+W+ P +NLLV+ EELGGDPS+ISL+ ++   
Sbjct: 724 CSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGGDPSRISLVRRSMTS 783

Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+ V E   P + +W   + G       P+V L C  G  I++I FASYG P G CGSF
Sbjct: 784 VCADVFEYH-PNIKNWHIESYGKTEELHKPKVHLRCGPGQSISSIKFASYGTPLGTCGSF 842

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             G CH  D   IV+K C+G+  C++ +S+     +   CP +LK L+VEA C+
Sbjct: 843 EQGPCHAPDSYAIVEKRCIGRQRCAVTISNT--NFAQDPCPNVLKRLSVEAVCA 894


>gi|326512146|dbj|BAJ96054.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 847

 Score =  872 bits (2253), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/834 (52%), Positives = 556/834 (66%), Gaps = 39/834 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A++I+G+RR+L SGSIHYPRSTPE+W  LI+K+K+GGL+VI+TYVFWN HEP  
Sbjct: 32  VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLV+F+KT Q+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 92  GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+ +MK E LFASQGGPIIL+Q+ENEYG  E  +G  G+ Y  WAA  A
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V L+T VPWVMC+QEDAPDP+IN CNGFYCD FTPN+PSKP MWTE ++GWF  FG  + 
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDL+FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG  R+
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL+ELHKAIKLCE+ L+S DPT   LG+  EAH+Y +S + CAAFLANY+S+S A 
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVY-RSPSGCAAFLANYNSNSHAK 390

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           + F+   Y LP WS+SILPDCK VV+NTA V            Q   +      +S+  W
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATV----------GVQTSQMQMWSDGASSMMW 440

Query: 425 --YEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E+VG ++         L EQ+N T+DTSDYLWY  S+ V P +     GK + L +
Sbjct: 441 ERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTV 500

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  +FVN +L     G  +         ++L  G N + +LS+  GL N G  ++
Sbjct: 501 QSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYE 560

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   V+L  L  G RDL+   W YQVG++GE + L+ +  A+S  W QGS +  N
Sbjct: 561 TWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQN 620

Query: 596 K-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           +  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRY  AY   +TG  K C Y 
Sbjct: 621 QMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAY---ATGDCKDCSYT 677

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           GS+ A KCQ  CGQP Q  YH+P++W+ P  NLLV+ EELGGD SKISL+ ++  ++C+ 
Sbjct: 678 GSFRAIKCQAGCGQPTQRWYHVPKSWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVCAD 737

Query: 715 VSEADPPPVDSW--------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
           VSE   P + +W        KP L       +V L C  G  I+AI FAS+G P G CGS
Sbjct: 738 VSEFH-PSIKNWQTENSGEAKPEL----RRSKVHLRCAPGQSISAIKFASFGTPLGTCGS 792

Query: 767 FRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           F  G CH      V + C+G+  C++ +S    G     CP ++K +AVEA CS
Sbjct: 793 FEQGQCHSTKSQTVLENCIGKQRCAVTISPDNFG--GDPCPNVMKRVAVEAVCS 844


>gi|356522482|ref|XP_003529875.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
          Length = 845

 Score =  872 bits (2252), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/836 (51%), Positives = 564/836 (67%), Gaps = 36/836 (4%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V+YDH+A+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HE
Sbjct: 29  SASVSYDHKAITINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHE 88

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYF G +DLVRF+K VQ+AGL+++LRIGPY CAEWN+GGFPVWL +IPGI FRT 
Sbjct: 89  PSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISFRTD 148

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK +M++F  KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y +WAA
Sbjct: 149 NGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRAYTQWAA 208

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L T VPW+MC+QEDAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF  FG 
Sbjct: 209 HMAVGLGTGVPWIMCKQEDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGG 268

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 269 AVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 328

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            RQPKWGHL++LH+AIKLCE  L+S DPT Q+LG   EAH++   S  CAAFLANY+  S
Sbjct: 329 PRQPKWGHLKDLHRAIKLCEPALVSGDPTVQQLGNYEEAHVFRSKSGACAAFLANYNPQS 388

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V F    Y LP WS+SILP+CK+ V+NTA+V SQ           K     +    +
Sbjct: 389 YATVAFGNQRYNLPPWSISILPNCKHTVYNTARVGSQSTT-------MKMTRVPIHGGLS 441

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
           +  + E+   + + SF    L EQIN T+D SDYLWY+  + +   +     GK   L +
Sbjct: 442 WKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTV 501

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VF+N +L    YG+ +      ++ + L  G+N + +LS+ VGL N G  F+
Sbjct: 502 LSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFE 561

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   + L  L  G+RDL+  +W Y+VG++GE + L  +S ++S  W QG  +   
Sbjct: 562 RWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSRR 621

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQS+GRYW AY A  +G    C+Y G
Sbjct: 622 QPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKA--SGSCGYCNYAG 679

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           +Y+  KC  +CGQ +Q  YH+P +W+ P  NLLV+ EELGGDP+ I L+ +    +C+ +
Sbjct: 680 TYNEKKCGSNCGQASQRWYHVPHSWLKPTGNLLVVFEELGGDPNGIFLVRRDIDSVCADI 739

Query: 716 SEADPPPVDSWKPNL--------GVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCG 765
            E        W+PNL        G V S   P+  L+C  G  I++I FAS+G P G+CG
Sbjct: 740 YE--------WQPNLVSYDMQASGKVRSPVRPKAHLSCGPGQKISSIKFASFGTPVGSCG 791

Query: 766 SFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           ++R G+CH        QK CVGQ  C++ VS    G     CP ++K L+VEA C+
Sbjct: 792 NYREGSCHAHKSYDAFQKNCVGQSWCTVTVSPEIFG--GDPCPSVMKKLSVEAICT 845


>gi|297735069|emb|CBI17431.3| unnamed protein product [Vitis vinifera]
          Length = 845

 Score =  871 bits (2251), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/834 (51%), Positives = 559/834 (67%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W ++I+K+K+GGL+V+ETYVFWN H
Sbjct: 24  IQCSVTYDRKAIVINGQRRILISGSIHYPRSTPDMWEDIIQKAKDGGLDVVETYVFWNVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRF++TVQ+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84  EPSPGSYNFEGRYDLVRFIRTVQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ LMK E LF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 144 DNEPFKRAMQGFTEKIVGLMKSERLFESQGGPIILSQIENEYGVQSKLLGDAGHDYMTWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP +WTE +SGWF  FG
Sbjct: 204 ANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNKPYKPTIWTEAWSGWFNEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPLHQRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPK+GHL+ELH++IKLCE  L+S+DP    LG+  +AH+Y   + DCAAFL+NYD+ 
Sbjct: 324 LVRQPKYGHLKELHRSIKLCERALVSADPIVSSLGSFQQAHVYSSDAGDCAAFLSNYDTK 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+N VFNTAKV            Q  ++  L   + 
Sbjct: 384 SSARVMFNNMHYNLPPWSISILPDCRNAVFNTAKV----------GVQTAHMEMLPTNAE 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y+E +  +  + +F    L EQIN T+D SDYLWY   I +   +     G+  
Sbjct: 434 MLSWESYDEDISSLDDSSTFTTLGLLEQINVTRDASDYLWYITRIDIGSSESFLRGGELP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L +++ GHA  VF+N +L    +G  ++  F   +K+ L+ G NT+ +LS+ VGL N G
Sbjct: 494 TLILQTTGHAVHVFINGQLTGSAFGTREYRRFTFTEKVNLHAGTNTIALLSVAVGLPNVG 553

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  L  GK DLS   W Y+VG++GE + L   +  +S  W QGS 
Sbjct: 554 GHFETWNTGILGPVALHGLNQGKWDLSWQRWTYKVGLKGEAMNLVSPNGISSVDWMQGSL 613

Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
               +  L W+K  F APEG  PLAL++  MGKGQ W+NGQSIGRYW+AY   + G  + 
Sbjct: 614 AAQRQQPLTWHKAFFNAPEGDEPLALDMEGMGKGQVWINGQSIGRYWTAY---ANGNCQG 670

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y   KCQ  CGQP Q  YH+PR+W+ P +NLLV+ EELGGDPS+ISL+ ++   
Sbjct: 671 CSYSGTYRPPKCQLGCGQPTQRWYHVPRSWLKPTQNLLVVFEELGGDPSRISLVRRSMTS 730

Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+ V E   P + +W   + G       P+V L C  G  I++I FASYG P G CGSF
Sbjct: 731 VCADVFEYH-PNIKNWHIESYGKTEELHKPKVHLRCGPGQSISSIKFASYGTPLGTCGSF 789

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             G CH  D   IV+K C+G+  C++ +S+     +   CP +LK L+VEA C+
Sbjct: 790 EQGPCHAPDSYAIVEKRCIGRQRCAVTISNT--NFAQDPCPNVLKRLSVEAVCA 841


>gi|326515822|dbj|BAK07157.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 847

 Score =  871 bits (2250), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/834 (52%), Positives = 555/834 (66%), Gaps = 39/834 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A++I+G+RR+L SGSIHYPRSTPE+W  LI+K+K+GGL+VI+TYVFWN HEP  
Sbjct: 32  VTYDRKAVLINGQRRILFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIQTYVFWNGHEPTP 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLV+F+KT Q+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 92  GSYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 151

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+ +MK E LFASQGGPIIL+Q+ENEYG  E  +G  G+ Y  WAA  A
Sbjct: 152 FKAAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEEKEFGAAGKSYSDWAAKMA 211

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V L+T VPWVMC+QEDAPDP+IN CNGFYCD FTPN+PSKP MWTE ++GWF  FG  + 
Sbjct: 212 VGLDTGVPWVMCKQEDAPDPVINACNGFYCDAFTPNTPSKPTMWTEAWTGWFTEFGGTIR 271

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDL+FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG  R+
Sbjct: 272 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 331

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL+ELHKAIKLCE+ L+S DPT   LG+  EAH+Y +S + CAAFLANY+S+S A 
Sbjct: 332 PKYGHLKELHKAIKLCEQALVSVDPTVTSLGSMQEAHVY-RSPSGCAAFLANYNSNSHAK 390

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           + F+   Y LP WS+SILPDCK VV+NTA V            Q   +      +S+  W
Sbjct: 391 IVFDNEHYSLPPWSISILPDCKTVVYNTATV----------GVQTSQMQMWSDGASSMMW 440

Query: 425 --YEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E+VG ++         L EQ+N T+DTSDYLWY  S+ V P +     GK + L +
Sbjct: 441 ERYDEEVGSLAAAPLLTTTGLLEQLNATRDTSDYLWYMTSVDVSPSEKSLQGGKPLSLTV 500

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  +FVN +L     G  +         ++L  G N + +LS+  GL N G  ++
Sbjct: 501 QSAGHALHIFVNGQLQGSASGTREDKRISYKGDVKLRAGTNKISLLSVACGLPNIGVHYE 560

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   V+L  L  G RDL+   W YQVG++GE + L+ +  A+S  W QGS +  N
Sbjct: 561 TWNTGVNGPVVLHGLDEGSRDLTWQTWTYQVGLKGEQMNLNSLEGASSVEWMQGSLIAQN 620

Query: 596 K-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           +  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRY  AY   +TG  K C Y 
Sbjct: 621 QMPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYSLAY---ATGDCKDCSYT 677

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           GS+ A KCQ  CGQP Q  YH+P+ W+ P  NLLV+ EELGGD SKISL+ ++  ++C+ 
Sbjct: 678 GSFRAIKCQAGCGQPTQRWYHVPKPWLQPTRNLLVVFEELGGDTSKISLVKRSVSNVCAD 737

Query: 715 VSEADPPPVDSW--------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
           VSE   P + +W        KP L       +V L C  G  I+AI FAS+G P G CGS
Sbjct: 738 VSEFH-PSIKNWQTENSGEAKPEL----RRSKVHLRCAPGQSISAIKFASFGTPLGTCGS 792

Query: 767 FRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           F  G CH      V + C+G+  C++ +S    G     CP ++K +AVEA CS
Sbjct: 793 FEQGQCHSTKSQTVLENCIGKQRCAVTISPDNFG--GDPCPNVMKRVAVEAVCS 844


>gi|15081596|gb|AAK81874.1| putative beta-galactosidase BG1 [Vitis vinifera]
          Length = 854

 Score =  870 bits (2249), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/834 (51%), Positives = 557/834 (66%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W +LIRK+K+GGL+VI+TY+FWN H
Sbjct: 25  IQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL F+PGI FRT
Sbjct: 85  EPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 145 NNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE +SGWF  FG
Sbjct: 205 AKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELHKAIKLCE  ++S+DPT   LG+  +AH++     +CAAFL+NY+  
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LPAWS+SILPDC+ VVFNTA+V            Q  ++      S 
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARV----------GVQTSHMRMFPTNSK 434

Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E +   G+  +     L EQIN T+D++DYLWY  S+++   +     G+  
Sbjct: 435 LHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTP 494

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  VF+N +     YG  +   F       L+ G N + +LS+ VGL N G
Sbjct: 495 TLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVG 554

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V+L  +  GKRDLS  +W YQVG++GE + L   +  ++  W +GS 
Sbjct: 555 LHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSL 614

Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
               +  L WYK  F APEG  PLAL++ SMGKGQ W+NGQSIGRYW AY   + G    
Sbjct: 615 AAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNV 671

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y   KCQ  CG P Q  YH+PR+W+ P +NLL+I EELGGD SKI+L+ +  + 
Sbjct: 672 CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKS 731

Query: 711 ICSFVSEADPPPVDSW---KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+  +E   P +++W    P+         V L C  G  I+ I FAS+G P G CGSF
Sbjct: 732 VCADANEHH-PTLENWHTESPSESEELHQASVHLQCAPGQSISTIMFASFGTPSGTCGSF 790

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH  +   I++K C+GQ +CS+P+S++Y G  A  CP +LK L+VEA CS
Sbjct: 791 QKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFG--ADPCPNVLKRLSVEAACS 842


>gi|147818153|emb|CAN78072.1| hypothetical protein VITISV_013292 [Vitis vinifera]
          Length = 854

 Score =  870 bits (2249), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/834 (51%), Positives = 557/834 (66%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W +LIRK+K+GGL+VI+TY+FWN H
Sbjct: 25  IQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL F+PGI FRT
Sbjct: 85  EPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 145 NNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE +SGWF  FG
Sbjct: 205 AKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELHKAIKLCE  ++S+DPT   LG+  +AH++     +CAAFL+NY+  
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LPAWS+SILPDC+ VVFNTA+V            Q  ++      S 
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARV----------GVQTSHMRMFPTNSK 434

Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E +   G+  +     L EQIN T+D++DYLWY  S+++   +     G+  
Sbjct: 435 LHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTP 494

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  VF+N +     YG  +   F       L+ G N + +LS+ VGL N G
Sbjct: 495 TLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVG 554

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V+L  +  GKRDLS  +W YQVG++GE + L   +  ++  W +GS 
Sbjct: 555 LHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSL 614

Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
               +  L WYK  F APEG  PLAL++ SMGKGQ W+NGQSIGRYW AY   + G    
Sbjct: 615 AAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNV 671

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y   KCQ  CG P Q  YH+PR+W+ P +NLL+I EELGGD SKI+L+ +  + 
Sbjct: 672 CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKS 731

Query: 711 ICSFVSEADPPPVDSW---KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+  +E   P +++W    P+         V L C  G  I+ I FAS+G P G CGSF
Sbjct: 732 VCADANEHH-PTLENWHTESPSESEELHZASVHLQCAPGQSISTIMFASFGTPSGTCGSF 790

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH  +   I++K C+GQ +CS+P+S++Y G  A  CP +LK L+VEA CS
Sbjct: 791 QKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFG--ADPCPNVLKRLSVEAACS 842


>gi|225458151|ref|XP_002280715.1| PREDICTED: beta-galactosidase 3 [Vitis vinifera]
 gi|302142564|emb|CBI19767.3| unnamed protein product [Vitis vinifera]
          Length = 854

 Score =  870 bits (2249), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/834 (51%), Positives = 557/834 (66%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A+VI+G+RR+L SGSIHYPRSTP++W +LIRK+K+GGL+VI+TY+FWN H
Sbjct: 25  IQCSVTYDKKAIVINGQRRILISGSIHYPRSTPDMWEDLIRKAKDGGLDVIDTYIFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL F+PGI FRT
Sbjct: 85  EPSPGNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKFVPGISFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 145 NNEPFKMAMQGFTQKIVHMMKSENLFASQGGPIILSQIENEYGPESRELGAAGHAYINWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE +SGWF  FG
Sbjct: 205 AKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDAFSPNKPYKPRIWTEAWSGWFTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHRRPVQDLAFGVARFIQNGGSFVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELHKAIKLCE  ++S+DPT   LG+  +AH++     +CAAFL+NY+  
Sbjct: 325 LIRQPKYGHLKELHKAIKLCEHAVVSADPTVISLGSYQQAHVFSSGRGNCAAFLSNYNPK 384

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LPAWS+SILPDC+ VVFNTA+V            Q  ++      S 
Sbjct: 385 SSARVIFNNVHYDLPAWSISILPDCRTVVFNTARV----------GVQTSHMRMFPTNSK 434

Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E +   G+  +     L EQIN T+D++DYLWY  S+++   +     G+  
Sbjct: 435 LHSWETYGEDISSLGSSGTMTAGGLLEQINITRDSTDYLWYMTSVNIDSSESFLRRGQTP 494

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  VF+N +     YG  +   F       L+ G N + +LS+ VGL N G
Sbjct: 495 TLTVQSKGHAVHVFINGQYSGSAYGTRENRKFTYTGAANLHAGTNRIALLSIAVGLPNVG 554

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V+L  +  GKRDLS  +W YQVG++GE + L   +  ++  W +GS 
Sbjct: 555 LHFETWKTGILGPVLLHGIDQGKRDLSWQKWSYQVGLKGEAMNLVSPNGVSAVEWVRGSL 614

Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
               +  L WYK  F APEG  PLAL++ SMGKGQ W+NGQSIGRYW AY   + G    
Sbjct: 615 AAQGQQPLKWYKAYFNAPEGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNV 671

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y   KCQ  CG P Q  YH+PR+W+ P +NLL+I EELGGD SKI+L+ +  + 
Sbjct: 672 CSYSGTYRPPKCQHGCGHPTQRWYHVPRSWLKPTQNLLIIFEELGGDASKIALMKRAMKS 731

Query: 711 ICSFVSEADPPPVDSW---KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+  +E   P +++W    P+         V L C  G  I+ I FAS+G P G CGSF
Sbjct: 732 VCADANEHH-PTLENWHTESPSESEELHEASVHLQCAPGQSISTIMFASFGTPSGTCGSF 790

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH  +   I++K C+GQ +CS+P+S++Y G  A  CP +LK L+VEA CS
Sbjct: 791 QKGTCHAPNSQAILEKNCIGQEKCSVPISNSYFG--ADPCPNVLKRLSVEAACS 842


>gi|18419821|ref|NP_568001.1| beta-galactosidase 3 [Arabidopsis thaliana]
 gi|75202767|sp|Q9SCV9.1|BGAL3_ARATH RecName: Full=Beta-galactosidase 3; Short=Lactase 3; Flags:
           Precursor
 gi|6686878|emb|CAB64739.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|15810493|gb|AAL07134.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|20259271|gb|AAM14371.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|332661246|gb|AEE86646.1| beta-galactosidase 3 [Arabidopsis thaliana]
          Length = 856

 Score =  870 bits (2248), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/831 (52%), Positives = 554/831 (66%), Gaps = 24/831 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 29  VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89  EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK F  +I++LMK ENLF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 208

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+   T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF  FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 268

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+AIK+CE+ L+S+DP    +G K +AH+Y   S DC+AFLANYD+ 
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 388

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+N VFNTAKV  Q +  +      KN         
Sbjct: 389 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKNFQWE----- 443

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
             S+ E+   +  + +F    L EQIN T+DTSDYLWY  S+ +   +     G+   L 
Sbjct: 444 --SYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLI 501

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I+S GHA  +FVN +L    +G      F    KI L+ G N + +LS+ VGL N G  F
Sbjct: 502 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 561

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
           +    G+   V L  L  GK DLS  +W YQVG++GE + L   +   S  W   S T+ 
Sbjct: 562 ESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQ 621

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+KT F APEG  PLAL++  MGKGQ WVNG+SIGRYW+A+   +TG    C Y
Sbjct: 622 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSY 678

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G+Y  +KCQ  CGQP Q  YH+PR W+ P +NLLVI EELGG+PS +SL+ ++   +C+
Sbjct: 679 TGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCA 738

Query: 714 FVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            VSE   P + +W+      G     P+V L C  G  IA+I FAS+G P G CGS++ G
Sbjct: 739 EVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQG 797

Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH      I+++ CVG+  C++ +S++  G     CP +LK L VEA C+
Sbjct: 798 ECHAATSYAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 846


>gi|4006924|emb|CAB16852.1| beta-galactosidase like protein [Arabidopsis thaliana]
 gi|7270584|emb|CAB80302.1| beta-galactosidase like protein [Arabidopsis thaliana]
          Length = 853

 Score =  870 bits (2248), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/834 (52%), Positives = 554/834 (66%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 26  VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 85

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 86  EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 145

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK F  +I++LMK ENLF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 146 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+   T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF  FG
Sbjct: 206 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 266 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+AIK+CE+ L+S+DP    +G K +AH+Y   S DC+AFLANYD+ 
Sbjct: 326 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+N VFNTAKV  Q +  +      KN         
Sbjct: 386 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKN--------- 436

Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            F W    E+   +  + +F    L EQIN T+DTSDYLWY  S+ +   +     G+  
Sbjct: 437 -FQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELP 495

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L I+S GHA  +FVN +L    +G      F    KI L+ G N + +LS+ VGL N G
Sbjct: 496 TLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVG 555

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
             F+    G+   V L  L  GK DLS  +W YQVG++GE + L   +   S  W   S 
Sbjct: 556 GHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASL 615

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           T+   + L W+KT F APEG  PLAL++  MGKGQ WVNG+SIGRYW+A+   +TG    
Sbjct: 616 TVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSH 672

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y  +KCQ  CGQP Q  YH+PR W+ P +NLLVI EELGG+PS +SL+ ++   
Sbjct: 673 CSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 732

Query: 711 ICSFVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+ VSE   P + +W+      G     P+V L C  G  IA+I FAS+G P G CGS+
Sbjct: 733 VCAEVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSY 791

Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH      I+++ CVG+  C++ +S++  G     CP +LK L VEA C+
Sbjct: 792 QQGECHAATSYAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 843


>gi|30690633|ref|NP_849506.1| beta-galactosidase 3 [Arabidopsis thaliana]
 gi|332661247|gb|AEE86647.1| beta-galactosidase 3 [Arabidopsis thaliana]
          Length = 855

 Score =  870 bits (2247), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/830 (52%), Positives = 552/830 (66%), Gaps = 23/830 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 29  VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89  EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK F  +I++LMK ENLF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 208

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+   T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF  FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 268

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+AIK+CE+ L+S+DP    +G K +AH+Y   S DC+AFLANYD+ 
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTE 388

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+N VFNTAKV  Q +  +      KN         
Sbjct: 389 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKNFQWE----- 443

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
             S+ E+   +  + +F    L EQIN T+DTSDYLWY  S+ +   +     G+   L 
Sbjct: 444 --SYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLI 501

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I+S GHA  +FVN +L    +G      F    KI L+ G N + +LS+ VGL N G  F
Sbjct: 502 IQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHF 561

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
           +    G+   V L  L  GK DLS  +W YQVG++GE + L   +   S  W   S T+ 
Sbjct: 562 ESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQ 621

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+KT F APEG  PLAL++  MGKGQ WVNG+SIGRYW+A+   +TG    C Y
Sbjct: 622 KPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSY 678

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G+Y  +KCQ  CGQP Q  YH+PR W+ P +NLLVI EELGG+PS +SL+ ++   +C+
Sbjct: 679 TGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCA 738

Query: 714 FVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            VSE   P + +W+      G     P+V L C  G  IA+I FAS+G P G CGS++ G
Sbjct: 739 EVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQG 797

Query: 771 ACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH      + + CVG+  C++ +S++  G     CP +LK L VEA C+
Sbjct: 798 ECHAATSYAILERCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 845


>gi|326534200|dbj|BAJ89450.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 763

 Score =  869 bits (2246), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 425/768 (55%), Positives = 532/768 (69%), Gaps = 18/768 (2%)

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QY FEGR DLVRFVK   +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT N PF
Sbjct: 1   QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K EM+RF  K++  MK   L+ASQGGPIIL+Q+ENEYGN+  +YG  G+ Y++WAA  AV
Sbjct: 61  KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            L+T VPWVMCQQ DAP+P+INTCNGFYCD FTP+ PS+P +WTEN+SGWFLSFG AVP+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG +RQP
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHLR++HKAIK+CE  LI++DP++  LG   EAH+Y KS + CAAFLAN D  SD  V
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVY-KSGSLCAAFLANIDDQSDKTV 299

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNELLLAS 419
           TFNG  Y LPAWSVSILPDCKNVV NTA++ SQ      RN G    A   +  E  LA+
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359

Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVFLN 475
           S++S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V  G+    G +  L 
Sbjct: 360 SSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQSNLP 419

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + SLGH   VF+N KL     G+   +   +   + L  G N +D+LS  VGL NYGA+F
Sbjct: 420 VNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGAFF 479

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           D+ GAG+   + +    G  DLSS EW YQ+G+ GE + L   S A S  W   ++ P N
Sbjct: 480 DLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEA-SPEWVSDNSYPTN 538

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
             L WYK+ F AP G  P+A++   MGKG+AWVNGQSIGRYW   +AP + C   C+YRG
Sbjct: 539 NPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSDCVNSCNYRG 598

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           SY A+KC K CGQP+Q LYH+PR+++ PG N +V+ E+ GG+PSKIS  TK  + +C+ V
Sbjct: 599 SYSATKCLKKCGQPSQILYHVPRSFLQPGSNDIVLFEQFGGNPSKISFTTKQTESVCAHV 658

Query: 716 SEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGAC- 772
           SE  P  +DSW      +  S P +RL C + G  I++I FAS+G P G CGS+  G C 
Sbjct: 659 SEDHPDQIDSWVSSQQKLQRSGPALRLECPKEGQVISSIKFASFGTPSGTCGSYSHGECS 718

Query: 773 HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
               L + Q+ACVG   CS+PVS+   G     C G+ K+L VEA CS
Sbjct: 719 SSQALAVAQEACVGVSSCSVPVSAKNFG---DPCRGVTKSLVVEAACS 763


>gi|34148077|gb|AAQ62586.1| putative beta-galactosidase [Glycine max]
          Length = 909

 Score =  869 bits (2246), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/857 (50%), Positives = 579/857 (67%), Gaps = 48/857 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+++GKRR L S  IHYPR+TPE+WP+LI KSKEGG +VIETYVFWN HEP+
Sbjct: 46  NVSYDHRALILNGKRRFLISAGIHYPRATPEMWPDLIAKSKEGGADVIETYVFWNGHEPV 105

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY FEGR+DLV+FV+     GL+  LRIGPYACAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 106 RGQYNFEGRYDLVKFVRLAASHGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 165

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFKEEMKRF++K+++LM++E LF+ QGGPIIL Q+ENEYGN+E +YG GG+ Y+KWAA  
Sbjct: 166 PFKEEMKRFVSKVVNLMREERLFSWQGGPIILLQIENEYGNIENSYGKGGKEYMKWAAKM 225

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A++L   VPWVMC+Q+DAP  II+TCN +YCDGF PNS +KP MWTEN+ GW+  +G  +
Sbjct: 226 ALSLGAGVPWVMCRQQDAPYDIIDTCNAYYCDGFKPNSHNKPTMWTENWDGWYTQWGERL 285

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAFAVARFF+ GG+FQNYYMYFGGTNFGRTAGGPL  TSYDYDAPIDEYG +R
Sbjct: 286 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTAGGPLQITSYDYDAPIDEYGLLR 345

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYH-------------KSSND 349
           +PKWGHL++LH A+KLCE  L+++D PT+ KLG K EAH+Y              +SS+ 
Sbjct: 346 EPKWGHLKDLHAALKLCEPALVATDSPTYIKLGPKQEAHVYQANVHLEGLNLSMFESSSI 405

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN--------- 400
           C+AFLAN D   +A VTF G  Y +P WSVS+LPDC+N VFNTAKV +Q +         
Sbjct: 406 CSAFLANIDEWKEATVTFRGQRYTIPPWSVSVLPDCRNTVFNTAKVRAQTSVKLVESYLP 465

Query: 401 --NGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
             +   P  Q ++ N+    S ++   +E + I    SF    + E +N TKD SDYLWY
Sbjct: 466 TVSNIFPAQQLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWY 525

Query: 459 TASIHVMPG-----QGKEVF--LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
           +  ++V        +  +V   L I+ +     VF+N +L+    GN       + + ++
Sbjct: 526 STRVYVSDSDILFWEENDVHPKLTIDGVRDILRVFINGQLI----GNVVGHWIKVVQTLQ 581

Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEG 570
              G N L +L+  VGLQNYGA+ +  GAG+   I I   +NG  DLS   W YQVG++G
Sbjct: 582 FLPGYNDLTLLTQTVGLQNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQG 641

Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
           E++        NS  W + +   +  +  WYKT F  P G  P+AL+  SMGKGQAWVNG
Sbjct: 642 EFLKFYSEENENSE-WVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNG 700

Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
           Q IGRYW+  ++P +GC + CDYRG+Y++ KC  +CG+P QTLYH+PR+W+    NLLVI
Sbjct: 701 QHIGRYWTR-VSPKSGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVI 759

Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPV------DSWKPNLGVVSSSPQVRLACE 744
            EE GG+P +IS+   + + IC+ VSE++ PP+      D     +   +  P++ L C+
Sbjct: 760 LEETGGNPFEISVKLHSSRIICAQVSESNYPPLQKLVNADLIGEEVSANNMIPELHLHCQ 819

Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSA 803
           +G  I+++ FAS+G P G+C +F  G CH    + IV +AC G+  CSI +S +  GV  
Sbjct: 820 QGHTISSVAFASFGTPGGSCQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAFGVD- 878

Query: 804 GACPGLLKALAVEAHCS 820
             CPG++K L+VEA C+
Sbjct: 879 -PCPGVVKTLSVEARCT 894


>gi|224094887|ref|XP_002310279.1| predicted protein [Populus trichocarpa]
 gi|222853182|gb|EEE90729.1| predicted protein [Populus trichocarpa]
          Length = 847

 Score =  869 bits (2246), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/834 (51%), Positives = 559/834 (67%), Gaps = 31/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 25  IQCSVTYDRKAIMINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y+FEGR+D+VRF+KT+Q AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85  EPTPGNYHFEGRYDIVRFMKTIQRAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ LMK ENLF SQGGPIIL+Q+ENEYG     +G  G  Y+ WA
Sbjct: 145 DNEPFKRAMQGFTEKIVGLMKAENLFESQGGPIILSQIENEYGVQSKLFGAAGYNYMTWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A+   T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP +WTE +SGWF  FG
Sbjct: 205 ANMAIQTGTGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPTIWTEAWSGWFSEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVA+F + GG+F NYYM+ GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GTIHQRPVQDLAFAVAKFIQKGGSFINYYMFHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH++IK+CE  L+S DP   +LG   + H+Y   S DCAAFLANYD+ 
Sbjct: 325 LIRQPKYGHLKELHRSIKMCERALVSVDPIVTQLGTYQQVHVYSTESGDCAAFLANYDTK 384

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV             Q +  E+L  + 
Sbjct: 385 SAARVLFNNMHYNLPPWSISILPDCRNVVFNTAKV-----------GVQTSQMEMLPTNG 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            FSW  Y+E +  +  + +F    L EQIN T+D SDYLWY  S+ +   +     G+  
Sbjct: 434 IFSWESYDEDISSLDDSSTFTTAGLLEQINVTRDASDYLWYMTSVDIGSSESFLHGGELP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L I+S GHA  +F+N +L    +G  +   F    K+ L  G N + +LS+ VGL N G
Sbjct: 494 TLIIQSTGHAVHIFINGQLSGSAFGTRENRRFTYTGKVNLRPGTNRIALLSVAVGLPNVG 553

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             ++    G+   V L  L  GK DLS  +W YQVG++GE + L       S  W Q S 
Sbjct: 554 GHYESWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLLSPDSVTSVEWMQSSL 613

Query: 592 LPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                + L W+K  F APEG  PLAL++  MGKGQ W+NGQSIGRYW+AY   ++G    
Sbjct: 614 AAQRPQPLTWHKAYFNAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAY---ASGNCNG 670

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G++  +KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGDPS+ISL+ ++   
Sbjct: 671 CSYAGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTNNLLVVFEELGGDPSRISLVKRSLAS 730

Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+ VSE   P + +W+  + G      SP+V L C  G  I +I FAS+G P G CGS+
Sbjct: 731 VCAEVSEFH-PTIKNWQIESYGRAEEFHSPKVHLRCSGGQSITSIKFASFGTPLGTCGSY 789

Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + GACH      I++K C+G+  C++ +S++  G     CP ++K L+VEA C+
Sbjct: 790 QQGACHASTSYAILEKKCIGKQRCAVTISNSNFG--QDPCPNVMKKLSVEAVCA 841


>gi|183238710|gb|ACC60981.1| beta-galactosidase 1 precursor [Petunia x hybrida]
          Length = 842

 Score =  869 bits (2245), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/829 (50%), Positives = 562/829 (67%), Gaps = 26/829 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YDH+A++++G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 29  ASVSYDHKAIIVNGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +G+YYFE R+DLV+F+K V +AGL+++LR+GPYACAEWN+GGFPVWL ++PGI FRT N
Sbjct: 89  EQGKYYFEERYDLVKFIKLVHQAGLYVNLRVGPYACAEWNFGGFPVWLKYVPGISFRTDN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+++MK E L+ SQGGPIIL+Q+ENEYG +E  +G  G+ Y +WAA 
Sbjct: 149 EPFKAAMQKFTTKIVNMMKAERLYESQGGPIILSQIENEYGPLEVRFGEQGKSYAEWAAK 208

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A++L T VPW+MC+Q+DAPDP+INTCNGFYCD F PN   KP +WTE ++ WF  FG  
Sbjct: 209 MALDLGTGVPWLMCKQDDAPDPVINTCNGFYCDYFYPNKAYKPKIWTEAWTAWFTEFGSP 268

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAF VA F +TGG+F NYYMY GGTNFGRTAGGP VATSYDYDAP+DE+G +
Sbjct: 269 VPYRPVEDLAFGVANFIQTGGSFINYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEFGLL 328

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+AIKLCE  L+S DPT   LG   +AH++  +S  CAAFLAN D +S 
Sbjct: 329 RQPKWGHLKDLHRAIKLCEPALVSGDPTVTALGNYQKAHVFRSTSGACAAFLANNDPNSF 388

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILPDCK+ V+NTA+V             Q  + ++  A+  +
Sbjct: 389 ATVAFGNKHYNLPPWSISILPDCKHTVYNTARV-----------GAQSALMKMTPANEGY 437

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           SW  Y ++     + +F    L EQ+NTT+D SDYLWY   + + P +     G   +L 
Sbjct: 438 SWQSYNDQTAFYDDNAFTVVGLLEQLNTTRDVSDYLWYMTDVKIDPSEGFLRSGNWPWLT 497

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S G A  VFVN +L    YG+        +K + L  G+N + +LS+ VGL N G  F
Sbjct: 498 VSSAGDALHVFVNGQLAGTVYGSLKKQKITFSKAVNLRAGVNKISLLSIAVGLPNIGPHF 557

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +    G+   V L  L  GKRDL+  +W Y+VG++GE + L  +S ++S  W +GS +  
Sbjct: 558 ETWNTGVLGPVSLSGLDEGKRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWVEGSLVAQ 617

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQSIGRYW  Y A  +G    C+Y 
Sbjct: 618 RQPLTWYKTTFNAPAGNEPLALDMNSMGKGQVWINGQSIGRYWPGYKA--SGTCDACNYA 675

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G ++  KC  +CG  +Q  YH+PR+W+HP  NLLV+ EE GGDP+ ISL+ +    +C+ 
Sbjct: 676 GPFNEKKCLSNCGDASQRWYHVPRSWLHPTGNLLVVFEEWGGDPNGISLVKRELASVCAD 735

Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           ++E  P  V+      G V     P+  L+C  G  I +I FAS+G P+G CGSF  G+C
Sbjct: 736 INEWQPQLVNWQLQASGKVDKPLRPKAHLSCTSGQKITSIKFASFGTPQGVCGSFSEGSC 795

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H        +K C+GQ  C++PV+    G     CP ++K L+VEA CS
Sbjct: 796 HAHHSYDAFEKYCIGQESCTVPVTPEIFG--GDPCPSVMKKLSVEAVCS 842


>gi|359474925|ref|XP_002263382.2| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
 gi|297744764|emb|CBI38026.3| unnamed protein product [Vitis vinifera]
          Length = 846

 Score =  869 bits (2245), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/831 (51%), Positives = 558/831 (67%), Gaps = 31/831 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYD +AL+I+G+RR+L SGSIHYPRSTP++W  LI+K+K+GGL+ I+TYVFWN HEP 
Sbjct: 26  SVTYDRKALIINGQRRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDAIDTYVFWNLHEPS 85

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G+Y FEGR+DLVRF+K +Q+AGL++HLRIGPY CAEWN+GGFPVWL F+PG+ FRT N 
Sbjct: 86  PGKYNFEGRYDLVRFIKLIQKAGLYVHLRIGPYICAEWNFGGFPVWLKFVPGVSFRTDNE 145

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+RF  KI+ +MK E LF SQGGPII++Q+ENEYG+   A+G  G  Y+ WAA  
Sbjct: 146 PFKMAMQRFTQKIVQMMKNEKLFESQGGPIIISQIENEYGHESRAFGAPGYAYLTWAAKM 205

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV ++T VPWVMC+++DAPDP+INTCNGFYCD F+PN P+KP +WTE +SGWF  F   +
Sbjct: 206 AVAMDTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPNKPTLWTEAWSGWFTEFAGPI 265

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             RPVEDL+FAV RF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR
Sbjct: 266 QQRPVEDLSFAVTRFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 325

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPK+GHL+ELHKAIKLCE  L+S+DP    LG   +A +++  S  CAAFL+NY+ +S A
Sbjct: 326 QPKYGHLKELHKAIKLCERALLSADPAETSLGTYAKAQVFYSESGGCAAFLSNYNPTSAA 385

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            VTFN   Y L  WS+SILPDCKNVVFNTA V  Q +       Q    N  LL+   F+
Sbjct: 386 RVTFNSMHYNLAPWSISILPDCKNVVFNTATVGVQTSQ-----MQMLPTNSELLSWETFN 440

Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
             E+      + +     L EQ+N T+DTSDYLWY+  I +   +     G+   L ++S
Sbjct: 441 --EDISSADDDSTITVVGLLEQLNVTRDTSDYLWYSTRIDISSSESFLHGGQHPTLIVQS 498

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GHA  VF+N  L    +G  +   F     + L  G N + +LS+ VGL N G  F+  
Sbjct: 499 TGHAMHVFINGHLSGSAFGTREDRRFTFTGDVNLQTGSNIISVLSIAVGLPNNGPHFETW 558

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             G+   V+L  L  GK+DLS  +W YQVG++GE + L   ++ ++  W +GS     + 
Sbjct: 559 STGVLGPVVLHGLDEGKKDLSWQKWSYQVGLKGEAMNLVSPNVISNIDWMKGSLFAQKQQ 618

Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK  F AP+G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   + G    C Y G+
Sbjct: 619 PLTWYKAYFDAPDGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---AKGNCSGCSYSGT 675

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           +  +KCQ  CGQP Q  YH+PR+W+ P +NLLV+ EELGGD SKIS + ++   +C+ VS
Sbjct: 676 FRTTKCQFGCGQPTQRWYHVPRSWLKPTQNLLVLFEELGGDASKISFMKRSVTTVCAEVS 735

Query: 717 EADPPPVDSW------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           E   P + +W      +P      S P+V L C  G  I+AI FAS+G P G CG+F+ G
Sbjct: 736 EHH-PNIKNWHIESQERPE---EMSKPKVHLHCASGQSISAIKFASFGTPSGTCGNFQKG 791

Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH      +++K C+GQ +CS+ VSS+     A  CP + K L+VEA C+
Sbjct: 792 TCHAPTSQAVLEKKCIGQQKCSVAVSSSNF---ANPCPNMFKKLSVEAVCA 839


>gi|312283357|dbj|BAJ34544.1| unnamed protein product [Thellungiella halophila]
          Length = 856

 Score =  868 bits (2244), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 434/834 (52%), Positives = 554/834 (66%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +AL+I+G+RR+L SGSIHYPRSTP++W  LI+K+K+GG++VIETYVFWN H
Sbjct: 29  VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGIDVIETYVFWNLH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR DLVRFVK + +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89  EPSPGKYDFEGRNDLVRFVKAIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK F  +I++LMK ENLF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQILGAEGHNYMTWA 208

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+   T VPWVMC+++DAPDP+I+TCNGFYCD F PN P KP +WTE +SGWF  FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVISTCNGFYCDSFAPNKPYKPTIWTEAWSGWFTEFG 268

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+AIK+CE+ L+S+DP    LG K +AH+Y   S DC+AFLANYD+ 
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSTDPVVTSLGNKQQAHVYSSESGDCSAFLANYDTE 388

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+N VFNTAKV            Q   +  L  ++ 
Sbjct: 389 SAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKV----------GVQTSQMEMLPTSTG 438

Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           +F W    E+   +  + +F    L EQIN T+DTSDYLWY  S+ +   +     G+  
Sbjct: 439 SFQWQSYLEDLSSLDDSSTFTTQGLLEQINVTRDTSDYLWYMTSVDIGETESFLHGGELP 498

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L I+S GHA  +FVN +L    +G      F    KI L+ G N + +LS+ VGL N G
Sbjct: 499 TLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYKGKINLHSGTNRIALLSVAVGLPNVG 558

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
             F+    G+   V L  L  GKRDLS  +W YQVG++GE + L   +   S  W   S 
Sbjct: 559 GHFESWNTGILGPVALHGLSQGKRDLSWQKWTYQVGLKGEAMNLAYPTNTPSFGWMDASL 618

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           T+   + L W+KT F APEG  PLAL++  MGKGQ WVNG+SIGRYW+A+   +TG    
Sbjct: 619 TVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCGH 675

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y  +KC   CGQP Q  YH+PR+W+ P +NLLVI EELGG+PS +SL+ ++   
Sbjct: 676 CSYTGTYKPNKCNSGCGQPTQKWYHVPRSWLKPSQNLLVIFEELGGNPSTVSLVKRSVSG 735

Query: 711 ICSFVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+ VSE   P + +W+      G     P+V L C  G  I+AI FAS+G P G CGS+
Sbjct: 736 VCAEVSEYH-PNIKNWQIESYGKGQTFRRPKVHLKCSPGQAISAIKFASFGTPLGTCGSY 794

Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH      I+++ CVG+  C++ +S++  G     CP +LK L VEA C+
Sbjct: 795 QQGDCHAATSYAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 846


>gi|115450935|ref|NP_001049068.1| Os03g0165400 [Oryza sativa Japonica Group]
 gi|122247496|sp|Q10RB4.1|BGAL5_ORYSJ RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
           Precursor
 gi|108706354|gb|ABF94149.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113547539|dbj|BAF10982.1| Os03g0165400 [Oryza sativa Japonica Group]
 gi|215717073|dbj|BAG95436.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 841

 Score =  867 bits (2241), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/833 (51%), Positives = 558/833 (66%), Gaps = 38/833 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A+++DG+RR+L SGSIHYPRSTPE+W  LI K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87  GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG     +G  G+ Y+ WAA  A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF  FG  + 
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG  R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL+ELH+A+KLCE+ L+S+DPT   LG+  EAH++ +SS+ CAAFLANY+S+S A 
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFLANYNSNSYAK 385

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V FN   Y LP WS+SILPDCKNVVFNTA V  Q N           +      +S+  W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQMWADGASSMMW 435

Query: 425 --YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E+V  ++         L EQ+N T+DTSDYLWY  S+ V P +     G  + L +
Sbjct: 436 EKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTV 495

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VF+N +L    YG  +      +    L  G N + +LS+  GL N G  ++
Sbjct: 496 QSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYE 555

Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   ++I  L  G RDL+   W YQVG++GE + L+ +  + S  W QGS +  N
Sbjct: 556 TWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQN 615

Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           +  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   + G  K C Y 
Sbjct: 616 QQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---AEGDCKGCHYT 672

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           GSY A KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGD SKI+L  +T   +C+ 
Sbjct: 673 GSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCAD 732

Query: 715 VSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
           VSE   P + +W+      P       + +V L C  G  I+AI FAS+G P G CG+F+
Sbjct: 733 VSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQ 787

Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G CH ++   +++K C+G   C + +S +  G     CP ++K +AVEA CS
Sbjct: 788 QGECHSINSNSVLEKKCIGLQRCVVAISPSNFG--GDPCPEVMKRVAVEAVCS 838


>gi|350537913|ref|NP_001234317.1| TBG6 protein precursor [Solanum lycopersicum]
 gi|7939625|gb|AAF70825.1|AF154424_1 putative beta-galactosidase [Solanum lycopersicum]
          Length = 845

 Score =  867 bits (2241), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/834 (51%), Positives = 561/834 (67%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A+VI+G+RR+L SGSIHYPRSTPE+W +LI K+KEGGL+V+ETYVFWN H
Sbjct: 24  VHCDVTYDRKAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FR 
Sbjct: 84  EPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRA 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK +  KI++LMK  NLF SQGGPIIL+Q+ENEYG      G  G  Y  WA
Sbjct: 144 DNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L+T VPWVMC++EDAPDP+INTCNGFYCD F PN P KP +WTE +SGWF  FG
Sbjct: 204 ANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPAIWTEAWSGWFSEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVA+F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+A+K+CE+ ++S+DP    LG   +A++Y   +  CAAFL+N D  
Sbjct: 324 LIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSETGGCAAFLSNNDWK 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV            Q   +  L   S 
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSKMEMLPTNSE 433

Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E +    + S +R   L EQIN T+DTSDYLWY  S+ +   +     G+  
Sbjct: 434 MLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDIGSTESFLHGGELP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L +E+ GHA  VF+N +L    +G      F+   K+ L  G N + +LS+ VGL N G
Sbjct: 494 TLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRIALLSVAVGLPNIG 553

Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   + I  L +GK DLS  +W YQVG++GE + L   +  ++  W QGS 
Sbjct: 554 GHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVSTNGISAVDWMQGSL 613

Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           +   +  L W+K  F  PEG  PLAL+++SMGKGQ W+NGQSIGRYW+AY   +TG    
Sbjct: 614 IAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRYWTAY---ATGDCNG 670

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G +   KCQ  CG+P Q  YH+PR+W+ P +NLLV+ EELGGDP++ISL+ ++  +
Sbjct: 671 CQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGGDPTRISLVKRSVTN 730

Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +CS V+E   P + +W+  N G       P+VR+ C  G  I++I FAS+G P G CGSF
Sbjct: 731 VCSNVAEYH-PNIKNWQIENYGKTEEFHLPKVRIHCAPGQSISSIKFASFGTPLGTCGSF 789

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH  D   +V+K C+G+  C++ +S++  G     CP +LK L+VEAHC+
Sbjct: 790 KQGTCHAPDSHAVVEKKCLGRQTCAVTISNSNFG--EDPCPNVLKRLSVEAHCT 841


>gi|61162206|dbj|BAD91084.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 852

 Score =  867 bits (2240), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/831 (52%), Positives = 562/831 (67%), Gaps = 31/831 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             VTYD +A++I+G+RR+L SGSIHYPRSTPE+W  LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 28  TTVTYDKKAILINGQRRLLISGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYVFWNGHEP 87

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G YYFEGR+DLVRF+KTVQ+AGLFLHLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 88  SPGNYYFEGRYDLVRFIKTVQKAGLFLHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 147

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK E LFASQGGPIIL+Q+ENEYG    A G  G+ Y+ WAA 
Sbjct: 148 GPFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPERKALGAPGQNYINWAAK 207

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV L+T VPWVMC+++DAPDP+IN CNGFYCDGFTPN P KP MWTE +SGWFL FG  
Sbjct: 208 MAVGLDTGVPWVMCKEDDAPDPMINACNGFYCDGFTPNKPYKPTMWTEAWSGWFLEFGGT 267

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +  RPV+DLAFAVARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG I
Sbjct: 268 IHHRPVQDLAFAVARFIQRGGSYVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLI 327

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELHKAIKLCE  L+SS+PT   LG   +A++++     CAAFL+N+  S +
Sbjct: 328 RQPKYGHLKELHKAIKLCEHSLLSSEPTVTSLGTYHQAYVFNSGPRRCAAFLSNFH-SVE 386

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A VTFN   Y LP WSVSILPDC+N V+NTAKV            Q  +V  +   S  F
Sbjct: 387 ARVTFNNKHYDLPPWSVSILPDCRNEVYNTAKV----------GVQTSHVQMIPTNSRLF 436

Query: 423 SW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
           SW  Y+E +     RS +    L EQIN T+DTSDYLWY  ++ +       GK+  L +
Sbjct: 437 SWQTYDEDISSVHERSSIPAIGLLEQINVTRDTSDYLWYMTNVDISSSDLSGGKKPTLTV 496

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VFVN +     +G  +   F     + L+ GIN + +LS+ VGL N G  ++
Sbjct: 497 QSAGHALHVFVNGQFSGSAFGTREQRQFTFADPVNLHAGINRIALLSIAVGLPNVGLHYE 556

Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
               G+   + +D L NGK+DL+  +W  +VG++GE + L   + A+S  W ++      
Sbjct: 557 SWKTGIQGPVFLDGLGNGKKDLTLHKWFNKVGLKGEAMNLVSPNGASSVGWIRRSLATQT 616

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            ++L WYK  F AP G  PLAL++  MGKGQ W+NGQSIGRYW AY   + G    C Y 
Sbjct: 617 KQTLKWYKAYFNAPGGNEPLALDMRRMGKGQVWINGQSIGRYWMAY---AKGDCSSCSYI 673

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G++  +KCQ HCG+P Q  YH+PR+W+ P +NL+V+ EELGGDPSKI+L+ ++   +C  
Sbjct: 674 GTFRPTKCQLHCGRPTQRWYHVPRSWLKPTQNLVVVFEELGGDPSKITLVRRSVAGVCGD 733

Query: 715 VSEADPPP----VDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           + E  P      VD  + +  +  +  QV L C  G  I++I FAS+G P G CGSF+ G
Sbjct: 734 LHENHPNAENFDVDGNEDSKTLHQA--QVHLHCAPGQSISSIKFASFGTPSGTCGSFQQG 791

Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH  +   +V+K C+G+  CS+ VS++        CP +LK L+VEA CS
Sbjct: 792 TCHATNSHAVVEKNCIGRESCSVAVSNSTF--ETDPCPNVLKRLSVEAVCS 840


>gi|357454655|ref|XP_003597608.1| Beta-galactosidase [Medicago truncatula]
 gi|124360385|gb|ABN08398.1| D-galactoside/L-rhamnose binding SUEL lectin; Galactose-binding
           like [Medicago truncatula]
 gi|355486656|gb|AES67859.1| Beta-galactosidase [Medicago truncatula]
          Length = 841

 Score =  867 bits (2239), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 422/827 (51%), Positives = 564/827 (68%), Gaps = 20/827 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YD +A+ I+G+ R+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 26  ASVSYDSKAITINGQSRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 85

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFEG +DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 86  SPGKYYFEGNYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGISFRTDN 145

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK +M++F  KI+D+MK + LF SQGGPII++Q+ENEYG +E+  G  G+ Y KWAAD
Sbjct: 146 EPFKFQMQKFTEKIVDMMKADRLFESQGGPIIMSQIENEYGPMEYEIGAPGKSYTKWAAD 205

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV L T VPW+MC+Q+DAPDP+INTCNGFYCD F+PN   KP MWTE ++GWF  FG  
Sbjct: 206 MAVGLGTGVPWIMCKQDDAPDPVINTCNGFYCDYFSPNKDYKPKMWTEAWTGWFTEFGGP 265

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 266 VPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLL 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           +QPKWGHL++LH+AIKL E  LIS DPT  ++G   EAH++   S  CAAFL NY+  + 
Sbjct: 326 QQPKWGHLKDLHRAIKLSEPALISGDPTVTRIGNYQEAHVFKSKSGACAAFLGNYNPKAF 385

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILPDCKN V+NTA+V SQ        AQ K     +    ++
Sbjct: 386 ATVAFGNMHYNLPPWSISILPDCKNTVYNTARVGSQS-------AQMKMTRVPIHGGLSW 438

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIE 477
             + E+   + + SF    L EQ+NTT+D +DYLWY+  + + P +     GK+  L + 
Sbjct: 439 QVFTEQTASTDDSSFTMTGLLEQLNTTRDLTDYLWYSTDVVIDPNEGFLRSGKDPVLTVL 498

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GHA  VF+N +L    YG+ +F     ++ ++L  G+N + +LS+ VGL N G  F+ 
Sbjct: 499 SAGHALHVFINSQLSGTIYGSLEFPKLTFSQNVKLIPGVNKISLLSVAVGLPNVGPHFET 558

Query: 538 AGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             AG+   I ++ L  G+RDLS  +W Y+VG+ GE + L  +  ++S  W QGS +   +
Sbjct: 559 WNAGVLGPITLNGLDEGRRDLSWQKWSYKVGLHGEALSLHSLGGSSSVEWVQGSLVSRMQ 618

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYKTTF AP+G  P AL++ SMGKGQ W+NGQ++GRYW AY A  +G    CDY G+
Sbjct: 619 PLTWYKTTFDAPDGIAPFALDMGSMGKGQVWLNGQNLGRYWPAYKA--SGTCDNCDYAGT 676

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           Y+ +KC+ +CG+ +Q  YH+P +W+ P  NLLV+ EELGGDP+ I L+ +    +C+ + 
Sbjct: 677 YNENKCRSNCGEASQRWYHVPHSWLIPTGNLLVVFEELGGDPNGIFLVRRDIDSVCADIY 736

Query: 717 EADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
           E  P  +       G  +    P+  L+C  G  I++I FAS+G P G+CG+F  G+CH 
Sbjct: 737 EWQPNLISYQMQTSGKTNKPVRPKAHLSCGPGQKISSIKFASFGTPVGSCGNFHEGSCHA 796

Query: 775 -DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                  +K CVGQ  C + VS    G     CP +LK L+VEA C+
Sbjct: 797 HKSYNTFEKNCVGQNSCKVTVSPENFG--GDPCPNVLKKLSVEAICT 841


>gi|114217397|dbj|BAF31234.1| beta-D-galactosidase [Persea americana]
          Length = 849

 Score =  867 bits (2239), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/834 (51%), Positives = 556/834 (66%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+R++L SGSIHYPRSTP++W  L++K+K+GGL+VI+TYVFWN H
Sbjct: 26  IQCSVTYDRKAIIINGQRKILISGSIHYPRSTPDMWEGLMQKAKDGGLDVIQTYVFWNVH 85

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRFVKTVQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 86  EPSPGNYNFEGRYDLVRFVKTVQKAGLYMHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 145

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK E+LF SQGGPIIL+Q+ENEYG+   A G  G  Y+ WA
Sbjct: 146 DNEPFKMAMQGFTEKIVQMMKSESLFESQGGPIILSQIENEYGSESKALGAPGHAYMTWA 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP MWTE +SGWF  FG
Sbjct: 206 AKMAVGLRTGVPWVMCKEDDAPDPVINTCNGFYCDAFTPNKPYKPTMWTEAWSGWFTEFG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             V  RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 266 GTVHERPVEDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+AIKLCE  LIS+DP    LG   ++H++   +  CAAFL+NY+ +
Sbjct: 326 LIRQPKYGHLKELHRAIKLCEPALISADPIVTSLGPYQQSHVFSSGTGGCAAFLSNYNPN 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV  Q +       + K          
Sbjct: 386 SVARVMFNNMHYSLPPWSISILPDCRNVVFNTAKVGVQTSQMHMSAGETK---------- 435

Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y+E +   G+ S +    L EQ+N T+DTSDYLWY  S+ + P +     G+  
Sbjct: 436 LLSWEMYDEDIASLGDNSMITAVGLLEQLNVTRDTSDYLWYMTSVDISPSESSLRGGRPP 495

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  V++N +L    +G+ +   F     + +  GIN + +LS+ V L N G
Sbjct: 496 VLTVQSAGHALHVYINGQLSGSAHGSRENRRFTFTGDVNMRAGINRIALLSIAVELPNVG 555

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             ++    G+   V+L  L  GKRDL+  +W YQVG++GE + L   S  +   W Q S 
Sbjct: 556 LHYESTNTGVLGPVVLHGLDQGKRDLTWQKWSYQVGLKGEAMNLVAPSGISYVEWMQASF 615

Query: 592 LPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                + L WYK  F AP G  PLAL+L SMGKGQ W+NG+SIGRYW+   A + G    
Sbjct: 616 ATQKLQPLTWYKAYFNAPGGDEPLALDLGSMGKGQVWINGESIGRYWT---AAANGDCNH 672

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G+Y A KCQ  CGQP Q  YH+PR+W+ P +NLLVI EE+GGD S ISL+ ++   
Sbjct: 673 CSYAGTYRAPKCQTGCGQPTQRWYHVPRSWLQPTKNLLVIFEEIGGDASGISLVKRSVSS 732

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +C+ VSE   P + +W       S     P+V L C  G  I+AI FAS+G P G CGSF
Sbjct: 733 VCADVSEWH-PTIKNWHIESYGRSEELHRPKVHLRCAMGQSISAIKFASFGTPLGTCGSF 791

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH  +   I++K C+GQ  C++ +S    G     CP ++K +AVEA C+
Sbjct: 792 QQGPCHSPNSHAILEKKCIGQQRCAVTISMNNFG--GDPCPNVMKRVAVEAICT 843


>gi|356526021|ref|XP_003531618.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
          Length = 843

 Score =  866 bits (2238), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/828 (51%), Positives = 560/828 (67%), Gaps = 20/828 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V+YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HE
Sbjct: 27  SASVSYDHKAIIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHE 86

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYF G +DLVRF+K VQ+AGL+++LRIGPY CAEWN+GGFPVWL +IPGI FRT 
Sbjct: 87  PSPGKYYFGGNYDLVRFIKLVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYIPGISFRTD 146

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK +M++F  KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y +WAA
Sbjct: 147 NGPFKFQMEKFTKKIVDMMKAERLFESQGGPIILSQIENEYGPMEYEIGAPGRSYTQWAA 206

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L T VPW+MC+Q+DAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF  FG 
Sbjct: 207 HMAVGLGTGVPWIMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGG 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP EDLAF++ARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 267 AVPHRPAEDLAFSIARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            RQPKWGHL++LH+AIKLCE  L+S D T Q+LG   EAH++   S  CAAFLANY+  S
Sbjct: 327 ARQPKWGHLKDLHRAIKLCEPALVSGDSTVQRLGNYEEAHVFRSKSGACAAFLANYNPQS 386

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V F    Y LP WS+SILP+CK+ V+NTA+V SQ           K     +    +
Sbjct: 387 YATVAFGNQHYNLPPWSISILPNCKHTVYNTARVGSQSTT-------MKMTRVPIHGGLS 439

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
           +  + E+   + + SF    L EQIN T+D SDYLWY+  + +   +     GK   L +
Sbjct: 440 WKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINSNEGFLRNGKNPVLTV 499

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VF+N +L    YG+ +      ++ + L  G+N + +LS+ VGL N G  F+
Sbjct: 500 LSAGHALHVFINNQLSGTAYGSLEAPKLTFSESVRLRAGVNKISLLSVAVGLPNVGPHFE 559

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   + L  L  G+RDL+  +W Y+VG++GE + L  +S ++S  W QG  +   
Sbjct: 560 RWNAGVLGPITLSGLNEGRRDLTWQKWSYKVGLKGEALNLHSLSGSSSVEWLQGFLVSRR 619

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQS+GRYW AY A  +G    C+Y G
Sbjct: 620 QPLTWYKTTFDAPAGVAPLALDMGSMGKGQVWINGQSLGRYWPAYKA--SGSCGYCNYAG 677

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           +Y+  KC  +CG+ +Q  YH+P +W+ P  NLLV+ EELGGDP+ I L+ +    +C+ +
Sbjct: 678 TYNEKKCGSNCGEASQRWYHVPHSWLKPSGNLLVVFEELGGDPNGIFLVRRDIDSVCADI 737

Query: 716 SEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
            E  P  V       G V S   P+  L+C  G  I++I FAS+G P G+CGS+R G+CH
Sbjct: 738 YEWQPNLVSYEMQASGKVRSPVRPKAHLSCGPGQKISSIKFASFGTPVGSCGSYREGSCH 797

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                    K CVGQ  C++ VS    G     CP ++K L+VEA C+
Sbjct: 798 AHKSYDAFLKNCVGQSWCTVTVSPEIFG--GDPCPRVMKKLSVEAICT 843


>gi|57232107|gb|AAW47739.1| beta-galactosidase [Prunus persica]
          Length = 853

 Score =  866 bits (2238), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/831 (51%), Positives = 559/831 (67%), Gaps = 25/831 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD RA+VI+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+V+ETYVFWN H
Sbjct: 24  VQCTVTYDRRAIVINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVVETYVFWNVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y F+GR+DLVRF+KT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84  EPSPGNYNFKGRYDLVRFLKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ LMK E LF SQGGPIIL+Q+ENEYG     +G  G  Y+ WA
Sbjct: 144 DNEPFKRAMQGFTEKIVGLMKSEKLFESQGGPIILSQIENEYGAQSKLFGAAGHNYMTWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F PN P KP +WTE +SGWF  FG
Sbjct: 204 ANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFAPNKPYKPTIWTEAWSGWFSEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLA+AVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 264 GPIHQRPVQDLAYAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+AIK+CE  L+S+DP    LG   +A++Y   S DC+AFL+N+DS 
Sbjct: 324 LIRQPKYGHLKELHRAIKMCERALVSADPIITSLGNFQQAYVYTSESGDCSAFLSNHDSK 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV  Q        +Q   +   +   S
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKVGVQT-------SQMGMLPTNIQMLS 436

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
             S+ E+   +  + +   P L EQIN T+D++DYLWY  S+ +   +     G+   L 
Sbjct: 437 WESYDEDITSLDDSSTITAPGLLEQINVTRDSTDYLWYKTSVDIGSSESFLRGGELPTLI 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           ++S GHA  +F+N +L    +G  +   F    K+ L+ G N + +LS+ VGL N G  F
Sbjct: 497 VQSTGHAVHIFINGQLSGSSFGTRESRRFTYTGKVNLHAGTNRIALLSVAVGLPNVGGHF 556

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +    G+   V L  L  GK DLS  +W YQVG++GE + L   +  +S  W +GS    
Sbjct: 557 EAWNTGILGPVALHGLDQGKWDLSWQKWTYQVGLKGEAMNLVSPNSISSVDWMRGSLAAQ 616

Query: 595 NKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
            +  L W+KT F APEG  PLAL++  MGKGQ W+NGQSIGRYW+A+   + G    C Y
Sbjct: 617 KQQPLTWHKTLFNAPEGDEPLALDMEGMGKGQIWINGQSIGRYWTAF---ANGNCNGCSY 673

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G +   KCQ  CGQP Q +YH+PR+W+ P +NLLVI EE GGDPS+ISL+ ++   +C+
Sbjct: 674 AGGFRPPKCQVGCGQPTQRVYHVPRSWLKPMQNLLVIFEEFGGDPSRISLVKRSVSSVCA 733

Query: 714 FVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            V+E   P + +W   + G      SP+V L C  G  I++I FAS+G P G CGS++ G
Sbjct: 734 EVAEYH-PTIKNWHIESYGKAEDFHSPKVHLRCNPGQAISSIKFASFGTPLGTCGSYQEG 792

Query: 771 ACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH      ++QK C+G+  C++ +S++  G     CP +LK L+VEA C+
Sbjct: 793 TCHAATSYSVLQKKCIGKQRCAVTISNSNFG---DPCPKVLKRLSVEAVCA 840


>gi|308550954|gb|ADO34791.1| beta-galactosidase STBG6 [Solanum lycopersicum]
          Length = 845

 Score =  866 bits (2238), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/834 (51%), Positives = 559/834 (67%), Gaps = 30/834 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD  A+VI+G+RR+L SGSIHYPRSTPE+W +LI K+KEGGL+V+ETYVFWN H
Sbjct: 24  VHCDVTYDREAIVINGQRRLLFSGSIHYPRSTPEMWEDLINKAKEGGLDVVETYVFWNVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FR 
Sbjct: 84  EPSPGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRA 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK +  KI++LMK  NLF SQGGPIIL+Q+ENEYG      G  G  Y  WA
Sbjct: 144 DNEPFKNAMKGYAEKIVNLMKSHNLFESQGGPIILSQIENEYGPQAKVLGAPGHQYSTWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L+T VPWVMC++EDAPDP+INTCNGFYCD F PN P KP  WTE +SGWF  FG
Sbjct: 204 ANMAVGLDTGVPWVMCKEEDAPDPVINTCNGFYCDNFFPNKPYKPATWTEAWSGWFSEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVA+F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPLHQRPVQDLAFAVAQFIQRGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+A+K+CE+ ++S+DP    LG   +A++Y   +  CAAFL+N D  
Sbjct: 324 LIRQPKYGHLKELHRAVKMCEKSIVSADPAITSLGNLQQAYVYSSETGGCAAFLSNNDWK 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV            Q   +  L   S 
Sbjct: 384 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSKMEMLPTNSE 433

Query: 421 AFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E +    + S +R   L EQIN T+DTSDYLWY  S+ +   +     G+  
Sbjct: 434 MLSWETYSEDISALDDSSSIRSFGLLEQINVTRDTSDYLWYITSVDIGSTESFLHGGELP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L +E+ GHA  VF+N +L    +G      F+   K+ L  G N + +LS+ VGL N G
Sbjct: 494 TLIVETTGHAMHVFINGQLSGSAFGTRKNRRFVFKGKVNLRAGSNRIALLSVAVGLPNIG 553

Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   + I  L +GK DLS  +W YQVG++GE + L   +  ++  W QGS 
Sbjct: 554 GHFETWSTGVLGPVAIQGLDHGKWDLSWAKWTYQVGLKGEAMNLVSTNGISAVDWMQGSL 613

Query: 592 LPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           +   +  L W+K  F  PEG  PLAL+++SMGKGQ W+NGQSIGRYW+AY   +TG    
Sbjct: 614 IAQKQQPLTWHKAYFNTPEGDEPLALDMSSMGKGQVWINGQSIGRYWTAY---ATGDCNG 670

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y G +   KCQ  CG+P Q  YH+PR+W+ P +NLLV+ EELGGDP++ISL+ ++  +
Sbjct: 671 CQYSGVFRPPKCQLGCGEPTQKWYHVPRSWLKPTQNLLVLFEELGGDPTRISLVKRSVTN 730

Query: 711 ICSFVSEADPPPVDSWK-PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           +CS V+E   P + +W+  N G       P+VR+ C  G  I++I FAS+G P G CGSF
Sbjct: 731 VCSNVAEYH-PNIKNWQIENYGKTEEFHLPKVRIHCAPGQSISSIKFASFGTPLGTCGSF 789

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           + G CH  D   +V+K C+G+  C++ +S++  G     CP +LK L+VEAHC+
Sbjct: 790 KQGTCHAPDSHAVVEKKCLGRQTCAVTISNSNFG--EDPCPNVLKRLSVEAHCT 841


>gi|302789848|ref|XP_002976692.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
 gi|300155730|gb|EFJ22361.1| hypothetical protein SELMODRAFT_268001 [Selaginella moellendorffii]
          Length = 802

 Score =  866 bits (2237), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 446/822 (54%), Positives = 555/822 (67%), Gaps = 44/822 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHR+L+++GKRR+L SGS+HYPR+TPE+WP +I+K+KEGGL+VIETYVFW+ HEP 
Sbjct: 19  NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQYYFEGR+DLV+FVK VQ+AGL ++LRIGPY CAEWN GGFP+WL  IP I FRT N 
Sbjct: 79  PGQYYFEGRYDLVKFVKLVQQAGLLVNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+ M+ FL KI+++MK+ENLFASQGGPIILAQVENEYGNV+  YG  G  Y+ WAA+ 
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A   NT VPW+MC Q   P+ II+TCNG YCDG+ P    KP MWTE+Y+GWF  +G+ +
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPTLYKKPTMWTESYTGWFTYYGWPL 258

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVED+AFAVARFFE GG+F NYYMYFGGTNFGRT+GGP VA+SYDYDAP+DEYG   
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMYFGGTNFGRTSGGPYVASSYDYDAPLDEYGMQH 318

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
            PKWGHL++LH+ +KL EE ++SS+  H +LG   EAH+Y    N C AFLAN DS +D 
Sbjct: 319 LPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVY-SYGNGCVAFLANVDSMNDT 377

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V F    Y LPAWSVSI+ DCK V FN+AKV S           Q  V  +  + S+ S
Sbjct: 378 VVEFRNVSYSLPAWSVSIVLDCKTVAFNSAKVKS-----------QSAVVSMNPSKSSLS 426

Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           W  ++E VGISG+ SF    L EQ+ TTKDTSDYLWYT       G     +L+IES+  
Sbjct: 427 WTSFDEPVGISGS-SFKAKQLLEQMETTKDTSDYLWYTTRYATGTGS---TWLSIESMRD 482

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
              +FVN +  +  + +       +   I+L  G NT+ +LS  VGLQN+GA+ +   AG
Sbjct: 483 VVHIFVNGQFQSSWHTSKSVLYNSVEAPIKLAPGSNTIALLSATVGLQNFGAFIETWSAG 542

Query: 542 LF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
           L  S+IL  L  G ++LS  EW YQVG++GE + L  +  + S  W   ST    K L W
Sbjct: 543 LSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAVST---KKPLTW 599

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           Y T F AP G  P+AL+LASMGKGQAWVNGQSIGRYW AY A  + C + CDYRGSYD +
Sbjct: 600 YMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGSYDQN 659

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
           KC   CGQ +Q  YH+PR+W+ P  NLLV+ EE GGDPS I  +T++   IC+ V E+ P
Sbjct: 660 KCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICARVYESHP 719

Query: 721 PPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPI 779
             V  W P    V               I+ I FAS G PEG+CGSF+ G+CH  D+   
Sbjct: 720 ASVKLWCPGEKQV---------------ISQIRFASLGNPEGSCGSFKEGSCHTNDLSNT 764

Query: 780 VQKACVGQIECSIPVSSAYLGVSAGACPGLL-KALAVEAHCS 820
           V+KACVGQ  CS+         +  ACPG+  K LAVEA CS
Sbjct: 765 VEKACVGQRSCSLAPD-----FTTSACPGVREKFLAVEALCS 801


>gi|350539595|ref|NP_001234465.1| beta-galactosidase precursor [Solanum lycopersicum]
 gi|1352077|sp|P48980.1|BGAL_SOLLC RecName: Full=Beta-galactosidase; AltName: Full=Acid
           beta-galactosidase; Short=Lactase; AltName:
           Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
 gi|6649906|gb|AAF21626.1|AF023847_1 beta-galactosidase precursor [Solanum lycopersicum]
 gi|971485|emb|CAA58734.1| putative beta-galactosidase/galactanase [Solanum lycopersicum]
 gi|4138139|emb|CAA10174.1| ss-galactosidase [Solanum lycopersicum]
          Length = 835

 Score =  866 bits (2237), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/829 (51%), Positives = 562/829 (67%), Gaps = 26/829 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YDH+A++++G+R++L SGSIHYPRSTPE+WP+LI+K+KEGG++VI+TYVFWN HEP
Sbjct: 22  ASVSYDHKAIIVNGQRKILISGSIHYPRSTPEMWPDLIQKAKEGGVDVIQTYVFWNGHEP 81

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFE R+DLV+F+K VQEAGL++HLRIGPYACAEWN+GGFPVWL ++PGI FRT N
Sbjct: 82  EEGKYYFEERYDLVKFIKVVQEAGLYVHLRIGPYACAEWNFGGFPVWLKYVPGISFRTNN 141

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+D+MK E L+ +QGGPIIL+Q+ENEYG +EW  G  G++Y +WAA 
Sbjct: 142 EPFKAAMQKFTTKIVDMMKAEKLYETQGGPIILSQIENEYGPMEWELGEPGKVYSEWAAK 201

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV+L T VPW+MC+Q+D PDPIINTCNGFYCD FTPN  +KP MWTE ++ WF  FG  
Sbjct: 202 MAVDLGTGVPWIMCKQDDVPDPIINTCNGFYCDYFTPNKANKPKMWTEAWTAWFTEFGGP 261

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RP ED+AFAVARF +TGG+F NYYMY GGTNFGRT+GGP +ATSYDYDAP+DE+G +
Sbjct: 262 VPYRPAEDMAFAVARFIQTGGSFINYYMYHGGTNFGRTSGGPFIATSYDYDAPLDEFGSL 321

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+AIKLCE  L+S DPT   LG   EA ++   S  CAAFLANY+  S 
Sbjct: 322 RQPKWGHLKDLHRAIKLCEPALVSVDPTVTSLGNYQEARVFKSESGACAAFLANYNQHSF 381

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILPDCKN V+NTA+V +Q        AQ K    +   S  F
Sbjct: 382 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------AQMK----MTPVSRGF 430

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           SW  + E      + +F    L EQIN T+D SDYLWY   I + P +     G   +L 
Sbjct: 431 SWESFNEDAASHEDDTFTVVGLLEQINITRDVSDYLWYMTDIEIDPTEGFLNSGNWPWLT 490

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VFVN +L    YG+ +      +  I L  G+N + +LS+ VGL N G  F
Sbjct: 491 VFSAGHALHVFVNGQLAGTVYGSLENPKLTFSNGINLRAGVNKISLLSIAVGLPNVGPHF 550

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V L  L  G RDL+  +W Y+VG++GE + L  +S + S  W +GS +  
Sbjct: 551 ETWNAGVLGPVSLNGLNEGTRDLTWQKWFYKVGLKGEALSLHSLSGSPSVEWVEGSLVAQ 610

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP+G  PLAL++ +MGKGQ W+NGQS+GR+W AY   S+G    C+Y 
Sbjct: 611 KQPLSWYKTTFNAPDGNEPLALDMNTMGKGQVWINGQSLGRHWPAY--KSSGSCSVCNYT 668

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G +D  KC  +CG+ +Q  YH+PR+W++P  NLLV+ EE GGDP  I+L+ +    +C+ 
Sbjct: 669 GWFDEKKCLTNCGEGSQRWYHVPRSWLYPTGNLLVVFEEWGGDPYGITLVKREIGSVCAD 728

Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + E  P  ++  +   G       P+  L C  G  I++I FAS+G PEG CG+F+ G+C
Sbjct: 729 IYEWQPQLLNWQRLVSGKFDRPLRPKAHLKCAPGQKISSIKFASFGTPEGVCGNFQQGSC 788

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H        +K CVG+  CS+ V+    G     C  +LK L+VEA CS
Sbjct: 789 HAPRSYDAFKKNCVGKESCSVQVTPENFG--GDPCRNVLKKLSVEAICS 835


>gi|316995681|emb|CAA07236.2| beta-galactosidase precursor [Cicer arietinum]
          Length = 839

 Score =  865 bits (2235), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/829 (51%), Positives = 569/829 (68%), Gaps = 20/829 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
             A+V+YD++A+ I+G+R++L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 22  FEASVSYDYKAITINGQRKILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGH 81

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLV+F++ VQ+AGL++HLRIGPYACAEWN+GGFPVWL +IPGI FRT
Sbjct: 82  EPSPGKYYFEGNYDLVKFIRLVQQAGLYVHLRIGPYACAEWNFGGFPVWLKYIPGISFRT 141

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M++F  KI+++MK E L+ SQGGPIIL+Q+ENEYG +E+  G  G+ Y +WA
Sbjct: 142 DNGPFKFQMQKFTTKIVNIMKAERLYESQGGPIILSQIENEYGPMEYELGAPGKAYAQWA 201

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+ L T VPWVMC+Q+DAPDP+INTCNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 202 AHMAIGLGTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFTGFG 261

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP RP EDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 262 GTVPHRPAEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL++LH+AIKLCE  L+S+DPT  +LG   EAH++   S  CAAFLANY+  
Sbjct: 322 LLRQPKWGHLKDLHRAIKLCEPALVSADPTVTRLGNYQEAHVFKSKSGACAAFLANYNPH 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S + V F    Y LP WS+SILP+CK+ V+NTA++ SQ        AQ K     +    
Sbjct: 382 SYSTVAFGNQHYNLPPWSISILPNCKHTVYNTARLGSQS-------AQMKMTRVPIHGGL 434

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           ++  + E+   + + SF    L EQIN T+D SDYLWY+  + + P +     GK   L 
Sbjct: 435 SWKAFNEETTTTDDSSFTVTGLLEQINATRDLSDYLWYSTDVVINPDEGYFRNGKNPVLT 494

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA  VF+N +L    YG+ DF     ++ + L  G+N + +LS+ VGL N G  F
Sbjct: 495 VLSAGHALHVFINGQLSGTVYGSLDFPKLTFSESVNLRAGVNKISLLSVAVGLPNVGPHF 554

Query: 536 DVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   I ++ L  G+RDL+  +W Y+VG++GE + L  +S ++S  W QG  +  
Sbjct: 555 ETWNAGVLGPITLNGLNEGRRDLTWQKWSYKVGLKGEDLSLHSLSGSSSVDWLQGYLVSR 614

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQS+GRYW AY A  TG    C+Y 
Sbjct: 615 RQPLTWYKTTFDAPAGVAPLALDMNSMGKGQVWLNGQSLGRYWPAYKA--TGSCDYCNYA 672

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G+Y+  KC  +CG+ +Q  YH+P +W+ P  NLLV+ EELGGDP+ + L+ +    +C+ 
Sbjct: 673 GTYNEKKCGTNCGEASQRWYHVPHSWLKPTGNLLVMFEELGGDPNGVFLVRRDIDSVCAD 732

Query: 715 VSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           + E  P  V       G VS   SP+  L+C  G  I++I FAS+G P G+CG++R G+C
Sbjct: 733 IYEWQPNLVSYQMQASGKVSRPVSPKAHLSCGPGQKISSIKFASFGTPVGSCGNYREGSC 792

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H        Q+ CVGQ  C++ VS    G     CP ++K L+VEA C+
Sbjct: 793 HAHKSYDAFQRNCVGQSSCTVTVSPEIFG--GDPCPNVMKKLSVEAICT 839


>gi|302782774|ref|XP_002973160.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
 gi|300158913|gb|EFJ25534.1| hypothetical protein SELMODRAFT_413650 [Selaginella moellendorffii]
          Length = 805

 Score =  864 bits (2232), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 449/824 (54%), Positives = 556/824 (67%), Gaps = 45/824 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHR+L+++GKRR+L SGS+HYPR+TPE+WP +I+K+KEGGL+VIETYVFW+ HEP 
Sbjct: 19  NVSYDHRSLILNGKRRILLSGSVHYPRATPEMWPGIIQKAKEGGLDVIETYVFWDRHEPS 78

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQYYFEGR+DLV+FVK VQ+AGL ++LRIGPY CAEWN GGFP+WL  IP I FRT N 
Sbjct: 79  PGQYYFEGRYDLVKFVKLVQQAGLLMNLRIGPYVCAEWNLGGFPIWLRDIPHIVFRTDNE 138

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+ M+ FL KI+++MK+ENLFASQGGPIILAQVENEYGNV+  YG  G  Y+ WAA+ 
Sbjct: 139 PFKKYMQSFLTKIVNMMKEENLFASQGGPIILAQVENEYGNVDSHYGEAGVRYINWAAEM 198

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A   NT VPW+MC Q   P+ II+TCNG YCDG+ P    KP MWTE+Y+GWF  +G+ +
Sbjct: 199 AQAQNTGVPWIMCAQSKVPEYIIDTCNGMYCDGWNPILYKKPTMWTESYTGWFTYYGWPI 258

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYM--YFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           P RPVED+AFAVARFFE GG+F NYYM  YFGGTNFGRT+GGP VA+SYDYDAP+DEYG 
Sbjct: 259 PHRPVEDIAFAVARFFERGGSFHNYYMVWYFGGTNFGRTSGGPYVASSYDYDAPLDEYGM 318

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
              PKWGHL++LH+ +KL EE ++SS+  H +LG   EAH+Y    N C AFLAN DS +
Sbjct: 319 QHLPKWGHLKDLHETLKLGEEVILSSEGQHSELGPNQEAHVY-SYGNGCVAFLANVDSMN 377

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  V F    Y LPAWSVSIL DCK V FN+AKV S           Q  V  +  + S 
Sbjct: 378 DTVVEFRNVSYSLPAWSVSILLDCKTVAFNSAKVKS-----------QSAVVSMSPSKST 426

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
            SW  ++E VGISG+ SF    L EQ+ TTKDTSDYLWYT S+    G G   +L+IES+
Sbjct: 427 LSWTSFDEPVGISGS-SFKAKQLLEQMETTKDTSDYLWYTTSVEAT-GTGS-TWLSIESM 483

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
                +FVN +  +  + +       +   I L  G NT+ +LS  VGLQN+GA+ +   
Sbjct: 484 RDVVHIFVNGQFQSSWHTSKSVLYNSVEAPITLAPGSNTIALLSATVGLQNFGAFIETWS 543

Query: 540 AGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
           AGL  S+IL  L  G ++LS  EW YQVG++GE + L  +  + S  W   ST    K L
Sbjct: 544 AGLSGSLILKGLPGGDQNLSKQEWTYQVGLKGEDLKLFTVEGSRSVNWSAVST---EKPL 600

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WY T F AP G  P+AL+LASMGKGQAWVNGQSIGRYW AY A  + C + CDYRGSYD
Sbjct: 601 TWYMTEFDAPPGDDPVALDLASMGKGQAWVNGQSIGRYWPAYKAADSVCPESCDYRGSYD 660

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
            +KC   CGQ +Q  YH+PR+W+ P  NLLV+ EE GGDPS I  +T++   IC+ V E+
Sbjct: 661 QNKCLTGCGQSSQRWYHVPRSWMKPRGNLLVLFEETGGDPSSIDFVTRSTNVICARVYES 720

Query: 719 DPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVL 777
            P  V  W P    V               I+ I FAS G PEG+CGSF+ G+CH  D+ 
Sbjct: 721 HPASVKLWCPGEKQV---------------ISQIRFASLGNPEGSCGSFKEGSCHTNDLS 765

Query: 778 PIVQKACVGQIECSIPVSSAYLGVSAGACPGLL-KALAVEAHCS 820
             V+KACVGQ  CS+         +  ACPG+  K LAVEA CS
Sbjct: 766 NTVEKACVGQRSCSLAPD-----FTISACPGVREKFLAVEALCS 804


>gi|449491392|ref|XP_004158882.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
          Length = 854

 Score =  863 bits (2231), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/833 (51%), Positives = 557/833 (66%), Gaps = 28/833 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+RRVL SGSIHYPRSTPE+W  LI+K+KEGGL+V+ETYVFWN H
Sbjct: 25  VQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DL RF+KT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85  EPSPGNYNFEGRYDLARFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ LMK ENLF SQGGPIIL+Q+ENEYG     +G  G+ Y+ WA
Sbjct: 145 DNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF  FG
Sbjct: 205 AKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 265 GPIHQRPVQDLAFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+A+K+CE+ L+S+DP    LG+  +A++Y   S +CAAFL+NYD+ 
Sbjct: 325 LIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTD 384

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV            Q   +  L   S 
Sbjct: 385 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQLEMLPTNSP 434

Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
              W  Y E V    + + +    L EQIN TKDTSDYLWY  S+ +   +     G+  
Sbjct: 435 MLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSVDIGSTESFLHGGELP 494

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  +F+N +L    +G+ +   F    K+    G NT+ +LS+ VGL N G
Sbjct: 495 TLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVG 554

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
             F+    G+   V L  L  GK DLS  +W Y+VG++GE + L   +  +S  W +GS 
Sbjct: 555 GHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSL 614

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                + L W+K+ F APEG  PLA+++  MGKGQ W+NG SIGRYW+AY   +TG   K
Sbjct: 615 AAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAY---ATGNCDK 671

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C+Y G++   KCQ+ CGQP Q  YH+PR W+ P +NLLV+ EELGG+P+ ISL+ ++   
Sbjct: 672 CNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVTG 731

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
           +C+ VSE  P   +    + G       P+V L C  G+ I +I FAS+G P G CGS++
Sbjct: 732 VCADVSEYHPTLKNWHIESYGKSEDLHRPKVHLKCSAGYSITSIKFASFGTPLGTCGSYQ 791

Query: 769 PGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G CH  +   I++K C+G+  C++ +S+   G     CP +LK L+VE  C+
Sbjct: 792 QGTCHAPMSYDILEKRCIGKQRCAVTISNTNFG--QDPCPNVLKRLSVEVVCA 842


>gi|356564794|ref|XP_003550633.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 839

 Score =  863 bits (2231), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 417/829 (50%), Positives = 559/829 (67%), Gaps = 27/829 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+V++G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 27  VTASVTYDHKAIVVNGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 86

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 87  EPSPGKYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 146

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK+E LF +QGGPII++Q+ENEYG VEW  G  G+ Y KW 
Sbjct: 147 DNEPFKAAMQKFTEKIVSIMKEEKLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWF 206

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           +  AV L+T VPW+MC+Q+D PDP+I+TCNG+YC+ FTPN   KP MWTEN++GW+  FG
Sbjct: 207 SQMAVGLDTGVPWIMCKQQDTPDPLIDTCNGYYCENFTPNKKYKPKMWTENWTGWYTEFG 266

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP ED+AF+VARF + GG+F NYYMY GGTNF RT+ G  +ATSYDYD PIDEYG
Sbjct: 267 GAVPRRPAEDMAFSVARFVQNGGSFVNYYMYHGGTNFDRTSSGLFIATSYDYDGPIDEYG 326

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PKWGHLR+LHKAIKLCE  L+S DPT    G  LE H++ K+S  CAAFLANYD+ 
Sbjct: 327 LLNEPKWGHLRDLHKAIKLCEPALVSVDPTVTWPGNNLEVHVF-KTSGACAAFLANYDTK 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A+V F    Y LP WS+SILPDCK  VFNTA++             Q ++ ++   +S
Sbjct: 386 SSASVKFGNGQYDLPPWSISILPDCKTAVFNTARL-----------GAQSSLMKMTAVNS 434

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AF W    EE    + + S     L EQIN T+D++DYLWY   +++   +     G+  
Sbjct: 435 AFDWQSYNEEPASSNEDDSLTAYALWEQINVTRDSTDYLWYMTDVNIDANEGFIKNGQSP 494

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GH   V +N +L    YG  D      +  ++L  G N + +LS+ VGL N G
Sbjct: 495 VLTVMSAGHVLHVLINDQLSGTVYGGLDSHKLTFSDSVKLRVGNNKISLLSIAVGLPNVG 554

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  L  G RDLS  +W Y++G++GE + L+ +S ++S  W QGS 
Sbjct: 555 PHFETWNAGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGEALNLNTVSGSSSVEWVQGSL 614

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + L WYKTTF  P G  PLAL++ SMGKGQAW+NG+SIGR+W  Y+A   G    C
Sbjct: 615 LAKQQPLAWYKTTFSTPAGNDPLALDMISMGKGQAWINGRSIGRHWPGYIA--RGNCGDC 672

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
            Y G+Y   KC+ +CG+P+Q  YHIPR+W++P  N LV+ EE GGDP+ I+L+ +T   +
Sbjct: 673 YYAGTYTDKKCRTNCGEPSQRWYHIPRSWLNPSGNYLVVFEEWGGDPTGITLVKRTTASV 732

Query: 712 CSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           C+ + +  P   +    + G V   P+  L C  G +I+ I FASYG+P+G CG+FR G+
Sbjct: 733 CADIYQGQPTLKNRQMLDSGKV-VRPKAHLWCPPGKNISQIKFASYGLPQGTCGNFREGS 791

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           CH        QK C+G+  C + V+    G     CPG+ K L++EA C
Sbjct: 792 CHAHKSYDAPQKNCIGKQSCLVTVAPEVFG--GDPCPGIAKKLSLEALC 838


>gi|449460229|ref|XP_004147848.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
 gi|449476862|ref|XP_004154857.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
          Length = 844

 Score =  862 bits (2228), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/832 (51%), Positives = 548/832 (65%), Gaps = 35/832 (4%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +L++K+K+GGL+V++TYVFWN HEP
Sbjct: 27  TTVTYDKKAILINGQRRILISGSIHYPRSTPEMWDDLMQKAKDGGLDVVDTYVFWNVHEP 86

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR+DLVRF+KT Q  GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 87  SPGNYDFEGRYDLVRFIKTAQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDN 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK E LFASQGGPIIL+Q+ENEYG    A G  G  Y+ WAA 
Sbjct: 147 GPFKMAMQGFTQKIVQMMKSEKLFASQGGPIILSQIENEYGPQSKALGAAGHAYMNWAAK 206

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV LNT VPWVMC+++DAPDP+IN+CNGFYCD F+PN P KP +WTE +SGWF  FG  
Sbjct: 207 MAVGLNTGVPWVMCKEDDAPDPVINSCNGFYCDYFSPNKPYKPTLWTEAWSGWFTEFGGP 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           V  RPV+DLAFAVARF + GG+  NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG +
Sbjct: 267 VYGRPVQDLAFAVARFVQKGGSLFNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGML 326

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ LH+AIKLCE  L+SSDPT   LGA  +AH++      CAAFLANY ++S 
Sbjct: 327 RQPKYGHLKNLHRAIKLCEHALVSSDPTVTSLGAYEQAHVFSSGPGRCAAFLANYHTNSA 386

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LPAWS+SILPDCK VVFNTA+V      G H  AQ     ++L   S  
Sbjct: 387 ATVVFNNMRYALPAWSISILPDCKRVVFNTAQV------GVH-IAQ----TQMLPTISKL 435

Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW    E+   + G+       L EQIN T+DTSDYLWY  S+ +   +     G++  L
Sbjct: 436 SWETYNEDTYSLGGSSRMTVAGLLEQINVTRDTSDYLWYMTSVGISSSEAFLRGGQKPTL 495

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
           ++ S GHA  VF+N +     YG+ +   F     I L  G+N + +LS+ VGL N G  
Sbjct: 496 SVRSAGHAVHVFINGQFSGSAYGSREHPAFTYTGPINLRAGMNKIALLSIAVGLPNVGLH 555

Query: 535 FDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+    G+   I I  L  GK+DL+  +W YQVG++GE + L   + A S  W +GS L 
Sbjct: 556 FEKWQTGILGPISISGLNGGKKDLTWQKWSYQVGLKGEAMNLVSPTEATSVDWIKGSLLQ 615

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK +F AP G  PLAL+L SMGKGQAW+NGQSIGRYW AY   + G   +C Y
Sbjct: 616 GQRPLTWYKASFNAPRGNEPLALDLRSMGKGQAWINGQSIGRYWMAY---AKGGCSRCTY 672

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G+Y    C+  CGQP Q  YH+PR+W+ P  N+LV+ EELGGD SKISL+ ++   +C 
Sbjct: 673 AGTYRPPTCENGCGQPTQRWYHVPRSWLKPTNNVLVLFEELGGDASKISLMRRSVTGLCG 732

Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQ---VRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
              E         K +  ++ S+ +   + L C  G  I+AI FAS+G P G CGS++ G
Sbjct: 733 EAVEYHA------KNDSYIIESNEELDSLHLQCNPGQVISAIKFASFGTPSGTCGSYQKG 786

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
            CH  D   I++K C+G   CS+  +    GV    CP  LK L VE  C I
Sbjct: 787 TCHAPDSHAIIEKKCIGLKSCSVSTTRDNFGVD--PCPNELKQLLVEVDCGI 836


>gi|18403090|ref|NP_565755.1| beta galactosidase 9 [Arabidopsis thaliana]
 gi|75265632|sp|Q9SCV3.1|BGAL9_ARATH RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
           Precursor
 gi|6686890|emb|CAB64745.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|20197062|gb|AAC04500.2| putative beta-galactosidase [Arabidopsis thaliana]
 gi|330253650|gb|AEC08744.1| beta galactosidase 9 [Arabidopsis thaliana]
          Length = 887

 Score =  862 bits (2228), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 440/851 (51%), Positives = 565/851 (66%), Gaps = 44/851 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+I GKRR+L S  IHYPR+TPE+W +LI KSKEGG +V++TYVFWN HEP+
Sbjct: 37  NVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKEGGADVVQTYVFWNGHEPV 96

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEGR+DLV+FVK +  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 97  KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNE 156

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+EM++F+ KI+DLM++  LF  QGGPII+ Q+ENEYG+VE +YG  G+ YVKWAA  
Sbjct: 157 PFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMC+Q DAP+ II+ CNG+YCDGF PNS +KP++WTE++ GW+  +G ++
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRTKPVLWTEDWDGWYTKWGGSL 276

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP EDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP   TSYDYDAP+DEYG   
Sbjct: 277 PHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRS 336

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND----CAAFLANYD 358
           +PKWGHL++LH AIKLCE  L+++D P ++KLG+K EAHIYH         CAAFLAN D
Sbjct: 337 EPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHIYHGDGETGGKVCAAFLANID 396

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---------Q 409
               A+V FNG  Y LP WSVSILPDC++V FNTAKV +Q +      A+         Q
Sbjct: 397 EHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQ 456

Query: 410 KNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
           K V +  ++  + SW   +E +GI G  +F    L E +N TKD SDYLW+   I V   
Sbjct: 457 KVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSED 516

Query: 468 Q-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
                   G    ++I+S+     VFVNK+L     G+   A     + +   +G N L 
Sbjct: 517 DISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWVKA----VQPVRFIQGNNDLL 572

Query: 521 ILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
           +L+  VGLQNYGA+ +  GAG      L   KNG  DLS   W YQVG++GE    DKI 
Sbjct: 573 LLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSKSSWTYQVGLKGE---ADKIY 629

Query: 580 LANSSFWKQGSTLPVNKS---LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
               +   + STL  + S    +WYKT F  P G  P+ LNL SMG+GQAWVNGQ IGRY
Sbjct: 630 TVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRY 689

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W+  ++   GC + CDYRG+Y++ KC  +CG+P QT YH+PR+W+ P  NLLV+ EE GG
Sbjct: 690 WNI-ISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGG 748

Query: 697 DPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACERGWHIA 750
           +P KIS+ T T   +C  VSE+  PP+  W         + + S +P+V L CE G  I+
Sbjct: 749 NPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDYINGTMSINSVAPEVHLHCEDGHVIS 808

Query: 751 AINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGL 809
           +I FASYG P G+C  F  G CH  + L IV +AC G+  C I VS+      +  C G 
Sbjct: 809 SIEFASYGTPRGSCDGFSIGKCHASNSLSIVSEACKGRNSCFIEVSNT--AFISDPCSGT 866

Query: 810 LKALAVEAHCS 820
           LK LAV + CS
Sbjct: 867 LKTLAVMSRCS 877


>gi|449464526|ref|XP_004149980.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
          Length = 854

 Score =  862 bits (2227), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/833 (51%), Positives = 557/833 (66%), Gaps = 28/833 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+RRVL SGSIHYPRSTPE+W  LI+K+KEGGL+V+ETYVFWN H
Sbjct: 25  VQCSVTYDRKAILINGQRRVLFSGSIHYPRSTPEMWEGLIQKAKEGGLDVVETYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRF+KT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85  EPSPGNYNFEGRYDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ LMK ENLF SQGGPIIL+Q+ENEYG     +G  G+ Y+ WA
Sbjct: 145 DNEPFKRAMQGFTEKIVGLMKSENLFESQGGPIILSQIENEYGVQSKLFGAAGQNYMTWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF  FG
Sbjct: 205 AKMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDAFSPNRPYKPTMWTEAWSGWFNEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVA F + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 265 GPIHQRPVQDLAFAVALFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELH+A+K+CE+ L+S+DP    LG+  +A++Y   S +CAAFL+NYD+ 
Sbjct: 325 LIRQPKYGHLKELHRAVKMCEKALVSADPIVTSLGSSQQAYVYTSESGNCAAFLSNYDTD 384

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTAKV            Q   +  L   S 
Sbjct: 385 SAARVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQLEMLPTNSP 434

Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
              W  Y E V    + + +    L EQIN TKDTSDYLWY  S+ +   +     G+  
Sbjct: 435 MLLWESYNEDVSAEDDSTTMTASGLLEQINVTKDTSDYLWYITSVDIGSTESFLHGGELP 494

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  +F+N +L    +G+ +   F    K+    G NT+ +LS+ VGL N G
Sbjct: 495 TLIVQSTGHAVHIFINGRLSGSAFGSRENRRFTYTGKVNFRAGRNTIALLSVAVGLPNVG 554

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
             F+    G+   V L  L  GK DLS  +W Y+VG++GE + L   +  +S  W +GS 
Sbjct: 555 GHFETWNTGILGPVALHGLDQGKLDLSWAKWTYKVGLKGEAMNLVSPNGISSVEWMEGSL 614

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                + L W+K+ F APEG  PLA+++  MGKGQ W+NG SIGRYW+AY   +TG   K
Sbjct: 615 AAQAPQPLTWHKSNFDAPEGDEPLAIDMRGMGKGQIWINGVSIGRYWTAY---ATGNCDK 671

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C+Y G++   KCQ+ CGQP Q  YH+PR W+ P +NLLV+ EELGG+P+ ISL+ ++   
Sbjct: 672 CNYAGTFRPPKCQQGCGQPTQRWYHVPRAWLKPKDNLLVVFEELGGNPTSISLVKRSVTG 731

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
           +C+ VSE  P   +    + G       P+V L C  G+ I +I FAS+G P G CGS++
Sbjct: 732 VCADVSEYHPTLKNWHIESYGKSEDLHRPKVHLKCSAGYSITSIKFASFGTPLGTCGSYQ 791

Query: 769 PGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G CH  +   I++K C+G+  C++ +S+   G     CP +LK L+VE  C+
Sbjct: 792 QGTCHAPMSYDILEKRCIGKQRCAVTISNTNFG--QDPCPNVLKRLSVEVVCA 842


>gi|449433177|ref|XP_004134374.1| PREDICTED: beta-galactosidase 9-like [Cucumis sativus]
          Length = 890

 Score =  862 bits (2226), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/856 (49%), Positives = 572/856 (66%), Gaps = 47/856 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+IDGKRR+L S  +HYPR++PE+WP++I KSKEGG +VI++YVFWN HEP 
Sbjct: 32  NVSYDHRALIIDGKRRMLISAGVHYPRASPEMWPDIIEKSKEGGADVIQSYVFWNGHEPT 91

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY F+GR+DLV+F++ V  +GL+LHLRIGPY CAEWN+GGFP+WL  +PGI+FRT N 
Sbjct: 92  KGQYNFDGRYDLVKFIRLVGSSGLYLHLRIGPYVCAEWNFGGFPLWLRDVPGIEFRTDNA 151

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFKEEM+RF+ KI+DL++ E LF  QGGP+I+ QVENEYGN+E +YG  G+ Y+KW  + 
Sbjct: 152 PFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVENEYGNIESSYGKRGQEYIKWVGNM 211

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMCQQ+DAP  IIN+CNG+YCDGF  NSPSKPI WTEN++GWF S+G   
Sbjct: 212 ALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKANSPSKPIFWTENWNGWFTSWGERS 271

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAF+VARFF+  G+FQNYYMYFGGTNFGRTAGGP   TSYDYD+PIDEYG IR
Sbjct: 272 PHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGRTAGGPFYITSYDYDSPIDEYGLIR 331

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSN-------------D 349
           +PKWGHL++LH A+KLCE  L+S+D P + KLG K EAH+YH  S              +
Sbjct: 332 EPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQEAHVYHMKSQTDDLTLSKLGTLRN 391

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHP 405
           C+AFLAN D      V FNG  Y LP WSVSILPDC+NVVFNTAKV +Q +        P
Sbjct: 392 CSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDCQNVVFNTAKVAAQTSIKILELYAP 451

Query: 406 FA-------QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
            +          + NEL + ++++   +E +GI  +++F    + E +N TKD SDYLWY
Sbjct: 452 LSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSDQNFTVKGILEHLNVTKDRSDYLWY 511

Query: 459 TASIHVMPGQGK-------EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
              IHV     +          + I+S+     VFVN KL     G   +  F+  + ++
Sbjct: 512 MTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFVNGKLTGSAIGQ--WVKFV--QPVQ 567

Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEG 570
             EG N L +LS  +GLQN GA+ +  GAG+   I L   KNG  DLS   W YQVG++G
Sbjct: 568 FLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIKLTGFKNGDIDLSKSLWTYQVGLKG 627

Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
           E++    +     + W + S   +  +  WYK  F +P+G  P+A+NL SMGKGQAWVNG
Sbjct: 628 EFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFSSPDGTDPVAINLGSMGKGQAWVNG 687

Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
             IGRYWS  ++P  GC +KCDYRG+Y++ KC  +CG+P Q+ YHIPR+W+    NLLV+
Sbjct: 688 HHIGRYWSV-VSPKDGCPRKCDYRGAYNSGKCATNCGRPTQSWYHIPRSWLKESSNLLVL 746

Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPV----DSWKPNLGVVS--SSPQVRLACE 744
            EE GG+P +I +   +   IC  VSE+  P +    + +  +   +S  ++P++ L C+
Sbjct: 747 FEETGGNPLEIVVKLYSTGVICGQVSESHYPSLRKLSNDYISDGETLSNRANPEMFLHCD 806

Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
            G  I+++ FASYG P+G+C  F  G CH  + L +V +AC+G+  C++ +S++  G   
Sbjct: 807 DGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLSVVSQACLGKNSCTVEISNSAFG--G 864

Query: 804 GACPGLLKALAVEAHC 819
             C  ++K LAVEA C
Sbjct: 865 DPCHSIVKTLAVEARC 880


>gi|414864995|tpg|DAA43552.1| TPA: hypothetical protein ZEAMMB73_935084 [Zea mays]
          Length = 845

 Score =  861 bits (2225), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/828 (52%), Positives = 550/828 (66%), Gaps = 28/828 (3%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYD +A++IDG+RR+L SGSIHYPRSTP++W  LI+K+K+GGL+VI+TYVFWN HEP  G
Sbjct: 30  TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 89

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
            YYFE R+DLVRFVKTVQ+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90  NYYFEERYDLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 149

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG     +G  G+ Y+ WAA  AV
Sbjct: 150 KTAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAV 209

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            L+T VPWVMC++EDAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF  FG  +  
Sbjct: 210 GLDTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 269

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR+P
Sbjct: 270 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREP 329

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           K  HL+ELH+A+KLCE+ L+S DPT   LG   EAH++ +S + CAAFLANY+S+S A V
Sbjct: 330 KHSHLKELHRAVKLCEQALVSVDPTITTLGTMQEAHVF-RSPSGCAAFLANYNSNSHAKV 388

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWY 425
            FN   Y LP WS+SILPDCKNVVFN+A V  Q          Q  +      S  +  Y
Sbjct: 389 VFNNEQYSLPPWSISILPDCKNVVFNSATVGVQ--------TSQMQMWGDGATSMMWERY 440

Query: 426 EEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP------GQGKEVFLNIES 478
           +E+V  ++         L EQ+N T+D+SDYLWY  S+ + P      G GK   L+++S
Sbjct: 441 DEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQS 500

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GHA  VFVN +L    YG  +      N  + L  G N + +LS+  GL N G  ++  
Sbjct: 501 AGHALHVFVNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETW 560

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             G+   V+L  L  G RDL+   W YQVG++GE + L+ +  + S  W QGS +   + 
Sbjct: 561 NTGVGGPVVLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQ 620

Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   + G  K C Y G+
Sbjct: 621 PLAWYKAYFETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---ADGDCKGCSYTGT 677

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL-GGDPSKISLLTKTGQHICSFV 715
           + A KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EEL GGD SKI+L  ++   +C+ V
Sbjct: 678 FRAPKCQAGCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADV 737

Query: 716 SEADPPPVDSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           SE D P +  W+  + G       +V L C  G  I+AI FAS+G P G CG+F+ G CH
Sbjct: 738 SE-DHPNIKKWQIESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCH 796

Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                 +++K C+G   C + +S    G     CP + K +AVEA CS
Sbjct: 797 SASSHAVLEKRCIGLQRCVVAISPDNFG--GDPCPSVTKRVAVEAVCS 842


>gi|20514290|gb|AAM22973.1|AF499737_1 beta-galactosidase [Oryza sativa Japonica Group]
 gi|21070357|gb|AAM34271.1|AF508799_1 beta-galactosidase [Oryza sativa Japonica Group]
          Length = 843

 Score =  860 bits (2223), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/835 (51%), Positives = 557/835 (66%), Gaps = 40/835 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A+++DG+RR+L SGSIHYPRSTPE+W  LI K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87  GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG     +G  G+ Y+ WAA  A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF  FG  + 
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG  R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL+ELH+A+KLCE+ L+S+DPT   LG+  EAH++ +SS+ CAAFLANY+S+S A 
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFLANYNSNSYAK 385

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V FN   Y LP WS+SILPDCKNVVFNTA V  Q N           +      +S+  W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQMWADGASSMMW 435

Query: 425 --YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E+V  ++         L EQ+N T+DTSDYLWY   + V P +     G  + L +
Sbjct: 436 EKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITRVEVDPSEKFLQGGTPLSLTV 495

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VF+N +L    YG  +      +    L  G N + +LS+  GL N G  ++
Sbjct: 496 QSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYE 555

Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIY--QVGVEGEYIGLDKISLANSSFWKQGSTLP 593
               G+   ++I  L  G RDL+   W Y  QVG++GE + L+ +  + S  W QGS + 
Sbjct: 556 TWNTGVVGPVVIHGLDEGSRDLTWQTWSYQFQVGLKGEQMNLNSLEGSGSVEWMQGSLVA 615

Query: 594 VNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
            N+  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   + G  K C 
Sbjct: 616 QNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---AEGDCKGCH 672

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y GSY A KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGD SKI+L  +T   +C
Sbjct: 673 YTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVC 732

Query: 713 SFVSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
           + VSE   P + +W+      P       + +V L C  G  I+AI FAS+G P G CG+
Sbjct: 733 ADVSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFGTPLGTCGT 787

Query: 767 FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           F+ G CH ++   +++K C+G   C + +S +  G     CP ++K +AVEA CS
Sbjct: 788 FQQGECHSINSNSVLEKKCIGLQRCVVAISPSNFG--GDPCPEVMKRVAVEAVCS 840


>gi|449457508|ref|XP_004146490.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
 gi|449500002|ref|XP_004160975.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
          Length = 846

 Score =  860 bits (2223), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/831 (51%), Positives = 558/831 (67%), Gaps = 30/831 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + + VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI K+K GGL+V+ETYVFWN H
Sbjct: 23  IHSTVTYDRKAILINGQRRILFSGSIHYPRSTPEMWEDLILKAKNGGLDVVETYVFWNVH 82

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGRFDLVRF+KT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 83  EPYPGIYNFEGRFDLVRFIKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N  FK  M+ F  KI+ LMK ENLF SQGGPIILAQ+ENEYG     +G  G  Y+ WA
Sbjct: 143 DNEAFKNAMQGFTEKIVALMKSENLFESQGGPIILAQIENEYGTESKLFGEAGYNYMTWA 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L T VPWVMC++ DAPDP+INTCNGFYCD F+PN P KP MWTE ++GWF  FG
Sbjct: 203 ANMAVGLQTGVPWVMCKEADAPDPVINTCNGFYCDTFSPNKPYKPTMWTEAWTGWFSEFG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVARF + GG+  NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 263 GPLHQRPVQDLAFAVARFIQRGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPK+GHL+ELH+AIK+CE  L+S+DP    LG   +AH+Y   S  CAAFL+NYD+ 
Sbjct: 323 LLRQPKYGHLKELHRAIKMCEPALVSADPIVTSLGDYQQAHVYSSESGGCAAFLSNYDTK 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDCKN VFNTAKV            Q   +  L   S+
Sbjct: 383 SFARVLFNNRHYNLPPWSISILPDCKNAVFNTAKV----------GVQTAQMGMLPAEST 432

Query: 421 AFSW--YEEKVGISGNRSFV-RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E +    +RS +  P L EQIN T+DTSDYLWY  S+ +   +     G+  
Sbjct: 433 TLSWESYFEDISALDDRSMMTSPGLLEQINVTRDTSDYLWYITSVDISSSEPFLHGGELP 492

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  VF+N +L     G+     F  + K+ L+ G N + +LS+ VGL N G
Sbjct: 493 TLLVQSTGHAVHVFINGQLSGSVSGSRKSRRFTYSGKVNLHAGTNKIGLLSVAVGLPNVG 552

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS- 590
             F+    G+   V+L  L+ GK DLSS +W Y+VG++GE + L   S  +   W Q S 
Sbjct: 553 GHFETWNTGILGPVVLYGLRQGKWDLSSQKWTYKVGLKGEAMNLISPSGFSPVEWMQASL 612

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                + L W+K  F APEG+ PLAL++  MGKGQ W+NGQSIGRYW+AY   + G   +
Sbjct: 613 AAQTPQPLTWHKAYFDAPEGEEPLALDMEGMGKGQIWINGQSIGRYWTAY---ARGNCSR 669

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C+Y  ++   KCQ  CGQP Q  YH+PR+W+ P +NLLV+ EE+GG+PS+IS++ +    
Sbjct: 670 CNYATAFRPPKCQLGCGQPTQRWYHVPRSWLRPEQNLLVVFEEVGGNPSRISIVKRLVTS 729

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           +C+ VSE   P   +W      +  +P+V L+C+ G +I++I FAS+G P G CGS++ G
Sbjct: 730 VCADVSEFH-PTFKNWHITAKFI--TPKVHLSCDPGQYISSIKFASFGTPLGTCGSYQQG 786

Query: 771 ACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH      I++K CVG+  C++ VS++        CP ++K L+VEA C+
Sbjct: 787 TCHAPSSSGILEKKCVGKQRCAVTVSNSNF---EDPCPNMMKRLSVEAVCN 834


>gi|225433463|ref|XP_002263385.1| PREDICTED: beta-galactosidase 9-like [Vitis vinifera]
          Length = 882

 Score =  860 bits (2223), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/853 (51%), Positives = 570/853 (66%), Gaps = 44/853 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+IDGKRR+L S  IHYPR+TPE+WP+LI KSKEGG +VI+TYVFWN HEP+
Sbjct: 28  NVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNGHEPV 87

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           R QY FEGR+D+V+FVK V  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 88  RRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+EM+RF+ KI+DLM++E LF+ QGGPII+ Q+ENEYGNVE ++G  G+ YVKWAA  
Sbjct: 148 PFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKWAARM 207

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+  VPWVMCQQ DAPD IIN CNGFYCD F PNS +KP +WTE+++GWF S+G   
Sbjct: 208 ALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSANKPKLWTEDWNGWFASWGGRT 267

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVED+AFAVARFF+ GG+F NYYMYFGGTNFGR++GGP   TSYDYDAPIDEYG + 
Sbjct: 268 PKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEYGLLS 327

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYH--------KSSN--DCAA 352
           QPKWGHL+ELH AIKLCE  L++ D P + KLG   EAH+Y         +S N   C+A
Sbjct: 328 QPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEAHVYRVKESLYSTQSGNGSSCSA 387

Query: 353 FLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQ 408
           FLAN D    A+VTF G +Y LP WSVSILPDC+  VFNTAKV +Q +      D P  +
Sbjct: 388 FLANIDEHKTASVTFLGQIYKLPPWSVSILPDCRTTVFNTAKVGAQTSIKTVEFDLPLVR 447

Query: 409 QKNVNELLLASSAFSW-------YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTAS 461
             +V + L+  +  S+        +E + +    +F    + E +N TKD SDYLW    
Sbjct: 448 NISVTQPLMVQNKISYVPKTWMTLKEPISVWSENNFTIQGVLEHLNVTKDHSDYLWRITR 507

Query: 462 IHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
           I+V        +  +V   L+I+S+     +FVN +L+    G+       + + I+L +
Sbjct: 508 INVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVIGHW----VKVVQPIQLLQ 563

Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
           G N L +LS  VGLQNYGA+ +  GAG    V L   KNG+ DLS   W YQVG+ GE+ 
Sbjct: 564 GYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQ 623

Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
            +  I  +  + W   +      +  WYKT F AP G+ P+AL+L SMGKGQAWVNG  I
Sbjct: 624 KIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHI 683

Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
           GRYW+  +AP  GC  KCDYRG Y  SKC  +CG P Q  YHIPR+W+    NLLV+ EE
Sbjct: 684 GRYWTR-VAPKDGC-GKCDYRGHYHTSKCATNCGNPTQIWYHIPRSWLQASNNLLVLFEE 741

Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWH 748
            GG P +IS+ +++ Q IC+ VSE+  P + +W P+  +  +S     P++ L C+ G  
Sbjct: 742 TGGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPSDFIDQNSKNKMTPEMHLQCDDGHT 801

Query: 749 IAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACP 807
           I++I FASYG P+G+C  F  G CH  + L +V KAC G+  C I + ++  G     C 
Sbjct: 802 ISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFG--GDPCR 859

Query: 808 GLLKALAVEAHCS 820
           G++K LAVEA C+
Sbjct: 860 GIVKTLAVEAKCA 872


>gi|449459196|ref|XP_004147332.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
 gi|449497145|ref|XP_004160325.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
          Length = 844

 Score =  860 bits (2222), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/829 (51%), Positives = 538/829 (64%), Gaps = 12/829 (1%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L+ANVTYD R+L+IDG R++L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN H
Sbjct: 18  LAANVTYDRRSLIIDGHRKLLISASIHYPRSVPAMWPSLIQNAKEGGVDVIETYVFWNGH 77

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           E     Y+F+GRFDLV+F+  V  AGL+L LRIGP+  AEWN+GG PVWLH+IP   FRT
Sbjct: 78  ELSPDNYHFDGRFDLVKFINIVHNAGLYLILRIGPFVAAEWNFGGVPVWLHYIPNTVFRT 137

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N  FK  M++F   I+ LMK+E LFASQGGPIIL+QVENEYG++E  YG GG+ Y  WA
Sbjct: 138 DNASFKFYMQKFTTYIVSLMKKEKLFASQGGPIILSQVENEYGDIERVYGEGGKPYAMWA 197

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV+ N  VPW+MCQQ DAPDP+INTCN FYCD FTPNSP+KP MWTEN+ GWF +FG
Sbjct: 198 AQMAVSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFKTFG 257

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 258 ARDPHRPPEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 317

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKWGHL+ELH+AIKL E  L++S+PT+  LG  LEA +Y  SS  CAAF+AN D  
Sbjct: 318 LPRLPKWGHLKELHRAIKLTERVLLNSEPTYVSLGPSLEADVYTDSSGACAAFIANIDEK 377

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLAS 419
            D  V F    Y LPAWSVSILPDCKNVVFNTA + SQ    +  P   Q + +      
Sbjct: 378 DDKTVQFRNISYHLPAWSVSILPDCKNVVFNTAMIRSQTAMVEMVPEELQPSADATNKDL 437

Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
            A  W  + E+ GI G   FV+  L + +NTTKDT+DYLWYT SI V   +    G +  
Sbjct: 438 KALKWEVFVEQPGIWGKADFVKNVLVDHLNTTKDTTDYLWYTTSIFVNENEKFLKGSQPV 497

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L +ES GHA   F+NKKL     GN     F   + I L  G N + +LSM VGLQN G 
Sbjct: 498 LVVESKGHALHAFINKKLQVSATGNGSDITFKFKQAISLKAGKNEIALLSMTVGLQNAGP 557

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           +++  GAGL  V++    NG  DLSS  W Y++G++GE++G+ K     +  W      P
Sbjct: 558 FYEWVGAGLSKVVIEGFNNGPVDLSSYAWSYKIGLQGEHLGIYKPDGIKNVKWLSSREPP 617

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK     P G  P+ L++  MGKG AW+NG+ IGRYW    +    C +KCDY
Sbjct: 618 KQQPLTWYKVILDPPSGNEPVGLDMVHMGKGLAWLNGEEIGRYWPTKSSIHDVCVQKCDY 677

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
           RG +   KC   CG+P Q  YH+PR+W  P  N+LVI EE GGDP++I L  +    IC+
Sbjct: 678 RGKFRPDKCLTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTQIRLSKRKVLGICA 737

Query: 714 FVSEADPPPVDSWKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
            + E   P ++SW     V   S   V L C     IA I FAS+G P+G+CGS+  G C
Sbjct: 738 HLGEGH-PSIESWSEAENVERKSKATVDLKCPDNGRIAKIKFASFGTPQGSCGSYSIGDC 796

Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H  + + +V+K C+ + EC I +     G + G CP   K LAVEA CS
Sbjct: 797 HDPNSISLVEKVCLNRNECRIELGEE--GFNKGLCPTASKKLAVEAMCS 843


>gi|357483611|ref|XP_003612092.1| Beta-galactosidase [Medicago truncatula]
 gi|355513427|gb|AES95050.1| Beta-galactosidase [Medicago truncatula]
          Length = 843

 Score =  860 bits (2221), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/832 (52%), Positives = 551/832 (66%), Gaps = 31/832 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ++VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+VIETYVFWN HEP
Sbjct: 24  SDVTYDRKAIIINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR DLVRF++TV +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FR  N
Sbjct: 84  SPGNYNFEGRNDLVRFIQTVHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRQDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK+ M+ F  KI+ +MK E L+ SQGGPIIL+Q+ENEYG      G  G  Y+ WAA 
Sbjct: 144 EPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKMLGPVGYNYMSWAAK 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV + T VPW+MC+++DAPDP+INTCNGFYCD FTPN P KP MWTE +SGWF  FG  
Sbjct: 204 MAVEMGTGVPWIMCKEDDAPDPVINTCNGFYCDKFTPNKPYKPTMWTEAWSGWFSEFGGP 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +  RPV+DLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG I
Sbjct: 264 IHKRPVQDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLI 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELHKAIK+CE+ LIS+DP    LG   +A++Y   S DC+AFL+NYDS S 
Sbjct: 324 RQPKYGHLKELHKAIKMCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSS 383

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LP WSVSILPDC+N VFNTAKV            Q   +  L   S  F
Sbjct: 384 ARVMFNNMHYNLPPWSVSILPDCRNAVFNTAKV----------GVQTSQMQMLPTNSERF 433

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           SW  +EE    S   +     L EQIN T+DTSDYLWY  S+ V   +     GK   L 
Sbjct: 434 SWESFEEDTSSSSATTITASGLLEQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLI 493

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           ++S GHA  VF+N +L    YG  +   F     + L  G NT+ +LS+ VGL N G  F
Sbjct: 494 VQSTGHAVHVFINGRLSGSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHF 553

Query: 536 DVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLP 593
           +    G+   ++I  L  GK DLS  +W YQVG++GE + L      +S  W Q +  + 
Sbjct: 554 ETWNTGILGPVVIHGLDKGKLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQ 613

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
            N+ L W+KT F APEG+ PLAL++  MGKGQ W+NG SIGRYW+A    +TG    C+Y
Sbjct: 614 RNQPLTWHKTFFDAPEGEEPLALDMDGMGKGQIWINGISIGRYWTAI---ATGSCNDCNY 670

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            GS+   KCQ  CGQP Q  YH+PR+W+    NLLV+ EELGGDPSKISL  ++   +C+
Sbjct: 671 AGSFRPPKCQLGCGQPTQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVCA 730

Query: 714 FVSEADPP----PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
            VSE  P      +DS+  +       P+V L C  G  I++I FAS+G P G CGS+  
Sbjct: 731 DVSEYHPNLKNWHIDSYGKSENF--RPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQ 788

Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           GACH      I+++ C+G+  C + VS++  G     CP +LK L+VEA C+
Sbjct: 789 GACHSSSSYDILEQKCIGKPRCIVTVSNSNFG--RDPCPNVLKRLSVEAVCA 838


>gi|222624250|gb|EEE58382.1| hypothetical protein OsJ_09539 [Oryza sativa Japonica Group]
          Length = 851

 Score =  858 bits (2218), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/843 (51%), Positives = 558/843 (66%), Gaps = 48/843 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A+++DG+RR+L SGSIHYPRSTPE+W  LI K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87  GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ----------VENEYGNVEWAYGVGGE 174
           FK  M+ F  KI+ +MK ENLFASQGGPIIL+Q          +ENEYG     +G  G+
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206

Query: 175 LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSG 234
            Y+ WAA  AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SG
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDA 294
           WF  FG  +  RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326

Query: 295 PIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFL 354
           P+DEYG  R+PK+GHL+ELH+A+KLCE+ L+S+DPT   LG+  EAH++ +SS+ CAAFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFL 385

Query: 355 ANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE 414
           ANY+S+S A V FN   Y LP WS+SILPDCKNVVFNTA V  Q N           +  
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQM 435

Query: 415 LLLASSAFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--- 468
               +S+  W  Y+E+V  ++         L EQ+N T+DTSDYLWY  S+ V P +   
Sbjct: 436 WADGASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFL 495

Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
             G  + L ++S GHA  VF+N +L    YG  +      +    L  G N + +LS+  
Sbjct: 496 QGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVAC 555

Query: 527 GLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
           GL N G  ++    G+   ++I  L  G RDL+   W YQVG++GE + L+ +  + S  
Sbjct: 556 GLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVE 615

Query: 586 WKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
           W QGS +  N+  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   +
Sbjct: 616 WMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---A 672

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
            G  K C Y GSY A KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGD SKI+L 
Sbjct: 673 EGDCKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALA 732

Query: 705 TKTGQHICSFVSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYG 758
            +T   +C+ VSE   P + +W+      P       + +V L C  G  I+AI FAS+G
Sbjct: 733 KRTVSGVCADVSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFG 787

Query: 759 IPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
            P G CG+F+ G CH ++   +++K C+G   C + +S +  G     CP ++K +AVEA
Sbjct: 788 TPLGTCGTFQQGECHSINSNSVLEKKCIGLQRCVVAISPSNFG--GDPCPEVMKRVAVEA 845

Query: 818 HCS 820
            CS
Sbjct: 846 VCS 848


>gi|356540789|ref|XP_003538867.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
          Length = 853

 Score =  858 bits (2217), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/836 (51%), Positives = 552/836 (66%), Gaps = 32/836 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+VIETY+FWN H
Sbjct: 28  VHCSVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYIFWNVH 87

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP RG Y FEGR+DLVRFVKT+Q+AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 88  EPSRGNYNFEGRYDLVRFVKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 147

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+ F  KI+ +MK E L+ SQGGPIIL+Q+ENEYG      G  G+ YV WA
Sbjct: 148 DNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGPAGQNYVNWA 207

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV   T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP +WTE +SGWF  FG
Sbjct: 208 AKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFG 267

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 268 GPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 327

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELHKAIK+CE  L+S+DP    +G   +AH+Y   S DCAAFL+N+D+ 
Sbjct: 328 LIRQPKYGHLKELHKAIKMCERALVSADPAVTSMGNFQQAHVYTTKSGDCAAFLSNFDTK 387

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S   V FN   Y LP WS+SILPDC+NVVFNTAKV            Q   +  L   + 
Sbjct: 388 SSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQMQMLPTNTH 437

Query: 421 AFSW--YEEKVGISGNRSFV---RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
            FSW  ++E +    + S +      L EQIN T+DTSDYLWY  S+ +   +     GK
Sbjct: 438 MFSWESFDEDISSLDDGSAITITTSGLLEQINVTRDTSDYLWYITSVDIGSSESFLRGGK 497

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
              L ++S GHA  VF+N +L    YG  +   F     + L  G N + +LS+ VGL N
Sbjct: 498 LPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFRYTGTVNLRAGTNRIALLSVAVGLPN 557

Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
            G  F+    G+   V+L  L  GK DLS  +W YQVG++GE + L   +  +S  W Q 
Sbjct: 558 VGGHFETWNTGILGPVVLRGLNQGKLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQS 617

Query: 590 STLP-VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
           + +   N+ L W+KT F AP+G  PLAL++  MGKGQ W+NG SIGRYW+   AP+ G  
Sbjct: 618 ALVSEKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYWT---APAAGIC 674

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
             C Y G++   KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGDPSKISL+ ++ 
Sbjct: 675 NGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPNHNLLVVFEELGGDPSKISLVKRSV 734

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCG 765
             IC+ VSE   P + +W  +    S     P+V L C     I++I FAS+G P G CG
Sbjct: 735 SSICADVSEYH-PNIRNWHIDSYGKSEEFHPPKVHLHCSPSQAISSIKFASFGTPLGTCG 793

Query: 766 SFRPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           ++  G CH       ++K C+G+  C++ VS++  G     CP +LK L+VEA CS
Sbjct: 794 NYEKGVCHSPTSYATLEKKCIGKPRCTVTVSNSNFG--QDPCPNVLKRLSVEAVCS 847


>gi|115437888|ref|NP_001043405.1| Os01g0580200 [Oryza sativa Japonica Group]
 gi|75272679|sp|Q8W0A1.1|BGAL2_ORYSJ RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
           Precursor
 gi|18461259|dbj|BAB84455.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|113532936|dbj|BAF05319.1| Os01g0580200 [Oryza sativa Japonica Group]
 gi|215736924|dbj|BAG95853.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 827

 Score =  858 bits (2217), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/824 (51%), Positives = 549/824 (66%), Gaps = 34/824 (4%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYD +A+V++G+RR+L SGSIHYPRSTPE+WP+LI K+K+GGL+V++TYVFWN HEP  G
Sbjct: 27  TYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSPG 86

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYFEGR+DLV F+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 87  QYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 146

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K EM++F  KI+++MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ AV
Sbjct: 147 KAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 206

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            LNTSVPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP 
Sbjct: 207 ALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPH 266

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+P
Sbjct: 267 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 326

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHL++LHKAIKLCE  L++ DP    LG   ++ ++  S+  CAAFL N D  S A V
Sbjct: 327 KWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLENKDKVSYARV 386

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
            FNG  Y LP WS+SILPDCK  VFNTA+V SQ        +Q K     +  +  F+W 
Sbjct: 387 AFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK-----MEWAGGFAWQ 434

Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
            Y E++   G        L EQIN T+D +DYLWYT  + V   +     G+ + L + S
Sbjct: 435 SYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGENLKLTVMS 494

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GHA  +F+N +L    YG+ D         ++L  G NT+  LS+ VGL N G  F+  
Sbjct: 495 AGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETW 554

Query: 539 GAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
            AG+   + +D L  G+RDL+  +W YQVG++GE + L  +S +++  W +    PV K 
Sbjct: 555 NAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGE----PVQKQ 610

Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK  F AP+G  PLAL+++SMGKGQ W+NGQ IGRYW  Y A  +G    CDYRG 
Sbjct: 611 PLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA--SGNCGTCDYRGE 668

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           YD +KCQ +CG  +Q  YH+PR+W+ P  NLLVI EE GGDP+ IS++ ++   +C+ VS
Sbjct: 669 YDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVS 728

Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-D 775
           E   P + +W           +V L C+ G  I  I FAS+G P+G+CGS+  G CH   
Sbjct: 729 EWQ-PSMKNWHTK---DYEKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHK 784

Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
              I  K CVGQ  C + V     G     CPG +K   VEA C
Sbjct: 785 SYDIFWKNCVGQERCGVSVVPEIFG--GDPCPGTMKRAVVEAIC 826


>gi|84579373|dbj|BAE72075.1| pear beta-galactosidase3 [Pyrus communis]
          Length = 894

 Score =  858 bits (2216), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/856 (50%), Positives = 570/856 (66%), Gaps = 47/856 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+IDGKRR+L S  IHYPR+TPE+WP+LI KSKEGG++VI+TY FW+ HEP+
Sbjct: 35  NVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPV 94

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY FEGR+D+V+F   V  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 95  RGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 154

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            FKEEM+RF+ K++DLM++E L + QGGPII+ Q+ENEYGN+E  +G  G+ Y+KWAA+ 
Sbjct: 155 LFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMLQIENEYGNIEGQFGQKGKEYIKWAAEM 214

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMC+Q DAP  II+ CNG+YCDG+ PNS +KP MWTE++ GW+ S+G  +
Sbjct: 215 ALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPNSYNKPTMWTEDWDGWYASWGGRL 274

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP   TSYDYDAPIDEYG + 
Sbjct: 275 PHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSN-------------D 349
           +PKWGHL++LH AIKLCE  L+++D P + KLG K EAH+Y  +S+              
Sbjct: 335 EPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQEAHVYRMNSHTEGLNITSYGSQIS 394

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHP 405
           C+AFLAN D    A+VTF G  Y LP WSVSILPDC+NVV+NTAKV +Q +      D P
Sbjct: 395 CSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTVEFDLP 454

Query: 406 F-----AQQKNV--NELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
                 +QQ+ +  N+ L  + ++   +E VG+    +F    + E +N TKD SDYLW+
Sbjct: 455 LYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNFTVQGILEHLNVTKDQSDYLWH 514

Query: 459 TASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
              I V                ++I+S+     VFVN +L     G+       + + ++
Sbjct: 515 ITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQLTGSVIGHW----VKVEQPVK 570

Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEG 570
             +G N L +L+  VGLQNYGA+ +  GAG    I L   KNG  D S   W YQVG++G
Sbjct: 571 FLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTGFKNGDIDFSKLLWTYQVGLKG 630

Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
           E++ +  I     + W + S      + IWYKT F +P G  P+AL+L SMGKGQAWVNG
Sbjct: 631 EFLKIYTIEENEKASWAELSPDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNG 690

Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
             IGRYW+  +AP  GC + CDYRG+YD+ KC  +CG+P QTLYH+PR+W+    NLLVI
Sbjct: 691 HHIGRYWT-LVAPEDGCPEICDYRGAYDSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVI 749

Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACE 744
            EE GG+P  IS+  ++   +C+ VSE+  PPV  W         + V   +P++ L C+
Sbjct: 750 LEETGGNPFDISIKLRSAGVLCAQVSESHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQ 809

Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
            G+ I++I FASYG P+G+C  F  G CH  +   IV K+C+G+  CS+ +S+   G   
Sbjct: 810 DGFTISSIEFASYGTPQGSCQKFSMGNCHATNSSSIVSKSCLGKNSCSVEISNISFG--G 867

Query: 804 GACPGLLKALAVEAHC 819
             C G++K LAVEA C
Sbjct: 868 DPCRGVVKTLAVEARC 883


>gi|414881557|tpg|DAA58688.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
          Length = 830

 Score =  857 bits (2215), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/828 (51%), Positives = 552/828 (66%), Gaps = 42/828 (5%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R 
Sbjct: 30  TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90  QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K EM+ F  KI+D+MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP 
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+P
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREP 329

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHL+ELHKAIKLCE  L++ DP    LG   +A ++  S++ C AFL N D  S A V
Sbjct: 330 KWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARV 389

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
           +FNG  Y LP WS+SILPDCK  V+NTA V SQ        +Q K     +  +  F+W 
Sbjct: 390 SFNGMHYDLPPWSISILPDCKTTVYNTASVGSQ-------ISQMK-----MEWAGGFTWQ 437

Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
            Y E +   G+ SF    L EQIN T+D +DYLWYT  + +   +     GK   L + S
Sbjct: 438 SYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTTYVDIAQDEQFLSNGKNPMLTVMS 497

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GHA  +FVN +L    YG+ +      +  ++L  G NT+  LS+ VGL N G  F+  
Sbjct: 498 AGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSGSNTISCLSIAVGLPNVGEHFETW 557

Query: 539 GAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
            AG+   + +D L  G+RDL+  +W Y+VG++GE + L  +S ++S  W +    PV K 
Sbjct: 558 NAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEALSLHSLSGSSSVEWGE----PVQKQ 613

Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK  F AP+G  PLAL+++SMGKGQ W+NGQ IGRYW  Y A  +G    CDYRG 
Sbjct: 614 PLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA--SGTCGICDYRGE 671

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           YD  KCQ +CG  +Q  YH+PR+W++P  NLLVI EE GGDP+ IS++ +    IC+ VS
Sbjct: 672 YDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRIAGSICADVS 731

Query: 717 EADPPPVDSWKPNLGVVSS----SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           E        W+P++    +      +V L C+ G  +  I FAS+G P+G+CGS+  G C
Sbjct: 732 E--------WQPSMANWRTKGYEKAKVHLQCDHGRKMTHIKFASFGTPQGSCGSYSEGGC 783

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           H      I  K+C+GQ  C + V     G     CPG +K   VEA C
Sbjct: 784 HAHKSYDIFWKSCIGQERCGVSVVPDAFG--GDPCPGTMKRAVVEAIC 829


>gi|357113908|ref|XP_003558743.1| PREDICTED: beta-galactosidase 5-like [Brachypodium distachyon]
          Length = 839

 Score =  857 bits (2215), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/828 (51%), Positives = 551/828 (66%), Gaps = 30/828 (3%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A++IDG+RR+L SGSIHYPRSTPE+W  L +K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDKKAVLIDGQRRILFSGSIHYPRSTPEMWEGLFQKAKDGGLDVIQTYVFWNGHEPTP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLV+F+KT Q+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87  GNYNFEGRYDLVKFIKTAQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+ +MK E LFASQGGPIIL+Q+ENEYG    ++G  G+ Y  WAA  A
Sbjct: 147 FKTAMQGFTEKIVGMMKSEELFASQGGPIILSQIENEYGPEGKSFGAAGKSYSNWAAKMA 206

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V L+T VPWVMC+Q+DAPDP+IN CNGFYCD F+PN P KP MWTE ++GWF  FG  + 
Sbjct: 207 VGLDTGVPWVMCKQDDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWTGWFTEFGGTIR 266

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDL+FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG  R+
Sbjct: 267 KRPVEDLSFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL+ELH+A+KLCE  L+S DP    LG+  EAH++ +S + CAAFLANY+S+S AN
Sbjct: 327 PKYGHLKELHRAVKLCEPALVSVDPAVTTLGSMQEAHVF-RSPSSCAAFLANYNSNSHAN 385

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V FN   Y LP WS+SILPDCK VVFNTA V            Q   +       S+  W
Sbjct: 386 VVFNNEHYSLPPWSISILPDCKTVVFNTATV----------GVQTSQMQMWADGESSMMW 435

Query: 425 --YEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E+VG ++         L EQ+N T+D+SDYLWY  S+ V P +     G+ + L +
Sbjct: 436 ERYDEEVGSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDVSPSEKFLQGGEPLSLTV 495

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  +F+N +L     G  +   F       L  G N + +LS+  GL N G  ++
Sbjct: 496 QSAGHALHIFINGQLQGSASGTREAKKFSYKGNANLRAGTNKIALLSIACGLPNVGVHYE 555

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   V+L  L  G RDL+   W YQVG++GE + L+ +  A+S  W QGS L   
Sbjct: 556 TWNTGIVGPVVLHGLDVGSRDLTWQTWSYQVGLKGEQMNLNSLEGASSVEWMQGSLL-AQ 614

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
             L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRY ++Y   ++G  K C Y G
Sbjct: 615 APLSWYRAYFDTPTGDEPLALDMGSMGKGQIWINGQSIGRYSTSY---ASGDCKACSYAG 671

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           SY A KCQ  CGQP Q  YH+P++W+ P  NLLV+ EELGGD SKISL+ ++   +C+ V
Sbjct: 672 SYRAPKCQAGCGQPTQRWYHVPKSWLQPSRNLLVVFEELGGDSSKISLVKRSVSSVCADV 731

Query: 716 SEADPPPVDSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           SE     + +W+  N G V    P+V L C  G  I+AI FAS+G P G CG+F+ G CH
Sbjct: 732 SEYH-TNIKNWQIENAGEVEFHRPKVHLRCAPGQTISAIKFASFGTPLGTCGNFQQGDCH 790

Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                 +++K C+GQ  C++ +S    G     CP  +K +AVEA CS
Sbjct: 791 STKSHAVLEKNCIGQQRCAVTISPDNFG--GDPCPKEMKKVAVEAVCS 836


>gi|218192153|gb|EEC74580.1| hypothetical protein OsI_10152 [Oryza sativa Indica Group]
          Length = 851

 Score =  857 bits (2215), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/843 (51%), Positives = 558/843 (66%), Gaps = 48/843 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A+++DG+RR+L SGSIHYPRSTPE+W  LI K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87  GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ----------VENEYGNVEWAYGVGGE 174
           FK  M+ F  KI+ +MK ENLFASQGGPIIL+Q          +ENEYG     +G  G+
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQASAKLCFPCHIENEYGPEGKEFGAAGK 206

Query: 175 LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSG 234
            Y+ WAA  AV L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SG
Sbjct: 207 AYINWAAKMAVGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSG 266

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDA 294
           WF  FG  +  RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDA
Sbjct: 267 WFTEFGGTIRQRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDA 326

Query: 295 PIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFL 354
           P+DEYG  R+PK+GHL+ELH+A+KLCE+ L+S+DPT   LG+  EAH++ +SS+ CAAFL
Sbjct: 327 PLDEYGLAREPKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFL 385

Query: 355 ANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE 414
           ANY+S+S A V FN   Y LP WS+SILPDCKNVVFNTA V  Q N           +  
Sbjct: 386 ANYNSNSYAKVIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQM 435

Query: 415 LLLASSAFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--- 468
               +S+  W  Y+E+V  ++         L EQ+N T+DTSDYLWY  S+ V P +   
Sbjct: 436 WADGASSMMWEKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFL 495

Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
             G  + L ++S GHA  VF+N +L    YG  +      +    L  G N + +LS+  
Sbjct: 496 QGGTPLSLTVQSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVAC 555

Query: 527 GLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
           GL N G  ++    G+   ++I  L  G RDL+   W YQVG++GE + L+ +  + S  
Sbjct: 556 GLPNVGVHYETWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVE 615

Query: 586 WKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
           W QGS +  N+  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   +
Sbjct: 616 WMQGSLVAQNQQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---A 672

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
            G  K C Y GSY A KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGD SKI+L 
Sbjct: 673 EGDCKGCHYTGSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALA 732

Query: 705 TKTGQHICSFVSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYG 758
            +T   +C+ VSE   P + +W+      P       + +V L C  G  I+AI FAS+G
Sbjct: 733 KRTVSGVCADVSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFG 787

Query: 759 IPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
            P G CG+F+ G CH ++   ++++ C+G   C + +S +  G     CP ++K +AVEA
Sbjct: 788 TPLGTCGTFQQGECHSINSNSVLERKCIGLERCVVAISPSNFG--GDPCPEVMKRVAVEA 845

Query: 818 HCS 820
            CS
Sbjct: 846 VCS 848


>gi|255554022|ref|XP_002518051.1| beta-galactosidase, putative [Ricinus communis]
 gi|223542647|gb|EEF44184.1| beta-galactosidase, putative [Ricinus communis]
          Length = 897

 Score =  857 bits (2214), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 429/858 (50%), Positives = 564/858 (65%), Gaps = 48/858 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+IDG RR+L SG IHYPR+TP++WP+LI KSKEGG++VI+TYVFWN HEP+
Sbjct: 39  NVSYDHRALIIDGHRRMLISGGIHYPRATPQMWPDLIAKSKEGGVDVIQTYVFWNGHEPV 98

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEG++DLV+FVK V  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGI FRT N+
Sbjct: 99  KGQYIFEGQYDLVKFVKLVGVSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIVFRTDNS 158

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PF EEM++F+ KI+DLM++E LF+ QGGPII+ Q+ENEYGN+E ++G GG+ YVKWAA  
Sbjct: 159 PFMEEMQQFVKKIVDLMREEMLFSWQGGPIIMLQIENEYGNIEHSFGPGGKEYVKWAARM 218

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMC+Q DAP  II+ CN +YCDG+ PNS  KPI+WTE++ GW+ ++G ++
Sbjct: 219 ALGLGAGVPWVMCRQTDAPGSIIDACNEYYCDGYKPNSNKKPILWTEDWDGWYTTWGGSL 278

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAFAVARFF+ GG+FQNYYMYFGGTNF RTAGGP   TSYDYDAPIDEYG + 
Sbjct: 279 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFARTAGGPFYITSYDYDAPIDEYGLLS 338

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQ-KLGAKLEAHIY-------------HKSSND 349
           +PKWGHL++LH AIKLCE  L+++D     KLG+K EAH+Y             H S + 
Sbjct: 339 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGSKQEAHVYRANVHAEGQNLTQHGSQSK 398

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA-- 407
           C+AFLAN D      V F G  Y LP WSVS+LPDC+N VFNTAKV +Q +      A  
Sbjct: 399 CSAFLANIDEHKAVTVRFLGQSYTLPPWSVSVLPDCRNAVFNTAKVAAQTSIKSMELALP 458

Query: 408 ---------QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
                    Q    NE    SS++   +E + +    +F    + E +N TKD SDYLWY
Sbjct: 459 QFSGISAPKQLMAQNEGSYMSSSWMTVKEPISVWSGNNFTVEGILEHLNVTKDHSDYLWY 518

Query: 459 TASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
              I+V                + I+S+     VF+N +L     G        + + ++
Sbjct: 519 FTRIYVSDDDIAFWEENNVHPAIKIDSMRDVLRVFINGQLTGSVIGRW----IKVVQPVQ 574

Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEG 570
             +G N L +LS  VGLQNYGA+ +  GAG      L   ++G  DLS+ EW YQVG++G
Sbjct: 575 FQKGYNELVLLSQTVGLQNYGAFLERDGAGFRGHTKLTGFRDGDIDLSNLEWTYQVGLQG 634

Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
           E   +        + W   +   +  +  WYKT F AP G  P+AL+L SMGKGQAWVN 
Sbjct: 635 ENQKIYTTENNEKAEWTDLTLDDIPSTFTWYKTYFDAPSGADPVALDLGSMGKGQAWVND 694

Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
             IGRYW+  +AP  GC +KCDYRG+Y++ KC+ +CG+P Q  YHIPR+W+ P  NLLVI
Sbjct: 695 HHIGRYWT-LVAPEEGC-QKCDYRGAYNSEKCRTNCGKPTQIWYHIPRSWLQPSNNLLVI 752

Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACE 744
            EE GG+P +IS+  ++   +C+ VSE   PP+  W        N+     +P+++L C+
Sbjct: 753 FEETGGNPFEISIKLRSASVVCAQVSETHYPPLQRWIHTDFIYGNVSGKDMTPEIQLRCQ 812

Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSA 803
            G+ I++I FASYG P+G+C  F  G CH  + L +V KAC G+  C+I +S+A  G   
Sbjct: 813 DGYVISSIEFASYGTPQGSCQKFSRGNCHAPNSLSVVSKACQGRDTCNIAISNAVFG--G 870

Query: 804 GACPGLLKALAVEAHCSI 821
             C G++K LAVEA CS+
Sbjct: 871 DPCRGIVKTLAVEAKCSL 888


>gi|242036825|ref|XP_002465807.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
 gi|241919661|gb|EER92805.1| hypothetical protein SORBIDRAFT_01g046160 [Sorghum bicolor]
          Length = 842

 Score =  856 bits (2211), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/829 (51%), Positives = 551/829 (66%), Gaps = 31/829 (3%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYD +A++IDG+RR+L SGSIHYPRSTP++W  LI+K+K+GGL+VI+TYVFWN HEP  G
Sbjct: 28  TYDKKAVLIDGQRRILFSGSIHYPRSTPDMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPG 87

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
            YYFE R+DLVRF+KTVQ+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N PF
Sbjct: 88  NYYFEERYDLVRFIKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPF 147

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K  M+ F  KI+ +MK E LFASQGGPIIL+Q+ENEYG      G  G+ Y+ WAA  A+
Sbjct: 148 KTAMQGFTEKIVGMMKSEKLFASQGGPIILSQIENEYGPEGKELGAAGQAYINWAAKMAI 207

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            L T VPWVMC++EDAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF  FG  +  
Sbjct: 208 GLGTGVPWVMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQ 267

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +R+P
Sbjct: 268 RPVEDLAFAVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREP 327

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           K  HL+ELH+A+KLCE+ L+S DP    LG   EAH++ +S + CAAFLANY+S+S A V
Sbjct: 328 KHSHLKELHRAVKLCEQALVSVDPAITTLGTMQEAHVF-RSPSGCAAFLANYNSNSYAKV 386

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
            FN   Y LP WS+SILPDCKNVVFN+A V            Q   +      +S+  W 
Sbjct: 387 VFNNEQYSLPPWSISILPDCKNVVFNSATV----------GVQTSQMQMWGDGASSMMWE 436

Query: 425 -YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP------GQGKEVFLNI 476
            Y+E+V  ++         L EQ+N T+D+SDYLWY  S+ + P      G GK + L++
Sbjct: 437 RYDEEVDSLAAAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPLSLSV 496

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VFVN +L    YG  +      N    L  G N + +LS+  GL N G  ++
Sbjct: 497 LSAGHALHVFVNGELQGSAYGTREDRRIKYNGNANLRAGTNKIALLSVACGLPNVGVHYE 556

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   V L  L  G RDL+   W YQVG++GE + L+ +  + S  W QGS +  N
Sbjct: 557 TWNTGVGGPVGLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSTSVEWMQGSLIAQN 616

Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           +  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   + G  K+C Y 
Sbjct: 617 QQPLSWYRAYFETPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---ADGDCKECSYT 673

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G++ A KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGD SKI+L+ ++   +C+ 
Sbjct: 674 GTFRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALVKRSVSSVCAD 733

Query: 715 VSEADPPPVDSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           VSE D P + +W+  + G       +V L C  G  I+AI FAS+G P G CG+F+ G C
Sbjct: 734 VSE-DHPNIKNWQIESYGEREYHRAKVHLRCSPGQSISAIKFASFGTPMGTCGNFQQGDC 792

Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           H  +   +++K C+G   C++ +S    G     CP + K +AVEA CS
Sbjct: 793 HSANSHTVLEKKCIGLQRCAVAISPESFG--GDPCPRVTKRVAVEAVCS 839


>gi|255538780|ref|XP_002510455.1| beta-galactosidase, putative [Ricinus communis]
 gi|223551156|gb|EEF52642.1| beta-galactosidase, putative [Ricinus communis]
          Length = 846

 Score =  855 bits (2210), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 426/836 (50%), Positives = 556/836 (66%), Gaps = 35/836 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFW+ H
Sbjct: 24  VQCTVTYDKKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWDVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           E   G Y F+GR+DLVRF+KTVQ+ GL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84  ETSPGNYNFDGRYDLVRFIKTVQKVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG    A G  G  Y+ WA
Sbjct: 144 DNEPFKAAMQGFTQKIVQMMKNENLFASQGGPIILSQIENEYGPESRALGAAGRSYINWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP +WTE +SGWF  FG
Sbjct: 204 AKMAVGLDTGVPWVMCKEDDAPDPMINTCNGFYCDAFAPNKPYKPTLWTEAWSGWFTEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPVEDLAFAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GPIHQRPVEDLAFAVARFIQKGGSYFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IR+PK+GHL+ LHKAIKLCE  L+SSDP+   LG   +AH++  S   CAAFLANY++ 
Sbjct: 324 LIREPKYGHLKALHKAIKLCEHALVSSDPSITSLGTYQQAHVF-SSGRSCAAFLANYNAK 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V FN   Y LP WS+SILPDC+NVVFNTA+V           AQ   +  L   S 
Sbjct: 383 SAARVMFNNMHYDLPPWSISILPDCRNVVFNTARV----------GAQTLRMQMLPTGSE 432

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            FSW  Y+E++  ++ +       L EQIN T+DTSDYLWY  S+ + P +     G++ 
Sbjct: 433 LFSWETYDEEISSLTDSSRITALGLLEQINVTRDTSDYLWYLTSVDISPSEAFLRNGQKP 492

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GH   VF+N +     +G  +         + L  G N + +LS+ VGL N G
Sbjct: 493 SLTVQSAGHGLHVFINGQFSGSAFGTRENRQLTFTGPVNLRAGTNRIALLSIAVGLPNVG 552

Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             ++    G+   +L++ L  GK+DL+  +W YQVG++GE + L   +  +S  W +GS 
Sbjct: 553 LHYETWKTGVQGPVLLNGLNQGKKDLTWQKWSYQVGLKGEAMNLVSPNGVSSVDWIEGSL 612

Query: 592 LPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                ++L W+K  F AP G  PLAL++ SMGKGQ W+NGQSIGRYW AY   + G    
Sbjct: 613 ASSQGQALKWHKAYFDAPRGNEPLALDMRSMGKGQVWINGQSIGRYWMAY---AKGDCNS 669

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           C Y  ++  SKCQ  CG+P Q  YH+PR+W+ P +NLLV+ EELGGD SKISL+ ++ + 
Sbjct: 670 CSYIWTFRPSKCQLGCGEPTQRWYHVPRSWLKPTKNLLVVFEELGGDASKISLVKRSIEG 729

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCG 765
           +C+   E  P   +    N G    S      ++ L C  G  IAAI FAS+G P G CG
Sbjct: 730 VCADAYEHHPATKNY---NTGGNDESSKLHQAKIHLRCAPGQFIAAIKFASFGTPSGTCG 786

Query: 766 SFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           SF+ G CH  +   +++K C+GQ  C + +S++  G  A  CP +LK L+VEA CS
Sbjct: 787 SFQQGTCHAPNTHSVIEKKCIGQESCMVTISNSNFG--ADPCPNVLKKLSVEAVCS 840


>gi|332105893|gb|AEE01408.1| beta-galactosidase STBG2 [Solanum lycopersicum]
          Length = 892

 Score =  855 bits (2208), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/854 (50%), Positives = 570/854 (66%), Gaps = 45/854 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD+RAL+I GKRR+L S  IHYPR+TPE+WP LI +SKEGG +VIETY FWN HEP 
Sbjct: 36  NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY FEGR+D+V+F K V   GLFL +RIGPYACAEWN+GGFP+WL  IPGI+FRT N 
Sbjct: 96  RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFKEEM+R++ KI+DLM  E+LF+ QGGPIIL Q+ENEYGNVE  +G  G+LY+KWAA+ 
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESTFGPKGKLYMKWAAEM 215

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV L   VPWVMC+Q DAP+ II+TCN +YCDGFTPNS  KP +WTEN++GWF  +G  +
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGERL 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+RP ED+AFA+ARFF+ GG+ QNYYMYFGGTNFGRTAGGP   TSYDYDAP+DEYG +R
Sbjct: 276 PYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGLLR 335

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND-----------CA 351
           QPKWGHL++LH AIKLCE  L+++D P + KLG K EAH+Y  +SN+           CA
Sbjct: 336 QPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGICA 395

Query: 352 AFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----------- 400
           AF+AN D    A V F G  + LP WSVSILPDC+N  FNTAKV +Q +           
Sbjct: 396 AFIANIDEHESATVKFYGQEFTLPPWSVSILPDCRNTAFNTAKVGAQTSIKTVGSDSVSV 455

Query: 401 NGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
             +  F Q    ++L   S ++   +E +G+ G+++F    + E +N TKD SDYLWY  
Sbjct: 456 GNNSLFLQVITKSKLESFSQSWMTLKEPLGVWGDKNFTSKGILEHLNVTKDQSDYLWYLT 515

Query: 461 SIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN 513
            I++        +  +V   ++I+S+     +FVN +L     G        + + ++L 
Sbjct: 516 RIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----IKVVQPVKLV 571

Query: 514 EGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEY 572
           +G N + +LS  VGLQNYGA+ +  GAG    I L   K+G  +L++  W YQVG+ GE+
Sbjct: 572 QGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWTYQVGLRGEF 631

Query: 573 IGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQS 632
           + +  ++   S+ W +  T        WYKT F AP G  P+AL+ +SMGKGQAWVNG  
Sbjct: 632 LEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGKGQAWVNGHH 691

Query: 633 IGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
           +GRYW+  +AP+ GC + CDYRG+Y + KC+ +CG+  Q  YHIPR+W+    N+LVI E
Sbjct: 692 VGRYWT-LVAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKTLNNVLVIFE 750

Query: 693 ELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN-----LGVVSSSPQVRLACERGW 747
           E+   P  IS+ T++ + IC+ VSE   PP+  W  +     L ++  +P++ L C+ G 
Sbjct: 751 EIDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRKLSLMDKTPEMHLQCDEGH 810

Query: 748 HIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGAC 806
            I++I FASYG P G+C  F  G CH  + L +V +AC+G+  CSI +S+   GV    C
Sbjct: 811 TISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVSQACIGRTSCSIGISN---GVFGDPC 867

Query: 807 PGLLKALAVEAHCS 820
             ++K+LAV+A CS
Sbjct: 868 RHVVKSLAVQAKCS 881


>gi|61162194|dbj|BAD91079.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 903

 Score =  854 bits (2206), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/856 (50%), Positives = 569/856 (66%), Gaps = 46/856 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+IDGKRR+L S  IHYPR+TPE+WP+LI KSKEGG++VI+TY FW+ HEP+
Sbjct: 35  NVSYDHRALIIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGVDVIQTYAFWSGHEPV 94

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY FEGR+D+V+F   V  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 95  RGQYNFEGRYDIVKFANLVGASGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTNNA 154

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            FKEEM+RF+ K++DLM++E L + QGGPII+ Q+ENEYGN+E  +G  G+ Y+KWAA+ 
Sbjct: 155 LFKEEMQRFVKKMVDLMQEEELLSWQGGPIIMMQIENEYGNIEGQFGQKGKEYIKWAAEM 214

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMC+Q DAP  II+ CNG+YCDG+ PNS +KP +WTE++ GW+ S+G  +
Sbjct: 215 ALGLGAGVPWVMCKQVDAPGSIIDACNGYYCDGYKPNSYNKPTLWTEDWDGWYASWGGRL 274

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP   TSYDYDAPIDEYG + 
Sbjct: 275 PHRPVEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSN-------------D 349
           +PKWGHL++LH AIKLCE  L+++D P + KLG K EAH+Y  +S+              
Sbjct: 335 EPKWGHLKDLHAAIKLCEPALVAADSPNYIKLGPKQEAHVYRVNSHTEGLNITSYGSQIS 394

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHP 405
           C+AFLAN D    A+VTF G  Y LP WSVSILPDC+NVV+NTAKV +Q +      D P
Sbjct: 395 CSAFLANIDEHKAASVTFLGQKYNLPPWSVSILPDCRNVVYNTAKVGAQTSIKTVEFDLP 454

Query: 406 F-----AQQKNV--NELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
                 +QQ+ +  N+ L  + ++   +E VG+    +F    + E +N TKD SDYLW+
Sbjct: 455 LYSGISSQQQFITKNDDLFITKSWMTVKEPVGVWSENNFTVQGILEHLNVTKDQSDYLWH 514

Query: 459 TASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
              I V                ++I+S+     VFVN +L       H      + + ++
Sbjct: 515 ITRIFVSEDDISFWEKNNISAAVSIDSMRDVLRVFVNGQLTEGSVIGHWVK---VEQPVK 571

Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEG 570
             +G N L +L+  VGLQNYGA+ +  GAG    I L   KNG  DLS   W YQVG++G
Sbjct: 572 FLKGYNDLVLLTQTVGLQNYGAFLEKDGAGFRGQIKLTGFKNGDIDLSKLLWTYQVGLKG 631

Query: 571 EYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNG 630
           E+  +  I     + W + S      + IWYKT F +P G  P+AL+L SMGKGQAWVNG
Sbjct: 632 EFFKIYTIEENEKAGWAELSPDDDPSTFIWYKTYFDSPAGTDPVALDLGSMGKGQAWVNG 691

Query: 631 QSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI 690
             IGRYW+  +AP  GC + CDYRG+Y++ KC  +CG+P QTLYH+PR+W+    NLLVI
Sbjct: 692 HHIGRYWT-LVAPEDGCPEICDYRGAYNSDKCSFNCGKPTQTLYHVPRSWLQSSSNLLVI 750

Query: 691 HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACE 744
            EE GG+P  IS+  ++   +C+ VSE+  PPV  W         + V   +P++ L C+
Sbjct: 751 LEETGGNPFDISIKLRSAGVLCAQVSESHYPPVQKWFNPDSVDEKITVNDLTPEMHLQCQ 810

Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
            G+ I++I FASYG P+G+C  F  G CH  +   IV K+C+G+  CS+ +S+   G   
Sbjct: 811 DGFTISSIEFASYGTPQGSCQKFSMGNCHATNSSSIVSKSCLGKNSCSVEISNNSFG--G 868

Query: 804 GACPGLLKALAVEAHC 819
             C G++K LAVEA C
Sbjct: 869 DPCRGIVKTLAVEARC 884


>gi|356496697|ref|XP_003517202.1| PREDICTED: beta-galactosidase 3-like [Glycine max]
          Length = 849

 Score =  854 bits (2206), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/836 (51%), Positives = 551/836 (65%), Gaps = 32/836 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+RR+L SGSIHYPRSTP++W +LI K+KEGGL+VIETYVFWN H
Sbjct: 28  VHCSVTYDRKAILINGQRRILFSGSIHYPRSTPDMWEDLIYKAKEGGLDVIETYVFWNVH 87

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP RG Y FEGR+DLVRFVKT+Q+AGL+ +LRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 88  EPSRGNYNFEGRYDLVRFVKTIQKAGLYANLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 147

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+ F  KI+ +MK E L+ SQGGPIIL+Q+ENEYG      G  G+ YV WA
Sbjct: 148 DNEPFKKAMQGFTEKIVGMMKSERLYESQGGPIILSQIENEYGAQSKLLGSAGQNYVNWA 207

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV   T VPWVMC+++DAPDP+INTCNGFYCD FTPN P KP +WTE +SGWF  FG
Sbjct: 208 AKMAVETGTGVPWVMCKEDDAPDPVINTCNGFYCDYFTPNKPYKPSIWTEAWSGWFSEFG 267

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 268 GPNHERPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYG 327

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELHKAIK+CE  L+S+DP    LG   +AH+Y   S DCAAFL+N+D+ 
Sbjct: 328 LIRQPKYGHLKELHKAIKMCERALVSTDPAVTSLGNFQQAHVYSAKSGDCAAFLSNFDTK 387

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S   V FN   Y LP WS+SILPDC+NVVFNTAKV            Q   +  L   + 
Sbjct: 388 SSVRVMFNNMHYNLPPWSISILPDCRNVVFNTAKV----------GVQTSQMQMLPTNTR 437

Query: 421 AFSW--YEEKVGISGNRSFVRPD---LAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
            FSW  ++E +    + S +      L EQIN T+DTSDYLWY  S+ +   +     GK
Sbjct: 438 MFSWESFDEDISSLDDGSSITTTTSGLLEQINVTRDTSDYLWYITSVDIGSSESFLRGGK 497

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
              L ++S GHA  VF+N +L    YG  +   F     + L  G N + +LS+ VGL N
Sbjct: 498 LPTLIVQSTGHAVHVFINGQLSGSAYGTREDRRFTYTGTVNLRAGTNRIALLSVAVGLPN 557

Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
            G  F+    G+   V+L     GK DLS  +W YQVG++GE + L   +  +S  W Q 
Sbjct: 558 VGGHFETWNTGILGPVVLRGFDQGKLDLSWQKWTYQVGLKGEAMNLASPNGISSVEWMQS 617

Query: 590 STLP-VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
           + +   N+ L W+KT F AP+G  PLAL++  MGKGQ W+NG SIGRYW+A  A   G  
Sbjct: 618 ALVSDKNQPLTWHKTYFDAPDGDEPLALDMEGMGKGQIWINGLSIGRYWTALAA---GNC 674

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
             C Y G++   KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGDPSKISL+ ++ 
Sbjct: 675 NGCSYAGTFRPPKCQVGCGQPTQRWYHVPRSWLKPDHNLLVVFEELGGDPSKISLVKRSV 734

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCG 765
             +C+ VSE   P + +W  +    S     P+V L C  G  I++I FAS+G P G CG
Sbjct: 735 SSVCADVSEYH-PNIRNWHIDSYGKSEEFHPPKVHLHCSPGQTISSIKFASFGTPLGTCG 793

Query: 766 SFRPGACHMDVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           ++  G CH       ++K C+G+  C++ VS++  G     CP +LK L+VEA C+
Sbjct: 794 NYEKGVCHSSTSHATLEKKCIGKPRCTVTVSNSNFG--QDPCPNVLKRLSVEAVCA 847


>gi|61162208|dbj|BAD91085.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 848

 Score =  853 bits (2205), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 421/828 (50%), Positives = 551/828 (66%), Gaps = 24/828 (2%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV YD +ALVIDG+RR+L SGSIHYPRSTPE+W  LI+K+K+GGL+ I+TYVFWN HEP 
Sbjct: 30  NVVYDRKALVIDGQRRLLFSGSIHYPRSTPEMWEGLIQKAKDGGLDAIDTYVFWNLHEPS 89

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G Y FEGR DLVRF+KTV +AGL++HLRIGPY C+EWN+GGFPVWL F+PGI FRT N 
Sbjct: 90  PGNYNFEGRNDLVRFIKTVHKAGLYVHLRIGPYICSEWNFGGFPVWLKFVPGISFRTDNE 149

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M++F  K++ LMK E LF SQGGPIIL+Q+ENEY     A+G  G  Y+ WAA  
Sbjct: 150 PFKSAMQKFTQKVVQLMKNEKLFESQGGPIILSQIENEYEPESKAFGASGYAYMTWAAKM 209

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV + T VPWVMC+++DAPDP+INTCNGFYCD F+PN P KP MWTE +SGWF  FG  +
Sbjct: 210 AVGMGTGVPWVMCKEDDAPDPVINTCNGFYCDYFSPNKPYKPTMWTEAWSGWFTEFGGPI 269

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             RPVEDL FAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR
Sbjct: 270 YQRPVEDLTFAVARFIQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 329

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           +PK+GHL+ELHKA+KLCE  L+++DPT   LG+  +AH++   S   A FL+N+++ S  
Sbjct: 330 RPKYGHLKELHKAVKLCELALLNADPTVTTLGSYEQAHVFSSKSGSGAVFLSNFNTKSAT 389

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            VTFN   + LP WS+SILPDCKNV FNTA+V  Q +       Q    N  L +   F+
Sbjct: 390 KVTFNNMNFHLPPWSISILPDCKNVAFNTARVGVQTSQ-----TQLLRTNSELHSWGIFN 444

Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP-----GQGKEVFLNIES 478
             E+   ++G+ +     L +Q+N T+D+SDYLWYT S+ + P     G G+   L ++S
Sbjct: 445 --EDVSSVAGDTTITVTGLLDQLNITRDSSDYLWYTTSVDIDPSESFLGGGQHPSLTVQS 502

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            G A  VF+N +L     G  +   F     + L+ G+N + +LS+ VGL N G  F+  
Sbjct: 503 AGDAMHVFINDQLSGSASGTREHRRFTFTGNVNLHAGLNKISLLSIAVGLANNGPHFETR 562

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             G+   V L  L +G RDLS  +W YQVG++GE   LD  +  ++  W  GS +   + 
Sbjct: 563 NTGVLGPVALHGLDHGTRDLSWQKWSYQVGLKGEATNLDSPNSISAVDWMTGSLVAQKQQ 622

Query: 598 -LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+ Y    + C+  C Y G+
Sbjct: 623 PLTWYKAYFDEPNGDEPLALDMGSMGKGQVWINGQSIGRYWTIY--ADSDCS-ACTYSGT 679

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           +   KCQ  C  P Q  YH+PR+W+ P +NLLV+ EE+GGD SK++L+ K+   +C+ VS
Sbjct: 680 FRPKKCQFGCQHPTQQWYHVPRSWLKPSKNLLVVFEEIGGDVSKVALVKKSVTSVCAEVS 739

Query: 717 EADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           E + P + +W         V   P++ L C  G  I+AI F+S+G P G+CG F+ G CH
Sbjct: 740 E-NHPRITNWHTESHGQTEVQQKPEISLHCTDGHSISAIKFSSFGTPSGSCGKFQHGTCH 798

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             +   ++QK C+G+ +CS+ +S+   G  A  CP  LK L+VEA CS
Sbjct: 799 APNSNAVLQKECLGKQKCSVTISNTNFG--ADPCPSKLKKLSVEAVCS 844


>gi|2961390|emb|CAA18137.1| beta-galactosidase like protein [Arabidopsis thaliana]
          Length = 853

 Score =  853 bits (2205), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/845 (51%), Positives = 549/845 (64%), Gaps = 55/845 (6%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +AL+I+G+RR+L SGSIHYPRSTP++W +LI+K+K+GG++VIETYVFWN H
Sbjct: 29  VQCGVTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGIDVIETYVFWNLH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR DLVRFVKT+ +AGL+ HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 89  EPSPGKYDFEGRNDLVRFVKTIHKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK F  +I++LMK ENLF SQGGPIIL+Q+ENEYG      G  G  Y+ WA
Sbjct: 149 DNEPFKRAMKGFTERIVELMKSENLFESQGGPIILSQIENEYGRQGQLLGAEGHNYMTWA 208

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+   T VPWVMC+++DAPDP+INTCNGFYCD F PN P KP++WTE +SGWF  FG
Sbjct: 209 AKMAIATETGVPWVMCKEDDAPDPVINTCNGFYCDSFAPNKPYKPLIWTEAWSGWFTEFG 268

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG
Sbjct: 269 GPMHHRPVQDLAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYG 328

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE--------AHIYHKSSNDCAA 352
            IRQPK+GHL+ELH+AIK+CE+ L+S+DP    +G K +        AH+Y   S DC+A
Sbjct: 329 LIRQPKYGHLKELHRAIKMCEKALVSADPVVTSIGNKQQVWIYYERFAHVYSAESGDCSA 388

Query: 353 FLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNV 412
           FLANYD+ S A V FN   Y LP WS+SILPDC+N VFNTAKV                 
Sbjct: 389 FLANYDTESAARVLFNNVHYNLPPWSISILPDCRNAVFNTAKV----------------- 431

Query: 413 NELLLASSAFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ- 468
                  S F W    E+   +  + +F    L EQIN T+DTSDYLWY  S+ +   + 
Sbjct: 432 -------SNFQWESYLEDLSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSES 484

Query: 469 ----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
               G+   L I+S GHA  +FVN +L    +G      F    KI L+ G N + +LS+
Sbjct: 485 FLHGGELPTLIIQSTGHAVHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSV 544

Query: 525 MVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS 583
            VGL N G  F+    G+   V L  L  GK DLS  +W YQVG++GE + L   +   S
Sbjct: 545 AVGLPNVGGHFESWNTGILGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPS 604

Query: 584 SFWKQGS-TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
             W   S T+   + L W+KT F APEG  PLAL++  MGKGQ WVNG+SIGRYW+A+  
Sbjct: 605 IGWMDASLTVQKPQPLTWHKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF-- 662

Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKIS 702
            +TG    C Y G+Y  +KCQ  CGQP Q  YH+PR W+ P +NLLVI EELGG+PS +S
Sbjct: 663 -ATGDCSHCSYTGTYKPNKCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVS 721

Query: 703 LLTKTGQHICSFVSEADPPPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGI 759
           L+ ++   +C+ VSE   P + +W+      G     P+V L C  G  IA+I FAS+G 
Sbjct: 722 LVKRSVSGVCAEVSEYH-PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGT 780

Query: 760 PEGNCGSFRPGACHM----DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAV 815
           P G CGS++ G CH      +L    + CVG+  C++ +S++  G     CP +LK L V
Sbjct: 781 PLGTCGSYQQGECHAATSYAILERYMQKCVGKARCAVTISNSNFG--KDPCPNVLKRLTV 838

Query: 816 EAHCS 820
           EA C+
Sbjct: 839 EAVCA 843


>gi|224129140|ref|XP_002328900.1| predicted protein [Populus trichocarpa]
 gi|222839330|gb|EEE77667.1| predicted protein [Populus trichocarpa]
          Length = 891

 Score =  852 bits (2202), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/855 (50%), Positives = 566/855 (66%), Gaps = 48/855 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRAL+IDG+RR+L S  IHYPR+TPE+WP+LI KSKEGG +V++TYVFW  HEP+
Sbjct: 35  NVTYDHRALIIDGRRRILNSAGIHYPRATPEMWPDLIAKSKEGGADVVQTYVFWGGHEPV 94

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQYYFEGR+DLV+FVK V E+GL+LHLRIGPY CAEWN+GGFPVWL  +PG+ FRT N 
Sbjct: 95  KGQYYFEGRYDLVKFVKLVGESGLYLHLRIGPYVCAEWNFGGFPVWLRDVPGVVFRTDNA 154

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFKEEM++F+ KI+DLM++E L + QGGPII+ Q+ENEYGN+E ++G GG+ Y+KWAA  
Sbjct: 155 PFKEEMQKFVTKIVDLMREEMLLSWQGGPIIMFQIENEYGNIEHSFGQGGKEYMKWAAGM 214

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+  VPWVMC+Q DAP+ II+ CNG+YCDGF PNSP KPI WTE++ GW+ ++G  +
Sbjct: 215 ALALDAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSPKKPIFWTEDWDGWYTTWGGRL 274

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAFAVARFF+ GG+FQNYYMYFGGTNFGRT+GGP   TSYDYDAPIDEYG + 
Sbjct: 275 PHRPVEDLAFAVARFFQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPIDEYGLLS 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPT-HQKLGAKLEAHIY-------------HKSSND 349
           +PKWGHL++LH AIKLCE  L+++D   + KLG K EAH+Y             + S + 
Sbjct: 335 EPKWGHLKDLHAAIKLCEPALVAADSAQYIKLGPKQEAHVYGGSLSIQGMNFSQYGSQSK 394

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ 409
           C+AFLAN D    A V F G  + LP WSVSILPDC+N VFNTAKV +Q +     F   
Sbjct: 395 CSAFLANIDERQAATVRFLGQSFTLPPWSVSILPDCRNTVFNTAKVAAQTHIKTVEFVLP 454

Query: 410 KNVNELLL--------ASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
            + + LL         +  + SW   +E + +    +F    + E +N TKD SDYLWY 
Sbjct: 455 LSNSSLLPQFIVQNEDSPQSTSWLIAKEPITLWSEENFTVKGILEHLNVTKDESDYLWYF 514

Query: 460 ASIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
             I+V        +  +V   ++I+S+     VF+N +L     G+   A     + ++ 
Sbjct: 515 TRIYVSDDDIAFWEKNKVSPAVSIDSMRDVLRVFINGQLTGSVVGHWVKA----VQPVQF 570

Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGE 571
            +G N L +LS  VGLQNYGA+ +  GAG    I L   KNG  DLS+  W YQVG++GE
Sbjct: 571 QKGYNELVLLSQTVGLQNYGAFLERDGAGFKGQIKLTGFKNGDIDLSNLSWTYQVGLKGE 630

Query: 572 YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQ 631
           ++ +          W + +      +  WYKT F AP G  P+AL+L SMGKGQAWVNG 
Sbjct: 631 FLKVYSTGDNEKFEWSELAVDATPSTFTWYKTFFDAPSGVDPVALDLGSMGKGQAWVNGH 690

Query: 632 SIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIH 691
            IGRYW+  ++P  GC   CDYRG+Y + KC+ +CG P QT YH+PR W+    NLLV+ 
Sbjct: 691 HIGRYWTV-VSPKDGC-GSCDYRGAYSSGKCRTNCGNPTQTWYHVPRAWLEASNNLLVVF 748

Query: 692 EELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACER 745
           EE GG+P +IS+  ++ + IC+ VSE+  PP+  W        N+     +P++ L C+ 
Sbjct: 749 EETGGNPFEISVKLRSAKVICAQVSESHYPPLRKWSRADLTGGNISRNDMTPEMHLKCQD 808

Query: 746 GWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAG 804
           G  +++I FASYG P G+C  F  G CH  +   +V +AC G+ +C I +S+A  G    
Sbjct: 809 GHIMSSIEFASYGTPNGSCQKFSRGNCHASNSSSVVTEACQGKNKCDIAISNAVFG---D 865

Query: 805 ACPGLLKALAVEAHC 819
            C G++K LAVEA C
Sbjct: 866 PCRGVIKTLAVEARC 880


>gi|357153898|ref|XP_003576603.1| PREDICTED: beta-galactosidase 15-like [Brachypodium distachyon]
          Length = 908

 Score =  851 bits (2199), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 433/855 (50%), Positives = 559/855 (65%), Gaps = 45/855 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRA+ + G+RR+L S  +HYPR+TPE+WP +I K KEGG +VIETY+FWN HEP 
Sbjct: 51  NVSYDHRAVRVGGERRMLVSAGVHYPRATPEMWPSIIAKCKEGGADVIETYIFWNGHEPA 110

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQYYFE RFDLVRF+K V   GLFL LRIGPYACAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 111 KGQYYFEERFDLVRFIKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 170

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P+K EM+ F+ KI+D+MK E L++ QGGPIIL Q+ENEYGN++  YG  G+ Y++WAA  
Sbjct: 171 PYKAEMQTFVTKIVDMMKDEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 230

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+T +PWVMC+Q DAP+ I++TCN FYCDGF PNS +KP +WTE++ GW+  +G  +
Sbjct: 231 ALGLDTGIPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGPL 290

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL  TSYDYDAPI+EYG +R
Sbjct: 291 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPINEYGMLR 350

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKS-----------SNDC 350
           QPKWGHL++LH AIKLCE  LI+ D  P + KLG+  EAHIY  +           +  C
Sbjct: 351 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYVKLGSMQEAHIYSSAKVHTNGSTAGNAQIC 410

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-----NGDHP 405
           +AFLAN D     +V   G  Y LP WSVSILPDC+NV FNTA+V +Q +     +G   
Sbjct: 411 SAFLANIDEHKYVSVWIFGKSYNLPPWSVSILPDCENVAFNTARVGAQTSVFTFESGSPS 470

Query: 406 FAQQKNVNELL------LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
            + ++  + LL        SS +   +E +G  G+ SF    + E +N TKD SDYLWYT
Sbjct: 471 HSSRREPSVLLPGVRGSYLSSTWWTSKETIGTWGDGSFATQGILEHLNVTKDISDYLWYT 530

Query: 460 ASIHVM-------PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
            S+++          +G    L I+ +   A VFVN KL     G+       + + I+ 
Sbjct: 531 TSVNISDEDVAFWSSKGVLPSLIIDQIRDVARVFVNGKLAGSQVGHW----VSLKQPIQF 586

Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGE 571
             G+N L +LS +VGLQNYGA+ +  GAG    V L  L NG  DL++  W YQVG++GE
Sbjct: 587 VRGLNELTLLSEIVGLQNYGAFLEKDGAGFKGQVKLTGLSNGDTDLTNSAWTYQVGLKGE 646

Query: 572 YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQ 631
           +  +        + W    T  +     WYKT   APEG  P+A++L SMGKGQAWVNG+
Sbjct: 647 FSMIYTPEKQECAEWSAMQTDNIQSPFTWYKTMVDAPEGTDPVAIDLGSMGKGQAWVNGR 706

Query: 632 SIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIH 691
            IGRYWS  +AP +GC   C+Y G+Y  +KCQ +CG P Q+ YHIPR W+    NLLV+ 
Sbjct: 707 LIGRYWS-LVAPESGCPSSCNYPGAYSETKCQSNCGMPTQSWYHIPREWLQESNNLLVLF 765

Query: 692 EELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK----PNLGVVSSSPQVRLACERGW 747
           EE GGDPSKISL     + ICS +SE   PP+ +W       + V S +P++ L C+ G+
Sbjct: 766 EETGGDPSKISLEVHYTKTICSRISENYYPPLSAWSWLDTGRVSVDSVAPELLLRCDDGY 825

Query: 748 HIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGAC 806
            I+ I FASYG P G C +F  G CH    L  V +ACVG+ +C+I VS+   G     C
Sbjct: 826 EISRITFASYGTPSGGCQNFSKGKCHAASTLDFVTEACVGKNKCAISVSNDVFG---DPC 882

Query: 807 PGLLKALAVEAHCSI 821
            G+LK LAVEA CS+
Sbjct: 883 RGVLKDLAVEAECSL 897


>gi|115488372|ref|NP_001066673.1| Os12g0429200 [Oryza sativa Japonica Group]
 gi|122234131|sp|Q0INM3.1|BGL15_ORYSJ RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
           Precursor
 gi|113649180|dbj|BAF29692.1| Os12g0429200 [Oryza sativa Japonica Group]
          Length = 919

 Score =  848 bits (2192), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 438/857 (51%), Positives = 562/857 (65%), Gaps = 52/857 (6%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRA++I GKRR+L S  +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP 
Sbjct: 63  NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQYYFE RFDLV+F K V   GLFL LRIGPYACAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK EM+ F+ KI+ LMK+E L++ QGGPIIL Q+ENEYGN++  YG  G+ Y++WAA  
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+T +PWVMC+Q DAP+ II+TCN FYCDGF PNS +KP +WTE++ GW+  +G A+
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL  TSYDYDAPIDEYG +R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-----------SSNDC 350
           QPKWGHL++LH AIKLCE  LI+ D  P + KLG+  EAH+Y             ++  C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPF 406
           +AFLAN D    A+V   G  Y LP WSVSILPDC+NV FNTA++ +Q +        P 
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482

Query: 407 AQQKNVNELLLASS-----AFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
              ++   +L  +S     + +W+  +E +G  G  +F    + E +N TKD SDYLWYT
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542

Query: 460 ASIHVMPG-------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
             +++          +G    L I+ +   A VFVN KL     G+       + + I+L
Sbjct: 543 TRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQL 598

Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGE 571
            EG+N L +LS +VGLQNYGA+ +  GAG    V L  L +G  DL++  W YQVG++GE
Sbjct: 599 VEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGE 658

Query: 572 YIGL---DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
           +  +   +K   A  S  ++ S  P      WYKT F  P+G  P+A++L SMGKGQAWV
Sbjct: 659 FSMIYAPEKQGCAGWSRMQKDSVQP----FTWYKTMFSTPKGTDPVAIDLGSMGKGQAWV 714

Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLL 688
           NG  IGRYWS  +AP +GC+  C Y G+Y+  KCQ +CG P Q  YHIPR W+   +NLL
Sbjct: 715 NGHLIGRYWS-LVAPESGCSSSCYYPGAYNERKCQSNCGMPTQNWYHIPREWLKESDNLL 773

Query: 689 VIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW----KPNLGVVSSSPQVRLACE 744
           V+ EE GGDPS ISL     + +CS +SE   PP+ +W         V +++P++RL C+
Sbjct: 774 VLFEETGGDPSLISLEAHYAKTVCSRISENYYPPLSAWSHLSSGRASVNAATPELRLQCD 833

Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSA 803
            G  I+ I FASYG P G C +F  G CH    L +V +ACVG  +C+I VS+   G   
Sbjct: 834 DGHVISEITFASYGTPSGGCLNFSKGNCHASSTLDLVTEACVGNTKCAISVSNDVFG--- 890

Query: 804 GACPGLLKALAVEAHCS 820
             C G+LK LAVEA CS
Sbjct: 891 DPCRGVLKDLAVEAKCS 907


>gi|222618730|gb|EEE54862.1| hypothetical protein OsJ_02342 [Oryza sativa Japonica Group]
          Length = 839

 Score =  848 bits (2192), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/836 (50%), Positives = 549/836 (65%), Gaps = 46/836 (5%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPE------------VWPELIRKSKEGGLEVIET 53
           TYD +A+V++G+RR+L SGSIHYPRSTPE            +WP+LI K+K+GGL+V++T
Sbjct: 27  TYDRKAVVVNGQRRILISGSIHYPRSTPEARRTRFPFLLLTMWPDLIEKAKDGGLDVVQT 86

Query: 54  YVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI 113
           YVFWN HEP  GQYYFEGR+DLV F+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++
Sbjct: 87  YVFWNGHEPSPGQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYV 146

Query: 114 PGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG 173
           PGI FRT N PFK EM++F  KI+++MK E LF  QGGPIIL+Q+ENE+G +EW  G   
Sbjct: 147 PGISFRTDNEPFKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPA 206

Query: 174 ELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
           + Y  WAA+ AV LNTSVPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++
Sbjct: 207 KAYASWAANMAVALNTSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWT 266

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
            W+  FG  VP RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYD
Sbjct: 267 AWYTGFGIPVPHRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYD 326

Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAF 353
           APIDEYG +R+PKWGHL++LHKAIKLCE  L++ DP    LG   ++ ++  S+  CAAF
Sbjct: 327 APIDEYGLLREPKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAF 386

Query: 354 LANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVN 413
           L N D  S A V FNG  Y LP WS+SILPDCK  VFNTA+V SQ        +Q K   
Sbjct: 387 LENKDKVSYARVAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK--- 436

Query: 414 ELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ--- 468
             +  +  F+W  Y E++   G        L EQIN T+D +DYLWYT  + V   +   
Sbjct: 437 --MEWAGGFAWQSYNEEINSFGEDPLTTVGLLEQINVTRDNTDYLWYTTYVDVAQDEQFL 494

Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
             G+ + L + S GHA  +F+N +L    YG+ D         ++L  G NT+  LS+ V
Sbjct: 495 SNGENLKLTVMSAGHALHIFINGQLKGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAV 554

Query: 527 GLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
           GL N G  F+   AG+   + +D L  G+RDL+  +W YQVG++GE + L  +S +++  
Sbjct: 555 GLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVE 614

Query: 586 WKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
           W +    PV K  L WYK  F AP+G  PLAL+++SMGKGQ W+NGQ IGRYW  Y A  
Sbjct: 615 WGE----PVQKQPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA-- 668

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
           +G    CDYRG YD +KCQ +CG  +Q  YH+PR+W+ P  NLLVI EE GGDP+ IS++
Sbjct: 669 SGNCGTCDYRGEYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMV 728

Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNC 764
            ++   +C+ VSE   P + +W           +V L C+ G  I  I FAS+G P+G+C
Sbjct: 729 KRSIGSVCADVSEWQ-PSMKNWHTK---DYEKAKVHLQCDNGQKITEIKFASFGTPQGSC 784

Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           GS+  G CH      I  K CVGQ  C + V     G     CPG +K   VEA C
Sbjct: 785 GSYTEGGCHAHKSYDIFWKNCVGQERCGVSVVPEIFG--GDPCPGTMKRAVVEAIC 838


>gi|297826725|ref|XP_002881245.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327084|gb|EFH57504.1| hypothetical protein ARALYDRAFT_902346 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 887

 Score =  847 bits (2187), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/848 (50%), Positives = 551/848 (64%), Gaps = 38/848 (4%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+I  KRR+L S  IHYPR+TPE+W +LI KSKEGG +VI+TYVFW+ HEP+
Sbjct: 37  NVSYDHRALIIADKRRMLVSAGIHYPRATPEMWSDLIEKSKEGGADVIQTYVFWSGHEPV 96

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEGR+DLV+FVK +  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGIQFRT N 
Sbjct: 97  KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIQFRTDNE 156

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+EM++F+ KI+DLM+   LF  QGGPII+ Q+ENEYG+VE +YG  G+ YVKWAA  
Sbjct: 157 PFKKEMQKFVTKIVDLMRDAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMC+Q DAP+ II+ CNG+YCDGF PNS  KPI+WTE++ GW+  +G ++
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSQMKPILWTEDWDGWYTKWGGSL 276

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP EDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP   TSYDYDAP+DEYG   
Sbjct: 277 PHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRS 336

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND----CAAFLANYD 358
           +PKWGHL++LH AIKLCE  L+++D P ++KLG+  EAHIY          CAAFLAN D
Sbjct: 337 EPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSNQEAHIYRGDGETGGKVCAAFLANID 396

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---------Q 409
               A+V FNG  Y LP WSVSILPDC++V FNTAKV +Q +      A+         Q
Sbjct: 397 EHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGSKSILQ 456

Query: 410 KNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
           K V +  ++  + SW   +E +GI G  +F    L E +N TKD SDYLW+   I V   
Sbjct: 457 KVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRITVSED 516

Query: 468 Q-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
                   G    ++I+S+     VFVNK+L     G+   A     + +   +G N L 
Sbjct: 517 DISFWKKNGANPTVSIDSMRDVLRVFVNKQLSGSVVGHWVKA----VQPVRFMQGNNDLL 572

Query: 521 ILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
           +L+  VGLQNYGA+ +  GAG      L   KNG  DL+   W YQVG++GE   +  + 
Sbjct: 573 LLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDMDLAKSSWTYQVGLKGEAEKIYTVE 632

Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
               + W    T       +WYKT F  P G  P+ L+L SMGKGQAWVNG  IGRYW+ 
Sbjct: 633 HNEKAEWSTLETDASPSIFMWYKTYFDTPAGTDPVVLDLESMGKGQAWVNGHHIGRYWNI 692

Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
            ++   GC + CDYRG+Y + KC  +CG+P QT YH+PR+W+ P  NLLV+ EE GG+P 
Sbjct: 693 -ISQKDGCERTCDYRGAYYSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPF 751

Query: 700 KISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACERGWHIAAIN 753
            IS+ T T   +C  V E+  PP+  W         + + S +P+V L CE G  I++I 
Sbjct: 752 NISVKTVTAGILCGQVLESHYPPLRKWSTPDYINGTMSINSVAPEVYLHCEDGHVISSIE 811

Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
           FASYG P G+C  F  G CH  + L IV +AC G+  C I VS+      +  C G LK 
Sbjct: 812 FASYGTPRGSCDRFSIGKCHASNSLSIVSEACKGRTSCFIEVSNT--AFRSDPCSGTLKT 869

Query: 813 LAVEAHCS 820
           LAV A CS
Sbjct: 870 LAVMARCS 877


>gi|357518749|ref|XP_003629663.1| Beta-galactosidase [Medicago truncatula]
 gi|355523685|gb|AET04139.1| Beta-galactosidase [Medicago truncatula]
          Length = 912

 Score =  845 bits (2184), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 442/863 (51%), Positives = 571/863 (66%), Gaps = 56/863 (6%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRAL+IDG RR+L S  IHYPR+TPE+WP+LI K+KEGG++VIETYVFWN H+P+
Sbjct: 49  NVTYDHRALIIDGHRRMLISAGIHYPRATPEMWPDLIAKAKEGGVDVIETYVFWNGHQPV 108

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEGR+DLV+F K V   GL+  LRIGPYACAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 109 KGQYNFEGRYDLVKFAKLVASNGLYFFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTNNA 168

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQV------ENEYGNVEWAYGVGGELYV 177
           PFKEEMKRF++K+++LM++E LF+ QGGPIIL QV      ENEYGN+E +YG  G+ YV
Sbjct: 169 PFKEEMKRFVSKVVNLMREEMLFSWQGGPIILLQVRREYGIENEYGNLESSYGNEGKEYV 228

Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
           KWAA  A++L   VPWVMC+Q DAP  II+TCN +YCDGF PNS +KPI WTEN+ GW+ 
Sbjct: 229 KWAASMALSLGAGVPWVMCKQPDAPYDIIDTCNAYYCDGFKPNSRNKPIFWTENWDGWYT 288

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
            +G  +P RPVEDLAFAVARFF+ GG+ QNYYMYFGGTNFGRTAGGPL  TSYDYDAPID
Sbjct: 289 QWGERLPHRPVEDLAFAVARFFQRGGSLQNYYMYFGGTNFGRTAGGPLQITSYDYDAPID 348

Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKS---------- 346
           EYG + +PKWGHL++LH A+KLCE  L+++D PT+ KLG+K EAH+Y ++          
Sbjct: 349 EYGLLNEPKWGHLKDLHAALKLCEPALVAADSPTYIKLGSKQEAHVYQENVHREGLNLSI 408

Query: 347 ---SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN--- 400
              SN C+AFLAN D    A VTF G  Y LP WSVSILPDC++ +FNTAKV +Q +   
Sbjct: 409 SQISNKCSAFLANIDERKAATVTFRGQTYTLPPWSVSILPDCRSAIFNTAKVGAQTSVKL 468

Query: 401 -NGDHPFA-----QQKNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDT 452
              + P        Q++++   ++  + SW   +E + I  N SF    + E +N TKD 
Sbjct: 469 VGSNLPLTSNLLLSQQSIDHNGISHISKSWMTTKEPINIWINSSFTAEGIWEHLNVTKDQ 528

Query: 453 SDYLWYTASIHVMPGQ-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFL 505
           SDYLWY+  I+V  G             L I+S+     VFVN +L+    G+   A   
Sbjct: 529 SDYLWYSTRIYVSDGDILFWKENAAHPKLAIDSVRDILRVFVNGQLIGNVVGHWVKA--- 585

Query: 506 INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIY 564
             + ++   G N L +L+  VGLQNYGA+ +  GAG+   I I   +NG  DLS   W Y
Sbjct: 586 -VQTLQFQPGYNDLTLLTQTVGLQNYGAFIEKDGAGIRGTIKITGFENGHIDLSKPLWTY 644

Query: 565 QVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKG 624
           QVG++GE++        N+  W + +   +  +  WYKT F  P G  P+AL+L SMGKG
Sbjct: 645 QVGLQGEFLKFYNEESENAG-WVELTPDAIPSTFTWYKTYFDVPGGNDPVALDLESMGKG 703

Query: 625 QAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG 684
           QAWVNG  IGRYW+  ++P TGC + CDYRG+YD+ KC  +CG+P QTLYH+PR+W+   
Sbjct: 704 QAWVNGHHIGRYWTR-VSPKTGC-QVCDYRGAYDSDKCTTNCGKPTQTLYHVPRSWLKAS 761

Query: 685 ENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW--KPNLGV--VSSS---P 737
            N LVI EE GG+P  IS+   +   +C+ VS++  PP+        LG   VSS+   P
Sbjct: 762 NNFLVILEETGGNPLGISVKLHSASIVCAQVSQSYYPPMQKLLNASLLGQQEVSSNDMIP 821

Query: 738 QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSS 796
           ++ L C  G  I++I FAS+G P G+C SF  G CH      IV KAC+G+  CSI +SS
Sbjct: 822 EMNLRCRDGNIISSITFASFGTPGGSCQSFSRGNCHAPSSKSIVSKACLGKRSCSIKISS 881

Query: 797 AYLGVSAGACPGLLKALAVEAHC 819
              G     C  ++K L+VEA C
Sbjct: 882 DVFG--GDPCQDVVKTLSVEARC 902


>gi|308550956|gb|ADO34792.1| beta-galactosidase STBG7 [Solanum lycopersicum]
          Length = 870

 Score =  845 bits (2184), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/835 (50%), Positives = 537/835 (64%), Gaps = 27/835 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYD R+L+I+G+R++L S SIHYPRS P +WP L+R +KEGG++VIETYVFWN HEP 
Sbjct: 45  SVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVFWNGHEPS 104

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G YYF GRFDLV+F K +Q+AG+++ LRIGP+  AEWN+GG PVWLH++PG  FRT + 
Sbjct: 105 PGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGTTFRTDSE 164

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M++F+   ++LMK+E LFASQGGPIIL+QVENEYG  E AYG GG+ Y  WAA  
Sbjct: 165 PFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRYALWAAKM 224

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A++ NT VPW+MCQQ DAPDP+I+TCN FYCD F P SP+KP +WTEN+ GWF +FG   
Sbjct: 225 ALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWFKTFGARD 284

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED+A++VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG  R
Sbjct: 285 PHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 344

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
            PKWGHL+ELHK IK CE  L+++DPT   LG   EA +Y  +S  CAAFLAN D  +D 
Sbjct: 345 FPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLANMDDKNDK 404

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD------HPFAQQKNVNELLL 417
            V F    Y LPAWSVSILPDCKNV FNTAKV  Q +  +      HP A     +   +
Sbjct: 405 VVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSPKRD---I 461

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN-- 475
            S  +  ++E  G+ G   F +    + INTTKD +DYLWYT SI V     +E FL   
Sbjct: 462 KSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFV---HAEEDFLRNR 518

Query: 476 ------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
                 +ES GHA  VF+NKKL A   GN     F     I L  G N + +LSM VGLQ
Sbjct: 519 GTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEIALLSMTVGLQ 578

Query: 530 NYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
             GA+++  GAG  SV +   K G  DL++  W Y++G++GE++ + K     S  W   
Sbjct: 579 TAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQKSYNLKSKIWAPT 638

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           S  P  + L WYK    AP G  P+AL++  MGKG AW+NGQ IGRYW    +    C  
Sbjct: 639 SQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKYENCVT 698

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
           +CDYRG ++  KC   CGQP Q  YH+PR+W  P  N+L+I EE+GGDPS+I    +   
Sbjct: 699 QCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRFSMRKVS 758

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGS 766
             C  +S  D P  D        + S    P + L C    +I+++ FAS+G P G CGS
Sbjct: 759 GACGHLS-VDHPSFDVENLQGSEIESDKNRPTLSLKCPTNTNISSVKFASFGNPNGTCGS 817

Query: 767 FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +  G CH  +   +V+K C+ Q EC++ +SSA   +    CP  +K LAVE +CS
Sbjct: 818 YMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQ--LCPSTVKKLAVEVNCS 870


>gi|359480881|ref|XP_003632537.1| PREDICTED: beta-galactosidase 3-like [Vitis vinifera]
 gi|296082595|emb|CBI21600.3| unnamed protein product [Vitis vinifera]
          Length = 847

 Score =  845 bits (2182), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/831 (50%), Positives = 542/831 (65%), Gaps = 13/831 (1%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L+ANVTYD R+L+IDG+R++L S SIHYPRS P +WP L++ +KEGG++VIETYVFWN H
Sbjct: 19  LAANVTYDRRSLIIDGQRKLLISASIHYPRSVPGMWPGLVKTAKEGGIDVIETYVFWNGH 78

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           E     YYF GR+DL++FVK VQ+A ++L LR+GP+  AEWN+GG PVWLH++PG  FRT
Sbjct: 79  ELSPDNYYFGGRYDLLKFVKIVQQARMYLILRVGPFVAAEWNFGGVPVWLHYVPGTVFRT 138

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            + PFK  M++F+  I+++MK+E LFASQGGPIILAQVENEYG+ E  YG GG+ Y  WA
Sbjct: 139 NSEPFKYHMQKFMTLIVNIMKKEKLFASQGGPIILAQVENEYGDTERIYGDGGKPYAMWA 198

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A++ N  VPW+MCQQ DAPDP+INTCN FYCD FTPNSP+KP MWTEN+ GWF +FG
Sbjct: 199 ANMALSQNIGVPWIMCQQYDAPDPVINTCNSFYCDQFTPNSPNKPKMWTENWPGWFKTFG 258

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRT+GGP + TSYDY+APIDEYG
Sbjct: 259 APDPHRPHEDIAFSVARFFQKGGSLQNYYMYHGGTNFGRTSGGPFITTSYDYNAPIDEYG 318

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKWGHL+ELH+AIK CE  L+  +P +  LG   E  +Y  SS  CAAF++N D  
Sbjct: 319 LARLPKWGHLKELHRAIKSCEHVLLYGEPINLSLGPSQEVDVYTDSSGGCAAFISNVDEK 378

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLAS 419
            D  + F    Y +PAWSVSILPDCKNVVFNTAKV SQ +  +  P   Q ++       
Sbjct: 379 EDKIIVFQNVSYHVPAWSVSILPDCKNVVFNTAKVGSQTSQVEMVPEELQPSLVPSNKDL 438

Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG--KEV--- 472
               W  + EK GI G   FV+    + INTTKDT+DYLWYT S+ V   +   KE+   
Sbjct: 439 KGLQWETFVEKAGIWGEADFVKNGFVDHINTTKDTTDYLWYTVSLTVGESENFLKEISQP 498

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L +ES GHA   FVN+KL     GN   + F     I L  G N + +LSM VGLQN G
Sbjct: 499 VLLVESKGHALHAFVNQKLQGSASGNGSHSPFKFECPISLKAGKNDIALLSMTVGLQNAG 558

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            +++  GAGL SV +  L NG  DLS+  W Y++G++GE++ + K    NS  W      
Sbjct: 559 PFYEWVGAGLTSVKIKGLNNGIMDLSTYTWTYKIGLQGEHLLIYKPEGLNSVKWLSTPEP 618

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
           P  + L WYK     P G  P+ L++  MGKG AW+NG+ IGRYW    +    C ++CD
Sbjct: 619 PKQQPLTWYKAVVDPPSGNEPIGLDMVHMGKGLAWLNGEEIGRYWPRKSSIHDKCVQECD 678

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           YRG +  +KC   CG+P Q  YH+PR+W  P  N+LVI EE GGDP+KI    +    +C
Sbjct: 679 YRGKFMPNKCSTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTKIRFSRRKTTGVC 738

Query: 713 SFVSEADPP-PVDSWKPNLGVVSSSP-QVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           + VSE  P   ++SW  +    + +   + L C    HI+++ FASYG P G CGS+  G
Sbjct: 739 ALVSEDHPTYELESWHKDANENNKNKATIHLKCPENTHISSVKFASYGTPTGKCGSYSQG 798

Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH  +   +V+K C+ + +C+I ++      S   CP   K LAVEA CS
Sbjct: 799 DCHDPNSASVVEKLCIRKNDCAIELAEK--NFSKDLCPSTTKKLAVEAVCS 847


>gi|302799737|ref|XP_002981627.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
 gi|300150793|gb|EFJ17442.1| hypothetical protein SELMODRAFT_421090 [Selaginella moellendorffii]
          Length = 874

 Score =  845 bits (2182), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/862 (49%), Positives = 573/862 (66%), Gaps = 62/862 (7%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N++YDHRA++I G+RR+L SG IHYPR++P++WP LIR +KEGGL++I+TYVFW+ HE
Sbjct: 20  ATNISYDHRAIIIGGQRRILISGCIHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G Y F+GR+DL+RF+K V +AGL+++LRIGPY CAEWN+GGFP WL  +PGIQFRT 
Sbjct: 80  PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTH 139

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  F+++M+ F+ KI+D++K E LFASQGGP++ +Q+ENEYGNV+ +YG+ G+ Y+ WAA
Sbjct: 140 NRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGINGKTYMLWAA 199

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A +L T VPW+MC+Q DAPD IINTCNG+YCDG+ PNS  KP MWTEN+SGW+ S+G 
Sbjct: 200 RMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQSWGE 259

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYM------------------YFGGTNFGRTAGG 283
           A P+R VED+AFAVARFF+ GG  QNYYM                  YFGGTNFGRT+GG
Sbjct: 260 AAPYRTVEDVAFAVARFFQRGGVAQNYYMVRTLHDLEQRLLMPERCQYFGGTNFGRTSGG 319

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG---AKLEA 340
           P + TSYDYDAP+DE+G +RQPKWGHL+ELH A+KLCE  L S+DP +  LG     ++A
Sbjct: 320 PFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPVYYTLGRMQEMVQA 379

Query: 341 HIYHKSSND---------CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
           H+Y   S +         CAAFLAN D+SS A+V F G VY LP WSVSILPDC+NVVFN
Sbjct: 380 HVYSDGSLEANFSNLATPCAAFLANIDTSS-ASVKFGGKVYNLPPWSVSILPDCRNVVFN 438

Query: 392 TAKVISQRNNGDHPFAQQKNVNEL--------LLASSAFSWYEEKVGISGNRSFVRPDLA 443
           TA+V +Q +       Q+ ++ E         L+   A+ W++E VG SG    +   L 
Sbjct: 439 TAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHALL 498

Query: 444 EQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDF 501
           EQI+TT D++DY+WY+    ++  +  G +  L I S+     +FVN +           
Sbjct: 499 EQISTTNDSTDYMWYSTRFEILDQELKGGDPVLVITSMRDMVHIFVNGEFAGSTSTLKSG 558

Query: 502 ANFL-INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSS 559
             +  + + I L  G+N L ILS  VGLQNYGA  +  GAG+   I I  L  G R+L+S
Sbjct: 559 GLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSIWIQGLSTGTRNLTS 618

Query: 560 GEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLA 619
             W++QVG+ GE+   D I+      W   ++LP  + L+WYK  F  P+G  P+A++L 
Sbjct: 619 ALWLHQVGLNGEH---DAIT------WSSTTSLPFFQPLVWYKANFNIPDGDDPVAIHLG 669

Query: 620 SMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRT 679
           SMGKGQAWVNG S+GR+W    APSTGC+ +CDYRG+Y +SKC   CG P+Q  YH+PR 
Sbjct: 670 SMGKGQAWVNGHSLGRFWPVITAPSTGCSDRCDYRGTYYSSKCLSSCGLPSQEWYHVPRE 729

Query: 680 WVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQV 739
           W+   +N LV+ EE+GG+ S +S  ++    +C+ VSE   PPV  +       SS P++
Sbjct: 730 WLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQF-------SSLPEL 782

Query: 740 RLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAY 798
            L+C  G  I++I FAS+G P+G CG+F+ G+CH ++   IV+KAC+G+  CS  +    
Sbjct: 783 GLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKACIGRQSCSFEIFWKN 842

Query: 799 LGVSAGACPGLLKALAVEAHCS 820
            G     CPG  K LAVEA C+
Sbjct: 843 FGTD--PCPGKAKTLAVEAACT 862


>gi|350537729|ref|NP_001234307.1| beta-galactosidase, chloroplastic precursor [Solanum lycopersicum]
 gi|7939621|gb|AAF70823.1|AF154422_1 beta-galactosidase [Solanum lycopersicum]
          Length = 870

 Score =  845 bits (2182), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/835 (50%), Positives = 537/835 (64%), Gaps = 27/835 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYD R+L+I+G+R++L S SIHYPRS P +WP L+R +KEGG++VIETYVFWN HEP 
Sbjct: 45  SVTYDRRSLIINGQRKLLISASIHYPRSVPAMWPGLVRLAKEGGVDVIETYVFWNGHEPS 104

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G YYF GRFDLV+F K +Q+AG+++ LRIGP+  AEWN+GG PVWLH++PG  FRT + 
Sbjct: 105 PGNYYFGGRFDLVKFCKIIQQAGMYMILRIGPFVAAEWNFGGLPVWLHYVPGTTFRTDSE 164

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M++F+   ++LMK+E LFASQGGPIIL+QVENEYG  E AYG GG+ Y  WAA  
Sbjct: 165 PFKYHMQKFMTYTVNLMKRERLFASQGGPIILSQVENEYGYYENAYGEGGKRYALWAAKM 224

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A++ NT VPW+MCQQ DAPDP+I+TCN FYCD F P SP+KP +WTEN+ GWF +FG   
Sbjct: 225 ALSQNTGVPWIMCQQYDAPDPVIDTCNSFYCDQFKPISPNKPKIWTENWPGWFKTFGARD 284

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED+A++VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG  R
Sbjct: 285 PHRPAEDVAYSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLPR 344

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
            PKWGHL+ELHK IK CE  L+++DPT   LG   EA +Y  +S  CAAFLAN D  +D 
Sbjct: 345 FPKWGHLKELHKVIKSCEHALLNNDPTLLSLGPLQEADVYEDASGACAAFLANMDDKNDK 404

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD------HPFAQQKNVNELLL 417
            V F    Y LPAWSVSILPDCKNV FNTAKV  Q +  +      HP A     +   +
Sbjct: 405 VVQFRHVSYHLPAWSVSILPDCKNVAFNTAKVGCQTSIVNMAPIDLHPTASSPKRD---I 461

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN-- 475
            S  +  ++E  G+ G   F +    + INTTKD +DYLWYT SI V     +E FL   
Sbjct: 462 KSLQWEVFKETAGVWGVADFTKNGFVDHINTTKDATDYLWYTTSIFV---HAEEDFLRNR 518

Query: 476 ------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
                 +ES GHA  VF+NKKL A   GN     F     I L  G N + +LSM VGLQ
Sbjct: 519 GTAMLFVESKGHAMHVFINKKLQASASGNGTVPQFKFGTPIALKAGKNEISLLSMTVGLQ 578

Query: 530 NYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
             GA+++  GAG  SV +   K G  DL++  W Y++G++GE++ + K     S  W   
Sbjct: 579 TAGAFYEWIGAGPTSVKVAGFKTGTMDLTASAWTYKIGLQGEHLRIQKSYNLKSKIWAPT 638

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           S  P  + L WYK    AP G  P+AL++  MGKG AW+NGQ IGRYW    +    C  
Sbjct: 639 SQPPKQQPLTWYKAVVDAPPGNEPVALDMIHMGKGMAWLNGQEIGRYWPRRTSKYENCVT 698

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
           +CDYRG ++  KC   CGQP Q  YH+PR+W  P  N+L+I EE+GGDPS+I    +   
Sbjct: 699 QCDYRGKFNPDKCVTGCGQPTQRWYHVPRSWFKPSGNVLIIFEEIGGDPSQIRFSMRKVS 758

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSS---PQVRLACERGWHIAAINFASYGIPEGNCGS 766
             C  +S  D P  D        + +    P + L C    +I+++ FAS+G P G CGS
Sbjct: 759 GACGHLS-VDHPSFDVENLQGSEIENDKNRPTLSLKCPTNTNISSVKFASFGNPNGTCGS 817

Query: 767 FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +  G CH  +   +V+K C+ Q EC++ +SSA   +    CP  +K LAVE +CS
Sbjct: 818 YMLGDCHDQNSAALVEKVCLNQNECALEMSSANFNMQ--LCPSTVKKLAVEVNCS 870


>gi|357130338|ref|XP_003566806.1| PREDICTED: beta-galactosidase 2-like [Brachypodium distachyon]
          Length = 831

 Score =  844 bits (2181), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/824 (51%), Positives = 548/824 (66%), Gaps = 29/824 (3%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP  
Sbjct: 29  VTYDRKAVVVNGQRRILLSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSP 88

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY+FEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFP+WL ++PGI FRT N P
Sbjct: 89  GQYHFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPIWLKYVPGISFRTDNEP 148

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK EM++F  KI+ +MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ A
Sbjct: 149 FKAEMQKFTTKIVQMMKSERLFEWQGGPIILSQIENEFGPLEWDQGEPAKDYASWAANMA 208

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           + LNT VPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP
Sbjct: 209 MALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 268

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDLA+ VA+F + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAP+DEYG +R+
Sbjct: 269 HRPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFERTAGGPFIATSYDYDAPLDEYGLLRE 328

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PKWGHL+ELH+AIKLCE  L+++DP    LG   +A ++  S+  CAAFL N    S A 
Sbjct: 329 PKWGHLKELHRAIKLCEPALVAADPILSSLGNAQKASVFRSSTGACAAFLENKHKLSYAR 388

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V+FNG  Y LP WS+SILPDCK  VFNTA+V SQ        +Q K   E     +  S+
Sbjct: 389 VSFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK--MEWAGGLTWQSY 439

Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESL 479
            EE    S   SF    L EQIN T+D +DYLWYT  + V   +     GK   L + S 
Sbjct: 440 NEEINSFSELESFTTVGLLEQINMTRDNTDYLWYTTYVDVAKDEQFLTSGKNPKLTVMSA 499

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
           GHA  VF+N +L    YG+ +        K++L  G NT+  LS+ VGL N G  F+   
Sbjct: 500 GHALHVFINGQLSGTVYGSVENPKLTYTGKVKLWSGSNTISCLSIAVGLPNVGEHFETWN 559

Query: 540 AGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS- 597
           AG+   + +D L  GKRDL+  +W YQVG++GE + L  +S ++S  W +    PV K  
Sbjct: 560 AGILGPVTLDGLNEGKRDLTWQKWTYQVGLKGEAMSLHSLSGSSSVEWGE----PVQKQP 615

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L WYK  F AP+G  PLAL++ SMGKGQ W+NGQ IGRYW  Y A  +G    CDYRG Y
Sbjct: 616 LTWYKAFFNAPDGDEPLALDMNSMGKGQIWINGQGIGRYWPGYKA--SGTCGHCDYRGEY 673

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
           + +KCQ +CG P+Q  YH+PR W++P  NLLVI EE GGDP+ IS++ +T   +C+ VSE
Sbjct: 674 NETKCQTNCGDPSQRWYHVPRPWLNPTGNLLVIFEEWGGDPTGISMVKRTTGSVCADVSE 733

Query: 718 ADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMD-V 776
              P + +W+          +V L C+ G  I  I FAS+G P+G+CG++  G CH    
Sbjct: 734 WQ-PSIKNWRTK---DYEKAEVHLQCDHGRKITEIKFASFGTPQGSCGNYSEGGCHAHRS 789

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             I +K C+ Q  C + V     G     CPG +K   VE  CS
Sbjct: 790 YDIFKKNCINQEWCGVSVVPEAFG--GDPCPGTMKRAVVEVTCS 831


>gi|61162196|dbj|BAD91080.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 851

 Score =  842 bits (2176), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/827 (51%), Positives = 539/827 (65%), Gaps = 13/827 (1%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YD R+L+IDG+R++L S +IHYPRS PE+WP+L++ +KEGG++VIETYVFWN HEP 
Sbjct: 28  NVSYDSRSLIIDGQRKLLISAAIHYPRSVPEMWPKLVQTAKEGGVDVIETYVFWNGHEPS 87

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G YYF GR+DLV+FVK V++AG+ L LRIGP+  AEW +GG PVWLH++PG  FRT N 
Sbjct: 88  PGNYYFGGRYDLVKFVKIVEQAGMHLILRIGPFVAAEWYFGGIPVWLHYVPGTVFRTENK 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M++F   I+DLMKQE  FASQGGPIILAQVENEYG  E  YG GG+ Y  WAA  
Sbjct: 148 PFKYHMQKFTTFIVDLMKQEKFFASQGGPIILAQVENEYGYYEKDYGEGGKQYAMWAASM 207

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+ N  VPW+MCQQ DAP+ +INTCN FYCD FTP   +KP +WTEN+ GWF +FG   
Sbjct: 208 AVSQNIGVPWIMCQQFDAPESVINTCNSFYCDQFTPIYQNKPKIWTENWPGWFKTFGGWN 267

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED+AF+VARFF+ GG+  NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG  R
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 327

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
            PKWGHL++LH+AIKLCE  +++S PT+  LG  LEA ++  SS  CAAF+AN D  +D 
Sbjct: 328 LPKWGHLKQLHRAIKLCEHIMLNSQPTNVSLGPSLEADVFTNSSGACAAFIANMDDKNDK 387

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLASSAF 422
            V F    Y LPAWSVSILPDCKNVVFNTAKV SQ +  +  P + Q +V     +    
Sbjct: 388 TVEFRNMSYHLPAWSVSILPDCKNVVFNTAKVGSQSSVVEMLPESLQLSVGSADKSLKDL 447

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
            W  + EK GI G   FV+  L + INTTK T+DYLWYT SI V   +     G    L 
Sbjct: 448 KWDVFVEKAGIWGEADFVKSGLVDHINTTKFTTDYLWYTTSILVGENEEFLKKGSSPVLL 507

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           IES GHA   FVN++L A   GN     F +   I L EG N + +LSM VGLQN G+++
Sbjct: 508 IESKGHAVHAFVNQELQASAAGNGTHFPFKLKAPISLKEGKNDIALLSMTVGLQNAGSFY 567

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           +  GAGL SV +    NG  DLS+  W Y++G+EGE+ GLDK     +  W   S  P  
Sbjct: 568 EWVGAGLTSVKIQGFNNGTIDLSAYNWTYKIGLEGEHQGLDKEEGFGNVNWISASEPPKE 627

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK     P G  P+ L++  MGKG AW+NG+ IGRYW     P  GC K+C+YRG
Sbjct: 628 QPLTWYKVIVDPPPGDDPVGLDMIHMGKGLAWLNGEEIGRYWPRK-GPLHGCVKECNYRG 686

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
            +D  KC   CG+P Q  YH+PR+W     N+LVI EE GGDPSKI    +    +C+ V
Sbjct: 687 KFDPDKCNTGCGEPTQRWYHVPRSWFKQSGNVLVIFEEKGGDPSKIEFSRRKITGVCALV 746

Query: 716 SEADPP-PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH- 773
           +E  P   ++SW    G   +   + L C    HI+++ FAS+G P G C S+  G CH 
Sbjct: 747 AENYPSIDLESWNDGSGSNKTVATIHLGCPEDTHISSVKFASFGNPTGACRSYTQGDCHD 806

Query: 774 MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            + + +V+K C+ +  C I ++      + G+C    K LAVE  C+
Sbjct: 807 PNSISVVEKVCLNKNRCDIELTGE--NFNKGSCLSEPKKLAVEVQCN 851


>gi|302759477|ref|XP_002963161.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
 gi|300168429|gb|EFJ35032.1| hypothetical protein SELMODRAFT_404798 [Selaginella moellendorffii]
          Length = 874

 Score =  842 bits (2175), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/862 (49%), Positives = 572/862 (66%), Gaps = 62/862 (7%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N++YDHRA++I G+RR+L SG +HYPR++P++WP LIR +KEGGL++I+TYVFW+ HE
Sbjct: 20  ATNISYDHRAIIIGGQRRILISGCLHYPRASPQMWPALIRNAKEGGLDMIDTYVFWDGHE 79

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G Y F+GR+DL+RF+K V +AGL+++LRIGPY CAEWN+GGFP WL  +PGIQFRT 
Sbjct: 80  PSPGIYNFQGRYDLIRFLKLVHQAGLYVNLRIGPYVCAEWNFGGFPAWLLKLPGIQFRTH 139

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  F+++M+ F+ KI+D++K E LFASQGGP++ +Q+ENEYGNV+ +YG  G+ Y+ WAA
Sbjct: 140 NRAFEDKMEEFVRKIVDMVKSEQLFASQGGPVLFSQIENEYGNVQGSYGTNGKTYMLWAA 199

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A +L T VPW+MC+Q DAPD IINTCNG+YCDG+ PNS  KP MWTEN+SGW+  +G 
Sbjct: 200 RMAKDLETGVPWIMCKQPDAPDYIINTCNGYYCDGWKPNSRDKPAMWTENWSGWYQLWGE 259

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYM------------------YFGGTNFGRTAGG 283
           A P+R VED+AFAVARFF+ GG  QNYYM                  YFGGTNFGRT+GG
Sbjct: 260 AAPYRTVEDVAFAVARFFQRGGVAQNYYMVRMLHDLEQHLLMPERCQYFGGTNFGRTSGG 319

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG---AKLEA 340
           P + TSYDYDAP+DE+G +RQPKWGHL+ELH A+KLCE  L S+DP +  LG     ++A
Sbjct: 320 PFITTSYDYDAPLDEFGMLRQPKWGHLKELHAALKLCETALTSNDPLYYTLGRMQEMVQA 379

Query: 341 HIYHKSSND---------CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
           H+Y   S +         CAAFLAN D+SS A+V F GNVY LP WSVSILPDC+NVVFN
Sbjct: 380 HVYSDGSLEANFSNLATPCAAFLANIDTSS-ASVKFGGNVYNLPPWSVSILPDCRNVVFN 438

Query: 392 TAKVISQRNNGDHPFAQQKNVNEL--------LLASSAFSWYEEKVGISGNRSFVRPDLA 443
           TA+V +Q +       Q+ ++ E         L+   A+ W++E VG SG    +   L 
Sbjct: 439 TAQVSAQTSVTKMVAVQKPSLIEEVSGSYTPGLVEQLAWEWFQEPVGGSGINKILAHALL 498

Query: 444 EQINTTKDTSDYLWYTASIHVMPGQ--GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDF 501
           EQI+TT D++DYLWY+    +   +  G +  L I S+     +FVN +           
Sbjct: 499 EQISTTNDSTDYLWYSTRFEISDQELKGGDPVLVITSMRDMVHIFVNGEFAGSTSTLKSG 558

Query: 502 ANFL-INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSS 559
             +  + + I L  G+N L ILS  VGLQNYGA  +  GAG+   + I  L  G R+L+S
Sbjct: 559 GLYARVQQPIHLKAGVNHLAILSATVGLQNYGAHLETHGAGITGSVWIQGLSTGTRNLTS 618

Query: 560 GEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLA 619
             W++QVG+ GE+   D I+      W   ++LP  + L+WYK  F  P+G  P+A++L 
Sbjct: 619 ALWLHQVGLNGEH---DAIT------WSSTTSLPFFQPLVWYKANFNIPDGDDPVAIHLG 669

Query: 620 SMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRT 679
           SMGKGQAWVNG S+GR+W A  APSTGC+ +CDYRG+Y +SKC   CG P+Q  YH+PR 
Sbjct: 670 SMGKGQAWVNGHSLGRFWPAITAPSTGCSDRCDYRGTYYSSKCLSGCGLPSQEWYHVPRE 729

Query: 680 WVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQV 739
           W+   +N LV+ EE+GG+ S +S  ++    +C+ VSE   PPV  +       SS P++
Sbjct: 730 WLVNEKNTLVLLEEIGGNVSGVSFASRVVDRVCAQVSEYSLPPVAQF-------SSLPEL 782

Query: 740 RLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAY 798
            L+C  G  I++I FAS+G P+G CG+F+ G+CH ++   IV+KAC+G+  CS  +    
Sbjct: 783 GLSCSPGQFISSIFFASFGNPKGRCGAFQKGSCHALESETIVEKACIGRQSCSFEIFWKN 842

Query: 799 LGVSAGACPGLLKALAVEAHCS 820
            G     CPG  K LAVEA C+
Sbjct: 843 FGTD--PCPGKAKTLAVEAACT 862


>gi|242084926|ref|XP_002442888.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
 gi|241943581|gb|EES16726.1| hypothetical protein SORBIDRAFT_08g004410 [Sorghum bicolor]
          Length = 923

 Score =  842 bits (2174), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/854 (50%), Positives = 560/854 (65%), Gaps = 47/854 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRAL++ GKRR+L S  +HYPR+TPE+WP LI K+KEGG++VIETY+FWN HEP 
Sbjct: 68  NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKAKEGGVDVIETYIFWNGHEPA 127

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQYYFEGRFD+VRF K V   GLFL LRIGPYACAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 128 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 187

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P+K EM+ F+ KI+D+MK+E L++ QGGPIIL Q+ENEYGN++  YG  G+ Y++WAA  
Sbjct: 188 PYKAEMQNFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGKYGQAGKRYMQWAAQM 247

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+T VPWVMC+Q DAP+ I++TCN FYCDGF PNS +KP +WTE++ GW+  +G A+
Sbjct: 248 ALALDTGVPWVMCRQTDAPEQILDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGEAL 307

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP +D AFAVARF++ GG+FQNYYMYFGGTNF RTAGGPL  TSYDYDAPIDEYG +R
Sbjct: 308 PHRPAQDSAFAVARFYQRGGSFQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 367

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-----------SSNDC 350
           QPKWGHL++LH AIKLCE  L + D  P + KLG   EAH+Y             ++  C
Sbjct: 368 QPKWGHLKDLHAAIKLCEPALTAVDGSPRYIKLGPMQEAHVYSSENVHTNGSISGNAQFC 427

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-----NGDHP 405
           +AFLAN D    A+V   G  Y LP WSVSILPDC+ V FNTA+V +Q +     +G   
Sbjct: 428 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVESGSPS 487

Query: 406 FAQQKNVNELLLASSAFS--WY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTAS 461
           ++ +     L L     S  W+  +E VGI     F    + E +N TKD SDYL YT  
Sbjct: 488 YSSRHKPRILSLGGPYLSSTWWASKEPVGIWSEDIFAAQGILEHLNVTKDISDYLSYTTR 547

Query: 462 IHVMP-------GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
           +++          +G    L I+ +     +FVN KL     G+       +N+ ++L +
Sbjct: 548 VNISDEDVLYWNSEGLLPSLTIDQIRDVVRIFVNGKLAGSQVGHW----VSLNQPLQLVQ 603

Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
           G+N L +LS +VGLQNYGA+ +  GAG    V L  L NG  DL++  W YQ+G++GE+ 
Sbjct: 604 GLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLKGEFS 663

Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
            +       S+ W             W+KTTF APEG GP+A++L SMGKGQAWVNG  I
Sbjct: 664 RIYSPEKQGSAGWSSMQNDDTLSPFTWFKTTFDAPEGNGPVAIDLGSMGKGQAWVNGHLI 723

Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
           GRYWS  +AP +GC   C+Y G+Y  SKC+ +CG   Q+ YHIPR W+   +NLLV+ EE
Sbjct: 724 GRYWS-LVAPESGCPSSCNYAGNYGDSKCRSNCGIATQSWYHIPREWLQESDNLLVLFEE 782

Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACERGW 747
            GGDPS+ISL     + ICS +SE   PP+ +W      +P++  V  +P++RL C+ G 
Sbjct: 783 TGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGRPSVNTV--APELRLQCDEGH 840

Query: 748 HIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGAC 806
            I+ I FASYG P G+C +F  G CH    L +V +AC G+  C+I V++   G     C
Sbjct: 841 VISKITFASYGTPTGDCQNFSVGNCHASTTLDLVAEACEGKNRCAISVTNDVFG---DPC 897

Query: 807 PGLLKALAVEAHCS 820
             ++K LAV A CS
Sbjct: 898 RKVVKDLAVVAECS 911


>gi|108706355|gb|ABF94150.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
          Length = 819

 Score =  842 bits (2174), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/785 (53%), Positives = 531/785 (67%), Gaps = 35/785 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A+++DG+RR+L SGSIHYPRSTPE+W  LI K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDKKAVLVDGQRRILFSGSIHYPRSTPEMWDGLIEKAKDGGLDVIQTYVFWNGHEPTP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DLVRF+KTVQ+AG+F+HLRIGPY C EWN+GGFPVWL ++PGI FRT N P
Sbjct: 87  GNYNFEGRYDLVRFIKTVQKAGMFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+ +MK ENLFASQGGPIIL+Q+ENEYG     +G  G+ Y+ WAA  A
Sbjct: 147 FKNAMQGFTEKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGKAYINWAAKMA 206

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V L+T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF  FG  + 
Sbjct: 207 VGLDTGVPWVMCKEDDAPDPVINACNGFYCDTFSPNKPYKPTMWTEAWSGWFTEFGGTIR 266

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDLAF VARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG  R+
Sbjct: 267 QRPVEDLAFGVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLARE 326

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL+ELH+A+KLCE+ L+S+DPT   LG+  EAH++ +SS+ CAAFLANY+S+S A 
Sbjct: 327 PKFGHLKELHRAVKLCEQPLVSADPTVTTLGSMQEAHVF-RSSSGCAAFLANYNSNSYAK 385

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V FN   Y LP WS+SILPDCKNVVFNTA V  Q N           +      +S+  W
Sbjct: 386 VIFNNENYSLPPWSISILPDCKNVVFNTATVGVQTN----------QMQMWADGASSMMW 435

Query: 425 --YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E+V  ++         L EQ+N T+DTSDYLWY  S+ V P +     G  + L +
Sbjct: 436 EKYDEEVDSLAAAPLLTSTGLLEQLNVTRDTSDYLWYITSVEVDPSEKFLQGGTPLSLTV 495

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VF+N +L    YG  +      +    L  G N + +LS+  GL N G  ++
Sbjct: 496 QSAGHALHVFINGQLQGSAYGTREDRKISYSGNANLRAGTNKVALLSVACGLPNVGVHYE 555

Query: 537 VAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   ++I  L  G RDL+   W YQVG++GE + L+ +  + S  W QGS +  N
Sbjct: 556 TWNTGVVGPVVIHGLDEGSRDLTWQTWSYQVGLKGEQMNLNSLEGSGSVEWMQGSLVAQN 615

Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           +  L WY+  F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   + G  K C Y 
Sbjct: 616 QQPLAWYRAYFDTPSGDEPLALDMGSMGKGQIWINGQSIGRYWTAY---AEGDCKGCHYT 672

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           GSY A KCQ  CGQP Q  YH+PR+W+ P  NLLV+ EELGGD SKI+L  +T   +C+ 
Sbjct: 673 GSYRAPKCQAGCGQPTQRWYHVPRSWLQPTRNLLVVFEELGGDSSKIALAKRTVSGVCAD 732

Query: 715 VSEADPPPVDSWK------PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
           VSE   P + +W+      P       + +V L C  G  I+AI FAS+G P G CG+F+
Sbjct: 733 VSEYH-PNIKNWQIESYGEPEF----HTAKVHLKCAPGQTISAIKFASFGTPLGTCGTFQ 787

Query: 769 PGACH 773
            G CH
Sbjct: 788 QGECH 792


>gi|356518796|ref|XP_003528063.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
          Length = 898

 Score =  841 bits (2172), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/830 (50%), Positives = 540/830 (65%), Gaps = 16/830 (1%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SANV+YD R+L+ID +R++L S SIHYPRS P +WP L++ +KEGG++VIETYVFWN HE
Sbjct: 74  SANVSYDGRSLIIDAQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGHE 133

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              G YYF GRFDLV+F +TVQ+AG++L LRIGP+  AEWN+GG PVWLH++PG  FRT 
Sbjct: 134 LSPGNYYFGGRFDLVKFAQTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTY 193

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PF   M++F   I++LMKQE LFASQGGPIILAQ+ENEYG  E  Y   G+ Y  WAA
Sbjct: 194 NQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILAQIENEYGYYENFYKEDGKKYALWAA 253

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+ NT VPW+MCQQ DAPDP+I+TCN FYCD FTP SP++P +WTEN+ GWF +FG 
Sbjct: 254 KMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGWFKTFGG 313

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P RP ED+AF+VARFF+ GG+  NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG 
Sbjct: 314 RDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYGL 373

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R PKWGHL+ELH+AIKLCE  L++    +  LG  +EA +Y  SS  CAAF++N D  +
Sbjct: 374 PRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFISNVDDKN 433

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  V F    + LPAWSVSILPDCKNVVFNTAKV SQ +         +  ++++   ++
Sbjct: 434 DKTVEFRNASFHLPAWSVSILPDCKNVVFNTAKVTSQTSVVAMVPESLQQSDKVV---NS 490

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           F W   +EK GI G   FV+    + INTTKDT+DYLW+T SI V   +     G +  L
Sbjct: 491 FKWDIVKEKPGIWGKADFVKNGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKGNKPVL 550

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            IES GHA   FVN++    G GN   A F     I L  G N + +L + VGLQ  G +
Sbjct: 551 LIESTGHALHAFVNQEYEGTGSGNGTHAPFTFKNPISLRAGKNEIALLCLTVGLQTAGPF 610

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +D  GAGL SV +  L NG  DLSS  W Y++GV+GEY+ L + +  N+  W   S  P 
Sbjct: 611 YDFVGAGLTSVKIKGLNNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNNVNWTSTSEPPK 670

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-PSTGCTKKCDY 653
            + L WYK    AP G  P+ L++  MGKG AW+NG+ IGRYW       S  C K+CDY
Sbjct: 671 MQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSEDCVKECDY 730

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
           RG ++  KC   CG+P Q  YH+PR+W  P  N+LV+ EE GGDP KI  + +     C+
Sbjct: 731 RGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSGACA 790

Query: 714 FVSEADPPP--VDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
            V+E  P    V   +  +    + P  RLAC     I+A+ FAS+G P G CGS+  G 
Sbjct: 791 LVAEDYPSVALVSQGEDKIQSNKNIPFARLACPGNTRISAVKFASFGSPSGTCGSYLKGD 850

Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH  +   IV+KAC+ + +C I ++       +  CPGL + LAVEA CS
Sbjct: 851 CHDPNSSTIVEKACLNKNDCVIKLTEE--NFKSNLCPGLSRKLAVEAVCS 898


>gi|242053381|ref|XP_002455836.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
 gi|241927811|gb|EES00956.1| hypothetical protein SORBIDRAFT_03g025990 [Sorghum bicolor]
          Length = 785

 Score =  840 bits (2171), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/807 (52%), Positives = 539/807 (66%), Gaps = 34/807 (4%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SGS+HYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP RGQYYFEGR+DLV F+K V
Sbjct: 2   SGSVHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRGQYYFEGRYDLVHFIKLV 61

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
           ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PFK EM++F  KI+D+MK 
Sbjct: 62  KQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKAEMQKFTTKIVDMMKS 121

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAP 202
           E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ AV LNTSVPWVMC+++DAP
Sbjct: 122 EGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALNTSVPWVMCKEDDAP 181

Query: 203 DPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETG 262
           DPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP RPVEDLA+ VA+F + G
Sbjct: 182 DPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPHRPVEDLAYGVAKFIQKG 241

Query: 263 GTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           G+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+PKWGHL+ELHKAIKLCE 
Sbjct: 242 GSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPKWGHLKELHKAIKLCEP 301

Query: 323 YLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSIL 382
            L++ DP    LG   +A ++  S++ C AFL N D  S A V+FNG  Y LP WS+SIL
Sbjct: 302 ALVAGDPIVTSLGNAQQASVFRSSTDACVAFLENKDKVSYARVSFNGMHYNLPPWSISIL 361

Query: 383 PDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRP 440
           PDCK  V+NTA+V SQ        +Q K     +  +  F+W  Y E +   G+ SFV  
Sbjct: 362 PDCKTTVYNTARVGSQ-------ISQMK-----MEWAGGFTWQSYNEDINSLGDESFVTV 409

Query: 441 DLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFG 495
            L EQIN T+D +DYLWYT  + V   +     GK   L + S GHA  +FVN +L    
Sbjct: 410 GLLEQINVTRDNTDYLWYTTYVDVAQDEQFLSNGKNPVLTVMSAGHALHIFVNGQLTGTV 469

Query: 496 YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGK 554
           YG+ D         ++L  G NT+  LS+ VGL N G  F+   AG+   + +D L  G+
Sbjct: 470 YGSVDDPKLTYRGNVKLWPGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGR 529

Query: 555 RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGP 613
           RDL+  +W Y+VG++GE + L  +S ++S  W +    P+ K  L WYK  F AP+G  P
Sbjct: 530 RDLTWQKWTYKVGLKGEDLSLHSLSGSSSVEWGE----PMQKQPLTWYKAFFNAPDGDEP 585

Query: 614 LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTL 673
           LAL+++SMGKGQ W+NGQ IGRYW  Y A  +G    CDYRG YD  KCQ +CG  +Q  
Sbjct: 586 LALDMSSMGKGQIWINGQGIGRYWPGYKA--SGTCGICDYRGEYDEKKCQTNCGDSSQRW 643

Query: 674 YHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVV 733
           YH+PR+W++P  NLLVI EE GGDP+ IS++ +T   IC+ VSE   P + +W+      
Sbjct: 644 YHVPRSWLNPTGNLLVIFEEWGGDPTGISMVKRTTGSICADVSEWQ-PSMTNWRTK---D 699

Query: 734 SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSI 792
               ++ L C+ G  +  I FAS+G P+G+CGS+  G CH      I  K C+GQ  C +
Sbjct: 700 YEKAKIHLQCDHGRKMTDIKFASFGTPQGSCGSYSEGGCHAHKSYDIFWKNCIGQERCGV 759

Query: 793 PVSSAYLGVSAGACPGLLKALAVEAHC 819
            V     G     CPG +K   VEA C
Sbjct: 760 SVVPNVFG--GDPCPGTMKRAVVEAIC 784


>gi|334184642|ref|NP_001189660.1| beta galactosidase 9 [Arabidopsis thaliana]
 gi|330253651|gb|AEC08745.1| beta galactosidase 9 [Arabidopsis thaliana]
          Length = 859

 Score =  837 bits (2162), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 421/803 (52%), Positives = 540/803 (67%), Gaps = 41/803 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+I GKRR+L S  IHYPR+TPE+W +LI KSKEGG +V++TYVFWN HEP+
Sbjct: 37  NVSYDHRALIIAGKRRMLVSAGIHYPRATPEMWSDLIAKSKEGGADVVQTYVFWNGHEPV 96

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEGR+DLV+FVK +  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 97  KGQYNFEGRYDLVKFVKLIGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNE 156

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+EM++F+ KI+DLM++  LF  QGGPII+ Q+ENEYG+VE +YG  G+ YVKWAA  
Sbjct: 157 PFKKEMQKFVTKIVDLMREAKLFCWQGGPIIMLQIENEYGDVEKSYGQKGKDYVKWAASM 216

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L   VPWVMC+Q DAP+ II+ CNG+YCDGF PNS +KP++WTE++ GW+  +G ++
Sbjct: 217 ALGLGAGVPWVMCKQTDAPENIIDACNGYYCDGFKPNSRTKPVLWTEDWDGWYTKWGGSL 276

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP EDLAFAVARF++ GG+FQNYYMYFGGTNFGRT+GGP   TSYDYDAP+DEYG   
Sbjct: 277 PHRPAEDLAFAVARFYQRGGSFQNYYMYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRS 336

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND----CAAFLANYD 358
           +PKWGHL++LH AIKLCE  L+++D P ++KLG+K EAHIYH         CAAFLAN D
Sbjct: 337 EPKWGHLKDLHAAIKLCEPALVAADAPQYRKLGSKQEAHIYHGDGETGGKVCAAFLANID 396

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---------Q 409
               A+V FNG  Y LP WSVSILPDC++V FNTAKV +Q +      A+         Q
Sbjct: 397 EHKSAHVKFNGQSYTLPPWSVSILPDCRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQ 456

Query: 410 KNVNELLLASSAFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
           K V +  ++  + SW   +E +GI G  +F    L E +N TKD SDYLW+   I V   
Sbjct: 457 KVVRQDNVSYISKSWMALKEPIGIWGENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSED 516

Query: 468 Q-------GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
                   G    ++I+S+     VFVNK+L     G+   A     + +   +G N L 
Sbjct: 517 DISFWKKNGPNSTVSIDSMRDVLRVFVNKQLAGSIVGHWVKA----VQPVRFIQGNNDLL 572

Query: 521 ILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
           +L+  VGLQNYGA+ +  GAG      L   KNG  DLS   W YQVG++GE    DKI 
Sbjct: 573 LLTQTVGLQNYGAFLEKDGAGFRGKAKLTGFKNGDLDLSKSSWTYQVGLKGE---ADKIY 629

Query: 580 LANSSFWKQGSTLPVNKS---LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
               +   + STL  + S    +WYKT F  P G  P+ LNL SMG+GQAWVNGQ IGRY
Sbjct: 630 TVEHNEKAEWSTLETDASPSIFMWYKTYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRY 689

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W+  ++   GC + CDYRG+Y++ KC  +CG+P QT YH+PR+W+ P  NLLV+ EE GG
Sbjct: 690 WNI-ISQKDGCDRTCDYRGAYNSDKCTTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGG 748

Query: 697 DPSKISLLTKTGQHICSFVSEADPPPVDSWKP------NLGVVSSSPQVRLACERGWHIA 750
           +P KIS+ T T   +C  VSE+  PP+  W         + + S +P+V L CE G  I+
Sbjct: 749 NPFKISVKTVTAGILCGQVSESHYPPLRKWSTPDYINGTMSINSVAPEVHLHCEDGHVIS 808

Query: 751 AINFASYGIPEGNCGSFRPGACH 773
           +I FASYG P G+C  F  G CH
Sbjct: 809 SIEFASYGTPRGSCDGFSIGKCH 831


>gi|414864994|tpg|DAA43551.1| TPA: beta-galactosidase [Zea mays]
          Length = 897

 Score =  837 bits (2162), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/880 (49%), Positives = 551/880 (62%), Gaps = 80/880 (9%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEV------------------------------ 35
           TYD +A++IDG+RR+L SGSIHYPRSTP+V                              
Sbjct: 30  TYDKKAVLIDGQRRILFSGSIHYPRSTPDVISCILQNLSFFFSPLLPRGGGEFMAVVSCV 89

Query: 36  ----------------------WPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
                                 W  LI+K+K+GGL+VI+TYVFWN HEP  G YYFE R+
Sbjct: 90  LDAMLSKANCFPTLAVPLYSTMWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERY 149

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DLVRFVKTVQ+AGLF+HLRIGPY C EWN+GGFPVWL ++PGI FRT N PFK  M+ F 
Sbjct: 150 DLVRFVKTVQKAGLFVHLRIGPYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFT 209

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            KI+ +MK ENLFASQGGPIIL+Q+ENEYG     +G  G+ Y+ WAA  AV L+T VPW
Sbjct: 210 EKIVGMMKSENLFASQGGPIILSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPW 269

Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAF 253
           VMC++EDAPDP+IN CNGFYCD F+PN P KP MWTE +SGWF  FG  +  RPVEDLAF
Sbjct: 270 VMCKEEDAPDPVINACNGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAF 329

Query: 254 AVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
           AVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR+PK  HL+EL
Sbjct: 330 AVARFVQKGGSFINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKEL 389

Query: 314 HKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYF 373
           H+A+KLCE+ L+S DPT   LG   EAH++ +S + CAAFLANY+S+S A V FN   Y 
Sbjct: 390 HRAVKLCEQALVSVDPTITTLGTMQEAHVF-RSPSGCAAFLANYNSNSHAKVVFNNEQYS 448

Query: 374 LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKV-GIS 432
           LP WS+SILPDCKNVVFN+A V  Q +        Q  +      S  +  Y+E+V  ++
Sbjct: 449 LPPWSISILPDCKNVVFNSATVGVQTS--------QMQMWGDGATSMMWERYDEEVDSLA 500

Query: 433 GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP------GQGKEVFLNIESLGHAALVF 486
                    L EQ+N T+D+SDYLWY  S+ + P      G GK   L+++S GHA  VF
Sbjct: 501 AAPLLTTTGLLEQLNVTRDSSDYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVF 560

Query: 487 VNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-V 545
           VN +L    YG  +      N  + L  G N + +LS+  GL N G  ++    G+   V
Sbjct: 561 VNGQLQGSSYGTREDRRIKYNGNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPV 620

Query: 546 ILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTT 604
           +L  L  G RDL+   W YQVG++GE + L+ +  + S  W QGS +   +  L WYK  
Sbjct: 621 VLHGLNEGSRDLTWQTWSYQVGLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAY 680

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F  P G  PLAL++ SMGKGQ W+NGQSIGRYW+AY   + G  K C Y G++ A KCQ 
Sbjct: 681 FETPSGDEPLALDMGSMGKGQVWINGQSIGRYWTAY---ADGDCKGCSYTGTFRAPKCQA 737

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEEL-GGDPSKISLLTKTGQHICSFVSEADPPPV 723
            CGQP Q  YH+PR+W+ P  NLLV+ EEL GGD SKI+L  ++   +C+ VSE D P +
Sbjct: 738 GCGQPTQRWYHVPRSWLQPSRNLLVVLEELGGGDSSKIALAKRSVSSVCADVSE-DHPNI 796

Query: 724 DSWK-PNLGVVS-SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIV 780
             W+  + G       +V L C  G  I+AI FAS+G P G CG+F+ G CH      ++
Sbjct: 797 KKWQIESYGEREHRRAKVHLRCAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVL 856

Query: 781 QKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +K C+G   C + +S    G     CP + K +AVEA CS
Sbjct: 857 EKRCIGLQRCVVAISPDNFG--GDPCPSVTKRVAVEAVCS 894


>gi|33521214|gb|AAQ21369.1| beta-galactosidase [Sandersonia aurantiaca]
          Length = 826

 Score =  837 bits (2162), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 416/825 (50%), Positives = 554/825 (67%), Gaps = 33/825 (4%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV YD RA+ I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP 
Sbjct: 25  NVWYDSRAITINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPS 84

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G+YYFEG +DLVRF+K VQ+ GL+LHLRIGPY CAEWN+GGFPVWL ++PGI FRT N 
Sbjct: 85  PGKYYFEGNYDLVRFIKLVQQGGLYLHLRIGPYVCAEWNFGGFPVWLKYVPGIHFRTDNE 144

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK EM++F + I+++MK E LF  QGGPIIL+Q+ENE+G +E+  G   + Y  WAA  
Sbjct: 145 PFKAEMEKFTSHIVNMMKAEKLFHWQGGPIILSQIENEFGPLEYDQGAPAKAYAAWAAKM 204

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+L T VPWVMC+++DAPDP+INT NGFY DGF PN   KP+MWTEN++GWF  +G  V
Sbjct: 205 AVDLETGVPWVMCKEDDAPDPVINTWNGFYADGFYPNKRYKPMMWTENWTGWFTGYGVPV 264

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVEDLAF+VA+F + GG++ NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +R
Sbjct: 265 PHRPVEDLAFSVAKFVQKGGSYVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGMLR 324

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPK+GHL +LHKAIKLCE  L+S  P    LG   E++++  +S  CAAFLANYD+   A
Sbjct: 325 QPKYGHLTDLHKAIKLCEPALVSGYPVVTSLGNNQESNVFRSNSGACAAFLANYDTKYYA 384

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            VTFNG  Y LP WS+SILPDCK  VFNTA+V +Q                 +     FS
Sbjct: 385 TVTFNGMRYNLPPWSISILPDCKTTVFNTARVGAQ------------TTQMQMTTVGGFS 432

Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
           W  Y E      + SF +  L EQI+ T+D++DYLWYT  +++   +     G+   L  
Sbjct: 433 WVSYNEDPNSIDDGSFTKLGLVEQISMTRDSTDYLWYTTYVNIDQNEQFLKNGQYPVLTA 492

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GH+  VF+N +L+   YG+ +         ++L  G N +  LS+ VGL N G  F+
Sbjct: 493 QSAGHSLHVFINGQLIGTAYGSVEDPRLTYTGNVKLFAGSNKISFLSIAVGLPNVGEHFE 552

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               GL   V L  L  GKRDL+  +W Y++G++GE + L  +S +++  W   S     
Sbjct: 553 TWNTGLLGPVTLNGLNEGKRDLTWQKWTYKIGLKGEALSLHTLSGSSNVEWGDASR---K 609

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK  F AP G  PLAL++++MGKGQ W+NGQSIGRYW AY A   G   KCDY G
Sbjct: 610 QPLAWYKGFFNAPGGSEPLALDMSTMGKGQVWINGQSIGRYWPAYKA--RGSCPKCDYEG 667

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           +Y+ +KCQ +CG  +Q  YH+PR+W++P  NL+V+ EE GG+P+ ISL+ ++ +  C++V
Sbjct: 668 TYEETKCQSNCGDSSQRWYHVPRSWLNPTGNLIVVFEEWGGEPTGISLVKRSMRSACAYV 727

Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
           S+   P +++W        +  +V L+C+ G  +  I FASYG P+G C S+  G CH  
Sbjct: 728 SQGQ-PSMNNWHTKY----AESKVHLSCDPGLKMTQIKFASYGTPQGACESYSEGRCHAH 782

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
               I QK C+GQ  CS+ V     G     CPG++K++AV+A C
Sbjct: 783 KSYDIFQKNCIGQQVCSVTVVPEVFG--GDPCPGIMKSVAVQASC 825


>gi|356508931|ref|XP_003523206.1| PREDICTED: beta-galactosidase 10-like [Glycine max]
          Length = 843

 Score =  837 bits (2161), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 431/838 (51%), Positives = 539/838 (64%), Gaps = 30/838 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           LS NV+YD R+L+IDG+R++L S SIHYPRS P +WP L++ +KEGG++VIETYVFWN H
Sbjct: 18  LSGNVSYDGRSLLIDGQRKLLISASIHYPRSVPAMWPGLVQTAKEGGVDVIETYVFWNGH 77

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           E   G YYF GRFDLV+F KTVQ+AG++L LRIGP+  AEWN+GG PVWLH++PG  FRT
Sbjct: 78  ELSPGNYYFGGRFDLVKFAKTVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRT 137

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PF   M++F   I++LMKQE LFASQGGPIIL+Q+ENEYG  E  Y   G+ Y  WA
Sbjct: 138 YNQPFMYHMQKFTTYIVNLMKQEKLFASQGGPIILSQIENEYGYYENFYKEDGKKYALWA 197

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV+ NT VPW+MCQQ DAPDP+I+TCN FYCD FTP SP++P +WTEN+ GWF +FG
Sbjct: 198 AKMAVSQNTGVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPNRPKIWTENWPGWFKTFG 257

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P RP ED+AF+VARFF+ GG+  NYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 258 GRDPHRPAEDVAFSVARFFQKGGSVHNYYMYHGGTNFGRTAGGPFITTSYDYDAPVDEYG 317

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKWGHL+ELH+AIKLCE  L++    +  LG  +EA +Y  SS  CAAF++N D  
Sbjct: 318 LPRLPKWGHLKELHRAIKLCEHVLLNGKSVNISLGPSVEADVYTDSSGACAAFISNVDDK 377

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-NGDHPFAQQ---KNVNELL 416
           +D  V F    Y LPAWSVSILPDCKNVVFNTAKV SQ N     P + Q   K VN L 
Sbjct: 378 NDKTVEFRNASYHLPAWSVSILPDCKNVVFNTAKVTSQTNVVAMIPESLQQSDKGVNSL- 436

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
                +   +EK GI G   FV+    + INTTKDT+DYLW+T SI V   +     G +
Sbjct: 437 ----KWDIVKEKPGIWGKADFVKSGFVDLINTTKDTTDYLWHTTSIFVSENEEFLKKGSK 492

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L IES GHA   FVN++    G GN   + F     I L  G N + +L + VGLQ  
Sbjct: 493 PVLLIESTGHALHAFVNQEYQGTGTGNGTHSPFSFKNPISLRAGKNEIALLCLTVGLQTA 552

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           G ++D  GAGL SV +  LKNG  DLSS  W Y++GV+GEY+ L + +  N   W   S 
Sbjct: 553 GPFYDFIGAGLTSVKIKGLKNGTIDLSSYAWTYKIGVQGEYLRLYQGNGLNKVNWTSTSE 612

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-PSTGCTKK 650
               + L WYK    AP G  P+ L++  MGKG AW+NG+ IGRYW       S  C K+
Sbjct: 613 PQKMQPLTWYKAIVDAPPGDEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSEFKSEDCVKE 672

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDYRG ++  KC   CG+P Q  YH+PR+W  P  N+LV+ EE GGDP KI  + +    
Sbjct: 673 CDYRGKFNPDKCDTGCGEPTQRWYHVPRSWFKPSGNILVLFEEKGGDPEKIKFVRRKVSG 732

Query: 711 ICSFVSEADPPPV-------DSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
            C+ V+E D P V       D  + N  V    P   L C     I+A+ FAS+G P G+
Sbjct: 733 ACALVAE-DYPSVGLLSQGEDKIQNNKNV----PFAHLTCPSNTRISAVKFASFGTPSGS 787

Query: 764 CGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CGS+  G CH  +   IV+KAC+ + +C I ++          CPGL + LAVEA CS
Sbjct: 788 CGSYLKGDCHDPNSSTIVEKACLNKNDCVIKLTEE--NFKTNLCPGLSRKLAVEAVCS 843


>gi|218188525|gb|EEC70952.1| hypothetical protein OsI_02561 [Oryza sativa Indica Group]
          Length = 822

 Score =  835 bits (2158), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 418/825 (50%), Positives = 544/825 (65%), Gaps = 36/825 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +TYD +A+V++G+RR+L SGSIHYPRSTPE+WP+LI K+K+GGL+V++TYVFWN HEP  
Sbjct: 23  LTYDRKAVVVNGQRRILISGSIHYPRSTPEMWPDLIEKAKDGGLDVVQTYVFWNGHEPSP 82

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQYYFEGR+DLV F+K V++AGL+++LRIGPY CAEWN+GGFPVWL ++PGI FRT N P
Sbjct: 83  GQYYFEGRYDLVHFIKLVKQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 142

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK EM++F  KI+++MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ A
Sbjct: 143 FKAEMQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMA 202

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V LNT VPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP
Sbjct: 203 VALNTGVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVP 262

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RPVEDLA+ VA+F + GG+F NYYM+ GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+
Sbjct: 263 HRPVEDLAYGVAKFIQKGGSFVNYYMFHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLRE 322

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PKWGHL++LHKAIKLCE  L++ DP    LG   ++ ++  S+  CAAFL N D  S A 
Sbjct: 323 PKWGHLKQLHKAIKLCEPALVAGDPIVTSLGNAQKSSVFRSSTGACAAFLDNKDKVSYAR 382

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V FNG  Y LP WS+SILPDCK  VFNTA+V SQ        +Q K     +  +  F+W
Sbjct: 383 VAFNGMHYDLPPWSISILPDCKTTVFNTARVGSQ-------ISQMK-----MEWAGGFAW 430

Query: 425 --YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHA 482
             Y E++   G   F    L EQIN T+D +DYLWYT  + V   Q  +   N E+    
Sbjct: 431 QSYNEEINSFGEDPFTTVGLLEQINVTRDNTDYLWYTTYVDV--AQDDQFLSNGENPKLT 488

Query: 483 ALVFVNKKLVAFG-----YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
            + F+   ++        YG+ D         ++L  G NT+  LS+ VGL N G  F+ 
Sbjct: 489 VMCFLILNILFNLLAGTVYGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFET 548

Query: 538 AGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             AG+   + +D L  G+RDL+  +W YQVG++GE + L  +S +++  W +    PV K
Sbjct: 549 WNAGILGPVTLDGLNEGRRDLTWQKWTYQVGLKGESMSLHSLSGSSTVEWGE----PVQK 604

Query: 597 S-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
             L WYK  F AP+G  PLAL+++SMGKGQ W+NGQ IGRYW  Y A  +G    CDYRG
Sbjct: 605 QPLTWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGIGRYWPGYKA--SGNCGTCDYRG 662

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
            YD +KCQ +CG  +Q  YH+PR+W+ P  NLLVI EE GGDP+ IS++ ++   +C+ V
Sbjct: 663 EYDETKCQTNCGDSSQRWYHVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADV 722

Query: 716 SEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
           SE   P + +W           +V L C+ G  I  I FAS+G P+G+CGS+  G CH  
Sbjct: 723 SEWQ-PSMKNWHTK---DYEKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYSEGGCHAH 778

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
               I  K CVGQ  C + V     G     CPG +K   VEA C
Sbjct: 779 KSYDIFWKNCVGQERCGVSVVPEIFG--GDPCPGTMKRAVVEAIC 821


>gi|414878434|tpg|DAA55565.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
          Length = 918

 Score =  835 bits (2158), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/855 (50%), Positives = 556/855 (65%), Gaps = 48/855 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRAL++ GKRR+L S  +HYPR+TPE+WP LI K KEGG++ IETYVFWN HEP 
Sbjct: 62  NVTYDHRALILGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGVDAIETYVFWNGHEPA 121

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQYYFEGRFD+VRF K V   GLFL LRIGPYACAEWN+GGFPVWL  +PGI+FRT N 
Sbjct: 122 KGQYYFEGRFDIVRFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDVPGIEFRTDNE 181

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P+K EM+ F+ KI+D+MK+E L++ QGGPIIL Q+ENEYGN++  YG  G+ Y+ WAA  
Sbjct: 182 PYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGNIQGHYGQAGKRYMLWAAQM 241

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+T VPWVMC+Q DAP+ I+NTCN FYCDGF PNS +KP +WTE++ GW+  +G ++
Sbjct: 242 ALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGESL 301

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP +D AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL  TSYDYDAPIDEYG +R
Sbjct: 302 PHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGPLQITSYDYDAPIDEYGILR 361

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-----------SSNDC 350
           QPKWGHL++LH AIKLCE  L + D  P + KLG   EAH+Y             +S  C
Sbjct: 362 QPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHVYSSENVHTNGSISGNSQFC 421

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-----NGDHP 405
           +AFLAN D    A+V   G  Y LP WSVSILPDC+ V FNTA+V +Q +     +G   
Sbjct: 422 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFNTARVGTQTSFFNVESGSPS 481

Query: 406 FAQQKNVNELLLASSAF---SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
           ++ +     L L    +   +W  ++E VGI G   F    + E +N TKD SDYL YT 
Sbjct: 482 YSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQGILEHLNVTKDISDYLSYTT 541

Query: 461 SIHVMP-------GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN 513
            +++          +G    L I+ +   A VFVN KL     G+       +N+ ++L 
Sbjct: 542 RVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGSKVGHW----VSLNQPLQLV 597

Query: 514 EGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEY 572
           +G+N L +LS +VGLQNYGA+ +  GAG    V L  L NG  DL++  W YQ+G++GE+
Sbjct: 598 QGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNGDIDLTNSLWTYQIGLKGEF 657

Query: 573 IGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQS 632
             +       S+ W             W+KT F APEG GP+ ++L SMGKGQAWVNG  
Sbjct: 658 SRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGPVTIDLGSMGKGQAWVNGHL 717

Query: 633 IGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
           IGRYWS  +AP +GC   C+Y G+Y  SKC+ +CG   Q+ YHIPR W+    NLLV+ E
Sbjct: 718 IGRYWS-LVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSWYHIPREWLQESGNLLVLFE 776

Query: 693 ELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------KPNLGVVSSSPQVRLACERG 746
           E GGDPS+ISL     + ICS +SE   PP+ +W      +P++  V  +P++RL C+ G
Sbjct: 777 ETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGRPSVNTV--APELRLQCDDG 834

Query: 747 WHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGA 805
             I+ I FASYG P G C +F  G CH    L +V +AC G+  C+I V++   G     
Sbjct: 835 HVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVEACEGKNRCAISVTNEVFG---DP 891

Query: 806 CPGLLKALAVEAHCS 820
           C  ++K LAVEA CS
Sbjct: 892 CRKVVKDLAVEAECS 906


>gi|449435860|ref|XP_004135712.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
          Length = 723

 Score =  827 bits (2137), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 406/714 (56%), Positives = 500/714 (70%), Gaps = 23/714 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+VTYDH+ALVIDGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN HEP
Sbjct: 24  ASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQYYFE R++LVRFVK VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 84  SPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F AKI+ +MK E L+ SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA 
Sbjct: 144 GPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN   KP MWTE ++GWF  FG  
Sbjct: 204 MALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKPKMWTEAWTGWFTEFGGP 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLA+AVARF +  G+  NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLI 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHLR+LHKAIKLCE  L+S DPT   LG+K EAH+Y+  S +CAAFLANYD S+ 
Sbjct: 324 RQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTRSGECAAFLANYDPSTS 383

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             VTF  + Y LP WSVSILPDCK VVFNTAKV       + P    K     +   S+F
Sbjct: 384 VRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKV-------NAPSYWPK-----MTPISSF 431

Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW+   EE      + +     L EQI+ T+D +DYLWY   I +   +     G+   L
Sbjct: 432 SWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIRIDSNEGFLKSGQWPLL 491

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VF+N +L    YG  D      +K + L  G+N L +LS+ VGL N G  
Sbjct: 492 TIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVH 551

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+   AG+   V L  L  G RD+S  +W Y+VG++GE + L  +S ++S  W  GS + 
Sbjct: 552 FETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMTGSLVS 611

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYKTTF AP G  PLAL++ SMGKGQ W+NG+SIGR+W AY A   G   KC Y
Sbjct: 612 QKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHWPAYTA--RGSCGKCYY 669

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G +   KC   CG+P+Q  YH+PR W+ P  N+LVI EE GG+P  ISL+ ++
Sbjct: 670 GGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISLVKRS 723


>gi|224096113|ref|XP_002310540.1| predicted protein [Populus trichocarpa]
 gi|222853443|gb|EEE90990.1| predicted protein [Populus trichocarpa]
          Length = 827

 Score =  826 bits (2134), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 419/833 (50%), Positives = 531/833 (63%), Gaps = 35/833 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            + NV+YD R+L+I+G+R++L S +IHYPRS P +WPEL++ +KEGG++VIETYVFWN H
Sbjct: 17  FAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVH 76

Query: 61  EPIR-GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFR 119
           +P    +Y+F+GRFDLV+F+  VQEAG++L LRIGP+  AEWN+GG PVWLH++ G  FR
Sbjct: 77  QPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFR 136

Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ--VENEYGNVEWAYGVGGELYV 177
           T N  FK  M+ F   I+ LMK+E LFASQGGPIIL+Q  VENEYG  E AYG GG+ Y 
Sbjct: 137 TDNYNFKYYMEEFTTYIVKLMKKEKLFASQGGPIILSQAKVENEYGYYEGAYGEGGKRYA 196

Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
            WAA  AV+ NT VPW+MCQQ DAP  +INTCN FYCD F P  P KP +WTEN+ GWF 
Sbjct: 197 AWAAQMAVSQNTGVPWIMCQQFDAPPSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQ 256

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
           +FG   P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRTAGGP + TSYDY+APID
Sbjct: 257 TFGAPNPHRPAEDVAFSVARFFQKGGSVQNYYMYHGGTNFGRTAGGPFITTSYDYEAPID 316

Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           EYG  R PKWGHL+ELHKAIKLCE  L++S P +  LG   EA +Y  +S  C AFLAN 
Sbjct: 317 EYGLPRLPKWGHLKELHKAIKLCEHVLLNSKPVNLSLGPSQEADVYADASGGCVAFLANI 376

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
           D  +D  V F    Y LPAWSVSILPDCKNVV+NTAK              QK+      
Sbjct: 377 DDKNDKTVDFQNVSYKLPAWSVSILPDCKNVVYNTAK--------------QKD------ 416

Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGK 470
            S A  W  + EK GI G   F++    + INTTKDT+DYLWYT SI V        +G+
Sbjct: 417 GSKALKWEVFVEKAGIWGEPDFMKNGFVDHINTTKDTTDYLWYTTSIVVGENEEFLKEGR 476

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
              L IES+GHA   FVN++L     GN   + F     I L  G N + +LSM VGL N
Sbjct: 477 HPVLLIESMGHALHAFVNQELQGSASGNGSHSPFKFKNPISLKAGNNEIALLSMTVGLPN 536

Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
            G++++  GAGL SV +    NG  DLS   WIY++G++GE +G+ K    NS  W   S
Sbjct: 537 AGSFYEWVGAGLTSVRIEGFNNGTVDLSHFNWIYKIGLQGEKLGIYKPEGVNSVSWVATS 596

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
             P  + L WYK     P G  P+ L++  MGKG AW+NG+ IGRYW    +    C  +
Sbjct: 597 EPPKKQPLTWYKVVLDPPAGNEPVGLDMLHMGKGLAWLNGEEIGRYWPRKSSVHEKCVTE 656

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDYRG +   KC   CGQP Q  YH+PR+W  P  NLLVI EE GGDP KI+   +    
Sbjct: 657 CDYRGKFMPDKCFTGCGQPTQRWYHVPRSWFKPSGNLLVIFEEKGGDPEKITFSRRKMSS 716

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQ--VRLACERGWHIAAINFASYGIPEGNCGSFR 768
           IC+ ++E  P          G  +S+ +  V L C +   I+A+ FAS+G P G CGS+ 
Sbjct: 717 ICALIAEDYPSADRKSLQEAGSKNSNSKASVHLGCPQNAVISAVKFASFGTPTGKCGSYS 776

Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G CH  + + +V+KAC+ + EC+I ++      + G CP   + LAVEA CS
Sbjct: 777 EGECHDPNSISVVEKACLNKTECTIELTEE--NFNKGLCPDFTRRLAVEAVCS 827


>gi|224128630|ref|XP_002329051.1| predicted protein [Populus trichocarpa]
 gi|222839722|gb|EEE78045.1| predicted protein [Populus trichocarpa]
          Length = 830

 Score =  825 bits (2132), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 417/832 (50%), Positives = 550/832 (66%), Gaps = 34/832 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+V+YD +A+ I+G+RR+L SGSIHYPRS+PE+WP+LI+K+KEGGL+VI+TYVFWN H
Sbjct: 21  VTASVSYDSKAITINGQRRILISGSIHYPRSSPEMWPDLIQKAKEGGLDVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFEG +DLV+FVK V+EAGL+++LRIGPY CAEWN+G            QF+ 
Sbjct: 81  EPSPGKYYFEGNYDLVKFVKLVKEAGLYVNLRIGPYICAEWNFGH-----------QFQN 129

Query: 121 TNNPFKEE---MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYV 177
              PF+ E   M++F  KI+++MK E LF SQGGPIIL+Q+ENEYG +E+  G  G+ Y 
Sbjct: 130 GQWPFQGEAAQMRKFTTKIVNMMKAERLFESQGGPIILSQIENEYGPMEYELGSPGQAYT 189

Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
           KWAA  AV L T VPWVMC+Q+DAPDPIINTCNGFYCD F+PN   KP MWTE ++GWF 
Sbjct: 190 KWAAQMAVGLRTGVPWVMCKQDDAPDPIINTCNGFYCDYFSPNKAYKPKMWTEAWTGWFT 249

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
            FG  VP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+D
Sbjct: 250 QFGGPVPHRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLD 309

Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           EYG +RQPKWGHL++LH+AIKLCE  L+S D T   LG   EAH+++  +  CAAFLANY
Sbjct: 310 EYGLLRQPKWGHLKDLHRAIKLCEPALVSGDATVIPLGNYQEAHVFNYKAGGCAAFLANY 369

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
              S A V+F    Y LP WS+SILPDCKN V+NTA+V +Q        A  K     + 
Sbjct: 370 HQRSFAKVSFRNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ATIKMTPVPMH 422

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
              ++  Y E+   SG+ +F    L EQINTT+D SDYLWY   +H+ P +     GK  
Sbjct: 423 GGLSWQTYNEEPSSSGDNTFTMVGLLEQINTTRDVSDYLWYMTDVHIDPSEGFLKSGKYP 482

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA  VF+N +L    YG+ DF     ++ + L  G+N + +LS+ VGL N G
Sbjct: 483 VLTVLSAGHALHVFINGQLSGTAYGSLDFPKLTFSQGVSLRAGVNKISLLSIAVGLPNVG 542

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  L  G+ DLS  +W Y++G+ GE + L  IS ++S  W +GS 
Sbjct: 543 PHFETWNAGILGPVTLNGLNEGRMDLSWQKWSYKIGLHGEALSLHSISGSSSVEWAEGSL 602

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYKTTF AP G  PLAL++ SMGKGQ W+NGQ +GR+W AY A  +G   +C
Sbjct: 603 VAQKQPLSWYKTTFNAPAGNSPLALDMGSMGKGQIWINGQHVGRHWPAYKA--SGTCGEC 660

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
            Y G+Y+ +KC  +CG+ +Q  YH+P++W+ P  NLLV+ EE GGDP+ +SL+ +    +
Sbjct: 661 TYIGTYNENKCSTNCGEASQRWYHVPQSWLKPTGNLLVVFEEWGGDPNGVSLVRREVDSV 720

Query: 712 CSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           C+ + E  P  ++      G V+    P+  L+C  G  I +I FAS+G PEG CGS+  
Sbjct: 721 CADIYEWQPTLMNYQMQASGKVNKPLRPKAHLSCGPGQKIRSIKFASFGTPEGVCGSYNQ 780

Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G+CH           CVGQ  CS+ V+    G     CP ++K LA EA CS
Sbjct: 781 GSCHAFHSYDAFNNLCVGQNSCSVTVAPEMFG--GDPCPSVMKKLAAEAICS 830


>gi|449489943|ref|XP_004158465.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
          Length = 1225

 Score =  825 bits (2132), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 406/713 (56%), Positives = 499/713 (69%), Gaps = 23/713 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+VTYDH+ALVIDGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN HEP
Sbjct: 24  ASVTYDHKALVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQYYFE R++LVRFVK VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 84  SPGQYYFEDRYELVRFVKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F AKI+ +MK E L+ SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA 
Sbjct: 144 GPFKAAMQKFTAKIVSMMKGEKLYHSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN   KP MWTE ++GWF  FG  
Sbjct: 204 MALGLDTGVPWVMCKQEDAPDPMIDTCNGFYCENFEPNKAYKPKMWTEAWTGWFTEFGGP 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLA+AVARF +  G+  NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG I
Sbjct: 264 VPYRPVEDLAYAVARFIQNRGSLINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLI 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHLR+LHKAIKLCE  L+S DPT   LG+K EAH+Y+  S +CAAFLANYD S+ 
Sbjct: 324 RQPKWGHLRDLHKAIKLCEPALVSVDPTVSSLGSKQEAHVYNTRSGECAAFLANYDPSTS 383

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             VTF  + Y LP WSVSILPDCK VVFNTAKV       + P    K     +   S+F
Sbjct: 384 VRVTFGNHPYDLPPWSVSILPDCKTVVFNTAKV-------NAPSYWPK-----MTPISSF 431

Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW+   EE      + +     L EQI+ T+D +DYLWY   I +   +     G+   L
Sbjct: 432 SWHSYNEETASAYADDTTTMAGLVEQISITRDATDYLWYMTDIRIDSNEGFLKSGQWPLL 491

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VF+N +L    YG  D      +K + L  G+N L +LS+ VGL N G  
Sbjct: 492 TIFSAGHALHVFINGQLSGTVYGGLDNPKLTFSKYVNLRPGVNKLSMLSVAVGLPNVGVH 551

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+   AG+   V L  L  G RD+S  +W Y+VG++GE + L  +S ++S  W  GS + 
Sbjct: 552 FETWNAGILGPVTLKGLNEGTRDMSGYKWSYKVGLKGEALNLHTVSGSSSVEWMTGSLVS 611

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYKTTF AP G  PLAL++ SMGKGQ W+NG+SIGR+W AY A   G   KC Y
Sbjct: 612 QKQPLTWYKTTFNAPGGNEPLALDMGSMGKGQVWINGESIGRHWPAYTA--RGSCGKCYY 669

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
            G +   KC   CG+P+Q  YH+PR W+ P  N+LVI EE GG+P  ISL+ +
Sbjct: 670 GGIFTEKKCHFSCGEPSQRWYHVPRAWLKPSGNILVIFEEWGGNPDGISLVKR 722



 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 247/515 (47%), Positives = 323/515 (62%), Gaps = 27/515 (5%)

Query: 206  INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
            I+TCNGFYC+ F PN   KP +WTEN+SGW+ +FG   P+RP ED+AF+VARF + GG+ 
Sbjct: 723  IDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPTPYRPPEDVAFSVARFIQNGGSL 782

Query: 266  QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
             NYYMY GGTNFGRT+G   V TSYD+DAPIDEYG +R+PKWGHLR+LHKAIKLCE  L+
Sbjct: 783  VNYYMYHGGTNFGRTSG-LFVTTSYDFDAPIDEYGLLREPKWGHLRDLHKAIKLCEPALV 841

Query: 326  SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
            S+DPT   LG   EA ++  SS  CAAFLANYD+S+   V F  + Y LP WS+SILPDC
Sbjct: 842  SADPTSTWLGKDQEARVFKSSSGACAAFLANYDTSAFVRVNFWNHPYDLPPWSISILPDC 901

Query: 386  KNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS----SAFSWY---EEKVGISGNRSFV 438
            K V FNTA+V  +R+        +  +  LL+A     S+F W    EE        +  
Sbjct: 902  KTVTFNTARV--RRD-------PKLFIPNLLMAKMTPISSFWWLSYKEEPASAYAKDTTT 952

Query: 439  RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVA 493
            +  L EQ++ T DT+DYLWY   I +   +     G+   L + S GH   VF+N +L  
Sbjct: 953  KDGLVEQVSVTWDTTDYLWYMTDIRIDSTEGFLKSGQWPLLTVNSAGHILHVFINGQLSG 1012

Query: 494  FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKN 552
              YG+ +      +K + L +G+N L +LS+ VGL N G  FD   AG+   V L  L  
Sbjct: 1013 SVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNE 1072

Query: 553  GKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKG 612
            G RD+S  +W Y+VG+ GE + L  +  +NS  W +GS     + L WYKTTF  P G  
Sbjct: 1073 GTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQ--KQPLTWYKTTFNTPAGNE 1130

Query: 613  PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT 672
            PLAL+++SM KGQ WVNG+SIGRY+  Y+A  +G   KC Y G +   KC  +CG P+Q 
Sbjct: 1131 PLALDMSSMSKGQIWVNGRSIGRYFPGYIA--SGKCNKCSYTGFFTEKKCLWNCGGPSQK 1188

Query: 673  LYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
             YHIPR W+ P  NLL+I EE+GG+P  ISL+ +T
Sbjct: 1189 WYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 1223


>gi|255546099|ref|XP_002514109.1| beta-galactosidase, putative [Ricinus communis]
 gi|223546565|gb|EEF48063.1| beta-galactosidase, putative [Ricinus communis]
          Length = 827

 Score =  824 bits (2129), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 411/831 (49%), Positives = 543/831 (65%), Gaps = 35/831 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A V YDH+A+ I+ +RR+L SGSIHYPRSTPE+WP LI+K+KEGG+EVI+TYVFWN H
Sbjct: 21  VTATVWYDHKAITINNQRRILISGSIHYPRSTPEMWPGLIQKAKEGGIEVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYF+ R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFP+WL ++PGI+FRT
Sbjct: 81  EPSPGQYYFQDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPMWLKYVPGIEFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F+  I+++MK++ LF +QGGPIIL+Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 DNGPFKAAMQKFVTLIVNMMKEQKLFQTQGGPIILSQIENEYGPVEWTIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A  LNT VPW+MC+QEDAPDP I+TCNGFYC+G+ PN+ +KP +WTEN++GW+  +G
Sbjct: 201 AAMATGLNTGVPWIMCKQEDAPDPTIDTCNGFYCEGYKPNNYNKPKVWTENWTGWYTEWG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            +VP+RP ED AF+VARF    G+F NYYMY GGTNF RTA G  +ATSYDYDAP+DEYG
Sbjct: 261 ASVPYRPPEDTAFSVARFIAASGSFVNYYMYHGGTNFDRTA-GLFMATSYDYDAPLDEYG 319

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
               PKWGHLR+LH+AIK  E  L+S+DPT   LG   EAH++ +S   CAAFLANYD+ 
Sbjct: 320 LTHDPKWGHLRDLHRAIKQSERALVSADPTVISLGKNQEAHVF-QSKMGCAAFLANYDTQ 378

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
             A V F    Y LP WS+S+LPDCK VV+NTAK+ +Q                ++  +S
Sbjct: 379 YSARVNFWNKPYSLPRWSISVLPDCKTVVYNTAKISAQSTQ-----------KWMMPVAS 427

Query: 421 AFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
            FSW     E  VG S   +F +  L EQ   T D +DYLWY   + +   +     GK 
Sbjct: 428 GFSWQSHIDEVPVGYSAG-TFTKVGLWEQKYLTGDKTDYLWYMTDVTINSNEGFLRSGKN 486

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
            FL + S GH   VF+N  L    YG+ +      ++ ++L  G+N + +LS  VGL N 
Sbjct: 487 PFLTVASAGHVLHVFINGHLAGSAYGSLENPKLTFSQNVKLVGGVNKIALLSATVGLANV 546

Query: 532 GAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           G  +D    G+   V L  L  G  D++  +W Y++G++GE + L   S   +  W QG+
Sbjct: 547 GVHYDTWNVGVLGPVTLQGLNQGTLDMTKWKWSYKIGLKGEDLKL--FSGGANVGWAQGA 604

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            L     L WYKT   AP G  P+AL + SMGKGQ ++NG+SIGR+W AY A   G  K 
Sbjct: 605 QLAKKTPLTWYKTFINAPPGNDPVALYMGSMGKGQMYINGRSIGRHWPAYTA--KGNCKD 662

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDY G YD  KC+  CGQP Q  YH+PR+W+ P  NLLV+ EE+GGDP+ ISL+ +    
Sbjct: 663 CDYAGYYDDQKCRSGCGQPPQQWYHVPRSWLKPTGNLLVVFEEMGGDPTGISLVKRVVGS 722

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           +C+ + + D P + SW  N+ V   +P+  L C  G   + I FASYG P+G CG++R G
Sbjct: 723 VCADIDD-DQPEMKSWTENIPV---TPKAHLWCPPGQKFSKIVFASYGWPQGRCGAYRQG 778

Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CH +      QK C+G+  C I V+ A  G     CPG  K L+V+  CS
Sbjct: 779 KCHALKSWDPFQKYCIGKGACDIDVAPATFG--GDPCPGSAKRLSVQLQCS 827


>gi|168008096|ref|XP_001756743.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691981|gb|EDQ78340.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 836

 Score =  817 bits (2111), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 422/834 (50%), Positives = 543/834 (65%), Gaps = 35/834 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++  V+YDHRAL +DG+RR+L SGSIHYPRSTP +WP LI K+KEGGL+VI+TYVFWN H
Sbjct: 24  VAVTVSYDHRALKLDGQRRMLVSGSIHYPRSTPLMWPGLIAKAKEGGLDVIQTYVFWNGH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP RG Y + GR++L +F++ V EAG++++LRIGPY CAEWN GGFP WL FIPGI+FRT
Sbjct: 84  EPTRGVYNYAGRYNLPKFIRLVYEAGMYVNLRIGPYVCAEWNSGGFPAWLRFIPGIEFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK E +RF+  ++  +K+E LFA QGGPII+AQ+ENEYGN++ +YG  G+ Y+ W 
Sbjct: 144 DNEPFKNETQRFVNHLVRKLKREKLFAWQGGPIIMAQIENEYGNIDASYGEAGQRYLNWI 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV  NTSVPW+MCQQ +AP  +INTCNGFYCDG+ PNS  KP  WTEN++GWF S+G
Sbjct: 204 ANMAVATNTSVPWIMCQQPEAPQLVINTCNGFYCDGWRPNSEDKPAFWTENWTGWFQSWG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P RPV+D+AF+VARFFE GG+F NYYMY GGTNF RT G   V TSYDYDAPIDEY 
Sbjct: 264 GGAPTRPVQDIAFSVARFFEKGGSFMNYYMYHGGTNFERT-GVESVTTSYDYDAPIDEYD 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
            +RQPKWGHL++LH A+KLCE  L+  D  PT   LG   EAH+Y  SS  CAAFLA++D
Sbjct: 323 -VRQPKWGHLKDLHAALKLCEPALVEVDTVPTGISLGPNQEAHVYQSSSGTCAAFLASWD 381

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
            ++D+ VTF G  Y LPAWSVSILPDCK+VVFNTAKV             Q  +  +  A
Sbjct: 382 -TNDSLVTFQGQPYDLPAWSVSILPDCKSVVFNTAKV-----------GAQSVIMTMQGA 429

Query: 419 SSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEV---- 472
               +W  Y E +G  G+  F    L EQI TTKDT+DYLWY  ++ V     + +    
Sbjct: 430 VPVTNWVSYHEPLGPWGS-VFSTNGLLEQIATTKDTTDYLWYMTNVQVAESDVRNISAQA 488

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + SL  AA  FVN     F  G          + I L  G N + +LSM +GLQ YG
Sbjct: 489 TLVMSSLRDAAHTFVN----GFYTGTSHQQFMHARQPISLRPGSNNITVLSMTMGLQGYG 544

Query: 533 AWFDVAGAGL-FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
            + +   AG+ + V + DL +G  +L    W YQVG++GE   L +++ + ++ W   S 
Sbjct: 545 PFLENEKAGIQYGVRIEDLPSGTIELGGSTWTYQVGLQGESKQLFEVNGSLTAEWNTISE 604

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +     L W KT F  P G G +AL+L+SMGKG  WVNG ++GRYWS++ A   GC   C
Sbjct: 605 VSDQNFLFWIKTRFDMPAGNGSIALDLSSMGKGVVWVNGVNLGRYWSSFTAQRDGCDASC 664

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           DYRGSY  SKC   C QP+Q  YHIPR W+ P  N +V+ EE GG+P  IS+ T+  Q I
Sbjct: 665 DYRGSYTQSKCLTKCNQPSQNWYHIPRQWLLPKNNFIVLFEEKGGNPKDISIATRMPQQI 724

Query: 712 CSFVSEADPPP--VDSW--KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           CS +S++ P P  + SW  + NL        + L C  G  I+ I FASYG P G+C  F
Sbjct: 725 CSHISQSHPFPFSLTSWTKRDNLTSTLLRAPLTLECAEGQQISRICFASYGTPSGDCEGF 784

Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
              +CH +    ++ KACVG+ +CS+P+ S+  G     CPGL K+LA  A CS
Sbjct: 785 VLSSCHANTSYDVLTKACVGRQKCSVPIVSSIFG--DDPCPGLSKSLAATAECS 836


>gi|350537549|ref|NP_001234298.1| beta-galactosidase precursor [Solanum lycopersicum]
 gi|7939617|gb|AAF70821.1|AF154420_1 beta-galactosidase [Solanum lycopersicum]
          Length = 892

 Score =  817 bits (2110), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/863 (48%), Positives = 563/863 (65%), Gaps = 63/863 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD+RAL+I GKRR+L S  IHYPR+TPE+WP LI +SKEGG +VIETY FWN HEP 
Sbjct: 36  NVTYDNRALIIGGKRRMLISAGIHYPRATPEMWPTLIARSKEGGADVIETYTFWNGHEPT 95

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY FEGR+D+V+F K V   GLFL +RIGPYACAEWN+GGFP+WL  IPGI+FRT N 
Sbjct: 96  RGQYNFEGRYDIVKFAKLVGSHGLFLFIRIGPYACAEWNFGGFPIWLRDIPGIEFRTDNA 155

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFKEEM+R++ KI+DLM  E+LF+ QGGPIIL Q+ENEYGNVE ++G  G+LY+KWAA+ 
Sbjct: 156 PFKEEMERYVKKIVDLMISESLFSWQGGPIILLQIENEYGNVESSFGPKGKLYMKWAAEM 215

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV L   VPWVMC+Q DAP+ II+TCN +YCDGFTPNS  KP +WTEN++GWF  +G  +
Sbjct: 216 AVGLGAGVPWVMCRQTDAPEYIIDTCNAYYCDGFTPNSEKKPKIWTENWNGWFADWGERL 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+RP ED+AFA+ARFF+ GG+ QNYYMYFGGTNFGRTAGGP   TSYDYDAP+DEYG +R
Sbjct: 276 PYRPSEDIAFAIARFFQRGGSLQNYYMYFGGTNFGRTAGGPTQITSYDYDAPLDEYGLLR 335

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSND-----------CA 351
           QPKWGHL++LH AIKLCE  L+++D P + KLG K EAH+Y  +SN+           CA
Sbjct: 336 QPKWGHLKDLHAAIKLCEPALVAADSPQYIKLGPKQEAHVYRGTSNNIGQYMSLNEGICA 395

Query: 352 AFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV-ISQRNNGDHPFAQQK 410
           AF+AN D    A V F G  + LP WSV        V    A++ +S +    H   Q K
Sbjct: 396 AFIANIDEHESATVKFYGQEFTLPPWSV--------VFCQIAEIQLSTQLRWGHKL-QSK 446

Query: 411 NVNELLL---------------ASSAF--SWY--EEKVGISGNRSFVRPDLAEQINTTKD 451
              ++L                +S +F  SW   +E +G+ G+++F    + E +N TKD
Sbjct: 447 QWAQILFQLGIILCFYKLSLKASSESFSQSWMTLKEPLGVWGDKNFTSKGILEHLNVTKD 506

Query: 452 TSDYLWYTASIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGYGNHDFANF 504
            SDYLWY   I++        +  +V   ++I+S+     +FVN +L     G       
Sbjct: 507 QSDYLWYLTRIYISDDDISFWEENDVSPTIDIDSMRDFVRIFVNGQLAGSVKGKW----I 562

Query: 505 LINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI-LIDLKNGKRDLSSGEWI 563
            + + ++L +G N + +LS  VGLQNYGA+ +  GAG    I L   K+G  +L++  W 
Sbjct: 563 KVVQPVKLVQGYNDILLLSETVGLQNYGAFLEKDGAGFKGQIKLTGCKSGDINLTTSLWT 622

Query: 564 YQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGK 623
           YQVG+ GE++ +  ++   S+ W +  T        WYKT F AP G  P+AL+ +SMGK
Sbjct: 623 YQVGLRGEFLEVYDVNSTESAGWTEFPTGTTPSVFSWYKTKFDAPGGTDPVALDFSSMGK 682

Query: 624 GQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHP 683
           GQAWVNG  +GRYW+  +AP+ GC + CDYRG+Y + KC+ +CG+  Q  YHIPR+W+  
Sbjct: 683 GQAWVNGHHVGRYWT-LVAPNNGCGRTCDYRGAYHSDKCRTNCGEITQAWYHIPRSWLKT 741

Query: 684 GENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN-----LGVVSSSPQ 738
             N+LVI EE    P  IS+ T++ + IC+ VSE   PP+  W  +     L ++  +P+
Sbjct: 742 LNNVLVIFEETDKTPFDISISTRSTETICAQVSEKHYPPLHKWSHSEFDRKLSLMDKTPE 801

Query: 739 VRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSA 797
           + L C+ G  I++I FASYG P G+C  F  G CH  + L +V +AC+G+  CSI +S+ 
Sbjct: 802 MHLQCDEGHTISSIEFASYGSPNGSCQKFSQGKCHAANSLSVVSQACIGRTSCSIGISN- 860

Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
             GV    C  ++K+LAV+A CS
Sbjct: 861 --GVFGDPCRHVVKSLAVQAKCS 881


>gi|302814772|ref|XP_002989069.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
 gi|300143170|gb|EFJ09863.1| hypothetical protein SELMODRAFT_269483 [Selaginella moellendorffii]
          Length = 722

 Score =  816 bits (2109), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/714 (56%), Positives = 500/714 (70%), Gaps = 20/714 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           LS  V YDHR L+I+G+ R+L S SIHYPR+ P++W +LI  +K GG++VIETYVFW+ H
Sbjct: 20  LSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGH 79

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           +P R  Y FEGRFDLV FVK V EAGL+ +LRIGPY CAEWN GGFPVWL  +PGI+FRT
Sbjct: 80  QPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVPGIEFRT 139

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK EM+ F+ KI+ +MK + LFA QGGPIILAQ+ENEYGN++ AYG  G+ Y++WA
Sbjct: 140 NNQPFKAEMQAFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMEWA 199

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A  L T VPW+MCQQ DAPD I++TCNGFYCD + PN+  KP MWTEN+SGWF  +G
Sbjct: 200 ANMAQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWG 259

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A P RPVED+AFAVARFF+ GG+FQNYYMYFGGTNFGR++GGP V TSYDYDAPIDE+G
Sbjct: 260 EASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFG 319

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDS 359
            IRQPKWGHL++LH AIKLCE  L S+DPT+  LG   EAH+Y   SS  CAAFLAN DS
Sbjct: 320 VIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDS 379

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
           SSDA V FN   Y LPAWSVSILPDCK V  NTAKV             Q  +  +  + 
Sbjct: 380 SSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKV-----------HVQTAMPTMKPSI 428

Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK--EVFLN 475
           +  +W  Y E VG+  +   V   L EQINTTKDTSDYLWYT S+ +        +  L+
Sbjct: 429 TGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAASGKALLS 488

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           +ES+     VFVN KL              + + IEL  G N+L IL   VGLQNYG + 
Sbjct: 489 LESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFI 548

Query: 536 DVAGAGL-FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +  GAG+  SVI+  L +G+ DL++ EWI+QVG++GE + +   S +    W   S +P 
Sbjct: 549 ETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWS--SAVPQ 606

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST-GCTKKCDY 653
            ++L+WYK  F +P G  P+AL+L SMGKGQAW+NGQSIGR+W +  AP T GC + CDY
Sbjct: 607 GQALVWYKAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFWPSLRAPDTAGCPQTCDY 666

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           RGSY +SKC+  CGQP+Q  YH+PR+W+    NL+V+ EE GG PS +S +T+T
Sbjct: 667 RGSYSSSKCRSGCGQPSQRWYHVPRSWLQDSGNLVVLFEEEGGKPSGVSFVTRT 720


>gi|293332101|ref|NP_001168664.1| uncharacterized protein LOC100382452 [Zea mays]
 gi|223950023|gb|ACN29095.1| unknown [Zea mays]
          Length = 815

 Score =  815 bits (2106), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/799 (51%), Positives = 524/799 (65%), Gaps = 28/799 (3%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +W  LI+K+K+GGL+VI+TYVFWN HEP  G YYFE R+DLVRFVKTVQ+AGLF+HLRIG
Sbjct: 29  MWEGLIQKAKDGGLDVIQTYVFWNGHEPTPGNYYFEERYDLVRFVKTVQKAGLFVHLRIG 88

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           PY C EWN+GGFPVWL ++PGI FRT N PFK  M+ F  KI+ +MK ENLFASQGGPII
Sbjct: 89  PYICGEWNFGGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPII 148

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
           L+Q+ENEYG     +G  G+ Y+ WAA  AV L+T VPWVMC++EDAPDP+IN CNGFYC
Sbjct: 149 LSQIENEYGPEGKEFGAAGQAYINWAAKMAVGLDTGVPWVMCKEEDAPDPVINACNGFYC 208

Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
           D F+PN P KP MWTE +SGWF  FG  +  RPVEDLAFAVARF + GG+F NYYMY GG
Sbjct: 209 DAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYYMYHGG 268

Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
           TNFGRTAGGP + TSYDYDAPIDEYG IR+PK  HL+ELH+A+KLCE+ L+S DPT   L
Sbjct: 269 TNFGRTAGGPFITTSYDYDAPIDEYGLIREPKHSHLKELHRAVKLCEQALVSVDPTITTL 328

Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK 394
           G   EAH++ +S + CAAFLANY+S+S A V FN   Y LP WS+SILPDCKNVVFN+A 
Sbjct: 329 GTMQEAHVF-RSPSGCAAFLANYNSNSHAKVVFNNEQYSLPPWSISILPDCKNVVFNSAT 387

Query: 395 VISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKV-GISGNRSFVRPDLAEQINTTKDTS 453
           V  Q +        Q  +      S  +  Y+E+V  ++         L EQ+N T+D+S
Sbjct: 388 VGVQTS--------QMQMWGDGATSMMWERYDEEVDSLAAAPLLTTTGLLEQLNVTRDSS 439

Query: 454 DYLWYTASIHVMP------GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLIN 507
           DYLWY  S+ + P      G GK   L+++S GHA  VFVN +L    YG  +      N
Sbjct: 440 DYLWYITSVDISPSENFLQGGGKPPSLSVQSAGHALHVFVNGQLQGSSYGTREDRRIKYN 499

Query: 508 KKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQV 566
             + L  G N + +LS+  GL N G  ++    G+   V+L  L  G RDL+   W YQV
Sbjct: 500 GNVNLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLNEGSRDLTWQTWSYQV 559

Query: 567 GVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQ 625
           G++GE + L+ +  + S  W QGS +   +  L WYK  F  P G  PLAL++ SMGKGQ
Sbjct: 560 GLKGEQMNLNSVEGSGSVEWMQGSLIAQKQQPLAWYKAYFETPSGDEPLALDMGSMGKGQ 619

Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGE 685
            W+NGQSIGRYW+AY   + G  K C Y G++ A KCQ  CGQP Q  YH+PR+W+ P  
Sbjct: 620 VWINGQSIGRYWTAY---ADGDCKGCSYTGTFRAPKCQAGCGQPTQRWYHVPRSWLQPSR 676

Query: 686 NLLVIHEEL-GGDPSKISLLTKTGQHICSFVSEADPPPVDSWK-PNLGVVS-SSPQVRLA 742
           NLLV+ EEL GGD SKI+L  ++   +C+ VSE D P +  W+  + G       +V L 
Sbjct: 677 NLLVVLEELGGGDSSKIALAKRSVSSVCADVSE-DHPNIKKWQIESYGEREHRRAKVHLR 735

Query: 743 CERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGV 801
           C  G  I+AI FAS+G P G CG+F+ G CH      +++K C+G   C + +S    G 
Sbjct: 736 CAHGQSISAIRFASFGTPVGTCGNFQQGGCHSASSHAVLEKRCIGLQRCVVAISPDNFG- 794

Query: 802 SAGACPGLLKALAVEAHCS 820
               CP + K +AVEA CS
Sbjct: 795 -GDPCPSVTKRVAVEAVCS 812


>gi|449464712|ref|XP_004150073.1| PREDICTED: beta-galactosidase 3-like [Cucumis sativus]
          Length = 848

 Score =  815 bits (2104), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 411/830 (49%), Positives = 536/830 (64%), Gaps = 29/830 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD +AL+I+G+R++L SGSIHYPRS P++W  LI K+K GGL+V++TYVFWN HEP 
Sbjct: 29  NVTYDGKALIINGQRKILFSGSIHYPRSVPDMWESLIEKAKMGGLDVVDTYVFWNLHEPS 88

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G Y FEGR DLV+F+K V++AGL++HLRIGPY C EWN+GGFP WL F+PGI FRT N 
Sbjct: 89  PGIYDFEGRNDLVKFIKLVEKAGLYVHLRIGPYICGEWNFGGFPAWLKFVPGISFRTDNE 148

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M +F  KI+ +MK E LF SQGGPIIL+Q+ENEY   +  +G  G  Y+ WAA  
Sbjct: 149 PFKLAMAKFTKKIVQMMKDERLFQSQGGPIILSQIENEYETEDKVFGEAGFAYMNWAAKM 208

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV ++T VPWVMC+Q+DAPDP+INTCNGFYCD F+PN P KP  WTE ++ WF +FG   
Sbjct: 209 AVQMDTGVPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPN 268

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             RPVEDLAF VARF + GG+  NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IR
Sbjct: 269 HKRPVEDLAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIR 328

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPK+GHL+ LH A+KLCE+ L++ +P    L    +A ++  SS DCAAFL+NY S++ A
Sbjct: 329 QPKFGHLKRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTA 388

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            VTFNG  Y LP WS+SILPDCK+V++NTA+V  Q N           ++ L     +FS
Sbjct: 389 RVTFNGRHYTLPPWSISILPDCKSVIYNTAQVQVQTN----------QLSFLPTKVESFS 438

Query: 424 W--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           W  Y E +  I  + S     L EQ+  TKD SDYLWYT S++V P +     GK   L 
Sbjct: 439 WETYNENISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLT 498

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
             S GH   VF+N KL    +G HD + F    +I L  G+N + +LS+  GL N G  +
Sbjct: 499 ATSKGHGMHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHY 558

Query: 536 DVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +    G+   + I  L  GK DLS  +W Y+VG++GE + L   S   +  W + S    
Sbjct: 559 EEREMGVLGPVAIHGLDKGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQE 618

Query: 595 N-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           N + L WYK  F APEG  PLAL++ SM KGQ W+NGQ++GRYW+  +  +  CT  C Y
Sbjct: 619 NAQPLTWYKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWT--ITANGNCT-DCSY 675

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
            G+Y   KCQ  CGQP Q  YH+PR+W+ P +NL+V+ EE+GG+PS+ISL+ ++   IC+
Sbjct: 676 SGTYRPRKCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICT 735

Query: 714 FVSEADPPPVD-SWKPNLGVVSSSP--QVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
             S+  P   +     N G ++     ++ L C  G  I+AI FAS+G P G CGS + G
Sbjct: 736 EASQYRPVIKNVHMHQNNGELNEQNVLKINLHCAAGQFISAIKFASFGTPSGACGSHKQG 795

Query: 771 ACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            CH      ++QK CVG+  C   + ++  G     CP L K L+ E  C
Sbjct: 796 TCHSPKSDYVLQKLCVGRQRCLATIPTSIFG--EDPCPNLRKKLSAEVVC 843


>gi|449489867|ref|XP_004158444.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
           sativus]
          Length = 725

 Score =  813 bits (2101), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/714 (55%), Positives = 498/714 (69%), Gaps = 23/714 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+VTYDH+A++I+G+RR+L SGSIHYPRS P++WP+LI+K+K+GGL+VIETYVFWN HEP
Sbjct: 24  ASVTYDHKAIIINGRRRILISGSIHYPRSIPQMWPDLIQKAKDGGLDVIETYVFWNGHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQY FE R+DLVRFVK V +AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 84  SPGQYNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+ LMK E L+ SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA 
Sbjct: 144 GPFKAAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ LNT VPWVMC+Q+DAPDP+I+TCNGFYC+ F PN   KP MWTE ++GWF  FG  
Sbjct: 204 MALGLNTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGGP 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P+RPVED+A++VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +
Sbjct: 264 APYRPVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLL 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R+PKW HLR+LHKAIKLCE  L+S DPT   LG+  EAH++   S  CAAFLANYD+SS 
Sbjct: 324 REPKWSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASSS 383

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A VTF  N Y LP WSVSILPDCK+V+FNTAKV         P +Q K     +   S+F
Sbjct: 384 ATVTFGNNQYDLPPWSVSILPDCKSVIFNTAKV-------GAPTSQPK-----MTPVSSF 431

Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW    EE        +     L EQI+ T+D++DYLWY   I + P +     G+   L
Sbjct: 432 SWLSYNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLL 491

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + S GHA  VF+N +L    YG  +      +K + L  GIN L ILS+ VGL N G  
Sbjct: 492 TVFSAGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLH 551

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++    G+   V L  L    RD+S  +W Y++G++GE + L  +S ++S  W  GS + 
Sbjct: 552 YETWNTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSLVA 611

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYKTTF +P+G  PLAL+++SMGKGQ W+NGQSIGR+W AY A   G   KC+Y
Sbjct: 612 QKQPLTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTA--KGSCGKCNY 669

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G ++  KC   CG+P+Q  YH+PR W+    N+LVI EE GG+P  ISL+ ++
Sbjct: 670 GGIFNEKKCHSXCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRS 723


>gi|3641865|emb|CAA09457.1| beta-galactosidase [Cicer arietinum]
          Length = 723

 Score =  813 bits (2099), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/715 (55%), Positives = 496/715 (69%), Gaps = 22/715 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+ +VIDG+RR+L SGSIHYPRSTPE+WP L +K+KEGGL+VI+TYVFWN H
Sbjct: 21  VTASVTYDHKTIVIDGQRRILISGSIHYPRSTPEMWPALFQKAKEGGLDVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFE RFDLV+F+K  Q+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81  EPSPGKYYFEDRFDLVKFIKLAQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK ENLF +QGGPII++Q+ENEYG VEW  G  G+ Y  WA
Sbjct: 141 DNEPFKAAMQKFTTKIVSMMKAENLFQNQGGPIIMSQIENEYGPVEWNIGAPGKAYTNWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPW MC+QEDAPDP+I+TCNG+YC+ FTPN   KP MWTEN+SGW+  FG
Sbjct: 201 AQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNKNYKPKMWTENWSGWYTDFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A+ +RPVEDLA++VARF +  G+F NYYMY GGTNFGRT+ G  +ATSYDYDAPIDEYG
Sbjct: 261 NAICYRPVEDLAYSVARFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
              +PKW HLR+LHKAIK CE  L+S DPT   LG KLEAH+Y   ++ CAAFLANYD+ 
Sbjct: 321 LTNEPKWSHLRDLHKAIKQCEPALVSVDPTITSLGNKLEAHVYSTGTSVCAAFLANYDTK 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A VTF    Y LP WSVSILPDCK  VFNTAKV +Q        + QK    ++  +S
Sbjct: 381 SAATVTFGNGKYDLPPWSVSILPDCKTDVFNTAKVGAQ--------SSQKT---MISTNS 429

Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            F W    EE    S + S     L EQIN T+D+SDYLWY   +++ P +     G+  
Sbjct: 430 TFDWQSYIEEPAFSSEDDSITAEALWEQINVTRDSSDYLWYLTDVNISPNEDFIKNGQYP 489

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            LN+ S GH   VFVN +L    YG  D      +  + L  G N + +LS+ VGL N G
Sbjct: 490 ILNVMSAGHVLHVFVNGQLSGTVYGVLDNPKLTFSNSVNLTVGNNKISLLSVAVGLPNVG 549

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  L  G RDLS  +W Y+VG++GE + L  I+  +S  W QGS 
Sbjct: 550 LHFETWNVGVLGPVTLKGLNEGTRDLSWQKWSYKVGLKGESLSLHTITGGSSVDWTQGSL 609

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + L WYK TF AP G  PL L+++SMGKG+ WVN QSIGR+W  Y+A   G    C
Sbjct: 610 LAKKQPLTWYKATFNAPAGNDPLGLDMSSMGKGEIWVNDQSIGRHWPGYIA--HGSCGDC 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           DY G++  +KC+ +CG P QT YHIPR+W++P  N+LV+ EE GGDPS ISLL +
Sbjct: 668 DYAGTFTNTKCRTNCGNPTQTWYHIPRSWLNPTGNVLVVLEEWGGDPSGISLLKR 722


>gi|357449771|ref|XP_003595162.1| Beta-galactosidase [Medicago truncatula]
 gi|124360798|gb|ABN08770.1| Galactose-binding like [Medicago truncatula]
 gi|355484210|gb|AES65413.1| Beta-galactosidase [Medicago truncatula]
          Length = 726

 Score =  812 bits (2097), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/716 (54%), Positives = 502/716 (70%), Gaps = 22/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+VI+GKRR+L SGSIHYPRSTP++WP+LI+K+K+GG++VIETYVFWN H
Sbjct: 24  VTASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIETYVFWNGH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP +G+YYFE RFDLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 84  EPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVAFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK ENLF SQGGPIIL+Q+ENEYG VEW  G  G+ Y KW 
Sbjct: 144 DNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWF 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           +  AV LNT VPWVMC+QEDAPDPII+TCNG+YC+ F+PN   KP MWTEN++GW+  FG
Sbjct: 204 SQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMWTENWTGWYTDFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP EDLAF+VARF +  G++ NYYMY GGTNFGRT+ G  +ATSYDYDAPIDEYG
Sbjct: 264 TAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            I +PKWGHLR+LHKAIK CE  L+S DPT    G  LE H+Y  S   CAAFLANYD+ 
Sbjct: 324 LISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSFGACAAFLANYDTG 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F    Y LP WS+SILPDCK  VFNTAKV + R +             +  A+S
Sbjct: 384 SWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVH-----------RSMTPANS 432

Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AF+W  Y E+   SG   S+    L EQ++ T D SDYLWY   +++ P +     G+  
Sbjct: 433 AFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNISPNEGFIKNGQNP 492

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L   S GH   VF+N +     YG+ D      +  ++L  G N + +LS+ VGL N G
Sbjct: 493 VLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKISLLSVAVGLSNVG 552

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             ++    G+   V L  L  G RDLS  +W Y++G++GE + L   S ++S  W QGS 
Sbjct: 553 VHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKIGLKGESLNLHTTSGSSSVKWTQGSF 612

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + L WYKTTF AP G  PLAL+++SMGKG+ WVNGQSIGR+W AY+A   G    C
Sbjct: 613 LSKKQPLTWYKTTFNAPAGNDPLALDMSSMGKGEIWVNGQSIGRHWPAYIA--RGNCGSC 670

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G++   KC+ +CGQP Q  YHIPR+W++P  N+LV+ EE GGDP+ ISL+ +T
Sbjct: 671 NYAGTFTDKKCRTNCGQPTQKWYHIPRSWLNPSGNVLVVLEEWGGDPTGISLVKRT 726


>gi|12583687|dbj|BAB21492.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 731

 Score =  811 bits (2095), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/714 (54%), Positives = 503/714 (70%), Gaps = 23/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23  SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 83  PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN   KP MWTE ++GW+  FG 
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 263 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +R+PKWGHLR+LHKAIK CE  L+S DP+  KLG+  EAH++ KS +DCAAFLANYD+  
Sbjct: 323 LREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 381

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V+F G  Y LP WS+SILPDCK  V++TAKV SQ +             ++    S 
Sbjct: 382 SVKVSFGGGQYDLPPWSISILPDCKTEVYSTAKVGSQSSQ-----------VQMTPVHSG 430

Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
           F W     E        +     L EQIN T+DT+DYLWY   I +   +     GK   
Sbjct: 431 FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 490

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GHA  VF+N +L    YG+ +      ++ + L  GIN L +LS+ VGL N G 
Sbjct: 491 LTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 550

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   + L  L +G  D+S  +W Y+ G++GE +GL  ++ ++S  W +G ++
Sbjct: 551 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 610

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYK TF AP G  PLAL++ SMGKGQ W+NGQS+GR+W  Y+A   G    C 
Sbjct: 611 AKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 668

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           Y G+YD  KC+ HCG+P+Q  YHIPR+W+ P  NLLV+ EE GGDPS+ISL+ +
Sbjct: 669 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPNGNLLVVFEEWGGDPSRISLVER 722


>gi|84579369|dbj|BAE72073.1| pear beta-galactosidase1 [Pyrus communis]
          Length = 731

 Score =  811 bits (2094), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/714 (54%), Positives = 501/714 (70%), Gaps = 23/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23  SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 83  PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN   KP MWTE ++GW+  FG 
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 263 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R+PKWGHLR+LHKAIK CE  L+S DP+  KLG+  EAH++ KS +DCAAFLANYD+  
Sbjct: 323 PREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 381

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V+F G  Y LP WS+SILPDCK  V+NTAKV SQ +             ++    S 
Sbjct: 382 SVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-----------VQMTPVHSG 430

Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
           F W     E        +     L EQIN T+DT+DYLWY   I +   +     GK   
Sbjct: 431 FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 490

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GHA  VF+N +L    YG+ +      ++ + L  GIN L +LS+ VGL N G 
Sbjct: 491 LTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 550

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   + L  L +G  D+S  +W Y+ G++GE +GL  ++ ++S  W +G ++
Sbjct: 551 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 610

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYK TF AP G  PLAL++ SMGKGQ W+NGQS+GR+W  Y+A   G    C 
Sbjct: 611 AKKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 668

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           Y G+YD  KC+ HCG+P+Q  YHIPR+W+ P  NLLV+ EE GGDPS ISL+ +
Sbjct: 669 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSGISLVER 722


>gi|51507377|emb|CAH18936.1| beta-galactosidase [Pyrus communis]
          Length = 724

 Score =  810 bits (2092), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/714 (54%), Positives = 501/714 (70%), Gaps = 23/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 16  SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 75

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 76  PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 135

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 136 NEPFKAAMQKFTEKIVSMMKAEKLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 195

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN   KP MWTE ++GW+  FG 
Sbjct: 196 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 255

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 256 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 315

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R+PKWGHLR+LHKAIK CE  L+S DP+  KLG+  EAH++ KS +DCAAFLANYD+  
Sbjct: 316 PREPKWGHLRDLHKAIKPCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 374

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V+F G  Y LP WS+SILPDCK  V+NTAKV SQ +             ++    S 
Sbjct: 375 SVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-----------VQMTPVHSG 423

Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
           F W     E        +     L EQIN T+DT+DYLWY   I +   +     GK   
Sbjct: 424 FPWQSFIEETTSSDETDTTYMDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 483

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GHA  VF+N +L    YG+ +      ++ + L  GIN L +LS+ VGL N G 
Sbjct: 484 LTISSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 543

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   + L  L +G  D+S  +W Y+ G++GE +GL  ++ ++S  W +G ++
Sbjct: 544 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 603

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L W+K TF AP G  PLAL++ SMGKGQ W+NGQS+GR+W  Y+A   G    C 
Sbjct: 604 AKKQPLTWHKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 661

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           Y G+YD  KC+ HCG+P+Q  YHIPR+W+ P  NLLV+ EE GGDPS ISL+ +
Sbjct: 662 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSGISLVER 715


>gi|1352078|sp|P48981.1|BGAL_MALDO RecName: Full=Beta-galactosidase; AltName: Full=Acid
           beta-galactosidase; Short=Lactase; AltName:
           Full=Exo-(1-->4)-beta-D-galactanase; Flags: Precursor
 gi|507278|gb|AAA62324.1| b-galactosidase-related protein; putative [Malus x domestica]
          Length = 731

 Score =  808 bits (2086), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/714 (54%), Positives = 500/714 (70%), Gaps = 23/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23  SASVSYDHKAIIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G YYFE R+DLV+F+K VQ+ GLF++LRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 83  PSPGNYYFEERYDLVKFIKLVQQEGLFVNLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN   KP MWTE ++GW+  FG 
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDYKPKMWTEVWTGWYTEFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RP ED+AF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 263 AVPTRPAEDVAFSVARFIQSGGSFLNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R+PKWGHLR+LHKAIK CE  L+S DP+  KLG+  EAH++ KS +DCAAFLANYD+  
Sbjct: 323 PREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF-KSESDCAAFLANYDAKY 381

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V+F G  Y LP WS+SILPDCK  V+NTAKV SQ +             ++    S 
Sbjct: 382 SVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-----------VQMTPVHSG 430

Query: 422 FSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
           F W     E        +     L EQIN T+DT+DYLWY   I +   +     GK   
Sbjct: 431 FPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWYMTDITIGSDEAFLKNGKSPL 490

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GHA  VF+N +L    YG+ +      ++ + L  GIN L +LS+ VGL N G 
Sbjct: 491 LTIFSAGHALNVFINGQLSGTVYGSLENPKLSFSQNVNLRSGINKLALLSISVGLPNVGT 550

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+   AG+   + L  L +G  D+S  +W Y+ G++GE +GL  ++ ++S  W +G ++
Sbjct: 551 HFETWNAGVLGPITLKGLNSGTWDMSGWKWTYKTGLKGEALGLHTVTGSSSVEWVEGPSM 610

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYK TF AP G  PLAL++ SMGKGQ W+NGQS+GR+W  Y+A   G    C 
Sbjct: 611 AEKQPLTWYKATFNAPPGDAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--RGSCGDCS 668

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           Y G+YD  KC+ HCG+P+Q  YHIPR+W+ P  NLLV+ EE GGDPS+ISL+ +
Sbjct: 669 YAGTYDDKKCRTHCGEPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSRISLVER 722


>gi|20384648|gb|AAK31801.1| beta-galactosidase [Citrus sinensis]
          Length = 737

 Score =  808 bits (2086), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/713 (53%), Positives = 501/713 (70%), Gaps = 17/713 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YDH+A++I+G++R+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 35  VKASVSYDHKAVIINGQKRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 94

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP +G YYF+ R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFPVWL ++PGI+FRT
Sbjct: 95  EPTQGNYYFQDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPVWLKYVPGIEFRT 154

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M +F  KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW  G  G+ Y KWA
Sbjct: 155 DNGPFKAAMHKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFGPVEWDIGAPGKAYAKWA 214

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV LNT VPWVMC+Q+DAPDP+INTCNGFYC+ F PN   KP MWTE ++GWF  FG
Sbjct: 215 AQMAVGLNTGVPWVMCKQDDAPDPVINTCNGFYCEKFVPNQNYKPKMWTEAWTGWFTEFG 274

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP EDL F+VARF ++GG+F NYYMY GGTNFGRT+GG  VATSYDYDAPIDEYG
Sbjct: 275 SAVPTRPAEDLVFSVARFIQSGGSFINYYMYHGGTNFGRTSGG-FVATSYDYDAPIDEYG 333

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PKWGHLR LHKAIKLCE  L+S DPT + LG   EAH+++  S  CAAFLANYD++
Sbjct: 334 LLNEPKWGHLRGLHKAIKLCEPALVSVDPTVKSLGENQEAHVFNSISGKCAAFLANYDTT 393

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
             A V+F    Y LP WS+S+LPDCK  VFNTA+V  Q        + QK    ++ A S
Sbjct: 394 FSAKVSFGNAQYDLPPWSISVLPDCKTAVFNTARVGVQ--------SSQKKFVPVINAFS 445

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
             S+ EE    + + +F +  L EQ+  T D SDYLWY   +++   +     G++  L 
Sbjct: 446 WQSYIEETASSTDDNTFTKDGLWEQVYLTADASDYLWYMTDVNIGSNEGFLKNGQDPLLT 505

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I S GHA  VF+N +L    YG+ +      +K ++L  G+N + +LS  VGL N G  F
Sbjct: 506 IWSAGHALQVFINGQLSGTVYGSLENPKLTFSKNVKLRAGVNKISLLSTSVGLPNVGTHF 565

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V L  L  G RD+S  +W Y++G++GE + L  +S ++S  W QG++L  
Sbjct: 566 EKWNAGVLGPVTLKGLNEGTRDISKQKWTYKIGLKGEALSLHTVSGSSSVEWAQGASLAQ 625

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + + WYKTTF  P G  PLAL++ +MGKG  W+NGQSIGR+W  Y+    G    C+Y 
Sbjct: 626 KQPMTWYKTTFNVPPGNDPLALDMGAMGKGMVWINGQSIGRHWPGYIG--NGNCGGCNYA 683

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G+Y   KC+ +CG+P+Q  YH+PR+ + P  NLLV+ EE GG+P  ISLL +T
Sbjct: 684 GTYTEKKCRTYCGKPSQRWYHVPRSRLKPSGNLLVVFEEWGGEPHWISLLKRT 736


>gi|3641863|emb|CAA06309.1| beta-galactosidase [Cicer arietinum]
          Length = 730

 Score =  805 bits (2079), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/716 (54%), Positives = 499/716 (69%), Gaps = 21/716 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+VI+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GG++VI+TYVFWN H
Sbjct: 27  VTASVTYDHKAIVINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIQTYVFWNGH 86

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G YYFE RFDLV+FVK VQ+AGL+++LRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 87  EPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGVAFRT 146

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F AKI+ +MK ENLF SQGGPII++Q+ENEYG VEW  G  G+ Y KW 
Sbjct: 147 DNEPFKAAMQKFTAKIVSMMKAENLFESQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWF 206

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           +  A+ L+T VPW+MC+QEDAPDPII+TCNG+YC+ FTPN   KP MWTEN+SGW+  FG
Sbjct: 207 SQMAIGLDTGVPWIMCKQEDAPDPIIDTCNGYYCENFTPNKNYKPKMWTENWSGWYTDFG 266

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP +D+AF+VARF +  G++ NYYMY GGTNFGRT+ G  +ATSYDYDAPIDEYG
Sbjct: 267 SAVPYRPAQDVAFSVARFIQNRGSYVNYYMYHGGTNFGRTSAGLFIATSYDYDAPIDEYG 326

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PKWGHLR LHKAIK CE  L+S DPT    G  LE H+Y  S+  CAAFLANYD++
Sbjct: 327 LLSEPKWGHLRNLHKAIKQCEPILVSVDPTVSWPGKNLEVHVYKTSTGACAAFLANYDTT 386

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A VTF    Y LP WS+SILPDCK  VFNTAKV      G  P   +K    +   SS
Sbjct: 387 SPAKVTFGNGQYDLPPWSISILPDCKTAVFNTAKV------GTVPSFHRK----MTPVSS 436

Query: 421 AFSW--YEEKVGISG-NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AF W  Y E    SG + S     L EQI  T+D+SDYLWY   +++ P +     G+  
Sbjct: 437 AFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNGQYP 496

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L   S GH   VFVN +     YG  +      +  ++L  G N + +LS+ VGL N G
Sbjct: 497 VLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSNVG 556

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             ++    G+   V L  L  G RDLS  +W Y++G++GE + L  +  ++S  W +GS+
Sbjct: 557 LHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTKGSS 616

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + L WYK TF AP G  PLAL+++SMGKG+ WVNG+SIGR+W AY+A   G    C
Sbjct: 617 LVKKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAYIA--RGSCGGC 674

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G++   KC+  CGQP Q  YHIPR+WV+P  N LV+ EE GGDPS ISL+ +T
Sbjct: 675 NYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISLVKRT 730


>gi|186461094|gb|ACC78255.1| beta-galactosidase [Carica papaya]
          Length = 721

 Score =  804 bits (2077), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/716 (53%), Positives = 502/716 (70%), Gaps = 23/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A V+YDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+ +KEGGL+VI+TYVFWN H
Sbjct: 19  VEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGH 78

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G YYFE R+DLV+F+K V +AGL++HLRIGPY C EWN+GGFPVWL ++PGIQFRT
Sbjct: 79  EPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRIGPYICGEWNFGGFPVWLKYVPGIQFRT 138

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M++F  KI+++MK E LF  QGGPII++Q+ENEYG +EW  G  G+ Y KWA
Sbjct: 139 DNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWA 198

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPW+MC+QEDAPDPII+TCNGFYC+ F PN+  KP M+TE ++GW+  FG
Sbjct: 199 AQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPKMFTEAWTGWYTEFG 258

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP+RP ED+A++VARF +  G+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 259 GPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 318

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PKWGHLR+LHK IKLCE  L+S DP    LG+  EAH++  +   CAAFLANYD  
Sbjct: 319 LRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFW-TKTSCAAFLANYDLK 377

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               VTF    Y LP WSVSILPDCK VVFNTAKV+S           Q ++ +++  +S
Sbjct: 378 YSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVS-----------QGSLAKMIAVNS 426

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AFSW    EE    + +  F +  L EQI+ T+D +DYLWY   + + P +     G++ 
Sbjct: 427 AFSWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYLWYMTDVTIGPDEAFLKNGQDP 486

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA  VFVN +L    YG  +      + K++L  G+N + +LS+ VGL N G
Sbjct: 487 ILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLLSIAVGLPNVG 546

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  + +G  D+S  +W Y++G++GE + L  +S ++S  W +GS 
Sbjct: 547 LHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSGSSSVEWVEGSL 606

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + LIWYKTTF AP G  PLAL++ SMGKGQ W+NGQSIGR+W  Y A   G    C
Sbjct: 607 LAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGYKA--RGSCGAC 664

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G YD  KC  +CG+ +Q  YH+PR+W++P  NLLV+ EE GGDP+KISL+ + 
Sbjct: 665 NYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKISLVKRV 720


>gi|302824860|ref|XP_002994069.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
 gi|300138075|gb|EFJ04856.1| hypothetical protein SELMODRAFT_187747 [Selaginella moellendorffii]
          Length = 741

 Score =  803 bits (2073), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 400/731 (54%), Positives = 498/731 (68%), Gaps = 37/731 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           LS  V YDHR L+I+G+ R+L S SIHYPR+ P++W +LI  +K GG++VIETYVFW+ H
Sbjct: 22  LSDTVAYDHRGLIINGQHRMLISASIHYPRAAPQMWSQLISNAKAGGIDVIETYVFWDGH 81

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           +P R  Y FEGRFDLV FVK V EAGL+ +LRIGPY CAEWN GGFPVWL  + GI+FRT
Sbjct: 82  QPTRDTYNFEGRFDLVSFVKLVHEAGLYANLRIGPYVCAEWNLGGFPVWLKDVAGIEFRT 141

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK EM+ F+ KI+ +MK + LFA QGGPIILAQ+ENEYGN++ AYG  G+ Y+ WA
Sbjct: 142 NNQPFKAEMQTFVEKIVAMMKHDKLFAPQGGPIILAQIENEYGNIDAAYGAAGKEYMVWA 201

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ +  L T VPW+MCQQ DAPD I++TCNGFYCD + PN+  KP MWTEN+SGWF  +G
Sbjct: 202 ANMSQGLGTGVPWIMCQQSDAPDYILDTCNGFYCDAWAPNNKKKPKMWTENWSGWFQKWG 261

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A P RPVED+AFAVARFF+ GG+FQNYYMYFGGTNFGR++GGP V TSYDYDAPIDE+G
Sbjct: 262 EASPHRPVEDVAFAVARFFQRGGSFQNYYMYFGGTNFGRSSGGPYVTTSYDYDAPIDEFG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDS 359
            IRQPKWGHL++LH AIKLCE  L S+DPT+  LG   EAH+Y   SS  CAAFLAN DS
Sbjct: 322 VIRQPKWGHLKQLHAAIKLCEAALGSNDPTYISLGQLQEAHVYGSTSSGACAAFLANIDS 381

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
           SSDA V FN   Y LPAWSVSILPDCK V  NTAKV             Q  +  +  + 
Sbjct: 382 SSDATVKFNSRTYLLPAWSVSILPDCKTVSHNTAKV-----------DVQTAMPTMKPSI 430

Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK--EVFLN 475
           +  +W  Y E VG+  +   V   L EQINTTKDTSDYLWYT S+ +        +  L 
Sbjct: 431 TGLAWESYPEPVGVWSDSGIVASALLEQINTTKDTSDYLWYTTSLDISQADAASGKALLY 490

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           +ES+     VFVN KL              + + IEL  G N+L IL   VGLQNYG + 
Sbjct: 491 LESMRDVVHVFVNGKLAGSASTKGTQLYAAVEQPIELASGHNSLAILCATVGLQNYGPFI 550

Query: 536 DVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +  GAG+  SVI+  L +G+ DL++ EWI+QVG++GE + +   S +    W   S +P 
Sbjct: 551 ETWGAGINGSVIVKGLPSGQIDLTAEEWIHQVGLKGESLAIFTESGSQRVRWS--SAVPQ 608

Query: 595 NKSLIWYKTTFL-----------------APEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
            ++L+WYK  F                  +P G  P+AL+L SMGKGQAW+NGQSIGR+W
Sbjct: 609 GQALVWYKVIFQHHGITCIVWIAMQAHFDSPSGNDPVALDLESMGKGQAWINGQSIGRFW 668

Query: 638 SAYLAPST-GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
            +  AP T GC + CDYRGSY +SKC+  CGQP+Q  YH+PR+W+  G NL+V+ EE GG
Sbjct: 669 PSLRAPDTAGCPQTCDYRGSYSSSKCRSGCGQPSQRWYHVPRSWLQDGGNLVVLFEEEGG 728

Query: 697 DPSKISLLTKT 707
            PS +S +T+T
Sbjct: 729 KPSGVSFVTRT 739


>gi|318136780|gb|ADV41669.1| beta-D-galactosidase [Actinidia deliciosa var. deliciosa]
          Length = 728

 Score =  802 bits (2071), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/716 (54%), Positives = 498/716 (69%), Gaps = 22/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYD +A+ I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VI+TYVFWN H
Sbjct: 25  VTASVTYDGKAIKINGQRRILFSGSIHYPRSTPEMWPGLIQKAKEGGLDVIQTYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFEGR+DLVRF+K  Q+AGL++HLRIG Y CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85  EPSPGQYYFEGRYDLVRFIKLAQQAGLYVHLRIGLYVCAEWNFGGFPVWLKYVPGIAFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI++LMK E LF SQGGPII++Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 145 DNGPFKAAMQKFTEKIVNLMKSEKLFESQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ AV L+T VPW+MC+QEDAPDPII+TCNGFYC+GFTPN   KP MWTE ++GW+  FG
Sbjct: 205 AEMAVGLDTGVPWIMCKQEDAPDPIIDTCNGFYCEGFTPNKNYKPKMWTEAWTGWYTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPVEDLA++VARF +  G+F NYYMY GGTNFGRTA G  VATSYDYDAPIDEYG
Sbjct: 265 GPIHNRPVEDLAYSVARFIQNNGSFVNYYMYHGGTNFGRTAAGLFVATSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PKWGHLR+LHKAIKLCE  L+S+ PT    G  LE H++ KS + CAAFLANYD S
Sbjct: 325 LPREPKWGHLRDLHKAIKLCEPSLVSAYPTVTWPGKNLEVHVF-KSKSSCAAFLANYDPS 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A VTF    Y LP WS+SILPDCKN VFNTA+V S+ +           +    ++  
Sbjct: 384 SPAKVTFQNMQYDLPPWSISILPDCKNAVFNTARVSSKSS----------QMKMTPVSGG 433

Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AFSW    EE V    + +  +  L EQI+ T+D SDYLWY   +++ P +     G+  
Sbjct: 434 AFSWQSYIEETVSADDSDTIAKNGLWEQISITRDGSDYLWYLTDVNIHPNEGFLKNGQSP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA  VF+N +L    YG+ +      +  ++L  GIN + +LS  VGL N G
Sbjct: 494 VLTVMSAGHALHVFINGQLAGTVYGSLENPKLTFSNNVKLRAGINKISLLSAAVGLPNVG 553

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  L  G RDL+  +W Y+VG++GE + L  +S ++S  W QGS 
Sbjct: 554 LHFETWNTGVLGPVTLKGLNEGTRDLTKQKWSYKVGLKGEDLSLHTLSGSSSVEWVQGSL 613

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + L WYK TF APEG  PLAL++ +MGKGQ W+NG+SIGR+W  Y A  +G    C
Sbjct: 614 LAQKQPLTWYKATFNAPEGNDPLALDMNTMGKGQIWINGESIGRHWPEYKA--SGNCGGC 671

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G Y   KC  +CG+ +Q  YH+PR+W+ P  N LV+ EELGGDP+ IS + +T
Sbjct: 672 SYAGIYTEKKCLSNCGEASQRWYHVPRSWLKPSGNFLVVFEELGGDPTGISFVRRT 727


>gi|3869280|gb|AAC77377.1| beta-galactosidase precursor [Carica papaya]
          Length = 721

 Score =  801 bits (2070), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/716 (53%), Positives = 501/716 (69%), Gaps = 23/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A V+YDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+ +KEGGL+VI+TYVFWN H
Sbjct: 19  VEATVSYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQNAKEGGLDVIQTYVFWNGH 78

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G YYFE R+DLV+F+K V +AGL++HLRI PY C EWN+GGFPVWL ++PGIQFRT
Sbjct: 79  EPSPGNYYFEDRYDLVKFIKLVHQAGLYVHLRISPYICGEWNFGGFPVWLKYVPGIQFRT 138

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M++F  KI+++MK E LF  QGGPII++Q+ENEYG +EW  G  G+ Y KWA
Sbjct: 139 DNGPFKAQMQKFTEKIVNMMKAEKLFEPQGGPIIMSQIENEYGPIEWEIGAPGKAYTKWA 198

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPW+MC+QEDAPDPII+TCNGFYC+ F PN+  KP M+TE ++GW+  FG
Sbjct: 199 AQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMPNANYKPKMFTEAWTGWYTEFG 258

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP+RP ED+A++VARF +  G+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 259 GPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 318

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PKWGHLR+LHK IKLCE  L+S DP    LG+  EAH++  +   CAAFLANYD  
Sbjct: 319 LRREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQEAHVFW-TKTSCAAFLANYDLK 377

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               VTF    Y LP WSVSILPDCK VVFNTAKV+S           Q ++ +++  +S
Sbjct: 378 YSVRVTFQNLPYDLPPWSVSILPDCKTVVFNTAKVVS-----------QGSLAKMIAVNS 426

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AFSW    EE    + +  F +  L EQI+ T+D +DYLWY   + + P +     G++ 
Sbjct: 427 AFSWQSYNEETPSANYDAVFTKDGLWEQISVTRDATDYLWYMTDVTIGPDEAFLKNGQDP 486

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA  VFVN +L    YG  +      + K++L  G+N + +LS+ VGL N G
Sbjct: 487 ILTVMSAGHALHVFVNGQLSGTVYGQLENPKLAFSGKVKLRAGVNKVSLLSIAVGLPNVG 546

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  + +G  D+S  +W Y++G++GE + L  +S ++S  W +GS 
Sbjct: 547 LHFETWNAGVLGPVTLKGVNSGTWDMSKWKWSYKIGLKGEALSLHTVSGSSSVEWVEGSL 606

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + LIWYKTTF AP G  PLAL++ SMGKGQ W+NGQSIGR+W  Y A   G    C
Sbjct: 607 LAQRQPLIWYKTTFNAPVGNDPLALDMNSMGKGQIWINGQSIGRHWPGYKA--RGSCGAC 664

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G YD  KC  +CG+ +Q  YH+PR+W++P  NLLV+ EE GGDP+KISL+ + 
Sbjct: 665 NYAGIYDEKKCHSNCGKASQRWYHVPRSWLNPTANLLVVFEEWGGDPTKISLVKRV 720


>gi|356556286|ref|XP_003546457.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 721

 Score =  801 bits (2069), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/716 (54%), Positives = 499/716 (69%), Gaps = 24/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+V+DGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21  VTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFE RFDLV+FVK VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81  EPSPGQYYFEDRFDLVKFVKLVQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F AKI+ LMK+  LF SQGGPII++Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+QEDAPDP+I+TCNG+YC+ F PN  +KP MWTEN++GW+  FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGYYCENFKPNKNTKPKMWTENWTGWYTDFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG  +ATSYDYDAP+DEYG
Sbjct: 261 GAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
              +PK+ HLR LHKAIK CE  L+++DP  Q LG  LEAH++  +   CAAF+ANYD+ 
Sbjct: 321 LQNEPKYEHLRNLHKAIKQCEPALVATDPKVQSLGYNLEAHVF-STPGACAAFIANYDTK 379

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A  TF    Y LP WS+SILPDCK VV+NTAKV      G+    +   VN      S
Sbjct: 380 SYAKATFGNGQYDLPPWSISILPDCKTVVYNTAKV------GNSWLKKMTPVN------S 427

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AF+W    EE    S   S     L EQ+N T+D+SDYLWY   +++   +     G+  
Sbjct: 428 AFAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVYINANEGFLKNGQSP 487

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L   S GH   VF+N +L    +G         +  ++L  G N L +LS+ VGL N G
Sbjct: 488 VLTAMSAGHVLHVFINDQLAGTVWGGLANPKLTFSDNVKLRVGNNKLSLLSVAVGLPNVG 547

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  L  G RDLSS +W Y+VG++GE + L   S ++S  W +GS 
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSSQKWSYKVGLKGESLSLHTESGSSSVEWIRGSL 607

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYKTTF AP G  PLAL+L SMGKG+ WVNG+SIGR+W  Y+A   G    C
Sbjct: 608 VAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIA--HGSCNAC 665

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G Y  +KC+ +CGQP+Q  YH+PR+W+  G N LV+ EE GGDP+ I+L+ +T
Sbjct: 666 NYAGFYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 721


>gi|357139090|ref|XP_003571118.1| PREDICTED: beta-galactosidase 4-like [Brachypodium distachyon]
          Length = 787

 Score =  801 bits (2069), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/714 (55%), Positives = 499/714 (69%), Gaps = 26/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDHR+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+V++TYVFWN HE
Sbjct: 91  NAAVSYDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVVQTYVFWNGHE 150

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQYYF  R+DL+RFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 151 PVKGQYYFSDRYDLIRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 210

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF+ KI+ +MK E LF  QGGPII++QVENE+G +E A GVG + Y  WAA
Sbjct: 211 NGPFKAEMQRFVEKIVSMMKSERLFEWQGGPIIMSQVENEFGPMESAGGVGAKPYANWAA 270

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV  NT VPWVMC+QEDAPDP+INTCNGFYCD FTPN  +KP MWTE ++GWF SFG 
Sbjct: 271 KMAVATNTGVPWVMCKQEDAPDPVINTCNGFYCDYFTPNKKNKPAMWTEAWTGWFTSFGG 330

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RPVED+AFAVARF + GG+F NYYMY GGTNFGRTAGGP VATSYDYDAPIDE+G 
Sbjct: 331 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVATSYDYDAPIDEFGL 390

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIK  E  L+S DPT Q LG   +A+++   +  CAAFL+NY  +S
Sbjct: 391 LRQPKWGHLRDLHKAIKQAEPTLVSGDPTIQSLGNYEKAYVFKSKNGACAAFLSNYHMNS 450

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V FNG  Y LPAWS+SILPDCK VVFNTA V            ++  +   +     
Sbjct: 451 AVKVRFNGRHYDLPAWSISILPDCKTVVFNTATV------------KEPTLLPKMHPVVR 498

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVFLN 475
           F+W  Y E      + +F +  L EQ++ T D SDYLWYT  +++ PG+    G+   L 
Sbjct: 499 FTWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDYLWYTTFVNIGPGELSKNGQWPQLT 558

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GH+  VFVN K     YG  +      +  +++ +G N + ILS  VGL N G  F
Sbjct: 559 VYSAGHSMQVFVNGKSYGSVYGGFENPKLTYDGHVKMWQGSNKISILSSAVGLPNVGDHF 618

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLP 593
           +    G+   V L  L  GKRDLS  +W YQVG++GE +G+  +S +++  W   GS  P
Sbjct: 619 ERWNVGVLGPVTLSGLSEGKRDLSHQKWTYQVGLKGESLGIHTVSGSSAVEWGGPGSKQP 678

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
               L W+K  F AP G  P+AL++ SMGKGQ WVNG  +GRYWS Y APS GC   C Y
Sbjct: 679 ----LTWHKALFNAPSGSDPVALDMGSMGKGQMWVNGHHVGRYWS-YKAPSRGC-GGCSY 732

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y   KC+  CG+ +Q  YH+PR+W+ PG NLLV+ EE GGD + ++L T+T
Sbjct: 733 AGTYREDKCRSSCGELSQRWYHVPRSWLKPGGNLLVVLEEYGGDVAGVTLATRT 786


>gi|7682677|gb|AAF67341.1| beta galactosidase [Vigna radiata]
          Length = 721

 Score =  800 bits (2066), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/716 (54%), Positives = 496/716 (69%), Gaps = 24/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+VIDGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21  VTASVTYDHKAIVIDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFE R+DLVRFVK  Q+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 81  EPSPGKYYFEDRYDLVRFVKLAQQAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F AKI+ LMK+E LF SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKEERLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN  +KP MWTEN++GW+  FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A P RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG  +ATSYDYDAP+DEYG
Sbjct: 261 GASPIRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
              +PKWGHLR LHKAIK  E  L+S+DP    LG  LEAH++  +   CAAF+ANYD+ 
Sbjct: 321 LQNEPKWGHLRALHKAIKQSEPALVSTDPKVTSLGYNLEAHVF-STPGACAAFIANYDTK 379

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A  TF    Y LP WS+SILPDCK VV+NTA+V     NG         V ++   +S
Sbjct: 380 SSAKATFGSGQYDLPPWSISILPDCKTVVYNTARV----GNG--------WVKKMTPVNS 427

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            F+W    EE    S + S     L EQ+N T+D+SDYLWY   +++   +     G+  
Sbjct: 428 GFAWQSYNEEPASSSQDDSIAAEALWEQVNVTRDSSDYLWYMTDVYINGNEGFLKNGRSP 487

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GH   VF+N +L    YG         +  + L  G N L +LS+ VGL N G
Sbjct: 488 VLTVMSAGHLLHVFINGQLSGTVYGGLGNPKLTFSDNVNLRVGNNKLSLLSVAVGLPNVG 547

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  L  G RDLS  +W Y+VG++GE + L   S ++S  W QGS 
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGEALNLHTESGSSSVEWIQGSL 607

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK TF AP G  PLAL+L SMGKG+ WVNG+SIGR+W  Y+A   G    C
Sbjct: 608 VAKKQPLTWYKATFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIA--HGSCNAC 665

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G Y   KC+ +CG+P+Q  YH+PR+W++ G N LV+ EE GGDP+ I+L+ +T
Sbjct: 666 NYAGYYTDQKCRTNCGKPSQRWYHVPRSWLNSGGNSLVVFEEWGGDPNGIALVKRT 721


>gi|255550373|ref|XP_002516237.1| beta-galactosidase, putative [Ricinus communis]
 gi|223544723|gb|EEF46239.1| beta-galactosidase, putative [Ricinus communis]
          Length = 825

 Score =  799 bits (2063), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 409/835 (48%), Positives = 535/835 (64%), Gaps = 48/835 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A +++D RA+ IDGKRRVL SGSIHYPRSTP++WP+LI+KSKEGGL+ IETYVFWN HE
Sbjct: 22  AAVISHDGRAITIDGKRRVLLSGSIHYPRSTPQMWPDLIKKSKEGGLDAIETYVFWNVHE 81

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P R QY F G  DLVRF+K VQ+ GL+  LRIGPY CAEWNYGGFPVWLH +PGI+ RT 
Sbjct: 82  PSRRQYDFGGNLDLVRFIKAVQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGIELRTA 141

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N+ F  EM+ F + I+D+MKQE LFASQGGPII+AQVENEYGNV  +YG  G+ Y+ W A
Sbjct: 142 NSIFMNEMQNFTSLIVDMMKQEQLFASQGGPIIIAQVENEYGNVMSSYGAAGKAYIDWCA 201

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A +LN  VPW+MCQQ DAPDP+INTCNG+YCD FTP++P+ P MWTEN++GWF S+G 
Sbjct: 202 NMAESLNIGVPWIMCQQSDAPDPMINTCNGWYCDQFTPSNPNSPKMWTENWTGWFKSWGG 261

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P R  ED+AFAVARFF+TGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DE+G 
Sbjct: 262 KDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEFGN 321

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           + QPKWGHL++LH  +   EE L S   +       + A IY  +  + + FL+N + +S
Sbjct: 322 LNQPKWGHLKQLHDVLHSMEEILTSGTVSSVDYDNSVTATIY-ATDKESSCFLSNANETS 380

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DA + F G  Y +PAWSVSILPDC NV +NTAKV +Q +       ++ N  E    S  
Sbjct: 381 DATIEFKGTTYTIPAWSVSILPDCANVGYNTAKVKTQTS----VMVKRDNKAEDEPTSLN 436

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLN 475
           +SW  E V    + G        + +Q     D SDYLWY  S+ +        K++ + 
Sbjct: 437 WSWRPENVDKTVLLGQGHIHAKQIVDQKAVANDASDYLWYMTSVDLKKDDLIWSKDMSIR 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I   GH    +VN + +   +  +  +N++  K ++L  G N + +LS  VGL NYGA +
Sbjct: 497 INGSGHILHAYVNGEYLGSQWSEYSVSNYVFEKSVKLKHGRNLITLLSATVGLANYGANY 556

Query: 536 DVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGL-DKISLANS---SFWK 587
           D+  AG+   + +  + G     +DLS+  W Y+VG+    +GL DK+ L++S   S W+
Sbjct: 557 DLIQAGILGPVELVGRKGDETIIKDLSNNRWSYKVGL----LGLEDKLYLSDSKHASKWQ 612

Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
           +   LP NK L WYKTTF AP G  P+ L+L  +GKG AW+NG SIGRYW ++LA   GC
Sbjct: 613 E-QELPTNKMLTWYKTTFKAPLGTDPVVLDLQGLGKGMAWINGNSIGRYWPSFLAEDDGC 671

Query: 648 -TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
            T  CDYRG YD +KC  +CG+P Q  YH+PR+++   EN LV+ EE GG+PS+++  T 
Sbjct: 672 STDLCDYRGPYDNNKCVSNCGKPTQRWYHVPRSFLQDNENTLVLFEEFGGNPSQVNFQTV 731

Query: 707 TGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
                C    E +                   V ++C  G  I+A+ FAS+G P+G CGS
Sbjct: 732 VTGVACVSGDEGEV------------------VEISC-NGQSISAVQFASFGDPQGTCGS 772

Query: 767 FRPGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
              G+C    D L IVQKACVG   CS+ VS    G +  +C   +  LAVE  C
Sbjct: 773 SVKGSCEGTEDALLIVQKACVGNESCSLEVSHKLFGST--SCDNGVNRLAVEVLC 825


>gi|212274513|ref|NP_001130532.1| uncharacterized protein LOC100191631 precursor [Zea mays]
 gi|194689400|gb|ACF78784.1| unknown [Zea mays]
 gi|224030521|gb|ACN34336.1| unknown [Zea mays]
 gi|413922054|gb|AFW61986.1| beta-galactosidase isoform 1 [Zea mays]
 gi|413922055|gb|AFW61987.1| beta-galactosidase isoform 2 [Zea mays]
 gi|413954366|gb|AFW87015.1| beta-galactosidase isoform 1 [Zea mays]
 gi|413954367|gb|AFW87016.1| beta-galactosidase isoform 2 [Zea mays]
          Length = 722

 Score =  796 bits (2057), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/714 (54%), Positives = 493/714 (69%), Gaps = 25/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25  NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQYYF  R+DLVRFVK  ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 85  PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + Y  WAA
Sbjct: 145 NGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV     VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG 
Sbjct: 205 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 264

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG 
Sbjct: 265 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 324

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIK  E  L+S DPT Q LG   +A+++  S   CAAFL+NY +S+
Sbjct: 325 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSA 384

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V FNG  Y LPAWS+S+LPDCK  VFNTA V         P A  +     +  +  
Sbjct: 385 AARVVFNGRRYDLPAWSISVLPDCKAAVFNTATV-------SEPSAPAR-----MSPAGG 432

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           FSW  Y E       R+F +  L EQ++ T D SDYLWYT  +++   +     G+   L
Sbjct: 433 FSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQL 492

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GH+  VFVN +     YG +D      +  +++ +G N + ILS  VGL N G  
Sbjct: 493 TIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTH 552

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++    G+   V L  L  GKRDLS  +W YQ+G+ GE +G+  ++ ++S  W   +   
Sbjct: 553 YETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG-- 610

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+K  F AP G  P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+GC   C Y
Sbjct: 611 -KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGC-GGCSY 667

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y  +KCQ  CG  +Q  YH+PR+W++P  NLLV+ EE GGD S + L+T+T
Sbjct: 668 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTRT 721


>gi|54111247|dbj|BAC10578.2| beta-galactosidase [Capsicum annuum]
          Length = 724

 Score =  796 bits (2057), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/716 (53%), Positives = 496/716 (69%), Gaps = 22/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + ANV+YD RA+VI+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21  VKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR+DLV+F+K VQ AGL+++LRIGPY CAEWN+GG PVWL ++ G++FRT
Sbjct: 81  EPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F+ KI+ +MK E LF  QGGPII+AQ+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 DNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GWF  FG
Sbjct: 201 AQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWFTKFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +P RP ED+AF+VARF +  G++ NYYMY GGTNFGRT+ G  +ATSYDYDAPIDEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PK+GHLRELHKAIK CE  L+SS PT   LG+  EAH+Y   S  CAAFL+NYD+ 
Sbjct: 321 LLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSGACAAFLSNYDAK 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               V+F    Y LP WS+SILPDCK VV+NTAKV SQ ++            ++  A  
Sbjct: 381 YSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSS-----------IKMTPAGG 429

Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E    + +   +R + L EQ N T+D+SDYLWY   I++   +     GK+ 
Sbjct: 430 GLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDINIASNEGFLKSGKDP 489

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
           +L + S GH   VFVN KL    YG  D      +  ++LN GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKISLLSVSVGLPNVG 549

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             +D   AG+   V L  L  G RDL+  +W Y+VG++GE + L  +S ++S  W QGS 
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTLSGSSSVEWVQGSL 609

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK TF AP G  PLAL++ASMGKGQ W+NG+ +GR+W  Y A   G   KC
Sbjct: 610 VARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWPGYAA--QGDCSKC 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G+++  KCQ +CGQP+Q  YH+PR+W+    NLLV+ EE GGDP+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPTGISLVRRS 723


>gi|356564721|ref|XP_003550597.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
          Length = 831

 Score =  795 bits (2053), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 411/833 (49%), Positives = 537/833 (64%), Gaps = 41/833 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L  NV++D RA+ IDGKRRVL SGSIHYPRSTPE+WPELI+K+KEGGL+ IETYVFWN H
Sbjct: 26  LHTNVSHDGRAIKIDGKRRVLISGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNAH 85

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP R  Y F G  D++RF+KT+QE+GL+  LRIGPY CAEWNYGG PVW+H +P ++ RT
Sbjct: 86  EPSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRT 145

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N+ F  EM+ F   I+D++K+E LFASQGGPIIL Q+ENEYGNV   YG  G+ Y+ W 
Sbjct: 146 ANSVFMNEMQNFTTLIVDMLKKEKLFASQGGPIILTQIENEYGNVISQYGDAGKAYMNWC 205

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A +L   VPW+MCQ+ DAP P+INTCNG+YCD F PNS + P MWTEN+ GWF ++G
Sbjct: 206 ANMAESLKVGVPWIMCQESDAPQPMINTCNGWYCDNFEPNSFNSPKMWTENWIGWFKNWG 265

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P R  ED+AFAVARFF+TGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG
Sbjct: 266 GRDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYG 325

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            I QPKWGHL+ELH A+K  EE L S + +   LG  ++  IY  ++   + FL+N +++
Sbjct: 326 NIAQPKWGHLKELHSALKAMEEALTSGNVSETDLGNSVKVTIY-ATNGSSSCFLSNTNTT 384

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +DA +TF GN Y +PAWSVSILPDC++  +NTAKV  Q +       ++ +  E   A  
Sbjct: 385 ADATLTFRGNNYTVPAWSVSILPDCQHEEYNTAKVKEQTS----VMTKENSKAEKEAAIL 440

Query: 421 AFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLN 475
            + W  E +   + G  +     L +Q +   D SDYLWY   +HV    P   + + L 
Sbjct: 441 KWVWRSENIDKALHGKSNVSAHRLLDQKDAANDASDYLWYMTKLHVKHDDPVWSENMTLR 500

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I   GH    FVN + +   +  +   N     KI+L  G NT+ +LS+ VGLQNYGA+F
Sbjct: 501 INGSGHVIHAFVNGEYIDSHWATYGIHNDKFEPKIKLKHGTNTISLLSVTVGLQNYGAFF 560

Query: 536 DVAGAGLFSVI-LIDLKNGK---RDLSSGEWIYQVGVEG--EYIGLDKISLANSSFWKQG 589
           D   AGL   I L+ +K  +   ++LSS +W Y++G+ G    +  D    A  S W + 
Sbjct: 561 DTWHAGLVGPIELVSVKGEETIIKNLSSHKWSYKIGLHGWDHKLFSDDSPFAAQSKW-ES 619

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
             LP N+ L WYKTTF AP G  P+ ++L  MGKG AWVNG++IGR W +Y A   GC+ 
Sbjct: 620 EKLPTNRMLTWYKTTFKAPLGTDPVVVDLQGMGKGYAWVNGKNIGRIWPSYNAEEDGCSD 679

Query: 650 K-CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
           + CDYRG Y  SKC  +CG+P Q  YH+PR+++  G N LV+  ELGG+PS ++  T   
Sbjct: 680 EPCDYRGEYSDSKCVTNCGKPTQRWYHVPRSYLKDGANTLVLFAELGGNPSLVNFQTVVV 739

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
            ++C+   E                  +  + L+C+ G  I+AI FAS+G P+G CG+F 
Sbjct: 740 GNVCANAYE------------------NKTLELSCQ-GRKISAIKFASFGDPKGVCGAFT 780

Query: 769 PGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G+C    + LPIVQKACVG+  CSI +S    G  A AC  L K LAVEA C
Sbjct: 781 NGSCESKSNALPIVQKACVGKEACSIDLSEKTFG--ATACGNLAKRLAVEAVC 831


>gi|13936236|gb|AAK40304.1| beta-galactosidase [Capsicum annuum]
          Length = 724

 Score =  795 bits (2053), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/716 (53%), Positives = 495/716 (69%), Gaps = 22/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + ANV+YD RA+VI+GKR++L SGSIHYPRSTP++WP+LI K+K+GGL+VIETYVFWN H
Sbjct: 21  VKANVSYDDRAIVINGKRKILISGSIHYPRSTPQMWPDLIEKAKDGGLDVIETYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR+DLV+F+K VQ AGL+++LRIGPY CAEWN+GG PVWL ++ G++FRT
Sbjct: 81  EPSPGKYNFEGRYDLVKFIKLVQGAGLYVNLRIGPYICAEWNFGGLPVWLKYVSGMEFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F+ KI+ +MK E LF  QGGPII+AQ+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 DNQPFKVAMQGFVQKIVSMMKSEKLFEPQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GWF  FG
Sbjct: 201 AQMAVGLKTDVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWFTKFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +P RP ED+AF+VARF +  G++ NYYMY GGTNFGRT+ G  +ATSYDYDAPIDEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSYFNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PK+GHLRELHKAIK CE  L+SS PT   LG+  EAH+Y   S  CAAFL+NYD+ 
Sbjct: 321 LLNEPKYGHLRELHKAIKQCEPALVSSYPTVTSLGSNQEAHVYRSKSGACAAFLSNYDAK 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               V+F    Y LP WS+SILPDCK VV+NTAKV SQ ++            ++  A  
Sbjct: 381 YSVRVSFQNLPYDLPPWSISILPDCKTVVYNTAKVSSQGSS-----------IKMTPAGG 429

Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y E    + +   +R + L EQ N T+D+SDYLWY   +++   +     GK+ 
Sbjct: 430 GLSWQSYNEDTPTADDSDTLRANGLWEQRNVTRDSSDYLWYMTDVNIASNEGFLKSGKDP 489

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
           +L + S GH   VFVN KL    YG  D      +  ++LN GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLAGTVYGALDNPKLTYSGNVKLNAGINKISLLSVSVGLPNVG 549

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             +D   AG+   V L  L  G RDL+  +W Y+VG++GE + L  +S ++S  W QGS 
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRDLAKQKWSYKVGLKGESLSLHTLSGSSSVEWVQGSL 609

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK TF AP G  PLAL++ASMGKGQ W+NG+ +GR+W  Y A   G   KC
Sbjct: 610 VARTQPLTWYKATFSAPGGNEPLALDMASMGKGQIWINGEGVGRHWPGYAA--QGDCSKC 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G+++  KCQ +CGQP+Q  YH+PR+W+    NLLV+ EE GGDP+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKTSGNLLVVFEEWGGDPTGISLVRRS 723


>gi|193850557|gb|ACF22882.1| beta-galactosidase [Glycine max]
          Length = 721

 Score =  794 bits (2050), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/716 (54%), Positives = 494/716 (68%), Gaps = 24/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+V+DGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21  VTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFE RFDLV+FVK  Q+AGL++HLRIGPY CAEWN GGFPVWL ++PGI FRT
Sbjct: 81  EPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F AKI+ LMK+  LF SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN  +KP MWTEN++GW+  FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG  +ATSYDYDAP+DEYG
Sbjct: 261 GAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
              +PK+ HLR LHKAIK  E  L+++DP  Q LG  LEAH++  +   CAAF+ANYD+ 
Sbjct: 321 LENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVF-SAPGACAAFIANYDTK 379

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A   F    Y LP WS+SILPDCK VV+NTAKV      G     +   VN      S
Sbjct: 380 SYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKV------GYGWLKKMTPVN------S 427

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AF+W    EE    S   S     L EQ+N T+D+SDYLWY   ++V   +     G+  
Sbjct: 428 AFAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSP 487

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GH   VF+N +L    +G         +  ++L  G N L +LS+ VGL N G
Sbjct: 488 LLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVG 547

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  L  G RDLS  +W Y+VG++GE + L   S ++S  W QGS 
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSL 607

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYKTTF AP G  PLAL+L SMGKG+ WVNG+SIGR+W  Y+A   G    C
Sbjct: 608 VAKKQPLTWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGRSIGRHWPGYIA--HGSCNAC 665

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G Y  +KC+ +CGQP+Q  YH+PR+W+  G N LV+ EE GGDP+ I+L+ +T
Sbjct: 666 NYAGYYTDTKCRTNCGQPSQRWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 721


>gi|18148449|dbj|BAB83260.1| beta-D-galactosidase [Persea americana]
          Length = 766

 Score =  793 bits (2047), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/735 (53%), Positives = 508/735 (69%), Gaps = 25/735 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + +VTYD +A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFW+ HE
Sbjct: 34  TCSVTYDRKAIVINGQRRILISGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWDGHE 93

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFEGR+DLV+F+K V++AGL+++LRIGPY CAEWN GGFPVWL +IPGI FRT 
Sbjct: 94  PSPGKYYFEGRYDLVKFIKLVKQAGLYVNLRIGPYICAEWNLGGFPVWLKYIPGISFRTD 153

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M  F  KI+++MK E+LF  QGGPII++Q+ENEYG VEW  G  G++Y +WAA
Sbjct: 154 NEPFKRYMAGFTKKIVEMMKAESLFEPQGGPIIMSQIENEYGPVEWEIGAIGKVYTRWAA 213

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AVNLNT VPW+MC+Q++ PDPIINTCNGFYCD F PN   KPIMWTE ++GWF +FG 
Sbjct: 214 SMAVNLNTGVPWIMCKQDEVPDPIINTCNGFYCDWFKPNKDYKPIMWTELWTGWFTAFGG 273

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP+RPVED+A+AV +F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 274 PVPYRPVEDVAYAVVKFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGL 333

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R+PKWGHLR+LH+AIK+CE  L+S+DPT  K+G   EAH++   S  C+AFL N D ++
Sbjct: 334 KREPKWGHLRDLHRAIKMCEPALVSNDPTVTKIGDSQEAHVFKFESGACSAFLENKDETN 393

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS-S 420
              VTF G  Y LP WS+SILPDC NVV+NT +V             Q ++  +L AS +
Sbjct: 394 FVKVTFQGMQYELPPWSISILPDCVNVVYNTGRV-----------GTQTSMMTMLSASNN 442

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
            FSW  Y E        S     L+EQI+ TKD++DYL YT  + +   +     G+   
Sbjct: 443 EFSWASYNEDTASYNEESMTIEGLSEQISITKDSTDYLRYTTDVTIGQNEGFLKNGEYPV 502

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S GHA  VFVN +L    YG+ +      + K++L  G N + +LS  VGL N G 
Sbjct: 503 LTVNSAGHALQVFVNGQLSGTAYGSVNDPRLTFSGKVKLWAGNNKISLLSSAVGLPNVGT 562

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            F+    G+   V L  L  GKRDLS  +W Y+VGV GE + L   + ++S  W  GS+ 
Sbjct: 563 HFETWNYGVLGPVTLNGLNEGKRDLSLQKWSYKVGVIGEALQLHSPTGSSSVEW--GSST 620

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              +   WYKTTF AP G  PLAL++ +MGKGQ W+NGQSIGRYW AY A   G    C 
Sbjct: 621 SKIQPFTWYKTTFNAPGGNDPLALDMNTMGKGQIWINGQSIGRYWPAYKA--NGKCSACH 678

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           Y G YD  KC  +CG+ +Q  YHIPR+W++P  NLLV+ EE GGDP+ I+L+ +T    C
Sbjct: 679 YTGWYDEKKCGFNCGEASQRWYHIPRSWLNPTGNLLVVFEEWGGDPTGITLVRRTIGSAC 738

Query: 713 SFVSEADPPPVDSWK 727
           ++++E   P V +WK
Sbjct: 739 AYINEWH-PTVKNWK 752


>gi|242093394|ref|XP_002437187.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
 gi|241915410|gb|EER88554.1| hypothetical protein SORBIDRAFT_10g022620 [Sorghum bicolor]
          Length = 725

 Score =  791 bits (2044), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/714 (54%), Positives = 494/714 (69%), Gaps = 25/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP+L++K+K+GGL+V++TYVFWN HE
Sbjct: 28  NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVVQTYVFWNGHE 87

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +GQYYF  R+DLVRFVK  ++AGLF+HLRIGPY CAEWN+GGFPVWL ++PG+ FRT 
Sbjct: 88  PQQGQYYFGDRYDLVRFVKLAKQAGLFVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 147

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + Y  WAA
Sbjct: 148 NAPFKAAMQAFVEKIVSMMKAEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAA 207

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV     VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG 
Sbjct: 208 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 267

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG 
Sbjct: 268 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 327

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIK  E  L+S DPT Q +G   +A++Y  SS  CAAFL+NY +++
Sbjct: 328 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQTIGNYEKAYVYKSSSGACAAFLSNYHTNA 387

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V FNG  Y LPAWS+S+LPDC+  VFNTA V S       P A  +     +  +  
Sbjct: 388 AARVVFNGRRYDLPAWSISVLPDCRTAVFNTATVSS-------PSAPAR-----MTPAGG 435

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           FSW  Y E      +R+F +  L EQ++ T D SDYLWYT  +++   +     G+   L
Sbjct: 436 FSWQSYSEATNSLDDRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQL 495

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VFVN +     YG +D      +  +++ +G N + ILS  VGL N G  
Sbjct: 496 TIYSAGHALQVFVNGQSYGAAYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTH 555

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++    G+   V L  L  GKRDLS+ +W YQ+G+ GE +G+  ++ ++S  W   +   
Sbjct: 556 YEAWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVHSVAGSSSVEWGSAAG-- 613

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+K  F AP G  P+AL+++SMGKGQAWVNG  IGRYWS Y A    C   C Y
Sbjct: 614 -KQPLTWHKAYFNAPSGNAPVALDMSSMGKGQAWVNGHHIGRYWS-YKATGGSC-GGCSY 670

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y  +KCQ  CG  +Q  YH+PR+W++P  NLLV+ EE GGD S + L+T+T
Sbjct: 671 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLVTRT 724


>gi|68161828|emb|CAJ09953.1| beta-galactosidase [Mangifera indica]
          Length = 827

 Score =  791 bits (2042), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/840 (48%), Positives = 536/840 (63%), Gaps = 60/840 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV++D RA++IDG+RRVL SGSIHYPRSTPE+WP+LIRK+KEGGL+ IETYVFWN HEP 
Sbjct: 24  NVSHDGRAIIIDGQRRVLLSGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNAHEPA 83

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQ-FRTTN 122
           R QY F G  DL+RF+KT+Q+ GL+  LRIGPY CAEWNYGGFPVWLH +PG+Q FRT N
Sbjct: 84  RRQYDFSGHLDLIRFIKTIQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGVQEFRTVN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             F  EM+ F   I+D++KQE LFASQGGPII+AQ+ENEYGN+   YG  G++Y+ W A 
Sbjct: 144 EVFMNEMQNFTTLIVDMVKQEKLFASQGGPIIIAQIENEYGNMISNYGDAGKVYIDWCAK 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A +L+  VPW+MCQ+ DAP P+INTCNG+YCD FTPN P+ P MWTEN++GWF S+G  
Sbjct: 204 MAESLDIGVPWIMCQESDAPQPMINTCNGWYCDSFTPNDPNSPKMWTENWTGWFKSWGGK 263

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P R  EDLAF+VARFF+TGGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP+DE+G +
Sbjct: 264 DPHRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRTSGGPYLTTSYDYDAPLDEFGNL 323

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
            QPKWGHL+ELH  +K  E+ L   + +    G  + A +Y  +    + F  N +++ D
Sbjct: 324 NQPKWGHLKELHTVLKAMEKTLTHGNVSTTDFGNSVTATVY-ATEEGSSCFFGNANTTGD 382

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A +TF G+ Y +PAWSVSILPDCK   +NTAKV +Q +       ++ N  E   +S  +
Sbjct: 383 ATITFQGSDYVVPAWSVSILPDCKTEAYNTAKVNTQTS----VIVKKPNQAENEPSSLKW 438

Query: 423 SWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
            W  E +    + G  SF    L +Q     D SDYLWY  S+ + P        + L +
Sbjct: 439 VWRPEAIDEPVVQGKGSFSASFLIDQ-KVINDASDYLWYMTSVDLKPDDIIWSDNMTLRV 497

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            + G     FVN + V   +  +     +  ++++LN G N + +LS+ VGLQNYG  FD
Sbjct: 498 NTTGIVLHAFVNGEHVGSQWTKYGVFKDVFQQQVKLNPGKNQISLLSVTVGLQNYGPMFD 557

Query: 537 VAGAGLFS-VILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST- 591
           +  AG+   V LI  K  +   +DLS  +W Y+VG+ G         L ++ F+ + ST 
Sbjct: 558 MVQAGITGPVELIGQKGDETVIKDLSCHKWTYEVGLTG---------LEDNKFYSKASTN 608

Query: 592 ---------LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
                    +P N  + WYKTTF AP G  P+ L+L  MGKG AWVNG ++GRYW +YLA
Sbjct: 609 ETCGWSAENVPSNSKMTWYKTTFKAPLGNDPVVLDLQGMGKGFAWVNGYNLGRYWPSYLA 668

Query: 643 PSTGCTKK-CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKI 701
            + GC+   CDYRG YD +KC  +CGQP+Q  YH+PR+++  GEN LV+ EE GG+P ++
Sbjct: 669 EADGCSSDPCDYRGQYDNNKCVTNCGQPSQRWYHVPRSFLQDGENTLVLFEEFGGNPWQV 728

Query: 702 SLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPE 761
           +  T     +C                  G       + L+C  G  I+AI FAS+G P+
Sbjct: 729 NFQTLVVGSVC------------------GNAHEKKTLELSC-NGRPISAIKFASFGDPQ 769

Query: 762 GNCGSFRPGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G CGSF+ G C    D+LP++Q+ CVG+  CSI +S   LG +   C  ++K LAVEA C
Sbjct: 770 GTCGSFQAGTCQTEQDILPVLQQECVGKETCSIDISEDKLGKT--NCGSVVKKLAVEAVC 827


>gi|3860420|emb|CAA09467.1| exo galactanase [Lupinus angustifolius]
          Length = 730

 Score =  791 bits (2042), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/714 (54%), Positives = 493/714 (69%), Gaps = 21/714 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 31  VTASVTYDHKAIMINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 90

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFE RFDLV F+K VQ+AGLF+HLRIGP+ CAEWN+GGFPVWL ++PGI FRT
Sbjct: 91  EPSPGKYYFEDRFDLVGFIKLVQQAGLFVHLRIGPFICAEWNFGGFPVWLKYVPGIAFRT 150

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFKE M++F  KI+++MK E LF SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 151 DNEPFKEAMQKFTEKIVNIMKAEKLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 210

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+QEDAPDPII+TCNGFYC+ FTPN   KP +WTEN++GW+ +FG
Sbjct: 211 AQMAVGLDTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNYKPKLWTENWTGWYTAFG 270

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A P+RP ED+AF+VARF +  G+  NYYMY GGTNFGRT+ G  VATSYDYDAPIDEYG
Sbjct: 271 GATPYRPAEDIAFSVARFIQNRGSLFNYYMYHGGTNFGRTSNGLFVATSYDYDAPIDEYG 330

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PKWGHLRELH+AIK CE  L+S DPT    G  LE H+Y K+ + CAAFLANY++ 
Sbjct: 331 LLNEPKWGHLRELHRAIKQCESALVSVDPTVSWPGKNLEVHLY-KTESACAAFLANYNTD 389

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               V F    Y LP WS+SILPDCK  VFNTAKV S R +            ++   +S
Sbjct: 390 YSTQVKFGNGQYDLPPWSISILPDCKTEVFNTAKVNSPRLH-----------RKMTPVNS 438

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG---QGKEVFL 474
           AF+W    EE    S N       L EQ+  T+D+SDYLWY   +++ P     GK   L
Sbjct: 439 AFAWQSYNEEPASSSENDPVTGYALWEQVGVTRDSSDYLWYLTDVNIGPNDIKDGKWPVL 498

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
              S GH   VF+N +     YG+ D      ++ + L  G N + +LS+ VGL N G  
Sbjct: 499 TAMSAGHVLNVFINGQYAGTAYGSLDDPRLTFSQSVNLRVGNNKISLLSVSVGLANVGTH 558

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+    G+   V L  L +G  DLS  +W Y++G++GE + L   + +NS  W QGS + 
Sbjct: 559 FETWNTGVLGPVTLTGLSSGTWDLSKQKWSYKIGLKGESLSLHTEAGSNSVEWVQGSLVA 618

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYKTTF AP G  PLAL+L SMGKG+ WVNGQSIGR+W    A   G    C+Y
Sbjct: 619 KKQPLAWYKTTFSAPAGNDPLALDLGSMGKGEVWVNGQSIGRHWPGNKA--RGNCGNCNY 676

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y  +KC  +CGQP+Q  YH+PR+W+  G N LV+ EE GGDP+ I+L+ +T
Sbjct: 677 AGTYTDTKCLANCGQPSQRWYHVPRSWLRSGGNYLVVLEEWGGDPNGIALVERT 730


>gi|357124047|ref|XP_003563718.1| PREDICTED: beta-galactosidase 9-like isoform 1 [Brachypodium
           distachyon]
          Length = 719

 Score =  790 bits (2040), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/714 (53%), Positives = 494/714 (69%), Gaps = 26/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDH+A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23  NAAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQYYF  R+DLVRFVK  ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 83  PVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + Y  WAA
Sbjct: 143 NGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV     VPWVMC+Q+DAPDP+INTCNGFYCD FTPNS  KP MWTE +SGWF +FG 
Sbjct: 203 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG 
Sbjct: 263 AVPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIK  E  ++S DPT Q +G   +A+++  S+  CAAFL+NY +SS
Sbjct: 323 LRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSS 382

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V +NG  Y LPAWS+SILPDCK  V+NTA V         P A  K     +  +  
Sbjct: 383 PAKVVYNGRRYELPAWSISILPDCKTAVYNTATV-------KEPSAPAK-----MNPAGG 430

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           FSW  Y E      + +F +  L EQ++ T D SD+LWYT  +++   +     G+   L
Sbjct: 431 FSWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWPQL 490

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GH   VFVN +    GYG +D      +K +++ +G N + ILS  VGL N G  
Sbjct: 491 TINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQGTH 550

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++    G+   V L  L  GKRDLS+ +W YQ+G++GE +G+  I+ ++S  W   +   
Sbjct: 551 YENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSANGA- 609

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+K  F AP G  P+AL++ SMGKGQ WVNG++ GRYWS     ++G    C Y
Sbjct: 610 --QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWS---YKASGSCGSCSY 664

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y  +KCQ +CG  +Q  YH+PR+W++P  NLLV+ EE GGD S + L+T+T
Sbjct: 665 TGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTRT 718


>gi|326497687|dbj|BAK05933.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 716

 Score =  789 bits (2038), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/710 (54%), Positives = 498/710 (70%), Gaps = 26/710 (3%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           +YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP RG
Sbjct: 24  SYDHRAVVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPARG 83

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QY+F  R+DLVRFVK  ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 84  QYHFADRYDLVRFVKLARQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGPF 143

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K EM+RF+ KI+ +MK E LF  QGGPIILAQVENEYG +E A G G + Y  WAA+ AV
Sbjct: 144 KAEMQRFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESAMGAGAKPYANWAANMAV 203

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
             +  VPWVMC+Q+DAPDP+INTCNGFYCD FTPNS SKP MWTE ++GWF +FG  VP 
Sbjct: 204 ATDAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNSKPTMWTEAWTGWFTAFGGPVPH 263

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVED+AFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG IRQP
Sbjct: 264 RPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLIRQP 323

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHLR+LHKAIK  E  L+S DPT Q++G   +A+++  S+  CAAFL+NY +SS A +
Sbjct: 324 KWGHLRDLHKAIKQAEPALVSGDPTIQRIGNYEKAYVFKSSTGACAAFLSNYHTSSAARI 383

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
            +NG  Y LPAWS+SILPDCK  VFNTA V         P A  K     +  +  F+W 
Sbjct: 384 VYNGRRYDLPAWSISILPDCKTAVFNTATV-------KEPTAPAK-----MNPAGGFAWQ 431

Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
            Y E      + +F +  L EQ++ T D SDYLWYT  +++   +     G+   L I S
Sbjct: 432 SYSEDTNALDSSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSSEQFLKTGQWPQLTINS 491

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH+  VFVN +     YG ++      +K +++ +G N + ILS  +GL N G  ++  
Sbjct: 492 AGHSVQVFVNGQSFGVAYGGYNSPKLTYSKPVKMWQGSNKISILSSAMGLPNQGTHYEAW 551

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             G+   V L  L  GKRDLS+ +W YQ+G++GE +G++ IS ++S    + S+    + 
Sbjct: 552 NVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVNSISGSSSV---EWSSASGAQP 608

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L W+K  F AP G  P+AL++ SMGKGQ WVNG + GRYWS Y A  +G    C Y G++
Sbjct: 609 LTWHKAYFAAPAGSAPVALDMGSMGKGQIWVNGNNAGRYWS-YRA--SGSCGGCSYAGTF 665

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
             +KCQ +CG  +Q  YH+PR+W+ P  NLLV+ EE GGD S ++L+T+T
Sbjct: 666 SEAKCQTNCGDISQRWYHVPRSWLKPSGNLLVVLEEFGGDLSGVTLMTRT 715


>gi|357124049|ref|XP_003563719.1| PREDICTED: beta-galactosidase 9-like isoform 2 [Brachypodium
           distachyon]
          Length = 721

 Score =  788 bits (2036), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/716 (53%), Positives = 496/716 (69%), Gaps = 28/716 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDH+A+VI+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 23  NAAVSYDHKAIVINGQRRILMSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQYYF  R+DLVRFVK  ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 83  PVQGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + Y  WAA
Sbjct: 143 NGPFKAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGGGAKPYANWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV     VPWVMC+Q+DAPDP+INTCNGFYCD FTPNS  KP MWTE +SGWF +FG 
Sbjct: 203 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNSNGKPNMWTEAWSGWFTAFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG 
Sbjct: 263 AVPHRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIK  E  ++S DPT Q +G   +A+++  S+  CAAFL+NY +SS
Sbjct: 323 LRQPKWGHLRDLHKAIKQAEPAMVSGDPTIQSIGNYEKAYVFKSSTGACAAFLSNYHTSS 382

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V +NG  Y LPAWS+SILPDCK  V+NTA V            +QK   + L  + A
Sbjct: 383 PAKVVYNGRRYELPAWSISILPDCKTAVYNTATV------------RQKWKEKKLWMNPA 430

Query: 422 --FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             FSW  Y E      + +F +  L EQ++ T D SD+LWYT  +++   +     G+  
Sbjct: 431 GGFSWQSYSEDTNSLDDSAFTKDGLVEQLSMTWDKSDFLWYTTYVNIDSSEQFLKSGQWP 490

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L I S GH   VFVN +    GYG +D      +K +++ +G N + ILS  VGL N G
Sbjct: 491 QLTINSAGHTLQVFVNGQSYGAGYGGYDSPKLSYSKYVKMWQGSNKISILSSAVGLANQG 550

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             ++    G+   V L  L  GKRDLS+ +W YQ+G++GE +G+  I+ ++S  W   + 
Sbjct: 551 THYENWNVGVLGPVTLSGLNQGKRDLSNQKWTYQIGLKGESLGVHSITGSSSVEWGSANG 610

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
               + L W+K  F AP G  P+AL++ SMGKGQ WVNG++ GRYWS     ++G    C
Sbjct: 611 A---QPLTWHKAYFSAPAGGAPVALDMGSMGKGQIWVNGRNAGRYWS---YKASGSCGSC 664

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G+Y  +KCQ +CG  +Q  YH+PR+W++P  NLLV+ EE GGD S + L+T+T
Sbjct: 665 SYTGTYSETKCQTNCGDISQRWYHVPRSWLNPSGNLLVVLEEFGGDLSGVKLMTRT 720


>gi|15219534|ref|NP_175127.1| beta-galactosidase 5 [Arabidopsis thaliana]
 gi|75192251|sp|Q9MAJ7.1|BGAL5_ARATH RecName: Full=Beta-galactosidase 5; Short=Lactase 5; Flags:
           Precursor
 gi|7767665|gb|AAF69162.1|AC007915_14 F27F5.20 [Arabidopsis thaliana]
 gi|17979002|gb|AAL47461.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
 gi|20334754|gb|AAM16238.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
 gi|332193961|gb|AEE32082.1| beta-galactosidase 5 [Arabidopsis thaliana]
          Length = 732

 Score =  788 bits (2035), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/713 (53%), Positives = 488/713 (68%), Gaps = 19/713 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29  SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89  SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK+   FASQGGPIIL+Q+ENE+       G  G  YV WAA 
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV LNT VPWVMC+++DAPDPIINTCNGFYCD FTPN P KP MWTE +SGWF  FG  
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 VPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           ++PK+ HL++LH+AIK CE  L+SSDP   KLG   EAH++      C AFL NY  ++ 
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LPAWS+SILPDC+NVVFNTA V ++ ++      Q      +L + +  
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMVPSGSILYSVAR- 442

Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E +   GNR  +    L EQ+N T+DT+DYLWYT S+ +   +     GK   L +
Sbjct: 443 --YDEDIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VFVN       +G  +   F  + ++ L  G N + +LS+ VGL N G  F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFE 560

Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+  SV+L  L  G +DLS  +W YQ G+ GE + L   +  +S  W +GS    N
Sbjct: 561 TWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN 620

Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           K  L WYK  F AP G  PLAL+L SMGKGQAW+NGQSIGRYW A+   + G    C+Y 
Sbjct: 621 KQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGDCGSCNYA 677

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G+Y  +KCQ  CG+P Q  YH+PR+W+ P  NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730


>gi|224053294|ref|XP_002297749.1| predicted protein [Populus trichocarpa]
 gi|222845007|gb|EEE82554.1| predicted protein [Populus trichocarpa]
          Length = 823

 Score =  787 bits (2032), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 403/831 (48%), Positives = 535/831 (64%), Gaps = 42/831 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           ++ VTYD RA++IDGK R+L SGSIHYPRST ++WP+L++KS+EGGL+ IETYVFW+ HE
Sbjct: 22  ASKVTYDGRAIIIDGKHRLLVSGSIHYPRSTAQMWPDLVKKSREGGLDAIETYVFWDSHE 81

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P R +Y F G  DL+RF+KT+Q+ GL+  LRIGPY CAEWNYGGFPVWLH +PG+Q RT 
Sbjct: 82  PARREYDFSGNLDLIRFLKTIQDEGLYAVLRIGPYVCAEWNYGGFPVWLHNMPGVQMRTA 141

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N+ F  EM+ F   I++++KQENLFASQGGP+ILAQ+ENEYGNV  +YG  G+ Y++W A
Sbjct: 142 NDVFMNEMRNFTTLIVNMVKQENLFASQGGPVILAQIENEYGNVMSSYGDEGKAYIEWCA 201

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A +L+  VPW+MCQQ DAP+P+INTCNG+YCD FTPN P+ P MWTEN++GWF S+G 
Sbjct: 202 NMAQSLHIGVPWLMCQQSDAPEPMINTCNGWYCDQFTPNRPTSPKMWTENWTGWFKSWGG 261

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P R  EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG 
Sbjct: 262 KDPHRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 321

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           + QPKWGHL+ELH  +   E+ L   + +    G  +   IY  +    + FL N DS +
Sbjct: 322 LNQPKWGHLKELHDVLHSMEDTLTRGNISSVDFGNSVSGTIY-STEKGSSCFLTNTDSRN 380

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  + F G  Y +PAWSVSILPDC++VV+NTAKV +Q +       ++KNV E   A+  
Sbjct: 381 DTTINFQGLDYEVPAWSVSILPDCQDVVYNTAKVSAQTS----VMVKKKNVAEDEPAALT 436

Query: 422 FSWYEE---KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLN 475
           +SW  E   K  + G        + +Q +   D SDYL+Y  S+ +    P  G  + L 
Sbjct: 437 WSWRPETNDKSILFGKGEVSVNQILDQKDAANDLSDYLFYMTSVSLKEDDPIWGDNMTLR 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I   G    VFVN + +   +  +   +++  ++I+LN+G NT+ +LS  VG  NYGA F
Sbjct: 497 ITGSGQVLHVFVNGEFIGSQWAKYGVFDYVFEQQIKLNKGKNTITLLSATVGFANYGANF 556

Query: 536 DVAGAGLFS-VILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           D+  AG+   V L+   + +   +DLSS +W Y+VG+EG    L     ++SS W+Q   
Sbjct: 557 DLTQAGVRGPVELVGYHDDEIIIKDLSSHKWSYKVGLEGLRQNLYS---SDSSKWQQ-DN 612

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
            P NK   WYK TF AP G  P+ ++L  +GKG AWVNG SIGRYW +++A        C
Sbjct: 613 YPTNKMFTWYKATFKAPLGTDPVVVDLLGLGKGLAWVNGNSIGRYWPSFIAEDGCSLDPC 672

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWV-HPGENLLVIHEELGGDPSKISLLTKTGQH 710
           DYRGSYD +KC  +CG+P Q  YH+PR+++ + G+N LV+ EE GGDPS ++  T     
Sbjct: 673 DYRGSYDNNKCVTNCGKPTQRWYHVPRSFLNNEGDNTLVLFEEFGGDPSSVNFQTTAIGS 732

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            C    E                    ++ L+C+ G  I+AI FAS+G P G CGSF  G
Sbjct: 733 ACVNAEE------------------KKKIELSCQ-GRPISAIKFASFGNPLGTCGSFSKG 773

Query: 771 ACHM--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            C    D L IVQKACVGQ  C+I VS    G S      ++K L+VEA C
Sbjct: 774 TCEASNDALSIVQKACVGQESCTIDVSEDTFG-STTCGDDVIKTLSVEAIC 823


>gi|448278449|gb|AGE44111.1| beta-galactosidase 101 [Malus x domestica]
          Length = 725

 Score =  786 bits (2030), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/711 (53%), Positives = 497/711 (69%), Gaps = 17/711 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V YDH+A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K GGL+VI+TYVFWN HE
Sbjct: 23  SASVGYDHKAIIINGQRRILISGSIHYPRSTPEMWPDLIQKAKAGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFP+WL ++PGI FRT 
Sbjct: 83  PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIWLKYVPGIAFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+++MK E LF ++GGPIIL+Q+ENEYG VEW  G  G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVNMMKAEKLFQTEGGPIILSQIENEYGPVEWEIGAPGKAYTKWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV LNT VPW+MC+QEDAPDP+I+TCNG+YC+ F PN   KP MWTE ++GW+  FG 
Sbjct: 203 QMAVGLNTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKMWTEVWTGWYTEFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           A+P RPVEDLAF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 263 AIPTRPVEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           ++QPKWGHL++LHKAIK CE  L++ DP+  KLG   EAH+++  S  CAAFLANYD+  
Sbjct: 323 LQQPKWGHLKDLHKAIKSCEYALVAVDPSVTKLGNNQEAHVFNTKSG-CAAFLANYDTKY 381

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V+F    Y LP WS+SILPDCK  VFNTAKV  + +       Q K V   L     
Sbjct: 382 PVRVSFGQGQYDLPPWSISILPDCKTAVFNTAKVTWKTSQ-----VQMKPVYSRLPWQ-- 434

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
            S+ EE      + +     L EQI  T+D +DYLWY   I +   +     GK   L I
Sbjct: 435 -SFIEETTTSDESGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEAFLNNGKFPLLTI 493

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S  HA  VF+N +L    YG+ +      ++ ++L  GIN L +LS+ VGL N G  F+
Sbjct: 494 FSACHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFE 553

Query: 537 VAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   I L  L  G  D+S  +W Y++G++GE +GL  ++ ++S  W +G ++   
Sbjct: 554 TWNAGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGEALGLHTVTGSSSVDWAEGPSMAKK 613

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK TF AP G  PLAL++ SMGKGQ W+NGQS+GR+W  Y+A   G    C+Y G
Sbjct: 614 QPLTWYKATFNAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--QGSCGTCNYAG 671

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           ++   KC+ +CG+P+Q  YHIPR+W+ P  NLLV+ EE GGDP  +SL+ +
Sbjct: 672 TFYDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPQWMSLVER 722


>gi|16604400|gb|AAL24206.1| At1g45130/F27F5_20 [Arabidopsis thaliana]
          Length = 732

 Score =  786 bits (2029), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/713 (53%), Positives = 487/713 (68%), Gaps = 19/713 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29  SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89  SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK+   FASQGGPIIL+Q+ENE+       G  G  YV WAA 
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV LNT VPWVMC+++DAPDPIINTCNGFYCD FTPN P KP MWTE +SGWF  FG  
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 VPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           ++PK+ HL++LH+AIK CE  L+SSDP   KLG   EAH++      C AFL NY  ++ 
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LPAWS+SILPDC+NVVFNTA V ++ ++      Q      +L + +  
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMVPSGSILYSVAR- 442

Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E +   GNR  +    L EQ+N T+DT+DYLWYT S+ +   +     GK   L +
Sbjct: 443 --YDEDIATYGNRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VFVN       +G  +   F  + ++ L  G N + +LS+ VGL N G  F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFE 560

Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+  SV+L  L  G +DLS  +W YQ G+ GE + L   +  +S  W +GS    N
Sbjct: 561 TWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN 620

Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           K  L WYK  F  P G  PLAL+L SMGKGQAW+NGQSIGRYW A+   + G    C+Y 
Sbjct: 621 KQPLTWYKAYFDVPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGDCGSCNYA 677

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G+Y  +KCQ  CG+P Q  YH+PR+W+ P  NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730


>gi|3299896|gb|AAC25984.1| beta-galactosidase [Solanum lycopersicum]
          Length = 724

 Score =  785 bits (2028), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/716 (52%), Positives = 497/716 (69%), Gaps = 22/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD RA++I+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21  VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR+DLVRF+K VQ AGL+++LRIGPY CAEWN+GGFPVWL ++PG++FRT
Sbjct: 81  EPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F+ KI+++MK ENLF SQGGPII+AQ+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 NNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GW+  FG
Sbjct: 201 AQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +P RP ED+AF+VARF +  G+F NYYMY GGTNFGRT+ G  +ATSYDYDAP+DEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PK+GHLR+LHKAIKL E  L+SS      LG+  EAH+Y   S  CAAFL+NYDS 
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               VTF    Y LP WS+SILPDCK  V+NTA+V SQ ++            ++  A  
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSS-----------IKMTPAGG 429

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW    EE      + +     L EQ N T+D+SDYLWY  ++++   +     GK+ 
Sbjct: 430 GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKDP 489

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
           +L + S GH   VFVN KL    YG  D      +  ++L  GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVG 549

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             +D   AG+   V L  L  G R+L+  +W Y+VG++GE + L  +S ++S  W +GS 
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRGSL 609

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK TF AP G  PLAL++ASMGKGQ W+NG+ +GR+W  Y+A   G   KC
Sbjct: 610 MAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIA--QGDCSKC 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G+++  KCQ +CGQP+Q  YH+PR+W+ P  NLLV+ EE GG+P+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRRS 723


>gi|359484258|ref|XP_002276918.2| PREDICTED: beta-galactosidase 7-like [Vitis vinifera]
 gi|297738528|emb|CBI27773.3| unnamed protein product [Vitis vinifera]
          Length = 835

 Score =  785 bits (2027), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/831 (47%), Positives = 542/831 (65%), Gaps = 45/831 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  V+YD RAL+IDGKRRVLQSGSIHYPRSTPE+WP+LIRK+K GGL+ IETYVFWN HE
Sbjct: 37  AVEVSYDGRALIIDGKRRVLQSGSIHYPRSTPEMWPDLIRKAKAGGLDAIETYVFWNVHE 96

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+R +Y F G  DL+RF++T+Q  GL+  LRIGPY CAEW YGGFP+WLH +PGI+FRT 
Sbjct: 97  PLRREYDFSGNLDLIRFIQTIQAEGLYAVLRIGPYVCAEWTYGGFPMWLHNMPGIEFRTA 156

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  F  EM+ F   I+D+ KQE LFASQGGPII+AQ+ENEYGN+   YG  G++YV W A
Sbjct: 157 NKVFMNEMQNFTTLIVDMAKQEKLFASQGGPIIIAQIENEYGNIMAPYGDAGKVYVDWCA 216

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A +L+  VPW+MCQQ DAP P+INTCNG+YCD FTPN+P+ P MWTEN++GWF ++G 
Sbjct: 217 AMANSLDIGVPWIMCQQSDAPQPMINTCNGWYCDSFTPNNPNSPKMWTENWTGWFKNWGG 276

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P R  EDL+++VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDYDAP+DE+G 
Sbjct: 277 KDPHRTAEDLSYSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYDAPLDEFGN 336

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           + QPKWGHL++LH  +K  EE L   + T   +G  +E  +Y  +    + F +N ++++
Sbjct: 337 LNQPKWGHLKDLHTVLKSMEETLTEGNITTIDMGNSVEVTVY-ATQKVSSCFFSNSNTTN 395

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DA  T+ G  Y +PAWSVSILPDCK  V+NTAKV +Q +       + KN  E   AS  
Sbjct: 396 DATFTYGGTEYTVPAWSVSILPDCKKEVYNTAKVNAQTS----VMVKNKNEAEDQPASLK 451

Query: 422 FSWYEEKV---GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLN 475
           +SW  E +    + G        L +Q  TT D SDYLWY  S+ +          + L 
Sbjct: 452 WSWRPEMIDDTAVLGKGQVSANRLIDQ-KTTNDRSDYLWYMNSVDLSEDDLVWTDNMTLR 510

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + + GH    +VN + +   +  +   N++  +K++L  G N + +LS  +G QNYGA++
Sbjct: 511 VNATGHILHAYVNGEYLGSQWATNGIFNYVFEEKVKLKPGKNLIALLSATIGFQNYGAFY 570

Query: 536 DVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSF-WKQGS 590
           D+  +G+   + I  + G     +DLSS +W Y+VG+ G  +   K+    S + W++G+
Sbjct: 571 DLVQSGISGPVEIVGRKGDETIIKDLSSHKWSYKVGMHGMAM---KLYDPESPYKWEEGN 627

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            +P+N++L WYKTTF AP G   + ++L  +GKG+AWVNGQS+GRYW + +A   GC   
Sbjct: 628 -VPLNRNLTWYKTTFKAPLGTDAVVVDLQGLGKGEAWVNGQSLGRYWPSSIAED-GCNAT 685

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDYRG Y  +KC ++CG P Q  YH+PR+++   EN LV+ EE GG+PS ++  T T   
Sbjct: 686 CDYRGPYTNTKCVRNCGNPTQRWYHVPRSFLTADENTLVLFEEFGGNPSLVNFQTVTIGT 745

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            C           ++++ N+        + LAC+    I+ I FAS+G P+G+CGSF  G
Sbjct: 746 ACG----------NAYENNV--------LELACQNR-PISDIKFASFGDPQGSCGSFSKG 786

Query: 771 AC--HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           +C  + D L I++KACVG+  CS+ VS    G +  +C  + K LAVEA C
Sbjct: 787 SCEGNKDALDIIKKACVGKESCSLDVSEKAFGST--SCGSIPKRLAVEAVC 835


>gi|195617466|gb|ACG30563.1| beta-galactosidase precursor [Zea mays]
          Length = 723

 Score =  785 bits (2027), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/714 (53%), Positives = 492/714 (68%), Gaps = 24/714 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25  NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQYYF  R+DLVRFVK  ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 85  PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + Y  WAA
Sbjct: 145 NGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV     VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG 
Sbjct: 205 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 264

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG 
Sbjct: 265 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGL 324

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LHKAIK  E  L+S DPT Q LG   +A+++  S   CAAFL+NY +S+
Sbjct: 325 LRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSA 384

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            A V FNG  Y LPAWS+S+LPDCK  VFNTA V         P A  +     +  +  
Sbjct: 385 AARVVFNGRRYDLPAWSISVLPDCKAAVFNTATV-------SEPSAPAR-----MSPAGG 432

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           FSW  Y E       R+F +  L EQ++ T D SDYLWYT  +++   +     G+   L
Sbjct: 433 FSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQL 492

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + S GH+  VFVN +     YG +D      +  +++ +G N + ILS  VGL N G  
Sbjct: 493 TVYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTH 552

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++    G+   V L  L  GKRDLS+ +W YQ+G+ GE +G+  ++ ++S  W   +   
Sbjct: 553 YETWNVGVLGPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG-- 610

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L W+K  F AP G  P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+G    C Y
Sbjct: 611 -KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGGCGGCSY 668

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y  +KCQ  CG  +Q  YH+PR+W++P  NLLV+ EE GGD   + L+T+T
Sbjct: 669 AGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTRT 722


>gi|297846860|ref|XP_002891311.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337153|gb|EFH67570.1| hypothetical protein ARALYDRAFT_473836 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 732

 Score =  784 bits (2025), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/713 (53%), Positives = 488/713 (68%), Gaps = 19/713 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29  SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89  SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK+   FASQGGPIIL+Q+ENE+       G  G  YV WAA 
Sbjct: 149 GPFKAAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPELKGLGPAGHSYVNWAAK 208

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV LNT VPWVMC+++DAPDPIIN+CNGFYCD FTPN P KP MWTE +SGWF  FG  
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINSCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +P RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 IPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           ++PK+ HL++LH+AIK CE  L+SSDP   KLG   EAH++      C AFL NY  ++ 
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LPAWS+SILPDC+NVVFNTA V ++ ++      Q      +L + +  
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMMPSGSILYSVAR- 442

Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E +   G+R  +    L EQ+N T+DT+DYLWYT S+ +   +     GK   L +
Sbjct: 443 --YDEDIATYGDRGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VFVN       +G  +   F  + ++ L  G N + +LS+ VGL N G  F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANRIALLSVAVGLPNVGPHFE 560

Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+  SV+L  L  G +DLS  +W YQ G+ GE + L   +  +S  W +GS    N
Sbjct: 561 TWATGIVGSVVLHGLDEGNKDLSWQKWTYQAGLRGEAMKLVSPTEDSSVDWIKGSLAKQN 620

Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           K  L WYK  F AP G  PLAL+L SMGKGQAW+NGQSIGRYW A+   + G    C+Y 
Sbjct: 621 KQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGNCGSCNYA 677

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G+Y  +KCQ  CG+P Q  YH+PR+W+ P  NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPRGNLLVLFEELGGDISKVSVVKRS 730


>gi|7682680|gb|AAF67342.1| beta galactosidase [Vigna radiata]
          Length = 739

 Score =  784 bits (2025), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/715 (53%), Positives = 491/715 (68%), Gaps = 22/715 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LIRK+K GGL+ I+TYVFWN H
Sbjct: 24  IHCSVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIRKAKGGGLDAIDTYVFWNVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLVRF+KTVQ  GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84  EPSPGIYNFEGRYDLVRFIKTVQRVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+ENEYG+     G  G  Y  WA
Sbjct: 144 DNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGSESKQLGGAGYAYTNWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV LNT VPWVMC+Q+DAPDP+IN CNGFYCD F+PN P KP +WTE++SGWF  FG
Sbjct: 204 AKMAVGLNTGVPWVMCKQDDAPDPVINACNGFYCDYFSPNKPYKPTLWTESWSGWFTEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPV+DLAFAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GPIYQRPVQDLAFAVARFIQKGGSYINYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IR+PK+GHL +LHKAIK CE  L+SSDPT   LGA  +AH++   +  CAAFLANY S+
Sbjct: 324 LIREPKYGHLMDLHKAIKQCERALVSSDPTVTSLGAYEQAHVFSSKNGACAAFLANYHSN 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A VTFN   Y LP WS+SILPDCK  VFNTA+V            Q   +  L   S 
Sbjct: 384 SAARVTFNNRKYDLPPWSISILPDCKTDVFNTARV----------RFQTTKIQMLPSNSK 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            FSW  Y+E V  +S +       L EQ+N T+DTSDYLWY  S+ +   +     G + 
Sbjct: 434 LFSWETYDEDVSSLSESSKITASGLLEQLNATRDTSDYLWYITSVDISSSESFLRGGNKP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            +++ S GHA  VF+N + +   +G  +  +   N  + L  G N + +LS+ VGL N G
Sbjct: 494 SISVHSAGHAVHVFINGQFLGSAFGTSEDRSCTFNGPVNLRAGTNKIALLSVAVGLPNVG 553

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
             F+   AG+  V+L  L +G++DL+  +W YQ+G++GE + L   +  +S  W + S  
Sbjct: 554 FHFETWKAGITGVLLYGLDHGQKDLTWQKWSYQIGLKGEAMNLVSPNGVSSVDWVRDSLD 613

Query: 593 PVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
             ++S L W+K  F AP+G  PLAL+L+SMGKGQ W+NGQSIGRYW  Y   + G    C
Sbjct: 614 VRSQSQLKWHKAYFNAPDGVEPLALDLSSMGKGQVWINGQSIGRYWMVY---AKGACNSC 670

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           +Y G+Y  +KCQ  CGQP Q  YH+PR+W+ P  NL+V+ EELGG+P KISL  +
Sbjct: 671 NYAGTYRPAKCQLGCGQPTQQWYHVPRSWLKPTNNLIVLLEELGGNPWKISLQKR 725


>gi|14970843|emb|CAC44502.1| beta-galactosidase [Fragaria x ananassa]
          Length = 722

 Score =  783 bits (2023), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/714 (53%), Positives = 491/714 (68%), Gaps = 25/714 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V YDHRA++++GKRR+L SGSIHYPRSTPE+WP+L++K+K+GGL+V++TYVFWN HEP
Sbjct: 25  ASVGYDHRAIIVNGKRRILISGSIHYPRSTPEMWPDLLQKAKDGGLDVLQTYVFWNGHEP 84

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFE R+DLV+F+K  Q+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 85  SPGKYYFEDRYDLVKFIKLAQQHGLYVHLRIGPYICAEWNFGGFPVWLKYVPGIAFRTDN 144

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PF   M++F  KI+ +MK E LF +QGGPIIL+Q+ENEYG VEW  G  G+ Y +WAA 
Sbjct: 145 RPFMAAMEKFTQKIVYMMKAERLFQTQGGPIILSQIENEYGPVEWEIGAPGKSYTQWAAK 204

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV LNT VPWVMC+QEDAPDPII+TCNGFYC+ FTPN   KP MWTE ++GW+  FG A
Sbjct: 205 MAVGLNTGVPWVMCKQEDAPDPIIDTCNGFYCENFTPNKNYKPKMWTEIWTGWYTEFGGA 264

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RP +DLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG  
Sbjct: 265 VPTRPAQDLAFSVARFIQNGGSFANYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLP 324

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R+PK+ HL+ +HKAIK+ E  L+++D    KLG   EAH+Y +S + CAAFLANYD+   
Sbjct: 325 REPKYSHLKYMHKAIKMAEPALLATDAAVSKLGNNQEAHVY-QSRSGCAAFLANYDTKYP 383

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             VTF    Y LP WS+SILPDCK  VFNTA+V      G  P  +   V  L       
Sbjct: 384 VRVTFWNKQYNLPPWSISILPDCKTEVFNTARV------GQSPPTKMTPVAHL------- 430

Query: 423 SW--YEEKVGISG-NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW  Y E V  S  + +F    L EQI+ T D +DYLWY   I + P +     GK   L
Sbjct: 431 SWQAYIEDVATSADDNAFTSVGLREQISLTWDNTDYLWYMTDITIGPNEQFLRTGKYPTL 490

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            ++S GHA  VF+N +L    YG   F     N+ ++L  GIN L +LS+ VGL N G  
Sbjct: 491 KVDSAGHALHVFINGQLSGSAYGTLAFPKLEFNQGVKLRAGINKLALLSVSVGLANVGLH 550

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+    G+   V L  + +G  D++  +W Y++G+ GE + L  +S ++S  W QGS L 
Sbjct: 551 FETWNTGVLGPVTLAGVNSGTWDMTRWQWTYKIGMRGEDMSLHTVSGSSSVEWVQGSLLA 610

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK    AP G  PLAL++ SMGKGQ W+NGQSIGR+W AY A   G    C Y
Sbjct: 611 QYRPLTWYKAILNAPPGNAPLALDMGSMGKGQMWINGQSIGRHWPAYKA--HGSCGACYY 668

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y  +KC+ +CGQP+Q  YH+PR+W+    NLLV+ EE GGDP+KISL+ ++
Sbjct: 669 AGTYTENKCRTNCGQPSQRWYHVPRSWLKSSGNLLVVFEEWGGDPTKISLVARS 722


>gi|297816572|ref|XP_002876169.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
 gi|297322007|gb|EFH52428.1| AT3g52840/F8J2_10 [Arabidopsis lyrata subsp. lyrata]
          Length = 728

 Score =  783 bits (2021), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/717 (53%), Positives = 501/717 (69%), Gaps = 25/717 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27  AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PGI FRT N
Sbjct: 87  SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGIVFRTDN 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+RF  KI+D+MK+E LF +QGGPIIL+Q+ENEYG +EW  G  G+ Y KW A+
Sbjct: 147 EPFKIAMQRFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMEWEMGAAGKAYSKWTAE 206

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L+T VPW+MC+QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF  FG A
Sbjct: 207 MALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +P RPVED+AF+VARF + GG+F NYYMY+GGTNF RTA G  +ATSYDYDAP+DEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFLNYYMYYGGTNFDRTA-GVFIATSYDYDAPLDEYGLL 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R+PK+ HL+ELHK IKLCE  L+S DPT   LG K E H++ KS   CAAFL+NYD+SS 
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEVHVF-KSKTSCAAFLSNYDTSSA 384

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A + F G  Y LP WSVSILPDCK   +NTAK+ +       P    K    ++  S+ F
Sbjct: 385 ARIMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MVPTSTKF 433

Query: 423 SW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW  Y E    S +  +FV+  L EQI+ T+D +DY WY   I +   +     G +  L
Sbjct: 434 SWESYNEGSPSSNDDGTFVKDGLVEQISMTRDKTDYFWYLTDITIGSDESFLKTGDDPLL 493

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VFVN  L    YG    +    ++KI+L+ GIN L +LS  VGL N G  
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQKIKLSVGINKLALLSTAVGLPNAGVH 553

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS-SFWKQGSTL 592
           ++    G+   V L  + +G  D+S  +W Y++G+ GE +    I+ +++  +W +GS +
Sbjct: 554 YETWNTGVLGPVTLKGVNSGTWDMSKWKWSYKIGIRGEAMSFHTIAGSSAVKWWIKGSFV 613

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYK++F  P+G  PLAL++ +MGKGQ WVNG +IGR+W AY A   G   +C+
Sbjct: 614 VKKEPLTWYKSSFDTPKGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTA--RGNCGRCN 671

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
           Y G Y+  KC  HCG+P+Q  YH+PR+W+ P  NLLVI EE GGDPS ISL+ +T +
Sbjct: 672 YAGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 728


>gi|350538173|ref|NP_001234842.1| ss-galactosidase precursor [Solanum lycopersicum]
 gi|4138141|emb|CAA10175.1| ss-galactosidase [Solanum lycopersicum]
          Length = 724

 Score =  782 bits (2020), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/716 (52%), Positives = 496/716 (69%), Gaps = 22/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD RA++I+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21  VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
            P  G+Y FEGR+DLVRF+K VQ AGL+++LRIGPY CAEWN+GGFPVWL ++PG++FRT
Sbjct: 81  GPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F+ KI+++MK ENLF SQGGPII+AQ+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 NNQPFKVAMRGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPW+MC+QEDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GW+  FG
Sbjct: 201 AQMAVGLKTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +P RP ED+AF+VARF +  G+F NYYMY GGTNFGRT+ G  +ATSYDYDAP+DEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PK+GHLR+LHKAIKL E  L+SS      LG+  EAH+Y   S  CAAFL+NYDS 
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               VTF    Y LP WS+SILPDCK  V+NTA+V SQ ++            ++  A  
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSS-----------IKMTPAGG 429

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW    EE      + +     L EQ N T+D+SDYLWY  ++++   +     GK+ 
Sbjct: 430 GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLKNGKDP 489

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
           +L + S GH   VFVN KL    YG  D      +  ++L  GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVG 549

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             +D   AG+   V L  L  G R+L+  +W Y+VG++GE + L  +S ++S  W +GS 
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRGSL 609

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK TF AP G  PLAL++ASMGKGQ W+NG+ +GR+W  Y+A   G   KC
Sbjct: 610 VAQKQPLTWYKATFNAPGGNDPLALDMASMGKGQIWINGEGVGRHWPGYIA--QGDCSKC 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G+++  KCQ +CGQP+Q  YH+PR+W+ P  NLLV+ EE GG+P+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWYHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRRS 723


>gi|380450408|gb|AFD54987.1| beta-galactosidase [Momordica charantia]
          Length = 719

 Score =  782 bits (2020), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/712 (53%), Positives = 493/712 (69%), Gaps = 20/712 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYD +A++I+GKRR+L SGSIHYPRSTP++WP LI+ +K+GGL++IETYVFWN HEP
Sbjct: 20  ATVTYDQKAIIINGKRRILVSGSIHYPRSTPQMWPSLIQNAKDGGLDIIETYVFWNGHEP 79

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL  +PGI FRT N
Sbjct: 80  TQGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPIWLKHVPGIVFRTEN 139

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+ +MK E L+ SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA 
Sbjct: 140 EPFKAAMQKFTEKIVGMMKSEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 199

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN  +KP +WTE +SGW+ +FG A
Sbjct: 200 MALGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNRENKPKIWTEVWSGWYTAFGGA 259

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RP EDLAF+VARF + GG+  NYYMY GGTNFGR++ G  +A SYD+DAPIDEYG  
Sbjct: 260 VPYRPAEDLAFSVARFVQNGGSLFNYYMYHGGTNFGRSS-GLFIANSYDFDAPIDEYGLK 318

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R+PKW HLR+LHKAIKLCE  L+S+DP    LG  LEA ++  SS  CAAFLANYD S+ 
Sbjct: 319 REPKWEHLRDLHKAIKLCEPALVSADPNVTWLGKNLEARVFKSSSGACAAFLANYDISTS 378

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           + V+F    Y LP WS+SIL DCK+ +FNTA++           AQ   +  +L++S  +
Sbjct: 379 SKVSFWNTQYDLPPWSISILSDCKSAIFNTARI----------GAQSAPMKMMLVSSFWW 428

Query: 423 SWYEEKVGIS-GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E+V       +  +  L EQ+N T D++DYLWY   I + P +     G+   LNI
Sbjct: 429 LSYKEEVASGYATDTTTKDGLVEQVNFTWDSTDYLWYMTDIQIDPNEAFIKSGQWPLLNI 488

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH   VFVN +L    YG+ +      +K + L  G+N L +LS+ VGL N G  F+
Sbjct: 489 SSAGHVLHVFVNGQLSGTVYGSLENPKVAFSKYVNLKAGVNKLSMLSVTVGLPNVGLHFE 548

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   V L  L  G RD+S  +W ++VG++GE + L  I  +NS  W +GS L   
Sbjct: 549 SWNAGVLGPVTLKGLNEGIRDMSGYKWSHKVGLKGENMNLHTIGGSNSVQWAKGSGLVQK 608

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYKT F  P G  PLAL+++SMGKGQ W+NG+SIGRYW AY A  +G   KC Y G
Sbjct: 609 QPLTWYKTNFNTPAGNEPLALDMSSMGKGQIWINGRSIGRYWPAYAA--SGSCGKCSYAG 666

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            +   KC  +CGQP+Q  YH+PR W+    N LV+ EELGG+P  ISL+ ++
Sbjct: 667 IFTEKKCLSNCGQPSQKWYHVPREWLESKGNFLVVFEELGGNPGGISLVKRS 718


>gi|6686882|emb|CAB64741.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 732

 Score =  782 bits (2020), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/713 (53%), Positives = 486/713 (68%), Gaps = 19/713 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ++VTYD +A+VI+G RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN HEP
Sbjct: 29  SSVTYDKKAIVINGHRRILLSGSIHYPRSTPEMWEDLIKKAKDGGLDVIDTYVFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y FEGR+DLVRF+KT+QE GL++HLRIGPY CAEWN+GGFPVWL ++ GI FRT N
Sbjct: 89  SPGTYNFEGRYDLVRFIKTIQEVGLYVHLRIGPYVCAEWNFGGFPVWLKYVDGISFRTDN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +MK+   FASQGGPIIL+Q+ENE+       G  G  YV WAA 
Sbjct: 149 GPFKSAMQGFTEKIVQMMKEHRFFASQGGPIILSQIENEFEPDLKGLGPAGHSYVNWAAK 208

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV LNT VPWVMC+++DAPDPIINTCNGFYCD FTPN P KP MWTE +SGWF  FG  
Sbjct: 209 MAVGLNTGVPWVMCKEDDAPDPIINTCNGFYCDYFTPNKPYKPTMWTEAWSGWFTEFGGT 268

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP RPVEDLAF VARF + GG++ NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG +
Sbjct: 269 VPKRPVEDLAFGVARFIQKGGSYINYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLV 328

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           ++PK+ HL++LH+AIK CE  L+SSDP   KLG   EAH++      C AFL NY  ++ 
Sbjct: 329 QEPKYSHLKQLHQAIKQCEAALVSSDPHVTKLGNYEEAHVFTAGKGSCVAFLTNYHMNAP 388

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V FN   Y LPAWS+SILPDC+NVVFNTA V ++ ++      Q      +L + +  
Sbjct: 389 AKVVFNNRHYTLPAWSISILPDCRNVVFNTATVAAKTSH-----VQMVPSGSILYSVAR- 442

Query: 423 SWYEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
             Y+E +   GN   +    L EQ+N T+DT+DYLWYT S+ +   +     GK   L +
Sbjct: 443 --YDEDIATYGNPGTITARGLLEQVNVTRDTTDYLWYTTSVDIKASESFLRGGKWPTLTV 500

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S GHA  VFVN       +G  +   F  + ++ L  G N + +LS+ VGL N G  F+
Sbjct: 501 DSAGHAVHVFVNGHFYGSAFGTRENRKFSFSSQVNLRGGANKIALLSVAVGLPNVGPHFE 560

Query: 537 VAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+  SV L  L  G +DLS  +W YQ G+ GE + L   +  +S  W +GS    N
Sbjct: 561 TWATGIVGSVALHGLDEGNKDLSWQKWTYQAGLRGESMNLVSPTEDSSVDWIKGSLAKQN 620

Query: 596 KS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           K  L WYK  F AP G  PLAL+L SMGKGQAW+NGQSIGRYW A+   + G    C+Y 
Sbjct: 621 KQPLTWYKAYFDAPRGNEPLALDLKSMGKGQAWINGQSIGRYWMAF---AKGDCGSCNYA 677

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G+Y  +KCQ  CG+P Q  YH+PR+W+ P  NLLV+ EELGGD SK+S++ ++
Sbjct: 678 GTYRQNKCQSGCGEPTQRWYHVPRSWLKPKGNLLVLFEELGGDISKVSVVKRS 730


>gi|30687121|ref|NP_849553.1| beta-galactosidase 12 [Arabidopsis thaliana]
 gi|75265630|sp|Q9SCV0.1|BGL12_ARATH RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
           Precursor
 gi|6686896|emb|CAB64748.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|332659762|gb|AEE85162.1| beta-galactosidase 12 [Arabidopsis thaliana]
          Length = 728

 Score =  782 bits (2019), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/717 (51%), Positives = 508/717 (70%), Gaps = 21/717 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25  VKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 85  EPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW  G  G+ Y KW 
Sbjct: 145 DNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A  L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS +KP MWTEN++GWF  FG
Sbjct: 205 AEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G  +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PK+ HL+ LHK IKLCE  L+S+DPT   LG K EAH++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF-KSKSSCAAFLSNYNTS 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F G+ Y LP WSVSILPDCK   +NTAKV   R +  H         +++  ++
Sbjct: 383 SAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV-QVRTSSIH--------MKMVPTNT 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
            FSW  Y E++   + N +F +  L EQI+ T+D +DY WY   I + P +    G++  
Sbjct: 434 PFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 493

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GHA  VFVN +L    YG+ +      ++KI+L+ G+N L +LS   GL N G 
Sbjct: 494 LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGV 553

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            ++    G+   V L  + +G  D++  +W Y++G +GE + +  ++ +++  WK+GS +
Sbjct: 554 HYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLAGSSTVEWKEGSLV 613

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYK+TF +P G  PLAL++ +MGKGQ W+NGQ+IGR+W AY A   G  ++C 
Sbjct: 614 AKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWPAYTA--RGKCERCS 671

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
           Y G++   KC  +CG+ +Q  YH+PR+W+ P  NL+++ EE GG+P+ ISL+ +T +
Sbjct: 672 YAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPNGISLVKRTAK 728


>gi|297799386|ref|XP_002867577.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297313413|gb|EFH43836.1| beta-galactosidase 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 728

 Score =  781 bits (2018), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/717 (51%), Positives = 506/717 (70%), Gaps = 21/717 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25  VKAMVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++P + FRT
Sbjct: 85  EPSPGQYYFEDRYDLVKFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPDMVFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW  G  G+ Y KW 
Sbjct: 145 DNEPFKAAMQKFTEKIVGMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A  L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS  KP MWTEN++GWF  FG
Sbjct: 205 AKMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDKKPKMWTENWTGWFTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G  +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PK+ HL+ LHK IKLCE  L+S+DPT   LG K EA ++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAQVF-KSQSSCAAFLSNYNTS 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V+F G+ Y LP WSVSILPDCK   +NTAKV   R +  H         +++  ++
Sbjct: 383 SAARVSFGGSTYDLPPWSVSILPDCKTEYYNTAKV-QVRTSSIH--------MKMVPTNT 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
            FSW  Y E++   + N +F +  L EQI+ T+D +DY WY   I + P +    G++  
Sbjct: 434 LFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 493

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           LNI S GHA  VFVN +L    YG+ +      ++KI+L+ G+N L +LS+  GL N G 
Sbjct: 494 LNIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSIAAGLPNVGV 553

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            ++    G+   V L  + +G  D+S  +W Y++G +GE + +  ++ +++  WKQGS +
Sbjct: 554 HYETWNTGVLGPVTLKGVNSGTWDMSQWKWSYKIGTKGEALSIHTVTGSSTVEWKQGSLV 613

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYK+TF  P G  PLAL++ +MGKGQ W+NGQ+IGR+W AY A   G  ++C 
Sbjct: 614 ATKQPLTWYKSTFDTPAGNEPLALDMNTMGKGQTWINGQNIGRHWPAYTA--RGKCERCS 671

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
           Y G++  +KC  +CG+ +Q  YH+PR+W+ P  NL+V+ EE GG+P+ ISL+ +  +
Sbjct: 672 YAGTFTENKCLSNCGEASQRWYHVPRSWLKPTNNLVVVLEEWGGEPNGISLVKRRAK 728


>gi|356509960|ref|XP_003523710.1| PREDICTED: beta-galactosidase 3-like isoform 1 [Glycine max]
          Length = 736

 Score =  781 bits (2016), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/716 (53%), Positives = 489/716 (68%), Gaps = 25/716 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD ++L+I+G+RR+L SGSIHYPRSTPE+W +LI K+K GGL+VI+TYVFW+ HEP 
Sbjct: 29  NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G Y FEGR+DLVRF+KTVQ+ GL+ +LRIGPY CAEWN+GG PVWL ++PG+ FRT N 
Sbjct: 89  PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+ENEYG    + G  G  YV WAA  
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYGPE--SRGAAGRAYVNWAASM 206

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV L T VPWVMC++ DAPDP+IN+CNGFYCD F+PN P KP MWTE +SGWF  FG  +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             RPVEDL+FAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG IR
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPK+ HL+ELHKAIK CE  L+S DPT   LG  L+AH++   +  CAAFLANY++ S A
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            VTFN   Y LP WS+SILPDCK  VFNTAKV            Q   V  L +    FS
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAKV----------RVQPSQVKMLPVKPKLFS 436

Query: 424 W--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           W  Y+E +  ++ +     P L EQ+N T+DTSDYLWY  S+ +   +     G++  +N
Sbjct: 437 WESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSIN 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           ++S GHA  VFVN +     +G  +  +   N  ++L  G N + +LS+ VGLQN G  +
Sbjct: 497 VQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHY 556

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V+L  L  G++DL+  +W Y+VG+ GE + L   +  +S  W Q S    
Sbjct: 557 ETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQ 616

Query: 595 NKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           ++S L WYK  F AP GK PLAL+L SMGKGQ W+NGQSIGRYW AY   + G    C Y
Sbjct: 617 SRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAY---AKGDCNSCTY 673

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            G++   KCQ  CGQP Q  YH+PR+W+ P +NL+V+ EELGG+P KISL+ +   
Sbjct: 674 SGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKRVAH 729


>gi|334305536|gb|AEG76892.1| putative beta-galactosidase [Linum usitatissimum]
 gi|334305538|gb|AEG76893.1| putative beta-galactosidase [Linum usitatissimum]
          Length = 731

 Score =  780 bits (2015), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/716 (52%), Positives = 494/716 (68%), Gaps = 24/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A VTYD +A++++G+RR+L +GSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 27  VTATVTYDGKAIIVNGQRRILIAGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 86

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G YYFE RFDLV+FVK VQ+AGL+++LRIGPYACAEWN+GGFPVWL ++PG+ FRT
Sbjct: 87  EPSPGNYYFEDRFDLVKFVKVVQQAGLYVNLRIGPYACAEWNFGGFPVWLKYVPGMSFRT 146

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+++MKQE LF  QGGPIIL+Q+ENEYG +EW     G+ Y +WA
Sbjct: 147 DNEPFKAAMQKFTEKIVNMMKQEQLFEPQGGPIILSQIENEYGPIEWELKAPGKAYAQWA 206

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV LNT VPW+ C+QEDAPDP+I+TCN +YC+ FTPN   KP MWTE ++ WF S+G
Sbjct: 207 AQMAVGLNTGVPWIACKQEDAPDPLIDTCNAYYCEKFTPNKSYKPKMWTEAWTAWFTSWG 266

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             V +RP ED AF+V +F ++GG++ NYYMY GGTNFGRTAGGP VATSYDYDAP+DEYG
Sbjct: 267 NPVLYRPAEDQAFSVLKFIQSGGSYANYYMYHGGTNFGRTAGGPFVATSYDYDAPLDEYG 326

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
               PK+ HL+ +HKAIK  E+ L+S+D T   LG   EAH+Y  SS+ CAAFLANYD S
Sbjct: 327 LTNDPKYTHLKHMHKAIKQSEKALVSADATVTSLGTNQEAHVY-SSSSGCAAFLANYDVS 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               V F    Y LPAWS+SILPDCK  V+NTAKV++ R            V++ +    
Sbjct: 386 YSVKVNFGSGQYDLPAWSISILPDCKTEVYNTAKVLAPR------------VHKKMTPLG 433

Query: 421 AFSW--YEEKVGISGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            F+W  Y ++V           D L EQ+  TKD+SDYLWY   + +   +     GK+ 
Sbjct: 434 GFTWDSYIDEVASGFASDTTTEDGLWEQLYMTKDSSDYLWYMQDVKIGSDEAFLTNGKDP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
           FLN++S GH   VFVN KL+   YG++D      ++ ++LN G+N + +LS  VGL N G
Sbjct: 494 FLNVQSAGHFLNVFVNGKLIGSAYGSNDNPKLTFSQSVKLNVGVNKIALLSASVGLANVG 553

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  L  G  D++  +W Y+VGV+GE + L+ ++ ++S  W +GS 
Sbjct: 554 LHFENYNVGVLGPVTLTGLNQGTVDMTKWKWSYKVGVQGEKLQLNTVAGSSSVEWVKGSM 613

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           L   + L WYK+TF APEG  P+AL++ SMGKGQ W+NGQ IGRYW AY A   G    C
Sbjct: 614 LAKKQPLTWYKSTFNAPEGNDPVALDMISMGKGQIWINGQGIGRYWPAYTA--QGNCGGC 671

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G +   KC   CGQP Q  YH+PR+W+ P  NLLV+ EE GGDP+ IS++ +T
Sbjct: 672 SYGGYFTEKKCLTGCGQPTQRWYHVPRSWLKPTGNLLVVFEEWGGDPTGISMVKRT 727


>gi|308550950|gb|ADO34789.1| beta-galactosidase STBG4 [Solanum lycopersicum]
          Length = 724

 Score =  780 bits (2015), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/716 (52%), Positives = 496/716 (69%), Gaps = 22/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD RA++I+GKR++L SGSIHYPRSTP++WP+LI+K+K+GGL+VIETYVFWN H
Sbjct: 21  VKASVSYDDRAIIINGKRKILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIETYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+Y FEGR+DLVRF+K VQ AGL+++LRIGPY CAEWN+GGFPVWL ++PG++FRT
Sbjct: 81  EPSPGKYNFEGRYDLVRFIKMVQRAGLYVNLRIGPYVCAEWNFGGFPVWLKYVPGMEFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F+ KI+++MK ENLF SQGGPII+AQ+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 NNQPFKVAMQGFVQKIVNMMKSENLFESQGGPIIMAQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPW+MC++EDAPDP+I+TCNGFYC+GF PN P KP MWTE ++GW+  FG
Sbjct: 201 AQMAVGLKTGVPWIMCKREDAPDPVIDTCNGFYCEGFRPNKPYKPKMWTEVWTGWYTKFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +P RP ED+AF+VARF +  G+F NYYMY GGTNFGRT+ G  +ATSYDYDAP+DEYG
Sbjct: 261 GPIPQRPAEDIAFSVARFVQNNGSFFNYYMYHGGTNFGRTSSGLFIATSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + +PK+GHLR+LHKAIKL E  L+SS      LG+  EAH+Y   S  CAAFL+NYDS 
Sbjct: 321 LLNEPKYGHLRDLHKAIKLSEPALVSSYAAVTSLGSNQEAHVYRSKSGACAAFLSNYDSR 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
               VTF    Y LP WS+SILPDCK  V+NTA+V SQ ++            ++  A  
Sbjct: 381 YSVKVTFQNRPYNLPPWSISILPDCKTAVYNTAQVNSQSSS-----------IKMTPAGG 429

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW    EE      + +     L EQ N T+D+SDYLWY  ++++   +     GK+ 
Sbjct: 430 GLSWQSYNEETPTADDSDTLTANGLWEQKNVTRDSSDYLWYMTNVNIASNEGFLRNGKDP 489

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
           +L + S GH   VFVN KL    YG  D      +  ++L  GIN + +LS+ VGL N G
Sbjct: 490 YLTVMSAGHVLHVFVNGKLSGTVYGTLDNPKLTYSGNVKLRAGINKISLLSVSVGLPNVG 549

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             +D   AG+   V L  L  G R+L+  +W Y+VG++GE + L  +S ++S  W +GS 
Sbjct: 550 VHYDTWNAGVLGPVTLSGLNEGSRNLAKQKWSYKVGLKGESLSLHSLSGSSSVEWVRGSL 609

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK TF AP G  PLAL +ASMGKGQ W+NG+ +GR+W  Y+A   G   KC
Sbjct: 610 VAQKQPLTWYKATFNAPGGNDPLALGMASMGKGQIWINGEGVGRHWPGYIA--QGDCSKC 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            Y G+++  KCQ +CGQP+Q  +H+PR+W+ P  NLLV+ EE GG+P+ ISL+ ++
Sbjct: 668 SYAGTFNEKKCQTNCGQPSQRWHHVPRSWLKPSGNLLVVFEEWGGNPTGISLVRRS 723


>gi|255550411|ref|XP_002516256.1| beta-galactosidase, putative [Ricinus communis]
 gi|223544742|gb|EEF46258.1| beta-galactosidase, putative [Ricinus communis]
          Length = 848

 Score =  780 bits (2014), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/829 (48%), Positives = 523/829 (63%), Gaps = 41/829 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V++D RA+ IDGKRRVL SGSIHYPRST E+WP+LI+KSKEGGL+ IETYVFWN HEP R
Sbjct: 47  VSHDGRAITIDGKRRVLISGSIHYPRSTAEMWPDLIKKSKEGGLDAIETYVFWNSHEPSR 106

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DLVRF+KT+Q  GL+  LRIGPY CAEWNYGGFP+WLH +PG + RT N+ 
Sbjct: 107 RQYDFSGNLDLVRFIKTIQAEGLYAVLRIGPYVCAEWNYGGFPMWLHNLPGCELRTANSV 166

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F  EM+ F + I+D+MK ENLFASQGGPIILAQVENEYGNV  AYG  G+ Y+ W ++ A
Sbjct: 167 FMNEMQNFTSLIVDMMKDENLFASQGGPIILAQVENEYGNVMSAYGAAGKTYIDWCSNMA 226

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +L+  VPW+MCQQ DAP P+INTCNG+YCD FTPN+ + P MWTEN++GWF S+G   P
Sbjct: 227 ESLDIGVPWIMCQQSDAPQPMINTCNGWYCDQFTPNNANSPKMWTENWTGWFKSWGGKDP 286

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  ED+AFAVARFF+TGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG + Q
Sbjct: 287 HRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGNLNQ 346

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PKWGHL++LH  +   E  L   + +       + A IY  +  + A F  N + +SDA 
Sbjct: 347 PKWGHLKQLHDILHSMEYTLTHGNISTIDYDNSVTATIY-ATDKESACFFGNANETSDAT 405

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           + F G  Y +PAWSVSILPDC+NV +NTAKV +Q         +QKN  E   +S  +SW
Sbjct: 406 IVFKGTEYNVPAWSVSILPDCENVGYNTAKVKTQT----AIMVKQKNEAEDQPSSLKWSW 461

Query: 425 YEEK---VGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
             E      + G        L +Q     D SDYLWY  S+H+    P    ++ L +  
Sbjct: 462 IPENTHTTSLLGKGHAHARQLIDQKAAANDASDYLWYMTSLHIKKDDPVWSSDMSLRVNG 521

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    +VN K +   +  +   +++  K ++L  G N + +LS  VGLQNYG  FD+ 
Sbjct: 522 SGHVLHAYVNGKHLGSQFAKYGVFSYVFEKSLKLRPGKNVISLLSATVGLQNYGPMFDLV 581

Query: 539 GAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
             G+   + I    G     +DLSS +W Y VG+ G +  L   +  ++S W +   LP 
Sbjct: 582 QTGIPGPVEIIGHRGDEKVVKDLSSHKWSYSVGLNGFHNELYSSNSRHASRWVE-QDLPT 640

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC-TKKCDY 653
           NK +IWYKTTF AP GK P+ L+L  MGKG AWVNG +IGRYW ++LA   GC T+ CDY
Sbjct: 641 NKMMIWYKTTFKAPLGKDPVVLDLQGMGKGFAWVNGNNIGRYWPSFLAEEDGCSTEVCDY 700

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
           RG+YD +KC  +CG+P Q  YH+PR++ +  EN LV+ EE GG+P+ ++  T T   +  
Sbjct: 701 RGAYDNNKCVTNCGKPTQRWYHVPRSFFNDYENTLVLFEEFGGNPAGVNFQTVTVGKVSG 760

Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
              E +                   + L+C  G  I+AI FAS+G P+G  G++  G C 
Sbjct: 761 SAGEGE------------------TIELSC-NGKSISAIEFASFGDPQGTSGAYVKGTCE 801

Query: 774 --MDVLPIVQKACVGQIECSIPVSSAYLG-VSAGACPGLLKALAVEAHC 819
              D   IVQKACVG+  C +  S    G  S G+   ++  LAV+A C
Sbjct: 802 GSNDAFSIVQKACVGKETCKLEASKDVFGPTSCGS--DVVNTLAVQATC 848


>gi|449452747|ref|XP_004144120.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
          Length = 782

 Score =  779 bits (2012), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/715 (53%), Positives = 489/715 (68%), Gaps = 24/715 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S +VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HE
Sbjct: 81  SRSVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHE 140

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT 
Sbjct: 141 PSPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTD 200

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F+ KI+D+MK E LF +QGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA
Sbjct: 201 NAPFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAA 260

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN   KP +WTEN+SGW+ +FG 
Sbjct: 261 QMAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGG 320

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+RP ED+AF+VARF + GG+  NYYMY GGTNFGRT+ G  V TSYD+DAPIDEYG 
Sbjct: 321 PTPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-GLFVTTSYDFDAPIDEYGL 379

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +R+PKWGHLR+LHKAIKLCE  L+S+DPT   LG   EA ++  SS  CAAFLANYD+S+
Sbjct: 380 LREPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEARVFKSSSGACAAFLANYDTSA 439

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V F  + Y LP WS+SILPDCK V FNT  +              K+    +   S+
Sbjct: 440 FVRVNFWNHPYDLPPWSISILPDCKTVTFNTGSLQ----------IGVKSYEAKMTPISS 489

Query: 422 FSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
           F W    EE        +  +  L EQ++ T DT+DYLWY  SI +   +     G+   
Sbjct: 490 FWWLSYKEEPASAYAQDTTTKDGLVEQVSVTWDTTDYLWYILSIRIDSTEGFLKSGQWPL 549

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S GH   VF+N +L    YG+ +      +K + L +G+N L +LS+ VGL N G 
Sbjct: 550 LTVNSAGHILHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGL 609

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            FD   AG+   V L  L  G RD+S  +W Y+VG+ GE + L  +  +NS  W +GS  
Sbjct: 610 HFDTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQ 669

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L WYKTTF  P G  PLAL+++SM KGQ WVNG+SIGRY+  Y+A   G   KC 
Sbjct: 670 --KQPLTWYKTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIA--RGKCNKCS 725

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           Y G +   KC  +CG P+Q  YHIPR W+ P  NLL+I EE+GG+P  ISL+ +T
Sbjct: 726 YTGFFTEKKCLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 780


>gi|4538943|emb|CAB39679.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|7269465|emb|CAB79469.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 729

 Score =  779 bits (2011), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/718 (51%), Positives = 508/718 (70%), Gaps = 22/718 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25  VKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 85  EPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW  G  G+ Y KW 
Sbjct: 145 DNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A  L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS +KP MWTEN++GWF  FG
Sbjct: 205 AEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G  +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PK+ HL+ LHK IKLCE  L+S+DPT   LG K EAH++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF-KSKSSCAAFLSNYNTS 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F G+ Y LP WSVSILPDCK   +NTAKV   R +  H         +++  ++
Sbjct: 383 SAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV-QVRTSSIH--------MKMVPTNT 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
            FSW  Y E++   + N +F +  L EQI+ T+D +DY WY   I + P +    G++  
Sbjct: 434 PFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 493

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GHA  VFVN +L    YG+ +      ++KI+L+ G+N L +LS   GL N G 
Sbjct: 494 LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGV 553

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIY-QVGVEGEYIGLDKISLANSSFWKQGST 591
            ++    G+   V L  + +G  D++  +W Y Q+G +GE + +  ++ +++  WK+GS 
Sbjct: 554 HYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKQIGTKGEALSVHTLAGSSTVEWKEGSL 613

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK+TF +P G  PLAL++ +MGKGQ W+NGQ+IGR+W AY A   G  ++C
Sbjct: 614 VAKKQPLTWYKSTFDSPTGNEPLALDMNTMGKGQMWINGQNIGRHWPAYTA--RGKCERC 671

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            Y G++   KC  +CG+ +Q  YH+PR+W+ P  NL+++ EE GG+P+ ISL+ +T +
Sbjct: 672 SYAGTFTEKKCLSNCGEASQRWYHVPRSWLKPTNNLVIVLEEWGGEPNGISLVKRTAK 729


>gi|61162199|dbj|BAD91081.1| beta-D-galactosidase [Pyrus pyrifolia]
          Length = 725

 Score =  778 bits (2008), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/712 (52%), Positives = 494/712 (69%), Gaps = 17/712 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V YDH+A++I+G+RR+L SGSIHYPRSTP +WP+LI+K+K GGL+VI+TYVFWN HE
Sbjct: 23  SASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFP+WL ++PGI FRT 
Sbjct: 83  PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIWLKYVPGIAFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+++MK E LF +QGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPW+MC+QEDAPDP+I+TCNG+YC+ F PN   KP MWTE ++GW+  FG 
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKMWTEVWTGWYTEFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           A+P RP EDLAF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 263 AIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           ++QPKWGHLR+LHKAIK CE  L++ DP+  KLG   EAH+++ S + CAAFLAN+D+  
Sbjct: 323 LQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFN-SKSGCAAFLANHDTKY 381

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V+F    Y LP WS+SILPDCK  VFNTAKV  + +       Q K V   L   S 
Sbjct: 382 SVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASE-----VQMKPVYSRLPWQSF 436

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
                E        +     L EQI  T+D +DYLWY   I +   +     GK   L I
Sbjct: 437 IE---ETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEAFLKNGKFPLLTI 493

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VF+N +L    YG+ +      ++ ++L  GIN L +LS+ VGL N G  F+
Sbjct: 494 FSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFE 553

Query: 537 VAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   I L  L  G  D+S  +W Y++G++GE +GL  ++ ++S  W +G ++   
Sbjct: 554 TWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESLGLHTVTGSSSVDWAEGPSMAQK 613

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK TF AP G  PLAL++ SMGKGQ W+NGQS+GR+W  Y+A   G    C Y G
Sbjct: 614 QPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--QGSCGNCYYAG 671

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +++  KC+ +CG+P+Q  YHIPR+W+ P  NLLV+ EE GGDPS +SL+ + 
Sbjct: 672 TFNDKKCRTYCGKPSQRWYHIPRSWLTPTGNLLVVFEEWGGDPSWMSLVERV 723


>gi|357438127|ref|XP_003589339.1| Beta-galactosidase [Medicago truncatula]
 gi|355478387|gb|AES59590.1| Beta-galactosidase [Medicago truncatula]
          Length = 745

 Score =  777 bits (2007), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/717 (53%), Positives = 493/717 (68%), Gaps = 26/717 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25  IHCTVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKDGGLDVIDTYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR+DLV+F+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 85  EPSPGNYNFEGRYDLVQFIKTVQKKGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+ENEYG    A G  G  Y  WA
Sbjct: 145 DNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGASGHAYSNWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+++DAPDP+IN CNGFYCD F+PN P KP +WTE++SGWF  FG
Sbjct: 205 AKMAVGLGTGVPWVMCKEDDAPDPVINACNGFYCDDFSPNKPYKPKLWTESWSGWFSEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            + P RPVEDLAFAVARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 265 GSNPQRPVEDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 324

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +R+PK+GHL++LHKAIK CE  L+SSDPT   LGA  +AH++  S   CAAFLANY S+
Sbjct: 325 LLREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVF-SSGTTCAAFLANYHSN 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A VTFN   Y LP WS+SILPDC+  VFNTA++            Q   +  L   S 
Sbjct: 384 SAARVTFNNRHYDLPPWSISILPDCRTDVFNTARM----------RFQPSQIQMLPSNSK 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV------MPGQGKE 471
             SW  Y+E V  ++ +       L EQI+ T+DTSDYLWY  S+ +      + G+ K 
Sbjct: 434 LLSWETYDEDVSSLAESSRITASRLLEQIDATRDTSDYLWYITSVDISSSESFLRGRNKP 493

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             +++ S G A  VF+N K     +G  +  +F  N  I+L  G N + +LS+ VGL N 
Sbjct: 494 S-ISVHSSGDAVHVFINGKFSGSAFGTREDRSFTFNGPIDLRAGTNKIALLSVAVGLPNG 552

Query: 532 GAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           G  F+   +G+   V+L DL +G++DL+  +W YQVG++GE + L   +  +S  W   S
Sbjct: 553 GIHFESWKSGITGPVLLHDLDHGQKDLTGQKWSYQVGLKGEAMNLVSPNGVSSVDWVSES 612

Query: 591 TLPVNK-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
               N+  L W+K  F AP G  PLAL+++SMGKGQ W+NGQSIGRYW  Y   + G   
Sbjct: 613 LASQNQPQLKWHKAHFNAPNGVEPLALDMSSMGKGQVWINGQSIGRYWMVY---AKGNCN 669

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
            C+Y G+Y  +KCQ  CGQP Q  YH+PR+W+ P  NL+V+ EELGG+P KISL+ +
Sbjct: 670 SCNYAGTYRQAKCQVGCGQPTQRWYHVPRSWLKPKNNLMVVFEELGGNPWKISLVKR 726


>gi|3860321|emb|CAA10128.1| beta-galactosidase [Cicer arietinum]
          Length = 745

 Score =  777 bits (2006), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/717 (53%), Positives = 490/717 (68%), Gaps = 23/717 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +VTYD +A++I+G+RR+L SGSIHYPRSTPE+W +LI+K+K GGL+VI+TYVFWN H
Sbjct: 24  IHCSVTYDRKAIIINGQRRILISGSIHYPRSTPEMWEDLIQKAKVGGLDVIDTYVFWNVH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP    Y FEGR+DLVRF+KTVQ+ GL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84  EPSPSNYNFEGRYDLVRFIKTVQKVGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+ENEYG    A G  G  Y  WA
Sbjct: 144 DNGPFKAAMQGFTQKIVQMMKNEKLFQSQGGPIILSQIENEYGPQGRALGAVGHAYSNWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L T VPWVMC+++DAPDP+IN+CNGFYCD F+PN P KP +WTE++SGWF  FG
Sbjct: 204 AKMAVGLGTGVPWVMCKEDDAPDPVINSCNGFYCDDFSPNKPYKPKLWTESWSGWFSEFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP RP +DLAFAVARF + GG+F NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG
Sbjct: 264 GPVPQRPAQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +R+PK+GHL++LHKAIK CE  L+SSDPT   LGA  +AH++   +  CAAFLANY S+
Sbjct: 324 LLREPKYGHLKDLHKAIKQCEHALVSSDPTVTSLGAYEQAHVFSSGTQTCAAFLANYHSN 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A VTFN   Y LP WS+SILPDCK  VFNTA+V            Q   +  L   S 
Sbjct: 384 SAARVTFNNRHYDLPPWSISILPDCKTDVFNTARV----------RFQNSKIQMLPSNSK 433

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW  Y+E V  ++ +       L EQIN T+DTSDYLWY  S+ + P +     G + 
Sbjct: 434 LLSWETYDEDVSSLAESSRITASGLLEQINATRDTSDYLWYITSVDISPSESFLRGGNKP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            +++ S G A  VF+N K     +G  +  +   N  I L+ G N + +LS+ VGL N G
Sbjct: 494 SISVHSSGDAVHVFINGKFSGSAFGTREQRSCTFNGPINLHAGTNKIALLSVAVGLPNGG 553

Query: 533 AWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   IL+  L +G++DL+  +W YQVG++GE + L   +  +S  W + S 
Sbjct: 554 IHFESWKTGITGPILLHGLDHGQKDLTWQKWSYQVGLKGEAMNLVSPNGVSSVDWVRESL 613

Query: 592 LPVNK-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
              N+  L W+K  F AP+G   LAL+++ MGKGQ W+NGQSIGRYW  Y   + G    
Sbjct: 614 ASQNQPQLKWHKAYFNAPDGNEALALDMSGMGKGQVWINGQSIGRYWLVY---AKGNCNS 670

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           C+Y G+Y  +KCQ  CGQP Q  YH+PR+W+ P  NL+V+ EELGG+P KISL+ +T
Sbjct: 671 CNYAGTYRQAKCQLGCGQPTQRWYHVPRSWLKPTNNLMVVFEELGGNPWKISLVKRT 727


>gi|224077880|ref|XP_002305449.1| predicted protein [Populus trichocarpa]
 gi|222848413|gb|EEE85960.1| predicted protein [Populus trichocarpa]
          Length = 731

 Score =  776 bits (2004), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/716 (52%), Positives = 486/716 (67%), Gaps = 23/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  NVTYD +AL+I+G+R+VL SGSIHYPRSTPE+W  LI+K+K+GGL+VI+TYVFWN H
Sbjct: 24  IQCNVTYDKKALIINGQRKVLFSGSIHYPRSTPEMWEGLIQKAKDGGLDVIDTYVFWNLH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y F+GR+DLVRF+K V EAGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT
Sbjct: 84  EPSPGNYNFDGRYDLVRFIKLVHEAGLYVHLRIGPYICAEWNFGGFPVWLKYVPGISFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK ENLF SQGGPIIL+Q+ENEY     A+G  G  Y+ WA
Sbjct: 144 DNEPFKSAMQKFTQKIVQMMKDENLFESQGGPIILSQIENEYEPESKAFGSPGHAYMTWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A++++T VPWVMC++ DAPDP+INTCNGFYCD F+PN P KP MWTE ++GWF  FG
Sbjct: 204 AHMAISMDTGVPWVMCKEFDAPDPVINTCNGFYCDYFSPNKPYKPTMWTEAWTGWFTDFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                RP EDLAFAVARF + GG+  NYYMY GGTNFGRT+GGP + TSYDYDAPIDEYG
Sbjct: 264 GPNHQRPAEDLAFAVARFIQKGGSLVNYYMYHGGTNFGRTSGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL+ELHKAIKLCE+ L+++D T   LG+  +AH++   S  CAAFL+NY++ 
Sbjct: 324 LIRQPKYGHLKELHKAIKLCEKALLAADSTVTSLGSYEQAHVFSSDSGGCAAFLSNYNTK 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
             A V FN   Y LP WS+SILPDCKNVVFNTA          H   Q   V+ L   S 
Sbjct: 384 QAARVKFNNIQYSLPPWSISILPDCKNVVFNTA----------HVGVQTSQVHMLPTDSE 433

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
             SW    E+   +  ++      L EQ+N T+DTSDYLWYT S+H+   +     G+  
Sbjct: 434 LLSWETFNEDISSVDDDKMITVAGLLEQLNITRDTSDYLWYTTSVHISSSESFLRGGRLP 493

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L ++S GHA  VF+N +L    +G  +   F   + ++ + G N + +LS+ VGL N G
Sbjct: 494 VLTVQSAGHALHVFINGELSGSAHGTREQRRFTFTEDMKFHAGKNRISLLSVAVGLPNNG 553

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  L  G+RDL+  +W Y+VG++GE + L      +   W QGS 
Sbjct: 554 PRFETWNTGILGPVTLHGLDEGQRDLTWQKWSYKVGLKGEDMNLRSRKSVSLVDWIQGSL 613

Query: 592 LP-VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           +    + L WYK  F +P+G  PLAL++ SMGKGQ W+NG SIGRYW+ Y   + G    
Sbjct: 614 MVGKQQPLTWYKAYFNSPKGDDPLALDMGSMGKGQVWINGHSIGRYWTLY---AEGNCSG 670

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           C Y  ++  ++CQ  CGQP Q  YH+PR+W+    NLLV+ EE+GGD S+ISL+ +
Sbjct: 671 CSYSATFRPARCQLGCGQPTQKWYHVPRSWLKSTRNLLVLFEEIGGDASRISLVKR 726


>gi|84579371|dbj|BAE72074.1| pear beta-galactosidase2 [Pyrus communis]
          Length = 725

 Score =  776 bits (2003), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/712 (52%), Positives = 493/712 (69%), Gaps = 17/712 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           SA+V YDH+A++I+G+RR+L SGSIHYPRSTP +WP+LI+K+K GGL+VI+TYVFWN HE
Sbjct: 23  SASVGYDHKAIIINGQRRILISGSIHYPRSTPGMWPDLIQKAKAGGLDVIQTYVFWNGHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYFE R+DLV+F+K VQ+AGLF++LRIGPY CAEWN+GGFP+WL ++PGI FRT 
Sbjct: 83  PSPGKYYFEDRYDLVKFIKLVQQAGLFVNLRIGPYVCAEWNFGGFPIWLKYVPGIAFRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M++F  KI+++MK E LF +QGGPIIL+Q+ENE+G VEW  G  G+ Y KWAA
Sbjct: 143 NEPFKAAMQKFTEKIVNMMKAEKLFQTQGGPIILSQIENEFGPVEWEIGAPGKAYTKWAA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV L+T VPW+MC+QEDAPDP+I+TCNG+YC+ F PN   KP MWTE ++GW+  FG 
Sbjct: 203 QMAVGLDTGVPWIMCKQEDAPDPVIDTCNGYYCENFKPNKVYKPKMWTEVWTGWYTEFGG 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           A+P RP EDLAF+VARF ++GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG 
Sbjct: 263 AIPTRPAEDLAFSVARFIQSGGSFFNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEYGL 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           ++QPKWGHLR+LHKAIK CE  L++ DP+  KLG   EAH+++ S + CAAFLANYD+  
Sbjct: 323 LQQPKWGHLRDLHKAIKSCEHALVAVDPSVTKLGNNQEAHVFN-SKSGCAAFLANYDTKY 381

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V+F    Y LP WS+SILPDCK  VFNTAKV  + +       Q K V   L   S 
Sbjct: 382 SVRVSFGHGQYDLPPWSISILPDCKTAVFNTAKVAWKASE-----VQMKPVYSRLPWQSF 436

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
                E        +     L EQI  T+D +DYLWY   I +   +     GK   L I
Sbjct: 437 IE---ETTTSDETGTTTLDGLYEQIYMTRDATDYLWYMTDITIGSDEAFLKNGKFPLLTI 493

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA  VF+N +L    YG+ +      ++ ++L  GIN L +LS+ VGL N G  F+
Sbjct: 494 FSAGHALHVFINGQLSGTVYGSLENPKLTFSQNVKLRPGINKLALLSISVGLPNVGTHFE 553

Query: 537 VAGAGLFSVI-LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   I L  L  G  D+S  +W Y++G++GE +GL  ++ ++S  W +G ++   
Sbjct: 554 TWNTGVLGPISLKGLNTGTWDMSRWKWTYKIGMKGESLGLHTVTGSSSVDWAEGPSMAQK 613

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK TF AP G  PLAL++ SMGKGQ W+NGQS+GR+W  Y+A   G    C Y G
Sbjct: 614 QPLTWYKATFDAPPGHAPLALDMGSMGKGQIWINGQSVGRHWPGYIA--QGSCGNCYYAG 671

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +++  KC+ +CG+P+Q   HIPR+W+ P  NLLV+ EE GGDPS +SL+ + 
Sbjct: 672 TFNDKKCRTYCGKPSQRWCHIPRSWLTPTGNLLVVFEEWGGDPSWMSLVERV 723


>gi|356529081|ref|XP_003533125.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 832

 Score =  776 bits (2003), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/828 (47%), Positives = 518/828 (62%), Gaps = 37/828 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD RA+ IDGKR+VL SGSIHYPRST E+WP LI K+KEGGL+VIETYVFWN HEP  
Sbjct: 22  VSYDSRAITIDGKRKVLFSGSIHYPRSTAEMWPSLINKAKEGGLDVIETYVFWNAHEPQP 81

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DLV+F+KT+Q+ GL+  LRIGPY CAEWNYGGFPVWLH +P ++FRT N  
Sbjct: 82  RQYDFSGNLDLVKFIKTIQKEGLYAMLRIGPYVCAEWNYGGFPVWLHNMPNMEFRTNNTA 141

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +  EM+ F   I+D M+ ENLFASQGGPIILAQ+ENEYGN+   YG  G+ YV+W A  A
Sbjct: 142 YMNEMQTFTTLIVDKMRHENLFASQGGPIILAQIENEYGNIMSEYGENGKQYVQWCAQLA 201

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +    VPWVMCQQ DAPDPIINTCNG+YCD F+PNS SKP MWTEN++GWF ++G  +P
Sbjct: 202 ESYKIGVPWVMCQQSDAPDPIINTCNGWYCDQFSPNSKSKPKMWTENWTGWFKNWGGPIP 261

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R   D+A+AVARFF+ GGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG   Q
Sbjct: 262 HRTARDVAYAVARFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNKNQ 321

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PKWGHL++LH+ +K  E+ L      H   G  L A +Y+ S    A FL N +SS+DA 
Sbjct: 322 PKWGHLKQLHELLKSMEDVLTQGTTNHTDYGNLLTATVYNYSGKS-ACFLGNANSSNDAT 380

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR-------NNGDHPFAQQKNVNELLL 417
           + F    Y +PAWSVSILP+C N V+NTAK+ +Q        N  D+       +N   +
Sbjct: 381 IMFQSTQYIVPAWSVSILPNCVNEVYNTAKINAQTSIMVMKDNKSDNEEEPHSTLNWQWM 440

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
                   + +V  S +R   +  L +Q   T DTSDYLWY  S+ +         + + 
Sbjct: 441 HEPHVQMKDGQVLGSVSRKAAQ--LLDQKVVTNDTSDYLWYITSVDISENDPIWSKIRVS 498

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           + GH   VFVN     + YG +   +F    KI+L +G N + +LS  VGL NYGA F  
Sbjct: 499 TNGHVLHVFVNGAQAGYQYGQNGKYSFTYEAKIKLKKGTNEISLLSGTVGLPNYGAHFSN 558

Query: 538 AGAGLFS-VILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
              G+   V L+ L+N     +D+++  W Y+VG+ GE + L      N+  W     LP
Sbjct: 559 VSVGVCGPVQLVALQNNTEVVKDITNNTWNYKVGLHGEIVKL--YCPENNKGWNTNG-LP 615

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
            N+  +WYKT F +P+G  P+ ++L  + KGQAWVNG +IGRYW+ YLA   GCT  C+Y
Sbjct: 616 TNRVFVWYKTLFKSPKGTDPVVVDLKGLKKGQAWVNGNNIGRYWTRYLADDNGCTATCNY 675

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWV-HPGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           RG Y + KC   CG+P Q  YH+PR+++    +N LV+ EE GG P+++   T   + IC
Sbjct: 676 RGPYSSDKCITKCGRPTQRWYHVPRSFLRQDNQNTLVLFEEFGGHPNEVKFATVMVEKIC 735

Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           +          +S++ N+        + L+C     I+ I FAS+G+PEG CGSF+   C
Sbjct: 736 A----------NSYEGNV--------LELSCREEQVISKIKFASFGVPEGECGSFKKSQC 777

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
              + L I+ K+C+G+  CS+ VS   LG +    P     LA+EA C
Sbjct: 778 ESPNALSILSKSCLGKQSCSVQVSQRMLGPTGCRMPQNQNKLAIEAVC 825


>gi|115468642|ref|NP_001057920.1| Os06g0573600 [Oryza sativa Japonica Group]
 gi|75112285|sp|Q5Z7L0.1|BGAL9_ORYSJ RecName: Full=Beta-galactosidase 9; Short=Lactase 9; Flags:
           Precursor
 gi|54291174|dbj|BAD61846.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|113595960|dbj|BAF19834.1| Os06g0573600 [Oryza sativa Japonica Group]
          Length = 715

 Score =  775 bits (2002), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/710 (53%), Positives = 489/710 (68%), Gaps = 26/710 (3%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYDHR+L I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP++G
Sbjct: 23  TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYF  R+DLVRFVK V++AGL+++LRIGPY CAEWNYGGFPVWL ++PGI FRT N PF
Sbjct: 83  QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + YV WAA  AV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
             N  VPW+MC+Q+DAPDP+INTCNGFYCD FTPNS +KP MWTE +SGWF +FG  VP 
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG +RQP
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHL  LHKAIK  E  L++ DPT Q +G   +A+++  SS DCAAFL+N+ +S+ A V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
            FNG  Y LPAWS+S+LPDC+  V+NTA V +  +               +  +  F+W 
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAK------------MNPAGGFTWQ 430

Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
            Y E        +F +  L EQ++ T D SDYLWYT  +++  G+     G+   L + S
Sbjct: 431 SYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYS 490

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH+  VFVN +     YG +D      +  +++ +G N + ILS  VGL N G  ++  
Sbjct: 491 AGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETW 550

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             G+   V L  L  GKRDLS  +W YQ+G++GE +G+  +S ++S  W   +     + 
Sbjct: 551 NIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAG---KQP 607

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           + W++  F AP G  P+AL+L SMGKGQAWVNG  IGRYWS     ++G    C Y G+Y
Sbjct: 608 VTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWS---YKASGNCGGCSYAGTY 664

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
              KCQ +CG  +Q  YH+PR+W++P  NL+V+ EE GGD S ++L+T+T
Sbjct: 665 SEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTRT 714


>gi|356509962|ref|XP_003523711.1| PREDICTED: beta-galactosidase 3-like isoform 2 [Glycine max]
          Length = 729

 Score =  775 bits (2001), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/716 (53%), Positives = 487/716 (68%), Gaps = 32/716 (4%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD ++L+I+G+RR+L SGSIHYPRSTPE+W +LI K+K GGL+VI+TYVFW+ HEP 
Sbjct: 29  NVTYDRKSLLINGQRRILISGSIHYPRSTPEMWEDLIWKAKHGGLDVIDTYVFWDVHEPS 88

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G Y FEGR+DLVRF+KTVQ+ GL+ +LRIGPY CAEWN+GG PVWL ++PG+ FRT N 
Sbjct: 89  PGNYDFEGRYDLVRFIKTVQKVGLYANLRIGPYVCAEWNFGGIPVWLKYVPGVSFRTDNE 148

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+ENEYG    + G  G  YV WAA  
Sbjct: 149 PFKAAMQGFTQKIVQMMKSEKLFQSQGGPIILSQIENEYGPE--SRGAAGRAYVNWAASM 206

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV L T VPWVMC++ DAPDP+IN+CNGFYCD F+PN P KP MWTE +SGWF  FG  +
Sbjct: 207 AVGLGTGVPWVMCKENDAPDPVINSCNGFYCDDFSPNKPYKPSMWTETWSGWFTEFGGPI 266

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             RPVEDL+FAVARF + GG++ NYYMY GGTNFGR+AGGP + TSYDYDAPIDEYG IR
Sbjct: 267 HQRPVEDLSFAVARFIQKGGSYVNYYMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIR 326

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPK+ HL+ELHKAIK CE  L+S DPT   LG  L+AH++   +  CAAFLANY++ S A
Sbjct: 327 QPKYSHLKELHKAIKRCEHALVSLDPTVLSLGTLLQAHVFSSGTGTCAAFLANYNAQSAA 386

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            VTFN   Y LP WS+SILPDCK  VFNTAK                 V  L +    FS
Sbjct: 387 TVTFNNRHYDLPPWSISILPDCKIDVFNTAK-----------------VKMLPVKPKLFS 429

Query: 424 W--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           W  Y+E +  ++ +     P L EQ+N T+DTSDYLWY  S+ +   +     G++  +N
Sbjct: 430 WESYDEDLSSLAESSRITAPGLLEQLNVTRDTSDYLWYITSVDISSSESFLRGGQKPSIN 489

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           ++S GHA  VFVN +     +G  +  +   N  ++L  G N + +LS+ VGLQN G  +
Sbjct: 490 VQSAGHAVHVFVNGQFSGSAFGTREQRSCTYNGPVDLRAGANKIALLSVTVGLQNVGRHY 549

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           +   AG+   V+L  L  G++DL+  +W Y+VG+ GE + L   +  +S  W Q S    
Sbjct: 550 ETWEAGITGPVLLHGLDQGQKDLTWNKWSYKVGLRGEAMNLVSPNGVSSVDWVQESQATQ 609

Query: 595 NKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           ++S L WYK  F AP GK PLAL+L SMGKGQ W+NGQSIGRYW AY   + G    C Y
Sbjct: 610 SRSQLKWYKAYFDAPGGKEPLALDLESMGKGQVWINGQSIGRYWMAY---AKGDCNSCTY 666

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            G++   KCQ  CGQP Q  YH+PR+W+ P +NL+V+ EELGG+P KISL+ +   
Sbjct: 667 SGTFRPVKCQLGCGQPTQRWYHVPRSWLKPTKNLIVVFEELGGNPWKISLVKRVAH 722


>gi|125555810|gb|EAZ01416.1| hypothetical protein OsI_23450 [Oryza sativa Indica Group]
          Length = 717

 Score =  775 bits (2000), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/710 (53%), Positives = 489/710 (68%), Gaps = 26/710 (3%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYDHR+L I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP++G
Sbjct: 25  TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 84

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYF  R+DLVRFVK V++AGL+++LRIGPY CAEWNYGGFPVWL ++PGI FRT N PF
Sbjct: 85  QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 144

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + YV WAA  AV
Sbjct: 145 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 204

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
             N  VPW+MC+Q+DAPDP+INTCNGFYCD FTPNS +KP MWTE +SGWF +FG  VP 
Sbjct: 205 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 264

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG +RQP
Sbjct: 265 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 324

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHL  LHKAIK  E  L++ DPT Q +G   +A+++  SS DCAAFL+N+ +S+ A V
Sbjct: 325 KWGHLTNLHKAIKQAEPALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 384

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
            FNG  Y LPAWS+S+LPDC+  V+NTA V +  +               +  +  F+W 
Sbjct: 385 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAK------------MNPAGGFTWQ 432

Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
            Y E        +F +  L EQ++ T D SDYLWYT  +++  G+     G+   L + S
Sbjct: 433 SYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYS 492

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH+  VFVN +     YG +D      +  +++ +G N + ILS  VGL N G  ++  
Sbjct: 493 AGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETW 552

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             G+   V L  L  GKRDLS  +W YQ+G++GE +G+  +S ++S  W   +     + 
Sbjct: 553 NIGVLGPVTLSGLNEGKRDLSKQKWTYQIGLKGEKLGVHSVSGSSSVEWGGAAG---KQP 609

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           + W++  F AP G  P+AL+L SMGKGQAWVNG  IGRYWS     ++G    C Y G+Y
Sbjct: 610 VTWHRAYFNAPAGGAPVALDLGSMGKGQAWVNGHLIGRYWS---YKASGNCGGCSYAGTY 666

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
              KCQ +CG  +Q  YH+PR+W++P  NL+V+ EE GGD S ++L+T+T
Sbjct: 667 SEKKCQANCGDASQRWYHVPRSWLNPSGNLVVLLEEFGGDLSGVTLMTRT 716


>gi|186510990|ref|NP_190852.2| beta-galactosidase 2 [Arabidopsis thaliana]
 gi|332278160|sp|Q9LFA6.2|BGAL2_ARATH RecName: Full=Beta-galactosidase 2; Short=Lactase 2; Flags:
           Precursor
 gi|13605857|gb|AAK32914.1|AF367327_1 AT3g52840/F8J2_10 [Arabidopsis thaliana]
 gi|6686876|emb|CAB64738.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|23308221|gb|AAN18080.1| At3g52840/F8J2_10 [Arabidopsis thaliana]
 gi|332645478|gb|AEE78999.1| beta-galactosidase 2 [Arabidopsis thaliana]
          Length = 727

 Score =  774 bits (1998), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/716 (52%), Positives = 493/716 (68%), Gaps = 24/716 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27  AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PG+ FRT N
Sbjct: 87  SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDN 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+D+MK+E LF +QGGPIIL+Q+ENEYG ++W  G  G+ Y KW A+
Sbjct: 147 EPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAE 206

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L+T VPW+MC+QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF  FG A
Sbjct: 207 MALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +P RPVED+AF+VARF + GG+F NYYMY+GGTNF RTA G  +ATSYDYDAPIDEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-GVFIATSYDYDAPIDEYGLL 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R+PK+ HL+ELHK IKLCE  L+S DPT   LG K E H++ KS   CAAFL+NYD+SS 
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKTSCAAFLSNYDTSSA 384

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F G  Y LP WSVSILPDCK   +NTAK+ +       P    K    ++  S+ F
Sbjct: 385 ARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MIPTSTKF 433

Query: 423 SWYEEKVGISGNR---SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW     G   +    +FV+  L EQI+ T+D +DY WY   I +   +     G    L
Sbjct: 434 SWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLL 493

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VFVN  L    YG    +    ++ I+L+ GIN L +LS  VGL N G  
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVH 553

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++    G+   V L  + +G  D+S  +W Y++G+ GE + L  ++ +++  W     + 
Sbjct: 554 YETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVV 613

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK++F  P G  PLAL++ +MGKGQ WVNG +IGR+W AY A   G   +C+Y
Sbjct: 614 KKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTA--RGNCGRCNY 671

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            G Y+  KC  HCG+P+Q  YH+PR+W+ P  NLLVI EE GGDPS ISL+ +T +
Sbjct: 672 AGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 727


>gi|356545784|ref|XP_003541315.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 826

 Score =  774 bits (1998), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/832 (48%), Positives = 535/832 (64%), Gaps = 41/832 (4%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  V++D RA++IDGKRRVL SGSIHYPRSTPE+WPELI+K+KEGGL+ IETYVFWN HE
Sbjct: 22  AVEVSHDGRAIIIDGKRRVLLSGSIHYPRSTPEMWPELIQKAKEGGLDAIETYVFWNAHE 81

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P R  Y F G  D++RF+KT+QE+GL+  LRIGPY CAEWNYGG PVW+H +P ++ RT 
Sbjct: 82  PSRRVYDFSGNNDIIRFLKTIQESGLYGVLRIGPYVCAEWNYGGIPVWVHNLPDVEIRTA 141

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N+ +  EM+ F   I+D++K+E LFASQGGPIIL Q+ENEYGNV   YG  G+ Y+ W A
Sbjct: 142 NSVYMNEMQNFTTLIVDMVKKEKLFASQGGPIILTQIENEYGNVISHYGDAGKAYMNWCA 201

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A +LN  VPW+MCQ+ DAP  +INTCNGFYCD F PN+PS P MWTEN+ GWF ++G 
Sbjct: 202 NMAESLNVGVPWIMCQESDAPQSMINTCNGFYCDNFEPNNPSSPKMWTENWVGWFKNWGG 261

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P R  ED+AFAVARFF+TGGTFQNYYMY GGTNF RTAGGP + TSYDYDAP+DEYG 
Sbjct: 262 RDPHRTAEDVAFAVARFFQTGGTFQNYYMYHGGTNFDRTAGGPYITTSYDYDAPLDEYGN 321

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           I QPKWGHL+ELH  +K  EE L S + +    G  ++A IY  ++   + FL++ ++++
Sbjct: 322 IAQPKWGHLKELHNVLKSMEETLTSGNVSETDFGNSVKATIY-ATNGSSSCFLSSTNTTT 380

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DA +TF G  Y +PAWSVSILPDC++  +NTAKV  Q +       ++ +  E    +  
Sbjct: 381 DATLTFRGKNYTVPAWSVSILPDCEHEEYNTAKVNVQTS----VMVKENSKAEEEATALK 436

Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNI 476
           + W  E +   + G  +     L +Q +   D SDYLWY   +HV    P  G+ + L I
Sbjct: 437 WVWRSENIDNALHGKSNVSANRLLDQKDAANDASDYLWYMTKLHVKHDDPVWGENMTLRI 496

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH    FVN + +   +  +   N     KI+L  G NT+ +LS+ VGLQNYGA+FD
Sbjct: 497 NSSGHVIHAFVNGEHIGSHWATYGIHNDKFEPKIKLKHGTNTISLLSVTVGLQNYGAFFD 556

Query: 537 VAGAGLFSVI-LIDLKNGK---RDLSSGEWIYQVGVEG--EYIGLDKISLANSSFWKQGS 590
              AGL   I L+ +K  +   ++LSS +W Y+VG+ G    +  D    A  + W +  
Sbjct: 557 TWHAGLVEPIELVSVKGDETIIKNLSSNKWSYKVGLHGWDHKLFSDDSPFAAPNKW-ESE 615

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            LP ++ L WYKTTF AP G  P+ ++L  MGKG AWVNGQ+IGR W +Y A   GC+ +
Sbjct: 616 KLPTDRMLTWYKTTFNAPLGTDPVVVDLQGMGKGYAWVNGQNIGRIWPSYNAEEDGCSDE 675

Query: 651 -CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            CDYRG Y  SKC  +CG+P Q  YH+PR+++  G N LV+  ELGG+PS+++  T    
Sbjct: 676 PCDYRGEYTDSKCVTNCGKPTQRWYHVPRSYLKDGANNLVLFAELGGNPSQVNFQTVVVG 735

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
            +C+   E                  +  + L+C+ G  I+AI FAS+G PEG CG+F  
Sbjct: 736 TVCANAYE------------------NKTLELSCQ-GRKISAIKFASFGDPEGVCGAFTN 776

Query: 770 GACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G+C    + L IVQKACVG+  CS  VS    G +  AC  + K LAVEA C
Sbjct: 777 GSCESKSNALSIVQKACVGKQACSFDVSEKTFGPT--ACGNVAKRLAVEAVC 826


>gi|75134155|sp|Q6Z6K4.1|BGAL4_ORYSJ RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
           Precursor
 gi|46805855|dbj|BAD17189.1| putative beta-galactosidase precursor [Oryza sativa Japonica Group]
          Length = 729

 Score =  773 bits (1997), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/713 (53%), Positives = 489/713 (68%), Gaps = 26/713 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YD R+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 35  NAAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHE 94

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQYYF  R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT 
Sbjct: 95  PVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 154

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM++F+ KI+ +MK E LF  QGGPII++QVENE+G +E   G G + Y  WAA
Sbjct: 155 NGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAA 214

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV  NT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN   KP MWTE ++GWF SFG 
Sbjct: 215 KMAVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGG 274

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G 
Sbjct: 275 GVPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LH+AIK  E  L+S+DPT + +G+  +A+++   +  CAAFL+NY  ++
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V FNG  Y LPAWS+SILPDCK  VFNTA V            ++  +   +     
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVR 442

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
           F+W  Y E      + +F +  L EQ++ T D SDYLWYT  +++       G+   L +
Sbjct: 443 FAWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTV 502

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH+  VFVN K     YG +D      N ++++ +G N + ILS  VGL N G  F+
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
               G+   V L  L  G +DLS  +W YQVG++GE +GL  ++ +++  W   G   P 
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQP- 621

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L W+K  F AP G  P+AL++ SMGKGQ WVNG  +GRYWS     S GC   C Y 
Sbjct: 622 ---LTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYA 675

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G+Y   KC+ +CG  +Q  YH+PR+W+ PG NLLV+ EE GGD + +SL T+T
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATRT 728


>gi|152013361|sp|A2X2H7.1|BGAL4_ORYSI RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
           Precursor
 gi|125538642|gb|EAY85037.1| hypothetical protein OsI_06394 [Oryza sativa Indica Group]
          Length = 729

 Score =  773 bits (1996), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/713 (53%), Positives = 489/713 (68%), Gaps = 26/713 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YD R+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 35  NAAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHE 94

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQYYF  R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT 
Sbjct: 95  PVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 154

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM++F+ KI+ +MK E LF  QGGPII++QVENE+G +E   G G + Y  WAA
Sbjct: 155 NGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAA 214

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV  NT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN   KP MWTE ++GWF SFG 
Sbjct: 215 KMAVRTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGG 274

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G 
Sbjct: 275 GVPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LH+AIK  E  L+S+DPT + +G+  +A+++   +  CAAFL+NY  ++
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V FNG  Y LPAWS+SILPDCK  VFNTA V            ++  +   +     
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVR 442

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
           F+W  Y E      + +F +  L EQ++ T D SDYLWYT  +++       G+   L +
Sbjct: 443 FAWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTV 502

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH+  VFVN K     YG +D      N ++++ +G N + ILS  VGL N G  F+
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
               G+   V L  L  G +DLS  +W YQVG++GE +GL  ++ +++  W   G   P 
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQP- 621

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L W+K  F AP G  P+AL++ SMGKGQ WVNG  +GRYWS     S GC   C Y 
Sbjct: 622 ---LTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYA 675

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G+Y   KC+ +CG  +Q  YH+PR+W+ PG NLLV+ EE GGD + +SL T+T
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATRT 728


>gi|449527779|ref|XP_004170887.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
           sativus]
          Length = 716

 Score =  773 bits (1996), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/712 (53%), Positives = 488/712 (68%), Gaps = 26/712 (3%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A++I+ +RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP  
Sbjct: 22  VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPSE 81

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+YYFE R+DLV F+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT N P
Sbjct: 82  GKYYFEERYDLVGFIKLVQKAGLYVHLRIGPYVCAEWNYGGFPIWLKFVPGIAFRTDNEP 141

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F+ KI+D+MK E L+ +QGGPIIL+Q+ENEYG VEW  G  G+ Y KW A  A
Sbjct: 142 FKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIENEYGPVEWQIGAPGKSYTKWFAQMA 201

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V+L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN   KP +WTEN+SGW+ +FG   P
Sbjct: 202 VDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPTP 261

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
           +RP ED+AF+VARF +  G+  NYY+Y GGTNFGRT+ G  +ATSYD+DAPIDEYG IR+
Sbjct: 262 YRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRTS-GLFIATSYDFDAPIDEYGLIRE 320

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PKWGHLR+LHKAIK CE  L+S+DPT   LG   EA ++ KSS+ CAAFLANYD+S+   
Sbjct: 321 PKWGHLRDLHKAIKSCEPALVSADPTITWLGKNQEARVF-KSSSACAAFLANYDTSASVK 379

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           V F  N Y LP WS+SILPDC  V FNTA+V              K+    ++  S+F W
Sbjct: 380 VNFWNNPYDLPPWSISILPDCXTVTFNTAQV------------GVKSYQAKMMPISSFGW 427

Query: 425 Y---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
               EE        +  +  L EQ++ T DT+DYLWY   I +   +     GK   L++
Sbjct: 428 LSYKEEPASAYAKDTTTKAGLVEQVSITWDTTDYLWYMQDISIDSTEGFLKSGKWPLLSV 487

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH   VF+N +L    YG+ +      +K ++L +G+N L +LS+ VGL N G  FD
Sbjct: 488 NSAGHLLHVFINGQLSGSVYGSLEDPAITFSKNVDLKQGVNKLSMLSVTVGLPNVGLHFD 547

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG+   V L  L  G RD+S  +W Y+VG+ GE + L     +NS  W +GS L   
Sbjct: 548 TWNAGVLGPVTLEGLNEGTRDMSKYKWSYKVGLSGESLNLYSDKGSNSVQWTKGS-LTQK 606

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYKTTF  P G  PL L+++SM KGQ W+NGQSIGRY+  Y+A   G   KC Y G
Sbjct: 607 QPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWINGQSIGRYFPGYIA--NGKCDKCSYAG 664

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            +   KC  +CG+P+Q  YHIPR W+ P +NLLVI EE+GG P  ISL+ +T
Sbjct: 665 LFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEEIGGSPDGISLVKRT 716


>gi|168045683|ref|XP_001775306.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673387|gb|EDQ59911.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 831

 Score =  772 bits (1993), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/834 (48%), Positives = 539/834 (64%), Gaps = 39/834 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++  V+YD RAL +DG RR+L SGSIHYPRSTP +WP LI K+K+GGL+VI+TYVFW+ H
Sbjct: 21  VAVTVSYDQRALKLDGNRRMLVSGSIHYPRSTPTMWPGLIAKAKKGGLDVIQTYVFWSGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP +G Y F GR+DL +F++ V EAG++++LRIGPY CAEWN+GGFP WL F+PGI+FRT
Sbjct: 81  EPTQGVYNFAGRYDLPKFLRLVHEAGMYVNLRIGPYVCAEWNFGGFPGWLRFLPGIEFRT 140

Query: 121 TNNPFKEEMKR-FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
            N  FK  +   F + +I +  +   F  Q   +I AQ+ENEYG+++  YG  G+ Y+ W
Sbjct: 141 DNESFKVHLSHSFTSSLISVYSRS--FNIQ--LVICAQIENEYGSIDAVYGEAGQKYLNW 196

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
            A+ AV  N SVPW+MC Q DAP  +I+TCNGFYCDGF PNS  KP +WTEN++GWF S+
Sbjct: 197 IANMAVATNISVPWIMCNQPDAPPSVIDTCNGFYCDGFRPNSEGKPALWTENWTGWFQSW 256

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G   P RPV+D+AFAVARFF+ GG+F +YYMY GGTNF R+A    V T+YDYDAPIDEY
Sbjct: 257 GEGAPTRPVQDIAFAVARFFQKGGSFMHYYMYHGGTNFERSAMEG-VTTNYDYDAPIDEY 315

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           G +RQPKWGHL++LH A+KLCE  L+  D  P+   LG   EAH+Y+ S+  CAAFLA++
Sbjct: 316 GDVRQPKWGHLKDLHAALKLCELCLVGVDTVPSEISLGPYQEAHVYNSSTGACAAFLASW 375

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
             + D+ V F G  Y LPAWSVSILPDCK+VVFNTAKV             Q     +  
Sbjct: 376 -GTDDSTVLFQGQSYDLPAWSVSILPDCKSVVFNTAKV-----------GVQSMTMTMQS 423

Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKE 471
           A    +W  Y E +   G+ +F   +L EQI TTKDT+DYLWYT ++ V     P    +
Sbjct: 424 AIPVTNWVSYREPLEPWGS-TFSTNELVEQIATTKDTTDYLWYTTNVEVAESDAPNGLAQ 482

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L +  L  AA +FVNK L     G         ++ I L  GIN++ +LSM  GLQ  
Sbjct: 483 ATLVMSYLRDAAHIFVNKWLT----GTKSAHGSEASQSISLRPGINSVKVLSMTTGLQGT 538

Query: 532 GAWFDVAGAGL-FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           G + +   AG+ F + +  L +G   +    W YQVG++GE   L + + + S+ W   +
Sbjct: 539 GPFLEKEKAGIQFGIRVEGLPSGAIIMQRNTWTYQVGLQGENNRLFESNGSLSAVWSTST 598

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            +    SL W+KTTF  PE  G +AL+L+SMGKGQ WVNG ++GRYWS+ +A + GC   
Sbjct: 599 DVSNQMSLSWFKTTFDMPERNGTVALDLSSMGKGQVWVNGINLGRYWSSCIAHTDGCVDN 658

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDYRGS+  SKC   CGQP+Q+ YH+PR W+   +NLLV+ EE  G+P  I++  +  QH
Sbjct: 659 CDYRGSHSESKCLTKCGQPSQSWYHVPREWLLSKQNLLVLFEEQEGNPEAITIAPRIPQH 718

Query: 711 ICSFVSEADPPPVD-SWKPNLGVVSSSPQV---RLACERGWHIAAINFASYGIPEGNCGS 766
           ICS +SE+ P P+  S     G  +S+P +    L C  G HI+ I+FASYG P G+CG 
Sbjct: 719 ICSRMSESHPFPIPLSSSTKRGSQTSTPPIAPLALECADGQHISRISFASYGTPSGDCGD 778

Query: 767 FRPGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           F+  +CH +    ++ KACVG+ +C +P+ S+  G     CPG++K+LA  A C
Sbjct: 779 FKLSSCHANSSKDVLSKACVGRQKCLVPIVSSICG--GDPCPGMIKSLAATAEC 830


>gi|7529708|emb|CAB86888.1| beta-galactosidase precursor-like protein [Arabidopsis thaliana]
          Length = 727

 Score =  771 bits (1990), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/716 (52%), Positives = 492/716 (68%), Gaps = 24/716 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27  AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PG+ FRT N
Sbjct: 87  SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDN 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+D+MK+E LF +QGGPIIL+Q+ENEYG ++W  G  G+ Y KW A+
Sbjct: 147 EPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAE 206

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L+T VPW+M +QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF  FG A
Sbjct: 207 MALGLSTGVPWIMSKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +P RPVED+AF+VARF + GG+F NYYMY+GGTNF RTA G  +ATSYDYDAPIDEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFMNYYMYYGGTNFDRTA-GVFIATSYDYDAPIDEYGLL 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R+PK+ HL+ELHK IKLCE  L+S DPT   LG K E H++ KS   CAAFL+NYD+SS 
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKTSCAAFLSNYDTSSA 384

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F G  Y LP WSVSILPDCK   +NTAK+ +       P    K    ++  S+ F
Sbjct: 385 ARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MIPTSTKF 433

Query: 423 SWYEEKVGISGNR---SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW     G   +    +FV+  L EQI+ T+D +DY WY   I +   +     G    L
Sbjct: 434 SWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLL 493

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VFVN  L    YG    +    ++ I+L+ GIN L +LS  VGL N G  
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVH 553

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           ++    G+   V L  + +G  D+S  +W Y++G+ GE + L  ++ +++  W     + 
Sbjct: 554 YETWNTGILGPVTLKGVNSGTWDMSKWKWSYKIGLRGEAMSLHTLAGSSAVKWWIKGFVV 613

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK++F  P G  PLAL++ +MGKGQ WVNG +IGR+W AY A   G   +C+Y
Sbjct: 614 KKQPLTWYKSSFDTPRGNEPLALDMNTMGKGQVWVNGHNIGRHWPAYTA--RGNCGRCNY 671

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            G Y+  KC  HCG+P+Q  YH+PR+W+ P  NLLVI EE GGDPS ISL+ +T +
Sbjct: 672 AGIYNEKKCLSHCGEPSQRWYHVPRSWLKPFGNLLVIFEEWGGDPSGISLVKRTAK 727


>gi|108707234|gb|ABF95029.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|108707235|gb|ABF95030.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
          Length = 702

 Score =  771 bits (1990), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/709 (54%), Positives = 482/709 (67%), Gaps = 24/709 (3%)

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
           M+RF  K++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++WAA  AV+L+
Sbjct: 1   MQRFTEKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAAGKAYMRWAAGMAVSLD 60

Query: 189 TSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPV 248
           T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS SKP MWTEN+SGWFLSFG AVP+RP 
Sbjct: 61  TGVPWVMCQQSDAPDPLINTCNGFYCDQFTPNSKSKPKMWTENWSGWFLSFGGAVPYRPA 120

Query: 249 EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWG 308
           EDLAFAVARF++ GGTFQNYYMY GGTNFGR+ GGP +ATSYDYDAPIDEYG +RQPKWG
Sbjct: 121 EDLAFAVARFYQRGGTFQNYYMYHGGTNFGRSTGGPFIATSYDYDAPIDEYGMVRQPKWG 180

Query: 309 HLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTF 367
           HLR++HKAIKLCE  LI+++P++  LG   EA +Y  + N  CAAFLAN D+ SD  V F
Sbjct: 181 HLRDVHKAIKLCEPALIAAEPSYSSLGQNTEATVYQTADNSICAAFLANVDAQSDKTVKF 240

Query: 368 NGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNELLLASSA 421
           NGN Y LPAWSVSILPDCKNVV NTA++ SQ      R+ G        ++    LA++ 
Sbjct: 241 NGNTYKLPAWSVSILPDCKNVVLNTAQINSQVTTSEMRSLGSSIQDTDDSLITPELATAG 300

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN------ 475
           +S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V   +G E +LN      
Sbjct: 301 WSYAIEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVV---KGDEPYLNGSQSNL 357

Query: 476 -IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + SLGH   +++N KL     G+   +   +   + L  G N +D+LS  VGL NYGA+
Sbjct: 358 LVNSLGHVLQIYINGKLAGSAKGSASSSLISLQTPVTLVPGKNKIDLLSTTVGLSNYGAF 417

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           FD+ GAG+   + +   NG  +LSS +W YQ+G+ GE + L   S A S  W   +  P 
Sbjct: 418 FDLVGAGVTGPVKLSGPNGALNLSSTDWTYQIGLRGEDLHLYNPSEA-SPEWVSDNAYPT 476

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           N+ LIWYKT F AP G  P+A++   MGKG+AWVNGQSIGRYW   LAP +GC   C+YR
Sbjct: 477 NQPLIWYKTKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYR 536

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           G+Y ++KC K CGQP+QTLYH+PR+++ PG N LV+ E+ GGDPS IS  T+    IC+ 
Sbjct: 537 GAYSSNKCLKKCGQPSQTLYHVPRSFLQPGSNDLVLFEQFGGDPSMISFTTRQTSSICAH 596

Query: 715 VSEADPPPVDSW-KPNLGVVSSSPQVRLACER-GWHIAAINFASYGIPEGNCGSFRPGAC 772
           VSE  P  +DSW  P     +  P +RL C R G  I+ I FAS+G P G CG++  G C
Sbjct: 597 VSEMHPAQIDSWISPQQTSQTQGPALRLECPREGQVISNIKFASFGTPSGTCGNYNHGEC 656

Query: 773 -HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
                L +VQ+ACVG   CS+PVSS   G     C G+ K+L VEA CS
Sbjct: 657 SSSQALAVVQEACVGMTNCSVPVSSNNFG---DPCSGVTKSLVVEAACS 702


>gi|356502277|ref|XP_003519946.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 835

 Score =  770 bits (1989), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/831 (47%), Positives = 515/831 (61%), Gaps = 47/831 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V+YD RA+ IDGKR++L SGSIHYPRST E+WP LI KSKEGGL+VIETYVFWN HEP 
Sbjct: 26  DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHEPH 85

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F G  DLVRF+KT+Q  GL+  LRIGPY CAEWNYGGFPVWLH IP I+FRT N 
Sbjct: 86  PGQYDFSGNLDLVRFIKTIQNQGLYAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTNNA 145

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            F++EMK+F   I+D+M+ E LFASQGGPIILAQ+ENEYGN+  +YG  G+ YV+W A  
Sbjct: 146 IFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCAQL 205

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A +    VPW+MCQQ DAPDP+INTCNGFYCD + PNS +KP MWTE+++GWF+ +G   
Sbjct: 206 AQSYQIGVPWIMCQQSDAPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWFMHWGGPT 265

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P R  ED+AFAV RFF+ GGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP++EYG + 
Sbjct: 266 PHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYGDLN 325

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHL+ LH+ +K  E  L      +   G ++ A I+   +     FL N   S DA
Sbjct: 326 QPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIF-SYAGQSVCFLGNAHPSMDA 384

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
           N+ F    Y +PAWSVSILPDC   V+NTAKV     N         N N   L    + 
Sbjct: 385 NINFQNTQYTIPAWSVSILPDCYTEVYNTAKV-----NAQTSIMTINNENSYAL---DWQ 436

Query: 424 WYEE------KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVF 473
           W  E      K G + G+ +   P L +Q     DTSDYLWY  S+ V  G      ++ 
Sbjct: 437 WMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPILSHDLK 495

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           + + + GH   VFVN   +   Y  +    F     I+L  G N + ++S  VGL NYGA
Sbjct: 496 IRVNTKGHVLHVFVNGAHIGSQYATYGKYTFTFEADIKLKLGKNEISLVSGTVGLPNYGA 555

Query: 534 WFDVAGAGLFSVILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           +FD    G+  V L+   +G    +D+S+  W Y+VG+ GE + L   S +   ++  G 
Sbjct: 556 YFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPSRSTEEWFTNG- 614

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            L  +K  +WYKTTF  P G   + L+L  +GKGQAWVNG +IGRYW +YLA   GC+  
Sbjct: 615 -LQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSST 673

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISLLTKTGQ 709
           CDYRG+Y ++KC  +CG P Q  YH+P +++  G +N LV+ EE GG+P ++ + T T  
Sbjct: 674 CDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVTIA 733

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
             C+   E                    ++ LAC+    I+ I FAS+G+PEG CGSF+ 
Sbjct: 734 KACAKAYEGH------------------ELELACKENQVISEIKFASFGVPEGECGSFKK 775

Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G C   D L IV++ C+G+ +CSI V+   LG +    P     LA++A C
Sbjct: 776 GHCESSDTLSIVKRLCLGKQQCSIQVNEKMLGPTGCRVPE--NRLAIDALC 824


>gi|414865884|tpg|DAA44441.1| TPA: hypothetical protein ZEAMMB73_968467 [Zea mays]
          Length = 641

 Score =  767 bits (1980), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/617 (60%), Positives = 457/617 (74%), Gaps = 18/617 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYDHRALVIDG RRVL SGSIHYPRSTP++WP LI+K+K+GGL+VIETYVFW+ HE
Sbjct: 27  AANVTYDHRALVIDGVRRVLVSGSIHYPRSTPDMWPGLIQKAKDGGLDVIETYVFWDIHE 86

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQY FEGR DL  FVKTV +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT 
Sbjct: 87  PVRGQYDFEGRKDLAAFVKTVADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRTD 146

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM+RF AK++D MK   L+ASQGGPIIL+Q+ENEYGN++ AYG  G+ Y++WAA
Sbjct: 147 NEPFKAEMQRFTAKVVDTMKGAGLYASQGGPIILSQIENEYGNIDSAYGAPGKAYMRWAA 206

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG 
Sbjct: 207 GMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCDQFTPNSAAKPKMWTENWSGWFLSFGG 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
           AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGTN  R++GGP +ATSYDYDAPIDEYG 
Sbjct: 267 AVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGTNLDRSSGGPFIATSYDYDAPIDEYGL 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR++HKAIKLCE  LI++DP++  LG  +EA +Y K  + CAAFLAN D  S
Sbjct: 327 VRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLGPNVEAAVY-KVGSVCAAFLANIDGQS 385

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE------L 415
           D  VTFNG +Y LPAWSVSILPDCKNVV NTA++ SQ    +  + +  NV         
Sbjct: 386 DKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQINSQTTGSEMRYLESSNVASDGSFVTP 445

Query: 416 LLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
            LA S +S+  E VGI+ + +  +  L EQINTT D SD+LWY+ SI V   +G E +LN
Sbjct: 446 ELAVSDWSYAIEPVGITKDNALTKAGLMEQINTTADASDFLWYSTSITV---KGDEPYLN 502

Query: 476 -------IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
                  + SLGH   V++N K+     G+   +     K IEL  G N +D+LS  VGL
Sbjct: 503 GSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSSLISWQKPIELVPGKNKIDLLSATVGL 562

Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
            NYGA+FD+ GAG+   + +   NG  DLSS EW YQ+G+ GE + L   S A S  W  
Sbjct: 563 SNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEWTYQIGLRGEDLHLYDPSEA-SPEWVS 621

Query: 589 GSTLPVNKSLIWYKTTF 605
            +  P+N  LIWYK + 
Sbjct: 622 ANAYPINHPLIWYKVSM 638


>gi|356502275|ref|XP_003519945.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 835

 Score =  766 bits (1977), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 395/831 (47%), Positives = 514/831 (61%), Gaps = 47/831 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V+YD RA+ IDGKR++L SGSIHYPRST E+WP LI KSKEGGL+VIETYVFWN HEP 
Sbjct: 26  DVSYDGRAITIDGKRKILFSGSIHYPRSTAEMWPSLIEKSKEGGLDVIETYVFWNVHEPH 85

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F G  DLVRF+KT+Q  GL   LRIGPY CAEWNYGGFPVWLH IP I+FRT N 
Sbjct: 86  PGQYDFSGNLDLVRFIKTIQNQGLHAVLRIGPYVCAEWNYGGFPVWLHNIPNIEFRTNNA 145

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            F++EMK+F   I+D+M+ E LFASQGGPIILAQ+ENEYGN+  +YG  G+ YV+W A  
Sbjct: 146 IFEDEMKKFTTLIVDMMRHEKLFASQGGPIILAQIENEYGNIMGSYGQNGKEYVQWCAQL 205

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A +    VPW+MCQQ D PDP+INTCNGFYCD + PNS +KP MWTE+++GWF+ +G   
Sbjct: 206 AQSYQIGVPWIMCQQSDTPDPLINTCNGFYCDQWHPNSNNKPKMWTEDWTGWFMHWGGPT 265

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P R  ED+AFAV RFF+ GGTFQNYYMY GGTNFGRT+GGP + TSYDYDAP++EYG + 
Sbjct: 266 PHRTAEDVAFAVGRFFQYGGTFQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLNEYGDLN 325

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHL+ LH+ +K  E  L      +   G ++ A I+   +     FL N   S DA
Sbjct: 326 QPKWGHLKRLHEVLKSVETTLTMGSSRNIDYGNQMTATIF-SYAGQSVCFLGNAHPSMDA 384

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
           N+ F    Y +PAWSVSILPDC   V+NTAKV     N         N N   L    + 
Sbjct: 385 NINFQNTQYTIPAWSVSILPDCYTEVYNTAKV-----NAQTSIMTINNENSYAL---DWQ 436

Query: 424 WYEE------KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVF 473
           W  E      K G + G+ +   P L +Q     DTSDYLWY  S+ V  G      ++ 
Sbjct: 437 WMPETHLEQMKDGKVLGSVAITAPRLLDQ-KVANDTSDYLWYITSVDVKQGDPILSHDLK 495

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           + + + GH   VFVN   +   Y  +    F     I+L  G N + ++S  VGL NYGA
Sbjct: 496 IRVNTKGHVLHVFVNGAHIGSQYATYGKYPFTFEADIKLKLGKNEISLVSGTVGLPNYGA 555

Query: 534 WFDVAGAGLFSVILIDLKNGK---RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           +FD    G+  V L+   +G    +D+S+  W Y+VG+ GE + L   S ++  ++  G 
Sbjct: 556 YFDNIHVGVTGVQLVSQNDGSEVTKDISTNVWHYKVGMHGENVKLYSPSRSSEEWFTNG- 614

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            L  +K  +WYKTTF  P G   + L+L  +GKGQAWVNG +IGRYW +YLA   GC+  
Sbjct: 615 -LQAHKIFMWYKTTFRTPVGTDSVVLDLKGLGKGQAWVNGNNIGRYWVSYLAGEDGCSST 673

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISLLTKTGQ 709
           CDYRG+Y ++KC  +CG P Q  YH+P +++  G +N LV+ EE GG+P ++ + T T  
Sbjct: 674 CDYRGTYRSNKCTTNCGNPTQRWYHVPDSFLRDGLDNTLVVFEEQGGNPFQVKIATVTIA 733

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
             C+   E                    ++ LAC+    I+ I FAS+G+PEG CGSF+ 
Sbjct: 734 KACAKAYEGH------------------ELELACKENQVISEIRFASFGVPEGECGSFKK 775

Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G C   D L IV++ C+G+ +CSI V+   LG +    P     LA++A C
Sbjct: 776 GHCESSDTLSIVKRLCLGKQQCSIHVNEKMLGPTGCRVPE--NRLAIDALC 824


>gi|357450109|ref|XP_003595331.1| Beta-galactosidase [Medicago truncatula]
 gi|355484379|gb|AES65582.1| Beta-galactosidase [Medicago truncatula]
          Length = 830

 Score =  765 bits (1976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/836 (48%), Positives = 538/836 (64%), Gaps = 45/836 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  V++D RA+ IDGKRRVL SGSIHYPRSTP++WP+LI+K+KEGGL+ IETYVFWN HE
Sbjct: 24  AVEVSHDGRAIKIDGKRRVLISGSIHYPRSTPQMWPDLIKKAKEGGLDAIETYVFWNAHE 83

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           PIR +Y F G  DL+RF+KT+Q+ GLF  LRIGPY CAEWNYGG PVW++ +PG++ RT 
Sbjct: 84  PIRREYDFSGNNDLIRFLKTIQDEGLFAVLRIGPYVCAEWNYGGIPVWVYNLPGVEIRTA 143

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  F  EM+ F   I+D++++E LFASQGGPIIL+Q+ENEYGNV  AYG  G+ Y+ W A
Sbjct: 144 NKVFMNEMQNFTTLIVDMVRKEKLFASQGGPIILSQIENEYGNVMSAYGDEGKAYINWCA 203

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A + N  VPW+MCQQ DAP P+INTCNG+YC  F PN+P+ P MWTEN+ GWF ++G 
Sbjct: 204 NMADSFNIGVPWIMCQQPDAPQPMINTCNGWYCHDFEPNNPNSPKMWTENWVGWFKNWGG 263

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P R  ED+A++VARFFETGGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG 
Sbjct: 264 KDPHRTAEDIAYSVARFFETGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 323

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAA-FLANYDSS 360
           I QPKWGHL+ELH  +K  E  L + + +   LG+ ++A +Y  ++ND ++ FL N +++
Sbjct: 324 IAQPKWGHLKELHLVLKSMENSLTNGNVSKIDLGSYVKATVY--ATNDSSSCFLTNTNTT 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +DA VTF GN Y +PAWSVSILPDC+   +NTAKV  Q +       +++N  E    + 
Sbjct: 382 TDATVTFKGNTYNVPAWSVSILPDCQTEEYNTAKVNVQTS----IMVKRENKAEDEPEAL 437

Query: 421 AFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLN 475
            + W  E V   + G  S  +  + +Q     D+SDYLWY   + +    P       L 
Sbjct: 438 KWVWRAENVHNSLIGKSSVSKNTIVDQKIAANDSSDYLWYMTRLDINQKDPVWTNNTILR 497

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I   GH    FVN + +   +  +   N      I+L  G N + +LS+ VGLQNYG  +
Sbjct: 498 INGTGHVIHAFVNGEHIGSHWATYGIHNDQFETNIKLKHGRNDISLLSVTVGLQNYGKEY 557

Query: 536 DVAGAGLFSVI-LIDLKNGK---RDLSSGEWIYQVGVEG---EYIGLDKISLANSSFWKQ 588
           D    GL S I LI  K  +   +DLSS +W Y+VG+ G   ++   D    A+SS W +
Sbjct: 558 DKWQDGLVSPIELIGTKGDETIIKDLSSHKWTYKVGLHGWENKFFSQDTF-FASSSKW-E 615

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
            + LP+NK L WYKTTF AP    P+ ++L  MGKG AWVNG S+GRYW +Y A   GC+
Sbjct: 616 SNELPINKMLTWYKTTFKAPLESDPIVVDLQGMGKGYAWVNGHSLGRYWPSYNADEDGCS 675

Query: 649 KK-CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
              CDYRG Y+ +KC  +CG+P+Q  YH+PR ++  G N LV+ EE+GG+PS+I+  T  
Sbjct: 676 DDPCDYRGEYNDTKCVSNCGKPSQRWYHVPRDFIEDGVNTLVLFEEIGGNPSQINFQTVI 735

Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
               C+   E                  +  + L+C  G  I+ I FAS+G P+G CG+F
Sbjct: 736 VGSACANAYE------------------NKTLELSC-HGRSISDIKFASFGNPQGTCGAF 776

Query: 768 RPGAC--HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
             G+C  + + L +VQKACVG+  CSI VS    G  A  C  ++K LAVEA C+I
Sbjct: 777 TKGSCESNNEALSLVQKACVGKESCSIDVSEKTFG--ATNCGNMVKRLAVEAVCAI 830


>gi|242064502|ref|XP_002453540.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
 gi|241933371|gb|EES06516.1| hypothetical protein SORBIDRAFT_04g007660 [Sorghum bicolor]
          Length = 740

 Score =  763 bits (1971), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/714 (53%), Positives = 486/714 (68%), Gaps = 34/714 (4%)

Query: 7   YDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQ 66
           YDHR+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HEP++GQ
Sbjct: 47  YDHRSLVINGRRRILISGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHEPVQGQ 106

Query: 67  YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
           Y+F  R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI+FRT N PFK
Sbjct: 107 YHFADRYDLVRFVKLVRQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGPFK 166

Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
             M++F+ KI+ +MK E LF  QGGPII+AQVENE+G +E   G G + Y  WAA  AV 
Sbjct: 167 AAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGAKPYAHWAAQMAVG 226

Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
            NT VPWVMC+Q+DAPDP+INTCNGFYCD FTPN   KP MWTE ++GWF  FG A+P R
Sbjct: 227 TNTGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNRKYKPTMWTEAWTGWFTKFGGALPHR 286

Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPK 306
           PVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G +RQPK
Sbjct: 287 PVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPK 346

Query: 307 WGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVT 366
           WGHLR+LH+AIK  E  LIS DPT Q +G   +A+I+   +  CAAFL+NY   +   + 
Sbjct: 347 WGHLRDLHRAIKQAEPALISGDPTIQSIGNYEKAYIFKSKNGACAAFLSNYHMKTAVKIR 406

Query: 367 FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW-- 424
           F+G  Y LPAWS+SILPDCK  VFNTA V         P    K +N +L     F+W  
Sbjct: 407 FDGRHYDLPAWSISILPDCKTAVFNTATV-------KEPTLLPK-MNPVL----HFAWQS 454

Query: 425 YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF--------LNI 476
           Y E      + +F R  L EQ++ T D SDYLWYT  + +    G E F        L +
Sbjct: 455 YSEDTNSLDDSAFTRNGLVEQLSLTWDKSDYLWYTTHVSI---GGNEQFLKSGQWPQLTV 511

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH+  VFVN +     YG +D      N  +++ +G N + ILS  VGL N G  F+
Sbjct: 512 YSAGHSMQVFVNGRSYGSVYGGYDNPKLTFNGHVKMWQGSNKISILSSAVGLPNNGNHFE 571

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK-QGSTLPV 594
           +   G+   V L  L  GKRDLS  +W YQVG++GE +GL  ++ +++  W   G   P 
Sbjct: 572 LWNVGVLGPVTLSGLNEGKRDLSHQKWTYQVGLKGESLGLHTVTGSSAVEWAGPGGKQP- 630

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L W+K  F AP G  P+AL++ SMGKGQ WVNG   GRYWS Y A S  C ++C Y 
Sbjct: 631 ---LTWHKALFNAPAGSDPVALDMGSMGKGQIWVNGHHAGRYWS-YRAYSGSC-RRCSYA 685

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL-GGDPSKISLLTKT 707
           G+Y   +C  +CG  +Q  YH+PR+W+ P  NLLV+ EE  GGD + ++L T+T
Sbjct: 686 GTYREDQCLSNCGDISQRWYHVPRSWLKPSGNLLVVLEEYGGGDLAGVTLATRT 739


>gi|357464797|ref|XP_003602680.1| Beta-galactosidase [Medicago truncatula]
 gi|355491728|gb|AES72931.1| Beta-galactosidase [Medicago truncatula]
          Length = 781

 Score =  763 bits (1969), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/727 (53%), Positives = 480/727 (66%), Gaps = 15/727 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + +NV+YD R+L+IDG+R++L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN H
Sbjct: 23  VGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTAKEGGIDVIETYVFWNGH 82

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           E   G YYF GRFDLV+F K VQ+AG++L LRIGP+  AEWN+GG PVWLH+IPG  FRT
Sbjct: 83  ELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNFGGVPVWLHYIPGTVFRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PF   M++F   I++LMK+E LFASQGGPIIL+Q+ENEYG  E  Y   G+ Y  WA
Sbjct: 143 YNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYENYYKEDGKKYALWA 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV+ NTSVPW+MCQQ DAPDP+I+TCN FYCD FTP SP +P MWTEN+ GWF +FG
Sbjct: 203 AKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKMWTENWPGWFKTFG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P RPVED+AF+VARFF+ GG+  NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 263 GRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKWGHL+ELHKAIKLCE  L+     +  LG  +EA IY  SS  CAAF++N D  
Sbjct: 323 LPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSSGACAAFISNVDDK 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQQKNVNELL 416
           +D  V F    Y LPAWSVSILPDCKNVVFNTAKV S  N      +H   QQ +  +  
Sbjct: 383 NDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPEH--LQQSDKGQKT 440

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
           L    F   +E  GI G   FV+    + INTTKDT+DYLW+T SI +   +     G +
Sbjct: 441 LKWDVF---KENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILIDANEEFLKKGSK 497

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L IES GH    FVN+K    G GN   + F     I L  G N + ILS+ VGLQ  
Sbjct: 498 PALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIAILSLTVGLQTA 557

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           G ++D  GAG+ SV +I L N   DLSS  W Y++GV GE++ + +    NS  W   S 
Sbjct: 558 GPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGEGMNSVKWTSTSE 617

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-PSTGCTKK 650
            P  ++L WYK    AP G  P+ L++  MGKG AW+NG+ IGRYW          C ++
Sbjct: 618 PPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWPRISEFKKEDCVQE 677

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDYRG ++  KC   CG+P+Q  YH+PR+W  P  N+LVI EE GGDP+KI+ +      
Sbjct: 678 CDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGDPTKITFVRHCHNP 737

Query: 711 ICSFVSE 717
             S V E
Sbjct: 738 YSSIVVE 744


>gi|449452767|ref|XP_004144130.1| PREDICTED: beta-galactosidase 15-like [Cucumis sativus]
          Length = 827

 Score =  763 bits (1969), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/831 (46%), Positives = 526/831 (63%), Gaps = 39/831 (4%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V+Y +R + IDG+ ++  SGSIHYPRSTP++WP+LI+KSKEGGL+ IETYVFWN HE
Sbjct: 23  STQVSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQ-FRT 120
           P+R QY F    DLVRF+KT+Q  GL+  LRIGPY CAEWNYGGFPVWLH +PGI+  RT
Sbjct: 83  PVRRQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
           TN  F  EM+ F   I+D+MKQENLFASQGGPIILAQ+ENEYGNV  +YG  G+ YV W 
Sbjct: 143 TNPVFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWC 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A + N  VPW+MCQQ+DAP+P INTCNG+YCD FTPN+   P MWTEN++GWF S+G
Sbjct: 203 ANMADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P R  EDLAF+VARFF+ GGTFQNYYMY GGTNF R AGGP + T+YDY+AP+DEYG
Sbjct: 263 GRDPVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + QPK+GHL++LH A+K  E+ L+S + T   L   +    Y       + F +N + +
Sbjct: 323 NLNQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINET 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +DA V + G  + +PAWSVSILPDC+  V+NTAKV +Q +       + +N  E+L    
Sbjct: 382 TDALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVL---- 437

Query: 421 AFSWYEEKVGIS---GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFL 474
            + W  E +  +   G        L +Q +   D SDYLWY  S+++    P    E+ L
Sbjct: 438 EWMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSNEMTL 497

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I   GH    FVN + +   + ++D  N++  ++++L  G N + +LS  +GL+NYGA 
Sbjct: 498 RINVSGHIVHAFVNGEHIGSQWASYDVYNYIFEQEVKLKPGKNIISLLSATIGLKNYGAQ 557

Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           +D+  +G+   + +  ++G     +DLS+ +W Y+VG+ G    L       ++ W+ G+
Sbjct: 558 YDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQSGN 617

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            LPVN+ + WYKTTF  P G  P+ L+L  +GKG AWVNG SIGRYW +++A      + 
Sbjct: 618 -LPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEP 676

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDYRGSY  +KC + CG+P Q  YH+PR+W++ G+N LV+ EE GG+PS ++  T   + 
Sbjct: 677 CDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEK 736

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            C    E                     + L+C+ G  I  I FAS+G P G+CG+F  G
Sbjct: 737 ACGHAYE------------------KKSLELSCQ-GKEITGIKFASFGDPTGSCGNFSKG 777

Query: 771 ACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           +C    D + IV+  C+G+  C I +S    G +  A  G++K LAVEA C
Sbjct: 778 SCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCAL-GVVKRLAVEAVC 827


>gi|125581329|gb|EAZ22260.1| hypothetical protein OsJ_05915 [Oryza sativa Japonica Group]
          Length = 754

 Score =  763 bits (1969), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/703 (52%), Positives = 481/703 (68%), Gaps = 26/703 (3%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YD R+LVI+G+RR+L SGSIHYPRSTPE+WP LI+K+K+GGL+VI+TYVFWN HE
Sbjct: 35  NAAVSYDRRSLVINGRRRILLSGSIHYPRSTPEMWPGLIQKAKDGGLDVIQTYVFWNGHE 94

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQYYF  R+DLVRFVK V++AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT 
Sbjct: 95  PVQGQYYFSDRYDLVRFVKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVSFRTD 154

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK EM++F+ KI+ +MK E LF  QGGPII++QVENE+G +E   G G + Y  WAA
Sbjct: 155 NGPFKAEMQKFVEKIVSMMKSEGLFEWQGGPIIMSQVENEFGPMESVGGSGAKPYANWAA 214

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV  NT VPWVMC+Q+DAPDP+INTCNGFYCD F+PN   KP MWTE ++GWF SFG 
Sbjct: 215 KMAVGTNTGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGG 274

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G 
Sbjct: 275 GVPHRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGL 334

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPKWGHLR+LH+AIK  E  L+S+DPT + +G+  +A+++   +  CAAFL+NY  ++
Sbjct: 335 LRQPKWGHLRDLHRAIKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNT 394

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V FNG  Y LPAWS+SILPDCK  VFNTA V            ++  +   +     
Sbjct: 395 AVKVRFNGQQYNLPAWSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVR 442

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNI 476
           F+W  Y E      + +F +  L EQ++ T D SDYLWYT  +++       G+   L +
Sbjct: 443 FAWQSYSEDTNSLSDSAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTV 502

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH+  VFVN K     YG +D      N ++++ +G N + ILS  VGL N G  F+
Sbjct: 503 YSAGHSMQVFVNGKSYGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFE 562

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPV 594
               G+   V L  L  G +DLS  +W YQVG++GE +GL  ++ +++  W   G   P 
Sbjct: 563 NWNVGVLGPVTLSSLNGGTKDLSHQKWTYQVGLKGETLGLQTVTGSSAVEWGGPGGYQP- 621

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L W+K  F AP G  P+AL++ SMGKGQ WVNG  +GRYWS     S GC   C Y 
Sbjct: 622 ---LTWHKAFFNAPAGNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYA 675

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
           G+Y   KC+ +CG  +Q  YH+PR+W+ PG NLLV+ EE G +
Sbjct: 676 GTYHEDKCRSNCGDLSQRWYHVPRSWLKPGGNLLVVLEEYGAN 718


>gi|449529387|ref|XP_004171681.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Cucumis
           sativus]
          Length = 827

 Score =  762 bits (1968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/831 (46%), Positives = 526/831 (63%), Gaps = 39/831 (4%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V+Y +R + IDG+ ++  SGSIHYPRSTP++WP+LI+KSKEGGL+ IETYVFWN HE
Sbjct: 23  STQVSYTNRGITIDGQPKIFLSGSIHYPRSTPQMWPDLIKKSKEGGLDTIETYVFWNAHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQ-FRT 120
           P+R QY F    DLVRF+KT+Q  GL+  LRIGPY CAEWNYGGFPVWLH +PGI+  RT
Sbjct: 83  PVRRQYDFSANLDLVRFIKTIQNEGLYAVLRIGPYVCAEWNYGGFPVWLHNLPGIEELRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
           TN  F  EM+ F   I+D+MKQENLFASQGGPIILAQ+ENEYGNV  +YG  G+ YV W 
Sbjct: 143 TNPVFMNEMQNFTTLIVDMMKQENLFASQGGPIILAQIENEYGNVMTSYGDAGKAYVNWC 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A + N  VPW+MCQQ+DAP+P INTCNG+YCD FTPN+   P MWTEN++GWF S+G
Sbjct: 203 ANMADSQNVGVPWIMCQQDDAPEPTINTCNGWYCDQFTPNNAKSPKMWTENWTGWFKSWG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P R  EDLAF+VARFF+ GGTFQNYYMY GGTNF R AGGP + T+YDY+AP+DEYG
Sbjct: 263 GRDPVRTPEDLAFSVARFFQLGGTFQNYYMYHGGTNFDRMAGGPYITTTYDYNAPLDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + QPK+GHL++LH A+K  E+ L+S + T   L   +    Y       + F +N + +
Sbjct: 323 NLNQPKFGHLKQLHAALKSIEKALVSGNVTTTDLTDSVSITEYATDKGK-SCFFSNINET 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +DA V + G  + +PAWSVSILPDC+  V+NTAKV +Q +       + +N  E+L    
Sbjct: 382 TDALVNYLGKDFNVPAWSVSILPDCQEEVYNTAKVNTQTSVMVKKENKAENEPEVL---- 437

Query: 421 AFSWYEEKVGIS---GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFL 474
            + W  E +  +   G        L +Q +   D SDYLWY  S+++    P    E+ L
Sbjct: 438 EWMWRPENIDNTARLGKGQVTANKLIDQKDAANDASDYLWYMTSVNLKKKDPIWSNEMTL 497

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I   GH    FVN + +   + ++D  N++  ++++L  G N + +LS  +GL+NYGA 
Sbjct: 498 RINVSGHIVHAFVNGEHIGSQWASYDVYNYIXEQEVKLKPGKNIISLLSATIGLKNYGAQ 557

Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           +D+  +G+   + +  ++G     +DLS+ +W Y+VG+ G    L       ++ W+ G+
Sbjct: 558 YDLIQSGIVGPVQLIGRHGDETIIKDLSNHKWSYEVGLHGFENRLFSPESRFATKWQSGN 617

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            LPVN+ + WYKTTF  P G  P+ L+L  +GKG AWVNG SIGRYW +++A      + 
Sbjct: 618 -LPVNRMMTWYKTTFKPPLGTDPVTLDLQGLGKGMAWVNGHSIGRYWPSFIAEDGCSDEP 676

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           CDYRGSY  +KC + CG+P Q  YH+PR+W++ G+N LV+ EE GG+PS ++  T   + 
Sbjct: 677 CDYRGSYTNTKCVRDCGKPTQQWYHVPRSWLNEGDNTLVLFEEFGGNPSLVNFKTIAMEK 736

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            C    E                     + L+C+ G  I  I FAS+G P G+CG+F  G
Sbjct: 737 ACGHAYE------------------KKSLELSCQ-GKEITGIKFASFGDPTGSCGNFSKG 777

Query: 771 ACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           +C    D + IV+  C+G+  C I +S    G +  A  G++K LAVEA C
Sbjct: 778 SCEGKNDAMKIVEDLCIGKESCVIDISEDTFGATNCAL-GVVKRLAVEAVC 827


>gi|255543793|ref|XP_002512959.1| beta-galactosidase, putative [Ricinus communis]
 gi|223547970|gb|EEF49462.1| beta-galactosidase, putative [Ricinus communis]
          Length = 732

 Score =  761 bits (1966), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/714 (52%), Positives = 487/714 (68%), Gaps = 18/714 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  NVTYD +AL+I+G++R+L SGSIHYPRSTP++W  LI+K+K+GGL+VI+TYVFWN H
Sbjct: 24  IECNVTYDKKALIINGQKRILFSGSIHYPRSTPQMWEGLIQKAKDGGLDVIDTYVFWNLH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y FEGR DLV+F+K V +AGL++HLRIGPY C EWN+GGFPVWL +IPG+ FRT
Sbjct: 84  EPSPGNYNFEGRNDLVQFIKLVHKAGLYVHLRIGPYICGEWNFGGFPVWLKYIPGMIFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK +M++F  KI+ +MK E L+ SQGGPIIL+Q+ENEY   + A+G  G  Y+ WA
Sbjct: 144 DNEPFKLQMQKFTQKIVQMMKDEQLYESQGGPIILSQIENEYEPEDKAFGAAGHAYMTWA 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV+LNT VPWVMC++ DAPDP++NTCNGFYCD F+PN   KP MWTE ++GWF  FG
Sbjct: 204 AHMAVSLNTGVPWVMCKEFDAPDPVVNTCNGFYCDYFSPNKAYKPTMWTEAWTGWFTDFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             +  RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 264 GPIHQRPVEDLAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPK+GHL++LHKAIKLCE  L+SSDP    LG+  +AH++  +S DCAAFLANY+  
Sbjct: 324 LIRQPKYGHLKDLHKAIKLCERALLSSDPVVTTLGSYEQAHVFSSNSGDCAAFLANYNPK 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           + A VTFN   Y LP WSVSILPDCKNVVFNTA+V  Q +       + + ++   L+  
Sbjct: 384 ATAKVTFNNMHYNLPPWSVSILPDCKNVVFNTAEVGVQPSKIQMLPTEARFLSWEALSED 443

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
             S  ++K+G           L EQIN T+D SDYLWYT  +H+   +     G+   L 
Sbjct: 444 ISSVDDDKIGTVAG-------LLEQINVTRDASDYLWYTTGVHISSSETFLDGGQPPILK 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAW 534
           + S GH   VFVN +L    YG         + ++ +L+ G N + +LS+ VGL N G  
Sbjct: 497 VISAGHGIHVFVNGQLSGSVYGTRGNRRISFSGELKQLHAGRNRISLLSVAVGLPNNGPR 556

Query: 535 FDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+    G+   ++I  L  G RDL+  +W Y+VG++GE + L   +   S  W Q S + 
Sbjct: 557 FETWNTGVLGPVVIHGLDQGHRDLTWQKWSYKVGLKGEDLNLGSPNSIPSINWMQESAMV 616

Query: 594 VNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
             +  L W++  F AP G  PLAL+++SM KGQ W+NG SIGRYW+ Y   + G    C 
Sbjct: 617 AERQPLTWHRAFFDAPRGDDPLALDMSSMVKGQVWINGNSIGRYWTVY---ADGNCTACS 673

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           Y G++  S CQ  CGQP Q  YHIPR+ + P ENLLV+ EE+GGD SKI L+ +
Sbjct: 674 YSGTFRPSTCQFGCGQPTQKWYHIPRSLLKPTENLLVVFEEIGGDVSKIYLVKR 727


>gi|449485873|ref|XP_004157296.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
           sativus]
          Length = 813

 Score =  759 bits (1960), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/829 (46%), Positives = 531/829 (64%), Gaps = 40/829 (4%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YD  A++I+G+RRV+ SGS+HYPRST  +WP+LI+K+K+GGL+ IETY+FW+ HEP 
Sbjct: 11  NVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 70

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           R +Y F GR D ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQFRT N 
Sbjct: 71  RRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQFRTDNQ 130

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +K EM+ F  KI+++ KQ NLFASQGGPIILAQ+ENEYGNV   YG  G+ Y+ W A  
Sbjct: 131 VYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCAQM 190

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNSPSKPIMWTENYSGWFLSFGYA 242
           A +LN  +PW+MCQQ DAP PIINTCNGFYCD  F+PN+P  P M+TEN+ GWF  +G  
Sbjct: 191 AESLNIGIPWIMCQQSDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGWFKKWGDK 250

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P+R  ED+AFAVARFF++GG F NYYMY GGTNFGRTAGGP + TSYDY+AP+DEYG +
Sbjct: 251 DPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDEYGNL 310

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDSSS 361
            QPKWGHL++LH +IK+ E+ L +S  + QKL + +      + +S +   FL+N D+ +
Sbjct: 311 NQPKWGHLKQLHASIKMGEKILTNSTRSDQKLXSFVTLTKFSNPTSGERFCFLSNTDNKN 370

Query: 362 DANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           DA +    +  YF+PAWSVSIL  C   VFNTAK+ SQ +     F + +N  E    ++
Sbjct: 371 DATIDLQADGKYFVPAWSVSILDGCNKEVFNTAKINSQTS----MFVKVQNKKE----NA 422

Query: 421 AFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLN 475
            FSW       +  + G  +F    L EQ  TT D SDYLWY  +I        + V L 
Sbjct: 423 QFSWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSLQNVTLQ 482

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + + GH    FVN++ +   + ++   +F+  K I +  G NT+ +LS  VGL+NY A++
Sbjct: 483 VNTKGHMLHAFVNRRYIGSQWRSNG-QSFVFXKPILIKPGTNTITLLSATVGLKNYDAFY 541

Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           D    G+    + LI   N K DLSS  W Y+VG+ GE   L     +  + W   +   
Sbjct: 542 DTVPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTNWSTINQKS 601

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           + + +  YKT F  P G  P+ L++  MGKGQAWVNGQSIGR+W +++A +  C+  CDY
Sbjct: 602 IGRRMTLYKTNFKTPSGIDPVTLDMQGMGKGQAWVNGQSIGRFWPSFIAGNDSCSTTCDY 661

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
           RG+Y+ SKC ++CG P+Q  YHIPR+++    N LV+ EE+GG+P ++S+ T T   IC 
Sbjct: 662 RGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITIGTICG 721

Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
             +E                     + L+C+ G  I+ I FASYG PEG CGSF+ G+ H
Sbjct: 722 NANEGS------------------TLELSCQGGHIISEIQFASYGNPEGKCGSFKQGSWH 763

Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
            ++   +V+K C+G   CSI VS+   G+  G    +   LA++A CSI
Sbjct: 764 VINSAILVEKLCIGMESCSIDVSAKSFGL--GDVTNISARLAIQALCSI 810


>gi|15241969|ref|NP_200498.1| beta-galactosidase 4 [Arabidopsis thaliana]
 gi|75265636|sp|Q9SCV8.1|BGAL4_ARATH RecName: Full=Beta-galactosidase 4; Short=Lactase 4; Flags:
           Precursor
 gi|6686880|emb|CAB64740.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|8809655|dbj|BAA97206.1| beta-galactosidase [Arabidopsis thaliana]
 gi|332009434|gb|AED96817.1| beta-galactosidase 4 [Arabidopsis thaliana]
          Length = 724

 Score =  758 bits (1957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/716 (52%), Positives = 495/716 (69%), Gaps = 25/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD +A++I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VIETYVFWN H
Sbjct: 25  VKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYF  R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT
Sbjct: 85  EPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK+F  KI+ +MK E LF +QGGPIILAQ+ENEYG VEW  G  G+ Y KW 
Sbjct: 145 DNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWV 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+  FG
Sbjct: 205 AQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RPVED+A++VARF + GG+  NYYMY GGTNF RTA G  +A+SYDYDAP+DEYG
Sbjct: 265 GAVPYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PK+ HL+ LHKAIKL E  L+S+D T   LGAK EA+++  S + CAAFL+N D +
Sbjct: 324 LPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKDEN 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F G  Y LP WSVSILPDCK  V+NTAKV       + P   +     ++   +
Sbjct: 383 SAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKV-------NAPSVHR----NMVPTGT 431

Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            FSW  + E    +    +F R  L EQI+ T D SDY WY   I +  G+     G   
Sbjct: 432 KFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSP 491

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA  VFVN +L    YG  D      ++KI+L+ G+N + +LS+ VGL N G
Sbjct: 492 LLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVG 551

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  + +G  D+S  +W Y++GV+GE + L   + ++   W QGS 
Sbjct: 552 THFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSF 611

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK+TF  P G  PLAL++ +MGKGQ W+NG++IGR+W AY A   G   +C
Sbjct: 612 VAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCGRC 669

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G++DA KC  +CG+ +Q  YH+PR+W+   +NL+V+ EELGGDP+ ISL+ +T
Sbjct: 670 NYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 724


>gi|15451018|gb|AAK96780.1| beta-galactosidase [Arabidopsis thaliana]
 gi|17978799|gb|AAL47393.1| beta-galactosidase [Arabidopsis thaliana]
          Length = 724

 Score =  758 bits (1957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/716 (52%), Positives = 495/716 (69%), Gaps = 25/716 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD +A++I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VIETYVFWN H
Sbjct: 25  VKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYF  R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT
Sbjct: 85  EPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  MK+F  KI+ +MK E LF +QGGPIILAQ+ENEYG VEW  G  G+ Y KW 
Sbjct: 145 DNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWV 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+  FG
Sbjct: 205 AQMALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RPVED+A++VARF + GG+  NYYMY GGTNF RTA G  +A+SYDYDAP+DEYG
Sbjct: 265 GAVPYRPVEDIAYSVARFIQKGGSLINYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PK+ HL+ LHKAIKL E  L+S+D T   LGAK EA+++  S + CAAFL+N D +
Sbjct: 324 LPREPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKDEN 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F G  Y LP WSVSILPDCK  V+NTAKV       + P   +     ++   +
Sbjct: 383 SAARVLFRGFPYDLPPWSVSILPDCKTEVYNTAKV-------NAPSVHR----NMVPTGT 431

Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            FSW  + E    +    +F R  L EQI+ T D SDY WY   I +  G+     G   
Sbjct: 432 KFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSP 491

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA  VFVN +L    YG  D      ++KI+L+ G+N + +LS+ VGL N G
Sbjct: 492 LLTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVG 551

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  + +G  D+S  +W Y++GV+GE + L   + ++   W QGS 
Sbjct: 552 THFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSF 611

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           +   + L WYK+TF  P G  PLAL++ +MGKGQ W+NG++IGR+W AY A   G   +C
Sbjct: 612 VAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCGRC 669

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Y G++DA KC  +CG+ +Q  YH+PR+W+   +NL+V+ EELGGDP+ ISL+ +T
Sbjct: 670 NYAGTFDAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 724


>gi|449529435|ref|XP_004171705.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
          Length = 826

 Score =  757 bits (1954), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/829 (45%), Positives = 531/829 (64%), Gaps = 36/829 (4%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  NV+YD  A++I+G+RR++ SGSIHYPRST E+WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 23  IGNNVSYDSNAIIINGERRIIFSGSIHYPRSTEEMWPDLIQKAKDGGLDAIETYIFWDRH 82

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP R +Y F G  + +++ + +QEAGL++ +RIGPY CAEWNYGGFP+WLH +PGIQ RT
Sbjct: 83  EPHRRKYDFSGHLNFIKYFQLIQEAGLYVVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N  +K EM+ F  KI+++ KQ NLFASQGGPIILAQ+ENEYGNV   YG  G+ Y+ W 
Sbjct: 143 NNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGEAGKTYINWC 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A +LN  +PW+MCQQ DAP PIINTCNGFYCD FTPN+P+ P M+TEN+ GWF  +G
Sbjct: 203 AQMAESLNIGIPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPNSPKMFTENWVGWFKKWG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P R  ED+AF+VARFF++GG   NYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG
Sbjct: 263 DKDPHRTAEDVAFSVARFFQSGGILNNYYMYHGGTNFGRTSGGPFITTSYDYDAPLDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDS 359
            + QPKWGHL++LH +IKL E+ L +S  + Q  G+ +      +  + +   FL+N D 
Sbjct: 323 NLNQPKWGHLKQLHASIKLGEKILTNSTRSDQDFGSSVTFTKFSNLETGEKFCFLSNADE 382

Query: 360 SSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           ++DA V   G+  YFLPAWSVSIL  C   +FNTAKV SQ +     F +++N  E   A
Sbjct: 383 NNDAIVDMLGDRKYFLPAWSVSILDGCNKEIFNTAKVSSQTS----LFFKKQNEKE--NA 436

Query: 419 SSAFSWYEE--KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLN 475
             +++W  E  +  + G  +F    L EQ   T D+SDYLWY  +++       + + L 
Sbjct: 437 KLSWNWASEPMRDTLQGYGTFKANLLLEQKGATIDSSDYLWYMTNVNSNTTSSLQNLTLQ 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + + GH    F+N++ +   +G++   +F+  K I+L  G NT+ +LS  VGL+NY A++
Sbjct: 497 VNTKGHVLHAFINRRYIGSQWGSNG-QSFVFEKPIQLKLGTNTITLLSATVGLKNYDAFY 555

Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           D    G+    + LI   N   DLSS  W Y+VG+ GE   L     +N + W   +   
Sbjct: 556 DTVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGLNGERKQLYNPMFSNRTKWSTLNKKS 615

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           + + + W+K TF  P G  P+ L++  MGKGQAWVNG+SIGR+W +++A +  C++ CDY
Sbjct: 616 IGRRMTWFKATFKTPSGTDPVVLDMQGMGKGQAWVNGRSIGRFWPSFIASNDSCSETCDY 675

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
           +GSY+ +KC ++CG  +Q  YHIPR++++   N L++ EE+GG+P  +S+ T T   IC 
Sbjct: 676 KGSYNPNKCVRNCGNSSQRWYHIPRSFMNDSINTLILFEEIGGNPQMVSVQTITIGTICG 735

Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
             +E                     + L+C+ G  I+ I FASYG PEG CGSF+ G   
Sbjct: 736 NANEGS------------------TLELSCQGGHVISEIQFASYGHPEGKCGSFQSGLWD 777

Query: 774 M--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +      IV+KAC+G   CSI +S     +S  A P     LAV+A CS
Sbjct: 778 VTKSTTIIVEKACIGMKNCSIDISPNLFKLSKVAYP--YAKLAVQALCS 824


>gi|449436000|ref|XP_004135782.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
          Length = 838

 Score =  756 bits (1952), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/830 (46%), Positives = 532/830 (64%), Gaps = 42/830 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YD  A++I+G+RRV+ SGS+HYPRST  +WP+LI+K+K+GGL+ IETY+FW+ HEP 
Sbjct: 36  NVSYDSNAIIINGERRVILSGSMHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRHEPQ 95

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           R +Y F GR D ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQFRT N 
Sbjct: 96  RRKYDFTGRLDFIKFFQLVQDAGLYVVMRIGPYVCAEWNYGGFPLWLHNLPGIQFRTDNQ 155

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +K EM+ F  KI+++ KQ NLFASQGGPIILAQ+ENEYGNV   YG  G+ Y+ W A  
Sbjct: 156 VYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKSYINWCAQM 215

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNSPSKPIMWTENYSGWFLSFGYA 242
           A +LN  +PW+MCQQ DAP PIINTCNGFYCD  F+PN+P  P M+TEN+ GWF  +G  
Sbjct: 216 AESLNIGIPWIMCQQNDAPQPIINTCNGFYCDYDFSPNNPKSPKMFTENWVGWFKKWGDK 275

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P+R  ED+AFAVARFF++GG F NYYMY GGTNFGRTAGGP + TSYDY+AP+DEYG +
Sbjct: 276 DPYRSPEDVAFAVARFFQSGGVFNNYYMYHGGTNFGRTAGGPFITTSYDYNAPLDEYGNL 335

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDSSS 361
            QPKWGHL++LH +IK+ E+ L +S  + QK+ + +      + +S +   FL+N D+ +
Sbjct: 336 NQPKWGHLKQLHASIKMGEKILTNSTRSDQKISSFVTLTKFSNPTSGERFCFLSNTDNKN 395

Query: 362 DANVTFNGN-VYF--LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           DA +    +  YF  +PAWSVSIL  C   VFNTAK+ SQ +     F + +N  E    
Sbjct: 396 DATIDLQADGKYFVPVPAWSVSILDGCNKEVFNTAKINSQTS----MFVKVQNKKE---- 447

Query: 419 SSAFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVF 473
           ++ FSW       +  + G  +F    L EQ  TT D SDYLWY  +I        + V 
Sbjct: 448 NAQFSWVWAPEPMRDTLQGKGTFKANLLLEQKGTTVDFSDYLWYMTNIDSNATSSLQNVT 507

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + + GH    FVN++ +   + ++   +F+  K I +  G NT+ +LS  VGL+NY A
Sbjct: 508 LQVNTKGHMLHAFVNRRYIGSQWRSNG-QSFVFEKPILIKPGTNTITLLSATVGLKNYDA 566

Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           ++D    G+    + LI   N K DLSS  W Y+VG+ GE   L     +  + W   + 
Sbjct: 567 FYDTVPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEMKQLYNPVFSQRTNWSTINQ 626

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
             + + + WYKT+F  P G   + L++  MGKGQAWVNGQSIGR+W +++A +  C+  C
Sbjct: 627 KSIGRRMTWYKTSFKTPSGIDRVTLDMQGMGKGQAWVNGQSIGRFWPSFIASNDSCSTTC 686

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           DYRG+Y+ SKC ++CG P+Q  YHIPR+++    N LV+ EE+GG+P ++S+ T T   I
Sbjct: 687 DYRGAYNPSKCVENCGNPSQRWYHIPRSFLSDDTNTLVLFEEIGGNPQQVSVQTITIGTI 746

Query: 712 CSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           C   +E                     + L+C+ G  I+ I FASYG PEG CGSF+ G+
Sbjct: 747 CGNANEGS------------------TLELSCQGGHIISEIQFASYGNPEGKCGSFKQGS 788

Query: 772 CH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            H ++   +V+K C+G+  CSI VS+   G+  G    L   LA++A CS
Sbjct: 789 WHVINSAILVEKLCIGRESCSIDVSAKSFGL--GDVTNLSARLAIQALCS 836


>gi|15242897|ref|NP_201186.1| beta-galactosidase 10 [Arabidopsis thaliana]
 gi|75171772|sp|Q9FN08.1|BGL10_ARATH RecName: Full=Beta-galactosidase 10; Short=Lactase 10; Flags:
           Precursor
 gi|10177669|dbj|BAB11029.1| beta-galactosidase [Arabidopsis thaliana]
 gi|20260438|gb|AAM13117.1| unknown protein [Arabidopsis thaliana]
 gi|34098797|gb|AAQ56781.1| At5g63810 [Arabidopsis thaliana]
 gi|332010417|gb|AED97800.1| beta-galactosidase 10 [Arabidopsis thaliana]
          Length = 741

 Score =  755 bits (1950), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/712 (51%), Positives = 474/712 (66%), Gaps = 17/712 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANV+YDHR+L I  +R+++ S +IHYPRS P +WP L++ +KEGG   IE+YVFWN HE
Sbjct: 29  AANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHE 88

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYF GR+++V+F+K VQ+AG+ + LRIGP+  AEWNYGG PVWLH++PG  FR  
Sbjct: 89  PSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRAD 148

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N P+K  M+ F   I++L+KQE LFA QGGPIIL+QVENEYG  E  YG GG+ Y +W+A
Sbjct: 149 NEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSA 208

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+ N  VPW+MCQQ DAP  +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG 
Sbjct: 209 SMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGG 268

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P RP ED+A++VARFF  GG+  NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG 
Sbjct: 269 RDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGL 328

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R PKWGHL++LHKAI L E  LIS +  +  LG  LEA +Y  SS  CAAFL+N D  +
Sbjct: 329 PRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKN 388

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  V F    Y LPAWSVSILPDCK  VFNTAKV S+        ++ + + E L +SS 
Sbjct: 389 DKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKS-------SKVEMLPEDLKSSSG 441

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
             W  + EK GI G   FV+ +L + INTTKDT+DYLWYT SI V   +     G    L
Sbjct: 442 LKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVL 501

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            IES GH   VF+NK+ +    GN     F + K + L  G N +D+LSM VGL N G++
Sbjct: 502 FIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSF 561

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           ++  GAGL SV +     G  +L++ +W Y++GVEGE++ L K   + +  W   +  P 
Sbjct: 562 YEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPK 621

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL---APSTGCTKKC 651
            + L WYK     P G  P+ L++ SMGKG AW+NG+ IGRYW       +P+  C K+C
Sbjct: 622 KQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKEC 681

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
           DYRG +   KC   CG+P+Q  YH+PR+W     N LVI EE GG+P KI L
Sbjct: 682 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKL 733


>gi|356522906|ref|XP_003530083.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 846

 Score =  755 bits (1950), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 398/839 (47%), Positives = 513/839 (61%), Gaps = 51/839 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD RAL IDGKRR+L SGSIHYPRSTPE+WP LIRK+KEGGL+VIETYVFWN HEP R
Sbjct: 28  VSYDERALTIDGKRRILFSGSIHYPRSTPEMWPYLIRKAKEGGLDVIETYVFWNAHEPQR 87

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F    DLVRF++T+Q+ GL+  +RIGPY  +EWNYGG PVWLH IP ++FRT N  
Sbjct: 88  RQYDFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGLPVWLHNIPNMEFRTHNRA 147

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F EEMK F  KI+D+M+ E LFA QGGPII+AQ+ENEYGNV  AYG  G  Y+KW A  A
Sbjct: 148 FMEEMKTFTRKIVDMMQDETLFAVQGGPIIIAQIENEYGNVMHAYGNNGTQYLKWCAQLA 207

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +  T VPWVM QQ +AP  +I++C+G+YCD F PN   KP +WTEN++G + ++G   P
Sbjct: 208 DSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPKIWTENWTGGYKNWGTQNP 267

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RP ED+A+AVARFF+ GGTFQNYYMY GGTNF RTAGGP V TSYDYDAP+DEYG + Q
Sbjct: 268 HRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYVTTSYDYDAPLDEYGNLNQ 327

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
           PKWGHLR+LH  +K  E  L      H   G  + A +Y +   + C  F+ N   S DA
Sbjct: 328 PKWGHLRQLHNLLKSKENILTQGSSQHTDYGNMVTATVYTYDGKSTC--FIGNAHQSKDA 385

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            + F  N Y +PAWSVSILP+C +  +NTAKV +Q           K  NE L  +  + 
Sbjct: 386 TINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTT------IMVKKDNEDLEYALRWQ 439

Query: 424 WYEE-----KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM----PGQGKEVF 473
           W +E     K G I+G      P L +Q   T D SDYLWY  SI +     P   KE  
Sbjct: 440 WRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLWYITSIDIKGDDDPSWTKEFR 499

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + + GH   VFVN K V   +  +    F+   KI+L  G N + +LS  VGL NYG 
Sbjct: 500 LRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIKLTTGKNEISLLSTTVGLPNYGP 559

Query: 534 WFDVAGAGLFSVILIDLKNGK---------RDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
           +FD    G+   + +    G          +DLS  +W Y+VG+ GE+      S  NS 
Sbjct: 560 FFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQWSYKVGLHGEHEM--HYSYENSL 617

Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
                  +P ++ L+WYKTTF +P G  P+ ++L+ +GKG AWVNG SIGRYWS+YLA  
Sbjct: 618 KTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGKGHAWVNGNSIGRYWSSYLADE 677

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISL 703
            GC+ KCDYRG Y ++KC   C QP+Q  YH+PR+++    +N LV+ EELGG P  ++ 
Sbjct: 678 NGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRDDDQNTLVLFEELGGQPYYVNF 737

Query: 704 LTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
           LT T   +C+   E +                   + LAC +   I+ I FAS+G+P+G 
Sbjct: 738 LTVTVGKVCANAYEGNT------------------LELACNKNQVISEIKFASFGLPKGE 779

Query: 764 CGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
           CGSF+ G C   + L  ++  C+G+ +CSI VS   LG +        + LAVEA C I
Sbjct: 780 CGSFQKGNCESSEALSAIKAQCIGKDKCSIQVSERALGPTRCRV-AEDRRLAVEAVCDI 837


>gi|255563853|ref|XP_002522927.1| beta-galactosidase, putative [Ricinus communis]
 gi|223537854|gb|EEF39470.1| beta-galactosidase, putative [Ricinus communis]
          Length = 803

 Score =  755 bits (1949), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/828 (48%), Positives = 508/828 (61%), Gaps = 63/828 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           N+TYD R+L+IDG+R++L S +IHYPRS P +WPEL++ +KEGG++VIETYVFWN HEP 
Sbjct: 28  NITYDSRSLIIDGQRKLLISAAIHYPRSVPGMWPELVQTAKEGGVDVIETYVFWNGHEPS 87

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
              YYFE R+DLV+FVK VQ+AG++L LRIGP+  AEWN+GG PVWLH++PG  FRT N 
Sbjct: 88  PSNYYFEKRYDLVKFVKIVQQAGMYLILRIGPFVAAEWNFGGVPVWLHYVPGTVFRTDNY 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            FK  M++F+  I++LMK+E LFASQGGPIILAQVENEYG  E AYG GG+ Y  WAA  
Sbjct: 148 NFKYHMQKFMTYIVNLMKKEKLFASQGGPIILAQVENEYGFYESAYGEGGKRYAMWAAQM 207

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+ N  VPW+MCQQ DAP+ +INTCN FYCD F P  P KP +WTEN+ GWF +FG   
Sbjct: 208 AVSQNIGVPWIMCQQFDAPNSVINTCNSFYCDQFKPIFPDKPKIWTENWPGWFQTFGAPN 267

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED+AF+VARFF+ GG+ QNYYMY GGTNFGRT+GGP + TSYDY+APIDEYG  R
Sbjct: 268 PHRPAEDIAFSVARFFQKGGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLAR 327

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
            PKW HL+ELHKAIKLCE  L++S P +  LG   EA +Y + S  CAAFLAN D  +D 
Sbjct: 328 LPKWAHLKELHKAIKLCELTLLNSVPVNLSLGPSQEADVYAEESGACAAFLANMDEKNDK 387

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V F    Y LPAWSVSILPDCKNVVFNTAKV SQ +  +      ++ ++    + A  
Sbjct: 388 TVVFRNMSYHLPAWSVSILPDCKNVVFNTAKVNSQTSIVEMVPDDLRSSDK---GTKALK 444

Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV------MPGQGKEVFLN 475
           W  + E  GI G    V+    + INTTKDT+DYLWYT SI V      +   G+ V L 
Sbjct: 445 WETFVENAGIWGTSDLVKNGFVDHINTTKDTTDYLWYTTSIFVGENEEFLKKGGRPVLL- 503

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           IES GHA   FVN++L     GN   + F   K + L  G N + +LSM VGLQN G+++
Sbjct: 504 IESKGHALHAFVNQELQGTASGNGTHSPFKFKKPVSLVAGKNDIALLSMTVGLQNAGSFY 563

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           +  GAGL SV +    NG  DLS+  W Y++G++GE +G+       +  W   S  P +
Sbjct: 564 EWVGAGLTSVKMKGFNNGTIDLSTFNWTYKIGLQGEKLGMYNGIAVETVNWVATSKPPKD 623

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK    A +      LN         W     +   W                  
Sbjct: 624 QPLTWYKRQIHARQ-----MLNW-------MWRINSEMILVW------------------ 653

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
                           T YH+PR+W  P  N+LVI EE GGDP+KI+   +    +C+ V
Sbjct: 654 ----------------TRYHVPRSWFKPSGNILVIFEEKGGDPTKITFSRRKISGVCALV 697

Query: 716 SEADPPPVDSWKPNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           +E  P        N G  SS+    V L C +   I+AI FAS+G P G CGS+  G CH
Sbjct: 698 AEDYPMANLESLENAGSGSSNYKASVHLKCPKSSIISAIKFASFGSPAGACGSYSEGECH 757

Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
               + +V+K C+ + +C + V+      S G CPG +K LAVEA CS
Sbjct: 758 DPKSISVVEKVCLNKNQCVVEVTEE--NFSKGLCPGKMKKLAVEAVCS 803


>gi|6686892|emb|CAB64746.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 741

 Score =  753 bits (1945), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/712 (51%), Positives = 473/712 (66%), Gaps = 17/712 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANV+YDHR+L I  +R+++ S +IHYPRS P +WP L++ +KEGG   IE+YVFWN HE
Sbjct: 29  AANVSYDHRSLTIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHE 88

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G+YYF GR+++V+F+K VQ+AG+ + LRIGP+  AEWNYGG PVWLH++PG  FR  
Sbjct: 89  PSPGKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRAD 148

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N P+K  M+ F   I++L+KQE LFA QGGPIIL+QVENEYG  E  YG GG+ Y +W+A
Sbjct: 149 NEPWKHYMESFTTYIVNLLKQEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSA 208

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+ N  VPW+MCQQ DAP  +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG 
Sbjct: 209 SMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGG 268

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P RP ED+A++VARFF  GG+  NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG 
Sbjct: 269 RDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGL 328

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R PKWGHL++LHKAI L E  LIS +  +  LG  LEA +Y  SS  CAAFL+N D  +
Sbjct: 329 PRLPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKN 388

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  V F    Y LPAWSVSILPDCK  VFNTAKV S+        ++ + + E L +SS 
Sbjct: 389 DKAVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKS-------SKVEMLPEDLKSSSG 441

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
             W  + EK GI G   FV+ +L + INTTKDT+DYLWYT SI V   +     G    L
Sbjct: 442 LKWEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVL 501

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            IES GH   VF+NK+ +    GN     F + K + L  G   +D+LSM VGL N G++
Sbjct: 502 FIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGETNIDLLSMTVGLANAGSF 561

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           ++  GAGL SV +     G  +L++ +W Y++GVEGE++ L K   + +  W   +  P 
Sbjct: 562 YEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPK 621

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL---APSTGCTKKC 651
            + L WYK     P G  P+ L++ SMGKG AW+NG+ IGRYW       +P+  C K+C
Sbjct: 622 KQPLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKEC 681

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
           DYRG +   KC   CG+P+Q  YH+PR+W     N LVI EE GG+P KI L
Sbjct: 682 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKL 733


>gi|356522904|ref|XP_003530082.1| PREDICTED: beta-galactosidase-like [Glycine max]
          Length = 923

 Score =  753 bits (1943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/839 (47%), Positives = 512/839 (61%), Gaps = 51/839 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD RAL IDGKRR+L S SIHYPRSTPE+WP LIRK+KEGGL+VIETYVFWN HEP R
Sbjct: 28  VSYDERALTIDGKRRILFSASIHYPRSTPEMWPYLIRKAKEGGLDVIETYVFWNAHEPQR 87

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F    DLVRF++T+Q+ GL+  +RIGPY  +EWNYGG PVWLH IP ++FRT N  
Sbjct: 88  RQYEFSENLDLVRFIRTIQKEGLYAMIRIGPYISSEWNYGGLPVWLHNIPNMEFRTHNRA 147

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F EEMK F  KI+D+M+ E LFA QGGPII+AQ+ENEYGNV  AYG  G  Y+KW A  A
Sbjct: 148 FMEEMKTFTTKIVDMMQDETLFAVQGGPIIIAQIENEYGNVMHAYGNNGTQYLKWCAQLA 207

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +  T VPWVM QQ +AP  +I++C+G+YCD F PN   KP +WTEN++G + ++G   P
Sbjct: 208 DSFETGVPWVMSQQSNAPQFMIDSCDGYYCDQFQPNDNHKPKIWTENWTGGYKNWGTQNP 267

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            RP ED+A+AVARFF+ GGTFQNYYMY GGTNF RTAGGP V TSYDYDAP+DEYG + Q
Sbjct: 268 HRPAEDVAYAVARFFQFGGTFQNYYMYHGGTNFKRTAGGPYVTTSYDYDAPLDEYGNLNQ 327

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
           PKWGHLR+LH  +K  E  L      +   G  + A +Y +   + C  F+ N   S DA
Sbjct: 328 PKWGHLRQLHNLLKSKENILTQGSSQNTDYGNMVTATVYTYDGKSTC--FIGNAHQSKDA 385

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            + F  N Y +PAWSVSILP+C +  +NTAKV +Q           K  NE L  +  + 
Sbjct: 386 TINFRNNEYTIPAWSVSILPNCSSEAYNTAKVNTQTT------IMVKKDNEDLEYALRWQ 439

Query: 424 WYEE-----KVG-ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM----PGQGKEVF 473
           W +E     K G I+G      P L +Q   T D SDYLWY  SI +     P   KE  
Sbjct: 440 WRQEPFVQMKDGQITGIIDLTAPKLLDQKVVTNDFSDYLWYITSIDIKGDDDPSWTKEFR 499

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + + GH   VFVN K V   +  +    F+   KI+L  G N + +LS  VGL NYG 
Sbjct: 500 LRVHTSGHVLHVFVNGKHVGTQHAKNGQFKFVHESKIKLTTGKNEISLLSTTVGLPNYGP 559

Query: 534 WFDVAGAGLFSVILIDLKNGK---------RDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
           +FD    G+   + +    G          +DLS  +W Y+VG+ GE+      S  NS 
Sbjct: 560 FFDNIEVGVLGPVQLVAAVGDYDYDDDEIVKDLSKNQWSYKVGLHGEHEM--HYSYENSL 617

Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
                  +P ++ L+WYKTTF +P G  P+ ++L+ +GKG AWVNG SIGRYWS+YLA  
Sbjct: 618 KTWYTDAVPTDRILVWYKTTFKSPIGDDPVVVDLSGLGKGHAWVNGNSIGRYWSSYLADE 677

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISL 703
            GC+ KCDYRG Y ++KC   C QP+Q  YH+PR+++    +N LV+ EELGG P  ++ 
Sbjct: 678 NGCSPKCDYRGPYTSNKCLSMCAQPSQRWYHVPRSFLRDNDQNTLVLFEELGGQPYYVNF 737

Query: 704 LTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
           LT T   +C+   E +                   + LAC +   I+ I FAS+G+P+G 
Sbjct: 738 LTVTVGKVCANAYEGN------------------TLELACNKNQVISEIKFASFGLPKGE 779

Query: 764 CGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
           CGSF+ G C   + L  ++  C+G+ +CSI VS   LG +        + LAVEA C I
Sbjct: 780 CGSFQKGNCESSEALSAIKAQCIGKDKCSIQVSERTLGPTRCRV-AEDRRLAVEAVCDI 837


>gi|297793967|ref|XP_002864868.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
 gi|297310703|gb|EFH41127.1| beta-galactosidase 10 [Arabidopsis lyrata subsp. lyrata]
          Length = 740

 Score =  753 bits (1943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/712 (51%), Positives = 475/712 (66%), Gaps = 17/712 (2%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANV+YDHR+L I  +R+++ S +IHYPRS P +WP L++ +KEGG   IE+YVFWN HE
Sbjct: 28  AANVSYDHRSLSIGNRRQLIISAAIHYPRSVPAMWPSLVQTAKEGGCNAIESYVFWNGHE 87

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P   +YYF GR+++V+F+K VQ+AG+ + LRIGP+  AEWNYGG PVWLH++PG  FR  
Sbjct: 88  PSPRKYYFGGRYNIVKFIKIVQQAGMHMILRIGPFVAAEWNYGGVPVWLHYVPGTVFRAD 147

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N P+K  M+ F   I++L+K+E LFA QGGPIIL+QVENEYG  E  YG GG+ Y +W+A
Sbjct: 148 NEPWKHYMESFTTYIVNLLKKEKLFAPQGGPIILSQVENEYGYYEKDYGEGGKRYAQWSA 207

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV+ N  VPW+MCQQ DAP  +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG 
Sbjct: 208 SMAVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGG 267

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P RP ED+A++VARFF  GG+  NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG 
Sbjct: 268 RDPHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGL 327

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R PKWGHL++LHKAI L E  LI+ +  +  LG  LEA +Y  SS  CAAFL+N D  +
Sbjct: 328 PRLPKWGHLKDLHKAIMLSENLLINGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKN 387

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  V F    Y LPAWSVSILPDCKN VFNTAKV S+       F++ + + E L +SS 
Sbjct: 388 DKTVMFRNTSYHLPAWSVSILPDCKNEVFNTAKVTSK-------FSKVEMLPEDLRSSSG 440

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
             W  + EK GI G   FV+ +L + INTTKDT+DYLWYT SI V   +     G    L
Sbjct: 441 LKWEVFSEKPGIWGEADFVKNELVDHINTTKDTTDYLWYTTSITVSTNEEFLKKGSPPVL 500

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            IES GH   VF+NK+ +    GN     F + K + L  G N +D+LSM VGL N G++
Sbjct: 501 FIESKGHTLHVFINKEYLGTATGNGTHVPFKLKKSVALKAGENNIDLLSMTVGLSNAGSF 560

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           ++  GAGL SV +     G  +L++ +W Y++GV+G ++ L K   + +  W   +  P 
Sbjct: 561 YEWVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVQGVHLELFKPGDSGAVKWTVTTKPPK 620

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW---SAYLAPSTGCTKKC 651
            + L WYK     P G  P+ L++ SMGKG AW+NG+ IGRYW   +    P+  C K+C
Sbjct: 621 KQPLTWYKVVIDPPSGSEPVGLDMMSMGKGMAWLNGEEIGRYWPRIARKSTPNDECVKEC 680

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
           DYRG +   KC   CG+P+Q  YH+PR+W     N LVI EE GGDP KI+L
Sbjct: 681 DYRGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGDPMKITL 732


>gi|1352075|sp|P49676.1|BGAL_BRAOL RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
 gi|669059|emb|CAA59162.1| beta-galactosidase [Brassica oleracea]
          Length = 828

 Score =  749 bits (1935), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/834 (45%), Positives = 522/834 (62%), Gaps = 45/834 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V++D RA+ IDG+RR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HE
Sbjct: 24  STIVSHDERAITIDGQRRILLSGSIHYPRSTSDMWPDLISKAKDGGLDTIETYVFWNAHE 83

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P R QY F G  DLVRF+KT+Q AGL+  LRIGPY CAEWNYGGFPVWLH +P ++FRT 
Sbjct: 84  PSRRQYDFSGNLDLVRFIKTIQSAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPDMKFRTI 143

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  F  EM+ F  KI+++MK+E+LFASQGGPIILAQ+ENEYGNV  +YG  G+ Y+ W A
Sbjct: 144 NPGFMNEMQNFTTKIVNMMKEESLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCA 203

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A +L+  VPW+MCQQ  AP P+I TCNGFYCD + P++PS P MWTEN++GWF ++G 
Sbjct: 204 NMANSLDIGVPWIMCQQPHAPQPMIETCNGFYCDQYKPSNPSSPKMWTENWTGWFKNWGG 263

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+R  EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDYDAP+DEYG 
Sbjct: 264 KHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYDAPLDEYGN 323

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           + QPKWGHL++LH  +K  E+ L   + +   LG  + A +Y  ++   + F+ N ++++
Sbjct: 324 LNQPKWGHLKQLHTLLKSMEKPLTYGNISTIDLGNSVTATVY-STNEKSSCFIGNVNATA 382

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ-----RNNGDHPFAQQKNVNELL 416
           DA V F G  Y +PAWSVS+LPDC    +NTA+V +Q      ++ D P        E L
Sbjct: 383 DALVNFKGKDYNVPAWSVSVLPDCDKEAYNTARVNTQTSIITEDSCDEP--------EKL 434

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVF 473
             +    +  +K  + G+   +   L +Q + T D SDYLWY   +H+    P   + + 
Sbjct: 435 KWTWRPEFTTQKTILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRVHLDKKDPIWSRNMS 494

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + S  H    +VN K V       +  ++   KK+ L  G N L +LS+ VGLQNYG 
Sbjct: 495 LRVHSNAHVLHAYVNGKYVGNQIVRDNKFDYRFEKKVNLVHGTNHLALLSVSVGLQNYGP 554

Query: 534 WFDVAGAGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
           +F+    G+   V L+  K     ++DLS  +W Y++G+ G    L  +  A     K  
Sbjct: 555 FFESGPTGINGPVKLVGYKGDETIEKDLSKHQWDYKIGLNGFNHKLFSMKSAGHHHRKWS 614

Query: 590 S-TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
           +  LP ++ L WYK  F AP GK P+ ++L  +GKG+ W+NGQSIGRYW ++ +   GCT
Sbjct: 615 TEKLPADRMLSWYKANFKAPLGKDPVIVDLNGLGKGEVWINGQSIGRYWPSFNSSDEGCT 674

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKT 707
           ++CDYRG Y + KC   CG+P Q  YH+PR++++  G N + + EE+GGDPS +   T  
Sbjct: 675 EECDYRGEYGSDKCAFMCGKPTQRWYHVPRSFLNDKGHNTITLFEEMGGDPSMVKFKTVV 734

Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
              +C+   E +                  +V L+C     I+A+ FAS+G P G CGSF
Sbjct: 735 TGRVCAKAHEHN------------------KVELSCNNR-PISAVKFASFGNPSGQCGSF 775

Query: 768 RPGACH--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G+C    D + +V K CVG++ C++ VSS   G S   C    K L VE  C
Sbjct: 776 AAGSCEGAKDAVKVVAKECVGKLNCTMNVSSHKFG-SNLDCGDSPKRLFVEVEC 828


>gi|297793199|ref|XP_002864484.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297310319|gb|EFH40743.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 726

 Score =  749 bits (1934), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/718 (51%), Positives = 491/718 (68%), Gaps = 27/718 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A+V+YD +A++I+G+RR+L SGSIHYPRSTPE+WP LI+K+KEGGL+VIETYVFWN H
Sbjct: 25  VKASVSYDRKAVIINGQRRILLSGSIHYPRSTPEMWPGLIQKAKEGGLDVIETYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYF  R+DLV+F+K V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT
Sbjct: 85  EPSPGQYYFGDRYDLVKFIKLVHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILA--QVENEYGNVEWAYGVGGELYVK 178
            N PFK  MK+F  KI+ +MK E LF +QGGPIILA  Q+ENEYG VEW  G  G+ Y K
Sbjct: 145 DNEPFKAAMKKFTEKIVWMMKAEKLFQTQGGPIILAQGQIENEYGPVEWEIGAPGKAYTK 204

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           W A  A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+  
Sbjct: 205 WVAQMALGLSTGVPWIMCKQEDAPSPIIDTCNGYYCEDFKPNSSNKPKMWTENWTGWYTE 264

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
           FG AVP+RPVED+A++VARF + GG+F NYYMY GGTNF RTA G  +A+SYDYDAP+DE
Sbjct: 265 FGGAVPYRPVEDIAYSVARFIQKGGSFVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDE 323

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
           YG  R+PK+ HL+ LHK IKL E  L+S+D T   LGAK EA+++  S + CAAFL+N D
Sbjct: 324 YGLPREPKYSHLKALHKVIKLSEPALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKD 382

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
            SS A V F G  Y LP WSVSILPDCK   +NTAKV       + P   +     ++  
Sbjct: 383 ESSAARVMFRGFPYVLPPWSVSILPDCKTEFYNTAKV-------NAPSVHR----NMVPT 431

Query: 419 SSAFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
            + FSW  + E    +    +F R  L EQI+ T D SDY WY   I +  G+     G 
Sbjct: 432 GARFSWGSFNEATPTANEAGTFARNGLVEQISMTWDKSDYFWYLTDITIGSGETFLKTGD 491

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
                + S GHA  VFVN +L    YG  D       +KI+L+ G+N L +LS+ VGL N
Sbjct: 492 FPLFTVMSAGHALHVFVNGQLSGTAYGGLDHPKLTFTQKIKLHAGVNKLALLSVAVGLPN 551

Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
            G  F+    G+   V L  + +G  D+S  +W Y++GV+GE + L   + ++   W QG
Sbjct: 552 VGTHFEQWNKGVLGPVTLKGVNSGTWDMSKWKWSYKIGVKGEALSLHTDTESSGVRWTQG 611

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           S +   + L WYK+TF  P G  PLAL++ +MGKGQ W+NG++IGR+W AY A   G   
Sbjct: 612 SFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCG 669

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +C+Y G+++A KC  +CG+ +Q  YH+PR+W+   +NL+V+ EE GGDP+ ISL+ +T
Sbjct: 670 RCNYAGTFNAKKCLSNCGEASQRWYHVPRSWLK-SQNLIVVFEEWGGDPNGISLVKRT 726


>gi|2209358|gb|AAB61470.1| beta-D-galactosidase [Mangifera indica]
          Length = 663

 Score =  749 bits (1933), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/644 (55%), Positives = 460/644 (71%), Gaps = 22/644 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A V+YDH+A++IDG+RR+L SGSIHYPRSTP++WP+LI+K+K+G ++VI+TYVFWN H
Sbjct: 30  VEATVSYDHKAIIIDGQRRILISGSIHYPRSTPQMWPDLIQKAKDG-VDVIQTYVFWNGH 88

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PGI+FRT
Sbjct: 89  EPSPGKYYFEDRYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIEFRT 148

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK E LF +QGGPIIL+Q+ENE+G VEW  G  G+ Y KWA
Sbjct: 149 DNEPFKAAMQKFTEKIVSMMKAEKLFETQGGPIILSQIENEFGPVEWEIGAPGKAYTKWA 208

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+Q+DAPDP+INTCNGFYC+ F PN  +KP MWTEN++GWF +FG
Sbjct: 209 AQMAVGLDTGVPWVMCKQDDAPDPVINTCNGFYCENFVPNQKNKPKMWTENWTGWFTAFG 268

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG
Sbjct: 269 GPTPQRPAEDVAFSVARFIQNGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYG 328

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +R+PKWGHLR+LHKAIKLCE  L+S+DPT   LG   E H+++  S  CAAFLANYD++
Sbjct: 329 LLREPKWGHLRDLHKAIKLCESALVSTDPTVTSLGNNQEVHVFNPKSGSCAAFLANYDTT 388

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F    Y LP WS+SILPDCK  VFNTA++ +Q +       Q   V       S
Sbjct: 389 SSAKVNFKIMQYELPPWSISILPDCKTAVFNTARLGAQSS-----LKQMTPV-------S 436

Query: 421 AFSW---YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
            FSW    EE    S +++F    L EQ+N T+D SDYLWY  +I++   +     G++ 
Sbjct: 437 TFSWQSYIEESASSSDDKTFTTDGLWEQLNVTRDASDYLWYMTNINIDSNEGFLKNGQDP 496

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L I S GHA  VF+N +L    YG  D      ++ +++  G+N L +LS+ VGLQN G
Sbjct: 497 LLTIWSAGHALHVFINGQLSGTVYGGVDNPKLTFSQNVKMRVGVNQLSLLSISVGLQNVG 556

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+    G+   V L  L  G RDLS  +W Y++G++GE + L  +S ++S  W +GS+
Sbjct: 557 THFEQWNTGVLGPVTLRGLNEGTRDLSKQQWSYKIGLKGEDLSLHTVSGSSSVEWVEGSS 616

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
           L   + L WYKTTF AP G  PLAL++++MGKG  W+N QSIGR
Sbjct: 617 LAQKQPLTWYKTTFNAPAGNEPLALDMSTMGKGLIWINSQSIGR 660


>gi|413926109|gb|AFW66041.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
          Length = 785

 Score =  746 bits (1927), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/762 (49%), Positives = 487/762 (63%), Gaps = 76/762 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YDHR+LVI+G+RR+L SGSIHYPRS PE+WP LI+K+K+GGL+V++TYVFWN HEP +
Sbjct: 40  VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQYYF  R+DLVRFVK V++AGL++HLR+GPY CAEWN+GGFPVWL ++PGI+FRT N P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F+ KI+ +MK E LF  QGGPII+AQVENE+G +E   G GG+ Y  WAA  A
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V  N  VPWVMC+Q+DAPDP+INTCNGFYCD FTPN+  KP MWTE ++GWF  FG A P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY----- 299
            RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+     
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339

Query: 300 --------------------------------------------GFIRQPKWGHLRELHK 315
                                                       G +RQPKWGHLR +H+
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399

Query: 316 AIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
           AIK  E  L+S DPT + +G   +A+++   +  CAAFL+NY   S   + F+G  Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459

Query: 376 AWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISG 433
           AWS+SILPDCK  VFNTA V            +   + ++      F+W  Y E      
Sbjct: 460 AWSISILPDCKTAVFNTATV-----------KEPTLLPKMSPVMHRFAWQSYSEDTNSLD 508

Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVN 488
           + +F R  L EQ++ T D SDYLWYT  +++   +     G+   L++ S GH+  VFVN
Sbjct: 509 DSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVN 568

Query: 489 KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VIL 547
            +     YG +D      +  +++ +G N + ILS  VGL N G  F++   G+   V L
Sbjct: 569 GRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTL 628

Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVNKSLIWYKTTF 605
             L  GKRDLS   WIYQVG++GE +GL  ++ +++  W    G T P    L W+K  F
Sbjct: 629 SGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGGGTQP----LTWHKALF 684

Query: 606 LAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKH 665
            AP G  P+AL++ SMGKGQ WVNG+  GRYWS Y A S GC  +C Y G+Y   +C  +
Sbjct: 685 NAPAGSDPVALDMGSMGKGQVWVNGRHAGRYWS-YRAHSRGC-GRCSYAGTYREDQCTSN 742

Query: 666 CGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           CG  +Q  YH+PR+W+ P  NLLV+ EE GGD + +SL T+T
Sbjct: 743 CGDLSQRWYHVPRSWLKPSGNLLVVLEEYGGDLAGVSLATRT 784


>gi|449476344|ref|XP_004154711.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
          Length = 803

 Score =  746 bits (1926), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/835 (45%), Positives = 526/835 (62%), Gaps = 49/835 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  NV+YD  A++I+G+RRV+ SGSIHYPRST  +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 1   MGDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRH 60

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP R +Y F G  + ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQ RT
Sbjct: 61  EPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 120

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N  +K EM  F  KI+++ KQ NLFASQGGPIILAQ+ENEYGNV   YG  G+ Y+ W 
Sbjct: 121 DNQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWC 180

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A +LN  VPW+MCQQ DAP PIINTCNGFYCD F+PN+P  P M+TEN+ GWF  +G
Sbjct: 181 AQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWG 240

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P+R  ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEYG
Sbjct: 241 DKDPYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYG 300

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYDS 359
            + QPKWGHL++LH +IKL E+ L +   +++  G+ +      + ++ +   FL+N D 
Sbjct: 301 NLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTLTKFSNPTTKERFCFLSNTDD 360

Query: 360 SSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           ++DA +    +  YF+PAWSVSI+  CK  VFNTAK+ SQ +     F + +N  E +  
Sbjct: 361 TNDATIDLQADGKYFVPAWSVSIIDGCKKEVFNTAKINSQTS----MFVKVQNEKENVKL 416

Query: 419 SSAFSWYEEKVG--ISGNRSFVRPDLAEQINTTKDTSDYLWY--------TASIHVMPGQ 468
           S  + W  E +   + G  +F    L EQ  TT D+SDYLWY        T+SIH     
Sbjct: 417 S--WVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTSSIH----- 469

Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
              V L + + GH    FVN + +   +GN+   +F+  K I L  G N + +LS  VGL
Sbjct: 470 --NVTLQVNTKGHVLHAFVNTRYIGSQWGNNG-QSFVFEKPILLKAGTNIITLLSATVGL 526

Query: 529 QNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
           +NY A++D    G+    + LI   N   +LSS  W Y+VG+ GE   L     +  + W
Sbjct: 527 KNYDAFYDTLPTGIDGGPIYLIGDGNVTTNLSSNLWSYKVGLNGEIKQLYNPVFSQETSW 586

Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
              +   + + + WYKT+F  P G  P+ L++  MGKG+AW+NGQSIGR+W +++A +  
Sbjct: 587 NTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWPSFIAGNDN 646

Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           C++ CDYRG+YD SKC  +CG P+Q  YHIPR+++    N LV+ EE+GG P ++S+ T 
Sbjct: 647 CSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSPQQVSVQTI 706

Query: 707 TGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
           T   IC   +E                     + L+C+  + I+ I FASYG P+G CGS
Sbjct: 707 TIGTICGNANEGS------------------TLELSCQGEYIISEIQFASYGNPKGKCGS 748

Query: 767 FRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           F+ G+  + +   +++K C     CS+ VS+   G+  G    L   L V+A CS
Sbjct: 749 FKQGSWDVTNSALLLEKTCKDMKSCSVDVSAKLFGL--GDAVNLSARLVVQALCS 801


>gi|357437609|ref|XP_003589080.1| Beta-galactosidase [Medicago truncatula]
 gi|355478128|gb|AES59331.1| Beta-galactosidase [Medicago truncatula]
          Length = 718

 Score =  746 bits (1925), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/714 (52%), Positives = 484/714 (67%), Gaps = 23/714 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+V+YDH+ALVIDG+RR+L SGSIHYPRSTPE+WP+L +K+K+GGL+VI+TYVFWN H
Sbjct: 21  VTASVSYDHKALVIDGQRRILISGSIHYPRSTPEMWPDLFQKAKDGGLDVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G Y  + R D V+  K  Q+A L +HLR+ P       + GFPVWL ++PG+ FRT
Sbjct: 81  EPSPGNYTLKDRLDWVKLSKLAQQAVLNVHLRMVP------TFVGFPVWLKYVPGMAFRT 134

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK E+LF +QGGPII++Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 135 DNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKAYTKWA 194

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPW MC+QEDAPDP+I+TCNG+YC+ FTPN   KP MWTEN+SGW+  FG
Sbjct: 195 AQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGWYTDFG 254

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A+  RP EDLA++VA F +  G+F NYYMY GGTNFGRT+ G  +ATSYDYDAPIDEYG
Sbjct: 255 GAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 314

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK-LEAHIYHKSSNDCAAFLANYDS 359
              +PKW HL+ LHKAIK CE  LIS DPT   LG K LEAH+Y+ +++ CAAFLANYD+
Sbjct: 315 LPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFLANYDT 374

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
            S A VTF    Y LP WSVSILPDCK VVFNTA V     NG H F ++    E     
Sbjct: 375 KSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATV-----NG-HSFHKRMTPVETTFDW 428

Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
            ++S  EE    S + S +   L EQIN T+D+SDYLWY   +++ P +     G+   L
Sbjct: 429 QSYS--EEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNGQFPTL 486

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GH   VFVN +L    YG  D      ++ + L  G N + +LS+ VGL N G  
Sbjct: 487 TINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLPNVGLH 546

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+    G+   V L  L  G RDLS  +W Y+VG++GE + L  I+ ++S  W QGS+L 
Sbjct: 547 FETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQGSSLA 606

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYKTTF AP G  P+AL+++SMGKG+ W+N QSIGR+W AY+A   G   +C+Y
Sbjct: 607 KKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIA--HGNCDECNY 664

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G++   KC+ +CG+P Q  YHIPR+W+    N+LV+ EE GGDP+ ISL+ +T
Sbjct: 665 AGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISLVKRT 718


>gi|326517964|dbj|BAK07234.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 616

 Score =  746 bits (1925), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/618 (58%), Positives = 442/618 (71%), Gaps = 12/618 (1%)

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QY FEGR DLVRFVK   +AGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+ RT N PF
Sbjct: 1   QYDFEGRNDLVRFVKAAADAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKLRTDNEPF 60

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K EM+RF  K++  MK   L+ASQGGPIIL+Q+ENEYGN+  +YG  G+ Y++WAA  AV
Sbjct: 61  KTEMQRFTEKVVATMKGAGLYASQGGPIILSQIENEYGNIAASYGAAGKSYIRWAAGMAV 120

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            L+T VPWVMCQQ DAP+P+INTCNGFYCD FTP+ PS+P +WTEN+SGWFLSFG AVP+
Sbjct: 121 ALDTGVPWVMCQQTDAPEPLINTCNGFYCDQFTPSLPSRPKLWTENWSGWFLSFGGAVPY 180

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RP EDLAFAVARF++ GGT QNYYMY GGTNFGR++GGP ++TSYDYDAPIDEYG +RQP
Sbjct: 181 RPTEDLAFAVARFYQRGGTLQNYYMYHGGTNFGRSSGGPFISTSYDYDAPIDEYGLVRQP 240

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHLR++HKAIK+CE  LI++DP++  LG   EAH+Y KS + CAAFLAN D  SD  V
Sbjct: 241 KWGHLRDVHKAIKMCEPALIATDPSYMSLGQNAEAHVY-KSGSLCAAFLANIDDQSDKTV 299

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ------RNNGDHPFAQQKNVNELLLAS 419
           TFNG  Y LPAWSVSILPDCKNVV NTA++ SQ      RN G    A   +  E  LA+
Sbjct: 300 TFNGKAYKLPAWSVSILPDCKNVVLNTAQINSQVASTQMRNLGFSTQASDGSSVEAELAA 359

Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVFLN 475
           S++S+  E VGI+   +  +P L EQINTT D SD+LWY+ SI V  G+    G +  L 
Sbjct: 360 SSWSYAVEPVGITKENALTKPGLMEQINTTADASDFLWYSTSIVVAGGEPYLNGSQSNLL 419

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + SLGH   VF+N KL     G+   +   +   + L  G N +D+LS  VGL NYGA+F
Sbjct: 420 VNSLGHVLQVFINGKLAGSSKGSASSSLISLTTPVTLVTGKNKIDLLSATVGLTNYGAFF 479

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           D+ GAG+   + +    G  DLSS EW YQ+G+ GE + L   S A S  W   ++ P N
Sbjct: 480 DLVGAGITGPVKLTGPKGTLDLSSAEWTYQIGLRGEDLHLYNPSEA-SPEWVSDNSYPTN 538

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
             L WYK+ F AP G  P+A++   MGKG+AWVNGQSIGRYW   +AP +GC   C+YRG
Sbjct: 539 NPLTWYKSKFTAPAGDDPVAIDFTGMGKGEAWVNGQSIGRYWPTNIAPQSGCVNSCNYRG 598

Query: 656 SYDASKCQKHCGQPAQTL 673
           SY A+KC K CGQP+Q L
Sbjct: 599 SYSATKCLKKCGQPSQIL 616


>gi|357484445|ref|XP_003612510.1| Beta-galactosidase [Medicago truncatula]
 gi|355513845|gb|AES95468.1| Beta-galactosidase [Medicago truncatula]
          Length = 828

 Score =  744 bits (1920), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/831 (45%), Positives = 517/831 (62%), Gaps = 43/831 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V YD  AL+I+G+RR++ SG+IHYPRST ++WP+L++K+K+GGL+ IETY+FW+ HE +R
Sbjct: 25  VKYDSNALIINGERRLIFSGAIHYPRSTVDMWPDLVQKAKDGGLDAIETYIFWDRHEQVR 84

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y F G  D V+F KT+QEAGL+  +RIGPY+CAEWNYGGFPVWLH IPGI+ RT N  
Sbjct: 85  GRYNFSGNLDFVKFFKTIQEAGLYGIIRIGPYSCAEWNYGGFPVWLHQIPGIEMRTDNAA 144

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K EM+ F+ KII++ K+ NLFASQGGPIILAQ+ENEYG++ W +   G+ Y+KWAA  A
Sbjct: 145 YKNEMQIFVTKIINVAKEANLFASQGGPIILAQIENEYGDIMWNFKEPGKAYIKWAAQMA 204

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           +  N  VPW MCQQ DAP PIINTCNG+YC  F PN+P  P M+TEN+ GWF  +G   P
Sbjct: 205 LAQNIGVPWFMCQQNDAPQPIINTCNGYYCHNFKPNNPKSPKMFTENWIGWFQKWGERAP 264

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  ED A+AVARFF+ GG F NYYMY GGTNFGRT+GGP + TSYDYDAPI+EYG + Q
Sbjct: 265 HRTAEDSAYAVARFFQNGGVFNNYYMYHGGTNFGRTSGGPYIITSYDYDAPINEYGNLNQ 324

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQK-LGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           PK+GHL+ LH+AIKL E+ L +    + K LG  +    Y  S      FL+N   ++D 
Sbjct: 325 PKYGHLKFLHEAIKLGEKVLTNYTSRNDKDLGNGITLTTYTNSVGARFCFLSNDKDNTDG 384

Query: 364 NVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           NV   N   YF+PAWSV+IL  C   VFNTAKV SQ +        +K ++        +
Sbjct: 385 NVDLQNDGKYFVPAWSVTILDGCNKEVFNTAKVNSQTS------IMEKKIDNSSTNKLTW 438

Query: 423 SWYEE--KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLNIESL 479
           +W  E  K  ++G  S     L EQ   T D SDYLWY  S+ +          L++E+ 
Sbjct: 439 AWIMEPKKDTMNGRGSIKAHQLLEQKELTLDASDYLWYMTSVDINDTSNWSNANLHVETS 498

Query: 480 GHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
           GH    +VNK+ +  GYG+  F  NF   K++ L  G N + +LS  VGL NYGA FD  
Sbjct: 499 GHTLHGYVNKRYI--GYGHSQFGNNFTYEKQVSLKNGTNIITLLSATVGLANYGARFDEI 556

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             G+    V L+   +   DLS+G W ++VG+ GE      +   +   W   S+ P  K
Sbjct: 557 KTGISDGPVKLVGQNSVTIDLSTGNWSFKVGLNGEKRRFYDLQPRSGVAWNT-SSYPTGK 615

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYKT F +P G  P+ ++L  +GKG AWVNG+SIGRYW++++  + GC+  CDYRG+
Sbjct: 616 PLTWYKTQFKSPLGPNPIVVDLQGLGKGHAWVNGKSIGRYWTSWITSTAGCSDTCDYRGN 675

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           Y   KC   C  P+Q  YH+PR++++   N L++ EE+GG+P  +S LT+T + IC+ V 
Sbjct: 676 YKKEKCNTGCASPSQRWYHVPRSFLNDDMNTLILFEEIGGNPQNVSFLTETTKTICANVY 735

Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MD 775
           E                    ++ L+C+ G  I +INFAS+G P+G CGSF+ G+   ++
Sbjct: 736 EGG------------------KLELSCQNGQVITSINFASFGNPQGQCGSFKKGSWESLN 777

Query: 776 VLPIVQKACVGQIECSIPVSSAYLGV-------SAGACPGLLKALAVEAHC 819
              +++ +C+G+  C   V+    GV       S  +    +  LAV+A C
Sbjct: 778 SQSMMETSCIGKTGCGFTVTRDMFGVNLDPLSASKASVKDGIPRLAVQATC 828


>gi|356558952|ref|XP_003547766.1| PREDICTED: beta-galactosidase 7-like [Glycine max]
          Length = 826

 Score =  741 bits (1914), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 378/832 (45%), Positives = 512/832 (61%), Gaps = 42/832 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            +  VTYD R+L+I+G+RRV+ SG++HYPRST ++WP++I+K+K+GGL+ IE+YVFW+ H
Sbjct: 24  FATEVTYDARSLIINGERRVIFSGAVHYPRSTVQMWPDIIQKAKDGGLDAIESYVFWDRH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R +Y F G  D ++F + +QEAGL+  LRIGPY CAEWN+GGFP+WLH +PGI+ RT
Sbjct: 84  EPVRREYDFSGNLDFIKFFQIIQEAGLYAILRIGPYVCAEWNFGGFPLWLHNMPGIELRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N  +K EM+ F  KI+++ K+  LFASQGGPIILAQ+ENEYGN+   YG  G+ Y+KW 
Sbjct: 144 DNPIYKNEMQIFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNIMTDYGEAGKTYIKWC 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+  N  VPW+MCQQ DAP P+INTCNG YCD F PN+P  P M+TEN+ GWF  +G
Sbjct: 204 AQMALAQNIGVPWIMCQQHDAPQPMINTCNGHYCDSFQPNNPKSPKMFTENWIGWFQKWG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP R  ED AF+VARFF+ GG   NYYMY GGTNFGRTAGGP + TSY+YDAP+DEYG
Sbjct: 264 ERVPHRSAEDSAFSVARFFQNGGILNNYYMYHGGTNFGRTAGGPYMTTSYEYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            + QPKWGHL++LH AIKL E+ + +   T +  G ++    Y  ++ +   FL+N + S
Sbjct: 324 NLNQPKWGHLKQLHAAIKLGEKIITNGTRTDKDFGNEVTLTTYTHTNGERFCFLSNTNDS 383

Query: 361 SDANVTF--NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
            DANV    +GN YFLPAWSV+IL  C   VFNTAKV SQ +           V +   A
Sbjct: 384 KDANVDLQQDGN-YFLPAWSVTILDGCNKEVFNTAKVNSQTS---------IMVKKSDDA 433

Query: 419 SSAFSWY----EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-GKEVF 473
           S+  +W     ++K  + G  +F    L EQ   T D SDYLWY  S+ +          
Sbjct: 434 SNKLTWAWIPEKKKDTMHGKGNFKVNQLLEQKELTFDVSDYLWYMTSVDINDTSIWSNAT 493

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + + GH    +VN + V + +      NF   K + L +G+N + +LS  VGL NYGA
Sbjct: 494 LRVNTRGHTLRAYVNGRHVGYKFSQWG-GNFTYEKYVSLKKGLNVITLLSATVGLPNYGA 552

Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
            FD    G+    V LI   N   DLS+  W Y++G+ GE   L          W+  S 
Sbjct: 553 KFDKIKTGIAGGPVQLIGNNNETIDLSTNLWSYKIGLNGEKKRLYDPQPRIGVSWRTNSP 612

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
            P+ +SL WYK  F+AP G  P+ ++L  +GKG+AWVNGQSIGRYW++++  + GC+  C
Sbjct: 613 YPIGRSLTWYKADFVAPSGNDPVVVDLLGLGKGEAWVNGQSIGRYWTSWITATNGCSDTC 672

Query: 652 DYRGSY-DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
           DYRG Y  A KC  +CG P+Q  YH+PR+++   +N LV+ EE+GG+P  +S  T     
Sbjct: 673 DYRGKYVPAQKCNTNCGNPSQRWYHVPRSFLKNDKNTLVLFEEIGGNPQNVSFQTVITGT 732

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           IC+ V E                     + L+C+ G  I+ I F+S+G P GNCGSF+ G
Sbjct: 733 ICAQVQEG------------------ALLELSCQGGKTISQIQFSSFGNPTGNCGSFKKG 774

Query: 771 ACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGA--CPGLLKALAVEAHC 819
                D   +V+ ACVG+  C   V+    GV+ G       +  LAV+A C
Sbjct: 775 TWEATDGQSVVEAACVGRNSCGFMVTKEAFGVAIGPMNVDERVARLAVQATC 826


>gi|79517234|ref|NP_568399.4| beta-galactosidase 7 [Arabidopsis thaliana]
 gi|152013363|sp|Q9SCV5.2|BGAL7_ARATH RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
           Precursor
 gi|332005497|gb|AED92880.1| beta-galactosidase 7 [Arabidopsis thaliana]
          Length = 826

 Score =  741 bits (1913), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/829 (45%), Positives = 517/829 (62%), Gaps = 38/829 (4%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V++D RA+ I+GKRR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HE
Sbjct: 25  STIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P R +Y F G  D+VRF+KT+Q+AGL+  LRIGPY CAEWNYGGFPVWLH +P ++FRT 
Sbjct: 85  PKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTV 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  F  EM+ F  KI+ +MK+E LFASQGGPIILAQ+ENEYGNV  +YG  G+ Y+ W A
Sbjct: 145 NPSFMNEMQNFTTKIVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A +L+  VPW+MCQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G 
Sbjct: 205 NMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGG 264

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+R  EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY AP+DE+G 
Sbjct: 265 KHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGN 324

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           + QPKWGHL++LH  +K  E+ L   + +   LG  ++A IY  +    + F+ N ++++
Sbjct: 325 LNQPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATA 383

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DA V F G  Y +PAWSVS+LPDC    +NTAKV +Q +      ++ + +       SA
Sbjct: 384 DALVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPESA 443

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIES 478
                +K+ + G+   +   L +Q + T D SDYLWY   +H+    P   + + L + S
Sbjct: 444 -----QKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHS 498

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDV 537
             H    +VN K V   +      ++   +K+  L  G N + +LS+ VGLQNYG +F+ 
Sbjct: 499 NAHVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFES 558

Query: 538 AGAGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
              G+   V L+  K     ++DLS  +W Y++G+ G    L  I       W     LP
Sbjct: 559 GPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWAN-EKLP 617

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L WYK  F AP GK P+ ++L  +GKG+AW+NGQSIGRYW ++ +   GC  +CDY
Sbjct: 618 TGRMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDY 677

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           RG+Y + KC   CG+P Q  YH+PR++++  G N + + EE+GG+PS ++  T     +C
Sbjct: 678 RGAYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVC 737

Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           +   E +                  +V L+C     I+A+ FAS+G P G+CGSF  G C
Sbjct: 738 ARAHEHN------------------KVELSCHNR-PISAVKFASFGNPLGHCGSFAVGTC 778

Query: 773 H--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
               D    V K CVG++ C++ VSS   G S   C    K LAVE  C
Sbjct: 779 QGDKDAAKTVAKECVGKLNCTVNVSSDTFG-STLDCGDSPKKLAVELEC 826


>gi|225441062|ref|XP_002284027.1| PREDICTED: beta-galactosidase-like [Vitis vinifera]
          Length = 833

 Score =  739 bits (1909), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/835 (46%), Positives = 515/835 (61%), Gaps = 51/835 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T D R ++I+G+R++L SGS+HYPRSTPE+WP+LI+KSK+GGL  I+TYVFW+ HEP R
Sbjct: 30  ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DLVRF+K +Q  GL+  LRIGPY CAEW YGGFPVWLH  P IQ RT N  
Sbjct: 90  RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +  EM+ F   I+D+MK+E LFASQGGPII++Q+ENEYGNV  AY   G  Y+ W A  A
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
             L+T VPW+MCQQ++AP P+INTCNG+YCD FTPN+P+ P MWTEN+SGW+ ++G + P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP++EYG   Q
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
           PKWGHLR+LH  +   E+ L   D  +        A IY ++  + C  F  N ++  D 
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSNADRDV 387

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            + + G  Y +PAWSVSILPDC N V+NTAKV SQ +     F ++ +  E    S  ++
Sbjct: 388 TINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYST----FVKKGSEAENEPNSLQWT 443

Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIESLG 480
           W  E +       F   +L +Q    +DTSDYL+Y  ++ +    P  GK++ L++ + G
Sbjct: 444 WRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTTVDISNDDPIWGKDLTLSVNTSG 503

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           H    FVN + + + Y       F   + + L  G N + +LS  VGL NYG  FD+   
Sbjct: 504 HILHAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMVNQ 563

Query: 541 GLFSVILIDLKNGKRDL-----SSGEWIYQVGVEGEYIGLDKISLANSSF--WKQGSTLP 593
           G+   + I   NG  D+     ++ +W Y+ G+ GE     KI L  + +  WK    LP
Sbjct: 564 GIHGPVQIIASNGSADIIKDLSNNNQWAYKAGLNGE---DKKIFLGRARYNQWKS-DNLP 619

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           VN+S +WYK TF AP G+ P+ ++L  +GKG+AWVNG S+GRYW +Y+A   GC+ +CDY
Sbjct: 620 VNRSFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDY 679

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
           RG Y A KC  +CG P+Q  YH+PR+++   +N LV+ EE GG+PS ++  T T  + C+
Sbjct: 680 RGPYKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNACA 739

Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS------- 766
              E                     + L+C+ G  I+ I FAS+G P+G CG        
Sbjct: 740 NAREG------------------YTLELSCQ-GRAISGIKFASFGDPQGTCGKPFATGSQ 780

Query: 767 -FRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            F  G C   D L I+QK CVG+  CSI VS   LG     C    K LAVEA C
Sbjct: 781 VFEKGTCEAADSLSIIQKLCVGKYSCSIDVSEQILG--PAGCTADTKRLAVEAIC 833


>gi|357484129|ref|XP_003612351.1| Beta-galactosidase [Medicago truncatula]
 gi|355513686|gb|AES95309.1| Beta-galactosidase [Medicago truncatula]
          Length = 806

 Score =  738 bits (1906), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/835 (46%), Positives = 514/835 (61%), Gaps = 50/835 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            +  VTYD  AL+I+G+RR++ SG+IHYPRST E+WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 6   FATEVTYDSNALIINGERRLIFSGAIHYPRSTVEMWPDLIQKAKDGGLDAIETYIFWDRH 65

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R +Y F G  D V+F + +Q+AGL+  +RIGPYACAEWN+GGFP WLH +PGI+ RT
Sbjct: 66  EPVRREYNFSGNLDFVKFFQLIQKAGLYAIMRIGPYACAEWNFGGFPSWLHNMPGIELRT 125

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N+ +K EM+ F  +I++++K+  LFASQGGPIILAQ+ENEYG++ W Y   G+ YV+WA
Sbjct: 126 NNSVYKNEMQNFTTEIVNVVKEAKLFASQGGPIILAQIENEYGDIMWNYKDAGKAYVQWA 185

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+  N  VPW+MCQQ+DAP PIINTCNG+YC  F PN+P  P ++TEN+ GWF  +G
Sbjct: 186 AQMALAQNIGVPWIMCQQQDAPQPIINTCNGYYCHNFQPNNPKSPKIFTENWIGWFQKWG 245

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP R  ED AF+VARFF+ GG   NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 246 ERVPHRSAEDSAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGGPYITTSYDYDAPIDEYG 305

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLIS-SDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
            + QPKWGHL+ LH AIKL E  L + S    + LG  L    Y  SS     FL+N ++
Sbjct: 306 NLNQPKWGHLKNLHAAIKLGENVLTNYSARKDEDLGNGLTLTTYTNSSGARFCFLSNNNN 365

Query: 360 SS-DANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
           +   A V   N  VY +PAWSVSI+  C   VFNTAKV SQ +        +K+ N   +
Sbjct: 366 TDLGARVDLKNDGVYIVPAWSVSIINGCNQEVFNTAKVNSQTS-----MMVKKSDN---V 417

Query: 418 ASSAFSWYEEKV-----GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-GKE 471
           +S+  +W E KV      I GN S     L EQ   T D SDYLWY  S  +        
Sbjct: 418 SSTNLTW-EWKVEPKRDTIHGNGSLKAQKLLEQKELTLDASDYLWYMTSADINDTSIWSN 476

Query: 472 VFLNIESLGHAALVFVNKKLVAF---GYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
             L + + GH+   +VN++ V +    YGN     F   K++ L  G N + +LS  VGL
Sbjct: 477 ATLRVNTSGHSLHGYVNQRYVGYQFSQYGNQ----FTYEKQVSLKNGTNIITLLSATVGL 532

Query: 529 QNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
            NYGAWFD    G+    V LI   N   DLS+  W Y++G+ GE   L       S  W
Sbjct: 533 ANYGAWFDDKKTGISGGPVELIGKNNVTMDLSTNLWSYKIGLNGERRHLYDAQQNVSVAW 592

Query: 587 KQGST-LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
              S+ +P+ K LIWY+  F +P G  P+ ++L  +GKG AWVNG SIGRYWS++++PS 
Sbjct: 593 HTNSSYIPIGKPLIWYRAKFKSPFGTNPIVVDLQGLGKGHAWVNGHSIGRYWSSWISPSD 652

Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
           GC+  CDYRG+Y   KC  +CG P+Q  YH+PR++++   N LV+ EE+GG+P  +   T
Sbjct: 653 GCSDTCDYRGNYVPVKCNTNCGSPSQRWYHVPRSFLNHDMNTLVLFEEIGGNPQSVQFQT 712

Query: 706 KTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCG 765
            T   IC+ V E                    Q  L+C+ G  ++ I FASYG PEG CG
Sbjct: 713 VTTGTICANVYEG------------------AQFELSCQSGQVMSQIQFASYGNPEGQCG 754

Query: 766 SFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           SF+ G     +   +V+ +CVG+  C   V+    GV+  +    +  LAV+  C
Sbjct: 755 SFKKGNFDAANSQSVVEASCVGKNNCGFNVTKEMFGVTNVSS---IPRLAVQVTC 806


>gi|267026|sp|Q00662.1|BGAL_DIACA RecName: Full=Putative beta-galactosidase; Short=Lactase; AltName:
           Full=SR12 protein; Flags: Precursor
 gi|18328|emb|CAA40459.1| CARSR12 [Dianthus caryophyllus]
          Length = 731

 Score =  738 bits (1905), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/714 (50%), Positives = 483/714 (67%), Gaps = 22/714 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            NV YD+RA+ I+ +RR+L SGSIHYPRSTPE+WP++I K+K+  L+VI+TYVFWN HEP
Sbjct: 29  GNVWYDYRAIKINDQRRILLSGSIHYPRSTPEMWPDIIEKAKDSQLDVIQTYVFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFEGR+DLV+F+K + +AGLF+HLRIGP+ACAEWN+GGFPVWL ++PGI+FRT N
Sbjct: 89  SEGKYYFEGRYDLVKFIKLIHQAGLFVHLRIGPFACAEWNFGGFPVWLKYVPGIEFRTDN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFKE+M+ F  KI+D+MK E LF  QGGPIIL Q+ENEYG VEW  G  G+ Y  WAA 
Sbjct: 149 GPFKEKMQVFTTKIVDMMKAEKLFHWQGGPIILNQIENEYGPVEWEIGAPGKAYTHWAAQ 208

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A +LN  VPW+MC+Q+ D PD +I+TCNGFYC+GF P   SKP MWTEN++GW+  +G 
Sbjct: 209 MAQSLNAGVPWIMCKQDSDVPDNVIDTCNGFYCEGFVPKDKSKPKMWTENWTGWYTEYGK 268

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            VP+RP ED+AF+VARF + GG+F NYYM+ GGTNF  TA G  V+TSYDYDAP+DEYG 
Sbjct: 269 PVPYRPAEDVAFSVARFIQNGGSFMNYYMFHGGTNFETTA-GRFVSTSYDYDAPLDEYGL 327

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
            R+PK+ HL+ LHKAIK+CE  L+SSD     LG+  EAH+Y  +S  CAAFLANYD   
Sbjct: 328 PREPKYTHLKNLHKAIKMCEPALVSSDAKVTNLGSNQEAHVYSSNSGSCAAFLANYDPKW 387

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              VTF+G  + LPAWS+SILPDCK  V+NTA+V       + P  +  +    ++++  
Sbjct: 388 SVKVTFSGMEFELPAWSISILPDCKKEVYNTARV-------NEPSPKLHSKMTPVISNLN 440

Query: 422 FSWYEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPG------QGKEVFL 474
           +  Y ++V  + +  +F    L EQIN T D SDYLWY   + V+ G      +G E +L
Sbjct: 441 WQSYSDEVPTADSPGTFREKKLYEQINMTWDKSDYLWYMTDV-VLDGNEGFLKKGDEPWL 499

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + S GH   VFVN +L    YG+        ++K+++  G+N + +LS +VGL N G  
Sbjct: 500 TVNSAGHVLHVFVNGQLQGHAYGSLAKPQLTFSQKVKMTAGVNRISLLSAVVGLANVGWH 559

Query: 535 FDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           F+    G+   V L  L  G RDL+   W Y++G +GE     ++  +  S   Q     
Sbjct: 560 FERYNQGVLGPVTLSGLNEGTRDLTWQYWSYKIGTKGEE---QQVYNSGGSSHVQWGPPA 616

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
             + L+WYKTTF AP G  PLAL+L SMGKGQAW+NGQSIGR+WS  +A  + C   C+Y
Sbjct: 617 WKQPLVWYKTTFDAPGGNDPLALDLGSMGKGQAWINGQSIGRHWSNNIAKGS-CNDNCNY 675

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            G+Y  +KC   CG+ +Q  YH+PR+W+ P  NLLV+ EE GGD   +SL+ +T
Sbjct: 676 AGTYTETKCLSDCGKSSQKWYHVPRSWLQPRGNLLVVFEEWGGDTKWVSLVKRT 729


>gi|297808143|ref|XP_002871955.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
 gi|297317792|gb|EFH48214.1| beta-galactosidase 7 [Arabidopsis lyrata subsp. lyrata]
          Length = 826

 Score =  737 bits (1902), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/829 (44%), Positives = 516/829 (62%), Gaps = 38/829 (4%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V++D RA+ I+GKRR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HE
Sbjct: 25  STIVSHDERAITINGKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P R +Y F G  D+VRF+KT+Q+AGL+  LRIGPY CAEWNYGGFPVWLH +P ++FRT 
Sbjct: 85  PKRREYDFSGNLDVVRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTV 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  F  EM+ F  KI+++MK+E LFASQGGPIILAQ+ENEYGNV  +YG  G+ Y+ W A
Sbjct: 145 NPSFMNEMQNFTTKIVEMMKEEKLFASQGGPIILAQIENEYGNVISSYGAAGKAYIDWCA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A +L+  VPW+MCQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G 
Sbjct: 205 NMANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGG 264

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P+R  EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY APIDE+G 
Sbjct: 265 KHPYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPIDEFGN 324

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           + QPKWGHL++LH+ +K  E+ L   + +   LG  ++A IY  +    + F+ N ++++
Sbjct: 325 LNQPKWGHLKQLHRVLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATA 383

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           +A V F G  Y +PAWSVS+LP+C    +NTAKV +Q +      ++ + +       SA
Sbjct: 384 NALVNFKGKDYHVPAWSVSVLPECDKEAYNTAKVNTQTSIMTEDSSKPEKLEWTWRPESA 443

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIES 478
                +K+ +  +   +   L +Q + T D SDYLWY   +H+    P   + + L + S
Sbjct: 444 -----QKMILKSSGDLIAKGLVDQKDVTNDASDYLWYMTRVHLDKKDPLWSRNMTLRVHS 498

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDV 537
             H    +VN K V   +      ++   KK+  L  G N + +LS+ VGLQNYGA+F+ 
Sbjct: 499 NAHVLHAYVNGKYVGNQFVKDGKFDYRFEKKVNHLVHGTNHISLLSVSVGLQNYGAFFES 558

Query: 538 AGAGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
              G+   V L+  K     ++DLS  +W Y++G+ G    L          W      P
Sbjct: 559 GPTGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNNKLFSTKSVGHIKWAN-EMFP 617

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
            ++ L WYK  F AP GK P+ ++   +GKG+AW+NGQSIGRYW ++ +   GC  +CDY
Sbjct: 618 TSRMLTWYKAKFKAPLGKEPVIVDFNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDY 677

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHIC 712
           RG Y + KC   CG+P Q  YH+PR+++   G N + + EE+GG+PS ++  T     +C
Sbjct: 678 RGEYGSDKCAFMCGEPTQRWYHVPRSFLKASGHNTITLFEEMGGNPSMVNFKTVVVGTVC 737

Query: 713 SFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
           +   E +                  +V L+C     I+A+ FAS+G P G+CG+F  G C
Sbjct: 738 ARAHEHN------------------KVELSCHNH-PISAVKFASFGNPVGHCGTFAVGTC 778

Query: 773 H--MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
               D +  V K CVG++ C+I VSS   G S   C    K LAVE  C
Sbjct: 779 QGDKDAVKTVAKECVGKLNCTINVSSDTFG-STLDCGDSPKKLAVELEC 826


>gi|297740029|emb|CBI30211.3| unnamed protein product [Vitis vinifera]
          Length = 829

 Score =  736 bits (1901), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/832 (46%), Positives = 514/832 (61%), Gaps = 49/832 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T D R ++I+G+R++L SGS+HYPRSTPE+WP+LI+KSK+GGL  I+TYVFW+ HEP R
Sbjct: 30  ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 89

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DLVRF+K +Q  GL+  LRIGPY CAEW YGGFPVWLH  P IQ RT N  
Sbjct: 90  RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 149

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +  EM+ F   I+D+MK+E LFASQGGPII++Q+ENEYGNV  AY   G  Y+ W A  A
Sbjct: 150 YMSEMQTFTTMIVDMMKKEQLFASQGGPIIISQIENEYGNVMRAYHDAGVQYINWCAQMA 209

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
             L+T VPW+MCQQ++AP P+INTCNG+YCD FTPN+P+ P MWTEN+SGW+ ++G + P
Sbjct: 210 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 269

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP++EYG   Q
Sbjct: 270 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 329

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
           PKWGHLR+LH  +   E+ L   D  +        A IY ++  + C  F  N ++  D 
Sbjct: 330 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSNADRDV 387

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            + + G  Y +PAWSVSILPDC N V+NTAKV SQ +     F ++ +  E    S  ++
Sbjct: 388 TINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYST----FVKKGSEAENEPNSLQWT 443

Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAA 483
           W  E +       F   +L +Q    +DTSDYL+Y  + +  P  GK++ L++ + GH  
Sbjct: 444 WRGETIQYITPGRFTASELLDQKTVAEDTSDYLYYMTT-NDDPIWGKDLTLSVNTSGHIL 502

Query: 484 LVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF 543
             FVN + + + Y       F   + + L  G N + +LS  VGL NYG  FD+   G+ 
Sbjct: 503 HAFVNGEHIGYQYALLGQFEFQFRRSVTLQLGKNEITLLSATVGLTNYGPDFDMVNQGIH 562

Query: 544 SVILIDLKNGKRDL-----SSGEWIYQVGVEGEYIGLDKISLANSSF--WKQGSTLPVNK 596
             + I   NG  D+     ++ +W Y+ G+ GE     KI L  + +  WK    LPVN+
Sbjct: 563 GPVQIIASNGSADIIKDLSNNNQWAYKAGLNGE---DKKIFLGRARYNQWKS-DNLPVNR 618

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
           S +WYK TF AP G+ P+ ++L  +GKG+AWVNG S+GRYW +Y+A   GC+ +CDYRG 
Sbjct: 619 SFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYRGP 678

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           Y A KC  +CG P+Q  YH+PR+++   +N LV+ EE GG+PS ++  T T  + C+   
Sbjct: 679 YKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFGGNPSSVTFQTVTVGNACANAR 738

Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS--------FR 768
           E                     + L+C+ G  I+ I FAS+G P+G CG         F 
Sbjct: 739 EG------------------YTLELSCQ-GRAISGIKFASFGDPQGTCGKPFATGSQVFE 779

Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G C   D L I+QK CVG+  CSI VS   LG     C    K LAVEA C
Sbjct: 780 KGTCEAADSLSIIQKLCVGKYSCSIDVSEQILG--PAGCTADTKRLAVEAIC 829


>gi|449442765|ref|XP_004139151.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
           sativus]
          Length = 803

 Score =  735 bits (1898), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/843 (44%), Positives = 522/843 (61%), Gaps = 65/843 (7%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  NV+YD  A++I+G+RRV+ SGSIHYPRST  +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 1   MGDNVSYDSNAIIINGERRVIFSGSIHYPRSTDAMWPDLIQKAKDGGLDAIETYIFWDRH 60

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP R +Y F G  + ++F + VQ+AGL++ +RIGPY CAEWNYGGFP+WLH +PGIQ RT
Sbjct: 61  EPQRQKYDFSGHLNFIKFFQLVQDAGLYIVMRIGPYVCAEWNYGGFPLWLHNMPGIQLRT 120

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N  +K EM  F  KI+++ KQ NLFASQGGPIILAQ+ENEYGNV   YG  G+ Y+ W 
Sbjct: 121 DNQVYKNEMLTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPYGNAGKAYINWC 180

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A + N  VPW+MCQQ DAP PIINTCNGFYCD F+PN+P  P M+TEN+ GWF  +G
Sbjct: 181 AQMAESFNIGVPWIMCQQSDAPQPIINTCNGFYCDSFSPNNPKSPKMFTENWVGWFKKWG 240

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P+R  ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEYG
Sbjct: 241 DKDPYRSAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEYG 300

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY----------HKSSNDC 350
            + QPKWGHL++LH +IKL E+ L +   +++  G+ +    +          + ++ + 
Sbjct: 301 NLNQPKWGHLKQLHSSIKLGEKILTNGTHSNKTFGSFVTFKTFGSFVTLTKFSNPTTKER 360

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK 410
             FL+N   +           YF+PAWSVSI+  CK  VFNTAK+ SQ +     F + +
Sbjct: 361 FCFLSNTXKADGK--------YFVPAWSVSIIDGCKKEVFNTAKINSQTS----IFVKVQ 408

Query: 411 NVNELLLASSAFSWYEEKVG--ISGNRSFVRPDLAEQINTTKDTSDYLWY--------TA 460
           N  E +  S  + W  E +   + G  +F    L EQ  TT D+SDYLWY        T+
Sbjct: 409 NEKENVKLS--WVWAPEAMSDTLQGKGTFKENLLLEQKGTTIDSSDYLWYMTNVETNGTS 466

Query: 461 SIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
           SIH        V L + + GH    FVN + +   +GN+   +F+  K I L  G N + 
Sbjct: 467 SIH-------NVTLQVNTKGHVLHAFVNTRYIGSQWGNNG-QSFVFEKPILLKAGTNIIT 518

Query: 521 ILSMMVGLQNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
           +LS  VGL+NY A++D    G+    + LI   N K DLSS  W Y+VG+ GE   L   
Sbjct: 519 LLSATVGLKNYDAFYDTLPTGIDGGPIYLIGDGNVKIDLSSNLWSYKVGLNGEIKQLYNP 578

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
             +  + W   +   + + + WYKT+F  P G  P+ L++  MGKG+AW+NGQSIGR+W 
Sbjct: 579 VFSQETSWNTLNKNSIGRRMTWYKTSFKTPSGIDPVTLDMQGMGKGEAWINGQSIGRFWP 638

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
           +++A +  C++ CDYRG+YD SKC  +CG P+Q  YHIPR+++    N LV+ EE+GG P
Sbjct: 639 SFIAGNDNCSETCDYRGAYDPSKCVGNCGNPSQRWYHIPRSFLSNNTNTLVLFEEIGGSP 698

Query: 699 SKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYG 758
            ++S+ T T   IC   +E                     + L+C+  + I+ I FASYG
Sbjct: 699 QQVSVQTITIGTICGNANEGS------------------TLELSCQGEYIISEIQFASYG 740

Query: 759 IPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
            P+G CGSF+ G+  + +   +++K C G   CS+ VS+   G+  G    L   L V+A
Sbjct: 741 NPKGKCGSFKQGSWDVTNSALLLEKTCKGMKSCSVDVSAKLFGL--GDAVNLSARLVVQA 798

Query: 818 HCS 820
            CS
Sbjct: 799 LCS 801


>gi|224066807|ref|XP_002302225.1| predicted protein [Populus trichocarpa]
 gi|222843951|gb|EEE81498.1| predicted protein [Populus trichocarpa]
          Length = 798

 Score =  730 bits (1884), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/830 (46%), Positives = 525/830 (63%), Gaps = 49/830 (5%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           +NVTYD R+LVI+GK +++ SGSIHYPRSTP++WP LI K++ GGL+ I+TYVFWN HEP
Sbjct: 6   SNVTYDSRSLVINGKHKIIFSGSIHYPRSTPQMWPYLISKARAGGLDAIDTYVFWNLHEP 65

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQY F GR DLVRF+K V   GL++ LRIGP+  +EW YGG P WLH +PGI FR+ N
Sbjct: 66  QQGQYDFSGRKDLVRFIKEVHAQGLYVCLRIGPFIESEWTYGGLPFWLHDVPGIVFRSDN 125

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+R+   I+ ++K E L+ASQGGPIIL+Q+ENEYGNVE A+   G  YVKWAA 
Sbjct: 126 KPFKYHMERYAKMIVKMLKAEKLYASQGGPIILSQIENEYGNVEAAFHEKGPPYVKWAAK 185

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L+T VPWVMC+Q+DAPDP+IN CNG  C + F+ PNSP KP +WTEN++  + ++G
Sbjct: 186 MAVGLHTGVPWVMCKQDDAPDPVINACNGLRCGETFSGPNSPRKPAIWTENWTSVYQTYG 245

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+AF  A F   GG+F NYYMY GGTNFGRTA    V TSY   AP+DEYG
Sbjct: 246 KETRSRSAEDIAFHAALFIAKGGSFVNYYMYHGGTNFGRTA-AEYVPTSYYDQAPLDEYG 304

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPK GHL+ELH AIKLC + L+S    +  LG   EA  + ++S++CAAFL N+D  
Sbjct: 305 LLRQPKHGHLKELHAAIKLCRKPLLSRKWINFSLGQLQEAFAFERNSDECAAFLVNHDGR 364

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S+A V F G+ Y LP  S+SILP CK V FNTA+V +Q        A +++  +     S
Sbjct: 365 SNATVHFKGSSYKLPPKSISILPHCKTVAFNTAQVSTQYGTR---LATRRHKFD-----S 416

Query: 421 AFSWYEEKVGI-SGNRSFVRPD-LAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
              W E K  I S ++S +R + L E +NTTKD+SDYLWYT   H        V L + S
Sbjct: 417 IEQWKEYKEYIPSFDKSSLRANTLLEHMNTTKDSSDYLWYTFRFHQNSSNAHSV-LTVNS 475

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
           LGH    FVN + +   +G+HD  +F + + + L  G N + +LS+M GL + GA+ +  
Sbjct: 476 LGHNLHAFVNGEFIGSAHGSHDNKSFTLQRSLPLKRGTNYVSLLSVMTGLPDAGAYLERR 535

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
            AGL  V  I  ++   D ++  W Y+VG+ GE I L + + +  ++W + ++   ++ L
Sbjct: 536 VAGLRRVT-IQRQHELHDFTTYLWGYKVGLSGENIQLHRNNASVKAYWSRYAS--SSRPL 592

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYK+ F AP G  P+ALNLASMGKG+AWVNG+SIGRYW ++L                 
Sbjct: 593 TWYKSIFDAPAGNDPVALNLASMGKGEAWVNGRSIGRYWVSFLDSD-------------- 638

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
                   G P QT  HIPR+++ P  NLLVI EE  G+P  ISL T +   +C  VS +
Sbjct: 639 --------GNPYQTWNHIPRSFLKPSGNLLVILEEERGNPLGISLGTMSITKVCGHVSIS 690

Query: 719 DPPPVDSWKPNLGVVSS-------SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
            PPPV SW+    +  +        P+V+L C RG  I+++ F+S+G P G+C ++  G+
Sbjct: 691 HPPPVISWQGENQINGTRKRKYGRRPKVQLRCPRGRKISSVLFSSFGTPSGDCETYAIGS 750

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH  +    V+KAC+G+  CSIPVSS         CPG+ K+L V+A C+
Sbjct: 751 CHASNSRATVEKACLGKERCSIPVSSK--NFKGDPCPGIAKSLLVDAKCA 798


>gi|255561536|ref|XP_002521778.1| beta-galactosidase, putative [Ricinus communis]
 gi|223538991|gb|EEF40588.1| beta-galactosidase, putative [Ricinus communis]
          Length = 828

 Score =  728 bits (1879), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/826 (47%), Positives = 509/826 (61%), Gaps = 31/826 (3%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYD R+L++DG+R++L SGSIHYPRSTPE+W  LI K+KEGGL+VI+TYVFWN HEP 
Sbjct: 23  DVTYDGRSLIVDGQRKLLFSGSIHYPRSTPEMWQSLIAKAKEGGLDVIDTYVFWNLHEPQ 82

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F GR D+VRF+K VQ  GL++ LRIGP+   EW+YGG P WLH IPGI FR+ N 
Sbjct: 83  PGQYDFSGRRDIVRFIKEVQAQGLYVCLRIGPFIQGEWSYGGLPFWLHDIPGIVFRSDNE 142

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK +M+ F  KI+ +M+ E L+ SQGGPIIL+Q+ENEYG VE AY   G  YVKWAA  
Sbjct: 143 PFKVQMQGFTTKIVTMMQSEKLYVSQGGPIILSQIENEYGTVEEAYHEKGPAYVKWAAQM 202

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
           AV LNT VPWVMC+Q DAPDP+IN CNG  C + F  PNSP+KP +WTEN++  ++  G 
Sbjct: 203 AVGLNTGVPWVMCKQNDAPDPVINACNGLRCAETFVGPNSPNKPAIWTENWTTRYVITGE 262

Query: 242 AVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            +  R VED+AF V +F     G+F NYYMY GGTNFGRTA    V TSY   APIDEYG
Sbjct: 263 NIRIRSVEDIAFQVTQFIVAKKGSFVNYYMYHGGTNFGRTASA-FVPTSYYDQAPIDEYG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPKWGHL+E+H AIKLC   L+S       LG + +A ++   S +CAAFL N D++
Sbjct: 322 LIRQPKWGHLKEMHAAIKLCLTPLLSGGQVTISLGQQQQAFVFTGLSGECAAFLLNNDTA 381

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           + A+V F    Y LP  S+SILPDCK V FNTAKV +Q         +    ++LL    
Sbjct: 382 NTASVQFRNASYDLPPNSISILPDCKTVAFNTAKVSTQYT------TRSMTRSKLLDGED 435

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  Y+E +      S     + EQ++TTKD SDYLWYT           +  LN+ SLG
Sbjct: 436 KWVQYQEAIVNFDETSVKSEAILEQMSTTKDASDYLWYTFRFQ-QESSDTQAVLNVRSLG 494

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           H    FVN + V +  G+H    F +   + L+EG+N + +LS+MVG+ + GA+ +   A
Sbjct: 495 HVLHAFVNGQAVGYAQGSHKNPQFTLQSTVSLSEGVNNVSLLSVMVGMPDSGAYMERRAA 554

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
           GL  V  I  K G ++ ++  W YQVG+ GE + +     ++   W   S   +N  L W
Sbjct: 555 GLRKV-KIQEKEGNKEFTNYSWGYQVGLLGEKLQIFTDQGSSQVQWANFSKNALNP-LTW 612

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKT F AP    P+ALNL SMGKG+AWVNGQSIGRYW +Y A          Y  +    
Sbjct: 613 YKTLFDAPLEDAPVALNLGSMGKGEAWVNGQSIGRYWPSYRASDGSSQIWYAYFNTGAIF 672

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
           +  +         Y++PR+++ P  NLLV+ EE GG+P +IS+ T +   ICS V+ +  
Sbjct: 673 RAVR---------YNVPRSFLKPKGNLLVVLEESGGNPLQISVDTASISKICSHVTASHL 723

Query: 721 PPVDSWKP-----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCG-SFRPGACHM 774
           P V SW       N   + + P+V+L C     I+ I FASYG PEG CG ++  G CH 
Sbjct: 724 PLVSSWSKRTNTDNNNSLQARPRVKLDCPSNTKISNILFASYGTPEGTCGDAYAVGMCHS 783

Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
                IVQKAC+GQ+ CSIPVSS Y G     C    K+L V A C
Sbjct: 784 SSSEAIVQKACLGQMRCSIPVSSKYFG--GDPCSANEKSLLVVAEC 827


>gi|6686886|emb|CAB64743.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 788

 Score =  727 bits (1877), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/815 (45%), Positives = 507/815 (62%), Gaps = 38/815 (4%)

Query: 16  GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
           GKRR+L SGSIHYPRST ++WP+LI K+K+GGL+ IETYVFWN HEP R +Y F G  D+
Sbjct: 1   GKRRILLSGSIHYPRSTADMWPDLINKAKDGGLDAIETYVFWNAHEPKRREYDFSGNLDV 60

Query: 76  VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
           VRF+KT+Q+AGL+  LRIGPY CAEWNYGGFPVWLH +P ++FRT N  F  EM+ F  K
Sbjct: 61  VRFIKTIQDAGLYSVLRIGPYVCAEWNYGGFPVWLHNMPNMKFRTVNPSFMNEMQNFTTK 120

Query: 136 IIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVM 195
           I+ +MK+E LFASQGGPIILAQ+ENEYGNV  +YG  G+ Y+ W A+ A +L+  VPW+M
Sbjct: 121 IVKMMKEEKLFASQGGPIILAQIENEYGNVISSYGAEGKAYIDWCANMANSLDIGVPWLM 180

Query: 196 CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAV 255
           CQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G   P+R  EDLAF+V
Sbjct: 181 CQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKHPYRTAEDLAFSV 240

Query: 256 ARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
           ARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY AP+DE+G + QPKWGHL++LH 
Sbjct: 241 ARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLNQPKWGHLKQLHT 300

Query: 316 AIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
            +K  E+ L   + +   LG  ++A IY  +    + F+ N ++++DA V F G  Y +P
Sbjct: 301 VLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATADALVNFKGKDYHVP 359

Query: 376 AWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNR 435
           AWSVS+LPDC    +NTAKV +Q +      ++ + +       SA     +K+ + G+ 
Sbjct: 360 AWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPESA-----QKMILKGSG 414

Query: 436 SFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIESLGHAALVFVNKKLV 492
             +   L +Q + T D SDYLWY   +H+    P   + + L + S  H    +VN K V
Sbjct: 415 DLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHSNAHVLHAYVNGKYV 474

Query: 493 AFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDL 550
              +      ++   +K+  L  G N + +LS+ VGLQNYG +F+    G+   V L+  
Sbjct: 475 GNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGPTGINGPVSLVGY 534

Query: 551 KNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLA 607
           K     ++DLS  +W Y++G+ G    L  I       W     LP  + L WYK  F A
Sbjct: 535 KGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWAN-EKLPTGRMLTWYKAKFKA 593

Query: 608 PEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCG 667
           P GK P+ ++L  +GKG+AW+NGQSIGRYW ++ +   GC  +CDYRG+Y + KC   CG
Sbjct: 594 PLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDECDYRGAYGSDKCAFMCG 653

Query: 668 QPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW 726
           +P Q  YH+PR++++  G N + + EE+GG+PS ++  T     +C+   E +       
Sbjct: 654 KPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCARAHEHN------- 706

Query: 727 KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH--MDVLPIVQKAC 784
                      +V L+C     I+A+ FAS+G P G+CGSF  G C    D    V K C
Sbjct: 707 -----------KVELSCHNR-PISAVKFASFGNPLGHCGSFAVGTCQGDKDAAKTVAKEC 754

Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           VG++ C++ VSS   G S   C    K LAVE  C
Sbjct: 755 VGKLNCTVNVSSDTFG-STLDCGDSPKKLAVELEC 788


>gi|224103199|ref|XP_002312963.1| predicted protein [Populus trichocarpa]
 gi|222849371|gb|EEE86918.1| predicted protein [Populus trichocarpa]
          Length = 835

 Score =  723 bits (1865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/832 (45%), Positives = 507/832 (60%), Gaps = 52/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+I+GKR +L SGSIHYPRSTP++WPELI K+K GGL VI+TYVFWN HEP +
Sbjct: 31  VTYDERSLIINGKRELLFSGSIHYPRSTPDMWPELILKAKRGGLNVIQTYVFWNIHEPEQ 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ FEG +DLV+F+KT+ E G+F  LR+GP+  AEWN+GG P WL  IP I FR+ N P
Sbjct: 91  GKFNFEGPYDLVKFIKTIGENGMFATLRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F+ KIID+MK+E LFASQGGPIIL+Q+ENEY  V+ AY   G  Y++WA + A
Sbjct: 151 FKHHMEKFVTKIIDMMKEEKLFASQGGPIILSQIENEYNTVQLAYKNLGVSYIQWAGNMA 210

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           + LNT VPWVMC+Q+DAP P+INTCNG +C D FT PN P+KP +WTEN++  F  FG  
Sbjct: 211 LGLNTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNKPNKPSLWTENWTAQFRVFGDP 270

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED AF+VAR+F   G+  NYYMY GGTNF RTA    V T Y  +AP+DEYG  
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
           R+PKWGHL++LH+A+ LC++ L+  +P  QKL A +EA  Y +     CAAFLA+ +S  
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWGNPNVQKLSADVEARFYEQPGTKVCAAFLASNNSKE 389

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V F G  Y+LPA S+SILPDCK VV+NT  V+SQ N+ +  F + +  N+L      
Sbjct: 390 AETVKFRGQEYYLPARSISILPDCKTVVYNTMTVVSQHNSRN--FVKSRKTNKL-----E 442

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEV-----FLNI 476
           ++ Y E +          P   E  N TKD +DY+W+T +I+V      E       L +
Sbjct: 443 WNMYSETIPAQLQVDSSLP--KELYNLTKDKTDYVWFTTTINVDRRDMNERKRINPVLRV 500

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            SLGHA + FVN + +   +G+    +F++   ++L  GIN + +L  +VGL + GA+ +
Sbjct: 501 ASLGHAMVAFVNGEFIGSAHGSQIEKSFVLQHSVDLKPGINFVTLLGTLVGLPDSGAYME 560

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPV 594
              AG   V ++ L  G  DL+S  W +QVG+ GE   L          W   Q +  PV
Sbjct: 561 HRYAGPRGVSILGLNTGTLDLTSNGWGHQVGLSGETAKLFTKEGGGKVTWTKVQKAGPPV 620

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
                WYKT F APEGK P+A+ +  M KG  W+NG+SIGRYW  Y++P           
Sbjct: 621 T----WYKTHFDAPEGKSPVAVRMTGMNKGMIWINGKSIGRYWMTYVSP----------- 665

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                       G+P Q+ YHIPR+++ P +NL+VI EE   +P KI +LT     ICS+
Sbjct: 666 -----------LGEPTQSEYHIPRSYLKPTDNLMVIFEEEEANPEKIEILTVNRDTICSY 714

Query: 715 VSEADPPPVDSWKPNLG-----VVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           V+E  PP V SW+         V ++ P   L C     I A+ FAS+G P G CG +  
Sbjct: 715 VTEYHPPSVKSWERKNNKFTPVVDNAKPAAHLKCPNQKKIIAVQFASFGDPLGTCGDYAV 774

Query: 770 GACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G CH  V   +V++ C+G+  C IP+           CPG+ K LAV+  CS
Sbjct: 775 GTCHSLVSKQVVEEHCLGKTSCDIPIDKGLFAGKKDDCPGISKTLAVQVKCS 826


>gi|238009208|gb|ACR35639.1| unknown [Zea mays]
          Length = 677

 Score =  721 bits (1862), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/680 (54%), Positives = 464/680 (68%), Gaps = 23/680 (3%)

Query: 156 AQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD 215
           A++ENEYGN++ AYG  G+ Y++WAA  AV+L+T VPWVMCQQ DAPDP+INTCNGFYCD
Sbjct: 6   AKIENEYGNIDSAYGAPGKAYMRWAAGMAVSLDTGVPWVMCQQADAPDPLINTCNGFYCD 65

Query: 216 GFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGT 275
            FTPNS +KP MWTEN+SGWFLSFG AVP+RPVEDLAFAVARF++ GGTFQNYYMY GGT
Sbjct: 66  QFTPNSAAKPKMWTENWSGWFLSFGGAVPYRPVEDLAFAVARFYQRGGTFQNYYMYHGGT 125

Query: 276 NFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
           N  R++GGP +ATSYDYDAPIDEYG +RQPKWGHLR++HKAIKLCE  LI++DP++  LG
Sbjct: 126 NLDRSSGGPFIATSYDYDAPIDEYGLVRQPKWGHLRDVHKAIKLCEPALIATDPSYTSLG 185

Query: 336 AKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV 395
             +EA +Y K  + CAAFLAN D  SD  VTFNG +Y LPAWSVSILPDCKNVV NTA++
Sbjct: 186 PNVEAAVY-KVGSVCAAFLANIDGQSDKTVTFNGKMYRLPAWSVSILPDCKNVVLNTAQI 244

Query: 396 ISQRNNGDHPFAQQKNVNE------LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
            SQ    +  + +  NV          LA S +S+  E VGI+ + +  +  L EQINTT
Sbjct: 245 NSQTTGSEMRYLESSNVASDGSFVTPELAVSDWSYAIEPVGITKDNALTKAGLMEQINTT 304

Query: 450 KDTSDYLWYTASIHVMPGQGKEVFLN-------IESLGHAALVFVNKKLVAFGYGNHDFA 502
            D SD+LWY+ SI V   +G E +LN       + SLGH   V++N K+     G+   +
Sbjct: 305 ADASDFLWYSTSITV---KGDEPYLNGSQSNLAVNSLGHVLQVYINGKIAGSAQGSASSS 361

Query: 503 NFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW 562
                K IEL  G N +D+LS  VGL NYGA+FD+ GAG+   + +   NG  DLSS EW
Sbjct: 362 LISWQKPIELVPGKNKIDLLSATVGLSNYGAFFDLVGAGITGPVKLSGLNGALDLSSAEW 421

Query: 563 IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMG 622
            YQ+G+ GE + L   S A S  W   +  P+N  LIWYKT F  P G  P+A++   MG
Sbjct: 422 TYQIGLRGEDLHLYDPSEA-SPEWVSANAYPINHPLIWYKTKFTPPAGDDPVAIDFTGMG 480

Query: 623 KGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH 682
           KG+AWVNGQSIGRYW   LAP +GC   C+YRG+Y +SKC K CGQP+QTLYH+PR+++ 
Sbjct: 481 KGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSFLQ 540

Query: 683 PGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLA 742
           PG N LV+ E  GGDPSKIS + +    +C+ VSEA P  +DSW     +    P +RL 
Sbjct: 541 PGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQPMQRYGPALRLE 600

Query: 743 CER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAYLG 800
           C + G  I+++ FAS+G P G CGS+  G C     L IVQ+AC+G   CS+PVSS Y G
Sbjct: 601 CPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSSNYFG 660

Query: 801 VSAGACPGLLKALAVEAHCS 820
                C G+ K+LAVEA CS
Sbjct: 661 ---NPCTGVTKSLAVEAACS 677


>gi|224068510|ref|XP_002326135.1| predicted protein [Populus trichocarpa]
 gi|222833328|gb|EEE71805.1| predicted protein [Populus trichocarpa]
          Length = 824

 Score =  718 bits (1853), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/807 (47%), Positives = 490/807 (60%), Gaps = 40/807 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V YD  A++I+G+R+++ SGSIHYPRST E+W +LI+K+KEGGL+ IETY+FWN HE  R
Sbjct: 30  VEYDSSAVIINGQRKIILSGSIHYPRSTVEMWSDLIQKAKEGGLDTIETYIFWNAHERRR 89

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            +Y F G  D V+F + VQEAGL+  LRIGPYACAEWNYGGFPVWLH IP I+FRT N  
Sbjct: 90  REYNFTGNLDFVKFFQKVQEAGLYGILRIGPYACAEWNYGGFPVWLHNIPEIKFRTDNEI 149

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK EM+ F  KI+++ K+  LFASQGGPIILAQ+ENEYGNV   YG  G+ YV+W A  A
Sbjct: 150 FKNEMQTFTTKIVNMAKEAKLFASQGGPIILAQIENEYGNVMGPYGEAGKSYVQWCAQMA 209

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V  N  VPW+MCQQ DAP  +INTCNGFYCD FTPNSP  P MWTEN++GW+  +G   P
Sbjct: 210 VAQNIGVPWIMCQQSDAPSSVINTCNGFYCDTFTPNSPKSPKMWTENWTGWYKKWGQKDP 269

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  EDLAF+VARFF+  G  QNYYMY+GGTNFGRT+GGP +ATSYDYDAP+DEYG + Q
Sbjct: 270 HRTAEDLAFSVARFFQYNGVLQNYYMYYGGTNFGRTSGGPFIATSYDYDAPLDEYGNLNQ 329

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCA--AFLANYDSSS- 361
           PKWGHL+ LH A+KL E+ L +S     K          + S+ D     FL+N      
Sbjct: 330 PKWGHLKNLHAALKLGEKILTNSTVKTTKYSDGWVELTTYTSNIDGERLCFLSNTKMDGL 389

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D ++  +G  YF+PAWSVSIL DC    +NTAKV  Q +       ++ + N+  L  S 
Sbjct: 390 DVDLQQDGK-YFVPAWSVSILQDCNKETYNTAKVNVQTS----LIVKKLHENDTPLKLS- 443

Query: 422 FSWYEE--KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
           + W  E  K  + G   F    L EQ   T D SDYLWY  S+       K V L ++  
Sbjct: 444 WEWAPEPTKAPLHGQGGFKATQLLEQKAATYDESDYLWYMTSVDNNGTASKNVTLRVKYS 503

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
           G     FVN K +   +G      F   K   L  G N + +LS  VGLQNYG +FD   
Sbjct: 504 GQFLHAFVNGKEIGSQHG----YTFTFEKPALLKPGTNIISLLSATVGLQNYGEFFDEGP 559

Query: 540 AGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
            G+    V LID  N   DLSS EW Y+VG+ GE  G      +  + W  G+ L V ++
Sbjct: 560 EGIAGGPVELIDSGNTTTDLSSNEWSYKVGLNGEG-GRFYDPTSGRAKWVSGN-LRVGRA 617

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           + WYKTTF AP G  P+ ++L  MGKG AWVNG S+GR+W    A   GC  KCDYRG Y
Sbjct: 618 MTWYKTTFQAPSGTEPVVVDLQGMGKGHAWVNGNSLGRFWPILTADPNGCDGKCDYRGQY 677

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
              KC  +CG P Q  YH+PR++++ G N L++ EE+GG+PS +S      + IC    E
Sbjct: 678 KEGKCLSNCGNPTQRWYHVPRSFLNNGSNTLILFEEIGGNPSDVSFQITATETICGNTYE 737

Query: 718 ADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAA-INFASYGIPEG-NCGSFRPGACHMD 775
                                + L+C  G  I + I +AS+G P+G +CGSF+ G+    
Sbjct: 738 G------------------TTLELSCNGGRRIISDIQYASFGDPQGSSCGSFQRGSVEAS 779

Query: 776 -VLPIVQKACVGQIECSIPVSSAYLGV 801
                V+KAC+G+  CSI VS A  GV
Sbjct: 780 RSFSAVEKACMGKESCSINVSKATFGV 806


>gi|358348424|ref|XP_003638247.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
 gi|355504182|gb|AES85385.1| hypothetical protein MTR_122s1070, partial [Medicago truncatula]
          Length = 771

 Score =  715 bits (1846), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 379/740 (51%), Positives = 462/740 (62%), Gaps = 49/740 (6%)

Query: 21  LQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVK 80
           L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN HE   G YYF GRFDLV+F K
Sbjct: 1   LISASIHYPRSVP-MWPALIQTAKEGGIDVIETYVFWNGHELSPGNYYFGGRFDLVQFAK 59

Query: 81  TVQEAGLFLHLRIGPYACAEWNYGG---------------------------------FP 107
            VQ+AG++L LRIGP+  AEWN+GG                                  P
Sbjct: 60  VVQDAGMYLILRIGPFVAAEWNFGGEKNGVLICEDGEERGYRERADKNNQGNSRVLCGVP 119

Query: 108 VWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEW 167
           VWLH+IPG  FRT N PF   M++F   I++LMK+E LFASQGGPIIL+Q+ENEYG  E 
Sbjct: 120 VWLHYIPGTVFRTYNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYEN 179

Query: 168 AYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIM 227
            Y   G+ Y  WAA  AV+ NTSVPW+MCQQ DAPDP+I+TCN FYCD FTP SP +P M
Sbjct: 180 YYKEDGKKYALWAAKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKM 239

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
           WTEN+ GWF +FG   P RPVED+AF+VARFF+ GG+  NYYMY GGTNFGRTAGGP + 
Sbjct: 240 WTENWPGWFKTFGGRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFIT 299

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSS 347
           TSYDYDAPIDEYG  R PKWGHL+ELHKAIKLCE  L+     +  LG  +EA IY  SS
Sbjct: 300 TSYDYDAPIDEYGLPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSS 359

Query: 348 NDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGD 403
             CAAF++N D  +D  V F    Y LPAWSVSILPDCKNVVFNTAKV S  N      +
Sbjct: 360 GACAAFISNVDDKNDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPE 419

Query: 404 HPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIH 463
           H   QQ +  +  L    F   +E  GI G   FV+    + INTTKDT+DYLW+T SI 
Sbjct: 420 H--LQQSDKGQKTLKWDVF---KENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSIL 474

Query: 464 VMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINT 518
           +   +     G +  L IES GH    FVN+K    G GN   + F     I L  G N 
Sbjct: 475 IDANEEFLKKGSKPALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNE 534

Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
           + ILS+ VGLQ  G ++D  GAG+ SV +I L N   DLSS  W Y++GV GE++ + + 
Sbjct: 535 IAILSLTVGLQTAGPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQG 594

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
              NS  W   S  P  ++L WYK    AP G  P+ L++  MGKG AW+NG+ IGRYW 
Sbjct: 595 EGMNSVKWTSTSEPPKGQALTWYKAIVDAPSGDEPVGLDMLYMGKGLAWLNGEEIGRYWP 654

Query: 639 AYLA-PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
                    C ++CDYRG ++  KC   CG+P+Q  YH+PR+W  P  N+LVI EE GGD
Sbjct: 655 RISEFKKEDCVQECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVIFEEKGGD 714

Query: 698 PSKISLLTKTGQHICSFVSE 717
           P+KI+ +        S V E
Sbjct: 715 PTKITFVRHCHNPYSSIVVE 734


>gi|225459613|ref|XP_002284529.1| PREDICTED: beta-galactosidase 16-like [Vitis vinifera]
          Length = 813

 Score =  715 bits (1845), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/827 (46%), Positives = 501/827 (60%), Gaps = 46/827 (5%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +VTYD R+L+I+G+RR+L SGSIHYPRSTPE+WP LI K+KEGG++VIETY FWN HEP
Sbjct: 22  GSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEP 81

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQY F GR D+V+F K VQ  GL+  LRIGP+  +EWNYGG P WLH +PGI +R+ N
Sbjct: 82  KQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDN 141

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI++LMK ENL+ASQGGPIIL+Q+ENEY NVE A+   G  YV+WAA 
Sbjct: 142 EPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAK 201

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV+L T VPWVMC+Q+DAPDP+IN CNG  C + F  PN P+KP +WTEN++  +  +G
Sbjct: 202 MAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYG 261

Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
                R  EDLAF VA F  +  G+F NYYMY GGTNFGRT+   ++   YD  AP+DEY
Sbjct: 262 EDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEY 320

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G IRQPKWGHL+ELH  IKLC + L+     +  LG   EA+++ + S  CAAFL N D 
Sbjct: 321 GLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 380

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
             +  V F    Y L A S+SILPDCK + FNTAKV +Q N         ++V       
Sbjct: 381 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNT--------RSVQTRATFG 432

Query: 420 SAFSWYEEKVGIS--GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
           S   W E + GI   G        L E + TTKD SDYLWYT    +      +  L ++
Sbjct: 433 STKQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-IQNSSNAQPVLRVD 491

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           SL H    FVN K +A  +G+H   +F +  K+ LN G+N + +LS+MVGL + G + + 
Sbjct: 492 SLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEH 551

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             AG+  V + D  + K D S   W YQVG+ GE   +     +    W  G        
Sbjct: 552 KVAGIRRVEIQDGGDSK-DFSKHPWGYQVGLMGEKSQIYTSPGSQKVQW-HGLGSHGRGP 609

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L WYKT F AP G  P+ L   SMGKG+AWVNGQSIGRYW +YL PS             
Sbjct: 610 LTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS------------- 656

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
                    G+P+QT Y++PR +++P  NLLV+ EE  GDP KIS+ T +  ++C  V++
Sbjct: 657 ---------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSVTNVCGHVTD 707

Query: 718 ADPPPVDSWKP----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           + PPP+ SW      N       P+V+L C    +I+ I FAS+G P G C S+  G+CH
Sbjct: 708 SHPPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGGCESYAIGSCH 767

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             + L + +KAC+G+  CSIP S    G     CPG  KAL V A C
Sbjct: 768 SPNSLAVAEKACLGKNMCSIPHSLKSFG--DDPCPGTPKALLVAAQC 812


>gi|302141788|emb|CBI18991.3| unnamed protein product [Vitis vinifera]
          Length = 821

 Score =  714 bits (1843), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/827 (46%), Positives = 501/827 (60%), Gaps = 46/827 (5%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +VTYD R+L+I+G+RR+L SGSIHYPRSTPE+WP LI K+KEGG++VIETY FWN HEP
Sbjct: 30  GSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEP 89

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQY F GR D+V+F K VQ  GL+  LRIGP+  +EWNYGG P WLH +PGI +R+ N
Sbjct: 90  KQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDN 149

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI++LMK ENL+ASQGGPIIL+Q+ENEY NVE A+   G  YV+WAA 
Sbjct: 150 EPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAK 209

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV+L T VPWVMC+Q+DAPDP+IN CNG  C + F  PN P+KP +WTEN++  +  +G
Sbjct: 210 MAVDLQTGVPWVMCKQDDAPDPVINACNGMKCGETFAGPNKPNKPAIWTENWTSVYEVYG 269

Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
                R  EDLAF VA F  +  G+F NYYMY GGTNFGRT+   ++   YD  AP+DEY
Sbjct: 270 EDKRGRAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEY 328

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G IRQPKWGHL+ELH  IKLC + L+     +  LG   EA+++ + S  CAAFL N D 
Sbjct: 329 GLIRQPKWGHLKELHAVIKLCSDTLLHGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDK 388

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
             +  V F    Y L A S+SILPDCK + FNTAKV +Q N         ++V       
Sbjct: 389 RRNVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNT--------RSVQTRATFG 440

Query: 420 SAFSWYEEKVGIS--GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
           S   W E + GI   G        L E + TTKD SDYLWYT    +      +  L ++
Sbjct: 441 STKQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-IQNSSNAQPVLRVD 499

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           SL H    FVN K +A  +G+H   +F +  K+ LN G+N + +LS+MVGL + G + + 
Sbjct: 500 SLAHVLHAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEH 559

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             AG+  V + D  + K D S   W YQVG+ GE   +     +    W  G        
Sbjct: 560 KVAGIRRVEIQDGGDSK-DFSKHPWGYQVGLMGEKSQIYTSPGSQKVQW-HGLGSHGRGP 617

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L WYKT F AP G  P+ L   SMGKG+AWVNGQSIGRYW +YL PS             
Sbjct: 618 LTWYKTLFDAPPGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS------------- 664

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
                    G+P+QT Y++PR +++P  NLLV+ EE  GDP KIS+ T +  ++C  V++
Sbjct: 665 ---------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSVTNVCGHVTD 715

Query: 718 ADPPPVDSWKP----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           + PPP+ SW      N       P+V+L C    +I+ I FAS+G P G C S+  G+CH
Sbjct: 716 SHPPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGGCESYAIGSCH 775

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             + L + +KAC+G+  CSIP S    G     CPG  KAL V A C
Sbjct: 776 SPNSLAVAEKACLGKNMCSIPHSLKSFG--DDPCPGTPKALLVAAQC 820


>gi|449436074|ref|XP_004135819.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
          Length = 643

 Score =  711 bits (1835), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/650 (53%), Positives = 440/650 (67%), Gaps = 23/650 (3%)

Query: 67  YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
           Y FE R+DLVRFVK V +AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PFK
Sbjct: 6   YNFEDRYDLVRFVKLVHQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGIAFRTDNGPFK 65

Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
             M++F  KI+ LMK E L+ SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA  A+ 
Sbjct: 66  AAMQKFTEKIVGLMKGEKLYESQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQMALG 125

Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
           L+T VPWVMC+Q+DAPDP+I+TCNGFYC+ F PN   KP MWTE ++GWF  FG   P+R
Sbjct: 126 LDTGVPWVMCKQDDAPDPVIDTCNGFYCENFKPNKVYKPKMWTEAWTGWFTEFGGPAPYR 185

Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPK 306
           PVED+A++VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R+PK
Sbjct: 186 PVEDMAYSVARFIQNGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLREPK 245

Query: 307 WGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVT 366
           W HLR+LHKAIKLCE  L+S DPT   LG+  EAH++   S  CAAFLANYD+SS A VT
Sbjct: 246 WSHLRDLHKAIKLCEPALVSVDPTVSYLGSNQEAHVFKTRSGSCAAFLANYDASSSATVT 305

Query: 367 FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWY- 425
           F  N Y LP WSVSILPDCK+V+FNTAKV         P +Q K     +   S+FSW  
Sbjct: 306 FGNNQYDLPPWSVSILPDCKSVIFNTAKV-------GAPTSQPK-----MTPVSSFSWLS 353

Query: 426 --EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
             EE        +     L EQI+ T+D++DYLWY   I + P +     G+   L + S
Sbjct: 354 YNEETASAYTEDTTTMAGLVEQISVTRDSTDYLWYMTDIRIDPNEGFLKSGQWPLLTVFS 413

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GHA  VF+N +L    YG  +      +K + L  GIN L ILS+ VGL N G  ++  
Sbjct: 414 AGHALHVFINGQLSGTTYGGSENYKLTFSKYVNLRAGINKLSILSVAVGLPNGGLHYETW 473

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             G+   V L  L    RD+S  +W Y++G++GE + L  +S ++S  W  GS +   + 
Sbjct: 474 NTGVLGPVTLKGLNEDTRDMSGYKWSYKIGLKGEALNLHSVSGSSSVEWVTGSLVAQKQP 533

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L WYKTTF +P+G  PLAL+++SMGKGQ W+NGQSIGR+W AY A   G   KC+Y G +
Sbjct: 534 LTWYKTTFDSPKGNEPLALDMSSMGKGQIWINGQSIGRHWPAYTA--KGSCGKCNYGGIF 591

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +  KC  +CG+P+Q  YH+PR W+    N+LVI EE GG+P  ISL+ ++
Sbjct: 592 NEKKCHSNCGEPSQRWYHVPRAWLKSSGNVLVIFEEWGGNPEGISLVKRS 641


>gi|224080622|ref|XP_002306183.1| predicted protein [Populus trichocarpa]
 gi|222849147|gb|EEE86694.1| predicted protein [Populus trichocarpa]
          Length = 838

 Score =  711 bits (1834), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/833 (44%), Positives = 502/833 (60%), Gaps = 53/833 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+I+GKR +L SGSIHYPRSTPE+WPELI+K+K GGL VI+TYVFWN HEP +
Sbjct: 31  VTYDGRSLIINGKRELLFSGSIHYPRSTPEMWPELIQKAKRGGLNVIQTYVFWNIHEPEQ 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ FEG +DLV+F+KT+ E G+   +R+GP+  AEWN+GG P WL  IP I FR+ N P
Sbjct: 91  GKFNFEGSYDLVKFIKTIGENGMSATIRLGPFIQAEWNHGGLPYWLREIPDIIFRSDNAP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+RF+  II+ +K+E LFASQGGPIILAQ+ENEY  V+ AY   G  YV+WA + A
Sbjct: 151 FKLHMERFVTMIINKLKEEKLFASQGGPIILAQIENEYNTVQLAYRNLGVSYVQWAGNMA 210

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           + L T VPWVMC+Q+DAP P+INTCNG +C D FT PNSP KP +WTEN++  F  FG  
Sbjct: 211 LGLKTGVPWVMCKQKDAPGPVINTCNGRHCGDTFTGPNSPDKPSLWTENWTAQFRVFGDP 270

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED AF+VAR+F   G+  NYYMY GGTNF RTA    V T Y  +AP+DEYG  
Sbjct: 271 PSQRSAEDTAFSVARWFSKNGSLVNYYMYHGGTNFDRTAAS-FVTTRYYDEAPLDEYGLQ 329

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           R+PKWGHL++LH+A+ LC++ L+   P  Q+L A +EA  + +  +NDCAAFLAN ++  
Sbjct: 330 REPKWGHLKDLHRALNLCKKALLWGTPNVQRLSADVEARFFEQPRTNDCAAFLANNNTKD 389

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              VTF G  Y+LPA S+SILPDCK VV+NT  V+SQ N+ +  F + +  +  L     
Sbjct: 390 PETVTFRGKKYYLPAKSISILPDCKTVVYNTMTVVSQHNSRN--FVKSRKTDGKL----- 442

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEV--FLNI 476
             W      I  N         E  N TKD +DY W+T +I+V        K++   L +
Sbjct: 443 -EWKMFSETIPSNLLVDSRIPRELYNLTKDKTDYAWFTTTINVDRNDLSARKDINPVLRV 501

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            SLGHA + F+N + +   +G+    +F++   ++L  GIN + +L  +VGL + GA+ +
Sbjct: 502 ASLGHAMVAFINGEFIGSAHGSQIEKSFVLQHSVKLKPGINFVTLLGSLVGLPDSGAYME 561

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
              AG   V ++ L  G  DLSS  W +QV + GE   +          W +     VNK
Sbjct: 562 HRYAGPRGVSILGLNTGTLDLSSNGWGHQVALSGETAKVFTKEGGRKVTWTK-----VNK 616

Query: 597 S---LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
               + WYKT F APEGK P+A+ +  M KG  W+NG+SIGRYW  Y++P          
Sbjct: 617 DGPPVTWYKTRFDAPEGKSPVAVRMTGMKKGMIWINGKSIGRYWMNYISP---------- 666

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
                        G+P Q+ YHIPR+++ P  NL+VI EE G  P KI +LT     ICS
Sbjct: 667 ------------LGEPTQSEYHIPRSYLKPTNNLMVILEEEGASPEKIEILTVNRDTICS 714

Query: 714 FVSEADPPPVDSWKPNLGVVS-----SSPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
           +V+E  PP V SW+      +     + P  RL C     I A+ FAS+G P G CG+F 
Sbjct: 715 YVTEYHPPNVRSWERKNKKFTPVADDAKPAARLKCPNKKKIVAVQFASFGDPSGTCGNFA 774

Query: 769 PGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G C   +   +V++ C+G+  C IP+           CP L K LAV+  CS
Sbjct: 775 VGTCDSPISKQVVEQHCLGKTSCDIPMDKGLFNGKKDNCPNLTKNLAVQVKCS 827


>gi|255575455|ref|XP_002528629.1| beta-galactosidase, putative [Ricinus communis]
 gi|223531918|gb|EEF33732.1| beta-galactosidase, putative [Ricinus communis]
          Length = 822

 Score =  709 bits (1830), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/846 (46%), Positives = 516/846 (60%), Gaps = 61/846 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD+RA+ IDG R+++ SGSIHYPRSTPE+WP+LIRK+KEGGL  IETYVFWN HEP +
Sbjct: 7   VTYDNRAIKIDGARKLILSGSIHYPRSTPEMWPQLIRKAKEGGLNTIETYVFWNAHEPHQ 66

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DL+RF+KT+++ GL+  LRIGPY CAEWNYGGFPVWLH +PGIQ RT N  
Sbjct: 67  RQYDFSGNLDLIRFIKTIRDEGLYAILRIGPYVCAEWNYGGFPVWLHNLPGIQIRTNNEV 126

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K EM+ F   I+++MK   LFASQGGPIIL+Q+ENEYGNV+ +YG  G+ YVKW A+ A
Sbjct: 127 YKNEMEIFTTLIVNMMKDGKLFASQGGPIILSQIENEYGNVQSSYGDEGKEYVKWCANLA 186

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +    VPW+MCQQ DAP P+I++CNGFYCD +  N+ S P +WTEN++GWF  +G   P
Sbjct: 187 ESFKVGVPWIMCQQSDAPSPMIDSCNGFYCDQYYSNNKSLPKIWTENWTGWFQDWGQKNP 246

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  ED+AFAVARFF+ GG+  NYYMY GGTNFG T GGP +  SYDYDAP+DEYG +RQ
Sbjct: 247 HRSAEDVAFAVARFFQLGGSVMNYYMYHGGTNFGTTGGGPYITASYDYDAPLDEYGNLRQ 306

Query: 305 PKWGHLRELHKAIKLCEEYLI------SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD 358
           PKWGHLR+LH  +   E+ L       S+ P +  +   + A+   +S      F ++ D
Sbjct: 307 PKWGHLRDLHSVLNSMEQTLTYGESKNSNYPDNNNIFITIFAYQGKRS-----CFFSSID 361

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
              D  ++F G  YFLPAWSVSILPDC   V+NTA V  Q +  ++      +  E    
Sbjct: 362 -YKDQTISFEGTDYFLPAWSVSILPDCFTEVYNTATVNVQTSIMENKANAADSFRE--PN 418

Query: 419 SSAFSWYEEKV-GIS------GNRSFVRPDLAEQINTTKDTSDYLW-YTASIHVMP---- 466
           S  + W  EK+ G+S      GN + V  +L +Q   T  TSDYLW  T   H M     
Sbjct: 419 SLQWKWRPEKIRGLSLQGDFVGN-TLVANELMDQKAVTNGTSDYLWIMTNYDHNMNDSLW 477

Query: 467 GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFA--NFLINKKIELNEGINTLDILSM 524
           G GK++ L + + GH    FVN K V     + +    +F+   KI+L  GIN + ++S+
Sbjct: 478 GAGKDIILQVHTNGHVVHAFVNGKHVGSQSASIESGRFDFVFESKIKLKRGINRISLVSV 537

Query: 525 MVGLQNYGAWFDVAGAGLFSVILI-------DLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
            VGLQNYGA FD A  G+   I I       +  +   D+SS  W+Y+ G+ GE  G   
Sbjct: 538 SVGLQNYGANFDTAPTGINGPITIIGRSKLGNQPDVTVDISSNRWVYKTGLHGEDQGFQA 597

Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
           +   +   +     L +N+  +WYKT+F AP G+ P+ ++L  +GKG AWVNG++IGR+W
Sbjct: 598 VRPRHRRQFYTKHVL-INQPFVWYKTSFNAPLGQDPVVVDLLGLGKGTAWVNGRNIGRFW 656

Query: 638 SAYLAPSTG-CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
              LAP  G C   C Y G+Y+  +C   CG+P Q  YHIPR W+ P +N LV+ EELGG
Sbjct: 657 PKALAPDDGTCNAPCSYIGTYEPKQCVTGCGEPTQRYYHIPRDWLKPEDNKLVLFEELGG 716

Query: 697 DPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFAS 756
            P  +S+ T T   +C    E                     V L+C+ G   + I FAS
Sbjct: 717 TPDFVSVQTVTVGKVCVHGYEGH------------------TVELSCQHGRKFSKITFAS 758

Query: 757 YGIPEGNCGSFRPG---ACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKAL 813
           +G+P+G CGSF P     CH DV  IV+KACVG+  CSI +S   L  +   C   +  L
Sbjct: 759 FGLPQGKCGSFTPSNNHDCHADVSTIVEKACVGKERCSIDISEKAL--APIHCDARIYRL 816

Query: 814 AVEAHC 819
           AVEA C
Sbjct: 817 AVEAVC 822


>gi|183238712|gb|ACC60982.1| beta-galactosidase 2 precursor [Petunia x hybrida]
          Length = 830

 Score =  709 bits (1829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/832 (44%), Positives = 505/832 (60%), Gaps = 50/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+++++G+R +L SGSIHYPR  PE+WPE+IRK+KEGGL VI+TYVFWN HEP++
Sbjct: 28  VTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPEIIRKAKEGGLNVIQTYVFWNIHEPVQ 87

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQ+ FEG +DLV+F+K + E GL++ LRIGPY  AEWN GGFP WL  +P I FR+ N P
Sbjct: 88  GQFNFEGNYDLVKFIKAIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSYNEP 147

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F   MK++   +IDL+K+E LFA QGGPII+AQ+ENEY NV+ AY   G+ Y++WAA+ A
Sbjct: 148 FIHHMKKYSEMVIDLVKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYIEWAANMA 207

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
            +L   VPW+MC+Q+DAP  +INTCNG +C D FT PN P+KP +WTEN++  + +FG  
Sbjct: 208 TSLYNGVPWIMCKQKDAPPQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTFGDP 267

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF+VARFF   GT  NYYMY+GGTN+GRT+    V T Y  +AP+DE+G  
Sbjct: 268 PSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRTSSS-FVTTRYYDEAPLDEFGLY 326

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           R+PKW HLR+LH+A++L    L+   PT QK+   LE  ++ K  S DCAAFL N  ++ 
Sbjct: 327 REPKWSHLRDLHRALRLSRRALLWGTPTVQKINQDLEITVFEKPGSTDCAAFLTNNHTTQ 386

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
            + + F G  Y+LP  SVSILPDCK VV+NT  ++SQ N+ +   +++         S  
Sbjct: 387 PSTIKFRGKDYYLPEKSVSILPDCKTVVYNTQTIVSQHNSRNFITSEK---------SKN 437

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI----HVMPGQGKEV-FL 474
             W  Y+EKV    +      +  E  + TKDTSDY WY+ SI    H +P +   +  L
Sbjct: 438 LKWEMYQEKVPTIADLPLKNREPLELYSLTKDTSDYAWYSTSITLERHDLPMRPDILPVL 497

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S+GHA   FVN + V FG+GN+   +F+  K I L  G NT+ IL+  VG  N GA+
Sbjct: 498 QIASMGHALAAFVNGEYVGFGHGNNIEKSFVFQKPIILKPGTNTITILAETVGFPNSGAY 557

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
            +   AG   V +  L  G  D++   W ++VGV GE   L     A    W    T P 
Sbjct: 558 MEKRFAGPRGVTIQGLMAGTLDITQNNWGHEVGVFGEKQELFTEEGAKKVQWTP-VTGPP 616

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
             ++ WYKT F APEG  P+AL +  M KG  WVNG+S+GRYW+++L+P           
Sbjct: 617 KGAVTWYKTYFDAPEGNNPVALKMDKMEKGMMWVNGKSLGRYWTSFLSP----------- 665

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                       GQP Q  YHIPR ++ P  NLLVI EE GG P+ I + T     ICS 
Sbjct: 666 -----------LGQPTQAEYHIPRAYLKPTNNLLVIFEETGGHPTNIEVQTVNRDTICSI 714

Query: 715 VSEADPPPVDSWKPN----LGVVSS-SPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           ++E  PP V SW+ +    + VV        L C     I  + FASYG P+G CG+   
Sbjct: 715 ITEYHPPHVKSWERSGTDFVAVVEDLKSGAHLTCPDNKIIEKVEFASYGNPDGACGNLFN 774

Query: 770 GACH-MDVLPIVQKACVGQIECSIPVSSA-YLGVSAGACPGLLKALAVEAHC 819
           G C+  + L +V++ C+G+  C+IP+    Y   S   CP + K LAV+  C
Sbjct: 775 GNCNSANSLKVVEQHCLGKNTCTIPIEREIYDEPSKDPCPNIFKTLAVQVKC 826


>gi|413957070|gb|AFW89719.1| hypothetical protein ZEAMMB73_400203 [Zea mays]
          Length = 809

 Score =  706 bits (1822), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/769 (48%), Positives = 477/769 (62%), Gaps = 82/769 (10%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTP--------------------------EVWPE 38
           VTYD +A++IDG+RR+L SGSIHYPRSTP                          E+W  
Sbjct: 27  VTYDKKAVLIDGQRRILFSGSIHYPRSTPDVTAFYKISSPPTIPWRGLWLRIYGSEMWEG 86

Query: 39  LIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYAC 98
           LI+K+K+GGL+VI+TYVFWN HEP  G                    G+F   R   Y  
Sbjct: 87  LIQKAKDGGLDVIQTYVFWNGHEPTPGN----------------DSDGIFF--RFEQYYF 128

Query: 99  AEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ- 157
            E    GFPVWL ++PGI FRT N PFK  M+ F  KI+ +MK ENLFASQGGPIIL+Q 
Sbjct: 129 EE---SGFPVWLKYVPGISFRTDNEPFKTAMQGFTEKIVGMMKSENLFASQGGPIILSQA 185

Query: 158 --------VENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTC 209
                   +ENEYG     +G  G+ Y+ WAA  AV L T VPWVMC++EDAPDP+IN C
Sbjct: 186 SIIFSLDLIENEYGPEGREFGAAGQAYINWAAKMAVGLGTGVPWVMCKEEDAPDPVINAC 245

Query: 210 NGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYY 269
           NGFYCD F+PN P KP MWTE +SGWF  FG  +  RPVEDLAFAVARF + GG+F NYY
Sbjct: 246 NGFYCDAFSPNKPYKPTMWTEAWSGWFTEFGGTIRQRPVEDLAFAVARFVQKGGSFINYY 305

Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
           MY GGTNFGRTAGGP + TSYDYDAPIDEYG +R+PK  HL+ELH+A+KLCE+ L+S DP
Sbjct: 306 MYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLVREPKHSHLKELHRAVKLCEQALVSVDP 365

Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
               LG   EA ++ +S + CAAFLANY+S+S A V FN   Y LP WS+SILPDCKNVV
Sbjct: 366 AITTLGTMQEARVF-QSPSGCAAFLANYNSNSYAKVVFNNEQYSLPPWSISILPDCKNVV 424

Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKV-GISGNRSFVRPDLAEQI 446
           FN+A V            Q   +      +S+ +W  Y+E+V  ++         L EQ+
Sbjct: 425 FNSATV----------GVQTSQMQMWGDGASSMTWERYDEEVDSLAAAPLLTTTGLLEQL 474

Query: 447 NTTKDTSDYLWYTASIHV------MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHD 500
           N T+D+SDYLWY  S+ +      + G GK + L+++S GHA  VFVN +L    YG  +
Sbjct: 475 NVTRDSSDYLWYITSVDISSSENFLQGGGKPLSLSVQSAGHALHVFVNGQLQGSAYGTRE 534

Query: 501 FANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSS 559
                 N    L  G N + +LS+  GL N G  ++    G+   V+L  L  G RDL+ 
Sbjct: 535 DRRIKYNGNASLRAGTNKIALLSVACGLPNVGVHYETWNTGVGGPVVLHGLDEGSRDLTW 594

Query: 560 GEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNL 618
             W YQVG++GE + L+ I  ++S  W QGS +  N+  L WY+  F  P G  PLAL++
Sbjct: 595 QTWSYQVGLKGEQMNLNSIEGSSSVEWMQGSLIAQNQQPLAWYRAYFETPSGDEPLALDM 654

Query: 619 ASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPR 678
            SMGKGQ W+NGQSIGRYW+AY   + G  K+C Y G++ A KCQ  CGQP Q  YH+P+
Sbjct: 655 GSMGKGQIWINGQSIGRYWTAY---ADGDCKECSYTGTFRAPKCQSGCGQPTQRWYHVPK 711

Query: 679 TWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK 727
           +W+ P  NLLV+ EELGGD SKI+L+ ++   +C+ VSE D P + +W+
Sbjct: 712 SWLQPTRNLLVVFEELGGDSSKIALVKRSVSSVCADVSE-DHPNIKNWQ 759


>gi|302141787|emb|CBI18990.3| unnamed protein product [Vitis vinifera]
          Length = 817

 Score =  704 bits (1816), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/825 (46%), Positives = 506/825 (61%), Gaps = 44/825 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+I+G+R++L SGSIHYPRSTPE+WP LI ++K+GG++VIETYVFWN HEP  
Sbjct: 28  VTYDGRSLIINGQRKILFSGSIHYPRSTPEMWPSLISQAKQGGIDVIETYVFWNQHEPKP 87

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F GR D+VRF++ VQ  GL+  LRIGP+  AEWNYGGFP WLH +PGI +RT N P
Sbjct: 88  GQYDFSGRRDIVRFIREVQAQGLYACLRIGPFIQAEWNYGGFPFWLHDVPGIVYRTDNEP 147

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+++MK ENL+ASQGGPIIL Q+ENEY  VE  +G  G+ YV WAA+ A
Sbjct: 148 FKFYMRNFTTKIVEIMKSENLYASQGGPIILQQIENEYKTVEANFGEAGKRYVLWAANMA 207

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L T VPWVMC+Q+DAPDP+IN+CNG  C + F  PNSP+KP +WTEN++  +  FG  
Sbjct: 208 VGLETGVPWVMCKQDDAPDPVINSCNGRLCGETFAGPNSPNKPAIWTENWTSSYPLFGED 267

Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
              RPVED+AF VA F  +  G+F NYYMY GGTNFGRTA    V T+Y  +AP+DEYG 
Sbjct: 268 ARPRPVEDIAFHVALFVAKMNGSFINYYMYHGGTNFGRTASA-YVQTAYYDEAPLDEYGL 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKL-EAHIYHKSSNDCAAFLANYDSS 360
           I+QP WGHL+ELH A+KLC E L+    ++  LG KL EA+++   S  CAAFL N DS 
Sbjct: 327 IQQPTWGHLKELHAAVKLCSETLLQGAQSNLSLGTKLQEAYVFRGQSGKCAAFLVNNDSR 386

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +D  V F    Y LP  S+SILPDCKN  FNTAK          P            ++ 
Sbjct: 387 TDVTVVFQNTSYELPRKSISILPDCKNEAFNTAKA------SFRPGLISIQTVTKFNSTE 440

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  Y+E +    + S     L E +NTTKD SDYLWYT   +  P  G+ V L+  S  
Sbjct: 441 QWEEYKESILNFDDTSSRANTLLEHMNTTKDASDYLWYTFRYNNDPSNGQSV-LSTNSRA 499

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   F+N +     +G+    +F ++  +    GIN + +LS+MVGL + GA+ +   A
Sbjct: 500 HALHAFINGRHTGSQHGSSSNLSFSLDNTVSFRAGINNVSLLSVMVGLPDSGAYLERRVA 559

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPVNKSLI 599
           GL  V  I      +D ++  W YQVG+ GE + +     +    W K GS+   +  L 
Sbjct: 560 GLRRV-RIQSNGSLKDFTNNPWGYQVGLLGEKLQIYTDVGSQKVQWSKFGSS--TSGLLT 616

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKT F AP G  P+ALNL SM KG+ WVNGQSIGRYW ++L PS               
Sbjct: 617 WYKTVFDAPAGNEPVALNLVSMRKGEVWVNGQSIGRYWVSFLTPS--------------- 661

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
                  G+P+Q  YHIPR+++ P  NLLV+ EE  G P  IS+   +   IC  VSE+ 
Sbjct: 662 -------GKPSQIWYHIPRSFLKPTGNLLVLLEEETGHPVGISIGKVSIPKICGHVSESH 714

Query: 720 PPPVDS---WKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MD 775
            PPV S   +K +       P+V+L C    +I+ I FAS+G P G+C S+  G+CH  +
Sbjct: 715 LPPVISRVIYKKHENHHGRRPKVQLRCPSNRNISRILFASFGTPSGDCQSYAVGSCHSSN 774

Query: 776 VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
               V+KAC+G+  CS+P+S    G     CPG  KAL V+  C+
Sbjct: 775 SRSNVEKACLGKGMCSVPLSYKRFG--GDPCPGTPKALLVDVQCT 817


>gi|297851602|ref|XP_002893682.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
 gi|297339524|gb|EFH69941.1| Beta-galactosidase 15 precursor [Arabidopsis lyrata subsp. lyrata]
          Length = 780

 Score =  703 bits (1815), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/828 (44%), Positives = 506/828 (61%), Gaps = 83/828 (10%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V++D RA+ IDG RRVL SGSIHYPRST E+WP+LI+K KEGGL+ IETYVFWN HEP R
Sbjct: 23  VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGGLDAIETYVFWNAHEPTR 82

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DL+RF+KT+Q+ G++  LRIGPY CAEWNYGGFPVWLH +PG++FRTTN  
Sbjct: 83  RQYDFSGNLDLIRFLKTIQDEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 142

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F  EM+ F   I++++K+E LFASQGGPIILAQ+ENEYGNV  +YG  G+ Y+KW A+ A
Sbjct: 143 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIKWCANMA 202

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +L+  VPW+MCQQ+DAP P++NTCNG+YCD FTPN+P+ P MWTEN++GW+ ++G   P
Sbjct: 203 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFTPNNPNTPKMWTENWTGWYKNWGGKDP 262

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  ED+AFAVARFF+ GGTFQNYYMY GGTNF RTAGGP + T+YDYDAP+DE+G + Q
Sbjct: 263 HRTTEDVAFAVARFFQRGGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQ 322

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL++LH  +   E+ L   + +    G  + A +Y K+    + F+ N + +SDA 
Sbjct: 323 PKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVY-KTEEGSSCFIGNVNETSDAK 381

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           + F G  Y +PAWSVSILPDCK   +NTAK+ +Q +       ++ N  E   ++  +SW
Sbjct: 382 INFQGTFYDVPAWSVSILPDCKTETYNTAKINTQTS----VMVKKANEAENEPSTLKWSW 437

Query: 425 YEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
             E +    + G        L +Q   + D SDYLWY  ++++    P  GK + L I S
Sbjct: 438 RPENIDNVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNIKEQDPVWGKNMSLRINS 497

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
             H    FVN + +      +   +++  +  + N G N + +LS+ VGL NYGA+F+  
Sbjct: 498 TAHVLHAFVNGQHIGNYRAENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAFFENV 557

Query: 539 GAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
            AG+   + I  +NG     +DLS+ +W Y+ G+ G           N  F  +      
Sbjct: 558 PAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSG---------FENQLFSSESP---- 604

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
                   +T+ AP G  P+ ++L  +GKG AW+NG +IGRYW A+LA   GC+ +    
Sbjct: 605 --------STWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLADIDGCSAE---- 652

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQHICS 713
                              YH+PR++++  G+N LV+ EE+GG+PS ++  T    ++C+
Sbjct: 653 -------------------YHVPRSFLNSDGDNTLVLFEEIGGNPSLVNFQTIGVGNVCA 693

Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
            V E +                   + L+C  G  I++I FAS+G P GNCGSF  G C 
Sbjct: 694 NVYEKN------------------VLELSC-NGKPISSIKFASFGNPGGNCGSFEKGTCE 734

Query: 774 M--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
              D   I+ + CVG+ +CSI VS    G  A  C GL K LAVEA C
Sbjct: 735 ASNDAAAILTQECVGKEKCSIDVSEKKFG--AADCGGLAKRLAVEAIC 780


>gi|30699255|ref|NP_177866.2| beta-galactosidase 16 [Arabidopsis thaliana]
 gi|152013367|sp|Q8GX69.2|BGL16_ARATH RecName: Full=Beta-galactosidase 16; Short=Lactase 16; Flags:
           Precursor
 gi|332197854|gb|AEE35975.1| beta-galactosidase 16 [Arabidopsis thaliana]
          Length = 815

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/831 (46%), Positives = 504/831 (60%), Gaps = 51/831 (6%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 23  ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 82

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQ+ F G  D+V+F+K V+  GL++ LRIGP+   EW+YGG P WLH + GI FRT N
Sbjct: 83  QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 142

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  MKR+   I+ LMK ENL+ASQGGPIIL+Q+ENEYG V  A+   G+ YVKW A 
Sbjct: 143 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 202

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L+T VPWVMC+Q+DAPDP++N CNG  C + F  PNSP+KP +WTEN++ ++ ++G
Sbjct: 203 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+AF VA F    G+F NYYMY GGTNFGR A    V TSY   AP+DEYG
Sbjct: 263 EEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL+ELH A+KLCEE L+S   T   LG    A ++ K +N CAA L N D  
Sbjct: 322 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            ++ V F  + Y L   SVS+LPDCKNV FNTAKV +Q N       + +   + L +  
Sbjct: 381 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 434

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  + E V      S     L E +NTT+DTSDYLW T        +G    L +  LG
Sbjct: 435 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 492

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   FVN + +   +G      FL+ K + LN G N L +LS+MVGL N GA  +    
Sbjct: 493 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 552

Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
           G  SV    + NG+  L  ++  W YQVG++GE   +     +    WKQ      ++ L
Sbjct: 553 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 608

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYK +F  PEG+ P+ALNL SMGKG+AWVNGQSIGRYW ++                  
Sbjct: 609 TWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSF------------------ 650

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSE 717
                 + G P+Q  YHIPR+++ P  NLLVI  EE  G+P  I++ T +   +C  VS 
Sbjct: 651 ----HTYKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSN 706

Query: 718 ADPPPVDS------WKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            +P PV S       + NL       P+V+L C  G  I+ I FAS+G P G+CGS+  G
Sbjct: 707 TNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIG 766

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +CH  + L +VQKAC+ +  CS+PV S   G    +CP  +K+L V A CS
Sbjct: 767 SCHSPNSLAVVQKACLKKSRCSVPVWSKTFG--GDSCPHTVKSLLVRAQCS 815


>gi|225428017|ref|XP_002278545.1| PREDICTED: beta-galactosidase 13 [Vitis vinifera]
 gi|297744615|emb|CBI37877.3| unnamed protein product [Vitis vinifera]
          Length = 833

 Score =  702 bits (1811), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/832 (43%), Positives = 503/832 (60%), Gaps = 45/832 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  VTYD R+L+++G+R +L SGSIHYPRSTPE+WP++++K+K GGL +I+TYVFWN HE
Sbjct: 29  AKTVTYDGRSLIVNGRRELLFSGSIHYPRSTPEMWPDILQKAKHGGLNLIQTYVFWNIHE 88

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+ GQ+ FEG +DLV+F+K + + GL+  LRIGP+  AEWN+GGFP WL  +P I FR+ 
Sbjct: 89  PVEGQFNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSY 148

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+++   II++MK+  LFA QGGPIILAQ+ENEY +++ AY   G  YV+WA 
Sbjct: 149 NEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYRELGVQYVQWAG 208

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
             AV L   VPW+MC+Q+DAPDP+INTCNG +C D FT PN P+KP +WTEN++  +  F
Sbjct: 209 KMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVF 268

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  EDLAF+VARF    GT  NYYMY GGTNFGRT G   V T Y  +AP+DEY
Sbjct: 269 GDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEY 327

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYD 358
           G  R+PKWGHL++LH A++LC++ L +  P  +KLG   E   Y K  ++ CAAFL N  
Sbjct: 328 GLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNH 387

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           S   A +TF G  YFLP  S+SILPDCK VV+NT +V++Q N  +  F + K  N+ L  
Sbjct: 388 SREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARN--FVKSKIANKNL-- 443

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV-F 473
              +   +E + +  +   +     E  N  KD SDY W+  SI +    +P +   +  
Sbjct: 444 --KWEMSQEPIPVMTDMKILTKSPMELYNFLKDRSDYAWFVTSIELSNYDLPMKKDIIPV 501

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I +LGHA L FVN   +   +G++   NF+  K ++   G N + +L M VGL N GA
Sbjct: 502 LQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKFKAGTNYIALLCMTVGLPNSGA 561

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           + +   AG+ SV ++ L  G  D+++  W  QVGV GE++       ++   W       
Sbjct: 562 YMEHRYAGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQGGSHRVQWTAAKG-- 619

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
              ++ WYKT F  PEG  P+ L + SM KG AWVNG++IGRYW +YL+P          
Sbjct: 620 KGPAMTWYKTYFDMPEGNDPVILRMTSMAKGMAWVNGKNIGRYWLSYLSP---------- 669

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
                         +P+Q+ YH+PR W+ P +NLLVI EE GG+P +I +       ICS
Sbjct: 670 ------------LEKPSQSEYHVPRAWLKPSDNLLVIFEETGGNPEEIEVELVNRDTICS 717

Query: 714 FVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
            V+E  PP V SW+ +   + +      P+  L C     I  ++FAS+G P G CG F 
Sbjct: 718 IVTEYHPPHVKSWQRHDSKIRAVVDEVKPKGHLKCPNYKVIVKVDFASFGNPLGACGDFE 777

Query: 769 PGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G C   +   +V++ C+G+  C IP+ +     ++GAC  + K LAV+  C
Sbjct: 778 MGNCTAPNSKKVVEQHCMGKTTCEIPMEAGIFDGNSGACSDITKTLAVQVRC 829


>gi|449435864|ref|XP_004135714.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Cucumis
           sativus]
          Length = 712

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/713 (49%), Positives = 466/713 (65%), Gaps = 32/713 (4%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD +A++I+ +RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP  
Sbjct: 22  VTYDEKAIIINDQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPSE 81

Query: 65  GQYYFEG-RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           G+  +E   ++ + ++     A     L   P       + GFP+WL F+PGI FRT N 
Sbjct: 82  GKVTWEDFLYEQILYINCFHVA-----LFXFPPYFXFQKFSGFPIWLKFVPGIAFRTDNE 136

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M++F+ KI+D+MK E L+ +QGGPIIL+Q+ENEYG VEW  G  G+ Y KW A  
Sbjct: 137 PFKAAMQKFVTKIVDMMKLEKLYHTQGGPIILSQIENEYGPVEWQIGAPGKSYTKWFAQM 196

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN   KP +WTEN+SGW+ +FG   
Sbjct: 197 AVDLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGPT 256

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+RP ED+AF+VARF +  G+  NYY+Y GGTNFGRT+ G  +ATSYD+DAPIDEYG IR
Sbjct: 257 PYRPPEDVAFSVARFIQNNGSLVNYYVYHGGTNFGRTS-GLFIATSYDFDAPIDEYGLIR 315

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           +PKWGHLR+LHKAIKLCE  L+S+DPT   LG   EA ++ KSS+ CAAFLANYD+S+  
Sbjct: 316 EPKWGHLRDLHKAIKLCEPALVSADPTSTWLGKNQEARVF-KSSSACAAFLANYDTSASV 374

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V F  N Y LP WS+SILPDCK V FNTA++              K+    ++  S+F 
Sbjct: 375 KVNFWNNPYDLPPWSISILPDCKTVTFNTAQI------------GVKSYEAKMMPISSFG 422

Query: 424 WY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLN 475
           W    EE        +  +  L EQ++ T DT+DYLWY   I +   +     GK   L+
Sbjct: 423 WLSYKEEPASAYAKDTTTKDGLVEQVSVTWDTTDYLWYMQDISIDSTEGFLKSGKWPLLS 482

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GH   VF+N +L    YG+ +      +K + L +G+N L +LS+ VGL N G  F
Sbjct: 483 VNSAGHLLHVFINGQLSGSVYGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHF 542

Query: 536 DVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
           D   AG+   V L  L  G RD+S  +W Y+VG+ GE + L     +NS  W +GS L  
Sbjct: 543 DTWNAGVLGPVTLKGLNEGTRDMSKYKWSYKVGLSGESLNLYSDKGSNSVQWTKGS-LTQ 601

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
            + L WYKTTF  P G  PL L+++SM KGQ WVNG+SIGRY+  Y+A   G   KC Y 
Sbjct: 602 KQPLTWYKTTFKTPAGNEPLGLDMSSMSKGQIWVNGRSIGRYFPGYIA--NGKCDKCSYA 659

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           G +   KC  +CG+P+Q  YHIPR W+ P +NLLVI EE+GG P  ISL+ +T
Sbjct: 660 GLFTEKKCLGNCGEPSQKWYHIPRDWLSPSDNLLVIFEEIGGSPDGISLVKRT 712


>gi|449517114|ref|XP_004165591.1| PREDICTED: beta-galactosidase 9-like, partial [Cucumis sativus]
          Length = 763

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/760 (47%), Positives = 486/760 (63%), Gaps = 48/760 (6%)

Query: 101 WNY-GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
           W+Y  GFP+WL  +PGI+FRT N PFKEEM+RF+ KI+DL++ E LF  QGGP+I+ QVE
Sbjct: 1   WDYCRGFPLWLRDVPGIEFRTDNAPFKEEMQRFVKKIVDLLRDEKLFCWQGGPVIMLQVE 60

Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
           NEYGN+E +YG  G+ Y+KW  + A+ L   VPWVMCQQ+DAP  IIN+CNG+YCDGF  
Sbjct: 61  NEYGNIESSYGKRGQEYIKWVGNMALGLGAEVPWVMCQQKDAPSTIINSCNGYYCDGFKA 120

Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
           NSPSKPI WTEN++GWF S+G   P RPVEDLAF+VARFF+  G+FQNYYMYFGGTNFGR
Sbjct: 121 NSPSKPIFWTENWNGWFTSWGERSPHRPVEDLAFSVARFFQREGSFQNYYMYFGGTNFGR 180

Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKL 338
           TAGGP   TSYDYD+PIDEYG IR+PKWGHL++LH A+KLCE  L+S+D P + KLG K 
Sbjct: 181 TAGGPFYITSYDYDSPIDEYGLIREPKWGHLKDLHTALKLCEPALVSADSPQYIKLGPKQ 240

Query: 339 EAHIYHKSSN-------------DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
           EAH+YH  S              +C+AFLAN D      V FNG  Y LP WSVSILPDC
Sbjct: 241 EAHVYHMKSQTDDLTLSKLGTLRNCSAFLANIDERKAVAVKFNGQTYNLPPWSVSILPDC 300

Query: 386 KNVVFNTAKVISQRN----NGDHPFA-------QQKNVNELLLASSAFSWYEEKVGISGN 434
           +NVVFNTAKV +Q +        P +          + NEL + ++++   +E +GI  +
Sbjct: 301 QNVVFNTAKVAAQTSIKILELYAPLSANVSLKLHATDQNELSIIANSWMTVKEPIGIWSD 360

Query: 435 RSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK-------EVFLNIESLGHAALVFV 487
           ++F    + E +N TKD SDYLWY   IHV     +          + I+S+     VFV
Sbjct: 361 QNFTVKGILEHLNVTKDRSDYLWYMTRIHVSNDDIRFWKERNITPTITIDSVRDVFRVFV 420

Query: 488 NKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVI- 546
           N KL     G   +  F+  + ++  EG N L +LS  +GLQN GA+ +  GAG+   I 
Sbjct: 421 NGKLTGSAIGQ--WVKFV--QPVQFLEGYNDLLLLSQAMGLQNSGAFIEKDGAGIRGRIK 476

Query: 547 LIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFL 606
           L   KNG  DLS   W YQVG++GE++    +     + W + S   +  +  WYK  F 
Sbjct: 477 LTGFKNGDIDLSKSLWTYQVGLKGEFLNFYSLEENEKADWTELSVDAIPSTFTWYKAYFS 536

Query: 607 APEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHC 666
           +P+G  P+A+NL SMGKGQAWVNG  IGRYWS  ++P  GC +KCDYRG+Y++ KC  +C
Sbjct: 537 SPDGTDPVAINLGSMGKGQAWVNGHHIGRYWSV-VSPKDGCPRKCDYRGAYNSGKCATNC 595

Query: 667 GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPV--- 723
           G+P Q+ YHIPR+W+    NLLV+ EE GG+P +I +   +   IC  VSE+  P +   
Sbjct: 596 GRPTQSWYHIPRSWLKESSNLLVLFEETGGNPLEIVVKLYSTGVICGQVSESHYPSLRKL 655

Query: 724 -DSWKPNLGVVS--SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPI 779
            + +  +   +S  ++P++ L C+ G  I+++ FASYG P+G+C  F  G CH  + L +
Sbjct: 656 SNDYISDGETLSNRANPEMFLHCDDGHVISSVEFASYGTPQGSCNKFSRGPCHATNSLSV 715

Query: 780 VQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           V +AC+G+  C++ +S++  G     C  ++K LAVEA C
Sbjct: 716 VSQACLGKNSCTVEISNSAFG--GDPCHSIVKTLAVEARC 753


>gi|218184335|gb|EEC66762.1| hypothetical protein OsI_33138 [Oryza sativa Indica Group]
          Length = 828

 Score =  699 bits (1805), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/836 (44%), Positives = 504/836 (60%), Gaps = 59/836 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL  IETYVFWN HEP R
Sbjct: 31  VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            ++ FEG +D+VRF K +Q AG++  LRIGPY C EWNYGG PVWL  IPGI+FR  N P
Sbjct: 91  REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------NVEWAYGVGGELYV 177
           F+ EM+ F   I+  MK  N+FA QGGPIILAQ+ENEYG       N++ A+      Y+
Sbjct: 151 FENEMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHE-----YI 205

Query: 178 KWAADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
            W AD A   N  VPW+MCQQ+ D P  ++NTCNGFYC  +  N  S P MWTEN++GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
             +      RP ED+AFAVA FF+  G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325

Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
           DEYG +RQPK+GHL+ELH  +   E+ L+  D      G  +    Y  ++   A F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATS-ACFINN 384

Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
                D NVT +G  +FLPAWSVSILPDCK V FN+AK+ +Q          + ++ E  
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPDCKTVAFNSAKIKTQTT----VMVNKTSMVEQQ 440

Query: 417 LASSAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF 473
                +SW  E +         +F + +L EQI TT D SDYLWY  S+    G+G  V 
Sbjct: 441 TEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLE-HKGEGSYV- 498

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + + GH    FVN KLV   Y  ++   F +   ++L++G N + +LS  VGL+NYG 
Sbjct: 499 LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGG 558

Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG 589
            F++  AG+    V LID      DLS+  W Y+ G+ GEY  I LDK     + +    
Sbjct: 559 SFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDK---PGNKWRSHN 615

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           ST+P+N+   WYKTTF AP G+  + ++L  + KG AWVNG S+GRYW +Y+A       
Sbjct: 616 STIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCH 675

Query: 650 KCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLL 704
            CDYRG +    +A KC   CG+P+Q LYH+PR+++H GE N L++ EE GGDPS++++ 
Sbjct: 676 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLHKGEPNTLILFEEAGGDPSEVAVR 735

Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGN 763
           T     +C+     D                   V L+C   G  I++++ AS+G+  G 
Sbjct: 736 TVVEGSVCASAELGD------------------TVTLSCGAHGRTISSVDVASFGVARGR 777

Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           CGS+  G            ACVG+  C++ V+ A+   +AG   G+   L V+A C
Sbjct: 778 CGSYDGGCDSKVAYDAFAAACVGKESCTVLVTDAF--ANAGCVSGV---LTVQATC 828


>gi|224082320|ref|XP_002306647.1| predicted protein [Populus trichocarpa]
 gi|222856096|gb|EEE93643.1| predicted protein [Populus trichocarpa]
          Length = 764

 Score =  699 bits (1804), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/820 (45%), Positives = 496/820 (60%), Gaps = 59/820 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD R+L+I+G+ ++L SGSIHYPRSTP++W  LI K+K GG++VI+TYVFWN HEP 
Sbjct: 1   NVTYDGRSLIINGQHKILFSGSIHYPRSTPDMWSSLISKAKAGGIDVIQTYVFWNLHEPQ 60

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQ+YF GR DLVRFVK +Q  GL+  LRIGP+  +EW YGG P WLH IPG+ +R+ N 
Sbjct: 61  QGQFYFNGRADLVRFVKEIQAQGLYACLRIGPFIESEWTYGGLPFWLHDIPGMVYRSDNQ 120

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  MKRF+++I+ +MK E L+ASQGGPIIL+QVENEY NVE A+   G  YV+WAA  
Sbjct: 121 PFKYHMKRFVSRIVSMMKSEKLYASQGGPIILSQVENEYKNVEAAFHEKGPSYVRWAALM 180

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
           AVNL T VPWVMC+Q+DAPDP+IN+CNG  C + F  PNSP+KP +WTE+++ ++  +G 
Sbjct: 181 AVNLQTGVPWVMCKQDDAPDPVINSCNGMRCGETFAGPNSPNKPSIWTEDWTSFYQVYGE 240

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  +D+AF VA F    G++ NYYMY GGTNFGRTA    + + YD  AP+DEYG 
Sbjct: 241 ETYMRSAQDIAFHVALFIAKTGSYVNYYMYHGGTNFGRTASAFTITSYYD-QAPLDEYGL 299

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IRQPKWGHL+ELH AIK C + L+        LG   +A+++  +S  CAAFL N D   
Sbjct: 300 IRQPKWGHLKELHAAIKSCSKLLLHGAHKTFSLGPLQQAYVFQGNSGQCAAFLVNNDGKQ 359

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           +  V F  N Y LP  S+SILPDCK + FNTAKV +Q         +    N+   +   
Sbjct: 360 EVEVLFQSNSYKLPQKSISILPDCKTMTFNTAKVNAQYT------TRSMKPNQKFNSVGK 413

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
           +  Y E +      S     L E ++TTKDTSDYLWYT          + VF N +S GH
Sbjct: 414 WEEYNEPIPEFDKTSLRANRLLEHMSTTKDTSDYLWYTFRFQQNLPNAQSVF-NAQSHGH 472

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
               +VN     FG+G+H   +F +   + L  G N++ +LS  VGL + GA+ +   AG
Sbjct: 473 VLHAYVNGVHAGFGHGSHQNTSFSLQTTVRLKNGTNSVALLSATVGLPDSGAYLERRVAG 532

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
           L  V + +     +D ++  W YQVG+ GE + +   + +N   W +   L  N+ L+WY
Sbjct: 533 LRRVRIQN-----KDFTTYTWGYQVGLLGERLQIYTENGSNKVKWNK---LGTNRPLMWY 584

Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
           KT F AP G  P+ALNL SMGKG+AWVNGQSIGRYW ++                     
Sbjct: 585 KTLFDAPAGNDPVALNLGSMGKGEAWVNGQSIGRYWVSFHTSQ----------------- 627

Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
                G P+QT Y+IPR ++ P  NLLV+ EE  G P  I++ T +   +C + SE    
Sbjct: 628 -----GSPSQTWYNIPRAFLKPTGNLLVLLEEEKGYPPGITVDTVSVTKVCGYASE---- 678

Query: 722 PVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPI-V 780
                       S    V+L+C    +I++I FAS+G P GNC S+  G CH       V
Sbjct: 679 ------------SHLSAVQLSCPLKRNISSIIFASFGTPSGNCESYAIGNCHSSSSKANV 726

Query: 781 QKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +KAC+G+  CSIP S+ + G     CPG+ K L VEA C+
Sbjct: 727 EKACIGKRSCSIPQSNHFFG--GDPCPGIPKVLLVEAKCT 764


>gi|297842521|ref|XP_002889142.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334983|gb|EFH65401.1| hypothetical protein ARALYDRAFT_476906 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 818

 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/835 (45%), Positives = 501/835 (60%), Gaps = 54/835 (6%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++VI+TYVFWN HE
Sbjct: 22  AANVTYDGRSLIIDGQHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVIDTYVFWNIHE 81

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +GQ+ F GR D+V+F+K V+  GL++ LRIGP+   EW+YGG P WLH + GI FRT 
Sbjct: 82  PQQGQFDFSGRRDIVKFIKEVKAHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTD 141

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  MKR+   I+ LMK ENL+ASQGGPIIL+Q+ENEYG V  A+   G+ YVKWAA
Sbjct: 142 NEPFKYHMKRYAQMIVKLMKSENLYASQGGPIILSQIENEYGMVARAFRQDGKSYVKWAA 201

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
             AV L+T VPWVMC+Q+DAPDP++N CNG  C + F  PNSP+KP +WTEN++ ++ ++
Sbjct: 202 KLAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTY 261

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  ED+AF VA F    G+F NYYMY GGTNFGR A    V TSY   AP+DEY
Sbjct: 262 GEEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEY 320

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +RQPKWGHL+ELH A+KLCEE L+S   T   LG    A ++ K +N CAA L N D 
Sbjct: 321 GLLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAALLVNQD- 379

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
             D  V F  + Y L   S+S+LPDCKNV FNTAKV +Q N       + +   + L + 
Sbjct: 380 KCDCTVQFRNSSYRLSPKSISVLPDCKNVAFNTAKVNAQYN------TRTRKPRQNLSSP 433

Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
             +  + E V      S     L E +NTT+DTSDYLW T        +G    L +  L
Sbjct: 434 HMWEKFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFE--QSEGAPSVLKVNHL 491

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
           GH    FVN++ +   +G     +FL+ K + LN G N + +LS+MVGL N GA  +   
Sbjct: 492 GHVLHAFVNERFIGSMHGTFKAHSFLLEKNMSLNNGTNNMALLSVMVGLPNSGAHLERRV 551

Query: 540 AGLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
            G  SV   ++ NG   L  ++  W YQVG++GE   +     A    WKQ      ++ 
Sbjct: 552 VGSRSV---NIWNGSYQLFFNNYSWGYQVGLKGEKYHVYTEDGAKKVQWKQYRD-SKSQP 607

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           L WYK +F  PEG+ P+ALNL SMGKG+AWVNGQSIGRYW ++                 
Sbjct: 608 LTWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSFYTSK------------- 654

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVS 716
                    G P+Q  YHIPR+++ P  NLLVI  EE  G P  I++ T +   +C  VS
Sbjct: 655 ---------GNPSQIWYHIPRSFLKPNSNLLVILEEEREGYPLGITIDTVSVTEVCGHVS 705

Query: 717 EADPPPVDSWKPN----------LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS 766
              P PV S +                   P+V+L C  G  I+ + FA++G P G+CGS
Sbjct: 706 NTHPHPVISPRKKGHNRNEQRHLKYRYDRKPKVQLQCPTGRKISKVLFATFGNPNGSCGS 765

Query: 767 FRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +  G+CH  + L +VQKAC+ +  CS+PV S   G     CP  +K+L V A CS
Sbjct: 766 YSVGSCHSPNSLAVVQKACLRKSRCSVPVWSKTFG--GDLCPQTVKSLLVRAQCS 818


>gi|125556152|gb|EAZ01758.1| hypothetical protein OsI_23787 [Oryza sativa Indica Group]
          Length = 828

 Score =  696 bits (1795), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/831 (44%), Positives = 493/831 (59%), Gaps = 49/831 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31  VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G +D+VRF K +Q AGL+  LRIGPY C EWNYGG P WL  IPG+QFR  N P
Sbjct: 91  RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   I++ MK  N+FA QGGPIILAQ+ENEYGN+  +         Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG 
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +        K+    Y   S   A F+ N + + 
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTS-ACFINNRNDNM 389

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +G  + LPAWSVSILPDCK V FN+AK+ +Q          + N+ E    S  
Sbjct: 390 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----VMVNKANMVEKEPESLK 445

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  SI+        +F+N  +
Sbjct: 446 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGEASYTLFVN--T 503

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +    +L++G N + +LS  +GL+NYG  F+  
Sbjct: 504 TGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEKM 563

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK      ++     T+P+
Sbjct: 564 PAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDK---PGCTWDNNNGTVPI 620

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           NK   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A   G    CDYR
Sbjct: 621 NKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDYR 680

Query: 655 GSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTGQ 709
           G +    D  KC   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS +S  T    
Sbjct: 681 GVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTLILFEEAGGDPSHVSFRTVAAG 740

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSFR 768
            +C+     D                   + L+C +    I+AIN  S+G+  G CG+++
Sbjct: 741 SVCASAEVGD------------------TITLSCGQHSKTISAINMTSFGVARGQCGAYK 782

Query: 769 PGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G           +AC+G+  C++ +++A   V+   C  L   L V+A C
Sbjct: 783 GGCESKAAYKAFTEACLGKESCTVQITNA---VTGSGC--LSNVLTVQASC 828


>gi|330689960|gb|AEC33272.1| beta-galactosidase [Ziziphus jujuba]
          Length = 730

 Score =  694 bits (1792), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/729 (49%), Positives = 459/729 (62%), Gaps = 27/729 (3%)

Query: 104 GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG 163
           GGFPVWL ++PGI FRT N PFK  M+ F  KI+ ++K ENLFASQGGPIIL+Q+ENEYG
Sbjct: 1   GGFPVWLKYVPGISFRTDNGPFKTAMQGFTQKIVQMLKSENLFASQGGPIILSQIENEYG 60

Query: 164 NVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPS 223
               A G  G  Y+ WAA  AV LNT VPWVMC+++DAPDP+IN CNGFYCDGF+PN P 
Sbjct: 61  PESKALGAAGRSYINWAAKMAVGLNTGVPWVMCKEDDAPDPVINACNGFYCDGFSPNKPY 120

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
           KPI+WTE +SGWF  FG  V  RPV+DLAFAVARF + GG++ NYYMY GGTNFGRTAGG
Sbjct: 121 KPILWTEAWSGWFTEFGGTVHQRPVQDLAFAVARFIQKGGSYFNYYMYHGGTNFGRTAGG 180

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
           P V TSYDYDAPIDEYG  R+PK+ HL+ELHKAIKL E+ L+S+ PT   LG   +A+IY
Sbjct: 181 PFVTTSYDYDAPIDEYGLTREPKYSHLKELHKAIKLSEDALVSAGPTITSLGTYEQAYIY 240

Query: 344 HKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD 403
           +     CAAFLANY+S S A V FN   Y LP WS+SILPDC+NV +NTA V        
Sbjct: 241 NSGPRKCAAFLANYNSKSAARVLFNNRHYNLPPWSISILPDCRNVAYNTALV-------- 292

Query: 404 HPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRP-DLAEQINTTKDTSDYLWYTA 460
               Q  +V+ L   +S  SW  Y+E +     R+ +    L EQIN T+DTSDYLWY  
Sbjct: 293 --GVQTSHVHMLPTGTSLLSWETYDEVISSLDERARMTAVGLLEQINVTRDTSDYLWYMT 350

Query: 461 SIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEG 515
           S+ +   +     G++  LN++S GHA  VF+N +     +G  +   F     + L  G
Sbjct: 351 SVDISSSESFLRGGQKPTLNVQSAGHAVRVFINGQFSGSAFGTREHRQFTFTGPVNLRAG 410

Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIG 574
            N + +LS+ VGL N G  +++   G+   + ++ L NGKRDL+  +W YQVG++GE + 
Sbjct: 411 SNKISLLSIAVGLPNVGFHYELWETGVLGPVFLNGLDNGKRDLTWQKWSYQVGLKGEAMN 470

Query: 575 LDKISLANSSFWKQGSTLPVN-KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
           L     A+S+ W +GS    + + L WYK  F AP G  PLAL+L SMGKGQ  +NGQSI
Sbjct: 471 LVTPEGASSADWVRGSLAARSVQPLTWYKAYFNAPNGNEPLALDLRSMGKGQVRINGQSI 530

Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
           GRYW+AY   + G  + C Y G             P Q  YH+PR+W+ P +NLLVI EE
Sbjct: 531 GRYWTAY---AKGDCEACSYTGHSGRQNVNLVVASPTQRWYHVPRSWLKPKQNLLVIFEE 587

Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVD-SWKPNLGVVSSSPQVRLACERGWHIAAI 752
           LGGD SKI+LL ++  ++C+   E  P     S     G       V L C  G  I+AI
Sbjct: 588 LGGDASKIALLRRSLTNVCANAFENHPSMAKYSTSSQDGSKVKEATVNLQCGPGQSISAI 647

Query: 753 NFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLK 811
            FAS+G P G CGSF  G CH  +   I++K CVGQ  CS+ +S++  G  A  CP +LK
Sbjct: 648 EFASFGTPSGTCGSFHIGTCHAPNSRSIIEKKCVGQKSCSVTISNSIFG--ADPCPNVLK 705

Query: 812 ALAVEAHCS 820
            L VEA CS
Sbjct: 706 RLTVEAVCS 714


>gi|222612650|gb|EEE50782.1| hypothetical protein OsJ_31141 [Oryza sativa Japonica Group]
          Length = 828

 Score =  693 bits (1788), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/836 (44%), Positives = 503/836 (60%), Gaps = 59/836 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL  IETYVFWN HEP R
Sbjct: 31  VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            ++ FEG +D+VRF K +Q AG++  LRIGPY C EWNYGG PVWL  IPGI+FR  N P
Sbjct: 91  REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------NVEWAYGVGGELYV 177
           F+  M+ F   I+  MK  N+FA QGGPIILAQ+ENEYG       N++ A+      Y+
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHE-----YI 205

Query: 178 KWAADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
            W AD A   N  VPW+MCQQ+ D P  ++NTCNGFYC  +  N  S P MWTEN++GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
             +      RP ED+AFAVA FF+  G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325

Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
           DEYG +RQPK+GHL+ELH  +   E+ L+  D      G  +    Y  ++   A F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATS-ACFINN 384

Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
                D NVT +G  +FLPAWSVSILP+CK V FN+AK+ +Q          + ++ E  
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTT----VMVNKTSMVEQQ 440

Query: 417 LASSAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF 473
                +SW  E +         +F + +L EQI TT D SDYLWY  S+    G+G  V 
Sbjct: 441 TEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLE-HKGEGSYV- 498

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + + GH    FVN KLV   Y  ++   F +   ++L++G N + +LS  VGL+NYG 
Sbjct: 499 LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSPVKLHDGKNYISLLSGTVGLRNYGG 558

Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG 589
            F++  AG+    V LID      DLS+  W Y+ G+ GEY  I LDK     + +    
Sbjct: 559 SFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDK---PGNKWRSHN 615

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           ST+P+N+   WYKTTF AP G+  + ++L  + KG AWVNG S+GRYW +Y+A       
Sbjct: 616 STIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCH 675

Query: 650 KCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLL 704
            CDYRG +    +A KC   CG+P+Q LYH+PR++++ GE N L++ EE GGDPS++++ 
Sbjct: 676 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVR 735

Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGN 763
           T     +C+     D                   V L+C   G  I++++ AS+G+  G 
Sbjct: 736 TVVEGSVCASAEVGD------------------TVTLSCGAHGRTISSVDVASFGVARGR 777

Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           CGS+  G            ACVG+  C++ V+ A+   +AG   G+   L V+A C
Sbjct: 778 CGSYDGGCESKVAYDAFAAACVGKESCTVLVTDAF--ANAGCVSGV---LTVQATC 828


>gi|357453875|ref|XP_003597218.1| Beta-galactosidase [Medicago truncatula]
 gi|355486266|gb|AES67469.1| Beta-galactosidase [Medicago truncatula]
          Length = 2260

 Score =  691 bits (1783), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/491 (66%), Positives = 385/491 (78%), Gaps = 3/491 (0%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN HEP
Sbjct: 20  TNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLHEP 79

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           ++GQY F+GR DLV+FVK V EAGL++HLRIGPY C+EWNYGGFP+WLHFIPGI+FRT N
Sbjct: 80  VKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCSEWNYGGFPLWLHFIPGIKFRTDN 139

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF  KI+DLMKQE L+ASQGGPIIL+Q+ENEYG+++ AYG  G+ Y+ WAA 
Sbjct: 140 EPFKVEMKRFTTKIVDLMKQEKLYASQGGPIILSQIENEYGDIDSAYGSAGKSYINWAAK 199

Query: 183 TAVNLNTSVPWVMCQQEDAPDPI-INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A +L+T VPWVMCQQ DAPDPI INTCNGFYCD FTPNS +KP +WTEN+S W+L FG 
Sbjct: 200 MATSLDTGVPWVMCQQADAPDPIVINTCNGFYCDQFTPNSKTKPKLWTENWSAWYLLFGG 259

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P RPVEDLAFAVARFF+ GGTFQNYYMY GGTNF R+ GGP +ATSYD+DAPIDEYG 
Sbjct: 260 GFPHRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFDRSTGGPFIATSYDFDAPIDEYGV 319

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IRQPKWGHL+++HKAIKLCEE LI+++P    LG  LEA +Y K+ + CAAFLAN D+ S
Sbjct: 320 IRQPKWGHLKDVHKAIKLCEEALIAAEPKITYLGPNLEAAVY-KTGSVCAAFLANVDAKS 378

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNELLLASS 420
           D  V F+GN Y LPAWSVSILPDCKNVV NTAK+ S     +      K +++    + S
Sbjct: 379 DKTVNFSGNSYHLPAWSVSILPDCKNVVLNTAKINSASTISNFVTESLKEDISSSETSRS 438

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +SW  E VGIS +    +  L EQIN T D SDYLWY+ S+ +    G +  L+IESLG
Sbjct: 439 KWSWINEPVGISKDDILSKTGLLEQINITADRSDYLWYSLSVDLKDDPGSQTVLHIESLG 498

Query: 481 HAALVFVNKKL 491
           HA   F+N KL
Sbjct: 499 HALHAFINGKL 509



 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 152/319 (47%), Positives = 203/319 (63%), Gaps = 16/319 (5%)

Query: 510  IELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKR--DLSSGEWIYQV 566
            I +  G N +D+LS+ VGLQNYGA+FD  GAG+   VIL  LKNG +  DLSS +W YQV
Sbjct: 1950 ITVLSGKNKIDLLSLTVGLQNYGAFFDTWGAGITGPVILKGLKNGNKTLDLSSRKWTYQV 2009

Query: 567  GVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQA 626
            G++GE +GL   S  +S  W   +T P  + LIWYKT F AP G  P+ ++   MGKG+A
Sbjct: 2010 GLKGEDLGL---SSGSSGAWNSKTTFPKKQPLIWYKTNFDAPSGSNPVVIDFTGMGKGEA 2066

Query: 627  WVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGEN 686
            WVNGQSIGRYW  Y+A +  CT  C+YRG +  +KC  +CG+P+QTLYH+P++++ P  N
Sbjct: 2067 WVNGQSIGRYWPTYVASNVDCTDSCNYRGPFTQTKCHMNCGKPSQTLYHVPQSFLKPNGN 2126

Query: 687  LLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSPQVRLAC 743
             LV+ EE GGDP++IS  TK    +C+ VS++ PP +D W  +    G V   P + L C
Sbjct: 2127 TLVLFEESGGDPTQISFATKQIGSVCAHVSDSHPPQIDLWNQDTESGGKV--GPALLLNC 2184

Query: 744  -ERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGV 801
                  I++I FASYG P G CG+F  G C  +  L IV+KAC+G   CSI VS+   G 
Sbjct: 2185 PNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKTLSIVKKACIGSRSCSIGVSTDTFG- 2243

Query: 802  SAGACPGLLKALAVEAHCS 820
                C G+ K+LAVEA C+
Sbjct: 2244 --DPCKGVPKSLAVEATCA 2260


>gi|26451843|dbj|BAC43014.1| unknown protein [Arabidopsis thaliana]
 gi|29029060|gb|AAO64909.1| At1g77410 [Arabidopsis thaliana]
          Length = 820

 Score =  690 bits (1780), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/812 (46%), Positives = 494/812 (60%), Gaps = 49/812 (6%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 23  ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 82

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQ+ F G  D+V+F+K V+  GL++ LRIGP+   EW+YGG P WLH + GI FRT N
Sbjct: 83  QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 142

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  MKR+   I+ LMK ENL+ASQGGPIIL+Q+ENEYG V  A+   G+ YVKW A 
Sbjct: 143 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 202

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L+T VPWVMC+Q+DAPDP++N CNG  C + F  PNSP+KP +WTEN++ ++ ++G
Sbjct: 203 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+AF VA F    G+F NYYMY GGTNFGR A    V TSY   AP+DEYG
Sbjct: 263 EEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL+ELH A+KLCEE L+S   T   LG    A ++ K +N CAA L N D  
Sbjct: 322 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            ++ V F  + Y L   SVS+LPDCKNV FNTAKV +Q N       + +   + L +  
Sbjct: 381 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 434

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  + E V      S     L E +NTT+DTSDYLW T        +G    L +  LG
Sbjct: 435 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 492

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   FVN + +   +G      FL+ K + LN G N L +LS+MVGL N GA  +    
Sbjct: 493 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 552

Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
           G  SV    + NG+  L  ++  W YQVG++GE   +     +    WKQ      ++ L
Sbjct: 553 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 608

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYK +F  PEG+ P+ALNL SMGKG+AWVNGQSIGRYW ++                  
Sbjct: 609 TWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIGRYWVSF------------------ 650

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSE 717
                 + G P+Q  YHIPR+++ P  NLLVI  EE  G+P  I++ T +   +C  VS 
Sbjct: 651 ----HTYKGNPSQIWYHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSN 706

Query: 718 ADPPPVDS------WKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            +P PV S       + NL       P+V+L C  G  I+ I FAS+G P G+CGS+  G
Sbjct: 707 TNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIG 766

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGV 801
           +CH  + L +VQKAC+ +  CS+PV S   GV
Sbjct: 767 SCHSPNSLAVVQKACLKKSRCSVPVWSKTFGV 798


>gi|115437264|ref|NP_001043252.1| Os01g0533400 [Oryza sativa Japonica Group]
 gi|75158475|sp|Q8RUV9.1|BGAL1_ORYSJ RecName: Full=Beta-galactosidase 1; Short=Lactase 1; Flags:
           Precursor
 gi|20146357|dbj|BAB89138.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|20161405|dbj|BAB90329.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|113532783|dbj|BAF05166.1| Os01g0533400 [Oryza sativa Japonica Group]
 gi|215767421|dbj|BAG99649.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 827

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/834 (43%), Positives = 497/834 (59%), Gaps = 52/834 (6%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +V+YD R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETY+FWN HEP
Sbjct: 29  TSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            R QY FEG +D+VRF K +Q AG++  LRIGPY C EWNYGG P WL  IPG+QFR  N
Sbjct: 89  HRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWA 180
            PF+ EM+ F   I++ MK   +FA QGGPIILAQ+ENEYGN+  +         Y+ W 
Sbjct: 149 EPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWC 208

Query: 181 ADTAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           AD A   N  VPW+MCQQ +D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++
Sbjct: 209 ADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAW 268

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
                 R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEY
Sbjct: 269 DKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEY 328

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +RQPK+GHL+ELH  +K  E+ L+  +      G  +    Y   S+  A F+ N   
Sbjct: 329 GNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSS-ACFINNRFD 387

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
             D NVT +G  + LPAWSVSILPDCK V FN+AK+ +Q +       ++ N  E    S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTS----VMVKKPNTAEQEQES 443

Query: 420 SAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
             +SW  E +         +F + +L EQI T+ D SDYLWY  S++   G+G    L +
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLN-HKGEGS-YKLYV 501

Query: 477 ESLGHAALVFVNKKLVAFGY-GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
            + GH    FVN KL+   +  + DF  F +   ++L++G N + +LS  VGL+NYG  F
Sbjct: 502 NTTGHELYAFVNGKLIGKNHSADGDFV-FQLESPVKLHDGKNYISLLSATVGLKNYGPSF 560

Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWK-QGS 590
           +    G+    V LID      DLS+  W Y+ G+  EY  I LDK        W     
Sbjct: 561 EKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYK----WNGNNG 616

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           T+P+N+   WYK TF AP G+  + ++L  + KG AWVNG ++GRYW +Y A       +
Sbjct: 617 TIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHR 676

Query: 651 CDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLT 705
           CDYRG++    D ++C   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS ++L T
Sbjct: 677 CDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVALRT 736

Query: 706 KTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCG 765
                +C+     D                   V L+C  G  +++++ AS+G+  G CG
Sbjct: 737 VVPGAVCTSGEAGDA------------------VTLSCGGGHAVSSVDVASFGVGRGRCG 778

Query: 766 SFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            +  G            ACVG+  C++ ++ A+ G  AG   G+   L V+A C
Sbjct: 779 GYEGGCESKAAYEAFTAACVGKESCTVEITGAFAG--AGCLSGV---LTVQATC 827


>gi|75169194|sp|Q9C6W4.1|BGL15_ARATH RecName: Full=Beta-galactosidase 15; Short=Lactase 15; Flags:
           Precursor
 gi|12597826|gb|AAG60136.1|AC074360_1 hypothetical protein [Arabidopsis thaliana]
          Length = 779

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/832 (43%), Positives = 503/832 (60%), Gaps = 91/832 (10%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V++D RA+ IDG RRVL SGSIHYPRST E+WP+LI+K KEG L+ IETYVFWN HEP R
Sbjct: 22  VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGSLDAIETYVFWNAHEPTR 81

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DL+RF+KT+Q  G++  LRIGPY CAEWNYGGFPVWLH +PG++FRTTN  
Sbjct: 82  RQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 141

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F  EM+ F   I++++K+E LFASQGGPIILAQ+ENEYGNV  +YG  G+ Y++W A+ A
Sbjct: 142 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIQWCANMA 201

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +L+  VPW+MCQQ+DAP P++NTCNG+YCD F+PN+P+ P MWTEN++GW+ ++G   P
Sbjct: 202 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNTPKMWTENWTGWYKNWGGKDP 261

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  ED+AFAVARFF+  GTFQNYYMY GGTNF RTAGGP + T+YDYDAP+DE+G + Q
Sbjct: 262 HRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQ 321

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL++LH  +   E+ L   + +    G  + A +Y ++    + F+ N + +SDA 
Sbjct: 322 PKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVY-QTEEGSSCFIGNVNETSDAK 380

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           + F G  Y +PAWSVSILPDCK   +NTAK+ +Q +       ++ N  E   ++  +SW
Sbjct: 381 INFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTS----VMVKKANEAENEPSTLKWSW 436

Query: 425 YEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
             E +    + G        L +Q   + D SDYLWY  ++++    P  GK + L I S
Sbjct: 437 RPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNLKEQDPVLGKNMSLRINS 496

Query: 479 LGHAALVFVNKKLVAFGYGNHDFAN----FLINKKIELNEGINTLDILSMMVGLQNYGAW 534
             H    FVN + +    GN+   N    ++  +  + N G N + +LS+ VGL NYGA+
Sbjct: 497 TAHVLHAFVNGQHI----GNYRVENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAF 552

Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           F+   AG+   + I  +NG     +DLS+ +W Y+ G+ G           N  F  +  
Sbjct: 553 FENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSG---------FENQLFSSES- 602

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                       +T+ AP G  P+ ++L  +GKG AW+NG +IGRYW A+L+   GC+ +
Sbjct: 603 -----------PSTWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLSDIDGCSAE 651

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQ 709
                                  YH+PR++++  G+N LV+ EE+GG+PS ++  T    
Sbjct: 652 -----------------------YHVPRSFLNSEGDNTLVLFEEIGGNPSLVNFQTIGVG 688

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
            +C+ V E +                   + L+C  G  I+AI FAS+G P G+CGSF  
Sbjct: 689 SVCANVYEKN------------------VLELSC-NGKPISAIKFASFGNPGGDCGSFEK 729

Query: 770 GACHM--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G C    +   I+ + CVG+ +CSI VS    G  A  C  L K LAVEA C
Sbjct: 730 GTCEASNNAAAILTQECVGKEKCSIDVSEDKFG--AAECGALAKRLAVEAIC 779


>gi|183604889|gb|ACC64531.1| beta-galactosidase 6 [Oryza sativa Indica Group]
          Length = 811

 Score =  686 bits (1771), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/826 (45%), Positives = 502/826 (60%), Gaps = 47/826 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L   +TYD RALV+ G RR+  SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25  LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EPI+GQY FEGR+DLV+F++ +Q  GL++ LRIGP+  AEW YGGFP WLH +P I FR+
Sbjct: 85  EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+ F+ KI+ +MK E L+  QGGPII++Q+ENEY  +E A+G  G  YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLS 238
           A  AV L T VPW+MC+Q DAPDP+INTCNG  C + F  PNSP+KP +WTEN++  +  
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPI 264

Query: 239 FGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
           +G     R  ED+AFAVA +     G+F +YYMY GGTNFGR A    V TSY   AP+D
Sbjct: 265 YGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLD 323

Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           EYG I QP WGHLRELH A+K   E L+    ++  LG + EAH++ ++   C AFL N+
Sbjct: 324 EYGLIWQPTWGHLRELHCAVKQSSEPLLFGSYSNFSLGQQQEAHVF-ETDFKCVAFLVNF 382

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN-VNELL 416
           D  +   V F      L   S+S+L DC+NVVF TAKV +Q  +      Q  N +N   
Sbjct: 383 DQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWK 442

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LN 475
                      K   +GN+ F      EQ+ TTKD +DYLWY  S       G ++  L 
Sbjct: 443 AFIEPVPQDLSKSTYTGNQLF------EQLPTTKDETDYLWYIVSYKNRASDGNQIARLY 496

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
           ++SL H    FVN + V   +G+HD   N ++N  + L EG NT+ +LS+MVG  + GA+
Sbjct: 497 VKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAY 556

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
            +    G+ +V +   +     L++  W YQVG+ GE   +      NS  W   + L +
Sbjct: 557 MERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMDINNL-I 615

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L WYKTTF  P G   + LNL SMGKG+ WVNG+SIGRYW ++ APS          
Sbjct: 616 YHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS---------- 665

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                       GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++ T +   +C  
Sbjct: 666 ------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGN 713

Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
           V E   PP+ S           P+VR+ C+ G  I++I FASYG P G+C SFR G+CH 
Sbjct: 714 VDEFSVPPLQS-------RGKVPKVRIWCQGGKRISSIEFASYGNPVGDCRSFRIGSCHA 766

Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           +    +V+++C+G+  CSIPV +A  G     CPG+ K+L V A C
Sbjct: 767 ESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 810


>gi|356541034|ref|XP_003538988.1| PREDICTED: beta-galactosidase 13-like, partial [Glycine max]
          Length = 806

 Score =  685 bits (1768), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/831 (41%), Positives = 497/831 (59%), Gaps = 50/831 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+I+G+R +L SGSIHYPRSTPE W  ++ K+++GG+ V++TYVFWN HE  +
Sbjct: 9   VTYDGRSLIINGRRELLFSGSIHYPRSTPEEWAGILDKARQGGINVVQTYVFWNIHETEK 68

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y  E ++D ++F+K +Q+ G+++ LR+GP+  AEWN+GG P WL  +P I FR+ N P
Sbjct: 69  GKYSIEPQYDYIKFIKLIQKKGMYVTLRVGPFIQAEWNHGGLPYWLREVPEIIFRSNNEP 128

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK+ MK++++ +I  +K  NLFA QGGPIILAQ+ENEY +++ A+   G+ YV+WAA  A
Sbjct: 129 FKKHMKKYVSTVIKTVKDANLFAPQGGPIILAQIENEYNHIQRAFREEGDNYVQWAAKMA 188

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V+L+  VPW+MC+Q DAPDP+IN CNG +C D F+ PN P KP +WTEN++  +  FG  
Sbjct: 189 VSLDIGVPWIMCKQTDAPDPVINACNGRHCGDTFSGPNKPYKPAIWTENWTAQYRVFGDP 248

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF+VARFF   G+  NYYMY GGTNFGRT+      T Y  +AP+DEYG  
Sbjct: 249 PSQRSAEDIAFSVARFFSKNGSLVNYYMYHGGTNFGRTSSA-FTTTRYYDEAPLDEYGMQ 307

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           R+PKW HLR++H+A+ LC+  L +   T  K+    E  ++ K  SN CAAF+ N  +  
Sbjct: 308 REPKWSHLRDVHRALSLCKRALFNGASTVTKMSQHHEVIVFEKPGSNLCAAFITNNHTKV 367

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              ++F G  Y++P  S+SILPDCK VVFNT  + SQ ++ +   +         +A++ 
Sbjct: 368 PTTISFRGTDYYMPPRSISILPDCKTVVFNTQCIASQHSSRNFKRS---------MAAND 418

Query: 422 FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE-----VFL 474
             W  Y E +  +        +  E  +  KDTSDY WYT S+ + P    +       L
Sbjct: 419 HKWEVYSETIPTTKQIPTHEKNPIELYSLLKDTSDYAWYTTSVELRPEDLPKKNDIPTIL 478

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I SLGH+ L FVN + +   +G+H+   F   K + L  G+N + IL+  VGL + GA+
Sbjct: 479 RIMSLGHSLLAFVNGEFIGSNHGSHEEKGFEFQKPVTLKVGVNQIAILASTVGLPDSGAY 538

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
            +   AG  S+ ++ L +GK DL+S  W ++VG++GE +G+     +    WK+      
Sbjct: 539 MEHRFAGPKSIFILGLNSGKMDLTSNGWGHEVGIKGEKLGIFTEEGSKKVQWKEAKG--P 596

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
             ++ WYKT F  PEG  P+A+ +  MGKG  W+NG+SIGR+W +YL+P           
Sbjct: 597 GPAVSWYKTNFATPEGTDPVAIRMTGMGKGMVWINGKSIGRHWMSYLSP----------- 645

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                       GQP Q+ YHIPRT+ +P +NLLV+ EE   +P K+ +LT     ICSF
Sbjct: 646 -----------LGQPTQSEYHIPRTYFNPKDNLLVVFEEEIANPEKVEILTVNRDTICSF 694

Query: 715 VSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           V+E  PP V SW     K    V    P   L C     I A+ FAS+G P G CG+F  
Sbjct: 695 VTENHPPNVKSWAIKSEKFQAVVNDLVPSASLKCPHQRTIKAVEFASFGDPAGACGAFAL 754

Query: 770 GACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G C+   +  IV+K C+G+  C +P+          ACP + KALA++  C
Sbjct: 755 GKCNAPAIKQIVEKQCLGKASCLVPIDKDAFTKGQDACPNVTKALAIQVRC 805


>gi|357455519|ref|XP_003598040.1| Beta-galactosidase [Medicago truncatula]
 gi|355487088|gb|AES68291.1| Beta-galactosidase [Medicago truncatula]
          Length = 812

 Score =  684 bits (1766), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/831 (42%), Positives = 490/831 (58%), Gaps = 69/831 (8%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  V YD  A++++G+R+++ SG+IHYPRST ++WP+LI K+K+G L+ IETY+FW+ HE
Sbjct: 23  ATTVEYDSSAIILNGERKLIISGAIHYPRSTSQMWPDLIMKAKDGDLDAIETYIFWDLHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+R +Y F G  D ++F+K  QE GL++ LRIGPY CAEWNYGGFP+WLH +PGIQ RT 
Sbjct: 83  PVRRKYDFSGNLDFIKFLKIAQEQGLYVVLRIGPYVCAEWNYGGFPMWLHNMPGIQLRTD 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  FKEEMK F  KI+ + K+  LFA QGGPIILAQ+ENEYG+V   YG  G  Y+KW A
Sbjct: 143 NAVFKEEMKIFTTKIVTMCKEAGLFAPQGGPIILAQIENEYGDVISHYGEAGNSYIKWCA 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           + A+  N  VPW+MC+Q++AP  II+TCNG+YCD F PN+P  P ++TEN+ GWF  +G 
Sbjct: 203 EMALAQNIGVPWIMCKQKNAPATIIDTCNGYYCDTFKPNNPKSPKIFTENWVGWFQKWGE 262

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P R  ED AF+VARFF+ GG  QNYY+Y GGTNFGRTAGGP + T+YDYDAP+DEYG 
Sbjct: 263 RRPHRTAEDSAFSVARFFQNGGALQNYYLYHGGTNFGRTAGGPFIITTYDYDAPLDEYGN 322

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSS 360
           + +PK+GHL+ LH AIKL E+ L +   T +  G  L    Y +K +     FL+N  +S
Sbjct: 323 LIEPKYGHLKRLHAAIKLGEKVLTNGTATWESHGDSLWMTTYTNKGTGQKFCFLSNSHTS 382

Query: 361 SDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
            DA V    +  Y++PAWS+S+L DC   V+NTAK  +Q N         K +++ L  S
Sbjct: 383 KDAEVDLQQDGKYYVPAWSMSLLQDCNKEVYNTAKTEAQTN------IYMKQLDQKLGNS 436

Query: 420 SAFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP----GQGKEVF 473
             +SW  + +     G  +F    L +Q + T   SDYLWY   + V      G+ K   
Sbjct: 437 PEWSWTSDPMEDTFQGKGTFTASQLLDQKSVTVGASDYLWYMTEVVVNDTNTWGKAK--- 493

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           + + + GH   +F+N  L    +G      F+    I LN+G N + +LS+ VG  NYGA
Sbjct: 494 VQVNTTGHILYLFINGFLTGTQHGTVSQPGFIHEGNISLNQGTNIISLLSVTVGHANYGA 553

Query: 534 WFDVAGAGL----FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
           +FD+   G+      +  I+  N   DLS   W Y+VG+ G               WK  
Sbjct: 554 FFDMQETGIVGGPVKLFSIENPNNVLDLSKSTWSYKVGINGMTKKFYDPKTTIGVQWKT- 612

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           + + +   + WYKTTF  P+G  P+ L+L  + KG+AWVNGQSIGRYW A LA + GC+ 
Sbjct: 613 NNVSIGVPMTWYKTTFKTPDGTNPVVLDLIGLQKGEAWVNGQSIGRYWPAMLAENKGCSD 672

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
            CDYRG Y+A KC   CG+P+Q  YH+PR++++   N LV+ EE+G D +  +       
Sbjct: 673 TCDYRGEYNADKCLSGCGEPSQRFYHVPRSFLNNDVNTLVLFEEMGFDATPFN------- 725

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
                                               G  ++ I FASYG PEG+CGSF+ 
Sbjct: 726 ------------------------------------GKTMSEIQFASYGDPEGSCGSFKI 749

Query: 770 GACHMDV-LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           G         +V+KAC+G+  CSI V+S+   +  G   G    LAV+  C
Sbjct: 750 GEWESRYSKTVVEKACIGKQSCSINVTSSTFRLKKGGTNG---QLAVQLSC 797


>gi|125574401|gb|EAZ15685.1| hypothetical protein OsJ_31098 [Oryza sativa Japonica Group]
          Length = 824

 Score =  684 bits (1765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/832 (43%), Positives = 488/832 (58%), Gaps = 51/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 27  VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY FEG +D++RF K +Q AGL+  LRIGPY C EWNYGG P WL  IP +QFR  N P
Sbjct: 87  RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   II+ MK  N+FA QGGPIILAQ+ENEYGNV  +         Y+ W AD
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG 
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +         +    Y   S   A F+ N + + 
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 385

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q         ++ N+ E    S  
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPESLK 441

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  S+         +F+N  +
Sbjct: 442 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 499

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +   ++L++G N + +LS  +GL+NYG  F+  
Sbjct: 500 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 559

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK        W   + T+P
Sbjct: 560 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 615

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           +N+   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A   G    CDY
Sbjct: 616 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 675

Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
           RG +    D  KC   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS++   +   
Sbjct: 676 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 735

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
             +C      D                   + L+C +    I+ I+  S+G+  G CG++
Sbjct: 736 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 777

Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G           +AC+G+  C++ + +A  G       GL   L V+A C
Sbjct: 778 EGGCESKAAYKAFTEACLGKESCTVQIINALTGSG-----GLSGVLTVQASC 824


>gi|115481546|ref|NP_001064366.1| Os10g0330600 [Oryza sativa Japonica Group]
 gi|122249227|sp|Q7G3T8.1|BGL13_ORYSJ RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
           Precursor
 gi|110288895|gb|AAP53027.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113638975|dbj|BAF26280.1| Os10g0330600 [Oryza sativa Japonica Group]
          Length = 828

 Score =  682 bits (1761), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/832 (43%), Positives = 490/832 (58%), Gaps = 51/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31  VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY FEG +D++RF K +Q AGL+  LRIGPY C EWNYGG P WL  IP +QFR  N P
Sbjct: 91  RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   II+ MK  N+FA QGGPIILAQ+ENEYGNV  +         Y+ W AD
Sbjct: 151 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 210

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG 
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +         +    Y   S   A F+ N + + 
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 389

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q         ++ N+ E    S  
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPESLK 445

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  S+         +F+N  +
Sbjct: 446 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 503

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +   ++L++G N + +LS  +GL+NYG  F+  
Sbjct: 504 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 563

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK        W   + T+P
Sbjct: 564 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 619

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           +N+   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A   G    CDY
Sbjct: 620 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 679

Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
           RG +    D  KC   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS++   +   
Sbjct: 680 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 739

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
             +C      D                   + L+C +    I+ I+  S+G+  G CG++
Sbjct: 740 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 781

Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G           +AC+G+  C++ + +A  G  +G   G+   L V+A C
Sbjct: 782 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 828


>gi|16905220|gb|AAL31090.1|AC091749_19 putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|22655745|gb|AAN04162.1| Putative beta-galactosidase [Oryza sativa Japonica Group]
          Length = 824

 Score =  682 bits (1760), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/832 (43%), Positives = 490/832 (58%), Gaps = 51/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 27  VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY FEG +D++RF K +Q AGL+  LRIGPY C EWNYGG P WL  IP +QFR  N P
Sbjct: 87  RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   II+ MK  N+FA QGGPIILAQ+ENEYGNV  +         Y+ W AD
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG 
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +         +    Y   S   A F+ N + + 
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDANYSDNVTVTKYTLGSTS-ACFINNRNDNK 385

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q         ++ N+ E    S  
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPESLK 441

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  S+         +F+N  +
Sbjct: 442 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 499

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +   ++L++G N + +LS  +GL+NYG  F+  
Sbjct: 500 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 559

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK        W   + T+P
Sbjct: 560 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 615

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           +N+   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A   G    CDY
Sbjct: 616 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 675

Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
           RG +    D  KC   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS++   +   
Sbjct: 676 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 735

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
             +C      D                   + L+C +    I+ I+  S+G+  G CG++
Sbjct: 736 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 777

Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G           +AC+G+  C++ + +A  G  +G   G+   L V+A C
Sbjct: 778 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 824


>gi|320170852|gb|EFW47751.1| beta-galactosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 851

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/839 (42%), Positives = 498/839 (59%), Gaps = 54/839 (6%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYD RAL++DG+RR+L +G IHYPRSTPE+WPEL  ++K  GL+VI+TY+FW+ ++
Sbjct: 47  AMNVTYDSRALLLDGQRRLLIAGCIHYPRSTPEMWPELFARAKANGLDVIQTYLFWDVNQ 106

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G++    RFD VRF+K  Q+AGL ++ RIGPY CAEWNYGGFP WL  I GI FR  
Sbjct: 107 PTPGEFVMTDRFDYVRFIKLAQQAGLMVNFRIGPYVCAEWNYGGFPAWLRQISGIVFRDN 166

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           + P+ + +  ++ K + ++K   L A+ GGP+IL Q+ENEYGN+E +Y  GG  YV+W  
Sbjct: 167 DKPWLDVVGPYITKTVQVLKDNKLLAADGGPVILLQIENEYGNIEDSYA-GGPAYVQWCG 225

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A +LN    W+MCQQ+DAP   I TCNGFYCD + P+   +P+MWTEN+ GWF ++G 
Sbjct: 226 QLAASLNAGAQWIMCQQDDAPANTIATCNGFYCDNYVPHK-GQPMMWTENWPGWFQTWGQ 284

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P RP +D+AFA ARF+  GGT+ +YYMY GGTNFGRTAGGP + TSYDYD  +DEYG 
Sbjct: 285 PSPHRPAQDVAFAAARFYAKGGTYMSYYMYHGGTNFGRTAGGPGITTSYDYDVALDEYGM 344

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             +PK+ HL  LH  +   E  ++S + P    LG  LEAH+++ SS  C AFL+N DSS
Sbjct: 345 PSEPKYSHLGSLHAVLHANEHIIMSMNVPAPISLGKNLEAHVFNSSSG-CVAFLSNIDSS 403

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----------------NGDH 404
            DA V FNG  + LPAWSVSIL +C   ++NTA V +  N                  DH
Sbjct: 404 VDAEVQFNGRTFELPAWSVSILHNCAFAIYNTAAVSAPLNARRMTPLVVHEDAVSDAADH 463

Query: 405 PFAQQKNV-NELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIH 463
             +  K    E + A S F+ Y E +G     +       EQINTT DT+DYLWYT + +
Sbjct: 464 RRSLSKGEGQERVGAFSTFASYAETIGRRAEEAVYFTSPQEQINTTNDTTDYLWYTTTYN 523

Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
                 +   L+I ++     V+VN++ V   +         +NK + L  G N +D+LS
Sbjct: 524 SASATSQ--VLSISNVNDVVYVYVNRQFVTMSWSGS------VNKAVPLMAGTNVIDVLS 575

Query: 524 MMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANS 583
              GLQNYG + +    G+   +    K G  DL+   W +QVG+ GE +G+     A++
Sbjct: 576 TTFGLQNYGTFLEQVTRGIQGTV----KLGSTDLTQNGWWHQVGLLGEELGIFLPQNASN 631

Query: 584 SFWKQGSTLPVNKSLIWYKTTFLAPE-GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
             W   +T   N+ L WY+++F  P+  + PLAL++  MGKG  WVNG ++GRYW + +A
Sbjct: 632 VPWATPAT--TNRGLTWYRSSFDLPQSSQAPLALDMTGMGKGFVWVNGHNLGRYWPSRIA 689

Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKIS 702
            S  C   CDYRG+YD S+C++ C  P+Q  YH+PR W+ P  NL+V+ EE+GG+P+ IS
Sbjct: 690 DSMAC-DDCDYRGAYDDSRCRQGCNIPSQRYYHVPREWLQPTNNLIVMLEEIGGNPALIS 748

Query: 703 LLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEG 762
           L+ +     C  V E  P        +L VV       L C     I  + FAS+G P G
Sbjct: 749 LVEREEDISCGAVGEDYP------ADDLSVV-------LGCGLHQTIRRVEFASFGTPVG 795

Query: 763 NCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            C  F  G+C+  +   IV+  C+G+  C +PV+  + G     CP   K L V+  C+
Sbjct: 796 TCRQFSLGSCNAANSTAIVESLCLGRQACHVPVAINHFG---DPCPDTTKRLFVQVSCA 851


>gi|356509519|ref|XP_003523495.1| PREDICTED: beta-galactosidase 13-like [Glycine max]
          Length = 844

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/837 (42%), Positives = 501/837 (59%), Gaps = 53/837 (6%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYD ++L I+G+R +L SGS+HY RSTP++WP+++ K++ GGL VI+TYVFWN HE
Sbjct: 43  ARNVTYDGKSLFINGRREILFSGSVHYTRSTPDMWPDILDKARRGGLNVIQTYVFWNAHE 102

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G++ F+G +DLV+F++ VQ  G+F+ LR+GP+  AEWN+GG P WL  +PGI FR+ 
Sbjct: 103 PEPGKFNFQGNYDLVKFIRLVQAKGMFVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSD 162

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N P+K  MK F++KII +MK E LFA QGGPIILAQ+ENEY +++ AY   G+ YV+WAA
Sbjct: 163 NEPYKFHMKAFVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAA 222

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
           + AV  +  VPW+MC+Q DAPDP+IN CNG +C D F  PN P KP +WTEN++  +   
Sbjct: 223 NMAVATDIGVPWLMCKQRDAPDPVINACNGRHCGDTFAGPNKPYKPAIWTENWTAQYRVH 282

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD-APIDE 298
           G     R  ED+AF+VARFF   G   NYYMY GGTNFGRT+   + +T+  YD AP+DE
Sbjct: 283 GDPPSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTSS--VFSTTRYYDEAPLDE 340

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANY 357
           YG  R+PKW HLR++HKA+ LC   ++   P+ QKL    E   + +  +N CAAF+ N 
Sbjct: 341 YGLPREPKWSHLRDVHKALLLCRRAILGGVPSVQKLNHFHEVRTFERVGTNMCAAFITNN 400

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
            +   A + F G  YFLP  S+SILPDCK VVFNT +++SQ N+ ++         E   
Sbjct: 401 HTMEPATINFRGTNYFLPPHSISILPDCKTVVFNTQQIVSQHNSRNY---------ERSP 451

Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GK 470
           A++ F W  + E +  +       P  AE  +  KDT+DY WYT S  +         G 
Sbjct: 452 AANNFHWEMFNEAIPTAKKMPINLPVPAELYSLLKDTTDYAWYTTSFELSQEDMSMKPGV 511

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
              L + SLGH+ + FVN  +V   +G H+  +F     + L  G N + +LS  VGL +
Sbjct: 512 LPVLRVMSLGHSMVAFVNGDIVGTAHGTHEEKSFEFQTPVLLRVGTNYISLLSSTVGLPD 571

Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
            GA+ +   AG  S+ ++ L  G  DL+   W ++VG++GE   +     + S  WK   
Sbjct: 572 SGAYMEHRYAGPKSINILGLNRGTLDLTRNGWGHRVGLKGEGKKVFSEEGSTSVKWKPLG 631

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
            +P  ++L WY+T F  PEG GP+A+ ++ M KG  WVNG +IGRYW +YL+P       
Sbjct: 632 AVP--RALSWYRTRFGTPEGTGPVAIRMSGMAKGMVWVNGNNIGRYWMSYLSP------- 682

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
                           G+P Q+ YHIPR++++P +NLLVI EE    P+++ +L      
Sbjct: 683 ---------------LGKPTQSEYHIPRSFLNPQDNLLVIFEEEARVPAQVEILNVNRDT 727

Query: 711 ICSFVSEADPPPVDSWKPNLG-----VVSSSPQVRLACERGWHIAAINFASYGIPEGNCG 765
           ICS V E DP  V+SW    G     V S      +AC  G  I A+ FAS+G P G CG
Sbjct: 728 ICSVVGERDPANVNSWVSRRGNFHPVVKSVGAAASMACATGKRIVAVEFASFGNPSGYCG 787

Query: 766 SFRPGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSA-GACPGLLKALAVEAHCS 820
            F  G+C+      IV++ C+GQ  C++ +  A    +   ACP L+K LAV+  C+
Sbjct: 788 DFAMGSCNAAASKQIVERECLGQEACTLALDRAVFNNNGVDACPDLVKQLAVQVRCA 844


>gi|156106159|gb|ABU49386.1| beta-galactosidase 15 [Oryza sativa Indica Group]
          Length = 828

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/832 (43%), Positives = 491/832 (59%), Gaps = 51/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31  VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G +D+VRF K +Q AGL+  LRIGPY C EWNYGG P WL  IPG+QFR  N P
Sbjct: 91  RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   I++ MK  N+FA QGGPIILAQ+ENEYGN+  +         Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG 
Sbjct: 271 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 330

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +         +    Y   S   A F+ N + + 
Sbjct: 331 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTS-ACFINNRNDNK 389

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q         ++ N+ E    +  
Sbjct: 390 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPENLK 445

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  S+         +F+N  +
Sbjct: 446 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 503

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +   ++L++G N + +LS  +GL+NYG  F+  
Sbjct: 504 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 563

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK        W   + T+P
Sbjct: 564 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 619

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           +N+   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A   G    CDY
Sbjct: 620 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 679

Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
           RG +    D  KC   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS++   +   
Sbjct: 680 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 739

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
             +C      D                   + L+C +    I+ I+  S+G+  G CG++
Sbjct: 740 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 781

Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G           +AC+G+  C++ + +A  G  +G   G+   L V+A C
Sbjct: 782 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 828


>gi|218184317|gb|EEC66744.1| hypothetical protein OsI_33101 [Oryza sativa Indica Group]
          Length = 824

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/832 (43%), Positives = 490/832 (58%), Gaps = 51/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 27  VAYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY FEG +D++RF K +Q AGL+  LRIGPY C EWNYGG P WL  IP +QFR  N P
Sbjct: 87  RQYNFEGNYDIIRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPQMQFRMHNAP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   II+ MK  N+FA QGGPIILAQ+ENEYGNV  +         Y+ W AD
Sbjct: 147 FENEMENFTTLIINKMKDANMFAGQGGPIILAQIENEYGNVMGQLNNNQSASEYIHWCAD 206

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 207 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG 
Sbjct: 267 PDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 326

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +         +    Y   S   A F+ N + + 
Sbjct: 327 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDNVTVTKYTLGSTS-ACFINNRNDNK 385

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +GN + LPAWSVSILPDCK V FN+AK+ +Q         ++ N+ E    +  
Sbjct: 386 DLNVTLDGNTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----IMVKKANMVEKEPENLK 441

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  S+         +F+N  +
Sbjct: 442 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSLDHKGEASYTLFVN--T 499

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +   ++L++G N + +LS  +GL+NYG  F+  
Sbjct: 500 TGHELYAFVNGMLVGKNHSPNGHFVFQLESAVKLHDGKNYISLLSATIGLKNYGPLFEKM 559

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS-TLP 593
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK        W   + T+P
Sbjct: 560 PAGIVGGPVKLIDNNGTGIDLSNSSWSYKAGLAGEYRQIHLDKPGYR----WDNNNGTVP 615

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
           +N+   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A   G    CDY
Sbjct: 616 INRPFTWYKTTFQAPAGQDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDY 675

Query: 654 RGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTG 708
           RG +    D  KC   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS++   +   
Sbjct: 676 RGVFQAEGDGQKCLTGCGEPSQRYYHVPRSFLKNGEPNTLILFEEAGGDPSQVIFHSVVA 735

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSF 767
             +C      D                   + L+C +    I+ I+  S+G+  G CG++
Sbjct: 736 GSVCVSAEVGDA------------------ITLSCGQHSKTISTIDVTSFGVARGQCGAY 777

Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G           +AC+G+  C++ + +A  G  +G   G+   L V+A C
Sbjct: 778 EGGCESKAAYKAFTEACLGKESCTVQIINALTG--SGCLSGV---LTVQASC 824


>gi|357473809|ref|XP_003607189.1| Beta-galactosidase [Medicago truncatula]
 gi|355508244|gb|AES89386.1| Beta-galactosidase [Medicago truncatula]
          Length = 825

 Score =  680 bits (1755), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/834 (42%), Positives = 503/834 (60%), Gaps = 50/834 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  +TYD R+L++DGK  +  SGSIHYPRSTP++WP+++ K++ GGL +I+TYVFWN HE
Sbjct: 25  AQTITYDGRSLLLDGKGELFFSGSIHYPRSTPDMWPDILDKARRGGLNLIQTYVFWNGHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P + +  FEGR+DLV+F+K VQE G+++ LRIGP+  AEWN+GG P WL  +P I FR+ 
Sbjct: 85  PEKDKVNFEGRYDLVKFLKLVQEKGMYVTLRIGPFIQAEWNHGGLPYWLREVPDIIFRSN 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK+ MK +++ +I+ MK+E LFA QGGPIILAQ+ENEY +++ AY   G+ YV+WAA
Sbjct: 145 NEPFKKYMKEYVSIVINRMKEEKLFAPQGGPIILAQIENEYNHIQLAYEADGDNYVQWAA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
             AV+L   VPWVMC+Q+DAPDP+IN CNG +C D FT PN P KP +WTEN++  +  F
Sbjct: 205 KMAVSLYNGVPWVMCKQKDAPDPVINACNGRHCGDTFTGPNKPYKPFIWTENWTAQYRVF 264

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  ED+AF+VARFF   G+  NYYMY GGTNFGRT       T Y  +AP+DE+
Sbjct: 265 GDPPSQRSAEDIAFSVARFFSKHGSLVNYYMYHGGTNFGRTTSA-FTTTRYYDEAPLDEF 323

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKS-SNDCAAFLANYD 358
           G  R+PKW HLR+ HKA+ LC++ L++  PT QK+    E  +Y K  SN CAAF+ N  
Sbjct: 324 GLQREPKWSHLRDAHKAVNLCKKSLLNGVPTTQKISQYHEVIVYEKKESNLCAAFITNNH 383

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           + +   ++F G+ YFLP  S+SILPDCK VVFNT  + SQ ++    F + K  N+    
Sbjct: 384 TQTAKTLSFRGSDYFLPPRSISILPDCKTVVFNTQNIASQHSS--RHFEKSKTGND---- 437

Query: 419 SSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG---QGKEV- 472
              F W  + E +  +      +   AE  +  KD +DY WYT S+ + P    +  +V 
Sbjct: 438 ---FKWEVFSEPIPSAKELPSKQKLPAELYSLLKDKTDYGWYTTSVELGPEDIPKKSDVA 494

Query: 473 -FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L I SLGH+   FVN + +   +G+H+   F   K +    G+N + IL+ +VGL + 
Sbjct: 495 PVLRILSLGHSLQAFVNGEYIGSKHGSHEEKGFEFQKPVNFKVGVNQIAILANLVGLPDS 554

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           GA+ +   AG  ++ ++ L +G  DL+S  W +QVG++GE   +     +    WK G  
Sbjct: 555 GAYMEHRYAGPKTITILGLMSGTIDLTSNGWGHQVGLQGENDSIFTEKGSKKVEWKDGKG 614

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
                ++ WYKT F  PEG  P+A+ +  M KG  WVNG+SIGR+W +YL+P        
Sbjct: 615 --KGSTISWYKTNFDTPEGTNPVAIGMEGMAKGMIWVNGESIGRHWMSYLSP-------- 664

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
                          G+P Q+ YHIPR+++ P +NLLVI EE    P KI++LT     I
Sbjct: 665 --------------LGKPTQSEYHIPRSFLKPKDNLLVIFEEEAISPDKIAILTVNRDTI 710

Query: 712 CSFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGS 766
           CSF++E  PP + S+      +       +P+  + C     I A+ FAS+G P G CGS
Sbjct: 711 CSFITENHPPNIRSFASKNQKLERVGENLTPEAFITCPDQKKITAVEFASFGDPSGFCGS 770

Query: 767 FRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           F  G C+      IV++ C+G+  CS+P+  A        CP ++K LA++  C
Sbjct: 771 FIMGKCNAPSSKKIVEQLCLGKPTCSVPMVKATFTGGNDGCPDVVKTLAIQVKC 824


>gi|357133576|ref|XP_003568400.1| PREDICTED: beta-galactosidase 7-like [Brachypodium distachyon]
          Length = 821

 Score =  680 bits (1754), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/824 (44%), Positives = 499/824 (60%), Gaps = 51/824 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD RAL+++G RR+L SG +HY RSTPE+WP++I K+++GG++VI+TYVFWN HEP++
Sbjct: 39  VTYDGRALLLNGTRRMLFSGEMHYTRSTPEMWPKIIAKARKGGIDVIQTYVFWNVHEPVQ 98

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y FEGR+++V+F++ +Q  GL++ LRIGP+  AEW YGGFP WLH +P I FRT N P
Sbjct: 99  GKYNFEGRYNIVKFIREIQAQGLYVSLRIGPFIEAEWKYGGFPFWLHEVPNITFRTDNEP 158

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK+ M+ F+  ++++MK E L+  QGGPII++Q+ENEY  VE A+G GG  YV+WAA  A
Sbjct: 159 FKQHMQGFVTHMVNMMKNEGLYYPQGGPIIISQIENEYQMVEPAFGPGGPRYVQWAASLA 218

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L T VPW+MC+Q DAPDPIINTCNG  C + F  PNSP+KP +WTEN++  +  +G  
Sbjct: 219 VGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPNKPALWTENWTTRYPIYGND 278

Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
              R   D+ FAVA F    GG+F +YYMY GGTNFGR A    V TSY   AP+DEYG 
Sbjct: 279 TKLRSTGDITFAVALFIARKGGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGL 337

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           I QP WGHL+ELH A+KL  E L+    ++  LG   EAH++ ++   C AFL N+D   
Sbjct: 338 IWQPTWGHLKELHAAVKLSSEPLLYGTYSNFSLGEDQEAHVF-ETKLKCVAFLVNFDKHQ 396

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              V F      L   S+SIL DC+ VVF T KV +Q  +      Q  N          
Sbjct: 397 RPTVIFRNISLQLAPKSISILSDCRTVVFETGKVNAQHGSRTAEVVQSLN--------DT 448

Query: 422 FSWYEEKVGISGNRS---FVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKE-VFLNIE 477
            +W   K  I  + S   +    L E ++TTKD +DYLWY AS    P      V LN+E
Sbjct: 449 HTWKAFKESIPQDISKAAYTGKQLFEHLSTTKDETDYLWYIASYEYRPSDDSHLVLLNVE 508

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLI-NKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           S  H    FVN + V   +G+H    ++I N  I L EG NT+ +L++MVG  + GA  +
Sbjct: 509 SQAHILHAFVNGEFVGSVHGSHGARGYIILNMTISLKEGQNTISLLNVMVGSPDSGAHME 568

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
               G+  V +   ++    L++  W YQVG+ GE   +     ++S  W   + L    
Sbjct: 569 RRSFGIHKVSIQQGQHALHLLNNELWGYQVGLFGEGNRIYTQEGSHSVEWTDVNNL-TYL 627

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WY+TTF  P G   + LNL SMGKG+ W+NG+SIGRYW ++  PS            
Sbjct: 628 PLTWYQTTFATPMGNDAVTLNLTSMGKGEVWINGESIGRYWVSFKTPS------------ 675

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
                     GQP+Q+LYHIP+ ++   +NLLV+ EE+GG+P +I++ T +   +CS V+
Sbjct: 676 ----------GQPSQSLYHIPQHFLKNTDNLLVLVEEMGGNPLQITVNTVSITTVCSSVN 725

Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV 776
           E   PPV S           P+VRL C++G HI+A+ FASYG P G+C +F  G+CH + 
Sbjct: 726 ELSAPPVQS-------QGKDPEVRLRCQKGKHISAVEFASYGNPAGDCRTFTIGSCHAES 778

Query: 777 L-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
              +V++AC+G+  CSIPV     G     CPG+ K+L V AHC
Sbjct: 779 SESVVKQACIGKRSCSIPVGPGSFG--GDPCPGIQKSLLVVAHC 820


>gi|224135691|ref|XP_002327281.1| predicted protein [Populus trichocarpa]
 gi|222835651|gb|EEE74086.1| predicted protein [Populus trichocarpa]
          Length = 788

 Score =  678 bits (1749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/828 (45%), Positives = 491/828 (59%), Gaps = 77/828 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYD R+L+IDG+R+++ SGSIHYPRSTPE+WP LI K+KEGGL+ IETYVFWN HEP 
Sbjct: 25  DVTYDGRSLIIDGQRKIVFSGSIHYPRSTPEMWPSLIAKAKEGGLDAIETYVFWNVHEPQ 84

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G Y F G  D+VRF+K VQ  GL+  LRIGP+  +EW+YGG P WLH IPGI FR+ N 
Sbjct: 85  PGHYDFSGGHDIVRFIKEVQAQGLYACLRIGPFIQSEWSYGGLPFWLHDIPGIVFRSDNE 144

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+ F AK++ +M+ ENL+ASQGGPIIL+Q+ENEYG V+ AYG  G  YV+WAA  
Sbjct: 145 PFKVYMQNFTAKVVSMMQSENLYASQGGPIILSQIENEYGTVQKAYGQEGLAYVQWAAQM 204

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG--FTPNSPSKPIMWTENYSGWFLSFGY 241
           A  L T VPWVMC+Q +AP  +IN+CNG  C      PNSP+KP +WTEN++        
Sbjct: 205 AEGLQTGVPWVMCKQNNAPGHVINSCNGMKCGQTFVGPNSPNKPSIWTENWT-------- 256

Query: 242 AVPFRPVEDLAFAVARFFET-GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
               +  ED+AF V  F     G+F NYYMY GGTNFGRTA    V TSY   AP+DEYG
Sbjct: 257 ---TQSAEDIAFHVTLFIAAKKGSFVNYYMYHGGTNFGRTASA-FVTTSYYDQAPLDEYG 312

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
              QPKWGHL+ELH AIKLC   L+S    +  LG + +A+I++  S +CAAFL N DSS
Sbjct: 313 LTTQPKWGHLKELHAAIKLCSTPLLSGVQVNLYLGPQQQAYIFNAVSGECAAFLINNDSS 372

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           + A+V F    Y LP  S+SILPDCKNV    +   + R  G           E+L A+ 
Sbjct: 373 NAASVPFRNASYDLPPMSISILPDCKNV----STQYTTRTMGR---------GEVLDAAD 419

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  + E +    + S     L EQ+NTTKD+SDYLWYT           +  L++ SLG
Sbjct: 420 VWQEFTEAIPNFDSTSTRSETLLEQMNTTKDSSDYLWYTFRFQ-HESSDTQAILDVSSLG 478

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   FVN + V    G+     F     + L++GIN + +LS+MVG+ + GA+ +   A
Sbjct: 479 HALHAFVNGQAVGSVQGSRKNPRFKFETSVSLSKGINNVSLLSVMVGMPDSGAFLENRAA 538

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
           GL +V++ D K    D ++  W YQ+G++GE + +     ++   WK+ S       L W
Sbjct: 539 GLRTVMIRD-KQDNNDFTNYSWGYQIGLQGETLQIYTEQGSSQVQWKKFSN--AGNPLTW 595

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKT   AP G  P+ LNLASMGKG+AWVNGQSIGRYW     PS                
Sbjct: 596 YKTQVDAPPGDVPVGLNLASMGKGEAWVNGQSIGRYW-----PS---------------- 634

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
                        YH+PR+++ P  NLLV+ EE GG+P ++SL T T   +C  V+ +  
Sbjct: 635 -------------YHVPRSFLKPTGNLLVLQEEEGGNPLQVSLDTVTISQVCGHVTASHL 681

Query: 721 PPVDSW-------KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNC-GSFRPGAC 772
            PV SW       K    V    P+V LAC     I+ I+FASYG P GNC  S   G C
Sbjct: 682 APVSSWIEHNQRYKNPAKVSGRRPKVLLACPSKSKISRISFASYGTPLGNCRNSMAVGTC 741

Query: 773 H-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           H  +   +V++AC+G+++CSIPVS    G     CP   K+L V A C
Sbjct: 742 HSQNSKAVVEEACLGKMKCSIPVSVRQFG--GDPCPAKAKSLMVVAEC 787


>gi|449464182|ref|XP_004149808.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
          Length = 801

 Score =  677 bits (1747), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/830 (44%), Positives = 506/830 (60%), Gaps = 56/830 (6%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           + TYD R+L+++G+ ++L SGSIHYPRSTP++WP LI K+KEGG++VI+TYVFWN HEP 
Sbjct: 15  SATYDGRSLIVNGEHKLLFSGSIHYPRSTPDMWPSLIAKAKEGGIDVIQTYVFWNLHEPQ 74

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +G Y F GR D+VRFVK +Q  GL+  LRIGP+  AEW+YGG P WLH + GI +R+ N 
Sbjct: 75  QGTYEFSGRRDIVRFVKEIQAQGLYACLRIGPFIEAEWSYGGLPFWLHDVLGIVYRSDNE 134

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK  M+ F  KI+++MK E L+ASQGGPIIL+Q+ENEY  VE A+G  G  YV+WAA  
Sbjct: 135 PFKLHMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYTLVEAAFGEKGPPYVQWAAKM 194

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
           AV+L T VPW MC+Q DAPDP+INTCNG  C + FT PNSP+KP +WTEN++ ++ ++G 
Sbjct: 195 AVSLQTGVPWSMCKQNDAPDPVINTCNGMRCGETFTGPNSPNKPSIWTENWTSFYQTYGE 254

Query: 242 AVPFRPVEDLAFAVARFFET-GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
               R  E++AF VA F     GT+ NYYMY GGTNFGR+A   ++   YD  +P+DEYG
Sbjct: 255 EPYIRSAEEIAFHVALFIAAKNGTYVNYYMYHGGTNFGRSASAFMITGYYD-QSPLDEYG 313

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PKWGHL+ELH A+KLC   L++   ++  LG  +EA ++   SN+CAAFL N   +
Sbjct: 314 LTREPKWGHLKELHAAVKLCSTPLLTGTKSNFSLGQSVEAIVFKTESNECAAFLVN-RGA 372

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            D+NV F    Y LP  S+SILPDCKNV FNT +V  Q N       Q+ ++ E      
Sbjct: 373 IDSNVLFQNVTYELPLGSISILPDCKNVAFNTRRVSVQHNTRSMMAVQKFDLLE------ 426

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  ++E +    +      +L E + TTKD SDYLWYT  +       ++  L ++S  
Sbjct: 427 -WEEFKEPIPNIDDTELRANELLEHMGTTKDRSDYLWYTFRVQQDSPDSQQT-LEVDSRA 484

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   FVN       +G +    F + K I L  GIN + +LS+MVGL + GA+ +   A
Sbjct: 485 HALHAFVNGDYAGSAHGIYKEKGFSLAKNITLRNGINNISLLSVMVGLPDSGAFLETRVA 544

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGE--YIGLDKISLANSSFWKQGSTLPVNKSL 598
           GL  V +        D S   W Y+VG+ GE   I LD  S +N  + + G++   ++ L
Sbjct: 545 GLRRVGI-----QGEDFSEQHWGYKVGLSGEQSQIFLDTGS-SNVQWSRLGNS---SQPL 595

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYKT F AP G  P+ALNL SMGKG  WVNG+ IGRYW ++L P               
Sbjct: 596 TWYKTQFDAPPGDDPIALNLGSMGKGAVWVNGRGIGRYWVSFLTPK-------------- 641

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
                   G+P+Q  Y++PR+++ P +N LVI EE  G+P +ISL +      C  VSE+
Sbjct: 642 --------GEPSQKWYNVPRSFLKPTDNQLVILEEETGNPVEISLDSVLITKTCGQVSES 693

Query: 719 DPPPVDSW----KPNLGVV---SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
             P V SW    K  +  V   +  P+V+L+C     I+ I FAS+G P G+C S+  G 
Sbjct: 694 HYPLVASWMGAKKQKVRRVKNRTRRPKVQLSCPSKKKISNILFASFGTPSGDCQSYAIGL 753

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CH  +   IV+ AC+G+ +CSIP+S+  L      CP + K L V+A C+
Sbjct: 754 CHSPNSRAIVEHACLGRAKCSIPISN--LNFRGDPCPHVTKTLLVDAQCT 801


>gi|242081931|ref|XP_002445734.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
 gi|241942084|gb|EES15229.1| hypothetical protein SORBIDRAFT_07g024870 [Sorghum bicolor]
          Length = 844

 Score =  675 bits (1741), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/831 (42%), Positives = 494/831 (59%), Gaps = 48/831 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           ++YD R+L++DG+R +  SGSIHYPRS P++WPELI K+KEGGL  IETYVFWN HEP +
Sbjct: 38  ISYDRRSLMVDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 97

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQ+ FEGR+D+V+F K +QE  +F  +R+GP+  AEWN+GG P WL  IP I FRT N P
Sbjct: 98  GQFNFEGRYDMVKFFKLIQEHDMFAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 157

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K  M+ F+  +I  +K  NLFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A
Sbjct: 158 YKMHMETFVKIVIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKEEGTKYIHWAAQMA 217

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK--PIMWTENYSGWFLSFGYA 242
           +  N  +PW+MC+Q  AP  +I TCNG  C    P   +K  P++WTEN++  +  FG  
Sbjct: 218 IGTNIGIPWIMCKQTKAPGDVIPTCNGRNCGDTWPGPMNKTMPLLWTENWTAQYRVFGDP 277

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AFAVARFF  GGT  NYYMY GGTNFGRTA   ++   YD +AP+DE+G  
Sbjct: 278 PSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTAAAFVMPKYYD-EAPLDEFGLY 336

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
           ++PKWGHLR+LH A+KLC++ L+   P+ +KLG +LEA ++       C AFL+N+++  
Sbjct: 337 KEPKWGHLRDLHLALKLCKKALLWGKPSTEKLGKQLEARVFEIPEQKVCVAFLSNHNTKD 396

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  +TF G  YF+P  S+SIL DCK VVF T  V +Q N     FA Q N N +      
Sbjct: 397 DVTLTFRGQPYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHFADQTNQNNVWQM--- 453

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG-----QGKEVFLNI 476
             + EEKV             A+  N TKD +DY+WYT+S  + P      +  +  + +
Sbjct: 454 --FDEEKVPKYKQAKIRTRKAADLYNLTKDKTDYVWYTSSFKLEPDDMPIRRDIKTVVEV 511

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GHA++ FVN K    G+G      F + K +EL +G+N + +L+  +G+ + GA+ +
Sbjct: 512 NSHGHASVAFVNNKFAGCGHGTKMNKAFTLEKPMELKKGVNHVAVLASSMGMMDSGAYLE 571

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
              AG+  V +  L  G  DL++  W + VG+ GE   +       S  WK       +K
Sbjct: 572 HRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGEQKEIYTEKGMASVTWKPAVN---DK 628

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK  F  P G+ P+ L++++MGKG  +VNGQ IGRYW +Y                
Sbjct: 629 PLTWYKRHFDMPSGEDPIVLDMSTMGKGMMYVNGQGIGRYWMSY---------------- 672

Query: 657 YDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
                  KH  G+P+Q LYHIPR+++ P +N+LV+ EE  G P  I +LT    +IC+++
Sbjct: 673 -------KHALGRPSQQLYHIPRSFLRPKDNVLVLFEEEFGRPDAIMILTVKRDNICTYI 725

Query: 716 SEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           SE +P  + SW+     ++++      +  L C     I  + FASYG P G CG++  G
Sbjct: 726 SERNPAHIKSWERKDSQITATADDLKARATLTCPPKKLIQQVVFASYGNPVGICGNYTIG 785

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +CH      +V+K+C+G+  C++PVS+   G     CPG    LAV+A CS
Sbjct: 786 SCHTPRAKEVVEKSCLGKRTCTLPVSADVYGGDVN-CPGTTATLAVQAKCS 835


>gi|357130214|ref|XP_003566745.1| PREDICTED: beta-galactosidase 13-like [Brachypodium distachyon]
          Length = 829

 Score =  674 bits (1740), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/832 (43%), Positives = 491/832 (59%), Gaps = 49/832 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y+ RALVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP  
Sbjct: 30  VAYNDRALVIDGQRRIVLSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPRP 89

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G +D+VRF K +Q AG++  LRIGPY C EWNYGG P WL  IPG+QFR  N P
Sbjct: 90  RQYNFAGNYDIVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRMHNQP 149

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   I++ +K  N+FA QGGPIIL+Q+ENEYGN+            Y+ W A 
Sbjct: 150 FEHEMETFTTLIVNKLKDANMFAGQGGPIILSQIENEYGNIMANLTDAQSASEYIHWCAA 209

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  +INTCNGFYC  + P     P +WTEN++GWF ++  
Sbjct: 210 MANKQNVGVPWIMCQQDADVPPNVINTCNGFYCHDWFPKRTDIPKIWTENWTGWFKAWDK 269

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  +D+AFAVA FF+  G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+DEYG 
Sbjct: 270 PDFHRSAQDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLDEYGN 329

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IR+PK+GHL++LH  +K  E+ L+  D +    G  +    Y    +    F++N     
Sbjct: 330 IREPKYGHLKDLHAVLKSMEKILVHGDFSDINYGRNVTVTKYTLDGSS-VCFISNQFDDR 388

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           DAN T +G  + +PAWSVS+LPDCK V +NTAK+ +Q +       ++ N  E    +  
Sbjct: 389 DANATIDGTTHVVPAWSVSVLPDCKAVAYNTAKIKAQTS----VMVKKPNTVEQEPENLK 444

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         SF + +L EQI T+ D SDYLWY  S     G+ K   L++ +
Sbjct: 445 WSWMPEHLKPFMTDEKGSFRKNELLEQITTSTDQSDYLWYRTSFE-HKGEAK-YKLSVNT 502

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN KL    +  +    F +   ++L++G N L +LS  +GL+NYGA F++ 
Sbjct: 503 TGHQIYAFVNGKLAGRQHSPNGAFIFQLESPVKLHDGKNYLSLLSATMGLKNYGALFELM 562

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
            AG+    V L+D      DLS+  W Y+ G+ GE+  I LDK       +     T+P+
Sbjct: 563 PAGIVGGPVKLVDNNGSTIDLSNSSWSYKAGLAGEHRQIHLDKPGY---KWHGDNGTIPI 619

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           N++  WYK TF AP G+  +  +L  + KG AWVNG ++GRYW +Y+A   G    CDYR
Sbjct: 620 NRAFTWYKATFQAPAGEEAVVADLMGLNKGVAWVNGNNLGRYWPSYVAAEMGGCHHCDYR 679

Query: 655 GSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTGQ 709
           G++    D  KC   C +PAQ  YH+PR ++  GE N +V+ EE GGDPS++   T    
Sbjct: 680 GAFKAEGDGLKCLTGCNEPAQRFYHVPRVFLRAGEPNTVVLFEEAGGDPSRVGFHTVAVG 739

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC--ERGWHIAAINFASYGIPEGNCGSF 767
            +C   +E                     V L+C   +G  I++++ ASYG+  G CG++
Sbjct: 740 PVCVEAAE-----------------KGDNVTLSCGQHKGRTISSVDLASYGVTRGQCGAY 782

Query: 768 RPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           + G           +ACVG+  C++  + A+ G  AG   G+   L V+A C
Sbjct: 783 QGGCESKAAYEAFAEACVGKESCTVQHTDAFSG--AGCQSGV---LTVQATC 829


>gi|224142776|ref|XP_002324727.1| predicted protein [Populus trichocarpa]
 gi|222866161|gb|EEF03292.1| predicted protein [Populus trichocarpa]
          Length = 749

 Score =  673 bits (1736), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/756 (45%), Positives = 466/756 (61%), Gaps = 43/756 (5%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +WPEL +K+KEGG++ IETY+FW+ HEP+R QYYF G  D+V+F K  QEAGL + LRIG
Sbjct: 1   MWPELFQKAKEGGIDAIETYIFWDRHEPVRRQYYFSGNQDIVKFCKLAQEAGLHVILRIG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           PY CAEW+YGGFP+WLH IPGI+ RT N  +K EM+ F  KI+D+ K+  LFA QGGPII
Sbjct: 61  PYVCAEWSYGGFPMWLHNIPGIELRTDNEIYKNEMQIFTTKIVDVCKEAKLFAPQGGPII 120

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
           LAQ+ENEYGNV   YG  G  YV W A  AV  N  VPW+MCQQ +AP P+INTCNGFYC
Sbjct: 121 LAQIENEYGNVMGPYGDAGRRYVNWCAQMAVGQNVGVPWIMCQQSNAPQPMINTCNGFYC 180

Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
           D F PN+P  P MWTEN+SGWF  +G   P+R  EDLAF+VARF + GG   +YYMY GG
Sbjct: 181 DQFKPNNPKSPKMWTENWSGWFKLWGGRDPYRTAEDLAFSVARFIQNGGVLNSYYMYHGG 240

Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
           TNFGRTAGGP + TSYDY+AP+DEYG + QPKWGHL++LH+AIK  E  L +   T +  
Sbjct: 241 TNFGRTAGGPYITTSYDYNAPLDEYGNLNQPKWGHLKQLHEAIKQGERILTNGTVTSKNF 300

Query: 335 GAKLEAHIY-HKSSNDCAAFLANYDSSSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNT 392
              ++   Y ++ + +   FL+N +   +ANV    +  Y LPAWSV+IL DC   ++NT
Sbjct: 301 WGGVDQTTYTNQGTGERFCFLSNTN-MEEANVDLGQDGKYSLPAWSVTILQDCNKEIYNT 359

Query: 393 AKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEE--KVGISGNRSFVRPDLAEQINTTK 450
           AKV +Q +       ++    +L     +++W  E  K  + G   F   +L EQ  TT 
Sbjct: 360 AKVNTQTSIMVKKLHEEDKPVQL-----SWTWAPEPMKGVLQGKGRFRATELLEQKETTV 414

Query: 451 DTSDYLWYTASIHVMPGQGKE---VFLNIESLGHAALVFVNKKLVAFGYGNH-------- 499
           DT+DYLWY  S+++     K+   V L + + GH    +VNKK +   +           
Sbjct: 415 DTTDYLWYMTSVNLNETTLKKWTNVTLRVGTRGHTLHAYVNKKEIGTQFSKQANAQQSVK 474

Query: 500 -DFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGK--RD 556
            D  +FL  K + L  G NT+ +LS  VGL NYG ++D    G+    +  + NGK   D
Sbjct: 475 GDDYSFLFEKPVTLTSGTNTISLLSATVGLANYGQYYDKKPVGIAEGPVQLVANGKPFMD 534

Query: 557 LSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLAL 616
           L+S +W Y++G+ GE    +  +  ++S +     LP  +++ WYKTTF +P G  P+ +
Sbjct: 535 LTSYQWSYKIGLSGEAKRYNDPNSPHASKFTASDNLPTGRAMTWYKTTFASPSGTEPVVV 594

Query: 617 NLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHI 676
           +L  MGKG AWVNG+S+GR+W   +A + GC   CDYRGSY+  KC  +CG P+Q  YHI
Sbjct: 595 DLLGMGKGHAWVNGKSLGRFWPTQIADAKGCPDTCDYRGSYNGDKCVTNCGNPSQRWYHI 654

Query: 677 PRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS 735
           PR++++  G+N L++ EE+GG+P+ +S      + IC    E                  
Sbjct: 655 PRSYLNKDGQNTLILFEEVGGNPTNVSFQIVAVETICGNAYEGS---------------- 698

Query: 736 SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
              + L+CE G  I+ I FASYG PEG CG+F  G+
Sbjct: 699 --TLELSCEGGRTISDIQFASYGDPEGTCGAFMKGS 732


>gi|357467507|ref|XP_003604038.1| Beta-galactosidase [Medicago truncatula]
 gi|355493086|gb|AES74289.1| Beta-galactosidase [Medicago truncatula]
          Length = 847

 Score =  673 bits (1736), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/842 (42%), Positives = 500/842 (59%), Gaps = 54/842 (6%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD ++L ++G+R +L SGSIHY RSTP+ WP+++ K++ GGL VI+TYVFWN HEP 
Sbjct: 34  NVTYDGKSLFVNGRRELLFSGSIHYTRSTPDAWPDILDKARHGGLNVIQTYVFWNAHEPE 93

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +G++ FEG  DLV+F++ VQ  G+++ LR+GP+  AEWN+GG P WL  +PGI FR+ N 
Sbjct: 94  QGKFNFEGNNDLVKFIRLVQSKGMYVTLRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNE 153

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P+K+ MK +++KII +MK E LFA QGGPIILAQ+ENEY +++ AY   G+ YV+WAA+ 
Sbjct: 154 PYKKYMKAYVSKIIQMMKDEKLFAPQGGPIILAQIENEYNHIQLAYEEKGDSYVQWAANM 213

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
           AV L+  VPW+MC+Q+DAPDP+IN CNG +C D F+ PN P KP +WTEN++  +  FG 
Sbjct: 214 AVALDIGVPWIMCKQKDAPDPVINACNGRHCGDTFSGPNKPYKPSLWTENWTAQYRVFGD 273

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            V  R  ED+AF+VARFF   G   NYYMY GGTNFGRT       T Y  +AP+DEYG 
Sbjct: 274 PVSQRSAEDIAFSVARFFSKNGNLVNYYMYHGGTNFGRTTSA-FTTTRYYDEAPLDEYGM 332

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSS 360
            RQPKW HLR+ HKA+ LC + ++   PT QKL    E  I+ K  ++ C+AF+ N  ++
Sbjct: 333 ERQPKWSHLRDAHKALLLCRKAILGGVPTVQKLNDYHEVRIFEKPGTSTCSAFITNNHTN 392

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQ----RNNGDHPFAQ----QKNV 412
             A ++F G+ YFLPA S+S+LPDCK VV+NT  V++Q    +    H   +    Q N 
Sbjct: 393 QAATISFRGSNYFLPAHSISVLPDCKTVVYNTQNVMNQLVYYKLISSHLIIKLIVSQHNK 452

Query: 413 NELLLASSA--FSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ 468
              + ++ A    W  + E +  S      +    E     KDT+DY WYT S  + P  
Sbjct: 453 RNFVKSAVANNLKWELFLEAIPSSKKLESNQKIPLELYTLLKDTTDYGWYTTSFELGPED 512

Query: 469 --GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
              K   L I SLGH    FVN + +   +G H+  +F   +      G N + IL+  V
Sbjct: 513 LPKKSAILRIMSLGHTLSAFVNGQYIGTDHGTHEEKSFEFEQPANFKVGTNYISILATTV 572

Query: 527 GLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
           GL + GA+ +   AG  S+ ++ L  GK +L+   W ++VG+ GE + +     +    W
Sbjct: 573 GLPDSGAYMEHRYAGPKSISILGLNKGKLELTKNGWGHRVGLRGEQLKVFTEEGSKKVQW 632

Query: 587 K--QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
               G T    ++L W KT F  PEG+GP+A+ +  MGKG  WVNG+SIGR+W ++L+P 
Sbjct: 633 DPVTGET----RALSWLKTRFATPEGRGPVAIRMTGMGKGMIWVNGKSIGRHWMSFLSP- 687

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
                                 GQP+Q  YHIPR +++  +NLLV+ EE  G P KI ++
Sbjct: 688 ---------------------LGQPSQEEYHIPRDYLNAKDNLLVVLEEEKGSPEKIEIM 726

Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGI 759
                 ICS+++E  P  V+SW    G       +S PQ  L C  G  I A+ FAS+G 
Sbjct: 727 IVDRDTICSYITENSPANVNSWGSKNGEFRSVGKNSGPQASLKCPSGKKIVAVEFASFGN 786

Query: 760 PEGNCGSFRPGACHMDVLP-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
           P G CG F  G C+      +V+KAC+G+ EC + V+ A    +   C G +  LA++A 
Sbjct: 787 PSGYCGDFALGNCNGGAAKGVVEKACLGKEECLVEVNRA--NFNGQGCAGSVNTLAIQAK 844

Query: 819 CS 820
           CS
Sbjct: 845 CS 846


>gi|449433325|ref|XP_004134448.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 7-like [Cucumis
           sativus]
          Length = 803

 Score =  672 bits (1735), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/833 (44%), Positives = 504/833 (60%), Gaps = 61/833 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYD R+L I+G+R+++ SG+IHYPRS+P +WP L++K+K GGL  IETYVFWN HEP 
Sbjct: 15  SVTYDGRSLKINGERKIIISGAIHYPRSSPGMWPMLMKKAKNGGLNAIETYVFWNAHEPQ 74

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RGQY F G  DLV+F+K VQ+  L+  LRIGPY CAEWNYGGFPVWLH +PGI+FRT N 
Sbjct: 75  RGQYDFSGNNDLVQFIKAVQKERLYAILRIGPYVCAEWNYGGFPVWLHNLPGIKFRTNNQ 134

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +K     F     +L K  N+F       +   +ENE+GNVE +YG  G+ YVKW A+ 
Sbjct: 135 VYKVTFXFFFL-TKNLKKINNMF-------LKNXIENEFGNVEGSYGQEGKEYVKWCAEL 186

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A + N S PW+MCQQ DAP PI+  CN   CD F PN+ + P MWTE+++GWF  +G   
Sbjct: 187 AQSYNLSEPWIMCQQGDAPQPIV--CN---CDQFKPNNKNSPKMWTESWAGWFKGWGERD 241

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+R  EDLAFAVARFF+ GG+  NYYMY GGTNFGR+AGGP + TSYDY+AP+DEYG + 
Sbjct: 242 PYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNFGRSAGGPYITTSYDYNAPLDEYGNMN 301

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSD 362
           QPKWGHL++LH+ I+  E+ L   D  H   G    A  Y +K  + C  F  N   +SD
Sbjct: 302 QPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHSTTATSYTYKGKSSC--FFGN-PENSD 358

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLASSA 421
             +TF    Y +P WSV++LPDCK  V+NTAKV +Q    +  P    K+   L      
Sbjct: 359 REITFQERKYTVPGWSVTVLPDCKTEVYNTAKVNTQTTIREMVPSLVGKHKKPL-----K 413

Query: 422 FSWYEEKV-------GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKE 471
           + W  EK+        ISG+ +     L +Q   T D+SDYLWY    H+    P  GK 
Sbjct: 414 WQWRNEKIEHLTHEGDISGS-AITANSLIDQKMVTNDSSDYLWYLTGFHLNGNDPLFGKR 472

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE-LNEGINTLDILSMMVGLQN 530
           V L +++ GH    FVN K +   +G +   +F + KK+  L  G N + +LS  VGL N
Sbjct: 473 VTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFTLEKKVRNLRHGFNQIALLSATVGLPN 532

Query: 531 YGAWFDVAGAGLFSVILIDLKNGK--RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
           YGA+++    G++  + + + +GK  RDLS+ EWIY+VG++GE              W  
Sbjct: 533 YGAYYENVEVGIYGPVEL-IADGKTIRDLSTNEWIYKVGLDGEKYEFFDPDHKFRKPW-L 590

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
            + LP+N++  WYKT+F  P+G+  + ++L  MGKGQAWVNG+SIGRYW +YLA   GC+
Sbjct: 591 SNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMGKGQAWVNGKSIGRYWPSYLATENGCS 650

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG-ENLLVIHEELGGDPSKISLLTKT 707
             CDYRG+Y  SKC  +CG+P Q  YHIPR++++ G EN L++ EE GG P  I + T  
Sbjct: 651 SSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMNDGKENTLILFEEFGGMPLNIEIKTTR 710

Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
            + +C+             K +LG      ++ L C     +  I F  +G P+GNC +F
Sbjct: 711 VKKVCA-------------KVDLG-----SKLELTCHDR-TVKRIIFVGFGNPKGNCNNF 751

Query: 768 RPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G+CH  +   +++K C+ + +CSI V+   LG++    P     LAV+  C
Sbjct: 752 HKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKLGLTGCKNPK-DNWLAVQVSC 803


>gi|414878435|tpg|DAA55566.1| TPA: hypothetical protein ZEAMMB73_938277 [Zea mays]
          Length = 774

 Score =  672 bits (1733), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/754 (47%), Positives = 471/754 (62%), Gaps = 48/754 (6%)

Query: 105 GFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN 164
           GFPVWL  +PGI+FRT N P+K EM+ F+ KI+D+MK+E L++ QGGPIIL Q+ENEYGN
Sbjct: 19  GFPVWLRDVPGIEFRTDNEPYKAEMQIFVTKIVDIMKEEKLYSWQGGPIILQQIENEYGN 78

Query: 165 VEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK 224
           ++  YG  G+ Y+ WAA  A+ L+T VPWVMC+Q DAP+ I+NTCN FYCDGF PNS +K
Sbjct: 79  IQGHYGQAGKRYMLWAAQMALALDTGVPWVMCRQTDAPEQILNTCNAFYCDGFKPNSYNK 138

Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
           P +WTE++ GW+  +G ++P RP +D AFAVARF++ GG+ QNYYMYFGGTNF RTAGGP
Sbjct: 139 PTIWTEDWDGWYADWGESLPHRPAQDSAFAVARFYQRGGSLQNYYMYFGGTNFERTAGGP 198

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHI 342
           L  TSYDYDAPIDEYG +RQPKWGHL++LH AIKLCE  L + D  P + KLG   EAH+
Sbjct: 199 LQITSYDYDAPIDEYGILRQPKWGHLKDLHAAIKLCESALTAVDGSPHYVKLGPMQEAHV 258

Query: 343 YHK-----------SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
           Y             +S  C+AFLAN D    A+V   G  Y LP WSVSILPDC+ V FN
Sbjct: 259 YSSENVHTNGSISGNSQFCSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCETVAFN 318

Query: 392 TAKVISQRN-----NGDHPFAQQKNVNELLLASSAF---SW--YEEKVGISGNRSFVRPD 441
           TA+V +Q +     +G   ++ +     L L    +   +W  ++E VGI G   F    
Sbjct: 319 TARVGTQTSFFNVESGSPSYSSRHKPRILSLIGVPYLSTTWWTFKEPVGIWGEGIFTAQG 378

Query: 442 LAEQINTTKDTSDYLWYTASIHVMP-------GQGKEVFLNIESLGHAALVFVNKKLVAF 494
           + E +N TKD SDYL YT  +++          +G    L I+ +   A VFVN KL   
Sbjct: 379 ILEHLNVTKDISDYLSYTTRVNISEEDVLYWNSKGFLPSLTIDQIRDVARVFVNGKLAGS 438

Query: 495 GYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNG 553
             G+       +N+ ++L +G+N L +LS +VGLQNYGA+ +  GAG    V L  L NG
Sbjct: 439 KVGHW----VSLNQPLQLVQGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVKLTGLSNG 494

Query: 554 KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGP 613
             DL++  W YQ+G++GE+  +       S+ W             W+KT F APEG GP
Sbjct: 495 DIDLTNSLWTYQIGLKGEFSRIYSPEYQGSAEWSSMQNDDTVSPFTWFKTMFDAPEGNGP 554

Query: 614 LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTL 673
           + ++L SMGKGQAWVNG  IGRYWS  +AP +GC   C+Y G+Y  SKC+ +CG   Q+ 
Sbjct: 555 VTIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCPSSCNYAGTYSDSKCRSNCGIATQSW 613

Query: 674 YHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW------K 727
           YHIPR W+    NLLV+ EE GGDPS+ISL     + ICS +SE   PP+ +W      +
Sbjct: 614 YHIPREWLQESGNLLVLFEETGGDPSQISLEVHYTKTICSKISETYYPPLSAWSRAANGR 673

Query: 728 PNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVG 786
           P++  V  +P++RL C+ G  I+ I FASYG P G C +F  G CH    L +V +AC G
Sbjct: 674 PSVNTV--APELRLQCDDGHVISKITFASYGTPTGGCQNFSVGNCHASTTLDLVVEACEG 731

Query: 787 QIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +  C+I V++   G     C  ++K LAVEA CS
Sbjct: 732 KNRCAISVTNEVFG---DPCRKVVKDLAVEAECS 762


>gi|356532710|ref|XP_003534914.1| PREDICTED: beta-galactosidase 1-like [Glycine max]
          Length = 650

 Score =  671 bits (1731), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/610 (54%), Positives = 418/610 (68%), Gaps = 22/610 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+V+DGKRR+L SGSIHYPRSTP++WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 21  VTASVTYDHKAIVVDGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDVIQTYVFWNGH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFE RFDLV+FVK  Q+AGL++HLRIGPY CAEWN GGFPVWL ++PGI FRT
Sbjct: 81  EPSPGQYYFEDRFDLVKFVKLAQQAGLYVHLRIGPYICAEWNLGGFPVWLKYVPGIAFRT 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F AKI+ LMK+  LF SQGGPIIL+Q+ENEYG VEW  G  G+ Y KWA
Sbjct: 141 DNEPFKAAMQKFTAKIVSLMKENRLFQSQGGPIILSQIENEYGPVEWEIGAPGKAYTKWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV L+T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN  +KP MWTEN++GW+  FG
Sbjct: 201 AQMAVGLDTGVPWVMCKQEDAPDPVIDTCNGFYCENFKPNKNTKPKMWTENWTGWYTDFG 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP RP EDLAF+VARF + GG+F NYYMY GGTNFGRT+GG  +ATSYDYDAP+DEYG
Sbjct: 261 GAVPRRPAEDLAFSVARFIQNGGSFVNYYMYHGGTNFGRTSGGLFIATSYDYDAPLDEYG 320

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
              +PK+ HLR LHKAIK  E  L+++DP  Q LG  LEAH++  +   CAAF+ANYD+ 
Sbjct: 321 LENEPKYEHLRALHKAIKQSEPALVATDPKVQSLGYNLEAHVF-SAPGACAAFIANYDTK 379

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A   F    Y LP WS+SILPDCK VV+NTAKV      G     +   VN      S
Sbjct: 380 SYAKAKFGNGQYDLPPWSISILPDCKTVVYNTAKV------GYGWLKKMTPVN------S 427

Query: 421 AFSWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AF+W    EE    S   S     L EQ+N T+D+SDYLWY   ++V   +     G+  
Sbjct: 428 AFAWQSYNEEPASSSQADSIAAYALWEQVNVTRDSSDYLWYMTDVNVNANEGFLKNGQSP 487

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GH   VF+N +L    +G         +  ++L  G N L +LS+ VGL N G
Sbjct: 488 LLTVMSAGHVLHVFINGQLAGTVWGGLGNPKLTFSDNVKLRAGNNKLSLLSVAVGLPNVG 547

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
             F+   AG+   V L  L  G RDLS  +W Y+VG++GE + L   S ++S  W QGS 
Sbjct: 548 VHFETWNAGVLGPVTLKGLNEGTRDLSRQKWSYKVGLKGESLSLHTESGSSSVEWIQGSL 607

Query: 592 LPVNKSLIWY 601
           +   + L WY
Sbjct: 608 VAKKQPLTWY 617



 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 27/36 (75%)

Query: 672 TLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           T YH+PR+W+  G N LV+ EE GGDP+ I+L+ +T
Sbjct: 615 TWYHVPRSWLSSGGNSLVVFEEWGGDPNGIALVKRT 650


>gi|326520505|dbj|BAK07511.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 830

 Score =  671 bits (1730), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/836 (43%), Positives = 499/836 (59%), Gaps = 52/836 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V YD RALVIDG+RR+L SGSIHYPRSTPE+WP+LIRK+KEGGL+ IETYVFWN HEP R
Sbjct: 26  VGYDDRALVIDGERRLLISGSIHYPRSTPEMWPDLIRKAKEGGLDAIETYVFWNGHEPRR 85

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY FEG +D+VRF K VQ+AG++  LRIGPY C EWNYGG P WL  I G+QFR  N+P
Sbjct: 86  RQYNFEGSYDIVRFFKEVQDAGMYAILRIGPYICGEWNYGGLPAWLRDISGMQFRMHNHP 145

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F++EM+ F   I+D +K+  +FA QGGPIIL+Q+ENEYGN+  +         Y+ W A 
Sbjct: 146 FEQEMETFTTLIVDKLKEAKMFAGQGGPIILSQIENEYGNIMGKLNNNESASEYIHWCAA 205

Query: 183 TAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ +D P  +INT NGFYC  + P     P +WTEN++GWF ++  
Sbjct: 206 MANKQNVGVPWIMCQQDDDVPSNVINTWNGFYCHDWFPKRTDIPKIWTENWTGWFKAWDK 265

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AF+VA FF+T G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG 
Sbjct: 266 PDFHRSAEDIAFSVAMFFQTRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGN 325

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IRQPK+GHL++LH  +K  E+ L+  D     +G        +   N  A F++N     
Sbjct: 326 IRQPKYGHLKDLHNVLKSMEKILLHGDYKDTTMGNTNVTVTKYTLDNSSACFISNKFDDK 385

Query: 362 DANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           + NVT  NG  + +PAWSVSILPDCK V +N+AK+ +Q +        ++   E +    
Sbjct: 386 EVNVTLDNGATHTVPAWSVSILPDCKTVAYNSAKIKTQTS-----VMVKRPGAETVTDGL 440

Query: 421 AFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LNI 476
           A+SW  E +         +F + +L EQI T+ D SDYLWY  S      +G+  + L++
Sbjct: 441 AWSWMPENLQPFMTDEKGNFRKNELLEQIATSGDQSDYLWYRTSFE---HKGESNYKLHV 497

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            + GH    FVN KLV   Y  +    F +   ++L+ G N + +LS  +GL+NYGA F+
Sbjct: 498 NTTGHELYAFVNGKLVGRHYSPNGGFAFQMETPVKLHSGKNYISLLSATIGLKNYGALFE 557

Query: 537 VAGAGLFS--VILIDLKNGKR--DLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG- 589
           +  AG+    V L+D        DLS+  W Y+ G+ GEY    LDK +  + S W  G 
Sbjct: 558 MMPAGIVGGPVKLVDTVTNTTAYDLSNSSWSYKAGLAGEYRETHLDKAN--DRSQWSGGL 615

Query: 590 -STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
             T+PV++   WYK TF AP G+ P+  +L  +GKG  WVNG ++GRYW +Y+A      
Sbjct: 616 NGTIPVHRPFTWYKATFEAPAGEEPVVADLLGLGKGVVWVNGNNLGRYWPSYVAADMDGC 675

Query: 649 KKCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISL 703
           ++CDYRG++    D  KC   C +P+Q  YH+PR+++  GE N +V+ EE GGDP+++S 
Sbjct: 676 QRCDYRGTFKAEGDGQKCLTGCNEPSQRFYHVPRSFIKAGEPNTMVLFEEAGGDPTRVSF 735

Query: 704 LTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGN 763
            T      C+  +                     +V LAC  G  I++++ AS G+  G 
Sbjct: 736 HTVAVGAACAEAA-----------------EVGDEVALACSHGRTISSVDVASLGVARGK 778

Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           CG+++ G      L     ACVG+  C++  +  +    +G   G+   L V+A C
Sbjct: 779 CGAYQGGCESKAALAAFTAACVGKESCTVRHTEDFR-AGSGCDSGV---LTVQATC 830


>gi|293332691|ref|NP_001168270.1| beta-galactosidase precursor [Zea mays]
 gi|223947135|gb|ACN27651.1| unknown [Zea mays]
 gi|414880417|tpg|DAA57548.1| TPA: beta-galactosidase [Zea mays]
          Length = 822

 Score =  671 bits (1730), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/836 (43%), Positives = 493/836 (58%), Gaps = 50/836 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  VTY+ RALVIDG+RR++ SGSIHYPRSTP++WP+LI K+KEGGL  IETYVFWN HE
Sbjct: 20  ATTVTYNDRALVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHE 79

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P R QY FEG +D++RF K +Q AG+   LRIGPY C EWNYGG P WL  IPG+QFR  
Sbjct: 80  PRRRQYNFEGSYDIIRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLH 139

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKW 179
           N PF+ EM+ F   I++ MK  N+FA QGGPIILAQ+ENEYGN+  +         Y+ W
Sbjct: 140 NAPFEREMETFTTLIVNKMKDVNMFAGQGGPIILAQIENEYGNIMGQLKNNQSASQYIHW 199

Query: 180 AADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
            AD A      VPW+MCQQ+ D P  +INTCNGFYC  + PN    P +WTEN++GWF +
Sbjct: 200 CADMANKQEVGVPWIMCQQDNDVPHNVINTCNGFYCHDWFPNRTGIPKIWTENWTGWFKA 259

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
           +      R  ED+AFAVA FF+  G+  NYYMY GGTNFGRT+GGP + TSYDYDAP+DE
Sbjct: 260 WDKPDFHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDE 319

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANY 357
           YG IRQPK+GHL++LH  I+  E+ L+         G  +    Y +  S+ C  F+ N 
Sbjct: 320 YGNIRQPKYGHLKDLHDLIRSMEKILVHGKYNDTSYGKNVTVTKYMYGGSSVC--FINNQ 377

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
               D  VT  G  + +PAWSVSILP+CK V +NTAK+ +Q +       ++ N  E   
Sbjct: 378 FVDRDMKVTLGGETHLVPAWSVSILPNCKTVAYNTAKIKTQTS----VMVKKANSVEKEP 433

Query: 418 ASSAFSWYEEKVG--ISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
            +  +SW  E +   ++ +R SF +  L EQI T+ D SDYLWY  S+    G+G    L
Sbjct: 434 ETMRWSWMPENLKPFMTDHRGSFRQSQLLEQIATSTDQSDYLWYRTSLE-HKGEGSYT-L 491

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + + GH    FVN +LV   +       F +   ++L+ G N + +LS  VGL+NYG  
Sbjct: 492 YVNTSGHEMYAFVNGRLVGQNHSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGPS 551

Query: 535 FDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF-WK-QGS 590
           F++  AG+    V L+       DL+   W Y+ G+ GE   L +I L    + W+    
Sbjct: 552 FELVPAGIAGGPVKLVGTNGTAIDLTKSSWSYKSGLAGE---LRQIHLDKPGYKWQSHNG 608

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           T+PVN+   WYKTTF AP G+  + ++L  + KG AWVNG S+GRYW +Y A        
Sbjct: 609 TIPVNRPFTWYKTTFEAPAGEEAVVVDLLGLNKGVAWVNGNSLGRYWPSYTAAEMPGCHV 668

Query: 651 CDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLT 705
           CDYRG +    D  +C   CG+PAQ  YH+PR+++  GE N L++ EE GGDP++ +  T
Sbjct: 669 CDYRGKFIAEGDGIRCLTGCGEPAQRFYHVPRSFLRAGEPNTLILFEEAGGDPTRAAFHT 728

Query: 706 KTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNC 764
                +C                 +  V     V L+C   G  +A+++ AS+G+  G+C
Sbjct: 729 VAVGPVC-----------------VAAVELGDDVTLSCGGHGRVVASVDVASFGVARGSC 771

Query: 765 GSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G+++ G      L     ACVG+  C++  ++A+ G  AG   G   AL V+A CS
Sbjct: 772 GAYKGGCESKAALKAFTDACVGRESCTVKYTAAFAG--AGCQSG---ALTVQATCS 822


>gi|413925747|gb|AFW65679.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
          Length = 846

 Score =  670 bits (1728), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/832 (42%), Positives = 496/832 (59%), Gaps = 50/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+IDG+R +  SGSIHYPRS P++WPELI K+KEGGL  IETY+FWN HEP +
Sbjct: 41  VSYDRRSLIIDGRREIFFSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYIFWNIHEPEK 100

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQ+ FEGR+D+VRF K +QE  ++  +R+GP+  AEWN+GG P WL  IP I FRT N P
Sbjct: 101 GQFDFEGRYDIVRFFKLIQEHNMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 160

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K  M+ F+  II  +K  NLFASQGGPIILAQ+ENEY ++E A+   G  Y+KWAA+ A
Sbjct: 161 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHLEAAFKNDGTKYIKWAANMA 220

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
           ++ N  +PW+MC+Q  AP  +I TCNG  C G T   P + S P++WTEN++  +  FG 
Sbjct: 221 ISTNVGIPWIMCKQTKAPSDVIPTCNGRNC-GDTWPGPMNKSMPLLWTENWTAQYRVFGD 279

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVARFF  GGT  NYYMY GGTNFGRT+   ++   YD +AP+DE+G 
Sbjct: 280 PPSQRSAEDIAFAVARFFSVGGTMTNYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 338

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
            ++PKWGHLR+LH A+KLC++ L+    + +KLG + EA ++       C AFL+N+++ 
Sbjct: 339 YKEPKWGHLRDLHLALKLCKKALLWGKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTK 398

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            D  +TF G  YF+P  S+SIL DCK VVF T  V +Q N     FA Q   N +     
Sbjct: 399 DDVTLTFRGQSYFVPRHSISILADCKTVVFGTQHVNAQHNQRTFHFADQTTQNNVWQM-- 456

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLN 475
              + EEKV              +  N TKD +DY+WYT+S  +    MP  +  +  L 
Sbjct: 457 ---FDEEKVPKYKQSKIRLRKAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLE 513

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           + S GHA++ FVN K V  G+G      F + K ++L +G+N + +L+  +G+ + GA+ 
Sbjct: 514 VNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYL 573

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           +   AG+  V +  L  G  DL++  W + VG+ GE   +       S  WK       +
Sbjct: 574 EHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAVN---D 630

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK  F  P G+ P+ L++++MGKG  +VNGQ IGRYW +Y               
Sbjct: 631 RPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISY--------------- 675

Query: 656 SYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                   KH  G+P+Q LYHIPR+++   +N+LV+ EE  G P  I +LT    +IC+F
Sbjct: 676 --------KHALGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVKRDNICTF 727

Query: 715 VSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           +SE +P  + SW+     ++ +     P+  L C     I  + FASYG P G CG++  
Sbjct: 728 ISERNPAHIKSWERKDSQITVTAADLKPRATLTCSPKKLIQQVVFASYGNPMGICGNYTI 787

Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G+CH      +V+KAC+G+  C++PVS+   G     CPG    LAV+A CS
Sbjct: 788 GSCHTPRAKELVEKACLGKRICTLPVSADVYGGDVN-CPGTTATLAVQAKCS 838


>gi|22328945|ref|NP_194344.2| beta-galactosidase 12 [Arabidopsis thaliana]
 gi|20466292|gb|AAM20463.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|23198118|gb|AAN15586.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|332659763|gb|AEE85163.1| beta-galactosidase 12 [Arabidopsis thaliana]
          Length = 636

 Score =  669 bits (1725), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/610 (52%), Positives = 432/610 (70%), Gaps = 21/610 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + A VTYD +A++I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN H
Sbjct: 25  VKAIVTYDRKAVIINGQRRILLSGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  GQYYFE R+DLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 85  EPSPGQYYFEDRYDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGMVFRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK+E LF +QGGPIIL+Q+ENEYG +EW  G  G+ Y KW 
Sbjct: 145 DNEPFKAAMQKFTEKIVRMMKEEKLFETQGGPIILSQIENEYGPIEWEIGAPGKAYTKWV 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A+ A  L+T VPW+MC+Q+DAP+ IINTCNGFYC+ F PNS +KP MWTEN++GWF  FG
Sbjct: 205 AEMAQGLSTGVPWIMCKQDDAPNSIINTCNGFYCENFKPNSDNKPKMWTENWTGWFTEFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP ED+A +VARF + GG+F NYYMY GGTNF RTA G  +ATSYDYDAP+DEYG
Sbjct: 265 GAVPYRPAEDIALSVARFIQNGGSFINYYMYHGGTNFDRTA-GEFIATSYDYDAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R+PK+ HL+ LHK IKLCE  L+S+DPT   LG K EAH++ KS + CAAFL+NY++S
Sbjct: 324 LPREPKYSHLKRLHKVIKLCEPALVSADPTVTSLGDKQEAHVF-KSKSSCAAFLSNYNTS 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F G+ Y LP WSVSILPDCK   +NTAKV   R +  H         +++  ++
Sbjct: 383 SAARVLFGGSTYDLPPWSVSILPDCKTEYYNTAKV---RTSSIH--------MKMVPTNT 431

Query: 421 AFSW--YEEKV-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ----GKEVF 473
            FSW  Y E++   + N +F +  L EQI+ T+D +DY WY   I + P +    G++  
Sbjct: 432 PFSWGSYNEEIPSANDNGTFSQDGLVEQISITRDKTDYFWYLTDITISPDEKFLTGEDPL 491

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GHA  VFVN +L    YG+ +      ++KI+L+ G+N L +LS   GL N G 
Sbjct: 492 LTIGSAGHALHVFVNGQLAGTAYGSLEKPKLTFSQKIKLHAGVNKLALLSTAAGLPNVGV 551

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            ++    G+   V L  + +G  D++  +W Y++G +GE + +  ++ +++  WK+GS +
Sbjct: 552 HYETWNTGVLGPVTLNGVNSGTWDMTKWKWSYKIGTKGEALSVHTLAGSSTVEWKEGSLV 611

Query: 593 PVNKSLIWYK 602
              + L WYK
Sbjct: 612 AKKQPLTWYK 621


>gi|357142911|ref|XP_003572734.1| PREDICTED: beta-galactosidase 1-like [Brachypodium distachyon]
          Length = 831

 Score =  669 bits (1725), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/833 (43%), Positives = 485/833 (58%), Gaps = 51/833 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD RALVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+K+GGL  IETYVFWN HEP  
Sbjct: 33  VSYDERALVIDGQRRIILSGSIHYPRSTPEMWPDLIQKAKDGGLNTIETYVFWNGHEPRP 92

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY FEG +D++RF K VQ+AG++  LRIGPY C EWNYGG P WL  IP +QFR  N P
Sbjct: 93  RQYNFEGNYDIMRFFKEVQKAGMYAILRIGPYICGEWNYGGLPAWLRDIPDMQFRLHNEP 152

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVE--WAYGVGGELYVKWAAD 182
           F+ EM+ F   I++ MK  N+FA QGGPIIL Q+ENEYGNV+           Y+ W AD
Sbjct: 153 FEREMETFTTLIVNKMKDANMFAGQGGPIILTQIENEYGNVQSNLPDQESATKYIHWCAD 212

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ  D P  +I TCNGFYC  F P   + P +WTEN++GWF ++  
Sbjct: 213 MANKQNVGVPWIMCQQSNDVPPNVIETCNGFYCHDFKPKGSNMPKIWTENWTGWFKAWDK 272

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               RP ED+A+AVA FF+  G+ QNYYMY GGTNFGRT+GGP + T+YDYDAP+DEYG 
Sbjct: 273 PDYHRPAEDVAYAVAMFFQNRGSVQNYYMYHGGTNFGRTSGGPYITTTYDYDAPLDEYGN 332

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IRQPK+GHL+ LH  +   E++L+        L  K++A  Y       A F++N   + 
Sbjct: 333 IRQPKYGHLKALHTVLTSMEKHLVYGQQNETNLDDKVKATKYTLDDGSSACFISNSHDNK 392

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVTF G+ Y +PAWSVS+LPDCK V +NTAKV +Q +       ++++  +  L  S 
Sbjct: 393 DVNVTFEGSAYQVPAWSVSVLPDCKTVAYNTAKVKTQTS----VMVKKESAAKGGLKWSW 448

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LNIESLG 480
              +          SF   +L EQI T  D SDYLWY  S+   P   KE F L + + G
Sbjct: 449 LPEFLRPSFTDSYGSFKSNELLEQIVTGADESDYLWYKTSLTRGP---KEQFTLYVNTTG 505

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           H    FVN +L  + +  +    F     + L  G N + +LS  VGL+NYGA F++  A
Sbjct: 506 HELYAFVNGELAGYKHAVNGPYLFQFEAPVTLKPGKNYISLLSATVGLKNYGASFELMPA 565

Query: 541 GLFS--VILIDLKNGKRDLSSGEWIYQVGVEGE--YIGLDKISLANSSFWKQGSTLPVNK 596
           G+    V L+       DLS+  W Y+ G+ GE   I LDK  L  S F      +P N+
Sbjct: 566 GIVGGPVKLVSAHGNTIDLSNNTWTYKTGLFGEQKQIHLDKPGLRWSPF-----AVPTNR 620

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
              WYK TF AP G   + ++L  + KG  +VNG ++GRYW +Y+A       +CDYRG 
Sbjct: 621 PFTWYKATFQAPAGTEAVVVDLVGLNKGVVYVNGHNLGRYWPSYVAGDMDGCHRCDYRGE 680

Query: 657 Y----DASKCQKHCGQPAQTLYHIPRTWV---HPGENLLVIHEELGGDPSKISLLTKTGQ 709
           Y    +  KC   CG+  Q  YH+PR+++   H   N +V+ EE GGDP+K++  T    
Sbjct: 681 YVTWNNQEKCLTGCGEVGQRFYHVPRSFLNAAHGAPNTVVLFEEAGGDPAKVNFRTVAVG 740

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
            +C+   + D                   V LAC  G  I++++ AS+G+  G CG++  
Sbjct: 741 PVCADAEKGD------------------AVTLACAHGRTISSVDTASFGVSGGQCGAYEG 782

Query: 770 GA-CHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G+ C     L  +  ACVG+  C++  + A+    +  C G    L V+A CS
Sbjct: 783 GSGCESKPALEAITAACVGKKWCTVSYTDAF---DSADCKG-SGVLTVQATCS 831


>gi|22329897|ref|NP_683341.1| beta-galactosidase 15 [Arabidopsis thaliana]
 gi|332193266|gb|AEE31387.1| beta-galactosidase 15 [Arabidopsis thaliana]
          Length = 786

 Score =  667 bits (1722), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/831 (42%), Positives = 490/831 (58%), Gaps = 105/831 (12%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V++D RA+ IDG RRVL SGSIHYPRST E+WP+LI+K KEG L+ IETYVFWN HEP R
Sbjct: 45  VSHDGRAITIDGHRRVLLSGSIHYPRSTTEMWPDLIKKGKEGSLDAIETYVFWNAHEPTR 104

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DL+RF+KT+Q  G++  LRIGPY CAEWNYGGFPVWLH +PG++FRTTN  
Sbjct: 105 RQYDFSGNLDLIRFLKTIQNEGMYGVLRIGPYVCAEWNYGGFPVWLHNMPGMEFRTTNTA 164

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F  EM+ F   I++++K+E LFASQGGPIILAQ+ENEYGNV  +YG  G+ Y++W A+ A
Sbjct: 165 FMNEMQNFTTMIVEMVKKEKLFASQGGPIILAQIENEYGNVIGSYGEAGKAYIQWCANMA 224

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            +L+  VPW+MCQQ+DAP P++NTCNG+YCD F+PN+P+ P MWTEN++GW+ ++G   P
Sbjct: 225 NSLDVGVPWIMCQQDDAPQPMLNTCNGYYCDNFSPNNPNTPKMWTENWTGWYKNWGGKDP 284

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  ED+AFAVARFF+  GTFQNYYMY GGTNF RTAGGP + T+YDYDAP+DE+G + Q
Sbjct: 285 HRTTEDVAFAVARFFQKEGTFQNYYMYHGGTNFDRTAGGPYITTTYDYDAPLDEFGNLNQ 344

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDAN 364
           PK+GHL++LH  +   E+ L   + +    G  + A +Y ++    + F+ N + +SDA 
Sbjct: 345 PKYGHLKQLHDVLHAMEKTLTYGNISTVDFGNLVTATVY-QTEEGSSCFIGNVNETSDAK 403

Query: 365 VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW 424
           + F G  Y +PAWSVSILPDCK   +NTAK+ +Q +       ++ N  E   ++  +SW
Sbjct: 404 INFQGTSYDVPAWSVSILPDCKTETYNTAKINTQTS----VMVKKANEAENEPSTLKWSW 459

Query: 425 YEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM---PGQGKEVFLNIES 478
             E +    + G        L +Q   + D SDYLWY  ++++    P  GK + L I S
Sbjct: 460 RPENIDSVLLKGKGESTMRQLFDQKVVSNDESDYLWYMTTVNLKEQDPVLGKNMSLRINS 519

Query: 479 LGHAALVFVNKKLVAFGYGNHDFAN----FLINKKIELNEGINTLDILSMMVGLQNYGAW 534
             H    FVN + +    GN+   N    ++  +  + N G N + +LS+ VGL NYGA+
Sbjct: 520 TAHVLHAFVNGQHI----GNYRVENGKFHYVFEQDAKFNPGANVITLLSITVGLPNYGAF 575

Query: 535 FDVAGAGLFSVILIDLKNGK----RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           F+   AG+   + I  +NG     +DLS+ +W Y+ G+ G           N  F  +  
Sbjct: 576 FENFSAGITGPVFIIGRNGDETIVKDLSTHKWSYKTGLSG---------FENQLFSSES- 625

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                       +T+ AP G  P+ ++L  +GKG AW+NG +IGRYW A+L+        
Sbjct: 626 -----------PSTWSAPLGSEPVVVDLLGLGKGTAWINGNNIGRYWPAFLSDI------ 668

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQH 710
                                            G+N LV+ EE+GG+PS ++  T     
Sbjct: 669 --------------------------------DGDNTLVLFEEIGGNPSLVNFQTIGVGS 696

Query: 711 ICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           +C+ V E +                   + L+C  G  I+AI FAS+G P G+CGSF  G
Sbjct: 697 VCANVYEKNV------------------LELSC-NGKPISAIKFASFGNPGGDCGSFEKG 737

Query: 771 ACHM--DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            C    +   I+ + CVG+ +CSI VS    G  A  C  L K LAVEA C
Sbjct: 738 TCEASNNAAAILTQECVGKEKCSIDVSEDKFG--AAECGALAKRLAVEAIC 786


>gi|11079481|gb|AAG29193.1|AC078898_3 beta-galactosidase, putative [Arabidopsis thaliana]
          Length = 780

 Score =  667 bits (1721), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/831 (45%), Positives = 490/831 (58%), Gaps = 73/831 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 10  ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 69

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQ+ F G  D+V+F+K V+  GL++ LRIGP+   EW+YGG P WLH + GI FRT N
Sbjct: 70  QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 129

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  MKR+   I+ LMK ENL+ASQGGPIIL+Q+ENEYG V  A+   G+ YVKW A 
Sbjct: 130 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 189

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L+T VPWVMC+Q+DAPDP++N CNG  C + F  PNSP+KP +WTEN++       
Sbjct: 190 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSL----- 244

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                   ED+AF VA F    G+F NYYMY GGTNFGR A    V TSY   AP+DEYG
Sbjct: 245 ------SAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 297

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL+ELH A+KLCEE L+S   T   LG    A ++ K +N CAA L N D  
Sbjct: 298 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 356

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            ++ V F  + Y L   SVS+LPDCKNV FNTAKV +Q N       + +   + L +  
Sbjct: 357 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 410

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  + E V      S     L E +NTT+DTSDYLW T        +G    L +  LG
Sbjct: 411 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 468

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   FVN + +   +G      FL+ K + LN G N L +LS+MVGL N GA  +    
Sbjct: 469 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 528

Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
           G  SV    + NG+  L  ++  W YQVG++GE   +     +    WKQ      ++ L
Sbjct: 529 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 584

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYK +F  PEG+ P+ALNL SMGKG+AWVNGQSI  +  +Y                  
Sbjct: 585 TWYKASFDTPEGEDPVALNLGSMGKGEAWVNGQSIAMF--SYFR---------------- 626

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSE 717
                          YHIPR+++ P  NLLVI  EE  G+P  I++ T +   +C  VS 
Sbjct: 627 ---------------YHIPRSFLKPNSNLLVILEEEREGNPLGITIDTVSVTEVCGHVSN 671

Query: 718 ADPPPVDS------WKPNLGV-VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
            +P PV S       + NL       P+V+L C  G  I+ I FAS+G P G+CGS+  G
Sbjct: 672 TNPHPVISPRKKGLNRKNLTYRYDRKPKVQLQCPTGRKISKILFASFGTPNGSCGSYSIG 731

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +CH  + L +VQKAC+ +  CS+PV S   G    +CP  +K+L V A CS
Sbjct: 732 SCHSPNSLAVVQKACLKKSRCSVPVWSKTFG--GDSCPHTVKSLLVRAQCS 780


>gi|242057631|ref|XP_002457961.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
 gi|241929936|gb|EES03081.1| hypothetical protein SORBIDRAFT_03g023500 [Sorghum bicolor]
          Length = 830

 Score =  666 bits (1719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/836 (43%), Positives = 491/836 (58%), Gaps = 58/836 (6%)

Query: 7   YDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQ 66
           Y+ RA+VIDG+RR++ SGSIHYPRSTP++WP+LI K+KEGGL  IETYVFWN HEP R Q
Sbjct: 30  YNDRAVVIDGQRRIILSGSIHYPRSTPQMWPDLINKAKEGGLNTIETYVFWNGHEPRRRQ 89

Query: 67  YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
           Y FEG +D+VRF K +Q AG+   LRIGPY C EWNYGG P WL  IPG+QFR  N+PF+
Sbjct: 90  YNFEGNYDIVRFFKEIQNAGMHAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNDPFE 149

Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTA 184
            EM+ F   I++ MK  N+FA QGGPIILAQ+ENEYGN+  +         Y+ W AD A
Sbjct: 150 REMETFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGKLENNQSASQYIHWCADMA 209

Query: 185 VNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
                 VPW+MCQQ+ D P  +INTCNGFYC  + PN    P +WTEN++GWF ++    
Sbjct: 210 NKQKIGVPWIMCQQDNDVPHNVINTCNGFYCYDWFPNRTGIPKIWTENWTGWFKAWDKPD 269

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+AFAVA FF+  G+  NYYMY GGTNFGRT+GGP + TSYDYDAP+DEYG IR
Sbjct: 270 FHRSAEDIAFAVAMFFQKRGSVHNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIR 329

Query: 304 QPKWGHLRELHKAIKLCEEYLIS---SDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
           QPK+GHL++LH  +K  E+ L+     D +H K    +  + Y  SS     F++N    
Sbjct: 330 QPKYGHLKDLHNLLKSMEKILVHGEYKDTSHGK-NVTVTKYTYGGSS---VCFISNQFDD 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            D NVT  G  + +PAWSVSILPDCK V +NTAK+ +Q +       ++ N  E    + 
Sbjct: 386 RDVNVTLAG-THLVPAWSVSILPDCKTVAYNTAKIKTQTS----VMVKKANSVEKEPEAL 440

Query: 421 AFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
            +SW  E +       + SF +  L EQI T+ D SDYLWY  S+    G+G    L + 
Sbjct: 441 RWSWMPENLKPFMTDDHGSFRQSRLLEQIATSTDQSDYLWYRTSLE-HKGEGSYT-LYVN 498

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           + GH    FVN KLV     ++    F +   ++L+ G N + +LS  VGL+NYG  F++
Sbjct: 499 TTGHKIYAFVNGKLVGQNQSSNGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKNYGPLFEL 558

Query: 538 AGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLP 593
             AG+    V L+   +   DL+   W Y+ G+ GE+  I LDK      S    GS +P
Sbjct: 559 VPAGIAGGPVKLVGANDTAIDLTHSSWSYKSGLAGEHRQIHLDKPGYKWRSHNGSGS-IP 617

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST-GCTKKCD 652
           VN+   WYKTTF AP G   + ++L  + KG AWVNG S+GRYW +Y A    GC   CD
Sbjct: 618 VNRPFTWYKTTFAAPAGDEAVVVDLLGLNKGAAWVNGNSLGRYWPSYTAAEMGGCHGACD 677

Query: 653 YRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKT 707
           YRG +    D  +C   CG+P+Q  YH+PR+++  GE N LV+ EE GGDP++ +  T  
Sbjct: 678 YRGKFKAEGDGIRCLTGCGEPSQRFYHVPRSFLRAGEPNTLVLFEEAGGDPARAAFHTVA 737

Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWH---IAAINFASYGIPEGNC 764
             H+C   +E                     V L+C  G     +A+++ AS+G+  G C
Sbjct: 738 VGHVCVAAAEV-----------------GDDVTLSCGGGLGGGVVASVDVASFGVTRGGC 780

Query: 765 GSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA-LAVEAHC 819
           G ++ G      L   + ACVG+  C++  + A+ G      PG     L V+A C
Sbjct: 781 GDYQGGCESKAALKAFRDACVGRESCTVKYTPAFAG------PGCQSGKLTVQATC 830


>gi|45758292|gb|AAS76480.1| beta-galactosidase [Gossypium hirsutum]
          Length = 843

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/832 (42%), Positives = 496/832 (59%), Gaps = 53/832 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+I+GKR +L SG+IHYPRSTP++WP+LI+K+K+GG+  IETYVFWN HEP+ 
Sbjct: 49  VTYDARSLIINGKRELLFSGAIHYPRSTPDMWPDLIKKAKQGGINAIETYVFWNGHEPVE 108

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY FEG FDLV+F+K + E  L+  +R+GP+  AEWN+GG P WL  +PGI FR+ N P
Sbjct: 109 GQYNFEGEFDLVKFIKLIHEHKLYAVVRVGPFIQAEWNHGGLPYWLREVPGIIFRSDNEP 168

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK+ MKRF+  I+D +KQE LFA QGGPIILAQ+ENEY  ++ A+   G+ YV+WA   A
Sbjct: 169 FKKHMKRFVTLIVDKLKQEKLFAPQGGPIILAQIENEYNTIQRAFREKGDSYVQWAGKLA 228

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG--FTPNSPSKPIMWTENYSGWFLSFGYA 242
           ++LN +VPW+MC+Q DAPDPIINTCNG +C    + PN  +KP +WTEN++  +  FG  
Sbjct: 229 LSLNANVPWIMCKQRDAPDPIINTCNGRHCGDTFYGPNKRNKPALWTENWTAQYRVFGDP 288

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  EDLA++VARFF   G+  NYYM++GGTNFGRT+      T Y  + P+DE+G  
Sbjct: 289 PSQRSAEDLAYSVARFFSKNGSMVNYYMHYGGTNFGRTSAS-FTTTRYYDEGPLDEFGLQ 347

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           R+PKWGHL+++H+A+ LC+  L    PT  KLG   +A ++ +  ++ CAAFLAN ++  
Sbjct: 348 REPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAAFLANNNTRL 407

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
             +V F G    LPA S+S+LPDCK VVFNT  V +Q N+        +N     +A+  
Sbjct: 408 AQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNS--------RNFVRSEIANKN 459

Query: 422 FSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFL 474
           F+W    E   +     F  P   E  + TKDT+DY WYT S+ +    +P  +     L
Sbjct: 460 FNWEMCREVPPVGLGFKFDVP--RELFHLTKDTTDYAWYTTSLLLGRRDLPMKKNVRPVL 517

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            + SLGH    +VN +     +G+    +F++ + + L EG N + +L  +VGL + GA+
Sbjct: 518 RVASLGHGIHAYVNGEYAGSAHGSKVEKSFVLQRAVSLKEGENHIALLGYLVGLPDSGAY 577

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
            +   AG  S+ ++ L  G  D+S   W +QVG++GE   L     + S  W +      
Sbjct: 578 MEKRFAGPRSITILGLNTGTLDISQNGWGHQVGIDGEKKKLFTEEGSKSVQWTKPDQ--- 634

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L WYK  F APEG  P+A+ +  MGKG  WVNG+SIGRYW+ YL+P           
Sbjct: 635 GGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSP----------- 683

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                        +P Q+ YHIPR ++ P +NL+V+ EE GG+P  + ++T     ICS 
Sbjct: 684 -----------LKKPTQSEYHIPRAYLKP-KNLIVLLEEEGGNPKDVHIVTVNRDTICSA 731

Query: 715 VSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
           VSE  PP    ++   G + +      P+  L C     I A+ FASYG P G CG++  
Sbjct: 732 VSEIHPPSPRLFETKNGSLQAKVNDLKPRAELKCPGKKQIVAVEFASYGDPFGACGAYFI 791

Query: 770 GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G C   +   +V+K C+G+  C IP+ S        AC  L K LAV+  C+
Sbjct: 792 GNCTAPESKQVVEKYCLGKPSCQIPLDSIPFSNQNDACTHLRKTLAVQLKCA 843


>gi|75141878|sp|Q7XFK2.1|BGL14_ORYSJ RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
           Precursor
 gi|15451595|gb|AAK98719.1|AC090483_9 Putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|31431327|gb|AAP53122.1| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
          Length = 808

 Score =  664 bits (1712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/836 (43%), Positives = 488/836 (58%), Gaps = 79/836 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL  IETYVFWN HEP R
Sbjct: 31  VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            ++ FEG +D+VRF K +Q AG++  LRIGPY C EWNYGG PVWL  IPGI+FR  N P
Sbjct: 91  REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPVWLRDIPGIKFRLHNKP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------NVEWAYGVGGELYV 177
           F+  M+ F   I+  MK  N+FA QGGPIILAQ+ENEYG       N++ A+      Y+
Sbjct: 151 FENGMEAFTTLIVKKMKDANMFAGQGGPIILAQIENEYGYTMLQPENIQSAHE-----YI 205

Query: 178 KWAADTAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
            W AD A   N  VPW+MCQQ+ D P  ++NTCNGFYC  +  N  S P MWTEN++GW+
Sbjct: 206 HWCADMANKQNVGVPWIMCQQDNDVPPNVVNTCNGFYCHEWFSNRTSIPKMWTENWTGWY 265

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
             +      RP ED+AFAVA FF+  G+ QNYYMY GGTNFGRTAGGP + TSYDYDAP+
Sbjct: 266 RDWDQPEFRRPTEDIAFAVAMFFQMRGSLQNYYMYHGGTNFGRTAGGPYITTSYDYDAPL 325

Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
           DEYG +RQPK+GHL+ELH  +   E+ L+  D      G  +    Y  ++   A F+ N
Sbjct: 326 DEYGNLRQPKYGHLKELHSVLMSMEKILLHGDYIDTNYGDNVTVTKYTLNATS-ACFINN 384

Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
                D NVT +G  +FLPAWSVSILP+CK V FN+AK+ +Q          + ++ E  
Sbjct: 385 RFDDRDVNVTLDGTTHFLPAWSVSILPNCKTVAFNSAKIKTQTT----VMVNKTSMVEQQ 440

Query: 417 LASSAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF 473
                +SW  E +         +F + +L EQI TT D SDYLWY  S+    G+G  V 
Sbjct: 441 TEHFKWSWMPENLRPFMTDEKGNFRKNELLEQIVTTTDQSDYLWYRTSLE-HKGEGSYV- 498

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L + + GH    FVN KLV   Y  ++   F +                       NYG 
Sbjct: 499 LYVNTTGHELYAFVNGKLVGQQYSPNENFTFQLKSP--------------------NYGG 538

Query: 534 WFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQG 589
            F++  AG+    V LID      DLS+  W Y+ G+ GEY  I LDK     + +    
Sbjct: 539 SFELLPAGIVGGPVKLIDSSGSAIDLSNNSWSYKAGLAGEYRKIYLDK---PGNKWRSHN 595

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           ST+P+N+   WYKTTF AP G+  + ++L  + KG AWVNG S+GRYW +Y+A       
Sbjct: 596 STIPINRPFTWYKTTFQAPAGEDSVVVDLHGLNKGVAWVNGNSLGRYWPSYVAADMPGCH 655

Query: 650 KCDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLL 704
            CDYRG +    +A KC   CG+P+Q LYH+PR++++ GE N L++ EE GGDPS++++ 
Sbjct: 656 HCDYRGVFKAEVEAQKCLTGCGEPSQQLYHVPRSFLNKGEPNTLILFEEAGGDPSEVAVR 715

Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGN 763
           T     +C+     D                   V L+C   G  I++++ AS+G+  G 
Sbjct: 716 TVVEGSVCASAEVGD------------------TVTLSCGAHGRTISSVDVASFGVARGR 757

Query: 764 CGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           CGS+  G            ACVG+  C++ V+ A+   +AG   G+   L V+A C
Sbjct: 758 CGSYDGGCESKVAYDAFAAACVGKESCTVLVTDAF--ANAGCVSGV---LTVQATC 808


>gi|115477689|ref|NP_001062440.1| Os08g0549200 [Oryza sativa Japonica Group]
 gi|75136208|sp|Q6ZJJ0.1|BGL11_ORYSJ RecName: Full=Beta-galactosidase 11; AltName: Full=Lactase 115;
           Flags: Precursor
 gi|42407808|dbj|BAD08952.1| putative glycosyl hydrolase family 35 (beta-galactosidase) [Oryza
           sativa Japonica Group]
 gi|113624409|dbj|BAF24354.1| Os08g0549200 [Oryza sativa Japonica Group]
          Length = 848

 Score =  659 bits (1701), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/842 (42%), Positives = 491/842 (58%), Gaps = 63/842 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +TYD R+L+IDG R +  SGSIHYPRS P+ WP+LI K+KEGGL VIE+YVFWN HEP +
Sbjct: 33  ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DL++F K +QE  ++  +RIGP+  AEWN+GG P WL  IP I FRT N P
Sbjct: 93  GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK+ MK+F+  I++ +K+  LFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
           +  NT VPW+MC+Q  AP  +I TCNG +C G T   P    KP++WTEN++  +  FG 
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYRVFGD 271

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AF+VARFF  GGT  NYYMY GGTNFGR     ++   YD +AP+DE+G 
Sbjct: 272 PPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRNGAAFVMPRYYD-EAPLDEFGL 330

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSS 360
            ++PKWGHLR+LH A++ C++ L+  +P+ Q LG   EA ++  K  N C AFL+N+++ 
Sbjct: 331 YKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTK 390

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
            D  VTF G  YF+   S+SIL DCK VVF+T  V SQ N     FA    Q NV E+  
Sbjct: 391 EDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM-- 448

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV- 472
                 + EEK+      S       EQ N TKD +DYLWYT S  +    +P + KEV 
Sbjct: 449 ------YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYR-KEVK 501

Query: 473 -FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L + S GHA + FVN   V  G+G      F + K ++L  G+N + ILS  +GL + 
Sbjct: 502 PVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMDS 561

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           G++ +   AG+++V +  L  G  DL++  W + VG++GE   +       +  WK G  
Sbjct: 562 GSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWKPGKD 621

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
              N+ L WY+  F  P G  P+ ++L  MGKG  +VNG+ +GRYW +Y           
Sbjct: 622 ---NQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY----------- 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
                          G+P+Q LYH+PR+ + P  N L+  EE GG P  I +LT    +I
Sbjct: 668 -----------HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKRDNI 716

Query: 712 CSFVSEADPPPVD-SWKPN-----------LGVVSSSPQVRLACERGWHIAAINFASYGI 759
           C+F++E +P  V  SW+              G     P   L+C     I ++ FASYG 
Sbjct: 717 CTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSVVFASYGN 776

Query: 760 PEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
           P G CG++  G+CH      +V+KAC+G+  CS+ VSS   G     CPG    LAV+A 
Sbjct: 777 PLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDV-HCPGTTGTLAVQAK 835

Query: 819 CS 820
           CS
Sbjct: 836 CS 837


>gi|222635782|gb|EEE65914.1| hypothetical protein OsJ_21762 [Oryza sativa Japonica Group]
          Length = 579

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/569 (55%), Positives = 398/569 (69%), Gaps = 20/569 (3%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYDHR+L I+G+RR+L SGSIHYPRSTPE+WP+LI+K+K+GGL+VI+TYVFWN HEP++G
Sbjct: 23  TYDHRSLTINGQRRILISGSIHYPRSTPEMWPDLIQKAKDGGLDVIQTYVFWNGHEPVQG 82

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYF  R+DLVRFVK V++AGL+++LRIGPY CAEWNYGGFPVWL ++PGI FRT N PF
Sbjct: 83  QYYFSDRYDLVRFVKLVKQAGLYVNLRIGPYVCAEWNYGGFPVWLKYVPGISFRTDNGPF 142

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + YV WAA  AV
Sbjct: 143 KAAMQTFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGSGAKSYVDWAAKMAV 202

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
             N  VPW+MC+Q+DAPDP+INTCNGFYCD FTPNS +KP MWTE +SGWF +FG  VP 
Sbjct: 203 ATNAGVPWIMCKQDDAPDPVINTCNGFYCDDFTPNSKNKPSMWTEAWSGWFTAFGGTVPQ 262

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
           RPVEDLAFAVARF + GG+F NYYMY GGTNF RTAGGP +ATSYDYDAPIDEYG +RQP
Sbjct: 263 RPVEDLAFAVARFIQKGGSFINYYMYHGGTNFDRTAGGPFIATSYDYDAPIDEYGLLRQP 322

Query: 306 KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANV 365
           KWGHL  LHKAIK  E  L++ DPT Q +G   +A+++  SS DCAAFL+N+ +S+ A V
Sbjct: 323 KWGHLTNLHKAIKQAETALVAGDPTVQNIGNYEKAYVFRSSSGDCAAFLSNFHTSAAARV 382

Query: 366 TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW- 424
            FNG  Y LPAWS+S+LPDC+  V+NTA V +  +               +  +  F+W 
Sbjct: 383 AFNGRRYDLPAWSISVLPDCRTAVYNTATVTAASSPAK------------MNPAGGFTWQ 430

Query: 425 -YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIES 478
            Y E        +F +  L EQ++ T D SDYLWYT  +++  G+     G+   L + S
Sbjct: 431 SYGEATNSLDETAFTKDGLVEQLSMTWDKSDYLWYTTYVNIDSGEQFLKSGQWPQLTVYS 490

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH+  VFVN +     YG +D      +  +++ +G N + ILS  VGL N G  ++  
Sbjct: 491 AGHSVQVFVNGQYFGNAYGGYDGPKLTYSGYVKMWQGSNKISILSSAVGLPNVGTHYETW 550

Query: 539 GAGLFS-VILIDLKNGKRDLSSGEWIYQV 566
             G+   V L  L  GKRDLS  +W YQV
Sbjct: 551 NIGVLGPVTLSGLNEGKRDLSKQKWTYQV 579


>gi|219887949|gb|ACL54349.1| unknown [Zea mays]
 gi|414870186|tpg|DAA48743.1| TPA: beta-galactosidase [Zea mays]
          Length = 850

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/837 (41%), Positives = 498/837 (59%), Gaps = 58/837 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+ DG R +  SGSIHYPRS P++WPELI K+KEGGL  IETYVFWN HEP +
Sbjct: 43  VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ FEG+ D+VRF + +QE  ++  +R+GP+  AEWN+GG P WL  IP I FRT N P
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K  M+ F+  II  +K  NLFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
           ++ N  +PW+MC+Q  AP  +I TCNG  C G T   P + S P++WTEN++  +  FG 
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNC-GDTWPGPTNKSMPLLWTENWTAQYRVFGD 281

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVARFF  GGT  NYYMY GGTNFGRT+   ++   YD +AP+DE+G 
Sbjct: 282 PPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 340

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
            ++PKWGHLR+LH+A+KLC++ L+   P+ +KLG +LEA ++       C AFL+N+++ 
Sbjct: 341 YKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTK 400

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
            DA +TF G  YF+P  S+S+L DC+ VVF T  V +Q N     FA    Q NV E+  
Sbjct: 401 DDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQTAQNNVWEMFD 460

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EV 472
             +   + + K+ +            +  N TKD +DY+WYT+S  +    MP +   + 
Sbjct: 461 GENVPKYKQAKIRLR--------KAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKT 512

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA++ FVN K V  G+G      F + K ++L +G+N + +L+  +G+ + G
Sbjct: 513 VLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSG 572

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
           A+ +   AG+  V +  L  G  DL++  W + VG+ GE   +       S  WK     
Sbjct: 573 AYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWKPAMN- 631

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
             ++ L WYK  F  P G+ P+ L++++MGKG  +VNGQ IGRYW +Y            
Sbjct: 632 --DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY------------ 677

Query: 653 YRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
                      KH  G+P+Q LYH+PR+++   +N+LV+ EE  G P  I +LT    +I
Sbjct: 678 -----------KHALGRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDAIMILTVKRDNI 726

Query: 712 CSFVSEADPPPVDSWKPNLGVVSSS-------PQVRLACERGWHIAAINFASYGIPEGNC 764
           C+F+SE +P  + SW+     +++         +  LAC     I  + FASYG P G C
Sbjct: 727 CTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLIQQVVFASYGNPAGIC 786

Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G++  G+CH      +V+KAC+G+  C++PV++   G  A  C G    LAV+A CS
Sbjct: 787 GNYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDAN-CSGTTATLAVQAKCS 842


>gi|222640983|gb|EEE69115.1| hypothetical protein OsJ_28192 [Oryza sativa Japonica Group]
          Length = 848

 Score =  659 bits (1699), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/842 (42%), Positives = 490/842 (58%), Gaps = 63/842 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +TYD R+L+IDG R +  SGSIHYPRS P+ WP+LI K+KEGGL VIE+YVFWN HEP +
Sbjct: 33  ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+DL++F K +QE  ++  +RIGP+  AEWN+GG P WL  IP I FRT N P
Sbjct: 93  GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHGGLPYWLREIPDIIFRTNNEP 152

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK+ MK+F+  I++ +K+  LFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A
Sbjct: 153 FKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMA 212

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
           +  NT VPW+MC+Q  AP  +I TCNG +C G T   P    KP++WTEN++  +  FG 
Sbjct: 213 IATNTGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYRVFGD 271

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AF+VARFF  GGT  NYYMY GGTNFGR     ++   YD +AP DE+G 
Sbjct: 272 PPSQRSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRNGAAFVMPRYYD-EAPFDEFGL 330

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSS 360
            ++PKWGHLR+LH A++ C++ L+  +P+ Q LG   EA ++  K  N C AFL+N+++ 
Sbjct: 331 YKEPKWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTK 390

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
            D  VTF G  YF+   S+SIL DCK VVF+T  V SQ N     FA    Q NV E+  
Sbjct: 391 EDGTVTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM-- 448

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV- 472
                 + EEK+      S       EQ N TKD +DYLWYT S  +    +P + KEV 
Sbjct: 449 ------YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFRLETDDLPYR-KEVK 501

Query: 473 -FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L + S GHA + FVN   V  G+G      F + K ++L  G+N + ILS  +GL + 
Sbjct: 502 PVLEVSSHGHAIVAFVNDAFVGCGHGTKINKAFTMEKAMDLKVGVNHVAILSSTLGLMDS 561

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           G++ +   AG+++V +  L  G  DL++  W + VG++GE   +       +  WK G  
Sbjct: 562 GSYLEHRMAGVYTVTIRGLNTGTLDLTTNGWGHVVGLDGERRRVHSEQGMGAVAWKPGKD 621

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
              N+ L WY+  F  P G  P+ ++L  MGKG  +VNG+ +GRYW +Y           
Sbjct: 622 ---NQPLTWYRRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY----------- 667

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
                          G+P+Q LYH+PR+ + P  N L+  EE GG P  I +LT    +I
Sbjct: 668 -----------HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKRDNI 716

Query: 712 CSFVSEADPPPVD-SWKPN-----------LGVVSSSPQVRLACERGWHIAAINFASYGI 759
           C+F++E +P  V  SW+              G     P   L+C     I ++ FASYG 
Sbjct: 717 CTFMTEKNPAHVRWSWESKDSQPKAVAGAGAGAGGFKPTAVLSCPTKKTIQSVVFASYGN 776

Query: 760 PEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
           P G CG++  G+CH      +V+KAC+G+  CS+ VSS   G     CPG    LAV+A 
Sbjct: 777 PLGICGNYTVGSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDV-HCPGTTGTLAVQAK 835

Query: 819 CS 820
           CS
Sbjct: 836 CS 837


>gi|357449773|ref|XP_003595163.1| Beta-galactosidase [Medicago truncatula]
 gi|355484211|gb|AES65414.1| Beta-galactosidase [Medicago truncatula]
          Length = 607

 Score =  658 bits (1697), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/575 (55%), Positives = 403/575 (70%), Gaps = 20/575 (3%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A+VTYDH+A+VI+GKRR+L SGSIHYPRSTP++WP+LI+K+K+GG++VIETYVFWN H
Sbjct: 24  VTASVTYDHKAIVINGKRRILISGSIHYPRSTPQMWPDLIQKAKDGGVDVIETYVFWNGH 83

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP +G+YYFE RFDLV+F+K VQ+AGL++HLRIGPY CAEWN+GGFPVWL ++PG+ FRT
Sbjct: 84  EPSQGKYYFEDRFDLVKFIKVVQQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGVAFRT 143

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK  M++F  KI+ +MK ENLF SQGGPIIL+Q+ENEYG VEW  G  G+ Y KW 
Sbjct: 144 DNEPFKAAMQKFTTKIVSIMKSENLFQSQGGPIILSQIENEYGPVEWEIGAPGKSYTKWF 203

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           +  AV LNT VPWVMC+QEDAPDPII+TCNG+YC+ F+PN   KP MWTEN++GW+  FG
Sbjct: 204 SQMAVGLNTGVPWVMCKQEDAPDPIIDTCNGYYCENFSPNKNYKPKMWTENWTGWYTDFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            AVP+RP EDLAF+VARF +  G++ NYYMY GGTNFGRT+ G  +ATSYDYDAPIDEYG
Sbjct: 264 TAVPYRPAEDLAFSVARFVQNRGSYVNYYMYHGGTNFGRTSSGLFIATSYDYDAPIDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            I +PKWGHLR+LHKAIK CE  L+S DPT    G  LE H+Y  S   CAAFLANYD+ 
Sbjct: 324 LISEPKWGHLRDLHKAIKQCESALVSVDPTVSWPGKNLEVHLYKTSFGACAAFLANYDTG 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S A V F    Y LP WS+SILPDCK  VFNTAKV + R +             +  A+S
Sbjct: 384 SWAKVAFGNGHYDLPPWSISILPDCKTEVFNTAKVRAPRVH-----------RSMTPANS 432

Query: 421 AFSW--YEEKVGISGNR-SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEV 472
           AF+W  Y E+   SG   S+    L EQ++ T D SDYLWY   +++ P +     G+  
Sbjct: 433 AFNWQSYNEQPAFSGESGSWTANGLLEQLSQTWDKSDYLWYMTDVNISPNEGFIKNGQNP 492

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L   S GH   VF+N +     YG+ D      +  ++L  G N + +LS+ VGL N G
Sbjct: 493 VLTAMSAGHVLHVFINGQFWGTAYGSLDNPKLTFSNSVKLRVGNNKISLLSVAVGLSNVG 552

Query: 533 AWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQV 566
             ++    G+   V L  L  G RDLS  +W Y+V
Sbjct: 553 VHYEKWNVGVLGPVTLKGLNEGTRDLSKQKWSYKV 587


>gi|357142200|ref|XP_003572492.1| PREDICTED: beta-galactosidase 11-like [Brachypodium distachyon]
          Length = 823

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/838 (42%), Positives = 492/838 (58%), Gaps = 59/838 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T+D R+L++DG+R +  SGSIHYPRS P +WP+LI ++KEGGL VIE+YVFWN HEP  
Sbjct: 15  ITFDRRSLMVDGRRDLFFSGSIHYPRSPPHMWPDLIARAKEGGLNVIESYVFWNGHEPEM 74

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+D+++F K VQE  +F  +RIGP+  AEWN+GG P WL  +P I FRT N P
Sbjct: 75  GVYNFEGRYDMIKFFKLVQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIIFRTNNEP 134

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK+ M++F+  I++ +K   LFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A
Sbjct: 135 FKKHMQKFVTMIVNKLKDAKLFASQGGPIILAQIENEYQHLEAAFKENGTTYIHWAAKMA 194

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
            +LN  VPW+MC+Q  AP  +I TCNG +C G T   P   +KP++WTEN++  +  FG 
Sbjct: 195 SDLNIGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPTDKNKPLLWTENWTAQYRVFGD 253

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVARF+  GGT  NYYMY GGTNFGRT    ++   YD +AP+DE+G 
Sbjct: 254 PPSQRSAEDIAFAVARFYSVGGTMVNYYMYHGGTNFGRTGASFVMPRYYD-EAPLDEFGL 312

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
            ++PKWGHLR+LH A++LC++ ++  +P++Q LG   EA ++       C AFL+N+++ 
Sbjct: 313 YKEPKWGHLRDLHHALRLCKKAILWGNPSNQPLGKLYEARLFEIPEQKICVAFLSNHNTK 372

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK---NVNELLL 417
            D  VTF G  YF+P  SVSIL DCK VVF+T  V SQ N     F+ Q    NV E+  
Sbjct: 373 EDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNSQHNQRTFHFSDQTVQGNVWEMYT 432

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEVF 473
            S     Y+       N    +P   E  N TKD +DY+WYT S  +    +P + K+++
Sbjct: 433 ESDKVPTYK-----FTNIRTQKP--LEAYNLTKDKTDYVWYTTSFKLEAEDLPFR-KDIW 484

Query: 474 --LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L + S GHA + FVN K V  G+G      F + K IE+  GIN + ILS  +G+Q+ 
Sbjct: 485 PVLEVSSHGHAMVAFVNGKYVGAGHGTKINKAFTMEKPIEVRTGINHVSILSTTLGMQDS 544

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           G + +   AG+  V +  L  G  DL+S  W + VG+EGE          +   W     
Sbjct: 545 GVYLEHRQAGIDGVTIQGLNTGTLDLTSNGWGHLVGLEGERRNAHTEKGGDGVQWVPAV- 603

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
              ++ L WY+  F  P G  P+ ++++ MGKG  +VNG+ +GRYWS+Y           
Sbjct: 604 --FDRPLTWYRRRFDIPTGDDPVVIDMSPMGKGVLYVNGEGLGRYWSSY----------- 650

Query: 652 DYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQ 709
                       KH  G+P+Q LYH+PR ++ P  N++ I  EE GG P  I +LT    
Sbjct: 651 ------------KHALGRPSQYLYHVPRCFLKPTGNVMTIFEEEGGGQPDGIMILTVKRD 698

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSS------SPQVRLACERGWHIAAINFASYGIPEGN 763
           +ICSF+SE +P  V SW+     + S       PQ  L+C     I  + FASYG P G 
Sbjct: 699 NICSFISEKNPAHVKSWERKDSHLKSVADADLKPQAVLSCPEKKLIQQVVFASYGNPLGI 758

Query: 764 CGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CG++  G CH      IV+KACVG+  C + VS    G     CPG    LAV+A CS
Sbjct: 759 CGNYTVGNCHAPKAKEIVEKACVGKKSCVLQVSHEVYGADLN-CPGSTGTLAVQAKCS 815


>gi|326520333|dbj|BAK07425.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 841

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 367/838 (43%), Positives = 491/838 (58%), Gaps = 62/838 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +TYD R+L+IDG+R +  SGSIHYPRS    WP+LI ++KEGGL VIE+YVFWN HEP  
Sbjct: 36  ITYDRRSLMIDGRREIFFSGSIHYPRSPFHEWPDLIARAKEGGLNVIESYVFWNIHEPEM 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y FEGR+D+++F K +QE  +F  +RIGP+  AEWN+GG P WL  +P I FRT N P
Sbjct: 96  GVYNFEGRYDMIKFFKLIQEHEMFAMVRIGPFVQAEWNHGGLPYWLREVPDIVFRTDNEP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+ M++F+  +++ +K   LFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A
Sbjct: 156 YKKLMQKFVTLVVNKLKDAKLFASQGGPIILAQIENEYQHMEAAFKENGTRYIDWAAKMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
           ++ +T VPW+MC+Q  AP  +I TCNG +C G T   P   +KP++WTEN++  +  FG 
Sbjct: 216 ISTSTGVPWIMCKQTKAPAEVIPTCNGRHC-GDTWPGPTDKNKPLLWTENWTAQYRVFGD 274

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVARFF  GG+  NYYMY GGTNFGRT G   V   Y  +AP+DE+G 
Sbjct: 275 PPSQRSAEDIAFAVARFFSVGGSMVNYYMYHGGTNFGRT-GASFVMPRYYDEAPLDEFGM 333

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
            ++PKWGHLR+LH A++LC++ L+  +P+ Q LG   EA ++       C AFL+N+++ 
Sbjct: 334 YKEPKWGHLRDLHHALRLCKKALLRGNPSTQPLGKLYEARLFEIPEQKVCVAFLSNHNTK 393

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
            D  VTF G  YF+P  SVSIL DCK VVF+T  V +Q N           Q NV E+  
Sbjct: 394 EDGTVTFRGQQYFVPRRSVSILADCKTVVFSTQHVNAQHNQRTFHLTDQTLQNNVWEMYT 453

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEV 472
                  Y+     + +RS  +P   E  N TKD +DYLWYT S  +    +P  Q  + 
Sbjct: 454 EGDKVPTYK----FTTDRS-EKP--LEAYNMTKDKTDYLWYTTSFKLEAEDLPFRQDIKP 506

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L   S GHA + FVN KLV   +G      F + K IE+  GIN + ILS  +GLQ+ G
Sbjct: 507 VLEASSHGHAMVAFVNGKLVGAAHGTKMNKAFSLEKPIEVRAGINHVSILSSTLGLQDSG 566

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGS 590
           A+ +   AG+ SV +  L  G  DLSS  W + VG++GE     +DK        WK   
Sbjct: 567 AYLEHRQAGVHSVTIQGLNTGTLDLSSNGWGHIVGLDGERKQAHMDK---GGEVQWKPAV 623

Query: 591 -TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
             LP    L WY+  F  P G+ P+ ++L  MGKG  +VNG+ +GRYWS+Y         
Sbjct: 624 FDLP----LTWYRRRFDMPSGEDPVVIDLNPMGKGILFVNGEGLGRYWSSY--------- 670

Query: 650 KCDYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
                         KH  G+P+Q LYH+PR ++ P  N+L I EE GG P  I +LT   
Sbjct: 671 --------------KHALGRPSQYLYHVPRCFLKPTGNVLTIFEEEGGRPDAIMILTVKR 716

Query: 709 QHICSFVSEADPPPVDSWK---PNLGVVSSS--PQVRLACERGWHIAAINFASYGIPEGN 763
            +ICSF+SE +P  V SW+     L VV+    P+  L C     I  + FASYG P G 
Sbjct: 717 DNICSFISEKNPGHVRSWERKDSQLTVVADDLKPRAVLTCPEKKTIQQVVFASYGNPLGI 776

Query: 764 CGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           CG++  G CH      +V+KACVG+  C + VS    G     CPG    LAV+A CS
Sbjct: 777 CGNYTVGNCHTPKAKEVVEKACVGKKSCVLAVSHEVYGGDLN-CPGTTATLAVQAKCS 833


>gi|218188392|gb|EEC70819.1| hypothetical protein OsI_02284 [Oryza sativa Indica Group]
          Length = 837

 Score =  653 bits (1685), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/734 (46%), Positives = 455/734 (61%), Gaps = 29/734 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +V+YD R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETY+FWN HEP
Sbjct: 29  TSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            R QY FEG +D+VRF K +Q AG++  LRIGPY C EWNYGG P WL  IPG+QFR  N
Sbjct: 89  HRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWA 180
            PF+ EM+ F   I++ MK   +FA QGGPIILAQ+ENEYGN+  +         Y+ W 
Sbjct: 149 EPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWC 208

Query: 181 ADTAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           AD A   N  VPW+MCQQ +D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++
Sbjct: 209 ADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAW 268

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
                 R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEY
Sbjct: 269 DKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEY 328

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +RQPK+GHL+ELH  +K  E+ L+  +      G  +    Y   S+  A F+ N   
Sbjct: 329 GNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSS-ACFINNRFD 387

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
             D NVT +G  + LPAWSVSILPDCK V FN+AK+ +Q +       ++ N  E    S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTS----VMVKKPNTAEQEQES 443

Query: 420 SAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
             +SW  E +         +F + +L EQI T+ D SDYLWY  S++   G+G    L +
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLN-HKGEGS-YKLYV 501

Query: 477 ESLGHAALVFVNKKLVAFGY-GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
            + GH    FVN KL+   +  + DF  F +   ++L++G N + +LS  VGL+NYG  F
Sbjct: 502 NTTGHELYAFVNGKLIGKNHSADGDFV-FQLESPVKLHDGKNYISLLSATVGLKNYGPSF 560

Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWK-QGS 590
           +    G+    V LID      DLS+  W Y+ G+  EY  I LDK        W     
Sbjct: 561 EKMPTGIVGGPVKLIDSNGTAIDLSNSSWSYKAGLASEYRQIHLDKPGYK----WNGNNG 616

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           T+P+N+   WYK TF AP G+  + ++L  + KG AWVNG ++GRYW +Y A       +
Sbjct: 617 TIPINRPFTWYKATFEAPSGEDAVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHR 676

Query: 651 CDYRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLT 705
           CDYRG++    D ++C   CG+P+Q  YH+PR+++  GE N L++ EE GGDPS ++L T
Sbjct: 677 CDYRGAFQAEGDGTRCLTGCGEPSQRYYHVPRSFLAAGEPNTLLLFEEAGGDPSGVALRT 736

Query: 706 KTGQHICSFVSEAD 719
                +C+     D
Sbjct: 737 VVPGPVCTSGEAGD 750


>gi|290782382|gb|ADD62393.1| beta-galactosidase 3 [Prunus persica]
          Length = 683

 Score =  653 bits (1685), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/688 (49%), Positives = 443/688 (64%), Gaps = 31/688 (4%)

Query: 146 FASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPI 205
           FASQGGPIIL+Q+ENEYG    A G  G  Y+ WAA  AV L+T VPWVMC+++DAPDP+
Sbjct: 2   FASQGGPIILSQIENEYGPESKALGAAGHAYINWAAKMAVALDTGVPWVMCKEDDAPDPM 61

Query: 206 INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
           IN CNGFYCDGF+PN P KP MWTE +SGWF  FG  +  RPV+DLAF+VARF + GG++
Sbjct: 62  INACNGFYCDGFSPNKPYKPTMWTEAWSGWFTEFGGTIHHRPVQDLAFSVARFIQKGGSY 121

Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
            NYYMY GGTNFGRTAGGP + TSYDYD PIDEYG IRQPK+GHL+ELHKAIKLCE  L+
Sbjct: 122 INYYMYHGGTNFGRTAGGPFITTSYDYDVPIDEYGLIRQPKYGHLKELHKAIKLCEHALV 181

Query: 326 SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
           SSDPT   LGA  +A++++     CAAFL+N+ S+  A +TFN   Y LPAWS+SILPDC
Sbjct: 182 SSDPTVTSLGAYQQAYVFNSGPRRCAAFLSNFHSTG-ARMTFNNMHYDLPAWSISILPDC 240

Query: 386 KNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRP-DL 442
           +NVVFNTAKV            Q   V  +   S  FSW  Y+E V     RS +    L
Sbjct: 241 RNVVFNTAKV----------GVQTSRVQMIPTNSRLFSWQTYDEDVSSLHERSSIAAGGL 290

Query: 443 AEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNIESLGHAALVFVNKKLVAFGYGNH 499
            EQIN T+DTSDYLWY  ++ +   +   GK+  L ++S GHA  VFVN +     +G  
Sbjct: 291 LEQINVTRDTSDYLWYMTNVDISSSELRGGKKPTLTVQSAGHALHVFVNGQFSGSAFGTR 350

Query: 500 DFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLS 558
           +   F   K + L  GIN + +LS+ VGL N G  ++    G+   + +D L  G++DL+
Sbjct: 351 EHRQFTFAKPVHLRAGINKIALLSIAVGLPNVGLHYESWKTGILGPVFLDGLGQGRKDLT 410

Query: 559 SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK-SLIWYKTTFLAPEGKGPLALN 617
             +W  +VG++GE + L   +  +S  W +GS     K +L WYK  F AP G  PLAL+
Sbjct: 411 MQKWFNKVGLKGEAMDLVSPNGGSSVDWIRGSLATQTKQTLKWYKAYFNAPGGDEPLALD 470

Query: 618 LASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIP 677
           + SMGKGQ W+NGQSIG+YW AY   + G    C Y G++  +KCQ  CGQP Q  YH+P
Sbjct: 471 MRSMGKGQVWINGQSIGKYWMAY---ANGDCSLCSYIGTFRPTKCQLGCGQPTQRWYHVP 527

Query: 678 RTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPP----VDSWKPNLGVV 733
           R+W+ P +NL+V+ EELGGDPSKI+L+ ++   +C+ + E  P      +DS + +  + 
Sbjct: 528 RSWLKPTQNLVVVFEELGGDPSKITLVKRSVAGVCADLQEHHPNAEKLDIDSHEESKTLH 587

Query: 734 SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSI 792
            +  QV L C  G  I++I FAS+G P G CGSF+ G CH  +   IV+K C+G+  C +
Sbjct: 588 QA--QVHLQCVPGQSISSIKFASFGTPTGTCGSFQQGTCHATNSHAIVEKNCIGRESCLV 645

Query: 793 PVSSAYLGVSAGACPGLLKALAVEAHCS 820
            VS++  G     CP +LK L+VEA CS
Sbjct: 646 TVSNSIFGTD--PCPNVLKRLSVEAVCS 671


>gi|147843477|emb|CAN82062.1| hypothetical protein VITISV_016430 [Vitis vinifera]
          Length = 773

 Score =  653 bits (1684), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/832 (43%), Positives = 480/832 (57%), Gaps = 101/832 (12%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T D R ++I+G+R++L SGS+HYPRSTPE+WP+LI+KSK+GGL  I+TYVFW+ HEP R
Sbjct: 26  ITSDARGIMINGERKILISGSVHYPRSTPEMWPDLIQKSKDGGLNTIDTYVFWDLHEPQR 85

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G  DLVRF+K +Q  GL+  LRIGPY CAEW YGGFPVWLH  P IQ RT N  
Sbjct: 86  RQYDFTGNKDLVRFIKAIQAQGLYAVLRIGPYVCAEWTYGGFPVWLHNQPSIQLRTNNTV 145

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +                                +ENEYGNV  AY   G  Y+ W A  A
Sbjct: 146 Y-------------------------------MIENEYGNVMRAYHDAGVQYINWCAQMA 174

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
             L+T VPW+MCQQ++AP P+INTCNG+YCD FTPN+P+ P MWTEN+SGW+ ++G + P
Sbjct: 175 AALDTGVPWIMCQQDNAPQPMINTCNGYYCDQFTPNNPNSPKMWTENWSGWYKNWGGSDP 234

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  EDLAF+VARF++ GGTFQNYYMY GGTNFGRTAGGP + TSYDYDAP++EYG   Q
Sbjct: 235 HRTAEDLAFSVARFYQLGGTFQNYYMYHGGTNFGRTAGGPYITTSYDYDAPLNEYGNKNQ 294

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDA 363
           PKWGHLR+LH  +   E+ L   D  +        A IY ++  + C  F  N ++  D 
Sbjct: 295 PKWGHLRDLHLLLLSMEKALTYGDVKNVDYETLTSATIYSYQGKSSC--FFGNSNADRDV 352

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            + + G  Y +PAWSVSILPDC N V+NTAKV SQ +     F ++ +  E    S  ++
Sbjct: 353 TINYGGVNYTIPAWSVSILPDCSNEVYNTAKVNSQYST----FVKKGSEAENEPNSLQWT 408

Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAA 483
           W  E +       ++ P   +  N      D +W           GK++ L++ + GH  
Sbjct: 409 WRGETI------QYITPGSVDISN-----DDPIW-----------GKDLTLSVNTSGHIL 446

Query: 484 LVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF 543
             FVN + + + Y       F   + I L  G N + +LS+ VGL NYG  FD+   G+ 
Sbjct: 447 HAFVNGEHIGYQYALLGQFEFQFRRSITLQLGKNEITLLSVTVGLTNYGPDFDMVNQGIH 506

Query: 544 SVILIDLKNGKRDL-----SSGEWIYQVGVEGEYIGLDKISLANSSF--WKQGSTLPVNK 596
             + I   NG  D+     ++ +W Y+ G+ GE     KI L  + +  WK    LPVN+
Sbjct: 507 GPVQIIASNGSADIIKDLSNNNQWAYKAGLNGE---DKKIFLGRARYNQWKS-DNLPVNR 562

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
           S +WYK TF AP G+ P+ ++L  +GKG+AWVNG S+GRYW +Y+A   GC+ +CDYRG 
Sbjct: 563 SFVWYKATFDAPPGEDPVVVDLMGLGKGEAWVNGHSLGRYWPSYIARGEGCSPECDYRGP 622

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
           Y A KC  +CG P+Q  YH+PR+++   +N LV+ EE  G+PS ++  T T  + C+   
Sbjct: 623 YKAEKCNTNCGNPSQRWYHVPRSFLASTDNRLVLFEEFXGNPSSVTFQTVTVGNACANAR 682

Query: 717 EADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGS--------FR 768
           E                     + L+C+ G  I+ I FAS+G P+G CG         F 
Sbjct: 683 EG------------------YTLELSCQ-GRAISXIKFASFGDPQGTCGKPFATGSQVFE 723

Query: 769 PGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G C   D L I+QK CVG+  CSI VS   LG     C    K LAVEA C
Sbjct: 724 KGTCEAADSLSIIQKLCVGKYSCSIDVSEQILG--PAGCTADTKRLAVEAIC 773


>gi|357464799|ref|XP_003602681.1| Beta-galactosidase [Medicago truncatula]
 gi|355491729|gb|AES72932.1| Beta-galactosidase [Medicago truncatula]
          Length = 628

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/609 (55%), Positives = 410/609 (67%), Gaps = 14/609 (2%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           + +NV+YD R+L+IDG+R++L S SIHYPRS P +WP LI+ +KEGG++VIETYVFWN H
Sbjct: 23  VGSNVSYDGRSLIIDGQRKLLISASIHYPRSVPAMWPALIQTAKEGGIDVIETYVFWNGH 82

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           E   G YYF GRFDLV+F K VQ+AG++L LRIGP+  AEWN+GG PVWLH+IPG  FRT
Sbjct: 83  ELSPGNYYFGGRFDLVQFAKVVQDAGMYLILRIGPFVAAEWNFGGVPVWLHYIPGTVFRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PF   M++F   I++LMK+E LFASQGGPIIL+Q+ENEYG  E  Y   G+ Y  WA
Sbjct: 143 YNQPFMHHMEKFTTYIVNLMKKEKLFASQGGPIILSQIENEYGYYENYYKEDGKKYALWA 202

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV+ NTSVPW+MCQQ DAPDP+I+TCN FYCD FTP SP +P MWTEN+ GWF +FG
Sbjct: 203 AKMAVSQNTSVPWIMCQQWDAPDPVIDTCNSFYCDQFTPTSPKRPKMWTENWPGWFKTFG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
              P RPVED+AF+VARFF+ GG+  NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG
Sbjct: 263 GRDPHRPVEDVAFSVARFFQKGGSLNNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             R PKWGHL+ELHKAIKLCE  L+     +  LG  +EA IY  SS  CAAF++N D  
Sbjct: 323 LPRLPKWGHLKELHKAIKLCEHVLLYGKSVNISLGPSVEADIYTDSSGACAAFISNVDDK 382

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQQKNVNELL 416
           +D  V F    Y LPAWSVSILPDCKNVVFNTAKV S  N      +H   QQ +  +  
Sbjct: 383 NDKKVVFRNASYHLPAWSVSILPDCKNVVFNTAKVSSPTNIVAMIPEH--LQQSDKGQKT 440

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKE 471
           L    F   +E  GI G   FV+    + INTTKDT+DYLW+T SI +   +     G +
Sbjct: 441 LKWDVF---KENPGIWGKADFVKNGFVDHINTTKDTTDYLWHTTSILIDANEEFLKKGSK 497

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
             L IES GH    FVN+K    G GN   + F     I L  G N + ILS+ VGLQ  
Sbjct: 498 PALLIESKGHTLHAFVNQKYQGTGTGNGSHSAFTFKNPISLRAGKNEIAILSLTVGLQTA 557

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST 591
           G ++D  GAG+ SV +I L N   DLSS  W Y++GV GE++ + +    NS  W   S 
Sbjct: 558 GPFYDFIGAGVTSVKIIGLNNRTIDLSSNAWAYKIGVLGEHLSIYQGEGMNSVKWTSTSE 617

Query: 592 LPVNKSLIW 600
            P  ++L W
Sbjct: 618 PPKGQALTW 626


>gi|75116245|sp|Q67VU7.1|BGL10_ORYSJ RecName: Full=Putative beta-galactosidase 10; Short=Lactase 10;
           Flags: Precursor
 gi|51535501|dbj|BAD37397.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|51535704|dbj|BAD37722.1| putative beta-galactosidase [Oryza sativa Japonica Group]
          Length = 809

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/831 (42%), Positives = 475/831 (57%), Gaps = 68/831 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31  VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G +D+VRF K +Q AGL+  LRIGPY C EWNYGG P WL  IPG+QFR  N P
Sbjct: 91  RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   I++ MK  N+FA QGGPIILAQ+ENEYGN+  +         Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+                     GGP + TSYDYDAP+DEYG 
Sbjct: 271 PDFHRSAEDIAFAVAMFFQ-------------------KRGGPYITTSYDYDAPLDEYGN 311

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +        K+    Y   S   A F+ N + + 
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTS-ACFINNRNDNM 370

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +G  + LPAWSVSILPDCK V FN+AK+ +Q          +  + E    S  
Sbjct: 371 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----VMVNKAKMVEKEPESLK 426

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  SI+        +F+N  +
Sbjct: 427 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGEASYTLFVN--T 484

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +    +L++G N + +LS  +GL+NYG  F+  
Sbjct: 485 TGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEKM 544

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK      ++     T+P+
Sbjct: 545 PAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDK---PGCTWDNNNGTVPI 601

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
           NK   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A   G    CDYR
Sbjct: 602 NKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMGGCHHCDYR 661

Query: 655 GSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKTGQ 709
           G +    D  KC   CG+P+Q  YH+PR+++  GE N +++ EE GGDPS +S  T    
Sbjct: 662 GVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTVAAG 721

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGSFR 768
            +C+     D                   + L+C +    I+AIN  S+G+  G CG+++
Sbjct: 722 SVCASAEVGD------------------TITLSCGQHSKTISAINVTSFGVARGQCGAYK 763

Query: 769 PGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G           +AC+G+  C++ +++A   V+   C  L   L V+A C
Sbjct: 764 GGCESKAAYKAFTEACLGKESCTVQITNA---VTGSGC--LSNVLTVQASC 809


>gi|356518798|ref|XP_003528064.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
          Length = 717

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 335/705 (47%), Positives = 451/705 (63%), Gaps = 37/705 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDG+R++L SGSIHYPRSTP++WP+LI K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHEPQP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y F GR+DLV F+K +Q  GL++ LRIGP+  +EW YGGFP WLH +PGI +RT N P
Sbjct: 87  GMYDFSGRYDLVGFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPGIVYRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+++MK+E L+ASQGGPIIL+Q+ENEY N++ A+G  G  YV+WAA  A
Sbjct: 147 FKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWAAKMA 206

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L+T VPW+MC+Q DAPDP+INTCNG  C + FT PNSP+KP +WTEN++ ++  +G  
Sbjct: 207 VGLDTGVPWIMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQVYGGL 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF V  F    G++ NYYMY GGTNFGRT G   V T Y   AP+DEYG +
Sbjct: 267 PYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRT-GSAYVITGYYDQAPLDEYGLL 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+ IK C   L+     +  LG  LE +++ +   +C AFL N D  + 
Sbjct: 326 RQPKWGHLKQLHEVIKSCSTTLLQGVQRNFTLGQLLEVYVFEEEKGECVAFLINNDRDNK 385

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F  + Y L   S+SILPDC+NV F+TA V +  N      + ++N       SS  
Sbjct: 386 ATVQFRNSSYELLPKSISILPDCQNVTFSTANVNTTSNR--RIISPKQNF------SSVD 437

Query: 423 SWYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            W + +  IS   N S     L EQ+NTTKD SDYLWYT          K   L+++S  
Sbjct: 438 DWQQFQDVISNFDNTSLKSDSLLEQMNTTKDKSDYLWYTLRFEYNLSCSKPT-LSVQSAA 496

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           H A  FVN   +   +GNHD  +F +   + +N+G N L ILS+MVGL + GA+ +   A
Sbjct: 497 HVAHAFVNNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSVMVGLPDSGAFLERRFA 556

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
           GL SV L   +    +L++  W YQVG+ GE + + K    + + W Q   + + ++L W
Sbjct: 557 GLISVELQCSEQESLNLTNSTWGYQVGLMGEQLQVYKEQNNSDTGWSQLGNV-MEQTLFW 615

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKTTF  PEG  P+ L+L+SMGKG+AWVNG+SIGRYW  +                +D+ 
Sbjct: 616 YKTTFDTPEGDDPVVLDLSSMGKGEAWVNGESIGRYWILF----------------HDSK 659

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                 G P+Q+LYH+PR+++    N+LV+ EE GG+P  ISL T
Sbjct: 660 ------GNPSQSLYHVPRSFLKDSGNVLVLLEEGGGNPLGISLDT 698


>gi|296082606|emb|CBI21611.3| unnamed protein product [Vitis vinifera]
          Length = 729

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/709 (48%), Positives = 443/709 (62%), Gaps = 34/709 (4%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYD R+L+IDG R++L SGSIHYPRSTP++W  LI K+KEGG++VI+TYVFWN HEP
Sbjct: 24  AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQY F GR+DL +F+K +Q  GL+  LRIGP+  +EW+YGG P WLH + GI +RT N
Sbjct: 84  QPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI++LMK E L+ASQGGPIIL+Q+ENEY N+E A+   G  YV+WAA 
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L T VPWVMC+Q DAPDP+INTCNG  C   FT PNSP+KP MWTEN++ ++  FG
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+AF VA F    G++ NYYMY GGTNFGR A    + TSY   AP+DEYG
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGR-ASSAYIKTSYYDQAPLDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPKWGHL+ELH AI LC   L++   ++  LG   EA+++ +    C AFL N D  
Sbjct: 323 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 382

Query: 361 SDANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
           +++ V F N ++  LP  S+SILPDCKNV+FNTAKV S      +   Q+ + + +    
Sbjct: 383 NNSTVLFQNVSIELLPK-SISILPDCKNVIFNTAKVCSSSRQSAYKI-QELSRSCIQSFD 440

Query: 420 SAFSWYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLNI 476
           +   W E K  I    + S     + E +N TKD SDYLWYT      P     E  L+I
Sbjct: 441 AVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT--FRFQPNSSCTEPLLHI 498

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           ESL HA   FVN   V   +G+HD   F     I LN  +N + ILS+MVG  + GA+ +
Sbjct: 499 ESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSGAYLE 558

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
              AGL  V +   + G  D ++  W YQVG+ GE + + K    ++  W++ + +  N+
Sbjct: 559 SRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRK-TEISTNQ 617

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYK  F  P G  P+ALNL++MGKG+AWVNGQSIGRYW                  S
Sbjct: 618 PLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-----------------S 660

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
           +  SK     G P+QTLYH+PR ++   ENLLV+ EE  GDP  ISL T
Sbjct: 661 FHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 704


>gi|30679742|ref|NP_179264.2| beta-galactosidase 13 [Arabidopsis thaliana]
 gi|75265629|sp|Q9SCU9.1|BGL13_ARATH RecName: Full=Beta-galactosidase 13; Short=Lactase 13; Flags:
           Precursor
 gi|6686898|emb|CAB64749.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|330251438|gb|AEC06532.1| beta-galactosidase 13 [Arabidopsis thaliana]
          Length = 848

 Score =  642 bits (1657), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 348/834 (41%), Positives = 497/834 (59%), Gaps = 54/834 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD  +L+I+G R +L SGSIHYPRSTPE+WP +I+++K+GGL  I+TYVFWN HEP +
Sbjct: 44  VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ F GR DLV+F+K +++ GL++ LR+GP+  AEW +GG P WL  +PGI FRT N P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNEP 163

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FKE  +R++  ++D+MK+E LFASQGGPIIL Q+ENEY  V+ AY   G  Y+KWA+   
Sbjct: 164 FKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
            +++  +PWVMC+Q DAPDP+IN CNG +C D F  PN  +KP +WTEN++  F  FG  
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGDP 283

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R VED+A++VARFF   GT  NYYMY GGTNFGRT+   +    YD DAP+DE+G  
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEFGLE 342

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
           R+PK+GHL+ LH A+ LC++ L+   P  +K   + E   Y +     CAAFLAN ++ +
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTEA 402

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              + F G  Y +P  S+SILPDCK VV+NT ++IS   + +  F + K  N+    +  
Sbjct: 403 AEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRN--FMKSKKANK----NFD 456

Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFL 474
           F  + E V   I G+ SF+  +L      TKD SDY WYT S  +        +G +  L
Sbjct: 457 FKVFTESVPSKIKGD-SFIPVEL---YGLTKDESDYGWYTTSFKIDDNDLSKKKGGKPNL 512

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I SLGHA  V++N + +  G+G+H+  +F+  K + L EG N L +L ++ G  + G++
Sbjct: 513 RIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDSGSY 572

Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
            +    G  SV ++ L +G  DL+   +W  +VG+EGE +G+          W++ S   
Sbjct: 573 MEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKKVKWEKASG-- 630

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
               + WY+T F APE +   A+ +  MGKG  WVNG+ +GRYW ++L+P          
Sbjct: 631 KEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP---------- 680

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
                        GQP Q  YHIPR+++ P +NLLVI  EE    P  I  +      +C
Sbjct: 681 ------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIVNRDTVC 728

Query: 713 SFVSEADPPPVDSW-KPNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
           S++ E   P V  W + N  V + +  V L     C     I+A+ FAS+G P G CG+F
Sbjct: 729 SYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKKISAVEFASFGNPNGTCGNF 788

Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
             G+C+  V   +V+K C+G+ EC IPV+ S +      +CP + K LAV+  C
Sbjct: 789 TLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFEQDKKDSCPKVEKKLAVQVKC 842


>gi|4581116|gb|AAD24606.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 832

 Score =  642 bits (1655), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/835 (41%), Positives = 498/835 (59%), Gaps = 54/835 (6%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           ++TYD  +L+I+G R +L SGSIHYPRSTPE+WP +I+++K+GGL  I+TYVFWN HEP 
Sbjct: 27  SITYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPE 86

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +G++ F GR DLV+F+K +++ GL++ LR+GP+  AEW +GG P WL  +PGI FRT N 
Sbjct: 87  QGKFNFSGRADLVKFIKLIEKNGLYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNE 146

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFKE  +R++  ++D+MK+E LFASQGGPIIL Q+ENEY  V+ AY   G  Y+KWA+  
Sbjct: 147 PFKEHTERYVKVVLDMMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKL 206

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGY 241
             +++  +PWVMC+Q DAPDP+IN CNG +C D F  PN  +KP +WTEN++  F  FG 
Sbjct: 207 VHSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKDNKPSLWTENWTTQFRVFGD 266

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R VED+A++VARFF   GT  NYYMY GGTNFGRT+   +    YD DAP+DE+G 
Sbjct: 267 PPAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEFGL 325

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
            R+PK+GHL+ LH A+ LC++ L+   P  +K   + E   Y +     CAAFLAN ++ 
Sbjct: 326 EREPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTE 385

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           +   + F G  Y +P  S+SILPDCK VV+NT ++IS   + +  F + K  N+    + 
Sbjct: 386 AAEKIKFRGKEYLIPHRSISILPDCKTVVYNTGEIISHHTSRN--FMKSKKANK----NF 439

Query: 421 AFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVF 473
            F  + E V   I G+ SF+  +L      TKD SDY WYT S  +        +G +  
Sbjct: 440 DFKVFTESVPSKIKGD-SFIPVEL---YGLTKDESDYGWYTTSFKIDDNDLSKKKGGKPN 495

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I SLGHA  V++N + +  G+G+H+  +F+  K + L EG N L +L ++ G  + G+
Sbjct: 496 LRIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPVTLKEGENHLTMLGVLTGFPDSGS 555

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
           + +    G  SV ++ L +G  DL+   +W  +VG+EGE +G+          W++ S  
Sbjct: 556 YMEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGERLGIHAEEGLKKVKWEKASG- 614

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
                + WY+T F APE +   A+ +  MGKG  WVNG+ +GRYW ++L+P         
Sbjct: 615 -KEPGMTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP--------- 664

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHI 711
                         GQP Q  YHIPR+++ P +NLLVI  EE    P  I  +      +
Sbjct: 665 -------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIVNRDTV 711

Query: 712 CSFVSEADPPPVDSW-KPNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGS 766
           CS++ E   P V  W + N  V + +  V L     C     I+A+ FAS+G P G CG+
Sbjct: 712 CSYIGENYTPSVRHWTRKNDQVQAITDDVHLTANLKCSGTKKISAVEFASFGNPNGTCGN 771

Query: 767 FRPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
           F  G+C+  V   +V+K C+G+ EC IPV+ S +      +CP + K LAV+  C
Sbjct: 772 FTLGSCNAPVSKKVVEKYCLGKAECVIPVNKSTFEQDKKDSCPKVEKKLAVQVKC 826


>gi|297798422|ref|XP_002867095.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
 gi|297312931|gb|EFH43354.1| beta-galactosidase 11 [Arabidopsis lyrata subsp. lyrata]
          Length = 844

 Score =  641 bits (1653), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/834 (41%), Positives = 492/834 (58%), Gaps = 54/834 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD  +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL  I+TYVFWN HEP +
Sbjct: 40  VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 99

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ F GR DLV+F+K +++ G+++ LR+GP+  AEW +GG P WL  +PGI FRT N P
Sbjct: 100 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNKP 159

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FKE  +R++  I+D MK+E LFASQGGPIIL Q+ENEY  V+ AY   G  Y+KWA+   
Sbjct: 160 FKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWASKLV 219

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
            ++   +PWVMC+Q DAPDP+IN CNG +C D F  PN  +KP +WTEN++  F  FG  
Sbjct: 220 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVFGDP 279

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R VED+A++VARFF   G+  NYYMY GGTNFGRT+   +    YD DAP+DEYG  
Sbjct: 280 PTQRSVEDIAYSVARFFSKNGSHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEYGLE 338

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           R+PK+GHL+ LH A+ LC++ L+   P  +K G   E   Y +  +  CAAFLAN ++ +
Sbjct: 339 REPKYGHLKHLHSALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 398

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              + F G  Y +   S+SILPDCK VV+NTA+++SQ  + +  F + K  N+       
Sbjct: 399 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRN--FMKSKKANKKF----D 452

Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-----HVMPGQGKEVFL 474
           F  + E +   + GN S++  +L      TKD +DY WYT S      H+   +G + F+
Sbjct: 453 FKVFTETLPSKLEGN-SYIPVEL---YGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFV 508

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I SLGHA  +++N + +  G+G+H+  +F+  K++ L  G N L +L ++ G  + G++
Sbjct: 509 RIASLGHALHIWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLIMLGVLTGFPDSGSY 568

Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
            +    G   V ++ L +G  DL+ S +W  ++G+EGE +G+          WK+ +   
Sbjct: 569 MEHRYTGPRGVSILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGKA 628

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
               L WY+  F APE     A+ +  MGKG  WVNG+ +GRYW ++L+P          
Sbjct: 629 --PGLTWYQAYFDAPESLNAAAIRMNGMGKGLIWVNGEGVGRYWQSFLSP---------- 676

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
                        GQP Q  YHIPR+++ P +NLLVI  EE    P  +  +      +C
Sbjct: 677 ------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFVIVNRDTVC 724

Query: 713 SFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
           S+V E   P V  W      V +     S    L C     IAA+ FAS+G P G CG+F
Sbjct: 725 SYVGENYTPSVRHWTRKQDQVQAITDNVSLTATLKCSGTKKIAAVEFASFGNPIGVCGNF 784

Query: 768 RPGACHMDVLP-IVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
             G C+  V   +++K C+G+ EC IPV+ S +      +C  + K LAV+  C
Sbjct: 785 TLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVAKTLAVQVKC 838


>gi|225438369|ref|XP_002274012.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
          Length = 758

 Score =  640 bits (1652), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/713 (48%), Positives = 442/713 (61%), Gaps = 49/713 (6%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYD R+L+IDG R++L SGSIHYPRSTP++W  LI K+KEGG++VI+TYVFWN HEP
Sbjct: 60  AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 119

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQY F GR+DL +F+K +Q  GL+  LRIGP+  +EW+YGG P WLH + GI +RT N
Sbjct: 120 QPGQYDFNGRYDLAKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 179

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI++LMK E L+ASQGGPIIL+Q+ENEY N+E A+   G  YV+WAA 
Sbjct: 180 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 239

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L T VPWVMC+Q DAPDP+INTCNG  C   FT PNSP+KP MWTEN++ ++  FG
Sbjct: 240 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 299

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+AF VA F    G++ NYYMY GGTNFGR A    + TSY   AP+DEYG
Sbjct: 300 GETYLRSAEDIAFHVALFIARNGSYVNYYMYHGGTNFGR-ASSAYIKTSYYDQAPLDEYG 358

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPKWGHL+ELH AI LC   L++   ++  LG   EA+++ +    C AFL N D  
Sbjct: 359 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 418

Query: 361 SDANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
           +++ V F N ++  LP  S+SILPDCKNV+FNTAK+ +  N              +  +S
Sbjct: 419 NNSTVLFQNVSIELLPK-SISILPDCKNVIFNTAKINTGYN------------ERIATSS 465

Query: 420 SAFS----WYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEV 472
            +F     W E K  I    + S     + E +N TKD SDYLWYT      P     E 
Sbjct: 466 QSFDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT--FRFQPNSSCTEP 523

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L+IESL HA   FVN   V   +G+HD   F     I LN  +N + ILS+MVG  + G
Sbjct: 524 LLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSG 583

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
           A+ +   AGL  V +   + G  D ++  W YQVG+ GE + + K    ++  W++ + +
Sbjct: 584 AYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRK-TEI 642

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
             N+ L WYK  F  P G  P+ALNL++MGKG+AWVNGQSIGRYW               
Sbjct: 643 STNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-------------- 688

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
              S+  SK     G P+QTLYH+PR ++   ENLLV+ EE  GDP  ISL T
Sbjct: 689 ---SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 733


>gi|227053532|gb|ACP18874.1| beta-galactosidase pBG(b) [Carica papaya]
          Length = 514

 Score =  640 bits (1650), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 306/497 (61%), Positives = 370/497 (74%), Gaps = 17/497 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+V+YDH+A+ I+GKRR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 19  ASVSYDHKAITINGKRRILLSGSIHYPRSTPEMWPDLIQKAKEGGLDVIQTYVFWNGHEP 78

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYF G +DLVRF+K V++AGL++HLRIGPY CAEWN+GGFPVWL +IPGI FRT N
Sbjct: 79  SPGKYYFGGNYDLVRFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYIPGIAFRTNN 138

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+RF  KI+D+MK E LF SQGGPIIL+Q+ENEYG +E+  G  G  Y +WAA 
Sbjct: 139 GPFKAYMQRFTKKIVDMMKAEGLFESQGGPIILSQIENEYGPMEYELGAAGRAYSQWAAQ 198

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV L T VPWVMC+Q+DAPDPIIN+CNGFYCD F+PN   KP MWTE ++GWF  FG A
Sbjct: 199 MAVGLGTGVPWVMCKQDDAPDPIINSCNGFYCDYFSPNKAYKPKMWTEAWTGWFTEFGGA 258

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           VP+RPVEDLAF+VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG +
Sbjct: 259 VPYRPVEDLAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLV 318

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL++LH+AIKLCE  L+S DP+   LG   EAH++      CAAFLANY+  S 
Sbjct: 319 RQPKWGHLKDLHRAIKLCEPALVSGDPSVMPLGRFQEAHVFKSKYGHCAAFLANYNPRSF 378

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F    Y LP WS+SILPDCKN V+NTA+V +Q        A+ K V   +    AF
Sbjct: 379 AKVAFGNMHYNLPPWSISILPDCKNTVYNTARVGAQS-------ARMKMVP--VPIHGAF 429

Query: 423 SWY---EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW    EE    +G RSF    L EQINTT+D SDYLWY+  + + P +     GK   L
Sbjct: 430 SWQAYNEEAPSSNGERSFTTVGLVEQINTTRDVSDYLWYSTDVKIDPDEGFLKTGKYPTL 489

Query: 475 NIESLGHAALVFVNKKL 491
            + S GHA  VFVN +L
Sbjct: 490 TVLSAGHALHVFVNDQL 506


>gi|357464801|ref|XP_003602682.1| Beta-galactosidase [Medicago truncatula]
 gi|355491730|gb|AES72933.1| Beta-galactosidase [Medicago truncatula]
          Length = 719

 Score =  640 bits (1650), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 333/706 (47%), Positives = 445/706 (63%), Gaps = 37/706 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+I+G+R +L SGSIHYPRSTP++WP LI K+K+GGL+VI+TYVFWN HEP  
Sbjct: 27  VTYDGRSLIINGQRNILFSGSIHYPRSTPQMWPGLIAKAKQGGLDVIQTYVFWNLHEPQP 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y F GR DLV F+K +   GL++ LRIGP+  +EWNYGGFP WLH +PGI +RT N P
Sbjct: 87  GKYDFSGRNDLVGFIKEIHAQGLYVSLRIGPFIESEWNYGGFPFWLHDVPGIVYRTDNEP 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M+ F  KI+++MK+E L+ASQGGPIIL+Q+ENEYGN++ A+G  G  YV+WAA  A
Sbjct: 147 FKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYGNIQKAFGTAGSQYVEWAAKMA 206

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V LNT VPWVMC+Q DAPDP+INTCNG  C + FT PNSP+KP MWTEN++ ++  +G  
Sbjct: 207 VGLNTGVPWVMCKQPDAPDPVINTCNGMRCGETFTGPNSPNKPAMWTENWTSFYQVYGGV 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF V  F    G+F NYYMY GGTNFGRT+   ++   YD  AP+DEYG  
Sbjct: 267 PYIRSAEDIAFHVTLFVARNGSFVNYYMYHGGTNFGRTSSAYMITGYYD-QAPLDEYGLF 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPKWGHL+ELH AIK C   L+     +  LG   E +++ + +  CAAFL N D  + 
Sbjct: 326 RQPKWGHLKELHAAIKSCSTTLLQGVQRNFSLGELQEGYVFEEENGKCAAFLINNDKGNT 385

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V FN + Y L   S+SILPDC+NV FNTA + +  N        ++ +      SS  
Sbjct: 386 VTVQFNNSSYKLLPKSISILPDCQNVAFNTAHLNTTSN--------RRIITSRQNFSSVD 437

Query: 423 SW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            W  +++ +    + S     L EQ+NTTKD SDYLWYT  +         + L+++S  
Sbjct: 438 DWKQFQDVIPNFDDTSLRSDSLLEQMNTTKDKSDYLWYTLRLENNLSCNDPI-LHVQSSA 496

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           H A  FVN   +   +GNHD  +F +   I LNE  N + ILS MVGL + GA+ +   A
Sbjct: 497 HVAYAFVNNTYIGGEHGNHDVKSFTLELPITLNERTNNISILSGMVGLPDSGAFLEKRFA 556

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK-SLI 599
           GL +V L   +    +L++  W YQVG+ GE + +     +    W Q   + +++ +L 
Sbjct: 557 GLNNVELQCSEQESLNLNNSTWGYQVGLLGEQLKVYTEQNSTDIKWTQLGNITIDEVTLT 616

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKTTF  P+G  P+AL+L+SM KG+AWVNGQSIGRYW  +L                  
Sbjct: 617 WYKTTFDTPKGDDPIALDLSSMAKGEAWVNGQSIGRYWILFLDSK--------------- 661

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                  G P+Q+LYH+PR+++   EN LV+ +E GG+P  ISL T
Sbjct: 662 -------GNPSQSLYHVPRSFLKDSENSLVLLDEGGGNPLDISLNT 700


>gi|449454199|ref|XP_004144843.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
 gi|449506996|ref|XP_004162905.1| PREDICTED: beta-galactosidase 13-like [Cucumis sativus]
          Length = 766

 Score =  639 bits (1649), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 349/803 (43%), Positives = 474/803 (59%), Gaps = 55/803 (6%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +W +++ K++ GGL VI+TYVFWN HEP+ GQ+ FEG +DLV+F+K + E  +++ LR+G
Sbjct: 1   MWSDILDKARRGGLNVIQTYVFWNIHEPVEGQFNFEGNYDLVKFIKLIGEKQMYVTLRVG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           P+  AEWN+GG P WL   P I FR+ N+ FK  MK+++A I+D+MK+  LFASQGGPI+
Sbjct: 61  PFIQAEWNHGGLPYWLREKPNIIFRSYNSQFKHYMKKYVAMIVDMMKENKLFASQGGPIV 120

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
           LAQ+ENEY +V+ AY   G  YV+WAA+ AV L   VPW+MC+Q+DAPDP+INTCNG +C
Sbjct: 121 LAQIENEYNHVQLAYDELGVQYVQWAANMAVGLGVGVPWIMCKQKDAPDPVINTCNGRHC 180

Query: 215 -DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYF 272
            D FT PN P KP +WTEN++  +  FG     R  ED+AF+VARFF   G+  NYYMY 
Sbjct: 181 GDTFTGPNKPYKPALWTENWTAQYRVFGDPPSQRAAEDIAFSVARFFSKNGSLVNYYMYH 240

Query: 273 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ 332
           GGTNFGRT+      T Y  +AP+DE+G  R+PKWGHLR++HKA+ LC++ L+   P  Q
Sbjct: 241 GGTNFGRTS-AVFTTTRYYDEAPLDEFGLQREPKWGHLRDVHKALNLCKKPLLWGTPGIQ 299

Query: 333 KLGAKLEAHIYHK-SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
            +G  LEA  Y K  +N CAAFLAN D+ S   + F G  + LP  S+SILPDCK VVFN
Sbjct: 300 VIGKGLEARFYEKPGTNICAAFLANNDTKSAQTINFRGREFLLPPRSISILPDCKTVVFN 359

Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSW-YEEKVGISGNRSFVRPDLAEQINTTK 450
           T  ++SQ N  +  F   KN N+L    S  S    E+V ++           E  +  K
Sbjct: 360 TETIVSQHNARN--FIPSKNANKLKWKMSPESIPTVEQVPVNNKIPL------ELYSLLK 411

Query: 451 DTSDYLWYTASIHVMPGQGKEV-----FLNIESLGHAALVFVNKKLVAFGYGNHDFANFL 505
           DT+DY WYT SI +      +       L I SLGHA LVFVN + +   +G+H+  NF+
Sbjct: 412 DTTDYGWYTTSIELDKEDVSKRPDILPVLRIASLGHAMLVFVNGEYIGTAHGSHEEKNFV 471

Query: 506 INKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQ 565
               +    G+N + +L ++VGL + GA+ +   AG  S+ ++ L  G  D+S   W +Q
Sbjct: 472 FQGSVPFKAGVNNIALLGILVGLPDSGAYMEHRFAGPRSITILGLNTGTLDISKNGWGHQ 531

Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKG 624
           V ++GE +   K+     S     S +   KS L WYKT F APEG  P+A+ +  MGKG
Sbjct: 532 VALQGEKV---KVFTQGGSHRVDWSEIKEEKSALTWYKTYFDAPEGNDPVAIRMNGMGKG 588

Query: 625 QAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPG 684
           Q WVNG+SIGRYW +YL+P    T                      Q+ YHIPR+++ P 
Sbjct: 589 QIWVNGKSIGRYWMSYLSPLKLST----------------------QSEYHIPRSFIKPS 626

Query: 685 ENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW----KPNLGVVSSSPQ-V 739
           ENLLVI EE    P K+ +L      ICSF+++  PP V SW    K    VV       
Sbjct: 627 ENLLVILEEENVTPEKVEILLVNRDTICSFITQYHPPNVKSWERKDKQFRAVVDDVKTGA 686

Query: 740 RLACERGWHIAAINFASYGIPEGNCGSFRPGACH--MDVLPIVQKACVGQIECSIPVSSA 797
            L C     I  I FAS+G P G CG+F  G CH   D   +V++ C+G+  CS+P+ + 
Sbjct: 687 HLRCPHDKKITNIEFASFGDPSGVCGNFEHGKCHSSSDTKKLVEQHCLGKENCSVPMDA- 745

Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
                   C    K LA++A CS
Sbjct: 746 -FDNFKNECDS--KTLAIQAKCS 765


>gi|10862896|emb|CAC13966.1| putative beta-galactosidase [Nicotiana tabacum]
          Length = 715

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 330/710 (46%), Positives = 442/710 (62%), Gaps = 40/710 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  VTYD R+++++G+R +L SGSIHYPR  PE+WP++IRK+KEGGL +I+TYVFWN HE
Sbjct: 25  TKGVTYDGRSMIVNGERELLFSGSIHYPRMPPEMWPDIIRKAKEGGLNLIQTYVFWNIHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P++GQ+ FEG +D+V+F+KT+ E GL++ LRIGPY  AEWN GGFP WL  +P I FR+ 
Sbjct: 85  PVQGQFNFEGNYDVVKFIKTIGEQGLYVTLRIGPYIEAEWNQGGFPYWLREVPNITFRSY 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PF   MK++   +IDLMK+E LFA QGGPII+AQ+ENEY NV+ AY   G+ YV+WAA
Sbjct: 145 NEPFIHHMKKYSEMVIDLMKKEKLFAPQGGPIIMAQIENEYNNVQLAYRDNGKKYVEWAA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
           + A  L   VPW+MC+Q+DAP  +INTCNG +C D FT PN P+KP +WTEN++  + +F
Sbjct: 205 NMATGLYNGVPWIMCKQKDAPAQVINTCNGRHCADTFTGPNGPNKPSLWTENWTAQYRTF 264

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  ED+AF+VARFF   GT  NYYMY+GGTN+GRT G   V T Y  +AP+DE+
Sbjct: 265 GDPPSQRAAEDIAFSVARFFAKNGTLTNYYMYYGGTNYGRT-GSSFVTTRYYDEAPLDEF 323

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G  R+PKW HLR+LH+A++L    L+   P+ QK+   LE  +Y K   DCAAFL N  +
Sbjct: 324 GLYREPKWSHLRDLHRALRLSRRALLWGTPSVQKINQHLEITVYEKPGTDCAAFLTNNHT 383

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH-PFAQQKNVNELLLA 418
           +  A + F G  Y+LP  SVSILPDCK +  NT  ++SQ N+ +  P  + KN+      
Sbjct: 384 TLPATIKFRGREYYLPEKSVSILPDCKLLSTNTQTIVSQHNSRNFLPSEKAKNLK----- 438

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI----HVMPGQGKEV-F 473
              +  Y+EKV    + S    +  E  + TKDTSDY WY+ SI    H +P +   +  
Sbjct: 439 ---WEMYQEKVPTISDLSLKNREPLELYSLTKDTSDYAWYSTSINFDRHDLPMRPDILPV 495

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S+GHA   FVN + V FG+GN+   +F+  K + L  G NT+ IL+  VG  N GA
Sbjct: 496 LQIASMGHALSAFVNGEFVGFGHGNNIEKSFVFQKPVILKPGTNTISILAETVGFPNSGA 555

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           + +   AG   + +  L  G  D++   W ++VGV GE   L     A    W   +  P
Sbjct: 556 YMEKRFAGPRGITVQGLMAGTLDITQNNWGHEVGVFGEKEQLFTEEGAKKVKWTPVNG-P 614

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
              ++ WYKT F APEG  P+AL +  M KG  WVNG S+GRYWS++L+P          
Sbjct: 615 TKGAVTWYKTYFDAPEGNNPVALKMDKMQKGMMWVNGNSLGRYWSSFLSP---------- 664

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
                        GQP Q  YHIPR ++ P  NLLVI EE GG P  I +
Sbjct: 665 ------------LGQPTQFEYHIPRAFLKPTNNLLVIFEETGGHPETIEV 702


>gi|320170654|gb|EFW47553.1| beta-D-galactosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 830

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 352/847 (41%), Positives = 475/847 (56%), Gaps = 68/847 (8%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYD RAL+IDG+RR+L SGSIHYPRSTP++WPEL  ++K  G++VI+TY+FWN + 
Sbjct: 24  AMNVTYDSRALLIDGRRRLLVSGSIHYPRSTPDMWPELFARAKANGIDVIQTYLFWNTNV 83

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G++    RFD VRFV+  QEAGL+++ RIGP+ CAEW YGG P WL  IP I FR  
Sbjct: 84  PTPGEFVMSDRFDYVRFVQLAQEAGLYVNFRIGPFVCAEWTYGGLPAWLRQIPDIMFRDY 143

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           + P+ +    ++ K + ++K   L A QGGPIIL Q+ENEYG  E  Y  GG  YV+W  
Sbjct: 144 DQPWLQVAGEYITKTVQILKDNRLLAGQGGPIILLQIENEYGGTESRYA-GGPQYVEWCG 202

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             A NL  +  W+MC Q DAP  II TCN FYCD F P+ P +P MWTEN+ GWF  +G 
Sbjct: 203 QLAANLTDAAQWIMCSQPDAPANIIATCNAFYCDDFVPH-PGQPSMWTENWPGWFQKWGD 261

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
             P RP +D+A+AV R++  GG++ NYYMY GGTNF RTAGGP + T+YDYDA +DEYG 
Sbjct: 262 PTPHRPAQDVAYAVTRYYIKGGSYMNYYMYHGGTNFERTAGGPFITTNYDYDASLDEYGM 321

Query: 302 IRQPKWGHLRELHKAIKLCEEYLIS-SDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
             +PK+ HL  +H  +   E  +++   P    LG  LEAHIY+ SS  C AFL+N ++ 
Sbjct: 322 PNEPKYSHLGSMHAVLHDNEAIMMAVPAPKPISLGTNLEAHIYN-SSVGCVAFLSNNNNK 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA------------- 407
           +D  V FNG  Y LPAWSVS+L  C   ++NTA V        H  A             
Sbjct: 381 TDVEVQFNGRTYELPAWSVSVLHGCVTAIYNTA-VCRAHQRAPHDAACCARESRRVCDRL 439

Query: 408 -----------QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                      Q   +  L L        +       N++ +     EQI+ T D +DYL
Sbjct: 440 PPLRPKARAPCQSGRIRHLCLVVLTSIGPQAPATKYWNKTPL-----EQIDQTLDHTDYL 494

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
           WY+ S   +        L++  +   A V+VN K V   +  +      ++  + L  G 
Sbjct: 495 WYSTSY--VSSSATYAQLSLPQITDVAYVYVNGKFVTVSWSGN------VSATVSLVAGP 546

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
           NT+DILS+ +GL N G        GL   + +    G  +L+   W +Q GV GE   + 
Sbjct: 547 NTIDILSLTMGLDNGGDILSEYNCGLLGGVYL----GSVNLTENGWWHQTGVVGERNAIF 602

Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAP-EGKGPLALNLASMGKGQAWVNGQSIGR 635
                    W   + L  N  L WYK++F  P + + PLAL+L  MGKG  WVNG ++GR
Sbjct: 603 LPENLKKVAWTTPAVL--NTGLTWYKSSFDVPRDSQAPLALDLTGMGKGYVWVNGHNLGR 660

Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
           YW   LA +  C   CDYRG+YDA  C++ C  P+QT YH+PR W+    N+LV+ EE+G
Sbjct: 661 YWPTILATNWPC-DVCDYRGTYDAPHCKQGCNMPSQTHYHVPREWLQAENNVLVLLEEMG 719

Query: 696 GDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFA 755
           G+PSKI+L+ +     C  V E  P        +L VV       L C     IA ++FA
Sbjct: 720 GNPSKIALVEREEYVSCGVVGEDYP------ADDLAVV-------LGCGTHQTIAGVDFA 766

Query: 756 SYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLL-KAL 813
           SYG P G+C S++ G+CH  +   IV   C G+  CSIPVS+A  G     CP +  K L
Sbjct: 767 SYGTPMGSCRSYQQGSCHASNSTEIVLSLCHGKQACSIPVSAAMFG---NPCPDVTNKRL 823

Query: 814 AVEAHCS 820
           AV+  C+
Sbjct: 824 AVQVACA 830


>gi|125536446|gb|EAY82934.1| hypothetical protein OsI_38151 [Oryza sativa Indica Group]
          Length = 705

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 329/634 (51%), Positives = 421/634 (66%), Gaps = 43/634 (6%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRA++I GKRR+L S  +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP 
Sbjct: 63  NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKFKEGGADVIETYVFWNGHEPA 122

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQYYFE RFDLV+F K V   GLFL LRIGPYACAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 123 KGQYYFEERFDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPGIEFRTDNE 182

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK EM+ F+ KI+ LMK+E L++ QGGPIIL Q+ENEYGN++  YG  G+ Y++WAA  
Sbjct: 183 PFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQM 242

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+T +PWVMC+Q DAP+ II+TCN FYCDGF PNS +KP +WTE++ GW+  +G A+
Sbjct: 243 AIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGAL 302

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL  TSYDYDAPIDEYG +R
Sbjct: 303 PHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILR 362

Query: 304 QPKWGHLRELHKAIKLCEEYLIS--SDPTHQKLGAKLEAHIYHK-----------SSNDC 350
           QPKWGHL++LH AIKLCE  LI+    P + KLG+  EAH+Y             ++  C
Sbjct: 363 QPKWGHLKDLHTAIKLCEPALIAVVGSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQIC 422

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN----NGDHPF 406
           +AFLAN D    A+V   G  Y LP WSVSILPDC+NV FNTA++ +Q +        P 
Sbjct: 423 SAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVFTVESGSPS 482

Query: 407 AQQKNVNELLLASS-----AFSWY--EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYT 459
              ++   +L  +S     + +W+  +E +G  G  +F    + E +N TKD SDYLWYT
Sbjct: 483 RSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYT 542

Query: 460 ASIHVMPG-------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL 512
             +++          +G    L I+ +   A VFVN KL     G+       + + I+L
Sbjct: 543 TRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----VSLKQPIQL 598

Query: 513 NEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGE 571
            EG+N L +LS +VGLQNYGA+ +  GAG    V L  L +G  DL++  W YQVG++GE
Sbjct: 599 VEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGE 658

Query: 572 YIGL---DKISLANSSFWKQGSTLPVNKSLIWYK 602
           +  +   +K   A  S  ++ S  P      WYK
Sbjct: 659 FSMIYAPEKQGCAGWSRMQKDSVQP----FTWYK 688


>gi|18418558|ref|NP_567973.1| beta-galactosidase 11 [Arabidopsis thaliana]
 gi|75202765|sp|Q9SCV1.1|BGL11_ARATH RecName: Full=Beta-galactosidase 11; Short=Lactase 11; Flags:
           Precursor
 gi|6686894|emb|CAB64747.1| putative beta-galactosidase [Arabidopsis thaliana]
 gi|332661046|gb|AEE86446.1| beta-galactosidase 11 [Arabidopsis thaliana]
          Length = 845

 Score =  638 bits (1646), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 343/837 (40%), Positives = 492/837 (58%), Gaps = 54/837 (6%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  VTYD  +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL  I+TYVFWN HE
Sbjct: 38  NKEVTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHE 97

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +G++ F GR DLV+F+K +Q+ G+++ LR+GP+  AEW +GG P WL  +PGI FRT 
Sbjct: 98  PQQGKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTD 157

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  FKE  +R++  I+D MK+E LFASQGGPIIL Q+ENEY  V+ AY   G  Y+KWA+
Sbjct: 158 NKQFKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWAS 217

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
           +   ++   +PWVMC+Q DAPDP+IN CNG +C D F  PN  +KP +WTEN++  F  F
Sbjct: 218 NLVDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVF 277

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R VED+A++VARFF   GT  NYYMY GGTNFGRT+   +    YD DAP+DEY
Sbjct: 278 GDPPTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEY 336

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYD 358
           G  ++PK+GHL+ LH A+ LC++ L+   P  +K G   E   Y +  +  CAAFLAN +
Sbjct: 337 GLEKEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNN 396

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           + +   + F G  Y +   S+SILPDCK VV+NTA+++SQ  + +  F + K  N+    
Sbjct: 397 TEAAETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRN--FMKSKKANKKF-- 452

Query: 419 SSAFSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-----HVMPGQGKE 471
              F  + E +   + GN S++  +L      TKD +DY WYT S      H+   +G +
Sbjct: 453 --DFKVFTETLPSKLEGN-SYIPVEL---YGLTKDKTDYGWYTTSFKVHKNHLPTKKGVK 506

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNY 531
            F+ I SLGHA   ++N + +  G+G+H+  +F+  K++ L  G N L +L ++ G  + 
Sbjct: 507 TFVRIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDS 566

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           G++ +    G   + ++ L +G  DL+ S +W  ++G+EGE +G+          WK+ +
Sbjct: 567 GSYMEHRYTGPRGISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFT 626

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
                  L WY+T F APE      + +  MGKG  WVNG+ +GRYW ++L+P       
Sbjct: 627 GKA--PGLTWYQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSP------- 677

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQ 709
                           GQP Q  YHIPR+++ P +NLLVI  EE    P  +        
Sbjct: 678 ---------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMDFAIVNRD 722

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNC 764
            +CS+V E   P V  W      V +     S    L C     IAA+ FAS+G P G C
Sbjct: 723 TVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGTKKIAAVEFASFGNPIGVC 782

Query: 765 GSFRPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
           G+F  G C+  V   +++K C+G+ EC IPV+ S +      +C  ++K LAV+  C
Sbjct: 783 GNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVVKMLAVQVKC 839


>gi|297836382|ref|XP_002886073.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
 gi|297331913|gb|EFH62332.1| beta-galactosidase 13 [Arabidopsis lyrata subsp. lyrata]
          Length = 848

 Score =  637 bits (1644), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 348/834 (41%), Positives = 493/834 (59%), Gaps = 54/834 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD  +L+I+G R +L SGSIHYPRSTPE+WP +I+++K+GGL  I+TYVFWN HEP +
Sbjct: 44  VTYDGTSLIINGNRELLYSGSIHYPRSTPEMWPNIIKRAKQGGLNTIQTYVFWNVHEPEQ 103

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ F GR DLV+F+K +++ G+++ LR+GP+  AEW +GG P WL  +PGI FRT N P
Sbjct: 104 GKFNFSGRADLVKFIKLIEKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTDNTP 163

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FKE  +R++  I+D MK+E LFASQGGPIIL Q+ENEY  V+ AY   G  Y+KWA+   
Sbjct: 164 FKEHTERYVKVILDKMKEEKLFASQGGPIILGQIENEYSAVQRAYKEDGLNYIKWASKLV 223

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
            +++  +PWVMC+Q DAPDP+IN CNG +C D F  PN  +KP +WTEN++  F  +G  
Sbjct: 224 HSMDLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNKENKPSLWTENWTTQFRVYGDP 283

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R VED+A++VARFF   GT  NYYMY GGTNFGRT+   +    YD DAP+DEYG  
Sbjct: 284 PAQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEYGLE 342

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
           R+PK+GHL+ LH A+ LC++ L+   P  +K   + E   Y +     CAAFLAN ++ S
Sbjct: 343 REPKYGHLKHLHNALNLCKKALLWGQPRVEKPSNETEIRYYEQPGTKVCAAFLANNNTES 402

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              + F G  Y +P  S+SILPDCK VV+NT ++IS   + +  F + K  N+    +  
Sbjct: 403 AEKIKFKGKEYIIPHRSISILPDCKTVVYNTGEIISHHTSRN--FMKSKKANK----NFD 456

Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQGKEVFL 474
           F  + E V   I G+ S++  +L      TKD +DY WYT S  +        +G +  L
Sbjct: 457 FKVFTETVPSKIKGD-SYIPVEL---YGLTKDETDYGWYTTSFKIDDNDLSKKKGSKPTL 512

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I SLGHA  V++N + +  G+G+H+  +F+  K I L EG N L +L ++ G  + G++
Sbjct: 513 RIASLGHALHVWLNGEYLGNGHGSHEEKSFVFQKPISLKEGENHLTMLGVLTGFPDSGSY 572

Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
            +    G  SV ++ L +G  DL+   +W  +VG+EGE +G+          W++ S   
Sbjct: 573 MEHRYTGPRSVSILGLGSGTLDLTEENKWGNKVGMEGEKLGIHAEEGLKKVKWQKFSG-- 630

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
               L WY+T F APE +   A+ +  MGKG  WVNG+ +GRYW ++L+P          
Sbjct: 631 KEPGLTWYQTYFDAPESQSAAAIRMNGMGKGLIWVNGEGVGRYWMSFLSP---------- 680

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
                        GQP Q  YHIPR+++ P +NLLVI  EE    P  I  +      +C
Sbjct: 681 ------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELIDFVIINRDTVC 728

Query: 713 SFVSEADPPPVDSW-KPNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
           S + E   P V  W + N  V + +  V L     C     I+ + FAS+G P G CG+F
Sbjct: 729 SHIGENYTPSVRHWTRKNDQVQAITDDVHLTASLKCSGTKKISEVEFASFGNPNGTCGNF 788

Query: 768 RPGACHMDV-LPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHC 819
             G C+  V   +V+K C+G+ EC IPV+ S +      +CP + K LAV+  C
Sbjct: 789 TLGTCNAPVSKKVVEKYCLGKAECVIPVNKSTFQQDKKDSCPKVEKKLAVQVKC 842


>gi|152013366|sp|Q9SCU8.2|BGL14_ARATH RecName: Full=Beta-galactosidase 14; Short=Lactase 14; Flags:
           Precursor
          Length = 887

 Score =  637 bits (1644), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 347/833 (41%), Positives = 487/833 (58%), Gaps = 54/833 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD  +L+I+GKR +L SGS+HYPRSTP +WP +I K++ GGL  I+TYVFWN HEP +
Sbjct: 41  VTYDGTSLIINGKRELLFSGSVHYPRSTPHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y F+GRFDLV+F+K + E GL++ LR+GP+  AEWN+GG P WL  +P + FRT N P
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FKE  +R++ KI+ +MK+E LFASQGGPIIL Q+ENEY  V+ AY   GE Y+KWAA+  
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
            ++N  +PWVMC+Q DAP  +IN CNG +C D F  PN   KP +WTEN++  F  FG  
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R VED+AF+VAR+F   G+  NYYMY GGTNFGRT+    V T Y  DAP+DE+G  
Sbjct: 281 PTQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLE 339

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           + PK+GHL+ +H+A++LC++ L       Q LG   E   Y +  +  CAAFL+N ++  
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              + F G  Y LP+ S+SILPDCK VV+NTA++++Q +  D  F + +  ++ L     
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRD--FVKSEKTSKGL----K 453

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQ-GKEVFLNI 476
           F  + E +    +   + P   E    TKD +DY WYT S+ +     P Q G +  L +
Sbjct: 454 FEMFSENIPSLLDGDSLIP--GELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRV 511

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            SLGHA +V+VN +     +G H+  +F   K +    G N + IL ++ GL + G++ +
Sbjct: 512 ASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYME 571

Query: 537 VAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG  ++ +I LK+G RDL+ + EW +  G+EGE   +     +    W++       
Sbjct: 572 HRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGK---R 628

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           K L WYKT F  PEG   +A+ + +MGKG  WVNG  +GRYW ++L+P            
Sbjct: 629 KPLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP------------ 676

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
                      G+P QT YHIPR+++     +N+LVI  EE G     I  +      IC
Sbjct: 677 ----------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTIC 726

Query: 713 SFVSEADPPPVDSWK-PNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
           S V E  P  V SWK     +VS S  +RL     C     +  + FAS+G P G CG+F
Sbjct: 727 SNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNF 786

Query: 768 RPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G C       +V+K C+G+  CSI V+    G     CP ++K LAV+  C
Sbjct: 787 TMGKCSASKSKEVVEKECLGRNYCSIVVARETFG--DKGCPEIVKTLAVQVKC 837


>gi|222424809|dbj|BAH20357.1| AT5G56870 [Arabidopsis thaliana]
          Length = 620

 Score =  637 bits (1644), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 319/635 (50%), Positives = 425/635 (66%), Gaps = 25/635 (3%)

Query: 82  VQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMK 141
           V +AGL+++LRIGPY CAEWN+GGFPVWL F+PG+ FRT N PFK  MK+F  KI+ +MK
Sbjct: 2   VHQAGLYVNLRIGPYVCAEWNFGGFPVWLKFVPGMAFRTDNEPFKAAMKKFTEKIVWMMK 61

Query: 142 QENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDA 201
            E LF +QGGPIILAQ+ENEYG VEW  G  G+ Y KW A  A+ L+T VPW+MC+QEDA
Sbjct: 62  AEKLFQTQGGPIILAQIENEYGPVEWEIGAPGKAYTKWVAQMALGLSTGVPWIMCKQEDA 121

Query: 202 PDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFET 261
           P PII+TCNG+YC+ F PNS +KP MWTEN++GW+ +FG AVP+RPVED+A++VARF + 
Sbjct: 122 PGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTNFGGAVPYRPVEDIAYSVARFIQK 181

Query: 262 GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCE 321
           GG+  NYYMY GGTNF RTA G  +A+SYDYDAP+DEYG  R+PK+ HL+ LHKAIKL E
Sbjct: 182 GGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLPREPKYSHLKALHKAIKLSE 240

Query: 322 EYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSI 381
             L+S+D T   LGAK EA+++  S + CAAFL+N D +S A V F G  Y LP WSVSI
Sbjct: 241 PALLSADATVTSLGAKQEAYVFW-SKSSCAAFLSNKDENSAARVLFRGFPYDLPPWSVSI 299

Query: 382 LPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNR-SFV 438
           LPDCK  V+NTAKV       + P   +     ++   + FSW  + E    +    +F 
Sbjct: 300 LPDCKTEVYNTAKV-------NAPSVHR----NMVPTGTKFSWGSFNEATPTANEAGTFA 348

Query: 439 RPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVA 493
           R  L EQI+ T D SDY WY   I +  G+     G    L + S GHA  VFVN +L  
Sbjct: 349 RNGLVEQISMTWDKSDYFWYITDITIGSGETFLKTGDSPLLTVMSAGHALHVFVNGQLSG 408

Query: 494 FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKN 552
             YG  D      ++KI+L+ G+N + +LS+ VGL N G  F+    G+   V L  + +
Sbjct: 409 TAYGGLDHPKLTFSQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGVLGPVTLKGVNS 468

Query: 553 GKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKG 612
           G  D+S  +W Y++GV+GE + L   + ++   W QGS +   + L WYK+TF  P G  
Sbjct: 469 GTWDMSKWKWSYKIGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNE 528

Query: 613 PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT 672
           PLAL++ +MGKGQ W+NG++IGR+W AY A   G   +C+Y G++DA KC  +CG+ +Q 
Sbjct: 529 PLALDMNTMGKGQVWINGRNIGRHWPAYKA--QGSCGRCNYAGTFDAKKCLSNCGEASQR 586

Query: 673 LYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            YH+PR+W+   +NL+V+ EELGGDP+ ISL+ +T
Sbjct: 587 WYHVPRSWLK-SQNLIVVFEELGGDPNGISLVKRT 620


>gi|147819335|emb|CAN64508.1| hypothetical protein VITISV_004610 [Vitis vinifera]
          Length = 766

 Score =  637 bits (1643), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 360/825 (43%), Positives = 470/825 (56%), Gaps = 89/825 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +VTYD R+L+I+G+RR+L SGSIHYPRSTPE+WP LI K+KEGG++VIETY FWN HEP
Sbjct: 22  GSVTYDGRSLIINGQRRLLFSGSIHYPRSTPEMWPSLISKAKEGGIDVIETYAFWNQHEP 81

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQY F GR D+V+F K VQ  GL+  LRIGP+  +EWNYGG P WLH +PGI +R+ N
Sbjct: 82  KQGQYDFSGRLDIVKFFKEVQAQGLYACLRIGPFIESEWNYGGLPFWLHDVPGIIYRSDN 141

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI++LMK ENL+ASQGGPIIL+Q+ENEY NVE A+   G  YV+WAA 
Sbjct: 142 EPFKFYMQNFTTKIVNLMKSENLYASQGGPIILSQIENEYKNVEAAFHEKGPPYVRWAAK 201

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV+L T++ +                                  + E+  G        
Sbjct: 202 MAVDLQTAMRY----------------------------------YGEDKRG-------- 219

Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
              R  EDLAF VA F  +  G+F NYYMY GGTNFGRT+   ++   YD  AP+DEYG 
Sbjct: 220 ---RAAEDLAFQVALFIAKKNGSFINYYMYHGGTNFGRTSSSYVLTAYYD-QAPLDEYGL 275

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           IRQPKWGHL+ELH  IKLC + L+     +  LG   EA+++ + S  CAAFL N D   
Sbjct: 276 IRQPKWGHLKELHAVIKLCSDTLLXGVQYNYSLGQLQEAYLFKRPSGQCAAFLVNNDKRR 335

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           +  V F    Y L A S+SILPDCK + FNTAKV +Q N         ++V       S 
Sbjct: 336 NVTVLFQNTNYELAANSISILPDCKKIAFNTAKVSTQFNT--------RSVQTRATFGST 387

Query: 422 FSWYEEKVGIS--GNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
             W E + GI   G        L E + TTKD SDYLWYT    +      +  L ++SL
Sbjct: 388 KQWSEYREGIPSFGGTPLKASMLLEHMGTTKDASDYLWYTLRF-IHNSSNAQPVLRVDSL 446

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
            H  L FVN K +A  +G+H   +F +  K+ LN G+N + +LS+MVGL + G + +   
Sbjct: 447 AHVLLAFVNGKYIASAHGSHQNGSFSLVNKVPLNSGLNRISLLSVMVGLPDAGPYLEHKV 506

Query: 540 AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           AG+  V + D     +D S   W YQVG+ GE + +     +    W  G        L 
Sbjct: 507 AGIRRVEIQD-GGXSKDFSKHPWGYQVGLMGEKLQIYTSPGSQKVQW-YGLGSHGRGPLT 564

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKT F AP G  P+ L   SMGKG+AWVNGQSIGRYW +YL PS               
Sbjct: 565 WYKTLFDAPRGNDPVVLFFGSMGKGEAWVNGQSIGRYWVSYLTPS--------------- 609

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
                  G+P+QT Y++PR +++P  NLLV+ EE  GDP KIS+ T +  ++C  V+++ 
Sbjct: 610 -------GEPSQTWYNVPRAFLNPKGNLLVVQEEESGDPLKISIGTVSVTNVCGHVTDSH 662

Query: 720 PPPVDSWKP----NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
           PPP+ SW      N       P+V+L C    +I+ I FAS+G P G C S+  G+CH  
Sbjct: 663 PPPIISWTTSDDGNESHHGKIPKVQLRCPPSSNISKITFASFGTPVGGCESYAIGSCHSP 722

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           + L + +KAC+G+  CSIP S    G     CPG  KAL V A C
Sbjct: 723 NSLAVAEKACLGKNXCSIPHSLKSFG--DDPCPGTPKALLVAAQC 765


>gi|413926110|gb|AFW66042.1| hypothetical protein ZEAMMB73_706783 [Zea mays]
          Length = 700

 Score =  637 bits (1642), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 323/658 (49%), Positives = 417/658 (63%), Gaps = 74/658 (11%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YDHR+LVI+G+RR+L SGSIHYPRS PE+WP LI+K+K+GGL+V++TYVFWN HEP +
Sbjct: 40  VSYDHRSLVINGRRRILISGSIHYPRSAPEMWPGLIQKAKDGGLDVVQTYVFWNGHEPAQ 99

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQYYF  R+DLVRFVK V++AGL++HLR+GPY CAEWN+GGFPVWL ++PGI+FRT N P
Sbjct: 100 GQYYFADRYDLVRFVKLVRQAGLYVHLRVGPYVCAEWNFGGFPVWLKYVPGIRFRTDNGP 159

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F+ KI+ +MK E LF  QGGPII+AQVENE+G +E   G GG+ Y  WAA  A
Sbjct: 160 FKAAMQKFVEKIVSMMKSEGLFEWQGGPIIMAQVENEFGPMESVVGSGGKPYAHWAAQMA 219

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           V  N  VPWVMC+Q+DAPDP+INTCNGFYCD FTPN+  KP MWTE ++GWF  FG A P
Sbjct: 220 VGTNAGVPWVMCKQDDAPDPVINTCNGFYCDYFTPNNKHKPTMWTEAWTGWFTKFGGAAP 279

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY----- 299
            RPVEDLAFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+     
Sbjct: 280 HRPVEDLAFAVARFVQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGMQWL 339

Query: 300 --------------------------------------------GFIRQPKWGHLRELHK 315
                                                       G +RQPKWGHLR +H+
Sbjct: 340 LPSLINLNSHRLPRDICRKSSQCGFYLSVVHTWNFWGGGWVYIAGLLRQPKWGHLRNMHR 399

Query: 316 AIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
           AIK  E  L+S DPT + +G   +A+++   +  CAAFL+NY   S   + F+G  Y LP
Sbjct: 400 AIKQAEPALVSGDPTIRSIGNYEKAYVFKSKNGACAAFLSNYHVKSAVRIRFDGRHYDLP 459

Query: 376 AWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISG 433
           AWS+SILPDCK  VFNTA V            +   + ++      F+W  Y E      
Sbjct: 460 AWSISILPDCKTAVFNTATV-----------KEPTLLPKMSPVMHRFAWQSYSEDTNSLD 508

Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVN 488
           + +F R  L EQ++ T D SDYLWYT  +++   +     G+   L++ S GH+  VFVN
Sbjct: 509 DSAFARDGLIEQLSLTWDKSDYLWYTTHVNIGSNERFLKSGQWPQLSVYSAGHSMQVFVN 568

Query: 489 KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VIL 547
            +     YG +D      +  +++ +G N + ILS  VGL N G  F++   G+   V L
Sbjct: 569 GRSYGSVYGGYDNPKLTFSGYVKMWQGSNKISILSSAVGLPNNGDHFELWNVGVLGPVTL 628

Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVNKSLIWYKT 603
             L  GKRDLS   WIYQVG++GE +GL  ++ +++  W    G T P    L W+K 
Sbjct: 629 SGLNEGKRDLSHQRWIYQVGLKGESLGLHTVTGSSAVEWAGPGGGTQP----LTWHKV 682


>gi|356527530|ref|XP_003532362.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
          Length = 673

 Score =  637 bits (1642), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 337/715 (47%), Positives = 450/715 (62%), Gaps = 50/715 (6%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYD R+L+IDG+R++L SGSIHYPRSTP++WP LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 2   AEVTYDGRSLIIDGQRKILFSGSIHYPRSTPQMWPALISKAKEGGLDVIQTYVFWNLHEP 61

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQY F GR+DLVRF+K +Q  GL++ LRIGPY  +EW YGGFP WLH +P I +RT N
Sbjct: 62  QFGQYDFSGRYDLVRFIKEIQVQGLYVCLRIGPYIESEWTYGGFPFWLHDVPAIVYRTDN 121

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+ +M+ E L+ASQGGPIIL+Q+ENEY NVE A+G  G  YV+WAA+
Sbjct: 122 QPFKLYMQNFTTKIVSMMQSEGLYASQGGPIILSQIENEYQNVEKAFGEDGSRYVQWAAE 181

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L T VPW+MC+Q DAPDP+INTCNG  C + FT PNSP+KP  WTEN++ ++  +G
Sbjct: 182 MAVGLKTGVPWLMCKQTDAPDPLINTCNGMRCGETFTGPNSPNKPAFWTENWTSFYQVYG 241

Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
                R  ED+AF V  F     G++ NYYMY GGTN GRT+   ++ + YD  AP+DEY
Sbjct: 242 GEPYIRSAEDIAFHVTLFIARKNGSYVNYYMYHGGTNLGRTSSSYVITSYYD-QAPLDEY 300

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +RQPKWGHL+ELH AIK C   L+    ++  LG   E +++ +    C AFL N D 
Sbjct: 301 GLLRQPKWGHLKELHAAIKSCSTTLLEGKQSNFSLGQLQEGYVFEEEGK-CVAFLVNNDH 359

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
                V F    Y LP+ S+SILPDC+NV FNTA V ++ N        ++  + +   S
Sbjct: 360 VKMFTVQFRNRSYELPSKSISILPDCQNVTFNTATVNTKSN--------RRMTSTIQTFS 411

Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
           SA  W  +++ +      + +   L EQ+N TKD SDYLWYT S         E  L  +
Sbjct: 412 SADKWEQFQDVIPNFDQTTLISNSLLEQMNVTKDKSDYLWYTLS---------ESKLTAQ 462

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  H    F +   +   +G+HD  +F     ++LNEG N + ILS+MVGL + GA+ + 
Sbjct: 463 SAAHVTHAFADGTYLGGAHGSHDVKSFTTQVPLKLNEGTNNISILSVMVGLPDAGAFLER 522

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ-GSTLPVNK 596
             AGL + + I       DL++  W YQVG+ GE + + +    +S  W   G+T   N+
Sbjct: 523 RFAGL-TAVEIQCSEESYDLTNSTWGYQVGLLGEQLEIYEEKSNSSIQWSPLGNT--CNQ 579

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
           +L WYKT F +P+G  P+ALNL SMGKGQAWVNG+SIGRYW ++                
Sbjct: 580 TLTWYKTAFDSPKGDEPVALNLESMGKGQAWVNGESIGRYWISF---------------- 623

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHI 711
           +D+       GQP+QTLYH+PR+++    N LV+ EE GG+P  ISL T +  +I
Sbjct: 624 HDSK------GQPSQTLYHVPRSFLKDIGNSLVLFEEEGGNPLHISLDTISSTNI 672


>gi|356507439|ref|XP_003522474.1| PREDICTED: beta-galactosidase 6-like [Glycine max]
          Length = 717

 Score =  635 bits (1638), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 329/706 (46%), Positives = 445/706 (63%), Gaps = 33/706 (4%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  VTYD R+L+IDG+R++L SG IHYPRSTP++WP+LI K+K+GGL+VI+TYVFWN HE
Sbjct: 24  AEEVTYDGRSLIIDGQRKILFSGLIHYPRSTPQMWPDLIAKAKQGGLDVIQTYVFWNLHE 83

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G Y F GR+DLV F+K +Q  GL++ LRIGP+  +EW YGGFP WLH +PGI +RT 
Sbjct: 84  PQPGMYDFRGRYDLVGFIKEIQAQGLYVCLRIGPFIQSEWKYGGFPFWLHDVPGIVYRTD 143

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  FK  M+ F  KI+++MK+E L+ASQGGPIIL+Q+ENEY N++ A+G  G  YV+WAA
Sbjct: 144 NESFKFYMQNFTTKIVNMMKEEGLYASQGGPIILSQIENEYQNIQKAFGTAGSQYVQWAA 203

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
             AV LNT VPWVMC+Q DAPDP+INTCNG  C + FT PNSP+KP +WTEN++ ++  +
Sbjct: 204 KMAVGLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFTGPNSPNKPALWTENWTSFYQVY 263

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  ED+AF V  F    G++ NYYMY GGTNFGRTA   ++   YD  AP+DEY
Sbjct: 264 GGLPYIRSAEDIAFHVTLFIARNGSYVNYYMYHGGTNFGRTASAYVITGYYD-QAPLDEY 322

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +RQPKWGHL++LH+ IK C   L+     +  LG   E +++ +   +C AFL N D 
Sbjct: 323 GLLRQPKWGHLKQLHEVIKSCSTTLLQGVQRNFSLGQLQEGYVFEEEKGECVAFLKNNDR 382

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
            +   V F    Y L   S+SILPDC+NV FNTA V +  N      + ++N + L    
Sbjct: 383 DNKVTVQFRNRSYELLPRSISILPDCQNVAFNTANVNTTSNR--RIISPKQNFSSL---- 436

Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESL 479
             +  +++ +    N S     L EQ+NTTKD SDYLWYT          K   L+++S 
Sbjct: 437 DDWKQFQDVIPYFDNTSLRSDSLLEQMNTTKDKSDYLWYTLRFEYNLSCRKPT-LSVQSA 495

Query: 480 GHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAG 539
            H A  F+N   +   +GNHD  +F +   + +N+G N L ILS MVGL + GA+ +   
Sbjct: 496 AHVAHAFINNTYIGGEHGNHDVKSFTLELPVTVNQGTNNLSILSAMVGLPDSGAFLERRF 555

Query: 540 AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           AGL SV L   +    +L++  W YQVG+ GE + + K    +   W Q   + + + LI
Sbjct: 556 AGLISVELQCSEQESLNLTNSTWGYQVGLLGEQLQVYKKQNNSDIGWSQLGNI-MEQLLI 614

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           WYKTTF  PEG  P+ L+L+SMGKG+AWVN QSIGRYW  +                +D+
Sbjct: 615 WYKTTFDTPEGDDPVVLDLSSMGKGEAWVNEQSIGRYWILF----------------HDS 658

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                  G P+Q+LYH+PR+++    N+LV+ EE GG+P  ISL T
Sbjct: 659 K------GNPSQSLYHVPRSFLKDTGNVLVLVEEGGGNPLGISLDT 698


>gi|449529068|ref|XP_004171523.1| PREDICTED: beta-galactosidase 16-like [Cucumis sativus]
          Length = 756

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 352/799 (44%), Positives = 479/799 (59%), Gaps = 56/799 (7%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +WP LI K+KEGG++VI+TYVFWN HEP +G Y F GR D+VRFVK +Q  GL+  LRIG
Sbjct: 1   MWPSLIAKAKEGGIDVIQTYVFWNLHEPQQGTYEFSGRRDIVRFVKEIQAQGLYACLRIG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           P+  AEW+YGG P WLH + GI +R+ N PFK  M+ F  KI+++MK E L+ASQGGPII
Sbjct: 61  PFIEAEWSYGGLPFWLHDVLGIVYRSDNEPFKLHMQNFTTKIVNMMKSEGLYASQGGPII 120

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
           L+Q+ENEY  VE A+G  G  YV+WAA  AV+L T VPW MC+Q DAPDP+INTCNG  C
Sbjct: 121 LSQIENEYTLVEAAFGEKGPPYVQWAAKMAVSLQTGVPWSMCKQNDAPDPVINTCNGMRC 180

Query: 215 -DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFET-GGTFQNYYMY 271
            + FT PNSP+KP +WTEN++ ++ ++G     R  E++AF VA F     GT+ NYYMY
Sbjct: 181 GETFTGPNSPNKPSIWTENWTSFYQTYGEEPYIRSAEEIAFHVALFIAAKNGTYVNYYMY 240

Query: 272 FGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTH 331
            GGTNFGR+A   ++   YD  +P+DEYG  R+PKWGHL+ELH A+KLC   L++   ++
Sbjct: 241 HGGTNFGRSASAFMITGYYD-QSPLDEYGLTREPKWGHLKELHAAVKLCSTPLLTGTKSN 299

Query: 332 QKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
             LG  +EA ++   SN+CAAFL N   + D+NV F    Y LP  S+SILPDCKNV FN
Sbjct: 300 FSLGQSVEAIVFKTESNECAAFLVN-RGAIDSNVLFQNVTYELPLGSISILPDCKNVAFN 358

Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKD 451
           T +V  Q N       Q+ ++ E       +  ++E +    +      +L E + TTKD
Sbjct: 359 TRRVSVQHNTRSMMAVQKFDLLE-------WEEFKEPIPNIDDTELRANELLEHMGTTKD 411

Query: 452 TSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIE 511
            SDYLWYT  +       ++  L ++S  HA   FVN       +G +    F + K I 
Sbjct: 412 RSDYLWYTFRVQQDSPDSQQT-LEVDSRAHALHAFVNGDYAGSAHGIYKEKGFSLAKNIT 470

Query: 512 LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGE 571
           L  GIN + +LS+MVGL + GA+ +   AGL  V +        D S   W Y+VG+ GE
Sbjct: 471 LRNGINNISLLSVMVGLPDSGAFLETRVAGLRRVGI-----QGEDFSEQHWGYKVGLSGE 525

Query: 572 --YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
              I LD  S +N  + + G++   ++ L WYKT F AP G  P+ALNL SMGKG  WVN
Sbjct: 526 QSQIFLDTGS-SNVQWSRLGNS---SQPLTWYKTQFDAPPGDDPIALNLGSMGKGAVWVN 581

Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
           G+ IGRYW ++L P                       G+P+Q  Y++PR+++ P +N LV
Sbjct: 582 GRGIGRYWVSFLTPK----------------------GEPSQKWYNVPRSFLKPTDNQLV 619

Query: 690 IHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW----KPNLGVV---SSSPQVRLA 742
           I EE  G+P +ISL +      C  VSE+  P V SW    K  +  V   +  P+V+L+
Sbjct: 620 ILEEETGNPVEISLDSVLITKTCGQVSESHYPLVASWMGAKKQKVRRVKNRTRRPKVQLS 679

Query: 743 CERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGV 801
           C     I+ I FAS+G P G+C S+  G CH  +   IV+ AC+G+ +CSIP+S+  L  
Sbjct: 680 CPSKKKISNILFASFGTPSGDCQSYAIGLCHSPNSRAIVEHACLGRAKCSIPISN--LNF 737

Query: 802 SAGACPGLLKALAVEAHCS 820
               CP + K L V+A C+
Sbjct: 738 RGDPCPHVTKTLLVDAQCT 756


>gi|125597922|gb|EAZ37702.1| hypothetical protein OsJ_22044 [Oryza sativa Japonica Group]
          Length = 811

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 347/833 (41%), Positives = 473/833 (56%), Gaps = 70/833 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTY+ R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETYVFWN HEP R
Sbjct: 31  VTYNDRSLVIDGERRIIISGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYVFWNGHEPHR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            QY F G +D+VRF K +Q AGL+  LRIGPY C EWNYGG P WL  IPG+QFR  N P
Sbjct: 91  RQYNFVGNYDIVRFFKEIQNAGLYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNAP 150

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAAD 182
           F+ EM+ F   I++ MK  N+FA QGGPIILAQ+ENEYGN+  +         Y+ W AD
Sbjct: 151 FENEMEIFTTLIVNKMKDANMFAGQGGPIILAQIENEYGNIMGQLNNNQSASEYIHWCAD 210

Query: 183 TAVNLNTSVPWVMCQQE-DAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A   N  VPW+MCQQ+ D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++  
Sbjct: 211 MANKQNVGVPWIMCQQDSDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAWDK 270

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVA FF+                     GGP + TSYDYDAP+DEYG 
Sbjct: 271 PDFHRSAEDIAFAVAMFFQ-------------------KRGGPYITTSYDYDAPLDEYGN 311

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           +RQPK+GHL++LH  IK  E+ L+  +        K+    Y   S   A F+ N + + 
Sbjct: 312 LRQPKYGHLKDLHSVIKSIEKILVHGEYVDTNYSDKVTVTKYTLDSTS-ACFINNRNDNM 370

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D NVT +G  + LPAWSVSILPDCK V FN+AK+ +Q          +  + E    S  
Sbjct: 371 DVNVTLDGTTHLLPAWSVSILPDCKTVAFNSAKIKAQTT----VMVNKAKMVEKEPESLK 426

Query: 422 FSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           +SW  E +         S+ + +L EQI T+ D SDYLWY  SI+        +F+N  +
Sbjct: 427 WSWMRENLTPFMTDEKGSYRKNELLEQIVTSTDQSDYLWYRTSINHKGEASYTLFVN--T 484

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +  +    F +    +L++G N + +LS  +GL+NYG  F+  
Sbjct: 485 TGHELYAFVNGMLVGQNHSPNGHFVFQLESPAKLHDGKNYISLLSATIGLKNYGPLFEKM 544

Query: 539 GAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEY--IGLDKISLANSSFWKQGSTLPV 594
            AG+    V LID      DLS+  W Y+ G+ GEY  I LDK      ++     T+P+
Sbjct: 545 PAGIVGGPVKLIDNNGKGIDLSNSSWSYKAGLAGEYRQIHLDK---PGCTWDNNNGTVPI 601

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST--GCTKKCD 652
           NK   WYKTTF AP G+  + ++L  + KG AWVNG ++GRYW +Y A  +         
Sbjct: 602 NKPFTWYKTTFQAPAGEDTVVVDLLGLNKGVAWVNGNNLGRYWPSYTAARSMRRLPTTAH 661

Query: 653 YRGSY----DASKCQKHCGQPAQTLYHIPRTWVHPGE-NLLVIHEELGGDPSKISLLTKT 707
           YRG +    D  KC   CG+P+Q  YH+PR+++  GE N +++ EE GGDPS +S  T  
Sbjct: 662 YRGVFQAEGDGQKCLTGCGEPSQRFYHVPRSFLKNGEPNTVILFEEAGGDPSHVSFRTVA 721

Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLAC-ERGWHIAAINFASYGIPEGNCGS 766
              +C+     D                   + L+C +    I+AIN  S+G+  G CG+
Sbjct: 722 AGSVCASAEVGD------------------TITLSCGQHSKTISAINVTSFGVARGQCGA 763

Query: 767 FRPGACHMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           ++ G           +AC+G+  C++ +++A   V+   C  L   L V+A C
Sbjct: 764 YKGGCESKAAYKAFTEACLGKESCTVQITNA---VTGSGC--LSNVLTVQASC 811


>gi|6686900|emb|CAB64750.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 887

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 345/833 (41%), Positives = 485/833 (58%), Gaps = 54/833 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD  +L+I+GKR +  SGS+HYPRSTP++WP +I K++ GGL  I+TYVFWN HEP +
Sbjct: 41  VTYDGTSLIINGKRELFFSGSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEQ 100

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y F+GRFDLV+F+K + E GL++ LR+GP+  AEWN+GG P WL  +P + FRT N P
Sbjct: 101 GKYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEP 160

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FKE  +R++ KI+ +MK+E LFASQGGPIIL Q+ENEY  V+ AY   GE Y+KWAA+  
Sbjct: 161 FKEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLV 220

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
            ++N  +PWVMC+Q DAP  +IN CNG +C D F  PN   KP +WTEN++  F  FG  
Sbjct: 221 ESMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDP 280

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF+VAR+F   G+  NYYMY GGTNFGRT+    V T Y  DAP+DE+G  
Sbjct: 281 PTQRTAEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLE 339

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           + PK+GHL+ +H+A++LC++ L       Q LG   E   Y +  +  CAAFL+N ++  
Sbjct: 340 KAPKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRD 399

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              + F G  Y LP+ S+SILPDCK VV+NTA++++Q +  D  F + +  ++ L     
Sbjct: 400 TNTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRD--FVKSEKTSKGL----K 453

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQ-GKEVFLNI 476
           F  + E +    +   + P   E    TKD +DY WYT S+ +     P Q G +  L +
Sbjct: 454 FEMFSENIPSLLDGDSLIP--GELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRV 511

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            SLGHA +V+VN +     +G H+  +F   K +    G N + IL ++ GL + G++ +
Sbjct: 512 ASLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYME 571

Query: 537 VAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
              AG  ++ +I LK+G RDL+ + EW +  G+EGE   +     +    W++       
Sbjct: 572 HRFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGE---R 628

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           K L WYKT F  PEG   +A+ +  MGKG  WVNG  +GRYW ++L+P            
Sbjct: 629 KPLTWYKTYFETPEGVNAVAIRMKGMGKGLIWVNGIGVGRYWMSFLSP------------ 676

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHIC 712
                      G+P QT YHIPR+++     +N+LVI  EE G     I  +      IC
Sbjct: 677 ----------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTIC 726

Query: 713 SFVSEADPPPVDSWK-PNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSF 767
           S V E  P  V SWK     +VS S  +RL     C     +  + FAS+G P G CG+F
Sbjct: 727 SNVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNF 786

Query: 768 RPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             G C       +V+K C+G+  CSI V+    G     CP ++K LAV+  C
Sbjct: 787 TMGKCSASKSKEVVEKECLGRNYCSIVVARETFG--DKGCPEIVKTLAVQVKC 837


>gi|108862584|gb|ABA97655.2| Beta-galactosidase precursor, putative, expressed [Oryza sativa
           Japonica Group]
          Length = 713

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 330/642 (51%), Positives = 422/642 (65%), Gaps = 51/642 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRA++I GKRR+L S  +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP 
Sbjct: 63  NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122

Query: 64  RGQYYFEGRFDLVRFVKT--------VQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPG 115
           +GQYYFE RFDLV+F K         V   GLFL LRIGPYACAEWN+GGFPVWL  IPG
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLVAAEGLFLFLRIGPYACAEWNFGGFPVWLRDIPG 182

Query: 116 IQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL 175
           I+FRT N PFK EM+ F+ KI+ LMK+E L++ QGGPIIL Q+ENEYGN++  YG  G+ 
Sbjct: 183 IEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSWQGGPIILQQIENEYGNIQGNYGQAGKR 242

Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGW 235
           Y++WAA  A+ L+T +PWVMC+Q DAP+ II+TCN FYCDGF PNS +KP +WTE++ GW
Sbjct: 243 YMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDTCNAFYCDGFKPNSYNKPTIWTEDWDGW 302

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
           +  +G A+P RP ED AFAVARF++ GG+ QNYYMYFGGTNF RTAGGPL  TSYDYDAP
Sbjct: 303 YADWGGALPHRPAEDSAFAVARFYQRGGSLQNYYMYFGGTNFARTAGGPLQITSYDYDAP 362

Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHK-------- 345
           IDEYG +RQPKWGHL++LH AIKLCE  LI+ D  P + KLG+  EAH+Y          
Sbjct: 363 IDEYGILRQPKWGHLKDLHTAIKLCEPALIAVDGSPQYIKLGSMQEAHVYSTGEVHTNGS 422

Query: 346 ---SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRN-- 400
              ++  C+AFLAN D    A+V   G  Y LP WSVSILPDC+NV FNTA++ +Q +  
Sbjct: 423 MAGNAQICSAFLANIDEHKYASVWIFGKSYSLPPWSVSILPDCENVAFNTARIGAQTSVF 482

Query: 401 --NGDHPFAQQKNVNELLLASS-----AFSWY--EEKVGISGNRSFVRPDLAEQINTTKD 451
                 P    ++   +L  +S     + +W+  +E +G  G  +F    + E +N TKD
Sbjct: 483 TVESGSPSRSSRHKPSILSLTSGGPYLSSTWWTSKETIGTWGGNNFAVQGILEHLNVTKD 542

Query: 452 TSDYLWYTASIHVMPG-------QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANF 504
            SDYLWYT  +++          +G    L I+ +   A VFVN KL     G+      
Sbjct: 543 ISDYLWYTTRVNISDADVAFWSSKGVLPSLTIDKIRDVARVFVNGKLAGSQVGHW----V 598

Query: 505 LINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWI 563
            + + I+L EG+N L +LS +VGLQNYGA+ +  GAG    V L  L +G  DL++  W 
Sbjct: 599 SLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEKDGAGFRGQVTLTGLSDGDVDLTNSLWT 658

Query: 564 YQVGVEGEYIGL---DKISLANSSFWKQGSTLPVNKSLIWYK 602
           YQVG++GE+  +   +K   A  S  ++ S  P      WYK
Sbjct: 659 YQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQP----FTWYK 696


>gi|356507642|ref|XP_003522573.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
          Length = 696

 Score =  633 bits (1633), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 342/707 (48%), Positives = 441/707 (62%), Gaps = 47/707 (6%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYD R+L+IDG+ ++L SGSIHYPRSTP++WP LI K+KEGGL+VI+TYVFWN HEP 
Sbjct: 26  NVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHEPQ 85

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY F G  ++VRF+K +Q  GL++ LRIGPY  +E  YGG P+WLH IPGI FR+ N 
Sbjct: 86  QGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIVFRSDNE 145

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            FK  M+RF AKI++LMK  NLFASQGGPIIL+Q+ENEYGNVE A+   G  Y++WAA  
Sbjct: 146 QFKFHMQRFTAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAAQM 205

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFG 240
           AV L T VPWVMC+Q++APDP+INTCNG  C G T   PNSP+KP +WTEN++ ++  FG
Sbjct: 206 AVGLQTGVPWVMCKQDNAPDPVINTCNGMQC-GKTFKGPNSPNKPSLWTENWTSFYQVFG 264

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+A+ VA F    G++ NYYMY GGTNF R A   +V   YD +AP+DEYG
Sbjct: 265 EVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASAFVVTAYYD-EAPLDEYG 323

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +R+PKWGHL+ELH+AIK C   L+    T   LG +  A+++ +SS +CAAFL N +  
Sbjct: 324 LVREPKWGHLKELHEAIKSCSNSLLYGTQTSFSLGTQQNAYVFRRSSIECAAFLENTEDR 383

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
           S   + F    Y LP  S+SILPDCKNV FNTAKV +Q           + +   L  +S
Sbjct: 384 S-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVRAQ---------NARAMKSQLQFNS 433

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           A  W  Y E +    + S     L +QI+T KDTSDYLWYT  ++      + + L+  S
Sbjct: 434 AEKWKVYREAIPSFADTSLRANTLLDQISTAKDTSDYLWYTFRLYDNSANAQSI-LSAYS 492

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            GH    FVN  LV   +G+H   +F++  K+ L  G+N +  LS  VGL N GA+ +  
Sbjct: 493 HGHVLHAFVNGNLVGSKHGSHKNVSFVMENKLNLISGMNNISFLSATVGLPNSGAYLEGR 552

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
            AGL S     LK   RD ++  W YQVG+ GE + +   S ++   W+  S L   K L
Sbjct: 553 VAGLRS-----LKVQGRDFTNQAWGYQVGLLGEKLQIYTASGSSKVKWE--SFLSSTKPL 605

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYKTTF AP G  P+ LNL SMGKG  WVNGQ IGRYW ++  P               
Sbjct: 606 TWYKTTFDAPVGNDPVVLNLGSMGKGYTWVNGQGIGRYWVSFHTPQ-------------- 651

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                   G P+Q  YHIPR+ +    NLLV+ EE  G+P  I+L T
Sbjct: 652 --------GTPSQKWYHIPRSLLKSTGNLLVLLEEETGNPLGITLDT 690


>gi|222631666|gb|EEE63798.1| hypothetical protein OsJ_18622 [Oryza sativa Japonica Group]
          Length = 765

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 354/826 (42%), Positives = 473/826 (57%), Gaps = 93/826 (11%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L   +TYD RALV+ G RR+  SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25  LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EPI+GQY FEGR+DLV+F++ +Q  GL++ LRIGP+  AEW YGGFP WLH +P I FR+
Sbjct: 85  EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+ F+ KI+ +MK E L+  QGGPII++Q+ENEY  +E A+G  G  YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLS 238
           A  AV L T VPW+MC+Q DAPDP+INTCNG  C + F  PNSP+KP +WTEN++  +  
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPI 264

Query: 239 FGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
           +G     R  ED+AFAVA F     G+F +YYMY GGTNFGR A    V TSY   AP+D
Sbjct: 265 YGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLD 323

Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           EY F                                                C AFL N+
Sbjct: 324 EYDF-----------------------------------------------KCVAFLVNF 336

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN-VNELL 416
           D  +   V F      L   S+S+L DC+NVVF TAKV +Q  +      Q  N +N   
Sbjct: 337 DQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWK 396

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEV-FLN 475
                      K   +GN+ F      EQ+ TTKD +DYLWY  S       G ++  L 
Sbjct: 397 AFIEPVPQDLSKSTYTGNQLF------EQLTTTKDETDYLWYIVSYKNRASDGNQIAHLY 450

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
           ++SL H    FVN + V   +G+HD   N ++N  + L EG NT+ +LS+MVG  + GA+
Sbjct: 451 VKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAY 510

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
            +    G+ +V +   +     L++  W YQVG+ GE   +      NS  W   + L +
Sbjct: 511 MERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSVRWMDINNL-I 569

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L WYKTTF  P G   + LNL SMGKG+ WVNG+SIGRYW ++ APS          
Sbjct: 570 YHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS---------- 619

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                       GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++ T +   +C  
Sbjct: 620 ------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGN 667

Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
           V E   PP+ S           P+VR+ C+ G  I++I FASYG P G+C SFR G+CH 
Sbjct: 668 VDEFSVPPLQS-------RGKVPKVRIWCQGGNRISSIEFASYGNPVGDCRSFRIGSCHA 720

Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           +    +V+++C+G+  CSIPV +A  G     CPG+ K+L V A C
Sbjct: 721 ESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 764


>gi|255558624|ref|XP_002520337.1| beta-galactosidase, putative [Ricinus communis]
 gi|223540556|gb|EEF42123.1| beta-galactosidase, putative [Ricinus communis]
          Length = 771

 Score =  631 bits (1628), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 360/828 (43%), Positives = 471/828 (56%), Gaps = 102/828 (12%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + NVTYD R+L+I+G+ R+L SGSIHYPRSTPE                           
Sbjct: 37  AGNVTYDGRSLIINGEHRILFSGSIHYPRSTPE--------------------------- 69

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
                Y F+GR DLV+F+  VQ  GL+  LRIGP+   EW YGG P WLH + GI FR+ 
Sbjct: 70  -----YDFDGRKDLVKFLLEVQAQGLYAALRIGPFIEGEWTYGGLPFWLHDVSGIVFRSD 124

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK+ M+RF+ KI+++MK   L+ASQGGPII++Q+ENEY NVE A+   G  YV WAA
Sbjct: 125 NEPFKKHMQRFVTKIVNMMKYNQLYASQGGPIIISQIENEYQNVETAFHEKGSRYVHWAA 184

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
           + AV LNT VPWVMC+Q DAPDP+INTCNG  C + F  PNSP+KP MWTEN++ ++  F
Sbjct: 185 NMAVRLNTGVPWVMCKQTDAPDPVINTCNGMRCGETFAGPNSPNKPSMWTENWTSFYQVF 244

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  ED+AF VA F    G++ NYYMY GGTNFGRT G   V TSY   AP+DEY
Sbjct: 245 GGEPYIRTAEDIAFHVALFIARNGSYVNYYMYHGGTNFGRT-GSAFVTTSYYDQAPLDEY 303

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK--LGAKLEAHIYHKSSNDCAAFLANY 357
           G IRQPKWGHL++LH  IK C + LI    THQ   LG   EA+++ + S DC AFL N 
Sbjct: 304 GLIRQPKWGHLKDLHAKIKSCSKTLIRG--THQTFPLGRLQEAYVFREKSGDCVAFLVNN 361

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
           D   D  V F    Y LP  S+SILPDCK++ FNTAKV +Q        +Q+        
Sbjct: 362 DGRRDVTVRFQNRSYELPHKSISILPDCKSITFNTAKVNTQYATRSATLSQE-------- 413

Query: 418 ASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLN 475
            SS   W  Y+E V    + S     L + ++TTKDTSDYLWYT        + +   L 
Sbjct: 414 FSSVGKWEEYKETVATFDSTSLRAKTLLDHLSTTKDTSDYLWYTFRFQNHFSRPQST-LR 472

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
             S GH    +VN       +G+H+  +F +   + L  G N + +LS+ VGL + GA+ 
Sbjct: 473 AYSRGHVLHAYVNGVYAGSAHGSHESTSFTLENSVRLKNGTNNVALLSVTVGLPDSGAYL 532

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ--GSTLP 593
           +   AGL  V + +     +D ++  W YQVG+ GE + +   +  N   W +  G+T P
Sbjct: 533 ERRVAGLHRVRIQN-----KDFTTYSWGYQVGLLGEKLQIYTDNGLNKVSWNEFRGTTQP 587

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
               L WYKT F AP G  P+ALNL SMGKG+AWVNGQSIGRYW                
Sbjct: 588 ----LTWYKTQFDAPAGSDPIALNLHSMGKGEAWVNGQSIGRYWV--------------- 628

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
             S+  SK     G P+QT YHIP+++V P  NLLV+ EE  G P  I++ + +   +C 
Sbjct: 629 --SFSTSK-----GNPSQTRYHIPQSFVKPTGNLLVLLEEEKGYPPGITVDSISISKVCG 681

Query: 714 FVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
            VSE                S    V+L+C    +I+ I F+S+G PEGNC  +  G CH
Sbjct: 682 HVSE----------------SHKSVVQLSCPPNRNISRILFSSFGTPEGNCNQYAIGKCH 725

Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             +   IV+KAC+G+ +C I  S+ + G     CPG+ K L V+A C+
Sbjct: 726 SSNSRAIVEKACIGKTKCIILRSNRFFG--GDPCPGIRKGLLVDAKCT 771


>gi|224083510|ref|XP_002307056.1| predicted protein [Populus trichocarpa]
 gi|222856505|gb|EEE94052.1| predicted protein [Populus trichocarpa]
          Length = 715

 Score =  631 bits (1627), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 334/728 (45%), Positives = 455/728 (62%), Gaps = 45/728 (6%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP L+ K++EGG++VI+TYVFWN HEP
Sbjct: 23  GDVTYDGRSLIIDGQRKILFSGSIHYPRSTPEMWPSLVAKAREGGVDVIQTYVFWNLHEP 82

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+Y F GR DLVRF+K +Q  GL++ LRIGP+  +EW YGGFP WLH +P I +R+ N
Sbjct: 83  RPGEYDFSGRNDLVRFIKEIQAQGLYVCLRIGPFIESEWTYGGFPFWLHDVPDIVYRSDN 142

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI+++MK E L+ASQGGPIIL+Q+ENEY NVE A+   G  YV WAA 
Sbjct: 143 EPFKFYMQNFTTKIVNMMKSEGLYASQGGPIILSQIENEYQNVEAAFRDKGPPYVIWAAK 202

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSF 239
            AV L T VPWVMC+Q DAPDP+INTCNG  C G T   PNSP+KP +WTEN++ ++  +
Sbjct: 203 MAVELQTGVPWVMCKQTDAPDPVINTCNGMRC-GETFGGPNSPTKPSLWTENWTSFYQVY 261

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  ED+AF V  F    G++ NYYM+ GGTNFGRTA   ++ + YD  AP+DEY
Sbjct: 262 GGEPYIRSAEDIAFHVTLFIAKNGSYINYYMFHGGTNFGRTASAYVITSYYD-QAPLDEY 320

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G IRQPKWGHL+ELH AIK C   ++    ++  LG   +A+I+ +    CAAFL N D 
Sbjct: 321 GLIRQPKWGHLKELHAAIKSCSSTILEGVQSNFSLGQLQQAYIFEEEGAGCAAFLVNNDQ 380

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
            ++A V F    + L   S+S+LPDC+N++FNTAKV ++ N         +  ++L   +
Sbjct: 381 KNNATVEFRNITFELLPKSISVLPDCENIIFNTAKVNAKGNE------ITRTSSQLFDDA 434

Query: 420 SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEVFLNIES 478
             +  Y + +    + +     L E +NTTKD SDYLWYT S   +P     E  L++ES
Sbjct: 435 DRWEAYTDVIPNFADTNLKSDTLLEHMNTTKDKSDYLWYTFSF--LPNSSCTEPILHVES 492

Query: 479 LGHAALVFVNKKLVAFGYGNHDFAN-FLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           L H A  FVN K     +G+ D    F +   I LN+ +NT+ ILS MVGLQ+ GA+ + 
Sbjct: 493 LAHVASAFVNNKYAGSAHGSKDAKGPFTMEAPIVLNDQMNTISILSTMVGLQDSGAFLER 552

Query: 538 AGAGLFSVILIDLKNGKRDL----SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
             AGL  V   +++  ++++    ++ EW YQ G+ GE + +      ++  W +  +  
Sbjct: 553 RYAGLTRV---EIRCAQQEIYNFTNNYEWGYQAGLSGESLNIYMREHLDNIEWSEVVS-A 608

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
            ++ L W+K  F AP G  P+ LNL++MGKG+AWVNGQSIGRYW ++L            
Sbjct: 609 TDQPLSWFKIEFDAPTGNDPVVLNLSTMGKGEAWVNGQSIGRYWLSFLTSK--------- 659

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
                        GQP+QTLYHIPR +++   NLLV+ EE GGDP  ISL T +   +  
Sbjct: 660 -------------GQPSQTLYHIPRAFLNSSGNLLVLLEESGGDPLHISLDTVSRTGLQE 706

Query: 714 FVSEADPP 721
             S   PP
Sbjct: 707 HASRYHPP 714


>gi|218196839|gb|EEC79266.1| hypothetical protein OsI_20049 [Oryza sativa Indica Group]
          Length = 761

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 353/826 (42%), Positives = 473/826 (57%), Gaps = 93/826 (11%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L   +TYD RALV+ G RR+  SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 21  LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EPI+GQY FEGR+DLV+F++ +Q  GL++ LRIGP+  AEW YGGFP WLH +P I FR+
Sbjct: 81  EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+ F+ KI+ +MK E L+  QGGPII++Q+ENEY  +E A+G  G  YV+WA
Sbjct: 141 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 200

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLS 238
           A  AV L T VPW+MC+Q DAPDP+INTCNG  C + F  PNSP+KP +WTEN++  +  
Sbjct: 201 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRYPI 260

Query: 239 FGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPID 297
           +G     R  ED+AFAVA +     G+F +YYMY GGTNFGR A    V TSY   AP+D
Sbjct: 261 YGNDTKLRDPEDIAFAVALYIARKKGSFVSYYMYHGGTNFGRFAAS-YVTTSYYDGAPLD 319

Query: 298 EYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           EY F                                                C AFL N+
Sbjct: 320 EYDF-----------------------------------------------KCVAFLVNF 332

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN-VNELL 416
           D  +   V F      L   S+S+L DC+NVVF TAKV +Q  +      Q  N +N   
Sbjct: 333 DQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAVQSLNDINNWK 392

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LN 475
                      K   +GN+ F      EQ+ TTKD +DYLWY  S       G ++  L 
Sbjct: 393 AFIEPVPQDLSKSTYTGNQLF------EQLTTTKDETDYLWYIVSYKNRASDGNQIARLY 446

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFA-NFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
           ++SL H    FVN + V   +G+HD   N ++N  + L EG NT+ +LS+MVG  + GA+
Sbjct: 447 VKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSVMVGSPDSGAY 506

Query: 535 FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPV 594
            +    G+ +V +   +     L++  W YQVG+ GE   +      NS  W   + L +
Sbjct: 507 MERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGPNSVRWMDINNL-I 565

Query: 595 NKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYR 654
              L WYKTTF  P G   + LNL SMGKG+ WVNG+SIGRYW ++ APS          
Sbjct: 566 YHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS---------- 615

Query: 655 GSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
                       GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++ T +   +C  
Sbjct: 616 ------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVNTMSVTTVCGN 663

Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM 774
           V E   PP+ S           P+VR+ C+ G  I++I FASYG P G+C SFR G+CH 
Sbjct: 664 VDEFSVPPLQS-------RGKVPKVRIWCQGGKRISSIEFASYGNPVGDCRSFRIGSCHA 716

Query: 775 DVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           +    +V+++C+G+  CSIPV +A  G     CPG+ K+L V A C
Sbjct: 717 ESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 760


>gi|24417238|gb|AAN60229.1| unknown [Arabidopsis thaliana]
          Length = 569

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 306/554 (55%), Positives = 388/554 (70%), Gaps = 21/554 (3%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYDH+AL+I+G+RR+L SGSIHYPRSTPE+WP+LI+K+KEGGL+VI+TYVFWN HEP
Sbjct: 27  AVVTYDHKALIINGQRRILISGSIHYPRSTPEMWPDLIKKAKEGGLDVIQTYVFWNGHEP 86

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G YYF+ R+DLV+F K V +AGL+L LRIGPY CAEWN+GGFPVWL ++PG+ FRT N
Sbjct: 87  SPGNYYFQDRYDLVKFTKLVHQAGLYLDLRIGPYVCAEWNFGGFPVWLKYVPGMVFRTDN 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F  KI+D+MK+E LF +QGGPIIL+Q+ENEYG ++W  G  G+ Y KW A+
Sbjct: 147 EPFKIAMQKFTKKIVDMMKEEKLFETQGGPIILSQIENEYGPMQWEMGAAGKAYSKWTAE 206

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            A+ L+T VPW+MC+QEDAP PII+TCNGFYC+GF PNS +KP +WTEN++GWF  FG A
Sbjct: 207 MALGLSTGVPWIMCKQEDAPYPIIDTCNGFYCEGFKPNSDNKPKLWTENWTGWFTEFGGA 266

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
           +P RPVED+AF+VARF + GG+F NYYMY GGTNF RTA G  +ATSYDYDAPIDEYG +
Sbjct: 267 IPNRPVEDIAFSVARFIQNGGSFMNYYMYXGGTNFDRTA-GVFIATSYDYDAPIDEYGLL 325

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           R+PK+ HL+ELHK IKLCE  L+S DPT   LG K E H++ KS   CAAFL+NYD+SS 
Sbjct: 326 REPKYSHLKELHKVIKLCEPALVSVDPTITSLGDKQEIHVF-KSKTSCAAFLSNYDTSSA 384

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           A V F G  Y LP WSVSILPDCK   +NTAK+ +       P    K    ++  S+ F
Sbjct: 385 ARVMFRGFPYDLPPWSVSILPDCKTEYYNTAKIRA-------PTILMK----MIPTSTKF 433

Query: 423 SWYEEKVGISGNR---SFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFL 474
           SW     G   +    +FV+  L EQI+ T+D +DY WY   I +   +     G    L
Sbjct: 434 SWESYNEGSPSSNEAGTFVKDGLVEQISMTRDKTDYFWYFTDITIGSDESFLKTGDNPLL 493

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I S GHA  VFVN  L    YG    +    ++ I+L+ GIN L +LS  VGL N G  
Sbjct: 494 TIFSAGHALHVFVNGLLAGTSYGALSNSKLTFSQNIKLSVGINKLALLSTAVGLPNAGVH 553

Query: 535 FDVAGAGLFSVILI 548
           ++    G+   + +
Sbjct: 554 YETWNTGILGPVTL 567


>gi|356518551|ref|XP_003527942.1| PREDICTED: beta-galactosidase 16-like [Glycine max]
          Length = 697

 Score =  628 bits (1620), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 340/710 (47%), Positives = 444/710 (62%), Gaps = 51/710 (7%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            NVTYD R+L+IDG+ ++L SGSIHYPRSTP++WP LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 26  GNVTYDGRSLIIDGQHKILFSGSIHYPRSTPQMWPNLIAKAKEGGLDVIQTYVFWNLHEP 85

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQY F G  ++VRF+K +Q  GL++ LRIGPY  +E  YGG P+WLH IPGI FR+ N
Sbjct: 86  QQGQYDFRGMRNIVRFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDIPGIVFRSDN 145

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             FK  M++F AKI++LMK  NLFASQGGPIIL+Q+ENEYGNVE A+   G  Y++WAA 
Sbjct: 146 EQFKFHMQKFSAKIVNLMKSANLFASQGGPIILSQIENEYGNVEGAFHEKGLSYIRWAAQ 205

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSF 239
            AV L T VPWVMC+Q++APDP+INTCNG  C G T   PNSP+KP +WTEN++ ++  F
Sbjct: 206 MAVGLQTGVPWVMCKQDNAPDPVINTCNGMQC-GKTFKGPNSPNKPSLWTENWTSFYQVF 264

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  ED+A+ VA F    G++ NYYMY GGTNF R A   ++   YD +AP+DEY
Sbjct: 265 GEVPYIRSAEDIAYNVALFIAKRGSYVNYYMYHGGTNFDRIASAFVITAYYD-EAPLDEY 323

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +R+PKWGHL+ELH AIK C   ++    T   LG +  A+++ +SS +CAAFL N + 
Sbjct: 324 GLVREPKWGHLKELHAAIKSCSNSILHGTQTSFSLGTQQNAYVFKRSSIECAAFLENTED 383

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
            S   + F    Y LP  S+SILPDCKNV FNTAKV  Q           + +   L  +
Sbjct: 384 QS-VTIQFQNIPYQLPPNSISILPDCKNVAFNTAKVSIQN---------ARAMKSQLEFN 433

Query: 420 SAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIE 477
           SA +W  Y+E +   G+ S     L +QI+TTKDTSDYLWYT  ++      + + L+  
Sbjct: 434 SAETWKVYKEAIPSFGDTSLRANTLLDQISTTKDTSDYLWYTFRLYDNSPNAQSI-LSAY 492

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S GH    FVN  LV   +G+H   +F++  K+ L  G+N +  LS  VGL N GA+ + 
Sbjct: 493 SHGHVLHAFVNGNLVGSIHGSHKNLSFVMENKLNLINGMNNISFLSATVGLPNSGAYLER 552

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVN 595
             AGL S     LK   RD ++  W YQ+G+ GE + +   S ++   W+  Q ST P  
Sbjct: 553 RVAGLRS-----LKVQGRDFTNQAWGYQIGLLGEKLQIYTASGSSKVQWESFQSSTKP-- 605

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
             L WYKTTF AP G  P+ LNL SMGKG  W+NGQ IGRYW ++  P            
Sbjct: 606 --LTWYKTTFDAPVGNDPVVLNLGSMGKGYTWINGQGIGRYWVSFHTPQ----------- 652

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                      G P+Q  YHIPR+ +    NLLV+ EE  G+P  I+L T
Sbjct: 653 -----------GTPSQKWYHIPRSLLKSTGNLLVLLEEETGNPLGITLDT 691


>gi|297724143|ref|NP_001174435.1| Os05g0428100 [Oryza sativa Japonica Group]
 gi|75137607|sp|Q75HQ3.1|BGAL7_ORYSJ RecName: Full=Beta-galactosidase 7; Short=Lactase 7; Flags:
           Precursor
 gi|46391137|gb|AAS90664.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|53981746|gb|AAV25023.1| putative beta-galactosidase [Oryza sativa Japonica Group]
 gi|255676388|dbj|BAH93163.1| Os05g0428100 [Oryza sativa Japonica Group]
          Length = 775

 Score =  625 bits (1611), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 354/836 (42%), Positives = 473/836 (56%), Gaps = 103/836 (12%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L   +TYD RALV+ G RR+  SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25  LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EPI+GQY FEGR+DLV+F++ +Q  GL++ LRIGP+  AEW YGGFP WLH +P I FR+
Sbjct: 85  EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+ F+ KI+ +MK E L+  QGGPII++Q+ENEY  +E A+G  G  YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGW--- 235
           A  AV L T VPW+MC+Q DAPDP+INTCNG  C + F  PNSP+KP +WTEN++     
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPVINTCNGLICGETFVGPNSPNKPALWTENWTSRSNG 264

Query: 236 -------FLSFGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
                  +  +G     R  ED+AFAVA F     G+F +YYMY GGTNFGR A    V 
Sbjct: 265 QNNSAFSYPIYGNDTKLRAPEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFAAS-YVT 323

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSS 347
           TSY   AP+DEY F                                              
Sbjct: 324 TSYYDGAPLDEYDF---------------------------------------------- 337

Query: 348 NDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA 407
             C AFL N+D  +   V F      L   S+S+L DC+NVVF TAKV +Q  +      
Sbjct: 338 -KCVAFLVNFDQHNTPKVEFRNISLELAPKSISVLSDCRNVVFETAKVNAQHGSRTANAV 396

Query: 408 QQKN-VNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP 466
           Q  N +N              K   +GN+ F      EQ+ TTKD +DYLWY  S     
Sbjct: 397 QSLNDINNWKAFIEPVPQDLSKSTYTGNQLF------EQLTTTKDETDYLWYIVSYKNRA 450

Query: 467 GQGKEV-FLNIESLGHAALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSM 524
             G ++  L ++SL H    FVN + V   +G+HD   N ++N  + L EG NT+ +LS+
Sbjct: 451 SDGNQIAHLYVKSLAHILHAFVNNEYVGSVHGSHDGPRNIVLNTHMSLKEGDNTISLLSV 510

Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
           MVG  + GA+ +    G+ +V +   +     L++  W YQVG+ GE   +      NS 
Sbjct: 511 MVGSPDSGAYMERRTFGIQTVGIQQGQQPMHLLNNDLWGYQVGLFGEKDSIYTQEGTNSV 570

Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPS 644
            W   + L +   L WYKTTF  P G   + LNL SMGKG+ WVNG+SIGRYW ++ APS
Sbjct: 571 RWMDINNL-IYHPLTWYKTTFSTPPGNDAVTLNLTSMGKGEVWVNGESIGRYWVSFKAPS 629

Query: 645 TGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
                                 GQP+Q+LYHIPR ++ P +NLLV+ EE+GGDP +I++ 
Sbjct: 630 ----------------------GQPSQSLYHIPRGFLTPKDNLLVLVEEMGGDPLQITVN 667

Query: 705 TKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNC 764
           T +   +C  V E   PP+ S           P+VR+ C+ G  I++I FASYG P G+C
Sbjct: 668 TMSVTTVCGNVDEFSVPPLQS-------RGKVPKVRIWCQGGNRISSIEFASYGNPVGDC 720

Query: 765 GSFRPGACHMDVL-PIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            SFR G+CH +    +V+++C+G+  CSIPV +A  G     CPG+ K+L V A C
Sbjct: 721 RSFRIGSCHAESSESVVKQSCIGRRGCSIPVMAAKFG--GDPCPGIQKSLLVVADC 774


>gi|30697899|ref|NP_568978.2| beta-galactosidase 6 [Arabidopsis thaliana]
 gi|75170268|sp|Q9FFN4.1|BGAL6_ARATH RecName: Full=Beta-galactosidase 6; Short=Lactase 6; Flags:
           Precursor
 gi|10177061|dbj|BAB10473.1| beta-galactosidase [Arabidopsis thaliana]
 gi|332010416|gb|AED97799.1| beta-galactosidase 6 [Arabidopsis thaliana]
          Length = 718

 Score =  624 bits (1608), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 329/705 (46%), Positives = 433/705 (61%), Gaps = 36/705 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP  
Sbjct: 32  VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHEPKL 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F GR DLV+F+K ++  GL++ LRIGP+  AEWNYGG P WL  +PG+ +RT N P
Sbjct: 92  GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 151

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F AKI+DLMK E L+ASQGGPIIL+Q+ENEY NVE A+   G  Y+KWA   A
Sbjct: 152 FKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAGQMA 211

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L T VPW+MC+  DAPDP+INTCNG  C + F  PNSP+KP MWTE+++ +F  +G  
Sbjct: 212 VGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVYGKE 271

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF  A F    G++ NYYMY GGTNFGRT+    +   YD  AP+DEYG +
Sbjct: 272 PYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 330

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELH AIK     L+    T   LG   +A+++  ++N C AFL N D+ + 
Sbjct: 331 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDAKA- 389

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           + + F  N Y L   S+ IL +CKN+++ TAKV  + N       Q  NV +       +
Sbjct: 390 SQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQVFNVPD------NW 443

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
           + + E +      S     L E  N TKD +DYLWYT+S  +  P     ++   ES GH
Sbjct: 444 NLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDSPCTNPSIY--TESSGH 501

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
              VFVN  L   G+G+ D     +   + L  G N + ILS MVGL + GA+ +    G
Sbjct: 502 VVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYG 561

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
           L  V +        DLS  +W Y VG+ GE + L +    N   W      L  N+ L W
Sbjct: 562 LTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAW 621

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKTTF  P G GP+ L+++SMGKG+ WVNG+SIGRYW ++L P+                
Sbjct: 622 YKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPA---------------- 665

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                 GQP+Q++YHIPR ++ P  NLLV+ EE GGDP  ISL T
Sbjct: 666 ------GQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNT 704


>gi|6686884|emb|CAB64742.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 718

 Score =  624 bits (1608), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 329/705 (46%), Positives = 433/705 (61%), Gaps = 36/705 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP  
Sbjct: 32  VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHEPKL 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F GR DLV+F+K ++  GL++ LRIGP+  AEWNYGG P WL  +PG+ +RT N P
Sbjct: 92  GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 151

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F AKI+DLMK E L+ASQGGPIIL+Q+ENEY NVE A+   G  Y+KWA   A
Sbjct: 152 FKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAGQMA 211

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L T VPW+MC+  DAPDP+INTCNG  C + F  PNSP+KP MWTE+++ +F  +G  
Sbjct: 212 VGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVYGKE 271

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF  A F    G++ NYYMY GGTNFGRT+    +   YD  AP+DEYG +
Sbjct: 272 PYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 330

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELH AIK     L+    T   LG   +A+++  ++N C AFL N D+ + 
Sbjct: 331 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDAKA- 389

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           + + F  N Y L   S+ IL +CKN+++ TAKV  + N       Q  NV +       +
Sbjct: 390 SQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQVFNVPD------NW 443

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
           + + E +  S         L E  N TKD +DYLWYT+S  +  P     ++   ES GH
Sbjct: 444 NLFRETIPASQAHLLKTNALLEHTNLTKDKTDYLWYTSSFKLDSPCTNPSIY--TESSGH 501

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
              VFVN  L   G+G+ D     +   + L  G N + ILS MVGL + GA+ +    G
Sbjct: 502 VVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYG 561

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
           L  V +        DLS  +W Y VG+ GE + L +    N   W      L  N+ L W
Sbjct: 562 LTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAW 621

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKTTF  P G GP+ L+++SMGKG+ WVNG+SIGRYW ++L P+                
Sbjct: 622 YKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPA---------------- 665

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                 GQP+Q++YHIPR ++ P  NLLV+ EE GGDP  ISL T
Sbjct: 666 ------GQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNT 704


>gi|110739416|dbj|BAF01618.1| beta-galactosidase like protein [Arabidopsis thaliana]
          Length = 718

 Score =  623 bits (1607), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 329/705 (46%), Positives = 433/705 (61%), Gaps = 36/705 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP  
Sbjct: 32  VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKAKEGGIDVIQTYVFWNLHEPKL 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F GR DLV+F+K ++  GL++ LRIGP+  AEWNYGG P WL  +PG+ +RT N P
Sbjct: 92  GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 151

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F AKI+DLMK E L+ASQGGPIIL+Q+ENEY NVE A+   G  Y+KWA   A
Sbjct: 152 FKFHMQKFTAKIVDLMKSEGLYASQGGPIILSQIENEYANVEGAFHEKGASYIKWAGQMA 211

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L T VPW+MC+  DAPDP+INTCNG  C + F  PNSP+KP MWTE+++ +F  +G  
Sbjct: 212 VGLKTGVPWIMCKSPDAPDPVINTCNGMKCGETFPGPNSPNKPKMWTEDWTSFFQVYGKE 271

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF  A F    G++ NYYMY GGTNFGRT+    +   YD  AP+DEYG +
Sbjct: 272 PYIRSAEDIAFHAALFVAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 330

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELH AIK     L+    T   LG   +A+++  ++N C AFL N D+ + 
Sbjct: 331 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDANNGCVAFLVNNDAKA- 389

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           + + F  N Y L   S+ IL +CKN+++ TAKV  + N       Q  NV +       +
Sbjct: 390 SQIQFRNNAYSLSPKSIGILQNCKNLIYETAKVNVKMNTRVTTPVQVFNVPD------NW 443

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
           + + E +      S     L E  N TKD +DYLWYT+S  +  P     ++   ES GH
Sbjct: 444 NLFRETIPAFPGTSLKTNALLEHTNLTKDKTDYLWYTSSFKLDSPCTNPSIY--TESSGH 501

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
              VFVN  L   G+G+ D     +   + L  G N + ILS MVGL + GA+ +    G
Sbjct: 502 VVHVFVNNALAGSGHGSRDIRVVKLQAPVSLINGQNNISILSGMVGLPDSGAYMERRSYG 561

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
           L  V +        DLS  +W Y VG+ GE + L +    N   W      L  N+ L W
Sbjct: 562 LTKVQISCGGTKPIDLSRSQWGYSVGLLGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAW 621

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKTTF  P G GP+ L+++SMGKG+ WVNG+SIGRYW ++L P+                
Sbjct: 622 YKTTFDGPNGDGPVGLHMSSMGKGEIWVNGESIGRYWVSFLTPA---------------- 665

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                 GQP+Q++YHIPR ++ P  NLLV+ EE GGDP  ISL T
Sbjct: 666 ------GQPSQSIYHIPRAFLKPSGNLLVVFEEEGGDPLGISLNT 704


>gi|297793965|ref|XP_002864867.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
 gi|297310702|gb|EFH41126.1| beta-galactosidase 6 [Arabidopsis lyrata subsp. lyrata]
          Length = 716

 Score =  620 bits (1600), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 328/705 (46%), Positives = 430/705 (60%), Gaps = 36/705 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDG+R++L SGSIHYPRSTPE+WP LI+K+KEGG++VI+TYVFWN HEP  
Sbjct: 30  VTYDGRSLIIDGQRKLLFSGSIHYPRSTPEMWPSLIKKTKEGGIDVIQTYVFWNLHEPKL 89

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F GR DLV+F+K ++  GL++ LRIGP+  AEWNYGG P WL  +PG+ +RT N P
Sbjct: 90  GQYDFSGRNDLVKFIKEIRSQGLYVCLRIGPFIEAEWNYGGLPFWLRDVPGMVYRTDNEP 149

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK  M++F  KI++LMK E L+ASQGGPIIL+Q+ENEY NVE A+   G  Y+KWA   A
Sbjct: 150 FKFHMQKFTTKIVNLMKSEGLYASQGGPIILSQIENEYANVEAAFHEKGASYIKWAGQMA 209

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L T VPW+MC+  DAPDP+INTCNG  C + F  PNSP+KP MWTE+++ +F  +G  
Sbjct: 210 VGLKTGVPWIMCKSPDAPDPVINTCNGMRCGETFPGPNSPNKPKMWTEDWTSFFQVYGTE 269

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R  ED+AF    F    G++ NYYMY GGTNFGRT+    +   YD  AP+DEYG +
Sbjct: 270 PYIRSAEDIAFHAVLFIAKNGSYINYYMYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLL 328

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
           RQPK+GHL+ELH AIK     L+    T   LG   +A+++  +S+ C AFL N D+   
Sbjct: 329 RQPKYGHLKELHAAIKSSANPLLQGKQTILSLGPMQQAYVFEDASSGCVAFLVNNDAKV- 387

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
           + + F  + Y L   S+ IL +CKN+++ TAKV  ++N       Q  NV E       +
Sbjct: 388 SQIQFRKSSYSLSPKSIGILQNCKNLIYETAKVNVEKNKRVTTPVQVFNVPE------KW 441

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-MPGQGKEVFLNIESLGH 481
             + E +      S     L E  N TKD +DYLWYT+S     P     ++  IES GH
Sbjct: 442 EGFRETIPAFSGTSLKANALLEHTNLTKDKTDYLWYTSSFKPDSPCTNPSIY--IESSGH 499

Query: 482 AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
              VFVN  L   G+G+ D     +     L  G N++ ILS MVGL + GA+ +    G
Sbjct: 500 VVHVFVNNALAGSGHGSRDIKVVKLQVPASLTNGQNSISILSGMVGLPDSGAYMERKSYG 559

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNKSLIW 600
           L  V +        DLS  +W Y VG+ GE + L +    N   W   +  L  N+ LIW
Sbjct: 560 LTKVQISCGGTKPIDLSGSQWGYSVGLLGEKVRLQQWRNLNRVKWSMNNAGLIKNRPLIW 619

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKT F  P G GP+ LN++SMGKG+ WVNG+SIGRYW ++L PS                
Sbjct: 620 YKTIFDGPNGDGPVGLNMSSMGKGEIWVNGESIGRYWVSFLTPS---------------- 663

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                 G P+Q++YHIPR ++ P  NLLV+ EE GGDP  ISL T
Sbjct: 664 ------GHPSQSIYHIPREFLKPSGNLLVVFEEEGGDPLGISLNT 702


>gi|414888321|tpg|DAA64335.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
          Length = 837

 Score =  619 bits (1596), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 327/828 (39%), Positives = 469/828 (56%), Gaps = 46/828 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDGKR +  SG+IHYPRS PEVWP+LI ++KEGGL  IETY+FWN HEP  
Sbjct: 36  VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y FEGRFDL++++K +QE  ++  +RIGP+  AEWN+GG P WL  I  I FR  N+P
Sbjct: 96  GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+EM++F+  I+  +K   LFASQGGPIIL Q+ENEYGN++  +   G+ Y++WAA  A
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++  T VPW+MC+Q  AP  +I TCNG +C D +T    +KP++WTEN++  F ++G  V
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GG+  NYYMY GGTNFGRT G   V T Y  +AP+DEYG  +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
           +PK+GHLR+LH  I+  ++  +    + + LG   EAHI+     N C +FL+N ++  D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  +++P+ SVSIL  CKNVV+NT +V  Q N   +      + +E+   ++ +
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSY------HTSEVTSKNNQW 448

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
             Y EK+    +      +  EQ N TKD SDYLWYT S  +    +P +      L ++
Sbjct: 449 EMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVK 508

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  H+ + F N   V    G+     F+  K ++L  G+N + +LS  +G+++ G     
Sbjct: 509 SSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAE 568

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             +G+   ++  L  G  DL    W ++  +EGE   +          WK        ++
Sbjct: 569 VKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWKPAEN---GRA 625

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
             WYK  F  P+G  P+ L+++SM KG  +VNG+ +GRYW +Y                 
Sbjct: 626 ATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY----------------- 668

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
                +   G P+Q LYHIPR ++   +NLLV+ EE  G P  I + T T   IC F+SE
Sbjct: 669 -----RTLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRDDICLFISE 723

Query: 718 ADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
            +P  + +W     K  L     S +  L C     I  + FAS+G PEG CG+F  G C
Sbjct: 724 HNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEGMCGNFTVGTC 783

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           H  +   IV+K C+G+  C +PV     G     C      L V+  C
Sbjct: 784 HTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTATLGVQVRC 830


>gi|357463559|ref|XP_003602061.1| Beta-galactosidase [Medicago truncatula]
 gi|355491109|gb|AES72312.1| Beta-galactosidase [Medicago truncatula]
          Length = 694

 Score =  617 bits (1592), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 335/709 (47%), Positives = 438/709 (61%), Gaps = 49/709 (6%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYD  +LVI+G  ++L SGSIHYPRSTP++WP+LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 24  ANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDVIQTYVFWNLHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQY F GRFDLV F+K +Q  GL++ LRIGPY  +E  YGG P+WLH +PGI FRT N
Sbjct: 84  QQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDVPGIVFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
           + FK  M+RF  KI+++MK  NLFASQGGPIIL+Q+ENEYG+++  +   G  Y+ WAA 
Sbjct: 144 DQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFRANGLPYIHWAAQ 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNSPSKPIMWTENYSGWFLSFG 240
            AV L T VPW+MC+Q+DAPDP+IN CNG  C  +   PNSP+KP +WTEN++ +  +FG
Sbjct: 204 MAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLWTENWTSFLQAFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A   R   D+A+ VA F    G++ NYYMY GGTNF R A   ++   YD +AP+DEYG
Sbjct: 264 GAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASAFIITAYYD-EAPLDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL+ELH +IK C + L+    T   LG++ +A+++ +SS +CAAFL N    
Sbjct: 323 LVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQAYVF-RSSTECAAFLEN-SGP 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            D  + F    Y LP  S+SILP CKNVVFNT KV  Q N         + +   L  +S
Sbjct: 381 RDVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNN--------VRAMKPRLQFNS 432

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           A +W  Y E +    + S     L +QI+T KDTSDY+WYT   +      K V L+I S
Sbjct: 433 AENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYTFRFNNKSPNAKSV-LSIYS 491

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
            G     F+N  L    +G+ +     + K + L  G+N + ILS  VGL N GA+ +  
Sbjct: 492 QGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNNISILSATVGLPNSGAFLESR 551

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK--QGSTLPVNK 596
            AGL  V +       RD SS  W YQVG+ GE + +  +S ++   WK  Q ST P   
Sbjct: 552 VAGLRKVEV-----QGRDFSSYSWGYQVGLLGEKLQIFTVSGSSKVQWKSFQSSTKP--- 603

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WY+TTF AP G  P+ +NL SMGKG AWVNGQ IGRYW ++  P             
Sbjct: 604 -LTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRYWVSFHKPD------------ 650

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                     G P+Q  YHIPR+++    NLLVI EE  G+P  I+L T
Sbjct: 651 ----------GTPSQQWYHIPRSFLKSTGNLLVILEEETGNPLGITLDT 689


>gi|242090613|ref|XP_002441139.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
 gi|241946424|gb|EES19569.1| hypothetical protein SORBIDRAFT_09g021140 [Sorghum bicolor]
          Length = 784

 Score =  612 bits (1579), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 350/820 (42%), Positives = 467/820 (56%), Gaps = 81/820 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V+ D RALV+DG RR+L +G +HY RSTPE+WP+LI K+KEGGL++I+TYVFWN HEP+
Sbjct: 41  QVSLDARALVVDGTRRLLFAGEMHYTRSTPEMWPKLIAKAKEGGLDMIQTYVFWNVHEPV 100

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +GQY FEGR+DLVRF+K +Q  GL++ LRIGP+  +EW YGGFP WLH +P I FR+ N 
Sbjct: 101 QGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRSDNE 160

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+ M+RF+  I+++MK E L+  QGGPII +Q+ENEY  VE A+G  G+ YV WAA  
Sbjct: 161 PFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEHAFGSSGQRYVSWAAAM 220

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+  T VPW MC+Q DAPDP++    G +      + P        N S  +L +G   
Sbjct: 221 AVDRQTGVPWTMCKQNDAPDPVV----GIHSHTIPLDFP--------NASRNYLIYGNDT 268

Query: 244 PFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
             R  ED+AFAV  F     G++ +YYMY GGTNFGR A    V TSY   AP+DEYG I
Sbjct: 269 KLRSPEDIAFAVVYFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDAAPLDEYGLI 327

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSD 362
            QP WGHLRELH A+K   E L+    ++  LG + EAHI+   S  C AFL N+D    
Sbjct: 328 WQPTWGHLRELHAAVKQSSEPLLFGTYSYLSLGQEQEAHIFETES-QCVAFLVNFDRHHI 386

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ-KNVNELLLASSA 421
           + V F      L   S+SIL DCK VVF TAKV +Q  +      Q   ++N        
Sbjct: 387 SEVVFRNISLELAPKSISILSDCKRVVFETAKVTAQHGSRTAEEVQSFSDINTWTAFKEP 446

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
                 K   SGNR F      E ++TTKD +DYLWY   +          F NI  LG 
Sbjct: 447 IPQDVSKAMYSGNRLF------EHLSTTKDDTDYLWYIVGL----------FHNI--LGR 488

Query: 482 AALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
                         +G+H   AN ++N  I L EG NT+ +LS MVG  + GA  +    
Sbjct: 489 I-------------HGSHGGPANIILNTNISLKEGPNTISLLSAMVGSPDSGAHMERRVF 535

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
           GL  V +   +  +  L++  W YQVG+ GE   +     + S  W     L  +  L W
Sbjct: 536 GLQKVSIQQGQEPENLLNNELWGYQVGLFGERNSIYTQEGSKSVEWTTIYNLAYSP-LTW 594

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YKTTF  P G   + LNL  MGKG+ WVNG+SIGRYW ++ APS                
Sbjct: 595 YKTTFSTPAGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS---------------- 638

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
                 G P+Q+LYHIPR +++P +N+LV+ EE+GG+P +I++ T +   +C  V+E   
Sbjct: 639 ------GNPSQSLYHIPRQFLNPQDNILVLFEEMGGNPQQITVNTVSVTRVCVNVNELS- 691

Query: 721 PPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPI 779
                  P+L   +  P V L C+ G  I+AI FASYG P G+C   R G+CH      +
Sbjct: 692 ------APSLQYKNKEPAVDLRCQEGKQISAIEFASYGNPIGDCKKIRFGSCHAGSSESV 745

Query: 780 VQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           V++AC+G+  CSIP++    G     CPG+ K+L V A+C
Sbjct: 746 VKQACLGKSGCSIPITPIKFG--GDPCPGIKKSLLVVANC 783


>gi|357154419|ref|XP_003576777.1| PREDICTED: beta-galactosidase 12-like [Brachypodium distachyon]
          Length = 835

 Score =  611 bits (1576), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 327/828 (39%), Positives = 468/828 (56%), Gaps = 46/828 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+IDGKR +  SG+IHYPRS PE+WP+L+ ++K+GGL  IETYVFWN HEP  
Sbjct: 33  VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWPKLLDRAKDGGLNTIETYVFWNAHEPEP 92

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y FEGR DL++F+K +Q+  ++  +RIGP+  AEWN+GG P WL  IP I FR  N P
Sbjct: 93  GKYNFEGRCDLIKFLKLIQDNDMYAVIRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 152

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+EM++F+  I+  +K  ++FASQGGPIILAQ+ENEYGN++  +   G+ Y++WAA+ A
Sbjct: 153 YKKEMEKFVRFIVQKLKDADMFASQGGPIILAQIENEYGNIKKDHITDGDKYLEWAAEMA 212

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++ N  +PW+MC+Q  AP  +I TCNG +C D +T    +KP +WTEN++  F +FG   
Sbjct: 213 LSTNIGIPWIMCKQTTAPGVVIPTCNGRHCGDTWTLRDKNKPRLWTENWTAQFRAFGDQA 272

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A++V RFF  GGT  NYYMY+GGTNFGRT G   V T Y  +APIDEYG  +
Sbjct: 273 AVRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEAPIDEYGLNK 331

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
           +PK+GHLR+LHK IK   +  +    + + LG   EAH Y     N C AF++N ++  D
Sbjct: 332 EPKFGHLRDLHKLIKSYHKAFLVGKQSFELLGHGYEAHNYELPEENLCLAFISNNNTGED 391

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  Y++P+ SVSIL DC +VV+NT +V  Q +      A +   N      + +
Sbjct: 392 GTVMFRGKKYYIPSRSVSILADCNHVVYNTKRVFVQHSERSFHTADESTKN------NVW 445

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
             Y E +      S    +  EQ N TKD SDYLWYT S  +    +P  +     + ++
Sbjct: 446 EMYSEPIPRYKVTSVRTKEPLEQYNLTKDKSDYLWYTTSFRLEADDLPFRRDIRPVVQVK 505

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  HA + FVN      G G+     FL  K I+L  GIN L +LS  +G+++ G     
Sbjct: 506 SSAHAMMGFVNDAFAGSGRGSKKDKGFLFEKPIDLRIGINHLALLSSSMGMKDSGGELVE 565

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
              G+   ++  L  G  DL    W +++ ++GE   +       +  WK         +
Sbjct: 566 VKGGIQDCMIQGLNTGTLDLQGNGWGHKINLDGEDKEIYTEKGMGTVKWKPAEN---GHA 622

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
           + WY+  F  P+G  P+ L+++SM KG  +VNG+ +GRYW++Y                 
Sbjct: 623 VTWYRRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWTSY----------------- 665

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
                +   G P+Q+LYHIPR ++   +NLLV+ EE  G P  I + T     IC  +SE
Sbjct: 666 -----KTIAGLPSQSLYHIPRPFLKSKKNLLVVFEEEIGKPEGILIQTVRRDDICFLMSE 720

Query: 718 ADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
            +P  V +W  + G +       S +  L C     I  + FAS+G PEG CG+F  G C
Sbjct: 721 HNPAQVKTWDADGGQIKLIAEDHSSRGILTCPHKKTIEEVVFASFGNPEGACGNFTAGTC 780

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           H  +    V K C+G+  C +P+     G     CP     LAV+  C
Sbjct: 781 HTPNAKEFVAKECLGKKSCVLPLIHTLYGADIN-CPTTTATLAVQVRC 827


>gi|222642000|gb|EEE70132.1| hypothetical protein OsJ_30164 [Oryza sativa Japonica Group]
          Length = 838

 Score =  610 bits (1573), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 328/831 (39%), Positives = 468/831 (56%), Gaps = 48/831 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+IDGKR +  SG+IHYPRS PE+W +L++ +K GGL  IETYVFWN HEP  
Sbjct: 36  VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+YYFEGRFDL+RF+  +++  ++  +RIGP+  AEWN+GG P WL  I  I FR  N P
Sbjct: 96  GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK EM++F+  I+  +K   +FA QGGPIIL+Q+ENEYGN++    V G+ Y++WAA+ A
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++    VPWVMC+Q  AP  +I TCNG +C D +T    +KP +WTEN++  F +FG  +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GGT  NYYMY GGTNFGRT G   V T Y  +AP+DEYG  +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
           +PK+GHLR+LH  IK   +  +    + + LG   EAH Y    +  C +FL+N ++  D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  +++P+ SVSIL DCK VV+NT +V  Q +        + + N +      +
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNV------W 448

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
             Y E +              EQ N TKDTSDYLWYT S  +    +P  +     + I+
Sbjct: 449 EMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 508

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  HA + F N   V  G G+    +F+  K ++L  GIN + +LS  +G+++ G     
Sbjct: 509 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 568

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNK 596
              G+   ++  L  G  DL    W ++  +EGE   +          WK     LP+  
Sbjct: 569 VKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT- 627

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
              WYK  F  P+G  P+ ++++SM KG  +VNG+ IGRYW++++  +            
Sbjct: 628 ---WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA------------ 672

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
                     G P+Q++YHIPR ++ P  NLL+I EE  G P  I + T     IC F+S
Sbjct: 673 ----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFIS 722

Query: 717 EADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           E +P  + +W+ + G +      +S +  L C     I  + FAS+G PEG CG+F  G 
Sbjct: 723 EHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAGT 782

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
           CH  D   IV+K C+G+  C +PV +   G     CP     LAV+  C +
Sbjct: 783 CHTPDAKAIVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQVRCKV 832


>gi|413949218|gb|AFW81867.1| hypothetical protein ZEAMMB73_495459 [Zea mays]
          Length = 759

 Score =  607 bits (1566), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 354/832 (42%), Positives = 476/832 (57%), Gaps = 102/832 (12%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +   VTY+ RALV+DG RR+L +G +HYPRSTPE+WP+LI K+KEGGL+VI+TYVFWN H
Sbjct: 14  VRGEVTYEQRALVLDGARRMLFAGEMHYPRSTPEMWPKLIAKAKEGGLDVIQTYVFWNVH 73

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EPI+GQY FEGR+DLVRF+K +Q  GL++ LRIGP+  +EW YGGFP WLH +P I FR+
Sbjct: 74  EPIQGQYNFEGRYDLVRFIKEIQAQGLYVSLRIGPFIESEWKYGGFPFWLHDVPNITFRS 133

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+RF+  I+++MK E L+  QGGPII +Q+ENEY  VE A+G  G+ YV WA
Sbjct: 134 DNEPFKQHMQRFVTDIVNMMKHEGLYYPQGGPIITSQIENEYQMVEPAFGSSGQRYVSWA 193

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  AV+L T VPW MC+Q DAPDP++             +S + P+ + +N S  +L +G
Sbjct: 194 AAMAVDLQTGVPWTMCKQNDAPDPVVGI-----------HSYTIPVNF-QNDSRNYLIYG 241

Query: 241 YAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
                R  +D+ FAVA F     G++ +YYMY GGTNFGR A    V TSY   AP+DEY
Sbjct: 242 NDTKLRSPQDITFAVALFIARKNGSYVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEY 300

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G I QP WGHLRELH A+K   E L+    ++  +G + EAHI+ ++   C AFL N+D 
Sbjct: 301 GLIWQPTWGHLRELHAAVKQSSEPLLFGTYSNLSIGQEQEAHIF-ETETQCVAFLVNFDQ 359

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
              + V F      L   S+SIL DCK VVF TAKV +Q  +        +   E+   S
Sbjct: 360 HHISEVVFRNISLELAPKSISILLDCKQVVFETAKVNAQHGS--------RTAEEVQSFS 411

Query: 420 SAFSW--YEE-------KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK 470
              +W  ++E       K   SGNR F      E ++TTKD +DYLWY   +        
Sbjct: 412 DISTWKAFKEPIPQDVSKSAYSGNRLF------EHLSTTKDATDYLWYIVGL-------- 457

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSMMVGLQ 529
             FLNI  LG               +G+H   AN + +  I L EG NT+ +LS MVG  
Sbjct: 458 --FLNI--LGRI-------------HGSHGGPANIIFSTNISLQEGPNTISLLSAMVGSP 500

Query: 530 NYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF--WK 587
           + GA  +    G+  V +   +  +  L++  W YQVG+ GE    + I   +S    W 
Sbjct: 501 DSGAHMERRVFGIRKVSIQQGQEPENLLNNELWGYQVGLFGE---RNNIYTQDSKITEWT 557

Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
               L  +  L WYKTTF  P G   + LNL  MGKG+ WVNG+SIGRYW ++ APS   
Sbjct: 558 TIDNLTYSP-LTWYKTTFSTPVGNDAVTLNLTGMGKGEVWVNGESIGRYWVSFKAPS--- 613

Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
                              G P+Q+LYHIPR +++P +N LV+ EE+GG+P  I++ T +
Sbjct: 614 -------------------GNPSQSLYHIPREFLNPQDNTLVLFEEMGGNPQLITVNTMS 654

Query: 708 GQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSF 767
              +C  V+E          P+L      P V L C  G HI+AI FASYG P G+C  F
Sbjct: 655 VSRVCGNVNELS-------APSLQYKDKEPAVDLWCPEGKHISAIEFASYGGPTGDCKKF 707

Query: 768 RPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAH 818
             G CH      +V++AC+G+  CS+PV+    G     CPG+ K+L V A+
Sbjct: 708 GFGRCHAGSSESVVKQACLGKSGCSVPVTPIKFG--GDPCPGIQKSLLVVAN 757


>gi|152013365|sp|Q0IZZ8.2|BGL12_ORYSJ RecName: Full=Beta-galactosidase 12; Short=Lactase 12; Flags:
           Precursor
          Length = 911

 Score =  605 bits (1560), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 327/826 (39%), Positives = 466/826 (56%), Gaps = 48/826 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+IDGKR +  SG+IHYPRS PE+W +L++ +K GGL  IETYVFWN HEP  
Sbjct: 36  VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+YYFEGRFDL+RF+  +++  ++  +RIGP+  AEWN+GG P WL  I  I FR  N P
Sbjct: 96  GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK EM++F+  I+  +K   +FA QGGPIIL+Q+ENEYGN++    V G+ Y++WAA+ A
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++    VPWVMC+Q  AP  +I TCNG +C D +T    +KP +WTEN++  F +FG  +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GGT  NYYMY GGTNFGRT G   V T Y  +AP+DEYG  +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
           +PK+GHLR+LH  IK   +  +    + + LG   EAH Y    +  C +FL+N ++  D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  +++P+ SVSIL DCK VV+NT +V  Q +        + + N      + +
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKN------NVW 448

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
             Y E +              EQ N TKDTSDYLWYT S  +    +P  +     + I+
Sbjct: 449 EMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 508

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  HA + F N   V  G G+    +F+  K ++L  GIN + +LS  +G+++ G     
Sbjct: 509 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 568

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNK 596
              G+   ++  L  G  DL    W ++  +EGE   +          WK     LP+  
Sbjct: 569 VKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT- 627

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
              WYK  F  P+G  P+ ++++SM KG  +VNG+ IGRYW++++  +            
Sbjct: 628 ---WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA------------ 672

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
                     G P+Q++YHIPR ++ P  NLL+I EE  G P  I + T     IC F+S
Sbjct: 673 ----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFIS 722

Query: 717 EADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           E +P  + +W+ + G +      +S +  L C     I  + FAS+G PEG CG+F  G 
Sbjct: 723 EHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAGT 782

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVE 816
           CH  D   IV+K C+G+  C +PV +   G     CP     LAV+
Sbjct: 783 CHTPDAKAIVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQ 827


>gi|357437611|ref|XP_003589081.1| Beta-galactosidase [Medicago truncatula]
 gi|355478129|gb|AES59332.1| Beta-galactosidase [Medicago truncatula]
          Length = 589

 Score =  604 bits (1557), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 308/599 (51%), Positives = 396/599 (66%), Gaps = 17/599 (2%)

Query: 116 IQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL 175
           + FRT N PFK  M++F  KI+ +MK E+LF +QGGPII++Q+ENEYG VEW  G  G+ 
Sbjct: 1   MAFRTDNEPFKAAMQKFTTKIVTMMKAESLFQTQGGPIIMSQIENEYGPVEWEIGAPGKA 60

Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGW 235
           Y KWAA  AV L+T VPW MC+QEDAPDP+I+TCNG+YC+ FTPN   KP MWTEN+SGW
Sbjct: 61  YTKWAAQMAVGLDTGVPWDMCKQEDAPDPVIDTCNGYYCENFTPNENFKPKMWTENWSGW 120

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
           +  FG A+  RP EDLA++VA F +  G+F NYYMY GGTNFGRT+ G  +ATSYDYDAP
Sbjct: 121 YTDFGGAISHRPTEDLAYSVATFIQNRGSFVNYYMYHGGTNFGRTSSGLFIATSYDYDAP 180

Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK-LEAHIYHKSSNDCAAFL 354
           IDEYG   +PKW HL+ LHKAIK CE  LIS DPT   LG K LEAH+Y+ +++ CAAFL
Sbjct: 181 IDEYGLPNEPKWSHLKNLHKAIKQCEPALISVDPTVTWLGNKNLEAHVYYVNTSICAAFL 240

Query: 355 ANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNE 414
           ANYD+ S A VTF    Y LP WSVSILPDCK VVFNTA V     NG H F ++    E
Sbjct: 241 ANYDTKSAATVTFGNGQYDLPPWSVSILPDCKTVVFNTATV-----NG-HSFHKRMTPVE 294

Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----G 469
                 ++S  EE    S + S +   L EQIN T+D+SDYLWY   +++ P +     G
Sbjct: 295 TTFDWQSYS--EEPAYSSDDDSIIANALWEQINVTRDSSDYLWYLTDVNISPSESFIKNG 352

Query: 470 KEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
           +   L I S GH   VFVN +L    YG  D      ++ + L  G N + +LS+ VGL 
Sbjct: 353 QFPTLTINSAGHVLHVFVNGQLSGTVYGGLDNPKVTFSESVNLKVGNNKISLLSVAVGLP 412

Query: 530 NYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
           N G  F+    G+   V L  L  G RDLS  +W Y+VG++GE + L  I+ ++S  W Q
Sbjct: 413 NVGLHFETWNVGVLGPVRLKGLDEGTRDLSWQKWSYKVGLKGESLSLHTITGSSSIDWTQ 472

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
           GS+L   + L WYKTTF AP G  P+AL+++SMGKG+ W+N QSIGR+W AY+A   G  
Sbjct: 473 GSSLAKKQPLTWYKTTFDAPSGNDPVALDMSSMGKGEIWINDQSIGRHWPAYIA--HGNC 530

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            +C+Y G++   KC+ +CG+P Q  YHIPR+W+    N+LV+ EE GGDP+ ISL+ +T
Sbjct: 531 DECNYAGTFTNPKCRTNCGEPTQKWYHIPRSWLSSSGNVLVVLEEWGGDPTGISLVKRT 589


>gi|414888322|tpg|DAA64336.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
          Length = 822

 Score =  601 bits (1550), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 321/827 (38%), Positives = 461/827 (55%), Gaps = 59/827 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDGKR +  SG+IHYPRS PEVWP+LI ++KEGGL  IETY+FWN HEP  
Sbjct: 36  VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y FEGRFDL++++K +QE  ++  +RIGP+  AEWN+GG P WL  I  I FR  N+P
Sbjct: 96  GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+EM++F+  I+  +K   LFASQGGPIIL Q+ENEYGN++  +   G+ Y++WAA  A
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++  T VPW+MC+Q  AP  +I TCNG +C D +T    +KP++WTEN++  F ++G  V
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GG+  NYYMY GGTNFGRT G   V T Y  +AP+DEYG  +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
           +PK+GHLR+LH  I+  ++  +    + + LG   EAHI+     N C +FL+N ++  D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  +++P+ SVSIL  CKNVV+NT +V  Q N   +      + +E+   ++ +
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSY------HTSEVTSKNNQW 448

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
             Y EK+    +      +  EQ N TKD SDYLWYT S  +    +P +      L ++
Sbjct: 449 EMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVK 508

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  H+ + F N   V    G+     F+  K ++L  G+N + +LS  +G+++ G     
Sbjct: 509 SSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMKDSGGELAE 568

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
             +G+   ++  L  G  DL    W ++  +EGE   +          WK        ++
Sbjct: 569 VKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWKPAEN---GRA 625

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
             WYK  F  P+G  P+ L+++SM KG  +VNG+ +GRYW +Y                 
Sbjct: 626 ATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY----------------- 668

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
                +   G P+Q LYHIPR ++   +NLLV+ EE  G P  I + T T   IC F+SE
Sbjct: 669 -----RTLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVTRDDICLFISE 723

Query: 718 ADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
            +P  + +W     K  L     S +  L C     I  + FAS+G PEG CG+F     
Sbjct: 724 HNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEGMCGNF----- 778

Query: 773 HMDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
                      C+G+  C +PV     G     C      L V+  C
Sbjct: 779 ---------TECLGKPSCMLPVDHTVYGADIN-CQSTTATLGVQVRC 815


>gi|449451942|ref|XP_004143719.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
          Length = 613

 Score =  598 bits (1543), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 298/617 (48%), Positives = 403/617 (65%), Gaps = 11/617 (1%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +WP+LI+K+K+GGL+ IETY+FW+ HEP R +Y F GR D ++F + +Q+AGL++ +RIG
Sbjct: 1   MWPDLIQKAKDGGLDAIETYIFWDRHEPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           PY CAEWNYGGFPVWLH +PGIQ RT N  +K EM+ F  KI+++ KQ NLFASQGGPII
Sbjct: 61  PYVCAEWNYGGFPVWLHNMPGIQLRTNNQVYKNEMQTFTTKIVNMCKQANLFASQGGPII 120

Query: 155 LAQVENEYGNVEW-AYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFY 213
           LAQ+ENEYGNV   AYG  G+ Y+ W A  A +LN  VPW+MCQQ DAP P+INTCNGFY
Sbjct: 121 LAQIENEYGNVMTPAYGDAGKAYINWCAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFY 180

Query: 214 CDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFG 273
           CD FTPN+P  P M+TEN+ GWF  +G   P+R  ED+AF+VARFF++GG F NYYMY G
Sbjct: 181 CDNFTPNNPKSPKMFTENWVGWFKKWGDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHG 240

Query: 274 GTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK 333
           GTNFGRT+GGP + TSYDY+AP+DEYG + QPKWGHL++LH +IKL E+ L +S  ++Q 
Sbjct: 241 GTNFGRTSGGPFITTSYDYNAPLDEYGNLNQPKWGHLKQLHASIKLGEKILTNSTRSNQN 300

Query: 334 LGAKLE-AHIYHKSSNDCAAFLANYDSSSDANVTFNGN-VYFLPAWSVSILPDCKNVVFN 391
            G+ +      + ++ +   FL+N D  +DA +    +  YF+PAWSVSIL  C   V+N
Sbjct: 301 FGSSVTLTKFSNPTTGERFCFLSNTDGKNDATIDLQEDGKYFVPAWSVSILDGCNKEVYN 360

Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKD 451
           TAKV SQ +     F +++N  E    S A++    K  + GN  F    L EQ   T D
Sbjct: 361 TAKVNSQTS----MFVKEQNEKENAQLSWAWAPEPMKDTLQGNGKFAANLLLEQKRVTVD 416

Query: 452 TSDYLWYTASIHVMPGQG-KEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKI 510
            SDY WY   +        + V L + + GH    FVNK+ +   +G++   +F+  K I
Sbjct: 417 FSDYFWYMTKVDTNGTSSLQNVTLQVNTKGHVLHAFVNKRYIGSKWGSNG-QSFVFEKPI 475

Query: 511 ELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGV 568
            L  GINT+ +LS  VGL+NY A++D+   G+    + LI   N   DLSS  W Y+VG+
Sbjct: 476 LLKSGINTITLLSATVGLKNYDAFYDMVPTGIDGGPIYLIGDGNVTTDLSSNLWSYKVGL 535

Query: 569 EGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
            GE   +     +  + W   +   + + + WYKT+F  P G  P+ L++  MGKGQAWV
Sbjct: 536 NGEMKQIYNPVFSQRTNWIPLNQKSIGRRMTWYKTSFKTPAGIDPVVLDMQGMGKGQAWV 595

Query: 629 NGQSIGRYWSAYLAPST 645
           NGQSIGR+W +++   T
Sbjct: 596 NGQSIGRFWPSFIXKFT 612


>gi|22329242|ref|NP_195571.2| beta-galactosidase 14 [Arabidopsis thaliana]
 gi|332661551|gb|AEE86951.1| beta-galactosidase 14 [Arabidopsis thaliana]
          Length = 988

 Score =  594 bits (1532), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 327/803 (40%), Positives = 461/803 (57%), Gaps = 54/803 (6%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +WP +I K++ GGL  I+TYVFWN HEP +G+Y F+GRFDLV+F+K + E GL++ LR+G
Sbjct: 1   MWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVTLRLG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           P+  AEWN+GG P WL  +P + FRT N PFKE  +R++ KI+ +MK+E LFASQGGPII
Sbjct: 61  PFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQGGPII 120

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
           L Q+ENEY  V+ AY   GE Y+KWAA+   ++N  +PWVMC+Q DAP  +IN CNG +C
Sbjct: 121 LGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACNGRHC 180

Query: 215 -DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYF 272
            D F  PN   KP +WTEN++  F  FG     R VED+AF+VAR+F   G+  NYYMY 
Sbjct: 181 GDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNYYMYH 240

Query: 273 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ 332
           GGTNFGRT+    V T Y  DAP+DE+G  + PK+GHL+ +H+A++LC++ L       Q
Sbjct: 241 GGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQLRAQ 299

Query: 333 KLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFN 391
            LG   E   Y +     CAAFL+N ++     + F G  Y LP+ S+SILPDCK VV+N
Sbjct: 300 TLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKTVVYN 359

Query: 392 TAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKD 451
           TA++++Q +  D  F + +  ++ L     F  + E +    +   + P   E    TKD
Sbjct: 360 TAQIVAQHSWRD--FVKSEKTSKGL----KFEMFSENIPSLLDGDSLIP--GELYYLTKD 411

Query: 452 TSDYLWYTASIHV----MPGQ-GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLI 506
            +DY WYT S+ +     P Q G +  L + SLGHA +V+VN +     +G H+  +F  
Sbjct: 412 KTDYAWYTTSVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEF 471

Query: 507 NKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQ 565
            K +    G N + IL ++ GL + G++ +   AG  ++ +I LK+G RDL+ + EW + 
Sbjct: 472 AKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHL 531

Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQ 625
            G+EGE   +     +    W++       K L WYKT F  PEG   +A+ + +MGKG 
Sbjct: 532 AGLEGEKKEVYTEEGSKKVKWEKDGK---RKPLTWYKTYFETPEGVNAVAIRMKAMGKGL 588

Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV--HP 683
            WVNG  +GRYW ++L+P                       G+P QT YHIPR+++    
Sbjct: 589 IWVNGIGVGRYWMSFLSP----------------------LGEPTQTEYHIPRSFMKGEK 626

Query: 684 GENLLVI-HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK-PNLGVVSSSPQVRL 741
            +N+LVI  EE G     I  +      ICS V E  P  V SWK     +VS S  +RL
Sbjct: 627 KKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRL 686

Query: 742 A----CERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSS 796
                C     +  + FAS+G P G CG+F  G C       +V+K C+G+  CSI V+ 
Sbjct: 687 KAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVAR 746

Query: 797 AYLGVSAGACPGLLKALAVEAHC 819
              G     CP ++K LAV+  C
Sbjct: 747 ETFGDK--GCPEIVKTLAVQVKC 767


>gi|57283676|emb|CAG30724.1| putative beta-galactosidase precursor [Hordeum vulgare]
          Length = 833

 Score =  593 bits (1529), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 324/825 (39%), Positives = 460/825 (55%), Gaps = 44/825 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+IDGKR +  SG+IHYPRS P++W +L++ +K+GGL  IETYVFWN HEP  
Sbjct: 35  VSYDERSLLIDGKRDLFFSGAIHYPRSPPDMWHKLLKTAKDGGLNTIETYVFWNAHEPEP 94

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y FEGR DL++F+K +Q   ++  +RIGP+  AEWN+GG P WL  IP I FR  N P
Sbjct: 95  GKYNFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEP 154

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+EM++F+  I+  +K   +FASQGGP+ILAQ+ENEYGN++  + V G+ Y++WAA  A
Sbjct: 155 YKKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMA 214

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++ NT VPW+MC+Q  AP  +I TCNG +C D +T    +KP +WTEN++  F +FG  +
Sbjct: 215 ISTNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQL 274

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYM-YFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
             R  ED+A++V RFF  GGT  NYYM Y+GGTNFGRT G   V T Y  + P+DE    
Sbjct: 275 ALRSAEDIAYSVLRFFAKGGTLVNYYMQYYGGTNFGRT-GASYVLTGYYDEGPVDEC-MP 332

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSS 361
           + PK+GHLR+LH  IK      +    + + L    EAH +       C AF++N ++  
Sbjct: 333 KAPKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGE 392

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  V F G+ Y++P+ SVSIL DCK+VV+NT +V  Q +      AQ+      L  S+A
Sbjct: 393 DGTVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQK------LAKSNA 446

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGK-EVFLNIESLG 480
           +  Y E +      S    +  EQ N TKD SDYL +      +P +G     + ++S  
Sbjct: 447 WEMYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLCFRLEADDLPFRGDIRPVVQVKSTS 506

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA + FVN      G G+     F+    I L  GIN L +LS  +G+++ G        
Sbjct: 507 HALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEVKG 566

Query: 541 GLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIW 600
           G+    +  L  G  DL    W ++V +EGE   +       +  W   +T    +++ W
Sbjct: 567 GIQDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPATT---GRAVTW 623

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YK  F  P+G+ P+ L++ SMGKG  +VNG+ +GRYW +Y                    
Sbjct: 624 YKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVG---------------- 667

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
                 G P+Q +YHIPR ++ P  NLLVI EE  G P  I + T     IC F+SE +P
Sbjct: 668 ------GVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRRDDICVFISEHNP 721

Query: 721 PPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM- 774
             + +W  + G +       S +  L C     I  + FAS+G PEG+C +F  G CH  
Sbjct: 722 AQIKTWDKDGGQIKLIAEDHSTRGILKCPPKKTIQEVVFASFGNPEGSCANFTAGTCHTP 781

Query: 775 DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           +   IV K C+G+  C +PV     G     CP     LAV+  C
Sbjct: 782 NAKDIVAKECLGKKSCVLPVLHTVYGADIN-CPTTTATLAVQVRC 825


>gi|147843186|emb|CAN82672.1| hypothetical protein VITISV_014349 [Vitis vinifera]
          Length = 710

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 326/713 (45%), Positives = 422/713 (59%), Gaps = 76/713 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A VTYD R+L+IDG R++L SGSIHYPRSTP++W  LI K+KEGG++VI+TYVFWN HEP
Sbjct: 24  AQVTYDGRSLIIDGHRKILFSGSIHYPRSTPQMWASLIAKAKEGGVDVIQTYVFWNRHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQY F GR+DL +F+K +Q  GL+  LRIGP+  +EW+YGG P WLH + GI +RT N
Sbjct: 84  QPGQYDFNGRYDLXKFIKEIQAQGLYACLRIGPFIESEWSYGGLPFWLHDVHGIVYRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI++LMK E L+ASQGGPIIL+Q+ENEY N+E A+   G  YV+WAA 
Sbjct: 144 EPFKFYMQNFTTKIVNLMKSEGLYASQGGPIILSQIENEYQNIEAAFNEKGPSYVRWAAK 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L T VPWVMC+Q DAPDP+INTCNG  C   FT PNSP+KP MWTEN++ ++  FG
Sbjct: 204 MAVELQTGVPWVMCKQSDAPDPVINTCNGMRCGQTFTGPNSPNKPSMWTENWTSFYEVFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+AF VA F    G++ NYYM                              
Sbjct: 264 GETYLRSAEDIAFHVALFIARNGSYVNYYM----------------------------VS 295

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            IRQPKWGHL+ELH AI LC   L++   ++  LG   EA+++ +    C AFL N D  
Sbjct: 296 LIRQPKWGHLKELHAAITLCSTPLLNGVQSNISLGQLQEAYVFQEEMGGCVAFLVNNDEG 355

Query: 361 SDANVTF-NGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
           +++ V F N ++  LP  S+SILPDCKNV+FNTAK+ +  N              +  +S
Sbjct: 356 NNSTVLFQNVSIELLPK-SISILPDCKNVIFNTAKINTGYN------------ERITTSS 402

Query: 420 SAFS----WYEEKVGISG--NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-KEV 472
            +F     W E K  I    + S     + E +N TKD SDYLWYT      P     E 
Sbjct: 403 QSFDAVDRWEEYKDAIPNFLDTSLKSNMILEHMNMTKDESDYLWYT--FRFQPNSSCTEP 460

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L+IESL HA   FVN   V   +G+HD   F     I LN  +N + ILS+MVG  + G
Sbjct: 461 LLHIESLAHAVHAFVNNIYVGATHGSHDMKGFTFKSPISLNNEMNNISILSVMVGFPDSG 520

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
           A+ +   AGL  V +   + G  D ++  W YQVG+ GE + + K    ++  W++ + +
Sbjct: 521 AYLESRFAGLTRVEIQCTEKGIYDFANYTWGYQVGLSGEKLHIYKEENLSNVEWRK-TEI 579

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
             N+ L WYK  F  P G  P+ALNL++MGKG+AWVNGQSIGRYW               
Sbjct: 580 STNQPLTWYKIVFNTPSGDDPVALNLSTMGKGEAWVNGQSIGRYWV-------------- 625

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
              S+  SK     G P+QTLYH+PR ++   ENLLV+ EE  GDP  ISL T
Sbjct: 626 ---SFHNSK-----GDPSQTLYHVPRAFLKTSENLLVLLEEANGDPLHISLET 670


>gi|357520325|ref|XP_003630451.1| Beta-galactosidase [Medicago truncatula]
 gi|355524473|gb|AET04927.1| Beta-galactosidase [Medicago truncatula]
          Length = 706

 Score =  592 bits (1525), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 329/720 (45%), Positives = 431/720 (59%), Gaps = 59/720 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYD  +LVI+G  ++L SGSIHYPRSTP++WP+LI K+KEGGL+VI+TYVFWN HEP
Sbjct: 24  ANVTYDRTSLVINGHHKILFSGSIHYPRSTPQMWPDLISKAKEGGLDVIQTYVFWNLHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQY F GRFDLV F+K +Q  GL++ LRIGPY  +E  YGG P+WLH +PGI FRT N
Sbjct: 84  QQGQYEFNGRFDLVGFIKEIQAQGLYVTLRIGPYIESECTYGGLPLWLHDVPGIVFRTDN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
           + FK  M+RF  KI+++MK  NLFASQGGPIIL+Q+ENEYG+++  +   G  Y+ WAA 
Sbjct: 144 DQFKFHMQRFTTKIVNMMKSANLFASQGGPIILSQIENEYGSIQSKFRANGLPYIHWAAQ 203

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNSPSKPIMWTENYSGWFLSFG 240
            AV L T VPW+MC+Q+DAPDP+IN CNG  C  +   PNSP+KP +WTEN++ +  +FG
Sbjct: 204 MAVGLQTGVPWMMCKQDDAPDPVINACNGMQCGRNFKGPNSPNKPSLWTENWTSFLQAFG 263

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
            A   R   D+A+ VA F    G++ NYYMY GGTNF R A   ++   YD +AP+DEYG
Sbjct: 264 GAPYMRSASDIAYNVALFIAKKGSYVNYYMYHGGTNFDRLASAFIITAYYD-EAPLDEYG 322

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN---- 356
            +RQPKWGHL+ELH +IK C + L+    T   LG++ +  I ++SS      + +    
Sbjct: 323 LVRQPKWGHLKELHASIKSCSQPLLDGTQTTFSLGSEQQV-IKNESSWTYFPLMFSEVPQ 381

Query: 357 -------YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ 409
                       D  + F    Y LP  S+SILP CKNVVFNT KV  Q N         
Sbjct: 382 NVLLSWKISGPRDVTIQFQNISYELPGKSISILPGCKNVVFNTGKVSIQNN--------V 433

Query: 410 KNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
           + +   L  +SA +W  Y E +    + S     L +QI+T KDTSDY+WYT   +    
Sbjct: 434 RAMKPRLQFNSAENWKVYTEAIPNFAHTSKRADTLLDQISTAKDTSDYMWYTFRFNNKSP 493

Query: 468 QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
             K V L+I S G     F+N  L    +G+ +     + K + L  G+N + ILS  VG
Sbjct: 494 NAKSV-LSIYSQGDVLHSFINGVLTGSAHGSRNNTQVTMKKNVNLINGMNNISILSATVG 552

Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
           L N GA+ +   AGL  V +       RD SS  W YQVG+ GE + +  +S ++   WK
Sbjct: 553 LPNSGAFLESRVAGLRKVEV-----QGRDFSSYSWGYQVGLLGEKLQIFTVSGSSKVQWK 607

Query: 588 --QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
             Q ST P    L WY+TTF AP G  P+ +NL SMGKG AWVNGQ IGRYW ++  P  
Sbjct: 608 SFQSSTKP----LTWYQTTFHAPAGNDPVVVNLGSMGKGLAWVNGQGIGRYWVSFHKPD- 662

Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                                G P+Q  YHIPR+++    NLLVI EE  G+P  I+L T
Sbjct: 663 ---------------------GTPSQQWYHIPRSFLKSTGNLLVILEEETGNPLGITLDT 701


>gi|242045426|ref|XP_002460584.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
 gi|241923961|gb|EER97105.1| hypothetical protein SORBIDRAFT_02g031260 [Sorghum bicolor]
          Length = 803

 Score =  587 bits (1513), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 312/828 (37%), Positives = 460/828 (55%), Gaps = 80/828 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDGKR +  SG+IHYPRS PEVWP+L+ ++KEGGL  IETY+FWN HEP  
Sbjct: 36  VTYDARSLLIDGKRDLFFSGAIHYPRSPPEVWPKLLDRAKEGGLNTIETYIFWNAHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y FEGR DLV+F+K +QE G++  +RIGP+  AEWN+GG P WL  I  I FR  N+P
Sbjct: 96  GKYNFEGRLDLVKFLKMIQEHGMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+EM+++   ++  +K   LFASQGGP+IL Q+ENEYGN++  + + G+ Y++WAA  A
Sbjct: 156 YKKEMEKWTRFVVQKLKDAELFASQGGPVILTQIENEYGNIKKDHKIEGDKYLEWAAQMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++  T VPW+MC+Q  AP  +I TCNG +C D +T    +KP++WTEN++  F ++G  +
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQL 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GG+  NYYMY GGTNFGRT+   ++   YD +AP+DEYG  +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSMVNYYMYHGGTNFGRTSASYVLTGYYD-EAPLDEYGMYK 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
           +PK+GHLR+LH  I+  ++  +S   + + LG   EA I+     N C +FL+N ++  D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLSGKHSSEILGHGYEAQIFELPEENLCLSFLSNNNTGED 394

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  +++P+ SVSIL  CK+VV+NT +V  Q +   +      + +E+   ++ +
Sbjct: 395 GTVIFRGVKHYVPSRSVSILAGCKDVVYNTKRVFVQHSERSY------HTSEVTSKNNQW 448

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
             Y E V    +      +  EQ N TKD SDYLWYT S  +    +P +G     L ++
Sbjct: 449 EMYSEMVPKYKDTKIRTKEPLEQYNQTKDASDYLWYTTSFRLESDDLPFRGDIRPVLQVK 508

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  H+ + F N   V    GN     F+  K ++L  G+N + +LS  +G+++ G     
Sbjct: 509 SSAHSMIGFANDAFVGSARGNKQVKGFMFEKPVDLKAGVNHVVLLSSTMGMKDSGGELAE 568

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
              G+   ++  L  G  DL    W                                   
Sbjct: 569 VKGGIQECLIQGLNTGTLDLQVNGW----------------------------------- 593

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSY 657
              +K  F  P+G  P+ L+++SM KG  +VNG+ IGRYW ++                 
Sbjct: 594 --GHKRYFDEPDGDDPIVLDMSSMSKGMIFVNGEGIGRYWVSF----------------- 634

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
                +   G P+Q +YHIPR ++ P +NLLV+ EE  G P  I + T T   IC  +SE
Sbjct: 635 -----RTLAGTPSQAVYHIPRPFLKPKDNLLVVFEEEMGKPDGILVQTVTRDDICLLISE 689

Query: 718 ADPPPVDSWKPN---LGVVSSSPQVR--LACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
            +P  + +W  +   + +++    VR  L C     I  + FAS+G P+G CG+F  G C
Sbjct: 690 HNPGQIKTWDTDGVKIKLIAEDHSVRGTLMCPPEKIIQEVVFASFGNPDGMCGNFTVGTC 749

Query: 773 HM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           H  +   IV+K C+G+  C +PV     G     C      L V+  C
Sbjct: 750 HTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTGTLGVQVRC 796


>gi|238481152|ref|NP_001154292.1| beta-galactosidase 14 [Arabidopsis thaliana]
 gi|332661552|gb|AEE86952.1| beta-galactosidase 14 [Arabidopsis thaliana]
          Length = 1052

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 324/803 (40%), Positives = 456/803 (56%), Gaps = 50/803 (6%)

Query: 31  STPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLH 90
           S   +WP +I K++ GGL  I+TYVFWN HEP +G+Y F+GRFDLV+F+K + E GL++ 
Sbjct: 65  SRKHMWPSIIDKARIGGLNTIQTYVFWNVHEPEQGKYDFKGRFDLVKFIKLIHEKGLYVT 124

Query: 91  LRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQG 150
           LR+GP+  AEWN+GG P WL  +P + FRT N PFKE  +R++ KI+ +MK+E LFASQG
Sbjct: 125 LRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPFKEHTERYVRKILGMMKEEKLFASQG 184

Query: 151 GPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN 210
           GPIIL Q+ENEY  V+ AY   GE Y+KWAA+   ++N  +PWVMC+Q DAP  +IN CN
Sbjct: 185 GPIILGQIENEYNAVQLAYKENGEKYIKWAANLVESMNLGIPWVMCKQNDAPGNLINACN 244

Query: 211 GFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNY 268
           G +C D F  PN   KP +WTEN++  F  FG     R VED+AF+VAR+F   G+  NY
Sbjct: 245 GRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPPTQRTVEDIAFSVARYFSKNGSHVNY 304

Query: 269 YMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD 328
           YMY GGTNFGRT+    V T Y  DAP+DE+G  + PK+GHL+ +H+A++LC++ L    
Sbjct: 305 YMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEKAPKYGHLKHVHRALRLCKKALFWGQ 363

Query: 329 PTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKN 387
              Q LG   E   Y +     CAAFL+N ++     + F G  Y LP+ S+SILPDCK 
Sbjct: 364 LRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDTNTIKFKGQDYVLPSRSISILPDCKT 423

Query: 388 VVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQIN 447
           VV+NTA++++Q +  D  F + +  ++ L     F  + E +    +   + P   E   
Sbjct: 424 VVYNTAQIVAQHSWRD--FVKSEKTSKGL----KFEMFSENIPSLLDGDSLIP--GELYY 475

Query: 448 TTKDTSDYLWYTASIHVMPGQ-GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLI 506
            TKD +DY          P Q G +  L + SLGHA +V+VN +     +G H+  +F  
Sbjct: 476 LTKDKTDYACVKIDEDDFPDQKGLKTILRVASLGHALIVYVNGEYAGKAHGRHEMKSFEF 535

Query: 507 NKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQ 565
            K +    G N + IL ++ GL + G++ +   AG  ++ +I LK+G RDL+ + EW + 
Sbjct: 536 AKPVNFKTGDNRISILGVLTGLPDSGSYMEHRFAGPRAISIIGLKSGTRDLTENNEWGHL 595

Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQ 625
            G+EGE   +     +    W++       K L WYKT F  PEG   +A+ + +MGKG 
Sbjct: 596 AGLEGEKKEVYTEEGSKKVKWEKDGK---RKPLTWYKTYFETPEGVNAVAIRMKAMGKGL 652

Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV--HP 683
            WVNG  +GRYW ++L+P                       G+P QT YHIPR+++    
Sbjct: 653 IWVNGIGVGRYWMSFLSP----------------------LGEPTQTEYHIPRSFMKGEK 690

Query: 684 GENLLVI-HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK-PNLGVVSSSPQVRL 741
            +N+LVI  EE G     I  +      ICS V E  P  V SWK     +VS S  +RL
Sbjct: 691 KKNMLVILEEEPGVKLESIDFVLVNRDTICSNVGEDYPVSVKSWKREGPKIVSRSKDMRL 750

Query: 742 A----CERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSS 796
                C     +  + FAS+G P G CG+F  G C       +V+K C+G+  CSI V+ 
Sbjct: 751 KAVMRCPPEKQMVEVQFASFGDPTGTCGNFTMGKCSASKSKEVVEKECLGRNYCSIVVAR 810

Query: 797 AYLGVSAGACPGLLKALAVEAHC 819
              G     CP ++K LAV+  C
Sbjct: 811 ETFGDK--GCPEIVKTLAVQVKC 831


>gi|222616997|gb|EEE53129.1| hypothetical protein OsJ_35927 [Oryza sativa Japonica Group]
          Length = 740

 Score =  583 bits (1503), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 316/669 (47%), Positives = 410/669 (61%), Gaps = 78/669 (11%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NVTYDHRA++I GKRR+L S  +HYPR+TPE+WP LI K KEGG +VIETYVFWN HEP 
Sbjct: 63  NVTYDHRAVLIGGKRRMLVSAGLHYPRATPEMWPSLIAKCKEGGADVIETYVFWNGHEPA 122

Query: 64  RGQYYFEGRFDLVRFVK----------------TVQEAG-------------------LF 88
           +GQYYFE RFDLV+F K                  +E G                    +
Sbjct: 123 KGQYYFEERFDLVKFAKIDLVKFAKLMWPSLIAKCKEGGADVIETYVFWNGHEPAKGQYY 182

Query: 89  LHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFAS 148
              R  P    +    GFPVWL  IPGI+FRT N PFK EM+ F+ KI+ LMK+E L++ 
Sbjct: 183 FEERFDPVKFEKHVIFGFPVWLRDIPGIEFRTDNEPFKAEMQTFVTKIVTLMKEEKLYSW 242

Query: 149 QGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINT 208
           QGGPIIL Q+ENEYGN++  YG  G+ Y++WAA  A+ L+T +PWVMC+Q DAP+ II+T
Sbjct: 243 QGGPIILQQIENEYGNIQGNYGQAGKRYMQWAAQMAIGLDTGIPWVMCRQTDAPEEIIDT 302

Query: 209 CNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNY 268
           CN FYCDGF PNS +KP +WTE++ GW+  +G A+P RP ED AFAVARF++ GG+ QNY
Sbjct: 303 CNAFYCDGFKPNSYNKPTIWTEDWDGWYADWGGALPHRPAEDSAFAVARFYQRGGSLQNY 362

Query: 269 YMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD 328
           YMYFGGTNF RTAGGPL  TSYDYDAPIDEYG +RQPKWGHL++LH AIKLCE  LI+ D
Sbjct: 363 YMYFGGTNFARTAGGPLQITSYDYDAPIDEYGILRQPKWGHLKDLHTAIKLCEPALIAVD 422

Query: 329 --PTHQKLGAKLEAHIYHK-----------SSNDCAAFLANYDSSSDANVTFNGNVYFLP 375
             P + KLG+  EAH+Y             ++  C+AFLAN D    A+V   G  Y LP
Sbjct: 423 GSPQYIKLGSMQEAHVYSTGEVHTNGSMAGNAQICSAFLANIDEHKYASVWIFGKSYSLP 482

Query: 376 AWSVSILPDCKNVVFNTAKVISQRN----NGDHPFAQQKNVNELLLASS-----AFSWY- 425
            WSVSILPDC+NV FNTA++ +Q +        P    ++   +L  +S     + +W+ 
Sbjct: 483 PWSVSILPDCENVAFNTARIGAQTSVFTVESGSPSRSSRHKPSILSLTSGGPYLSSTWWT 542

Query: 426 -EEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG-------QGKEVFLNIE 477
            +E +G  G  +F    + E +N TKD SDYLWYT  +++          +G    L I+
Sbjct: 543 SKETIGTWGGNNFAVQGILEHLNVTKDISDYLWYTTRVNISDADVAFWSSKGVLPSLTID 602

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
            +   A VFVN KL     G+       + + I+L EG+N L +LS +VGLQNYGA+ + 
Sbjct: 603 KIRDVARVFVNGKLAGSQVGHW----VSLKQPIQLVEGLNELTLLSEIVGLQNYGAFLEK 658

Query: 538 AGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGL---DKISLANSSFWKQGSTLP 593
            GAG    V L  L +G  DL++  W YQVG++GE+  +   +K   A  S  ++ S  P
Sbjct: 659 DGAGFRGQVTLTGLSDGDVDLTNSLWTYQVGLKGEFSMIYAPEKQGCAGWSRMQKDSVQP 718

Query: 594 VNKSLIWYK 602
                 WYK
Sbjct: 719 ----FTWYK 723


>gi|222424922|dbj|BAH20412.1| AT3G13750 [Arabidopsis thaliana]
          Length = 625

 Score =  583 bits (1502), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 303/636 (47%), Positives = 396/636 (62%), Gaps = 20/636 (3%)

Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAF 253
           V+C+Q+DAPDPIIN CNGFYCD F+PN   KP MWTE ++GWF  FG  VP+RP ED+AF
Sbjct: 1   VLCKQDDAPDPIINACNGFYCDYFSPNKAYKPKMWTEAWTGWFTKFGGPVPYRPAEDMAF 60

Query: 254 AVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
           +VARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEYG  RQPKWGHL++L
Sbjct: 61  SVARFIQKGGSFINYYMYHGGTNFGRTAGGPFIATSYDYDAPLDEYGLERQPKWGHLKDL 120

Query: 314 HKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYF 373
           H+AIKLCE  L+S +PT   LG   EAH+Y   S  C+AFLANY+  S A V+F  N Y 
Sbjct: 121 HRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSKSGACSAFLANYNPKSYAKVSFGNNHYN 180

Query: 374 LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISG 433
           LP WS+SILPDCKN V+NTA+V +Q        ++ K V   +    ++  Y E      
Sbjct: 181 LPPWSISILPDCKNTVYNTARVGAQT-------SRMKMVRVPVHGGLSWQAYNEDPSTYI 233

Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVN 488
           + SF    L EQINTT+DTSDYLWY   + V   +     G    L + S GHA  VF+N
Sbjct: 234 DESFTMVGLVEQINTTRDTSDYLWYMTDVKVDANEGFLRNGDLPTLTVLSAGHAMHVFIN 293

Query: 489 KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VIL 547
            +L    YG+ D       K + L  G N + ILS+ VGL N G  F+   AG+   V L
Sbjct: 294 GQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAILSIAVGLPNVGPHFETWNAGVLGPVSL 353

Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLA 607
             L  G+RDLS  +W Y+VG++GE + L  +S ++S  W +G+ +   + L WYKTTF A
Sbjct: 354 NGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSA 413

Query: 608 PEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCG 667
           P G  PLA+++ SMGKGQ W+NGQS+GR+W AY A   G   +C Y G++   KC ++CG
Sbjct: 414 PAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYTGTFREDKCLRNCG 471

Query: 668 QPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK 727
           + +Q  YH+PR+W+ P  NLLV+ EE GGDP+ I+L+ +    +C+ + E     V+   
Sbjct: 472 EASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQSTLVNYQL 531

Query: 728 PNLGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKAC 784
              G V+    P+  L C  G  I  + FAS+G PEG CGS+R G+CH         K C
Sbjct: 532 HASGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLC 591

Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           VGQ  CS+ V+    G     CP ++K LAVEA C+
Sbjct: 592 VGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 625


>gi|57283683|emb|CAG30731.1| beta-galactosidase precursor [Triticum monococcum]
          Length = 839

 Score =  582 bits (1501), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 315/830 (37%), Positives = 458/830 (55%), Gaps = 46/830 (5%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             VTYD  +L+IDG+R +  SG+IHYPRS  ++WP+L++ +KEGGL  IETYVFWN HEP
Sbjct: 36  TTVTYDKYSLMIDGRRELFFSGAIHYPRSPTQMWPKLLKTAKEGGLNTIETYVFWNAHEP 95

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G++ FEGR D+++F+K +Q  G++  +RIGP+   EWN+G  P WL  IP I FR  N
Sbjct: 96  EPGKFNFEGRNDMIKFLKLIQSFGMYAIVRIGPFIQGEWNHGALPYWLREIPHIIFRANN 155

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            P+K EM++F+  I+ ++K ENLFASQGG +ILAQ+ENEYGN++  +   G+ Y++WAA+
Sbjct: 156 EPYKREMEKFVRFIVQMLKDENLFASQGGNVILAQIENEYGNIKKDHITEGDKYLEWAAE 215

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGY 241
            A++ N  VPW+MC+Q  AP  +I TCNG +C D +     +KP +WTEN++  F +FG 
Sbjct: 216 MAISTNIGVPWIMCKQSTAPGVVIPTCNGRHCGDTWIMKDENKPHLWTENWTAQFRAFGN 275

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
            +  R  ED+A++V RFF  GGT  NYYMY+GGTNFGRT G   V T Y  + PIDEYG 
Sbjct: 276 DLAQRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPIDEYGM 334

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
            + PK+GHLR+LH  IK      +    + + LG   EA  +       C AF++N ++ 
Sbjct: 335 PKAPKYGHLRDLHNVIKSYSRAFLEGKQSFELLGQGYEARNFEIPEEKLCLAFISNNNTG 394

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            D  V F G+ Y++P+ SVSIL DCK+VV+NT +V  Q +      A++   N      +
Sbjct: 395 EDGTVIFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHKAEKATKN------N 448

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLN 475
            +  + E +      +    +  EQ N TKD SDYLWYT S  +    +P +G     + 
Sbjct: 449 VWEMFSELIPRYKQTTIRNKEPLEQYNQTKDQSDYLWYTTSFRLEADDLPIRGDIRPVIA 508

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           ++S  HA + FVN      G+G+     F     I L  G+N L +LS  +G+++ G   
Sbjct: 509 VKSTAHAMVGFVNDAFAGNGHGSKKEKFFTFETPISLRLGVNHLALLSSSMGMKDSGGEL 568

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
                G+    +  L  G  DL    W ++  +EGE   +       +  W    +    
Sbjct: 569 VELKGGIQDCTIQGLNTGTLDLQINGWGHKAKLEGEVKEIYTEKGMGAVKWVPAVS---G 625

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           +++ WYK  F  P+G  P+ L++ SM KG  +VNG+ +GRYW++Y  P            
Sbjct: 626 QAVTWYKRYFDEPDGDDPVVLDMTSMCKGMIFVNGEGMGRYWTSYKTPGKVA-------- 677

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
                         +Q +YHIPRT++    NLLV+ EE  G P  I + T     IC F+
Sbjct: 678 --------------SQAVYHIPRTFLKSKNNLLVVFEEELGKPEGILIQTVRRDDICVFI 723

Query: 716 SEADPPPVDSWKPNLG---VVSSSPQVR--LACERGWHIAAINFASYGIPEGNCGSFRPG 770
           SE +P  +  W  + G   +++     R  L C     I  + FAS+G P G+C +F  G
Sbjct: 724 SEHNPAQIKPWDEHGGQIKLIAEDHNTRGFLNCPPKKIIQEVVFASFGNPVGSCANFTVG 783

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            CH  +   IV+K C+G+  C +PV   + G     CP     LAV+  C
Sbjct: 784 TCHTPNAKEIVEKECLGKKGCVLPVLHTFYGADIN-CPTTTATLAVQVRC 832


>gi|222618606|gb|EEE54738.1| hypothetical protein OsJ_02090 [Oryza sativa Japonica Group]
          Length = 713

 Score =  579 bits (1493), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 303/666 (45%), Positives = 400/666 (60%), Gaps = 54/666 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +V+YD R+LVIDG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL+ IETY+FWN HEP
Sbjct: 29  TSVSYDDRSLVIDGQRRIILSGSIHYPRSTPEMWPDLIKKAKEGGLDAIETYIFWNGHEP 88

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            R QY FEG +D+VRF K +Q AG++  LRIGPY C EWNYGG P WL  IPG+QFR  N
Sbjct: 89  HRRQYNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHN 148

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWA 180
            PF+ EM+ F   I++ MK   +FA QGGPIILAQ+ENEYGN+  +         Y+ W 
Sbjct: 149 EPFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWC 208

Query: 181 ADTAVNLNTSVPWVMCQQ-EDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           AD A   N  VPW+MCQQ +D P  ++NTCNGFYC  + PN    P +WTEN++GWF ++
Sbjct: 209 ADMANKQNVGVPWIMCQQDDDVPHNVVNTCNGFYCHDWFPNRTGIPKIWTENWTGWFKAW 268

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
                 R  ED+AFAVA FF+  G+ QNYYMY GGTNFGRT+GGP + TSYDYDAP+DEY
Sbjct: 269 DKPDFHRSAEDIAFAVAMFFQKRGSLQNYYMYHGGTNFGRTSGGPYITTSYDYDAPLDEY 328

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +RQPK+GHL+ELH  +K  E+ L+  +      G  +    Y   S+  A F+ N   
Sbjct: 329 GNLRQPKYGHLKELHSVLKSMEKTLVHGEYFDTNYGDNITVTKYTLDSSS-ACFINNRFD 387

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLAS 419
             D NVT +G  + LPAWSVSILPDCK V FN+AK+ +Q +       ++ N  E    S
Sbjct: 388 DKDVNVTLDGATHLLPAWSVSILPDCKTVAFNSAKIKTQTS----VMVKKPNTAEQEQES 443

Query: 420 SAFSWYEEKVG---ISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
             +SW  E +         +F + +L EQI T+ D SDYLWY  S++   G+G    L +
Sbjct: 444 LKWSWMPENLSPFMTDEKGNFRKNELLEQIVTSTDQSDYLWYRTSLN-HKGEGS-YKLYV 501

Query: 477 ESLGHAALVFVNKKLVAFGY-GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
            + GH    FVN KL+   +  + DF  F +   ++L++G N + +LS  VGL+NYG  F
Sbjct: 502 NTTGHELYAFVNGKLIGKNHSADGDFV-FQLESPVKLHDGKNYISLLSATVGLKNYGPSF 560

Query: 536 DVAGAGLFS--VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
           +    G+    V LID      DLS+  W                               
Sbjct: 561 EKMPTGIVGGPVKLIDSNGTAIDLSNSSWS------------------------------ 590

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
                  YK TF AP G+ P+ ++L  + KG AWVNG ++GRYW +Y A       +CDY
Sbjct: 591 -------YKATFEAPSGEDPVVVDLLGLNKGVAWVNGNNLGRYWPSYTAAEMAGCHRCDY 643

Query: 654 RGSYDA 659
           RG++ A
Sbjct: 644 RGAFQA 649


>gi|326496501|dbj|BAJ94712.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 672

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 304/643 (47%), Positives = 415/643 (64%), Gaps = 17/643 (2%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD RALV++G RR+L SG +HY RSTPE+WP+LI  +K+GGL+VI+TYVFWN HEP++
Sbjct: 40  VTYDGRALVVNGTRRMLFSGEMHYTRSTPEMWPKLIANAKKGGLDVIQTYVFWNVHEPVQ 99

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F+GR+DLV+F++ +Q  GL++ LRIGP+  AEW YGGFP WLH +P I FRT N P
Sbjct: 100 GQYNFQGRYDLVKFIREIQTQGLYVSLRIGPFIEAEWKYGGFPFWLHDVPNITFRTDNEP 159

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK+ M+RF+ +I+++MK E L+  QGGPII++Q+ENEY  VE A+G GG  YV+WAA+ A
Sbjct: 160 FKQHMQRFVTQIVNMMKHEGLYYPQGGPIIISQIENEYQMVEPAFGSGGPRYVRWAAEMA 219

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
           V L T VPW+MC+Q DAPDPIINTCNG  C + F  PNSP+KP +WTEN++  +  +G  
Sbjct: 220 VGLQTGVPWMMCKQNDAPDPIINTCNGLICGETFVGPNSPTKPALWTENWTTRYPIYGND 279

Query: 243 VPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
              R  ED+AFAVA F     G+F +YYMY GGTNFGR A    V TSY   AP+DEYG 
Sbjct: 280 TKLRSTEDIAFAVALFIARKKGSFVSYYMYHGGTNFGRFASS-YVTTSYYDGAPLDEYGL 338

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSS 361
           I +P WGHLRELH A+KL  E L+    ++  LG + EAHI+ ++   C AFL N+D   
Sbjct: 339 IWRPTWGHLRELHAAVKLSSEALLFGRYSNFSLGPEQEAHIF-ETELKCVAFLVNFDKHQ 397

Query: 362 DANVTFNGNVYF-LPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
              V F  N+YF L   S+S+L +C+ VVF TA+V +Q        ++   V E L    
Sbjct: 398 TPTVVFR-NIYFQLAPKSISVLSECRTVVFETARVNAQYG------SRTAEVVESLNDIH 450

Query: 421 AFSWYEEKVGISGNRS-FVRPDLAEQINTTKDTSDYLWYTASIHVMPG-QGKEVFLNIES 478
            +  ++E +    +++ +    L E ++ TKD +DYLWY  S   +P   G+ V LN+ES
Sbjct: 451 TWKAFKEPIPEDISKAVYTGNQLFEHLSMTKDETDYLWYIVSYEYIPSDDGQLVLLNVES 510

Query: 479 LGHAALVFVNKKLVAFGYGNHDF-ANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
             H    FVN +     +G+HD   N ++N  I LNEG NT+ +LS+MVG  + GA  + 
Sbjct: 511 RAHVLHAFVNTEYAGSVHGSHDGPGNIILNTNISLNEGQNTISLLSVMVGSPDSGAHMER 570

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS 597
              G+  V +   +     L++  W YQVG+ GE   +     ++S+ W + + L  +  
Sbjct: 571 RSFGIHKVSIQQGQQPLHLLNNELWAYQVGLYGEANRIYTQEESSSAEWTEINNLTYHP- 629

Query: 598 LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
             WYKTTF  P G   +ALNL SMGKG+ WVNG+S+GRYW ++
Sbjct: 630 FTWYKTTFATPVGNDVVALNLTSMGKGEVWVNGESLGRYWVSF 672


>gi|12323389|gb|AAG51670.1|AC010704_14 putative beta-galactosidase, 3' partial; 3669-1 [Arabidopsis
           thaliana]
          Length = 636

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 305/628 (48%), Positives = 396/628 (63%), Gaps = 18/628 (2%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           ANVTYD R+L+IDG+ ++L SGSIHY RSTP++WP LI K+K GG++V++TYVFWN HEP
Sbjct: 23  ANVTYDGRSLIIDGEHKILFSGSIHYTRSTPQMWPSLIAKAKSGGIDVVDTYVFWNVHEP 82

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            +GQ+ F G  D+V+F+K V+  GL++ LRIGP+   EW+YGG P WLH + GI FRT N
Sbjct: 83  QQGQFDFSGSRDIVKFIKEVKNHGLYVCLRIGPFIQGEWSYGGLPFWLHNVQGIVFRTDN 142

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  MKR+   I+ LMK ENL+ASQGGPIIL+Q+ENEYG V  A+   G+ YVKW A 
Sbjct: 143 EPFKYHMKRYAKMIVKLMKSENLYASQGGPIILSQIENEYGMVGRAFRQEGKSYVKWTAK 202

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L+T VPWVMC+Q+DAPDP++N CNG  C + F  PNSP+KP +WTEN++ ++ ++G
Sbjct: 203 LAVELDTGVPWVMCKQDDAPDPLVNACNGRQCGETFKGPNSPNKPAIWTENWTSFYQTYG 262

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                R  ED+AF VA F    G+F NYYMY GGTNFGR A    V TSY   AP+DEYG
Sbjct: 263 EEPLIRSAEDIAFHVALFIAKNGSFVNYYMYHGGTNFGRNA-SQFVITSYYDQAPLDEYG 321

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +RQPKWGHL+ELH A+KLCEE L+S   T   LG    A ++ K +N CAA L N D  
Sbjct: 322 LLRQPKWGHLKELHAAVKLCEEPLLSGLQTTISLGKLQTAFVFGKKANLCAAILVNQD-K 380

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
            ++ V F  + Y L   SVS+LPDCKNV FNTAKV +Q N       + +   + L +  
Sbjct: 381 CESTVQFRNSSYRLSPKSVSVLPDCKNVAFNTAKVNAQYN------TRTRKARQNLSSPQ 434

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            +  + E V      S     L E +NTT+DTSDYLW T        +G    L +  LG
Sbjct: 435 MWEEFTETVPSFSETSIRSESLLEHMNTTQDTSDYLWQTTRFQ--QSEGAPSVLKVNHLG 492

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           HA   FVN + +   +G      FL+ K + LN G N L +LS+MVGL N GA  +    
Sbjct: 493 HALHAFVNGRFIGSMHGTFKAHRFLLEKNMSLNNGTNNLALLSVMVGLPNSGAHLERRVV 552

Query: 541 GLFSVILIDLKNGKRDL--SSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
           G  SV    + NG+  L  ++  W YQVG++GE   +     +    WKQ      ++ L
Sbjct: 553 GSRSV---KIWNGRYQLYFNNYSWGYQVGLKGEKFHVYTEDGSAKVQWKQYRD-SKSQPL 608

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQA 626
            WYK +F  PEG+ P+ALNL SMGKG+A
Sbjct: 609 TWYKASFDTPEGEDPVALNLGSMGKGEA 636


>gi|147768425|emb|CAN73625.1| hypothetical protein VITISV_026637 [Vitis vinifera]
          Length = 767

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 321/832 (38%), Positives = 445/832 (53%), Gaps = 111/832 (13%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  VTYD R+L+++G+R +L SGSIHYPRSTPE                           
Sbjct: 29  AKTVTYDGRSLIVNGRRELLFSGSIHYPRSTPE--------------------------- 61

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
                + FEG +DLV+F+K + + GL+  LRIGP+  AEWN+GGFP WL  +P I FR+ 
Sbjct: 62  -----FNFEGNYDLVKFIKLIGDYGLYATLRIGPFIEAEWNHGGFPYWLREVPDIIFRSY 116

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+++   II++MK+  LFA QGGPIILAQ+ENEY +++ AY   G  YV+WA 
Sbjct: 117 NEPFKYHMEKYSRMIIEMMKEAKLFAPQGGPIILAQIENEYNSIQLAYKELGVQYVQWAG 176

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
             AV L   VPW+MC+Q+DAPDP+INTCNG +C D FT PN P+KP +WTEN++  +  F
Sbjct: 177 KMAVGLGAGVPWIMCKQKDAPDPVINTCNGRHCGDTFTGPNRPNKPSLWTENWTAQYRVF 236

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G     R  EDLAF+VARF    GT  NYYMY GGTNFGRT G   V T Y  +AP+DEY
Sbjct: 237 GDPPSQRAAEDLAFSVARFISKNGTLANYYMYHGGTNFGRT-GSSFVTTRYYDEAPLDEY 295

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYD 358
           G  R+PKWGHL++LH A++LC++ L +  P  +KLG   E   Y K  ++ CAAFL N  
Sbjct: 296 GLQREPKWGHLKDLHSALRLCKKALFTGSPGVEKLGKDKEVRFYEKPGTHICAAFLTNNH 355

Query: 359 SSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLA 418
           S   A +TF G  YFLP  S+SILPDCK VV+NT +V++Q N  +  F + K  N+ L  
Sbjct: 356 SREAATLTFRGEEYFLPPHSISILPDCKTVVYNTQRVVAQHNARN--FVKSKIANKNL-- 411

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGKEV-F 473
              +   +E + +  +   +     E     KD SDY W+  SI +    +P +   +  
Sbjct: 412 --KWEMSQEPIPVMTDMKILTKSPMELYXFLKDRSDYAWFVTSIELSNYDLPMKKDIIPV 469

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I +LGHA L FVN   +   +G++   NF+  K ++  +G N L   ++         
Sbjct: 470 LQISNLGHAMLAFVNGNFIGSAHGSNVEKNFVFRKPVKF-QGRNKLHCPAV--------- 519

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
            +D    G+ SV ++ L  G  D+++  W  QVGV GE++       ++   W       
Sbjct: 520 -YDSGTTGIHSVQILGLNTGTLDITNNGWGQQVGVNGEHVKAYTQGGSHRVQWTAAKG-- 576

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
              ++ WYKT F  PEG  P+ L + SM KG    NG                       
Sbjct: 577 KGPAMTWYKTYFDMPEGNDPVILRMTSMAKG----NGLE--------------------- 611

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICS 713
                               YH+PR W+ P +NLLVI EE GG+P +I         ICS
Sbjct: 612 --------------------YHVPRAWLKPSDNLLVIFEETGGNPEEIEXELVNRDTICS 651

Query: 714 FVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASYGIPEGNCGSFR 768
            V+E  PP V SW+ +   + +      P+  L C     I  ++FAS+G P G CG F 
Sbjct: 652 IVTEYHPPHVKSWQRHDSKIRAVVDEVKPKGHLKCPNYKVIVKVDFASFGNPLGACGDFE 711

Query: 769 PGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G C   +   +V++ C G+  C IP+ +     ++GAC  + K LAV+  C
Sbjct: 712 MGNCTAPNSKKVVEQHCXGKTTCEIPMEAGIFXGNSGACSDITKTLAVQVRC 763


>gi|449519864|ref|XP_004166954.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 3-like, partial
           [Cucumis sativus]
          Length = 635

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 298/643 (46%), Positives = 393/643 (61%), Gaps = 29/643 (4%)

Query: 191 VPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVED 250
           VPWVMC+Q+DAPDP+INTCNGFYCD F+PN P KP  WTE ++ WF +FG     RPVED
Sbjct: 3   VPWVMCKQDDAPDPMINTCNGFYCDYFSPNKPYKPNFWTEAWTAWFNNFGGPNHKRPVED 62

Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHL 310
           LAF VARF + GG+  NYYMY GGTNFGRTAGGP + TSYDYDAPIDEYG IRQPK+GHL
Sbjct: 63  LAFGVARFIQKGGSLVNYYMYHGGTNFGRTAGGPFITTSYDYDAPIDEYGLIRQPKFGHL 122

Query: 311 RELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGN 370
           + LH A+KLCE+ L++ +P    L    +A ++  SS DCAAFL+NY S++ A VTFNG 
Sbjct: 123 KRLHDAVKLCEKALLTGEPHDYTLATYQKAKVFSSSSGDCAAFLSNYHSNNTARVTFNGR 182

Query: 371 VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEK 428
            Y LP WS+SILPDCK+V++NTA+V  Q N           ++ L     +FSW  Y E 
Sbjct: 183 HYTLPPWSISILPDCKSVIYNTAQVQVQTN----------QLSFLPTKVESFSWETYNEN 232

Query: 429 V-GISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHA 482
           +  I  + S     L EQ+  TKD SDYLWYT S++V P +     GK   L   S GH 
Sbjct: 233 ISSIEEDSSMSYDGLLEQLTITKDNSDYLWYTTSVNVDPNESYLRGGKFPTLTATSKGHG 292

Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
             VF+N KL    +G HD + F    +I L  G+N + +LS+  GL N G  ++    G+
Sbjct: 293 MHVFINGKLAGSSFGTHDNSKFTFTGRINLQAGVNKVSLLSIAGGLPNNGPHYEEREMGV 352

Query: 543 FSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN-KSLIW 600
              + I  L  GK DLS  +W Y+VG++GE + L   S   +  W + S    N + L W
Sbjct: 353 LGPVAIHGLDXGKMDLSRQKWSYKVGLKGENMNLGSPSSVQAVDWAKDSLKQENAQPLTW 412

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           YK  F APEG  PLAL++ SM KGQ W+NGQ++GRYW+  +  +  CT  C Y G+Y   
Sbjct: 413 YKAYFDAPEGDEPLALDMGSMQKGQVWINGQNVGRYWT--ITANGNCT-DCSYSGTYRPR 469

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
           KCQ  CGQP Q  YH+PR+W+ P +NL+V+ EE+GG+PS+ISL+ ++   IC+  S+  P
Sbjct: 470 KCQFGCGQPTQQWYHVPRSWLMPTKNLIVVFEEVGGNPSRISLVKRSVTSICTEASQYRP 529

Query: 721 PPVD-SWKPNLGVVSSSP--QVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVL 777
              +     N G ++     ++ L C  G  I+AI FAS+G P G CGS + G CH    
Sbjct: 530 VIKNVHMHQNNGELNEQNVLKINLHCAAGQFISAIKFASFGTPSGACGSHKQGTCHSPKS 589

Query: 778 P-IVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             ++QK CVG+  C   + ++  G     CP L K L+ E  C
Sbjct: 590 DYVLQKLCVGRQRCLATIPTSIFG--EDPCPNLRKKLSAEVVC 630


>gi|414870185|tpg|DAA48742.1| TPA: hypothetical protein ZEAMMB73_126543 [Zea mays]
          Length = 706

 Score =  559 bits (1441), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 285/648 (43%), Positives = 403/648 (62%), Gaps = 25/648 (3%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+ DG R +  SGSIHYPRS P++WPELI K+KEGGL  IETYVFWN HEP +
Sbjct: 43  VSYDRRSLMFDGHREIFLSGSIHYPRSPPDMWPELIAKAKEGGLNTIETYVFWNIHEPEK 102

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ FEG+ D+VRF + +QE  ++  +R+GP+  AEWN+GG P WL  IP I FRT N P
Sbjct: 103 GEFNFEGQNDVVRFFQLIQEHDMYAMVRLGPFIQAEWNHGGLPYWLREIPDIVFRTNNEP 162

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K  M+ F+  II  +K  NLFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A
Sbjct: 163 YKMHMETFVKIIIKRLKDANLFASQGGPIILAQIENEYQHMEAAFKDEGTKYINWAAKMA 222

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGY 241
           ++ N  +PW+MC+Q  AP  +I TCNG  C G T   P + S P++WTEN++  +  FG 
Sbjct: 223 ISTNIGIPWIMCKQTKAPSDVIPTCNGRNC-GDTWPGPTNKSMPLLWTENWTAQYRVFGD 281

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGF 301
               R  ED+AFAVARFF  GGT  NYYMY GGTNFGRT+   ++   YD +AP+DE+G 
Sbjct: 282 PPSQRSAEDIAFAVARFFSVGGTLANYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGL 340

Query: 302 IRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSS 360
            ++PKWGHLR+LH+A+KLC++ L+   P+ +KLG +LEA ++       C AFL+N+++ 
Sbjct: 341 YKEPKWGHLRDLHQALKLCKKALLWGTPSTEKLGKQLEARVFEMPEQKVCVAFLSNHNTK 400

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLL 417
            DA +TF G  YF+P  S+S+L DC+ VVF T  V +Q N     FA    Q NV E+  
Sbjct: 401 DDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQTAQNNVWEMFD 460

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EV 472
             +   + + K+ +            +  N TKD +DY+WYT+S  +    MP +   + 
Sbjct: 461 GENVPKYKQAKIRLR--------KAGDLYNLTKDKTDYVWYTSSFKLEADDMPIRSDIKT 512

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            L + S GHA++ FVN K V  G+G      F + K ++L +G+N + +L+  +G+ + G
Sbjct: 513 VLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASSMGMTDSG 572

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
           A+ +   AG+  V +  L  G  DL++  W + VG+ GE   +       S  WK     
Sbjct: 573 AYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGMGSVTWKPAMN- 631

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
             ++ L WYK  F  P G+ P+ L++++MGKG  +VNGQ IGRYW +Y
Sbjct: 632 --DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY 677


>gi|218202538|gb|EEC84965.1| hypothetical protein OsI_32205 [Oryza sativa Indica Group]
          Length = 807

 Score =  557 bits (1436), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 311/831 (37%), Positives = 445/831 (53%), Gaps = 79/831 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+IDGKR +  SG+IHYPRS PE+W +L++ +K GGL  IETYVFWN HEP  
Sbjct: 36  VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+YYFEGRFDL+RF+  +++  ++  +RIGP+  AEWN+GG P WL  I  I FR  N P
Sbjct: 96  GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK                               +ENEYGN++    V G+ Y++WAA+ A
Sbjct: 156 FK-------------------------------IENEYGNIKKDRKVEGDKYLEWAAEMA 184

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++    VPWVMC+Q  AP  +I TCNG +C D +T    +KP +WTEN++  F +FG  +
Sbjct: 185 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 244

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GGT  NYYMY GGTNFGRT G   V T Y  +AP+DEYG  +
Sbjct: 245 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 303

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
           +PK+GHLR+LH  IK   +  +    + + LG   EAH Y    +  C +FL+N ++  D
Sbjct: 304 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 363

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  +++P+ SVSIL DCK VV+NT +V  Q +        + + N +      +
Sbjct: 364 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNV------W 417

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIE 477
             Y E +              EQ N TKDTSDYLWYT S  +    +P  +     + I+
Sbjct: 418 EMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQIK 477

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           S  HA + F N   V  G G+    +F+  K ++L  GIN + +LS  +G+++ G     
Sbjct: 478 STAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELVE 537

Query: 538 AGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVNK 596
              G+   ++  L  G  DL      ++  +EGE   +          WK     LP+  
Sbjct: 538 VKGGIQDCVVQGLNTGTLDLQGNGRGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT- 596

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
              WYK  F  P+G  P+ ++++SM KG  +VNG+ IGRYW++++  +            
Sbjct: 597 ---WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA------------ 641

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVS 716
                     G P+Q++YHIPR ++ P  NLL+I EE  G P  I + T     IC F+S
Sbjct: 642 ----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFIS 691

Query: 717 EADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           E +P  + +W+ + G +      +S +  L C     I  + FAS+G PEG CG+F  G 
Sbjct: 692 EHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPQRTIQEVVFASFGNPEGACGNFTAGT 751

Query: 772 CHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
           CH  D   +V+K C+G+  C +PV +   G     CP     LAV+  C +
Sbjct: 752 CHTPDAKAVVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQVRCKV 801


>gi|4467146|emb|CAB37515.1| galactosidase like protein [Arabidopsis thaliana]
 gi|7270842|emb|CAB80523.1| galactosidase like protein [Arabidopsis thaliana]
          Length = 1036

 Score =  555 bits (1430), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 310/772 (40%), Positives = 437/772 (56%), Gaps = 54/772 (6%)

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QY F+GRFDLV+F+K + E GL++ LR+GP+  AEWN+GG P WL  +P + FRT N PF
Sbjct: 80  QYDFKGRFDLVKFIKLIHEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPDVYFRTNNEPF 139

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           KE  +R++ KI+ +MK+E LFASQGGPIIL Q+ENEY  V+ AY   GE Y+KWAA+   
Sbjct: 140 KEHTERYVRKILGMMKEEKLFASQGGPIILGQIENEYNAVQLAYKENGEKYIKWAANLVE 199

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAV 243
           ++N  +PWVMC+Q DAP  +IN CNG +C D F  PN   KP +WTEN++  F  FG   
Sbjct: 200 SMNLGIPWVMCKQNDAPGNLINACNGRHCGDTFPGPNRHDKPSLWTENWTTQFRVFGDPP 259

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R VED+AF+VAR+F   G+  NYYMY GGTNFGRT+    V T Y  DAP+DE+G  +
Sbjct: 260 TQRTVEDIAFSVARYFSKNGSHVNYYMYHGGTNFGRTSAH-FVTTRYYDDAPLDEFGLEK 318

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
            PK+GHL+ +H+A++LC++ L       Q LG   E   Y +     CAAFL+N ++   
Sbjct: 319 APKYGHLKHVHRALRLCKKALFWGQLRAQTLGPDTEVRYYEQPGTKVCAAFLSNNNTRDT 378

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             + F G  Y LP+ S+SILPDCK VV+NTA++++Q +  D  F + +  ++ L     F
Sbjct: 379 NTIKFKGQDYVLPSRSISILPDCKTVVYNTAQIVAQHSWRD--FVKSEKTSKGL----KF 432

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQ-GKEVFLNIE 477
             + E +    +   + P   E    TKD +DY WYT S+ +     P Q G +  L + 
Sbjct: 433 EMFSENIPSLLDGDSLIP--GELYYLTKDKTDYAWYTTSVKIDEDDFPDQKGLKTILRVA 490

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDV 537
           SLGHA +V+VN +     +G H+  +F   K +    G N + IL ++ GL + G++ + 
Sbjct: 491 SLGHALIVYVNGEYAGKAHGRHEMKSFEFAKPVNFKTGDNRISILGVLTGLPDSGSYMEH 550

Query: 538 AGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             AG  ++ +I LK+G RDL+ + EW +  G+EGE   +     +    W++       K
Sbjct: 551 RFAGPRAISIIGLKSGTRDLTENNEWGHLAGLEGEKKEVYTEEGSKKVKWEKDGK---RK 607

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGS 656
            L WYKT F  PEG   +A+ + +MGKG  WVNG  +GRYW ++L+P             
Sbjct: 608 PLTWYKTYFETPEGVNAVAIRMKAMGKGLIWVNGIGVGRYWMSFLSP------------- 654

Query: 657 YDASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHICS 713
                     G+P QT YHIPR+++     +N+LVI  EE G     I  +      ICS
Sbjct: 655 ---------LGEPTQTEYHIPRSFMKGEKKKNMLVILEEEPGVKLESIDFVLVNRDTICS 705

Query: 714 FVSEADPPPVDSWK-PNLGVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSFR 768
            V E  P  V SWK     +VS S  +RL     C     +  + FAS+G P G CG+F 
Sbjct: 706 NVGEDYPVSVKSWKREGPKIVSRSKDMRLKAVMRCPPEKQMVEVQFASFGDPTGTCGNFT 765

Query: 769 PGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            G C       +V+K C+G+  CSI V+    G     CP ++K LAV+  C
Sbjct: 766 MGKCSASKSKEVVEKECLGRNYCSIVVARETFG--DKGCPEIVKTLAVQVKC 815


>gi|2924512|emb|CAA17766.1| beta-galactosidase-like protein [Arabidopsis thaliana]
 gi|7270452|emb|CAB80218.1| beta-galactosidase-like protein [Arabidopsis thaliana]
          Length = 831

 Score =  544 bits (1402), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 313/844 (37%), Positives = 458/844 (54%), Gaps = 101/844 (11%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD  +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL  I+TYVFWN HEP +
Sbjct: 54  VTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHEPQQ 113

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ F GR DLV+F+K +Q+ G+++ LR+GP+  AEW +G    + H      +R     
Sbjct: 114 GKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGYITRYDHKNIAGAYR----- 168

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
                                           ++ENEY  V+ AY   G  Y+KWA++  
Sbjct: 169 --------------------------------KIENEYSAVQRAYKQDGLNYIKWASNLV 196

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYA 242
            ++   +PWVMC+Q DAPDP+IN CNG +C D F  PN  +KP +WTEN++  F  FG  
Sbjct: 197 DSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVFGDP 256

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
              R VED+A++VARFF   GT  NYYMY GGTNFGRT+   +    YD DAP+DEYG  
Sbjct: 257 PTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAHYVTTRYYD-DAPLDEYGLE 315

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSS 361
           ++PK+GHL+ LH A+ LC++ L+   P  +K G   E   Y +  +  CAAFLAN ++ +
Sbjct: 316 KEPKYGHLKHLHNALNLCKKPLLWGQPKTEKPGKDTEIRYYEQPGTKTCAAFLANNNTEA 375

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
              + F G  Y +   S+SILPDCK VV+NTA+++SQ  + +  F + K  N+       
Sbjct: 376 AETIKFKGREYVIAPRSISILPDCKTVVYNTAQIVSQHTSRN--FMKSKKANKKF----D 429

Query: 422 FSWYEEKV--GISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-----HVMPGQGKEVFL 474
           F  + E +   + GN S++  +L      TKD +DY WYT S      H+   +G + F+
Sbjct: 430 FKVFTETLPSKLEGN-SYIPVEL---YGLTKDKTDYGWYTTSFKVHKNHLPTKKGVKTFV 485

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAW 534
            I SLGHA   ++N + +  G+G+H+  +F+  K++ L  G N L +L ++ G  + G++
Sbjct: 486 RIASLGHALHAWLNGEYLGSGHGSHEEKSFVFQKQVTLKAGENHLVMLGVLTGFPDSGSY 545

Query: 535 FDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
            +    G   + ++ L +G  DL+ S +W  ++G+EGE +G+          WK+ +   
Sbjct: 546 MEHRYTGPRGISILGLTSGTLDLTESSKWGNKIGMEGEKLGIHTEEGLKKVEWKKFTGKA 605

Query: 594 VNKSLIWY----------KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAP 643
               L WY          +T F APE      + +  MGKG  WVNG+ +GRYW ++L+P
Sbjct: 606 --PGLTWYQKFSKECETLQTYFDAPESVSAATIRMHGMGKGLIWVNGEGVGRYWQSFLSP 663

Query: 644 STGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVI-HEELGGDPSKIS 702
                                  GQP Q  YHIPR+++ P +NLLVI  EE    P  + 
Sbjct: 664 ----------------------LGQPTQIEYHIPRSFLKPKKNLLVIFEEEPNVKPELMD 701

Query: 703 LLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS-----SPQVRLACERGWHIAAINFASY 757
                   +CS+V E   P V  W      V +     S    L C     IAA+ FAS+
Sbjct: 702 FAIVNRDTVCSYVGENYTPSVRHWTRKKDQVQAITDNVSLTATLKCSGTKKIAAVEFASF 761

Query: 758 GIPEGNCGSFRPGACHMDVLP-IVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAV 815
           G P G CG+F  G C+  V   +++K C+G+ EC IPV+ S +      +C  ++K LAV
Sbjct: 762 GNPIGVCGNFTLGTCNAPVSKQVIEKHCLGKAECVIPVNKSTFQQDKKDSCKNVVKMLAV 821

Query: 816 EAHC 819
           +  C
Sbjct: 822 QVKC 825


>gi|326500386|dbj|BAK06282.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 846

 Score =  534 bits (1375), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 294/767 (38%), Positives = 418/767 (54%), Gaps = 46/767 (5%)

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           Q  FEGR DL++F+K +Q   ++  +RIGP+  AEWN+GG P WL  IP I FR  N P+
Sbjct: 105 QVQFEGRNDLIKFLKLIQSHDMYALVRIGPFIQAEWNHGGLPYWLREIPHIIFRANNEPY 164

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K+EM++F+  I+  +K   +FASQGGP+ILAQ+ENEYGN++  + V G+ Y++WAA  A+
Sbjct: 165 KKEMEKFVRFIVQKLKDAEMFASQGGPVILAQIENEYGNIKKDHIVEGDKYLEWAAQMAI 224

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
           + NT VPW+MC+Q  AP  +I TCNG +C D +T    +KP +WTEN++  F +FG  + 
Sbjct: 225 STNTGVPWIMCKQSTAPGEVIPTCNGRHCGDTWTLKDKNKPRLWTENWTAQFRAFGDQLA 284

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQ 304
            R  ED+A++V RFF  GGT  NYYMY+GGTNFGRT G   V T Y  + P+DEYG  + 
Sbjct: 285 LRSAEDIAYSVLRFFAKGGTLVNYYMYYGGTNFGRT-GASYVLTGYYDEGPVDEYGMPKA 343

Query: 305 PKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDA 363
           PK+GHLR+LH  IK      +    + + L    EAH +       C AF++N ++  D 
Sbjct: 344 PKYGHLRDLHNLIKSYSRAFLEGKQSFELLAHGYEAHNFEIPEEKLCLAFISNNNTGEDG 403

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V F G+ Y++P+ SVSIL DCK+VV+NT +V  Q +      AQ+      L  S+A+ 
Sbjct: 404 TVNFRGDKYYIPSRSVSILADCKHVVYNTKRVFVQHSERSFHTAQK------LAKSNAWE 457

Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIES 478
            Y E +      S    +  EQ N TKD SDYLWYT S  +    +P +G     + ++S
Sbjct: 458 MYSEPIPRYKLTSIRNKEPMEQYNLTKDDSDYLWYTTSFRLEADDLPFRGDIRPVVQVKS 517

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
             HA + FVN      G G+     F+    I L  GIN L +LS  +G+++ G      
Sbjct: 518 TSHALMGFVNDAFAGNGRGSKKEKGFMFETPINLRIGINHLALLSSSMGMKDSGGELVEV 577

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
             G+    +  L  G  DL    W ++V +EGE   +       +  W   +T    +++
Sbjct: 578 KGGIQDCTIQGLNTGTLDLQVNGWGHKVKLEGEVKEIYTEKGMGAVKWVPATT---GRAV 634

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            WYK  F  P+G+ P+ L++ SMGKG  +VNG+ +GRYW +Y                  
Sbjct: 635 TWYKRYFDEPDGEDPVVLDMTSMGKGMIFVNGEGMGRYWPSYRTVG-------------- 680

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEA 718
                   G P+Q +YHIPR ++ P  NLLVI EE  G P  I + T     IC F+SE 
Sbjct: 681 --------GVPSQAMYHIPRPFLKPKNNLLVIFEEELGKPEGILIQTVRRDDICVFISEH 732

Query: 719 DPPPVDSWKPNLG---VVSSSPQVR--LACERGWHIAAINFASYGIPEGNCGSFRPGACH 773
           +P  + +W  + G   V++     R  L C     I  + FAS+G PEG+C +F  G+CH
Sbjct: 733 NPAQIKTWDKDGGQIKVIAEDHSTRGILKCPPKKTIQEVVFASFGNPEGSCANFTAGSCH 792

Query: 774 M-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             +   IV K C+G+  C +PV     G     CP     LAV+  C
Sbjct: 793 TPNAKDIVAKECLGKKSCVLPVLHTVYGADIN-CPTTTATLAVQVRC 838


>gi|449445172|ref|XP_004140347.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
          Length = 493

 Score =  531 bits (1367), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 251/467 (53%), Positives = 328/467 (70%), Gaps = 7/467 (1%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L  NV+YD  A++I+G+RR++ SGSIHYPRST  +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 18  LGDNVSYDSNAIIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRH 77

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP R +Y F GR D ++F + +Q+AGL++ +RIGPY CAEWNYGGFPVWLH +PGIQ RT
Sbjct: 78  EPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWLHNMPGIQLRT 137

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEW-AYGVGGELYVKW 179
            N  +K EM+ F  KI+++ KQ NLFASQGGPIILAQ+ENEYGNV   AYG  G+ Y+ W
Sbjct: 138 NNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAYGDAGKAYINW 197

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
            A  A +LN  VPW+MCQQ DAP PIINTCNGFYCD FTPN+P  P M+TEN+ GWF  +
Sbjct: 198 CAQMAESLNIGVPWIMCQQSDAPQPIINTCNGFYCDNFTPNNPKSPKMFTENWVGWFKKW 257

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G   P+R  ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEY
Sbjct: 258 GDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEY 317

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE-AHIYHKSSNDCAAFLANYD 358
           G + QPKWGHL++LH +IKL E+ L +   T+Q  G+ +     ++ ++ +   FL+N D
Sbjct: 318 GNLNQPKWGHLKQLHASIKLGEKILTNGTHTNQNFGSSVTLTKFFNPTTGERFCFLSNTD 377

Query: 359 SSSDANVTFNGN-VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
             +DA +    +  YF+PAWSVSIL  C   V+NTAKV SQ +     F +++N  E   
Sbjct: 378 GKNDATIDLQADGKYFVPAWSVSILDGCNKEVYNTAKVNSQTS----MFVKEQNEKENAQ 433

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
            S A++    K  + GN  F      EQ   T D SDY WY  ++  
Sbjct: 434 LSWAWAPEPMKDTLQGNGKFAANLFLEQKRVTADFSDYFWYMTNVDT 480


>gi|110741385|dbj|BAF02242.1| putative galactosidase [Arabidopsis thaliana]
          Length = 592

 Score =  530 bits (1364), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 280/603 (46%), Positives = 369/603 (61%), Gaps = 20/603 (3%)

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV 286
           MWTE ++GWF  FG  VP+RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGGP +
Sbjct: 1   MWTEAWTGWFTKFGGPVPYRPAEDMAFSVARFIQKGGSFINYYMYHGGTNFGRTAGGPFI 60

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKS 346
           ATSYDYDAP+DEYG  RQPKWGHL++LH+AIKLCE  L+S +PT   LG   EAH+Y   
Sbjct: 61  ATSYDYDAPLDEYGLERQPKWGHLKDLHRAIKLCEPALVSGEPTRMPLGNYQEAHVYKSK 120

Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF 406
           S  C+AFLANY+  S A V+F  N Y LP WS+SILPDCKN V+NTA+V +Q        
Sbjct: 121 SGACSAFLANYNPKSYAKVSFGNNHYNLPPWSISILPDCKNTVYNTARVGAQT------- 173

Query: 407 AQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMP 466
           ++ K V   +    ++  Y E      + SF    L EQINTT+DTSDYLWY   + V  
Sbjct: 174 SRMKMVRVPVHGGLSWQAYNEDPSTYIDESFTMVGLVEQINTTRDTSDYLWYMTDVKVDA 233

Query: 467 GQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDI 521
            +     G    L + S GHA  VF+N +L    YG+ D       K + L  G N + I
Sbjct: 234 NEGFLRNGDLPTLTVLSAGHAMHVFINGQLSGSAYGSLDSPKLTFRKGVNLRAGFNKIAI 293

Query: 522 LSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISL 580
           LS+ VGL N G  F+   AG+   V L  L  G+RDLS  +W Y+VG++GE + L  +S 
Sbjct: 294 LSIAVGLPNVGPHFETWNAGVLGPVSLNGLNGGRRDLSWQKWTYKVGLKGESLSLHSLSG 353

Query: 581 ANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
           ++S  W +G+ +   + L WYKTTF AP G  PLA+++ SMGKGQ W+NGQS+GR+W AY
Sbjct: 354 SSSVEWAEGAFVAQKQPLTWYKTTFSAPAGDSPLAVDMGSMGKGQIWINGQSLGRHWPAY 413

Query: 641 LAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
            A   G   +C Y G++   KC ++CG+ +Q  YH+PR+W+ P  NLLV+ EE GGDP+ 
Sbjct: 414 KA--VGSCSECSYTGTFREDKCLRNCGEASQRWYHVPRSWLKPSGNLLVVFEEWGGDPNG 471

Query: 701 ISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS--SPQVRLACERGWHIAAINFASYG 758
           I+L+ +    +C+ + E     V+      G V+    P+  L C  G  I  + FAS+G
Sbjct: 472 ITLVRREVDSVCADIYEWQSTLVNYQLHASGKVNKPLHPKAHLQCGPGQKITTVKFASFG 531

Query: 759 IPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEA 817
            PEG CGS+R G+CH         K CVGQ  CS+ V+    G     CP ++K LAVEA
Sbjct: 532 TPEGTCGSYRQGSCHAHHSYDAFNKLCVGQNWCSVTVAPEMFG--GDPCPNVMKKLAVEA 589

Query: 818 HCS 820
            C+
Sbjct: 590 VCA 592


>gi|449526237|ref|XP_004170120.1| PREDICTED: beta-galactosidase 7-like, partial [Cucumis sativus]
          Length = 706

 Score =  523 bits (1346), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 291/686 (42%), Positives = 405/686 (59%), Gaps = 54/686 (7%)

Query: 158 VENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF 217
           +ENE+GNVE +YG  G+ YVKW A+ A + N S PW+MCQQ DAP PIINTCNGFYCD F
Sbjct: 1   IENEFGNVEGSYGQEGKEYVKWCAELAQSYNLSEPWIMCQQGDAPQPIINTCNGFYCDQF 60

Query: 218 TPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF 277
            PN+ + P MWTE+++GWF  +G   P+R  EDLAFAVARFF+ GG+  NYYMY GGTNF
Sbjct: 61  KPNNKNSPKMWTESWAGWFKGWGERDPYRTAEDLAFAVARFFQYGGSLHNYYMYHGGTNF 120

Query: 278 GRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK 337
           GR+AGGP + TSYDY+AP+DEYG + QPKWGHL++LH+ I+  E+ L   D  H   G  
Sbjct: 121 GRSAGGPYITTSYDYNAPLDEYGNMNQPKWGHLKQLHELIRSMEKVLTYGDVKHIDTGHS 180

Query: 338 LEAHIY-HKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVI 396
             A  Y +K  + C  F  N   +SD  +TF    Y +P WSV++LPDCK  V+NTAKV 
Sbjct: 181 TTATSYTYKGKSSC--FFGN-PENSDREITFQERKYTVPGWSVTVLPDCKTEVYNTAKVN 237

Query: 397 SQRNNGDH-PFAQQKNVNELLLASSAFSWYEEKV-------GISGNRSFVRPDLAEQINT 448
           +Q    +  P    K+   L      + W  EK+        ISG+ +     L +Q   
Sbjct: 238 TQTTIREMVPSLVGKHKKPL-----KWQWRNEKIEHLTHEGDISGS-AITANSLIDQKMV 291

Query: 449 TKDTSDYLWYTASIHVM---PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFL 505
           T D+SDYLWY    H+    P  GK V L +++ GH    FVN K +   +G +   +F 
Sbjct: 292 TNDSSDYLWYLTGFHLNGNDPLFGKRVTLRVKTRGHILHAFVNNKHIGTQFGPYGKYSFT 351

Query: 506 INKKIE-LNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGK--RDLSSGEW 562
           + KK+  L  G N + +LS  VGL NYGA+++    G++  + + + +GK  RDLS+ EW
Sbjct: 352 LEKKVRNLRHGFNQIALLSATVGLPNYGAYYENVEVGIYGPVEL-IADGKTIRDLSTNEW 410

Query: 563 IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMG 622
           IY+VG++GE              W   + LP+N++  WYKT+F  P+G+  + ++L  MG
Sbjct: 411 IYKVGLDGEKYEFFDPDHKFRKPW-LSNNLPLNQNFTWYKTSFSTPKGREGVVVDLMGMG 469

Query: 623 KGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH 682
           KGQAWVNG+SIGRYW +YLA   GC+  CDYRG+Y  SKC  +CG+P Q  YHIPR++++
Sbjct: 470 KGQAWVNGKSIGRYWPSYLATENGCSSSCDYRGAYYGSKCATNCGKPTQRWYHIPRSYMN 529

Query: 683 PG-ENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRL 741
            G EN L++ EE GG P  I + T   + +C+             K +LG      ++ L
Sbjct: 530 DGKENTLILFEEFGGMPLNIEIKTTRVKKVCA-------------KVDLG-----SKLEL 571

Query: 742 ACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLG 800
            C     +  I F  +G P+GNC +F  G+CH  +   +++K C+ + +CSI V+   LG
Sbjct: 572 TCHDR-TVKRIIFVGFGNPKGNCNNFHKGSCHSSEAFSVIEKECLWKRKCSIEVTKDKLG 630

Query: 801 VSAGACPGLLKALAVE------AHCS 820
           ++    P     LAV+      +HCS
Sbjct: 631 LTGCKNPK-DNWLAVQPFWHHKSHCS 655


>gi|110737487|dbj|BAF00686.1| beta-galactosidase [Arabidopsis thaliana]
          Length = 532

 Score =  521 bits (1343), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 264/530 (49%), Positives = 336/530 (63%), Gaps = 17/530 (3%)

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           AV+ N  VPW+MCQQ DAP  +I+TCNGFYCD FTPN+P KP +WTEN+ GWF +FG   
Sbjct: 2   AVSQNIGVPWMMCQQWDAPPTVISTCNGFYCDQFTPNTPDKPKIWTENWPGWFKTFGGRD 61

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED+A++VARFF  GG+  NYYMY GGTNFGRT+GGP + TSYDY+APIDEYG  R
Sbjct: 62  PHRPAEDVAYSVARFFGKGGSVHNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPR 121

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
            PKWGHL++LHKAI L E  LIS +  +  LG  LEA +Y  SS  CAAFL+N D  +D 
Sbjct: 122 LPKWGHLKDLHKAIMLSENLLISGEHQNFTLGHSLEADVYTDSSGTCAAFLSNLDDKNDK 181

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V F    Y LPAWSVSILPDCK  VFNTAKV S+        ++ + + E L +SS   
Sbjct: 182 AVMFRNTSYHLPAWSVSILPDCKTEVFNTAKVTSKS-------SKVEMLPEDLKSSSGLK 234

Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
           W  + EK GI G   FV+ +L + INTTKDT+DYLWYT SI V   +     G    L I
Sbjct: 235 WEVFSEKPGIWGAADFVKNELVDHINTTKDTTDYLWYTTSITVSENEAFLKKGSSPVLFI 294

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           ES GH   VF+NK+ +    GN     F + K + L  G N +D+LSM VGL N G++++
Sbjct: 295 ESKGHTLHVFINKEYLGTATGNGTHVPFKLKKPVALKAGENNIDLLSMTVGLANAGSFYE 354

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
             GAGL SV +     G  +L++ +W Y++GVEGE++ L K   + +  W   +  P  +
Sbjct: 355 WVGAGLTSVSIKGFNKGTLNLTNSKWSYKLGVEGEHLELFKPGNSGAVKWTVTTKPPKKQ 414

Query: 597 SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL---APSTGCTKKCDY 653
            L WYK     P G  P+ L++ SMGKG AW+NG+ IGRYW       +P+  C K+CDY
Sbjct: 415 PLTWYKVVIEPPSGSEPVGLDMISMGKGMAWLNGEEIGRYWPRIARKNSPNDECVKECDY 474

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
           RG +   KC   CG+P+Q  YH+PR+W     N LVI EE GG+P KI L
Sbjct: 475 RGKFMPDKCLTGCGEPSQRWYHVPRSWFKSSGNELVIFEEKGGNPMKIKL 524


>gi|298205211|emb|CBI17270.3| unnamed protein product [Vitis vinifera]
          Length = 1064

 Score =  515 bits (1327), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 233/340 (68%), Positives = 280/340 (82%), Gaps = 1/340 (0%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHRAL+IDGKRR+L S  IHYPR+TPE+WP+LI KSKEGG +VI+TYVFWN HEP+
Sbjct: 28  NVSYDHRALLIDGKRRMLVSAGIHYPRATPEMWPDLIAKSKEGGADVIQTYVFWNGHEPV 87

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           R QY FEGR+D+V+FVK V  +GL+LHLRIGPY CAEWN+GGFPVWL  IPGI+FRT N 
Sbjct: 88  RRQYNFEGRYDIVKFVKLVGSSGLYLHLRIGPYVCAEWNFGGFPVWLRDIPGIEFRTDNA 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PFK+EM+RF+ KI+DLM++E LF+ QGGPII+ Q+ENEYGNVE ++G  G+ YVKWAA  
Sbjct: 148 PFKDEMQRFVKKIVDLMQKEMLFSWQGGPIIMLQIENEYGNVESSFGQRGKDYVKWAARM 207

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+  VPWVMCQQ DAPD IIN CNGFYCD F PNS +KP +WTE+++GWF S+G   
Sbjct: 208 ALELDAGVPWVMCQQADAPDIIINACNGFYCDAFWPNSANKPKLWTEDWNGWFASWGGRT 267

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RPVED+AFAVARFF+ GG+F NYYMYFGGTNFGR++GGP   TSYDYDAPIDEYG + 
Sbjct: 268 PKRPVEDIAFAVARFFQRGGSFHNYYMYFGGTNFGRSSGGPFYVTSYDYDAPIDEYGLLS 327

Query: 304 QPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHI 342
           QPKWGHL+ELH AIKLCE  L++ D P + KLG   E  +
Sbjct: 328 QPKWGHLKELHAAIKLCEPALVAVDSPQYIKLGPMQEVGV 367



 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 206/511 (40%), Positives = 286/511 (55%), Gaps = 41/511 (8%)

Query: 334  LGAKLEAHIYH--------KSSN--DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILP 383
            +  K  AH+Y         +S N   C+AFLAN D    A+VTF G +Y LP WSVSILP
Sbjct: 561  MDTKQTAHVYRVKESLYSTQSGNGSSCSAFLANIDEHKTASVTFLGQIYKLPPWSVSILP 620

Query: 384  DCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLA 443
            DC+  VFNTAKV +Q +            N++      +   +E + +    +F    + 
Sbjct: 621  DCRTTVFNTAKVGAQTS---------IKTNKISYVPKTWMTLKEPISVWSENNFTIQGVL 671

Query: 444  EQINTTKDTSDYLWYTASIHVMPG-----QGKEV--FLNIESLGHAALVFVNKKLVAFGY 496
            E +N TKD SDYLW    I+V        +  +V   L+I+S+     +FVN +L+    
Sbjct: 672  EHLNVTKDHSDYLWRITRINVSAEDISFWEENQVSPTLSIDSMRDILHIFVNGQLIGSVI 731

Query: 497  GNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SVILIDLKNGKR 555
            G+       + + I+L +G N L +LS  VGLQNYGA+ +  GAG    V L   KNG+ 
Sbjct: 732  GHW----VKVVQPIQLLQGYNDLVLLSQTVGLQNYGAFLEKDGAGFKGQVKLTGFKNGEI 787

Query: 556  DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLA 615
            DLS   W YQVG+ GE+  +  I  +  + W   +      +  WYKT F AP G+ P+A
Sbjct: 788  DLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTDLTPDASPSTFTWYKTFFDAPNGENPVA 847

Query: 616  LNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYH 675
            L+L SMGKGQAWVNG  IGRYW+  +AP  GC  KCDYRG Y  SKC  +CG P Q  YH
Sbjct: 848  LDLGSMGKGQAWVNGHHIGRYWTR-VAPKDGC-GKCDYRGHYHTSKCATNCGNPTQIWYH 905

Query: 676  IPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSS 735
            IPR+W+    NLLV+ EE GG P +IS+ +++ Q IC+ VSE+  P + +W P+  +  +
Sbjct: 906  IPRSWLQASNNLLVLFEETGGKPFEISVKSRSTQTICAEVSESHYPSLQNWSPSDFIDQN 965

Query: 736  S-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIE 789
            S     P++ L C+ G  I++I FASYG P+G+C  F  G CH  + L +V KAC G+  
Sbjct: 966  SKNKMTPEMHLQCDDGHTISSIEFASYGTPQGSCQMFSQGQCHAPNSLALVSKACQGKGS 1025

Query: 790  CSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            C I + ++  G     C G++K LAVEA C+
Sbjct: 1026 CVIRILNSAFG--GDPCRGIVKTLAVEAKCA 1054


>gi|33521216|gb|AAQ21370.1| beta-galactosidase [Sandersonia aurantiaca]
          Length = 568

 Score =  513 bits (1320), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 268/595 (45%), Positives = 363/595 (61%), Gaps = 47/595 (7%)

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P RP ED+AFAVARF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG +R
Sbjct: 1   PHRPAEDIAFAVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYGLLR 60

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           +PKWGHLR+LH+AIKLCE  L+S DPT   +G   ++H++   +  CAAFL+NYDS S A
Sbjct: 61  EPKWGHLRDLHRAIKLCEPALVSGDPTVTSIGHYQQSHVFRSKAGACAAFLSNYDSGSYA 120

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V FNG  Y +P WS+SILPDCK  VFNTA++ +Q +     +A +            FS
Sbjct: 121 RVVFNGIHYDIPPWSISILPDCKTTVFNTARIGAQTSQLKMEWAGK------------FS 168

Query: 424 W--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNI 476
           W  Y E      +RSF +  L EQI+ T+D +DYLWYT  +++   +     G    L +
Sbjct: 169 WESYNEDTNSFDDRSFTKVGLVEQISMTRDNTDYLWYTTYVNIGENEGFLKNGHYPVLTV 228

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
            S GH+  +++N +L    YG  +         ++L  G N + ILS+ VGL N G  F+
Sbjct: 229 NSAGHSMHIYINGQLTGTIYGALENPKLTYTGSVKLWAGSNKISILSVAVGLPNIGGHFE 288

Query: 537 VAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
               G+   V L  L  GKRDLS  +WIYQ+G++GE + L  +S ++S  W   S     
Sbjct: 289 TWNTGVLGPVTLSGLNEGKRDLSWQKWIYQIGLKGEALNLHTLSGSSSVEWGGPSQ---K 345

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           +SL WYKT+F AP G  PLAL++ SMGKGQ W+NGQS+GRYW AY A  +G    CDYRG
Sbjct: 346 QSLTWYKTSFNAPAGNDPLALDMGSMGKGQVWINGQSVGRYWPAYKA--SGSCGGCDYRG 403

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
           +Y+  KCQ +CG+  Q  YH+PR+W++P  NLLV+ EE GGDPS IS++ +  + +C+ +
Sbjct: 404 TYNEKKCQSNCGESTQRWYHVPRSWLNPTGNLLVVFEEWGGDPSGISMVRRKVESVCAEI 463

Query: 716 SEADPPPVDSWKPNLGVVSSS----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGA 771
           +E        W+PN+  V +      +  L+C  G  +  I FAS+G P+G CG+F  G 
Sbjct: 464 AE--------WQPNMDNVHTGNYGRSKAHLSCAPGQKMTNIKFASFGTPQGTCGAFSEGT 515

Query: 772 CH-------MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
           CH        +   ++Q  C+GQ  C++ V+    G     CPG +K LAVEA C
Sbjct: 516 CHAHKSYDAFEKESLLQN-CIGQQSCAVLVAPEVFG--GDPCPGTMKKLAVEAIC 567


>gi|110739914|dbj|BAF01862.1| beta-galactosidase like protein [Arabidopsis thaliana]
          Length = 578

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 275/584 (47%), Positives = 357/584 (61%), Gaps = 30/584 (5%)

Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHL 310
           LAF VARF + GG+F NYYMY GGTNFGRTAGGP V TSYDYDAPIDEYG IRQPK+GHL
Sbjct: 1   LAFGVARFIQKGGSFVNYYMYHGGTNFGRTAGGPFVTTSYDYDAPIDEYGLIRQPKYGHL 60

Query: 311 RELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGN 370
           +ELH+AIK+CE+ L+S+DP    +G K +AH+Y   S DC+AFLANYD+ S A V FN  
Sbjct: 61  KELHRAIKMCEKALVSADPVVTSIGNKQQAHVYSAESGDCSAFLANYDTESAARVLFNNV 120

Query: 371 VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW---YEE 427
            Y LP WS+SILPDC+N VFNTAKV  Q +  +      KN          F W    E+
Sbjct: 121 HYNLPPWSISILPDCRNAVFNTAKVGVQTSQMEMLPTDTKN----------FQWESYLED 170

Query: 428 KVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHA 482
              +  + +F    L EQIN T+DTSDYLWY  S+ +   +     G+   L I+S GHA
Sbjct: 171 LSSLDDSSTFTTHGLLEQINVTRDTSDYLWYMTSVDIGDSESFLHGGELPTLIIQSTGHA 230

Query: 483 ALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGL 542
             +FVN +L    +G      F    KI L+ G N + +LS+ VGL N G  F+    G+
Sbjct: 231 VHIFVNGQLSGSAFGTRQNRRFTYQGKINLHSGTNRIALLSVAVGLPNVGGHFESWNTGI 290

Query: 543 FS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVNKSLIW 600
              V L  L  GK DLS  +W YQVG++GE + L   +   S  W   S T+   + L W
Sbjct: 291 LGPVALHGLSQGKMDLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTW 350

Query: 601 YKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDAS 660
           +KT F APEG  PLAL++  MGKGQ WVNG+SIGRYW+A+   +TG    C Y G+Y  +
Sbjct: 351 HKTYFDAPEGNEPLALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSYTGTYKPN 407

Query: 661 KCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADP 720
           KCQ  CGQP Q  YH+PR W+ P +NLLVI EELGG+PS +SL+ ++   +C+ VSE   
Sbjct: 408 KCQTGCGQPTQRWYHVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH- 466

Query: 721 PPVDSWKPN---LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV- 776
           P + +W+      G     P+V L C  G  IA+I FAS+G P G CGS++ G CH    
Sbjct: 467 PNIKNWQIESYGKGQTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATS 526

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
             I+++ CVG+  C++ +S++  G     CP +LK L VEA C+
Sbjct: 527 YAILERKCVGKARCAVTISNSNFG--KDPCPNVLKRLTVEAVCA 568


>gi|359477955|ref|XP_003632046.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 10-like [Vitis
           vinifera]
          Length = 563

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 270/546 (49%), Positives = 346/546 (63%), Gaps = 24/546 (4%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +W  L++ +KEGG++VIETYVF N HE     YYF G +DL++FVK VQ+AG++L L IG
Sbjct: 1   MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           P+   EWN+GG P+WLH++P   F+T + PFK  M++F+  I+++MK++ LFASQGGPII
Sbjct: 61  PFVATEWNFGGVPIWLHYVPRTIFQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 120

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC 214
           L QVENEYG+ +  Y  GG+ YV WAA+  ++ N  VPW+MCQ   + DP+INTCN FYC
Sbjct: 121 LTQVENEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQXYASSDPMINTCNSFYC 180

Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
           D FTPNSPSK  MWTEN+  WF +FG +   R  ED+AF+VA FF       NYYMY GG
Sbjct: 181 DQFTPNSPSKAQMWTENWPRWFKTFGASNSHRLHEDIAFSVALFFFPKSX--NYYMYHGG 238

Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
           TNFG T+GGP + T+Y+Y+APIDEYG  R PK GHL+EL +AIK CE  L+  +P +  L
Sbjct: 239 TNFGCTSGGPFITTTYNYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLLYGEPINLXL 298

Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK 394
           G   E  +Y  S    AAF++N D   D  + F    Y +PAWSVSILPDCKNVVFNTAK
Sbjct: 299 GPSQEVDVYADSLGGYAAFISNVDEKEDKMIVFQNXSYHVPAWSVSILPDCKNVVFNTAK 358

Query: 395 VISQRNNGDHPFAQQKNVNELLLAS--------SAFSW--YEEKVGISGNRSFVRPDLAE 444
           V+SQ        +Q + V E L  S            W  + EK GI G   FV+    +
Sbjct: 359 VVSQ-------ISQVEMVLEDLQPSLVPSNKDLKGLXWKTFVEKAGIWGEADFVKNGFVD 411

Query: 445 QINTTKDTSDYLWYTASIHVMPGQG--KEV---FLNIESLGHAALVFVNKKLVAFGYGNH 499
            INTTKDT+D LWYT SI V   +   KE+    L +ES GHA   FVN+KL     GN 
Sbjct: 412 HINTTKDTTDXLWYTVSITVGESENFLKEISQPILLVESKGHALHAFVNQKLQGSASGNG 471

Query: 500 DFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSS 559
             + F     I L  G N + +LSM VGLQN   +++  GA L SV +  L NG  DLS+
Sbjct: 472 SHSPFKFECPISLKAGKNEIVVLSMTVGLQNEIPFYEWVGARLTSVKIKGLNNGIMDLST 531

Query: 560 GEWIYQ 565
             WIY+
Sbjct: 532 YPWIYK 537


>gi|281205901|gb|EFA80090.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
           PN500]
          Length = 727

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 284/711 (39%), Positives = 403/711 (56%), Gaps = 42/711 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           NV+YDHR+L+I+G+R++L S SIHYPR+TP +W  ++  +K  G+++IETY FWN HEP 
Sbjct: 42  NVSYDHRSLIINGERKLLLSASIHYPRATPSMWRPVLEATKAAGIDLIETYTFWNLHEPT 101

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G Y FEG  ++  F+    E GL++ +R GPY CAEWNYGGFP WL  I GI FR  N 
Sbjct: 102 PGTYNFEGNANVTAFLDICAELGLYVTVRFGPYVCAEWNYGGFPFWLKEIDGIVFRDYNQ 161

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           PF ++M  ++  I++ ++    +AS GGPIILAQVENEYG +E AYG  G  Y  WAA  
Sbjct: 162 PFMDQMSNWMTYIVNYLRP--YYASNGGPIILAQVENEYGWLEAAYGASGTKYALWAAQF 219

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYC----DGFTPNSPSKPIMWTENYSGWFLSF 239
           A +L+  +PW+MC Q+D    +INTCNGFYC    D      P++P  WTEN+ GWF ++
Sbjct: 220 ANSLDIGIPWIMCSQDDIAT-VINTCNGFYCHDWIDVHWTAYPNQPAFWTENWPGWFQNW 278

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
              VP RPV+D+ ++VAR+   GG+  NYYM+FGGT FGR  GGP + TSYDYD  IDEY
Sbjct: 279 EGGVPHRPVQDVLYSVARWIAYGGSMMNYYMWFGGTTFGRWTGGPFITTSYDYDGAIDEY 338

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ-KLGAKLE-AHIYHKSSNDCAAFLANY 357
           G+  +PK+    E H  I   E  ++S +P     LG  +E +H Y   + +  +FLAN+
Sbjct: 339 GYPYEPKYSQSLEFHTIIHAYEHIILSMNPPKPILLGENVEISHFYSVETGESFSFLANF 398

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVI-SQRNNGDHPFAQQKNVNELL 416
            ++    V +NG  + +  WSV +L +  ++   +A  I S       P    +N+ +  
Sbjct: 399 GATGVQTVQWNGITFKVQPWSVQLLYNNVSIFDTSATPIGSPVPKQFTPIKSFENIGQ-- 456

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
                   + E   ++       P   EQ++ T+D +DYLWY   I V     +    NI
Sbjct: 457 --------WSESFDLTFTNYSETP--MEQLSLTRDQTDYLWYVTKIEVNRVGAQLSLPNI 506

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
             + H   VFV+ + +A G G     N  +N  I +  G +TL +L   VGL NY    +
Sbjct: 507 SDMVH---VFVDNQYIATGRGP---TNITLNSTIGV--GGHTLQVLHTKVGLVNYAEHME 558

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
              AG+F  + +D      D+SS  W  +  V+GE + L   + + S  W   + +  N 
Sbjct: 559 ATVAGIFEPVTLD----SVDISSNGWSMKPFVQGETLQLYNPNHSGSVQW---TNVTGNP 611

Query: 597 SLIWYKTTF-LAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
            L WYK  F L       LAL++  M KG  +VNG +IGRYW   LA + GC   C Y+G
Sbjct: 612 PLTWYKFNFNLELSSNMSLALDMLGMTKGMIFVNGYNIGRYW---LALAYGC-NPCTYQG 667

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
            Y  S CQ  CG+P+Q  YH+P  W+  GEN +VI EE+ G+P  I+L+ +
Sbjct: 668 GYSPSMCQLGCGEPSQQYYHVPTDWLMNGENEIVIFEEVYGNPEAITLVQR 718


>gi|297789001|ref|XP_002862517.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297308086|gb|EFH38775.1| hypothetical protein ARALYDRAFT_333310 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 534

 Score =  496 bits (1276), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 265/539 (49%), Positives = 349/539 (64%), Gaps = 31/539 (5%)

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G +RQPKWGHLR+LHKAIKLCE+ LI++DPT   LG+ LEA +Y  +S  CAAFLAN  +
Sbjct: 9   GLLRQPKWGHLRDLHKAIKLCEDALIATDPTISSLGSNLEAAVYKTASGSCAAFLANVGT 68

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHP--FAQQK---NVNE 414
            SDA V+FNG  Y LPAWSVSILPDCKNV FNTAK+    N+   P  FA+Q    +   
Sbjct: 69  KSDATVSFNGESYHLPAWSVSILPDCKNVAFNTAKI----NSATEPTAFARQSLKPDGGS 124

Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-----MPGQG 469
                S +S+ +E +GIS   +F++P L EQINTT D SDYLWY+  + +        +G
Sbjct: 125 SAELGSEWSYIKEPIGISKADAFLKPGLLEQINTTADKSDYLWYSLRMDIKGDETFLDEG 184

Query: 470 KEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
            +  L+IESLG     F+N KL   G+G    +   ++  I L  G NT+D+LS+ VGL 
Sbjct: 185 SKAVLHIESLGQVVYAFINGKLAGSGHGKQKIS---LDIPINLVAGKNTVDLLSVTVGLA 241

Query: 530 NYGAWFDVAGAGLFS-VILIDLKNGKR-DLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
           NYGA+FD+ GAG+   V L   K G   DL+S +W YQVG++GE  GL  +   +SS W 
Sbjct: 242 NYGAFFDLVGAGITGPVTLKSAKGGSSIDLASQQWTYQVGLKGEDTGLGAV---DSSEWV 298

Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
             S LP  + LIWYKTTF AP G  P+A++     KG AWVNGQSIGRYW   +A + GC
Sbjct: 299 SKSPLPTKQPLIWYKTTFDAPSGSEPVAIDFTGTVKGIAWVNGQSIGRYWPTSIAGNGGC 358

Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK- 706
           T  CDYRGSY A+KC K+CG+P+QTLYH+PR+W+ P  N LV+ EE+GGDP++IS  TK 
Sbjct: 359 TDSCDYRGSYRANKCLKNCGKPSQTLYHVPRSWLKPSGNTLVLFEEMGGDPTQISFGTKQ 418

Query: 707 TGQHICSFVSEADPPPVDSWKPNLGVVS---SSPQVRLACERGWH-IAAINFASYGIPEG 762
           TG ++C  VS++ PPPVD+W  +  + +   + P + L C      I++I FAS+G P+G
Sbjct: 419 TGSNLCLTVSQSHPPPVDTWTSDSKISNRNRTRPVLSLQCPVSTQVISSIKFASFGTPKG 478

Query: 763 NCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CGSF  G+C+    L +VQKAC+G   C+I VS+   G     C G++K+LAVEA CS
Sbjct: 479 TCGSFTSGSCNSSRSLSLVQKACIGSRSCNIEVSTRVFGE---PCRGVVKSLAVEASCS 534


>gi|15027869|gb|AAK76465.1| putative beta-galactosidase [Arabidopsis thaliana]
          Length = 621

 Score =  487 bits (1253), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 262/647 (40%), Positives = 373/647 (57%), Gaps = 38/647 (5%)

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A +L+  VPW+MCQQ +AP P++ TCNGFYCD + P +PS P MWTEN++GWF ++G   
Sbjct: 2   ANSLDIGVPWLMCQQPNAPQPMLETCNGFYCDQYEPTNPSTPKMWTENWTGWFKNWGGKH 61

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+R  EDLAF+VARFF+TGGTFQNYYMY GGTNFGR AGGP + TSYDY AP+DE+G + 
Sbjct: 62  PYRTAEDLAFSVARFFQTGGTFQNYYMYHGGTNFGRVAGGPYITTSYDYHAPLDEFGNLN 121

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHL++LH  +K  E+ L   + +   LG  ++A IY  +    + F+ N ++++DA
Sbjct: 122 QPKWGHLKQLHTVLKSMEKSLTYGNISRIDLGNSIKATIY-TTKEGSSCFIGNVNATADA 180

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFS 423
            V F G  Y +PAWSVS+LPDC    +NTAKV +Q +      ++ + +       SA  
Sbjct: 181 LVNFKGKDYHVPAWSVSVLPDCDKEAYNTAKVNTQTSIMTEDSSKPERLEWTWRPESA-- 238

Query: 424 WYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV---MPGQGKEVFLNIESLG 480
              +K+ + G+   +   L +Q + T D SDYLWY   +H+    P   + + L + S  
Sbjct: 239 ---QKMILKGSGDLIAKGLVDQKDVTNDASDYLWYMTRLHLDKKDPLWSRNMTLRVHSNA 295

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKI-ELNEGINTLDILSMMVGLQNYGAWFDVAG 539
           H    +VN K V   +      ++   +K+  L  G N + +LS+ VGLQNYG +F+   
Sbjct: 296 HVLHAYVNGKYVGNQFVKDGKFDYRFERKVNHLVHGTNHISLLSVSVGLQNYGPFFESGP 355

Query: 540 AGLFS-VILIDLKNG---KRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
            G+   V L+  K     ++DLS  +W Y++G+ G    L  I       W     LP  
Sbjct: 356 TGINGPVSLVGYKGEETIEKDLSQHQWDYKIGLNGYNDKLFSIKSVGHQKWAN-EKLPTG 414

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           + L WYK  F AP GK P+ ++L  +GKG+AW+NGQSIGRYW ++ +   GC  KCDYRG
Sbjct: 415 RMLTWYKAKFKAPLGKEPVIVDLNGLGKGEAWINGQSIGRYWPSFNSSDDGCKDKCDYRG 474

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVH-PGENLLVIHEELGGDPSKISLLTKTGQHICSF 714
           +Y + KC   CG+P Q  YH+PR++++  G N + + EE+GG+PS ++  T     +C+ 
Sbjct: 475 AYGSDKCAFMCGKPTQRWYHVPRSFLNASGHNTITLFEEMGGNPSMVNFKTVVVGTVCAR 534

Query: 715 VSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH- 773
             E +                  +V L+C     I+A+ FAS+G P G+CGSF  G C  
Sbjct: 535 AHEHN------------------KVELSCHNR-PISAVKFASFGNPLGHCGSFAVGTCQG 575

Query: 774 -MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
             D    V K CVG++ C++ VSS   G S   C    K LAVE  C
Sbjct: 576 DKDAAKTVAKECVGKLNCTVNVSSDTFG-STLDCGDSPKKLAVELEC 621


>gi|115445061|ref|NP_001046310.1| Os02g0219200 [Oryza sativa Japonica Group]
 gi|113535841|dbj|BAF08224.1| Os02g0219200, partial [Oryza sativa Japonica Group]
          Length = 500

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 250/518 (48%), Positives = 328/518 (63%), Gaps = 26/518 (5%)

Query: 197 QQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVA 256
           +Q+DAPDP+INTCNGFYCD F+PN   KP MWTE ++GWF SFG  VP RPVEDLAFAVA
Sbjct: 1   KQDDAPDPVINTCNGFYCDYFSPNKNYKPSMWTEAWTGWFTSFGGGVPHRPVEDLAFAVA 60

Query: 257 RFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
           RF + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDE+G +RQPKWGHLR+LH+A
Sbjct: 61  RFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEFGLLRQPKWGHLRDLHRA 120

Query: 317 IKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPA 376
           IK  E  L+S+DPT + +G+  +A+++   +  CAAFL+NY  ++   V FNG  Y LPA
Sbjct: 121 IKQAEPVLVSADPTIESIGSYEKAYVFKAKNGACAAFLSNYHMNTAVKVRFNGQQYNLPA 180

Query: 377 WSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGN 434
           WS+SILPDCK  VFNTA V            ++  +   +     F+W  Y E      +
Sbjct: 181 WSISILPDCKTAVFNTATV------------KEPTLMPKMNPVVRFAWQSYSEDTNSLSD 228

Query: 435 RSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ---GKEVFLNIESLGHAALVFVNKKL 491
            +F +  L EQ++ T D SDYLWYT  +++       G+   L + S GH+  VFVN K 
Sbjct: 229 SAFTKDGLVEQLSMTWDKSDYLWYTTYVNIGTNDLRSGQSPQLTVYSAGHSMQVFVNGKS 288

Query: 492 VAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDL 550
               YG +D      N ++++ +G N + ILS  VGL N G  F+    G+   V L  L
Sbjct: 289 YGSVYGGYDNPKLTYNGRVKMWQGSNKISILSSAVGLPNVGNHFENWNVGVLGPVTLSSL 348

Query: 551 KNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW-KQGSTLPVNKSLIWYKTTFLAPE 609
             G +DLS  +W YQVG++GE +GL  ++ +++  W   G   P    L W+K  F AP 
Sbjct: 349 NGGTKDLSHQKWTYQVGLKGETLGLHTVTGSSAVEWGGPGGYQP----LTWHKAFFNAPA 404

Query: 610 GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQP 669
           G  P+AL++ SMGKGQ WVNG  +GRYWS     S GC   C Y G+Y   KC+ +CG  
Sbjct: 405 GNDPVALDMGSMGKGQLWVNGHHVGRYWS--YKASGGC-GGCSYAGTYHEDKCRSNCGDL 461

Query: 670 AQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           +Q  YH+PR+W+ PG NLLV+ EE GGD + +SL T+T
Sbjct: 462 SQRWYHVPRSWLKPGGNLLVVLEEYGGDLAGVSLATRT 499


>gi|414888319|tpg|DAA64333.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
 gi|414888320|tpg|DAA64334.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
          Length = 592

 Score =  477 bits (1228), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 236/532 (44%), Positives = 337/532 (63%), Gaps = 14/532 (2%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+IDGKR +  SG+IHYPRS PEVWP+LI ++KEGGL  IETY+FWN HEP  
Sbjct: 36  VTYDGRSLMIDGKRDLFFSGAIHYPRSPPEVWPKLIERAKEGGLNTIETYIFWNAHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+Y FEGRFDL++++K +QE  ++  +RIGP+  AEWN+GG P WL  I  I FR  N+P
Sbjct: 96  GKYNFEGRFDLIKYLKMIQEHDMYAIVRIGPFIQAEWNHGGLPYWLREIDHIIFRANNDP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+EM++F+  I+  +K   LFASQGGPIIL Q+ENEYGN++  +   G+ Y++WAA  A
Sbjct: 156 YKKEMEKFVRFIVQKLKDAELFASQGGPIILTQIENEYGNIKKDHATDGDKYLEWAAQMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++  T VPW+MC+Q  AP  +I TCNG +C D +T    +KP++WTEN++  F ++G  V
Sbjct: 216 LSTQTGVPWIMCKQSSAPGEVIPTCNGRHCGDTWTLRDKNKPMLWTENWTQQFRAYGDQV 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GG+  NYYMY GGTNFGRT G   V T Y  +AP+DEYG  +
Sbjct: 276 AMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMYK 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSD 362
           +PK+GHLR+LH  I+  ++  +    + + LG   EAHI+     N C +FL+N ++  D
Sbjct: 335 EPKFGHLRDLHNVIRSYQKAFLLGKHSSEILGHGYEAHIFELPEENLCLSFLSNNNTGED 394

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V F G  +++P+ SVSIL  CKNVV+NT +V  Q N   +      + +E+   ++ +
Sbjct: 395 GTVIFRGEKHYVPSRSVSILAGCKNVVYNTKRVFVQHNERSY------HTSEVTSKNNQW 448

Query: 423 SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MPGQGK-EVFLNIE 477
             Y EK+    +      +  EQ N TKD SDYLWYT S  +    +P +      L ++
Sbjct: 449 EMYSEKIPKYRDTKVRMKEPLEQFNQTKDASDYLWYTTSFRLESDDLPFRNDIRPVLQVK 508

Query: 478 SLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQ 529
           S  H+ + F N   V    G+     F+  K ++L  G+N + +LS  +G++
Sbjct: 509 SSAHSMMGFANDAFVGCARGSKQVKGFMFEKPVDLKVGVNHVVLLSSTMGMK 560


>gi|330804272|ref|XP_003290121.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
 gi|325079786|gb|EGC33370.1| hypothetical protein DICPUDRAFT_48969 [Dictyostelium purpureum]
          Length = 735

 Score =  471 bits (1211), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 279/719 (38%), Positives = 402/719 (55%), Gaps = 42/719 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP- 62
           N+TYDHR+L+I+G+R++L SGS+HYPR++   W E+++ SK  G+++IETY+FWN H+P 
Sbjct: 41  NITYDHRSLIINGERKLLVSGSVHYPRASVSKWNEILKSSKLAGVDIIETYIFWNVHQPN 100

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
              ++Y E   ++  F+   +E  LF++LRIGPY CAEWNYGGFP+WL  I GI FR  N
Sbjct: 101 TPNEFYLEDNANITLFLDLCKENELFVNLRIGPYVCAEWNYGGFPIWLKNIEGIVFRDYN 160

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PF + M  ++  ++D  K ++ FA  GGPII+AQ+ENEYG +E  YG  G  Y  WA +
Sbjct: 161 QPFMDAMSTWVTMVVD--KLQDYFAPNGGPIIIAQIENEYGWLENEYGASGREYALWAIN 218

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLS 238
            A +LN  +PW+MC QED  D  INTCNGFYC  +        P +P  WTEN+ GWF +
Sbjct: 219 FAKSLNIGIPWIMCAQEDI-DSAINTCNGFYCHDWIDRHWNAFPDQPAFWTENWVGWFEN 277

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDE 298
           +G AVP RPV+D+ F+ ARF   GG+  NYYM+FGGTNFGR+ GGP + TSY+YDAP+DE
Sbjct: 278 WGQAVPKRPVQDMLFSSARFIAYGGSLFNYYMWFGGTNFGRSVGGPWIITSYEYDAPLDE 337

Query: 299 YGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           +GF  +PK+    + H  I   E  ++  D PT   L    EAH Y +       FL N+
Sbjct: 338 FGFPNEPKYSMSTQFHFVIHKYESIIMGMDPPTPVPLSNISEAHPYGED----LVFLTNF 393

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLL 417
               D  + + G  Y L  WSV I+    +VVF+T+ V  +         Q K+V   + 
Sbjct: 394 GLVIDY-IQWQGTNYTLQPWSVVIVY-SGSVVFDTSYVPDEYIKPSTR-DQFKDVPNAIN 450

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDL-AEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNI 476
             S  S+ E       N   +  +   EQIN T DT+DYLWYT +I +     +   L I
Sbjct: 451 YDSILSFSEWGQSDIINDCIINNESPLEQINLTNDTTDYLWYTTNITL----NETTTLTI 506

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNYGAWF 535
           E++     VF+N      G+    +           N  IN  L IL+M +GL+NY A  
Sbjct: 507 ENMYDFCHVFLNGAYQGNGWSPVAYITLE-----PTNGNINYQLQILTMTMGLENYAAHM 561

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
           +    GL   I +    G+ ++++ +W  + G+ GE + +     ++   W Q       
Sbjct: 562 ESYSRGLLGSISL----GQTNITNNQWSMKPGILGEKLQIYNEYSSSKVNW-QPYNPSAT 616

Query: 596 KSLIWYKTT-----FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
           +S+ WY+         +        LN+ SM KG  +VNG +IGRY+    A  + CT K
Sbjct: 617 QSMTWYQFNISLDGLSSDPSSNAYVLNMTSMNKGFVYVNGFNIGRYF-LMEATQSNCTLK 675

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGEN----LLVIHEELGGDPSKISLLT 705
            DY G Y  S  +  C +P+Q+LYHIP  W+   ++     +++ EE+ GDP+KI LL+
Sbjct: 676 QDYIGIYTPSNNRIDCNEPSQSLYHIPLDWLFLQQDKQYATVILFEEVNGDPTKIQLLS 734


>gi|14517399|gb|AAK62590.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
 gi|25090389|gb|AAN72290.1| At2g32810/F24L7.5 [Arabidopsis thaliana]
          Length = 585

 Score =  463 bits (1192), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 267/586 (45%), Positives = 347/586 (59%), Gaps = 46/586 (7%)

Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD- 328
           MYFGGTNFGRT+GGP   TSYDYDAP+DEYG   +PKWGHL++LH AIKLCE  L+++D 
Sbjct: 1   MYFGGTNFGRTSGGPFYITSYDYDAPLDEYGLRSEPKWGHLKDLHAAIKLCEPALVAADA 60

Query: 329 PTHQKLGAKLEAHIYHKSSND----CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPD 384
           P ++KLG+K EAHIYH         CAAFLAN D    A+V FNG  Y LP WSVSILPD
Sbjct: 61  PQYRKLGSKQEAHIYHGDGETGGKVCAAFLANIDEHKSAHVKFNGQSYTLPPWSVSILPD 120

Query: 385 CKNVVFNTAKVISQRNNGDHPFAQ---------QKNVNELLLASSAFSWY--EEKVGISG 433
           C++V FNTAKV +Q +      A+         QK V +  ++  + SW   +E +GI G
Sbjct: 121 CRHVAFNTAKVGAQTSVKTVESARPSLGSMSILQKVVRQDNVSYISKSWMALKEPIGIWG 180

Query: 434 NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-------GKEVFLNIESLGHAALVF 486
             +F    L E +N TKD SDYLW+   I V           G    ++I+S+     VF
Sbjct: 181 ENNFTFQGLLEHLNVTKDRSDYLWHKTRISVSEDDISFWKKNGPNSTVSIDSMRDVLRVF 240

Query: 487 VNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF-SV 545
           VNK+L     G+   A     + +   +G N L +L+  VGLQNYGA+ +  GAG     
Sbjct: 241 VNKQLAGSIVGHWVKAV----QPVRFIQGNNDLLLLTQTVGLQNYGAFLEKDGAGFRGKA 296

Query: 546 ILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS---LIWYK 602
            L   KNG  DLS   W YQVG++GE    DKI     +   + STL  + S    +WYK
Sbjct: 297 KLTGFKNGDLDLSKSSWTYQVGLKGE---ADKIYTVEHNEKAEWSTLETDASPSIFMWYK 353

Query: 603 TTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKC 662
           T F  P G  P+ LNL SMG+GQAWVNGQ IGRYW+  ++   GC + CDYRG+Y++ KC
Sbjct: 354 TYFDPPAGTDPVVLNLESMGRGQAWVNGQHIGRYWN-IISQKDGCDRTCDYRGAYNSDKC 412

Query: 663 QKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPP 722
             +CG+P QT YH+PR+W+ P  NLLV+ EE GG+P KIS+ T T   +C  VSE+  PP
Sbjct: 413 TTNCGKPTQTRYHVPRSWLKPSSNLLVLFEETGGNPFKISVKTVTAGILCGQVSESHYPP 472

Query: 723 VDSWKP------NLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-D 775
           +  W         + + S +P+V L CE G  I++I FASYG P G+C  F  G CH  +
Sbjct: 473 LRKWSTPDYINGTMSINSVAPEVHLHCEDGHVISSIEFASYGTPRGSCDGFSIGKCHASN 532

Query: 776 VLPIVQKACVGQIECSIPVS-SAYLGVSAGACPGLLKALAVEAHCS 820
            L IV +AC G+  C I VS +A++   +  C G LK LAV + CS
Sbjct: 533 SLSIVSEACKGRNSCFIEVSNTAFI---SDPCSGTLKTLAVMSRCS 575


>gi|413922056|gb|AFW61988.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
          Length = 326

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 207/299 (69%), Positives = 248/299 (82%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25  NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+RGQYYF  R+DLVRFVK  ++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT 
Sbjct: 85  PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTD 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PFK  M+ F+ KI+ +MK E LF  QGGPIILAQVENEYG +E   G G + Y  WAA
Sbjct: 145 NGPFKAAMQAFVEKIVSMMKSEGLFEWQGGPIILAQVENEYGPMESVMGAGAKPYANWAA 204

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
             AV     VPWVMC+Q+DAPDP+INTCNGFYCD F+PNS SKP MWTE ++GWF +FG 
Sbjct: 205 KMAVATGAGVPWVMCKQDDAPDPVINTCNGFYCDYFSPNSNSKPTMWTEAWTGWFTAFGG 264

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
           AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG
Sbjct: 265 AVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYG 323


>gi|226532830|ref|NP_001140495.1| uncharacterized protein LOC100272556 precursor [Zea mays]
 gi|194699714|gb|ACF83941.1| unknown [Zea mays]
 gi|195659509|gb|ACG49222.1| hypothetical protein [Zea mays]
 gi|414881558|tpg|DAA58689.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
          Length = 346

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 202/295 (68%), Positives = 246/295 (83%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R 
Sbjct: 30  TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90  QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K EM+ F  KI+D+MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP 
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
           RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324


>gi|328872959|gb|EGG21326.1| glycoside hydrolase family 35 protein [Dictyostelium fasciculatum]
          Length = 759

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 272/727 (37%), Positives = 406/727 (55%), Gaps = 69/727 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V YD R+L I+G+R+++ SGSIHYPRSTP +WP LI+KSK+ G+ +IETYVFWN H+P  
Sbjct: 46  VEYDQRSLKINGERKLMISGSIHYPRSTPSMWPSLIKKSKDAGINMIETYVFWNLHQPNN 105

Query: 65  GQYY-FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            Q Y FEG  ++  F+   Q+ GL++HLRIGPY CAEWNYGG P WL  IPGI FR  N 
Sbjct: 106 SQEYNFEGNANITHFLDLCQQEGLYVHLRIGPYVCAEWNYGGIPSWLRNIPGIVFRDYNQ 165

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P+  EM  ++  I++ +K    FAS GGPIILAQVENEYG +E  YG  G+LY +WA   
Sbjct: 166 PWMTEMASWMTFIVNYLKP--YFASNGGPIILAQVENEYGWLENEYGDSGKLYAEWAISF 223

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLSF 239
           A +LN  +PW MCQQ D  D  INTCNGFYC  +        P++P  +TEN++GW   +
Sbjct: 224 AKSLNIGIPWTMCQQNDIDDA-INTCNGFYCHDWIQYHFQVYPNQPAFFTENWAGWIQYY 282

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
              VP RP EDL ++VAR+F  GG+  NYYM+ GGT F R +    +  SYDYDA +DEY
Sbjct: 283 SEGVPHRPTEDLLYSVARWFSRGGSLMNYYMWHGGTTFARYS-STFLTNSYDYDAALDEY 341

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK-------LEAHIYHKSSN---D 349
           G+  +PK+  L +LH  +      L+SS    + +          +E   Y+ + N   +
Sbjct: 342 GYEAEPKYSALAQLHSVLSQYSYILLSSGEVARPVNISNITTCNTIEIIQYNTTINGTLE 401

Query: 350 CAAFLANYDSSSDANV--TFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA 407
              F+ N+  SS A V   +NG    +  WSV IL + + V+      + Q+ +    F 
Sbjct: 402 TITFVTNFGVSSSAPVQLNWNGQTITVNPWSVLILYNNQTVI--DTSYVKQQYSAQKEFY 459

Query: 408 QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDL-AEQINTTKDTSDYLWYTASIHVMP 466
           Q K V  +L++S     + E +G+    + V  +L +EQ++ T D +DYL          
Sbjct: 460 QSKRVKNVLVSS-----WTEPIGVGNYSNVVTANLPSEQLDLTLDQTDYL---------- 504

Query: 467 GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMV 526
                   N + + +   ++++ +  ++  G+   A+F+++ K  +  G + L ILS+ +
Sbjct: 505 -------CNADDMIY---IYIDGEYQSWSRGSP--AHFVLDTKFGI--GTHKLSILSLTM 550

Query: 527 GLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFW 586
           GL +YG+ F+    GL   + +    G +D+++  W  +  + GE  G+   S  + + W
Sbjct: 551 GLISYGSHFESYKRGLNGTVTL----GTQDITNNGWSMRPYLVGEMQGIQ--SNPHLTSW 604

Query: 587 KQGSTLPVNKSLIWYKTTFLAP---EGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAP 643
              + L +N+ L WYK   +     +     AL++  M KG   VNG SIGRYW   L  
Sbjct: 605 SINNELSINQPLTWYKLNLIIQSEIQDTSSFALDMIGMNKGFIIVNGNSIGRYW---LTL 661

Query: 644 STGCTKKCDYRGS-YDASKCQKHCGQPAQTLYHIPRTWVH--PGE-NLLVIHEELGGDPS 699
             GC   C+Y G  Y    C+  CG+P++  YH+P  +++  P + N +++ EEL GDP+
Sbjct: 662 GWGCGSGCNYTGDGYQGYLCRTGCGEPSERYYHVPNDYLYLEPNQLNEIIVFEELSGDPN 721

Query: 700 KISLLTK 706
            I L+ +
Sbjct: 722 SIQLVQR 728


>gi|449436076|ref|XP_004135820.1| PREDICTED: beta-galactosidase-like [Cucumis sativus]
          Length = 486

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 207/323 (64%), Positives = 261/323 (80%), Gaps = 3/323 (0%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            +VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP
Sbjct: 20  GSVTYDHKAIIINGRRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEP 79

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT N
Sbjct: 80  SPGKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPIWLKFVPGIAFRTDN 139

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F+ KI+D+MK E LF +QGGPIIL+Q+ENEYG VEW  G  G+ Y KWAA 
Sbjct: 140 APFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQIENEYGPVEWEIGAPGKSYTKWAAQ 199

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            AV L T VPWVMC+QEDAPDP+I+TCNGFYC+ F PN   KP +WTEN+SGW+ +FG  
Sbjct: 200 MAVGLKTGVPWVMCKQEDAPDPLIDTCNGFYCENFKPNQIYKPKIWTENWSGWYTAFGGP 259

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P+RP ED+AF+VARF + GG+  NYYMY GGTNFGRT+ G  V TSYD+DAPIDEYG +
Sbjct: 260 TPYRPPEDVAFSVARFIQNGGSLVNYYMYHGGTNFGRTS-GLFVTTSYDFDAPIDEYGLL 318

Query: 303 RQPKWG--HLRELHKAIKLCEEY 323
           R+P  G   L+ L++  +   +Y
Sbjct: 319 REPILGPVTLKGLNEGTRDMSKY 341



 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/166 (48%), Positives = 104/166 (62%), Gaps = 4/166 (2%)

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
           L  V L  L  G RD+S  +W Y+VG+ GE + L  +  +NS  W +GS     + L WY
Sbjct: 323 LGPVTLKGLNEGTRDMSKYKWSYKVGLRGEILNLYSVKGSNSVQWMKGSFQ--KQPLTWY 380

Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
           KTTF  P G  PLAL+++SM KGQ WVNG+SIGRY+  Y+A   G   KC Y G +   K
Sbjct: 381 KTTFNTPAGNEPLALDMSSMSKGQIWVNGRSIGRYFPGYIA--RGKCNKCSYTGFFTEKK 438

Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           C  +CG P+Q  YHIPR W+ P  NLL+I EE+GG+P  ISL+ +T
Sbjct: 439 CLWNCGGPSQKWYHIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 484


>gi|218201568|gb|EEC83995.1| hypothetical protein OsI_30162 [Oryza sativa Indica Group]
          Length = 1078

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 272/712 (38%), Positives = 376/712 (52%), Gaps = 90/712 (12%)

Query: 129  MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
            MK+F+  I++ +K+  LFASQGGPIILAQ+ENEY ++E A+   G  Y+ WAA  A+  N
Sbjct: 426  MKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWAAKMAIATN 485

Query: 189  TSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGYAVPF 245
            T VPW+MC+Q  AP  +I TCNG +C G T   P    KP++WTEN++  +  FG     
Sbjct: 486  TGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYRVFGDPPSQ 544

Query: 246  RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQP 305
            R  ED+AF+VARFF  GGT  NYYMY GGTNFGR     ++   YD +AP+DE+G  ++P
Sbjct: 545  RSAEDIAFSVARFFSVGGTMANYYMYHGGTNFGRNGAAFVMPRYYD-EAPLDEFGLYKEP 603

Query: 306  KWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYH-KSSNDCAAFLANYDSSSDAN 364
            KWGHLR+LH A++ C++ L+  +P+ Q LG   EA ++  K  N C AFL+N+++  D  
Sbjct: 604  KWGHLRDLHHALRHCKKALLWGNPSVQPLGKLYEARVFEMKEKNVCVAFLSNHNTKEDGT 663

Query: 365  VTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ---QKNVNELLLASSA 421
            VTF G  YF+   S+SIL DCK VVF+T  V SQ N     FA    Q NV E+      
Sbjct: 664  VTFRGQKYFVARRSISILADCKTVVFSTQHVNSQHNQRTFHFADQTVQDNVWEM------ 717

Query: 422  FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGH 481
              + EEK+      S       EQ N TKD +DYLWYT S            L  + L +
Sbjct: 718  --YSEEKIPRYSKTSIRTQRPLEQYNQTKDKTDYLWYTTSFR----------LETDDLPY 765

Query: 482  AALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG 541
                 V   L   G G     +F + K ++L  G+N + ILS  +GL + G++ +   AG
Sbjct: 766  RKE--VKPVLEGAGTGRRSTRSFTMEKAMDLKVGVNHVAILSSTLGLMDSGSYLEHRMAG 823

Query: 542  LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
            +++V +  L  G  DL++  W       G   G D                  N+ L WY
Sbjct: 824  VYTVTIRGLNTGTLDLTTNGW-------GHVPGKD------------------NQPLTWY 858

Query: 602  KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
            +  F  P G  P+ ++L  MGKG  +VNG+ +GRYW +Y                     
Sbjct: 859  RRRFDPPSGTDPVVIDLTPMGKGFLFVNGEGLGRYWVSY--------------------- 897

Query: 662  CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
                 G+P+Q LYH+PR+ + P  N L+  EE GG P  I +LT    +IC+F++E +P 
Sbjct: 898  -HHALGKPSQYLYHVPRSLLRPKGNTLMFFEEEGGKPDAIMILTVKRDNICTFMTEKNPA 956

Query: 722  PVD-SWKPN-----------LGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRP 769
             V  SW+              G     P   L+C     I ++ FASYG P G CG++  
Sbjct: 957  HVRWSWESKDSQPKAVAGAGAGAGGLKPTAVLSCPTKKTIQSVVFASYGNPLGICGNYTV 1016

Query: 770  GACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            G+CH      +V+KAC+G+  CS+ VSS   G     CPG    LAV+A CS
Sbjct: 1017 GSCHAPRTKEVVEKACIGRKTCSLVVSSEVYGGDV-HCPGTTGTLAVQAKCS 1067



 Score =  357 bits (915), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 190/427 (44%), Positives = 250/427 (58%), Gaps = 75/427 (17%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +TYD R+L+IDG R +  SGSIHYPRS P+ WP+LI K+KEGGL VIE+YVFWN HEP +
Sbjct: 33  ITYDRRSLIIDGHREIFFSGSIHYPRSPPDTWPDLISKAKEGGLNVIESYVFWNGHEPEQ 92

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF----IPGIQFRT 120
           G Y FEGR+DL++F K +QE  ++  +RIGP+  AEWN+G      H     IP I FRT
Sbjct: 93  GVYNFEGRYDLIKFFKLIQEKEMYAIVRIGPFVQAEWNHG---FVCHIGSGEIPDIIFRT 149

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ MK+F+  I++ +K+  LFASQGGPIILAQ+ENEY ++E A+   G  Y+ WA
Sbjct: 150 NNEPFKKYMKQFVTLIVNKLKEAKLFASQGGPIILAQIENEYQHLEVAFKEAGTKYINWA 209

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFL 237
           A  A+  NT VPW+MC+Q  AP  +I TCNG +C G T   P    KP++WTEN++  + 
Sbjct: 210 AKMAIATNTGVPWIMCKQTKAPGEVIPTCNGRHC-GDTWPGPADKKKPLLWTENWTAQYR 268

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYM--------------------------- 270
            FG     R  ED+AF+VARFF  GGT  NYYM                           
Sbjct: 269 VFGDPPSQRSAEDIAFSVARFFSVGGTMANYYMVVLNSNSNLFLTKKRDEISDRTDTGGF 328

Query: 271 -------YFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEY 323
                  Y GGTNFGR     ++   YD +AP+DE+G  ++PKWGHLR+LH A++ C++ 
Sbjct: 329 TCVNNQQYHGGTNFGRNGAAFVMPRYYD-EAPLDEFGLYKEPKWGHLRDLHHALRHCKKA 387

Query: 324 LISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILP 383
           L+  +P+ Q LG KL                              G  YF+   S+SIL 
Sbjct: 388 LLWGNPSVQPLG-KLT----------------------------RGQKYFVARRSISILA 418

Query: 384 DCKNVVF 390
           DCK V +
Sbjct: 419 DCKTVKY 425


>gi|238009746|gb|ACR35908.1| unknown [Zea mays]
          Length = 346

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 201/295 (68%), Positives = 245/295 (83%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R 
Sbjct: 30  TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI  RT N PF
Sbjct: 90  QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISLRTDNEPF 149

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K EM+ F  KI+D+MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ AV
Sbjct: 150 KAEMQNFTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 209

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP 
Sbjct: 210 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 269

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
           RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 270 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 324


>gi|66808929|ref|XP_638187.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
           AX4]
 gi|74853739|sp|Q54MV6.1|BGAL2_DICDI RecName: Full=Probable beta-galactosidase 2; Short=Lactase 2;
           Flags: Precursor
 gi|60466604|gb|EAL64656.1| glycoside hydrolase family 35 protein [Dictyostelium discoideum
           AX4]
          Length = 761

 Score =  457 bits (1176), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 271/741 (36%), Positives = 420/741 (56%), Gaps = 62/741 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+L+I+G+R++L SGSIHYPR++ E+WP ++++SK+ G+++I+TY+FWN H+P  
Sbjct: 40  VTYDGRSLIINGERKLLFSGSIHYPRTSEEMWPIILKQSKDAGIDIIDTYIFWNIHQPNS 99

Query: 65  -GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
             +YYF+G  ++ +F+   +E  L+++LRIGPY CAEW YGGFP+WL  IP I +R  N 
Sbjct: 100 PSEYYFDGNANITKFLDLCKEFDLYVNLRIGPYVCAEWTYGGFPIWLKEIPNIVYRDYNQ 159

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +  EM  ++  ++  +  +N FA  GGPIILAQVENEYG +E  YG+ G  Y KW+ D 
Sbjct: 160 QWMNEMSIWMEFVVKYL--DNYFAPNGGPIILAQVENEYGWLEQEYGINGTEYAKWSIDF 217

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLSF 239
           A +LN  +PW+MCQQ D  +  INTCNG+YC  +  +     P++P  WTEN+ GWF ++
Sbjct: 218 AKSLNIGIPWIMCQQNDI-ESAINTCNGYYCHDWISSHWEQFPNQPSFWTENWIGWFENW 276

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G A P RPV+D+ ++ ARF   GG+  NYYM+FGGTNFGRT+GGP + TSYDYDAP+DE+
Sbjct: 277 GQAKPKRPVQDILYSNARFIAYGGSLINYYMWFGGTNFGRTSGGPWIITSYDYDAPLDEF 336

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK--LGAKLEAHIYHKSSNDCAAFLANY 357
           G   +PK+    + H+ +   E  L+++ P      L   +E H Y  +     +F+ NY
Sbjct: 337 GQPNEPKFSLSSKFHQVLHAIESDLLNNQPPKSPTFLSQFIEVHQYGIN----LSFITNY 392

Query: 358 DSSSDANVT-FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
            +S+   +  +    Y +  WSV I+ + + ++F+T+ +       ++     K +N+ +
Sbjct: 393 GTSTTPKIIQWMNQTYTIQPWSVLIIYNNE-ILFDTSFIPPNTLFNNNTINNFKPINQNI 451

Query: 417 LAS----SAF---SWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG 469
           + S    S F   S      G   + + V P   EQ+  TKDTSDY WY+ ++       
Sbjct: 452 IQSIFQISDFNLNSGGGGGDGDGNSVNSVSP--IEQLLITKDTSDYCWYSTNVTTTSLSY 509

Query: 470 KE---VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINT----LDIL 522
            E   +FL I        +F++ +     Y    F+  L   +++LN   N+    L IL
Sbjct: 510 NEKGNIFLTITEFYDYVHIFIDNE-----YQGSAFSPSLC--QLQLNPINNSTTFQLQIL 562

Query: 523 SMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLAN 582
           SM +GL+NY +  +    G+   ILI    G ++L++ +W+ + G+ GE I +   +  N
Sbjct: 563 SMTIGLENYASHMENYTRGILGSILI----GSQNLTNNQWLMKSGLIGENIKI--FNNDN 616

Query: 583 SSFWKQGSTLP----VNKSLIWYKTTF----LAPEGKGPL-ALNLASMGKGQAWVNGQSI 633
           +  W+   +      + K L WYK       L  +    + AL+++SM KG  WVNG SI
Sbjct: 617 TINWQTSPSSSSSSLIQKPLTWYKLNISLVGLPIDISSTVYALDMSSMNKGMIWVNGYSI 676

Query: 634 GRYWSAYLAPST---GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGE----- 685
           GRYW      S       +   Y G YD S  +  C +P+Q++Y +P  W+         
Sbjct: 677 GRYWLIEATQSICNQSAIENYSYIGEYDPSNYRIDCNKPSQSIYSVPIDWLFNNNYNNQY 736

Query: 686 NLLVIHEELGGDPSKISLLTK 706
             ++I EEL G+P++I LL+ 
Sbjct: 737 ATIIIIEELNGNPNEIQLLSN 757


>gi|449468694|ref|XP_004152056.1| PREDICTED: beta-galactosidase 7-like [Cucumis sativus]
          Length = 338

 Score =  457 bits (1175), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 201/321 (62%), Positives = 255/321 (79%), Gaps = 1/321 (0%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  NV+YD  AL+I+G+RR++ SGSIHYPRST  +WP+LI+K+K+GGL+ IETY+FW+ H
Sbjct: 18  IGDNVSYDSNALIINGERRIIFSGSIHYPRSTEAMWPDLIQKAKDGGLDAIETYIFWDRH 77

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP R +Y F GR D ++F + +Q+AGL++ +RIGPY CAEWNYGGFPVWLH +PGIQ RT
Sbjct: 78  EPQRRKYDFSGRLDFIKFFQLIQDAGLYVVMRIGPYVCAEWNYGGFPVWLHNMPGIQLRT 137

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEW-AYGVGGELYVKW 179
            N  +K EM+ F  KI+++ KQ NLFASQGGPIILAQ+ENEYGNV   AYG  G+ Y+ W
Sbjct: 138 NNQVYKNEMQTFTTKIVNMCKQANLFASQGGPIILAQIENEYGNVMTPAYGDAGKAYINW 197

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
            A  A +LN  VPW+MCQQ DAP P+INTCNGFYCD FTPN+P  P M+TEN+ GWF  +
Sbjct: 198 CAQMAESLNIGVPWIMCQQSDAPQPMINTCNGFYCDNFTPNNPKSPKMFTENWVGWFKKW 257

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G   P+R  ED+AF+VARFF++GG F NYYMY GGTNFGRT+GGP + TSYDY+AP+DEY
Sbjct: 258 GDKDPYRTAEDVAFSVARFFQSGGVFNNYYMYHGGTNFGRTSGGPFITTSYDYNAPLDEY 317

Query: 300 GFIRQPKWGHLRELHKAIKLC 320
           G + QPKWGHL++LH +I +C
Sbjct: 318 GNLNQPKWGHLKQLHASIXIC 338


>gi|413954365|gb|AFW87014.1| beta-galactosidase [Zea mays]
          Length = 473

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 237/489 (48%), Positives = 309/489 (63%), Gaps = 25/489 (5%)

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV 286
           MWTE ++GWF +FG AVP RPVED+AFAVARF + GG+F NYYMY GGTNF RT+GGP +
Sbjct: 1   MWTEAWTGWFTAFGGAVPHRPVEDMAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFI 60

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKS 346
           ATSYDYDAPIDEYG +RQPKWGHLR+LHKAIK  E  L+S DPT Q LG   +A+++  S
Sbjct: 61  ATSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSS 120

Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF 406
              CAAFL+NY +S+ A V FNG  Y LPAWS+S+LPDCK  VFNTA V         P 
Sbjct: 121 GGACAAFLSNYHTSAAARVVFNGRRYDLPAWSISVLPDCKAAVFNTATV-------SEPS 173

Query: 407 AQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
           A  +     +  +  FSW  Y E       R+F +  L EQ++ T D SDYLWYT  +++
Sbjct: 174 APAR-----MSPAGGFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNI 228

Query: 465 MPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
              +     G+   L I S GH+  VFVN +     YG +D      +  +++ +G N +
Sbjct: 229 NSNEQFLKSGQWPQLTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKI 288

Query: 520 DILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
            ILS  VGL N G  ++    G+   V L  L  GKRDLS  +W YQ+G+ GE +G+  +
Sbjct: 289 SILSAAVGLPNQGTHYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSV 348

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
           + ++S  W   +     + L W+K  F AP G  P+AL++ SMGKGQAWVNG+ IGRYWS
Sbjct: 349 AGSSSVEWGSAAG---KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS 405

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
            Y A S+GC   C Y G+Y  +KCQ  CG  +Q  YH+PR+W++P  NLLV+ EE GGD 
Sbjct: 406 -YKASSSGC-GGCSYAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDL 463

Query: 699 SKISLLTKT 707
           S + L+T+T
Sbjct: 464 SGVKLVTRT 472


>gi|255550371|ref|XP_002516236.1| beta-galactosidase, putative [Ricinus communis]
 gi|223544722|gb|EEF46238.1| beta-galactosidase, putative [Ricinus communis]
          Length = 775

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 259/628 (41%), Positives = 359/628 (57%), Gaps = 43/628 (6%)

Query: 206 INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
           INTCNG+YCD F PN+P  P M+TEN+SGW+  +G    +R  ED+AF+VARF + GG F
Sbjct: 164 INTCNGYYCDTFKPNNPKSPKMFTENWSGWYKLWGGKTSYRTAEDMAFSVARFVQAGGVF 223

Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
            NYYMY+GGTNFGRTAGGP +  SYDYD+P+DEYG + QPKWGHL++LH +IKL E+ + 
Sbjct: 224 NNYYMYYGGTNFGRTAGGPYITASYDYDSPLDEYGNLNQPKWGHLKQLHASIKLGEKIIT 283

Query: 326 SSDPTHQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDANVTF--NGNVYFLPAWSVSIL 382
           +   T +   A ++   Y + ++ +   FL+N +  +DA++    +GN Y +PAWSVSIL
Sbjct: 284 NGTVTIKNFQAGVDLTAYTNNATRERFCFLSNIN-IADAHIDLQQDGN-YTIPAWSVSIL 341

Query: 383 PDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEE--KVGISGNRSFVRP 440
            +C   +FNTAKV +Q +       +      L     ++ W  E  K  + G   F   
Sbjct: 342 QNCSKEIFNTAKVNTQTSLMVKKLYENDKPTNL-----SWVWAPEPMKDTLLGKGRFRTS 396

Query: 441 DLAEQINTTKDTSDYLWYTASIHVMPG--QGKEVFLNIESLGHAALVFVNKKLVAFGYGN 498
            L +Q  TT D SDYLWY  S  +     Q   V L + S GH    +VNKKL+  G   
Sbjct: 397 QLLDQKETTVDASDYLWYMTSFDMNKNTLQWTNVTLRVTSRGHVLHAYVNKKLIV-GSQL 455

Query: 499 HDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGK--RD 556
                F   K + L  G N + +LS  VGL NYG++FD    G+    +  + NGK   D
Sbjct: 456 VIQGEFTFEKPVTLKPGNNVISLLSATVGLANYGSFFDKTPVGIVDGPVQLMANGKPVMD 515

Query: 557 LSSGEWIYQVGVEGEYIGL-DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLA 615
           LSS  W Y++G+ GE     D  S  N   W   + +   + + WYKTTF +P G  P+ 
Sbjct: 516 LSSNLWSYKIGLNGEAKRFYDPTSRHNK--WSAANGVSTARPMTWYKTTFSSPSGTDPVV 573

Query: 616 LNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYH 675
           ++L  MGKG AW NG+S+GRYW + +A + GC+  CDYRG Y+A KC ++CG P Q  YH
Sbjct: 574 VDLQGMGKGHAWANGKSLGRYWPSQIANANGCSGTCDYRGPYNAGKCTRNCGIPTQRWYH 633

Query: 676 IPRTWVHP-GENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVS 734
           +PR++++  G+N L++ EE+GGDPS IS    T + IC    E                 
Sbjct: 634 VPRSFLNSNGKNTLILFEEVGGDPSGISFQIVTTETICGNAYEGS--------------- 678

Query: 735 SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIP 793
               + L+C+ G  I+ I FASYG P+G C SF+ G+   M+ + +VQK CVG+  CSI 
Sbjct: 679 ---TLELSCQGGRTISEIQFASYGNPQGTCSSFKKGSFDAMNSVQMVQKECVGKDSCSII 735

Query: 794 VSSAYLGVSAGACPGLL-KALAVEAHCS 820
            S     V+     G+  K LAV+AHCS
Sbjct: 736 ASDETFMVNEPQ--GISNKRLAVQAHCS 761



 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 78/133 (58%), Positives = 103/133 (77%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  V YD  AL+I+G+R+++ SG+IHYPRSTPE+WPELI K+K+GGL+ IETYVFW+ HE
Sbjct: 22  ATTVEYDSNALIINGERKIIFSGAIHYPRSTPEMWPELINKAKDGGLDAIETYVFWDRHE 81

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+R QY F G  D+V+F + +QEAGL++ LRIGPY CAEWNYGGFP+WLH  PG++ RT 
Sbjct: 82  PVRRQYDFSGNLDIVKFFRVIQEAGLYVILRIGPYVCAEWNYGGFPMWLHNTPGVELRTD 141

Query: 122 NNPFKEEMKRFLA 134
           N  +K  +  F  
Sbjct: 142 NEIYKVPLLIFFV 154


>gi|414881559|tpg|DAA58690.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
          Length = 342

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 200/295 (67%), Positives = 243/295 (82%), Gaps = 4/295 (1%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           TYD +A+V++G+RR+L SGSIHYPRS PE+WP+LI+K+K+GGL+V++TYVFWN HEP R 
Sbjct: 30  TYDRKAVVVNGQRRILMSGSIHYPRSVPEMWPDLIQKAKDGGLDVVQTYVFWNGHEPSRR 89

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QYYFEGR+DLV F+K V++AGL++HLRIGPY CAEWN+GGFPVWL ++PGI FRT N PF
Sbjct: 90  QYYFEGRYDLVHFIKLVKQAGLYVHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPF 149

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
           K     F  KI+D+MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ AV
Sbjct: 150 KN----FTTKIVDMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAV 205

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
            LNTSVPWVMC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP 
Sbjct: 206 ALNTSVPWVMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTSWYTGFGIPVPH 265

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
           RPVEDLA+ VA+F + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAPIDEYG
Sbjct: 266 RPVEDLAYGVAKFIQKGGSFVNYYMYHGGTNFGRTAGGPFIATSYDYDAPIDEYG 320


>gi|3850659|emb|CAA10064.1| beta galactosidase [Carica papaya]
          Length = 347

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 209/358 (58%), Positives = 253/358 (70%), Gaps = 15/358 (4%)

Query: 104 GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG 163
           GGFPVWL ++PGI FRT N PFK  M++F  KI+ +MK E LF +QGGPIIL+Q+ENE+G
Sbjct: 1   GGFPVWLKYVPGIAFRTDNEPFKAAMQKFTEKIVSMMKAEKLFQTQGGPIILSQIENEFG 60

Query: 164 NVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPS 223
            VEW  G  G+ Y KWAA  AV L+T VPW+MC+QEDAPDP+I+TCNGFYC+ F PN   
Sbjct: 61  PVEWEIGAPGKAYTKWAAQMAVGLDTGVPWIMCKQEDAPDPVIDTCNGFYCENFKPNKDY 120

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
           KP MWTE ++GW+  FG AVP RP ED+AF+VARF + GG+F NYYMY GGTNFGRTAGG
Sbjct: 121 KPKMWTEVWTGWYTEFGGAVPTRPAEDVAFSVARFIQGGGSFLNYYMYHGGTNFGRTAGG 180

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
           P +ATSYDYDAP+DEYG  R+PKWGHLR+LHKAIK CE  L+S DP+  KLG+  EAH++
Sbjct: 181 PFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSCESALVSVDPSVTKLGSNQEAHVF 240

Query: 344 HKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD 403
            KS +DCAAFLANYD+     V+F G  Y LP WS+SILPDCK  V+NTAKV SQ +   
Sbjct: 241 -KSESDCAAFLANYDAKYSVKVSFGGGQYDLPPWSISILPDCKTEVYNTAKVGSQSSQ-- 297

Query: 404 HPFAQQKNVNELLLASSAFSWYE---EKVGISGNRSFVRPDLAEQINTTKDTSDYLWY 458
                     ++    S F W     E        +     L EQIN T+DT+DYLWY
Sbjct: 298 ---------VQMTPVHSGFPWQSFIEETTSSDETDTTTLDGLYEQINITRDTTDYLWY 346


>gi|328873276|gb|EGG21643.1| hypothetical protein DFA_01529 [Dictyostelium fasciculatum]
          Length = 827

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 262/725 (36%), Positives = 389/725 (53%), Gaps = 43/725 (5%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD+RA++I+G+R++L S SIHYPRST  +WP++++++K  G+  IETY+FWN H+P  
Sbjct: 32  VSYDNRAIIINGERKLLYSASIHYPRSTRTMWPDILKRTKAAGINTIETYIFWNLHQPTP 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
             Y FEG  D+  F+   +E G  + +R GPY CAEWN GG P WL  +PGI +RT N P
Sbjct: 92  DTYDFEGSSDVKHFLDLCKEEGFHVIVRFGPYVCAEWNNGGLPSWLKAVPGIVYRTHNEP 151

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG-VGGELYVKWAADT 183
           F  EMK+++  I+  +   + +A  GGPII+AQ+ENEYG +E+ Y   GG  YV WA   
Sbjct: 152 FMREMKKWMDYIVHYLS--DYYAPNGGPIIMAQIENEYGWLEYEYREQGGPEYVDWAVKL 209

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
           A + NT +PW+MCQQ    D +INTCNGFYC  +        P +P  +TE ++GW   F
Sbjct: 210 AKSYNTGIPWIMCQQNTRSD-VINTCNGFYCHDWLQYHQRTFPDQPAFFTELWTGWPQYF 268

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
               P RP  D+ ++ ARF+  GG   NYYM+ GGT FGR    P + TSYDYDAP+DEY
Sbjct: 269 EEGFPTRPTVDVLYSAARFYSRGGGMVNYYMWHGGTTFGRFT-SPFLTTSYDYDAPLDEY 327

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKSSNDCAAFLANY 357
           GF ++PK+  L +LH  ++     ++     P            I +K   +   FL N+
Sbjct: 328 GFPQEPKYSMLTKLHVTLEKYSSVILHDPNVPPPYVFPDNTVEMIEYKKDAESVVFLVNW 387

Query: 358 DSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF---------AQ 408
           D +    V  NG    +  WSV I  +   +VF+T ++ +     + PF         A 
Sbjct: 388 DDTFAKQVDMNGKNVKINQWSVQIYYN-NELVFDTFEIPANLTRPNPPFKPIAKTSLDAT 446

Query: 409 QKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ 468
               +   L +   SW E    ++ N S   P    Q+  T D SDY+WY   I +   +
Sbjct: 447 AAATSRTGLVNLVSSWNEPFSFLTYNASSQTP--TAQLKLTGDNSDYIWYETEIDLT--K 502

Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
             E+    +S    + VFV+ + + +  G+   A F  N K  +  G +TL IL   +G+
Sbjct: 503 TDEILYLYKSYDF-SYVFVDGQFLYWHRGSPIQAYF--NGKFPV--GKHTLQILCAAMGV 557

Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
            +YGA  +    GL   I +    G ++++   W  +  + GE +GL   +  ++  W  
Sbjct: 558 PSYGAHIEQHERGLTGDIFL----GSKNITDNGWKMRPFLSGELLGLH--ASPSTVKWSP 611

Query: 589 GSTLPVNKSLIWYKTTFLAPEGK-GP-LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
            S       + WYK     P  + GP  AL+L SM KG  +VNG SIGRYW A       
Sbjct: 612 VSKGTAGSGVTWYKFNVKTPSFEDGPAFALDLKSMWKGLVFVNGNSIGRYWVA----KGW 667

Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV-HPGENLLVIHEELGGDPSKISLLT 705
           C +KC+  G YD   C+++CG+ +Q  YH+P+ ++    +N ++I EEL GDP  I L+ 
Sbjct: 668 CEEKCNQTGLYDNYGCRENCGESSQRYYHVPKDFLKESSDNEVIIFEELQGDPYSIELVQ 727

Query: 706 KTGQH 710
           +  ++
Sbjct: 728 RNTEY 732


>gi|217075793|gb|ACJ86256.1| unknown [Medicago truncatula]
          Length = 268

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 192/251 (76%), Positives = 218/251 (86%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
              NV YDHRALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GGL+VIETYVFWN H
Sbjct: 18  FCTNVDYDHRALVIDGKRRVLISGSIHYPRSTPQMWPDLIQKSKDGGLDVIETYVFWNLH 77

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP++GQY F+GR DLV+FVK V EAGL++HLRIGPY CAEWNYGGFP+WLHFIPGI+FRT
Sbjct: 78  EPVKGQYDFDGRKDLVKFVKAVAEAGLYVHLRIGPYVCAEWNYGGFPLWLHFIPGIKFRT 137

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK EMKRF AKI+DLMKQE L+ASQGGPIIL+Q+ENEYGN++  YG  G+ Y+ WA
Sbjct: 138 DNEPFKAEMKRFTAKIVDLMKQEKLYASQGGPIILSQIENEYGNIDSHYGSAGKSYINWA 197

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A +L+T VPWVMCQQ DAPDPIINTCNGFYCD FTPNS +KP MWTEN+SGWFLSFG
Sbjct: 198 AKMATSLDTGVPWVMCQQGDAPDPIINTCNGFYCDQFTPNSNTKPKMWTENWSGWFLSFG 257

Query: 241 YAVPFRPVEDL 251
            AVP RPVE L
Sbjct: 258 GAVPHRPVEIL 268


>gi|281209972|gb|EFA84140.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
           PN500]
          Length = 707

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 244/627 (38%), Positives = 364/627 (58%), Gaps = 36/627 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +S  VTYD R+L+I+G+R++  SGS+HYPRSTP +W +++  SK  G+ +I+TYVFW+ H
Sbjct: 104 VSYKVTYDGRSLLINGERKLFVSGSVHYPRSTPTIWKKVLALSKNSGINMIDTYVFWDLH 163

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP RG Y FEG  +L  F+   Q+ GLF++LRIGPY CAEWNYGG P+WL  IPGI+ R 
Sbjct: 164 EPQRGVYNFEGNANLKHFLDLCQQNGLFVNLRIGPYICAEWNYGGLPIWLKDIPGIKMRD 223

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N  + EE++R++  I+D +     FA QGGPI+LAQ+ENEY  V+W Y   G  +  W 
Sbjct: 224 FNTQYMEEVERWMKFIVDYL--HGYFAPQGGPIVLAQIENEYNWVQWRYQESGRKFAHWC 281

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT----PNSPSKPIMWTENYSGWF 236
           AD A  L+  +PW+MCQQ+D P  +INTCNG+YC  +      N   +P ++TEN+SGWF
Sbjct: 282 ADLANRLDIGIPWIMCQQDDIPT-VINTCNGYYCHEWINFHWNNFKDQPPLFTENWSGWF 340

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
            ++  AV  RPV DL ++ AR+F +GG   NYYM+ GGTNFGR + GP++A SYDYDAP+
Sbjct: 341 NNWVNAVRHRPVADLLYSAARWFASGGALMNYYMWHGGTNFGRKS-GPMIALSYDYDAPL 399

Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLAN 356
           +EYG  R PK+   R+ +K I   E+ L+S  P      A   + I++++ N+ A+F+ N
Sbjct: 400 NEYGNPRNPKYSQTRDFNKLILSLEDILLSQYPPTPIFLANNISVIHYRNGNNSASFIIN 459

Query: 357 YDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELL 416
            + + ++ V F G  YF  A+SV IL +  + VF++++  + RN  D     + N+    
Sbjct: 460 SNENGNSKVMFEGRSYFSYAYSVQILKNYVS-VFDSSQ--NPRNYTDTVVESEPNIP--- 513

Query: 417 LASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI-HVMPGQGKEVFLN 475
            A+S  S + E+       S     L EQ+N TKD +DY+WYT  I H   G+  +V +N
Sbjct: 514 FANSIISKHVERFDFE--ESLYDNRLMEQLNLTKDETDYIWYTTMINHDQDGEILKV-IN 570

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
              + H   VFV+   V         ++ L    + L  G +TL +L   +G+Q+Y    
Sbjct: 571 KTDIVH---VFVDSYYVG-----TIMSDSLAITGVPL--GPSTLQLLHTKMGIQHYELHM 620

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS---LANSSFWKQGSTL 592
           +   AG+   +      G  ++++  W  +  V  E +  D I    +  S   ++ + +
Sbjct: 621 ENTKAGILGPVYY----GDIEITNQMWGSKPFVSSEKVITDPIQSKFVRWSPLDRKPNEV 676

Query: 593 PVNKSLIWYK-TTFLAPEGKGPLALNL 618
             +  L WYK   F+  E K P +L L
Sbjct: 677 FYSVPLTWYKFIFFIDSEAKLPTSLAL 703


>gi|115480419|ref|NP_001063803.1| Os09g0539200 [Oryza sativa Japonica Group]
 gi|113632036|dbj|BAF25717.1| Os09g0539200 [Oryza sativa Japonica Group]
          Length = 446

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 197/393 (50%), Positives = 267/393 (67%), Gaps = 3/393 (0%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L+IDGKR +  SG+IHYPRS PE+W +L++ +K GGL  IETYVFWN HEP  
Sbjct: 36  VSYDERSLMIDGKRDLFFSGAIHYPRSPPEMWDKLVKTAKMGGLNTIETYVFWNGHEPEP 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G+YYFEGRFDL+RF+  +++  ++  +RIGP+  AEWN+GG P WL  I  I FR  N P
Sbjct: 96  GKYYFEGRFDLIRFLNVIKDNDMYAIVRIGPFIQAEWNHGGLPYWLREIGHIIFRANNEP 155

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FK EM++F+  I+  +K   +FA QGGPIIL+Q+ENEYGN++    V G+ Y++WAA+ A
Sbjct: 156 FKREMEKFVRFIVQKLKDAEMFAPQGGPIILSQIENEYGNIKKDRKVEGDKYLEWAAEMA 215

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           ++    VPWVMC+Q  AP  +I TCNG +C D +T    +KP +WTEN++  F +FG  +
Sbjct: 216 ISTGIGVPWVMCKQSIAPGEVIPTCNGRHCGDTWTLLDKNKPRLWTENWTAQFRTFGDQL 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
             R  ED+A+AV RFF  GGT  NYYMY GGTNFGRT G   V T Y  +AP+DEYG  +
Sbjct: 276 AQRSAEDIAYAVLRFFAKGGTLVNYYMYHGGTNFGRT-GASYVLTGYYDEAPMDEYGMCK 334

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSD 362
           +PK+GHLR+LH  IK   +  +    + + LG   EAH Y    +  C +FL+N ++  D
Sbjct: 335 EPKFGHLRDLHNVIKSYHKAFLWGKQSFEILGHGYEAHNYELPEDKLCLSFLSNNNTGED 394

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV 395
             V F G  +++P+ SVSIL DCK VV+NT +V
Sbjct: 395 GTVVFRGEKFYVPSRSVSILADCKTVVYNTKRV 427


>gi|195615772|gb|ACG29716.1| beta-galactosidase precursor [Zea mays]
          Length = 450

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 217/465 (46%), Positives = 288/465 (61%), Gaps = 24/465 (5%)

Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHL 310
           +AFAVARF + GG+F NYYMY GGTNF RT+GGP +ATSYDYDAPIDEYG +RQPKWGHL
Sbjct: 1   MAFAVARFIQKGGSFVNYYMYHGGTNFDRTSGGPFIATSYDYDAPIDEYGLLRQPKWGHL 60

Query: 311 RELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGN 370
           R+LHKAIK  E  L+S DPT Q LG   +A+++  S   CAAFL+NY +S+ A V FNG 
Sbjct: 61  RDLHKAIKQAEPALVSGDPTIQSLGNYEKAYVFKSSGGACAAFLSNYHTSAAARVVFNGR 120

Query: 371 VYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEK 428
            Y LPAWS+S+LPDCK  VFNTA V         P A  +     +  +  FSW  Y E 
Sbjct: 121 RYDLPAWSISVLPDCKAAVFNTATV-------SEPSAPAR-----MSPAGGFSWQSYSEA 168

Query: 429 VGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAA 483
                 R+F +  L EQ++ T D SDYLWYT  +++   +     G+   L + S GH+ 
Sbjct: 169 TNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQLTVYSAGHSL 228

Query: 484 LVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLF 543
            VFVN +     YG +D      +  +++ +G N + ILS  VGL N G  ++    G+ 
Sbjct: 229 QVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGTHYETWNVGVL 288

Query: 544 S-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYK 602
             V L  L  GKRDLS+ +W YQ+G+ GE +G+  ++ ++S  W   +     + L W+K
Sbjct: 289 GPVTLSGLNEGKRDLSNQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG---KQPLTWHK 345

Query: 603 TTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKC 662
             F AP G  P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+G    C Y G+Y  +KC
Sbjct: 346 AYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGGCGGCSYAGTYSETKC 404

Query: 663 QKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           Q  CG  +Q  YH+PR+W++P  NLLV+ EE GGD   + L+T+T
Sbjct: 405 QTGCGDVSQRYYHVPRSWLNPSGNLLVLLEEFGGDLPGVKLVTRT 449


>gi|414881560|tpg|DAA58691.1| TPA: hypothetical protein ZEAMMB73_223728 [Zea mays]
          Length = 655

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 230/547 (42%), Positives = 311/547 (56%), Gaps = 34/547 (6%)

Query: 283 GPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
           G  V   Y  D  +   G +R+PKWGHL+ELHKAIKLCE  L++ DP    LG   +A +
Sbjct: 132 GADVQMPYRLDHILVADGLLREPKWGHLKELHKAIKLCEPALVAGDPIVTSLGNAQQASV 191

Query: 343 YHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNG 402
           +  S++ C AFL N D  S A V+FNG  Y LP WS+SILPDCK  V+NTA V SQ    
Sbjct: 192 FRSSTDACVAFLENKDKVSYARVSFNGMHYDLPPWSISILPDCKTTVYNTASVGSQ---- 247

Query: 403 DHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
               +Q K     +  +  F+W  Y E +   G+ SF    L EQIN T+D +DYLWYT 
Sbjct: 248 ---ISQMK-----MEWAGGFTWQSYNEDINSLGDESFATVGLLEQINVTRDNTDYLWYTT 299

Query: 461 SIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEG 515
            + +   +     GK   L + S GHA  +FVN +L    YG+ +      +  ++L  G
Sbjct: 300 YVDIAQDEQFLSNGKNPMLTVMSAGHALHIFVNGQLTGTVYGSVEDPKLTYSGNVKLWSG 359

Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIG 574
            NT+  LS+ VGL N G  F+   AG+   + +D L  G+RDL+  +W Y+VG++GE + 
Sbjct: 360 SNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGRRDLTWQKWTYKVGLKGEALS 419

Query: 575 LDKISLANSSFWKQGSTLPVNKS-LIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
           L  +S ++S  W +    PV K  L WYK  F AP+G  PLAL+++SMGKGQ W+NGQ I
Sbjct: 420 LHSLSGSSSVEWGE----PVQKQPLSWYKAFFNAPDGDEPLALDMSSMGKGQIWINGQGI 475

Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
           GRYW  Y A  +G    CDYRG YD  KCQ +CG  +Q  YH+PR+W++P  NLLVI EE
Sbjct: 476 GRYWPGYKA--SGTCGICDYRGEYDEKKCQTNCGDSSQRWYHVPRSWLNPTGNLLVIFEE 533

Query: 694 LGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAIN 753
            GGDP+ IS++ +    IC+ VSE   P + +W+          +V L C+ G  +  I 
Sbjct: 534 WGGDPTGISMVKRIAGSICADVSEWQ-PSMANWRTK---GYEKAKVHLQCDHGRKMTHIK 589

Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
           FAS+G P+G+CGS+  G CH      I  K+C+GQ  C + V     G     CPG +K 
Sbjct: 590 FASFGTPQGSCGSYSEGGCHAHKSYDIFWKSCIGQERCGVSVVPDAFG--GDPCPGTMKR 647

Query: 813 LAVEAHC 819
             VEA C
Sbjct: 648 AVVEAIC 654


>gi|34481809|emb|CAD44190.1| putative beta-galactosidase [Mangifera indica]
 gi|34481811|emb|CAD44191.1| putative beta-galactosidase [Mangifera indica]
          Length = 286

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 180/286 (62%), Positives = 217/286 (75%)

Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
           EWN+GGFPVWL F+PGI FRT N PFK  M+ F  KI+ +MK E LF SQGGPIIL+Q+E
Sbjct: 1   EWNFGGFPVWLKFVPGISFRTDNEPFKRAMQNFTQKIVQMMKDEKLFESQGGPIILSQIE 60

Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
           NEY      +G  GE Y+ WAA  A  LNT VPWVMC++ DAPDP+INTCNGFYCD F+P
Sbjct: 61  NEYEPERMKFGSAGEAYMNWAAQMATGLNTGVPWVMCKEYDAPDPVINTCNGFYCDKFSP 120

Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
           N P KP +WTE ++GWF  FG  +  RPVEDLAFAVARF + GG+F NYYMY GGTNFGR
Sbjct: 121 NKPFKPKLWTEAWTGWFTEFGGPIYQRPVEDLAFAVARFIQAGGSFVNYYMYHGGTNFGR 180

Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
           TAGGP + TSYDYDAPIDEYG IR+PK+ HL+ELH+A+KLCE  L+ +DP    LG   +
Sbjct: 181 TAGGPFITTSYDYDAPIDEYGLIRRPKYDHLKELHQAVKLCETALLYADPYVMSLGNYEQ 240

Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
           AH++  +S  CAAFL+N++S S A VTFN   ++LP WS+SILPDC
Sbjct: 241 AHVFSSTSGGCAAFLSNFNSKSSARVTFNRKHFYLPPWSISILPDC 286


>gi|34481839|emb|CAD44519.1| putative beta-galactosidase [Carica papaya]
          Length = 285

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 179/286 (62%), Positives = 218/286 (76%), Gaps = 1/286 (0%)

Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
           EWN+GGFPVWL ++PGIQFRT N PFK +M++F  KI+++MK E LF  Q GPII++Q+E
Sbjct: 1   EWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEPQEGPIIMSQIE 60

Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
           NEYG +EW  G  G+ Y KWAA  AV L T VPW+MC+QEDAPDPII+TCNGFYC+ F P
Sbjct: 61  NEYGPIEWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDTCNGFYCENFMP 120

Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
           N+  KP M+TE ++GW+  FG  VP+RP ED+A++VARF +  G+F NYYMY GGTNFGR
Sbjct: 121 NANYKPKMFTEAWTGWYTEFGGPVPYRPAEDMAYSVARFIQNRGSFINYYMYHGGTNFGR 180

Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
           TAGGP +ATSYDYDAP+DEYG  R+PKWGHLR+LHK IKLCE  L+S DP    LG+  E
Sbjct: 181 TAGGPFIATSYDYDAPLDEYGLGREPKWGHLRDLHKTIKLCEPSLVSVDPKVTSLGSNQE 240

Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
           AH++   ++ CAAFLANYD      VTF    Y LP WSVSILPDC
Sbjct: 241 AHVFWTKTS-CAAFLANYDLKYSVRVTFQNLPYDLPPWSVSILPDC 285


>gi|359476803|ref|XP_003631891.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 11-like [Vitis
           vinifera]
          Length = 722

 Score =  390 bits (1003), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 263/729 (36%), Positives = 366/729 (50%), Gaps = 124/729 (17%)

Query: 110 LHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAY 169
           L+ I    F   + P ++ MKRF   IID+M +E   ASQGGPIILA V++       A+
Sbjct: 99  LNVIHTYAFWNLHEPVQDHMKRFTRMIIDMMSKEKXIASQGGPIILALVDSAI-----AF 153

Query: 170 GVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIM 227
              G   V WA   AV L T +P VMC+Q+DAPDP+INTC G  C D FT PN P+K  +
Sbjct: 154 KEMGTRCVHWAGTMAVGLKTGIPXVMCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSV 213

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
            + +  G +  FG     R  EDLAF+   F    GT  NYYMY+  TNFGRT       
Sbjct: 214 -SNHXLGMYRVFGDPPSQRAAEDLAFSX--FISKNGTLANYYMYYSVTNFGRTTSS-FAT 269

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-S 346
           T Y  +AP+DEYG  R+ KWGHLR+LH A++L ++ L+    + QKLG  LEA IY K  
Sbjct: 270 TCYYDEAPLDEYGLPRETKWGHLRDLHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPG 329

Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPF 406
           SN CA FL N  + +    T  G+ Y+LP  S+S LPDCK VVFNT  V+SQ       +
Sbjct: 330 SNICATFLLNNITRTPTTTTLRGSKYYLPQHSISNLPDCKTVVFNTQTVVSQ-------Y 382

Query: 407 AQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
           +  KN+ +  ++  A   YEE    +  +S V     E +  TKDT+DYLWYT +I +  
Sbjct: 383 SVNKNL-QWXMSQDALPTYEE--CPTKTKSPV-----ELMTMTKDTTDYLWYTTNIELAR 434

Query: 465 --MPGQGKEVFL--NIESLGHAALVFVNKKLVAF-----GYGNHDFANFLINKKIELNEG 515
             +P + K+V     + +LGH    F+N + + F      +G++   +F+ NK I L  G
Sbjct: 435 TGLPFR-KDVLRVPQVSNLGHVMHAFLNGEYMEFYLTGTRHGSNVEKSFVFNKPITLKAG 493

Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGL 575
           +N +  L   VGL + G++ +   AG+ +V +  L     DL    W             
Sbjct: 494 LNQIAPLGATVGLPDSGSYMEHRLAGVHNVAIQGLNTRTIDLPKNGW------------- 540

Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
                                    +K  F APEG  P+AL L++M KG AW+NG+SI  
Sbjct: 541 ------------------------GHKAYFDAPEGDVPVALELSTMAKGMAWINGKSIDX 576

Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
           YW +YL+P                       G+P+Q++YH+PR ++   +NLLV+ EE G
Sbjct: 577 YWVSYLSP----------------------LGKPSQSVYHVPRAFLKTSDNLLVLFEETG 614

Query: 696 GDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFA 755
            +P  I +LT     IC ++SE  P  V SWK                       A +  
Sbjct: 615 RNPDGIEILTLNRDTICCYISEHHPTHVRSWKRE---------------------ASDIQ 653

Query: 756 SYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYL---GVSAGACPGLLK 811
            +G P G C  F PG C   +   +V+K C+G+  CSIPV    +   G+S     G+ K
Sbjct: 654 IFGDPTGTCXEFIPGNCAAPNSXKVVEKHCLGKSSCSIPVEQEIVSKDGISISGS-GITK 712

Query: 812 ALAVEAHCS 820
           ALAV+  C+
Sbjct: 713 ALAVQVLCA 721



 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 37/60 (61%), Positives = 49/60 (81%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R L+++GKR +L SGSIHYPRS PE+WP++I K++ GGL VI TY FWN HEP++
Sbjct: 56  VSYDGRPLIVNGKRELLFSGSIHYPRSIPEMWPDIIXKARHGGLNVIHTYAFWNLHEPVQ 115


>gi|357483613|ref|XP_003612093.1| Beta-galactosidase [Medicago truncatula]
 gi|355513428|gb|AES95051.1| Beta-galactosidase [Medicago truncatula]
          Length = 504

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 223/516 (43%), Positives = 295/516 (57%), Gaps = 31/516 (6%)

Query: 319 LCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWS 378
           +CE+ LIS+DP    LG   +A++Y   S DC+AFL+NYDS S A V FN   Y LP WS
Sbjct: 1   MCEKALISTDPVVTSLGNFQQAYVYTTESGDCSAFLSNYDSKSSARVMFNNMHYNLPPWS 60

Query: 379 VSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRS 436
           VSILPDC+N VFNTAKV            Q   +  L   S  FSW  +EE    S   +
Sbjct: 61  VSILPDCRNAVFNTAKV----------GVQTSQMQMLPTNSERFSWESFEEDTSSSSATT 110

Query: 437 FVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVFLNIESLGHAALVFVNKKL 491
                L EQIN T+DTSDYLWY  S+ V   +     GK   L ++S GHA  VF+N +L
Sbjct: 111 ITASGLLEQINVTRDTSDYLWYITSVDVGSSESFLHGGKLPSLIVQSTGHAVHVFINGRL 170

Query: 492 VAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-L 550
               YG  +   F     + L  G NT+ +LS+ VGL N G  F+    G+   ++I  L
Sbjct: 171 SGSAYGTREDRRFRYTGDVNLRAGTNTIALLSVAVGLPNVGGHFETWNTGILGPVVIHGL 230

Query: 551 KNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVNKSLIWYKTTFLAPE 609
             GK DLS  +W YQVG++GE + L      +S  W Q +  +  N+ L W+KT F APE
Sbjct: 231 DKGKLDLSWQKWTYQVGLKGEAMNLASPDGISSVEWMQSAVVVQRNQPLTWHKTFFDAPE 290

Query: 610 GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQP 669
           G+ PLAL++  MGKGQ W+NG SIGRYW+A    +TG    C+Y GS+   KCQ  CGQP
Sbjct: 291 GEEPLALDMDGMGKGQIWINGISIGRYWTAI---ATGSCNDCNYAGSFRPPKCQLGCGQP 347

Query: 670 AQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP----PVDS 725
            Q  YH+PR+W+    NLLV+ EELGGDPSKISL  ++   +C+ VSE  P      +DS
Sbjct: 348 TQRWYHVPRSWLKQNHNLLVVFEELGGDPSKISLAKRSVSSVCADVSEYHPNLKNWHIDS 407

Query: 726 WKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKAC 784
           +  +       P+V L C  G  I++I FAS+G P G CGS+  GACH      I+++ C
Sbjct: 408 YGKSENF--RPPKVHLHCNPGQAISSIKFASFGTPLGTCGSYEQGACHSSSSYDILEQKC 465

Query: 785 VGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           +G+  C + VS++  G     CP +LK L+VEA C+
Sbjct: 466 IGKPRCIVTVSNSNFG--RDPCPNVLKRLSVEAVCA 499


>gi|19386854|dbj|BAB86232.1| putative beta-D-galactosidase [Oryza sativa Japonica Group]
          Length = 774

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 173/289 (59%), Positives = 213/289 (73%), Gaps = 20/289 (6%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +++VTYDHR+L+I G+RR+L S SIHYPRS PE+WP+L+ ++K+GG + +ETYVFWN HE
Sbjct: 35  NSSVTYDHRSLIISGRRRLLISTSIHYPRSVPEMWPKLVAEAKDGGADCVETYVFWNGHE 94

Query: 62  PIRGQ--------------------YYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEW 101
           P +GQ                    YYFE RFDLVRF K V++AGL++ LRIGP+  AEW
Sbjct: 95  PAQGQVRAASPKFVMDLACSIRDKPYYFEERFDLVRFAKIVKDAGLYMILRIGPFVAAEW 154

Query: 102 NYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENE 161
            +GG PVWLH+ PG  FRT N PFK  MKRF   I+D+MK+E  FASQGG IILAQVENE
Sbjct: 155 TFGGVPVWLHYAPGTVFRTNNEPFKSHMKRFTTYIVDMMKKEQFFASQGGHIILAQVENE 214

Query: 162 YGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS 221
           YG++E AYG G + Y  WAA  A+  NT VPW+MCQQ DAPDP+INTCN FYCD F PNS
Sbjct: 215 YGDMEQAYGAGAKPYAMWAASMALAQNTGVPWIMCQQYDAPDPVINTCNSFYCDQFKPNS 274

Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYM 270
           P+KP  WTEN+ GWF +FG + P RP ED+AF+VARFF  GG+ QNYY+
Sbjct: 275 PTKPKFWTENWPGWFQTFGESNPHRPPEDVAFSVARFFGKGGSLQNYYV 323



 Score =  352 bits (902), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 204/497 (41%), Positives = 269/497 (54%), Gaps = 62/497 (12%)

Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR 399
           A +Y   S  C AFL+N DS  D  VTF    Y LPAWSVSILPDCKNV FNTAKV SQ 
Sbjct: 324 ADVYTDQSGGCVAFLSNVDSEKDKVVTFQSRSYDLPAWSVSILPDCKNVAFNTAKVRSQT 383

Query: 400 NNGDHPFAQQKNVNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLW 457
              D        V   L +S    W  + EK GI GN   VR    + INTTKD++DYLW
Sbjct: 384 LMMDM-------VPANLESSKVDGWSIFREKYGIWGNIDLVRNGFVDHINTTKDSTDYLW 436

Query: 458 YTASIHVMPGQ--GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEG 515
           YT S  V      G    L+IES GHA   F+N +L+   YGN   +NF +   + L  G
Sbjct: 437 YTTSFDVDGSHLAGGNHVLHIESKGHAVQAFLNNELIGSAYGNGSKSNFSVEMPVNLRAG 496

Query: 516 INTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGL 575
            N L +LSM VGLQN G  ++ AGAG+ SV +  ++N   DLSS +W Y+V V+      
Sbjct: 497 KNKLSLLSMTVGLQNGGPMYEWAGAGITSVKISGMENRIIDLSSNKWEYKVNVD------ 550

Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
                                           P+G  P+ L++ SMGKG AW+NG +IGR
Sbjct: 551 -------------------------------VPQGDDPVGLDMQSMGKGLAWLNGNAIGR 579

Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
           YW      S  CT  CDYRG++  +KC++ CGQP Q  YH+PR+W HP  N LVI EE G
Sbjct: 580 YWPRISPVSDRCTSSCDYRGTFSPNKCRRGCGQPTQRWYHVPRSWFHPSGNTLVIFEEKG 639

Query: 696 GDPSKISLLTKTGQHICSFVSEADPP-PVDSWKPNL-GVVSSSPQVRLACERGWHIAAIN 753
           GDP+KI+   +T   +CSFVSE  P   ++SW  N       + +V+L+C +G  I+++ 
Sbjct: 640 GDPTKITFSRRTVASVCSFVSEHYPSIDLESWDRNTQNDGRDAAKVQLSCPKGKSISSVK 699

Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQK---------ACVGQIECSIPVSSAYLGVSA 803
           F S+G P G C S++ G+CH  + + +V+K         AC+    C++ +S    G   
Sbjct: 700 FVSFGNPSGTCRSYQQGSCHHPNSISVVEKGTLGWAHRRACLNMNGCTVSLSDE--GFGE 757

Query: 804 GACPGLLKALAVEAHCS 820
             CPG+ K LA+EA CS
Sbjct: 758 DLCPGVTKTLAIEADCS 774


>gi|373853838|ref|ZP_09596637.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
 gi|372473365|gb|EHP33376.1| glycoside hydrolase family 35 [Opitutaceae bacterium TAV5]
          Length = 744

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 260/786 (33%), Positives = 377/786 (47%), Gaps = 127/786 (16%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V++DHRAL++DG+R ++ SG++HYPRSTP +WP ++R  ++ GL  +ETY+FWN HE  
Sbjct: 2   TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG   F GR DLVRF +  Q  GL + LRIGPY CAE NYGG P WL  +P I+ RT N 
Sbjct: 62  RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            FK E  R++  + ++++   L A  GGP+ILAQ+ENEY N+   YG  G  Y++W+ + 
Sbjct: 122 AFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179

Query: 184 AVNLNTSVPWVMC--------QQEDA---PDPIINTCNGFYCDGFT----PNSPSKPIMW 228
           A +L   +PWV C         ++DA       + T N F             P +P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT 288
           TEN++GW+ ++G  +P R  E+LA+A ARFF  GG+  NY+++ GGTNFGR  G  L+ T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGRD-GMYLLTT 298

Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSN 348
           +Y++  P+DEYG +   K  HL  L+KA+  C + +++S+      G +     +  SS 
Sbjct: 299 AYEFGGPLDEYG-LPTTKARHLARLNKALAACADKILASERPRAITGERNGLLKFQYSSG 357

Query: 349 DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQ 408
               F  +  + +   V  NG V +    S  + P     V  T K    R      FA 
Sbjct: 358 --LTFWCDDVARTVRIVGKNGEVLYDS--SARVAP-----VRRTWKASGVR------FAP 402

Query: 409 QKNVNELLLASSAFSW-YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV--- 464
                E L A    +W  E +  ++  +        EQ+  TKD +DY WY  +I V   
Sbjct: 403 WGWRAEPLPA----AWPAEAQSAVTARKPL------EQLLLTKDETDYCWYETAIVVEGS 452

Query: 465 -------------------------------MPGQGKEV------FLNIESLGHAALVFV 487
                                          + G   EV       L +  +     VF+
Sbjct: 453 GDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVFI 512

Query: 488 NKKLVAFG-------YGNHDFANF-----LINKKIELNEGINTLDILSMMVGLQNYGAW- 534
           +   VA          G  D   F     L  K + +  G + L +L   +GL   G W 
Sbjct: 513 DGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIK-GDWM 571

Query: 535 -----FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
                  +   GL++ +     NGK+    GEW +Q G+ GE  G    +  +   WK  
Sbjct: 572 IGYENMALEKKGLWAPVFW---NGKK--LEGEWRHQPGLLGERCGFADPAAGSLLAWKTA 626

Query: 590 STLP---VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW----SAYLA 642
                    + L W++TTF  P+G GP AL+L  MGKG AW+NG  IGRYW    +  + 
Sbjct: 627 KAATGRGARRPLRWWRTTFTRPKGHGPWALDLGGMGKGMAWINGHCIGRYWLLADTDPMG 686

Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHP--GENLLVIHEELGGDPSK 700
           P     K     GS  A+        P Q  YH+P  W+    G + LV+ EELGGDP+ 
Sbjct: 687 PWMAWMK-----GSLTAAPSSG----PTQRYYHVPDDWLRTDGGPDTLVLFEELGGDPAT 737

Query: 701 ISLLTK 706
           + L+ +
Sbjct: 738 VRLVRR 743


>gi|323371174|gb|ADX59436.1| beta-galactosidase [Coffea arabica]
          Length = 338

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 183/339 (53%), Positives = 230/339 (67%), Gaps = 28/339 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             V+YD R+L+I+G+R++L SGSIHYPRSTP++WP LI K+K GGL+VIETYVFWN HEP
Sbjct: 26  GQVSYDGRSLIIEGQRKLLFSGSIHYPRSTPDMWPSLISKAKHGGLDVIETYVFWNLHEP 85

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQY F+GR ++VRF++ +Q  GL+  +RIGP+  AEW YGG P WLH +PGI +R+ N
Sbjct: 86  RHGQYDFKGRHNIVRFIREIQAHGLYAFIRIGPFIEAEWTYGGLPFWLHDVPGIVYRSDN 145

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M+ F  KI++L K E L+A QGGPIIL Q+ENEY N E A+   G  YV+WAA 
Sbjct: 146 EPFKYHMQNFTTKIVNLFKSEGLYAPQGGPIILQQIENEYKNAERAFHEKGPPYVQWAAA 205

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFG 240
            AV L T VPWVMC+Q+DAPDP+INTCNG  C + F  PNSP+KP +WT+N++       
Sbjct: 206 MAVGLQTGVPWVMCKQDDAPDPVINTCNGRTCGETFVGPNSPNKPAIWTDNWTSL----- 260

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
                                 G+F NYYMY GGTNFGRT G   V TSY  +APIDEYG
Sbjct: 261 --------------------KNGSFVNYYMYHGGTNFGRT-GSAFVLTSYYDEAPIDEYG 299

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
            IRQPKWGHL++LH  IK C + L+    +   LG + E
Sbjct: 300 LIRQPKWGHLKQLHSVIKSCSQTLLHGVISVSPLGQQQE 338


>gi|188501572|gb|ACD54699.1| beta-D-galactosidase [Adineta vaga]
          Length = 735

 Score =  370 bits (951), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 251/739 (33%), Positives = 375/739 (50%), Gaps = 76/739 (10%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V+YDHRA+ I+G R +L SG IHYPRSTP +WP L+ K+KE GL  I+TYVFWN HE  
Sbjct: 33  HVSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNMHEQK 92

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG Y F GR +L  F++    AGLF++LR+GPY CAEW+YG  PVWL+ IP I FR++N+
Sbjct: 93  RGTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSND 152

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +K EMKRFL+ II  +  +   A  GGPIILAQ+ENEYG  + A       YV W    
Sbjct: 153 AWKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGGNDRA-------YVDWCGSL 203

Query: 184 AVN--LNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNS----PSKPIMWTENYSGW 235
             N   +T +PW+MC    A +  I TCNG  C  DG+        P++P+++TEN+ GW
Sbjct: 204 VSNDFASTQIPWIMCNGL-AANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GW 261

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
           F  +G  +  R  EDLA++VA +F  GG +  YYM+ GG ++GRT GG  + T+Y  D  
Sbjct: 262 FQGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDDVI 320

Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT----------HQKLGAKLEAHIYHK 345
           +   G   +PK+ HL  L + +    + L+S D               +G +   + Y  
Sbjct: 321 LRADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSARLPIPYWDGKQWSVGTQQMVYSYPP 380

Query: 346 SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHP 405
           S      F+ N  ++    V FN     +   SV I  + +++++N+A V          
Sbjct: 381 S----IQFVIN-QAAFSLFVLFNKQNISIAGQSVQIYDNNEHLLWNSADV-------SGI 428

Query: 406 FAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVM 465
           F     +  +++    +  Y E   +S     V     EQ+N T D + YLWY  ++ + 
Sbjct: 429 FRNNTFLVPIVVGPLDWQVYSEPF-LSDLPVIVASTPLEQLNLTNDETIYLWYRRNVSLS 487

Query: 466 PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN------EGINTL 519
               + +        ++ + F++++ V + + +H  A   IN  I LN            
Sbjct: 488 QPSAQTIVQVQTRRANSLIFFMDRQFVGY-FDDHSHAQGTINVNITLNLSQFLPNQQYLF 546

Query: 520 DILSMMVGLQNYGAWFDVAGAGLFSV--ILIDLKNGKRDLSSGE---WIYQVGVEGEYIG 574
           +ILS+ +G+ N+       G G F    I+ ++  G + L   E   W +Q G+ GE   
Sbjct: 547 EILSVSLGIDNFN-----IGPGSFEYKGIVGNVSLGGQSLVGDEASIWEHQKGLFGEAYQ 601

Query: 575 LDKISLANSSFWKQGSTLPVNKSLIWYKTTF----LAPE--GKGPLALNLASMGKGQAWV 628
           +     + +  W    T  +NKS+ W++T F    L  E     P+ L+   + +G A+V
Sbjct: 602 IYTEQGSKTVEWNPRWTTAINKSVTWFQTRFDLNHLVREDLNANPVLLDAFGLNRGHAFV 661

Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLL 688
           NG  IG YW   L   T   K C         + Q +C QP+Q  YHIP  W+ P  NLL
Sbjct: 662 NGNDIGLYW---LIEGTCQNKLC------CCLQNQTNCQQPSQRYYHIPSDWLKPTNNLL 712

Query: 689 VIHEELGG-DPSKISLLTK 706
            + EE+G   P  + L+ +
Sbjct: 713 TVFEEIGASSPKSVGLVQR 731


>gi|62869849|gb|AAY18075.1| beta-galactosidase, partial [Carica papaya]
          Length = 263

 Score =  368 bits (944), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 168/264 (63%), Positives = 202/264 (76%), Gaps = 1/264 (0%)

Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
           T N PFK  M++F  KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW  G  G+ Y KW
Sbjct: 1   TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           AA  AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN   KP MWTE ++GW+  F
Sbjct: 61  AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G AVP RP EDLAF++AR  + GG+F NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARLIQKGGSFVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G  R+PKWGHLR+LHKAIK  E  L+S++P+   LG   EAH++ KS + CAAFLANYD+
Sbjct: 181 GLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNSQEAHVF-KSKSGCAAFLANYDT 239

Query: 360 SSDANVTFNGNVYFLPAWSVSILP 383
            S A V+F    Y LP WS+SILP
Sbjct: 240 KSSAKVSFGNGQYELPPWSISILP 263


>gi|217070894|gb|ACJ83807.1| unknown [Medicago truncatula]
          Length = 283

 Score =  365 bits (938), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 190/282 (67%), Positives = 216/282 (76%), Gaps = 7/282 (2%)

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A +L+T VPW+MCQQ +APDPIINTCN FYCD FTPNS +KP MWTEN+SGWFL+FG AV
Sbjct: 2   ATSLDTGVPWIMCQQANAPDPIINTCNSFYCDQFTPNSDNKPKMWTENWSGWFLAFGGAV 61

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+RPVEDLAFAVARFF+ GGTFQNYYMY GGTNFGRT GGP ++TSYDYDAPIDEYG IR
Sbjct: 62  PYRPVEDLAFAVARFFQRGGTFQNYYMYHGGTNFGRTTGGPFISTSYDYDAPIDEYGDIR 121

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDA 363
           QPKWGHL++LHKAIKLCEE LI+SDPT    G  LE  +Y K+   C+AFLAN    SDA
Sbjct: 122 QPKWGHLKDLHKAIKLCEEALIASDPTITSPGPNLETAVY-KTGAVCSAFLANI-GMSDA 179

Query: 364 NVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ---KNVNELLLASS 420
            VTFNGN Y LP WSVSILPDCKNVV NTAKV +        FA +   + V+ L  +SS
Sbjct: 180 TVTFNGNSYHLPGWSVSILPDCKNVVLNTAKVNTASMISS--FATESLKEKVDSLDSSSS 237

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI 462
            +SW  E VGIS   +F +  L EQINTT D SDYLWY+ SI
Sbjct: 238 GWSWISEPVGISTPDAFTKSGLLEQINTTADRSDYLWYSLSI 279


>gi|62869847|gb|AAY18074.1| beta-galactosidase [Carica papaya]
          Length = 263

 Score =  364 bits (935), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 168/264 (63%), Positives = 200/264 (75%), Gaps = 1/264 (0%)

Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
           T N PFK  M++F  KI+ +MK E LF SQGGPIIL+Q+ENE+G VEW  G  G+ Y KW
Sbjct: 1   TDNEPFKAAMQKFTEKIVSMMKAEQLFQSQGGPIILSQIENEFGPVEWEIGAPGKAYTKW 60

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           AA  AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+ FTPN   KP MWTE ++GW+  F
Sbjct: 61  AARMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCENFTPNKNYKPKMWTEVWTGWYTEF 120

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEY 299
           G AVP RP EDLAF++ARF + GG+  NYYMY GGTNFGRTAGGP +ATSYDYDAP+DEY
Sbjct: 121 GGAVPTRPAEDLAFSIARFIQKGGSSVNYYMYHGGTNFGRTAGGPFMATSYDYDAPLDEY 180

Query: 300 GFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDS 359
           G  R+PKWGHLR LHKAIK  E  L+S++P+   LG   EAH + KS + CAAFLANYD+
Sbjct: 181 GLPREPKWGHLRNLHKAIKSSESALVSAEPSVTSLGNSQEAHAF-KSKSGCAAFLANYDT 239

Query: 360 SSDANVTFNGNVYFLPAWSVSILP 383
            S A V+F    Y LP WS+SILP
Sbjct: 240 KSSAKVSFGNGQYELPPWSISILP 263


>gi|56550179|emb|CAE51355.1| putative beta-galactosidase [Musa acuminata]
          Length = 281

 Score =  362 bits (930), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 172/286 (60%), Positives = 206/286 (72%), Gaps = 5/286 (1%)

Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
           EWN+GGFPVWL ++PGI FRT N PFK  M +F  KI+ +MK E LF SQGGPIIL+Q+E
Sbjct: 1   EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60

Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
           NEYG VE+  G   + Y+ WAA  AV LNT VPWVMC+Q+DAPDP+IN CNGFYCD F+P
Sbjct: 61  NEYGPVEYYGGTAAKNYLSWAAQMAVGLNTRVPWVMCKQDDAPDPVINACNGFYCDYFSP 120

Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
           N P KP MWTE ++GWF  F   V     +  A  V R +    T   +     GTNFGR
Sbjct: 121 NKPYKPTMWTEAWTGWFTGFRGPVLTDCEDCFAVQVIRRWILVTTIVPW-----GTNFGR 175

Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
           TAGGP ++TSYDYDAPIDEYG +RQPKWGHLR+LHKAIK+CE  L+S DPT  KLG   E
Sbjct: 176 TAGGPFISTSYDYDAPIDEYGLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKLGNYQE 235

Query: 340 AHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
           AH+Y   S  CAAFL+N++  S A+VTFNG  Y +P+WS+SILPDC
Sbjct: 236 AHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 281


>gi|84468366|dbj|BAE71266.1| putative beta-galactosidase [Trifolium pratense]
          Length = 425

 Score =  361 bits (926), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 188/423 (44%), Positives = 249/423 (58%), Gaps = 11/423 (2%)

Query: 292 YDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCA 351
           YDAP+DEYG  R PKWGHL++LHKAIKLCE  L+     +  LG  +EA +Y  SS  CA
Sbjct: 1   YDAPVDEYGLPRLPKWGHLKDLHKAIKLCEHVLLYGKSVNVSLGPSVEADVYTDSSGACA 60

Query: 352 AFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKN 411
           AF+AN D  +D  V F    Y +PAWSVSILPDCKNVV+NTAKV +Q N         + 
Sbjct: 61  AFIANVDDKNDKTVEFRNASYHIPAWSVSILPDCKNVVYNTAKVTTQTNK---IAMIPEK 117

Query: 412 VNELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----- 464
           + +       F W  ++E  GI G   FV     + INTTKDT+DYLW+T SI +     
Sbjct: 118 LQQSDKGQKTFKWDVWKENPGIWGKPDFVINGFVDHINTTKDTTDYLWHTTSISIDENEE 177

Query: 465 MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
           +  +G +  L IES GHA   FVN+K     YGN   + F     I L  G N + +LS+
Sbjct: 178 LLKKGSKPVLVIESKGHALHAFVNQKYQGTAYGNGSHSAFTFKNPISLKAGKNEIALLSL 237

Query: 525 MVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSS 584
            VGLQ  G ++D  GAG+ SV +  L N   DLSS  W Y++GV+GE++ + + +  NS 
Sbjct: 238 TVGLQTAGPFYDFVGAGVTSVKIKGLNNKTIDLSSNAWTYKIGVQGEHLKIYQGNGLNSV 297

Query: 585 FWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA-P 643
            W   S  P  ++L WYK    AP G  P+ L++  MGKG AW+NG+ IGRYW       
Sbjct: 298 SWTSTSEPPKGQTLTWYKAIVDAPPGDEPVGLDMLYMGKGFAWLNGEGIGRYWPRISEFK 357

Query: 644 STGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
              C ++CDYRG ++  KC   CG+P+Q  YH+PR+W  P  N+LV  EE GGDP+KI+ 
Sbjct: 358 KEDCVEECDYRGKFNPDKCDTGCGEPSQKWYHVPRSWFKPSGNVLVFFEEKGGDPTKITF 417

Query: 704 LTK 706
           + +
Sbjct: 418 VRR 420


>gi|188501582|gb|ACD54708.1| beta-D-galactosidase-like protein [Adineta vaga]
          Length = 735

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 252/740 (34%), Positives = 371/740 (50%), Gaps = 80/740 (10%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YDHRA+ I+G R +L SG IHYPRSTP +WP L+ K+KE GL  I+TYVFWN HE  R
Sbjct: 34  VSYDHRAITINGNRTLLFSGVIHYPRSTPAMWPYLMSKAKEQGLNTIQTYVFWNIHEQKR 93

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y F GR +L  F++    AGLF++LR+GPY CAEW+YG  PVWL+ IP I FR++N+ 
Sbjct: 94  GTYDFSGRANLSLFLQEAANAGLFVNLRLGPYVCAEWDYGALPVWLNNIPNIAFRSSNDA 153

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K EMKRFL+ II  +  +   A  GGPIILAQ+ENEYG  + A       YV W     
Sbjct: 154 WKSEMKRFLSDII--VYVDGFLAKNGGPIILAQIENEYGGNDRA-------YVDWCGSLV 204

Query: 185 VN--LNTSVPWVMCQQEDAPDPIINTCNGFYC--DGFTPNS----PSKPIMWTENYSGWF 236
            N   +T +PW+MC    A +  I TCNG  C  DG+        P++P+++TEN+ GWF
Sbjct: 205 SNDFASTQIPWIMCNGL-AANSTIETCNGCNCFDDGWMDRHRRTYPNQPLLFTENW-GWF 262

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPI 296
             +G  +  R  EDLA++VA +F  GG +  YYM+ GG ++GRT GG  + T+Y  D  +
Sbjct: 263 QGWGEGLGIRTPEDLAYSVAEWFANGGAYHAYYMWHGGNHYGRT-GGSGLTTAYSDDVIL 321

Query: 297 DEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL----------GAKLEAHIYHKS 346
              G   +PK+ HL  L + +    + L+S D     +          G +   + Y  S
Sbjct: 322 RADGTPNEPKFTHLNRLQRLLASQAQVLLSQDSNRLSIPYWNGKQWTVGTQQMVYSYPPS 381

Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV--ISQRNNGDH 404
                 F+ N  ++    V FN     +   SV I    +++++N+A V  IS+ N    
Sbjct: 382 ----VQFVIN-QAAFSLFVLFNKQNISIAGQSVQIYDYNEHLLWNSADVSGISRNNTFLV 436

Query: 405 PFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
           P         +++    +  Y E    S     V     EQ+N T D + YLWY  ++ +
Sbjct: 437 P---------IVVGPLDWQVYSEPF-TSDLPVIVASTPLEQLNLTNDETIYLWYRRNVSL 486

Query: 465 MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELN------EGINT 518
                + +        ++ L F++++ V + + +H      IN  I LN           
Sbjct: 487 SQPSVQTIVQVQTRRANSLLFFMDRQFVGY-FDDHSHTQGTINVNITLNLSQFLPNQQYI 545

Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSV--ILIDLKNGKRDLSSGE---WIYQVGVEGEYI 573
            +ILS+ +G+ N+       G G F    I+ ++  G + L   E   W +Q G+ GE  
Sbjct: 546 FEILSVSLGIDNFN-----IGPGSFEYKGIVGNVSLGGQSLVGDEASIWEHQKGLFGEAH 600

Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTF----LAPE--GKGPLALNLASMGKGQAW 627
            +     + +  W    T  +NK + W++T F    LA E     P+ L+     +G A+
Sbjct: 601 QIYTEQGSKTVEWNPKWTTVINKPVTWFQTRFDLNHLAREDLNANPILLDAFGFNRGHAF 660

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG  IG YW   L   T     C         + Q +C QP+Q  YHI   W+ P  NL
Sbjct: 661 VNGNDIGLYW---LIEGTCQNNLC------CCLQNQTNCQQPSQRYYHISSDWLKPTNNL 711

Query: 688 LVIHEELGG-DPSKISLLTK 706
           L + EE+G   P  + L+ +
Sbjct: 712 LTVFEEIGASSPKSVGLVQR 731


>gi|391229102|ref|ZP_10265308.1| beta-galactosidase [Opitutaceae bacterium TAV1]
 gi|391218763|gb|EIP97183.1| beta-galactosidase [Opitutaceae bacterium TAV1]
          Length = 743

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 262/791 (33%), Positives = 373/791 (47%), Gaps = 138/791 (17%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V++DHRAL++DG+R ++ SG++HYPRSTP +WP ++R  ++ GL  +ETY+FWN HE  
Sbjct: 2   TVSFDHRALLLDGRRTLVLSGAVHYPRSTPAMWPRILRHMRQSGLNTVETYIFWNLHERR 61

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG   F GR DLVRF +  Q  GL + LRIGPY CAE NYGG P WL  +P I+ RT N 
Sbjct: 62  RGVLDFSGRLDLVRFCRLAQAEGLNVILRIGPYICAETNYGGLPGWLRDVPDIRMRTDNE 121

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            FK E  R++  + ++++   L A  GGP+ILAQ+ENEY N+   YG  G  Y++W+ + 
Sbjct: 122 AFKREKARWVRLVAEVIRP--LCAPNGGPVILAQIENEYDNIAATYGEDGRRYLRWSVEL 179

Query: 184 AVNLNTSVPWVMC--------QQEDA---PDPIINTCNGFYCDGFT----PNSPSKPIMW 228
           A +L   +PWV C         ++DA       + T N F             P +P +W
Sbjct: 180 AQSLGLGIPWVTCAAGRAAEAGEKDAVASAGDSLETLNAFRAHEIIGQHFREHPEQPALW 239

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT 288
           TEN++GW+ ++G  +P R  E+LA+A ARFF  GG+  NY+++ GGTNFGR  G  L+ T
Sbjct: 240 TENWAGWYQTWGGVLPKREPEELAYATARFFAAGGSGVNYFLWHGGTNFGRD-GMYLLTT 298

Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP-THQKLGAKLEAHIYHKSS 347
           +Y++  P+DEYG          R          E L S  P   +K    +E   YH  S
Sbjct: 299 AYEFGGPLDEYGLPTTKARHLARLNAALAACAGELLASERPGVVEKSSGVVE---YHYDS 355

Query: 348 NDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFA 407
                F+ +  + +   V  +G V +    SV + P     V    K    R      FA
Sbjct: 356 G--LVFVCDDTARAVRIVKKSGEVLYDS--SVRVAP-----VRRAWKSSGVR------FA 400

Query: 408 QQKNVNELLLASSAFSW-YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
                 E L A    +W  E +  ++  +        EQ+  TKD +DY WY  +I V  
Sbjct: 401 PWGWRAEPLPA----AWPAEAQSAVTARKPL------EQLLPTKDETDYCWYETAIVVEG 450

Query: 465 --------------------------------MPGQGKEV------FLNIESLGHAALVF 486
                                           + G   EV       L +  +     VF
Sbjct: 451 SGDVLVAGRDGSPAGLERGALARVGRRGRRPSIAGLASEVPANTVNTLRLTRVADIVHVF 510

Query: 487 VNKKLVAFG-------YGNHDFANF-----LINKKIELNEGINTLDILSMMVGLQNYGAW 534
           ++   VA          G  D   F     L  K + +  G + L +L   +GL   G W
Sbjct: 511 IDGTFVATTPTPLRERRGKMDAGLFTQTFELDLKALRITPGKHRLSLLCCALGLIK-GDW 569

Query: 535 ------FDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK- 587
                   +   GL++ +     NGK+    GEW +Q G+ GE  G    +  +   WK 
Sbjct: 570 MIGYENMALEKKGLWAPVFW---NGKK--LEGEWRHQPGLLGERCGFADPAAGSLLAWKT 624

Query: 588 ------QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL 641
                 +G+  P+N    W++TTF  P+G GP AL+L  MGKG  W+NG  IGRYW   L
Sbjct: 625 AKAATGRGARRPLN----WWRTTFTRPKGHGPWALDLGGMGKGFCWINGHCIGRYW---L 677

Query: 642 APSTGCTKKCDYRGSYDA----SKCQKHCGQPAQTLYHIPRTWVHP--GENLLVIHEELG 695
            P T      D  G + A    S      G P Q  YH+P  W+    G + LV+ EELG
Sbjct: 678 LPDT------DPMGPWMAWMKGSLTAAPSGGPTQRYYHVPDDWLRTDGGPDTLVLFEELG 731

Query: 696 GDPSKISLLTK 706
           GDP+ + L+ +
Sbjct: 732 GDPATVRLVRR 742


>gi|414590082|tpg|DAA40653.1| TPA: hypothetical protein ZEAMMB73_851266 [Zea mays]
          Length = 580

 Score =  348 bits (894), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 202/606 (33%), Positives = 306/606 (50%), Gaps = 45/606 (7%)

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
           ++WTEN++  F ++G  V  R  ED+A+AV RFF  GG+  NYYMY GGTNFGRT G   
Sbjct: 1   MLWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASY 59

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
           V T Y  +AP+DEYG  ++PK+GHLR+LH  I+  ++  +    + + LG   EAHI+  
Sbjct: 60  VLTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFEL 119

Query: 346 SSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH 404
                C +FL+N ++  D  V F G+ +++P+ SVSIL  CKNVV+NT +V  Q +    
Sbjct: 120 PEEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSE--- 176

Query: 405 PFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
              +  + +++   ++ +  + E +    +      +  EQ N TKD +DYLWYT S  +
Sbjct: 177 ---RSFHTSDVTSKNNQWEMFSETIPKYRDTKVRTKEPLEQYNQTKDDTDYLWYTTSFRL 233

Query: 465 ----MPGQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
               +P +      L ++S  HA + F N   V    GN     F+  K ++L  G+N +
Sbjct: 234 ESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVNHV 293

Query: 520 DILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
            +LS  +G+++ G        G+   ++  L  G  DL    W ++  +EGEY  +    
Sbjct: 294 VLLSSTMGMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYSEK 353

Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
                 WK       +++  WYK  F  P+G  P+ L+++SM KG  +VNG+ +GRYW +
Sbjct: 354 GLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYWVS 410

Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
           Y                      +   G P+Q +YHIPR ++   +NLLVI EE  G P 
Sbjct: 411 Y----------------------RTLAGTPSQAVYHIPRPFLKSKDNLLVIFEEEMGKPD 448

Query: 700 KISLLTKTGQHICSFVSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINF 754
            I + T T   IC F+SE +P  + +W     K  L     S +  L C     I  + F
Sbjct: 449 GILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCPPEKTIQEVVF 508

Query: 755 ASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKAL 813
           AS+G P+G CG+F  G CH  +   IV+K C+G+  C +PV     G     C      L
Sbjct: 509 ASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTATL 567

Query: 814 AVEAHC 819
            V+  C
Sbjct: 568 GVQVRC 573


>gi|293331757|ref|NP_001169479.1| uncharacterized protein LOC100383352 [Zea mays]
 gi|224029591|gb|ACN33871.1| unknown [Zea mays]
          Length = 580

 Score =  348 bits (892), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 207/608 (34%), Positives = 305/608 (50%), Gaps = 49/608 (8%)

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
           ++WTEN++  F ++G  V  R  ED+A+AV RFF  GG+  NYYMY GGTNFGRT G   
Sbjct: 1   MLWTENWTQQFRAYGDQVAMRSAEDIAYAVLRFFAKGGSLVNYYMYHGGTNFGRT-GASY 59

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
           V T Y  +AP+DEYG  ++PK+GHLR+LH  I+  ++  +    + + LG   EAHI+  
Sbjct: 60  VLTGYYDEAPMDEYGMYKEPKFGHLRDLHNVIRSYQKAFLWGQHSSEILGHGYEAHIFEL 119

Query: 346 SSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD- 403
                C +FL+N ++  D  V F G+ +++P+ SVSIL  CKNVV+NT +V  Q +    
Sbjct: 120 PEEKLCLSFLSNNNTGEDGTVIFRGDKHYVPSRSVSILAGCKNVVYNTKRVFVQHSERSF 179

Query: 404 HPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVR-PDLAEQINTTKDTSDYLWYTASI 462
           H        N+  ++S     Y +        + VR  +  EQ N TKD +DYLWYT S 
Sbjct: 180 HTSDVTSKNNQWEMSSETIPKYRD--------TKVRTKEPLEQYNQTKDDTDYLWYTTSF 231

Query: 463 HV----MPGQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN 517
            +    +P +      L ++S  HA + F N   V    GN     F+  K ++L  G+N
Sbjct: 232 RLESDDLPFRNDIRPVLQVKSSAHAMMGFANDAFVGCARGNKQVKGFMFEKPVDLKVGVN 291

Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
            + +LS  +G+++ G        G+   ++  L  G  DL    W ++  +EGEY  +  
Sbjct: 292 HVVLLSSTMGMKDSGGELAEVKGGIQECLIQGLNTGTLDLQVNGWGHKAALEGEYKEIYS 351

Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
                   WK       +++  WYK  F  P+G  P+ L+++SM KG  +VNG+ +GRYW
Sbjct: 352 EKGLGKVQWKPAEN---DRAATWYKRYFDEPDGDDPVVLDMSSMSKGMIFVNGEGVGRYW 408

Query: 638 SAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
            +Y                      +   G P+Q +YHIPR ++   +NLLVI EE  G 
Sbjct: 409 VSY----------------------RTLAGTPSQAVYHIPRPFLKSKDNLLVIFEEEMGK 446

Query: 698 PSKISLLTKTGQHICSFVSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAI 752
           P  I + T T   IC F+SE +P  + +W     K  L     S +  L C     I  +
Sbjct: 447 PDGILVQTVTRDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLTCPPEKTIQEV 506

Query: 753 NFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLK 811
            FAS+G P+G CG+F  G CH  +   IV+K C+G+  C +PV     G     C     
Sbjct: 507 VFASFGNPDGMCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTA 565

Query: 812 ALAVEAHC 819
            L V+  C
Sbjct: 566 TLGVQVRC 573


>gi|227204157|dbj|BAH56930.1| AT4G35010 [Arabidopsis thaliana]
          Length = 377

 Score =  344 bits (882), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 163/295 (55%), Positives = 212/295 (71%), Gaps = 3/295 (1%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  VTYD  +L+IDGKR +L SGSIHYPRSTPE+WP +I+++K+GGL  I+TYVFWN HE
Sbjct: 38  NKEVTYDGTSLIIDGKRELLYSGSIHYPRSTPEMWPSIIKRAKQGGLNTIQTYVFWNVHE 97

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P +G++ F GR DLV+F+K +Q+ G+++ LR+GP+  AEW +GG P WL  +PGI FRT 
Sbjct: 98  PQQGKFNFSGRADLVKFIKLIQKNGMYVTLRLGPFIQAEWTHGGLPYWLREVPGIFFRTD 157

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N  FKE  +R++  I+D MK+E LFASQGGPIIL Q+ENEY  V+ AY   G  Y+KWA+
Sbjct: 158 NKQFKEHTERYVRMILDKMKEERLFASQGGPIILGQIENEYSAVQRAYKQDGLNYIKWAS 217

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSF 239
           +   ++   +PWVMC+Q DAPDP+IN CNG +C D F  PN  +KP +WTEN++  F  F
Sbjct: 218 NLVDSMKLGIPWVMCKQNDAPDPMINACNGRHCGDTFPGPNRENKPSLWTENWTTQFRVF 277

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDA 294
           G     R VED+A++VARFF   GT  NYYMY GGTNFGRT+    V T Y  DA
Sbjct: 278 GDPPTQRSVEDIAYSVARFFSKNGTHVNYYMYHGGTNFGRTSAH-YVTTRYYEDA 331


>gi|56550181|emb|CAE51356.1| putative beta-galactosidase [Musa AAB Group]
          Length = 282

 Score =  342 bits (876), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 169/291 (58%), Positives = 203/291 (69%), Gaps = 14/291 (4%)

Query: 100 EWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVE 159
           EWN+GGFPVWL ++PGI FRT N PFK  M +F  KI+ +MK E LF SQGGPIIL+Q+E
Sbjct: 1   EWNFGGFPVWLKYVPGINFRTDNGPFKAAMAKFTEKIVAMMKSEGLFESQGGPIILSQIE 60

Query: 160 NEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP 219
           NEYG VE+  G   + Y+ WAA  AV LNT VPWVMC+Q+DAPDP+IN  NGFYCD F+P
Sbjct: 61  NEYGPVEYYGGAAAKNYLSWAAQMAVGLNTGVPWVMCKQDDAPDPVINAGNGFYCDYFSP 120

Query: 220 NSPSKPIMWTENYSGWFLSFGYAVPFRPVED-----LAFAVARFFETGGTFQNYYMYFGG 274
           NS        + + G  L   + VP             F V  + E G  F+NYYMY GG
Sbjct: 121 NS-------LKTFFGG-LKLDWLVPVSGSSSSQTVRTGFCVQVYTE-GWIFRNYYMYHGG 171

Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
           TNFGRTAGG  ++TSYDYDAPIDEY  +RQPKWGHLR+LHKAIK+CE  L+S DPT  KL
Sbjct: 172 TNFGRTAGGLFISTSYDYDAPIDEYVLLRQPKWGHLRDLHKAIKMCEPALVSGDPTVTKL 231

Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
           G   EAH+Y   S  CAAFL+N++  S A+VTFNG  Y +P+WS+SILPDC
Sbjct: 232 GNYQEAHVYRSKSGSCAAFLSNFNPHSYASVTFNGMKYNIPSWSISILPDC 282


>gi|356503083|ref|XP_003520341.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase 15-like [Glycine
           max]
          Length = 482

 Score =  341 bits (874), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 150/319 (47%), Positives = 207/319 (64%), Gaps = 5/319 (1%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            +  V+YD  + +I+ ++ ++ SG +HYP ST ++WP + ++ K GGL+ IE+Y+FW+ H
Sbjct: 5   FATEVSYDAHSHIINEEKHIIFSGVVHYPXSTVDLWPAIFKRXKYGGLDAIESYIFWDRH 64

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP+R +Y   G  D + F+K +QEA L+  LRIGPY C  WN+GGF +WLH +P I+ R 
Sbjct: 65  EPVRREYDCSGNLDFIDFLKLIQEAELYFILRIGPYVCEXWNFGGFSLWLHNMPEIELRI 124

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N   K EM+ F  KI+++ K+  LFA  GGPIIL  +ENEYGN+   Y    + Y+KW 
Sbjct: 125 DNPIXKNEMQIFTTKIVNMAKEAKLFAPXGGPIILTPIENEYGNIMTDYREARKPYIKWC 184

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           A  A+  N  VPW+MC   DAP P+INTCNG YCD F PN+P    M+       F  +G
Sbjct: 185 AQMALTQNIGVPWIMCXXRDAPQPMINTCNGHYCDSFXPNNPKSSKMFRX-----FQKWG 239

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYG 300
             VP +  E+  F+VARFF++GG   NYYMY GGTNFG   GGP +  SY+YDAP+DEYG
Sbjct: 240 ERVPHKSAEESTFSVARFFQSGGILNNYYMYHGGTNFGHMVGGPYMTASYEYDAPLDEYG 299

Query: 301 FIRQPKWGHLRELHKAIKL 319
            + +PKW H ++LHK +  
Sbjct: 300 NLNKPKWEHFKQLHKELTF 318



 Score = 53.1 bits (126), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 1/68 (1%)

Query: 733 VSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECS 791
           V+   Q+  +C+ G  I+ I FAS+G PEGNCGSF+ G     D   +V+ AC+G+  C 
Sbjct: 415 VNEGAQLDPSCQIGKTISQIQFASFGNPEGNCGSFKGGTWEATDSQSVVEVACIGRNSCG 474

Query: 792 IPVSSAYL 799
             V+  ++
Sbjct: 475 FTVTKRHI 482



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 30/43 (69%)

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
           F AP G  P+ ++L   GK QAWVNG+SIG YWS+++  + GC
Sbjct: 363 FEAPFGIDPMVMDLQDSGKRQAWVNGKSIGCYWSSWITNTNGC 405


>gi|255563859|ref|XP_002522930.1| beta-galactosidase, putative [Ricinus communis]
 gi|223537857|gb|EEF39473.1| beta-galactosidase, putative [Ricinus communis]
          Length = 450

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 199/494 (40%), Positives = 270/494 (54%), Gaps = 62/494 (12%)

Query: 158 VENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF 217
           +ENEYGN+E A+   G  YV WAA  AV+L T VPW+MC+Q DAPDP+INTCNG  C G 
Sbjct: 1   IENEYGNIEAAFHEKGSSYVHWAAKMAVDLQTGVPWIMCKQIDAPDPVINTCNGMKC-GE 59

Query: 218 T---PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
           T   PNSP+KP +WTEN++ ++  +G     R  +D+AF VA F    G++ NYYMY GG
Sbjct: 60  TFGGPNSPNKPSLWTENWTSFYQVYGGEPYIRSAQDIAFHVALFIAKNGSYVNYYMYHGG 119

Query: 275 TNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL 334
           TNFGRTA   ++   YD  AP+DEYG IRQPKWGHL+ELH  IK C   L+    T+  +
Sbjct: 120 TNFGRTAAAYVITGYYD-QAPLDEYGLIRQPKWGHLKELHAVIKSCSTTLLEGVQTNLSV 178

Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAK 394
           G   +A+++      C AFL N D S +A V F    + L   S+SILPDC N++FNTAK
Sbjct: 179 GQLQQAYMFEAQGGGCVAFLVNND-SVNATVGFRNKSFELLPKSISILPDCDNIIFNTAK 237

Query: 395 VISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGN--RSFVRPD-LAEQINTTKD 451
           V +  N              +  +S   + +E+ + +  N   S ++ D L E +NTTKD
Sbjct: 238 VNAGSN------------RRITTSSKKLNTWEKYIDVIPNYSDSTIKSDTLLEHMNTTKD 285

Query: 452 TSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHD-FANFLINKKI 510
            SDYLWYT S        K + L++ESL H A  FVN K     +G+ +    F++   I
Sbjct: 286 KSDYLWYTFSFQPNLSCTKPL-LHVESLAHVAYAFVNNKYSGSAHGSKNGKVPFIMEVPI 344

Query: 511 ELNEG--INTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGV 568
            L++    N + ILS++VGL                                     VG+
Sbjct: 345 VLDDDGLSNNISILSVLVGL------------------------------------SVGL 368

Query: 569 EGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
            GE + L          W +   + + + L W+K  F  P+G  P+ LNLA+M KG+AWV
Sbjct: 369 LGETLQLYGKEHLEMVKWSKAD-ISIAQPLTWFKLEFDTPKGNDPVVLNLATMSKGEAWV 427

Query: 629 NGQSIGRYWSAYLA 642
           NGQSIGRYW ++L 
Sbjct: 428 NGQSIGRYWISFLT 441


>gi|348687417|gb|EGZ27231.1| hypothetical protein PHYSODRAFT_553859 [Phytophthora sojae]
          Length = 825

 Score =  331 bits (849), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 236/760 (31%), Positives = 370/760 (48%), Gaps = 107/760 (14%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V+Y  R   IDG+R +L  GSIHYPRS+   W  L+R +K  GL  IE YVFWN HE  
Sbjct: 86  SVSYSARGFEIDGRRTLLLGGSIHYPRSSEGEWETLLRAAKRDGLNHIEMYVFWNLHEQE 145

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG + F G  +  RF +   E GLFLH+R GPY CAEW+ GG P+WL++IPG++ R++N 
Sbjct: 146 RGVFNFAGNANATRFYELAAEVGLFLHVRFGPYVCAEWSNGGLPLWLNWIPGMKVRSSNA 205

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P++ EM+RF+  +++L +     A  GGPII+AQ+ENE+        +    YV+W  D 
Sbjct: 206 PWQWEMERFVTYMVELSRP--FLAKNGGPIIMAQIENEFA-------MHDPEYVEWCGDL 256

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSF 239
              L+TS+PWVMC    A + I+ +CNG  C  F        PS P++WTE+  GWF ++
Sbjct: 257 VKRLDTSIPWVMCYANAAENTIL-SCNGNDCVDFAVKHVKERPSDPLVWTED-EGWFQTW 314

Query: 240 GYAV--PF----RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
                 P     R  ED+A+AVAR+F  GG   NYYMY GG NFGR A    V T Y   
Sbjct: 315 AKDKKNPLPNDQRTAEDMAYAVARWFAVGGAAHNYYMYHGGNNFGRAASAG-VTTKYADG 373

Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD----------PTHQKLG--AKLEAH 341
             +   G   +PK  HLR+LH+A+  C + L+ +D          PTH +    + L+  
Sbjct: 374 VNLHSDGLSNEPKRSHLRKLHEALIDCNDILMRNDRQLLHPHELAPTHGETAEASSLQQR 433

Query: 342 IYHKSSNDCAAFLANYDSSSDANVT--FNGNVYFLPAWSVSILPDCKNVVFNTAKVISQR 399
            +   + D    +A  ++ +D  VT  F  N Y L   S+ I+ D   ++FNTA V    
Sbjct: 434 AFIYGAEDGPNQVAFLENQADKKVTVVFRDNKYELAPTSMMIIKDGA-LLFNTADV---- 488

Query: 400 NNGDHPFAQQKNVNELLLASSA--FSWYEEKVG-ISGNRSFVRPDLAEQINTTKDTSDYL 456
                P    +    ++ A++    +W E  V  ++  R  V     EQ+  T D SDYL
Sbjct: 489 -RKSFPGTVHRAYTPIVQAATLQWETWSELNVSSLTPRRRVVAERPVEQLRLTADRSDYL 547

Query: 457 WYTASIHVMPGQ------GKEVFLNIESLGHAALV------FVNKKLVAFGYGN--HDFA 502
            Y  +  V P             + + S   ++++       + ++ +A+  GN   +F 
Sbjct: 548 TYETTFTVDPADTPIDIDSDASTVKVTSCEASSIIAFVDGWLIGERNLAYPGGNCSKEF- 606

Query: 503 NFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG-E 561
            F +   I++    ++L ++S+ +G+ + G+       G   V       G+++L+ G +
Sbjct: 607 RFSLPTNIDVTRQ-HSLKLVSVSLGIYSLGSNHTKGLTGKVRV-------GRKNLAKGHQ 658

Query: 562 WIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN--KSLIWYKTTFLAPEGKGP------ 613
           W     + GE + + +    +S  W     +  +  + + WY T+F  P  + P      
Sbjct: 659 WEMYPTLVGEQLEIYRPEWLSSVPWTPVPRVVASGRQLMSWYWTSFSYPAFELPAEADPV 718

Query: 614 -----LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQ 668
                + L+   + +G+A++NG  +GRYW                             G+
Sbjct: 719 SEPFSILLDCIGLTRGRAYINGHDLGRYW------------------------LVNDEGE 754

Query: 669 PAQTLYHIPRTW-VHPGENLLVIHEELGGDPSKISLLTKT 707
             Q  YH+PR W V    N+LV+ +ELGG  + + L++ +
Sbjct: 755 FVQRYYHVPRDWLVKDQANVLVVFDELGGSVADVRLVSSS 794


>gi|320129049|gb|ADW19770.1| beta-galactosidase [Fragaria chiloensis]
          Length = 219

 Score =  329 bits (843), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 144/219 (65%), Positives = 179/219 (81%)

Query: 34  EVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRI 93
           E+WP+LI+++K+GGL+VI+TYVFWN HEP  G+YYFE  +DLV+F+K VQ+AGL++HLRI
Sbjct: 1   EMWPDLIQRAKDGGLDVIQTYVFWNGHEPSPGKYYFEDNYDLVKFIKLVQQAGLYVHLRI 60

Query: 94  GPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPI 153
           GPY CAEWN+GGFPVWL +IPGIQFRT N PFK++M+RF  KI+++MK E LF S GGPI
Sbjct: 61  GPYVCAEWNFGGFPVWLKYIPGIQFRTDNGPFKDQMQRFTTKIVNMMKAERLFESHGGPI 120

Query: 154 ILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFY 213
           IL+Q+ENEYG +E+  G  G+ Y  WAA  AV L T VPWVMC+Q+DAPDP+IN CNGFY
Sbjct: 121 ILSQIENEYGPMEYEIGAPGKAYTDWAAQMAVGLGTGVPWVMCKQDDAPDPVINACNGFY 180

Query: 214 CDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLA 252
           CD F+PN   KP MWTE ++GWF  FG AVP+RP EDLA
Sbjct: 181 CDYFSPNKAYKPKMWTEAWTGWFTEFGGAVPYRPAEDLA 219


>gi|325183103|emb|CCA17560.1| betagalactosidase putative [Albugo laibachii Nc14]
          Length = 811

 Score =  325 bits (832), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 249/752 (33%), Positives = 362/752 (48%), Gaps = 96/752 (12%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V Y  R  VIDGK  +L  GSIHY RSTP+ W  L+ K+KE GL +++ Y+FWN+HEP 
Sbjct: 98  DVKYTKRGFVIDGKASILLGGSIHYARSTPDTWDSLLAKAKEDGLNLVQLYIFWNFHEPR 157

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG +YF  R +L  F + V   GLF+HLR GPY CAEWN GG P+WL  IPG++ R+ + 
Sbjct: 158 RGSFYFADRGNLTHFFERVVAHGLFVHLRFGPYVCAEWNRGGLPLWLDRIPGMKVRSNSE 217

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +++EM R +  +I+L +    F+  GGPII+AQ+ENEY   +         YV W +  
Sbjct: 218 SWRQEMNRIILIMINLARP--YFSVNGGPIIMAQIENEYNGHD-------PTYVAWLSQL 268

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTEN---YSGWF 236
              L   +PW MC    A +  I+TCN   C  F   +    PS+P++WTEN   Y  W 
Sbjct: 269 VRKLGIGIPWTMCNGASAVN-TISTCNDNDCFQFAEKNAKVFPSQPLVWTENEAWYEKWA 327

Query: 237 ---LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
              ++       R  E +A+ VAR+F  GG   NYYMY GG NFGRTA    V T Y   
Sbjct: 328 TKNIAQDGQNDQRSPEQVAYVVARWFAVGGAMHNYYMYHGGNNFGRTASAG-VTTMYADG 386

Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP--THQK-LGAK------LEAHIYH 344
           A +   G   +PK  HLR+LH  +  C + L+S++    H K LG +        A+IY 
Sbjct: 387 AILHHDGLDNEPKRSHLRKLHHTLIRCNKALLSNERQLNHAKPLGPEGKNAYTQRAYIYG 446

Query: 345 KSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDH 404
             S     FL N  +   A   +    Y LP  ++ IL D  NV++NT+ V     +G  
Sbjct: 447 NCS-----FLENTHAIHRACFRYQLKEYCLPPQTIVIL-DHNNVLYNTSDV-----SGTL 495

Query: 405 PFAQQKNVNELL--LASSAFSWYEEKVGISGNRSFVRPDL-AEQINTTKDTSDYLWYTAS 461
                ++ + L+    S    W E  V     R  +  D   EQ+  T+DT+DYL Y   
Sbjct: 496 GSRSTRSFSPLIRFRKSDWKIWSEWDVNPHNVRDQIVNDSPLEQLLVTQDTTDYLMYQNE 555

Query: 462 IH---VMPGQGK---EVFLNIESLGHAALVFVNKKLVA---FGYGNHDFANFLINKKIEL 512
           +      P + K    +   I    ++ LVF+N + +      Y   D +N        L
Sbjct: 556 VRWGSNGPTKNKMKSSILKFISCDANSFLVFINGEFIGEQHLAYPGDDCSNIFRFDLGPL 615

Query: 513 NE-GIN-TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG---EWIYQVG 567
            + G N TL ILS+ +G+ + G   +    G+ S + ID    +R L  G    W+   G
Sbjct: 616 GKYGANLTLSILSISLGIHSLG---EKHQKGIVSDVQID----ERSLVYGPHERWVMFSG 668

Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNK-SLIWYKTTFLAPE----GKGPLALNLASMG 622
           + GE + L     +NS  W+  +     K +  WY T F+  +     +  + L+   M 
Sbjct: 669 LIGELLKLYDPMWSNSVPWRNLNVQTDRKRTSKWYMTKFVLKQLDWDTETSVLLDCKGMN 728

Query: 623 KGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVH 682
           +G+ ++NG  +GRYW           ++ D  G+Y             Q  Y IP  W+H
Sbjct: 729 RGRIYLNGHDLGRYW---------LIRRSD--GAY------------VQRYYTIPVAWLH 765

Query: 683 PG--ENLLVIHEELGGDP-SKISLLTKTGQHI 711
                N LVI EEL  +    + ++T T + I
Sbjct: 766 AANKSNYLVIFEELRNETIESMRIVTSTMRRI 797


>gi|300121971|emb|CBK22545.2| unnamed protein product [Blastocystis hominis]
          Length = 721

 Score =  323 bits (828), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 226/739 (30%), Positives = 356/739 (48%), Gaps = 96/739 (12%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD R+  +DGKR +  +GS+HYPR+TPE+W  ++ ++ E GL +I+ Y FWN HEP++
Sbjct: 35  VTYDERSFFLDGKRSIFLAGSVHYPRATPEMWDTILDQAVEDGLNLIQIYTFWNLHEPVK 94

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY +EG  D+  F++   + GLF+++RIGPY CAEW+ GG PVW++++ G++ R  N+ 
Sbjct: 95  GQYNWEGIADIRLFLQKCADRGLFVNMRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDV 154

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +K+EM  ++  + D  +  + FA +GGPII +Q+ENE     W    G   Y+ W  + A
Sbjct: 155 WKKEMGDWMKVLTDYTR--DFFADRGGPIIFSQIENEL----WG---GAREYIDWCGEFA 205

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSP-------SKPIMWTENYSGWFL 237
            +L  +VPW+MC   D  +  IN CNG  C  +  +          +P  WTEN  GWF 
Sbjct: 206 ESLELNVPWMMCNG-DTSEKTINACNGNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQ 263

Query: 238 SFGYAVP---------FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT 288
             G A            R  ED  F V +F + GG++ NYYM+FGG ++G+ AG  +   
Sbjct: 264 IHGAASAERDDYEGWDARSAEDYTFNVLKFMDRGGSYHNYYMWFGGNHYGKWAGNGMT-N 322

Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP---THQKLGAKLEAHIYHK 345
            Y     I       +PK  H  ++H+ +    E L++        + L         ++
Sbjct: 323 WYTNGVMIHSDTLPNEPKHSHTAKMHRMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYR 382

Query: 346 SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHP 405
             +   +F+ N   S+D  V +   VY LPAWS+ +L +  NV+F T  V         P
Sbjct: 383 YGDRLVSFVENNKGSADK-VIYRDIVYELPAWSMIVLDEYDNVLFETNNV--------KP 433

Query: 406 FAQQK--NVNELLLASSAFSWYEEKVGI---SGNRSFVRPDLAEQINTTKDTSDYLWYTA 460
             + +  +  E L     F ++ E V        R  V P   EQ+N T+D +++L+Y  
Sbjct: 434 VNKHRVYHCEEKL----EFEYWNEPVSTLSQEAPRVVVSPKANEQLNMTRDLTEFLYYET 489

Query: 461 SIHVMPGQGKEVFLNIESLGHAALV-FVNKKLVAFG--YGNHDFANFLINKKIELNEGIN 517
            +        E  L+I      A V +V+   V     + +HD     +N  ++  +G +
Sbjct: 490 EVEFPQ---DECTLSIGGTDANAFVAYVDDHFVGSDDEHTHHD-GWHTMNINMKSGKGKH 545

Query: 518 TLDILSMMVGLQN------YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGE 571
            L +LS  +G+ N        +W      G+   I    K    D+ + EW +  G+ GE
Sbjct: 546 KLVLLSESLGVSNGMDSNLDPSWASSRLKGICGWI----KLCGNDIFNQEWKHYPGLVGE 601

Query: 572 YIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEG--KG-PLALNLASMGKGQAWV 628
              +       +  WK  S +    +L WY++TF  P+G  +G  + L    M +GQA+V
Sbjct: 602 AKQVFTDEGMKTVTWK--SDVENADNLAWYRSTFKTPQGLKRGIEVLLRPEGMNRGQAYV 659

Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV--HPGEN 686
           NG +IGRYW                         +   G+  Q  YHIP+ W+     EN
Sbjct: 660 NGHNIGRYW-----------------------MIKDGNGEYTQGYYHIPKDWLKGEGEEN 696

Query: 687 LLVIHEELGGDPSKISLLT 705
           +LV+ E LG     +++ T
Sbjct: 697 VLVLGETLGASDPSVTICT 715


>gi|281202334|gb|EFA76539.1| glycoside hydrolase family 35 protein [Polysphondylium pallidum
           PN500]
          Length = 611

 Score =  320 bits (820), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 206/591 (34%), Positives = 316/591 (53%), Gaps = 47/591 (7%)

Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
           E   RF+ K +     E  FA+ GGPII++QVENEYG V+  YG  G  Y +W+A  A +
Sbjct: 2   ESWMRFITKYL-----ERHFAANGGPIIMSQVENEYGWVQERYGESGTKYAQWSARLAQS 56

Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYC----DGFTPNSPSKPIMWTENYSGWFLSFGYA 242
           LN  VPW+MCQQ+D  D +INTCNGFYC    +G     P++P  +TEN+ GWF  +  +
Sbjct: 57  LNVGVPWIMCQQDDI-DSVINTCNGFYCHDWIEGHWARYPNQPAFFTENWPGWFQQWKQS 115

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFI 302
            P RPVED+ +AV  +F  GG+  NYYM+ GGTNFGRT+  P+V  SYDYDA +DEYG  
Sbjct: 116 TPHRPVEDVLYAVGNWFARGGSLMNYYMWHGGTNFGRTS-SPMVVNSYDYDAALDEYGNP 174

Query: 303 RQPKWGHLRELHKAIKLCEEYLISSD--PTHQKLGAKLEAHIYHKS-SNDCAAFLANYDS 359
            +PK+ H  + +  ++      +++   P  + LG    + IYH +   +  +FL N   
Sbjct: 175 SEPKYSHAAKFNNLLQKYSHIFLNAPEIPRSEYLGGS--SSIYHYTFGGESLSFLINNHE 232

Query: 360 SSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKV--ISQRNNGDHPFAQQKNVNELLL 417
           S+  ++ +NG  + +  WSV +L +  + VF++A    +S+       F+   + N   +
Sbjct: 233 SALNDIVWNGQNHIIKPWSVHLLYN-NHTVFDSAATPEVSKLAMTSKRFSPVNSFNNAYI 291

Query: 418 ASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVF-LNI 476
           +     W EE + ++ +    +P   EQ++ T D +DYLWY   I++   +G EVF  N+
Sbjct: 292 S----QWVEE-IDMTDSTWSSKP--LEQLSLTHDKTDYLWYVTEINLQV-RGAEVFTTNV 343

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
             + HA   +++ K  +  +  + F     N K ++  G + L IL+  +G+Q+Y    +
Sbjct: 344 SDVLHA---YIDGKYQSTIWSANPF-----NIKSDIPLGWHKLQILNSKLGVQHYTVDME 395

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNK 596
               GL   I +    G  D+++  W  +  V GE + +   +      W   S   V +
Sbjct: 396 KVTGGLLGNIWV----GGTDITNNGWSMKPYVNGERLAIYNPNNIFKVDWSSFSG--VQQ 449

Query: 597 SLIWYKTTFLAPEGKGP-LALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
            L WYK  FL         +LN++ M KG  W+NG+ + RYW   +    GC   C Y+G
Sbjct: 450 PLTWYKINFLHELSPNKHYSLNMSGMNKGMIWLNGKHVARYW---ITKGWGC-NGCSYQG 505

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
            Y    C  +CG+P+Q  YH+P+ W+  G NLLVI EE+GG+P  I L  K
Sbjct: 506 GYTDQLCSTNCGEPSQINYHLPQDWLIEGANLLVIFEEVGGNPKSIKLEEK 556


>gi|359496328|ref|XP_003635211.1| PREDICTED: beta-galactosidase 6-like [Vitis vinifera]
 gi|296080974|emb|CBI18606.3| unnamed protein product [Vitis vinifera]
          Length = 198

 Score =  320 bits (819), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 147/200 (73%), Positives = 171/200 (85%), Gaps = 3/200 (1%)

Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
           MGKGQAWVNGQSIGRYW AYLAPSTGCT  CDYRG+YDASKC ++CGQPAQTLYHIPRTW
Sbjct: 1   MGKGQAWVNGQSIGRYWPAYLAPSTGCTTNCDYRGAYDASKCLRNCGQPAQTLYHIPRTW 60

Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVR 740
           VH G+NLLV+HEELGGDPSKISLLT+TGQ +C+ VSEADPPP DSW+PNL  +S S QVR
Sbjct: 61  VHSGKNLLVLHEELGGDPSKISLLTRTGQEVCAHVSEADPPPADSWQPNLEFMSQSSQVR 120

Query: 741 LACERGWHIAAINFASYGIPEGNCGSFRPGACHMDVLPIVQKACVGQIECSIPVSSAYLG 800
           L CE+GWHI+ INFAS+G P G+CG+F PG CH +VL +VQ+AC+GQ  C+IPVS+A LG
Sbjct: 121 LTCEQGWHISMINFASFGTPRGHCGTFNPGNCHANVLSVVQQACIGQEGCAIPVSTARLG 180

Query: 801 VSAGACPGLLKALAVEAHCS 820
                CPG+LK+LA+EA CS
Sbjct: 181 ---DPCPGVLKSLAIEALCS 197


>gi|217075721|gb|ACJ86220.1| unknown [Medicago truncatula]
          Length = 208

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 135/185 (72%), Positives = 159/185 (85%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           +NVTYDH+ALVIDGKRRVL SGSIHYPRSTP++WP+LI+KSK+GG++VIETYVFWN HEP
Sbjct: 24  SNVTYDHKALVIDGKRRVLMSGSIHYPRSTPQMWPDLIQKSKDGGIDVIETYVFWNLHEP 83

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           +RGQY FEGR DLV FVK V  AGL++HLRIGPY CAEWNYGGFP+WLHFI GI+FRT N
Sbjct: 84  VRGQYNFEGRGDLVGFVKVVAAAGLYVHLRIGPYVCAEWNYGGFPLWLHFIAGIKFRTNN 143

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK EMKRF AKI+D+MKQENL+ASQGGPIIL+Q+ENEYGN++       + Y+ WAA 
Sbjct: 144 EPFKAEMKRFTAKIVDMMKQENLYASQGGPIILSQIENEYGNIDTHDARAAKSYIDWAAS 203

Query: 183 TAVNL 187
            A +L
Sbjct: 204 MATSL 208


>gi|302144233|emb|CBI23471.3| unnamed protein product [Vitis vinifera]
          Length = 315

 Score =  296 bits (759), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 137/154 (88%), Positives = 145/154 (94%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            VTYDHRALVIDGKRRVLQSGSIHYPRS PEVWPE+IRKSKEGGL+VIETYVFWN HEP+
Sbjct: 159 TVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEGGLDVIETYVFWNNHEPV 218

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG+YYFEGRFDLVRFVKTVQEAGL +HLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN+
Sbjct: 219 RGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTND 278

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ 157
            FK EMKRFLAKI+ LMK+ NLFA QGGPIILAQ
Sbjct: 279 LFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 312


>gi|68161830|emb|CAJ09952.1| beta-galactosidase [Mangifera indica]
          Length = 362

 Score =  296 bits (758), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 158/375 (42%), Positives = 222/375 (59%), Gaps = 23/375 (6%)

Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
           T   LG   E H+++  S  CAAFLANYD++S A V F    Y LP WS+SILPDCK  V
Sbjct: 1   TVTSLGNNQEVHVFNPKSGSCAAFLANYDTTSSAKVNFQNMQYELPPWSISILPDCKTAV 60

Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSW---YEEKVGISGNRSFVRPDLAEQI 446
           FNTA++ +Q +       Q   V       S FSW    EE    S +++F    L EQ+
Sbjct: 61  FNTARLGAQSS-----LKQMTPV-------STFSWQSYIEESASSSDDKTFTTDGLWEQL 108

Query: 447 NTTKDTSDYLWYTASIHVMPGQG-----KEVFLNIESLGHAALVFVNKKLVAFGYGNHDF 501
           N T+D SDYLWY  +I++   +G     ++  L I S GHA  VF+N +L    YG  D 
Sbjct: 109 NVTRDASDYLWYMTNINIDSNEGFLKNGQDPLLTIWSAGHALHVFINGQLSGTVYGGVDN 168

Query: 502 ANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSG 560
                ++ +++  G+N L +LS+ VGLQN G  F+    G+   V L  L  G RDLS  
Sbjct: 169 PKLTFSQNVKMRVGVNQLSLLSISVGLQNVGTHFEQWNTGVLGPVTLRGLNEGTRDLSKQ 228

Query: 561 EWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLAS 620
           +W Y++G++GE + L  +S ++S  W +GS+L   + L WYKTTF AP G  PLAL++++
Sbjct: 229 QWSYKIGLKGEDLSLHTVSGSSSVEWVEGSSLAQKQPLTWYKTTFNAPAGNEPLALDMST 288

Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
           MGKG  W+N QSIGR+W  Y+A   G   +C+Y G+Y   KC  +CGQP+Q  YH+PR+W
Sbjct: 289 MGKGLIWINSQSIGRHWPGYIAH--GSCGECNYAGTYTDKKCHTNCGQPSQRWYHVPRSW 346

Query: 681 VHPGENLLVIHEELG 695
           ++P  NLLV+ + +G
Sbjct: 347 LNPTGNLLVVLKRVG 361


>gi|359496728|ref|XP_002268994.2| PREDICTED: beta-galactosidase 6-like, partial [Vitis vinifera]
          Length = 177

 Score =  295 bits (755), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 137/154 (88%), Positives = 145/154 (94%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            VTYDHRALVIDGKRRVLQSGSIHYPRS PEVWPE+IRKSKEGGL+VIETYVFWN HEP+
Sbjct: 24  TVTYDHRALVIDGKRRVLQSGSIHYPRSMPEVWPEIIRKSKEGGLDVIETYVFWNNHEPV 83

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG+YYFEGRFDLVRFVKTVQEAGL +HLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN+
Sbjct: 84  RGEYYFEGRFDLVRFVKTVQEAGLLVHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTND 143

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ 157
            FK EMKRFLAKI+ LMK+ NLFA QGGPIILAQ
Sbjct: 144 LFKNEMKRFLAKIVSLMKEANLFAPQGGPIILAQ 177


>gi|3388167|gb|AAC28739.1| beta-galactosidase [Carica papaya]
          Length = 203

 Score =  294 bits (752), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 130/204 (63%), Positives = 162/204 (79%), Gaps = 1/204 (0%)

Query: 29  PRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLF 88
           PRSTPE+WP+LI+ +KEGGL+VI+TYVFWN HEP  G YYFE R+D V+F+K V +AGL+
Sbjct: 1   PRSTPEMWPDLIQNAKEGGLDVIQTYVFWNGHEPSPGNYYFEDRYDPVKFIKLVHQAGLY 60

Query: 89  LHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFAS 148
           +HLRIGPY C EWN+GGFPVWL ++PGIQFRT N PFK +M++F  KI+++MK E LF  
Sbjct: 61  VHLRIGPYICGEWNFGGFPVWLKYVPGIQFRTDNGPFKAQMQKFTEKIVNMMKAEKLFEP 120

Query: 149 QGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINT 208
           QGGP I++Q+E EYG + W  G  G+ Y KWAA  AV L T VPW+MC+QEDAPDPII+T
Sbjct: 121 QGGP-IMSQIEIEYGPIGWEIGAPGKAYTKWAAQMAVGLGTGVPWIMCKQEDAPDPIIDT 179

Query: 209 CNGFYCDGFTPNSPSKPIMWTENY 232
           CNGFYC+ F PN+  KP MWTE +
Sbjct: 180 CNGFYCENFMPNANYKPKMWTEAW 203


>gi|356544613|ref|XP_003540743.1| PREDICTED: beta-galactosidase 8-like [Glycine max]
          Length = 288

 Score =  293 bits (750), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 157/285 (55%), Positives = 192/285 (67%), Gaps = 5/285 (1%)

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAP 295
           F+SFG  VP RPVEDLAFAVARF++ GGTFQNYYM+ GGTNFGRT GGP ++TSYD+D P
Sbjct: 6   FVSFGDVVPHRPVEDLAFAVARFYQRGGTFQNYYMFHGGTNFGRTTGGPFISTSYDFDTP 65

Query: 296 IDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLA 355
           IDEYG IRQPKW HL+ +HKAIKLCE+ L+++ PT   LG  +EA +Y+  +   AAFLA
Sbjct: 66  IDEYGIIRQPKWDHLKNVHKAIKLCEKALLATGPTITYLGPNIEAAVYNIGAV-SAAFLA 124

Query: 356 NYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK-NVNE 414
           N  + +DA V+FNGN Y LPAW VS LPDCK+VV NTAK+ S            K  V  
Sbjct: 125 NI-AKTDAKVSFNGNSYHLPAWYVSTLPDCKSVVLNTAKINSASMISSFTTESLKEEVGS 183

Query: 415 LLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFL 474
           L  + S +SW  E +GIS   SF +  L EQINTT D SDYLWY++SI +      E  L
Sbjct: 184 LDDSGSGWSWISEPIGISKAHSFSKFWLLEQINTTADRSDYLWYSSSIDL--DAATETVL 241

Query: 475 NIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTL 519
           +IESLGHA   FVN KL   G GNH+  +  ++  I L  G NT+
Sbjct: 242 HIESLGHALHAFVNGKLAGSGTGNHEKVSVKVDIPITLVYGKNTI 286


>gi|452819191|gb|EME26260.1| beta-galactosidase [Galdieria sulphuraria]
          Length = 652

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 183/516 (35%), Positives = 273/516 (52%), Gaps = 42/516 (8%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A VT+D RA+VIDGKR +L  GS HYP+   E WP+ +  +K+ GL  +E Y+FWN HE
Sbjct: 3   TAQVTFDKRAVVIDGKRTILYCGSYHYPKIHYEHWPQALELAKDCGLNCLEVYIFWNVHE 62

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
             +G Y+FE   ++ RF++  QE GL + LR+GPY CAE +YGGFP WL  IPGI+FRT 
Sbjct: 63  KKKGVYHFEREGNIFRFLQLAQERGLKVILRMGPYICAETSYGGFPYWLREIPGIEFRTY 122

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           N PF +EMKR+L  I  ++K+  L+  +GGPIIL Q+ENEY  V   YG  G+ Y+ W  
Sbjct: 123 NEPFMKEMKRWLTDINRMLKENKLYHQKGGPIILVQIENEYDIVSSIYGAAGQKYLHWCY 182

Query: 182 DTAVNLNTSVPWVMCQQED-----APDPIINTCNGFY----CDGFTPNSPSKPIMWTENY 232
           +  +    +  W+  +  +     + D  I T N FY     D      P +P++WTE +
Sbjct: 183 E--LYKEGASEWLTSKDSEYFRVASIDKSIETINDFYGHRRIDSLKALKPHQPLLWTEFW 240

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDY 292
            GW+  +  A   RPV+D+ +A ARF   GG+  NYYM+ GGT+FG  A      T YD+
Sbjct: 241 IGWYNIWRGAQRQRPVDDVIYAAARFIAQGGSGMNYYMFHGGTHFGNLAMYG-QTTGYDF 299

Query: 293 DAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTHQKLGAKLEAHIYHK-SSNDC 350
           DAP+D YG   + K+  L++L+  +   E  L+S D P  QKL   +  + +    S D 
Sbjct: 300 DAPVDSYGRPTE-KFERLKQLNHCLSNLEYILLSQDEPEVQKLTPNVNVYRWKDIESGDE 358

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK 410
            +F+ N D  S + V        L   SV I         N  +V     N  +    QK
Sbjct: 359 CSFVCN-DQRSQSYVIVAERAVCLKPLSVKIY-------LNHEEVFDSSQNSYN--VSQK 408

Query: 411 NVNELLLASSAFSWYEEKVGISGNR-------SFVRPDLAEQINTTKDTSDYLWYTASIH 463
           + + L    +   W   ++ I            F  P + + ++ T+D +DY+WYT    
Sbjct: 409 SYHRLDYVCN--EWKTMQIPIPSKEKKDKEHFEFSFPHIPDMLHITQDETDYMWYTGVGT 466

Query: 464 VM-PGQGK------EVFLNIESLGHAALVFVNKKLV 492
           +  P +G+      ++ + +E+  +   VF+N+K V
Sbjct: 467 IYCPFKGENTPHCLKIHMELEAADYVH-VFLNRKYV 501


>gi|301123859|ref|XP_002909656.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
 gi|262100418|gb|EEY58470.1| beta-galactosidase, putative [Phytophthora infestans T30-4]
          Length = 706

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 199/581 (34%), Positives = 294/581 (50%), Gaps = 73/581 (12%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTY  R   IDGK+ +L  GSIHYPRS+P  W +L+R++K  GL  IE YVFWN HE  
Sbjct: 84  SVTYSPRGFEIDGKQTLLLGGSIHYPRSSPGEWEQLLREAKRDGLNHIEMYVFWNLHEQE 143

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           RG + F G  ++ RF +   E GLFLH+R GPY CAEWN GG P+WL++IPG++ R++N 
Sbjct: 144 RGVFNFAGNANITRFYELAAEVGLFLHVRFGPYVCAEWNNGGLPLWLNWIPGMEVRSSNA 203

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
           P++ EM+RF+  +++L +     A  GGPII+AQ+ENE+    W        Y+ W  + 
Sbjct: 204 PWQREMERFIRYMVELSRP--FLAKNGGPIIMAQIENEFA---WH----DPEYIAWCGNL 254

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSF 239
              L+TS+PWVMC    A + I+ +CN   C  F        PS P++WTE+  GWF ++
Sbjct: 255 VKQLDTSIPWVMCYANAAENTIL-SCNDDDCVDFAVKHVKERPSDPLVWTED-EGWFQTW 312

Query: 240 --GYAVPF----RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYD 293
                 P     R  ED+A+AVAR+F  GG   NYYMY GG N+GR A    V T Y   
Sbjct: 313 QKDKKNPLPNDQRSPEDVAYAVARWFAVGGAAHNYYMYHGGNNYGRAASAG-VTTMYADG 371

Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAK---LEAHIYHKSSNDC 350
             +   G   +PK  HLR+LH+A+  C + L+ +D   Q L  +   L      K+S+  
Sbjct: 372 VNLHSDGLSNEPKRTHLRKLHEALIECNDVLLRND--RQVLNPRELPLVDEQTVKASSQQ 429

Query: 351 AAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK 410
            AF+   ++  + +                       ++F+TA V         P  Q +
Sbjct: 430 RAFVYGPEAEPNQDGA---------------------ILFDTADV-----RKSFPGRQHR 463

Query: 411 NVNELLLASSAF--SWYEEKVGISGNRSFVRPDLA-EQINTTKDTSDYLWYTASIHVMPG 467
               L+ AS+    +W E  V  +  R  V  D   EQ+  T D SDYL Y  +    P 
Sbjct: 464 TYTPLVKASALAWKAWSELNVSSTTPRRRVVADQPIEQLRLTADQSDYLTYETTF--TPK 521

Query: 468 QGKEV--------FLNIESLGHAALV---FVNKKLVAFGYGN--HDFANFLINKKIELNE 514
           Q  +V          + E+    ALV    + ++ +A+  GN   +F+ F +   IE+  
Sbjct: 522 QLSDVDDDMWTVKVTSCEASSIIALVDGWLIGERNLAYPGGNCSKEFS-FHLPASIEVGR 580

Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKR 555
             + L ++S+ +G+ + G+       G   +   DL  G+R
Sbjct: 581 Q-HDLKLVSVSLGIYSLGSNHSKGVTGSVRIGHKDLARGQR 620


>gi|217075791|gb|ACJ86255.1| unknown [Medicago truncatula]
          Length = 267

 Score =  285 bits (730), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 146/271 (53%), Positives = 184/271 (67%), Gaps = 4/271 (1%)

Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
           MY GGTNF R+ GGP +ATSYDYDAPIDEYG IRQ KWGHL++++KAIKLCEE LI++DP
Sbjct: 1   MYHGGTNFDRSTGGPFIATSYDYDAPIDEYGIIRQQKWGHLKDVYKAIKLCEEALITTDP 60

Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
               LG  LEA +Y K+ + CAAFLAN D+ +D  V F+GN Y LPAWSVS+LPDCKNVV
Sbjct: 61  KISSLGQNLEAAVY-KTGSVCAAFLANVDTKNDKTVNFSGNSYHLPAWSVSMLPDCKNVV 119

Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
            NTAK+ S     +  F  + +++ L  +SS +SW  E VGIS +    +  L EQINTT
Sbjct: 120 LNTAKINSASAISN--FVTE-DISSLETSSSKWSWINEPVGISKDDILSKTGLLEQINTT 176

Query: 450 KDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKK 509
            D SDYLWY+ S+ +    G +  L+IESLGH    F+N KL     GN D +   ++  
Sbjct: 177 ADRSDYLWYSLSLDLADDPGSQTVLHIESLGHTLHAFINGKLAGNQAGNSDKSKLNVDIP 236

Query: 510 IELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           I L  G N +D+LS+ VGLQNYGA+FD  GA
Sbjct: 237 IALVSGKNKIDLLSLTVGLQNYGAFFDTVGA 267


>gi|16649045|gb|AAL24374.1| beta-galactosidase [Arabidopsis thaliana]
 gi|20260008|gb|AAM13351.1| beta-galactosidase [Arabidopsis thaliana]
          Length = 420

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 171/438 (39%), Positives = 233/438 (53%), Gaps = 34/438 (7%)

Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
           MY GGTNFGRT+    +   YD  AP+DEYG +RQPK+GHL+ELH AIK     L+    
Sbjct: 1   MYHGGTNFGRTSSSYFITGYYD-QAPLDEYGLLRQPKYGHLKELHAAIKSSANPLLQGKQ 59

Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
           T   LG   +A+++  ++N C AFL N D+ + + + F  N Y L   S+ IL +CKN++
Sbjct: 60  TILSLGPMQQAYVFEDANNGCVAFLVNNDAKA-SQIQFRNNAYSLSPKSIGILQNCKNLI 118

Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
           + TAKV  + N       Q  NV +       ++ + E +      S     L E  N T
Sbjct: 119 YETAKVNVKMNTRVTTPVQVFNVPD------NWNLFRETIPAFPGTSLKTNALLEHTNLT 172

Query: 450 KDTSDYLWYTASIHV-MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINK 508
           KD +DYLWYT+S  +  P     ++   ES GH   VFVN  L   G+G+ D     +  
Sbjct: 173 KDKTDYLWYTSSFKLDSPCTNPSIY--TESSGHVVHVFVNNALAGSGHGSRDIRVVKLQA 230

Query: 509 KIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGV 568
            + L  G N + ILS MVGL + GA+ +    GL  V +        DLS  +W Y VG+
Sbjct: 231 PVSLINGQNNISILSGMVGLPDSGAYMERRSYGLTKVQISCGGTKPIDLSRSQWGYSVGL 290

Query: 569 EGEYIGLDKISLANSSFWKQGST-LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
            GE + L +    N   W      L  N+ L WYKTTF  P G GP+ L+++SMGKG+ W
Sbjct: 291 LGEKVRLYQWKNLNRVKWSMNKAGLIKNRPLAWYKTTFDGPNGDGPVGLHMSSMGKGEIW 350

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW ++L P+                      GQP+Q++YHIPR ++ P  NL
Sbjct: 351 VNGESIGRYWVSFLTPA----------------------GQPSQSIYHIPRAFLKPSGNL 388

Query: 688 LVIHEELGGDPSKISLLT 705
           LV+ EE GGDP  ISL T
Sbjct: 389 LVVFEEEGGDPLGISLNT 406


>gi|452821358|gb|EME28389.1| beta-galactosidase [Galdieria sulphuraria]
          Length = 1171

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 171/482 (35%), Positives = 246/482 (51%), Gaps = 54/482 (11%)

Query: 19  RVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRF 78
           R+L   SIHYPR  P  W +LI  +KE G+  IETYVFWN HE  +G Y F GR DL  F
Sbjct: 476 RILFPASIHYPRCQPSDWQQLIEFAKEAGINCIETYVFWNQHEKEKGVYDFSGRLDLFGF 535

Query: 79  VKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIID 138
           ++T+ +AGL+  LRIGPY CAE ++GGFP WL  I GI+FRT N PF+ E  R++  +++
Sbjct: 536 IRTIAKAGLYALLRIGPYICAETHFGGFPHWLRDIDGIEFRTQNEPFQRESSRWVRFLVE 595

Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQ 198
            +   N F SQGGPI++ Q ENEY  +   YG  G  Y+KW ++ A +L   VP  MC+ 
Sbjct: 596 KLNSNNCFYSQGGPIVMVQFENEYKLIGQNYGEAGLNYLKWCSELAKDLQLPVPLFMCK- 654

Query: 199 EDAPDPIINTCNGFYCDGFTPNS----PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFA 254
             + + ++ T N FY      N     P++P +WTE ++GW+  +G A   RP +DL +A
Sbjct: 655 -GSIENVLETINDFYGHQEMENHHREYPNQPAIWTECWTGWYDVWGSAHHIRPCKDLFYA 713

Query: 255 VARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELH 314
           V RFF  GG   NYYM+ GGTN+ + A   L  TSYDYDAPIDEYG  +  K+  L+ +H
Sbjct: 714 VLRFFAQGGKGINYYMFHGGTNYDQLAMY-LQTTSYDYDAPIDEYGR-KTKKYFGLQYIH 771

Query: 315 KAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCA------------AFLANYDSSSD 362
           + +   E++  S       L  KLEA I H   ++               F  N   +S 
Sbjct: 772 RQL---EQHFAS-------LALKLEAPIAHSYEDNYVWIFIWEEQGSNCIFFCNDHPTST 821

Query: 363 ANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAF 422
             V +    Y L   SV ++ D   ++  + ++             QK +  + + +  +
Sbjct: 822 KQVQWKEQEYCLAPLSVQMVVDHHRLILKSDQLFVDEE------LIQKELKPISVTTEEW 875

Query: 423 SW--YEEKVGISG----------------NRSFVRPDLAEQINTTKDTSDYLWYTASIHV 464
           +W  Y+E +  +                 N         E +  T   +DY WY A   +
Sbjct: 876 TWQYYKENIPTTDITSSASQSSSISSLSSNTEIETQVPVEMLRYTGTATDYAWYIAHYQI 935

Query: 465 MP 466
            P
Sbjct: 936 DP 937


>gi|414879451|tpg|DAA56582.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
          Length = 249

 Score =  281 bits (719), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 123/205 (60%), Positives = 160/205 (78%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             VTYD RAL++DG RR+L SG +HYPRSTPE+WP+LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 36  GEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAHEP 95

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           ++GQ+ FEGR+DLV+F++ +   GL++ LRIGP+  +EW YGG P WL  IP I FR+ N
Sbjct: 96  VQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYGGLPFWLRGIPNITFRSDN 155

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            PFK  M++F+ KI++LMK E LF  QGGPII++Q+ENEY  VE A+   G  YV WAA 
Sbjct: 156 EPFKRHMQKFVTKIVNLMKDERLFYPQGGPIIISQIENEYKLVEAAFHSKGSSYVHWAAA 215

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIIN 207
            AVNL T VPW+MC+Q+DAPDPI++
Sbjct: 216 MAVNLQTGVPWMMCKQDDAPDPIVS 240


>gi|183604891|gb|ACC64532.1| beta-galactosidase 6 inactive isoform [Oryza sativa Indica Group]
          Length = 244

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 122/205 (59%), Positives = 157/205 (76%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L   +TYD RALV+ G RR+  SG +HY RSTPE+WP+LI K+K GGL+VI+TYVFWN H
Sbjct: 25  LGREITYDGRALVVSGARRMFFSGDMHYARSTPEMWPKLIAKAKNGGLDVIQTYVFWNVH 84

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EPI+GQY FEGR+DLV+F++ +Q  GL++ LRIGP+  AEW YGGFP WLH +P I FR+
Sbjct: 85  EPIQGQYNFEGRYDLVKFIREIQAQGLYVSLRIGPFVEAEWKYGGFPFWLHDVPSITFRS 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            N PFK+ M+ F+ KI+ +MK E L+  QGGPII++Q+ENEY  +E A+G  G  YV+WA
Sbjct: 145 DNEPFKQHMQNFVTKIVTMMKHEGLYYPQGGPIIISQIENEYQMIEPAFGASGPRYVRWA 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPI 205
           A  AV L T VPW+MC+Q DAPDP+
Sbjct: 205 AAMAVGLQTGVPWMMCKQNDAPDPV 229


>gi|212723424|ref|NP_001132807.1| uncharacterized protein LOC100194296 [Zea mays]
 gi|194695440|gb|ACF81804.1| unknown [Zea mays]
          Length = 467

 Score =  275 bits (703), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 165/488 (33%), Positives = 255/488 (52%), Gaps = 52/488 (10%)

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQ 409
           C AFL+N+++  DA +TF G  YF+P  S+S+L DC+ VVF T  V +Q N     FA Q
Sbjct: 7   CVAFLSNHNTKDDATMTFRGRPYFVPRHSISVLADCETVVFGTQHVNAQHNQRTFHFADQ 66

Query: 410 ---KNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV-- 464
               NV E+    +   + + K+ +            +  N TKD +DY+WYT+S  +  
Sbjct: 67  TAQNNVWEMFDGENVPKYKQAKIRLR--------KAGDLYNLTKDKTDYVWYTSSFKLEA 118

Query: 465 --MPGQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDI 521
             MP +   +  L + S GHA++ FVN K V  G+G      F + K ++L +G+N + +
Sbjct: 119 DDMPIRSDIKTVLEVNSHGHASVAFVNNKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAV 178

Query: 522 LSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLA 581
           L+  +G+ + GA+ +   AG+  V +  L  G  DL++  W + VG+ GE   +      
Sbjct: 179 LASSMGMTDSGAYMEHRLAGVDRVQITGLNAGTLDLTNNGWGHIVGLVGERKQIYTDKGM 238

Query: 582 NSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYL 641
            S  WK       ++ L WYK  F  P G+ P+ L++++MGKG  +VNGQ IGRYW +Y 
Sbjct: 239 GSVTWKPAMN---DRPLTWYKRHFDMPSGEDPVVLDMSTMGKGMMFVNGQGIGRYWISY- 294

Query: 642 APSTGCTKKCDYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
                                 KH  G+P+Q LYH+PR+++   +N+LV+ EE  G P  
Sbjct: 295 ----------------------KHALGRPSQQLYHVPRSFLRQKDNMLVLFEEEFGRPDA 332

Query: 701 ISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSS-------PQVRLACERGWHIAAIN 753
           I +LT    +IC+F+SE +P  + SW+     +++         +  LAC     I  + 
Sbjct: 333 IMILTVKRDNICTFISERNPAHIMSWERKDSQITAKANADDLRARAALACPPKKLIQQVV 392

Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
           FASYG P G CG++  G+CH      +V+KAC+G+  C++PV++   G  A  C G    
Sbjct: 393 FASYGNPAGICGNYTVGSCHTPRAKEVVEKACLGKRVCTLPVAADVYGGDAN-CSGTTAT 451

Query: 813 LAVEAHCS 820
           LAV+A CS
Sbjct: 452 LAVQAKCS 459


>gi|351722837|ref|NP_001235722.1| lectin [Glycine max]
 gi|217314871|gb|ACK36970.1| lectin [Glycine max]
          Length = 447

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 159/428 (37%), Positives = 239/428 (55%), Gaps = 31/428 (7%)

Query: 408 QQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPG 467
           Q ++ N+    S ++   +E + I    SF    + E +N TKD SDYLWY+  ++V   
Sbjct: 21  QLRHQNDFYYISKSWMTTKEPLNIWSKSSFTVEGIWEHLNVTKDQSDYLWYSTRVYVSDS 80

Query: 468 -----QGKEVF--LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLD 520
                +  +V   L I+ +     VF+N +L+          +      I ++ G N   
Sbjct: 81  DILFWEENDVHPKLTIDGVRDILRVFINGQLIV--------KDEQFKAVISVSIGKNDCT 132

Query: 521 ILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKIS 579
             S    + NYGA+ +  GAG+   I I   +NG  DLS   W YQVG++GE++      
Sbjct: 133 AGS----INNYGAFLEKDGAGIRGKIKITGFENGDIDLSKSLWTYQVGLQGEFLKFYSEE 188

Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
             NS  W + +   +  +  WYKT F  P G  P+AL+  SMGKGQAWVNGQ IGRYW+ 
Sbjct: 189 NENSE-WVELTPDAIPSTFTWYKTYFDVPGGIDPVALDFKSMGKGQAWVNGQHIGRYWTR 247

Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
            ++P +GC + CDYRG+Y++ KC  +CG+P QTLYH+PR+W+    NLLVI EE GG+P 
Sbjct: 248 -VSPKSGCQQVCDYRGAYNSDKCSTNCGKPTQTLYHVPRSWLKATNNLLVILEETGGNPF 306

Query: 700 KISLLTKTGQHICSFVSEADPPPV------DSWKPNLGVVSSSPQVRLACERGWHIAAIN 753
           +IS+   + + IC+ VSE++ PP+      D     +   +  P++ L C++G  I+++ 
Sbjct: 307 EISVKLHSSRIICAQVSESNYPPLQKLVNADLIGEEVSANNMIPELHLHCQQGHTISSVA 366

Query: 754 FASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKA 812
           FAS+G P G+C +F  G CH    + IV +AC G+  CSI +S +  GV    CPG++K 
Sbjct: 367 FASFGTPGGSCQNFSRGNCHAPSSMSIVSEACQGKRSCSIKISDSAFGVD--PCPGVVKT 424

Query: 813 LAVEAHCS 820
           L+VEA C+
Sbjct: 425 LSVEARCT 432


>gi|452825532|gb|EME32528.1| beta-galactosidase [Galdieria sulphuraria]
          Length = 752

 Score =  273 bits (698), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 227/766 (29%), Positives = 354/766 (46%), Gaps = 91/766 (11%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           ++D RA+ ++GKR +L  GS+ YP+     W   ++ +KE GL  ++ YVFWN HE  RG
Sbjct: 8   SFDSRAITLNGKRTLLLGGSLQYPKIHHTQWNNTLKLAKECGLNFLDIYVFWNVHEKKRG 67

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
            + F    D+ RF++   + GL + LR+GPY CAE +YGGFP WL  IPGIQFRT N+PF
Sbjct: 68  IFTFTEEADIFRFLQMAHQHGLLVMLRLGPYICAETSYGGFPCWLREIPGIQFRTYNDPF 127

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
             E+KR+L  I  L+K++ LF  QGGPI+L Q+ENEY  V       GE Y+ W  +   
Sbjct: 128 MREVKRWLFYITTLLKEKRLFFPQGGPIVLVQLENEYDLVSKIQLSKGEQYLNWYNELYR 187

Query: 186 NLNTSVPWVMCQQEDAPDPI---------------------INTCNGFY----CDGFTPN 220
            L   VP +MC+   +P+ +                     I T N FY           
Sbjct: 188 ELAFDVPLIMCR--SSPEEVGEFCSCSKEPELSTIASVETCIETFNSFYGHKKIADLRRR 245

Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
            P +PI+WTE + GW+  +  A   R  ED+ +A  RF   GG   +YYM+ GGT+F   
Sbjct: 246 KPHQPILWTEFWIGWYDIWTSAPRKRSTEDVIYAALRFIAQGGAGFSYYMFHGGTHFNNL 305

Query: 281 AGGPLVATSYDYDAPIDEYGFIRQPKWGH--LRELHKAIKLCEEYLISSD-PTHQKLGAK 337
           A      TSY +D+PIDEYG   +P +    L+ ++  +     +L+S D P    L  +
Sbjct: 306 AMYS-QTTSYYFDSPIDEYG---RPSFLFYMLKRINHILHQFSSHLLSQDHPQVLHLLPQ 361

Query: 338 LEAHIYHK-SSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVI 396
           + A I+ + SS    +FL N DS   A + F  ++  +   SV++  +   ++F++    
Sbjct: 362 VVAFIWQEHSSQQSLSFLCN-DSEQIAYIMFQQSMMKMNPLSVAVFLE-NELLFDS---- 415

Query: 397 SQRNNGDHPFAQQKNVNELLLAS-SAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDY 455
           S   +   PF   K +          F        +S +  F +  L + ++ T+D +DY
Sbjct: 416 SSGYDWQIPFRDFKPLERAYFRELKTFQLDIPIPPLSSSCDFSQ--LPDMLSVTQDETDY 473

Query: 456 LWYTASIHVMPGQGKE-----VFLNIESLGHAALVFVNKKLVAFGYGNHD---FAN---- 503
           +WY +S   +P   KE     V L IE +     +F+N++ +   +   D   FAN    
Sbjct: 474 MWYISSA-TLPVSSKEFTCEKVLLQIE-MADLIHLFINQQYMGSSWIKIDDERFANGKNG 531

Query: 504 ----------------FLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVIL 547
                           F  N K+ ++  + +L ++     L   GA  +    GLF   +
Sbjct: 532 FRFSIEFENSVYPQPVFSSNSKLYVSILVCSLGLIKGEFQLWK-GATMEKEKKGLFKQPI 590

Query: 548 IDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL----IWYKT 603
           I       +L +             + L  +    S+F K+ +   V+K L     +YK 
Sbjct: 591 IHFVVKHSELETETIPLSFTSSWAMMPLSIMKDHQSAFVKEYNIKNVDKPLSLGPTYYKQ 650

Query: 604 TFLAPEG-----KGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
           T +  +      K  L ++ +SM KG    N    GRY+S  +    G  +    R S  
Sbjct: 651 TVIINKAMIDALKWGLVIDFSSMTKGIFRWNSFCCGRYYSIQVL---GKERDPSLRNS-- 705

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLL 704
               + H  +  Q  YHIP+  V    N L + EE+GG+  ++ +L
Sbjct: 706 -PVQEDHLFKSTQRYYHIPKG-VLQERNELEVFEEIGGNFMQLRIL 749


>gi|116782829|gb|ABK22678.1| unknown [Picea sitchensis]
          Length = 317

 Score =  272 bits (695), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 149/317 (47%), Positives = 196/317 (61%), Gaps = 11/317 (3%)

Query: 510 IELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVE 569
           I L  G N + +LS+MVGL N G  F+   AG+ +V L   K+G RDLS   W YQ+G+ 
Sbjct: 6   ISLIPGTNDIALLSVMVGLPNSGGHFERKIAGISTVTLRGFKDGTRDLSQELWTYQIGLL 65

Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
           GE   +       S  W   ST   N  L WYK     P+G  P+ L+L+SMGKGQAW+N
Sbjct: 66  GEMSTIYSDVGFISVNWTSSST--PNPPLTWYKAVIDVPDGDEPVILDLSSMGKGQAWIN 123

Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
           G+ IGRYW ++LAP   C+K CDYRG+Y   KC  +CGQP+QTLYH+PR+W+ P  NLLV
Sbjct: 124 GEHIGRYWISFLAPLGDCSK-CDYRGNYSLHKCATNCGQPSQTLYHVPRSWLRPTGNLLV 182

Query: 690 IHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSW---KPNLGVVSSS--PQVRLACE 744
           + EE GGDPSK+SLLT++   +C+   E  PP + SW   K N  V+  +  P ++L C 
Sbjct: 183 LFEETGGDPSKVSLLTRSIDSVCAHAFETHPPSIQSWQKTKVNSEVLRENVEPSLQLDCS 242

Query: 745 RGWHIAAINFASYGIPEGNCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSA 803
            G  I++I FAS+G P+G CG+F  G CH ++    V+KAC+GQ  CSI  S    G   
Sbjct: 243 VGRRISSIKFASFGNPKGVCGNFMKGTCHSVESEKAVEKACLGQHGCSITNSPKEFG--G 300

Query: 804 GACPGLLKALAVEAHCS 820
            AC G +K+LAVEA CS
Sbjct: 301 DACVGTVKSLAVEATCS 317


>gi|10047451|gb|AAG12249.1|AF184080_1 beta-galactosidase [Prunus armeniaca]
          Length = 376

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 148/359 (41%), Positives = 213/359 (59%), Gaps = 14/359 (3%)

Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
           GK+  L ++S GHA  VFVN +     +G  +   F   K + L  GIN + +LS+ VGL
Sbjct: 13  GKKPTLTVQSAGHALHVFVNGQFSGSAFGTREQRQFTFAKPVHLRAGINKIALLSIAVGL 72

Query: 529 QNYGAWFDVAGAGLFSVILID-LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
            N G  ++    G+   + +D L  G++DL+  +W  +VG++GE + L   +  +S  W 
Sbjct: 73  PNVGLHYESWKTGILGPVFLDGLGQGRKDLTMQKWFNKVGLKGEAMDLVSPNGGSSVDWI 132

Query: 588 QGSTLPVNK-SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
           +GS     K +L WYK  F AP G  PLAL++ SMGKGQ W+NGQSIGRYW AY   + G
Sbjct: 133 RGSLATQTKQTLKWYKAYFNAPGGDEPLALDMRSMGKGQVWINGQSIGRYWMAY---ANG 189

Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               C Y G++  +KCQ  CGQP Q  YH+PR+W+ P +NL+V+ EELGGDPSKI+L+ +
Sbjct: 190 DCSLCSYIGTFRPTKCQLGCGQPTQRWYHVPRSWLKPTKNLMVMFEELGGDPSKITLVKR 249

Query: 707 TGQHICSFVSEADPPP----VDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEG 762
           +   +C+ + E  P      +DS + +  +  +  QV L C  G  I++I FAS+G P G
Sbjct: 250 SVAGVCADLQEHHPNAEKFDIDSHEESKTLHQA--QVHLQCVPGQSISSIKFASFGTPTG 307

Query: 763 NCGSFRPGACH-MDVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
            CGSF+ G CH  +   IV+K C+G+  C + VS++  G     CP +LK L+VEA CS
Sbjct: 308 TCGSFQQGTCHATNSHAIVEKNCIGRESCLVTVSNSIFGTD--PCPNVLKRLSVEAVCS 364


>gi|297797852|ref|XP_002866810.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312646|gb|EFH43069.1| hypothetical protein ARALYDRAFT_912308 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 448

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 125/266 (46%), Positives = 167/266 (62%), Gaps = 22/266 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           VTYD  +L+I+GKR +L S S+HYPRSTP++WP +I K++ GGL  I+TYVFWN HEP  
Sbjct: 42  VTYDGTSLIINGKRELLFSVSVHYPRSTPDMWPSIIDKARIGGLNTIQTYVFWNVHEPEH 101

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            +Y F+GRFDLV F+K +QE GL++ LR+GP+  AEWN+GG P WL  +P + FRT N P
Sbjct: 102 RKYDFKGRFDLVTFIKLIQEKGLYVTLRLGPFIQAEWNHGGLPYWLREVPEVYFRTDNEP 161

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           FKE  +R++ KI+ +MK+E L ASQ     L   ENE   V+ AY   GE Y+KWAA+  
Sbjct: 162 FKEHTERYVRKILGMMKEEKLLASQRRSHHLG-TENECNAVQLAYKENGERYIKWAANLV 220

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVP 244
            ++   +PWVMC+Q +A D +IN CNG +C                     F   G    
Sbjct: 221 ESMKLGIPWVMCKQNNASDNLINACNGRHC---------------------FEFLGILQL 259

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYM 270
               ED+AF+VAR+F   G+  NYYM
Sbjct: 260 IEQSEDIAFSVARYFSKNGSHVNYYM 285



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/107 (37%), Positives = 53/107 (49%), Gaps = 8/107 (7%)

Query: 674 YHIPRTWV--HPGENLLVI-HEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL 730
           YHIPR+++     +N+LVI  EE G     I  +      ICS+V E  P  V SWK   
Sbjct: 290 YHIPRSFMKEEKKKNMLVILEEEPGVKLEAIDFVLVNRDTICSYVGEDYPVSVKSWKRER 349

Query: 731 -GVVSSSPQVRLA----CERGWHIAAINFASYGIPEGNCGSFRPGAC 772
             + S S  +RL     C     + A+ FAS+G P G CG+F  G C
Sbjct: 350 PKIASRSKDMRLKAVMKCPPEKQMVAVEFASFGDPTGTCGNFTMGKC 396


>gi|413925746|gb|AFW65678.1| hypothetical protein ZEAMMB73_601729 [Zea mays]
          Length = 402

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/380 (37%), Positives = 213/380 (56%), Gaps = 15/380 (3%)

Query: 267 NYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLIS 326
           NYYMY GGTNFGRT+   ++   YD +AP+DE+G  ++PKWGHLR+LH A+KLC++ L+ 
Sbjct: 3   NYYMYHGGTNFGRTSAAFVMPKYYD-EAPLDEFGLYKEPKWGHLRDLHLALKLCKKALLW 61

Query: 327 SDPTHQKLGAKLEAHIYHKSSND-CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDC 385
              + +KLG + EA ++       C AFL+N+++  D  +TF G  YF+P  S+SIL DC
Sbjct: 62  GKTSTEKLGKQFEARVFEIPEQKVCVAFLSNHNTKDDVTLTFRGQSYFVPRHSISILADC 121

Query: 386 KNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQ 445
           K VVF T  V +Q N     FA Q   N +        + EEKV              + 
Sbjct: 122 KTVVFGTQHVNAQHNQRTFHFADQTTQNNVWQM-----FDEEKVPKYKQSKIRLRKAGDL 176

Query: 446 INTTKDTSDYLWYTASIHV----MP-GQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHD 500
            N TKD +DY+WYT+S  +    MP  +  +  L + S GHA++ FVN K V  G+G   
Sbjct: 177 YNLTKDKTDYVWYTSSFKLEADDMPIRRDIKTVLEVNSHGHASVAFVNTKFVGCGHGTKM 236

Query: 501 FANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSG 560
              F + K ++L +G+N + +L+  +G+ + GA+ +   AG+  V +  L  G  DL++ 
Sbjct: 237 NKAFTLEKPMDLKKGVNHVAVLASTMGMMDSGAYLEHRLAGVDRVQIKGLNAGTLDLTNN 296

Query: 561 EWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLAS 620
            W + VG+ GE   +       S  WK       ++ L WYK  F  P G+ P+ L++++
Sbjct: 297 GWGHIVGLVGEQKQIYTDKGMGSVTWKPAVN---DRPLTWYKRHFDMPSGEDPIVLDMST 353

Query: 621 MGKGQAWVNGQSIGRYWSAY 640
           MGKG  +VNGQ IGRYW +Y
Sbjct: 354 MGKGLMFVNGQGIGRYWISY 373


>gi|356554933|ref|XP_003545795.1| PREDICTED: beta-galactosidase 15-like [Glycine max]
          Length = 288

 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 117/188 (62%), Positives = 143/188 (76%), Gaps = 1/188 (0%)

Query: 153 IILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF 212
           ++L  V    G +E  YG GG+ Y KWAA  A++L   VPWVMC+Q+DAP  II+TCN +
Sbjct: 32  LVLGTVSLGVGAIENEYGKGGKEYRKWAAKKALSLGVGVPWVMCRQQDAPYDIIDTCNAY 91

Query: 213 YCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYF 272
           YCDGF PNS +KP MWTEN+ GW+  +G  +P RPVEDLAFAVA FF+ GG+FQNYYMYF
Sbjct: 92  YCDGFKPNSHNKPTMWTENWDGWYTQWGERLPHRPVEDLAFAVACFFQRGGSFQNYYMYF 151

Query: 273 GGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD-PTH 331
           G TNFGRTAGGPL  TSYDY A IDEYG +R+PKWGHL++LH A+KLCE  L+++D PT+
Sbjct: 152 GRTNFGRTAGGPLQITSYDYVASIDEYGQLREPKWGHLKDLHAALKLCEPALVATDSPTY 211

Query: 332 QKLGAKLE 339
            KLG   E
Sbjct: 212 IKLGPNQE 219


>gi|294948459|ref|XP_002785761.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
 gi|239899809|gb|EER17557.1| beta-galactosidase, putative [Perkinsus marinus ATCC 50983]
          Length = 770

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 204/702 (29%), Positives = 316/702 (45%), Gaps = 111/702 (15%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP- 62
           +VTYD RA  IDG R +L  GSIHYPR   + W  ++ +    GL  ++ YVFWNYHEP 
Sbjct: 50  SVTYDSRAFKIDGVRTLLLGGSIHYPRVAVDEWEPMLEEMGRDGLNHVQLYVFWNYHEPR 109

Query: 63  ----------IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF 112
                     +  +Y F GR DL+ F++   +  LF+ LRIGPY CAEW +GG P+WL  
Sbjct: 110 PPRYDQLKDRLEHKYDFSGRGDLLGFIRAAAKKDLFVSLRIGPYVCAEWAFGGLPLWLRD 169

Query: 113 IPGIQFRT--------------------TNNPFKEEMKRFLAKIIDLMKQENLFASQGGP 152
           + G+ FR+                    + +P+++ M  F+ +I  ++K+ NL A+QGGP
Sbjct: 170 VEGMCFRSICGYNGSPGKCKPWEGGKFRSCDPWRKYMADFVMEIGRMVKEANLMAAQGGP 229

Query: 153 IILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF 212
           +IL Q+ENEYG+    +   G  Y+ W  + +  L   VPWVMC    A +  +N CNG 
Sbjct: 230 VILGQLENEYGH----HSDAGRAYIDWVGELSFGLGLDVPWVMCNGISA-NGTLNVCNGD 284

Query: 213 YC-DGFTPNS----PSKPIMWTENYSGWFLSFGYAV--PFRPVEDLAFAVARFFETGGTF 265
            C D +  +     P +P+ WTEN  GWF ++G AV    R  E++A+ +A++   GG+ 
Sbjct: 285 DCADEYKTDHDKRWPDEPLGWTEN-EGWFDTWGGAVGNSKRSAEEMAYVLAKWVAVGGSH 343

Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
            NYYM++GG +  +     L   +Y         G   +PK  HL+ LH+ +      L+
Sbjct: 344 HNYYMWYGGNHLAQWGAASLT-NAYADGVNFHSNGLPNEPKRSHLQRLHEVLGKLNGELM 402

Query: 326 SSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYD-SSSDANVTFNGNVYFLPAWSVSIL-P 383
             +  H  +  +LE  +         AFL     S S   V +    Y +    V ++ P
Sbjct: 403 QVEDRHSVMPVQLENGVEVYEWTAGLAFLHRPACSGSPVEVHYAKATYSIACREVLVVDP 462

Query: 384 DCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLA 443
               V+F TA V       + P    + V   L A   +S  +E++ + G  +    +  
Sbjct: 463 SSSTVLFATASV-------EPPPELVRRVVATLTADR-WSMRKEEL-LHGMATVEGREPV 513

Query: 444 EQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFAN 503
           E +  +   +DY+ Y  ++    G    V L I+S           ++    + + D A+
Sbjct: 514 EHLRVSGLDTDYVTYKTTVTATEGV-TNVSLEIDS-----------RISQVFHVSVDNAS 561

Query: 504 FLINKKIELNEGINT------------------LDILSMMVGLQN---YGAWFDVAGAGL 542
            L    +++N+G NT                  L ILS  +G++N   YGA        L
Sbjct: 562 SLAATVMDVNKG-NTEWTAVAQLHNLTAGRTYDLWILSESLGVENGMLYGA-PAATEPSL 619

Query: 543 FSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG-STLPVNKSL--I 599
              I  D++  ++ +  G W    G++GE  G             QG + LP   SL   
Sbjct: 620 QKGIFGDIRLNEKSIRKGRWSMVKGLDGEVDG------------GQGKAELPCCDSLGPA 667

Query: 600 WYKTTFLAPEGKGP-----LALNLASMGKGQAWVNGQSIGRY 636
           W+   F     +       L L L     G  W+NG  IGR+
Sbjct: 668 WFVAGFTLHSVRSKSISLTLPLGLPQQAGGHIWLNGVDIGRW 709


>gi|449018329|dbj|BAM81731.1| probable beta-galactosidase [Cyanidioschyzon merolae strain 10D]
          Length = 777

 Score =  246 bits (628), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 141/339 (41%), Positives = 194/339 (57%), Gaps = 27/339 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE-- 61
            +TYD R+L I+GK     SG++HY RS P  WP++ R  +  GL  +ETYVFW  HE  
Sbjct: 9   EITYDSRSLRINGKPFFCLSGAVHYVRSHPSAWPQIFRCMRRDGLNTVETYVFWGDHEFE 68

Query: 62  -----PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI--- 113
                    +  F G  DLVRF++  +  GL   LR+GPY CAE NYGGFP WL  +   
Sbjct: 69  PPEMPDAEPRADFSGPRDLVRFLRCAKLHGLNAILRLGPYVCAEVNYGGFPWWLRQVCEK 128

Query: 114 ---PGIQFRTTNNPFKEEMKRFLAKIID-LMKQENLFASQGGPIILAQVENEYGNVEWAY 169
                ++FRT +  +  +++R+L  ++D ++K   +FA QGGP+ILAQ+ENEY  +  +Y
Sbjct: 129 GSSKPVRFRTWDPAYCAQVERWLKYLVDHVLKPARVFAPQGGPVILAQIENEYAMIAESY 188

Query: 170 GVGGELYVKWAADTAVNLNTSVPWVMC----QQEDAPDPIINTCNGFYCDGFTPN----- 220
           G  G+ Y+ W A  A  L   VP VMC    Q+E     +I T N FY      +     
Sbjct: 189 GPDGQQYLDWIASLANQLALGVPLVMCYGASQRESGR--VIETINAFYAHEHVESLRRAQ 246

Query: 221 -SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
            +  +P++WTE ++GW+  +G     R   DLA+AV RF   GG   NYYMYFGGTN+ R
Sbjct: 247 GANPQPLLWTECWTGWYDVWGAPHHRRDAADLAYAVLRFLAAGGAGINYYMYFGGTNWRR 306

Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
                L ATSYDYDAP++EY  +   K  HLR LH++I+
Sbjct: 307 ENTMYLQATSYDYDAPLNEY-VMETTKSRHLRRLHESIQ 344


>gi|449534351|ref|XP_004174126.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
          Length = 154

 Score =  245 bits (626), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 106/154 (68%), Positives = 136/154 (88%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +VTYDH+A++I+G+RR+L SGSIHYPRSTP++WP+LI+K+K+GGL++IETYVFWN HEP 
Sbjct: 1   SVTYDHKAIIINGQRRILISGSIHYPRSTPQMWPDLIQKAKDGGLDIIETYVFWNGHEPS 60

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
             +YYFE R+DLVRF+K VQ+AGL++HLRIGPY CAEWNYGGFP+WL F+PGI FRT N 
Sbjct: 61  PDKYYFEERYDLVRFIKLVQQAGLYVHLRIGPYVCAEWNYGGFPLWLKFVPGIAFRTDNA 120

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQ 157
           PFK  M++F+ KI+D+MK E LF +QGGPIIL+Q
Sbjct: 121 PFKAAMQKFVYKIVDMMKWEKLFHTQGGPIILSQ 154


>gi|343963202|gb|AEM72517.1| beta-galactosidase [Diospyros kaki]
          Length = 172

 Score =  243 bits (621), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 113/172 (65%), Positives = 131/172 (76%)

Query: 104 GGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG 163
           GGFPVWL ++PGI FRT N PFK  M+ F  KI++LMK ENLF SQGGPIIL+Q+ENEYG
Sbjct: 1   GGFPVWLKYVPGISFRTDNEPFKNAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYG 60

Query: 164 NVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPS 223
                 G  G  YV WAA+ AV L T VPWVMC++EDAPDP+INTCNGFYCD F+PN P 
Sbjct: 61  PQGKILGDAGHKYVTWAANMAVGLGTGVPWVMCKEEDAPDPVINTCNGFYCDSFSPNRPY 120

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGT 275
           KP +WTE +SGWF  FG  +  RPV+DLAFAVARF + GG+F NYYMY GGT
Sbjct: 121 KPTIWTEAWSGWFTEFGGPIHERPVQDLAFAVARFIQKGGSFFNYYMYHGGT 172


>gi|183604893|gb|ACC64533.1| beta-galactosidase 11 [Oryza sativa Indica Group]
          Length = 446

 Score =  243 bits (619), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 149/472 (31%), Positives = 225/472 (47%), Gaps = 45/472 (9%)

Query: 362 DANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSA 421
           D  V F G  +++P+ SVSIL DCK VV+NT +V  Q +        + + N +      
Sbjct: 2   DGTVVFRGEKFYVPSRSVSILADCKTVVYNTKRVFVQHSERSFHTTDETSKNNV------ 55

Query: 422 FSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP-GQGKEVFLNI 476
           +  Y E +              EQ N TKDTSDYLWYT S  +    +P  +     + I
Sbjct: 56  WEMYSEAIPKFRKTKVRTKQPLEQYNQTKDTSDYLWYTTSFRLESDDLPFRRDIRPVIQI 115

Query: 477 ESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFD 536
           +S  HA + F N   V  G G+    +F+  K ++L  GIN + +LS  +G+++ G    
Sbjct: 116 KSTAHAMIGFANDAFVGTGRGSKREKSFVFEKPMDLRVGINHIAMLSSSMGMKDSGGELV 175

Query: 537 VAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGST-LPVN 595
               G+   ++  L  G  DL    W ++  +EGE   +          WK     LP+ 
Sbjct: 176 EVKGGIQDCVVQGLNTGTLDLQGNGWGHKARLEGEDKEIYTEKGMAQFQWKPAENDLPIT 235

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
               WYK  F  P+G  P+ ++++SM KG  +VNG+ IGRYW++++  +           
Sbjct: 236 ----WYKRYFDEPDGDDPIVVDMSSMSKGMIYVNGEGIGRYWTSFITLA----------- 280

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFV 715
                      G P+Q++YHIPR ++ P  NLL+I EE  G P  I + T     IC F+
Sbjct: 281 -----------GHPSQSVYHIPRAFLKPKGNLLIIFEEELGKPGGILIQTVRRDDICVFI 329

Query: 716 SEADPPPVDSWKPNLGVV-----SSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPG 770
           SE +P  + +W+ + G +      +S +  L C     I  + FAS+G PEG CG+F  G
Sbjct: 330 SEHNPAQIKTWESDGGQIKLIAEDTSTRGTLNCPPKRTIQEVVFASFGNPEGACGNFTAG 389

Query: 771 ACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCSI 821
            CH  D   IV+K C+G+  C +PV +   G     CP     LAV+  C +
Sbjct: 390 TCHTPDAKAIVEKECLGKESCVLPVVNTVYGADIN-CPATTATLAVQVRCKV 440


>gi|16973314|emb|CAC84109.1| putative galactosidae, partial [Gossypium hirsutum]
          Length = 383

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 148/416 (35%), Positives = 218/416 (52%), Gaps = 44/416 (10%)

Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAA 352
            P+DE+G  R+PKWGHL+++H+A+ LC+  L    PT  KLG   +A ++ +  ++ CAA
Sbjct: 4   GPLDEFGLQREPKWGHLKDVHRALSLCKRALFWGFPTTLKLGPDQQAIVWQQPGTSACAA 63

Query: 353 FLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNV 412
            LAN ++    +V F G    LPA S+S+LPDCK VVFNT  V +Q N+        +N 
Sbjct: 64  LLANNNTRLAQHVNFRGQDIRLPARSISVLPDCKTVVFNTQLVTTQHNS--------RNF 115

Query: 413 NELLLASSAFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHV----MP 466
               +A+  F+W  Y E   +     F  P   E  + TKDT+DY WYT S+ +    +P
Sbjct: 116 VRSEIANKNFNWEMYREVPPVGLGFKFDVP--RELFHLTKDTTDYAWYTTSLLLGRRDLP 173

Query: 467 GQGK-EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMM 525
            +      L + SLGH    +VN +     +G+    +F+  +   L EG N + +L  +
Sbjct: 174 MKKNVRPVLRVASLGHGIHAYVNGEYAGSAHGSKVEKSFVCRELSSLKEGENHIALLGYL 233

Query: 526 VGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
           VGL + GA+ +   AG  S+ ++ L  G  D+S   W +QVG +GE   L     + S  
Sbjct: 234 VGLPDSGAYMEKRFAGPRSITILGLNTGTLDISQNGWGHQVGTDGEKKKLFTEEGSKSVQ 293

Query: 586 WKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
           W +         L WYK  F APEG  P+A+ +  MGKG  WVNG+SIGRYW+ YL+P  
Sbjct: 294 WTKPDQ---GGPLTWYKGYFDAPEGDNPVAIVMTGMGKGMVWVNGRSIGRYWNNYLSP-- 348

Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKI 701
                                 +P Q+ YHIPR ++ P +NL+V+ EE GG+P  +
Sbjct: 349 --------------------LKKPTQSEYHIPRAYLKP-KNLIVLLEEEGGNPKDV 383


>gi|218188529|gb|EEC70956.1| hypothetical protein OsI_02569 [Oryza sativa Indica Group]
          Length = 480

 Score =  238 bits (607), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 133/326 (40%), Positives = 182/326 (55%), Gaps = 16/326 (4%)

Query: 496 YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILID-LKNGK 554
           YG+ D         ++L  G NT+  LS+ VGL N G  F+   AG+   + +D L  G+
Sbjct: 168 YGSVDDPKLTYTGNVKLWAGSNTISCLSIAVGLPNVGEHFETWNAGILGPVTLDGLNEGR 227

Query: 555 RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPL 614
           RDL+  +W YQVG++GE   L  +S +++  W +      N +       F AP+G  PL
Sbjct: 228 RDLTWQKWTYQVGLKGESTTLHSLSGSSTVEWGEPVQNASNMAF------FNAPDGDEPL 281

Query: 615 ALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLY 674
           AL+++SMGKGQ W+NGQ IGRYW  Y A  +G    CDYRG YD +KCQ +CG  +Q  Y
Sbjct: 282 ALDMSSMGKGQIWINGQGIGRYWPGYKA--SGNCGTCDYRGEYDETKCQTNCGDSSQRWY 339

Query: 675 HIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVS 734
           H+PR+W+ P  NLLVI EE GGDP+ IS++ ++   +C+ VSE   P + +W        
Sbjct: 340 HVPRSWLSPTGNLLVIFEEWGGDPTGISMVKRSIGSVCADVSEWQ-PSMKNWHTK---DY 395

Query: 735 SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIP 793
              +V L C+ G  I  I FAS+G P+G+CGS+  G CH      I  K CVGQ  C + 
Sbjct: 396 EKAKVHLQCDNGQKITEIKFASFGTPQGSCGSYTEGGCHAHKSYDIFWKNCVGQERCGVS 455

Query: 794 VSSAYLGVSAGACPGLLKALAVEAHC 819
           V     G     CPG +K   VEA C
Sbjct: 456 VVPEIFG--GDPCPGTMKRAVVEAIC 479



 Score =  204 bits (518), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 88/151 (58%), Positives = 113/151 (74%)

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
           M++F  KI+++MK E LF  QGGPIIL+Q+ENE+G +EW  G   + Y  WAA+ AV LN
Sbjct: 1   MQKFTTKIVEMMKSEGLFEWQGGPIILSQIENEFGPLEWDQGEPAKAYASWAANMAVALN 60

Query: 189 TSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPV 248
           TSVPW+MC+++DAPDPIINTCNGFYCD F+PN P KP MWTE ++ W+  FG  VP RPV
Sbjct: 61  TSVPWIMCKEDDAPDPIINTCNGFYCDWFSPNKPHKPTMWTEAWTAWYTGFGIPVPHRPV 120

Query: 249 EDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
           EDLA+ VA+F + GG+F NYYM+     F +
Sbjct: 121 EDLAYGVAKFIQKGGSFVNYYMFLNLRGFTK 151


>gi|357483853|ref|XP_003612213.1| Beta-galactosidase [Medicago truncatula]
 gi|355513548|gb|AES95171.1| Beta-galactosidase [Medicago truncatula]
          Length = 418

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 132/319 (41%), Positives = 179/319 (56%), Gaps = 39/319 (12%)

Query: 24  GSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQ 83
           GS+HYPR  PE+WP++ +K+K                     Q+ FEG +DL++F+K + 
Sbjct: 11  GSVHYPRCPPEMWPDIFKKAK---------------------QFNFEGNYDLIKFIKMIG 49

Query: 84  EAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQE 143
                 HL +  ++  E      P+WL  IP I FR+ N PF   M++F   II  M+ E
Sbjct: 50  IMICMQHLEL-VHSLKE-----LPIWLREIPNIIFRSDNQPFMYHMEQFTKMIIKKMRDE 103

Query: 144 NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPD 203
             F  +       Q+ENE+  V+ AY   G  YV+W  + AV L+T VPW+MC+Q +A  
Sbjct: 104 KFFPRK-------QIENEHTAVQQAYKEHGMRYVQWEGNMAVGLDTGVPWIMCKQVNALG 156

Query: 204 PIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFET 261
           P++NTCNG YC D F+ PN  S   +   +Y   + +FG     R  ED+A AVARFF  
Sbjct: 157 PVMNTCNGRYCGDTFSGPNKNSHLNIHLRHYR--YRAFGDPPSERTAEDIAIAVARFFSK 214

Query: 262 GGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCE 321
            GT  NYYMY+GGTNFGRT+    V T Y  +API EYG  R+PKWGH R+LH A+KLC+
Sbjct: 215 KGTMANYYMYYGGTNFGRTSSS-FVTTQYYDEAPIVEYGLPREPKWGHFRDLHDALKLCQ 273

Query: 322 EYLISSDPTHQKLGAKLEA 340
           + L+      Q LG  LE 
Sbjct: 274 KALLWGTQPVQMLGKDLEV 292



 Score = 86.7 bits (213), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 47/121 (38%), Positives = 59/121 (48%), Gaps = 5/121 (4%)

Query: 658 DASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSE 717
           D    QK  G     LYH PR  + P  N LV+ EE+GG    I +LT     ICS   E
Sbjct: 289 DLEVGQKQFGSYVSMLYHTPRAILQPKNNFLVVLEEMGGKLDGIEILTVNRDTICSIAGE 348

Query: 718 ADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNCGSFRPGAC 772
             PP V++W    GV+ ++     P   L C     I  ++FASYG P GNCG F  G C
Sbjct: 349 HYPPNVETWSRYKGVIRTNVDTPKPAANLVCLDNKTITQVDFASYGDPVGNCGHFILGKC 408

Query: 773 H 773
           +
Sbjct: 409 N 409


>gi|343963204|gb|AEM72518.1| beta-galactosidase [Diospyros kaki]
          Length = 173

 Score =  236 bits (601), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 104/169 (61%), Positives = 127/169 (75%)

Query: 105 GFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN 164
           GF     ++PGI FRT N PFK  M++F  KI+++MK E LF  QGGPII++Q+ENEYG 
Sbjct: 3   GFSCLAQYVPGIAFRTDNGPFKAAMQKFTEKIVNMMKSEKLFEPQGGPIIMSQIENEYGP 62

Query: 165 VEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK 224
           VEW  G  G+ Y KWAA  AV LNT VPW+MC+QEDAPDP+I+TCNGFYC+GF PN   K
Sbjct: 63  VEWEIGAPGKSYTKWAAQMAVGLNTGVPWIMCKQEDAPDPVIDTCNGFYCEGFRPNKNYK 122

Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFG 273
           P MWTEN++GW+  FG   P+RPVEDLAF+VARF +  G+F NYYMY G
Sbjct: 123 PKMWTENWTGWYTKFGGPAPYRPVEDLAFSVARFIQNNGSFVNYYMYHG 171


>gi|62529271|gb|AAX84941.1| beta-galactosidase [Prunus persica]
          Length = 287

 Score =  236 bits (601), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 130/295 (44%), Positives = 175/295 (59%), Gaps = 15/295 (5%)

Query: 283 GPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
           GP +ATSYDYDAP+DEYG  R+PKWGHLR+LHKAIK  E  L+S++P+   LG   EAH+
Sbjct: 1   GPFMATSYDYDAPLDEYGLPREPKWGHLRDLHKAIKSSESALVSAEPSVTSLGNGQEAHV 60

Query: 343 YHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNG 402
           + KS + CAAFLANYD+ S A V+F    Y LP WS+SILPDCK  V+NTA++ SQ    
Sbjct: 61  F-KSKSGCAAFLANYDTKSSAKVSFGNGQYELPPWSISILPDCKTAVYNTARLGSQ---- 115

Query: 403 DHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASI 462
               + Q  +  +  A    S+ EE      + +     L EQIN T+DT+DYLWY   I
Sbjct: 116 ----SSQMKMTPVKSALPWQSFVEESASSDESDTTTLDGLWEQINVTRDTTDYLWYMTDI 171

Query: 463 HVMPGQ-----GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN 517
            + P +     G+   L I S GHA  VF+N +L    YG  +      ++ ++L  GIN
Sbjct: 172 TISPDEGFIKRGESPLLTIYSAGHALHVFINGQLSGTVYGALENPKLTFSQNVKLRSGIN 231

Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGE 571
            L +LS+ VGL N G  F+   AG+   V L  L +G  D+S  +W Y+ G++GE
Sbjct: 232 KLALLSISVGLPNVGLHFETWNAGVLGPVTLKGLNSGTWDMSRWKWTYKTGLKGE 286


>gi|3021342|emb|CAA06310.1| beta-galactosidase [Cicer arietinum]
          Length = 307

 Score =  233 bits (593), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 127/298 (42%), Positives = 174/298 (58%), Gaps = 11/298 (3%)

Query: 419 SSAFSW--YEEKVGISG-NRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQG-----K 470
           SSAF W  Y E    SG + S     L EQI  T+D+SDYLWY   +++ P +G     +
Sbjct: 12  SSAFDWQSYNEAPASSGIDDSTTANALLEQIKVTRDSSDYLWYMTDVNISPNEGFIKNGQ 71

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
              L   S GH   VFVN +     YG  +      +  ++L  G N + +LS+ VGL N
Sbjct: 72  YPVLTAMSAGHVLHVFVNGQFSGTAYGGLENPKLTFSNSVKLRVGNNKISLLSVAVGLSN 131

Query: 531 YGAWFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG 589
            G  ++    G+   V L  L  G RDLS  +W Y++G++GE + L  +  ++S  W +G
Sbjct: 132 VGLHYETWNVGVLGPVTLKGLNEGTRDLSGQKWSYKIGLKGETLNLHTLIGSSSVQWTKG 191

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           S+L   + L WYK TF AP G  PLAL+++SMGKG+ WVNG+SIGR+W AY+A   G   
Sbjct: 192 SSLVEKQPLTWYKATFDAPAGNDPLALDMSSMGKGEIWVNGESIGRHWPAYIA--RGSCG 249

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
            C+Y G++   KC+  CGQP Q  YHIPR+WV+P  N LV+ EE GGDPS ISL+ +T
Sbjct: 250 GCNYAGTFTDKKCRTSCGQPTQKWYHIPRSWVNPRGNFLVVLEEWGGDPSGISLVKRT 307


>gi|147778844|emb|CAN67049.1| hypothetical protein VITISV_001154 [Vitis vinifera]
          Length = 317

 Score =  229 bits (583), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 128/298 (42%), Positives = 174/298 (58%), Gaps = 23/298 (7%)

Query: 530 NYGAWFDVAGAGLF-SVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
           NYGA+ +  GAG    V L   KNG+ DLS   W YQVG+ GE+  +  I  +  + W  
Sbjct: 26  NYGAFLEKDGAGFKGQVKLTGFKNGEIDLSEYSWTYQVGLRGEFQKIYMIDESEKAEWTD 85

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
            +      +  WYKT F AP G+ P+AL+L SMGKGQAWVNG  IGRYW+  +AP  GC 
Sbjct: 86  LTPDASPSTFTWYKTFFDAPNGENPVALDLGSMGKGQAWVNGHHIGRYWT-RVAPKDGC- 143

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTG 708
            KCDYRG Y  SK            YHIPR+W+    NLLV+ EE GG P +IS+ +++ 
Sbjct: 144 GKCDYRGHYHTSK------------YHIPRSWLQASNNLLVLFEETGGKPFEISVKSRST 191

Query: 709 QHICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGN 763
           Q IC+ VSE+  P + +W P+  +  +S     P++ L C+ G  I++I FASYG P+G+
Sbjct: 192 QTICAEVSESHYPSLQNWSPSDFIDQNSKNKMTPEMHLQCDDGHTISSIEFASYGTPQGS 251

Query: 764 CGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           C  F  G CH  + L +V KAC G+  C I + ++  G     C G++K LAVEA C+
Sbjct: 252 CQMFSQGQCHAPNSLALVSKACQGKGSCVIRILNSAFG--GDPCRGIVKTLAVEAKCA 307


>gi|300122832|emb|CBK23839.2| unnamed protein product [Blastocystis hominis]
          Length = 601

 Score =  228 bits (580), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 185/649 (28%), Positives = 294/649 (45%), Gaps = 88/649 (13%)

Query: 91  LRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQG 150
           +RIGPY CAEW+ GG PVW++++ G++ R  N+ +K+EM  ++  + D  +  + FA +G
Sbjct: 1   MRIGPYVCAEWDNGGIPVWVNYLDGVRLRANNDVWKKEMGDWMKVLTDYTR--DFFADRG 58

Query: 151 GPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN 210
           GPII +Q+ENE     W    G   Y+ W  + A +L  +VPW+MC   D  +  IN CN
Sbjct: 59  GPIIFSQIENEL----WG---GAREYIDWCGEFAESLELNVPWMMCNG-DTSEKTINACN 110

Query: 211 GFYCDGFTPNSP-------SKPIMWTENYSGWFLSFGYAVP---------FRPVEDLAFA 254
           G  C  +  +          +P  WTEN  GWF   G A            R  ED  F 
Sbjct: 111 GNDCSSYLESHGQSGRILVDQPGCWTEN-EGWFQIHGAASAERDDYEGWDARSAEDYTFN 169

Query: 255 VARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELH 314
           V +F + GG++ NYYM+FGG ++G+ AG  +    Y     I       +PK  H  ++H
Sbjct: 170 VLKFMDRGGSYHNYYMWFGGNHYGKWAGNGMT-NWYTNGVMIHSDTLPNEPKHSHTAKMH 228

Query: 315 KAIKLCEEYLISSDP---THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNV 371
           + +    E L++        + L         ++  +   +F+ N   S+D  V +   V
Sbjct: 229 RMLANIAEVLLNDKAQVNNQKHLNCDNCNAFEYRYGDRLVSFVENSKGSADK-VIYRDIV 287

Query: 372 YFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQK--NVNELLLASSAFSWYEEKV 429
           Y LPAWS+ +L +  NV+F T  V         P  + +  +  E L     F ++ E V
Sbjct: 288 YELPAWSMIVLDEYDNVLFETNNV--------KPVNKHRVYHCEEKL----EFEYWNEPV 335

Query: 430 GI---SGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALV- 485
                   R  V P   EQ+N T+D +++L+Y   +        E  L+I      A V 
Sbjct: 336 STLSQEAPRVVVSPKANEQLNMTRDLTEFLYYETEVEFPQ---DECTLSIGGTDANAFVA 392

Query: 486 FVNKKLVAFG--YGNHDFANFLINKKIELNEGINTLDILSMMVGLQN-YGAWFDVAGA-G 541
           +V+   V     + +HD     +N  ++  +G + L +LS  +G+ N   +  D + A  
Sbjct: 393 YVDDHFVGSDDEHTHHD-GWHTMNINMKSGKGKHKLVLLSESLGVSNGMDSNLDPSWASS 451

Query: 542 LFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWY 601
               I   +K    D+ + EW +  G+ GE   +       +  WK  S +    +L WY
Sbjct: 452 RLKGICGWIKLCGNDIFNQEWKHYPGLVGEAKQVFTDEGMKTVTWK--SDVENADNLAWY 509

Query: 602 KTTFLAPEG--KG-PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
           ++TF  P+G  +G  + L    M +GQA+ NG +IGRYW                     
Sbjct: 510 RSTFKTPQGLKRGIEVLLRPEGMNRGQAYANGHNIGRYW--------------------- 548

Query: 659 ASKCQKHCGQPAQTLYHIPRTWV--HPGENLLVIHEELGGDPSKISLLT 705
               +   G+  Q  YHIP+ W+     EN+LV+ E LG     +++ T
Sbjct: 549 --MIKDGNGEYTQGFYHIPKDWLKGEGEENVLVLGETLGASDPSVTICT 595


>gi|62321607|dbj|BAD95183.1| beta-galactosidase like protein [Arabidopsis thaliana]
          Length = 275

 Score =  224 bits (572), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 121/270 (44%), Positives = 166/270 (61%), Gaps = 11/270 (4%)

Query: 556 DLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS-TLPVNKSLIWYKTTFLAPEGKGPL 614
           DLS  +W YQVG++GE + L   +   S  W   S T+   + L W+KT F APEG  PL
Sbjct: 2   DLSWQKWTYQVGLKGEAMNLAFPTNTPSIGWMDASLTVQKPQPLTWHKTYFDAPEGNEPL 61

Query: 615 ALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLY 674
           AL++  MGKGQ WVNG+SIGRYW+A+   +TG    C Y G+Y  +KCQ  CGQP Q  Y
Sbjct: 62  ALDMEGMGKGQIWVNGESIGRYWTAF---ATGDCSHCSYTGTYKPNKCQTGCGQPTQRWY 118

Query: 675 HIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN---LG 731
           H+PR W+ P +NLLVI EELGG+PS +SL+ ++   +C+ VSE   P + +W+      G
Sbjct: 119 HVPRAWLKPSQNLLVIFEELGGNPSTVSLVKRSVSGVCAEVSEYH-PNIKNWQIESYGKG 177

Query: 732 VVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHMDV-LPIVQKACVGQIEC 790
                P+V L C  G  IA+I FAS+G P G CGS++ G CH      I+++ CVG+  C
Sbjct: 178 QTFHRPKVHLKCSPGQAIASIKFASFGTPLGTCGSYQQGECHAATSYAILERKCVGKARC 237

Query: 791 SIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           ++ +S++  G     CP +LK L VEA C+
Sbjct: 238 AVTISNSNFG--KDPCPNVLKRLTVEAVCA 265


>gi|223945899|gb|ACN27033.1| unknown [Zea mays]
          Length = 296

 Score =  223 bits (567), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 122/295 (41%), Positives = 171/295 (57%), Gaps = 13/295 (4%)

Query: 421 AFSW--YEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQ-----GKEVF 473
            FSW  Y E       R+F +  L EQ++ T D SDYLWYT  +++   +     G+   
Sbjct: 6   GFSWQSYSEATNSLDGRAFTKDGLVEQLSMTWDKSDYLWYTTYVNINSNEQFLKSGQWPQ 65

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I S GH+  VFVN +     YG +D      +  +++ +G N + ILS  VGL N G 
Sbjct: 66  LTIYSAGHSLQVFVNGQSYGAVYGGYDSPKLTYSGYVKMWQGSNKISILSAAVGLPNQGT 125

Query: 534 WFDVAGAGLFS-VILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
            ++    G+   V L  L  GKRDLS  +W YQ+G+ GE +G+  ++ ++S  W   +  
Sbjct: 126 HYETWNVGVLGPVTLSGLNEGKRDLSDQKWTYQIGLHGESLGVQSVAGSSSVEWGSAAG- 184

Query: 593 PVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCD 652
              + L W+K  F AP G  P+AL++ SMGKGQAWVNG+ IGRYWS Y A S+GC   C 
Sbjct: 185 --KQPLTWHKAYFSAPSGDAPVALDMGSMGKGQAWVNGRHIGRYWS-YKASSSGC-GGCS 240

Query: 653 YRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           Y G+Y  +KCQ  CG  +Q  YH+PR+W++P  NLLV+ EE GGD S + L+T+T
Sbjct: 241 YAGTYSETKCQTGCGDVSQRYYHVPRSWLNPSGNLLVMLEEFGGDLSGVKLVTRT 295


>gi|297734971|emb|CBI17333.3| unnamed protein product [Vitis vinifera]
          Length = 447

 Score =  221 bits (564), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 156/433 (36%), Positives = 221/433 (51%), Gaps = 36/433 (8%)

Query: 195 MCQQEDAPDPIINTCNGFYC-DGFT-PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLA 252
           MC+Q+DAPDP+INTC G  C D FT PN P+K  + TE      L     +         
Sbjct: 1   MCKQKDAPDPVINTCKGRNCGDTFTGPNRPNKRSVSTEYLETPHLKGQQKI--------- 51

Query: 253 FAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
              + F    GT  NYYMY+  TNFGRT       T Y  +AP+DEYG  R+ KWGHLR+
Sbjct: 52  -LHSLFISKNGTLANYYMYYSVTNFGRTTSS-FATTCYYDEAPLDEYGLPRETKWGHLRD 109

Query: 313 LHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK-SSNDCAAFLANYDSSSDANVTFNGNV 371
           LH A++L ++ L+    + QKLG  LEA IY K  SN CA FL N  + +    T  G+ 
Sbjct: 110 LHAALRLSKKALLWGVTSAQKLGEDLEARIYEKPGSNICATFLLNNITRTPTTTTLRGSK 169

Query: 372 YFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGI 431
           Y+LP  S+S LPDCK VVFNT  V S  N    PF+   ++NE  + + A   YEE    
Sbjct: 170 YYLPQHSISNLPDCKTVVFNTQTVAS--NYLIFPFSMFDSLNEPNMKTDALPTYEE--CP 225

Query: 432 SGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKK- 490
           +  +S V     E +  TKDT+DYLWYT    V+          + +LGH    F+N + 
Sbjct: 226 TKTKSPV-----ELMTMTKDTTDYLWYTTKKDVL------RVPQVSNLGHVMHAFLNGEY 274

Query: 491 -----LVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSV 545
                L    +G++   +F+ NK I L  G+N +  L   VGL + G++ +   AG+ +V
Sbjct: 275 VMEFYLTGTRHGSNVEKSFVFNKPITLKAGLNQIAPLGATVGLPDSGSYMEHRLAGVHNV 334

Query: 546 ILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKS-LIWYKTT 604
            +  L     DL    W ++VG+ G+ + L     + S +    + L  + + L+ ++ T
Sbjct: 335 AIQGLNTRTIDLPKNGWGHKVGLNGDKLHLFTQPPSQSVYHVPRAFLKTSDNLLVLFEET 394

Query: 605 FLAPEGKGPLALN 617
              P+G   L LN
Sbjct: 395 GRNPDGIEILTLN 407



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 38/59 (64%)

Query: 669 PAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWK 727
           P+Q++YH+PR ++   +NLLV+ EE G +P  I +LT     IC ++SE  P  V SWK
Sbjct: 369 PSQSVYHVPRAFLKTSDNLLVLFEETGRNPDGIEILTLNRDTICCYISEHHPTHVRSWK 427


>gi|62319263|dbj|BAD94489.1| beta-galactosidase [Arabidopsis thaliana]
          Length = 172

 Score =  221 bits (562), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 101/159 (63%), Positives = 126/159 (79%), Gaps = 1/159 (0%)

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           A+ L+T VPW+MC+QEDAP PII+TCNG+YC+ F PNS +KP MWTEN++GW+  FG AV
Sbjct: 2   ALGLSTGVPWIMCKQEDAPGPIIDTCNGYYCEDFKPNSINKPKMWTENWTGWYTDFGGAV 61

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIR 303
           P+RPVED+A++VARF + GG+  NYYMY GGTNF RTA G  +A+SYDYDAP+DEYG  R
Sbjct: 62  PYRPVEDIAYSVARFIQKGGSLVNYYMYHGGTNFDRTA-GEFMASSYDYDAPLDEYGLPR 120

Query: 304 QPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
           +PK+ HL+ LHKAIKL E  L+S+D T   LGAK E  I
Sbjct: 121 EPKYSHLKALHKAIKLSEPALLSADATVTSLGAKQEVTI 159


>gi|62321782|dbj|BAD95407.1| galactosidase [Arabidopsis thaliana]
          Length = 270

 Score =  216 bits (549), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 116/274 (42%), Positives = 165/274 (60%), Gaps = 7/274 (2%)

Query: 550 LKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPE 609
           L  G+RDLS  +W Y+VG++GE + L  +S ++S  W +G+ +   + L WYKTTF AP 
Sbjct: 1   LNGGRRDLSWQKWTYKVGLKGESLSLHSLSGSSSVEWAEGAFVAQKQPLTWYKTTFSAPA 60

Query: 610 GKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQP 669
           G  PLA+++ SMGKGQ W+NGQS+GR+W AY A   G   +C Y G++   KC ++CG+ 
Sbjct: 61  GDSPLAVDMGSMGKGQIWINGQSLGRHWPAYKA--VGSCSECSYTGTFREDKCLRNCGEA 118

Query: 670 AQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN 729
           +Q  YH+PR+W+ P  NLLV+ EE GGDP+ I+L+ +    +C+ + E     V+     
Sbjct: 119 SQRWYHVPRSWLKPSGNLLVVFEEWGGDPNGITLVRREVDSVCADIYEWQSTLVNYQLHA 178

Query: 730 LGVVSS--SPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVG 786
            G V+    P+  L C  G  I  + FAS+G PEG CGS+R G+CH         K CVG
Sbjct: 179 SGKVNKPLHPKAHLQCGPGQKITTVKFASFGTPEGTCGSYRQGSCHAHHSYDAFNKLCVG 238

Query: 787 QIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           Q  CS+ V+    G     CP ++K LAVEA C+
Sbjct: 239 QNWCSVTVAPEMFG--GDPCPNVMKKLAVEAVCA 270


>gi|293334807|ref|NP_001170541.1| uncharacterized protein LOC100384558 [Zea mays]
 gi|238005922|gb|ACR33996.1| unknown [Zea mays]
          Length = 345

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 123/357 (34%), Positives = 189/357 (52%), Gaps = 34/357 (9%)

Query: 471 EVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQN 530
           +  L + S GHA++ FVN K V  G+G      F + K ++L +G+N + +L+  +G+ +
Sbjct: 8   KTVLEVNSHGHASVAFVNTKFVGCGHGTKMNKAFTLEKPMDLKKGVNHVAVLASTMGMMD 67

Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
            GA+ +   AG+  V +  L  G  DL++  W + VG+ GE   +       S  WK   
Sbjct: 68  SGAYLEHRLAGVDRVQIKGLNAGTLDLTNNGWGHIVGLVGEQKQIYTDKGMGSVTWKPAV 127

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
               ++ L WYK  F  P G+ P+ L++++MGKG  +VNGQ IGRYW +Y          
Sbjct: 128 N---DRPLTWYKRHFDMPSGEDPIVLDMSTMGKGLMFVNGQGIGRYWISY---------- 174

Query: 651 CDYRGSYDASKCQKHC-GQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQ 709
                        KH  G+P+Q LYHIPR+++   +N+LV+ EE  G P  I +LT    
Sbjct: 175 -------------KHALGRPSQQLYHIPRSFLRQKDNVLVLFEEEFGRPDAIMILTVKRD 221

Query: 710 HICSFVSEADPPPVDSWKPNLGVVSSS-----PQVRLACERGWHIAAINFASYGIPEGNC 764
           +IC+F+SE +P  + SW+     ++ +     P+  L C     I  + FASYG P G C
Sbjct: 222 NICTFISERNPAHIKSWERKDSQITVTAADLKPRATLTCSPKKLIQQVVFASYGNPMGIC 281

Query: 765 GSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           G++  G+CH      +V+KAC+G+  C++PVS+   G     CPG    LAV+A CS
Sbjct: 282 GNYTIGSCHTPRAKELVEKACLGKRICTLPVSADVYGGDVN-CPGTTATLAVQAKCS 337


>gi|125536445|gb|EAY82933.1| hypothetical protein OsI_38150 [Oryza sativa Indica Group]
          Length = 314

 Score =  209 bits (533), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 108/224 (48%), Positives = 141/224 (62%), Gaps = 9/224 (4%)

Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
           +T F  P+G  P+A++L SMGKGQAWVNG  IGRYWS  +AP +GC+  C Y G+Y+  K
Sbjct: 83  ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCSSSCYYPGAYNERK 141

Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
           CQ +CG P Q  YHIPR W+   +NLLV+ EE GGDPS ISL     + +CS +SE   P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKAVCSRISENYYP 201

Query: 722 PVDSW----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DV 776
           P+ +W         V +++P++RL C+ G  I+ I FASYG P G C +F  G CH    
Sbjct: 202 PLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASST 261

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           L +V +ACVG  +C+I VS+   G     C G+LK LAVEA CS
Sbjct: 262 LDLVTEACVGNTKCAISVSNDVFG---DPCRGVLKDLAVEAKCS 302


>gi|222616996|gb|EEE53128.1| hypothetical protein OsJ_35926 [Oryza sativa Japonica Group]
          Length = 314

 Score =  209 bits (533), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 108/224 (48%), Positives = 141/224 (62%), Gaps = 9/224 (4%)

Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
           +T F  P+G  P+A++L SMGKGQAWVNG  IGRYWS  +AP +GC+  C Y G+Y+  K
Sbjct: 83  ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCSSSCYYPGAYNERK 141

Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
           CQ +CG P Q  YHIPR W+   +NLLV+ EE GGDPS ISL     + +CS +SE   P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201

Query: 722 PVDSW----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DV 776
           P+ +W         V +++P++RL C+ G  I+ I FASYG P G C +F  G CH    
Sbjct: 202 PLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASST 261

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           L +V +ACVG  +C+I VS+   G     C G+LK LAVEA CS
Sbjct: 262 LDLVTEACVGNTKCAISVSNDVFG---DPCRGVLKDLAVEAKCS 302


>gi|77554857|gb|ABA97653.1| Galactose binding lectin domain containing protein, expressed
           [Oryza sativa Japonica Group]
          Length = 317

 Score =  209 bits (533), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 108/224 (48%), Positives = 141/224 (62%), Gaps = 9/224 (4%)

Query: 602 KTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASK 661
           +T F  P+G  P+A++L SMGKGQAWVNG  IGRYWS  +AP +GC+  C Y G+Y+  K
Sbjct: 83  ETMFSTPKGTDPVAIDLGSMGKGQAWVNGHLIGRYWS-LVAPESGCSSSCYYPGAYNERK 141

Query: 662 CQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPP 721
           CQ +CG P Q  YHIPR W+   +NLLV+ EE GGDPS ISL     + +CS +SE   P
Sbjct: 142 CQSNCGMPTQNWYHIPREWLKESDNLLVLFEETGGDPSLISLEAHYAKTVCSRISENYYP 201

Query: 722 PVDSW----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DV 776
           P+ +W         V +++P++RL C+ G  I+ I FASYG P G C +F  G CH    
Sbjct: 202 PLSAWSHLSSGRASVNAATPELRLQCDDGHVISEITFASYGTPSGGCLNFSKGNCHASST 261

Query: 777 LPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHCS 820
           L +V +ACVG  +C+I VS+   G     C G+LK LAVEA CS
Sbjct: 262 LDLVTEACVGNTKCAISVSNDVFG---DPCRGVLKDLAVEAKCS 302


>gi|217075719|gb|ACJ86219.1| unknown [Medicago truncatula]
          Length = 200

 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 103/203 (50%), Positives = 135/203 (66%), Gaps = 6/203 (2%)

Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
           MGKG+AWVNGQSIGRYW  Y++P++GCT  C+YRG+Y ASKC K+CG+P+QTLYH+PR W
Sbjct: 1   MGKGEAWVNGQSIGRYWPTYISPNSGCTDSCNYRGTYSASKCLKNCGKPSQTLYHVPRAW 60

Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL-GVVSSSPQV 739
           + P  N  V+ EE GGDP+KIS  TK  + +CS V+E+ PPPVD+W  N        P +
Sbjct: 61  LKPDSNTFVLFEESGGDPTKISFGTKQIESVCSHVTESHPPPVDTWNSNAESERKVGPVL 120

Query: 740 RLACE-RGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVSSA 797
            L C      I++I FAS+G P   CG++  G+C  +  L IVQKAC+G   C+I VS  
Sbjct: 121 SLECPYPNQAISSIKFASFGTPRRTCGNYNHGSCSSNRALSIVQKACIGSSSCNIGVSIN 180

Query: 798 YLGVSAGACPGLLKALAVEAHCS 820
             G     C G+ K+LAVEA C+
Sbjct: 181 TFG---NPCRGVTKSLAVEAACT 200


>gi|1669595|dbj|BAA13685.1| AR782 [Arabidopsis thaliana]
          Length = 206

 Score =  206 bits (525), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 105/208 (50%), Positives = 142/208 (68%), Gaps = 9/208 (4%)

Query: 619 ASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPR 678
           A  GKG AWVNGQSIGRYW   +A + GCT+ CDYRGSY A+KC K+CG+P+QTLYH+PR
Sbjct: 2   AGTGKGIAWVNGQSIGRYWPTSIAGNGGCTESCDYRGSYRANKCLKNCGKPSQTLYHVPR 61

Query: 679 TWVHPGENLLVIHEELGGDPSKISLLTK-TGQHICSFVSEADPPPVDSWKPNLGVVS--- 734
           +W+ P  N+LV+ EE+GGDP++IS  TK TG ++C  VS++ PPPVD+W  +  + +   
Sbjct: 62  SWLKPSGNILVLFEEMGGDPTQISFATKQTGSNLCLTVSQSHPPPVDTWTSDSKISNRNR 121

Query: 735 SSPQVRLACERGWH-IAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSI 792
           + P + L C      I +I FAS+G P+G CGSF  G C+    L +VQKAC+G   C++
Sbjct: 122 TRPVLSLKCPISTQVIFSIKFASFGTPKGTCGSFTQGHCNSSRSLSLVQKACIGLRSCNV 181

Query: 793 PVSSAYLGVSAGACPGLLKALAVEAHCS 820
            VS+   G     C G++K+LAVEA CS
Sbjct: 182 EVSTRVFGE---PCRGVVKSLAVEASCS 206


>gi|298205259|emb|CBI17318.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 126/354 (35%), Positives = 174/354 (49%), Gaps = 63/354 (17%)

Query: 35  VWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIG 94
           +W  L++ +KEGG++VIETYVF N HE     YYF G +DL++FVK VQ+AG++L L IG
Sbjct: 1   MWSGLVKTAKEGGIDVIETYVFQNGHELSPSNYYFGGWYDLLKFVKIVQQAGMYLILHIG 60

Query: 95  PYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPII 154
           P+   EWN+G             F+T + PFK  M++F+  I+++MK++ LFASQGGPII
Sbjct: 61  PFVATEWNFGTI-----------FQTNSKPFKYHMQKFMTLIVNIMKKDKLFASQGGPII 109

Query: 155 LAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPI-INTCNGFY 213
           L Q +NEYG+ +  Y  GG+ YV WAA+  ++ N  VPW+MCQ       I I    G Y
Sbjct: 110 LTQAKNEYGDTKRIYEDGGKPYVMWAANMVLSHNIGVPWIMCQYSYVDIYIYIVKKEGLY 169

Query: 214 CDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYM--- 270
              +        I+ T        +    +  +P   L   +      G      YM   
Sbjct: 170 SLSYQ----YALILSTLVTHSIVTNSHQILQAKPKCGLKIGLDGLKHLGHRILTDYMKIL 225

Query: 271 ----------------YFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELH 314
                           Y GGTNFG T+GGP + T+Y+Y+APIDEYG  R PK        
Sbjct: 226 LFLLLFFFFQKVNYYMYHGGTNFGCTSGGPFITTTYNYNAPIDEYGLARLPK-------- 277

Query: 315 KAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFN 368
                C        P+        E  +Y  S    AAF++N D   D  + F 
Sbjct: 278 -----C--------PSQ-------EVDVYADSLGGYAAFISNVDEKEDKMIVFQ 311


>gi|223942939|gb|ACN25553.1| unknown [Zea mays]
          Length = 199

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 106/202 (52%), Positives = 135/202 (66%), Gaps = 5/202 (2%)

Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
           MGKG+AWVNGQSIGRYW   LAP +GC   C+YRG+Y +SKC K CGQP+QTLYH+PR++
Sbjct: 1   MGKGEAWVNGQSIGRYWPTNLAPQSGCVNSCNYRGAYSSSKCLKKCGQPSQTLYHVPRSF 60

Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVR 740
           + PG N LV+ E  GGDPSKIS + +    +C+ VSEA P  +DSW     +    P +R
Sbjct: 61  LQPGSNDLVLFEHFGGDPSKISFVMRQTGSVCAQVSEAHPAQIDSWSSQQPMQRYGPALR 120

Query: 741 LACER-GWHIAAINFASYGIPEGNCGSFRPGAC-HMDVLPIVQKACVGQIECSIPVSSAY 798
           L C + G  I+++ FAS+G P G CGS+  G C     L IVQ+AC+G   CS+PVSS Y
Sbjct: 121 LECPKEGQVISSVKFASFGTPSGTCGSYSHGECSSTQALSIVQEACIGVSSCSVPVSSNY 180

Query: 799 LGVSAGACPGLLKALAVEAHCS 820
            G     C G+ K+LAVEA CS
Sbjct: 181 FG---NPCTGVTKSLAVEAACS 199


>gi|166092020|gb|ABY82047.1| beta-galactosidase [Hymenaea courbaril var. stilbocarpa]
          Length = 138

 Score =  199 bits (506), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 90/138 (65%), Positives = 107/138 (77%)

Query: 157 QVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG 216
           Q+ENEYG VEW     G+ Y  WAA  AV LNT VPWVMC+Q+DAPDP+I+TCNG+YC+ 
Sbjct: 1   QIENEYGPVEWEIRAPGKAYTAWAAKMAVGLNTGVPWVMCKQDDAPDPVIDTCNGYYCEN 60

Query: 217 FTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTN 276
           FTPN   KP MWTEN+SGW+  +G AVP RPVED+A++V RF + GG+F NYYMY GGTN
Sbjct: 61  FTPNKNYKPKMWTENWSGWYTEYGGAVPKRPVEDIAYSVTRFIQNGGSFVNYYMYHGGTN 120

Query: 277 FGRTAGGPLVATSYDYDA 294
           FGRT  G  +ATSYDYDA
Sbjct: 121 FGRTYSGLFIATSYDYDA 138


>gi|217070908|gb|ACJ83814.1| unknown [Medicago truncatula]
          Length = 200

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 102/205 (49%), Positives = 137/205 (66%), Gaps = 10/205 (4%)

Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
           MGKG+AWVNGQSIGRYW  Y+A + GCT  C+YRG Y +SKC+K+CG+P+QTLYH+PR++
Sbjct: 1   MGKGEAWVNGQSIGRYWPTYVASNAGCTDSCNYRGPYTSSKCRKNCGKPSQTLYHVPRSF 60

Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPNL---GVVSSSP 737
           + P  N LV+ EE GGDP++IS  TK  + +CS VS++ PP +D W  +    G V   P
Sbjct: 61  LKPNGNTLVLFEENGGDPTQISFATKQLESVCSHVSDSHPPQIDLWNQDTESGGKV--GP 118

Query: 738 QVRLAC-ERGWHIAAINFASYGIPEGNCGSFRPGACHMD-VLPIVQKACVGQIECSIPVS 795
            + L+C      I++I FASYG P G CG+F  G C  +  L IV+KAC+G   CS+ VS
Sbjct: 119 ALLLSCPNHNQVISSIKFASYGTPLGTCGNFYRGRCSSNKALSIVKKACIGSRSCSVGVS 178

Query: 796 SAYLGVSAGACPGLLKALAVEAHCS 820
           +   G     C G+ K+LAVEA C+
Sbjct: 179 TDTFG---DPCRGVPKSLAVEATCA 200


>gi|388518087|gb|AFK47105.1| unknown [Lotus japonicus]
          Length = 220

 Score =  188 bits (478), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 95/207 (45%), Positives = 132/207 (63%), Gaps = 10/207 (4%)

Query: 621 MGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTW 680
           MGKGQAWVNG  IGRYW+  ++P +GC + CDYRG+Y++ KC  +CG+P QTLYH+PR+W
Sbjct: 1   MGKGQAWVNGHHIGRYWT-RVSPKSGCEQVCDYRGAYNSDKCTTNCGKPTQTLYHVPRSW 59

Query: 681 VHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPV------DSWKPNLGVVS 734
           +   +NLLVI EE GG+P +IS+   + + +C+ VSE+   P+      D     +   S
Sbjct: 60  LKASDNLLVIFEETGGNPFRISVKLHSARIVCAKVSESHYQPLHKLMNADLIGHEVSANS 119

Query: 735 SSPQVRLACERGWHIAAINFASYGIPEGNCGSFRPGACHM-DVLPIVQKACVGQIECSIP 793
             P++ L C+ G  I++I FASYG PEG+C SF  G CH    + IV KAC G+  CSI 
Sbjct: 120 MIPELHLRCQDGRIISSITFASYGNPEGSCQSFSRGNCHAPSSMAIVSKACQGKRSCSIK 179

Query: 794 VSSAYLGVSAGACPGLLKALAVEAHCS 820
           +S    G     C G++K L+VEA C+
Sbjct: 180 ISDTIFG--GDPCQGVMKTLSVEARCT 204


>gi|256831356|ref|YP_003160083.1| beta-galactosidase [Jonesia denitrificans DSM 20603]
 gi|256684887|gb|ACV07780.1| Beta-galactosidase [Jonesia denitrificans DSM 20603]
          Length = 584

 Score =  185 bits (469), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 126/379 (33%), Positives = 183/379 (48%), Gaps = 33/379 (8%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           R   +DG+   + SG+IHY R  P+ W + IRK++  GL  IETYV WN+H P R +++ 
Sbjct: 9   RDFTLDGEPFQIISGAIHYFRVHPDSWRDRIRKARLMGLNTIETYVAWNFHAPSRDEFHT 68

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
           +G  DL RF+  +QE GL   +R GPY CAEW+ GG P WL   P I  R+++  +  E+
Sbjct: 69  DGARDLGRFLDIIQEEGLRAIVRPGPYICAEWDNGGLPTWLTATPDIVVRSSDPTYLTEV 128

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           +R+L  +  +++   +  + GGPIIL QVENEYG    AYG     Y+    +   NL  
Sbjct: 129 ERYLEHLAPIVEPRQI--NHGGPIILMQVENEYG----AYG-NDRAYLTHLTNVYRNLGF 181

Query: 190 SVPWVMCQQ------EDAPDPIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFL 237
            VP     Q           P ++T   F             +  + P+M +E + GWF 
Sbjct: 182 VVPLTTVDQPMDDMLAHGTLPDLHTTGSFGSRIDERLATLREHQTTGPLMCSEFWIGWFD 241

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSY 290
            +G       V D A A+ R    G +  N YM+ GGTNFG T G        PLV TSY
Sbjct: 242 HWGAHHHTTDVADAANALDRLLGAGASV-NIYMFHGGTNFGFTNGANDKGVYQPLV-TSY 299

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDC 350
           DYDAP+ E G+  +  W     + +   +  E      P  + L A+    + H+     
Sbjct: 300 DYDAPLAEDGYPTEKYWAFREVIARYAPVPAEV-----PAERPLVAERSVPLTHRVGWLD 354

Query: 351 AAFLANYDSSSDANVTFNG 369
                +   + D+  TF+G
Sbjct: 355 VPLDVDEAVTCDSPATFDG 373


>gi|449532986|ref|XP_004173458.1| PREDICTED: beta-galactosidase-like, partial [Cucumis sativus]
          Length = 213

 Score =  182 bits (463), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 97/213 (45%), Positives = 132/213 (61%), Gaps = 5/213 (2%)

Query: 496 YGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFS-VILIDLKNGK 554
           YG+ +      +K + L +G+N L +LS+ VGL N G  FD   AG+   V L  L  G 
Sbjct: 3   YGSLEDPRITFSKYVNLKQGVNKLSMLSVTVGLPNVGLHFDTWNAGVLGPVTLKGLNEGT 62

Query: 555 RDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPL 614
           RD+S  +W Y+VG++GE + L  +  +NS  W +GS     + L WYKTTF  P G  PL
Sbjct: 63  RDMSKYKWSYKVGLKGEILNLYSVKGSNSVQWMKGSF--QKQPLTWYKTTFNTPAGNEPL 120

Query: 615 ALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLY 674
           AL+++SM KGQ WVNG+SIGRY+  Y+A  +G   KC Y G +   KC  +CG P+Q  Y
Sbjct: 121 ALDMSSMSKGQIWVNGRSIGRYFPGYIA--SGKCNKCSYTGFFTEKKCLWNCGGPSQKWY 178

Query: 675 HIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
           HIPR W+ P  NLL+I EE+GG+P  ISL+ +T
Sbjct: 179 HIPRDWLSPNGNLLIILEEIGGNPQGISLVKRT 211


>gi|284030079|ref|YP_003380010.1| beta-galactosidase [Kribbella flavida DSM 17836]
 gi|283809372|gb|ADB31211.1| Beta-galactosidase [Kribbella flavida DSM 17836]
          Length = 582

 Score =  182 bits (462), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 114/308 (37%), Positives = 159/308 (51%), Gaps = 28/308 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   + SG++HY R  P++W + I K++  GL  IETYV WN H P RG +  +G
Sbjct: 11  FLLDGEPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVPWNAHSPRRGVFDTDG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF++ V  AGL+  +R GPY CAEW+ GG P WL   PG+  R     F   +++
Sbjct: 71  MLDLGRFLEQVAAAGLYAIVRPGPYICAEWDNGGLPAWLFQEPGVGVRRYEPRFLAAVEQ 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L +++DL++   L   QGGP++L QVENEYG    A+G   E Y++  A        +V
Sbjct: 131 YLEQVLDLVRP--LQVDQGGPVLLLQVENEYG----AFGNDPE-YLEAVAGMIRKAGITV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
           P V   Q           +G    G               + P+ P+M  E + GWF  +
Sbjct: 184 PLVTVDQPTGEMLAAGGLDGVLRTGSFGSRSAERLATLREHQPTGPLMCMEFWDGWFDHW 243

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDY 292
           G       VED A  +      G +  N YM+ GGTNFG T+G        P V TSYDY
Sbjct: 244 GGPHHTTSVEDAARELDALLAAGASV-NIYMFHGGTNFGLTSGADDKGVFRPTV-TSYDY 301

Query: 293 DAPIDEYG 300
           DAP+DE G
Sbjct: 302 DAPLDEAG 309


>gi|2289790|dbj|BAA21669.1| beta-galactosidase [Bacillus circulans]
          Length = 586

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 179/362 (49%), Gaps = 36/362 (9%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + +TYD  + ++DGK   L SG++HY R+ PE W + + K K  G   +ETYV WN HEP
Sbjct: 2   SQLTYDD-SFLLDGKEIRLLSGAMHYFRTVPEYWEDRLLKLKACGFNTVETYVAWNLHEP 60

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQ+ FEG  D+VRF+KT ++ GL + +R GP+ CAEW +GGFP WL  +P I+ R  N
Sbjct: 61  EEGQFVFEGIADIVRFIKTAEKVGLHVIVRPGPFICAEWEFGGFPYWLLTVPNIKLRCFN 120

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            P+ E++  +   + + ++   L +S GGPII  Q+ENEYG    ++G   + Y+++  D
Sbjct: 121 QPYLEKVDAYFDVLFERLRP--LLSSNGGPIIALQIENEYG----SFG-NDQKYLQYLRD 173

Query: 183 TAVNLNTSVPWVMCQQEDAPDP----------IINTCN-GFYCDG----FTPNSPSKPIM 227
               +   V   +    D P+P          I  T N G   +          P+ P+M
Sbjct: 174 ---GIKKRVGNELLFTSDGPEPSMLSGGMIEGIFETVNFGSRAESAFAQLKQYQPNAPLM 230

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
             E + GWF  +G     R  E +   +    +  G+  N+YM  GGTNFG   G     
Sbjct: 231 CMEFWHGWFDHWGEEHHTRSAESVVETLEEILKQNGSV-NFYMAHGGTNFGFYNGANHNE 289

Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLE 339
               P + TSYDYD  + E G + +  +   +   K + L E  L +  P       K  
Sbjct: 290 TDYQPTI-TSYDYDGLLTESGDVTEKFYAVRKVFEKYVDLPELNLPAPIPKRLFGKVKFT 348

Query: 340 AH 341
            H
Sbjct: 349 EH 350


>gi|320536152|ref|ZP_08036203.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
 gi|320147005|gb|EFW38570.1| glycosyl hydrolase family 35 [Treponema phagedenis F0421]
          Length = 857

 Score =  181 bits (459), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 110/335 (32%), Positives = 172/335 (51%), Gaps = 23/335 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + +D  + +IDGKR+ + S ++HY R     W  +IRK++ GG   IETY+ WNYHE   
Sbjct: 2   IQFDSNSWIIDGKRKFIISAAVHYFRLPRAEWAAVIRKARLGGCNAIETYIAWNYHETAE 61

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            Q+ F G  DL  F     + G+++ +R GPY CAEW++GG P +L+   GI++R +N  
Sbjct: 62  EQWDFSGDKDLAAFFAICHDEGMYVIVRPGPYICAEWDFGGLPYYLNNTDGIEYRCSNAA 121

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +++ ++R+  +I+ ++++  L    GG II+ Q+ENEY     A+G     ++++  +  
Sbjct: 122 YEQAVRRYFERIMPIIRRYQL--GSGGSIIMVQIENEYH----AFGKKDLAHIRFLEELT 175

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLS 238
                +VP V C    A    +   N F+                +P+   E + GW   
Sbjct: 176 RGFGITVPLVSCY--GAGRNTVEMRN-FWSGAERAAAVLRERQSGQPLGIMEFWIGWVEH 232

Query: 239 F-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGP--LVATSYD 291
           + G     +P E +        ++G  F NYYMYFGG+NF    GRT G     +  SYD
Sbjct: 233 WGGEPQKHKPAEAVLSHCFEALKSGFVFFNYYMYFGGSNFGSWGGRTIGAHKIFMTQSYD 292

Query: 292 YDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLIS 326
           YDAP+DE+GF    K+  L  LH  I   E  L +
Sbjct: 293 YDAPLDEFGF-ETEKYRLLAVLHTFIAWLENDLTA 326


>gi|219117911|ref|XP_002179741.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408794|gb|EEC48727.1| beta-galactosidase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 951

 Score =  180 bits (456), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 222/835 (26%), Positives = 345/835 (41%), Gaps = 157/835 (18%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V+YD RA+ I+ KR +L SGS+H  R+T   W   + ++   GL +I  Y+FW  H+  
Sbjct: 149 SVSYDERAIRINDKRVLLLSGSMHPVRATRGTWEHALDEAVYNGLNMITVYIFWGAHQSF 208

Query: 64  RGQYY-----------FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF 112
           R +              E +++L   +++    GLF+H+RIGPYAC E+ YGG P WL  
Sbjct: 209 RDEPLNWSLDGSSIGPKESQWELADALRSAANRGLFIHVRIGPYACGEYTYGGIPEWLPL 268

Query: 113 IPG-IQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENE---------- 161
               ++ R  N P+ + M+ F+A  I  +   NL+A QGGPI++AQ+ENE          
Sbjct: 269 QSSTMRMRRLNRPWLDAMEGFVAATITYLSSFNLWAHQGGPILIAQIENELGSGVDGSAA 328

Query: 162 --------------------------YGNV---EWAYGVGGEL-------YVKWAADTAV 185
                                     YG++     + G+  EL       Y  W  +   
Sbjct: 329 ANYVVLERDEFNDDKHEDSHLLQLDRYGHILENASSRGMDSELRNATVQDYADWCGNLVA 388

Query: 186 NLNTSVPWVMCQQEDAPDPI--INTCNGF-----YCDGFTPNSPSKPIMWTENYSGWFLS 238
            L  +V W MC    A + I   N  NG      Y D        +P +WTE+  G F  
Sbjct: 389 RLAPNVIWTMCNGLSAENTISTFNGNNGIDWLEKYGDSGRIQV-DQPAIWTEDEGG-FQL 446

Query: 239 FGYAVPFRPVE--------DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVATSY 290
           +G   P +P +         +A    ++F  GGT  NYYM++GG N GR++   ++  +Y
Sbjct: 447 WG-DQPSKPSDYFWGRTSRAMATDALQWFARGGTHLNYYMWWGGYNRGRSSAAGIM-NAY 504

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAH------IYH 344
             DA +   G  R PK+ H   LH  I      L+ + PT     A +E        +  
Sbjct: 505 ATDAFLCSSGQRRHPKYDHFLALHLVIADIAAILLHA-PTSLLKNASVEIMDGDDWIVGD 563

Query: 345 KSSNDCAAFLANYDS------SSDANVTFNGN----------VYFLPAWSVSILPDCKNV 388
                    L  +DS       +DAN T              V+ +  +S  I+ D   V
Sbjct: 564 NQRQFLYQVLDTHDSKQVIFLENDANTTEMARLTGAKADDSLVFVMKPYSSQIVIDGI-V 622

Query: 389 VFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNR-SFVRPDLAEQIN 447
            F+++ + ++  +       +  V  LL  +   SW E   G   ++ + V  +  EQ N
Sbjct: 623 AFDSSTISTKAMSFRRTLHYEPAV--LLHLT---SWSEPIAGADTDQNAHVSTEPLEQTN 677

Query: 448 TTKD---TSDYLWY--TASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFA 502
                  +SDY WY     I V+  Q K +++  E    A  VF++   +     NH  A
Sbjct: 678 LNSKASISSDYAWYGTDVKIDVVLSQVK-LYIGTEK-ATALAVFIDGAFIGEA-NNHQHA 734

Query: 503 N--FLINKKIE-LNEGINTLDILSMMVGLQNY-GAWFDVAGA---GLFSVILID--LKNG 553
               +++ +IE L  G + L IL   +G  N  G W  +  A   G+   +LI   L + 
Sbjct: 735 EGPTVLSIEIESLAAGTHRLAILCESLGYHNLIGRWGAITTAKPKGITGNVLIGSPLLSE 794

Query: 554 KRDLSSGE--WIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGK 611
              L  G   W    G+  E     +  L   SF +  +        +W    F +P+  
Sbjct: 795 NISLVDGRQMWWSLPGLSVERKAA-RHGLRRESF-EDAAQAEAGLHPLWSSVLFTSPQFD 852

Query: 612 G---PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQ 668
                L L+L S G+G  W+NG+ +GRYW+      T      DY               
Sbjct: 853 STVHSLFLDLTS-GRGHLWLNGKDLGRYWNI-----TRGNSWNDY--------------- 891

Query: 669 PAQTLYHIPRTWVH-PGE-NLLVIHEELGGDPSKISLLTKTGQ--HICSFVSEAD 719
            +Q  Y +P  ++H  G+ N L++ + LGGD S   LL  + +      F  E D
Sbjct: 892 -SQRYYFLPADFLHLDGQLNELILFDMLGGDHSAARLLLSSIEESETSKFSDEVD 945


>gi|126347898|emb|CAJ89618.1| putative beta-galactosidase [Streptomyces ambofaciens ATCC 23877]
          Length = 615

 Score =  178 bits (452), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 115/339 (33%), Positives = 169/339 (49%), Gaps = 36/339 (10%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A +T+ H A +  G+   + SGS+HY R  PE W + + +    GL  ++TYV WN+HE
Sbjct: 22  TATLTHTHGAFLRRGRPHRVLSGSLHYFRVHPEQWADRLDRLAALGLNTVDTYVPWNFHE 81

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              G+  F+G  DL RFV+  Q AGL + +R GPY CAEW+ GG P WL   PG++ R  
Sbjct: 82  RRPGEARFDGWRDLARFVRLAQRAGLDVMVRPGPYICAEWDNGGLPAWLTGTPGMRLRAG 141

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           + P+ + + R+   ++  + +  L A  GGP++  Q+ENEYG+    YG     YV+W  
Sbjct: 142 HQPYLDAVARWFDALVPRVAE--LQAVHGGPVVAVQIENEYGS----YG-DDHAYVRWVR 194

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPII---NTCNGFYCDG------------FTPNSPSKPI 226
           D  V+   +    +    D P P++    T  G                      P +P 
Sbjct: 195 DALVDRGIT---ELLYTADGPTPLMLDGGTVPGELAAATFGSRAAEAAALLRSRRPGEPF 251

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           +  E ++GWF  +G     R  +  A  V    + GG+  + YM  GGTNFG  AG    
Sbjct: 252 LCAEFWNGWFDHWGEKHHVRSRDGAAQEVEEILDAGGSV-SLYMAHGGTNFGLWAGANHD 310

Query: 284 -----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
                P V TSYD DAP+ E+G +  PK+  LRE   A+
Sbjct: 311 GGVLRPTV-TSYDSDAPVSEHGAL-TPKFHALRERFAAL 347


>gi|326331074|ref|ZP_08197372.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
 gi|325951115|gb|EGD43157.1| beta-galactosidase [Nocardioidaceae bacterium Broad-1]
          Length = 586

 Score =  177 bits (450), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/309 (36%), Positives = 165/309 (53%), Gaps = 30/309 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   + SG++HY R  P+ W + I K++  GL  IETYV WN H P  G +  +G
Sbjct: 11  FLLDGEPFRILSGALHYFRVHPDQWADRIEKARLMGLNTIETYVPWNAHSPRPGVFDTDG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF++ V++AG++  +R GP+ CAEW+ GG P WL   PG+  R     F +E+++
Sbjct: 71  ILDLPRFLRLVKDAGMYAIVRPGPFICAEWDNGGLPPWLFREPGVGIRRHEPRFLDEVEK 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L +++ L++   +    GGP++L QVENEYG    AYG   + Y++  AD        V
Sbjct: 131 YLHQVLALVRPHQV--DLGGPVLLVQVENEYG----AYGDDRD-YLQAVADMIRGAGIDV 183

Query: 192 PWVMCQQE-DAP------DPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLS 238
           P V   Q  DA       D ++ T + F  D          + P+ P+M  E + GWF  
Sbjct: 184 PLVTVDQPVDAMLAAGGLDGVLRTSS-FGSDSANRLRTLRDHQPTGPLMCMEFWDGWFDH 242

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYD 291
           +G      PVE  A  +      G +  N YM+ GGTNFG T+G        P V TSYD
Sbjct: 243 WGGRHHTTPVEQAAEELDALLAAGASV-NVYMFHGGTNFGLTSGANDKGIYRPTV-TSYD 300

Query: 292 YDAPIDEYG 300
           YDAP+DE G
Sbjct: 301 YDAPLDEAG 309


>gi|399022099|ref|ZP_10724178.1| beta-galactosidase [Chryseobacterium sp. CF314]
 gi|398085466|gb|EJL76124.1| beta-galactosidase [Chryseobacterium sp. CF314]
          Length = 618

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 122/335 (36%), Positives = 174/335 (51%), Gaps = 39/335 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++ GK   + SG +HYPR   E W   ++  K  GL  + TYVFWNYHE   G++ F G
Sbjct: 34  FLLSGKPFTIYSGEMHYPRVPSEYWKHRLQMMKSMGLNTVTTYVFWNYHEEEPGKWNFSG 93

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL +F+KT QEAGL++ +R GPY CAEW +GG+P WL     ++ RT N  F ++ + 
Sbjct: 94  EKDLKKFIKTAQEAGLYVIIRPGPYVCAEWEFGGYPWWLQKDKNLEIRTDNKAFLKQCEN 153

Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWA---ADTAVN 186
           +   I +L KQ   L  + GGP+I+ Q ENE+G+ V     +  E + K++    D  V 
Sbjct: 154 Y---INELAKQIIPLQINNGGPVIMVQAENEFGSYVAQRKDISLEQHKKYSHKIKDFLVK 210

Query: 187 LNTSVPWVMCQ-----QEDAPDPIINTCNGFYCDGFTPNSPSK---------PIMWTENY 232
              +VP+         +E + +  + T NG   +G   N   K         P M  E Y
Sbjct: 211 SGITVPFFTSDGSWLFKEGSIEGALPTANG---EGDVDNLRKKINEFNNGKGPYMVAEYY 267

Query: 233 SGWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            GW     +A PF  V  ED+      + + G +F NYYM  GGTNFG T+G        
Sbjct: 268 PGWLDH--WAEPFVKVSTEDVVKQTELYIKNGISF-NYYMIHGGTNFGFTSGANYDKNHD 324

Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
                TSYDYDAPI+E G++  PK+  LR++ + I
Sbjct: 325 IQPDLTSYDYDAPINEAGWV-TPKFNALRDIFQKI 358



 Score = 44.3 bits (103), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 89/390 (22%), Positives = 145/390 (37%), Gaps = 66/390 (16%)

Query: 317 IKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPA 376
           +K+  E ++     + K G     ++ H  +N      ANYD + D         Y  P 
Sbjct: 279 VKVSTEDVVKQTELYIKNGISFNYYMIHGGTNFGFTSGANYDKNHDIQPDLTSYDYDAPI 338

Query: 377 WSVS-ILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNR 435
                + P      FN  + I Q+ N        K +  + +    F+       +   +
Sbjct: 339 NEAGWVTPK-----FNALRDIFQKINRQRLPEVPKPMKVITIPEIKFTKINSLFDVIQQQ 393

Query: 436 SFVRPDLAEQINTTKDTS---DYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLV 492
              +P +  Q  T +D +    Y+ Y    +    +GK   L I+ L   A V+VN++  
Sbjct: 394 ---KPIIHNQPLTFEDLNIGNGYIMYRRKFN-KDQKGK---LEIKGLRDYANVYVNERW- 445

Query: 493 AFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKN 552
               G  +  N   +  IE+  G + L+IL   +G  NYGA       G+ S ++I   N
Sbjct: 446 ---QGELNRVNKKYDLDIEIKAG-DRLEILVENMGRINYGAEIVHNLKGIISPVII---N 498

Query: 553 GKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKG 612
           G     SG W        E   L         + ++      N S +  +  F   E  G
Sbjct: 499 GSE--ISGNW--------EMFPLPFDQFPKHKYQQKDIA---NNSPVISEAEFKLDE-TG 544

Query: 613 PLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQT 672
              L++   GKG  ++NG++IGRYWS                              P QT
Sbjct: 545 DTFLDMRKFGKGIVFINGRNIGRYWSK---------------------------AGPQQT 577

Query: 673 LYHIPRTWVHPGENLLVIHEELGGDPSKIS 702
           LY +P  W+  G+N + I E++    S I+
Sbjct: 578 LY-VPGVWLKKGKNGIQIFEQIFEGSSSIN 606


>gi|62321383|dbj|BAD94714.1| beta-galactosidase [Arabidopsis thaliana]
          Length = 199

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 88/202 (43%), Positives = 134/202 (66%), Gaps = 4/202 (1%)

Query: 507 NKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAG-LFSVILIDLKNGKRDLSSGEWIYQ 565
           ++KI+L+ G+N + +LS+ VGL N G  F+    G L  V L  + +G  D+S  +W Y+
Sbjct: 1   SQKIKLHAGVNKIALLSVAVGLPNVGTHFEQWNKGALGPVTLKGVNSGTWDMSKWKWSYK 60

Query: 566 VGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQ 625
           +GV+GE + L   + ++   W QGS +   + L WYK+TF  P G  PLAL++ +MGKGQ
Sbjct: 61  IGVKGEALSLHTNTESSGVRWTQGSFVAKKQPLTWYKSTFATPAGNEPLALDMNTMGKGQ 120

Query: 626 AWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGE 685
            W+NG++IGR+W AY A   G   +C+Y G++DA KC  +CG+ +Q  YH+PR+W+   +
Sbjct: 121 VWINGRNIGRHWPAYKA--QGSCGRCNYAGTFDAKKCLSNCGEASQRWYHVPRSWLKS-Q 177

Query: 686 NLLVIHEELGGDPSKISLLTKT 707
           NL+V+ EELGGDP+ ISL+ +T
Sbjct: 178 NLIVVFEELGGDPNGISLVKRT 199


>gi|340370414|ref|XP_003383741.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Amphimedon
           queenslandica]
          Length = 689

 Score =  177 bits (448), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 115/337 (34%), Positives = 174/337 (51%), Gaps = 35/337 (10%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           ++ D  +  I GK+  + SGSIHY R  P+ W + ++K K  GL  ++TYV WN HEP+ 
Sbjct: 71  LSLDEDSFYIRGKKTHILSGSIHYFRVVPDYWTDRLKKLKAMGLNTVDTYVSWNLHEPMP 130

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ F G  ++  F+K      L + +R GPY C+EW+ GG P WL   P ++ R+   P
Sbjct: 131 GEFDFSGLLNIHEFIKIAHSLELNVIVRPGPYICSEWDNGGLPAWLLHDPNMKIRSNYKP 190

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG---VGGELYVKWAA 181
           +++ +KRF  K+ +++    L +S GGPII  QVENEY     AYG     G  ++++ A
Sbjct: 191 YQDAVKRFFTKLFEILTP--LQSSYGGPIIAFQVENEYA----AYGPRNATGRHHMQYLA 244

Query: 182 DTAVNLNTSVPWVMCQQED--------APDPIINTCNGFYCD------GFTPNSPSKPIM 227
           +   +L     ++    ++        AP+  + T N F  D            P+KP +
Sbjct: 245 NLMRSLGAVELFITSDGQNDIKASSDMAPNNALLTVN-FQNDPSEALNKLLLVQPNKPPL 303

Query: 228 WTENYSGWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
             E ++GWF  +G     R +    L   +    + GG+F N YM+ GGTNFG   G  +
Sbjct: 304 VMEYWTGWFDHWGRRHLERTLSPSQLIVNIGTILQMGGSF-NLYMFHGGTNFGFMNGANI 362

Query: 286 V-------ATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
                    TSYDYDAP+ E G I + K+  LREL K
Sbjct: 363 EGGEYRPDVTSYDYDAPLSEAGDITK-KYTLLRELLK 398


>gi|443684013|gb|ELT88070.1| hypothetical protein CAPTEDRAFT_181391 [Capitella teleta]
          Length = 655

 Score =  176 bits (447), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 118/339 (34%), Positives = 172/339 (50%), Gaps = 43/339 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A  ++GK+ +L SG++HY R  PE W + + K K  GL  +ETYV WN HE +RG + F 
Sbjct: 10  AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  DL RF++  Q+ GL++ LR GPY C+EW++GG P WL   P ++ RT+  P+ E + 
Sbjct: 70  GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------------NVEWAYGVGGELYV 177
            +LAKI+ L+   +L  S+GGPII  Q+ENEYG             N    YG+   L+ 
Sbjct: 130 AYLAKILPLVN--DLQMSKGGPIIAVQLENEYGSYGDDLDYKLFLKNQFIKYGIEELLF- 186

Query: 178 KWAADTAVNL-NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPN--SPSKPIMWTENYSG 234
              +D    + N  +P V+     A         G+    +  N   P  P+M  E +SG
Sbjct: 187 --TSDNGTGIQNGPIPGVL-----ATTNFQEQEQGYLMFEYLRNIKQPGLPMMVMEFWSG 239

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----------- 283
           WF  +G         +    V ++    G+  N+YM+ GGTNFG  AG            
Sbjct: 240 WFDHWGEQHNLCHHAEF-IDVFKWILLEGSSVNFYMFHGGTNFGFMAGANEDFGATNEGG 298

Query: 284 --PLVA--TSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
             P  A  TSYDYD P+ E G + + K+  +R +   +K
Sbjct: 299 GEPYAADTTSYDYDCPVSESGQLNE-KFYEIRNILSEMK 336


>gi|443689405|gb|ELT91801.1| hypothetical protein CAPTEDRAFT_23316, partial [Capitella teleta]
          Length = 596

 Score =  176 bits (445), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 119/349 (34%), Positives = 178/349 (51%), Gaps = 35/349 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           +  +DG+R  + SGS HY R+ P +W + + + K  GL  + TYV WN+HEP +GQ+   
Sbjct: 8   SFYLDGRRFKIFSGSFHYFRTHPLLWGDRLLRMKAAGLNTVMTYVPWNFHEPRKGQFTLG 67

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN-NPFKEEM 129
           G +DLV F++ VQ+ GL+L +R GPY CAEW +GGFP WL   P +  RT++  P+  E+
Sbjct: 68  GLYDLVSFMEQVQKVGLYLIVRPGPYICAEWEFGGFPSWLLRDPKMNLRTSSYTPYLNEV 127

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD--TAVNL 187
           K++L+++  ++ +       GGPII  QVENE+G    + GV    Y+++     ++ NL
Sbjct: 128 KQYLSQLFAVLTK--FTYKHGGPIIAFQVENEFG----SKGVHDPEYLQFLVTQYSSWNL 181

Query: 188 N----TSVPWVMCQQEDAPDPI----INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           N    TS           PD +    +N       +      P +P+M TE ++GWF  +
Sbjct: 182 NELLFTSDGKKYLSNGTLPDVLATINLNDHAKEDLEELKEFQPERPLMVTEFWAGWFDHW 241

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------------GP 284
           G         +L   +        +  N+YM+ GGTNFG   G               GP
Sbjct: 242 GEEHHHYGTTELERELEAILSLNASV-NFYMFIGGTNFGFWNGANYLSYNKDKEASLLGP 300

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQK 333
            V TSYDYDA + E+G ++ PK+  +R L K   L    L    PT  K
Sbjct: 301 TV-TSYDYDAAVSEWGHVK-PKYNVIRNLLKKYSLTPLDLPDVPPTPMK 347


>gi|224152391|ref|XP_002337230.1| predicted protein [Populus trichocarpa]
 gi|222838524|gb|EEE76889.1| predicted protein [Populus trichocarpa]
          Length = 144

 Score =  175 bits (444), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 75/127 (59%), Positives = 102/127 (80%), Gaps = 1/127 (0%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            + NV+YD R+L+I+G+R++L S +IHYPRS P +WPEL++ +KEGG++VIETYVFWN H
Sbjct: 17  FAGNVSYDSRSLIINGERKLLISAAIHYPRSVPAMWPELVKTAKEGGVDVIETYVFWNVH 76

Query: 61  EPIR-GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFR 119
           +P    +Y+F+GRFDLV+F+  VQEAG++L LRIGP+  AEWN+GG PVWLH++ G  FR
Sbjct: 77  QPTSPSEYHFDGRFDLVKFINIVQEAGMYLILRIGPFVAAEWNFGGIPVWLHYVNGTVFR 136

Query: 120 TTNNPFK 126
           T N  FK
Sbjct: 137 TDNYNFK 143


>gi|365876141|ref|ZP_09415664.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
 gi|442588464|ref|ZP_21007275.1| putative exported beta-galactosidase [Elizabethkingia anophelis
           R26]
 gi|365756153|gb|EHM98069.1| beta-galactosidase [Elizabethkingia anophelis Ag1]
 gi|442561698|gb|ELR78922.1| putative exported beta-galactosidase [Elizabethkingia anophelis
           R26]
          Length = 628

 Score =  174 bits (441), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 116/343 (33%), Positives = 167/343 (48%), Gaps = 44/343 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++GK   + SG +HYPR   E W   ++  K  GL  + TYVFWNYHE   G++ + G
Sbjct: 36  FLLNGKLFSIHSGEMHYPRIPQEYWKHRLQMMKAMGLNAVTTYVFWNYHEENPGKWNWSG 95

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL +F+KT QE GL++ +R GPY CAEW +GG+P WL  I G++ R  NN F  E ++
Sbjct: 96  EKDLKKFIKTAQEVGLYVIIRPGPYVCAEWEFGGYPWWLQNIKGLKIREDNNLFLAETQK 155

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNT- 189
           ++ ++ + +K  +L  + GGP+I+ Q ENE+G+ V     +    +  + A     L   
Sbjct: 156 YITQLYNQVK--DLQITNGGPVIMVQAENEFGSFVAQRKDIPLASHRTYNAKIVKQLKDA 213

Query: 190 --SVP-------WVM-----------CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
             SVP       W+               ED  + +    N +       N+   P M  
Sbjct: 214 GFSVPMFTSDGSWLFEGGSVVGALPTANGEDNIENLKKIVNQY-------NNNQGPYMVA 266

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-- 287
           E Y GW   +    P      +A    ++ +   +F NYYM  GGTNFG T G       
Sbjct: 267 EFYPGWLAHWAEKFPRVDAGTVARQTDKYLKNDVSF-NYYMVHGGTNFGFTNGANYDKNH 325

Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLREL---HKAIKLCE 321
                 TSYDYDAPI E G+ R PK+  LR +   H   KL E
Sbjct: 326 DIQPDLTSYDYDAPITEAGW-RTPKYDSLRAVISKHTKAKLPE 367


>gi|323358527|ref|YP_004224923.1| beta-galactosidase [Microbacterium testaceum StLB037]
 gi|323274898|dbj|BAJ75043.1| beta-galactosidase [Microbacterium testaceum StLB037]
          Length = 574

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 113/312 (36%), Positives = 155/312 (49%), Gaps = 36/312 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   + SG++HY R  PE W + IR +K  GL  IETYV WN HEP+RG++   G
Sbjct: 11  FLLDGRPHQVISGTLHYFRIHPEHWADRIRTAKAMGLNTIETYVAWNAHEPVRGEWDATG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+  +   GL   +R GPY CAEW+ GG PVWL   PGI  R +   F E +  
Sbjct: 71  WNDLGRFLDLIAAEGLHAIVRPGPYICAEWHNGGLPVWLTSTPGIGIRRSEPQFVEAVSE 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L ++ +++    +   +GG ++L Q+ENEYG    AYG   E Y++       +   +V
Sbjct: 131 YLRRVYEIVAPRQI--DRGGNVVLVQIENEYG----AYGSDKE-YLRELVRVTKDAGITV 183

Query: 192 PWVMCQQ------EDAPDPIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFLSF 239
           P     Q      E    P ++    F             + P+ P+M +E + GWF  +
Sbjct: 184 PLTTVDQPMPWMLEAGSLPELHLTGSFGSRSAERLATLREHQPTGPLMCSEFWDGWFDWW 243

Query: 240 G----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVAT 288
           G       P     DL   +A      G   N YM  GGTNFG T G        P+V T
Sbjct: 244 GSIHHTTDPAASAHDLDVLLA-----AGASVNIYMVHGGTNFGTTNGANDKGRFDPIV-T 297

Query: 289 SYDYDAPIDEYG 300
           SYDYDAPIDE G
Sbjct: 298 SYDYDAPIDESG 309


>gi|125526285|gb|EAY74399.1| hypothetical protein OsI_02287 [Oryza sativa Indica Group]
          Length = 255

 Score =  174 bits (440), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 88/199 (44%), Positives = 109/199 (54%), Gaps = 48/199 (24%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V+YD R+LVIDG+RR++ SGSIHYPRSTPE                             
Sbjct: 29  SVSYDDRSLVIDGQRRIILSGSIHYPRSTPE----------------------------- 59

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
                             +Q AG++  LRIGPY C EWNYGG P WL  IPG+QFR  N 
Sbjct: 60  -----------------EIQNAGMYAILRIGPYICGEWNYGGLPAWLRDIPGMQFRLHNE 102

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAA 181
           PF+ EM+ F   I++ MK   +FA QGGPIILAQ+ENEYGN+  +         Y+ W A
Sbjct: 103 PFENEMETFTTLIVNKMKDSKMFAEQGGPIILAQIENEYGNIMGKLNNNQSASEYIHWCA 162

Query: 182 DTAVNLNTSVPWVMCQQED 200
           D A   N  VPW+MCQQ+D
Sbjct: 163 DMANKQNVGVPWIMCQQDD 181


>gi|255550369|ref|XP_002516235.1| beta-galactosidase, putative [Ricinus communis]
 gi|223544721|gb|EEF46237.1| beta-galactosidase, putative [Ricinus communis]
          Length = 451

 Score =  173 bits (439), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 137/474 (28%), Positives = 204/474 (43%), Gaps = 76/474 (16%)

Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
           MY GGTNF R +GGP++ TSYDYDAP+DEYG + QPKWGHLR+LH  I L   +L  S  
Sbjct: 38  MYHGGTNFRRMSGGPMIVTSYDYDAPLDEYGNLNQPKWGHLRDLHVRILL---HLSQSRG 94

Query: 330 THQKLGAKLEAHIY-HKSSNDCAAFLANYDSSSDANVTFNGN-VYFLPAWSVSILPDCKN 387
                   L    Y + ++ +   FL+N  ++ DAN+    + ++F+PAW          
Sbjct: 95  LGFATVYALNLTTYINNATGERFCFLSNTKTNEDANIDLQQDGIFFVPAW---------- 144

Query: 388 VVFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQIN 447
           + + +++V  Q+ N     A     + L   +  F ++   V           D+  +  
Sbjct: 145 IYYYSSRV--QQGNFQQCKATSDETDYLRYITRYFDFFTVSV----------KDVHSRCQ 192

Query: 448 TTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLIN 507
              +T ++          P    +    ++ + H+     +        G  ++  F   
Sbjct: 193 QCNNTEEHDLACDFFGTSPACSCQSAARLQQVFHSIYNLTS--------GKQNYGEFF-- 242

Query: 508 KKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVG 567
              E  EGI     LS          W    G G  +  L D  +G RD+          
Sbjct: 243 --DEGPEGIAGAADLSS-------NQWAYKIGLGGEAKRLYDPNSGHRDV---------- 283

Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
                             ++  + LPV +++ WYKTTF  P G  PL LNL  MGKG AW
Sbjct: 284 ------------------FRTSAILPVGRAMTWYKTTFHVPSGTDPLVLNLQGMGKGHAW 325

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG S+GR+W    A  TG +  CDYRG YD  KC  +CG P Q   HI  T++  G  +
Sbjct: 326 VNGHSLGRFWPMQSADPTGYSGSCDYRGKYDKDKCLTNCGNPTQRWKHIA-TFMPNGRII 384

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEA-DPPPVDSWKPNLGVVSSSPQVR 740
            VI     G+P       + G    ++ + A +   V     +LGV  S+  V+
Sbjct: 385 SVIQFASFGNPEGTCGSLQKGDFEAAYTAFAVEKACVGKESCSLGVSESTLGVK 438



 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 97/282 (34%), Positives = 128/282 (45%), Gaps = 73/282 (25%)

Query: 521 ILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISL 580
           I ++  G QNYG +FD    G+          G  DLSS +W Y++G+ GE   L   + 
Sbjct: 228 IYNLTSGKQNYGEFFDEGPEGI---------AGAADLSSNQWAYKIGLGGEAKRLYDPNS 278

Query: 581 ANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
            +   ++  + LPV +++ WYKTTF  P G  PL LNL  MGKG AWVNG S+GR+W   
Sbjct: 279 GHRDVFRTSAILPVGRAMTWYKTTFHVPSGTDPLVLNLQGMGKGHAWVNGHSLGRFWPMQ 338

Query: 641 LAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
            A  TG +  CDYRG YD  KC  +CG P Q        W                    
Sbjct: 339 SADPTGYSGSCDYRGKYDKDKCLTNCGNPTQR-------W-------------------- 371

Query: 701 ISLLTKTGQHICSFVSEADPPPVDSWKPNLGVVSSSPQVRLACERGWHIAAINFASYGIP 760
                   +HI +F+                     P  R+       I+ I FAS+G P
Sbjct: 372 --------KHIATFM---------------------PNGRI-------ISVIQFASFGNP 395

Query: 761 EGNCGSFRPGACHMDVLPI-VQKACVGQIECSIPVSSAYLGV 801
           EG CGS + G          V+KACVG+  CS+ VS + LGV
Sbjct: 396 EGTCGSLQKGDFEAAYTAFAVEKACVGKESCSLGVSESTLGV 437



 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 16/26 (61%), Positives = 21/26 (80%)

Query: 139 LMKQENLFASQGGPIILAQVENEYGN 164
           + K+  LFAS GGPI+ AQ+EN+YGN
Sbjct: 1   MAKEAKLFASSGGPIVFAQIENDYGN 26


>gi|289768016|ref|ZP_06527394.1| beta-galactosidase [Streptomyces lividans TK24]
 gi|289698215|gb|EFD65644.1| beta-galactosidase [Streptomyces lividans TK24]
          Length = 595

 Score =  172 bits (436), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 115/351 (32%), Positives = 174/351 (49%), Gaps = 37/351 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + ++Y    L+ +G+   L +GS+HY R  P  W + +R+    GL  ++TYV WN+HE 
Sbjct: 4   STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G   F+G  DL RF++  QE GL + +R GPY CAEW+ GG P WL   PG++ RT++
Sbjct: 64  TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            P+ E + R+   ++  + +  L A +GGP++  Q+ENEYG+    YG     YV+   D
Sbjct: 124 GPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGS----YG-DDRAYVRHIRD 176

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------GFTPN---------SPSKPIM 227
             V    +    +    D P P++        +      G  P+          P++P  
Sbjct: 177 ALVARGIT---ELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFF 233

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
             E ++GWF  +G     RP    A  +    + GG+  + YM  GGTNFG  AG     
Sbjct: 234 CAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEG 292

Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAI-KLCEEYLISSDP 329
               P V TSYD DAPI E G +  PK+  LR+   A+  +     + +DP
Sbjct: 293 GTIRPTV-TSYDSDAPIAENGAL-TPKFFALRDRLTALGTVAARRPLPADP 341


>gi|380694789|ref|ZP_09859648.1| beta-galactosidase [Bacteroides faecis MAJ27]
          Length = 781

 Score =  172 bits (436), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 113/320 (35%), Positives = 166/320 (51%), Gaps = 19/320 (5%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++G+  V+++  IHYPR   E W   I+ SK  G+  I  YVFWN+HEP  G+Y F
Sbjct: 33  KTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMSKALGMNTICLYVFWNFHEPEEGKYDF 92

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+  F +  QE G+++ +R GPY CAEW  GG P WL     I+ R  +  + E +
Sbjct: 93  TGQKDIAAFCRMAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKEDIKLREQDPYYMERV 152

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
           K F+ ++   +   +L  S+GG II+ QVENEYG+  ++  Y       VK A  T V L
Sbjct: 153 KLFMNEVGKQLA--DLQISKGGNIIMVQVENEYGSFGIDKPYIAAIRDMVKQAGFTGVPL 210

Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
                W    + +A D ++ T N   G   D          P+ P+M +E +SGWF  +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTVNFGTGANIDQQFERLKELRPNTPLMCSEFWSGWFDHWG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
                R  E+L   +    +   +F + YM  GGT+FG   G          TSYDYDAP
Sbjct: 270 AKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328

Query: 296 IDEYGFIRQPKWGHLRELHK 315
           I+E G +  PK+  +R+L K
Sbjct: 329 INESGKV-TPKFLEVRDLLK 347



 Score = 44.3 bits (103), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 59/236 (25%), Positives = 95/236 (40%), Gaps = 53/236 (22%)

Query: 464 VMPGQGKEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDIL 522
            +P   +E  L I      A VF+N KKL        +    L   K E     + LDIL
Sbjct: 410 TLPASKEEQTLIITEAHDWAQVFLNGKKLATLSRLKGEGTVILPPMKEE-----SRLDIL 464

Query: 523 SMMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSS-GEW-IYQVGVEGEYIGLDKIS 579
              +G  N+G   +D  G        ++L++   +++S  +W +Y + V+  +       
Sbjct: 465 VEAMGRMNFGKGIYDWKGI----TEKVELQSNDGNITSLKDWQVYNIPVDYSFA------ 514

Query: 580 LANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSA 639
             N  + K+ +T    K   +Y+ TF   +  G   LN+ +  KG  W+NG ++GRYW  
Sbjct: 515 -QNKKYEKRDNT---EKYPAYYRGTFTL-DKVGDTFLNMMNWSKGMVWINGHAVGRYWEI 569

Query: 640 YLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                                        P QTLY +P  W+  G+N +VI +  G
Sbjct: 570 ----------------------------GPQQTLY-VPGCWLKEGDNEVVILDMAG 596


>gi|21224660|ref|NP_630439.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
 gi|3367753|emb|CAA20078.1| beta-galactosidase [Streptomyces coelicolor A3(2)]
          Length = 595

 Score =  172 bits (435), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 113/338 (33%), Positives = 169/338 (50%), Gaps = 36/338 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + ++Y    L+ +G+   L +GS+HY R  P  W + +R+    GL  ++TYV WN+HE 
Sbjct: 4   STLSYTDGTLLRNGRPHRLLAGSLHYFRVHPGHWADRLRRLAALGLNAVDTYVPWNFHER 63

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G   F+G  DL RF++  QE GL + +R GPY CAEW+ GG P WL   PG++ RT++
Sbjct: 64  TAGDIRFDGPRDLARFIRLAQEEGLDVVVRPGPYICAEWDNGGLPAWLTGTPGMRLRTSH 123

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            P+ E + R+   ++  + +  L A +GGP++  Q+ENEYG+    YG     YV+   D
Sbjct: 124 GPYLEAVDRWFDALVPRIAE--LQAGRGGPVVAVQIENEYGS----YG-DDRAYVRHIRD 176

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------GFTPN---------SPSKPIM 227
             V    +    +    D P P++        +      G  P+          P++P  
Sbjct: 177 ALVARGIT---ELLYTADGPTPLMQDGGALPGELAAATFGSRPDRAAALLRSRRPAEPFF 233

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
             E ++GWF  +G     RP    A  +    + GG+  + YM  GGTNFG  AG     
Sbjct: 234 CAEFWNGWFDHWGDKHHVRPAPSAAEDLGGILDEGGSV-SLYMAHGGTNFGLWAGANHEG 292

Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
               P V TSYD DAPI E G +  PK+  LR+   A+
Sbjct: 293 GTIRPTV-TSYDSDAPIAENGAL-TPKFFALRDRLTAL 328


>gi|410865123|ref|YP_006979734.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
 gi|410821764|gb|AFV88379.1| Beta-galactosidase [Propionibacterium acidipropionici ATCC 4875]
          Length = 591

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 116/343 (33%), Positives = 162/343 (47%), Gaps = 28/343 (8%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           T      ++DG+   + SG+IHY R  P+ W + I K++  GL  IETYV WN HEP+ G
Sbjct: 5   TIGEHDFLLDGRPHRILSGAIHYFRIHPDQWADRIHKARLMGLNTIETYVAWNAHEPVEG 64

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           Q+ +EG  DL  F+K V + G+   +R  PY CAEW+ GG P WL        R     F
Sbjct: 65  QWSWEGGLDLAAFLKAVADEGMHAIVRPAPYICAEWDNGGLPAWLFGEKAAGVRRDEPVF 124

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
              ++ +L ++ +++  E L    GGP+IL Q+ENEYG    AYG   E Y++   D   
Sbjct: 125 MAAVQAYLRRVYEVI--EPLQIHHGGPVILVQIENEYG----AYGSDPE-YLRKLVDITS 177

Query: 186 NLNTSVPWVMCQQEDAPDPIINTCNGFYCDG-FTPNSPSK-----------PIMWTENYS 233
           +   +VP     Q +       +  G    G F   SP +           P+M  E ++
Sbjct: 178 SAGITVPLTTVDQPEDGMLAAGSLPGLLRTGSFGSRSPERLATLRRHQPTGPLMCMEYWN 237

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLV 286
           GWF  +G        E  A  +     +G +  N YM  GGTNFG T G        P+V
Sbjct: 238 GWFDDWGTPHHTTDAEASAADLDALLGSGASV-NLYMLCGGTNFGLTNGANDKGTYEPIV 296

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
            TSYDYDAP+DE G      W     + +  +L  E    S P
Sbjct: 297 -TSYDYDAPLDEAGHPTAKYWAFREVIGRYTELPGEVPPGSSP 338


>gi|334338180|ref|YP_004543332.1| glycoside hydrolase family protein [Isoptericola variabilis 225]
 gi|334108548|gb|AEG45438.1| glycoside hydrolase family 35 [Isoptericola variabilis 225]
          Length = 603

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 151/311 (48%), Gaps = 29/311 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   + SG++HY R  P+ W + IRK++  GL  +ETYV WN H P RG +   G
Sbjct: 11  FLLDGRSLQIVSGALHYFRVHPDQWADRIRKARLLGLNTVETYVAWNVHSPERGVFDTSG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
           R DL RF+  V   GL   +R GPY CAEW  GG P WL   P +  R     F E +  
Sbjct: 71  RRDLARFLDLVAAEGLHAIVRPGPYICAEWTGGGLPAWLFADPEVGVRRAEPRFLEAIGE 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG----VGGELYVKWAADTAVNL 187
           + A ++ ++ +  +  ++GGP+++ QVENEYG    AYG    V  E Y++  AD     
Sbjct: 131 YYAALLPIVAERQV--TRGGPVLMVQVENEYG----AYGDDPPVERERYLRALADMIRAQ 184

Query: 188 NTSVPWVMCQQED------APDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGW 235
              VP     Q +         P + T   F             + P+ P+M  E + GW
Sbjct: 185 GIDVPLFTSDQANDHHLSRGSLPELLTTANFGSRATERLAILRKHQPTGPLMCMEFWDGW 244

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATS 289
           F S G      P E  A  +      G +  N YM  GGTNFG T+G         + TS
Sbjct: 245 FDSAGLHHHTTPPEANARDLDDLLAAGASV-NLYMLHGGTNFGLTSGANDKGVYRPITTS 303

Query: 290 YDYDAPIDEYG 300
           YDYDAP+ E+G
Sbjct: 304 YDYDAPLSEHG 314


>gi|413922057|gb|AFW61989.1| hypothetical protein ZEAMMB73_453254 [Zea mays]
          Length = 139

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 73/103 (70%), Positives = 94/103 (91%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A V+YDHRA+VI+G+RR+L SGSIHYPRSTPE+WP L++K+K+GGL+V++TYVFWN HE
Sbjct: 25  NAAVSYDHRAVVINGQRRILISGSIHYPRSTPEMWPGLLQKAKDGGLDVVQTYVFWNGHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYG 104
           P+RGQYYF  R+DLVRFVK  ++AGL++HLRIGPY CAEWN+G
Sbjct: 85  PVRGQYYFGDRYDLVRFVKLAKQAGLYVHLRIGPYVCAEWNFG 127


>gi|156382804|ref|XP_001632742.1| predicted protein [Nematostella vectensis]
 gi|156219802|gb|EDO40679.1| predicted protein [Nematostella vectensis]
          Length = 612

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 115/329 (34%), Positives = 162/329 (49%), Gaps = 25/329 (7%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  +  + R   +DGK   + SG++HY R  P+ W + I K K  GL  +ETYV WN HE
Sbjct: 39  SKGLVANGRHFTMDGKPFTILSGAMHYFRIPPQYWEDRIVKLKAMGLNTVETYVSWNLHE 98

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
            I+G + F+   D+V F+KT Q+  L++ +R GPY CAEW+ GG P WL   P I  R+ 
Sbjct: 99  EIQGDFNFKDGLDIVEFIKTAQKHDLYVIMRPGPYICAEWDLGGLPSWLLHNPNIYLRSL 158

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVE----WAYGVGGELYV 177
           +  F +   RF  ++I  +       S GGPII  Q+ENEY + +    +   +  E+ +
Sbjct: 159 DPIFMKATLRFFDELIPRLIDYQY--SNGGPIIAWQIENEYLSYDNSSAYMRKLQQEMVI 216

Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDP-IINTCN-----GFYCDGFTPNSPSKPIMWTEN 231
           +   +      +   W M  ++    P ++ T N          G     P+ P+M TE 
Sbjct: 217 RGVKELL--FTSDGIWQMQIEKKYSLPGVLKTVNFQRNETNILKGLRKLQPNMPLMVTEF 274

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
           +SGWF  +G       VE  A       +   +  NYYM  GGTNFG   G         
Sbjct: 275 WSGWFDHWGEDKHVLTVEKAAERTKNILKMESSI-NYYMLHGGTNFGFMNGANAENGKYK 333

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
           P + TSYDYDAPI E G I  PK+  LRE
Sbjct: 334 PTI-TSYDYDAPISESGDI-TPKYRELRE 360


>gi|269794634|ref|YP_003314089.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
 gi|269096819|gb|ACZ21255.1| beta-galactosidase [Sanguibacter keddieii DSM 10542]
          Length = 586

 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 103/308 (33%), Positives = 153/308 (49%), Gaps = 26/308 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   + SG++HY R  P++W + I K++  GL  IETYV WN H P RG++  +G
Sbjct: 8   FLLDGKPFRILSGALHYFRVHPDLWADRIHKARLMGLNTIETYVPWNAHAPQRGEFRTDG 67

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF++ V+  G+   +R GPY CAEW+ GG P WL   P +  R     + E +  
Sbjct: 68  ALDLERFLRLVEAEGMLAIVRPGPYICAEWDNGGLPGWLFRDPAVGVRRDEPLYMEAVSE 127

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L  ++DL+    +   +GGP++L QVENEYG    AYG    +Y++       +   +V
Sbjct: 128 YLGTVLDLVAPFQV--DRGGPVVLVQVENEYG----AYG-SDHVYLEKLMALTRSHGITV 180

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
           P     Q         + +G +  G               + P+ P+M  E + GWF  +
Sbjct: 181 PLTSIDQPSGTMLADGSIDGLHRTGSFGSRSAERLATLREHQPTGPLMCAEFWDGWFDHW 240

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
           G        +D A  +      G +  N YM+ GGTNFG T+G           TSYDYD
Sbjct: 241 GAHHHTTSAQDAARELDELLAAGASV-NIYMFHGGTNFGFTSGANDKGVYQPTTTSYDYD 299

Query: 294 APIDEYGF 301
           AP+ E G+
Sbjct: 300 APLAEDGY 307


>gi|383112460|ref|ZP_09933253.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
 gi|313693132|gb|EFS29967.1| hypothetical protein BSGG_0667 [Bacteroides sp. D2]
          Length = 782

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 113/318 (35%), Positives = 162/318 (50%), Gaps = 19/318 (5%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++GK  V+++  IHYPR   E W   I+  K  G+  I  YVFWN+HEP  G+Y F
Sbjct: 33  KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+  F +  QE G+++ +R GPY CAEW  GG P WL     I+ R  +  + E +
Sbjct: 93  TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
           K F+ ++   +   +L  S+GG II+ QVENEYG+  ++  Y       VK A  T V L
Sbjct: 153 KLFMNEVGKQLT--DLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210

Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
                W    + +A D ++ T N   G   D          P  P+M +E +SGWF  +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
                R  EDL   +    +   +F + YM  GGT+FG   G          TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328

Query: 296 IDEYGFIRQPKWGHLREL 313
           I+E G +  PK+  +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345



 Score = 43.9 bits (102), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 85/233 (36%), Gaps = 47/233 (20%)

Query: 464 VMPGQGKEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDIL 522
            +P   +E  L I      A VF++ KKL        +    L   K    EG   LDIL
Sbjct: 410 TLPASKEEQTLTITEAHDWAQVFLDGKKLATLSRLKGEGTVILPPMK----EGAQ-LDIL 464

Query: 523 SMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLAN 582
              +G  N+G        G+   + +   NG         +Y + V       D     N
Sbjct: 465 VEAMGRMNFGKGI-YDWKGITEKVEVQSNNGVITSLKNWKVYNIPV-------DYAFAQN 516

Query: 583 SSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
             F KQ +     K   +Y+ TF   +  G   LN+ +  KG  WVNG +IGRYW     
Sbjct: 517 KKFVKQDNP---QKYPAYYRGTFTL-DKTGDTFLNMTNWSKGMVWVNGYAIGRYWEI--- 569

Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                                     P QTLY +P  W+  GEN ++I +  G
Sbjct: 570 -------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596


>gi|423295816|ref|ZP_17273943.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
           CL03T12C18]
 gi|392671544|gb|EIY65016.1| hypothetical protein HMPREF1070_02608 [Bacteroides ovatus
           CL03T12C18]
          Length = 782

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 113/318 (35%), Positives = 162/318 (50%), Gaps = 19/318 (5%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++GK  V+++  IHYPR   E W   I+  K  G+  I  YVFWN+HEP  G+Y F
Sbjct: 33  KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+  F +  QE G+++ +R GPY CAEW  GG P WL     I+ R  +  + E +
Sbjct: 93  TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
           K F+ ++   +   +L  S+GG II+ QVENEYG+  ++  Y       VK A  T V L
Sbjct: 153 KLFMNEVGKQLA--DLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210

Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
                W    + +A D ++ T N   G   D          P  P+M +E +SGWF  +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
                R  EDL   +    +   +F + YM  GGT+FG   G          TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328

Query: 296 IDEYGFIRQPKWGHLREL 313
           I+E G +  PK+  +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345



 Score = 43.5 bits (101), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 47/177 (26%), Positives = 66/177 (37%), Gaps = 41/177 (23%)

Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
           LDIL   +G  N+G        G+   + +   NG         +Y + V       D  
Sbjct: 461 LDILVEAMGRMNFGKGI-YDWKGITEKVEVQSNNGVITSLKNWKVYNIPV-------DYA 512

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
              N  F KQ +     K   +Y+ TF   +  G   LN+ +  KG  WVNG +IGRYW 
Sbjct: 513 FAQNKKFVKQDNP---QKYPAYYRGTFTL-DKTGDTFLNMTNWSKGMVWVNGYAIGRYWE 568

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                                         P QTLY +P  W+  GEN ++I +  G
Sbjct: 569 I----------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596


>gi|294633111|ref|ZP_06711670.1| beta-galactosidase [Streptomyces sp. e14]
 gi|292830892|gb|EFF89242.1| beta-galactosidase [Streptomyces sp. e14]
          Length = 606

 Score =  169 bits (429), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 111/326 (34%), Positives = 160/326 (49%), Gaps = 35/326 (10%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A +T+    L+  G+   + SGS+HY R  P  W + + +    GL  ++TYV WN+HE
Sbjct: 14  AATLTHAGGTLLRAGRPHRILSGSLHYFRVHPGQWADRLARLAALGLNTVDTYVPWNFHE 73

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              G   F+G  DL RFV+  QE GL + +R GPY CAEW+ GG P WL   PG++ RT+
Sbjct: 74  RTPGDVRFDGWRDLDRFVRLAQETGLDVIVRPGPYICAEWDNGGLPAWLTGTPGMRPRTS 133

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           + PF   + R+  ++I  +    L A +GGP++  Q+ENEYG+    YG  G+ YV+W  
Sbjct: 134 HPPFLAAVARWFDQLIPRIAA--LQAGRGGPVVAVQIENEYGS----YGDDGD-YVRWVR 186

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------GFTPNS---------PSKPI 226
           D       +    +    D P  ++        +      G  P           P +P 
Sbjct: 187 DALTARGVT---ELLYTADGPTELMLDAGAVEGELAAATFGSRPEQAARLLRSRRPEEPF 243

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
              E ++GWF  +G     RP    A  V R    GG+  + YM  GGTNFG  AG    
Sbjct: 244 FCAEFWNGWFDHWGEQHHVRPARSAADDVGRILGAGGSL-SLYMAHGGTNFGLWAGANHD 302

Query: 284 -----PLVATSYDYDAPIDEYGFIRQ 304
                P V TSYD DAP+ E+G + +
Sbjct: 303 GDRLQPTV-TSYDSDAPVAEHGALTE 327


>gi|336417631|ref|ZP_08597952.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
           3_8_47FAA]
 gi|335935372|gb|EGM97326.1| hypothetical protein HMPREF1017_05060 [Bacteroides ovatus
           3_8_47FAA]
          Length = 782

 Score =  169 bits (429), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 112/318 (35%), Positives = 162/318 (50%), Gaps = 19/318 (5%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++GK  V+++  IHYPR   E W   I+  K  G+  I  YVFWN+HEP  G+Y F
Sbjct: 33  KTFLLNGKPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+  F +  QE G+++ +R GPY CAEW  GG P WL     I+ R  +  + E +
Sbjct: 93  TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
           K F+ ++   +   +L  ++GG II+ QVENEYG+  ++  Y       VK A  T V L
Sbjct: 153 KLFMNEVGKQLT--DLQINKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210

Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
                W    + +A D ++ T N   G   D          P  P+M +E +SGWF  +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
                R  EDL   +    +   +F + YM  GGT+FG   G          TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328

Query: 296 IDEYGFIRQPKWGHLREL 313
           I+E G +  PK+  +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345



 Score = 47.8 bits (112), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 87/233 (37%), Gaps = 47/233 (20%)

Query: 464 VMPGQGKEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDIL 522
            +P   +E  L I      A VF++ KKL        +    L   K    EG   LDIL
Sbjct: 410 TLPASKEEQTLTITEAHDWAQVFLDGKKLATLSRLKGEGTVILPPMK----EGAQ-LDIL 464

Query: 523 SMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLAN 582
              +G  N+G        G+   + I   NG         +Y + V       D     N
Sbjct: 465 VEAMGRMNFGKGI-YDWKGITEKVEIQSNNGVITSLKNWKVYNIPV-------DYAFAQN 516

Query: 583 SSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLA 642
             F KQ + L   K   +Y+ TF+  +  G   LN+ +  KG  WVNG +IGRYW     
Sbjct: 517 KEFMKQDNPL---KYPAYYRGTFML-DKTGDTFLNMTNWSKGMVWVNGYAIGRYWEI--- 569

Query: 643 PSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                                     P QTLY +P  W+  GEN ++I +  G
Sbjct: 570 -------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596


>gi|229084352|ref|ZP_04216632.1| Beta-galactosidase [Bacillus cereus Rock3-44]
 gi|228698892|gb|EEL51597.1| Beta-galactosidase [Bacillus cereus Rock3-44]
          Length = 867

 Score =  169 bits (429), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 119/399 (29%), Positives = 188/399 (47%), Gaps = 26/399 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +TYD ++  I  KR  + S +IHY R     W +++ K+K GG   IETY+ WN+HE   
Sbjct: 2   ITYDKKSWKIHNKRIFILSAAIHYFRLPKAEWDDVLEKAKAGGCNTIETYIPWNFHEMKE 61

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ F G  DL  F++     GL++  R GPY CAEW++GGFP WL     IQ+R+    
Sbjct: 62  GEWDFSGDKDLAHFLQLCANKGLYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPS 121

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F   + ++  ++I ++ +  L  ++ G +I+ Q+ENE+     AYG   + Y+++  D  
Sbjct: 122 FLHYVDQYFDQVISIIDEYQL--TKNGSVIMVQIENEFQ----AYGKPDKKYMEYLRDGM 175

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           +     VP+V C    A D  +   N         +        +P    E + GWF  +
Sbjct: 176 IARGIEVPFVTCY--GAVDGAVEFRNFWSGANRAAEILDERFADQPKGVMEFWIGWFEHW 233

Query: 240 -GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV-ATSYDYD 293
            G     +  E L     +    G T  NYYMYFGGTNF    GRT    +   T+YDYD
Sbjct: 234 GGNKANQKTPEQLERECYQLLRNGFTTINYYMYFGGTNFDHWGGRTVSEQVFCTTTYDYD 293

Query: 294 APIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDPTHQ--KLGAKLEAHIYHKSSND 349
             IDEY    QP  K+  L+  H  +K  E    +++  +   KL + L++        +
Sbjct: 294 VAIDEY---LQPTRKYEVLKRYHLFVKWLEPLFTNAEQANSDVKLSSDLKSGRIVSPHGE 350

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNV 388
                 N +    ++V     +      + ++LP  +NV
Sbjct: 351 VLFIENNRNERIQSHVKHGNELVPFTIEANAVLPIVRNV 389


>gi|254443764|ref|ZP_05057240.1| Glycosyl hydrolases family 35 [Verrucomicrobiae bacterium DG1235]
 gi|198258072|gb|EDY82380.1| Glycosyl hydrolases family 35 [Verrucomicrobiae bacterium DG1235]
          Length = 792

 Score =  169 bits (428), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 112/325 (34%), Positives = 165/325 (50%), Gaps = 35/325 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   ++ G +HY R   E W   I   +  G+  +  Y+FWNYHE   G++ +EG
Sbjct: 48  FLLDGEPIQIRCGELHYSRVPREYWKHRIEMIRAMGMNAVCVYLFWNYHEREEGEFTWEG 107

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
           + D+V F +  QEAGL++ LR GPY+CAEW  GG P WL     IQ RTT+  F    + 
Sbjct: 108 QADVVEFCRLAQEAGLWVVLRPGPYSCAEWEMGGLPWWLLKHDDIQLRTTDKRFISAARN 167

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           ++A++   +   NL  S+GGPI++ QVENEYG     YG   E Y+    ++ ++    V
Sbjct: 168 YMAEVGRTLG--NLQVSRGGPILMVQVENEYG----FYGSDPE-YMGAIRESLIDAGFEV 220

Query: 192 PWVMCQQEDAPDPIINTCNGFYCD-------GFTPNS---------PSKPIMWTENYSGW 235
           P   C      +P  +   G+  D       G  P S          + P+M  E Y GW
Sbjct: 221 PLFAC------NPPYHLERGYRDDLFQVVNFGSEPESAFAELRKVQATGPLMCGEFYPGW 274

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----ATSYD 291
           F ++G       +E+   A+ R  E   +F + YM  GGT FG  AG         +SYD
Sbjct: 275 FDTWGNPHHTGKIENYTGALGRMMEMRASF-SIYMAHGGTTFGFWAGADRPFKPDTSSYD 333

Query: 292 YDAPIDEYGFIRQPKWGHLRELHKA 316
           YDAP+ E G+   P++  LREL ++
Sbjct: 334 YDAPVSEAGWT-TPQYFRLRELMQS 357



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 62/263 (23%), Positives = 104/263 (39%), Gaps = 60/263 (22%)

Query: 468 QGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVG 527
           +G  V L   ++     VFV+ + +    G  D  +   +  I   +   TL+IL   +G
Sbjct: 422 KGPAVTLKAAAVNDFGWVFVDGEPM----GTFDRRSRTFSIDIPKRDSPATLEILVYAMG 477

Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
             N+G         +  V L+D K   R L  G   + + ++ +Y+   K   A+     
Sbjct: 478 RINFGPEVHDRKGLIGPVELVDEKGRARQLK-GWKHHSLPMDDDYLASLKYQAASE---- 532

Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
                   KS  ++++ F   E  G   L+L+S GKG  W+NG ++GRYW+         
Sbjct: 533 -------EKSPAFWRSEFELKE-TGDTFLDLSSWGKGAVWINGYALGRYWNI-------- 576

Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
                                P QT+Y +P  W+  G N +V+ + LG +   I+ L K 
Sbjct: 577 --------------------GPTQTMY-VPGPWLKEGRNEIVVLDLLGPESPVIAGLEK- 614

Query: 708 GQHICSFVSEADPPPVDSWKPNL 730
                        P +D+ +P L
Sbjct: 615 -------------PVLDTLRPEL 624


>gi|449493221|ref|XP_002196735.2| PREDICTED: beta-galactosidase [Taeniopygia guttata]
          Length = 636

 Score =  169 bits (428), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 175/662 (26%), Positives = 269/662 (40%), Gaps = 127/662 (19%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  + YD    V DGK     SGSIHY R  P  W + + K K  GL+ I+TYV WNYHE
Sbjct: 8   SFGIDYDSNCFVKDGKPFRYISGSIHYSRVPPYYWKDRLLKMKMAGLDAIQTYVPWNYHE 67

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G Y F G  DL  F++   + GL + LR GPY CAEW+ GG P WL     I  R++
Sbjct: 68  PQMGTYDFFGGKDLQYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSS 127

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG------ 172
           ++ + E ++R++  ++  M+        GGPII+ QVENEYG+    ++ Y         
Sbjct: 128 DSDYLEAVERWMGVLLPKMRP--YLYQNGGPIIMVQVENEYGSYFACDYNYLRFLLKLFR 185

Query: 173 ---GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
              G+  V +  D A   +    ++  +    + AP    N    F       + P  P+
Sbjct: 186 LHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPG--ANVTAAFLAQ--RSSEPKGPL 241

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
           + +E Y+GW   +G+     P + +A  +     +G    N YM+ GGTNF    G  + 
Sbjct: 242 VNSEFYTGWLDHWGHHHSVVPAQTIAKTLNEILASGANV-NLYMFIGGTNFAYWNGANMP 300

Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHI 342
                TSYDYDAP+ E G + + K+  LR++    K   E L  + PT  K        +
Sbjct: 301 YMPQPTSYDYDAPLSEAGDLTE-KYFALRKVIGMYKQLPEGL--TPPTTPKFAY---GKV 354

Query: 343 YHKSSNDCAAFLANYDSSSDANVTF-----NGNVYFLPAWSVSILPDCKNVVFNTAKVIS 397
             + +      L     S     T+         YF      + LP  KN V  T   +S
Sbjct: 355 RLQKAGTVLEVLDGLSRSGPVRSTYPLTFVELKQYFGYVLYRTTLP--KNCVEPTP--LS 410

Query: 398 QRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLW 457
              NG H                             +R++V  D   Q    +D S    
Sbjct: 411 SPLNGVH-----------------------------DRAYVSVDGVPQGVLERDKS---- 437

Query: 458 YTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN 517
               I++    G  + + +E++G           V FG  N+DF   + N  +  +    
Sbjct: 438 --LKINITGQAGASLDILVENMGR----------VNFGRYNNDFKGLVSNLTLAQD---- 481

Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
                 ++VG + Y    D+ GA  + +I + L + KR                   + +
Sbjct: 482 ------VLVGWEIYP--LDIDGAVNYDIIYL-LHHPKRS-----------------AIKE 515

Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
           +S    +F+    ++P              P+      +N     KGQ W+NG ++GRYW
Sbjct: 516 LSYEVPTFYTGTLSIPGG-----------IPDLPQDTYVNFPGWTKGQIWINGFNLGRYW 564

Query: 638 SA 639
            A
Sbjct: 565 PA 566


>gi|255691973|ref|ZP_05415648.1| glycosyl hydrolase [Bacteroides finegoldii DSM 17565]
 gi|260622382|gb|EEX45253.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
          Length = 782

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 112/318 (35%), Positives = 161/318 (50%), Gaps = 19/318 (5%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++G   V+++  IHYPR   E W   I+  K  G+  I  YVFWN+HEP  G+Y F
Sbjct: 33  KTFLLNGNPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGKYDF 92

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+  F +  QE G+++ +R GPY CAEW  GG P WL     I+ R  +  + E +
Sbjct: 93  TGQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERV 152

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--VEWAYGVGGELYVKWAADTAVNL 187
           K F+ ++   +   +L  S+GG II+ QVENEYG+  ++  Y       VK A  T V L
Sbjct: 153 KLFMNEVGKQLT--DLQISKGGNIIMVQVENEYGSFGIDKPYIAEIRDIVKQAGFTGVPL 210

Query: 188 NTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
                W    + +A D ++ T N   G   D          P  P+M +E +SGWF  +G
Sbjct: 211 -FQCDWNSNFENNALDDLLWTINFGTGANIDDQFKRLQELRPDIPLMCSEFWSGWFDHWG 269

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDYDAP 295
                R  EDL   +    +   +F + YM  GGT+FG   G          TSYDYDAP
Sbjct: 270 AKHETRSAEDLVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDYDAP 328

Query: 296 IDEYGFIRQPKWGHLREL 313
           I+E G +  PK+  +R L
Sbjct: 329 INESGKV-TPKYFEVRNL 345



 Score = 43.9 bits (102), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 47/177 (26%), Positives = 66/177 (37%), Gaps = 41/177 (23%)

Query: 519 LDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKI 578
           LDIL   +G  N+G        G+   + +   NG         +Y + V       D  
Sbjct: 461 LDILVEAMGRMNFGKGI-YDWKGITEKVEVQSNNGVITSLKNWKVYNIPV-------DYA 512

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
              N  F KQ +     K   +Y+ TF   +  G   LN+ +  KG  WVNG +IGRYW 
Sbjct: 513 FAQNKKFVKQDNP---QKYPAYYRGTFTL-DKTGDTFLNMTTWSKGMVWVNGYAIGRYWE 568

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                                         P QTLY +P  W+  GEN ++I +  G
Sbjct: 569 I----------------------------GPQQTLY-VPGCWLKKGENEVIILDMAG 596


>gi|29345700|ref|NP_809203.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|383123143|ref|ZP_09943828.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
 gi|29337593|gb|AAO75397.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251841761|gb|EES69841.1| hypothetical protein BSIG_0114 [Bacteroides sp. 1_1_6]
          Length = 779

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 112/321 (34%), Positives = 162/321 (50%), Gaps = 27/321 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++G+  V+++  IHYPR   E W   I+  K  G+  I  YVFWN+HEP  G+Y F 
Sbjct: 34  TFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKMCKALGMNTICLYVFWNFHEPEEGRYDFA 93

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  QE G+++ +R GPY CAEW  GG P WL     I+ R  +  + E +K
Sbjct: 94  GQKDIAAFCRLAQENGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLREQDPYYMERVK 153

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            FL ++   +   +L  S+GG II+ QVENEYG    A+G+  + Y+    D       T
Sbjct: 154 LFLNEVGKQLA--DLQISKGGNIIMVQVENEYG----AFGI-DKPYISEIRDMVKQAGFT 206

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C      + +A D ++ T N   G   D          P  P+M +E +SGWF 
Sbjct: 207 GVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLMCSEFWSGWFD 266

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDY 292
            +G     R  E+L   +    +   +F + YM  GGT+FG   G          TSYDY
Sbjct: 267 HWGAKHETRSAEELVKGMKEMLDRNISF-SLYMTHGGTSFGHWGGANFPNFSPTCTSYDY 325

Query: 293 DAPIDEYGFIRQPKWGHLREL 313
           DAPI+E G +  PK+  +R L
Sbjct: 326 DAPINESGKV-TPKYLEVRNL 345



 Score = 45.8 bits (107), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 78/191 (40%), Gaps = 49/191 (25%)

Query: 512 LNEGINTLDILSMMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVE 569
           L EG + LDIL   +G  N+G   +D  G        ++L++ K      +W +Y + V+
Sbjct: 455 LKEG-DRLDILVEAMGRMNFGKGIYDWKGI----TEKVELQSDKGVELVKDWQVYTIPVD 509

Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
                    S A    +KQ           +Y++TF   E  G   LN+ +  KG  WVN
Sbjct: 510 --------YSFARDKQYKQQEN--AENQPAYYRSTFNLNE-LGDTFLNMMNWSKGMVWVN 558

Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
           G +IGRYW                               P QTLY +P  W+  GEN ++
Sbjct: 559 GHAIGRYWEI----------------------------GPQQTLY-VPGCWLKKGENEII 589

Query: 690 IHEELGGDPSK 700
           I +  G  PSK
Sbjct: 590 ILDMAG--PSK 598


>gi|84494646|ref|ZP_00993765.1| beta-galactosidase [Janibacter sp. HTCC2649]
 gi|84384139|gb|EAQ00019.1| beta-galactosidase [Janibacter sp. HTCC2649]
          Length = 592

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 117/345 (33%), Positives = 169/345 (48%), Gaps = 47/345 (13%)

Query: 19  RVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRF 78
           RVL SG+IHY R  P++W + +R+    GL  +ETYV WN+HE +RG+  F G  DL RF
Sbjct: 25  RVL-SGAIHYFRIHPDLWEDRLRRLAAMGLNTVETYVAWNFHERVRGEIDFTGPRDLARF 83

Query: 79  VKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIID 138
           +    + GL + +R GPY CAEW++GG P WL   PGI  RT++  F   +  +   ++ 
Sbjct: 84  ISLAGDLGLDVIVRPGPYICAEWDFGGLPAWLMTEPGIALRTSDPAFLAAVDDWFDAVVP 143

Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG---ELYVKWAADTAVNLNTSVPWVM 195
           +++   L  + GGP++  QVENEYG    +YG      E   K   D  ++       V+
Sbjct: 144 VIRP--LLTTAGGPVVAVQVENEYG----SYGDDAAYLEHCRKGLLDRGID-------VL 190

Query: 196 CQQEDAPDP----------IINTCN-GFYCD----GFTPNSPSKPIMWTENYSGWFLSFG 240
               D P P          ++ T N G   D          P+ P M  E ++GWF  +G
Sbjct: 191 LFTSDGPGPDWLDNGTIPGVLATVNFGSRTDEAFAELRKVQPAGPDMVMEYWNGWFDHWG 250

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-------VATSYDYD 293
                R V+D A  +      GG+  N+YM  GGTNFG  +G  +         TSYDYD
Sbjct: 251 EPHHVRDVDDAAGVLDDVLRAGGSV-NFYMAHGGTNFGLWSGANVEDGKLQPTVTSYDYD 309

Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKL 338
           A + E G +  PK+   RE      +   Y +++ P    L A+L
Sbjct: 310 AAVGEAGEL-TPKFHAFRE------VISRYAVTALPELPPLPARL 347


>gi|256376699|ref|YP_003100359.1| beta-galactosidase [Actinosynnema mirum DSM 43827]
 gi|255921002|gb|ACU36513.1| Beta-galactosidase [Actinosynnema mirum DSM 43827]
          Length = 579

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 104/308 (33%), Positives = 157/308 (50%), Gaps = 28/308 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   + +G++HY R  P++W + I K++  GL  IETY  WN HEP+ G Y F G
Sbjct: 11  FLLDGRPHRVLAGALHYFRVHPDLWADRIEKARLMGLNTIETYTPWNLHEPVEGAYDFTG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF++ V +AG+   +R GPY CAEW+ GG P WL+  P +  R +   +   +  
Sbjct: 71  MLDLERFLRLVADAGMHAIVRPGPYICAEWDNGGLPAWLYRDPEVGVRRSEPRYLGAVSA 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L ++ D++    L   +GGP++L Q+ENEYG    AYG   + Y++   D       +V
Sbjct: 131 YLRRVYDVVTP--LQIDRGGPVVLVQIENEYG----AYG-SDKFYLRHLVDLTRECGITV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
           P     Q         + +  +  G               + P+ P+M +E ++GWF  +
Sbjct: 184 PLTTVDQPTDEMLSQGSLDCLHRTGSFGSRATERLATLRRHQPTGPLMCSEFWNGWFDHW 243

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDY 292
           G        ED A  +      G +  N YM+ GGTNFG T+G        P + TSYDY
Sbjct: 244 GDRHHTTSAEDSAAELDALLAAGASV-NIYMFHGGTNFGLTSGANDKGVYQPTI-TSYDY 301

Query: 293 DAPIDEYG 300
           DAP+DE G
Sbjct: 302 DAPLDEAG 309


>gi|410456453|ref|ZP_11310314.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
 gi|409928122|gb|EKN65245.1| beta-galactosidase [Bacillus bataviensis LMG 21833]
          Length = 867

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/400 (30%), Positives = 187/400 (46%), Gaps = 28/400 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +TYD ++  I  +R  + S +IHY R     W E++ K+K GG   IETY+ WN+HE   
Sbjct: 2   ITYDKKSWKIHNERVFILSAAIHYFRLPRAEWNEVLDKAKAGGCNTIETYIPWNFHEMNE 61

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ F G  DL  F +   +  L++  R GPY CAEW++GGFP WL     IQ+R+    
Sbjct: 62  GEWDFSGDKDLAHFFQLCADKELYVIARPGPYICAEWDFGGFPWWLSTKKDIQYRSAQPA 121

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F   + ++  ++I ++ +  L  ++ G +I+ QVENE+     AYG   + Y+++  D  
Sbjct: 122 FLHYVDQYFDRVIPIIDEYQL--TKNGTVIMVQVENEFQ----AYGKPDKPYMEYIRDGM 175

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLS 238
                 VP V C    A +  +   N F+              P +P    E + GWF  
Sbjct: 176 KARGIDVPLVTCY--GAVEGAVEFRN-FWSHSKHAAAILDERFPDQPKGVMEFWIGWFEQ 232

Query: 239 F-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAG-GPLVATSYDY 292
           + G     +  E L     +    G T  NYYMYFGGTNF    GRT G   L  T+YDY
Sbjct: 233 WGGNKADQKTPEQLERECYQLLSNGFTAINYYMYFGGTNFDHWGGRTVGEQTLCTTTYDY 292

Query: 293 DAPIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDP--THQKLGAKLEAHIYHKSSN 348
           D  IDEY    QP  K+  L+  H  +K  E     ++   +  KL + L++        
Sbjct: 293 DVAIDEY---LQPTRKYEVLKRYHSFVKWLEPLFTDAEKVASDMKLPSDLKSERIASPYG 349

Query: 349 DCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNV 388
           +      N +    ++V    +       + ++LP  +NV
Sbjct: 350 EVIFIENNRNERIQSHVKHGYDQILFTIEANTVLPIVRNV 389


>gi|328958462|ref|YP_004375848.1| beta-galactosidase [Carnobacterium sp. 17-4]
 gi|328674786|gb|AEB30832.1| beta-galactosidase [Carnobacterium sp. 17-4]
          Length = 589

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 178/661 (26%), Positives = 262/661 (39%), Gaps = 157/661 (23%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG++HY R  PE W   +   K  G   +ETY+ WN HEP  G+Y F G
Sbjct: 10  FLLNGEPFKITSGAVHYFRVLPEDWYHSLYNLKALGFNTVETYIPWNVHEPKEGEYQFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
           ++D+ +FV+  +E GLF+ LR  PY CAEW +GG P WL     +  R+++  F E++ R
Sbjct: 70  QWDIKKFVQLAEELGLFVILRPSPYICAEWEFGGLPAWLLTYKDMLIRSSDPVFIEKVSR 129

Query: 132 FLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
           +     +L+KQ   L    GGP+I+ Q+ENEYG    +YG   E Y++   +  + L  +
Sbjct: 130 YYK---ELLKQITPLQVDHGGPVIMMQLENEYG----SYGEDKE-YLRTLYELMLKLGVT 181

Query: 191 VP-------WVMCQQE-DAPDPIINTCNGF---------YCDGFTPNSPSK-PIMWTENY 232
           +P       W   Q+     D  I T   F             F  +   K P+M  E +
Sbjct: 182 IPIFTSDGAWRATQEAGTMTDLDILTTGNFGSRSKENFKELKEFHESKGKKWPLMCMEYW 241

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPL 285
            GWF  +   +  R   +L   V    E G    N YM+ GGTNFG       R      
Sbjct: 242 DGWFNRWNDPIIKRDALELTQDVKEALEIGSL--NLYMFHGGTNFGFMNGCSARLRKDLP 299

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
             TSYDYDAP++E G                           +PT +    K   ++  +
Sbjct: 300 QVTSYDYDAPLNEQG---------------------------NPTEKYFALK---NMMQE 329

Query: 346 SSNDCAAF--LANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGD 403
           S  D      L   DS S  N+   G V  L                +    I+++    
Sbjct: 330 SFPDIEQHPPLVK-DSMSITNIQVGGKVSLL----------------SIVDRIAKKQESR 372

Query: 404 HPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIH 463
           +P    K + EL           ++ G +  RS+V+ D  E+     D SD L +     
Sbjct: 373 YP----KTMEEL----------GQQYGYTLYRSYVKKDSDEEFYRVIDGSDRLHF----- 413

Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
                    FLN E +       + +K+ A                     G N LD+L 
Sbjct: 414 ---------FLNEEKIATQYQEEIGEKIYASPIS-----------------GSNQLDVLV 447

Query: 524 MMVGLQNYGAWF--DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD---KI 578
             +G  NYG     D    G+   ++ DL                    E   LD    +
Sbjct: 448 ENMGRVNYGHKLLADTQQKGIRRGVMSDL--------------HFITNWEQYSLDFSEPL 493

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
           S+     WK+ S      S   YK T  APE      +N+   GKG   VNG +IGR+W+
Sbjct: 494 SIDFDKEWKENSP-----SFYQYKVTIDAPEDT---FINMELFGKGIVLVNGFNIGRFWN 545

Query: 639 A 639
            
Sbjct: 546 V 546


>gi|357626884|gb|EHJ76789.1| putative carbamoyl-phosphate synthase large chain [Danaus
           plexippus]
          Length = 2861

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 115/334 (34%), Positives = 162/334 (48%), Gaps = 56/334 (16%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N++      ++DGK   + SGS+HY R   E W + +RK +  GL  + TYV W+ HE
Sbjct: 52  ARNISIVGDDFMLDGKPLRIVSGSVHYYRLPAEYWRDRLRKIRAAGLNAVSTYVEWSSHE 111

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRT 120
              G Y FEG  D+ RF+K   E  L++ LR GPY CAE + GG P W L   P I+ RT
Sbjct: 112 EEEGAYSFEGDKDIARFLKIAAEENLYVLLRPGPYICAERDLGGLPYWLLSKYPDIKLRT 171

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
           T+  F  E K+++AK+ + +K        GGPIIL QVENEYG    +YG   E Y+K  
Sbjct: 172 TDGNFIAETKKWMAKLFEEVKP--FLLGNGGPIILVQVENEYG----SYGASKE-YMKQI 224

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNG----FYCDG----------FTPNS----- 221
            D           +    EDA   ++ T +G    ++ DG          F P +     
Sbjct: 225 RDI----------IKSHVEDA--ALLYTTDGPYRSYFIDGSISGTLTTIDFGPTTSVINT 272

Query: 222 --------PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFG 273
                   P  P+M +E Y GW   +   +     + + F +    E      N+Y++FG
Sbjct: 273 FKELRAYMPVGPLMNSEFYPGWLTHWSEHIQQVSTDRVTFTLRDMLENKINL-NFYVFFG 331

Query: 274 GTNFGRTAGG-------PLVATSYDYDAPIDEYG 300
           GTNF  T+G        P + TSYDYDAP+ E G
Sbjct: 332 GTNFEFTSGANYGRFYQPDI-TSYDYDAPLSEAG 364


>gi|255550379|ref|XP_002516240.1| beta-galactosidase, putative [Ricinus communis]
 gi|223544726|gb|EEF46242.1| beta-galactosidase, putative [Ricinus communis]
          Length = 216

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 88/207 (42%), Positives = 114/207 (55%), Gaps = 46/207 (22%)

Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
             G+ Y+ W +D A +L+  VPW++CQQ DAP P+INTC G+YCD FTPN+ + P  WTE
Sbjct: 55  TAGKAYLDWCSDMAESLDIGVPWIICQQRDAPQPMINTCYGWYCDQFTPNTANSPKKWTE 114

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-VATS 289
           N++GWF S+G   P R  E +AFAVARFF+    FQN YMY GGTNFGRTAGGP    TS
Sbjct: 115 NWTGWFKSWGDKDPHRTAEGVAFAVARFFQ----FQNCYMYHGGTNFGRTAGGPYSTTTS 170

Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSND 349
           +DYDAP+DE+  I                                         H +  +
Sbjct: 171 HDYDAPLDEHVTI-----------------------------------------HATEKE 189

Query: 350 CAAFLANYDSSSDANVTFNGNVYFLPA 376
            + F  N + +SDA + F G  Y +PA
Sbjct: 190 SSCFFGNINETSDAVIEFRGAKYKIPA 216


>gi|255652865|ref|NP_001157373.1| beta-galactosidase [Bombyx mori]
 gi|239938036|gb|ACS36117.1| beta-galactosidase [Bombyx mori]
          Length = 606

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 167/346 (48%), Gaps = 26/346 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           N++      +IDGK   + SGS+HY R     W + + K K  GL  + TYV W+YHEP 
Sbjct: 5   NISIVGDKFMIDGKPLHIISGSLHYFRVPAVYWRDRLHKFKAAGLNTVATYVEWSYHEPE 64

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
             QY FEG  DLVRFV+T  E GL + LR+GPY CAE + GG P W L   P I+ RTT+
Sbjct: 65  EKQYNFEGDRDLVRFVQTAAEVGLHVLLRVGPYICAERDLGGLPYWLLGKYPNIKLRTTD 124

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVG-GELYVKW 179
             F  E   +L K+ +  +  +L    GGPIIL QVENEYG  + + AY     +L    
Sbjct: 125 KDFIAESDIWLKKLFE--QVSHLLFGNGGPIILVQVENEYGSYDSDLAYKEKMRDLISAH 182

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF---------YCDGFTPNSPSKPIMWTE 230
             D A+   T  P ++        P ++    F         +   F       P+M +E
Sbjct: 183 VGDKALLYTTDGPSLVGA---GMIPGVHATIDFGVTSQPTEQFDSLFHLRPAPGPLMNSE 239

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            Y GW   +G  +      D+   + R         N+Y++FGG+NF  T+G        
Sbjct: 240 FYPGWLTHWGERMARVGTNDIVLTL-RNMIVNKIHVNFYVFFGGSNFEFTSGANFDGTYQ 298

Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT 330
              TSYDYDAP+ E G    PK+  +RE  K +   +E +    P+
Sbjct: 299 PDITSYDYDAPLSEAG-DPTPKYYAIRETLKQLNFVDEKIEPPQPS 343



 Score = 44.7 bits (104), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 2/46 (4%)

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMG--KGQAWVNGQSIGRYW 637
           V +   +Y+ TF+ PEG+ PL   L + G  KG  WVNG ++GRYW
Sbjct: 502 VTQGPTFYEGTFVLPEGQKPLDTFLDTTGWDKGYVWVNGHNLGRYW 547


>gi|227538632|ref|ZP_03968681.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
 gi|227241551|gb|EEI91566.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33300]
          Length = 638

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 172/348 (49%), Gaps = 44/348 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   + SG +HY R   + W   ++  K  GL  + TYVFWN+HE   G + FEG
Sbjct: 41  FVYDGKTTRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEG 100

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  F+KT  E GL + LR GPYACAEW++GG+P WL  I G++ R  N  F E  K+
Sbjct: 101 DHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDNAKFLEYTKK 160

Query: 132 FLAKIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLN 188
           +    ID + +E  +L  + GGPII+ Q ENE+G+ V     +  E +  + A     L 
Sbjct: 161 Y----IDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLE 216

Query: 189 TS---VPWVMCQ-----QEDAPDPIINTCNG--------FYCDGFTPNSPSKPIMWTENY 232
            +   VP          +  A    + T NG           D +  N+   P M  E Y
Sbjct: 217 EAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQGPYMVAEFY 274

Query: 233 SGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------- 283
            GW     +A PF  V+   +A    ++ +   +F NYYM  GGTNFG T+G        
Sbjct: 275 PGWLDH--WAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGANYNNKSD 331

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
             P + TSYDYDAPI E G+   PK+  +R +   I+   +Y + + P
Sbjct: 332 IQPDI-TSYDYDAPISEAGW-ATPKYDSIRTV---IQKYADYTVPAVP 374



 Score = 47.8 bits (112), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 70/262 (26%), Positives = 110/262 (41%), Gaps = 61/262 (23%)

Query: 444 EQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFAN 503
           EQ+N     + Y+ Y+   +  P  GK   L I+ L   A+V+++   V  G  N  F N
Sbjct: 412 EQLN---QANGYVLYSKQFN-QPINGK---LKIDGLRDFAVVYIDGTKV--GELNRVFKN 462

Query: 504 FLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEW 562
           + ++  I  N   +TL IL   +G  NYG+       G+ S +LI+      D+  +G+W
Sbjct: 463 YEMDIDIPFN---STLQILVENMGRINYGSEMIHNHKGIISPVLIN------DMEITGDW 513

Query: 563 IYQV-------GVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLA 615
             Q         + G+     + +  N+S     +  PV      Y+ TF   E  G   
Sbjct: 514 TMQQLPMDKVPDLAGKQTAAIQNTKTNASKIAALTGQPV-----LYQGTFDLKE-IGDTF 567

Query: 616 LNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYH 675
           +++   GKG  ++NG +IGRYW       TG                      P  TLY 
Sbjct: 568 IDMEKWGKGIVFINGINIGRYW------KTG----------------------PQHTLY- 598

Query: 676 IPRTWVHPGENLLVIHEELGGD 697
           IP  ++  G N +VI E+L  +
Sbjct: 599 IPAPYLKKGSNSIVIFEQLNDE 620


>gi|443697452|gb|ELT97928.1| hypothetical protein CAPTEDRAFT_112460 [Capitella teleta]
          Length = 651

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/300 (35%), Positives = 147/300 (49%), Gaps = 17/300 (5%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +D K   + SG++HY R  PE W + + + K  GL  +ETYV WN HE I G++ F G  
Sbjct: 65  LDNKELRILSGAMHYFRIVPEYWLDRLTRMKAAGLNTVETYVPWNLHEEIHGEFVFTGML 124

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D+ RFV   ++ GL + LR GP+ C+EW +GG P WL   P +  R+T  PF +  + ++
Sbjct: 125 DIRRFVAIAEKVGLLVILRPGPFICSEWEFGGLPSWLLRDPQMDVRSTYRPFMDAARSYM 184

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN-LNTSVP 192
             +I  +  E++    GGPII  Q+ENEYG+         EL         +  L TS  
Sbjct: 185 RSLISEL--EDMQYQYGGPIIAMQIENEYGSYSDDVNYMQELKNIMTDSGVIEILFTSDN 242

Query: 193 WVMCQQEDAPDPIINTC------NGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
               Q    P   + T        G   D      P KP+M  E +SGWF  +       
Sbjct: 243 KHGLQPGRVPGVFMTTNFKNTNEGGRMFDKLHELQPGKPLMVMEFWSGWFDHWEEKHHTM 302

Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAPIDEYG 300
            +E+ A AV    + G +  N YM+ GGTNFG   G       P V TSYDYD+P+ E G
Sbjct: 303 SLEEYASAVEYILQQGSSI-NLYMFHGGTNFGFLNGANTEPYLPTV-TSYDYDSPLSEAG 360


>gi|300775043|ref|ZP_07084906.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
 gi|300506858|gb|EFK37993.1| beta-galactosidase [Chryseobacterium gleum ATCC 35910]
          Length = 621

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 119/334 (35%), Positives = 166/334 (49%), Gaps = 37/334 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++GK   + SG IHYPR     W   +   K  GL  + TYVFWNYHE   G++ F G
Sbjct: 38  FLLNGKPFTIYSGEIHYPRVPSAYWKHRLEMMKAMGLNTVTTYVFWNYHEEAPGKWNFSG 97

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL +F+KT QE GL++ +R GPY CAEW +GG+P WL     ++ R  N  F EE  +
Sbjct: 98  EKDLQKFIKTAQETGLYVIIRPGPYVCAEWEFGGYPWWLQKNKELEIRRDNKAFSEECWK 157

Query: 132 F---LAKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWA---ADTA 184
           +   LAK I  M+  N     GGP+I+ Q ENE+G+ V     +  E + K++    +  
Sbjct: 158 YISQLAKQITPMQITN-----GGPVIMVQAENEFGSYVAQRKDIPLEEHRKYSHKIKEML 212

Query: 185 VNLNTSVPWVMCQ-----QEDAPDPIINTCNGFYCDGFTPNSPSK------PIMWTENYS 233
           +    SVP          +  + +  + T NG         S ++      P M  E Y 
Sbjct: 213 LKSGISVPLFTSDGSSLFKGGSVEGALPTANGESDIDVLKKSINEYNGGKGPYMIAEYYP 272

Query: 234 GWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           GW     +A PF  V  E++      + E G +F NYYM  GGTNFG T+G         
Sbjct: 273 GWLDH--WAEPFVKVSTEEVVKQTNLYIENGVSF-NYYMIHGGTNFGFTSGANYDKDHDI 329

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
               TSYDYDAPI E G+   PK+  LR++ + I
Sbjct: 330 QPDLTSYDYDAPISEAGWA-TPKYNALRKIFQKI 362


>gi|300726558|ref|ZP_07060002.1| beta-galactosidase [Prevotella bryantii B14]
 gi|299776172|gb|EFI72738.1| beta-galactosidase [Prevotella bryantii B14]
          Length = 781

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 108/313 (34%), Positives = 152/313 (48%), Gaps = 32/313 (10%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           H+  +++GK   +++  +HYPR     W   I+  K  G+  I  YVFWN HE   G++ 
Sbjct: 35  HKTFLLNGKPFTVKAAELHYPRIPRPYWEHRIKMCKALGMNAICIYVFWNIHEQKEGEFN 94

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F G  D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R  +  F E 
Sbjct: 95  FTGNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIKLRERDPYFMER 154

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE--------LYVKWA 180
           +K F  K+ + +    L   +GGPII+ QVENEYG    +YG+  +        L   W 
Sbjct: 155 VKIFEDKVAEQLAP--LTIQRGGPIIMVQVENEYG----SYGIDKQYVGEIRDMLRQGWG 208

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYS 233
            D  +       W      +  D +I T N   G   D          P  P+M +E +S
Sbjct: 209 NDVKM---FQCDWSSNFTHNGLDDLIWTMNFGTGANIDNQFKKLKSLRPDAPLMCSEFWS 265

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVA 287
           GWF  +G     RP +D+   +      G +F + YM  GGT+FG  AG       P V 
Sbjct: 266 GWFDKWGARHETRPAQDMVNNIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQPDV- 323

Query: 288 TSYDYDAPIDEYG 300
           TSYDYDAPI+EYG
Sbjct: 324 TSYDYDAPINEYG 336


>gi|300770171|ref|ZP_07080050.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
 gi|300762647|gb|EFK59464.1| beta-galactosidase [Sphingobacterium spiritivorum ATCC 33861]
          Length = 638

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 172/348 (49%), Gaps = 44/348 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   + SG +HY R   + W   ++  K  GL  + TYVFWN+HE   G + FEG
Sbjct: 41  FVYDGKATRILSGEMHYARIPHQYWKHRLQMVKSMGLNTVATYVFWNFHEESPGNWNFEG 100

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  F+KT  E GL + LR GPYACAEW++GG+P WL  I G++ R  N  F E  K+
Sbjct: 101 DHDLAAFIKTAGEVGLHVILRPGPYACAEWDFGGYPWWLQKIDGLEIRRDNAKFLEYTKK 160

Query: 132 FLAKIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLN 188
           +    ID + +E  +L  + GGPII+ Q ENE+G+ V     +  E +  + A     L 
Sbjct: 161 Y----IDRLAKEVGSLQITNGGPIIMVQAENEFGSYVSQRKDIPLEEHKAYNAKIKKQLE 216

Query: 189 TS---VPWVMCQ-----QEDAPDPIINTCNG--------FYCDGFTPNSPSKPIMWTENY 232
            +   VP          +  A    + T NG           D +  N+   P M  E Y
Sbjct: 217 EAGFNVPLFTSDGSWLFEGGAIPGALPTANGENNISNLKKVVDQY--NNNQGPYMVAEFY 274

Query: 233 SGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------- 283
            GW     +A PF  V+   +A    ++ +   +F NYYM  GGTNFG T+G        
Sbjct: 275 PGWLDH--WAEPFAKVDAGRIARQTEKYLQNDISF-NYYMVHGGTNFGFTSGANYNNKSD 331

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
             P + TSYDYDAPI E G+   PK+  +R +   I+   +Y + + P
Sbjct: 332 IQPDI-TSYDYDAPISEAGWTT-PKYDSIRTV---IQKYADYTVPAIP 374



 Score = 48.9 bits (115), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 73/256 (28%), Positives = 110/256 (42%), Gaps = 68/256 (26%)

Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
           Y+ Y+   +  P  GK   L I+ L   A+V+++   V  G  N  F N+ ++  I  N 
Sbjct: 420 YVLYSKQFN-QPINGK---LKIDGLRDFAVVYIDGTKV--GELNRVFKNYEMDIDIPFN- 472

Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLS-SGEWIYQVGVEGEYI 573
             +TL IL   +G  NYG+       G+ S +LI+      D+  +G+W  Q       +
Sbjct: 473 --STLQILVENMGRINYGSEIIHNHKGIISPVLIN------DMEITGDWTMQ------QL 518

Query: 574 GLDKI-SLANSSFWKQGSTL---PVNKSLI--------WYKTTFLAPEGKGPLALNLASM 621
            +DK+  LA     KQ +T+    VN S I         Y+ TF   E  G   +++   
Sbjct: 519 PMDKVPDLAG----KQTATIQNTKVNTSKIATLKGQPVLYQGTFDLKE-IGDTFIDMEKW 573

Query: 622 GKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWV 681
           GKG  ++NG +IGRYW       TG                      P  TLY IP  ++
Sbjct: 574 GKGIVFINGINIGRYW------KTG----------------------PQHTLY-IPGPYL 604

Query: 682 HPGENLLVIHEELGGD 697
             G N +VI E+L  +
Sbjct: 605 KKGSNSIVIFEQLNDE 620


>gi|334134215|ref|ZP_08507725.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
 gi|333608023|gb|EGL19327.1| putative beta-galactosidase [Paenibacillus sp. HGF7]
          Length = 940

 Score =  166 bits (421), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 115/338 (34%), Positives = 164/338 (48%), Gaps = 26/338 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V YD  + +IDG+R  + S ++HY R     W E++ KSKE G   IETYV WN+HE  
Sbjct: 5   RVQYDRNSWIIDGRRVFILSAAVHYFRLPRAEWAEVLDKSKEAGCNCIETYVPWNWHEEE 64

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQ+ F G  DL  F+    E GL++ +R GPY CAEW+ GG P WL   P +Q+R  + 
Sbjct: 65  EGQWDFSGDKDLGAFLDLCAERGLYVIVRPGPYICAEWDMGGLPYWLERKPDMQYRKFHR 124

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            F   +  +  +++ ++    L  S  G +I+ QVENE+     A G   + Y+++  D 
Sbjct: 125 EFLHYVDLYWDRLVPVVLPRLL--SNSGTVIMVQVENEFQ----ALGKPDKAYMEYLRDG 178

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGF-----YCDGFTPNSPSKPIMWTENYSGWFLS 238
            +     VP V C    A D  +   N +     +          +P    E + GWF  
Sbjct: 179 LIERGIDVPLVTCY--GAVDGAVEFRNFWSHAEEHARTLEERFADQPKGVLEFWIGWFEQ 236

Query: 239 FGYAVPFRPVEDLAFAVAR----FFETGGTFQNYYMYFGGTNF----GRTAGG-PLVATS 289
           +G     R  +  A  V R        G T  NYYM+FGGTNF    GRT G    + TS
Sbjct: 237 WGGP---RANQKTASQVERKTYELIREGFTAINYYMFFGGTNFGHWGGRTIGEHTFMTTS 293

Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
           YDYDA +DEY      K+  L+ +H  ++  E  L  +
Sbjct: 294 YDYDAALDEY-LRPTAKYKALKLVHDFVRWMEPLLTET 330


>gi|402813167|ref|ZP_10862762.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
 gi|402509110|gb|EJW19630.1| beta-galactosidase Bga [Paenibacillus alvei DSM 29]
          Length = 580

 Score =  166 bits (420), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 115/349 (32%), Positives = 173/349 (49%), Gaps = 39/349 (11%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           ++Y+ +  +++GK   L SG++HY R  PE W + +RK K  G   +ETY+ WN HEP  
Sbjct: 4   LSYEDQHFMLEGKPIQLISGAVHYFRIVPEYWEDRLRKVKAMGCNCVETYIAWNVHEPRD 63

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQ+ F+G  D+V F++  Q   L + +R  PY CAEW +GG P WL     I+ R ++  
Sbjct: 64  GQFNFDGIADVVEFIRIAQRVDLLVIVRPSPYICAEWEFGGMPAWL-LKEDIRLRCSDPR 122

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F E++  +   +I  +K   L ++ GGPII  Q+ENEYG    +YG   + Y++   +  
Sbjct: 123 FLEKVSAYYDALIPQLKP--LLSTSGGPIIAVQIENEYG----SYG-NDQAYLQALRNML 175

Query: 185 VNLNTSV-------PWVMCQQEDAPDPIINTCNGFYCDGFTPN---------SPSKPIMW 228
           V     V       P     Q    + ++ T N     G  P           P+ P+M 
Sbjct: 176 VERGIDVLLFTSDGPADDMLQGGMTEGVLATVNF----GSRPKEAFGKLEEYQPNAPLMC 231

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
            E ++GWF  +      R  ED A  +      G +  N+YM  GGTNFG ++G      
Sbjct: 232 MEYWNGWFDHWFEEHHTRSAEDAAQVLDEMLSMGASV-NFYMLHGGTNFGFSSGANHGGR 290

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
             P V TSYDYD+ I E G I  PK+   R+ + K + L E+ +  + P
Sbjct: 291 YKPTV-TSYDYDSAISEAGDI-TPKYQLFRKVIGKYVSLSEDDMPQNTP 337


>gi|312378199|gb|EFR24839.1| hypothetical protein AND_10320 [Anopheles darlingi]
          Length = 639

 Score =  166 bits (419), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 113/327 (34%), Positives = 163/327 (49%), Gaps = 37/327 (11%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  + Y+    V+DGK     +GS HY R+ P+ W   +R  + GGL  ++ YV W+ H 
Sbjct: 23  SFTIDYERDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLRTLRAGGLNAVDLYVQWSLHN 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRT 120
           P  G Y +EG  ++   ++   E  L++ LR GPY CAE + GG P WL +  PGIQ RT
Sbjct: 83  PRDGVYSWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIQVRT 142

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYV--- 177
           ++  +  E+K++  +++  M  E      GGPII+ Q+ENEYG    A+G   + Y+   
Sbjct: 143 SDANYLAEVKKWYGELMSRM--EPYMYGNGGPIIMVQIENEYG----AFGKCDKPYLNFL 196

Query: 178 -----KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCDGFTPN--------S 221
                ++  D AV      P+   + C Q D     I T  G   D              
Sbjct: 197 KEETNRYVQDKAVLFTVDRPYDDEIGCGQIDG--VFITTDFGLMTDEEVDTHAAKVRSYQ 254

Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA 281
           P  P++ TE Y+GW   +  +   RP   LA  + +  + G    ++YMYFGGTNFG  A
Sbjct: 255 PKGPLVNTEFYTGWLTHWQESNQRRPAGPLAATLRKMLKDGWNV-DFYMYFGGTNFGFWA 313

Query: 282 G------GPLVA--TSYDYDAPIDEYG 300
           G      G  +A  TSYDYDAP+DE G
Sbjct: 314 GANDWGLGKYMADITSYDYDAPMDEAG 340


>gi|251795198|ref|YP_003009929.1| beta-galactosidase [Paenibacillus sp. JDR-2]
 gi|247542824|gb|ACS99842.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
          Length = 584

 Score =  166 bits (419), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 105/333 (31%), Positives = 166/333 (49%), Gaps = 39/333 (11%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T   + L+++ +   + +G+IHY R  PE W + + K K  G   +ETYV WN+HEP  
Sbjct: 4   LTIQGKQLMLNDRPFRIIAGAIHYFRVVPEYWRDRLLKLKACGFNTVETYVPWNFHEPEE 63

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ FEG  DL +F+    E GL+  +R  PY CAEW +GG P WL   PG++ R +  P
Sbjct: 64  GRFVFEGMADLEKFIALAGELGLYAIVRPSPYICAEWEFGGLPAWLLKDPGMRLRCSYKP 123

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F ++   +  ++I  +      +++GGP+I  Q+ENEYG    +YG   + Y+ +  +  
Sbjct: 124 FLDKADAYYDELIPRLTP--FLSTKGGPLIAMQIENEYG----SYG-NDKTYLNYLKEAL 176

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDG-----------------FTPNSPSKPIM 227
           V        V+    D P+  +    G   +G                      P +P+M
Sbjct: 177 VKRGVD---VLLFTSDGPEDFM--LQGGMVEGVWETVNFGSRSAEAFAKLQEYQPDQPLM 231

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
             E ++GWF  +G     R   D+A  +      G +  N+YM+ GGTNFG  +G     
Sbjct: 232 CMEFWNGWFDHWGETHHTRGAADVALVLDEMLAAGASV-NFYMFHGGTNFGFFSGANYTD 290

Query: 284 ---PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
              P V TSYDYD+P+ E G + + K+  +RE+
Sbjct: 291 RLLPTV-TSYDYDSPLSESGELTE-KYYAVREV 321


>gi|260813304|ref|XP_002601358.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
 gi|229286653|gb|EEN57370.1| hypothetical protein BRAFLDRAFT_114709 [Branchiostoma floridae]
          Length = 638

 Score =  165 bits (418), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 109/324 (33%), Positives = 161/324 (49%), Gaps = 23/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             +DGK   + SG+IHY R   E W + + K K  GL  +ETYV WN HEP +G++ F G
Sbjct: 18  FTLDGKPVQILSGAIHYFRVPREYWRDRMLKLKACGLNTLETYVCWNLHEPEKGKFDFTG 77

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  +++     GL++  R GPY CAEW+YGG P WL   P +Q RTT  P+ E ++R
Sbjct: 78  MLDIAAYLREAANLGLWVIFRPGPYICAEWDYGGLPSWLLRDPNMQVRTTYQPYMEAVER 137

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           F   ++ ++K       +GGPII  QVENEYG+   +  Y    +  ++      + L +
Sbjct: 138 FFDALLPIVKP--FQYKEGGPIIAMQVENEYGSYARDDKYLTAVKQAIQKRGIEELLLTS 195

Query: 190 SVPWVMCQQEDAPDPIINTCNGFY-----CDGFTPNSPSKPIMWTENYSGWFLSFG---Y 241
               +   +      ++ T N  +             P++P M  E +SGWF  +G   +
Sbjct: 196 DGGQIERLERGCIPGVLMTANFNFNPKKQLGALKKLQPNRPQMVMEFWSGWFDHWGRDHH 255

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAP 295
            +     E L   + RF  +     N+YM+ GGTNFG   G   +       TSYDYDAP
Sbjct: 256 KLHVEKFEQLLGDILRFPSS----VNFYMFHGGTNFGFMNGANYINGYKPDVTSYDYDAP 311

Query: 296 IDEYGFIRQPKWGHLRELHKAIKL 319
           + E G    PK+   REL K + +
Sbjct: 312 LSEAG-DPTPKYYKTRELLKTLAM 334


>gi|398787680|ref|ZP_10550020.1| beta-galactosidase [Streptomyces auratus AGR0001]
 gi|396992782|gb|EJJ03876.1| beta-galactosidase [Streptomyces auratus AGR0001]
          Length = 603

 Score =  165 bits (418), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 110/339 (32%), Positives = 164/339 (48%), Gaps = 36/339 (10%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T   +  ++DGK   + SG+ HY R+ P+ W + + + +  GL  +ETYV WN+H+P  
Sbjct: 27  LTIRGKEFLLDGKPFRILSGAFHYFRTHPQDWRDRLMRMRAMGLNTVETYVAWNFHQPDE 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
            +  F G  D+V FV+T  E GL + +R GPY CAEW++GG P WL        R ++  
Sbjct: 87  KEADFTGWRDVVAFVRTADEVGLKVIVRPGPYICAEWDFGGLPAWLLKDKDAPLRRSDPA 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------------NVEWAYGV 171
           F+  +  + A++  L +  +L A++GGPII  QVENEYG             +   A G+
Sbjct: 147 FERAVDAWFAEL--LPRFVDLQATRGGPIIAMQVENEYGSYGDDHAYLEHLRDTMRAQGI 204

Query: 172 GGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTEN 231
            G L+    A        S+P ++       DP      G + +      P KP+  TE 
Sbjct: 205 DGLLFCSNGATQEALKAGSLPDLLSTVNFGGDP-----TGPFAE-LRAFQPDKPLFCTEF 258

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------ 285
           + GWF  +G           A  V +  E G +  N+YM  GGTNFG +AG  L      
Sbjct: 259 WDGWFDHWGERHRTTDPAQTAADVEKMLEAGASI-NFYMAVGGTNFGWSAGANLSGSGYQ 317

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEY 323
              TSYDYD+PI E G + +       + HK   +  +Y
Sbjct: 318 PTVTSYDYDSPISESGELTE-------KFHKVRDVLGKY 349


>gi|260912222|ref|ZP_05918774.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
 gi|260633656|gb|EEX51794.1| beta-galactosidase [Prevotella sp. oral taxon 472 str. F0295]
          Length = 627

 Score =  165 bits (418), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 113/337 (33%), Positives = 160/337 (47%), Gaps = 50/337 (14%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE- 70
            V +GK   L SG +HY R     W   ++  K  GL  + TYVFWNYHE   G++ ++ 
Sbjct: 42  FVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKT 101

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  +L +FVKT  E G+ + LR GPY CAEW++GG+P WL    G+  R  N PF +  +
Sbjct: 102 GNRNLRQFVKTAAEEGMLVILRPGPYCCAEWDFGGYPWWLSKAKGLVIRADNQPFLDSCR 161

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            ++ ++   M+  +L  ++GGPII+ Q ENE+G+           YV    D  +  + +
Sbjct: 162 VYINQLASQMR--DLQITKGGPIIMVQAENEFGS-----------YVAQRKDVPLESHRA 208

Query: 191 VPWVMCQQ--EDAPDPIINTCNGFY------CDGFTP------------------NSPSK 224
               + QQ  +   D  + T +G +       +G  P                  N    
Sbjct: 209 YSAKIKQQLIDAGFDVPLFTSDGSWLFKGGTIEGALPTANGENDIEKLKKVVNEYNGGKG 268

Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
           P M  E Y GW   +    P    E +    A++ E G +F NYYM  GGTNFG T+G  
Sbjct: 269 PYMVAEFYPGWLSHWAEPFPQVSTESIVKQTAKYLENGVSF-NYYMVHGGTNFGFTSGAN 327

Query: 285 LVA--------TSYDYDAPIDEYGFIRQPKWGHLREL 313
                      TSYDYDAPI E G+   PK+  LR L
Sbjct: 328 YTTATNLQSDLTSYDYDAPISEAGW-NTPKYDALRAL 363



 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 60/222 (27%), Positives = 94/222 (42%), Gaps = 55/222 (24%)

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNYG 532
           + I  L   ALV+VN + V    G  D  + +    IE+N   N  LDIL   +G  NYG
Sbjct: 437 MKIAGLADYALVYVNGQKV----GELDRVSDV--DSIEINMPFNGVLDILVENMGRINYG 490

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGST 591
           A    +  G+   ++ID      +  +G W +Y++ +  E   ++ +  AN         
Sbjct: 491 ARIPQSIKGINGPVVID-----GNEITGNWQMYKLPMN-EAPDVNALPTAN--------- 535

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
              NK L    +     +  G   LN+ + GKG  ++NG ++GRYW              
Sbjct: 536 ---NKGLPTLYSGTFNLDTTGDTFLNMETWGKGIVFINGFNLGRYWK------------- 579

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
             RG             P QTLY +P  ++  GEN +V+ E+
Sbjct: 580 --RG-------------PQQTLY-LPGCFLKKGENKIVVFEQ 605


>gi|158301280|ref|XP_550752.3| AGAP002055-PA [Anopheles gambiae str. PEST]
 gi|157012394|gb|EAL38488.3| AGAP002055-PA [Anopheles gambiae str. PEST]
          Length = 657

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/327 (34%), Positives = 163/327 (49%), Gaps = 37/327 (11%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  + Y+    V+DGK     +GS HY R+ PE W   +R  + GGL  ++ YV W+ H 
Sbjct: 42  SFKIDYERDTFVMDGKDFRYVAGSFHYFRALPETWRTKLRTLRAGGLNAVDLYVQWSLHN 101

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRT 120
           P  G Y +EG  ++   ++   E  L++ LR GPY CAE + GG P WL +  PGI  RT
Sbjct: 102 PRDGVYNWEGIANVTDIIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIAVRT 161

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYV--- 177
           ++  + EE++++  +++  M  E      GGPII+ Q+ENEYG    A+G   + Y+   
Sbjct: 162 SDANYLEEVRKWYGELMSRM--EPYMYGNGGPIIMVQIENEYG----AFGKCDKPYLNFL 215

Query: 178 -----KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCDGFTPN--------S 221
                ++  D AV      P+   + C Q D     I T  G   +              
Sbjct: 216 KQQTERYVQDKAVLFTVDRPYDDEIGCGQIDG--VFITTDFGLMTEEEVDTHAAKVRSYQ 273

Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA 281
           P  P++ TE Y+GW   +  +   RP + LA  + +    G    ++YMYFGGTNFG  A
Sbjct: 274 PKGPLVNTEFYTGWLTHWQESNQRRPAQPLAATLRKMLRDGWNV-DFYMYFGGTNFGFWA 332

Query: 282 G------GPLVA--TSYDYDAPIDEYG 300
           G      G  +A  TSYDYDAP+DE G
Sbjct: 333 GANDWGLGKYMADITSYDYDAPMDEAG 359


>gi|254384398|ref|ZP_04999740.1| beta-galactosidase [Streptomyces sp. Mg1]
 gi|194343285|gb|EDX24251.1| beta-galactosidase [Streptomyces sp. Mg1]
          Length = 588

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 105/304 (34%), Positives = 151/304 (49%), Gaps = 26/304 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DG+   + SG +HY R  P +W + + K++  GL  +ETYV WN H+P   ++  +G  
Sbjct: 18  LDGEPFRILSGGLHYFRVHPGLWRDRLHKARLMGLNTVETYVPWNLHQPRPDEFRMDGGL 77

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF+      GL + LR GPY CAEW  GG P WL   P ++ R+ +  F   +  + 
Sbjct: 78  DLPRFLDLAAAEGLHVLLRPGPYICAEWEGGGLPSWLLADPAMRLRSRDPNFLAAVDDYF 137

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            +++  +   +  AS+GGP++  QVENEYG    AYG     Y++  AD+       VP 
Sbjct: 138 RRLLPPL--HDRLASRGGPVLAVQVENEYG----AYG-DDTAYLEHLADSLRRHGVDVPL 190

Query: 194 VMCQQ-----EDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
             C Q       A   ++ T N       +        PS P++ TE + GWF  +G   
Sbjct: 191 FTCDQPADLERGALAGVLATANFGSRPAAHLATLRTARPSAPLLCTEFWIGWFDRWGGNH 250

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDYDAPI 296
             R  E  +  +     TG +  N+YM+ GGTNFG   G        P V TSYDYDAP+
Sbjct: 251 VVRDAEQASQELDELLATGASV-NFYMFHGGTNFGFMNGANDKHTYRPTV-TSYDYDAPL 308

Query: 297 DEYG 300
           DE G
Sbjct: 309 DEAG 312


>gi|196002910|ref|XP_002111322.1| hypothetical protein TRIADDRAFT_1215 [Trichoplax adhaerens]
 gi|190585221|gb|EDV25289.1| hypothetical protein TRIADDRAFT_1215, partial [Trichoplax
           adhaerens]
          Length = 543

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 164/318 (51%), Gaps = 40/318 (12%)

Query: 21  LQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVK 80
           ++SG+IHY R  PE W + + K K  GL  +ETYV WN HEP+ GQ+ + G  ++ +F+ 
Sbjct: 13  IRSGAIHYFRVVPEYWRDRLLKMKAFGLNTVETYVPWNLHEPVPGQFDYTGILNVRKFIL 72

Query: 81  TVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLM 140
             QE G ++ LR GPY CAEW +GG P WL     +Q R+T  PFK+ + RF    I  +
Sbjct: 73  LAQELGFYVILRPGPYICAEWEFGGMPSWLLSDKNMQVRSTYKPFKDAVNRFFDGFIPEI 132

Query: 141 KQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------LNTSVPWV 194
           K  +L AS+GGPII  QVENEYG    +YG   E Y+++  D  +N      L TS    
Sbjct: 133 K--SLQASKGGPIIAVQVENEYG----SYG-SDEEYMQFIRDALINRGIVELLVTSDNSE 185

Query: 195 MCQQEDAPDPIINTCNGFYCDGFTPNSPS-------KPIMWTENYSGWFLSFGYAVPFRP 247
             +   AP  ++ T N     G   +  S        P +  E +SGWF  +G       
Sbjct: 186 GIKHGGAPG-VLKTYN---FQGHAKSHLSILERLQDAPSIVMEFWSGWFDHWGEKN--HQ 239

Query: 248 VEDLAFAVARF---FETGGTFQNYYMYFGGTNFGRTAGGPLV---------ATSYDYDAP 295
           V  +A     F    +   +F N+Y++ GGTNFG   G   +          TSYDYDAP
Sbjct: 240 VHTIAHVTNTFKDILDCDASF-NFYVFHGGTNFGFMNGANFIDFFSYYLPTVTSYDYDAP 298

Query: 296 IDEYGFIRQPKWGHLREL 313
           + E G I + K+  LR++
Sbjct: 299 LSEAGDITE-KYMELRKI 315


>gi|296081427|emb|CBI16778.3| unnamed protein product [Vitis vinifera]
          Length = 242

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 77/124 (62%), Positives = 90/124 (72%), Gaps = 4/124 (3%)

Query: 206 INTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
           INTCN FYCD FTPNSP+KP MWTEN+ GW  +FG   P  P ED+ F+VARFF      
Sbjct: 120 INTCNSFYCDQFTPNSPNKPKMWTENWPGWSKTFGALDPHGPREDIVFSVARFFWK---- 175

Query: 266 QNYYMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
            NYYM  GGTNFGRT+GGP + T+YDY+APIDEYG  R PK GHL+EL +AIK CE  L+
Sbjct: 176 VNYYMDHGGTNFGRTSGGPFITTTYDYNAPIDEYGLARLPKCGHLKELRRAIKSCEHVLL 235

Query: 326 SSDP 329
             +P
Sbjct: 236 YGEP 239


>gi|386725149|ref|YP_006191475.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
 gi|384092274|gb|AFH63710.1| beta-galactosidase [Paenibacillus mucilaginosus K02]
          Length = 591

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 108/305 (35%), Positives = 150/305 (49%), Gaps = 28/305 (9%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DG+   L SG+IHY R  PE W + +RK K  G   +ETYV WN HEP  G++ FEG  D
Sbjct: 14  DGEELRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L RF++     GL + +R  PY CAEW +GG P WL   PG++ R  +  +  ++  +  
Sbjct: 74  LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133

Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW- 193
           ++I  +    L  + GGP+IL QVENEYG    +YG   + Y++   D  V     VP  
Sbjct: 134 ELIPRLVP--LLCTSGGPVILVQVENEYG----SYG-SDKAYLEHLRDGLVRRGIDVPLF 186

Query: 194 -------VMCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGY 241
                   M Q    P  ++ T N      + F       P  P+M  E ++GWF  +  
Sbjct: 187 TSDGPTDAMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDYDAP 295
               R   D A       E G +  N+YM+ GGTNFG   G   +       TSYDYD+P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSP 304

Query: 296 IDEYG 300
           + E+G
Sbjct: 305 LTEWG 309


>gi|429739263|ref|ZP_19273023.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
 gi|429157228|gb|EKX99829.1| glycosyl hydrolase family 35 [Prevotella saccharolytica F0055]
          Length = 786

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 114/324 (35%), Positives = 161/324 (49%), Gaps = 27/324 (8%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++GK  V+++  +HYPR     W   IR  K  G+  I  YVFWN HE   G++ F
Sbjct: 35  KTFLLNGKPFVIKAAELHYPRIPRPYWEHRIRMCKALGMNTICLYVFWNIHEQQEGKFNF 94

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G  D+  F +  Q+ GL++ +R GPY CAEW  GG P WL     I+ R  +  F E +
Sbjct: 95  TGNNDVAAFCRLAQKHGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRERDPYFMERV 154

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           K F  ++ + +    L   +GGPII+ QVENEYG    +YGV  E YV    D   +   
Sbjct: 155 KVFEQQVGNQLAP--LTIDKGGPIIMVQVENEYG----SYGVDKE-YVSQIRDIVRSSGF 207

Query: 190 S------VPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWF 236
                    W    +++  D +I T N   G   D          P  P M +E +SGWF
Sbjct: 208 DKVALFQCDWASNFEKNGLDDLIWTMNFGTGANIDEQFKRLGELRPQSPKMCSEFWSGWF 267

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
             +G     RP +++   +      G +F + YM  GGT+FG  AG   P  A   TSYD
Sbjct: 268 DKWGARHETRPAKNMVAGIDEMLTKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 326

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAPI+EYG +  PK+  LR + +
Sbjct: 327 YDAPINEYG-LATPKYYELRAMMQ 349


>gi|347967091|ref|XP_001689312.2| AGAP002056-PA [Anopheles gambiae str. PEST]
 gi|333469762|gb|EDO63217.2| AGAP002056-PA [Anopheles gambiae str. PEST]
          Length = 629

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 112/334 (33%), Positives = 157/334 (47%), Gaps = 33/334 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           ++ YD+   V+DGK     +GS HY R+ PE WP ++R  +  GL  I TYV W+ H P 
Sbjct: 27  SIDYDNDTFVMDGKPFQYVAGSFHYFRALPESWPSILRSMRAAGLNAITTYVEWSLHNPK 86

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
              Y ++G  D+  F++    AGL++ LR GPY CAE + GGFP W LH  P I  RT +
Sbjct: 87  EDVYNWQGMADIEHFLELADSAGLYVILRPGPYICAERDMGGFPSWLLHKYPDILLRTND 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             +  E++ + A++  L + +     QGGPII+ QVENEYG    ++      Y+ W  D
Sbjct: 147 LRYLREVRTWYAQL--LSRVQRFLVGQGGPIIMVQVENEYG----SFYACDHKYLNWLRD 200

Query: 183 --------TAVNLNTSVPWV--------MCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
                    AV    + P +        +    D      +  NGF+        P  P+
Sbjct: 201 ETERYVMGNAVLFTNNGPGLEGCGAIEHVLSSLDFGPGTEDEINGFWST-LRKTQPKGPL 259

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV 286
           +  E Y GW   +      R           F        N YM+FGGTN+G TAG   +
Sbjct: 260 VNAEYYPGWLTHWQEPHMARTDTKPVVDSLDFMLRNKVNVNIYMFFGGTNYGFTAGANNM 319

Query: 287 A--------TSYDYDAPIDEYGFIRQPKWGHLRE 312
                    TSYDYDAP+DE G    PK+  LR+
Sbjct: 320 GAGGYAADLTSYDYDAPLDESG-DPTPKYFALRD 352


>gi|224027078|ref|ZP_03645444.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
           18228]
 gi|224020314|gb|EEF78312.1| hypothetical protein BACCOPRO_03839 [Bacteroides coprophilus DSM
           18228]
          Length = 783

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 113/324 (34%), Positives = 159/324 (49%), Gaps = 29/324 (8%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           ++  +++GK  ++++  IHY R   E W   I   K  G+  I  Y FWN HE   G++ 
Sbjct: 37  NKEFLLNGKPFLIKAAEIHYTRIPAEYWEHRIEMCKALGMNTICIYAFWNIHEQRPGEFD 96

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           FEG+ D+ RF +  Q+ G+++ LR GPY C+EW  GG P WL     I  RT++  F E 
Sbjct: 97  FEGQNDVARFCRLAQKHGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIALRTSDPYFLER 156

Query: 129 MKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
            K F+    +L KQ  +L A +GG II+ QVENEYG    AY    E Y+    D     
Sbjct: 157 TKIFMN---ELGKQLADLQAPRGGNIIMVQVENEYG----AYAEDKE-YIASIRDIVRGA 208

Query: 188 N-TSVPWVMCQ-----QEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSG 234
             T VP   C      Q +  D ++ T N G   D            P  P+M +E +SG
Sbjct: 209 GFTDVPLFQCDWASTFQRNGLDDLLWTINFGTGADIDQQFKALREARPETPLMCSEYWSG 268

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATS 289
           WF  +G     RP + +   +    +   +F + YM  GGT FG   G        + +S
Sbjct: 269 WFDHWGRKHETRPADVMVKGIKDMMDRNISF-SLYMTHGGTTFGHWGGANSPSYSAMCSS 327

Query: 290 YDYDAPIDEYGFIRQPKWGHLREL 313
           YDYDAPI E G+   PK+  LR+L
Sbjct: 328 YDYDAPISEAGWA-TPKYYQLRDL 350



 Score = 48.1 bits (113), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 30/107 (28%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +Y+TTF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 536 YYRTTFTL-DKTGDTFLDMSTWGKGMVWVNGHAMGRFWKI-------------------- 574

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                    P QTL+ +P  W+  G+N +V+ + LG D +KI  L +
Sbjct: 575 --------GPQQTLF-MPGCWLKKGKNEIVVLDLLGPDETKIEGLKQ 612


>gi|156552637|ref|XP_001603160.1| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
          Length = 629

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 113/330 (34%), Positives = 167/330 (50%), Gaps = 22/330 (6%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  + Y++   ++DGK     SGS HY R+  + W  ++RK + GGL  + TYV W+ HE
Sbjct: 30  SFAIDYENDQFLLDGKPFRYVSGSFHYFRTPRQHWRGILRKMRAGGLNAVSTYVEWSMHE 89

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRT 120
           P   Q+ ++G  D+V F+K  QE  LF+ LR GPY CAE ++GGFP W L  +P I+ RT
Sbjct: 90  PEFDQWVWDGDADIVEFIKIAQEEDLFVILRPGPYICAERDFGGFPYWLLSRVPDIKLRT 149

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV----EWAYGVGGELY 176
            +  +    +RFL +I  L + + L    GGPII+ QVENEYG+     +       E++
Sbjct: 150 KDERYVFYAERFLNEI--LRRTKPLLRGNGGPIIMVQVENEYGSFYACDDQYKSKMYEIF 207

Query: 177 VKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNG----FYCDGFTPNSPSKPIMWT 229
            +   + AV   T   +   + C         I+  NG    F        SP  P++ +
Sbjct: 208 HRHVKNDAVLFTTDGSARSMLKCGSIPGVYATIDFGNGANVPFNYKIMREFSPKGPLVNS 267

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL---- 285
           E Y GW   +G +       ++A  +        +  N YMY+GGTNF  T+G  +    
Sbjct: 268 EYYPGWLTHWGESFQRVNSHNVAKTLDEMLAYNVSV-NIYMYYGGTNFAFTSGANINEHY 326

Query: 286 --VATSYDYDAPIDEYGFIRQPKWGHLREL 313
               TSYDYDAP+ E G    PK+  LR++
Sbjct: 327 WPQLTSYDYDAPLTEAG-DPTPKYFELRDV 355


>gi|300789308|ref|YP_003769599.1| beta-galactosidase [Amycolatopsis mediterranei U32]
 gi|384152800|ref|YP_005535616.1| beta-galactosidase [Amycolatopsis mediterranei S699]
 gi|399541188|ref|YP_006553850.1| beta-galactosidase [Amycolatopsis mediterranei S699]
 gi|299798822|gb|ADJ49197.1| beta-galactosidase [Amycolatopsis mediterranei U32]
 gi|340530954|gb|AEK46159.1| beta-galactosidase [Amycolatopsis mediterranei S699]
 gi|398321958|gb|AFO80905.1| beta-galactosidase [Amycolatopsis mediterranei S699]
          Length = 584

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 106/321 (33%), Positives = 162/321 (50%), Gaps = 29/321 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   + SG++HY R  P++W + I K++  GL  IETYV WN H P  G +   G
Sbjct: 11  FLLDGRPFRILSGALHYFRVHPDLWADRIDKARRMGLNTIETYVAWNAHAPEPGTFDLSG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF++ V +AG++  +R GPY CAEW+ GG P WL   P +  R     + + ++ 
Sbjct: 71  GLDLDRFLRLVADAGMYAIVRPGPYICAEWDNGGLPAWLFRDPSVGVRRYEPKYLDAVRE 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L K+ +++    +   +GGP++L QVENEYG    A+G   + Y+K  A+       +V
Sbjct: 131 YLTKVYEVVVPHQI--DRGGPVLLVQVENEYG----AFG-DDKRYLKALAEHTREAGVTV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSF 239
           P     Q         + +G +                  + P+ P+M +E ++GWF  +
Sbjct: 184 PLTTVDQPTPEMLEAGSLDGLHRTASFGSGAEARLAILRAHQPTGPLMCSEFWNGWFDHW 243

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDY 292
           G         D A  +      G +  N YM+ GGTNFG T G        PL+ TSYDY
Sbjct: 244 GAHHHTTSAADSAAELDALLAAGASV-NLYMFHGGTNFGLTNGANDKGVYQPLI-TSYDY 301

Query: 293 DAPIDEYGFIRQPKWGHLREL 313
           DAP+DE G    PK+   R++
Sbjct: 302 DAPLDEAG-DPTPKYHAFRDV 321


>gi|329960238|ref|ZP_08298680.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
 gi|328532911|gb|EGF59688.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
          Length = 778

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/323 (34%), Positives = 157/323 (48%), Gaps = 27/323 (8%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           ++  ++DGK  ++++  +HY R   E W   I+  K  G+  I  Y FWN HE   G++ 
Sbjct: 35  NQTFLLDGKPFIIKAAEMHYTRIPAEYWEHRIQMCKALGMNTICIYAFWNIHEQRPGEFD 94

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F+G+ D+  F +  Q+ G+++ LR GPY C+EW  GG P WL     IQ RT +  F E 
Sbjct: 95  FKGQNDIAEFCRLAQKNGMYIMLRPGPYVCSEWEMGGLPWWLLKKKDIQLRTNDPYFLER 154

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
            K F+ +I   +   +L A +GG II+ QVENEYG     Y V  E Y+    D      
Sbjct: 155 TKLFMNEIGKQLA--DLQAPRGGNIIMVQVENEYG----GYAVNKE-YIANVRDIVRGAG 207

Query: 189 -TSVPWVMCQ-----QEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGW 235
            T VP   C      Q +  D ++ T N   G   D          P  P+M +E +SGW
Sbjct: 208 FTDVPLFQCDWSSTFQLNGLDDLLWTINFGTGANIDAQFKSLKEARPDAPLMCSEFWSGW 267

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
           F  +G     R  E +   +    +   +F + YM  GGT FG   G        + +SY
Sbjct: 268 FDHWGRKHETRDAETMVSGLKDMLDRNISF-SLYMAHGGTTFGHWGGANCPPYSAMCSSY 326

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAPI E G+   PK+  LRE+
Sbjct: 327 DYDAPISEAGWA-TPKYYKLREM 348



 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/151 (26%), Positives = 63/151 (41%), Gaps = 35/151 (23%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +Y+ +F   E  G + L++ + GKG  WVNG++IGR+W                      
Sbjct: 532 YYRASFNLKE-TGDVFLDMQTWGKGMVWVNGKAIGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
                    P QTLY +P  W+  G+N +V+ + LG D ++I  L    Q I   +   +
Sbjct: 571 --------GPQQTLY-MPGCWLKKGKNEIVVLDLLGPDKAEIKGLK---QPILDMLRSEE 618

Query: 720 PPPVDSWKPNLGVVSSSPQV--RLACERGWH 748
           P        NL + +  P     +A   GW 
Sbjct: 619 PLTHRKEGENLNLKNEKPVAAGEMAAGNGWQ 649


>gi|379722393|ref|YP_005314524.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
 gi|378571065|gb|AFC31375.1| beta-galactosidase [Paenibacillus mucilaginosus 3016]
          Length = 591

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 108/305 (35%), Positives = 150/305 (49%), Gaps = 28/305 (9%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DG+   L SG+IHY R  PE W + +RK K  G   +ETYV WN HEP  G++ FEG  D
Sbjct: 14  DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L RF++     GL + +R  PY CAEW +GG P WL   PG++ R  +  +  ++  +  
Sbjct: 74  LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133

Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWV 194
           ++I  +    L  + GGP+IL QVENEYG    +YG   + Y++   D  V     VP  
Sbjct: 134 ELIPRLVP--LLCTSGGPVILVQVENEYG----SYG-SDKAYLEHLRDGLVRRGIDVPLF 186

Query: 195 --------MCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGY 241
                   M Q    P  ++ T N      + F       P  P+M  E ++GWF  +  
Sbjct: 187 TSDGPTDSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDYDAP 295
               R   D A       E G +  N+YM+ GGTNFG   G   +       TSYDYD+P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFHNGANHIKTYEPTITSYDYDSP 304

Query: 296 IDEYG 300
           + E+G
Sbjct: 305 LTEWG 309


>gi|288928311|ref|ZP_06422158.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
           F0108]
 gi|288331145|gb|EFC69729.1| beta-galactosidase (Lactase) [Prevotella sp. oral taxon 317 str.
           F0108]
          Length = 674

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 113/337 (33%), Positives = 159/337 (47%), Gaps = 50/337 (14%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE- 70
            V +GK   L SG +HY R     W   ++  K  GL  + TYVFWNYHE   G++ ++ 
Sbjct: 89  FVYNGKPMQLHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYVFWNYHETEPGKWDWKT 148

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  +L +FVKT  E G+ + LR GPY CAEW +GG+P WL    G+  R  N PF +  +
Sbjct: 149 GNRNLRQFVKTAAEEGMLVILRPGPYCCAEWEFGGYPWWLSKAKGLVIRADNQPFLDSCR 208

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            ++ ++   M+  +L  ++GGPII+ Q ENE+G+           YV    D  +  + +
Sbjct: 209 VYINQLASQMR--DLQITKGGPIIMVQAENEFGS-----------YVAQRKDIPLETHRA 255

Query: 191 VPWVMCQQ--EDAPDPIINTCNGFY------CDGFTP------------------NSPSK 224
               + QQ  +   D  + T +G +       +G  P                  N    
Sbjct: 256 YSAKIKQQLLDAGFDVPLFTSDGSWLFKGGTIEGALPTANGESDIEKLKKVVNEYNGGKG 315

Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
           P M  E Y GW   +    P    E +    A++ E G +F NYYM  GGTNFG T+G  
Sbjct: 316 PYMVAEFYPGWLSHWAEPFPQVSTESIVKQTAKYLENGISF-NYYMVHGGTNFGFTSGAN 374

Query: 285 LVA--------TSYDYDAPIDEYGFIRQPKWGHLREL 313
                      TSYDYDAPI E G+   PK+  LR L
Sbjct: 375 YTTATNLQPDLTSYDYDAPISEAGW-NTPKYDALRAL 410



 Score = 47.0 bits (110), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 62/229 (27%), Positives = 95/229 (41%), Gaps = 55/229 (24%)

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNY 531
            L +  L   ALV+VN + V    G  D  + +    IE+N   N  LDIL   +G  NY
Sbjct: 483 MLKVAGLADYALVYVNGQKV----GELDRVSDV--DSIEINVPFNGVLDILVENMGRINY 536

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS 590
           GA    +  G+   ++ID      +  +G W +Y++ +  E   ++ +  AN        
Sbjct: 537 GARITQSIKGINGPVVID-----GNEITGNWQMYKLPMN-EVPDVNALPTAN-------- 582

Query: 591 TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKK 650
               NK L    +     +  G   LN+ + GKG  +VNG ++GRYW             
Sbjct: 583 ----NKGLPTLYSGTFNLDTTGDTFLNMETWGKGIVFVNGINLGRYWK------------ 626

Query: 651 CDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPS 699
              RG             P QTLY +P  ++  GEN +V+ E+    P 
Sbjct: 627 ---RG-------------PQQTLY-LPGCFLKKGENKIVVFEQQNDTPQ 658


>gi|156376589|ref|XP_001630442.1| predicted protein [Nematostella vectensis]
 gi|156217463|gb|EDO38379.1| predicted protein [Nematostella vectensis]
          Length = 570

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 105/302 (34%), Positives = 145/302 (48%), Gaps = 25/302 (8%)

Query: 33  PEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLR 92
           PE W + + K K  GL  +ETYV WN HE ++  + F+   D+V+FVK  Q  GL++ +R
Sbjct: 2   PEYWKDRLVKLKAMGLNTVETYVAWNLHEQVQDNFKFKDELDIVKFVKLAQRLGLYVIIR 61

Query: 93  IGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGP 152
            GPY CAEW+ GG P WL   P ++ RT+  PF E + R+  K+  L+    L   QGGP
Sbjct: 62  PGPYICAEWDLGGLPSWLLSDPEMKLRTSYGPFMEAVDRYFQKLFPLLTP--LQYCQGGP 119

Query: 153 IILAQVENEYGNVEWAYGVG-GELYVKWAADTAVNLNTSVPWVMCQQEDAP-DPIINTCN 210
           II  Q+ENEY + +    +   EL  K      V     +   +   +  P + ++ T N
Sbjct: 120 IIAWQIENEYSSFDKKVDMTYMELLQKMMVKNGVTEMLLMSDNLFSMKTHPINLVLKTIN 179

Query: 211 -----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTF 265
                           P KP+M TE + GWF  +G      P E L   +   F  G + 
Sbjct: 180 LQKNVKDALLQLKEIQPDKPLMVTEFWPGWFDVWGAKHHILPTEKLIKEIKDLFSLGASI 239

Query: 266 QNYYMYFGGTNFGRTAGGPLVA--------------TSYDYDAPIDEYGFIRQPKWGHLR 311
            N+YM+ GGTNFG   G                   TSYDYDAP+ E G I  PK+  LR
Sbjct: 240 -NFYMFHGGTNFGFMNGASFTPSGVSVLEGDYQPDITSYDYDAPLSESGDI-TPKYKALR 297

Query: 312 EL 313
           + 
Sbjct: 298 KF 299


>gi|337749468|ref|YP_004643630.1| beta-galactosidase [Paenibacillus mucilaginosus KNP414]
 gi|336300657|gb|AEI43760.1| Beta-galactosidase [Paenibacillus mucilaginosus KNP414]
          Length = 591

 Score =  163 bits (413), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 108/305 (35%), Positives = 150/305 (49%), Gaps = 28/305 (9%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DG+   L SG+IHY R  PE W + +RK K  G   +ETYV WN HEP  G++ FEG  D
Sbjct: 14  DGEEIRLYSGAIHYFRIVPEYWEDRLRKLKACGFNTVETYVPWNLHEPQEGRFVFEGMAD 73

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L RF++     GL + +R  PY CAEW +GG P WL   PG++ R  +  +  ++  +  
Sbjct: 74  LERFIRLAGRLGLHVIVRPSPYICAEWEFGGLPAWLLAEPGMKLRCADPLYLSKVDAYYD 133

Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWV 194
           ++I  +    L  + GGP+IL QVENEYG    +YG   + Y++   D  V     VP  
Sbjct: 134 ELIPRLVP--LLCTSGGPVILVQVENEYG----SYG-SDKAYLEHLRDGLVRRGIDVPLF 186

Query: 195 --------MCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGY 241
                   M Q    P  ++ T N      + F       P  P+M  E ++GWF  +  
Sbjct: 187 TSDGPTDSMLQGGSLPG-VLATVNFGSRTAESFAKLREYQPQGPLMCMEYWNGWFDHWME 245

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDYDAP 295
               R   D A       E G +  N+YM+ GGTNFG   G   +       TSYDYD+P
Sbjct: 246 EHHQRDAADAARVFGEMLEAGASV-NFYMFHGGTNFGFYNGANHIKTYEPTITSYDYDSP 304

Query: 296 IDEYG 300
           + E+G
Sbjct: 305 LTEWG 309


>gi|156375241|ref|XP_001629990.1| predicted protein [Nematostella vectensis]
 gi|156217002|gb|EDO37927.1| predicted protein [Nematostella vectensis]
          Length = 578

 Score =  163 bits (413), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 107/297 (36%), Positives = 152/297 (51%), Gaps = 20/297 (6%)

Query: 33  PEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLR 92
           PE W + ++K K  GL  +ETYV WN HE ++  + F+   D+V+FV   QE GL + +R
Sbjct: 2   PEYWADRLKKLKAMGLNTVETYVAWNLHEQVKENFKFKDEVDIVKFVNLAQELGLHVIIR 61

Query: 93  IGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGP 152
            GPY C+EW+ GG P WL   P ++ R+T  PF E ++++ +K+  L+    L  S+GGP
Sbjct: 62  PGPYICSEWDLGGLPSWLLNDPNMRLRSTYGPFMEAVEKYFSKLFALLTP--LQFSRGGP 119

Query: 153 IILAQVENEYGNVEWAYG-----VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIIN 207
           II  QVENEY +V+         +  +L +K  A   +  +  V +              
Sbjct: 120 IIAWQVENEYASVQEEVDNHYMELLHKLMLKNGATELLFTSDDVGYTKRYPIKLDGGKYM 179

Query: 208 TCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQN 267
           + N ++C  F    P KPIM TE +SGWF  +G        E       +     G   N
Sbjct: 180 SFNKWFC-LFLHFQPDKPIMVTEYWSGWFDHWGEKHHVLNTERKMINEVKDILDMGASIN 238

Query: 268 YYMYFGGTNFG-----RTAGGPL------VATSYDYDAPIDEYGFIRQPKWGHLREL 313
           +YM+ GGTNFG      TAG  +        TSYDYDAP+ E G I  PK+  LR+L
Sbjct: 239 FYMFHGGTNFGFMNGANTAGNRIDDGYQPDVTSYDYDAPLSEAGDI-TPKYKALRKL 294


>gi|157106611|ref|XP_001649403.1| beta-galactosidase [Aedes aegypti]
 gi|108879822|gb|EAT44047.1| AAEL004580-PA [Aedes aegypti]
          Length = 656

 Score =  163 bits (413), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 114/328 (34%), Positives = 162/328 (49%), Gaps = 39/328 (11%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  + YD    V+DGK     +GS HY R+ P+ W   ++  + GGL  ++ YV W+ H 
Sbjct: 42  SFTIDYDRDTFVMDGKDFRYVAGSFHYFRALPQTWRTKLKTLRAGGLNAVDLYVQWSLHN 101

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRT 120
           P   QY ++G  ++   ++   EA L++ LR GPY CAE + GG P WL    PGIQ RT
Sbjct: 102 PKENQYVWDGIANIKDVIEAAIEADLYVILRPGPYICAEIDNGGLPYWLFTKYPGIQVRT 161

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFA-SQGGPIILAQVENEYGNVEWAYGVGGELYV-- 177
           ++  + +E+  +  K   LM Q   +    GGPII+ Q+ENEYG    A+G   + Y+  
Sbjct: 162 SDANYLKEVATWYEK---LMSQLTPYMYGNGGPIIMVQLENEYG----AFGKCDKPYLNF 214

Query: 178 ------KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCD--------GFTPN 220
                 K+    AV      P+   + C Q   P   + T  G   D             
Sbjct: 215 LKEETEKYTQGKAVLFTVDRPYGNEMECGQ--VPGVFVTTDFGLMTDEEVDTHKAKLRSV 272

Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
            P+ P++ TE Y+GW   +  +   RP E LA  + +    G    ++YMYFGGTNFG  
Sbjct: 273 QPNGPLVNTEFYTGWLTHWQESNQRRPAEPLANTLRKMLHDGWNV-DFYMYFGGTNFGFW 331

Query: 281 AG------GPLVA--TSYDYDAPIDEYG 300
           AG      G  +A  TSYDYDAP+DE G
Sbjct: 332 AGANDWGLGKYMADITSYDYDAPMDEAG 359


>gi|325297293|ref|YP_004257210.1| glycoside hydrolase family protein [Bacteroides salanitronis DSM
           18170]
 gi|324316846|gb|ADY34737.1| glycoside hydrolase family 35 [Bacteroides salanitronis DSM 18170]
          Length = 784

 Score =  163 bits (412), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 126/374 (33%), Positives = 178/374 (47%), Gaps = 34/374 (9%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++G+  V+++  +HYPR     W   I++ K  G+  I  YVFWN+HE   G++ F 
Sbjct: 39  TFLLNGEPFVVKAAELHYPRIPRAYWEHRIKQCKALGMNTICLYVFWNFHEEKPGEFDFT 98

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ DL  F +  Q+  +++ LR GPY CAEW  GG P WL     I+ R  +  F E + 
Sbjct: 99  GQKDLAEFCRLCQKNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREDDPYFLERVA 158

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            F  ++ + +    L   +GGPII+ QVENEYG    +YG   E YV    D        
Sbjct: 159 IFEKEVANQVA--GLTIQKGGPIIMVQVENEYG----SYGESKE-YVAKIRDIVRGNFGD 211

Query: 191 VPWVMCQ-----QEDAPDPIINTCN---GFYCD-GFTP---NSPSKPIMWTENYSGWFLS 238
           V    C      Q +A D ++ T N   G   D  F P     P  P+M +E +SGWF  
Sbjct: 212 VTLFQCDWASNFQLNALDDLVWTMNFGTGANIDEQFAPLKKVRPDSPLMCSEFWSGWFDK 271

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYDYD 293
           +G     R  +D+   +      G +F + YM  GGTN+G  AG   P  A   TSYDYD
Sbjct: 272 WGANHETRAADDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYD 330

Query: 294 APIDEYGFIRQPKWGHLREL-------HKAIKLCEEYLISSDPTHQKLG-AKLEAHIYHK 345
           API E G I  PK+  LRE         K  K+ ++    S P  +    A L A++   
Sbjct: 331 APISESGKI-TPKYEKLRETLAKYMDGKKQAKVPDDIPTISVPAFEFTEVAPLFANLPEP 389

Query: 346 SSNDCAAFLANYDS 359
            S+D    +  YD 
Sbjct: 390 KSDDTIRTMEEYDQ 403



 Score = 42.4 bits (98), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 63/235 (26%), Positives = 87/235 (37%), Gaps = 47/235 (20%)

Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
            +P   +   L +      A +F++ K +    G  D  N      I        LDIL 
Sbjct: 413 TLPKIDRSATLTVTEAHDYAQIFIDGKYI----GKLDRRNGEKQLDIPACAEGAQLDILV 468

Query: 524 MMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGE-YIGLDKISL 580
             +G  N+G A  D  G        ++LKNG R      W +Y +    E Y GL K   
Sbjct: 469 EAMGRINFGRAIKDFKGI----TEKVELKNGGRTTELKGWKVYNLEDRYEGYKGL-KFEP 523

Query: 581 ANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAY 640
             S    QG  +P       Y+ TF   E  G   LN  + GKG  +VNG  IGR W   
Sbjct: 524 LKSVKDAQGQRVPGC-----YRATFHV-EKPGDTFLNFETWGKGLVYVNGYGIGRIWEI- 576

Query: 641 LAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                                       P QTLY +P  W+  GEN +++ + +G
Sbjct: 577 ---------------------------GPQQTLY-MPGCWLKEGENEILVFDIVG 603


>gi|170034404|ref|XP_001845064.1| beta-galactosidase [Culex quinquefasciatus]
 gi|167875697|gb|EDS39080.1| beta-galactosidase [Culex quinquefasciatus]
          Length = 650

 Score =  162 bits (410), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 114/325 (35%), Positives = 160/325 (49%), Gaps = 39/325 (12%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + YD    V+DGK     SGS HY R+ P+ W   +R  + GGL  ++ YV W+ H P  
Sbjct: 37  IDYDRDTFVMDGKDFRYVSGSFHYFRALPQTWRSKLRTMRAGGLNAVDLYVQWSLHNPKD 96

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTNN 123
            QY ++G  ++   ++   E  L++ LR GPY CAE + GG P WL +  PGIQ R ++ 
Sbjct: 97  NQYVWDGIANITDVIEAAIEEDLYVILRPGPYICAEIDNGGLPYWLFNKYPGIQVRISDA 156

Query: 124 PFKEEMKRFLAKIIDLMKQENLFA-SQGGPIILAQVENEYGNVEWAYGVGGELYV----- 177
            + +E+K +  K   LM Q   +    GGPII+ Q+ENEYG    A+G   + Y+     
Sbjct: 157 NYIKEVKIWYEK---LMSQLTPYMYGNGGPIIMVQLENEYG----AFGKCDKQYLNVLKE 209

Query: 178 ---KWAADTAVNLNTSVPW---VMCQQEDAPDPIINTCNGFYCDGFTPN--------SPS 223
              K+    AV      P+   ++C Q   P   I T  G   D              P 
Sbjct: 210 ETEKYTQGKAVLFTVDRPYDDELVCGQ--IPGVFITTDFGLMTDDEVDTHAAKVRSIQPK 267

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG- 282
            P++ TE Y+GW   +      RP   LA  + +  + G    ++YMYFGGTNFG  AG 
Sbjct: 268 GPLVNTEFYTGWLTHWQEKNQRRPAGPLAATLRKMLKDGWNV-DFYMYFGGTNFGFWAGA 326

Query: 283 -----GPLVA--TSYDYDAPIDEYG 300
                G  +A  TSYDYDAP+DE G
Sbjct: 327 NDWGLGKYMADITSYDYDAPMDEAG 351


>gi|15228075|ref|NP_178493.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
 gi|20198172|gb|AAM15443.1| predicted protein [Arabidopsis thaliana]
 gi|330250699|gb|AEC05793.1| glycosyl hydrolase family 35 protein [Arabidopsis thaliana]
          Length = 469

 Score =  162 bits (410), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 114/375 (30%), Positives = 166/375 (44%), Gaps = 89/375 (23%)

Query: 270 MYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
           MY G TNF RTAGGP + T+YDYDAP+DE+G + QPK+GHL++LH      E+ L   + 
Sbjct: 23  MYHGHTNFDRTAGGPFITTTYDYDAPLDEFGNLNQPKYGHLKQLHDVFHAMEKTLTYGNI 82

Query: 330 THQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNVV 389
           +    G  +   +Y ++    + F+ N     +A + F G  Y +PAW VSILPDCK   
Sbjct: 83  STADFGNLVMTTVY-QTEEGSSCFIGNV----NAKINFQGTSYDVPAWYVSILPDCKTES 137

Query: 390 FNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
           +NTAK +  R                   S  F                        N +
Sbjct: 138 YNTAKRMKLR------------------TSLRFK-----------------------NVS 156

Query: 450 KDTSDYLWYTASIHVM---PGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFAN--- 503
            D SD+LWY  ++++    P  GK + L I S  H    FVN +      GN+   N   
Sbjct: 157 NDESDFLWYMTTVNLKEQDPAWGKNMSLRINSTAHVLHGFVNGQHT----GNYRVENGKF 212

Query: 504 -FLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW 562
            ++  +  + N G+N + +LS+ V L NYGA+F+   AG+   + I  +NG   +     
Sbjct: 213 HYVFEQDAKFNPGVNVITLLSVTVDLPNYGAFFENVPAGITGPVFIIGRNGDETV----- 267

Query: 563 IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMG 622
              V     + G  K+                        T F AP G  P+ ++L   G
Sbjct: 268 ---VKYLSTHNGATKL------------------------TIFKAPLGSEPVVVDLLGFG 300

Query: 623 KGQAWVNGQSIGRYW 637
           KG+A +N    GRYW
Sbjct: 301 KGKASINENYTGRYW 315


>gi|218260271|ref|ZP_03475643.1| hypothetical protein PRABACTJOHN_01305, partial [Parabacteroides
           johnsonii DSM 18315]
 gi|218224641|gb|EEC97291.1| hypothetical protein PRABACTJOHN_01305 [Parabacteroides johnsonii
           DSM 18315]
          Length = 539

 Score =  162 bits (410), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 111/323 (34%), Positives = 158/323 (48%), Gaps = 27/323 (8%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           ++  ++DGK  V+++  IHY R   E W   I+  K  G+  I  Y FWN HE   G++ 
Sbjct: 36  NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F G+ D+  F +  Q+  +++ LR GPY C+EW  GG P WL     I+ RT +  F E 
Sbjct: 96  FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
            K F+ +I   +   +L  ++GG II+ QVENEYG    +Y    E Y+    D      
Sbjct: 156 TKLFMNEIGKQLA--DLQITKGGNIIMVQVENEYG----SYATDKE-YIANIRDIVKGAG 208

Query: 189 -TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
            T VP   C      Q +A D ++ T N   G   D          P+ P+M +E +SGW
Sbjct: 209 FTDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGW 268

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
           F  +G     R  E +   +    + G +F + YM  GGT FG   G        + +SY
Sbjct: 269 FDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSY 327

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAPI E G+   PK+  LREL
Sbjct: 328 DYDAPISEAGWT-TPKYFKLREL 349


>gi|332030018|gb|EGI69843.1| Beta-galactosidase [Acromyrmex echinatior]
          Length = 594

 Score =  162 bits (410), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 113/332 (34%), Positives = 163/332 (49%), Gaps = 29/332 (8%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V Y++   ++DGK     SGS HY R+  + W + +RK +  GL  I TYV W+ HEP 
Sbjct: 1   DVDYENNQFLLDGKPFQYVSGSFHYFRTPRQYWRDRLRKMRAAGLNAISTYVEWSLHEPE 60

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
            GQ+ + G  DLV F+   QE  LF+ LR GPY CAE + GG P W L  +P I  RT +
Sbjct: 61  PGQFNWTGDADLVNFLNIAQEEDLFVLLRPGPYICAERDMGGLPYWLLREVPNINLRTKD 120

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-----NVEWAYGVGGELYV 177
             F      +L +I  L K   L    GGPII+ Q+ENEYG     ++E+   +  E++V
Sbjct: 121 ADFVRYATLYLNEI--LSKIRPLLRGNGGPIIMVQIENEYGSYYACDIEYM-DMLKEVFV 177

Query: 178 KWAADTAVNLNT---SVPWVMCQQEDAPDPII------NTCNGFYCDGFTPNSPSKPIMW 228
           K   + A+   T   +   + C         +      N  N F         P  P++ 
Sbjct: 178 KKVGNKALLYTTDGAAASLLRCGFISGAYATVDFGTASNVTNSFLSMRLY--QPRGPLVN 235

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA- 287
           +E Y GW   +G        E +  ++      G +  N+YM++GGTNFG T+G    A 
Sbjct: 236 SEFYPGWLTHWGEPFQRTKTEAIVKSLEEMLALGASV-NFYMFYGGTNFGFTSGANGGAG 294

Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLREL 313
                 TSYDYDAP+ E G    PK+  +R++
Sbjct: 295 VYNPQLTSYDYDAPLTEAG-DPTPKYFAIRDV 325


>gi|313231409|emb|CBY08524.1| unnamed protein product [Oikopleura dioica]
          Length = 493

 Score =  162 bits (409), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 107/321 (33%), Positives = 152/321 (47%), Gaps = 30/321 (9%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A  +DG++  L SGSIHY R   E W + + K K  GL  +E YV WN HEP  G++ F 
Sbjct: 62  AFWLDGEKITLVSGSIHYFRVPNEYWLDRLTKLKYAGLNTVELYVSWNLHEPYSGEFNFS 121

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  D+VRF++   E GL +  R GPY CAEW +GG P WL     ++ RTT   + E ++
Sbjct: 122 GDLDVVRFIEMAGELGLHVLFRPGPYICAEWEWGGHPYWLLHDTDMKVRTTYPGYLEAVE 181

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVG--GELYVKWAADTAVN-- 186
           +F +++    +  +L    GGPII  Q+ENEY     A+ +G     ++ W   T  +  
Sbjct: 182 KFYSELFG--RVNHLMYRNGGPIIAVQIENEYAGFADAFEIGPLDPGFLTWLRQTIKDQQ 239

Query: 187 -----LNTSVPWVMCQQEDAPDPI-INTCN----GFYCDGFTPNSPSKPIMWTENYSGWF 236
                  +   W   + E   DP  +N  +     ++ +    N P KP M  E +SGWF
Sbjct: 240 CEELLFTSDGGWDFYKYELEGDPYGLNFDDVLRANYWLNILENNQPGKPKMVMEWWSGWF 299

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------- 285
             +GY       +     +        +  NYYM+ GGTNFG   G              
Sbjct: 300 DFWGYHHQGTTADSFEENLRAILSQNASV-NYYMFHGGTNFGYMNGANFNTNDQTNDLEY 358

Query: 286 --VATSYDYDAPIDEYGFIRQ 304
             V TSYDYD P+ E G I +
Sbjct: 359 QPVVTSYDYDCPLSEEGRITK 379


>gi|336319932|ref|YP_004599900.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
 gi|336103513|gb|AEI11332.1| Beta-galactosidase [[Cellvibrio] gilvus ATCC 13127]
          Length = 586

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 26/307 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   + SG++HY R  P++W + IRK++  GL  IETYV WN H P RG +   G
Sbjct: 11  FLLDGEPLQILSGALHYFRVHPDLWADRIRKARLMGLNTIETYVAWNAHAPERGVFDLTG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+  V   GL   +R GPY CAEW+ GG P WL   PG+  RT    + E +  
Sbjct: 71  NLDLGRFLDLVAAEGLHAIVRPGPYICAEWDNGGLPAWLMATPGVGVRTAEPQYLEAIAG 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +  +I+ ++    +  ++GGP+++ QVENEYG    AYG   + Y++            V
Sbjct: 131 YYDEILAVVAPRQV--TRGGPVLMVQVENEYG----AYGDDAD-YLRALVTMMRERGIEV 183

Query: 192 PWVMCQQEDAPD------PIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFLSF 239
           P   C Q +         P ++    F        +    + P+ P+M  E + GWF S+
Sbjct: 184 PLTTCDQANDEMLGRGGLPELHKTATFGSRSPERLETLRRHQPTGPLMCMEYWDGWFDSW 243

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP------LVATSYDYD 293
           G           A A      + G   N YM+ GGTN G T G         + TSYDYD
Sbjct: 244 GEQH-HTTDAAEAAADLDLLLSQGASANLYMFHGGTNLGFTNGANDKGTYLPITTSYDYD 302

Query: 294 APIDEYG 300
           AP+ E G
Sbjct: 303 APLAEDG 309


>gi|365118603|ref|ZP_09337115.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649320|gb|EHL88436.1| hypothetical protein HMPREF1033_00461 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 823

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 112/322 (34%), Positives = 151/322 (46%), Gaps = 25/322 (7%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  ++++  +HYPR     W + I+  K  G+  I  YVFWN HEP  G++ F 
Sbjct: 74  TFLLNGKPFIIRAAELHYPRIPKPYWEQRIKLCKALGMNTICLYVFWNLHEPRPGEFDFT 133

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ DL  F +  Q+  +++ LR GPY CAEW  GG P WL     I+ R  +  F E + 
Sbjct: 134 GQNDLAAFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLREADPYFIERVN 193

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            F  ++   +    L    GGPII+ QVENEYG    +YG   E YV    D        
Sbjct: 194 IFEQEVARQVG--GLTIQNGGPIIMVQVENEYG----SYGESKE-YVSLIRDIVRTNFGD 246

Query: 191 VPWVMCQ------QEDAPDPI--INTCNGFYCD----GFTPNSPSKPIMWTENYSGWFLS 238
           V    C       +   PD +  IN   G   D    G     P  P+M +E +SGWF  
Sbjct: 247 VTLFQCDWASNFTKNALPDLLWTINFGTGANIDQQFAGLKKLRPDSPLMCSEFWSGWFDK 306

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYDYD 293
           +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A   TSYDYD
Sbjct: 307 WGANHETRPASDMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDVTSYDYD 365

Query: 294 APIDEYGFIRQPKWGHLRELHK 315
           API E G      W   + L K
Sbjct: 366 APISESGQTTPKYWALRKTLGK 387


>gi|423342145|ref|ZP_17319860.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219016|gb|EKN11981.1| hypothetical protein HMPREF1077_01290 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 779

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 111/323 (34%), Positives = 158/323 (48%), Gaps = 27/323 (8%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           ++  ++DGK  V+++  IHY R   E W   I+  K  G+  I  Y FWN HE   G++ 
Sbjct: 36  NKTFLLDGKPFVIKAAEIHYTRIPAEYWEHRIQLCKALGMNTICIYAFWNIHEQKPGEFD 95

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F G+ D+  F +  Q+  +++ LR GPY C+EW  GG P WL     I+ RT +  F E 
Sbjct: 96  FSGQNDIAAFCRLAQKYDMYIMLRPGPYVCSEWEMGGLPWWLLKKDDIKLRTNDPYFLER 155

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
            K F+ +I   +   +L  ++GG II+ QVENEYG    +Y    E Y+    D      
Sbjct: 156 TKLFMNEIGKQLA--DLQITKGGNIIMVQVENEYG----SYATDKE-YIANIRDIVKGAG 208

Query: 189 -TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
            T VP   C      Q +A D ++ T N   G   D          P+ P+M +E +SGW
Sbjct: 209 FTDVPLFQCDWSSNFQNNALDDLVWTINFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGW 268

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
           F  +G     R  E +   +    + G +F + YM  GGT FG   G        + +SY
Sbjct: 269 FDHWGRKHETRDAETMVSGLKDMLDRGISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSY 327

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAPI E G+   PK+  LREL
Sbjct: 328 DYDAPISEAGWT-TPKYFKLREL 349



 Score = 47.0 bits (110), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 49/107 (45%), Gaps = 30/107 (28%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +Y+ TF   E  G + L++ + GKG  WVNG++IGR+W                      
Sbjct: 533 YYRATFNLEEA-GDVFLDMQTWGKGMVWVNGKAIGRFWEI-------------------- 571

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                    P QTL+ +P  W+  GEN +++ + LG + + I  L K
Sbjct: 572 --------GPQQTLF-MPGCWLKKGENEIIVLDLLGPEKATIKGLDK 609


>gi|327282153|ref|XP_003225808.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
          Length = 649

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 115/335 (34%), Positives = 164/335 (48%), Gaps = 23/335 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y H   + DG+     SGSIHY R     W + + K K  GL+ I+TYV WN+HEP R
Sbjct: 32  IDYGHNCFLKDGQPFRYISGSIHYSRIPRYYWKDRLLKMKMAGLDAIQTYVPWNFHEPER 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y F G  DL  F++  QE GL + LR GPY CAEW+ GG P WL     I  R+++  
Sbjct: 92  GVYNFTGDRDLEYFLQLAQEVGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDPD 151

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY-GVGGELYVKWA 180
           +   +  ++   +  MK        GGPII+ QVENEYG+    ++ Y      L+ ++ 
Sbjct: 152 YLTAVGSWMGIFLPKMKPH--LYQNGGPIIMVQVENEYGSYFACDFDYLRYLQNLFRQYL 209

Query: 181 ADTAVNLNT---SVPWVMCQQEDAP------DPIINTCNGFYCDGFTPNSPSKPIMWTEN 231
            D  V   T   S+ ++ C             P  N    F     T   P  P++ +E 
Sbjct: 210 GDEVVLFTTDGASMFYLRCGALQGLYSTVDFGPGRNVTAAFSTQRHT--EPKGPLVNSEF 267

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
           Y+GW   +G+     P   +A +++    +G    N YM+ GGTNFG   G   P +A  
Sbjct: 268 YTGWLDHWGHRHITVPASIVAKSLSEILASGANV-NMYMFIGGTNFGYWNGANMPYMAQP 326

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           TSYDYDAP+ E G + + K+  +RE+    K   E
Sbjct: 327 TSYDYDAPLSEAGDLTE-KYFAIREVIGMFKKLPE 360


>gi|329927236|ref|ZP_08281534.1| beta-galactosidase [Paenibacillus sp. HGF5]
 gi|328938636|gb|EGG35019.1| beta-galactosidase [Paenibacillus sp. HGF5]
          Length = 587

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 104/296 (35%), Positives = 144/296 (48%), Gaps = 26/296 (8%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SG+IHY R  PE W + + K +  GL  +ETY+ WN HEP  GQ+ F+G  DL RFV+  
Sbjct: 23  SGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVRIA 82

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
            + GL + LR  PY CAEW +GG P WL   P IQ R  +  + E++ ++  ++I  +  
Sbjct: 83  GDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRLVP 142

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-------PWVM 195
             L  S+GGP+I  Q+ENEYG    +YG     Y+++  D  +     V       P   
Sbjct: 143 --LLTSKGGPVIAMQIENEYG----SYG-NDTAYLEYLKDGLIKRGVDVLLFTSDGPTDG 195

Query: 196 CQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVED 250
             Q  A   ++ T N         D      P  P+M  E ++GWF  +      R  ED
Sbjct: 196 MLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAED 255

Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYG 300
            A       +   +  N+YM+ GGTNFG   G           TSYDYDAP+ E G
Sbjct: 256 AAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310


>gi|257413247|ref|ZP_04742461.2| beta-galactosidase [Roseburia intestinalis L1-82]
 gi|257204151|gb|EEV02436.1| beta-galactosidase [Roseburia intestinalis L1-82]
          Length = 588

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 118/351 (33%), Positives = 166/351 (47%), Gaps = 47/351 (13%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W + + K K  G   +ETY+ WN HEP +G+++FEG  
Sbjct: 19  LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 78

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF- 132
           D+ RFVKT QE GL++ LR  PY CAEW +GG P WL    G++ R +  PF + ++ + 
Sbjct: 79  DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 138

Query: 133 ---LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
              L KI+          + GGP+IL QVENEYG     Y      Y+    D       
Sbjct: 139 DVLLKKIVPYQ------INYGGPVILMQVENEYG-----YYANDREYLLAMRDKMQKGGV 187

Query: 190 SVPWVMCQQEDAPDPIINTCNGFYCDGFTP--NSPSK---------------PIMWTENY 232
            VP V      +  P     NG + +G  P  N  SK               P+M TE +
Sbjct: 188 VVPLVT-----SDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFW 242

Query: 233 SGWFLSFGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----- 286
            GWF  +G        +E+    + +  E G    N YM+ GGTNFG   G         
Sbjct: 243 VGWFDHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTP 300

Query: 287 -ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
             TSYDYDA + E G I + K+   R++    +   E   +++   +  G+
Sbjct: 301 DVTSYDYDALLTEDGQITE-KYRRYRDVIAKYREIPEVTFTTEIKRKAYGS 350


>gi|157106609|ref|XP_001649402.1| beta-galactosidase [Aedes aegypti]
 gi|108879821|gb|EAT44046.1| AAEL004575-PA [Aedes aegypti]
          Length = 648

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 112/336 (33%), Positives = 171/336 (50%), Gaps = 37/336 (11%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y++   ++DG      +GS HY R+ P+ W  +++  +  GL  + TYV W+ H P +
Sbjct: 36  IDYENNTFLLDGAPFQYIAGSFHYFRALPQAWGPILKSMRAAGLNAVTTYVEWSLHNPKK 95

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
           G Y ++G  D+ RFV+  Q   L + LR GPY CAE + GGFP W L+  PGIQ RT + 
Sbjct: 96  GVYNWDGMADIERFVQLAQNEDLLVILRPGPYICAERDMGGFPYWLLNKYPGIQLRTADV 155

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD- 182
            +  E++ + A++    + E  F   GGPII+ QVENEYG    ++      Y+KW  D 
Sbjct: 156 AYLREVRTWYAELFS--RLEPYFYGNGGPIIMVQVENEYG----SFFACDYKYMKWLRDE 209

Query: 183 -------TAVNLNTSVPWVMCQQEDAPDPIINTCN-----GFYCDGFTPN----SPSKPI 226
                   AV    + P +   Q    D +++T +         DG+  +     P  P+
Sbjct: 210 TERYVRGKAVLFTNNGPGL--TQCGGIDGVLSTLDFGPGTALEIDGYWKDLRKLQPKGPL 267

Query: 227 MWTENYSGWFLSFGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--- 282
           +  E Y GW   +      R P+E +  ++ R+  +     N YM++GGTNFG TAG   
Sbjct: 268 VNAEYYPGWLTHWQEQQMARSPIEPVVTSL-RYMLSSKVNVNIYMFYGGTNFGFTAGANE 326

Query: 283 ---GPLVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
              G  +   TSYDYDAP+DE G    PK+  +R++
Sbjct: 327 QGPGRFIPDITSYDYDAPLDESG-DPTPKYEAIRKV 361


>gi|291535092|emb|CBL08204.1| Beta-galactosidase [Roseburia intestinalis M50/1]
 gi|291539606|emb|CBL12717.1| Beta-galactosidase [Roseburia intestinalis XB6B4]
          Length = 581

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 118/351 (33%), Positives = 166/351 (47%), Gaps = 47/351 (13%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W + + K K  G   +ETY+ WN HEP +G+++FEG  
Sbjct: 12  LDGKPFQIISGAIHYFRIVPEYWQDRLEKLKAMGCNTVETYIPWNMHEPKKGEFHFEGML 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF- 132
           D+ RFVKT QE GL++ LR  PY CAEW +GG P WL    G++ R +  PF + ++ + 
Sbjct: 72  DIERFVKTAQELGLYVILRPSPYICAEWEFGGLPAWLLAEDGMKLRVSYPPFLKHVQDYY 131

Query: 133 ---LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
              L KI+          + GGP+IL QVENEYG     Y      Y+    D       
Sbjct: 132 DVLLKKIVPYQ------INYGGPVILMQVENEYG-----YYANDREYLLAMRDKMQKGGV 180

Query: 190 SVPWVMCQQEDAPDPIINTCNGFYCDGFTP--NSPSK---------------PIMWTENY 232
            VP V      +  P     NG + +G  P  N  SK               P+M TE +
Sbjct: 181 VVPLVT-----SDGPFEENLNGGHLEGALPTGNFGSKTEERFEVLKKYTDGGPLMCTEFW 235

Query: 233 SGWFLSFGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----- 286
            GWF  +G        +E+    + +  E G    N YM+ GGTNFG   G         
Sbjct: 236 VGWFDHWGNGGHMTGNLEESVKDLDKMLELGHV--NIYMFEGGTNFGFMNGSNYYDELTP 293

Query: 287 -ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
             TSYDYDA + E G I + K+   R++    +   E   +++   +  G+
Sbjct: 294 DVTSYDYDALLTEDGQITE-KYRRYRDVIAKYREIPEVTFTTEIKRKAYGS 343


>gi|345003968|ref|YP_004806822.1| glycoside hydrolase family protein [Streptomyces sp. SirexAA-E]
 gi|344319594|gb|AEN14282.1| glycoside hydrolase family 35 [Streptomyces sp. SirexAA-E]
          Length = 602

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 113/347 (32%), Positives = 168/347 (48%), Gaps = 29/347 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A +TY    L+  G+   + +G++HY R  P+ W + + +    GL  ++TY+ WN+HE 
Sbjct: 7   ALLTYSEGTLLRAGRPHQVLAGTLHYFRVHPDQWHDRLERLAAMGLNTVDTYIAWNFHER 66

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G++ F+G  D+ RFV+T Q  GL + +R GPY CAEW+ GG P WL   PG++ R++ 
Sbjct: 67  RTGEHRFDGWRDIERFVRTAQRTGLDVIVRPGPYICAEWDNGGLPAWLTDRPGMRPRSSY 126

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            P+ +E+ R+   +I  +   +L A++GGP++  QVENEYG+    YG     Y++W  D
Sbjct: 127 APYLDEVARWFDVLIPRIA--DLQAARGGPVVAVQVENEYGS----YG-DDHAYMRWVHD 179

Query: 183 TAVNLNTS--------VPWVMCQQEDAPDPIINTCNGFYCDG----FTPNSPSKPIMWTE 230
                  +           +M      P  +     G   D            +P +  E
Sbjct: 180 ALAGRGVTELLYTADGPTELMLDGGSLPGVLATATLGSRADQAAQLLRTRRSGEPFLCAE 239

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-----GPL 285
            ++GWF  +G     R V   A A+      GG+  + Y   GGTNFG  AG     G L
Sbjct: 240 FWNGWFDHWGEKHHTRSVGSAAAALDEILAKGGSV-SLYPAHGGTNFGLWAGANHADGAL 298

Query: 286 --VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
               TSYD DAPI E+G    PK+   R+ L  A    E  L  S P
Sbjct: 299 QPTVTSYDSDAPIAEHG-APTPKFHAFRDRLLAATGAAERELPRSRP 344


>gi|387791561|ref|YP_006256626.1| beta-galactosidase [Solitalea canadensis DSM 3403]
 gi|379654394|gb|AFD07450.1| beta-galactosidase [Solitalea canadensis DSM 3403]
          Length = 619

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 115/339 (33%), Positives = 165/339 (48%), Gaps = 42/339 (12%)

Query: 8   DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
           ++ A V DGK   + SG +H+ R   E W   ++  K  GL  + TYVFWNYHE   G +
Sbjct: 29  ENGAFVYDGKPVQIHSGEMHFARVPQEYWRHRLKMMKAMGLNSVATYVFWNYHETAPGVW 88

Query: 68  YFE-GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFK 126
            F+ G  ++  F+K   E GL + LR GPYACAEW YGG+P +L  + G++ R  N  F 
Sbjct: 89  DFKTGNKNISEFIKIAGEEGLMVILRPGPYACAEWEYGGYPWFLQNVEGLEVRRNNPKFL 148

Query: 127 EEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWAYGVGG 173
              K ++  +   +K + +  ++GGPII+ Q ENE+G+               ++  +  
Sbjct: 149 AACKEYIDHLAKEVKNQQI--TKGGPIIMVQAENEFGSYVAQRKDIPLAEHKAYSSAIKA 206

Query: 174 ELYVKWAADTAVNLNTSV-PWVMCQQEDAPDPIINTCNG--------FYCDGFTPNSPSK 224
           +L    AA   V L TS   W+   +  + +  + T NG           D +  N    
Sbjct: 207 QLL---AAGFDVPLFTSDGSWLF--EGGSIENCLPTANGEDNIENLKKVVDQY--NGGKG 259

Query: 225 PIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
           P M  E Y GW   +    P  P ED+     ++ +   +F NYYM  GGTNFG T+G  
Sbjct: 260 PYMVAEFYPGWLDHWAEPFPKVPTEDVVKQTEKYLQNNVSF-NYYMVHGGTNFGYTSGAN 318

Query: 285 LVA--------TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
                      TSYDYDAPI E G+   PK+  +REL K
Sbjct: 319 YDKNHDIQPDMTSYDYDAPISEAGW-ATPKYIAIRELMK 356



 Score = 46.2 bits (108), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 68/240 (28%), Positives = 97/240 (40%), Gaps = 59/240 (24%)

Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
           Y+ Y+   +  P  GK   L +  L   ALV+VN + VA      +   +  N   E++ 
Sbjct: 413 YVLYSRKFN-QPISGK---LELNGLRDYALVYVNGEKVA------ELNRYYKNYSCEIDV 462

Query: 515 GIN-TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEY 572
             N TLDI    +G  NYGA       G+ S ++I   NG     SG W +Y+       
Sbjct: 463 PFNATLDIFVENMGRINYGAKITENNKGIISPVVI---NGTE--ISGNWKMYK------- 510

Query: 573 IGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQS 632
           + L+K     S   K+  + PV       K TF   E  G   L++ + GKG  +VNG  
Sbjct: 511 MPLEKQEEVASIKAKEVKSQPV-----VLKGTFNLTE-TGDTFLDMEAWGKGIVFVNGYH 564

Query: 633 IGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
           +GRYW+                              P QTLY +P  W+  G N + I E
Sbjct: 565 LGRYWNV----------------------------GPQQTLY-LPGCWLKKGANEITIVE 595


>gi|383128326|gb|AFG44819.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
 gi|383128328|gb|AFG44820.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
 gi|383128336|gb|AFG44824.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
 gi|383128338|gb|AFG44825.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
          Length = 157

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 104/153 (67%), Gaps = 5/153 (3%)

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW +Y+A   GCT  CDYRG+Y +SKC  +CGQP+Q LYH+PR+W+ P  N+
Sbjct: 1   VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
           LV+ EELGGDP++IS + ++   +C+ VSE   PPV SWK +    L V     +++L C
Sbjct: 61  LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120

Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
               H I +I FAS+G P G+CGSF  G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTN 153


>gi|427385726|ref|ZP_18882033.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726765|gb|EKU89628.1| hypothetical protein HMPREF9447_03066 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1106

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 112/326 (34%), Positives = 157/326 (48%), Gaps = 39/326 (11%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           + +++GK  V+++  +HYPR     W + I+  K  G+  +  YVFWN HEP  G Y F 
Sbjct: 356 SFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGTYDFT 415

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
            + DL  F +  Q+  +++ LR GPY CAEW  GG P WL     I+ R ++  F E + 
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVN 475

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   +   +K  +L  + GGPII+ QVENEYG+              V   +G    L+
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGADKGYVSQIRDIVRTHFGNDIALF 533

Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
              WA++  +N    + W M           N   G   D          P+ P+M +E 
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKKLRPNSPLMCSEF 582

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
           +SGWF  +G     RP ED+   +      G +F + YM  GGTN+G  AG   P  A  
Sbjct: 583 WSGWFDKWGANHETRPAEDMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641

Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
            TSYDYDAPI E G    PK+  LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWKLRE 666


>gi|261407762|ref|YP_003244003.1| beta-galactosidase [Paenibacillus sp. Y412MC10]
 gi|261284225|gb|ACX66196.1| Beta-galactosidase [Paenibacillus sp. Y412MC10]
          Length = 587

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 104/296 (35%), Positives = 144/296 (48%), Gaps = 26/296 (8%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SG+IHY R  PE W + + K +  GL  +ETY+ WN HEP  GQ+ F+G  DL RFV+  
Sbjct: 23  SGAIHYFRVVPEYWEDRLMKLRSCGLNTVETYIPWNLHEPKEGQFVFDGIADLERFVRIA 82

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
            + GL + LR  PY CAEW +GG P WL   P IQ R  +  + E++ ++  ++I  +  
Sbjct: 83  GDLGLHVILRPSPYICAEWEFGGLPSWLLQNPDIQLRCMDPVYLEKVDQYYDELIPRLVP 142

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-------PWVM 195
             L  S+GGP+I  Q+ENEYG    +YG     Y+++  D  +     V       P   
Sbjct: 143 --LLTSKGGPVIAMQIENEYG----SYG-NDTAYLEYLKDGLIKRGVDVLLFTSDGPTDG 195

Query: 196 CQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVED 250
             Q  A   ++ T N         D      P  P+M  E ++GWF  +      R  ED
Sbjct: 196 MLQGGAVPGVLATVNFGSRTKEAFDKLREYRPEDPLMCMEYWNGWFDHWLKPHHTRDAED 255

Query: 251 LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYG 300
            A       +   +  N+YM+ GGTNFG   G           TSYDYDAP+ E G
Sbjct: 256 AAAVFKEMLDLNASV-NFYMFHGGTNFGFYNGANFHEKYEPTLTSYDYDAPLSECG 310


>gi|361068121|gb|AEW08372.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
 gi|383128330|gb|AFG44821.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
 gi|383128334|gb|AFG44823.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
          Length = 157

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 76/153 (49%), Positives = 103/153 (67%), Gaps = 5/153 (3%)

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW +Y+A   GCT  CDYRG+Y +SKC  +CGQP+Q LYH+PR+W+ P  N+
Sbjct: 1   VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
           LV+ EELGGDP++IS + ++   +C+ VSE   PPV SWK +    L V     +++L C
Sbjct: 61  LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120

Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
               H I +I FAS+G P G CGSF  G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTN 153


>gi|319934802|ref|ZP_08009247.1| beta-galactosidase [Coprobacillus sp. 29_1]
 gi|319810179|gb|EFW06541.1| beta-galactosidase [Coprobacillus sp. 29_1]
          Length = 589

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/332 (32%), Positives = 166/332 (50%), Gaps = 37/332 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   + SG+IHY R  P+ W + +   K  G   +ETY+ WN HEP  G++ F+G
Sbjct: 10  FIVDGKPIKILSGAIHYFRIVPKHWEDSLYNLKALGFNTVETYIPWNLHEPKEGEFDFQG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V F+K  QE  L + +R  PY CAEW +GG P WL     +  R+    + E++K 
Sbjct: 70  IKDVVSFIKKAQEMELMVIVRPSPYICAEWEFGGLPAWLLTYDNLHLRSDCPRYLEKVKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   ++ ++   +L ++QGGPII+ QVENE+G+         + Y+K      ++L   V
Sbjct: 130 YYEVLLPMLT--SLQSTQGGPIIMMQVENEFGSFS-----NNKTYLKKLKKIMLDLGVEV 182

Query: 192 P-------WVMCQQEDA--PDPIINTCN-GFY-------CDGFTPNSPSK-PIMWTENYS 233
           P       W    +  +   D ++ T N G +        + F  N   K P+M  E + 
Sbjct: 183 PLFTSDGSWQQALESGSLIDDDVLVTANFGSHSHENLDVLEQFMANHQKKWPLMSMEFWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PL 285
           GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG   G         P 
Sbjct: 243 GWFNRWGEEIITRDAQDLANCVKELLTRGSI--NLYMFHGGTNFGFMNGCSARGQKDLPQ 300

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           V TSYDYDA + E G I + K+  ++++ K +
Sbjct: 301 V-TSYDYDALLTEAGDITE-KYQCVKKVMKEL 330


>gi|395803570|ref|ZP_10482814.1| beta-galactosidase [Flavobacterium sp. F52]
 gi|395434124|gb|EJG00074.1| beta-galactosidase [Flavobacterium sp. F52]
          Length = 617

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 113/339 (33%), Positives = 165/339 (48%), Gaps = 56/339 (16%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GRF 73
           DGK   + SG +HY R   E W   ++  K  GL  + TYVFWNYHE   G + F+ G  
Sbjct: 37  DGKIIKIHSGEMHYERIPKEYWRHRLQMLKAMGLNTVATYVFWNYHEIEPGVWDFKTGNR 96

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL  F++  +  GL++ LR GPYAC EW +GG+P WL   P +  RT N  F +  K +L
Sbjct: 97  DLAEFLRIAKSEGLYVILRPGPYACGEWEFGGYPWWLQNNPDLVIRTNNKAFLDACKTYL 156

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP- 192
             +  ++K    FA+QGGPII+ Q ENE+G+           YV    D +   + +   
Sbjct: 157 EHLYAVVKGN--FANQGGPIIMVQAENEFGS-----------YVSQRTDISAEDHKAYKT 203

Query: 193 --WVMCQQEDAPDPIINT-----CNGFYCDGFTPNSPSK------------------PIM 227
             + + ++   P+P   +       G   +G  P +  +                  P M
Sbjct: 204 AIYNILKETGFPEPFFTSDGSWLFEGGMVEGVLPTANGESNIENLKKQVDKYHKGQGPYM 263

Query: 228 WTENYSGWFLSFGYAVPFRPV--EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
             E Y GW     +A PF  +  E++A    ++ + G +F NYYM  GGTNFG T+G   
Sbjct: 264 VAEFYPGWLDH--WAEPFVKIGSEEIASQTKKYLDAGVSF-NYYMAHGGTNFGFTSGANY 320

Query: 284 -------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
                  P + TSYDYDAPI E G+   PK+  +R++ +
Sbjct: 321 NEESDIQPDI-TSYDYDAPISEAGW-ATPKFMAIRDVMQ 357


>gi|340346435|ref|ZP_08669560.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
 gi|339611892|gb|EGQ16709.1| family 35 glycosyl hydrolase [Prevotella dentalis DSM 3688]
          Length = 859

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 112/334 (33%), Positives = 160/334 (47%), Gaps = 44/334 (13%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  +  YVFWN HE   GQ+ F 
Sbjct: 100 TFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFT 159

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R  +  F E ++
Sbjct: 160 GQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVE 219

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            F  K+ + +    L   +GGPII+ QVENEYG    +YG   + YV    D      + 
Sbjct: 220 LFEQKVAEQLAP--LTIRRGGPIIMVQVENEYG----SYGED-KAYVSQIRDVLRRYWSL 272

Query: 191 VPWVMCQQEDAPDPIINTCNGFYCDGFTPN---------------------------SPS 223
            P    + E A  P++  C+  +   FT N                            P 
Sbjct: 273 SPTGEGRGE-AASPLMFQCD--WSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPD 329

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
            P M +E +SGWF  +G     RP  D+   +      G +F + YM  GGT+FG  AG 
Sbjct: 330 APKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGA 388

Query: 284 --PLVA---TSYDYDAPIDEYGFIRQPKWGHLRE 312
             P  A   TSYDYDAPI+EYG    PK+  LR+
Sbjct: 389 NSPGFAPDVTSYDYDAPINEYGQA-TPKFWELRK 421


>gi|433651261|ref|YP_007277640.1| beta-galactosidase [Prevotella dentalis DSM 3688]
 gi|433301794|gb|AGB27610.1| beta-galactosidase [Prevotella dentalis DSM 3688]
          Length = 797

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 112/334 (33%), Positives = 160/334 (47%), Gaps = 44/334 (13%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  +  YVFWN HE   GQ+ F 
Sbjct: 38  TFLLNGKPFVVKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQREGQFDFT 97

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R  +  F E ++
Sbjct: 98  GQNDVAAFCRLAQQNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMERVE 157

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            F  K+ + +    L   +GGPII+ QVENEYG    +YG   + YV    D      + 
Sbjct: 158 LFEQKVAEQLAP--LTIRRGGPIIMVQVENEYG----SYGED-KAYVSQIRDVLRRYWSL 210

Query: 191 VPWVMCQQEDAPDPIINTCNGFYCDGFTPN---------------------------SPS 223
            P    + E A  P++  C+  +   FT N                            P 
Sbjct: 211 SPTGEGRGE-AASPLMFQCD--WSSNFTRNGLDDLVWTMNFGTGANINDQFRRLGELRPD 267

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
            P M +E +SGWF  +G     RP  D+   +      G +F + YM  GGT+FG  AG 
Sbjct: 268 APKMCSEFWSGWFDKWGARHETRPARDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGA 326

Query: 284 --PLVA---TSYDYDAPIDEYGFIRQPKWGHLRE 312
             P  A   TSYDYDAPI+EYG    PK+  LR+
Sbjct: 327 NSPGFAPDVTSYDYDAPINEYGQA-TPKFWELRK 359


>gi|440800373|gb|ELR21412.1| lysosomal betagalactosidase, partial [Acanthamoeba castellanii str.
           Neff]
          Length = 604

 Score =  160 bits (405), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 108/309 (34%), Positives = 150/309 (48%), Gaps = 30/309 (9%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DG+   + SGSIHY RS PE WP  +R  +  GL  + TYV WN HEP  GQY F GR D
Sbjct: 36  DGQEFRIVSGSIHYFRSLPEQWPARLRTLRSCGLNTVTTYVPWNLHEPTPGQYDFSGRLD 95

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           +VRF++  Q+ G  + +R  PY CAE  +GG P WL    G+Q R ++  + + +  FL 
Sbjct: 96  IVRFIEAAQQEGFLVIVRPPPYICAELEFGGLPAWLLNEEGLQLRCSDPKYLKRVDSFLD 155

Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSVP 192
             + ++       S+GGPII  QVENEYG+   +  Y    EL  +     A+  +++  
Sbjct: 156 HFLPMLATYQY--SRGGPIIAMQVENEYGSYGNDHLYLRHLELKFRQHQIDAILFSSNGA 213

Query: 193 WVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFG----Y 241
                   A   ++ T N   G   +G         PS P+  TE + GWF  +G     
Sbjct: 214 GDQMFVGGALPSLLRTVNFGTGADVEGNLKVLRKYQPSGPLFVTEFWDGWFDHWGEEHHT 273

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----------PLVATSYD 291
             P + ++ L   +     +     N YM FGGTNFG T G               TSYD
Sbjct: 274 TTPTQSMKTLEAIL-----SNNASVNLYMAFGGTNFGFTNGANKGYGETDPYQPTTTSYD 328

Query: 292 YDAPIDEYG 300
           YDAP++E G
Sbjct: 329 YDAPVNESG 337


>gi|260804659|ref|XP_002597205.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
 gi|229282468|gb|EEN53217.1| hypothetical protein BRAFLDRAFT_203307 [Branchiostoma floridae]
          Length = 608

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 116/337 (34%), Positives = 167/337 (49%), Gaps = 46/337 (13%)

Query: 8   DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
           D     IDGK   L SG++HY R  PE W + + K K  GL  +ETYV WN HEP +  Y
Sbjct: 26  DGANFTIDGKPVRLLSGAMHYFRVVPEYWRDRMLKMKAAGLNTLETYVPWNLHEPEKYTY 85

Query: 68  YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
            FEG  DL R++    E GL++ LR GPY CAEW +GG P WL ++     RTT   F +
Sbjct: 86  NFEGILDLGRYLDIAHEVGLWVILRPGPYICAEWEFGGIPGWLAYVKE-HVRTTRPMFID 144

Query: 128 EMK----RFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVG------G 173
            ++    R LA+++          + GGPII  Q+ENEYG    + E+   +       G
Sbjct: 145 PVEVWFGRLLAEVVPRQ------YTNGGPIIAVQIENEYGGFSNSTEYMERLKKILESRG 198

Query: 174 ELYVKWAAD-TAVNLNTSVPWVMCQ---QEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
            + + + +D     ++  +P V+     Q +A D +                P +P+M  
Sbjct: 199 IVELLFTSDGKGALISGGIPGVLKTVNFQNNASDKL---------QKLKEIQPDRPMMVM 249

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFF-ETGGTFQNYYMYFGGTNFG---------R 279
           E ++GWF  +G       +E  +F  + F+    G   N+YM+ GGTNFG         +
Sbjct: 250 EYWTGWFDHWGEDHHLYRLESESFVHSVFYILDAGASVNFYMFHGGTNFGFMNGANTRYK 309

Query: 280 TAGGPL-VATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
           + G  L   TSYDYDAPI E G +  PK+  +RE+ K
Sbjct: 310 SGGRTLPTITSYDYDAPISETGDL-TPKYFKIREILK 345


>gi|376338072|gb|AFB33581.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
 gi|376338074|gb|AFB33582.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
          Length = 157

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 103/153 (67%), Gaps = 5/153 (3%)

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW +Y+A  +GCT  CDYRG+Y +SKC  +CGQP+Q LYH+PR+W+    N+
Sbjct: 1   VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
           LV+ EELGGDP++IS + ++   +C+ VSE   PPV SWK +    L V     +++L C
Sbjct: 61  LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSVLKVNKPKAELQLHC 120

Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
               H I +I FAS+G P G CGSF  G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTN 153


>gi|251799202|ref|YP_003013933.1| beta-galactosidase [Paenibacillus sp. JDR-2]
 gi|247546828|gb|ACT03847.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
          Length = 604

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 108/337 (32%), Positives = 163/337 (48%), Gaps = 41/337 (12%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + +T+  +   +DG+   + SG+IHY R  PE W + + K K  G   +ETY+ WN HEP
Sbjct: 2   SRLTWKDQKYRLDGEEFRILSGAIHYFRVVPEYWEDRLLKLKACGFNTVETYIPWNLHEP 61

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G + F+G  D+ RF++T    GL + +R  PY CAEW +GG P WL     +  R  +
Sbjct: 62  REGSFRFDGFADVARFIETAGRLGLHVIVRPSPYICAEWEFGGLPAWL-LKSSMGLRCMD 120

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
           N + E++ R+  ++I  +    L  S+GGPII  QVENEYG    +YG     Y+ +  D
Sbjct: 121 NEYLEKVDRYYDELIPRLLP--LLDSRGGPIIAVQVENEYG----SYG-NDTAYLAYLRD 173

Query: 183 TAVNLNTSVPWVMCQQEDAPDPII--NTCNGFYCD------------GFTPNSPSKPIMW 228
             +     V  ++   +   D ++   T  G +               +      +P+M 
Sbjct: 174 GLIR--RGVDCLLFTSDGPTDEMLLGGTVEGLHATVNFGSRVAESLAKYREYRQDEPLMV 231

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
            E + GWF  +      R   D+A  +    E G +  N YM+ GGTNFG  +G      
Sbjct: 232 MEYWLGWFDHWRKPHHVREAGDVANVLDEMLEQGASV-NLYMFHGGTNFGFYSGANYGEH 290

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
             P + TSYDYDAP+ E        WG + E +KAI+
Sbjct: 291 YEPTI-TSYDYDAPLTE--------WGDITEKYKAIR 318


>gi|340722578|ref|XP_003399681.1| PREDICTED: beta-galactosidase-like [Bombus terrestris]
          Length = 646

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 162/325 (49%), Gaps = 35/325 (10%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            S  V Y++   ++DGK     SGS HY R+  + W + +RK +  GL  + TYV W+ H
Sbjct: 30  FSFEVDYENNQFLLDGKPFRYISGSFHYFRTPRQYWRDRLRKMRAAGLNAVSTYVEWSLH 89

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFR 119
           +P   ++++ G  D++ F+   QE GLF+ LR GPY CAE ++GG P W L  +P I+ R
Sbjct: 90  QPTENEWHWTGDADVIEFINIAQEEGLFVLLRPGPYICAERDFGGLPYWLLARVPDIKLR 149

Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------V 165
           T ++ + + ++ +L +I+D  K +      GGPII+ QVENEYG+              +
Sbjct: 150 TNDSRYMKYVEIYLNEILD--KVQPYLRGNGGPIIMVQVENEYGSYACDREYLSRLRDIM 207

Query: 166 EWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
               G    LY    A+  +     +P V    +  P+   N    F  +      P  P
Sbjct: 208 RQKIGTKALLYSTDGANANMLRCGFIPEVYATVDFGPN--TNVTKNF--EIMRMYQPRGP 263

Query: 226 IMWTENYSGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
           ++ +E Y GW     +  PF+ V+   +   +      G +  N YM++GGTNFG TAG 
Sbjct: 264 LVNSEFYPGWLTH--WREPFQRVQTATVTKTLDEMLSLGASV-NIYMFYGGTNFGYTAGA 320

Query: 284 --------PLVATSYDYDAPIDEYG 300
                   P + TSYDYDAP+ E G
Sbjct: 321 NGGHNAYNPQL-TSYDYDAPLTEAG 344


>gi|376338078|gb|AFB33584.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
          Length = 157

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 103/153 (67%), Gaps = 5/153 (3%)

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW +Y+A  +GCT  CDYRG+Y +SKC  +CGQP+Q LYH+PR+W+    N+
Sbjct: 1   VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
           LV+ EELGGDP++IS + ++   +C+ VSE   PPV SWK +    L V     +++L C
Sbjct: 61  LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120

Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
               H I +I FAS+G P G CGSF  G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHCNTN 153


>gi|220914306|ref|YP_002489615.1| beta-galactosidase [Arthrobacter chlorophenolicus A6]
 gi|219861184|gb|ACL41526.1| Beta-galactosidase [Arthrobacter chlorophenolicus A6]
          Length = 586

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 107/310 (34%), Positives = 154/310 (49%), Gaps = 28/310 (9%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           R  ++DG+   + SG+IHY R  P++W + IRK++  GL  IETYV WN H    G +  
Sbjct: 9   RDFLLDGEPFRILSGAIHYFRVHPDLWADRIRKARLMGLNTIETYVPWNEHSSTPGAFRT 68

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
           +G  DL RF+  V   G+   +R GPY CAEW+ GG P WL   P I  R++   +   +
Sbjct: 69  DGGLDLGRFLDLVAAEGMQGIVRPGPYICAEWDNGGLPAWLFTDPSIGVRSSEPGYLAAV 128

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
             F+ +++ ++ +  +  ++GGP+IL Q+ENEYG    AYG   + Y++   DTA     
Sbjct: 129 DGFMDRLLPIVVERQI--TRGGPVILFQIENEYG----AYG-SDKAYLQHLVDTATRAGV 181

Query: 190 SVPWVMCQQ------EDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFL 237
            VP   C Q      ED   P ++    F               P  P+M  E ++GWF 
Sbjct: 182 EVPLFTCDQPFETMIEDGSLPGLHKTGTFGSRADERLAFLRERQPDGPLMCAEFWNGWFD 241

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSY 290
           ++G           A  +      G +  N YM+ GGTNFG T G        P + TSY
Sbjct: 242 NWGTHHHTTDAAASAAELDALLAAGASV-NIYMFHGGTNFGFTNGANDKGIYEPTI-TSY 299

Query: 291 DYDAPIDEYG 300
           DYDAP+ E G
Sbjct: 300 DYDAPLSEDG 309


>gi|393782614|ref|ZP_10370797.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
           CL02T12C01]
 gi|392672841|gb|EIY66307.1| hypothetical protein HMPREF1071_01665 [Bacteroides salyersiae
           CL02T12C01]
          Length = 605

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 113/325 (34%), Positives = 160/325 (49%), Gaps = 39/325 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GR 72
           +D K   + SG IH  R   E W + I+  K  G   +  Y+ WNYHE   G + F+ G 
Sbjct: 41  LDDKPFQIISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGN 100

Query: 73  FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF 132
            +L +F++TVQ+ G+FL  R GPY C EW++GG P +L  IP I+ R  +  +   ++R+
Sbjct: 101 KNLEKFIQTVQDEGMFLLFRPGPYVCGEWDFGGLPPYLLSIPDIKIRCMDTRYTAAVERY 160

Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
           + KI  ++K+  +  + GGPII+ QVENEYG    +YG    +Y+KW  D   +    VP
Sbjct: 161 VDKIAPIIKKYEI--TNGGPIIMVQVENEYG----SYG-NDRIYMKWMHDLWRDKGIEVP 213

Query: 193 WVMCQQEDAPDPII---NTCNGFYCDGFTPNS------------PSKPIMWTENYSGWFL 237
           +      D   P +    T  G    G  P +            P   +  +E Y GW  
Sbjct: 214 FYTA---DGATPYMLEAGTLPGVAI-GLDPAASKAEFDEALKVHPDASVFCSELYPGWLT 269

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------PLVAT 288
            +        +E +   V    + G +F NYY+  GGTNFG  AG          P V T
Sbjct: 270 HWREEWQHPSIEKITTDVKWLLDNGKSF-NYYVIHGGTNFGFWAGANSPQPGTYQPDV-T 327

Query: 289 SYDYDAPIDEYGFIRQPKWGHLREL 313
           SYDYDAPI+E G    PK+  LREL
Sbjct: 328 SYDYDAPINEMG-QATPKYMALREL 351


>gi|296399387|gb|ADH10509.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
          Length = 571

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 110/342 (32%), Positives = 166/342 (48%), Gaps = 27/342 (7%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  + Y+  + V DGK     SGSIHY R     W + + K K  GL+ I+TYV WNYHE
Sbjct: 6   SFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHE 65

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G Y F G  DL  F++   + GL + LR GPY CAEW+ GG P WL     I  R++
Sbjct: 66  PRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSS 125

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG------ 172
           ++ + E ++R++  ++  M+        GGPII+ QVENEYG+    ++ Y         
Sbjct: 126 DSDYLEAVERWMGVLLPKMRP--YLYQNGGPIIMVQVENEYGSYFACDYDYLRFLLKLFR 183

Query: 173 ---GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
              G+  V +  D A   +    ++  +    + AP    N    F       + P  P+
Sbjct: 184 LHLGDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPMGPL 239

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
           + +E Y+GW   +G+     P E +A  +      G    N YM+ GGTNF    G  + 
Sbjct: 240 VNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMP 298

Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL 324
                TSYDYDAP+ E G + + K+  +R++   + +   +L
Sbjct: 299 YMPQPTSYDYDAPLSEAGDLTE-KYFTIRKVIGMVSVPRTFL 339


>gi|374606374|ref|ZP_09679251.1| beta-galactosidase [Paenibacillus dendritiformis C454]
 gi|374388019|gb|EHQ59464.1| beta-galactosidase [Paenibacillus dendritiformis C454]
          Length = 583

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 115/349 (32%), Positives = 168/349 (48%), Gaps = 35/349 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             ++YD     +  +   L SG+IHY R  P  W + +RK K  G   IETYV WN HEP
Sbjct: 2   TTLSYDQGQFTMGDRPIQLISGAIHYFRVVPAYWEDRLRKIKAMGCNCIETYVAWNLHEP 61

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+++FEG  D+  FV+   E GL++ +R  PY CAEW +GG P WL     ++ R  +
Sbjct: 62  REGEFHFEGMSDVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL-LKDDMRLRCND 120

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             F E++  +   ++  +    L A++GGPII  Q+ENEYG    +YG   + Y++  A 
Sbjct: 121 PRFLEKVAAYYDALLPQLTP--LLATKGGPIIAVQIENEYG----SYG-NDQAYLQ--AQ 171

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYC------------DGFTPNSPSKPIMW 228
            A+ +   V  ++   +   D ++      G               D      P  P+M 
Sbjct: 172 RAMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMC 231

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
            E ++GWF  +      R  ED A  +      G +  N+YM  GGTNFG  +G      
Sbjct: 232 MEYWNGWFDHWFEQHHTRDAEDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDK 290

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
             P V TSYDYDA I E G +  PK+   RE + K + L E  L ++ P
Sbjct: 291 YEPTV-TSYDYDAAISEAGDL-TPKYHAFREVIGKYVSLPEGDLPANTP 337


>gi|257869131|ref|ZP_05648784.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
 gi|257803295|gb|EEV32117.1| 35 glycosylhydrolase [Enterococcus gallinarum EG2]
          Length = 584

 Score =  159 bits (403), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 170/639 (26%), Positives = 260/639 (40%), Gaps = 143/639 (22%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SGSIHY R  P  W + + K +  G   +ETYV WN HEP  G++ F    DL RF++  
Sbjct: 21  SGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQLA 80

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
           QE GL++ LR  PY CAEW +GG P WL   P ++ R    PF E++ R+  ++   +  
Sbjct: 81  QEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQVS- 139

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA------VNLNTSV-PWVM 195
            +L  +Q GPI++ QVENEYG    +YG   + Y++ +A+        V+L TS  PW+ 
Sbjct: 140 -DLQITQEGPILMMQVENEYG----SYG-NDKSYLRKSAELMRHNGIDVSLFTSDGPWLD 193

Query: 196 CQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSFGYAVPF-R 246
             +    +D   P IN C     + F      +   +P+M  E + GWF ++G       
Sbjct: 194 MLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHTT 252

Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAPIDEYG 300
            V D A  +    E G    N YM+ GGTNFG   G           TSYDYDA + E+G
Sbjct: 253 SVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLSPDVTSYDYDALLSEWG 310

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +  PK+   +++   I     + +++  T +  G                         
Sbjct: 311 DV-TPKYEAFQQVIGEITEIPSFPLTTKITKRAYG------------------------- 344

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
                          +W VS     +  +F T   ISQ    ++P        ELL  ++
Sbjct: 345 ---------------SWKVS----QRVSLFETLASISQPVKHNYPLTM-----ELLDQAT 380

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            + +Y  ++G S                 +   DY                  ++ +   
Sbjct: 381 GYVYYRSQIGKS-----------------RVIEDYR----------------LIHCQDRA 407

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           H    F+N +L    Y           K + L E  N L IL   +G  NY    +    
Sbjct: 408 HT---FINNQLQFIQYDQE----IGQKKTLTLTEESNELGILVENMGRVNYSVQMNHQYK 460

Query: 541 GLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           G+   +++   NG       EW IY + ++     LD++    S  W+ G          
Sbjct: 461 GIKDGVIV---NGA---FQSEWEIYSLPMD----NLDQVDF--SGHWQTGQP-------S 501

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
           + K +F   E      + L   GKG   +NG +IGR+W 
Sbjct: 502 FSKVSFQVDECADTF-VELPGWGKGFIVINGHNIGRFWE 539


>gi|383128332|gb|AFG44822.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
          Length = 157

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 5/153 (3%)

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW +Y+A   GCT  CDYRG+Y +SKC  +CGQP+Q LYH+PR+W+ P  N+
Sbjct: 1   VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQPTGNV 60

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
           LV+ EELGGDP++IS + ++   +C+ VSE   PPV SWK +    L V     +++L C
Sbjct: 61  LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120

Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
               H I +I F S+G P G CGSF  G C+ +
Sbjct: 121 PSSGHLIKSIKFVSFGTPTGRCGSFTYGHCNTN 153


>gi|297842039|ref|XP_002888901.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334742|gb|EFH65160.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 686

 Score =  159 bits (402), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 113/351 (32%), Positives = 161/351 (45%), Gaps = 39/351 (11%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DG    +  G +HY R  PE W + + ++K  GL  I+ YV WN HEP  G+  FEG  D
Sbjct: 72  DGNHFQIIGGDLHYFRVLPEYWEDRLLRAKALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 131

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI-PGIQFRTTNNPFKEEMKRFL 133
           LV F+K   +    + LR GPY C EW+ GGFP WL  + P +Q RT++  + + ++R+ 
Sbjct: 132 LVSFLKLCDKLDFMVMLRAGPYICGEWDLGGFPAWLLSVKPRLQLRTSDPAYLKLVERWW 191

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEWAYGVGGELYVKWAAD 182
             +  L K   L  S GGP+I+ Q+ENEYG+           V  A G  G+  + +  D
Sbjct: 192 GVL--LPKIFPLIYSNGGPVIMVQIENEYGSYGNDKAYLRKLVSMARGHLGDDIIVYTTD 249

Query: 183 ---------TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
                      V ++     V     D P PI      F   G      S P + +E Y+
Sbjct: 250 GGTKETLEKGTVPVDDVYSAVDFTTGDDPWPIFELQKKFNAPG------SSPPLSSEFYT 303

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GW   +G  +     E  A ++ +     G+    YM  GGTNFG   G    +      
Sbjct: 304 GWLTHWGEKIAKTDAEFTATSLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDYK 362

Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
              TSYDYDAPI E G I  PK+  L+ + K   +    +I S+   +  G
Sbjct: 363 PDLTSYDYDAPIKESGDIDNPKFRALQRVIKKYNVASHSIIPSNKQRKAYG 413


>gi|383128340|gb|AFG44826.1| Pinus taeda anonymous locus 2_7725_01 genomic sequence
          Length = 157

 Score =  159 bits (401), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 75/153 (49%), Positives = 104/153 (67%), Gaps = 5/153 (3%)

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW +Y+A   GCT  CDYRG+Y +SKC  +CG+P+Q LYH+PR+W+ P  N+
Sbjct: 1   VNGKSIGRYWPSYIASQGGCTDSCDYRGAYSSSKCLTNCGKPSQKLYHVPRSWIQPTGNV 60

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
           LV+ EELGGDP++IS + ++   +C+ VSE   PPV SWK +    L V     +++L C
Sbjct: 61  LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKGELQLHC 120

Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGACHMD 775
               H I +I FAS+G P G+CGSF  G C+ +
Sbjct: 121 PSSGHLIKSIKFASFGTPTGHCGSFTYGHCNTN 153


>gi|375146511|ref|YP_005008952.1| glycoside hydrolase family protein [Niastella koreensis GR20-10]
 gi|361060557|gb|AEV99548.1| glycoside hydrolase family 35 [Niastella koreensis GR20-10]
          Length = 920

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 106/312 (33%), Positives = 149/312 (47%), Gaps = 32/312 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A ++DG+   + SG +HYPR   E W + +RK+K  GL  I TYVFWN HEP +G+Y F 
Sbjct: 346 AFLLDGQPFQIISGEMHYPRVPREAWRDRMRKAKAMGLNTIGTYVFWNLHEPQKGKYDFS 405

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  D+  FVKT QE GL++ LR  PY CAEW +GG+P WL  I G++ R+    + +  K
Sbjct: 406 GNNDIAAFVKTAQEEGLWVILRPSPYVCAEWEFGGYPYWLQNIKGLEVRSKEPQYLQAYK 465

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYG-------VGGELYVKWAADT 183
            ++ ++   +    L  + GG I++ QVENEYG    AYG       +   L+++   D 
Sbjct: 466 NYIMQVGKQLAP--LQVNHGGNILMVQVENEYG----AYGSDREYLDINRRLFIEAGFDG 519

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP------NSPSKPIMWTENYSGWFL 237
              L T  P     + + P  +  + NG              N    P    E Y  WF 
Sbjct: 520 L--LYTCDPEPFLAKGNLPGKLFTSINGLDKPARIKQLIKQNNEGKGPYFVAEWYPAWFD 577

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------PLVAT 288
            +G      P E     +      G +  N YM+ GGT      G          P + +
Sbjct: 578 WWGTQHHKVPAEKYTPGLDSVLSAGMSV-NMYMFHGGTTRDFMNGANYNDQNPYEPQI-S 635

Query: 289 SYDYDAPIDEYG 300
           SYDYDAP+DE G
Sbjct: 636 SYDYDAPLDEAG 647



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 102/241 (42%), Gaps = 57/241 (23%)

Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIEL---NEGINTLDILSMM 525
           G+E  L I+ L    LVF+N K ++           L    I L   +E I  LDIL   
Sbjct: 728 GREGALKIKDLRDYGLVFINGKRISV------LDRRLKQDSIWLKLPDEKIQ-LDILVEN 780

Query: 526 VGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSF 585
           +G  NYG +      G+   +     NGK    +G  ++++     +  L+ ++L NS  
Sbjct: 781 LGRINYGPYLLKNKKGITEGVSF---NGKE--LTGWQMFKL----PFNDLNSVALKNS-- 829

Query: 586 WKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPST 645
            K  S  PV K     K TF + +  G   LNL + GKG  WVNG ++GRYW+       
Sbjct: 830 -KTLSGAPVLK-----KGTF-SLQTVGDTYLNLGNWGKGVVWVNGHNLGRYWNI------ 876

Query: 646 GCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLT 705
                                  P QTLY +P  W+  G N +++ E L  + S++  + 
Sbjct: 877 ----------------------GPQQTLY-VPVEWLKKGGNEIIVLELLKPEQSQLQAVD 913

Query: 706 K 706
           K
Sbjct: 914 K 914


>gi|256840666|ref|ZP_05546174.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
 gi|256737938|gb|EEU51264.1| glycoside hydrolase, family 35 [Parabacteroides sp. D13]
          Length = 768

 Score =  159 bits (401), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++GK   + SG +HYPR   + W   +R  +  GL  + TYVFWN HE   G++ FEG  
Sbjct: 39  VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  +++   E GL + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K ++
Sbjct: 99  NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
            K+ + +   +L  S+GGPII+ Q ENE+G+ V     +  E + ++ A     L  +  
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216

Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
            VP          E    P  + T NG             Y  G        P M  E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270

Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            GW +   +A PF  + D  +A     + +   +F N+YM  GGTNFG T+G        
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327

Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
                TSYDYDAPI E G++  PK+  +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357


>gi|261880887|ref|ZP_06007314.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
 gi|270332394|gb|EFA43180.1| family 35 glycosyl hydrolase [Prevotella bergensis DSM 17361]
          Length = 789

 Score =  159 bits (401), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 105/319 (32%), Positives = 157/319 (49%), Gaps = 20/319 (6%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++ +  V+++  +HYPR     W   I+  K  G+  I  YVFWN HE   G++ F 
Sbjct: 38  TFLLNNRPFVVKAAELHYPRIPRAYWDHRIKMCKALGMNTICLYVFWNIHEQREGEFDFS 97

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R ++  F E ++
Sbjct: 98  GNSDVAAFCRLTQKNGMYIIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVE 157

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVGGELYVKWAADTAVN 186
            F  K+ + +    L    GGPII+ QVENEYG    + ++   +   L   W  +    
Sbjct: 158 IFEQKVAEQLAP--LTIQNGGPIIMVQVENEYGSYGEDKKYVGQIRDVLRKYWYTNGRGP 215

Query: 187 LNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSF 239
                 W    +++  + +I T N   G   D          P  P M +E +SGWF  +
Sbjct: 216 ALFQCDWASNFEKNGLEDLIWTMNFGTGANIDAQFMRLGELRPDAPKMCSEFWSGWFDKW 275

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYDYDA 294
           G     RP +D+   +      G +F + YM  GGT+FG  AG   P  A   TSYDYDA
Sbjct: 276 GARHETRPAKDMVAGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYDYDA 334

Query: 295 PIDEYGFIRQPKWGHLREL 313
           PI+EYG +  PK+  LR++
Sbjct: 335 PINEYGQV-TPKFWELRKM 352


>gi|423331257|ref|ZP_17309041.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230553|gb|EKN23415.1| hypothetical protein HMPREF1075_01054 [Parabacteroides distasonis
           CL03T12C09]
          Length = 768

 Score =  159 bits (401), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++GK   + SG +HYPR   + W   +R  +  GL  + TYVFWN HE   G++ FEG  
Sbjct: 39  VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  +++   E GL + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K ++
Sbjct: 99  NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
            K+ + +   +L  S+GGPII+ Q ENE+G+ V     +  E + ++ A     L  +  
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216

Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
            VP          E    P  + T NG             Y  G        P M  E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270

Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            GW +   +A PF  + D  +A     + +   +F N+YM  GGTNFG T+G        
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327

Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
                TSYDYDAPI E G++  PK+  +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357


>gi|301309736|ref|ZP_07215675.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
 gi|423340209|ref|ZP_17317948.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831310|gb|EFK61941.1| beta-galactosidase (Lactase) [Bacteroides sp. 20_3]
 gi|409227644|gb|EKN20540.1| hypothetical protein HMPREF1059_03873 [Parabacteroides distasonis
           CL09T03C24]
          Length = 765

 Score =  159 bits (401), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++GK   + SG +HYPR   + W   +R  +  GL  + TYVFWN HE   G++ FEG  
Sbjct: 36  VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 95

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  +++   E GL + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K ++
Sbjct: 96  NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 155

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
            K+ + +   +L  S+GGPII+ Q ENE+G+ V     +  E + ++ A     L  +  
Sbjct: 156 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 213

Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
            VP          E    P  + T NG             Y  G        P M  E Y
Sbjct: 214 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 267

Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            GW +   +A PF  + D  +A     + +   +F N+YM  GGTNFG T+G        
Sbjct: 268 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 324

Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
                TSYDYDAPI E G++  PK+  +R +
Sbjct: 325 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 354


>gi|376338076|gb|AFB33583.1| hypothetical protein 2_7725_01, partial [Pinus mugo]
          Length = 157

 Score =  159 bits (401), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 75/150 (50%), Positives = 101/150 (67%), Gaps = 5/150 (3%)

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG+SIGRYW +Y+A  +GCT  CDYRG+Y +SKC  +CGQP+Q LYH+PR+W+    N+
Sbjct: 1   VNGKSIGRYWPSYIASQSGCTDSCDYRGAYSSSKCLTNCGQPSQKLYHVPRSWIQSTGNV 60

Query: 688 LVIHEELGGDPSKISLLTKTGQHICSFVSEADPPPVDSWKPN----LGVVSSSPQVRLAC 743
           LV+ EELGGDP++IS + ++   +C+ VSE   PPV SWK +    L V     +++L C
Sbjct: 61  LVLFEELGGDPTQISFMARSVGTVCARVSETHLPPVGSWKSSATSGLKVNKPKAELQLHC 120

Query: 744 ERGWH-IAAINFASYGIPEGNCGSFRPGAC 772
               H I +I FAS+G P G CGSF  G C
Sbjct: 121 PSSGHLIKSIKFASFGTPTGRCGSFTYGHC 150


>gi|296399420|gb|ADH10537.1| galactosidase, beta 1, 5 prime [Zonotrichia albicollis]
          Length = 571

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 110/342 (32%), Positives = 165/342 (48%), Gaps = 27/342 (7%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  + Y+  + V DGK     SGSIHY R     W + + K K  GL+ I+TYV WNYHE
Sbjct: 6   SFGIDYESNSFVKDGKPFRYISGSIHYSRVPSYYWKDRLLKMKMAGLDAIQTYVPWNYHE 65

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P  G Y F G  DL  F++   + GL + LR GPY CAEW+ GG P WL     I  R++
Sbjct: 66  PRMGTYDFFGGKDLEYFLQLANDTGLLVILRAGPYICAEWDMGGLPAWLLEKKSIVLRSS 125

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG------ 172
           ++ + E ++R++  ++  M+        GGPII+ QVENEYG+    ++ Y         
Sbjct: 126 DSDYLEAVERWMGVLLPKMRP--YLYQNGGPIIMVQVENEYGSYFACDYDYLRFLLKLFR 183

Query: 173 ---GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPI 226
              G   V +  D A   +    ++  +    + AP    N    F       + P  P+
Sbjct: 184 LHLGHEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPMGPL 239

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
           + +E Y+GW   +G+     P E +A  +      G    N YM+ GGTNF    G  + 
Sbjct: 240 VNSEFYTGWLDHWGHRHSVVPAETVAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMP 298

Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL 324
                TSYDYDAP+ E G + + K+  +R++   + +   +L
Sbjct: 299 YMPQPTSYDYDAPLSEAGDLTE-KYFTIRKVIGMVSVPRTFL 339


>gi|423226297|ref|ZP_17212763.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392629725|gb|EIY23731.1| hypothetical protein HMPREF1062_04949 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 1106

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 111/326 (34%), Positives = 156/326 (47%), Gaps = 39/326 (11%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  +  YVFWN HEP  G Y F 
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
            + DL  F +  Q+  +++ LR GPY CAEW  GG P WL     ++ R ++  F E + 
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   +   +K  +L  + GGPII+ QVENEYG+              V   +G G  L+
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNGIALF 533

Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
              WA++  +N    + W M           N   G   D          P+ P+M +E 
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
           +SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A  
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641

Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
            TSYDYDAPI E G    PK+  LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWALRE 666


>gi|150008152|ref|YP_001302895.1| beta-glycosidase [Parabacteroides distasonis ATCC 8503]
 gi|149936576|gb|ABR43273.1| glycoside hydrolase family 35, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
          Length = 768

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++GK   + SG +HYPR   + W   +R  +  GL  + TYVFWN HE   G++ FEG  
Sbjct: 39  VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  +++   E GL + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K ++
Sbjct: 99  NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
            K+ + +   +L  S+GGPII+ Q ENE+G+ V     +  E + ++ A     L  +  
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216

Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
            VP          E    P  + T NG             Y  G        P M  E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270

Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            GW +   +A PF  + D  +A     + +   +F N+YM  GGTNFG T+G        
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327

Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
                TSYDYDAPI E G++  PK+  +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357



 Score = 50.1 bits (118), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 105/256 (41%), Gaps = 52/256 (20%)

Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
           Y+ Y+   +  P +G+   L I  L   A ++V+ + V  G  N  F  + +   I  N 
Sbjct: 416 YVLYSTHFN-QPLKGR---LEIPGLRDYATIYVDGERV--GELNRCFNQYAMEIDIPFNA 469

Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYI 573
              TLDIL   +G  NYG        G+ S + I   NG       +W +Y++ ++    
Sbjct: 470 ---TLDILVENMGRINYGEEIVRNTKGIISSVKI---NGS---EISDWKMYKLPMDR--- 517

Query: 574 GLDKISLANSSFWKQGS--TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQ 631
            +  +       +K GS     +    + Y+ TF   +  G   +++   GKG  ++NG 
Sbjct: 518 -MPALVSGEPYVYKNGSPEVAALGNKPVLYEGTFHLSD-TGDTFIDMEDWGKGIIFINGV 575

Query: 632 SIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIH 691
           +IGRYW A                             P QTLY IP  W++ GEN +VI+
Sbjct: 576 NIGRYWYA----------------------------GPQQTLY-IPGVWLNKGENKIVIY 606

Query: 692 EELGGDPSKISLLTKT 707
           E+L  D        KT
Sbjct: 607 EQLNNDRKSSVRTVKT 622


>gi|357014284|ref|ZP_09079283.1| beta-galactosidase [Paenibacillus elgii B69]
          Length = 591

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 107/313 (34%), Positives = 153/313 (48%), Gaps = 40/313 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DG+   L SG+IHY R  PE W + + K K  G   +ETY+ WN HEP  GQ+ F+G  
Sbjct: 13  LDGESIRLVSGAIHYFRVVPEYWRDRLLKLKACGFNTVETYIPWNLHEPKPGQFRFDGLA 72

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D+VRFV+   E GL + +R  PY CAEW +GG P WL   PG++ R  + P+ + +  + 
Sbjct: 73  DVVRFVEIAGEVGLHVIVRPSPYICAEWEFGGLPAWLLADPGMRVRCMHRPYLDRVDAYY 132

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
              + L   + L  + GGPII  Q+ENEYG    +YG     Y+ +  D  +        
Sbjct: 133 D--VLLPLLKPLLCTNGGPIIAMQIENEYG----SYG-NDRAYLVYLKDAMLQRGMD--- 182

Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
           V+    D P+           ++ T N        F  +      P  PIM  E ++GWF
Sbjct: 183 VLLFTSDGPEHFMLQGGMIPGVLETVNFGSRAEEAF--EMLRKYQPDGPIMCMEYWNGWF 240

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------PLVA 287
             +G     R  +D+A         G +  N+YM+ GGTNFG  +G          P + 
Sbjct: 241 DHWGEQHHTRDAKDVADVFDDMLRLGASV-NFYMFHGGTNFGYMSGANCPQRDHYEPTI- 298

Query: 288 TSYDYDAPIDEYG 300
           TSYDYD P++E G
Sbjct: 299 TSYDYDVPLNESG 311


>gi|255015104|ref|ZP_05287230.1| beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410104527|ref|ZP_11299440.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
 gi|409234336|gb|EKN27166.1| hypothetical protein HMPREF0999_03212 [Parabacteroides sp. D25]
          Length = 768

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++GK   + SG +HYPR   + W   +R  +  GL  + TYVFWN HE   G++ FEG  
Sbjct: 39  VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  +++   E GL + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K ++
Sbjct: 99  NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
            K+ + +   +L  S+GGPII+ Q ENE+G+ V     +  E + ++ A     L  +  
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216

Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
            VP          E    P  + T NG             Y  G        P M  E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270

Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            GW +   +A PF  + D  +A     + +   +F N+YM  GGTNFG T+G        
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327

Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
                TSYDYDAPI E G++  PK+  +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357



 Score = 49.7 bits (117), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 69/259 (26%), Positives = 106/259 (40%), Gaps = 58/259 (22%)

Query: 455 YLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNE 514
           Y+ Y+   +  P +G+   L I  L   A ++V+ + V  G  N  F  + +   I  N 
Sbjct: 416 YVLYSTHFN-QPLKGR---LEIPGLRDYATIYVDGERV--GELNRCFNQYAMEIDIPFNA 469

Query: 515 GINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYI 573
              TLDIL   +G  NYG        G+ S + I   NG       +W +Y+       +
Sbjct: 470 ---TLDILVENMGRINYGEEIVRNTKGIISSVKI---NGS---EISDWKMYK-------L 513

Query: 574 GLDKISLANSS---FWKQGS--TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
            +D++    S     +K GS     +    + Y+ TF   +  G   +++   GKG  ++
Sbjct: 514 PMDRMPALVSDEPYVYKNGSPEVAALGNKPVLYEGTFHLSD-TGDTFIDMEDWGKGIIFI 572

Query: 629 NGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLL 688
           NG +IGRYW A                             P QTLY IP  W++ GEN +
Sbjct: 573 NGVNIGRYWYA----------------------------GPQQTLY-IPGVWLNKGENKI 603

Query: 689 VIHEELGGDPSKISLLTKT 707
           VI+E+L  D        KT
Sbjct: 604 VIYEQLNNDRKSSVRTVKT 622


>gi|298376422|ref|ZP_06986377.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
 gi|298266300|gb|EFI07958.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_19]
          Length = 768

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 162/331 (48%), Gaps = 43/331 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++GK   + SG +HYPR   + W   +R  +  GL  + TYVFWN HE   G++ FEG  
Sbjct: 39  VNGKVTPILSGEMHYPRIPHQYWRHRLRMMRAMGLNTVATYVFWNLHETEPGKWDFEGDK 98

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  +++   E GL + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K ++
Sbjct: 99  NLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNPEFLKRTKLYI 158

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS-- 190
            K+ + +   +L  S+GGPII+ Q ENE+G+ V     +  E + ++ A     L  +  
Sbjct: 159 DKLYEQVG--DLQVSKGGPIIMVQAENEFGSYVAQRKDIPLEEHRRYNAKIKRQLADAGF 216

Query: 191 -VPWVMCQQ----EDAPDP-IINTCNG------------FYCDGFTPNSPSKPIMWTENY 232
            VP          E    P  + T NG             Y  G        P M  E Y
Sbjct: 217 NVPLFTSDGSWLFEGGSTPGALPTANGESNVENLKKVVNEYHGGVG------PYMVAEFY 270

Query: 233 SGWFLSFGYAVPFRPVED--LAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            GW +   +A PF  + D  +A     + +   +F N+YM  GGTNFG T+G        
Sbjct: 271 PGWLMH--WAEPFPDISDSGIARQTETYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKHD 327

Query: 288 -----TSYDYDAPIDEYGFIRQPKWGHLREL 313
                TSYDYDAPI E G++  PK+  +R +
Sbjct: 328 IQPDLTSYDYDAPISEAGWV-TPKFDSIRNV 357


>gi|357050010|ref|ZP_09111224.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
           30_1]
 gi|355382493|gb|EHG29591.1| hypothetical protein HMPREF9478_01207 [Enterococcus saccharolyticus
           30_1]
          Length = 584

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 167/639 (26%), Positives = 257/639 (40%), Gaps = 143/639 (22%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SGSIHY R  P  W + + K +  G   +ETYV WN HEP  G++ F    DL RF++  
Sbjct: 21  SGSIHYFRVVPAYWRDRLEKLRLMGCNTVETYVPWNMHEPQEGKFDFSDNLDLRRFIQLA 80

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
           QE GL++ LR  PY CAEW +GG P WL   P ++ R    PF E++ R+  ++   +  
Sbjct: 81  QEVGLYVILRPAPYICAEWEFGGLPYWLLKDPFMKIRFDYPPFMEKIARYFTQLFSQVS- 139

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-------PWVM 195
            +L  +Q GPI++ QVENEYG    +YG   + Y++ +A+   +    V       PW+ 
Sbjct: 140 -DLQITQEGPILMMQVENEYG----SYG-NDKSYLRKSAELMRHNGIDVPLFTSDGPWLD 193

Query: 196 CQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSFGYAVPF-R 246
             +    +D   P IN C     + F      +   +P+M  E + GWF ++G       
Sbjct: 194 MLENGSIKDIALPTIN-CGSDIQENFRKLQEFHGKKQPLMVMEFWIGWFDAWGDDKHHTT 252

Query: 247 PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAPIDEYG 300
            V D A  +    E G    N YM+ GGTNFG   G           TSYDYDA + E+G
Sbjct: 253 SVTDAANELRDCLEAGSV--NIYMFHGGTNFGFMNGANYYEKLLPDVTSYDYDALLSEWG 310

Query: 301 FIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAFLANYDSS 360
            +  PK+   +++   I     + +++  T +  G                         
Sbjct: 311 DV-TPKYEAFQQVIGEITEIPSFPLTTKITKRAYG------------------------- 344

Query: 361 SDANVTFNGNVYFLPAWSVSILPDCKNVVFNTAKVISQRNNGDHPFAQQKNVNELLLASS 420
                          +W VS     +  +F T   ISQ    ++P        ELL  ++
Sbjct: 345 ---------------SWKVS----QRVSLFETLASISQPVKHNYPLTM-----ELLDQAT 380

Query: 421 AFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLG 480
            + +Y  ++G S                 +   DY                  ++ +   
Sbjct: 381 GYVYYRSQIGKS-----------------RVIEDYR----------------LIHCQDRA 407

Query: 481 HAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGA 540
           H    F+N +L    Y           K + L E  N L IL   +G  NY    +    
Sbjct: 408 HT---FINNQLQFIQYDQE----IGQKKTLTLTEESNELGILVENMGRVNYSVQMNHQYK 460

Query: 541 GLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLI 599
           G+   +++   NG       EW IY + ++     LD++    S  W+ G          
Sbjct: 461 GIKDGVIV---NGA---FQSEWEIYSLPMD----NLDQVDF--SGHWQTGQP-------S 501

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
           + K +F   E      + L   GKG   +NG +IGR+W 
Sbjct: 502 FSKVSFQVDECADTF-VELPGWGKGFIVINGHNIGRFWE 539


>gi|257875465|ref|ZP_05655118.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
 gi|257809631|gb|EEV38451.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC20]
          Length = 585

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/309 (35%), Positives = 153/309 (49%), Gaps = 32/309 (10%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +D K   + SG+IHY R  PE W + + K +  G   +ETYV WN HE   G Y F+G  
Sbjct: 12  LDNKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T QE GL++ LR  PY CAEW +GG P WL   P ++ R    PF E++ R+ 
Sbjct: 72  DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
           A +   ++  +L  +QGGPII+ QVENEYG    +Y    E   K  A    +      +
Sbjct: 132 AHLFPQVR--DLQITQGGPIIMMQVENEYG----SYANDKEYLRKMVAAMRQHGVETPLV 185

Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
            +  PW    +    +D   P IN C     + F      +   +P+M  E + GWF ++
Sbjct: 186 TSDGPWHDMLENGSIKDLALPTIN-CGSNIKENFEKLRKFHGEKRPLMVMEFWIGWFDAW 244

Query: 240 GYAVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVATSYD 291
           G        ++D    +      G    N YM+ GGTNFG   G        P V TSYD
Sbjct: 245 GDDQHHTTSIQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNYYERLAPDV-TSYD 301

Query: 292 YDAPIDEYG 300
           YDA + E+G
Sbjct: 302 YDALLTEWG 310


>gi|239986962|ref|ZP_04707626.1| putative beta-galactosidase [Streptomyces roseosporus NRRL 11379]
          Length = 606

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 126/398 (31%), Positives = 179/398 (44%), Gaps = 55/398 (13%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           + T D      DGK   L SG++HY R   E W   +      GL  +ETYV WN HEP 
Sbjct: 3   DFTVDDDGFRFDGKPVRLLSGALHYFRVHEEQWGHRLAVLAAMGLNCVETYVPWNLHEPR 62

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G+    G   L RF+  V+ AGL+  +R GPY CAEW  GG PVW+    G + RT + 
Sbjct: 63  EGEVRDVG--ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDA 120

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            ++  ++R+  +++  + +  +   +GGP+IL Q ENEYG+    +G    +Y++W A  
Sbjct: 121 EYRAVVERWFRELLPQVVERQVV--RGGPVILVQAENEYGS----FG-SDAVYLEWLAGL 173

Query: 184 AVNLNTSVPWVMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPI 226
                 +VP       D P+           ++ T N       GF       + P  P+
Sbjct: 174 LRECGVTVPLFTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFAV--LRRHQPKGPL 228

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAG 282
           M  E + GWF  +G     R  E+ A A+    E G +  N YM  GGTNF    G   G
Sbjct: 229 MCMEFWCGWFDHWGAEPVLRDAEEAAGALREILECGASV-NIYMAHGGTNFAGWAGANRG 287

Query: 283 GPL-------VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEY----LISSDPTH 331
           GPL         TSYDYDAP+DEYG   +       + H   K+ E Y    L +  P  
Sbjct: 288 GPLQDGEFQPTVTSYDYDAPVDEYGRATE-------KFHLFRKVLEGYAQRPLPALPPEP 340

Query: 332 QKLGAKLEAHIYHKSS-NDCAAFLANYDSSSDANVTFN 368
           Q L   + A +   +   D    L + ++ S    TF 
Sbjct: 341 QGLAGPVRAELTGWAGLGDVLEALGDPETESGVPPTFE 378


>gi|445497922|ref|ZP_21464777.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
 gi|444787917|gb|ELX09465.1| beta-galactosidase BgaC [Janthinobacterium sp. HH01]
          Length = 624

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 157/324 (48%), Gaps = 34/324 (10%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DG+  V++SG +HYPR     W E +R ++  GL  + TY FW+ HEP  GQ+ F G+ 
Sbjct: 42  LDGQPFVIRSGEMHYPRIPRAAWRERLRMARAMGLNTVTTYAFWSQHEPEPGQWSFSGQN 101

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL  F+KT  E GL + LR GPY CAE ++GGFP WL    G++ R+ +  +     R+ 
Sbjct: 102 DLRTFIKTAAEEGLNVVLRPGPYVCAEVDFGGFPAWLMRTQGLRVRSMDARYLAASARYF 161

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            ++    +  +L +S+GGPI++ Q+ENEYG+    YG   + Y++             P 
Sbjct: 162 KRLA--QEVADLQSSRGGPILMLQLENEYGS----YGRDHD-YLRAVRTQMRQAGFDAPL 214

Query: 194 VMCQQ-----------EDAPDPIINTCNG-----FYCDGFTPNSPSKPIMWTENYSGWFL 237
                            D P  ++N   G               P  P M  E ++GWF 
Sbjct: 215 FTSDGGAGRLFEGGTLADVP-AVVNFGGGADDAQASVQELAAWRPHGPRMAGEYWAGWFD 273

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--------TS 289
            +G     +  E+ A  V R    G +F N YM+ GGT+FG  AG             TS
Sbjct: 274 HWGEQHHTQSPEEAARTVERMLSQGVSF-NLYMFHGGTSFGWLAGANYSGSEPYQPDTTS 332

Query: 290 YDYDAPIDEYGFIRQPKWGHLREL 313
           YDYDA +DE G    PK+  LR++
Sbjct: 333 YDYDAALDEAGRP-TPKYFALRDV 355


>gi|402304595|ref|ZP_10823662.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
 gi|400380871|gb|EJP33679.1| glycosyl hydrolase family 35 [Prevotella sp. MSX73]
          Length = 778

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 111/320 (34%), Positives = 157/320 (49%), Gaps = 19/320 (5%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           T   +  +++GK  V+++  +HYPR     W   I+  K  G+  +  YVFWN HE   G
Sbjct: 22  TVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEG 81

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           ++ F G  D+  F +  Q  GL++ +R GPY CAEW  GG P WL     I+ R  +  F
Sbjct: 82  KFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREPDPYF 141

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADT 183
            E +K F  K+ + +   +L    GGPII+ QVENEYG+     AY       V+ +   
Sbjct: 142 MERVKLFERKVGEQLA--SLTIQNGGPIIMVQVENEYGSYGKNKAYVSAIRDIVRRSGFD 199

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSGWF 236
            V L     W    +++  D ++ T N G   D            P+ P M +E +SGWF
Sbjct: 200 KVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWF 258

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
             +G     RP + +   +      G +F + YM  GGT+FG  AG   P  A   TSYD
Sbjct: 259 DKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 317

Query: 292 YDAPIDEYGFIRQPKWGHLR 311
           YDAPI+EYG    PK+  LR
Sbjct: 318 YDAPINEYGQA-TPKYWELR 336


>gi|421514041|ref|ZP_15960756.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
 gi|401672838|gb|EJS79281.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
          Length = 611

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   IETY+ WN HEP+ G Y FEG
Sbjct: 10  FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V FV   QE GL + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 70  MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           + + +  L K   L  + GGP+I+ QVENEYG    +YG+  E Y++            V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
           P  +   + A + +++       D F      S SK                 PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
              TSYDYDA + E G   + K+ H++   +AIK +C E +  ++P  +  G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345


>gi|229545563|ref|ZP_04434288.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
 gi|256619317|ref|ZP_05476163.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
 gi|256853375|ref|ZP_05558745.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
 gi|256964870|ref|ZP_05569041.1| beta-galactosidase [Enterococcus faecalis HIP11704]
 gi|257090147|ref|ZP_05584508.1| beta-galactosidase [Enterococcus faecalis CH188]
 gi|294614275|ref|ZP_06694194.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
 gi|307272958|ref|ZP_07554205.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
 gi|307277803|ref|ZP_07558888.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
 gi|307291733|ref|ZP_07571605.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
 gi|384518848|ref|YP_005706153.1| beta-galactosidase [Enterococcus faecalis 62]
 gi|422685728|ref|ZP_16743941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
 gi|422689100|ref|ZP_16747212.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
 gi|422720655|ref|ZP_16777264.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
 gi|422731066|ref|ZP_16787446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
 gi|422739263|ref|ZP_16794446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
 gi|430849460|ref|ZP_19467237.1| glycosyl hydrolase [Enterococcus faecium E1185]
 gi|229309303|gb|EEN75290.1| possible beta-galactosidase [Enterococcus faecalis TX1322]
 gi|256598844|gb|EEU18020.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
 gi|256711834|gb|EEU26872.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
 gi|256955366|gb|EEU71998.1| beta-galactosidase [Enterococcus faecalis HIP11704]
 gi|256998959|gb|EEU85479.1| beta-galactosidase [Enterococcus faecalis CH188]
 gi|291592934|gb|EFF24524.1| glycosyl hydrolase, family 35 [Enterococcus faecium E1636]
 gi|306497185|gb|EFM66730.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
 gi|306505543|gb|EFM74728.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
 gi|306510572|gb|EFM79595.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
 gi|315029440|gb|EFT41372.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
 gi|315032046|gb|EFT43978.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
 gi|315144925|gb|EFT88941.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
 gi|315162898|gb|EFU06915.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
 gi|315577862|gb|EFU90053.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
 gi|323480981|gb|ADX80420.1| beta-galactosidase [Enterococcus faecalis 62]
 gi|430537598|gb|ELA77922.1| glycosyl hydrolase [Enterococcus faecium E1185]
          Length = 611

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   IETY+ WN HEP+ G Y FEG
Sbjct: 10  FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V FV   QE GL + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 70  MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           + + +  L K   L  + GGP+I+ QVENEYG    +YG+  E Y++            V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
           P  +   + A + +++       D F      S SK                 PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
              TSYDYDA + E G   + K+ H++   +AIK +C E +  ++P  +  G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345


>gi|312903586|ref|ZP_07762766.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
 gi|310633462|gb|EFQ16745.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
          Length = 611

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   IETY+ WN HEP+ G Y FEG
Sbjct: 10  FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V FV   QE GL + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 70  MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           + + +  L K   L  + GGP+I+ QVENEYG    +YG+  E Y++            V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
           P  +   + A + +++       D F      S SK                 PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
              TSYDYDA + E G   + K+ H++   +AIK +C E +  ++P  +  G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345


>gi|154490061|ref|ZP_02030322.1| hypothetical protein PARMER_00290 [Parabacteroides merdae ATCC
           43184]
 gi|423723056|ref|ZP_17697209.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
           CL09T00C40]
 gi|154089210|gb|EDN88254.1| glycosyl hydrolase family 35 [Parabacteroides merdae ATCC 43184]
 gi|409241481|gb|EKN34249.1| hypothetical protein HMPREF1078_01269 [Parabacteroides merdae
           CL09T00C40]
          Length = 780

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/317 (34%), Positives = 154/317 (48%), Gaps = 19/317 (5%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  IHY R   E W   I+  K  G+  I  Y FWN HE   G++ F+
Sbjct: 39  TFLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFK 98

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ LR GPY C+EW  GG P WL     I+ RT +  F E  K
Sbjct: 99  GQNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTK 158

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVGGELYVKWAADTAVNLN 188
            F+ +I   +   +L  ++GG II+ QVENEYG    + AY       VK A  T V L 
Sbjct: 159 LFMNEIGKQLA--DLQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVKAAGFTDVPL- 215

Query: 189 TSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFGY 241
               W    Q +  D ++ T N   G   D          P  P+M +E +SGWF  +G 
Sbjct: 216 FQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGR 275

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDYDAPI 296
               R    +   +    +   +F + YM  GGT FG   G        + +SYDYDAPI
Sbjct: 276 KHETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPI 334

Query: 297 DEYGFIRQPKWGHLREL 313
            E G+   PK+  LREL
Sbjct: 335 SEAGWA-TPKYYKLREL 350



 Score = 43.1 bits (100), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 30/96 (31%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +Y+TTF   E  G + L++ + GKG  WVNG+++GR+W                      
Sbjct: 534 YYRTTFELDE-VGDVFLDMQTWGKGMVWVNGKAMGRFWEI-------------------- 572

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                    P QTL+ +P  W+  G+N ++I + LG
Sbjct: 573 --------GPQQTLF-MPGCWLKKGKNEIIILDLLG 599


>gi|315606512|ref|ZP_07881527.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
 gi|315251918|gb|EFU31892.1| family 35 glycosyl hydrolase [Prevotella buccae ATCC 33574]
          Length = 787

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 111/320 (34%), Positives = 157/320 (49%), Gaps = 19/320 (5%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           T   +  +++GK  V+++  +HYPR     W   I+  K  G+  +  YVFWN HE   G
Sbjct: 31  TVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEG 90

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           ++ F G  D+  F +  Q  GL++ +R GPY CAEW  GG P WL     I+ R  +  F
Sbjct: 91  RFDFTGNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREPDPYF 150

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADT 183
            E +K F  K+ + +   +L    GGPII+ QVENEYG+     AY       V+ +   
Sbjct: 151 MERVKLFERKVGEQLA--SLTIQNGGPIIMVQVENEYGSYGENKAYVSAIRDIVRQSGFD 208

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSGWF 236
            V L     W    +++  D ++ T N G   D            P+ P M +E +SGWF
Sbjct: 209 KVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWF 267

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
             +G     RP + +   +      G +F + YM  GGT+FG  AG   P  A   TSYD
Sbjct: 268 DKWGARHETRPAKAMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 326

Query: 292 YDAPIDEYGFIRQPKWGHLR 311
           YDAPI+EYG    PK+  LR
Sbjct: 327 YDAPINEYGQA-TPKYWELR 345


>gi|445495533|ref|ZP_21462577.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
 gi|444791694|gb|ELX13241.1| beta-galactosidase Bga [Janthinobacterium sp. HH01]
          Length = 586

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 108/306 (35%), Positives = 154/306 (50%), Gaps = 36/306 (11%)

Query: 19  RVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRF 78
           RVL SG++HY R  PE+W + + K K  GL  +ETYV WN HEP  GQ+ +EG  DL  F
Sbjct: 23  RVL-SGALHYFRVLPELWEDRLLKLKAMGLNTVETYVAWNLHEPAAGQFRYEGGLDLAAF 81

Query: 79  VKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIID 138
           ++  +  GL++ +R GP+ CAEW +GG P WL   P ++ R    P+ E ++RF   ++ 
Sbjct: 82  IRLAESLGLYVIVRPGPFICAEWEFGGLPAWLLADPYMEVRCCYQPYLEAVRRFYDDLLP 141

Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQ 198
            +    +   +GGPI+  QVENEYG    +YG   +LY+ W     + L+  V  ++   
Sbjct: 142 RLLPLQI--QRGGPILAMQVENEYG----SYG-SDQLYLTWL--RRLMLDGGVETLLFTS 192

Query: 199 EDAPDPII---NTCNGFYCDGFTPNS-----------PSKPIMWTENYSGWFLSFGYAVP 244
           + A D ++        +    F   +           P  P+M  E ++GWF  +G    
Sbjct: 193 DGATDHMLKHGTLAQVWKSANFGSRAEEEFAKLREYQPDGPLMCMEFWNGWFDHWGEPHH 252

Query: 245 FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----------PLVATSYDYDA 294
            R   D A A+ R    G    N YM+ GGTNFG   G           P V  SYDYDA
Sbjct: 253 TRDAADAADALERIMACGAHV-NVYMFHGGTNFGFMNGANTDLLTRDYQPTV-NSYDYDA 310

Query: 295 PIDEYG 300
           P+DE G
Sbjct: 311 PLDETG 316


>gi|373460889|ref|ZP_09552639.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
 gi|371954714|gb|EHO72523.1| hypothetical protein HMPREF9944_00903 [Prevotella maculosa OT 289]
          Length = 780

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 109/327 (33%), Positives = 159/327 (48%), Gaps = 27/327 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            + T      +++G+  V+++  +HYPR     W + I+  K  G+  +  YVFWN HE 
Sbjct: 26  GDFTVGKNTFLLNGRPFVIKAAELHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQ 85

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             GQ+ F G  D+  F +   + G+++ +R GPY CAEW  GG P WL     ++ R  +
Sbjct: 86  REGQFDFTGNNDVAAFCRLAHKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVRLREDD 145

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY------ 176
             F   +K F A++   +    L    GGPII+ QVENEYG    +YG+  +        
Sbjct: 146 PYFMARVKAFEAEVGRQLAP--LTIQNGGPIIMVQVENEYG----SYGINKKYVSEIRDI 199

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWT 229
           VK +    V L     W    + +  D ++ T N   G   D          P  P+M +
Sbjct: 200 VKASGFDKVTL-FQCDWASNFEHNGLDDLVWTMNFGTGANIDEQFRRLKQLRPEAPLMCS 258

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
           E +SGWF  +G     RP +D+   +      G +F + YM  GGT+FG  AG   P  A
Sbjct: 259 EFWSGWFDKWGARHETRPAKDMVEGIDEMLRKGISF-SLYMTHGGTSFGHWAGANSPGFA 317

Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLR 311
              TSYDYDAPI+EYG +  PK+  LR
Sbjct: 318 PDVTSYDYDAPINEYG-MPTPKFFALR 343



 Score = 40.4 bits (93), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 42/100 (42%), Gaps = 29/100 (29%)

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
           K  I Y   +   +  G   LNL   GKGQ +VNG ++GR+W                  
Sbjct: 535 KQNIGYYRGYFDLKKTGDTFLNLEQWGKGQVYVNGHALGRFW------------------ 576

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                    H G P QTLY +P  W+  G N +++ + +G
Sbjct: 577 ---------HIG-PQQTLY-LPGCWLKKGRNEIIVLDVVG 605


>gi|29376389|ref|NP_815543.1| glycosyl hydrolase [Enterococcus faecalis V583]
 gi|227519038|ref|ZP_03949087.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
 gi|227553661|ref|ZP_03983710.1| possible beta-galactosidase [Enterococcus faecalis HH22]
 gi|256961654|ref|ZP_05565825.1| beta-galactosidase [Enterococcus faecalis Merz96]
 gi|293383358|ref|ZP_06629271.1| beta-galactosidase [Enterococcus faecalis R712]
 gi|293388990|ref|ZP_06633475.1| beta-galactosidase [Enterococcus faecalis S613]
 gi|312907816|ref|ZP_07766806.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
 gi|312910433|ref|ZP_07769280.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
 gi|422714340|ref|ZP_16771066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
 gi|422715597|ref|ZP_16772313.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
 gi|424676484|ref|ZP_18113355.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
 gi|424681702|ref|ZP_18118489.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
 gi|424685588|ref|ZP_18122282.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
 gi|424686206|ref|ZP_18122874.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
 gi|424690524|ref|ZP_18127059.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
 gi|424694932|ref|ZP_18131318.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
 gi|424696643|ref|ZP_18132984.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
 gi|424700339|ref|ZP_18136532.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
 gi|424703758|ref|ZP_18139884.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
 gi|424712611|ref|ZP_18144783.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
 gi|424718249|ref|ZP_18147501.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
 gi|424721894|ref|ZP_18150963.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
 gi|424723972|ref|ZP_18152924.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
 gi|424733572|ref|ZP_18162127.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
 gi|424741709|ref|ZP_18170052.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
 gi|424751990|ref|ZP_18179997.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
 gi|29343852|gb|AAO81613.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
 gi|227073538|gb|EEI11501.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
 gi|227177203|gb|EEI58175.1| possible beta-galactosidase [Enterococcus faecalis HH22]
 gi|256952150|gb|EEU68782.1| beta-galactosidase [Enterococcus faecalis Merz96]
 gi|291079149|gb|EFE16513.1| beta-galactosidase [Enterococcus faecalis R712]
 gi|291081771|gb|EFE18734.1| beta-galactosidase [Enterococcus faecalis S613]
 gi|310626177|gb|EFQ09460.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
 gi|311289706|gb|EFQ68262.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
 gi|315575942|gb|EFU88133.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
 gi|315580774|gb|EFU92965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
 gi|402350621|gb|EJU85522.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
 gi|402356496|gb|EJU91227.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
 gi|402358329|gb|EJU93003.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
 gi|402364102|gb|EJU98549.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
 gi|402367740|gb|EJV02077.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
 gi|402369105|gb|EJV03397.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
 gi|402374029|gb|EJV08075.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
 gi|402377412|gb|EJV11319.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
 gi|402379869|gb|EJV13650.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
 gi|402382152|gb|EJV15835.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
 gi|402384002|gb|EJV17579.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
 gi|402390099|gb|EJV23464.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
 gi|402391584|gb|EJV24885.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
 gi|402396442|gb|EJV29504.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
 gi|402401146|gb|EJV33935.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
 gi|402404973|gb|EJV37581.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
          Length = 611

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   IETY+ WN HEP+ G Y FEG
Sbjct: 10  FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V FV   QE GL + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 70  MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           + + +  L K   L  + GGP+I+ QVENEYG    +YG+  E Y++            V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
           P  +   + A + +++       D F      S SK                 PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
              TSYDYDA + E G   + K+ H++   +AIK +C E +  ++P  +  G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345


>gi|423346501|ref|ZP_17324189.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
           CL03T12C32]
 gi|409219652|gb|EKN12612.1| hypothetical protein HMPREF1060_01861 [Parabacteroides merdae
           CL03T12C32]
          Length = 780

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 110/317 (34%), Positives = 154/317 (48%), Gaps = 19/317 (5%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  IHY R   E W   I+  K  G+  I  Y FWN HE   G++ F+
Sbjct: 39  TFLLDGKPFVIKAAEIHYTRIPAEYWQHRIQMCKALGMNTICIYAFWNIHEQKPGEFDFK 98

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ LR GPY C+EW  GG P WL     I+ RT +  F E  K
Sbjct: 99  GQNDIAAFCRLAQKEGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIKLRTNDPYFLERTK 158

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVGGELYVKWAADTAVNLN 188
            F+ +I   +   +L  ++GG II+ QVENEYG    + AY       VK A  T V L 
Sbjct: 159 LFMNEIGKQLA--DLQVTRGGNIIMVQVENEYGAYATDKAYIANIRDAVKAAGFTDVPL- 215

Query: 189 TSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFLSFGY 241
               W    Q +  D ++ T N   G   D          P  P+M +E +SGWF  +G 
Sbjct: 216 FQCDWSSTFQLNGLDDLVWTINFGTGANIDAQFKKLKEARPDAPLMCSEFWSGWFDHWGR 275

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDYDAPI 296
               R    +   +    +   +F + YM  GGT FG   G        + +SYDYDAPI
Sbjct: 276 KHETRDAGVMVSGIKDMLDRHISF-SLYMAHGGTTFGHWGGANSPAYSAMCSSYDYDAPI 334

Query: 297 DEYGFIRQPKWGHLREL 313
            E G+   PK+  LREL
Sbjct: 335 SEAGWA-TPKYYKLREL 350



 Score = 43.1 bits (100), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 45/96 (46%), Gaps = 30/96 (31%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +Y+TTF   E  G + L++ + GKG  WVNG+++GR+W                      
Sbjct: 534 YYRTTFELDE-VGDVFLDMQTWGKGMVWVNGKAMGRFWEI-------------------- 572

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
                    P QTL+ +P  W+  G+N ++I + LG
Sbjct: 573 --------GPQQTLF-MPGCWLKKGKNEIIILDLLG 599


>gi|395775444|ref|ZP_10455959.1| glycosyl hydrolase family 42 [Streptomyces acidiscabies 84-104]
          Length = 587

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 107/319 (33%), Positives = 154/319 (48%), Gaps = 29/319 (9%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++G+   + SG++HY R  P+ W + +RK++  GL  +ETYV WN H+P  G    +G  
Sbjct: 13  LNGEPFRIISGALHYFRVHPDQWADRLRKARLMGLNTVETYVPWNLHQPEPGTLVLDGLL 72

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++     GL + LR GPY CAEW+ GG P WL     +Q R+++  F   + R+L
Sbjct: 73  DLPRFLRLAHAEGLKVLLRPGPYICAEWDGGGLPHWLMSESDVQLRSSDPKFTAIIDRYL 132

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
             ++  +      A  GGP+I  QVENEYG    AYG   E Y+K+  +   +       
Sbjct: 133 DLLLPPLLPH--MAESGGPVIAVQVENEYG----AYGNDAE-YLKYLVEAFRSRGIEELL 185

Query: 194 VMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSFGY 241
             C Q +       +  G    G               + P  P+M  E + GWF  +G 
Sbjct: 186 FTCDQVNPEHQQAGSIPGVLSTGTFGGKIETALATLRAHQPEGPLMCAEFWIGWFDHWGG 245

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVATSYDYDA 294
               R   D+A  + +    G +  N YM+ GGTNFG T G        P + TSYDYDA
Sbjct: 246 PHHTRDTADVAADLDKLLAAGASV-NIYMFHGGTNFGLTNGANHHHTYAPTI-TSYDYDA 303

Query: 295 PIDEYGFIRQPKWGHLREL 313
           P+ E G    PK+   RE+
Sbjct: 304 PLTENG-DPGPKYHAFREV 321


>gi|414160019|ref|ZP_11416290.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
           ACS-120-V-Sch1]
 gi|410878669|gb|EKS26539.1| hypothetical protein HMPREF9310_00664 [Staphylococcus simulans
           ACS-120-V-Sch1]
          Length = 597

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 106/314 (33%), Positives = 156/314 (49%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN+HE + G++ F G
Sbjct: 10  FMLDGKPLKILSGAIHYFRVLPEDWEHSLYNLKALGFNAVETYVPWNFHETVEGEFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+ RF+ T +  GL++ +R  PY CAEW +GG P WL   P ++ R+ +  F E ++R
Sbjct: 70  TKDIKRFIHTAEAIGLYVIIRPSPYICAEWEFGGLPAWLLTKPNLRVRSRDPQFLEYVER 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +  ++ +++    L     GPI++ QVENEYG    +YG   + Y+   A    +   +V
Sbjct: 130 YYDRLFEILTP--LQIDHHGPILMMQVENEYG----SYG-EDKTYLSALARMMRDRGVTV 182

Query: 192 P-------WVMCQQED--APDPIINTCN-GFYCDGFTPN--------SPSKPIMWTENYS 233
           P       W  C +    A   II T N G        N          + P+M  E + 
Sbjct: 183 PLFTSDGSWQQCLEAGSLAEADIIPTGNFGSKSQKRLDNLHKFHQQFGKTWPLMSMEFWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL---V 286
           GWF  +G  +  R  ++L   +    + G    N YM+ GGTNFG     +A G +    
Sbjct: 243 GWFNRWGDRIITRQSDELIDEIGEVLKRGSI--NLYMFHGGTNFGFWNGCSARGRIDLPQ 300

Query: 287 ATSYDYDAPIDEYG 300
            TSYDYDAP+DE G
Sbjct: 301 VTSYDYDAPLDEAG 314


>gi|257865837|ref|ZP_05645490.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
 gi|257872172|ref|ZP_05651825.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
 gi|257799771|gb|EEV28823.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC30]
 gi|257806336|gb|EEV35158.1| 35 glycosylhydrolase [Enterococcus casseliflavus EC10]
          Length = 585

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/387 (31%), Positives = 180/387 (46%), Gaps = 40/387 (10%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +D K   + SG+IHY R  PE W + + K +  G   +ETYV WN HE   G Y F+G  
Sbjct: 12  LDNKPLKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFDGIL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T QE GL++ LR  PY CAEW +GG P WL   P ++ R    PF E++ R+ 
Sbjct: 72  DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
           A +   ++  +L  +QGGPII+ QVENEYG    +Y    E   K  A    +      +
Sbjct: 132 AHLFPQVR--DLQITQGGPIIMMQVENEYG----SYANDKEYLRKMVAAMRQHGVETPLV 185

Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
            +  PW    +    +D   P IN C     + F      +   +P+M  E + GWF ++
Sbjct: 186 TSDGPWHDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244

Query: 240 GYAVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVATSYD 291
           G         +D    +      G    N YM+ GGTNFG   G        P V TSYD
Sbjct: 245 GDDQHHTTSTQDAVKELQDCLALGSV--NIYMFHGGTNFGFMNGSNYYERLAPDV-TSYD 301

Query: 292 YDAPIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDPTHQKLGA---KLEAHIYHKS 346
           YDA + E+G   +P  K+   +++        E+ +S +   +  G    K    ++   
Sbjct: 302 YDALLTEWG---EPTAKYQAFKKVIADYAEIPEFPLSMEIERKAYGTFSVKERVSLFSTI 358

Query: 347 SNDCAAFLANYDSSSDANVTFNGNVYF 373
                  ++NY  S +A     G +Y+
Sbjct: 359 DTISQPIISNYPLSMEACNQATGYIYY 385


>gi|307275710|ref|ZP_07556850.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
 gi|306507586|gb|EFM76716.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
          Length = 611

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 173/353 (49%), Gaps = 45/353 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   IETY+ WN HEP+ G Y FEG
Sbjct: 10  FLVDGKPTKLISGAIHYFRMTPAQWEDSLYNLKALGANTIETYIPWNLHEPVEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V FV   QE GL + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 70  MKDIVAFVSLAQELGLMVILRPSVYICAEWEFGGLPAWL-LKEHVRLRSTDPRFIAKVRT 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           + + +  L K   L  + GGP+I+ QVENEYG    +YG+  E Y++            V
Sbjct: 129 YFSVL--LPKLVPLQVTHGGPVIMMQVENEYG----SYGMEKE-YLRQTKQVMEEFGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGFTP---NSPSK-----------------PIMWTEN 231
           P  +   + A + +++       D F      S SK                 PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDVGTLIEEDVFVTGNFGSHSKENATVMKAFMAKHDKKWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKDMLALGSL--NLYMFHGGTNFGFYNGCSARGVLDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLGA 336
              TSYDYDA + E G   + K+ H++   +AIK +C E +  ++P  +  G+
Sbjct: 298 PQVTSYDYDALLTEAGEPTE-KYFHVQ---RAIKEVCPE-VWQAEPRRKTFGS 345


>gi|224536014|ref|ZP_03676553.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522370|gb|EEF91475.1| hypothetical protein BACCELL_00878 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 1106

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 111/326 (34%), Positives = 155/326 (47%), Gaps = 39/326 (11%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  +  YVFWN HEP  G Y F 
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
            + DL  F +  Q+  +++ LR GPY CAEW  GG P WL     ++ R ++  F E + 
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   +   +K  NL  + GGPII+ QVENEYG+              V   +G    L+
Sbjct: 476 LFEEAVAKQVK--NLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533

Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
              WA++  +N    + W M           N   G   D          P+ P+M +E 
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
           +SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A  
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641

Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
            TSYDYDAPI E G    PK+  LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWALRE 666


>gi|153806012|ref|ZP_01958680.1| hypothetical protein BACCAC_00257 [Bacteroides caccae ATCC 43185]
 gi|149130689|gb|EDM21895.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
          Length = 774

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 112/329 (34%), Positives = 163/329 (49%), Gaps = 32/329 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            +  D     +DGK   L  G +HY R   E W + +++++  GL  I  YVFWN+HE  
Sbjct: 28  RIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQ 87

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G++ F G+ D+  FV+  QE GL++ LR GPYACAEW++GG+P WL     + +R+ + 
Sbjct: 88  PGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDP 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            F E  +R++  +   +    L  + GG I++ QVENEYG    +Y    E Y+    D 
Sbjct: 148 RFLEYCERYIKALGKQLAP--LTVNNGGNILMVQVENEYG----SYAADKE-YLAALRDM 200

Query: 184 AVNLNTSVPWVMCQ---QEDAP--DPIINTCNGFYCDG----FTPNSPSKPIMWTENYSG 234
             +   +VP   C    Q +A   D  + T NG + +          P  P    E Y  
Sbjct: 201 IKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPA 260

Query: 235 WFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG-P 284
           WF  +G     V + RP E L + + +     G   + YM+ GGTNF       TAGG  
Sbjct: 261 WFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSMYMFHGGTNFWYMNGANTAGGYR 315

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
              TSYDYDAP+ E+G    PK+   RE+
Sbjct: 316 PQPTSYDYDAPLGEWGNC-YPKYYAFREV 343


>gi|423219555|ref|ZP_17206051.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
           CL03T12C61]
 gi|392624760|gb|EIY18838.1| hypothetical protein HMPREF1061_02824 [Bacteroides caccae
           CL03T12C61]
          Length = 774

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 112/329 (34%), Positives = 163/329 (49%), Gaps = 32/329 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            +  D     +DGK   L  G +HY R   E W + +++++  GL  I  YVFWN+HE  
Sbjct: 28  RIKIDGGTFNVDGKDVQLICGEMHYARIPHEYWRDRLKRARAMGLNTISVYVFWNFHERQ 87

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G++ F G+ D+  FV+  QE GL++ LR GPYACAEW++GG+P WL     + +R+ + 
Sbjct: 88  PGEFDFSGQADVAEFVRLAQEEGLYVILRPGPYACAEWDFGGYPSWLLKEKDMVYRSKDP 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            F E  +R++  +   +    L  + GG I++ QVENEYG    +Y    E Y+    D 
Sbjct: 148 RFLEYCERYIKALGKQLAP--LTVNNGGNILMVQVENEYG----SYAADKE-YLAALRDM 200

Query: 184 AVNLNTSVPWVMCQ---QEDAP--DPIINTCNGFYCDG----FTPNSPSKPIMWTENYSG 234
             +   +VP   C    Q +A   D  + T NG + +          P  P    E Y  
Sbjct: 201 IKDAGFNVPLFTCDGGGQVEAGHIDGALPTLNGVFSEDIFKIIDKYHPGGPYFVAEFYPA 260

Query: 235 WFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG-P 284
           WF  +G     V + RP E L + + +     G   + YM+ GGTNF       TAGG  
Sbjct: 261 WFDVWGQRHSTVDYKRPAEQLDWMLGQ-----GVSVSMYMFHGGTNFWYMNGANTAGGYR 315

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
              TSYDYDAP+ E+G    PK+   RE+
Sbjct: 316 PQPTSYDYDAPLGEWGNC-YPKYYAFREV 343


>gi|189096261|pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
           Thetaiotaomicron
          Length = 612

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 110/321 (34%), Positives = 156/321 (48%), Gaps = 27/321 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++G+  V+++  IHYPR   E W   I+  K  G   I  YVFWN+HEP  G+Y F 
Sbjct: 14  TFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPEEGRYDFA 73

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  QE G ++ +R GPY CAEW  GG P WL     I+ R  +  + E +K
Sbjct: 74  GQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDPYYXERVK 133

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            FL ++   +   +L  S+GG II  QVENEYG    A+G+  + Y+    D       T
Sbjct: 134 LFLNEVGKQLA--DLQISKGGNIIXVQVENEYG----AFGI-DKPYISEIRDXVKQAGFT 186

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C      + +A D ++ T N   G   D          P  P+  +E +SGWF 
Sbjct: 187 GVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEFWSGWFD 246

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-----VATSYDY 292
            +G     R  E+L        +   +F  Y  + GGT+FG   G          TSYDY
Sbjct: 247 HWGAKHETRSAEELVKGXKEXLDRNISFSLYXTH-GGTSFGHWGGANFPNFSPTCTSYDY 305

Query: 293 DAPIDEYGFIRQPKWGHLREL 313
           DAPI+E G +  PK+  +R L
Sbjct: 306 DAPINESGKV-TPKYLEVRNL 325



 Score = 43.5 bits (101), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 66/234 (28%), Positives = 90/234 (38%), Gaps = 54/234 (23%)

Query: 470 KEVFLNIESLGHAALVFVN-KKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
           KE  L I      A VF+N KKL              + K   L EG + LDIL    G 
Sbjct: 396 KEQTLLITEAHDWAQVFLNGKKLATLSR----LKGEGVVKLPPLKEG-DRLDILVEAXGR 450

Query: 529 QNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFW 586
            N+G   +D  G        ++L++ K      +W +Y + V+         S A    +
Sbjct: 451 XNFGKGIYDWKGI----TEKVELQSDKGVELVKDWQVYTIPVD--------YSFARDKQY 498

Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
           KQ           +Y++TF   E  G   LN  +  KG  WVNG +IGRYW         
Sbjct: 499 KQQEN--AENQPAYYRSTFNLNE-LGDTFLNXXNWSKGXVWVNGHAIGRYWEI------- 548

Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSK 700
                                 P QTLY +P  W+  GEN ++I +  G  PSK
Sbjct: 549 ---------------------GPQQTLY-VPGCWLKKGENEIIILDXAG--PSK 578


>gi|71896501|ref|NP_001026163.1| beta-galactosidase precursor [Gallus gallus]
 gi|53129216|emb|CAG31369.1| hypothetical protein RCJMB04_5i4 [Gallus gallus]
          Length = 385

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 111/328 (33%), Positives = 160/328 (48%), Gaps = 27/328 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + YD    V DG      SGSIHY R     W + + K K  GL  I+TYV WNYHEP  
Sbjct: 27  IDYDCNCFVKDGHPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLNAIQTYVPWNYHEPQM 86

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y F G  DL  F++   E GL + LR GPY CAEW+ GG P WL     I  R++++ 
Sbjct: 87  GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 146

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG--------- 172
           +   +++++  ++  MK        GGPII+ QVENEYG+    ++ Y            
Sbjct: 147 YLTAVEKWMGVLLPKMKPH--LYHNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFRQHL 204

Query: 173 GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
           G+  V +  D A   +    ++  +    + AP    N    F       + P+ P++ +
Sbjct: 205 GDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPTGPLVNS 260

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
           E Y+GW   +G+     P E +A  +      G    N YM+ GGTNF    G   P ++
Sbjct: 261 EFYTGWLDHWGHRHIVVPSETIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMS 319

Query: 288 --TSYDYDAPIDEYGFIRQPKWGHLREL 313
             TSYDYDAP+ E G + + K+  LRE+
Sbjct: 320 QPTSYDYDAPLSEAGDLTE-KYFALREV 346


>gi|225872977|ref|YP_002754436.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
 gi|225792973|gb|ACO33063.1| beta-galactosidase [Acidobacterium capsulatum ATCC 51196]
          Length = 619

 Score =  157 bits (397), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 107/316 (33%), Positives = 163/316 (51%), Gaps = 40/316 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   + SGSIH+ R     W + +RK++  GL  I  YVFWN  EP RGQ+ F G
Sbjct: 45  FILDGKPVQIISGSIHFARVPRAEWGDRLRKARAMGLNAISVYVFWNVQEPHRGQWDFSG 104

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
           ++D+ RF++  Q+AGL++ LR GPYACAEW+ GG+P WL     ++ R+++  +    + 
Sbjct: 105 QYDVARFIRMAQQAGLYVILRPGPYACAEWSMGGYPAWLWKDGRVKIRSSDPAYLHAAQD 164

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
           ++  +   +K   L  + GGPII  QVENEYG+     AY           G+GG   V 
Sbjct: 165 YMDHLGQQLKP--LLWTHGGPIIAVQVENEYGSFGKSRAYLEEVRRMVAGAGLGG--VVL 220

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           + AD     + S+P +    +  P  + N         + P+S  K +   E Y GWF  
Sbjct: 221 YTADGPGLWSGSLPELPEAIDVGPGGVENGVKQLLA--YRPHS--KLVYVAEYYPGWFDQ 276

Query: 239 FG----YAVPFRP-VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           +G    +  P +  ++DL + ++R +       N YM+ GGT++G   G    A      
Sbjct: 277 WGQPHHHGAPLKEQLKDLRWILSRGYSV-----NLYMFHGGTDWGFMNGANDNAADTDYA 331

Query: 288 ---TSYDYDAPIDEYG 300
              TSYDY AP++E G
Sbjct: 332 PQTTSYDYAAPLNEAG 347


>gi|242004937|ref|XP_002423332.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
 gi|212506351|gb|EEB10594.1| beta-galactosidase precursor, putative [Pediculus humanus corporis]
          Length = 596

 Score =  157 bits (397), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 121/342 (35%), Positives = 162/342 (47%), Gaps = 52/342 (15%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK     SGS HY R   + W + +RK K  GL  + TYV W+ HE + G Y FEG  
Sbjct: 1   MDGKPFQYVSGSAHYFRMPNQYWRDRLRKIKAAGLNAVSTYVEWSQHERVPGVYDFEGDL 60

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI-PGIQFRTTNNPFKEEMKRF 132
           D+ RFV+  QE GLF+ LR GPY CAE + GG P WL    P IQ R+++  +   ++R+
Sbjct: 61  DVKRFVEMAQEEGLFVILRPGPYICAERDMGGLPYWLMTKHPDIQLRSSDFFYTYYVQRW 120

Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVE--------WAYGVGGELYVKWAADTA 184
           + K+  L K  +L+  +GGPIIL QVENEYG+          W   +  E +V + A   
Sbjct: 121 MDKL--LGKFTDLWYGKGGPIILVQVENEYGSYHSCDYNHTYWLRNLF-EKHVDYNAVLF 177

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS-------------PSKPIMWTE 230
                S  ++ C +            G Y    F PNS             PS P++ +E
Sbjct: 178 TTDGASRNFLKCGK----------IPGVYATVDFGPNSNVSKMFEAQREFEPSGPLVNSE 227

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            Y GW   +G     R          R         N+YM++GG+NFG TAG        
Sbjct: 228 YYPGWLTHWGEKKHARQDTKDVVKTLREMLNEKANVNFYMFYGGSNFGFTAGANQFGSIY 287

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYL 324
               TSYDYDAPI E         G L + + AIK + EEY 
Sbjct: 288 QSDITSYDYDAPISE--------AGDLTDKYYAIKNVLEEYF 321


>gi|329962091|ref|ZP_08300102.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
 gi|328530739|gb|EGF57597.1| putative beta-galactosidase [Bacteroides fluxus YIT 12057]
          Length = 632

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 109/338 (32%), Positives = 157/338 (46%), Gaps = 49/338 (14%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G
Sbjct: 34  FVYDGKAIRIISGEMHYARIPHQYWRHRMKMLKAMGLNAVATYVFWNLHEPEPGKWDFSG 93

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             +L  +++   E GL + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K 
Sbjct: 94  DRNLAEYIRIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGMELRRDNEQFLKYTKL 153

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L ++   + +  L  +QGGPII+ Q ENE+G+           YV    D  +  + + 
Sbjct: 154 YLERLYKEVGK--LQITQGGPIIMVQGENEFGS-----------YVSQRKDITLEEHRAY 200

Query: 192 PWVMCQQ--EDAPDPIINTCNG--FYCDGFTP----------------------NSPSKP 225
              + +Q  E   D  + T +G   +  G+ P                      N    P
Sbjct: 201 NAKIIKQLKEVGFDVPMFTSDGSWLFEGGYVPGALPTANGENNIENLKKVVNQYNGGQGP 260

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
            M  E Y GW   +    P      +A    ++   G +F NYYM  GGTNFG T+G   
Sbjct: 261 YMVAEFYPGWLAHWCEPHPQVKASTIARQTEKYLANGVSF-NYYMVHGGTNFGFTSGANY 319

Query: 286 VA--------TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
                     TSYDYDAPI E G++  PK+  +R + K
Sbjct: 320 DKKHDIQPDLTSYDYDAPISEAGWV-TPKFDSIRNVIK 356



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 86/367 (23%), Positives = 130/367 (35%), Gaps = 62/367 (16%)

Query: 335 GAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVF 390
           G     ++ H  +N      ANYD   D         Y  P     W        +NV+ 
Sbjct: 297 GVSFNYYMVHGGTNFGFTSGANYDKKHDIQPDLTSYDYDAPISEAGWVTPKFDSIRNVI- 355

Query: 391 NTAKVISQRNNGDHPFAQQKNVNELL-LASSAFSWYEEKVGISGNRSFVRPDLAEQINTT 449
                   +   D+P  +      L+ + S       + + I+  +  V+ D        
Sbjct: 356 --------KRYVDYPLPEAPKAFPLIEIPSIELQQVADLLAITETQEAVQGDKPLTFEEL 407

Query: 450 KDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKK 509
                Y+ Y    +  P  GK   L IE L   A V+V+ + V  G  N     + ++ +
Sbjct: 408 NQGYGYVLYRRHFN-QPISGK---LTIEGLRDYATVYVDGEFV--GRLNRYNKKYSMDIE 461

Query: 510 IELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVE 569
           I  N     L+IL   +G  NYG+       G+ S + ID      +   GEW       
Sbjct: 462 IPFN---GNLEILVENMGRINYGSEIVHNNKGIISPVKID-----DNFIEGEWEMTKLPM 513

Query: 570 GEYIGLDKI--SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
            E    +K+  +   S      + L    SL  YK TF   E  G   L++   GKG  +
Sbjct: 514 SEVPAFEKMPANTVTSIMGSSANALVGKPSL--YKGTFTLQE-TGDTFLDMKDWGKGIVF 570

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNG +IGRYW                               P QTL+ +P  W+  G N 
Sbjct: 571 VNGINIGRYWQV----------------------------GPQQTLF-VPGVWLKKGINE 601

Query: 688 LVIHEEL 694
           +VI ++L
Sbjct: 602 IVIFDQL 608


>gi|345487997|ref|XP_001602984.2| PREDICTED: beta-galactosidase-like [Nasonia vitripennis]
          Length = 638

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 169/361 (46%), Gaps = 66/361 (18%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + +++   ++DGK     SGS HY R+  + W + +RK +  GL  + TYV W+ H+P  
Sbjct: 32  IDFENNQFLLDGKPFRYVSGSFHYFRTPKQYWRDRLRKMRAAGLNALSTYVEWSLHQPEP 91

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
            ++ ++G  DLV+F++  QE  LF+ LR GPY CAE  +GGFP W L+ +PGI+ RT + 
Sbjct: 92  NKWVWDGDADLVKFLQLAQEEDLFVLLRPGPYICAEREFGGFPYWLLNLVPGIKLRTNDT 151

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            + E  + +L +++  +K   L    GGPII+ QVENEYG+           +     D 
Sbjct: 152 RYLEYAEEYLNQVLTRVKP--LLRGNGGPIIMVQVENEYGS-----------FHACDKDY 198

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPN--- 220
              L       + Q     D ++ T +G Y                        T N   
Sbjct: 199 MTKLKN-----IIQNHVGTDALLYTTDGSYRQALRCGPVSGAYATIDFGTSSNVTQNFNL 253

Query: 221 ----SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFE---TGGTFQNYYMYFG 273
                P  P++ +E Y GW     +  PF  VE   F + +  +   + G   N YM++G
Sbjct: 254 MREFEPKGPLVNSEFYPGWLSH--WEEPFERVE--TFKITKMLDEMLSLGASVNMYMFYG 309

Query: 274 GTNFGRTAGGPLV------ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
           GTNF  ++G  +        TSYDYDAP+ E G +         + H+  K+  +YL   
Sbjct: 310 GTNFAFSSGANIFDNYTPDLTSYDYDAPLSEAGDLTA-------KYHEIKKIISKYLPIP 362

Query: 328 D 328
           D
Sbjct: 363 D 363


>gi|348529664|ref|XP_003452333.1| PREDICTED: beta-galactosidase-like [Oreochromis niloticus]
          Length = 651

 Score =  157 bits (396), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 112/329 (34%), Positives = 160/329 (48%), Gaps = 19/329 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V Y +     DG++    SGSIHY R     W + + K    GL  I+TYV WNYHE
Sbjct: 25  SFTVDYQNDCFRKDGEKFQYISGSIHYNRIPRVYWKDRLLKMYMAGLNAIQTYVPWNYHE 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
            + G Y F G  DL  F+K  Q+ GL + LR GPY CAEW+ GG P WL     I  R+T
Sbjct: 85  EVPGLYNFSGDRDLEHFLKLAQDVGLLVILRPGPYICAEWDMGGLPAWLLKKKDIVLRST 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG-GELYV 177
           +  +   + +++ K++ ++K        GGPII  QVENEYG+    ++ Y     +L+ 
Sbjct: 145 DPDYIAAVDKWMGKLLPMIKP--YLYQNGGPIITVQVENEYGSYFACDYNYMRHLSKLFR 202

Query: 178 KWAADTAVNLNTS---VPWVMCQQ-EDAPDPIINTCNGFYCDGFTPN---SPSKPIMWTE 230
            +  D  V   T    + ++ C   +D    +           F P     P  P++ +E
Sbjct: 203 SYLGDEVVLFTTDGAGLGYLKCGSIQDLYATVDFGPGANVTAAFEPQRQVQPHGPLVNSE 262

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--RTAGGPLVA- 287
            Y+GW   +G          +A A++     G    N YM+ GGTNFG    A  P  A 
Sbjct: 263 FYTGWLDHWGSRHSVVSPTQVAKALSEMLLMGANV-NLYMFIGGTNFGYWNGANTPYAAQ 321

Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
            TSYDYDAP+ E G + + K+  +RE+ K
Sbjct: 322 PTSYDYDAPLTEAGDLTE-KYFAIREVIK 349


>gi|323449959|gb|EGB05843.1| hypothetical protein AURANDRAFT_66064 [Aureococcus anophagefferens]
          Length = 1630

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 122/390 (31%), Positives = 178/390 (45%), Gaps = 49/390 (12%)

Query: 4    NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            ++  D R+L+++G R +L SGSIHYPRSTP +WP+L  +++  GL  IE+Y FWN H   
Sbjct: 1037 SIARDGRSLLVNGSRVLLLSGSIHYPRSTPAMWPKLFAEARANGLNAIESYAFWNKHSAT 1096

Query: 64   R---GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFP------------V 108
            R     Y F G  DL  F+    E  LF+  R GPY CAEW  GG P             
Sbjct: 1097 RYGAYDYGFNGDVDL--FLSLAAEHDLFVLWRFGPYVCAEWPAGGIPARAPRRAVFASNA 1154

Query: 109  WLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWA 168
            W+H +PG++ RT N  +  E  R++     ++  E   +  G      ++ENEYG  +  
Sbjct: 1155 WIHDVPGMKTRTNNTAWLNETGRWMRDHFAVI--EPHLSRNGAS---NRIENEYGGSKSD 1209

Query: 169  YGVGGELYVKWAADTAVNLNTSVPWVMCQ--QEDAPDPIINTCNGFYCDG-------FTP 219
                   YV      A  +   + W+MC      APD  ++T NG   D          P
Sbjct: 1210 AAA--VAYVDALDALADAVAPELVWMMCGFVSLVAPD-ALHTGNGCPHDQGPASAHVVVP 1266

Query: 220  NSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
             +P     W      W+ ++G     RP  D+A+ VA +  TGG   N+YM+ GG ++G 
Sbjct: 1267 PAPGADPAWYTEDELWYDAWGLPSLARPPADVAYGVASYVATGGAMHNFYMWHGGNHYGN 1326

Query: 280  --TA----GG------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
              TA    GG      P     Y   AP+   G   +P + HL  +H  +    E L+ +
Sbjct: 1327 WSTATPDLGGASSPEPPASQVRYANAAPLRSDGSRHEPLFSHLAAVHGTLDAYAEVLLGA 1386

Query: 328  DPTHQKLGAKLEA--HIYH-KSSNDCAAFL 354
             P      + + A  H Y  K +ND A+ +
Sbjct: 1387 TPEALATPSCVAACPHAYFLKFANDTASVV 1416


>gi|355690250|gb|AER99094.1| galactosidase, beta 1 [Mustela putorius furo]
          Length = 648

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/343 (32%), Positives = 162/343 (47%), Gaps = 31/343 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            + Y H   + DG+     SGSIHY R     W + + K K  GL  I+TYV WN+HEP 
Sbjct: 22  KIDYHHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 81

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++ 
Sbjct: 82  PGQYKFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 141

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +   + ++L  ++  MK   L    GGPII  QVENEYG    +Y      Y+++    
Sbjct: 142 DYLAAVDKWLGVLLPRMKP--LLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQKL 195

Query: 184 -AVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYCD-GFTPNS-------------PSKPI 226
              +L   V  ++   + A +P +      G Y    F P +             P  P+
Sbjct: 196 FHYHLGKDV--LLFTTDGALEPFLQCGALQGLYATVDFGPGANITAAFEVQRKSEPKGPL 253

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
           + +E Y+GW   +G        E +A ++      G    N YM+ GGTNF    G  + 
Sbjct: 254 VNSEFYTGWLDHWGQPHSTVKTEVVASSLHDILARGANV-NLYMFIGGTNFAYWNGANMP 312

Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLI 325
                TSYDYDAP+ E G + + K+  LR++ +  +   E +I
Sbjct: 313 YKAQPTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGVI 354


>gi|307188518|gb|EFN73255.1| Beta-galactosidase [Camponotus floridanus]
          Length = 624

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 153/323 (47%), Gaps = 32/323 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V Y++   ++DGK     SGS HY R+  + W + +RK +  GL  + TYV W+ HE
Sbjct: 31  SFGVDYENNQFLLDGKPFRYVSGSFHYFRAPRQYWRDRLRKMRAAGLNAVSTYVEWSLHE 90

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRT 120
           P  GQ+ + G  DL+ F+   QE  LF+ LR GPY CAE + GG P WL    P I+ RT
Sbjct: 91  PEPGQFNWAGDADLIEFLNIAQEEDLFVLLRPGPYICAERDLGGLPYWLLREAPDIKLRT 150

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-----NVEWA------- 168
            +  F +    +L ++++ +K   L    GGPII+ Q+ENEYG     + E+        
Sbjct: 151 KDAAFMKYATAYLNQVLEKVKP--LLRGNGGPIIMVQIENEYGSYNACDTEYTDMLKEII 208

Query: 169 ---YGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
               G    LY    A  ++     VP      +      +N  N F         P  P
Sbjct: 209 VGKVGSKALLYTTDGASASLLRCGFVPGAYATIDFGTS--VNVTNSF--QSMRLYQPRGP 264

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
           ++ +E Y GW   +G        E +   +      G +  N YM++GGTNFG T+G   
Sbjct: 265 LVNSEFYPGWLTHWGETFQRVKTEAVTKTLREMLALGASV-NIYMFYGGTNFGFTSGANG 323

Query: 284 ------PLVATSYDYDAPIDEYG 300
                 P + TSYDYDAP+ E G
Sbjct: 324 GVGAYSPQI-TSYDYDAPLTEAG 345


>gi|336063700|ref|YP_004558559.1| beta-galactosidase [Streptococcus pasteurianus ATCC 43144]
 gi|334281900|dbj|BAK29473.1| beta-galactosidase precursor [Streptococcus pasteurianus ATCC
           43144]
          Length = 595

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/320 (35%), Positives = 152/320 (47%), Gaps = 43/320 (13%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
            +  +DGK   + SGSIHY R  P+ W + +   K  G   +ETYV WN HEP  G++ F
Sbjct: 8   ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G  DL RF+   QE GL+  +R  PY CAEW +GG P WL    G++ R+ +  F + +
Sbjct: 68  TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWL-LEKGVRVRSQDKDFLQVV 126

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           KR+   +I  + +  L   QGG I++ QVENEYG    +YG   ++Y++      + L  
Sbjct: 127 KRYYEALIPRLIKHQL--DQGGNILMFQVENEYG----SYG-EDKVYLRELKQMMLELGL 179

Query: 190 SVPWVMCQQEDAP------------DPIINTCNGFYCDG----------FTPNSPSKPIM 227
             P+      D P            D ++ T N F              F       P+M
Sbjct: 180 EEPFFTS---DGPWHTALRAGSLIEDDVLVTGN-FGSKAKENFASMEMFFQQYGKKWPLM 235

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
             E + GWF  +G  V  R  E+LA AV    E G    N YM+ GGTNFG       R 
Sbjct: 236 CMEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARK 293

Query: 281 AGGPLVATSYDYDAPIDEYG 300
                  TSYDYDA +DE G
Sbjct: 294 QTDLPQVTSYDYDAILDEAG 313


>gi|420262409|ref|ZP_14765050.1| beta-galactosidase [Enterococcus sp. C1]
 gi|394770166|gb|EJF49970.1| beta-galactosidase [Enterococcus sp. C1]
          Length = 585

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 112/313 (35%), Positives = 155/313 (49%), Gaps = 40/313 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +D K   + SG+IHY R  PE W + + K +  G   +ETYV WN HE   G Y FEG  
Sbjct: 12  LDNKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGIL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T QE GL++ LR  PY CAEW +GG P WL   P ++ R    PF E++ R+ 
Sbjct: 72  DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
           A +   ++  +L  +QGGPI++ QVENEYG    +Y    E   K  A           +
Sbjct: 132 AHLFPQVR--DLQITQGGPILMMQVENEYG----SYANDKEYLRKMVAAMRQQGVETPLV 185

Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
            +  PW    +    +D   P IN C     + F      +   +P+M  E + GWF ++
Sbjct: 186 TSDGPWHDMLENGSIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244

Query: 240 G-----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVA 287
           G            V++L   +A      G+  N YM+ GGTNFG   G        P V 
Sbjct: 245 GDDHHHTTSTADAVKELQDCLAE-----GSV-NIYMFHGGTNFGFMNGSNYYERLAPDV- 297

Query: 288 TSYDYDAPIDEYG 300
           TSYDYDA + E+G
Sbjct: 298 TSYDYDALLTEWG 310


>gi|350418578|ref|XP_003491903.1| PREDICTED: beta-galactosidase-like [Bombus impatiens]
          Length = 646

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 45/359 (12%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
            S  V Y++   ++DGK     SGS HY R+  + W + ++K +  GL  + TYV WN H
Sbjct: 30  FSFEVDYENNQFLLDGKPFRYISGSFHYFRTPRQYWRDRLKKMRAAGLNAVSTYVEWNLH 89

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFR 119
           +P   ++++ G  D+V F+   QE GLF+ LR GPY CAE ++GG P W L  +P I  R
Sbjct: 90  QPTENEWHWTGDADVVEFINIAQEEGLFVLLRPGPYICAERDFGGLPYWLLGRVPDINLR 149

Query: 120 TTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW 179
           T +  + + ++ ++ +++D  K +      GGPII+ QVENEYG    +Y    E  ++ 
Sbjct: 150 TNDPRYMKYVEIYINEVLD--KVQPYLRGNGGPIIMVQVENEYG----SYACDTEYLIRL 203

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-----GFTPNS------------- 221
                  + T     +    D  +P +  C GF  +      F  N+             
Sbjct: 204 RDIMRQKIGTK---ALLYSTDGSNPNMLRC-GFVPEVYATVDFGTNTNVTKNFEIMRMYQ 259

Query: 222 PSKPIMWTENYSGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGR 279
           P  P++ +E Y GW     +  PF+ V+   +   +      G +  N YM++GGTNFG 
Sbjct: 260 PRGPLVNSEFYPGWLSH--WREPFQRVQTATVTKTLDEMLSLGASV-NIYMFYGGTNFGY 316

Query: 280 TAGG--------PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
           TAG         P + TSYDYDAP+ E G    PK+  +R  + K + L    L S  P
Sbjct: 317 TAGANGGHNAYNPQL-TSYDYDAPLTEAG-DPTPKYFAIRNVISKYLPLPNVPLPSPSP 373


>gi|189463987|ref|ZP_03012772.1| hypothetical protein BACINT_00322 [Bacteroides intestinalis DSM
           17393]
 gi|189438560|gb|EDV07545.1| glycosyl hydrolase family 35 [Bacteroides intestinalis DSM 17393]
          Length = 1106

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 110/326 (33%), Positives = 155/326 (47%), Gaps = 39/326 (11%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  +  YVFWN HEP  G Y F 
Sbjct: 356 TFLLNGKPFVVKAAELHYPRIPKPYWDQRIKLCKALGMNTVCLYVFWNSHEPQPGVYDFT 415

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
            + DL  F +  Q+  +++ LR GPY CAEW  GG P WL     ++ R ++  F E + 
Sbjct: 416 EQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDVRLRESDPYFIERVA 475

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   +   +K  +L  + GGPII+ QVENEYG+              V   +G    L+
Sbjct: 476 LFEEAVAKQVK--DLTIANGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANFGNDIALF 533

Query: 177 -VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
              WA++  +N    + W M           N   G   D          P+ P+M +E 
Sbjct: 534 QCDWASNFTLNGLDDLIWTM-----------NFGTGANVDQQFAKLKQLRPNSPLMCSEF 582

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
           +SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A  
Sbjct: 583 WSGWFDKWGANHETRPAADMIKGIDDMLSRGISF-SLYMTHGGTNWGHWAGANSPGFAPD 641

Query: 288 -TSYDYDAPIDEYGFIRQPKWGHLRE 312
            TSYDYDAPI E G    PK+  LRE
Sbjct: 642 VTSYDYDAPISESGQT-TPKYWALRE 666


>gi|288926246|ref|ZP_06420171.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
 gi|288336937|gb|EFC75298.1| beta-galactosidase (Lactase) [Prevotella buccae D17]
          Length = 791

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 110/320 (34%), Positives = 156/320 (48%), Gaps = 19/320 (5%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           T   +  +++GK  V+++  +HYPR     W   I+  K  G+  +  YVFWN HE   G
Sbjct: 35  TVGDKTFLLNGKPFVVKAAELHYPRIPRPYWEHRIKMCKALGMNTVCLYVFWNIHEQQEG 94

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           ++ F    D+  F +  Q  GL++ +R GPY CAEW  GG P WL     I+ R  +  F
Sbjct: 95  KFDFTDNNDVAEFCRLAQRNGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREPDPYF 154

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADT 183
            E +K F  K+ + +   +L    GGPII+ QVENEYG+     AY       V+ +   
Sbjct: 155 MERVKLFERKVGEQLA--SLTIQNGGPIIMVQVENEYGSYGENKAYVSAIRDIVRQSGFD 212

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCN-GFYCD------GFTPNSPSKPIMWTENYSGWF 236
            V L     W    +++  D ++ T N G   D            P+ P M +E +SGWF
Sbjct: 213 KVTL-FQCDWASNFEKNGLDDLVWTMNFGTGADIDQQFRRLGELRPNAPQMCSEFWSGWF 271

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA---TSYD 291
             +G     RP + +   +      G +F + YM  GGT+FG  AG   P  A   TSYD
Sbjct: 272 DKWGARHETRPAKTMVEGIDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFAPDVTSYD 330

Query: 292 YDAPIDEYGFIRQPKWGHLR 311
           YDAPI+EYG    PK+  LR
Sbjct: 331 YDAPINEYGQA-TPKYWELR 349


>gi|320106923|ref|YP_004182513.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
 gi|319925444|gb|ADV82519.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
          Length = 633

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 115/332 (34%), Positives = 162/332 (48%), Gaps = 34/332 (10%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V  DH  L  +G+   L SG +HY R   E W   ++ +K  GL  + TY+FWN HEP 
Sbjct: 43  RVAGDHFEL--NGEPVQLLSGEMHYARIPREYWRARLQMAKAMGLNTVATYIFWNVHEPK 100

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIP--GIQFRTT 121
            G Y F G  D+  FVK  QE GL + LR GPYACAEW +GG+P WL   P  G   R+ 
Sbjct: 101 PGVYDFSGNHDVAAFVKMAQEEGLNVILRAGPYACAEWEFGGYPSWLMKDPKMGSALRSN 160

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  +   ++R++ ++   M    L  S GGPI+  QVENEYG+    +G G + Y+    
Sbjct: 161 DEVYMAPVERWIKRLGQEMVP--LLISNGGPIVAVQVENEYGD----FG-GDKKYLAHML 213

Query: 182 DTAVN----------LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMW 228
           +   N          ++ S   V    E  P   +N   G    G T  +   P +P+  
Sbjct: 214 EIFQNAGFKDSFLYTVDPSKALVNGSLEGLPSG-VNFGVGNAERGLTALAHLRPGQPLFA 272

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA- 287
           +E + GWF  +G+    RP+      +A   +   +  N YM+ GGT+FG  +G      
Sbjct: 273 SEYWPGWFDHWGHPHETRPIPPQLKDIAYTLDHKSSI-NIYMFHGGTSFGFMSGASWTGG 331

Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLREL 313
                 TSYDYDAP+DE G    PK+   R+L
Sbjct: 332 EYLPDVTSYDYDAPLDEAGH-PTPKFYAYRDL 362


>gi|325569852|ref|ZP_08145846.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
 gi|325156975|gb|EGC69143.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
          Length = 585

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 112/313 (35%), Positives = 155/313 (49%), Gaps = 40/313 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +D K   + SG+IHY R  PE W + + K +  G   +ETYV WN HE   G Y FEG  
Sbjct: 12  LDKKPFKVISGAIHYFRVVPEYWQDRLEKLRLMGCNTVETYVPWNLHEAQEGVYQFEGIL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T QE GL++ LR  PY CAEW +GG P WL   P ++ R    PF E++ R+ 
Sbjct: 72  DLRRFIQTAQEVGLYVILRPAPYICAEWEFGGLPYWLLQDPMMKLRFDYPPFMEKITRYF 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN------L 187
           A +   ++  +L  +QGGPI++ QVENEYG    +Y    E   K  A           +
Sbjct: 132 AHLFPQVR--DLQITQGGPILMMQVENEYG----SYANDKEYLRKMVAAMRQQGVETPLV 185

Query: 188 NTSVPWVMCQQ----EDAPDPIINTCNGFYCDGFTP----NSPSKPIMWTENYSGWFLSF 239
            +  PW    +    +D   P IN C     + F      +   +P+M  E + GWF ++
Sbjct: 186 TSDGPWHDMLENGTIKDLALPTIN-CGSNIKENFEKLRRFHGEKRPLMVMEFWIGWFDAW 244

Query: 240 G-----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------GPLVA 287
           G            V++L   +A      G+  N YM+ GGTNFG   G        P V 
Sbjct: 245 GDDHHHTTSTADAVKELQDCLAE-----GSV-NIYMFHGGTNFGFMNGSNYYERLAPDV- 297

Query: 288 TSYDYDAPIDEYG 300
           TSYDYDA + E+G
Sbjct: 298 TSYDYDALLTEWG 310


>gi|334348881|ref|XP_001378605.2| PREDICTED: beta-galactosidase-like [Monodelphis domestica]
          Length = 658

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 118/348 (33%), Positives = 164/348 (47%), Gaps = 25/348 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y+    + DGK     SGSIHY R     W + + K K  GL  I+TYV WN+HEP+ 
Sbjct: 50  IDYERDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPLP 109

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y F   +DL  F++   E GL + LR GPY CAEW+ GG P WL     I  R+++  
Sbjct: 110 GVYRFSDDYDLEYFLQLAHEIGLLVILRPGPYICAEWDMGGLPAWLLTKKSIVLRSSDPD 169

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY-GVGGELYVKWA 180
           +  E +++L  ++  MK        GGPII  QVENEYG+    ++ Y     +L+ K  
Sbjct: 170 YLAETEKWLGVLLPKMKP--YLYQNGGPIITVQVENEYGSYFTCDYNYLRFLQQLFHKHL 227

Query: 181 ADTAVNLNT---SVPWVMCQQEDAPDPII------NTCNGFYCDGFTPNSPSKPIMWTEN 231
            +  V   T   S  ++ C         +      N    F     T   P  P++ +E 
Sbjct: 228 GEEVVLFTTDGASEDYLKCGTLQGLYATVDFGTNHNITEAFQSQRKT--EPKGPLVNSEF 285

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA-- 287
           Y+GW   +G A      + +  ++      G    N YM+ GGTNFG   G   P  A  
Sbjct: 286 YTGWLDHWGEAHETVDTKAIISSLNDMLSQGANV-NMYMFIGGTNFGFWNGANIPYAAQP 344

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
           TSYDYDAP+ E G + + K+  LREL    +   E LI   PT  K  
Sbjct: 345 TSYDYDAPLSEAGDLTE-KYFALRELIGKFEKLPEGLIP--PTTPKFA 389


>gi|163790001|ref|ZP_02184436.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
 gi|159874701|gb|EDP68770.1| glycosyl hydrolase, family 35 [Carnobacterium sp. AT7]
          Length = 595

 Score =  156 bits (394), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 104/316 (32%), Positives = 152/316 (48%), Gaps = 34/316 (10%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
              +++G+   + SG+IHY R  PE W   +   K  G   +ETY+ WN HE    +Y F
Sbjct: 8   EEFLLNGEPFKIISGAIHYFRILPEDWYHSLYNLKALGFNTVETYIPWNVHETKEREYDF 67

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+ RFV+T +E GLF+ LR  PY CAEW +GG P WL     ++ R+++  F E++
Sbjct: 68  SGQLDIQRFVQTAKELGLFVILRPSPYICAEWEFGGLPAWLLTYKNMRIRSSDPQFIEKV 127

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
             +  K+ + +    L  + GGP+I+ Q+ENEYG    +YG   E Y+K   +  + L  
Sbjct: 128 SSYYKKLFEQIVP--LQVTSGGPVIMMQLENEYG----SYGEDKE-YLKTLYELMLELGV 180

Query: 190 SVP-------WVMCQQEDAPDPIINTCNGFYCDGFTPN---------SPSK--PIMWTEN 231
           +VP       W   Q+      +     G +      N         S  K  P+M  E 
Sbjct: 181 TVPIFTSDGAWKATQEAGTMTDLDILTTGNFGSQSKENFKNLKEFHESKGKNWPLMCMEY 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +   +  R  +DL   V    + G    N YM+ GGTNFG       R     
Sbjct: 241 WGGWFNRWNDPIIKRDAQDLTNDVKEALKIGSL--NLYMFHGGTNFGFMNGCSARLGKDL 298

Query: 285 LVATSYDYDAPIDEYG 300
              TSYDYDAP++E G
Sbjct: 299 PQLTSYDYDAPLNEQG 314


>gi|449664450|ref|XP_002165261.2| PREDICTED: beta-galactosidase-like [Hydra magnipapillata]
          Length = 589

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 98/317 (30%), Positives = 158/317 (49%), Gaps = 25/317 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y++   + DG      SGSIHY R   + W + + K ++ GL  I+TY+ WN+HEP  
Sbjct: 25  IDYENNKFLKDGTEFRYISGSIHYMRVPEDYWEDRLSKIRKAGLNAIQTYIPWNFHEPTE 84

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPG---IQFRTT 121
           G + F G+ ++ +F+K  Q+  L + LR GPY CAEW +GGFP WL    G   +Q RT+
Sbjct: 85  GNFQFGGQQNVFKFLKLAQKYDLLVILRPGPYICAEWEFGGFPYWLLKKVGNKTMQLRTS 144

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV----EWAYGVGGELYV 177
           +N + ++++ +++ ++  ++        GGPII  QVENEYG+     E+ Y +   ++ 
Sbjct: 145 DNLYLQKVENYMSVLLSGLRP--YLYENGGPIITVQVENEYGSYGCDHEYMYKLES-IFR 201

Query: 178 KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN-------GFYCDGFTPNSPSKPIMWTE 230
           K+  +  +   T        +     P+  T +         Y D      P  P++ +E
Sbjct: 202 KYLGENVILFTTDGAGDSYLKCGTIKPLFATVDFGPTAEPKLYFDIQRKYQPLGPLVNSE 261

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA--- 287
            Y+GW   +G       +ED+   + +      +  N YM+ GGTNFG   G    +   
Sbjct: 262 FYTGWLDHWGGQHAHTSLEDVTDTLDKMLSLNASV-NMYMFEGGTNFGFMNGANQDSNSL 320

Query: 288 ----TSYDYDAPIDEYG 300
               TSYDYDAP+ E G
Sbjct: 321 QPQPTSYDYDAPLSEAG 337


>gi|330997880|ref|ZP_08321714.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
 gi|329569484|gb|EGG51254.1| putative beta-galactosidase [Paraprevotella xylaniphila YIT 11841]
          Length = 786

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/321 (33%), Positives = 162/321 (50%), Gaps = 23/321 (7%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           ++  +++GK  ++++  +HYPR     W + I+  K  G+  +  YVFWN HE   G++ 
Sbjct: 40  NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFD 99

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F G  D+  F++  QE GL++ +R GPY CAEW  GG P WL     I+ R  +  F E 
Sbjct: 100 FTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMER 159

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVN 186
            + F  K+ + +   +L   +GGPII+ QVENEYG+   +  Y  G    ++ +    V 
Sbjct: 160 YRIFAKKLGEQIG--DLTIEKGGPIIMVQVENEYGSYGEDKPYVSGIRDIIRDSGFDKVT 217

Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS--------PSKPIMWTENYSGWFLS 238
           L     W     ++  D ++ T N F       N         P  P M +E +SGWF  
Sbjct: 218 L-FQCDWSSNFTKNGLDDLVWTMN-FGTGANIENEFKKLGELRPESPQMCSEFWSGWFDK 275

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDY 292
           +G     R  +++   +    + G +F + YM  GGT++G  AG       P V TSYDY
Sbjct: 276 WGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFSPDV-TSYDY 333

Query: 293 DAPIDEYGFIRQPKWGHLREL 313
           DAPI+E G +  PK+  LRE+
Sbjct: 334 DAPINEAGQV-TPKYMELREM 353



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 66/253 (26%), Positives = 98/253 (38%), Gaps = 66/253 (26%)

Query: 466 PGQGKEVFLNIESLGHAALVFVNKKLV-AFGYGNHDFANFLINKKIELNEGINTLDILSM 524
           P    +  L I      A VF+N KL+ +    NH+    L   K    EG + LDIL  
Sbjct: 418 PAVPTQSILTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMK----EG-DQLDILVE 472

Query: 525 MVGLQNYG-AWFDVAG----------AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
            +G  N+G A  D  G              S + ++LKN +    S    YQV  + +Y+
Sbjct: 473 AMGRINFGRAIKDFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSDS--YQVQKDMKYV 530

Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
            L    +                    Y+ TF   +  G   LNL + GKGQ +VNG +I
Sbjct: 531 PLKDQKVPGC-----------------YRATFNLKK-TGDTFLNLETWGKGQVYVNGHAI 572

Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
           GR+W                               P QTLY +P  W+  GEN +++ + 
Sbjct: 573 GRFWKI----------------------------GPQQTLY-MPGCWLKKGENEIIVQDI 603

Query: 694 LGGDPSKISLLTK 706
           +G   + +  L+K
Sbjct: 604 VGPQETVVEGLSK 616


>gi|348508362|ref|XP_003441723.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oreochromis
           niloticus]
          Length = 605

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 121/368 (32%), Positives = 167/368 (45%), Gaps = 58/368 (15%)

Query: 8   DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
           D     ++GK   +  GS+HY R     W + + K K  GL  + TYV WN HEP RG +
Sbjct: 10  DSSQFTLEGKPFRILGGSVHYFRVPRAYWEDRLLKMKACGLNTLTTYVPWNLHEPERGTF 69

Query: 68  YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
            F+ + DL  +V    + GL++ LR GPY CAEW+ GG P WL     +Q RTT   F  
Sbjct: 70  NFQDQLDLKAYVSLAAQLGLWVILRPGPYICAEWDLGGLPSWLLQDEEMQLRTTYPGFVN 129

Query: 128 EMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
            +  +  K+I ++K   L    GGPII  QVENEYG+              +A D     
Sbjct: 130 AVNLYFDKLISVIKP--LMFEGGGPIIAVQVENEYGS--------------FAKD----- 168

Query: 188 NTSVPWVM-CQQEDAPDPIINTCN---GFYCDGF-----TPN---------------SPS 223
           +  +P++  C Q      ++ T +   G  C G      T N                P 
Sbjct: 169 DKYMPFIKNCLQSRGIKELLMTSDNWEGLRCGGVEGALKTVNLQRLSFGAIQHLADIQPQ 228

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
           KP+M  E +SGWF  +G        ED+   V+   + G +  N YM+ GGT FG   G 
Sbjct: 229 KPLMVMEYWSGWFDVWGEHHHVFYAEDMLAVVSEILDRGVSI-NLYMFHGGTTFGFMNGA 287

Query: 284 ------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL--ISSDPTHQKLG 335
                     TSYDYDAP+ E G    PK+ HLR L        E+L  + S P  +  G
Sbjct: 288 MDFGTYKSQVTSYDYDAPLSEAGDC-TPKYHHLRNLFSQYH--SEHLPGVPSSPERKAYG 344

Query: 336 -AKLEAHI 342
            A ++ H+
Sbjct: 345 PALIQQHL 352


>gi|299142590|ref|ZP_07035721.1| beta-galactosidase (Lactase) [Prevotella oris C735]
 gi|298576025|gb|EFI47900.1| beta-galactosidase (Lactase) [Prevotella oris C735]
          Length = 823

 Score =  156 bits (394), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 109/328 (33%), Positives = 158/328 (48%), Gaps = 27/328 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            + T      +++G+  V+++  +HYPR     W + I+  K  G+  +  YVFWN HE 
Sbjct: 67  GDFTVGKNTFLLNGQPFVVKAAELHYPRIPRPYWEQRIKMCKSLGMNTVCLYVFWNIHEQ 126

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G++ F G  D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R  +
Sbjct: 127 QEGKFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREDD 186

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY------ 176
             F   +K F A++   +    L    GGPII+ QVENEYG    +YGV  +        
Sbjct: 187 PYFMARVKAFEAEVGRQLAP--LTIQNGGPIIMVQVENEYG----SYGVNKKYVSQIRDI 240

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWT 229
           VK +    V L     W    + +  D ++ T N   G   D          P  P+M +
Sbjct: 241 VKASGFDKVTL-FQCDWASNFENNGLDDLVWTMNFGTGSNIDAQFKRLKQLRPDAPLMCS 299

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
           E +SGWF  +G     RP + +   +        +F + YM  GGT+FG  AG   P  A
Sbjct: 300 EFWSGWFDKWGARHETRPAKAMVEGIDEMLSKNISF-SLYMTHGGTSFGHWAGANSPGFA 358

Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRE 312
              TSYDYDAPI+EYG    PK+  LR+
Sbjct: 359 PDVTSYDYDAPINEYGHA-TPKFWELRK 385


>gi|223982755|ref|ZP_03632983.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
           12042]
 gi|223965255|gb|EEF69539.1| hypothetical protein HOLDEFILI_00257 [Holdemania filiformis DSM
           12042]
          Length = 592

 Score =  155 bits (393), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 155/326 (47%), Gaps = 34/326 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   L SG++HY R  PE W + + K K  G   +ETY+ WNYHEP +GQ+ F G
Sbjct: 10  FMLDGQPVKLISGALHYFRIVPEYWQDRLEKLKNMGCNCVETYIPWNYHEPKKGQFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
           R D+ RFV+  Q  GL++ LR  PY CAEW +GG P WL     ++ R+T  P+ + +  
Sbjct: 70  RKDVARFVRKAQALGLWVILRPTPYICAEWEFGGLPAWLLADDSMRVRSTYQPYLDAVDA 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN-------------VEWAYGVGGELYV- 177
           + A++  +++   LF + GGP+++ Q+ENEYG+             +   +G    ++  
Sbjct: 130 YYAELFKVIRP--LFFTHGGPVLMCQIENEYGSFGNDKQYLKAIKRLMEKHGCDVPMFTS 187

Query: 178 ----KWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
               +   D    LN  V           D  I     F  D    N    P+M  E + 
Sbjct: 188 DGGWREVLDAGTLLNEGV-LPTANFGSRTDEQIGALRQFMND----NDIHGPLMCMEFWI 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTN------FGRTAGGPLVA 287
           GWF ++G  +  R  ++ A  +      G    N YM+ GGTN           G     
Sbjct: 243 GWFNNWGSPLKTRDAKEAADELDAMLRQGSV--NIYMFHGGTNPEFYNGCSYHNGMDPQI 300

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
           TSYDY AP+ E+G     K+   RE+
Sbjct: 301 TSYDYAAPLTEWG-TEAEKYAAFREV 325


>gi|311281324|ref|YP_003943555.1| glycoside hydrolase [Enterobacter cloacae SCF1]
 gi|308750519|gb|ADO50271.1| glycoside hydrolase family 35 [Enterobacter cloacae SCF1]
          Length = 591

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 152/313 (48%), Gaps = 30/313 (9%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
            + L+ DGK   L SG+IHY R  P+ W   +   K  G   +ETY+ WN H+P   ++ 
Sbjct: 7   EKNLLQDGKPVQLISGAIHYFRLVPQYWEHSLNNLKALGANCVETYLPWNIHQPDPERFC 66

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F G  D+ RF+   Q  GLF+ LR  PY CAEW +GG P WL   P ++ R++   F + 
Sbjct: 67  FTGMADVERFIALAQRKGLFVILRPSPYICAEWEFGGLPAWLLRDPSMRVRSSQPAFLQA 126

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
           ++R+ A+++  +        +GGP+++ Q+ENEYG    ++G   + Y++  A       
Sbjct: 127 VERYYAELLPRLAPWQY--DRGGPVVMMQLENEYG----SFG-NDKAYLRTLAAMMRRYG 179

Query: 189 TSVP-------WVMCQQEDA--PDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSG 234
            SVP       W    Q  +   D ++ T N         D      P +P+M  E ++G
Sbjct: 180 VSVPLFTSDGAWQEALQAGSLCEDNVLATANFGSRSAESLDNLAAFQPERPLMCLEFWNG 239

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------A 287
           WF  +G A+  R  +D+   +           N YM+ GGTNFG   G  +         
Sbjct: 240 WFNRYGDAIIRRDADDVGQEIRTLLTRASI--NIYMFQGGTNFGFMNGCSVRGDKDLPQV 297

Query: 288 TSYDYDAPIDEYG 300
           TSYDYDA + E+G
Sbjct: 298 TSYDYDALLSEWG 310



 Score = 39.3 bits (90), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 81/197 (41%), Gaps = 50/197 (25%)

Query: 512 LNEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVE 569
           L E  N LD+L   +G  NYG          GL   ++IDL      L +   I+ + ++
Sbjct: 432 LREADNVLDLLIENMGRVNYGPRLLAPTQRKGLRGGLVIDLH-----LETDWDIFPLPLD 486

Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
                +D +    S+ W+        +   +Y+  F A +      L+  S+GKG A++N
Sbjct: 487 N----IDDVDF--SAGWQP-------QQPAFYEYCF-AIDSPADTFLDTRSLGKGVAFIN 532

Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
           G ++GRYW               YRG             P   LY IP   +  GEN L+
Sbjct: 533 GFNLGRYW---------------YRG-------------PLGYLY-IPAPLLKQGENRLI 563

Query: 690 IHEELGGDPSKISLLTK 706
           I E  G +   ++LL K
Sbjct: 564 IFETEGVEVGALALLNK 580


>gi|297204198|ref|ZP_06921595.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
 gi|197714112|gb|EDY58146.1| beta-galactosidase [Streptomyces sviceus ATCC 29083]
          Length = 588

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 110/328 (33%), Positives = 156/328 (47%), Gaps = 29/328 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T      ++ G+   + SG++HY R  P+ W + +RK++  GL  IETY+ WN HEP  
Sbjct: 7   LTTSSDGFLLHGEPFRIISGAMHYFRIHPDQWTDRLRKARLMGLNTIETYLPWNLHEPEP 66

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G    +G  DL R+++  Q+ GL + LR GP+ CAEW+ GG P WL   P I+ R+++  
Sbjct: 67  GTLVLDGFLDLPRWLRLAQDEGLHVLLRPGPFICAEWDDGGLPAWLLADPDIRLRSSDPR 126

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           F      +L +++  ++     A+ GGP+I  QVENEYG    AYG     Y+K      
Sbjct: 127 FTGAFDGYLDQLLPALRP--FMAAHGGPVIAVQVENEYG----AYG-DDTAYLKHVHQAL 179

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCD------------GFTPNSPSKPIMWTENY 232
            +         C Q  A      T  G                    + P  P+M +E +
Sbjct: 180 RDRGVEELLYTCDQASAEHLAAGTLPGTLATATFGSRVEENLAALRTHQPEGPLMCSEFW 239

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PL 285
            GWF  +G     R   D A  + R    G +  N YM+ GGTNFG T G        P 
Sbjct: 240 VGWFDHWGGPHHVRSAADAAADLDRLLSAGASV-NIYMFHGGTNFGFTNGANHKHAYEPT 298

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLREL 313
           V TSYDYDAP+ E G    PK+   RE+
Sbjct: 299 V-TSYDYDAPLTESG-DPGPKYHAFREV 324


>gi|294672870|ref|YP_003573486.1| beta-galactosidase [Prevotella ruminicola 23]
 gi|294473700|gb|ADE83089.1| putative beta-galactosidase [Prevotella ruminicola 23]
          Length = 787

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 107/337 (31%), Positives = 163/337 (48%), Gaps = 37/337 (10%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + + T  ++  +++G+  V+++  +HYPR     W   I+  K  G+  +  YVFWN HE
Sbjct: 21  AGDFTVGNKTFLLNGEPFVVKAAEVHYPRIPRPYWEHRIKMCKALGMNTLCIYVFWNIHE 80

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F    D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R  
Sbjct: 81  QREGQFDFTDNNDVAEFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLRER 140

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-----------EWAYG 170
           +  F E +K F  K+ + +    L    GGPII+ QVENEYG+            +   G
Sbjct: 141 DPYFLERVKIFEQKVGEQLAP--LTIQNGGPIIMVQVENEYGSYGEDKPYVSEIRDCLRG 198

Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPS 223
           + GE    +  D + N           + +  D ++ T N   G   D          P+
Sbjct: 199 IYGEKLTLFQCDWSSNF----------ERNGLDDLVWTMNFGTGANIDHEFARLKQLRPN 248

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
            P+M +E +SGWF  +G     RP +D+   +        +F + YM  GGT+FG  AG 
Sbjct: 249 APLMCSEFWSGWFDKWGANHETRPAKDMVDGMDEMLSKNISF-SLYMTHGGTSFGHWAGA 307

Query: 284 --PLVA---TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
             P  A   TSYDYDAPI+EYG   + K+  LR++ +
Sbjct: 308 NSPGFAPDVTSYDYDAPINEYGGTTE-KFFQLRKMMQ 343


>gi|299148656|ref|ZP_07041718.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
 gi|298513417|gb|EFI37304.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
          Length = 778

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 158/323 (48%), Gaps = 34/323 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
              I+GK   L  G +HYPR   E W + +++++  GL  +  YVFWN+HE   G++ F 
Sbjct: 38  TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 97

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F++T QE GL++ LR GPY CAEW++GG+P WL     + +R+ +  F    +
Sbjct: 98  GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157

Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           R+   I +L KQ + L  + GG II+ QVENEYG+     G     Y+    D       
Sbjct: 158 RY---IKELGKQLSPLTINNGGNIIMVQVENEYGSYAADKG-----YLAAIRDMIKEAGF 209

Query: 190 SVPWVMCQ-----QEDAPDPIINTCNGFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
           +VP   C      +    +  + T NG + +             P    E Y  WF  +G
Sbjct: 210 NVPLFTCDGGGQVEAGHTEGALPTLNGVFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWG 269

Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
               +V + RP E L + ++      G   + YM+ GGTNF  T G           TSY
Sbjct: 270 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 324

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAP+ E+G    PK+   RE+
Sbjct: 325 DYDAPLGEWGNC-YPKYHAFREV 346



 Score = 48.5 bits (114), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 111/274 (40%), Gaps = 52/274 (18%)

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           ++ F+  E K   S   +F +   +E + + +D      Y      +   GK+  + I+ 
Sbjct: 366 TTTFATVELKESASLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 424

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
           L   A++ ++ K VA     ++  +  +N    +++   TL+IL    G  NYG      
Sbjct: 425 LRDYAVILIDGKQVASLDRRYNQNSMTLN----VSKTPATLEILVENTGRVNYGPDILFN 480

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
             G+ S +L     G   L+        G     + L K  ++   F +    +P     
Sbjct: 481 RKGITSQVLW----GNEKLT--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 524

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            ++K TF   E KG   ++++  GKG  WVNG+S+GR+W+                    
Sbjct: 525 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 563

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                     P QTLY +P  W+  GEN +V+ E
Sbjct: 564 ---------GPQQTLY-LPAPWLKEGENEIVVFE 587


>gi|411007376|ref|ZP_11383705.1| beta-galactosidase [Streptomyces globisporus C-1027]
          Length = 606

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 111/314 (35%), Positives = 152/314 (48%), Gaps = 43/314 (13%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DGK   L SG++HY R   E W   +      GL  +ETYV WN HEP  G+    G   
Sbjct: 14  DGKPVRLLSGALHYFRVHEEQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG--A 71

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L RF+  V+ AGL+  +R GPY CAEW  GG PVW+    G + RT +  ++  ++R+  
Sbjct: 72  LGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAEYRAVVERWFR 131

Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWV 194
           +++  + Q  +   +GGP+IL Q ENEYG+    +G    +Y++W A        +VP  
Sbjct: 132 ELLPQVVQRQVV--RGGPVILVQAENEYGS----FGSDA-VYLEWLAGLLRECGVTVPLF 184

Query: 195 MCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWFL 237
                D P+           ++ T N       GF       + P  P+M  E + GWF 
Sbjct: 185 TS---DGPEDHMLTGGSVPGLLATANFGSGAREGFEV--LRRHQPKGPLMCMEFWCGWFD 239

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPL-------V 286
            +G     R  E+ A A+    E G +  N YM  GGTNF    G   GGPL        
Sbjct: 240 HWGAEPVLRDAEEAAGALREILECGASV-NVYMAHGGTNFAGWAGANRGGPLQDGEFQPT 298

Query: 287 ATSYDYDAPIDEYG 300
            TSYDYDAP+DEYG
Sbjct: 299 VTSYDYDAPVDEYG 312


>gi|297727459|ref|NP_001176093.1| Os10g0340600 [Oryza sativa Japonica Group]
 gi|255679317|dbj|BAH94821.1| Os10g0340600 [Oryza sativa Japonica Group]
          Length = 143

 Score =  155 bits (393), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 67/104 (64%), Positives = 86/104 (82%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V+YD R+L++DG+RR++ SGSIHYPRSTPE+WP+LI+K+KEGGL  IETYVFWN HEP R
Sbjct: 31  VSYDGRSLILDGERRIVISGSIHYPRSTPEMWPDLIKKAKEGGLNAIETYVFWNGHEPRR 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPV 108
            ++ FEG +D+VRF K +Q AG++  LRIGPY C EWNYG  P+
Sbjct: 91  REFNFEGNYDVVRFFKEIQNAGMYAILRIGPYICGEWNYGYMPM 134


>gi|306832839|ref|ZP_07465973.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
 gi|304424978|gb|EFM28110.1| beta-galactosidase [Streptococcus bovis ATCC 700338]
          Length = 595

 Score =  155 bits (392), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 112/320 (35%), Positives = 152/320 (47%), Gaps = 43/320 (13%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
            +  +DGK   + SGSIHY R  P+ W + +   K  G   +ETYV WN HEP  G++ F
Sbjct: 8   ESFFLDGKPFKILSGSIHYFRIHPDDWYQSLYNLKALGFNTVETYVPWNLHEPREGEFDF 67

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G  DL RF+   QE GL+  +R  PY CAEW +GG P WL    G++ R+ +  F + +
Sbjct: 68  TGILDLERFLTIAQELGLYAIVRPSPYICAEWEFGGLPAWL-LEKGVRVRSQDKGFLQVV 126

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           KR+   +I  + +  L   QGG I++ QVENEYG    +YG   ++Y++      + L  
Sbjct: 127 KRYYEVLIPRLIKHQL--DQGGNILMFQVENEYG----SYG-EDKVYLRELKQMMLELGL 179

Query: 190 SVPWVMCQQEDAP------------DPIINTCNGFYCDG----------FTPNSPSKPIM 227
             P+      D P            D ++ T N F              F       P+M
Sbjct: 180 EEPFFTS---DGPWHTALRAGSLIEDDVLVTGN-FGSKAKENFASMEMFFQQYGKKWPLM 235

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
             E + GWF  +G  V  R  E+LA AV    E G    N YM+ GGTNFG       R 
Sbjct: 236 CMEFWDGWFNRWGEPVIKRDPEELADAVMEAIEIGSI--NLYMFHGGTNFGFMNGCSARK 293

Query: 281 AGGPLVATSYDYDAPIDEYG 300
                  TSYDYDA +DE G
Sbjct: 294 QTDLPQVTSYDYDAILDEAG 313


>gi|198433885|ref|XP_002127100.1| PREDICTED: similar to galactosidase, beta 1-like 2 [Ciona
           intestinalis]
          Length = 658

 Score =  155 bits (392), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 112/339 (33%), Positives = 164/339 (48%), Gaps = 33/339 (9%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + +T   +   +DGK   + SG++HY R   E W + + K K  GL  IETYV WN HEP
Sbjct: 56  SGLTAQGKTFKLDGKPMTIISGAVHYFRMPREYWRDRLMKMKACGLNTIETYVPWNLHEP 115

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
           I G+Y F G  DLV F+    +   ++ LR GPY C+EW +GG P WL   P ++ RT  
Sbjct: 116 IPGKYNFTGDLDLVHFILLAHKLEFYVLLRPGPYICSEWEFGGLPSWLLRDPKMKVRTMY 175

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            P+   + ++   ++  +K   L    GGPII  Q++NEYG    +Y    + Y+ +  +
Sbjct: 176 PPYIAAVTKYFNYLLPFVKP--LQYQYGGPIIAFQLDNEYG----SYFKDAD-YLPYLKE 228

Query: 183 TAVN------LNTSVPWVMCQQEDAPDPIINTCNGFYCDG-FTPNS---PSKPIMWTENY 232
              N      L  S      +Q+  P  ++ T N    +  FT  S   P  P+M  E +
Sbjct: 229 FLQNKGIIELLFISDSIEGLRQQTIPG-VLKTVNFKRMENHFTDLSNMQPDAPLMVMEFW 287

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PL 285
           +GWF  +G       V++    +   F  GG+  N+YM+FGGTNFG   G          
Sbjct: 288 TGWFDWWGEKHHILTVQEFGETLNEIFSQGGSV-NFYMFFGGTNFGFMNGAYKDGTGFHA 346

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYL 324
             TSYDYDA I E G + +       +  KA ++ E Y 
Sbjct: 347 DITSYDYDALIAENGDLTE-------KYFKAKQIIEHYF 378


>gi|326922161|ref|XP_003207320.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-like [Meleagris
           gallopavo]
          Length = 643

 Score =  155 bits (392), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 115/350 (32%), Positives = 167/350 (47%), Gaps = 29/350 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + YD    V DG+     SGSIHY R     W + + K K  GL+ I+TYV WNYHE   
Sbjct: 18  IDYDCNCFVKDGRPFRYISGSIHYSRVPRYYWKDRLLKMKMAGLDAIQTYVPWNYHETQM 77

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G Y F G  DL  F++   E GL + LR GPY CAEW+ GG P WL     I  R++++ 
Sbjct: 78  GVYDFSGDRDLEYFLQLASETGLLVILRAGPYICAEWDMGGLPAWLLEKESIVLRSSDSD 137

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG--------- 172
           +   +++++  ++  MK        GGPII+ QVENEYG+    ++ Y            
Sbjct: 138 YLTAVEKWMGVLLPKMKPH--LYQNGGPIIMVQVENEYGSYFACDYDYLRSLLKIFRQHL 195

Query: 173 GELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
           G+  V +  D A   +    ++  +    + AP    N    F       + P+ P++ +
Sbjct: 196 GDEVVLFTTDGASQFHLKCGALQGLYATVDFAPGG--NVTAAFLAQ--RSSEPTGPLVNS 251

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--RTAGGPLVA 287
           E Y+GW   +G+     P + +A  +      G    N YM+ GGTNF     A  P ++
Sbjct: 252 EFYTGWLDHWGHRHAVVPSQTIAKTLNEILARGANV-NLYMFIGGTNFAYWNGANMPYMS 310

Query: 288 --TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLG 335
             TSYDYDAP+ E G + + K+  LRE+        E LI   PT  K  
Sbjct: 311 QPTSYDYDAPLSEAGDLTE-KYFALREVIGMYNQLPEGLIP--PTTSKFA 357


>gi|18410234|ref|NP_565051.1| beta-galactosidase 17 [Arabidopsis thaliana]
 gi|75163694|sp|Q93Z24.1|BGL17_ARATH RecName: Full=Beta-galactosidase 17; Short=Lactase 17; Flags:
           Precursor
 gi|16648842|gb|AAL25611.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
 gi|22655360|gb|AAM98272.1| At1g72990/F3N23_19 [Arabidopsis thaliana]
 gi|332197279|gb|AEE35400.1| beta-galactosidase 17 [Arabidopsis thaliana]
          Length = 697

 Score =  155 bits (392), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 113/332 (34%), Positives = 160/332 (48%), Gaps = 41/332 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DG R  +  G +HY R  PE W + + ++   GL  I+ YV WN HEP  G+  FEG  D
Sbjct: 73  DGNRFQIIGGDLHYFRVLPEYWEDRLLRANALGLNTIQVYVPWNLHEPKPGKMVFEGIGD 132

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI-PGIQFRTTNNPFKEEMKRFL 133
           LV F+K  ++    + LR GPY C EW+ GGFP WL  + P +Q RT++  + + ++R+ 
Sbjct: 133 LVSFLKLCEKLDFLVMLRAGPYICGEWDLGGFPAWLLAVKPRLQLRTSDPVYLKLVERWW 192

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEWAYGVGGELYVKWAAD 182
             +  L K   L  S GGP+I+ Q+ENEYG+           V  A G  G+  + +  D
Sbjct: 193 DVL--LPKVFPLLYSNGGPVIMVQIENEYGSYGNDKAYLRKLVSMARGHLGDDIIVYTTD 250

Query: 183 --TAVNLNT-SVP------WVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-PIMWTENY 232
             T   L+  +VP       V     D P PI      F       N+P + P + +E Y
Sbjct: 251 GGTKETLDKGTVPVADVYSAVDFSTGDDPWPIFKLQKKF-------NAPGRSPPLSSEFY 303

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA----- 287
           +GW   +G  +     E  A ++ +     G+    YM  GGTNFG   G    +     
Sbjct: 304 TGWLTHWGEKITKTDAEFTAASLEKILSRNGS-AVLYMVHGGTNFGFYNGANTGSEESDY 362

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               TSYDYDAPI E G I  PK+  L+ + K
Sbjct: 363 KPDLTSYDYDAPIKESGDIDNPKFQALQRVIK 394


>gi|237721434|ref|ZP_04551915.1| beta-galactosidase [Bacteroides sp. 2_2_4]
 gi|293370839|ref|ZP_06617384.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
 gi|229449230|gb|EEO55021.1| beta-galactosidase [Bacteroides sp. 2_2_4]
 gi|292634055|gb|EFF52599.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
          Length = 777

 Score =  155 bits (392), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 158/323 (48%), Gaps = 34/323 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
              I+GK   L  G +HYPR   E W + +++++  GL  +  YVFWN+HE   G++ F 
Sbjct: 38  TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 97

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F++T QE GL++ LR GPY CAEW++GG+P WL     + +R+ +  F    +
Sbjct: 98  GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157

Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           R+   I +L KQ + L  + GG II+ QVENEYG+     G     Y+    D       
Sbjct: 158 RY---IKELGKQLSPLTINNGGNIIMVQVENEYGSYAADKG-----YLAAIRDMIKEAGF 209

Query: 190 SVPWVMCQ-----QEDAPDPIINTCNGFYCDG----FTPNSPSKPIMWTENYSGWFLSFG 240
           +VP   C      +    +  + T NG + +             P    E Y  WF  +G
Sbjct: 210 NVPLFTCDGGGQVEAGHTEGALPTLNGVFGEDIFKVIDKYQKGGPYFVAEFYPAWFDEWG 269

Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
               +V + RP E L + ++      G   + YM+ GGTNF  T G           TSY
Sbjct: 270 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 324

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAP+ E+G    PK+   RE+
Sbjct: 325 DYDAPLGEWGNC-YPKYHAFREV 346



 Score = 48.5 bits (114), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 111/274 (40%), Gaps = 52/274 (18%)

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           ++ F+  E K   S   +F +   +E + + +D      Y      +   GK+  + I+ 
Sbjct: 366 TTTFATVELKESASLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 424

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
           L   A++ ++ K VA     ++  +  +N    +++   TL+IL    G  NYG      
Sbjct: 425 LRDYAVILIDGKQVASLDRRYNQNSMTLN----VSKTPATLEILVENTGRVNYGPDILFN 480

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
             G+ S +L     G   L+        G     + L K  ++   F +    +P     
Sbjct: 481 RKGITSQVLW----GNEKLT--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 524

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            ++K TF   E KG   ++++  GKG  WVNG+S+GR+W+                    
Sbjct: 525 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 563

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                     P QTLY +P  W+  GEN +V+ E
Sbjct: 564 ---------GPQQTLY-LPAPWLKEGENEIVVFE 587


>gi|393785841|ref|ZP_10373985.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
           CL02T12C05]
 gi|392660955|gb|EIY54552.1| hypothetical protein HMPREF1068_00265 [Bacteroides nordii
           CL02T12C05]
          Length = 605

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 111/324 (34%), Positives = 155/324 (47%), Gaps = 37/324 (11%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GR 72
           +D K   + SG IH  R   E W + I+  K  G   +  Y+ WNYHE   G + F+ G 
Sbjct: 41  LDDKPFQIISGEIHPSRIPAEYWKQRIQMIKAMGCNTVACYIMWNYHESEPGVFDFQTGN 100

Query: 73  FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF 132
            DL +F++TVQE  +FL  R GPY C EW++GG P +L   P I+ R  +  +   ++R+
Sbjct: 101 KDLEKFIRTVQEEDMFLLFRPGPYVCGEWDFGGLPAYLLSTPDIKIRCMDPRYTTAVERY 160

Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
              I  ++K+  +  + GGPII+ QVENEYG    +YG     Y+KW  D   +    VP
Sbjct: 161 ATAIAPIIKKYEV--TNGGPIIMVQVENEYG----SYG-NDRTYMKWIHDLWRDKGIEVP 213

Query: 193 WVMCQQEDAPDPII---NTCNGFYCDGFTPNS------------PSKPIMWTENYSGWFL 237
           +      D   P +    T  G    G  P +            P   +  +E Y GW  
Sbjct: 214 FYTA---DGATPYMLEAGTLPGVAI-GLDPAASKAEFDEALKVHPDASVFCSELYPGWLT 269

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----PLV----ATS 289
            +        +E +   V    + G +F NYY+  GGTNFG  AG     P +     TS
Sbjct: 270 HWRENWQHPSIEKITTDVKWLLDNGKSF-NYYVIHGGTNFGFWAGANSPQPGIYQPDVTS 328

Query: 290 YDYDAPIDEYGFIRQPKWGHLREL 313
           YDYDAPI+E G    PK+  LREL
Sbjct: 329 YDYDAPINEMG-QATPKYMALREL 351


>gi|383114571|ref|ZP_09935333.1| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
 gi|382948460|gb|EFS30558.2| hypothetical protein BSGG_1258 [Bacteroides sp. D2]
          Length = 775

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 110/323 (34%), Positives = 161/323 (49%), Gaps = 34/323 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
              I+GK   L  G +HYPR   E W + +++++  GL  +  YVFWN+HE   G++ F 
Sbjct: 36  TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFS 95

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F++T QE GL++ LR GPY CAEW++GG+P WL     + +R+ +  F    +
Sbjct: 96  GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 155

Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           R+   I +L KQ + L  + GG II+ QVENEYG    +Y    E Y+    D       
Sbjct: 156 RY---IKELGKQLSPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGF 207

Query: 190 SVPWVMCQ---QEDA--PDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFG 240
           +VP   C    Q +A   +  + T NG + +             P    E Y  WF  +G
Sbjct: 208 NVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWG 267

Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
               +V + RP E L + ++      G   + YM+ GGTNF  T G           TSY
Sbjct: 268 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 322

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAP+ E+G    PK+   RE+
Sbjct: 323 DYDAPLGEWGNC-YPKYHAFREV 344



 Score = 45.1 bits (105), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 109/274 (39%), Gaps = 52/274 (18%)

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           ++ F+  E K       +F     +E + + +D      Y      +   GK+  + I+ 
Sbjct: 364 TTTFATVELKESAPLRTAFHPTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 422

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
           L   A++ ++ K VA     ++  +  +N    +++   TL+IL    G  NYG      
Sbjct: 423 LRDYAVILIDGKQVASLDRRYNQNSVTLN----VSKTPATLEILVENTGRVNYGPDILFN 478

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
             G+ S +L     G   L+        G     + L K  ++   F +    +P     
Sbjct: 479 RKGITSQVLW----GNEKLT--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 522

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            ++K TF   E KG   ++++  GKG  WVNG+S+GR+W+                    
Sbjct: 523 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 561

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                     P QTLY +P  W+  GEN +V+ E
Sbjct: 562 ---------GPQQTLY-LPAPWLKEGENEIVVFE 585


>gi|317504905|ref|ZP_07962857.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
 gi|315663982|gb|EFV03697.1| family 35 glycosyl hydrolase [Prevotella salivae DSM 15606]
          Length = 784

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 110/328 (33%), Positives = 158/328 (48%), Gaps = 27/328 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            + T      +++G+  V+++  +HYPR     W + I+  K  G+  I  YVFWN HE 
Sbjct: 28  GDFTAGKNTFLLNGQPFVVKAAELHYPRIPRPYWDQRIKMCKALGMNTICLYVFWNIHEQ 87

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
              +Y F G  D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R  +
Sbjct: 88  QESKYDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREDD 147

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY------ 176
             F   +K F A++   +    L    GGPII+ QVENEYG    +YGV  +        
Sbjct: 148 PYFLARVKAFEAEVGRQLAP--LTIQNGGPIIMVQVENEYG----SYGVNKQYVSQIRDI 201

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWT 229
           VK +    V L     W    +++  D ++ T N   G   D          P  P+M +
Sbjct: 202 VKASGFDKVTL-FQCDWASNFEKNGLDDLLWTMNFGTGSNIDAQFKRLKQLRPETPLMCS 260

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
           E +SGWF  +G     RP + +   +        +F + YM  GGT+FG  AG   P  A
Sbjct: 261 EFWSGWFDKWGARHETRPAKAMVEGINEMLSKNISF-SLYMTHGGTSFGHWAGANSPGFA 319

Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRE 312
              TSYDYDAPI+EYG    PK+  LR+
Sbjct: 320 PDVTSYDYDAPINEYGHA-TPKFWELRK 346



 Score = 42.4 bits (98), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 58/241 (24%), Positives = 91/241 (37%), Gaps = 53/241 (21%)

Query: 465 MPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSM 524
           +P   K   L++      A +F++ KL+    G  D      + K+   +   TL IL  
Sbjct: 413 LPQIEKSSRLSLNEAHDYAQIFIDNKLI----GTIDRTKNEKSIKLPPVKQGATLTILIE 468

Query: 525 MVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSS--GEWI-------YQVGVEGEYIG 574
            +G  N+G A  D  G  +   + ID +    D+S     W+       YQ         
Sbjct: 469 AMGRINFGRAVKDFKG--ITESVTIDTEMNGHDVSYHLKNWVIAPIPDSYQTAQHA---- 522

Query: 575 LDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIG 634
            DK+   N  F    S +  +   I Y   +   +  G   LNL   GKGQ +VNG ++G
Sbjct: 523 FDKLDETNRCF----SPINFSSPSIGYYRGYFNLKKVGDTFLNLEQWGKGQVYVNGHALG 578

Query: 635 RYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
           R+W                               P QTLY +P  W+  G N +++ + +
Sbjct: 579 RFWRI----------------------------GPQQTLY-LPGCWLKKGRNEIIVMDIV 609

Query: 695 G 695
           G
Sbjct: 610 G 610


>gi|182414740|ref|YP_001819806.1| beta-galactosidase [Opitutus terrae PB90-1]
 gi|177841954|gb|ACB76206.1| Beta-galactosidase [Opitutus terrae PB90-1]
          Length = 799

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 112/338 (33%), Positives = 157/338 (46%), Gaps = 27/338 (7%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A ++DG+   ++ G +H PR   E W   ++  K  GL  +  Y+FWN HEP  G++ + 
Sbjct: 53  AFLLDGQPFQIRCGELHAPRVPREYWRHRLQMVKAMGLNTVCAYLFWNMHEPRPGEFDWS 112

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D   F +  Q AGL++ LR GPYACAEW  GG P WL     I+ RT +  F E  +
Sbjct: 113 GQADAAAFCREAQAAGLWVILRPGPYACAEWEMGGLPWWLLKHDEIKLRTRDPRFIEAAR 172

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
           R+L ++   +    L  S+GGPI++ QVENE+G     +      Y+       ++    
Sbjct: 173 RYLQEVGRELGP--LQVSRGGPILMVQVENEHG-----FYADDPAYMGDIRQALLDAGFD 225

Query: 191 VPWVMCQQEDA------PD--PIIN----TCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           VP   C           PD  P++N       GF         P+ P+M  E Y GWF +
Sbjct: 226 VPLFACNPTQQVRRGYRPDLFPVVNFGTDPAGGFRA--LREILPTGPLMCGEFYPGWFDT 283

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV----ATSYDYDA 294
           +G        E     +     TG +F + YM  GGT FG   G         +SYDYDA
Sbjct: 284 WGAPHHTGQTERYLTDLDYMLRTGASF-SIYMAHGGTTFGFWTGADRPFKPDTSSYDYDA 342

Query: 295 PIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQ 332
           PI E G+   PK+   R L     L EE L    P H+
Sbjct: 343 PISEAGWA-TPKFEQSRALLSKYLLPEETLPEPAPRHR 379



 Score = 42.7 bits (99), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 57/230 (24%), Positives = 87/230 (37%), Gaps = 53/230 (23%)

Query: 518 TLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDK 577
           TLDIL   +G  N+G            V L      +R+L  G  I+++ ++   +G   
Sbjct: 474 TLDILVEAMGRVNFGVEVHDRKGIHGPVTLTASGQPRRELR-GWQIFRLPLDQPMLG--- 529

Query: 578 ISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYW 637
            +L      +Q  T P      W  T  +  E  G   L++   GKG  WVNG ++GRYW
Sbjct: 530 -TLRYQPTGEQERTSPA--PAFWRATVKV--EQPGDCFLDMRPWGKGFVWVNGHNLGRYW 584

Query: 638 SAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
           +                              P QT+Y +P  W+  G+N +V+ + +G  
Sbjct: 585 NI----------------------------GPQQTMY-VPAPWLKAGDNEIVVLDLIGPA 615

Query: 698 PSKISLLTKTGQHICSFVSEADPPPVDSWKPNLGVV-SSSPQVRLACERG 746
              I+ L              D P +D  +P L    S   QV L  + G
Sbjct: 616 NPVIAAL--------------DQPILDQLRPKLDFAPSRRRQVTLRADFG 651


>gi|195069729|ref|XP_001997012.1| GH25263 [Drosophila grimshawi]
 gi|193895091|gb|EDV93957.1| GH25263 [Drosophila grimshawi]
          Length = 619

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/320 (33%), Positives = 154/320 (48%), Gaps = 28/320 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y++   + DG+     SGS HY R+ PE W   +R  +  GL  + TYV W+ H P  
Sbjct: 28  VDYENDRFLKDGQPFRFISGSFHYFRAHPETWSRHLRTMRAAGLNAVTTYVEWSLHNPRD 87

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
           G Y + G  DL RF++   +  L + LR GPY CAE + GGFP W L   PGIQ RT + 
Sbjct: 88  GVYVWTGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLKKYPGIQLRTADI 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD- 182
            +  E++ + A++  +++        GGPII+ QVENEYG    +Y      Y  W  D 
Sbjct: 148 NYLSEVRIWYAQL--MVRMSPFLYGNGGPIIMVQVENEYG----SYFACDVNYRNWLRDE 201

Query: 183 TAVNLNTSVPWV-MCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGY 241
           T  ++N       +C   +  D                  P  P++  E Y GW   +  
Sbjct: 202 TQSHVNGCFGHNGLCATSNLKDTWAR---------LRRFEPKGPLVNAEYYPGWLTHWTE 252

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG------GPLVA--TSYDYD 293
            +     + +        E+G +  N+YM++GGTNFG TAG      G  +A  TSYDYD
Sbjct: 253 PMANVSTDSITGTFIDMLESGASV-NFYMFYGGTNFGFTAGANDNNPGKYIADITSYDYD 311

Query: 294 APIDEYGFIRQPKWGHLREL 313
           AP+ E G    PK+  LR +
Sbjct: 312 APMTEAG-DPTPKYMALRRI 330


>gi|332879232|ref|ZP_08446929.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
           F0087]
 gi|357048073|ref|ZP_09109651.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
 gi|332682652|gb|EGJ55552.1| putative beta-galactosidase [Capnocytophaga sp. oral taxon 329 str.
           F0087]
 gi|355529138|gb|EHG98592.1| putative beta-galactosidase [Paraprevotella clara YIT 11840]
          Length = 786

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 105/326 (32%), Positives = 159/326 (48%), Gaps = 33/326 (10%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           ++  +++GK  ++++  +HYPR     W + I+  K  G+  +  YVFWN HE   G++ 
Sbjct: 40  NKTFLLNGKPFIIKAAEVHYPRIPRPYWEQRIKMCKALGMNTLCLYVFWNIHEQEEGKFD 99

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F G  D+  F++  QE GL++ +R GPY CAEW  GG P WL     I+ R  +  F E 
Sbjct: 100 FTGNNDVAEFIRLAQENGLYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYFMER 159

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV---------------EWAYGVGG 173
            + F  K+ + +   +L   +GGPII+ QVENEYG+                +  +    
Sbjct: 160 YRIFAQKLGEQIG--DLTIEKGGPIIMVQVENEYGSYGEDKPYVSAIRDIIRDSGFDKVT 217

Query: 174 ELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
                W+++   N    + W M     A     N  N F   G     P  P M +E +S
Sbjct: 218 LFQCDWSSNFTKNGLDDLVWTMNFGTGA-----NIENEFKKLGEL--RPESPQMCSEFWS 270

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVA 287
           GWF  +G     R  +++   +    + G +F + YM  GGT++G  AG       P V 
Sbjct: 271 GWFDKWGGRHETRGSKEMVGGLKEMLDKGISF-SLYMTHGGTSWGHWAGANSPGFSPDV- 328

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
           TSYDYDAPI+E G +  PK+  LRE+
Sbjct: 329 TSYDYDAPINEAGQV-TPKYMELREM 353



 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 66/253 (26%), Positives = 98/253 (38%), Gaps = 66/253 (26%)

Query: 466 PGQGKEVFLNIESLGHAALVFVNKKLV-AFGYGNHDFANFLINKKIELNEGINTLDILSM 524
           P    +  L I      A VF+N KL+ +    NH+    L   K    EG + LDIL  
Sbjct: 418 PAVPTQSVLTITDAHDFAQVFINGKLIGSIDRRNHEKTMLLPAMK----EG-DQLDILVE 472

Query: 525 MVGLQNYG-AWFDVAG----------AGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYI 573
            +G  N+G A  D  G              S + ++LKN +    S    YQV  + +Y+
Sbjct: 473 AMGRINFGRAIKDFKGITEKVELSYTMNTGSQVTVNLKNWQIYTLSDS--YQVQKDMKYV 530

Query: 574 GLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSI 633
            L    +                    Y+ TF   +  G   LNL + GKGQ +VNG +I
Sbjct: 531 PLKDQKVPGC-----------------YRATFNLKK-TGDTFLNLETWGKGQVYVNGHAI 572

Query: 634 GRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
           GR+W                               P QTLY +P  W+  GEN +++ + 
Sbjct: 573 GRFWKI----------------------------GPQQTLY-MPGCWLKKGENEIIVQDI 603

Query: 694 LGGDPSKISLLTK 706
           +G   + +  L+K
Sbjct: 604 VGPQETVVEGLSK 616


>gi|228918502|ref|ZP_04081945.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
           4CC1]
 gi|228841118|gb|EEM86317.1| Beta-galactosidase [Bacillus thuringiensis serovar pulsiensis BGSC
           4CC1]
          Length = 591

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 153/317 (48%), Gaps = 36/317 (11%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  ++DG+   + SG++HY R  PE W   +   K  G   +ETYV WN HEP  G + F
Sbjct: 8   KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGVFNF 67

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
           EG  DLV++V+  Q+ GL + LR  PY CAEW +GG P WL     I+ R+  N F  ++
Sbjct: 68  EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYRDIRVRSNTNLFLNKV 127

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           + F   ++ L+   +L    GGPII+ QVENEYG    ++G   E YV+       +L  
Sbjct: 128 ENFYKVLLPLVT--SLQVENGGPIIMMQVENEYG----SFGNDKE-YVRSIKKLMRDLGV 180

Query: 190 SVP-------WVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTEN 231
           +VP       W    +  +   D ++ T N                  N    P+M  E 
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNALESFIKENKKEWPLMCMEF 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
           + GWF  +G  +  R   +LA  V    +      N+YM+ GGTNFG   G         
Sbjct: 241 WDGWFNRWGMEIIRRDSSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298

Query: 284 PLVATSYDYDAPIDEYG 300
           P + TSYDYDA + E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314


>gi|333377694|ref|ZP_08469427.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
           22836]
 gi|332883714|gb|EGK03994.1| hypothetical protein HMPREF9456_01022 [Dysgonomonas mossii DSM
           22836]
          Length = 630

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 109/331 (32%), Positives = 159/331 (48%), Gaps = 35/331 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   + SG +HYPR   + W   ++  K  GL  + TYVFWN HEP  G++ F G
Sbjct: 35  FVYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNIHEPEPGKWDFTG 94

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             +L  ++K   E GL + LR GPY CAEW +GG+P WL  + G++ R  N  F +  + 
Sbjct: 95  DKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEGLELRRDNEQFLKYTQL 154

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS 190
           ++ ++   +   NL  ++GGPI++ Q ENE+G+ V     +  E + ++ A     L  +
Sbjct: 155 YINRLYKEVG--NLQITKGGPIVMVQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQLKDA 212

Query: 191 ---VP-------WVMCQQEDAPDPIINTCNG--------FYCDGFTPNSPSKPIMWTENY 232
              VP       W+   +  A    + T NG           D +  N    P M  E Y
Sbjct: 213 GFDVPSFTSDGSWLF--EGGAVPGALPTANGESNIENLKKAVDKY--NGGQGPYMVAEFY 268

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA----- 287
            GW   +    P      +A    ++ +   +  NYYM  GGTNFG T+G          
Sbjct: 269 PGWLAHWLEPHPQISATSIARQTEKYLQNNVSI-NYYMVHGGTNFGFTSGANYDKKHDIQ 327

Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
              TSYDYDAPI E G++  PK+  LR + K
Sbjct: 328 PDLTSYDYDAPISEAGWV-TPKYDSLRNVIK 357



 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 69/225 (30%), Positives = 100/225 (44%), Gaps = 50/225 (22%)

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I  L   A+++ N + V  G  N  F    I+  I  N   +TL+IL   +G  NYG+
Sbjct: 427 LKINGLRDYAIIYANDEKV--GELNRYFNQDSIDVDIPFN---STLEILVENMGRINYGS 481

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTL 592
                  G+ S ++I   NG      G+W +YQ+ ++ E     K+   NS F   G+T 
Sbjct: 482 EIVHNTKGIISPVII---NGME--IEGDWQMYQIPMD-EAPDFSKMQ-KNSVF---GNTE 531

Query: 593 PVNKSLI----WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
              K L+     YK TF   E  G   L++   GKG  ++NG++IGRYW           
Sbjct: 532 SAAKRLLGAPALYKGTFNLTE-TGDTFLDMEDWGKGIVFINGKNIGRYW----------- 579

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEE 693
                           H G P QTLY +P  W+  G+N +VI E+
Sbjct: 580 ----------------HVG-PQQTLY-VPGVWLKKGQNEIVIFEQ 606


>gi|260592848|ref|ZP_05858306.1| beta-galactosidase [Prevotella veroralis F0319]
 gi|260535218|gb|EEX17835.1| beta-galactosidase [Prevotella veroralis F0319]
          Length = 621

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 111/327 (33%), Positives = 158/327 (48%), Gaps = 30/327 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY-YFE 70
            + DGK   + SG +HY R     W   ++  K  GL  + TY+FWN+HE   G + +  
Sbjct: 36  FIYDGKPIQIHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIFWNHHETSPGVWDWTT 95

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  +L +F+KT  E GL + LR GPY CAEW +GG+P WL     +  RT N PF +  +
Sbjct: 96  GTHNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKDLVIRTDNKPFLDSCR 155

Query: 131 RFLAKIIDLMKQE-NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNL- 187
            ++ +   L KQ  +L  +QGGP+I+ Q ENE+G+ V     +  E + ++AA     L 
Sbjct: 156 VYINQ---LAKQVLDLQVTQGGPVIMVQAENEFGSYVAQRKDIPLETHKRYAAQIRQQLL 212

Query: 188 --NTSVPWVMCQ-----QEDAPDPIINTCNGF-YCDGFTP-----NSPSKPIMWTENYSG 234
               +VP          +  A +  + T NG    D         +    P M  E Y G
Sbjct: 213 DAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDIDKLKKVVNEYHGGVGPYMVAEFYPG 272

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------- 286
           W   +    P    E +     ++ + G +F NYYM  GGTNFG +AG            
Sbjct: 273 WLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGGTNFGFSAGANYSNATNIQPD 331

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLREL 313
            TSYDYDAPI E G+   PK+  LR+L
Sbjct: 332 MTSYDYDAPISEAGWA-TPKYNALRDL 357



 Score = 44.3 bits (103), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 58/225 (25%), Positives = 91/225 (40%), Gaps = 57/225 (25%)

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNY 531
            + ++ L   A+V+VN      G    +         +E++   N TLDIL   +G  NY
Sbjct: 430 LMRMKGLADYAVVYVN------GEKKGELNKVFDKDSMEIDIPFNSTLDILVENMGRINY 483

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG-- 589
           GA    +  G+   I ID      +  +GEW        +   L   S+ +++    G  
Sbjct: 484 GARIVQSSKGITRPITID-----DNEITGEW--------QMYPLPMASMPDTNRLPAGYK 530

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           + LPV      Y  +F   +  G   L++A  GKG  +VNG ++GRYW            
Sbjct: 531 AGLPV-----LYSGSFNL-DKVGDTFLDMAQWGKGIVFVNGINLGRYWKV---------- 574

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
                              P QTLY +P  ++  G+N +VI E+L
Sbjct: 575 ------------------GPQQTLY-LPGCYLKKGKNDIVIFEQL 600


>gi|301767332|ref|XP_002919083.1| PREDICTED: beta-galactosidase-like [Ailuropoda melanoleuca]
          Length = 668

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 112/344 (32%), Positives = 161/344 (46%), Gaps = 26/344 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y H   + DG+     SGSIHY R     W + + K K  GL  I++YV WN+HEP  
Sbjct: 35  IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 94

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++  
Sbjct: 95  GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWAYGV 171
           +   + ++L  ++  MK   L    GGPII  QVENEYG+               + Y +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFSCDYDHLRFLQKLFHYHL 212

Query: 172 GGELYVKWAADTAVNLNTSVPWVMCQQEDAP-DPIINTCNGFYCDGFTPNSPSKPIMWTE 230
           G ++ + +  D A  +      +          P  N    F       + P  P++ +E
Sbjct: 213 GNDVLL-FTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQ--RKSEPRGPLVNSE 269

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----V 286
            Y+GW   +G        E +A A+      G    N YM+ GGTNF    G  +     
Sbjct: 270 FYTGWLDHWGQPHSTAKTEVVASALHEILSRGANV-NLYMFIGGTNFAYWNGANMPYQAQ 328

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
            TSYDYDAP+ E G + + K+  LR+ + K  K+ E ++  S P
Sbjct: 329 PTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGFIPPSTP 371


>gi|167856235|ref|ZP_02478970.1| beta-galactosidase [Haemophilus parasuis 29755]
 gi|167852655|gb|EDS23934.1| beta-galactosidase [Haemophilus parasuis 29755]
          Length = 596

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 156/320 (48%), Gaps = 40/320 (12%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
            +  +++GK   + SG++HY R  PE W + +   K  G   +ETYV WN H+P   Q+ 
Sbjct: 7   EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F  R DLV+F++T ++ GL++ LR  PY CAEW +GG P WL  IP I+ R  +  F  E
Sbjct: 67  FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
           + R+  +++  +    +  +QGG I++ Q+ENEYG    ++G   + Y++      +   
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYG----SFG-NDKNYLRAILALMLIHG 179

Query: 189 TSVP-------WVMCQQEDA--PDPIINTCN------------GFYCDGFTPNSPSKPIM 227
            +VP       W    +  A   D I+ T N              Y D    +  S P+M
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
             E + GWF  +   V  R  +DLA       E      N+YM+ GGTNFG       R 
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLADCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294

Query: 281 AGGPLVATSYDYDAPIDEYG 300
                  TSYDYDAP+ E+G
Sbjct: 295 DTDLPQVTSYDYDAPVHEWG 314


>gi|160887166|ref|ZP_02068169.1| hypothetical protein BACOVA_05182 [Bacteroides ovatus ATCC 8483]
 gi|156107577|gb|EDO09322.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
          Length = 777

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 110/323 (34%), Positives = 160/323 (49%), Gaps = 34/323 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
              I+GK   L  G +HYPR   E W + ++++   GL  +  YVFWN+HE   G++ F 
Sbjct: 38  TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFS 97

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F++T QE GL++ LR GPY CAEW++GG+P WL     + +R+ +  F    +
Sbjct: 98  GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 157

Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           R+   I +L KQ + L  + GG II+ QVENEYG    +Y    E Y+    D       
Sbjct: 158 RY---IKELGKQLSPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGF 209

Query: 190 SVPWVMCQ---QEDA--PDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFG 240
           +VP   C    Q +A   +  + T NG + +             P    E Y  WF  +G
Sbjct: 210 NVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWG 269

Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
               +V + RP E L + ++      G   + YM+ GGTNF  T G           TSY
Sbjct: 270 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 324

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAP+ E+G    PK+   RE+
Sbjct: 325 DYDAPLGEWGNCY-PKYHAFREV 346



 Score = 46.2 bits (108), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 52/274 (18%)

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           ++ F+  E K       +F +   +E + + +D      Y      +   GK+  + I+ 
Sbjct: 366 TTTFATVELKESAPLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 424

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
           L   A++ ++ K VA     ++  +  +N    +++   TL+IL    G  NYG      
Sbjct: 425 LRDYAVILIDGKQVASLDRRYNQNSVTLN----VSKTPATLEILVENTGRVNYGPDILFN 480

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
             G+ S +L     G   L+        G     + L K  ++   F +    +P     
Sbjct: 481 RKGITSQVLW----GNEKLA--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 524

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            ++K TF   E KG   ++++  GKG  WVNG+S+GR+W+                    
Sbjct: 525 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 563

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                     P QTLY +P  W+  GEN +V+ E
Sbjct: 564 ---------GPQQTLY-LPAPWLKEGENEIVVFE 587


>gi|195054633|ref|XP_001994229.1| GH23545 [Drosophila grimshawi]
 gi|193896099|gb|EDV94965.1| GH23545 [Drosophila grimshawi]
          Length = 639

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 112/335 (33%), Positives = 158/335 (47%), Gaps = 36/335 (10%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V Y++   + DG+     SGS HY R+ PE W   +R  +  GL  + TYV W+ H P 
Sbjct: 27  TVDYENDRFLKDGQPFRFISGSFHYFRAHPETWSRHLRTMRAAGLNAVTTYVEWSLHNPR 86

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
            G Y + G  DL RF++   +  L + LR GPY CAE + GGFP W L   PGIQ RT +
Sbjct: 87  DGVYVWTGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLKKYPGIQLRTAD 146

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             +  E++ + A++  +++        GGPII+ QVENEYG    +Y      Y  W  D
Sbjct: 147 INYLSEVRIWYAQL--MVRMSPFLYGNGGPIIMVQVENEYG----SYFACDVNYRNWLRD 200

Query: 183 -TAVNLNTSVPWVMCQQEDAPDPI----INTCNGFYCDGFTPN-----------SPSKPI 226
            T  ++N      +    D P  +    I         G T N            P  P+
Sbjct: 201 ETQSHVNGK---AVLFTNDGPSVLRCGKIQGVLATMDFGATSNLKDTWARLRRFEPKGPL 257

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---- 282
           +  E Y GW   +   +     + +        E+G +  N+YM++GGTNFG TAG    
Sbjct: 258 VNAEYYPGWLTHWTEPMANVSTDSITGTFIDMLESGASV-NFYMFYGGTNFGFTAGANDN 316

Query: 283 --GPLVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
             G  +A  TSYDYDAP+ E G    PK+  LR +
Sbjct: 317 NPGKYIADITSYDYDAPMTEAG-DPTPKYMALRRI 350


>gi|281352249|gb|EFB27833.1| hypothetical protein PANDA_007660 [Ailuropoda melanoleuca]
          Length = 626

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 112/344 (32%), Positives = 161/344 (46%), Gaps = 26/344 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y H   + DG+     SGSIHY R     W + + K K  GL  I++YV WN+HEP  
Sbjct: 8   IDYSHNRFLKDGRPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQSYVPWNFHEPQP 67

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++  
Sbjct: 68  GQYQFSGEHDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 127

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWAYGV 171
           +   + ++L  ++  MK   L    GGPII  QVENEYG+               + Y +
Sbjct: 128 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFSCDYDHLRFLQKLFHYHL 185

Query: 172 GGELYVKWAADTAVNLNTSVPWVMCQQEDAP-DPIINTCNGFYCDGFTPNSPSKPIMWTE 230
           G ++ + +  D A  +      +          P  N    F       + P  P++ +E
Sbjct: 186 GNDVLL-FTTDGAHEMFLKCGALQGLYATVDFGPGANITAAFEIQ--RKSEPRGPLVNSE 242

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----V 286
            Y+GW   +G        E +A A+      G    N YM+ GGTNF    G  +     
Sbjct: 243 FYTGWLDHWGQPHSTAKTEVVASALHEILSRGANV-NLYMFIGGTNFAYWNGANMPYQAQ 301

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
            TSYDYDAP+ E G + + K+  LR+ + K  K+ E ++  S P
Sbjct: 302 PTSYDYDAPLSEAGDLTE-KYFALRDVIRKFEKVPEGFIPPSTP 344


>gi|383812458|ref|ZP_09967896.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
           F0472]
 gi|383355018|gb|EID32564.1| glycosyl hydrolase family 35 [Prevotella sp. oral taxon 306 str.
           F0472]
          Length = 608

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 110/327 (33%), Positives = 159/327 (48%), Gaps = 30/327 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY-YFE 70
            + DGK   + SG +HY R     W   ++  K  GL  + TY+FWN+HE   G + +  
Sbjct: 28  FIYDGKPTQIHSGEMHYARVPAPYWRHRMKMMKAMGLNAVATYIFWNHHETSPGVWDWST 87

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  +L +F+KT  E GL + LR GPY CAEW +GG+P WL     +  RT N PF +  +
Sbjct: 88  GTHNLRQFIKTAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKNKDLVIRTDNKPFLDSCR 147

Query: 131 RFLAKIIDLMKQE-NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTA---V 185
            ++ +   L KQ  +L  +QGGP+I+ Q ENE+G+ V     +  E + ++AA      +
Sbjct: 148 VYINQ---LAKQVLDLQVTQGGPVIMVQAENEFGSYVAQRKDIPLETHKRYAAQIRQLLL 204

Query: 186 NLNTSVPWVMCQ-----QEDAPDPIINTCNGF-YCDGFTP-----NSPSKPIMWTENYSG 234
           +   +VP          +  A +  + T NG    D         +    P M  E Y G
Sbjct: 205 DAGFTVPMFTSDGSWLFKGGAIEGALPTANGEGDIDKLKKVVNEYHGGVGPYMVAEFYPG 264

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------- 286
           W   +    P    E +     ++ + G +F NYYM  GGTNFG +AG            
Sbjct: 265 WLSHWAEPFPRVSTESVVKQTKKYLDNGISF-NYYMVHGGTNFGFSAGANYSNATNIQPD 323

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLREL 313
            TSYDYDAPI E G+   PK+  LR+L
Sbjct: 324 MTSYDYDAPISEAGW-ATPKYNALRDL 349



 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 58/225 (25%), Positives = 92/225 (40%), Gaps = 57/225 (25%)

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGIN-TLDILSMMVGLQNY 531
            + ++ L   A+V+VN      G    +         +E++   N TLDIL   +G  NY
Sbjct: 422 LMRMKGLADYAIVYVN------GEKKGELNKVFDKDSMEIDIPFNSTLDILVENMGRINY 475

Query: 532 GAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQG-- 589
           GA    +  G+   I ID      +  +GEW        +   L   S+ +++    G  
Sbjct: 476 GARIVESAKGITRPITID-----DNEITGEW--------QMYPLPMASMPDTNRLPAGYK 522

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           + +PV      Y  +F   E  G   L++A  GKG  +VNG ++GRYW            
Sbjct: 523 AGMPV-----LYSGSFNL-EKVGDTFLDMAHWGKGIVFVNGINLGRYWKV---------- 566

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
                              P QTLY +P  +++ G+N +VI E+L
Sbjct: 567 ------------------GPQQTLY-LPGCYLNKGKNDIVIFEQL 592


>gi|3025876|gb|AAC12775.1| lysosomal beta-galactosidase [Canis lupus familiaris]
          Length = 662

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 115/348 (33%), Positives = 164/348 (47%), Gaps = 32/348 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            + Y H   + DG+     SGSIHY R     W + + K K  GL  I+TYV WN+HEP 
Sbjct: 28  TIDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 87

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++ 
Sbjct: 88  PGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +   + ++L  ++  MK   L    GGPII  QVENEYG    +Y      Y+++    
Sbjct: 148 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL 201

Query: 184 -AVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYCD-GFTPNS-------------PSKPI 226
              +L   V  ++   + A +  +      G Y    F P +             P  P+
Sbjct: 202 FHHHLGNDV--LLFTTDGANEKFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGPL 259

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
           + +E Y+GW   +G        E +A ++      G    N YM+ GGTNF    G  + 
Sbjct: 260 VNSEFYTGWLDHWGQPHSTVRTEVVASSLHDILAHGANV-NLYMFIGGTNFAYWNGANMP 318

Query: 286 ---VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
                TSYDYDAP+ E G + + K+  LRE + K  K+ E ++  S P
Sbjct: 319 YQAQPTSYDYDAPLSEAGDLTE-KYFALREVIRKFEKVPEGFIPPSTP 365


>gi|228950355|ref|ZP_04112522.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
           4AJ1]
 gi|228809313|gb|EEM55767.1| Beta-galactosidase [Bacillus thuringiensis serovar monterrey BGSC
           4AJ1]
          Length = 591

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 156/317 (49%), Gaps = 36/317 (11%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  ++DG+   + SG++HY R  PE W   +   K  G   +ETYV WN HEP  G + F
Sbjct: 8   KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNIHEPKEGVFNF 67

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
           EG  DLV++V+  Q+ GL + LR  PY CAEW +GG P WL     I+ R+  N F +++
Sbjct: 68  EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLDKV 127

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           + F   ++ ++    L    GGPII+ QVENEYG    ++G   E YV+       +L+ 
Sbjct: 128 ENFYKVLLPMVTP--LQVENGGPIIMMQVENEYG----SFGNDKE-YVRSIKKIMRDLDV 180

Query: 190 SVP-------WVMCQQEDA--PDPIINTCN-GFYCDG--------FTPNSPSKPIMWTEN 231
           +VP       W    +  +   D ++ T N G   +            N    P+M  E 
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCMEF 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
           + GWF  +G  +  R   +LA  V    +      N+YM+ GGTNFG   G         
Sbjct: 241 WDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298

Query: 284 PLVATSYDYDAPIDEYG 300
           P + TSYDYDA + E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314


>gi|423295092|ref|ZP_17273219.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
           CL03T12C18]
 gi|392673998|gb|EIY67449.1| hypothetical protein HMPREF1070_01884 [Bacteroides ovatus
           CL03T12C18]
          Length = 775

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 110/323 (34%), Positives = 160/323 (49%), Gaps = 34/323 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
              I+GK   L  G +HYPR   E W + ++++   GL  +  YVFWN+HE   G++ F 
Sbjct: 36  TFTIEGKDIQLICGEMHYPRIPHEYWRDRLKRASAMGLNTVSAYVFWNFHERQPGEFDFS 95

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F++T QE GL++ LR GPY CAEW++GG+P WL     + +R+ +  F    +
Sbjct: 96  GQADIAEFIRTAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDMTYRSKDPRFLSYCE 155

Query: 131 RFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           R+   I +L KQ + L  + GG II+ QVENEYG    +Y    E Y+    D       
Sbjct: 156 RY---IKELGKQLSPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGF 207

Query: 190 SVPWVMCQ---QEDA--PDPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFG 240
           +VP   C    Q +A   +  + T NG + +             P    E Y  WF  +G
Sbjct: 208 NVPLFTCDGGGQVEAGHVEGALPTLNGVFGEDIFKVVDKYQKGGPYFVAEFYPAWFDEWG 267

Query: 241 Y---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSY 290
               +V + RP E L + ++      G   + YM+ GGTNF  T G           TSY
Sbjct: 268 RRHSSVAYERPAEQLDWMLSH-----GVSVSMYMFHGGTNFEYTNGANTGGGYQPQPTSY 322

Query: 291 DYDAPIDEYGFIRQPKWGHLREL 313
           DYDAP+ E+G    PK+   RE+
Sbjct: 323 DYDAPLGEWGNCY-PKYHAFREV 344



 Score = 46.2 bits (108), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 110/274 (40%), Gaps = 52/274 (18%)

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIES 478
           ++ F+  E K       +F +   +E + + +D      Y      +   GK+  + I+ 
Sbjct: 364 TTTFATVELKESAPLRTAFHQTTQSENVLSMEDLGVDFGYIHYQTTLQKAGKQKLV-IQD 422

Query: 479 LGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVA 538
           L   A++ ++ K VA     ++  +  +N    +++   TL+IL    G  NYG      
Sbjct: 423 LRDYAVILIDGKQVASLDRRYNQNSVTLN----VSKTPATLEILVENTGRVNYGPDILFN 478

Query: 539 GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSL 598
             G+ S +L     G   L+        G     + L K  ++   F +    +P     
Sbjct: 479 RKGITSQVLW----GNEKLA--------GWSITPLPLYKEKVSEMEFGETIKGVPA---- 522

Query: 599 IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYD 658
            ++K TF   E KG   ++++  GKG  WVNG+S+GR+W+                    
Sbjct: 523 -FHKGTFTV-EKKGDCFVDMSQWGKGAVWVNGKSLGRFWNI------------------- 561

Query: 659 ASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                     P QTLY +P  W+  GEN +V+ E
Sbjct: 562 ---------GPQQTLY-LPAPWLKEGENEIVVFE 585


>gi|257143787|emb|CAZ44333.1| beta-D-galactosidase [Paenibacillus thiaminolyticus]
          Length = 583

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/349 (32%), Positives = 167/349 (47%), Gaps = 35/349 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             ++YD     +  +   L SG+IHY R  P  W + +RK K  G   IETYV WN HEP
Sbjct: 2   TTLSYDEGQFKMGDRPIQLISGAIHYFRIVPAYWEDRLRKIKAMGCNCIETYVAWNVHEP 61

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+++FE   D+  FV+   E GL++ +R  PY CAEW +GG P WL     ++ R  +
Sbjct: 62  REGEFHFERMADVAEFVRLAGELGLYVIVRPSPYICAEWEFGGLPAWL-LKDDMRLRCND 120

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             F E++  +   ++  +    L A++GGPII  Q+ENEYG    +YG   + Y++  A 
Sbjct: 121 PRFLEKVSAYYDALLPQLTP--LLATKGGPIIAVQIENEYG----SYG-NDQAYLQ--AQ 171

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIIN--TCNGFYC------------DGFTPNSPSKPIMW 228
            A+ +   V  ++   +   D ++      G               D      P  P+M 
Sbjct: 172 RAMLIERGVDVLLFTSDGPQDDMLQGGMAEGVLATVNFGSRPKEAFDKLKEYQPDGPLMC 231

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
            E ++GWF  +      R  +D A  +      G +  N+YM  GGTNFG  +G      
Sbjct: 232 MEYWNGWFDHWFEPHHTRDAKDAARVLDDMLGMGASV-NFYMVHGGTNFGFGSGANHSDK 290

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
             P V TSYDYDA I E G +  PK+   RE + K + L E  L ++ P
Sbjct: 291 YEPTV-TSYDYDAAISEAGDL-TPKYHAFREVIGKYVSLPEGELPANTP 337


>gi|313245457|emb|CBY40184.1| unnamed protein product [Oikopleura dioica]
          Length = 620

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/330 (32%), Positives = 153/330 (46%), Gaps = 52/330 (15%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +++YD +   +  +   L SGS+HY R   + W + + K K  GL  + TYV WN HEP 
Sbjct: 9   SLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGLNGVTTYVPWNLHEPE 68

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G++ F G  D+V F+   +   LF+ LR GPY C+EW +GG P WL     ++ RT  +
Sbjct: 69  PGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPAWLLRDSFMKVRTNYS 128

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +   +KRF  ++I L+K +   +  GGPI+  QVENEYG               +A   
Sbjct: 129 GYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYG--------------MYAGQD 172

Query: 184 AVNLNTSVPWVMCQQEDAPDPII---------NTCNGFYCDG-----FTPNS-------- 221
             +LNT     + + E   +P+          N  N  Y DG     F  N         
Sbjct: 173 GAHLNTLAE--LLKNEGIVEPLFTSDGSSVWDNEKNTIYEDGLKSVNFKSNPEKHLKSLR 230

Query: 222 ---PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG 278
              P +P+   E ++GWF  +G         D    +    +   +  N+YM+ GGTNFG
Sbjct: 231 GHFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDVILDHKASL-NFYMFHGGTNFG 289

Query: 279 RTAGGPLVA--------TSYDYDAPIDEYG 300
            T GG  +A        TSYDYD PI E G
Sbjct: 290 FTNGGLTIARGYYTADVTSYDYDCPISEAG 319


>gi|219870459|ref|YP_002474834.1| beta-galactosidase [Haemophilus parasuis SH0165]
 gi|219690663|gb|ACL31886.1| beta-galactosidase, glucosyl hydrolase family protein [Haemophilus
           parasuis SH0165]
          Length = 596

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 156/320 (48%), Gaps = 40/320 (12%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
            +  +++GK   + SG++HY R  PE W + +   K  G   +ETYV WN H+P   Q+ 
Sbjct: 7   EKDFLLNGKPFKILSGAVHYFRIVPEYWYKTLYNLKAMGCNTVETYVPWNLHQPQPDQFN 66

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F  R DLV+F++T ++ GL++ LR  PY CAEW +GG P WL  IP I+ R  +  F  E
Sbjct: 67  FSKRADLVKFLQTAKDLGLYVILRPTPYICAEWEFGGLPAWLLNIPNIRLRQNDPLFIAE 126

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
           + R+  +++  +    +  +QGG I++ Q+ENEYG    ++G   + Y++      +   
Sbjct: 127 IDRYFQELLPRIAPYQI--TQGGNILMMQIENEYG----SFG-NDKNYLRAIRALMLIHG 179

Query: 189 TSVP-------WVMCQQEDA--PDPIINTCN------------GFYCDGFTPNSPSKPIM 227
            +VP       W    +  A   D I+ T N              Y D    +  S P+M
Sbjct: 180 VNVPLFTSDGAWQNALEAGALIEDDILPTGNFGSRSNENLDELQRYIDK---HGKSYPLM 236

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RT 280
             E + GWF  +   V  R  +DLA       E      N+YM+ GGTNFG       R 
Sbjct: 237 CMEFWDGWFNRWKEPVIRRDAQDLANCTKELLERASI--NFYMFQGGTNFGFWNGCSARL 294

Query: 281 AGGPLVATSYDYDAPIDEYG 300
                  TSYDYDAP+ E+G
Sbjct: 295 DTDLPQVTSYDYDAPVHEWG 314


>gi|313241555|emb|CBY33800.1| unnamed protein product [Oikopleura dioica]
          Length = 571

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/331 (31%), Positives = 153/331 (46%), Gaps = 25/331 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T D     +DGK   + SG+IHY R   + W   ++   + GL  I+ Y+ WN HE  R
Sbjct: 8   LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G + F G  DLV F     E GL +  R GPY C+EW++GG P WL   P +  R+    
Sbjct: 68  GNFDFGGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           ++  +  + +K++ L+    L  S GGPII  QVENEYG+    Y      ++ W AD  
Sbjct: 128 YQAAVSSYFSKLLPLLAP--LQHSNGGPIIAFQVENEYGD----YVDKDNEHLPWLADLM 181

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS-----PSKPIMWTENYSGWFLSF 239
            +      + +          I   N       TP S     P+KP++ TE ++GWF  +
Sbjct: 182 KSHGLFELFFISDGGHT----IRKANMLKLTKSTPISLKSLQPNKPMLVTEFWAGWFDYW 237

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV--------ATSYD 291
           G+       +     +    + G +  N+YM+ GGTNFG   G   +         TSYD
Sbjct: 238 GHGRNLLNNDVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGYYTADVTSYD 296

Query: 292 YDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           YD P+DE G  R  KW  ++      K   E
Sbjct: 297 YDCPVDESG-NRTEKWEIIKRCLDVQKTSSE 326


>gi|395541292|ref|XP_003772579.1| PREDICTED: beta-galactosidase [Sarcophilus harrisii]
          Length = 673

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 114/353 (32%), Positives = 159/353 (45%), Gaps = 44/353 (12%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y+    + DGK     SGSIHY R     W + + K K  GL  IETYV WN+HEP  
Sbjct: 63  IDYEGDQFLKDGKPFRYISGSIHYSRIPRFYWKDRLFKMKMAGLNAIETYVPWNFHEPFP 122

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F G  DL  F++ V E GL + LR GPY CAEW+ GG PVWL     I  R+++  
Sbjct: 123 GQYQFSGEQDLEYFLQLVHEVGLLVILRPGPYICAEWDMGGLPVWLLEKKSIFLRSSDPD 182

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN-------------------- 164
           + + + ++L  ++  MK        GGPII  QVENEYG+                    
Sbjct: 183 YLKAVDKWLEVLLPKMKP--YLYQNGGPIITVQVENEYGSYFACDYNYLRFLLKVFRQHL 240

Query: 165 ----VEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPN 220
               V +     GE Y+K    T  +L  +V +             N    F        
Sbjct: 241 GEEVVLFTTDGAGENYLK--CGTLQDLYATVDFGTSS---------NITQAFMIQRKV-- 287

Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
            P  P++ +E Y+GW   +G +      +++  ++      G    N YM+ GGTNFG  
Sbjct: 288 EPKGPLVNSEFYTGWLDHWGESHQTVSTKNIVASLTDMLSRGANV-NLYMFIGGTNFGFW 346

Query: 281 AGGPL----VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
            G  +      TSYDYDAP+ E G + +  +     + K  KL E  +  S P
Sbjct: 347 NGANMPYLPQPTSYDYDAPLSEAGDLTEKYYAVREAIGKFEKLPEGPIPPSTP 399


>gi|326779952|ref|ZP_08239217.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
 gi|326660285|gb|EGE45131.1| glycoside hydrolase family 35 [Streptomyces griseus XylebKG-1]
          Length = 648

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 124/384 (32%), Positives = 177/384 (46%), Gaps = 47/384 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   L SG++HY R     W   +      GL  +ETYV WN HEP  G+    G  
Sbjct: 13  LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG-- 70

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
            L RF+  V+ AGL+  +R GPY CAEW  GG PVW+    G + RT +  ++  ++R+ 
Sbjct: 71  ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWF 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            +++  + +  +  S+GGP++L Q ENEYG+    YG    +Y++W A        +VP 
Sbjct: 131 RELLPQVVRRQV--SRGGPVVLVQAENEYGS----YG-SDAVYLEWLAGLLRQCGVTVPL 183

Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
                 D P+           ++ T N       GF       + P  P+M  E + GWF
Sbjct: 184 FTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFKV--LRRHQPGGPLMCMEFWCGWF 238

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GP-------L 285
             +G     R  E  A A+    E G +  N YM  GGTNFG  AG    GP        
Sbjct: 239 DHWGAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSGPHQDESFQP 297

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
             TSYDYDAP+DEYG   + K+   RE+ +A    E  L +  P    L   +   +   
Sbjct: 298 TVTSYDYDAPVDEYGRATE-KFRLFREVLEA--YAEGPLPALPPEPVGLAGPVRVELAEW 354

Query: 346 SS-NDCAAFLANYDSSSDANVTFN 368
           +S  D    L + ++ S    TF 
Sbjct: 355 ASLGDVLEVLGDPETESGVPATFE 378


>gi|83415088|ref|NP_001032730.1| beta-galactosidase precursor [Canis lupus familiaris]
 gi|94730362|sp|Q9TRY9.3|BGAL_CANFA RecName: Full=Beta-galactosidase; AltName: Full=Acid
           beta-galactosidase; Short=Lactase; Flags: Precursor
 gi|76470548|gb|ABA43388.1| lysosomal beta-galactosidase [Canis lupus familiaris]
          Length = 668

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 159/346 (45%), Gaps = 28/346 (8%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            + Y H   + DG+     SGSIHY R     W + + K K  GL  I+TYV WN+HEP 
Sbjct: 34  TIDYSHNRFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 93

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++ 
Sbjct: 94  PGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 153

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +   + ++L  ++  MK   L    GGPII  QVENEYG    +Y      Y+++    
Sbjct: 154 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL 207

Query: 184 -AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS-------------PSKPIMW 228
              +L   V        +          G Y    F P +             P  P++ 
Sbjct: 208 FHHHLGNDVLLFTTDGANEKFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGPLVN 267

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL--- 285
           +E Y+GW   +G        E +A ++      G    N YM+ GGTNF    G  +   
Sbjct: 268 SEFYTGWLDHWGQPHSTVRTEVVASSLHDILAHGANV-NLYMFIGGTNFAYWNGANMPYQ 326

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
              TSYDYDAP+ E G + + K+  LRE + K  K+ E ++  S P
Sbjct: 327 AQPTSYDYDAPLSEAGDLTE-KYFALREVIRKFEKVPEGFIPPSTP 371


>gi|431593417|ref|ZP_19521746.1| beta-galactosidase [Enterococcus faecium E1861]
 gi|430591294|gb|ELB29332.1| beta-galactosidase [Enterococcus faecium E1861]
          Length = 595

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 108/314 (34%), Positives = 148/314 (47%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++VRFVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVMCQ---QEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTENYS 233
           P         E     I+   + F    F  +S                  PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGILIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|29376349|ref|NP_815503.1| glycosyl hydrolase [Enterococcus faecalis V583]
 gi|256961697|ref|ZP_05565868.1| beta-galactosidase [Enterococcus faecalis Merz96]
 gi|257419527|ref|ZP_05596521.1| beta-galactosidase [Enterococcus faecalis T11]
 gi|29343812|gb|AAO81573.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
 gi|256952193|gb|EEU68825.1| beta-galactosidase [Enterococcus faecalis Merz96]
 gi|257161355|gb|EEU91315.1| beta-galactosidase [Enterococcus faecalis T11]
          Length = 594

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|329960218|ref|ZP_08298660.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
 gi|328532891|gb|EGF59668.1| beta-galactosidase domain protein [Bacteroides fluxus YIT 12057]
          Length = 1104

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 153/325 (47%), Gaps = 38/325 (11%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  I  YVFWN HEP  G + F 
Sbjct: 355 TFLLNGKPFVVKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHEPQPGVFDFT 414

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ DL  F +  ++  +++ LR GPY CAEW  GG P WL     I+ R ++  F E + 
Sbjct: 415 GQNDLAEFCRLCRQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFIERVG 474

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELY-------------- 176
            F   + + +   ++    GGPII+ QVENEYG+     G   ++               
Sbjct: 475 IFEKAVAEQVA--DMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVTLFQ 532

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
             WA++   N    + W M           N   G   D  F P     P  P+M +E +
Sbjct: 533 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 581

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
           SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A   
Sbjct: 582 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 640

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRE 312
           TSYDYDAPI E G    PK+  LR+
Sbjct: 641 TSYDYDAPISESGQT-TPKYWELRK 664


>gi|227518994|ref|ZP_03949043.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
 gi|227553614|ref|ZP_03983663.1| possible beta-galactosidase [Enterococcus faecalis HH22]
 gi|293383402|ref|ZP_06629315.1| beta-galactosidase [Enterococcus faecalis R712]
 gi|293388945|ref|ZP_06633430.1| beta-galactosidase [Enterococcus faecalis S613]
 gi|312907770|ref|ZP_07766761.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
 gi|312910388|ref|ZP_07769235.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
 gi|422714384|ref|ZP_16771110.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
 gi|422715641|ref|ZP_16772357.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
 gi|424676529|ref|ZP_18113400.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
 gi|424681657|ref|ZP_18118444.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
 gi|424683847|ref|ZP_18120597.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
 gi|424686250|ref|ZP_18122918.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
 gi|424690479|ref|ZP_18127014.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
 gi|424695572|ref|ZP_18131955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
 gi|424696689|ref|ZP_18133030.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
 gi|424699924|ref|ZP_18136135.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
 gi|424703062|ref|ZP_18139196.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
 gi|424707441|ref|ZP_18143425.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
 gi|424716899|ref|ZP_18146197.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
 gi|424720477|ref|ZP_18149578.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
 gi|424724025|ref|ZP_18152974.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
 gi|424733616|ref|ZP_18162171.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
 gi|424744084|ref|ZP_18172389.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
 gi|424750408|ref|ZP_18178472.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
 gi|227073566|gb|EEI11529.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
 gi|227177262|gb|EEI58234.1| possible beta-galactosidase [Enterococcus faecalis HH22]
 gi|291079193|gb|EFE16557.1| beta-galactosidase [Enterococcus faecalis R712]
 gi|291081726|gb|EFE18689.1| beta-galactosidase [Enterococcus faecalis S613]
 gi|310626798|gb|EFQ10081.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
 gi|311289661|gb|EFQ68217.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
 gi|315575986|gb|EFU88177.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
 gi|315580706|gb|EFU92897.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
 gi|402350756|gb|EJU85654.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
 gi|402356541|gb|EJU91272.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
 gi|402364212|gb|EJU98655.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
 gi|402364322|gb|EJU98764.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
 gi|402367784|gb|EJV02121.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
 gi|402368267|gb|EJV02587.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
 gi|402375423|gb|EJV09410.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
 gi|402377018|gb|EJV10929.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
 gi|402385039|gb|EJV18580.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
 gi|402385067|gb|EJV18607.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
 gi|402386247|gb|EJV19753.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
 gi|402391229|gb|EJV24540.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
 gi|402392948|gb|EJV26178.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
 gi|402396006|gb|EJV29081.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
 gi|402399507|gb|EJV32379.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
 gi|402406707|gb|EJV39253.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
          Length = 604

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQSFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|347967093|ref|XP_320991.5| AGAP002058-PA [Anopheles gambiae str. PEST]
 gi|333469761|gb|EAA01064.5| AGAP002058-PA [Anopheles gambiae str. PEST]
          Length = 630

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 159/329 (48%), Gaps = 24/329 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           ++ + +     DG+     SGS HY R+ PE W  ++R  +  GL  + TY+ W+ HEP+
Sbjct: 33  DIDFQNDTFTKDGQPFQFISGSFHYFRALPESWRHILRSMRAAGLNTVMTYIEWSLHEPM 92

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
            GQY +EG  +L  F++  Q   LF+ LR GPY CAE + GGFP WL    P I+ RT +
Sbjct: 93  PGQYQWEGIANLEEFIEIAQSENLFVILRPGPYICAERDMGGFPHWLLTKYPSIKLRTYD 152

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE----LYVK 178
             +  E++ +  +++  + +       GGP+I+  +ENEYG+ +   G   +    L V 
Sbjct: 153 TDYLREVQNWYNQLMPRLVR--YLYGNGGPVIMVSIENEYGSFKACDGQYMQFLKNLTVH 210

Query: 179 WAADTAVNLNTSVPWVM-CQQEDAPDP-----IINTCNGFYCDGFTPNSPSKPIMWTENY 232
           +  D AV      P ++ C       P     I N  N F+        P  P++  E Y
Sbjct: 211 FVQDKAVLFTNDGPELLKCGSIPGILPTLDFGITNNPNAFWQQ-LRKYLPKGPLVNAEYY 269

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA----- 287
            GW L+       R    +     +         N+YM+FGGTNFG TAG   V      
Sbjct: 270 PGW-LTHWMEPTARVDAGMVVNTLKLMLNQKANVNFYMFFGGTNFGFTAGANDVGPGKYS 328

Query: 288 ---TSYDYDAPIDEYGFIRQPKWGHLREL 313
              TSYDYDAP+DE G    PK+  +R++
Sbjct: 329 ADITSYDYDAPLDEAG-DPTPKYFAIRKV 356


>gi|193695178|ref|XP_001948549.1| PREDICTED: beta-galactosidase-like [Acyrthosiphon pisum]
          Length = 640

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 111/340 (32%), Positives = 167/340 (49%), Gaps = 36/340 (10%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y+    + DG+     SG +HY R     W + I+K K  GL  I TYV W+ HEP  
Sbjct: 31  VDYEKNEFLKDGEVFRYVSGDLHYFRVPKSYWKDRIQKIKAAGLNAITTYVEWSLHEPFP 90

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
           G Y FEG  DL  F+K +Q+ G++L LR GPY CAE ++GGFP W L+  P    RT ++
Sbjct: 91  GTYNFEGMADLEYFIKLIQDEGMYLLLRPGPYICAERDFGGFPYWLLNVTPKGSLRTNDS 150

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK----- 178
            +K+ + ++ + ++  M Q +L+ + GG II+ QVENEYG+  +A     +L+++     
Sbjct: 151 SYKKYVSQWFSVLMKKM-QPHLYGN-GGNIIMVQVENEYGSY-YACDSDYKLWLRDLLKG 207

Query: 179 WAADTAVNLNTSVPWVMCQQED---APDPIIN-------TCNGFYCDGFTPN-SPSKPIM 227
           +  D A+     +    C+Q D    P P +        + N   C  F  N     P +
Sbjct: 208 YVEDKALLYTIDI----CRQRDFDCGPIPEVYATVDFGISVNAATCFDFLKNYQKGGPSV 263

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
            +E Y GW   +    P    +D+   +        +F ++YM+ GGTNFG T+G     
Sbjct: 264 NSEFYPGWLAHWQEPHPKVNSDDVVNHMKSMLSLNASF-SFYMFHGGTNFGFTSGANTNE 322

Query: 286 ---------VATSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
                      TSYDYDAPI E G + +  +   + L  A
Sbjct: 323 SDANIGYLPQLTSYDYDAPITEAGDLTEKYFKIKQTLENA 362


>gi|423301385|ref|ZP_17279409.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
           CL09T03C10]
 gi|408471986|gb|EKJ90515.1| hypothetical protein HMPREF1057_02550 [Bacteroides finegoldii
           CL09T03C10]
          Length = 779

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 156/323 (48%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 36  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 95

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 96  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 155

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       T
Sbjct: 156 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-NKPYVSAVRDLVRESGFT 208

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 209 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 268

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 269 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 327

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+  + K+  LR+L K
Sbjct: 328 DAPISEAGWTTE-KYFLLRDLLK 349



 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 37/117 (31%)

Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
           KQ  T+P      +YK TF   +  G   L++++ GKG  WVNG ++GR+W         
Sbjct: 525 KQLPTMPA-----YYKGTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI------- 571

Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
                                 P QTL+ +P  W+  GEN +++ +  G  P+K S+
Sbjct: 572 ---------------------GPQQTLF-MPGCWLKKGENEILVLDLKG--PAKASI 604


>gi|313231869|emb|CBY08981.1| unnamed protein product [Oikopleura dioica]
          Length = 664

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 106/330 (32%), Positives = 153/330 (46%), Gaps = 52/330 (15%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +++YD +   +  +   L SGS+HY R   + W + + K K  GL  + TYV WN HEP 
Sbjct: 53  SLSYDSKNFYLGEEPTQLLSGSVHYFRIPKKYWYDRLAKLKSAGLNGVTTYVPWNLHEPE 112

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G++ F G  D+V F+   +   LF+ LR GPY C+EW +GG P WL     ++ RT  +
Sbjct: 113 PGEFSFSGELDIVHFINIARTLDLFVILRPGPYICSEWEWGGLPPWLLRDSFMKVRTNYS 172

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +   +KRF  ++I L+K +   +  GGPI+  QVENEYG               +A   
Sbjct: 173 GYITAVKRFFGQLIPLIKYQQ--SKYGGPIVAVQVENEYG--------------MYAGQD 216

Query: 184 AVNLNTSVPWVMCQQEDAPDPII---------NTCNGFYCDG-----FTPNS-------- 221
             +LNT     + + E   +P+          N  N  Y DG     F  N         
Sbjct: 217 GAHLNTLAE--LLKNEGIVEPLFTSDGSSVWDNEKNTIYEDGLKSVNFKSNPEKHLKSLR 274

Query: 222 ---PSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG 278
              P +P+   E ++GWF  +G         D    +    +   +  N+YM+ GGTNFG
Sbjct: 275 GHFPEQPLWVMEFWAGWFDWWGEGRNLFDNSDFQKNLDVILDHKASL-NFYMFHGGTNFG 333

Query: 279 RTAGGPLVA--------TSYDYDAPIDEYG 300
            T GG  +A        TSYDYD PI E G
Sbjct: 334 FTNGGLTIARGYYTADVTSYDYDCPISEAG 363


>gi|257899628|ref|ZP_05679281.1| glycosyl hydrolase [Enterococcus faecium Com15]
 gi|257837540|gb|EEV62614.1| glycosyl hydrolase [Enterococcus faecium Com15]
          Length = 595

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V+FVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKDIVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|419799561|ref|ZP_14324899.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
 gi|385697826|gb|EIG28233.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis F0449]
          Length = 595

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 107/307 (34%), Positives = 150/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           + G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +GQ+ F GR 
Sbjct: 12  LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q  GL++ +R  P+ CAEW +GG P WL     ++ R+++  F E + R+ 
Sbjct: 72  DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDMRIRSSDPVFIEAVDRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
             ++ L+ +  +   QGGPI++ QVENEYG+   + AY       +K    T     +  
Sbjct: 131 DHLLGLLTRYQV--DQGGPILMMQVENEYGSYGEDKAYLRAIRDLMKEKGVTCPLFTSDG 188

Query: 192 PWVMCQQED--APDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N         G   + F       P+M  E + GWF  + 
Sbjct: 189 PWRATLRAGNLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             V  R  E+LA AV    E G    N YM+ GGTNFG   G    G L     TSYDY 
Sbjct: 249 EPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYG 306

Query: 294 APIDEYG 300
           A ++E G
Sbjct: 307 ALLNEQG 313


>gi|384513478|ref|YP_005708571.1| beta-galactosidase [Enterococcus faecalis OG1RF]
 gi|430361754|ref|ZP_19426831.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
 gi|327535367|gb|AEA94201.1| beta-galactosidase [Enterococcus faecalis OG1RF]
 gi|429512307|gb|ELA01915.1| putative beta-galactosidase [Enterococcus faecalis OG1X]
          Length = 604

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L    GG I++ Q+ENEYG+   E AY       +     TA+   +
Sbjct: 139 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|300861196|ref|ZP_07107283.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
 gi|428767294|ref|YP_007153405.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
 gi|300850235|gb|EFK77985.1| putative beta-galactosidase [Enterococcus faecalis TUSoD Ef11]
 gi|427185467|emb|CCO72691.1| beta-galactosidase [Enterococcus faecalis str. Symbioflor 1]
          Length = 594

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 304

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328


>gi|410100792|ref|ZP_11295748.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409214073|gb|EKN07084.1| hypothetical protein HMPREF1076_04926 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 779

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 103/322 (31%), Positives = 154/322 (47%), Gaps = 27/322 (8%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++GK  ++++  IHY R   E W   I+  K  G+  I  Y FWN HE   G++ F
Sbjct: 37  KTFLLNGKPFIIKAAEIHYTRIPVEYWEHRIQMCKALGMNTICIYAFWNIHEQKPGEFDF 96

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+  F +  Q+ G+++ LR GPY C+EW  GG P WL     IQ RT +  F E  
Sbjct: 97  SGQNDIAAFCRLAQKNGMYIMLRPGPYVCSEWEMGGLPWWLLKKEDIQLRTNDPYFIERT 156

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN- 188
           + ++ +I   +    +  ++GG II+ QVENEYG+         + Y+    D   +   
Sbjct: 157 RIYMNEIGKQLADRQI--TRGGNIIMVQVENEYGSY-----ATDKSYIAKNRDILRDAGF 209

Query: 189 TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWF 236
           T VP   C        +A D ++ T N   G   D          P+ P+M +E +SGWF
Sbjct: 210 TDVPLFQCDWSSNFLNNALDDLVWTVNFGTGANIDEQFKKLKEVRPNTPLMCSEFWSGWF 269

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYD 291
             +G     R  E +   +    +   +F + YM  GGT FG   G        + +SYD
Sbjct: 270 DHWGRKHETRDAETMIAGLRDMLDRNISF-SLYMTHGGTTFGHWGGANSPAYSAMCSSYD 328

Query: 292 YDAPIDEYGFIRQPKWGHLREL 313
           YDAPI E G+   PK+  LRE 
Sbjct: 329 YDAPISEAGWA-TPKYHKLREF 349



 Score = 42.4 bits (98), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 59/245 (24%), Positives = 99/245 (40%), Gaps = 59/245 (24%)

Query: 464 VMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILS 523
            +P       L I+ +   A VF++ KL+  G  +     F I  K+        LDIL 
Sbjct: 414 TLPAVKAGTTLLIDEVHDWAQVFIDGKLI--GRLDRRRGEFTI--KLPATAAGARLDILI 469

Query: 524 MMVGLQNYGAWFDVA---GAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISL 580
             +G  N    FD A     G+ + +++  ++   +L   + +Y + V+           
Sbjct: 470 EAMGRVN----FDKAIHDRKGITNKVVLITESSSDELKDWQ-VYNLPVD----------- 513

Query: 581 ANSSFWKQGSTLPVNK--SLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
              SF K     P  K  +  +Y+ TF   E  G + L++ + GKG  WVNG+++GR+W 
Sbjct: 514 --YSFVKDKKYTPGKKIEAPAYYRATF-NLETPGDVFLDMQTWGKGMVWVNGKAMGRFWE 570

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
                                         P QTL+ +P  W+  GEN +++ +  G  P
Sbjct: 571 I----------------------------GPQQTLF-MPGCWLKKGENEIIVLDLKG--P 599

Query: 699 SKISL 703
            K S+
Sbjct: 600 EKASV 604


>gi|256959208|ref|ZP_05563379.1| beta-galactosidase [Enterococcus faecalis DS5]
 gi|256949704|gb|EEU66336.1| beta-galactosidase [Enterococcus faecalis DS5]
          Length = 594

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|336428330|ref|ZP_08608312.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336005980|gb|EGN36021.1| hypothetical protein HMPREF0994_04318 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 583

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 107/309 (34%), Positives = 147/309 (47%), Gaps = 34/309 (11%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG++HY R  PE W + + K K  G   +ETYV WN HEP +G++ FEG  
Sbjct: 14  LDGKPFKIISGAVHYFRIVPEYWRDRLEKLKAMGANTVETYVPWNMHEPQKGKFVFEGML 73

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D+ RF+   QE GL++ +R  PY CAEW +GG P WL    G++ R    PF E ++ + 
Sbjct: 74  DISRFILLAQELGLYVIVRPSPYICAEWEFGGLPAWLLKEDGMRLRGCYEPFLEAVREYY 133

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
           + +  ++    L    GGP+IL QVENEYG     Y      Y++      ++    VP 
Sbjct: 134 SVLFPILVP--LQIHHGGPVILMQVENEYG-----YYGDDTRYMETMKQLMLDNGAEVPL 186

Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTENYSGWFLS 238
           V     D P     +C        T N  SK               P+M TE + GWF  
Sbjct: 187 VTS---DGPMDESLSCGRLPGVLPTGNFGSKTEERFEVLKKYTEGGPLMCTEFWVGWFDH 243

Query: 239 FGYAVPFR-PVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYD 291
           +G     R  +E+    + +  E G    N YM+ GGTNFG   G           TSYD
Sbjct: 244 WGNGGHMRGNLEESTKDLDKMLEMGHV--NIYMFEGGTNFGFMNGSNYYDELTPDVTSYD 301

Query: 292 YDAPIDEYG 300
           YDA + E G
Sbjct: 302 YDAVLTEAG 310


>gi|430368510|ref|ZP_19428251.1| beta-galactosidase [Enterococcus faecalis M7]
 gi|429516266|gb|ELA05760.1| beta-galactosidase [Enterococcus faecalis M7]
          Length = 594

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L    GG I++ Q+ENEYG+   E AY       +     TA+   +
Sbjct: 129 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTALFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 304

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328


>gi|255975619|ref|ZP_05426205.1| beta-galactosidase [Enterococcus faecalis T2]
 gi|256619294|ref|ZP_05476140.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
 gi|256853354|ref|ZP_05558724.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
 gi|421514060|ref|ZP_15960775.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
 gi|255968491|gb|EET99113.1| beta-galactosidase [Enterococcus faecalis T2]
 gi|256598821|gb|EEU17997.1| beta-galactosidase [Enterococcus faecalis ATCC 4200]
 gi|256711813|gb|EEU26851.1| glycosyl hydrolase, family 35 [Enterococcus faecalis T8]
 gi|401672857|gb|EJS79300.1| Beta-galactosidase 3 [Enterococcus faecalis ATCC 29212]
          Length = 594

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|255692586|ref|ZP_05416261.1| beta-galactosidase [Bacteroides finegoldii DSM 17565]
 gi|260621643|gb|EEX44514.1| glycosyl hydrolase family 35 [Bacteroides finegoldii DSM 17565]
          Length = 779

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 156/323 (48%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 36  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 95

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 96  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKRDIALRTLDPYYMERVG 155

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       T
Sbjct: 156 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-NKPYVSAVRDLVRESGFT 208

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 209 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 268

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 269 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 327

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+  + K+  LR+L K
Sbjct: 328 DAPISEAGWTTE-KYFLLRDLLK 349



 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 52/117 (44%), Gaps = 37/117 (31%)

Query: 587 KQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTG 646
           KQ  T+P      +YK TF   +  G   L++++ GKG  WVNG ++GR+W         
Sbjct: 525 KQLPTMPA-----YYKGTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI------- 571

Query: 647 CTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
                                 P QTL+ +P  W+  GEN +++ +  G  P+K S+
Sbjct: 572 ---------------------GPQQTLF-MPGCWLKKGENEILVLDLKG--PAKASI 604


>gi|404372285|ref|ZP_10977584.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
 gi|226911573|gb|EEH96774.1| hypothetical protein CSBG_00400 [Clostridium sp. 7_2_43FAA]
          Length = 593

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 105/314 (33%), Positives = 155/314 (49%), Gaps = 38/314 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ID  +  + SG++HY R  P  W + +   K  G   +ETY+ WN HEP  G++ FEG  
Sbjct: 12  IDDNKFKILSGAVHYFRIHPSQWGDTLFNLKALGFNTVETYIPWNIHEPYEGKFDFEGIK 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D+ +F+K  ++ GL++ LR  PY CAEW +GG P WL     I+ R++++ F E+++ + 
Sbjct: 72  DIEKFIKISEKLGLYVILRPTPYICAEWEFGGLPAWLLKDKEIKLRSSDDNFIEKLRNYY 131

Query: 134 AKII-DLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
             ++  L+K +    ++GGP+++ QVENEYG    +YG   E Y++  A         VP
Sbjct: 132 NDLLPRLVKYQ---VTKGGPVLMMQVENEYG----SYGNEKE-YLRIVASIMKENGVDVP 183

Query: 193 -------WVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSG 234
                  W+   +  +   D I  + N             D    N    PIM  E + G
Sbjct: 184 LFTSDGTWIEALECGSLIEDDIFVSGNFGSKSKENCDMLKDFILKNGKEWPIMCMEYWDG 243

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLV 286
           WF  +G  +  R   DLA  V    + G    N YM+ GGTNFG   G         P V
Sbjct: 244 WFNRWGEDIIRRDSIDLAEDVKEMLKIGSI--NLYMFRGGTNFGFMNGCSARGNNDLPQV 301

Query: 287 ATSYDYDAPIDEYG 300
            TSYDYDA + E+G
Sbjct: 302 -TSYDYDAILTEWG 314


>gi|422708708|ref|ZP_16766236.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
 gi|315036693|gb|EFT48625.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
          Length = 604

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|298481696|ref|ZP_06999887.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
 gi|298272237|gb|EFI13807.1| beta-galactosidase (Lactase) [Bacteroides sp. D22]
          Length = 778

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F K  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+  + K+  LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348



 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 32/104 (30%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTF-KLDKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
                    P QTL+ +P  W+  GEN +++ +  G  P+K S+
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASI 603


>gi|255972505|ref|ZP_05423091.1| beta-galactosidase [Enterococcus faecalis T1]
 gi|257422333|ref|ZP_05599323.1| glycosyl hydrolase [Enterococcus faecalis X98]
 gi|255963523|gb|EET95999.1| beta-galactosidase [Enterococcus faecalis T1]
 gi|257164157|gb|EEU94117.1| glycosyl hydrolase [Enterococcus faecalis X98]
          Length = 594

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|62955063|ref|NP_001017547.1| beta-galactosidase precursor [Danio rerio]
 gi|62089564|gb|AAH92166.1| Galactosidase, beta 1 [Danio rerio]
 gi|182890870|gb|AAI65636.1| Glb1 protein [Danio rerio]
          Length = 651

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 109/337 (32%), Positives = 161/337 (47%), Gaps = 25/337 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           +V Y     + DG+     SGSIHY R     W + + K    GL  I+TYV WN+HE +
Sbjct: 27  SVDYHRNCFLKDGEPFRYISGSIHYSRIPRVYWKDRLLKMYMAGLNAIQTYVPWNFHEAV 86

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F G  DL +F++  Q+ GL + +R GPY CAEW+ GG P WL     I  R+++ 
Sbjct: 87  PGQYDFSGDRDLEQFLQLCQDIGLLVIMRPGPYICAEWDMGGLPAWLLKKKDIVLRSSDP 146

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG-GELYVKW 179
            +   + +++ K++ ++K+       GGPII  QVENEYG+    ++ Y     +L+  +
Sbjct: 147 DYLAAVDKWMGKLLPIIKR--YLYQNGGPIITVQVENEYGSYFACDFNYMRHLSQLFRFY 204

Query: 180 AADTAVNLNTS---VPWVMCQQEDAP------DPIINTCNGFYCDGFTPNSPSKPIMWTE 230
             + AV   T    + ++ C             P  N    F         P  P++ +E
Sbjct: 205 LGEEAVLFTTDGAGLGYLKCGSLQGLYATVDFGPGANVTAAFEAQRHV--EPRGPLVNSE 262

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-----RTAGGPL 285
            Y GW   +G      P   +   +    E G    N YM+ GGTNFG      T  GP 
Sbjct: 263 FYPGWLDHWGEKHSVVPTSAVVKTLNEILEIGANV-NLYMFIGGTNFGYWNGANTPYGPQ 321

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
             TSYDYD+P+ E G + + K+  +RE+ K  K   E
Sbjct: 322 -PTSYDYDSPLTEAGDLTE-KYFAIREVIKMYKDVPE 356


>gi|251798103|ref|YP_003012834.1| beta-galactosidase [Paenibacillus sp. JDR-2]
 gi|247545729|gb|ACT02748.1| Beta-galactosidase [Paenibacillus sp. JDR-2]
          Length = 919

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 166/337 (49%), Gaps = 23/337 (6%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y+  +  I+G++  L S +IHY R   E W E++ K+K  G+  ++TY  WN HEP  
Sbjct: 18  VQYNAFSYNINGEQVFLNSAAIHYFRMPKEEWREVLVKAKLAGMNCVDTYFAWNVHEPEE 77

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G++ FEG  D   F+    E GL++  R GP+ CAEW++GGFP WL+    ++FR  +  
Sbjct: 78  GEWNFEGDNDCGAFLDLCHELGLWVIARPGPFICAEWDFGGFPYWLNTKKDMKFRAFDMQ 137

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTA 184
           +   + R++ +II +++   + A  GG +IL QVENEYG +  A       Y+    D  
Sbjct: 138 YLTYVDRYMDRIIPIIRDREINA--GGSVILVQVENEYGYL--ASDEVARDYMLHLRDVM 193

Query: 185 VNLNTSVPWVMCQQEDAPDPIINTCN-----GFYCDGFTPNSPSKPIMWTENYSGWFLSF 239
           ++    VP + C      +  +   N       + +      P  P + TE ++GWF  +
Sbjct: 194 LDRGVMVPLITCV--GGAEGTVEGANFWSGADHHYNNLVQKQPDTPKIVTEFWTGWFEHW 251

Query: 240 GYAVPFRPVEDLAFAVARFFET---GGTFQNYYM----YFGGTNFGRTAGGP--LVATSY 290
           G   P    +  A    R  E+   G T  ++YM       G   GRT G     + TSY
Sbjct: 252 G--APAATQKTAALYEKRMLESLRAGFTGVSHYMFFGGTNFGGYGGRTVGASDIFMVTSY 309

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISS 327
           DYDAP+ EYG +   K+   + +   ++  E  L+++
Sbjct: 310 DYDAPLSEYGRVTD-KYNTAKRMSYFVQATESVLLNA 345


>gi|229549776|ref|ZP_04438501.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
 gi|312950913|ref|ZP_07769823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
 gi|422692785|ref|ZP_16750800.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
 gi|422706430|ref|ZP_16764128.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
 gi|422727290|ref|ZP_16783733.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
 gi|229305045|gb|EEN71041.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
 gi|310631062|gb|EFQ14345.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
 gi|315152244|gb|EFT96260.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
 gi|315156045|gb|EFU00062.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
 gi|315157806|gb|EFU01823.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0312]
          Length = 604

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|425056292|ref|ZP_18459750.1| putative beta-galactosidase [Enterococcus faecium 505]
 gi|403032128|gb|EJY43702.1| putative beta-galactosidase [Enterococcus faecium 505]
          Length = 595

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+V+FVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKDVVQFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|57619080|ref|NP_001009860.1| beta-galactosidase precursor [Felis catus]
 gi|5915775|sp|O19015.1|BGAL_FELCA RecName: Full=Beta-galactosidase; AltName: Full=Acid
           beta-galactosidase; Short=Lactase; Flags: Precursor
 gi|2547317|gb|AAB81350.1| lysosomal beta-galactosidase [Felis catus]
          Length = 669

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 163/345 (47%), Gaps = 28/345 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y H   + DG+     SGSIHY R     W + + K K  GL  I+TYV WN+HEP  
Sbjct: 35  IDYGHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 94

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++  
Sbjct: 95  GQYQFSGEHDVEYFLKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY----------GV 171
           +   + ++L  ++  MK   L    GGPII  QVENEYG+    ++ Y           +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQRRFRDHL 212

Query: 172 GGE--LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
           GG+  L+    A        ++  +    +  PD   N    F       + P  P++ +
Sbjct: 213 GGDVLLFTTDGAHEKFLQCGALQGIYATVDFGPD--ANITAAFQIQ--RKSEPRGPLVNS 268

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL---- 285
           E Y+GW   +G        E +A ++      G    N YM+ GGTNF    G  +    
Sbjct: 269 EFYTGWLDHWGQPHSRVRTEVVASSLHDVLAHGANV-NLYMFIGGTNFAYWNGANIPYQP 327

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
             TSYDYDAP+ E G +   K+  LR+ + K  K+ E ++  S P
Sbjct: 328 QPTSYDYDAPLSEAGDLTD-KYFALRDVIRKFEKVPEGFIPPSTP 371


>gi|227552575|ref|ZP_03982624.1| possible beta-galactosidase [Enterococcus faecium TX1330]
 gi|257896912|ref|ZP_05676565.1| glycosyl hydrolase [Enterococcus faecium Com12]
 gi|293379016|ref|ZP_06625170.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
 gi|431750982|ref|ZP_19539676.1| beta-galactosidase [Enterococcus faecium E2620]
 gi|227178324|gb|EEI59296.1| possible beta-galactosidase [Enterococcus faecium TX1330]
 gi|257833477|gb|EEV59898.1| glycosyl hydrolase [Enterococcus faecium Com12]
 gi|292642358|gb|EFF60514.1| glycosyl hydrolase family 35 [Enterococcus faecium PC4.1]
 gi|430616240|gb|ELB53164.1| beta-galactosidase [Enterococcus faecium E2620]
          Length = 595

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++VRFVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|307275736|ref|ZP_07556876.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
 gi|307277830|ref|ZP_07558914.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
 gi|307291757|ref|ZP_07571629.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
 gi|422685752|ref|ZP_16743965.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
 gi|422720681|ref|ZP_16777290.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
 gi|422739238|ref|ZP_16794421.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
 gi|306497209|gb|EFM66754.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0411]
 gi|306505227|gb|EFM74413.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
 gi|306507612|gb|EFM76742.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2134]
 gi|315029464|gb|EFT41396.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4000]
 gi|315032072|gb|EFT44004.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
 gi|315144900|gb|EFT88916.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2141]
          Length = 604

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|431741495|ref|ZP_19530400.1| beta-galactosidase [Enterococcus faecium E2039]
 gi|430601673|gb|ELB39267.1| beta-galactosidase [Enterococcus faecium E2039]
          Length = 595

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++VRFVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|2623150|gb|AAB86405.1| mutant lysosomal beta-galactosidase [Felis catus]
          Length = 669

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 163/345 (47%), Gaps = 28/345 (8%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y H   + DG+     SGSIHY R     W + + K K  GL  I+TYV WN+HEP  
Sbjct: 35  IDYGHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQP 94

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++  
Sbjct: 95  GQYQFSGEHDVEYFLKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDPD 154

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY----------GV 171
           +   + ++L  ++  MK   L    GGPII  QVENEYG+    ++ Y           +
Sbjct: 155 YLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQRRFRDHL 212

Query: 172 GGE--LYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
           GG+  L+    A        ++  +    +  PD   N    F       + P  P++ +
Sbjct: 213 GGDVLLFTTDGAHEKFLQCGALQGIYATVDFGPD--ANITAAFQIQ--RKSEPRGPLVNS 268

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL---- 285
           E Y+GW   +G        E +A ++      G    N YM+ GGTNF    G  +    
Sbjct: 269 EFYTGWLDHWGQPHSRVRTEVVASSLHDVLAHGANV-NLYMFIGGTNFAYWNGANIPYQP 327

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
             TSYDYDAP+ E G +   K+  LR+ + K  K+ E ++  S P
Sbjct: 328 QPTSYDYDAPLSEAGDLTD-KYFALRDVIRKFEKVPEGFIPPSTP 371


>gi|257888197|ref|ZP_05667850.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
 gi|431040248|ref|ZP_19492755.1| beta-galactosidase [Enterococcus faecium E1590]
 gi|431763679|ref|ZP_19552228.1| beta-galactosidase [Enterococcus faecium E3548]
 gi|257824251|gb|EEV51183.1| glycosyl hydrolase [Enterococcus faecium 1,141,733]
 gi|430562100|gb|ELB01353.1| beta-galactosidase [Enterococcus faecium E1590]
 gi|430622052|gb|ELB58793.1| beta-galactosidase [Enterococcus faecium E3548]
          Length = 595

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++VRFVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|424764212|ref|ZP_18191655.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
 gi|402420907|gb|EJV53177.1| putative beta-galactosidase [Enterococcus faecium TX1337RF]
          Length = 595

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++VRFVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|293570811|ref|ZP_06681858.1| beta-galactosidase [Enterococcus faecium E980]
 gi|430840422|ref|ZP_19458347.1| beta-galactosidase [Enterococcus faecium E1007]
 gi|431064256|ref|ZP_19493603.1| beta-galactosidase [Enterococcus faecium E1604]
 gi|431124630|ref|ZP_19498626.1| beta-galactosidase [Enterococcus faecium E1613]
 gi|431738579|ref|ZP_19527522.1| beta-galactosidase [Enterococcus faecium E1972]
 gi|291609079|gb|EFF38354.1| beta-galactosidase [Enterococcus faecium E980]
 gi|430495187|gb|ELA71394.1| beta-galactosidase [Enterococcus faecium E1007]
 gi|430566915|gb|ELB06003.1| beta-galactosidase [Enterococcus faecium E1613]
 gi|430568897|gb|ELB07927.1| beta-galactosidase [Enterococcus faecium E1604]
 gi|430597307|gb|ELB35110.1| beta-galactosidase [Enterococcus faecium E1972]
          Length = 595

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++VRFVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPDIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  V
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDV 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|257082326|ref|ZP_05576687.1| beta-galactosidase [Enterococcus faecalis E1Sol]
 gi|256990356|gb|EEU77658.1| beta-galactosidase [Enterococcus faecalis E1Sol]
          Length = 594

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|327283884|ref|XP_003226670.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Anolis
           carolinensis]
          Length = 584

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 115/330 (34%), Positives = 156/330 (47%), Gaps = 33/330 (10%)

Query: 8   DHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY 67
           D + L+ +   R+L  GS+HY R   E W + + K K  GL  + TYV WN HE IRG++
Sbjct: 17  DTQFLLEERPFRIL-GGSLHYFRIPREYWKDRLMKMKACGLNTVTTYVPWNLHEAIRGKF 75

Query: 68  YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
            F G  DL  F+K  +E GL++ LR GPY C+EW+ GG P WL   P +Q RTT   F E
Sbjct: 76  DFSGNLDLQVFIKMAEEVGLWVILRPGPYICSEWDLGGLPSWLLQDPEMQLRTTYRGFTE 135

Query: 128 EMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
            +  +  ++I   +   L    GGPII  QVENEYG+  +A       Y+K A      L
Sbjct: 136 AVDNYFDRLIP--QVVPLQYKYGGPIIAVQVENEYGS--YAQDPSYMTYIKMA------L 185

Query: 188 NTSVPWVMCQQEDAPDPIIN--------TCNGFYCDGF------TPNSPSKPIMWTENYS 233
            +     M    D  D +++        T N    D        T      P M  E ++
Sbjct: 186 TSRKIVEMLMTSDNHDGLVSGTVDGALATINFQKLDTAIMVFLSTDQRNKMPKMVMEYWT 245

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVA 287
           GWF S+G        +D+   V +  + G +  N YM+ GGTNFG   G           
Sbjct: 246 GWFDSWGGLHHVFDADDMVQTVGKVIKLGASI-NLYMFHGGTNFGFLNGAQHSNEYKSTI 304

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           TSYDYDA + E G     K+  LR+L   I
Sbjct: 305 TSYDYDAVLTESGDYTS-KFFKLRQLFTDI 333


>gi|322437493|ref|YP_004219583.1| glycoside hydrolase family protein [Granulicella tundricola
           MP5ACTX9]
 gi|321165386|gb|ADW71089.1| glycoside hydrolase family 35 [Granulicella tundricola MP5ACTX9]
          Length = 607

 Score =  153 bits (386), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 102/333 (30%), Positives = 158/333 (47%), Gaps = 27/333 (8%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +  +T D +  ++DG+   L SG +HYPR     W + +RK++  GL  +  Y FWN+HE
Sbjct: 23  THRLTTDPQHFLLDGQPFQLISGEMHYPRIPRAAWRDRLRKARAMGLNAVTVYAFWNFHE 82

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              G + F G+ D+  FV+  Q+ GLF+ LR GPY CAEW+ GG+P WL   P +  R+ 
Sbjct: 83  EEEGHFDFTGQRDIAEFVRIAQQEGLFVILRPGPYVCAEWDLGGYPSWLLKSPAVNLRSL 142

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           ++ +     +++  +   +    L A++GGPI+  QVENEYG+   +     + Y+    
Sbjct: 143 DSRYIAAADKWMKALGQQLAP--LQAAKGGPILAVQVENEYGSFPDSAQPNAQAYLDRVH 200

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIIN------TCNGFYCDGFTPNS--------PSKPIM 227
                L+      +    D  D +        T    Y  G +  S        P+  I 
Sbjct: 201 QMV--LDAGFKDSLLYTGDGADVLARGTFADLTAGIDYGTGDSARSIALYKKFRPNTNIY 258

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
             E + GWF  +G              V     +GG+  + YM  GGT+FG   G  +  
Sbjct: 259 TAEYWDGWFDHWGAKHEVVDASIHLKEVHDVLTSGGSI-SLYMLHGGTSFGWMNGANIDH 317

Query: 286 -----VATSYDYDAPIDEYGFIRQPKWGHLREL 313
                  TSYDYDAPIDE G +R P++  +R++
Sbjct: 318 NHYEPDVTSYDYDAPIDEAGQLR-PEYFAMRKV 349


>gi|270295887|ref|ZP_06202087.1| beta-galactosidase [Bacteroides sp. D20]
 gi|270273291|gb|EFA19153.1| beta-galactosidase [Bacteroides sp. D20]
          Length = 1106

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  I  YVFWN HE   G + F 
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ DL  F +  Q+  +++ LR GPY CAEW  GG P WL     I+ R ++  F E + 
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   + + +    +    GGPII+ QVENEYG+              V   Y       
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
             WA++   N    + W M           N   G   D  F P     P  P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
           SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A   
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           TSYDYDAPI E G      W    EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668


>gi|317479674|ref|ZP_07938798.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
 gi|316904175|gb|EFV26005.1| glycosyl hydrolase family 35 [Bacteroides sp. 4_1_36]
          Length = 1106

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  I  YVFWN HE   G + F 
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ DL  F +  Q+  +++ LR GPY CAEW  GG P WL     I+ R ++  F E + 
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   + + +    +    GGPII+ QVENEYG+              V   Y       
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
             WA++   N    + W M           N   G   D  F P     P  P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
           SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A   
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           TSYDYDAPI E G      W    EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668


>gi|297194215|ref|ZP_06911613.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
 gi|197722531|gb|EDY66439.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
          Length = 590

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 110/330 (33%), Positives = 147/330 (44%), Gaps = 40/330 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DG+   L SG++HY R  PE WP  +R  +  GL+ +ETYV WN HEP  G+Y F+G  
Sbjct: 11  LDGRPLRLLSGALHYFRVLPEQWPHRLRMLRAMGLDTVETYVPWNLHEPRPGEYDFDGIA 70

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGI-QFRTTNNPFKEEMKRF 132
           DL RF+   +EAGL   +R  PY CAEW  GG P WL   P +   R  +  +   + R+
Sbjct: 71  DLDRFLHATREAGLHAIVRPSPYICAEWENGGLPWWLLADPEVGALRCQDPAYLAHVDRW 130

Query: 133 LAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP 192
             ++I ++    +  S+GG +++ QVENEYG+     G     Y++  A         VP
Sbjct: 131 FDRLIPVVAAHQV--SRGGNVLMVQVENEYGSYGTDTG-----YLEHLAAGLRARGIDVP 183

Query: 193 WVMCQQEDAPDPIINTCNGFYCDGFTPN---------------SPSKPIMWTENYSGWFL 237
                  D PD    T         T N                P  P M  E + GWF 
Sbjct: 184 LFTS---DGPDDFFLTGGALPGHLATVNFGSRPKEALADLARLRPDDPAMCMEFWCGWFD 240

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------------PL 285
            +G     R   D A  +      G +  N YM  GGTNF   AG             P 
Sbjct: 241 HWGTDHVVRDPADAAGVLEELLAAGASV-NVYMAHGGTNFSTWAGANTEDPAAGTGYRPT 299

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
           V TSYDYDAP+DE G   +  W     L +
Sbjct: 300 V-TSYDYDAPVDERGAATEKFWAFREVLER 328


>gi|408677368|ref|YP_006877195.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
 gi|328881697|emb|CCA54936.1| Beta-galactosidase, partial [Streptomyces venezuelae ATCC 10712]
          Length = 611

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 114/333 (34%), Positives = 159/333 (47%), Gaps = 44/333 (13%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   L SG++HY R   E W   +   +  GL  +ETYV WN HEP  G+Y    
Sbjct: 11  FLLDGRPVRLLSGALHYFRVREEQWEHRLGMLRAMGLNCVETYVPWNLHEPEPGRYADVA 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
              L RF+  V  AG++  +R GPY CAEW  GG P WL    G + R+ +  F   ++ 
Sbjct: 71  --ALGRFLDAVARAGMWAIVRPGPYICAEWENGGLPHWLTGPLGRRVRSFDPEFLAPVEA 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +  +++  + +  +   +GGP++L QVENEYG+    YG     Y++W A+       +V
Sbjct: 129 WFRRLLPQVVERQI--DRGGPVVLVQVENEYGS----YG-SDRAYLEWLAELLRGCGVAV 181

Query: 192 PWVMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSG 234
           P       D P+           ++ T N       GF       + PS P+M  E + G
Sbjct: 182 PLFTS---DGPEDHMLTGGSVPGVLATANFGSGAREGFAT--LRRHQPSGPLMCMEFWCG 236

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPL 285
           WF  +G     R   D A A+    E G +  N YM  GGTNFG  AG         GPL
Sbjct: 237 WFDHWGTEHAVRDAADAAEALREILECGASV-NVYMAHGGTNFGGFAGANRAGELHDGPL 295

Query: 286 VA--TSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
            A  TSYDYDAP+DE G   +  W   RE+  A
Sbjct: 296 RATVTSYDYDAPVDEAGRPTEKFW-RFREVLAA 327


>gi|295086466|emb|CBK67989.1| Beta-galactosidase [Bacteroides xylanisolvens XB1A]
          Length = 778

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 154/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F K  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L   +GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVDKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+  + K+  LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348



 Score = 40.8 bits (94), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 51/121 (42%), Gaps = 33/121 (27%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKTGQHICSFVSEAD 719
                    P QTL+ +P  W+  GEN +++ +  G   + I  L K    I   + E  
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPAKASIKGLKKP---ILDMLREKA 618

Query: 720 P 720
           P
Sbjct: 619 P 619


>gi|336404675|ref|ZP_08585368.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
 gi|335941579|gb|EGN03432.1| hypothetical protein HMPREF0127_02681 [Bacteroides sp. 1_1_30]
          Length = 778

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 154/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F K  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L   +GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVDKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+  + K+  LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348



 Score = 40.8 bits (94), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 32/104 (30%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTFKL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
                    P QTL+ +P  W+  GEN +++ +  G  P+K S+
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASI 603


>gi|257416321|ref|ZP_05593315.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
 gi|257158149|gb|EEU88109.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
          Length = 594

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|257087085|ref|ZP_05581446.1| beta-galactosidase [Enterococcus faecalis D6]
 gi|256995115|gb|EEU82417.1| beta-galactosidase [Enterococcus faecalis D6]
          Length = 594

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|384518826|ref|YP_005706131.1| beta-galactosidase [Enterococcus faecalis 62]
 gi|323480959|gb|ADX80398.1| beta-galactosidase [Enterococcus faecalis 62]
          Length = 594

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|423303842|ref|ZP_17281841.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
           CL03T00C23]
 gi|423307438|ref|ZP_17285428.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
           CL03T12C37]
 gi|392687173|gb|EIY80470.1| hypothetical protein HMPREF1072_00781 [Bacteroides uniformis
           CL03T00C23]
 gi|392690047|gb|EIY83318.1| hypothetical protein HMPREF1073_00178 [Bacteroides uniformis
           CL03T12C37]
          Length = 1106

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  I  YVFWN HE   G + F 
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ DL  F +  Q+  +++ LR GPY CAEW  GG P WL     I+ R ++  F E + 
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   + + +    +    GGPII+ QVENEYG+              V   Y       
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
             WA++   N    + W M           N   G   D  F P     P  P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
           SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A   
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           TSYDYDAPI E G      W    EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668


>gi|422722062|ref|ZP_16778639.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
 gi|424672983|ref|ZP_18109926.1| putative beta-galactosidase [Enterococcus faecalis 599]
 gi|315027959|gb|EFT39891.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX2137]
 gi|402352793|gb|EJU87629.1| putative beta-galactosidase [Enterococcus faecalis 599]
          Length = 604

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|307289344|ref|ZP_07569299.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
 gi|422704713|ref|ZP_16762523.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
 gi|306499711|gb|EFM69073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
 gi|315163744|gb|EFU07761.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
          Length = 604

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|257079244|ref|ZP_05573605.1| beta-galactosidase [Enterococcus faecalis JH1]
 gi|294780244|ref|ZP_06745615.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
 gi|397700110|ref|YP_006537898.1| beta-galactosidase [Enterococcus faecalis D32]
 gi|256987274|gb|EEU74576.1| beta-galactosidase [Enterococcus faecalis JH1]
 gi|294452672|gb|EFG21103.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
 gi|397336749|gb|AFO44421.1| beta-galactosidase [Enterococcus faecalis D32]
          Length = 594

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|257084951|ref|ZP_05579312.1| beta-galactosidase [Enterococcus faecalis Fly1]
 gi|256992981|gb|EEU80283.1| beta-galactosidase [Enterococcus faecalis Fly1]
          Length = 594

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|256762786|ref|ZP_05503366.1| beta-galactosidase [Enterococcus faecalis T3]
 gi|256684037|gb|EEU23732.1| beta-galactosidase [Enterococcus faecalis T3]
          Length = 594

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|422695218|ref|ZP_16753206.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
 gi|315147501|gb|EFT91517.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
          Length = 604

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|365860016|ref|ZP_09399844.1| putative beta-galactosidase [Streptomyces sp. W007]
 gi|364010544|gb|EHM31456.1| putative beta-galactosidase [Streptomyces sp. W007]
          Length = 645

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 125/384 (32%), Positives = 176/384 (45%), Gaps = 47/384 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   L SG++HY R     W   +      GL  +ETYV WN HEP  G+    G  
Sbjct: 13  LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG-- 70

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
            L RF+  V+ AGL+  +R GPY CAEW  GG PVW+    G + RT +  ++  ++R+ 
Sbjct: 71  ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWF 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            +++  + Q  +  S+GGP+IL Q ENEYG+    YG    +Y++W A        +VP 
Sbjct: 131 RELLPQVVQRQV--SRGGPVILVQAENEYGS----YGSDA-VYLEWLAGLLRQCGVTVPL 183

Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
                 D P+           ++ T N       GF       + P  P+M  E + GWF
Sbjct: 184 FTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFEV--LLRHQPRGPLMCMEFWCGWF 238

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GP-------L 285
             +G     R  E  A A+    E G +  N YM  GGTNFG  AG    GP        
Sbjct: 239 DHWGAEPVRRDPEQAAGALREVLECGASV-NIYMAHGGTNFGGWAGANRSGPHQDESFQP 297

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
             TSYDYDAP+DEYG   + K+   RE+ +A    E  L +  P    L   +   +   
Sbjct: 298 TVTSYDYDAPVDEYGRATE-KFRLFREVLEA--YAEGPLPALPPEPVGLAGPVRVELAEW 354

Query: 346 SS-NDCAAFLANYDSSSDANVTFN 368
           +   D    L + ++ S    TF 
Sbjct: 355 AGLGDVLEALGDPETESGVPPTFE 378


>gi|182439300|ref|YP_001827019.1| beta-galactosidase [Streptomyces griseus subsp. griseus NBRC 13350]
 gi|178467816|dbj|BAG22336.1| putative beta-galactosidase [Streptomyces griseus subsp. griseus
           NBRC 13350]
          Length = 630

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 123/384 (32%), Positives = 176/384 (45%), Gaps = 47/384 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   L SG++HY R     W   +      GL  +ETYV WN HEP  G+    G  
Sbjct: 13  LDGKPVRLLSGALHYFRVHEAQWEHRLAMLAAMGLNCVETYVPWNLHEPREGEVRDVG-- 70

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
            L RF+  V+ AGL+  +R GPY CAEW  GG PVW+    G + RT +  ++  ++R+ 
Sbjct: 71  ALGRFLDAVERAGLWAIVRPGPYICAEWENGGLPVWVTGRFGRRVRTRDAAYRAVVERWF 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            +++  + +  +  S+GGP++L Q ENEYG+    YG    +Y++W A        +VP 
Sbjct: 131 RELLPQVVRRQV--SRGGPVVLVQAENEYGS----YG-SDAVYLEWLAGLLRQCGVTVPL 183

Query: 194 VMCQQEDAPDP----------IINTCN-------GFYCDGFTPNSPSKPIMWTENYSGWF 236
                 D P+           ++ T N       GF       + P  P+M  E + GWF
Sbjct: 184 FTS---DGPEDHMLTGGSVPGLLATANFGSGAREGFAV--LRRHQPGGPLMCMEFWCGWF 238

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GP-------L 285
             +G     R  E  A A+    E G +  N YM  GGTNFG  AG    GP        
Sbjct: 239 DHWGAEPVRRDPEQAAGALREILECGASV-NVYMAHGGTNFGGWAGANRSGPHQDESFQP 297

Query: 286 VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHK 345
             TSYDYDAP+DEYG   + K+   RE+ +A    E  L +  P    L   +   +   
Sbjct: 298 TVTSYDYDAPVDEYGRATE-KFRLFREVLEA--YAEGPLPALPPEPVGLAGPVRVELAEW 354

Query: 346 SS-NDCAAFLANYDSSSDANVTFN 368
           +   D    L + ++ S    TF 
Sbjct: 355 APLGDVLEVLGDPETESGVPATFE 378


>gi|160890905|ref|ZP_02071908.1| hypothetical protein BACUNI_03350 [Bacteroides uniformis ATCC 8492]
 gi|156859904|gb|EDO53335.1| glycosyl hydrolase family 35 [Bacteroides uniformis ATCC 8492]
          Length = 1106

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 110/330 (33%), Positives = 150/330 (45%), Gaps = 41/330 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             +++GK  V+++  +HYPR     W + I+  K  G+  I  YVFWN HE   G + F 
Sbjct: 357 TFLLNGKPFVIKAAELHYPRIPKAYWDQRIKLCKALGMNTICLYVFWNSHESQPGVFDFT 416

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ DL  F +  Q+  +++ LR GPY CAEW  GG P WL     I+ R ++  F E + 
Sbjct: 417 GQNDLAEFCRLCQQNDMYVILRPGPYVCAEWEMGGLPWWLLKKKDIRLRESDPYFMERVG 476

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------------VEWAYGVGGELY 176
            F   + + +    +    GGPII+ QVENEYG+              V   Y       
Sbjct: 477 IFEKAVAEQVA--GMTIQNGGPIIMVQVENEYGSYGEDKGYVSQIRDIVRANYPGVALFQ 534

Query: 177 VKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS---PSKPIMWTENY 232
             WA++   N    + W M           N   G   D  F P     P  P+M +E +
Sbjct: 535 CDWASNFTKNGLHDLVWTM-----------NFGTGANIDQQFAPLKKLRPDSPLMCSEFW 583

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--- 287
           SGWF  +G     RP  D+   +      G +F + YM  GGTN+G  AG   P  A   
Sbjct: 584 SGWFDKWGANHETRPAADMIAGIDEMLSKGISF-SLYMTHGGTNWGHWAGANSPGFAPDV 642

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           TSYDYDAPI E G      W    EL KA+
Sbjct: 643 TSYDYDAPISESGQTTPKYW----ELRKAL 668


>gi|423215069|ref|ZP_17201597.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692332|gb|EIY85570.1| hypothetical protein HMPREF1074_03129 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 778

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 154/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWSHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFA 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F K  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCKLAQQHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L   +GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVDKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+  + K+  LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348



 Score = 40.8 bits (94), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 32/104 (30%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTF-KLDKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISL 703
                    P QTL+ +P  W+  GEN +++ +  G  P+K S+
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKG--PAKASI 603


>gi|380693434|ref|ZP_09858293.1| beta-galactosidase [Bacteroides faecis MAJ27]
          Length = 778

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 27/324 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHKA 316
           DAPI E G+  + K+  LR+L K 
Sbjct: 327 DAPISEAGWTTE-KYFLLRDLLKT 349


>gi|322390566|ref|ZP_08064082.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
 gi|321142719|gb|EFX38181.1| beta-galactosidase [Streptococcus parasanguinis ATCC 903]
          Length = 595

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 108/318 (33%), Positives = 152/318 (47%), Gaps = 41/318 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A  + G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +GQ+ F 
Sbjct: 9   AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           GR DL RF++T Q  GL++ +R  P+ CAEW +GG P WL     ++ R+++  F E + 
Sbjct: 69  GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPAFIEAVD 127

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
           R+  +++ L+    +   QGGPI++ QVENEYG    +YG   + Y++   D       +
Sbjct: 128 RYYDRLLGLLTPYQV--DQGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVT 180

Query: 191 VPWVMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWT 229
            P       D P            + +  T N         G   + F       P+M  
Sbjct: 181 CPLFTS---DGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWPLMCM 237

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
           E + GWF  +   V  R  E+LA AV    E G    N YM+ GGTNFG   G    G L
Sbjct: 238 EFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTL 295

Query: 286 ---VATSYDYDAPIDEYG 300
                TSYDY A ++E G
Sbjct: 296 DLPQVTSYDYGALLNEQG 313


>gi|422866702|ref|ZP_16913314.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
 gi|329578150|gb|EGG59560.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
          Length = 604

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|307269354|ref|ZP_07550702.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
 gi|306514322|gb|EFM82889.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
          Length = 604

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|312901788|ref|ZP_07761056.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
 gi|311291123|gb|EFQ69679.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
          Length = 604

 Score =  152 bits (384), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|431758215|ref|ZP_19546843.1| beta-galactosidase [Enterococcus faecium E3083]
 gi|430617878|gb|ELB54742.1| beta-galactosidase [Enterococcus faecium E3083]
          Length = 595

 Score =  152 bits (384), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 108/314 (34%), Positives = 147/314 (46%), Gaps = 34/314 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG    + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP  G + F G
Sbjct: 10  FLVDGIPTKIISGAIHYFRIPPSQWEHSLYNLKALGANTVETYIPWNLHEPQEGSFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++VRFVK  QE  L + LR   Y CAEW +GG P WL   P I+ R+T+  F E++K 
Sbjct: 70  FKNVVRFVKIAQELDLMVILRPCAYICAEWEFGGLPAWLLKEPNIRVRSTDPRFMEKLKN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ Q+ENEYG    +YG+    Y++   +  +  +  +
Sbjct: 130 YYQVL--LPKLAPLQITQGGPVIMMQLENEYG----SYGMEKS-YLRQTKELMLAHSIDI 182

Query: 192 PWVM-----CQQEDAPDPIINTC------------NGFYCDGFTPNSPSK-PIMWTENYS 233
           P         +  DA   I                N      F  N     PIM  E + 
Sbjct: 183 PLFTSDGAWLEVLDAGTLIDEDIFVTGNFGSHSKENAQVLKEFMQNHQKNWPIMCMEYWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------ 287
           GWF  +G  +  R  E+LA  V    E G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGEPIITRDPEELATEVKEMLEIGSL--NLYMFHGGTNFGFYNGCSARGNTDLPQ 300

Query: 288 -TSYDYDAPIDEYG 300
            TSYDYDA ++E G
Sbjct: 301 ITSYDYDALLNEAG 314


>gi|195342884|ref|XP_002038028.1| GM17976 [Drosophila sechellia]
 gi|194132878|gb|EDW54446.1| GM17976 [Drosophila sechellia]
          Length = 672

 Score =  152 bits (384), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 107/325 (32%), Positives = 158/325 (48%), Gaps = 39/325 (12%)

Query: 6   TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           T DH A   ++DG+     SGS HY R+ PE W   +R  +  GL  ++TYV W+ H P 
Sbjct: 47  TIDHEANTFLLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 106

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
            G+Y +EG  DLV+F++  QE   ++ LR GPY CAE + GG P WL    P I+ RT +
Sbjct: 107 DGEYNWEGIADLVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTND 166

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             +  E+ ++ A+++   + ++LF   GG II+ QVENEYG+    +      Y+ W  D
Sbjct: 167 PNYISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 219

Query: 183 --------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKP 225
                    A+     +P   + C +        D     IN  +  +        P+ P
Sbjct: 220 ETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGP 278

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
           ++ +E Y GW   +      R  +++A A+        +  N YM+FGGTNFG TAG   
Sbjct: 279 LVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANY 337

Query: 284 --------PLVATSYDYDAPIDEYG 300
                       TSYDYDA +DE G
Sbjct: 338 NLDGGIGYAADITSYDYDAVMDEAG 362


>gi|339640120|ref|ZP_08661564.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
           F0418]
 gi|339453389|gb|EGP66004.1| glycosyl hydrolase family 35 [Streptococcus sp. oral taxon 056 str.
           F0418]
          Length = 595

 Score =  152 bits (384), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 115/349 (32%), Positives = 163/349 (46%), Gaps = 40/349 (11%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+I Y R  P+ W E +   K  G   +ETY+ W+ HEP  GQ+  +G  
Sbjct: 12  LDGKPFKILSGAIQYFRLHPDQWRETLHNLKALGYNTVETYIPWSLHEPQEGQFVTDGLL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D   +   VQE GL L +R  PY CAE+++GG P WL   PG++FR  +  F E++ RF 
Sbjct: 72  DFEAYFDLVQEMGLHLIVRPTPYICAEFDFGGMPPWLLNYPGMRFRVNDALFLEKVSRFY 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP- 192
             +   +       ++GGPI++ QVENEYG    +Y    E Y++  A    +   SVP 
Sbjct: 132 DWLFPKLLPYQF--TEGGPILMMQVENEYG----SYAEDKE-YMRNIAKMMRDRGVSVPL 184

Query: 193 ------WVMCQQEDA--PDPIINTCN-GFYCDGFTPN--------SPSKPIMWTENYSGW 235
                 W+   +      D I  T N G      T N            P+M TE + GW
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGNFGSQAKENTDNLRAFMERHGKKWPLMCTEFWDGW 244

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVA 287
           F  +G  +  R  EDLA  V      G    N ++  GGTNFG        +T   P + 
Sbjct: 245 FSRWGEEIVRRDAEDLAQDVKEMMRIGSM--NLFLLRGGTNFGFISGCSARKTRDLPQI- 301

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
           TSYD+DAP+ E+G   +  +   R  H+     E+     DP  +K  A
Sbjct: 302 TSYDFDAPVTEWGVPTEKYYAVQRVTHELFPELEQ----MDPIIRKARA 346


>gi|256964894|ref|ZP_05569065.1| beta-galactosidase [Enterococcus faecalis HIP11704]
 gi|256955390|gb|EEU72022.1| beta-galactosidase [Enterococcus faecalis HIP11704]
          Length = 594

 Score =  152 bits (384), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 157/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L    GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G    G +     TSYD
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 304

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 305 YDAPLDEQGNPTEKYFALQKMLHE 328


>gi|313237463|emb|CBY12650.1| unnamed protein product [Oikopleura dioica]
          Length = 583

 Score =  152 bits (384), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 105/340 (30%), Positives = 153/340 (45%), Gaps = 31/340 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           +T D     +DGK   + SG+IHY R   + W   ++   + GL  I+ Y+ WN HE  R
Sbjct: 8   LTADGDTFKLDGKDFRILSGAIHYFRIPKQSWKHRLQSVVDCGLNTIDVYIPWNLHEKER 67

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           G + F G  DLV F     E GL +  R GPY C+EW++GG P WL   P +  R+    
Sbjct: 68  GNFDFAGELDLVEFFTIAAEMGLKVLCRPGPYICSEWDWGGLPSWLLKDPKMHIRSNYCG 127

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN--------VEW------AYG 170
           ++  +  + +K++ L+    L  S GGPII  QVENEYG+        + W      ++G
Sbjct: 128 YQAAVSSYFSKLLPLLAP--LQHSNGGPIIAFQVENEYGDYVDKDNEHLPWLADLMKSHG 185

Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
           +    ++     T    N        Q       ++     F      PN   KP++ TE
Sbjct: 186 LFELFFISDGGHTIRKANMLKVRSTAQLNSGSFQLL--AKAFSLKSLQPN---KPMLVTE 240

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV---- 286
            ++GWF  +G+       E     +    + G +  N+YM+ GGTNFG   G   +    
Sbjct: 241 FWAGWFDYWGHGRNLLNNEVFEKTLKEILKRGASV-NFYMFHGGTNFGFMNGAIELEKGY 299

Query: 287 ----ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
                TSYDYD P+DE G  R  KW  +R      K   E
Sbjct: 300 YTADVTSYDYDCPVDESG-NRTEKWEIIRRCLNVQKTSSE 338


>gi|320109257|ref|YP_004184847.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
 gi|319927778|gb|ADV84853.1| glycoside hydrolase family 35 [Terriglobus saanensis SP1PR4]
          Length = 640

 Score =  152 bits (384), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 117/342 (34%), Positives = 165/342 (48%), Gaps = 54/342 (15%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + +G +HY R     W + ++K+K  GL  I TYVFWN HEP  G Y F G+ 
Sbjct: 35  LDGKPFRILTGEMHYARIPRARWDDAMQKAKALGLNAITTYVFWNVHEPRPGVYDFTGQN 94

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL  ++   Q AGL + LR GPYACAEW +GG+P WL   P +  R+++  F + + ++ 
Sbjct: 95  DLGEYLAAAQRAGLKVILRPGPYACAEWEFGGYPAWLIKDPTVVVRSSDPKFMKPVAKWF 154

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVKWA 180
            ++    + +   A+ GGPII  QVENEYG+   + AY           G+GG+   K  
Sbjct: 155 HRLG--QEVQPYLAANGGPIIAVQVENEYGSFGNDHAYMEQMKDLVISSGIGGKNPKKAV 212

Query: 181 --------ADTAVNLNTSVPWVMCQQEDAPD--PIINTCNG------FYCDGFTPNSPSK 224
                    DT   L T+   V       P+   ++N   G         + F PN P  
Sbjct: 213 DEDGKNVPQDTGTMLYTADGGVQLPNGTLPELPAVVNFGGGQAKSELARYEAFRPNGPR- 271

Query: 225 PIMWTENYSGWFLSFG----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
             M  E ++GWF  +G           V +  + + R +       + YM +GGT+FG  
Sbjct: 272 --MVGEYWAGWFDHWGNNHQKTNAAEQVAEYEYMLKRGYSV-----SLYMLYGGTSFGWM 324

Query: 281 AGG---------PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
           AG          P V TSYDYDAPIDE G    PK+  LRE+
Sbjct: 325 AGANSGDKAPYEPDV-TSYDYDAPIDERGN-PTPKYFALREV 364


>gi|346320352|gb|EGX89953.1| beta-calactosidase, putative [Cordyceps militaris CM01]
          Length = 633

 Score =  152 bits (384), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 106/320 (33%), Positives = 159/320 (49%), Gaps = 31/320 (9%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           +Y+    +++G+   +  G +   R  PE W   ++ ++  GL  I +Y++WN HEP  G
Sbjct: 30  SYNRTDFLLNGQPFQIIGGQMDPQRILPEYWTHRLKMARAMGLNTIFSYLYWNLHEPRPG 89

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
            + F GR D+ RF +  Q+ GL + LR GPY C E ++GGFP WL  +PG+  R  N PF
Sbjct: 90  AWDFSGRNDVARFFRLAQQEGLRVVLRPGPYICGERDWGGFPAWLSQVPGMAVRQNNRPF 149

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
            +  K ++ ++   + Q  L  +QGGPI++AQ+ENEYG    ++G         AA    
Sbjct: 150 LDAAKSYIDRLGKELGQ--LQITQGGPILMAQLENEYG----SFGTDKTYLAALAAMLRE 203

Query: 186 NLNTSV--------PWVMCQQEDAPDPII--NTCNGFYCDGFTPNSPSK--PIMWTENYS 233
           N +  +         ++   Q      +I  ++ +GF         P+   P +  E Y 
Sbjct: 204 NFDVFLYTNDGGGQSYLEGGQLHGVLAVIDGDSQSGFAARDKYVTDPTSLGPQLNGEYYI 263

Query: 234 GWFLSFGYAVPFRPV----EDLAFAVARFFET--GGTFQNYYMYFGGTNFGRTAG----- 282
            W   +G   P + +     D+A AVA    T  GG   + YM+ GGTNFG   G     
Sbjct: 264 SWIDQWGSDYPHQQIAGSQADVAKAVADLDWTLAGGYSFSIYMFHGGTNFGFENGGIRDD 323

Query: 283 GPLVA--TSYDYDAPIDEYG 300
           GPL A  TSYDY AP+DE G
Sbjct: 324 GPLAAMTTSYDYGAPLDESG 343


>gi|298386767|ref|ZP_06996322.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
 gi|298260441|gb|EFI03310.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
          Length = 778

 Score =  152 bits (384), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 27/324 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHKA 316
           DAPI E G+  + K+  LR+L K 
Sbjct: 327 DAPISEAGWTTE-KYYLLRDLLKT 349


>gi|29349062|ref|NP_812565.1| beta-galactosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|383124327|ref|ZP_09944991.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
 gi|29340969|gb|AAO78759.1| beta-galactosidase precursor [Bacteroides thetaiotaomicron
           VPI-5482]
 gi|251839176|gb|EES67260.1| hypothetical protein BSIG_3645 [Bacteroides sp. 1_1_6]
          Length = 778

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 27/324 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWDHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFT 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTRNALDDLIWTINFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKEMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHKA 316
           DAPI E G+  + K+  LR+L K 
Sbjct: 327 DAPISEAGWTTE-KYYLLRDLLKT 349


>gi|307272985|ref|ZP_07554232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
 gi|306510599|gb|EFM79622.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0855]
          Length = 604

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 155/324 (47%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L    GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQLV--NGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|325261840|ref|ZP_08128578.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
 gi|324033294|gb|EGB94571.1| glycosyl hydrolase, family 35 [Clostridium sp. D5]
          Length = 581

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 108/326 (33%), Positives = 156/326 (47%), Gaps = 43/326 (13%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ID ++  + SG +HY R   E W + + K K  G   +ETY+ WN HE  +G++ FEG  
Sbjct: 12  IDNQKVKIISGGVHYFRIMAEYWKDCLLKLKAFGCNTVETYIPWNLHEKEKGEFCFEGNL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D+ +FV   ++ GL++ LR  PY CAEW +GG P WL    G++ R +  PF + ++ + 
Sbjct: 72  DITKFVHIAKDLGLYVILRPSPYICAEWEFGGLPYWLLKEDGMRLRCSYKPFLKHVEEYY 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            ++ +++    L  ++GGP+I+ QVENEYG     Y     LY+K   D  V+    VP 
Sbjct: 132 HRLFEVIAP--LQYTKGGPVIMMQVENEYG-----YYGNDTLYLKTLQDFMVSYGCEVPL 184

Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPNSPS---------------KPIMWTENYSGWFLS 238
           V     D P      C        T N  S               KP+M  E + GWF S
Sbjct: 185 VTS---DGPWGDAFDCGKLEGVLQTGNFGSKSRQQLQIMRDKIGNKPLMCMEFWVGWFDS 241

Query: 239 FGYAV-----PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------A 287
           +G        P +  E+L        E+G    N YM+ GGTNFG   G           
Sbjct: 242 WGQTEHKQEDPNKNAENL----DEILESGHV--NIYMFMGGTNFGFMNGSNYYDVLTPDV 295

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
           TSYDYDA + E G +  PK+  L+ +
Sbjct: 296 TSYDYDALLTEAGDL-TPKYELLKNV 320


>gi|67078211|ref|YP_245831.1| beta-galactosidase [Bacillus cereus E33L]
 gi|66970517|gb|AAY60493.1| beta-galactosidase [Bacillus cereus E33L]
          Length = 598

 Score =  152 bits (383), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 154/317 (48%), Gaps = 36/317 (11%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  ++DG+   + SG++HY R  PE W   +   K  G   +ETYV WN HEP  G + F
Sbjct: 8   KDFMLDGEPIKIISGALHYFRIVPEYWDHSLYNLKALGCNTVETYVPWNMHEPKEGIFNF 67

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
           EG  DLV++V+  Q+ GL + LR  PY CAEW +GG P WL     I+ R+  N F  ++
Sbjct: 68  EGIADLVKYVQLAQKYGLMVILRPTPYICAEWEFGGLPAWLLKYKDIRVRSNTNLFLNKV 127

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
           + F   ++ ++    L    GGPII+ QVENEYG    ++G   E YV+       +L  
Sbjct: 128 ENFYKVLLPMVTP--LQVENGGPIIMMQVENEYG----SFGNDKE-YVRNIKKLMRDLGV 180

Query: 190 SVP-------WVMCQQEDA--PDPIINTCN-GFYCDG--------FTPNSPSKPIMWTEN 231
           +VP       W    +  +   D ++ T N G   +            N    P+M  E 
Sbjct: 181 TVPLFTSDGAWQEALESGSLIDDDVLVTGNFGSRSNENLNELESFIKENKKEWPLMCMEF 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
           + GWF  +G  +  R   +LA  V    +      N+YM+ GGTNFG   G         
Sbjct: 241 WDGWFNRWGMEIIRRDGSELAEEVKELLKRASI--NFYMFQGGTNFGFMNGCSSRENVDL 298

Query: 284 PLVATSYDYDAPIDEYG 300
           P + TSYDYDA + E+G
Sbjct: 299 PQI-TSYDYDALLTEWG 314


>gi|424759896|ref|ZP_18187551.1| putative beta-galactosidase [Enterococcus faecalis R508]
 gi|402403967|gb|EJV36601.1| putative beta-galactosidase [Enterococcus faecalis R508]
          Length = 604

 Score =  152 bits (383), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 109/324 (33%), Positives = 156/324 (48%), Gaps = 26/324 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
           L+ D   ++L SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 21  LLNDQPFKIL-SGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|222526932|ref|YP_002571403.1| beta-galactosidase [Chloroflexus sp. Y-400-fl]
 gi|222450811|gb|ACM55077.1| Beta-galactosidase [Chloroflexus sp. Y-400-fl]
          Length = 917

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 151/313 (48%), Gaps = 17/313 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +V      + +DGK   L SG +HY R     W  L+ +++  GL  I+T + WN H
Sbjct: 21  MQHSVRVHRNGIELDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRH 80

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G++ F    DL  F+    E GL   +R GPY CAEW  GG P WL     ++ R+
Sbjct: 81  EPQPGEFDFSEEADLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWLTASGDMRLRS 140

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGV-GGELYVKW 179
            +  F++ + R+   ++ ++         GGPIIL Q+ENE+    WA GV G + + + 
Sbjct: 141 DDPAFRDAVLRWFDTLMPILVPRQY--PHGGPIILCQIENEH----WASGVYGADTHQQT 194

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMWTENYSGWF 236
            A  A+     VP   C       P          +         P  P++ +E +SGWF
Sbjct: 195 LAQAALERGIVVPQYTCVGAMPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWF 254

Query: 237 LSF-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV--ATS 289
            ++ G+    +    L   + +    G    +++M+ GGTNF    GRT GG L+   TS
Sbjct: 255 DNWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTS 314

Query: 290 YDYDAPIDEYGFI 302
           YDYDAP+DEYG +
Sbjct: 315 YDYDAPVDEYGRL 327


>gi|237719727|ref|ZP_04550208.1| beta-galactosidase [Bacteroides sp. 2_2_4]
 gi|229450996|gb|EEO56787.1| beta-galactosidase [Bacteroides sp. 2_2_4]
          Length = 778

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 95  GQNDIATFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+    K+  LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348



 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                    P QTL+ +P  W+  GEN +++ +  G   + I  L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608


>gi|422701998|ref|ZP_16759838.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
 gi|315169479|gb|EFU13496.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
          Length = 604

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 108/325 (33%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQVFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 313

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 314 DYDAPLDEQGNPTEKYFALQKMLHE 338


>gi|86142033|ref|ZP_01060557.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
           MED217]
 gi|85831596|gb|EAQ50052.1| putative exported beta-galactosidase [Leeuwenhoekiella blandensis
           MED217]
          Length = 620

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 119/344 (34%), Positives = 167/344 (48%), Gaps = 40/344 (11%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
            A+   ++ + V +GK   + SG +HY R   E W   I+  K  GL  I TYVFWNYH 
Sbjct: 26  DASFKIENGSFVYNGKPTPIYSGEMHYERIPKEYWRHRIQMMKAMGLNTIATYVFWNYHN 85

Query: 62  PIRGQYYFE-GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           P  G + FE G  ++  F+K  +E  +F+ LR GPYAC EW +GG+P +L  IPG++ R 
Sbjct: 86  PAPGVWDFESGNRNVAEFIKIAKEEEMFVILRPGPYACGEWEFGGYPWFLQNIPGLKVRE 145

Query: 121 TNNPFKEEMKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV-----------EWA 168
            N  F    K +   I +L KQ   L  + GG II+ QVENE+G+              A
Sbjct: 146 NNAQFLAACKEY---INELAKQVAPLQVNNGGNIIMTQVENEFGSYVAQREDIAPEDHKA 202

Query: 169 YGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGF-YCDGFTP-----NSP 222
           Y       +K A   A    +   W+   +  + + ++ T NG    D         N+ 
Sbjct: 203 YKEAIFKMLKDAGFQAPFFTSDGAWLF--EGGSLEGVLPTANGEGNIDNLKKVVNKFNNN 260

Query: 223 SKPIMWTENYSGWFLSFGYAVPFRPVE--DLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
             P M  E Y GW     +A PF  +   D+A     + + G  F N+YM  GGTNFG T
Sbjct: 261 EGPYMVAEFYPGWLDH--WAEPFVKISASDIAKQTEVYLKNGVNF-NFYMAHGGTNFGFT 317

Query: 281 AGG---------PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
           +G          P + TSYDYDAPI E G++  PK+  +R L +
Sbjct: 318 SGANYNDEHDIQPDI-TSYDYDAPISEAGWVT-PKYDSIRALMQ 359



 Score = 44.7 bits (104), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 57/221 (25%), Positives = 89/221 (40%), Gaps = 52/221 (23%)

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L +  L   A V+VN K V  G  N  F ++ +  KI  N    +L+IL   +G  NYGA
Sbjct: 431 LKVPGLRDFATVYVNGKKV--GELNRVFNSYEMPIKIPFN---GSLEILVENMGRINYGA 485

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
                  G+ + + I+      +++ G  +Y+          +   + NS+  K G  + 
Sbjct: 486 EIVNNLKGITAPVSIN----DYEITGGWEMYKAPF------AEVPEVINSTEVKTGRPVV 535

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
            + S    K        +G   LN++ MGKG  +VNG ++GRYW                
Sbjct: 536 YSGSFDLKK--------QGDTFLNMSEMGKGIVFVNGHNLGRYWKV-------------- 573

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEEL 694
                          P QTLY +P  W+    N + I E+L
Sbjct: 574 --------------GPQQTLY-VPGCWLKKKGNTITIFEQL 599


>gi|423294349|ref|ZP_17272476.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
           CL03T12C18]
 gi|392675540|gb|EIY68981.1| hypothetical protein HMPREF1070_01141 [Bacteroides ovatus
           CL03T12C18]
          Length = 778

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+    K+  LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348



 Score = 41.2 bits (95), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                    P QTL+ +P  W+  GEN +++ +  G   + I  L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608


>gi|319893645|ref|YP_004150520.1| beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
 gi|386318129|ref|YP_006014292.1| glycosyl hydrolase [Staphylococcus pseudintermedius ED99]
 gi|317163341|gb|ADV06884.1| Beta-galactosidase 3 [Staphylococcus pseudintermedius HKU10-03]
 gi|323463300|gb|ADX75453.1| glycosyl hydrolase, family 35 [Staphylococcus pseudintermedius
           ED99]
          Length = 590

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 107/315 (33%), Positives = 153/315 (48%), Gaps = 34/315 (10%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++D K   + SG+IHY R   + W + +   K  G   +ETYV WN+HE I  +Y F+
Sbjct: 9   TFLLDDKPIKILSGAIHYFRIPKDDWEDSLYNLKALGFNTVETYVPWNFHETIENEYDFK 68

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  DL  F++   + GL++ +R  PY CAEW +GGFP WL     ++ R+ +  + E++K
Sbjct: 69  GHKDLKHFIELAAKLGLYVIVRPSPYICAEWEFGGFPAWLLNDRTMRIRSRDEKYLEKVK 128

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
           ++  ++  ++    L   QGGPII+ QVENEYG    ++G   + Y++  A        +
Sbjct: 129 KYYHELFKILTP--LQIDQGGPIIMMQVENEYG----SFGQDHD-YLRSLAHMMREEGVT 181

Query: 191 VP-------WVMCQQ-----EDAPDPIIN----TCNGFYCDGFTPNSPSK--PIMWTENY 232
           VP       W  C +     ED   P  N    T   F          SK  P+M  E +
Sbjct: 182 VPFFTSDGAWDQCLRAGSLIEDDILPTGNFGSRTVQNFENLKTFQQEFSKKWPLMCMEFW 241

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPL 285
            GWF  +G  V  R  +DLA  V    + G    N YM+ GGTNFG       R      
Sbjct: 242 DGWFNRWGEPVIKRDSDDLAEEVRDAVKLGSL--NLYMFHGGTNFGFWNGCSARGTKDLP 299

Query: 286 VATSYDYDAPIDEYG 300
             TSYDY AP+DE G
Sbjct: 300 QVTSYDYHAPLDEAG 314


>gi|384248639|gb|EIE22122.1| hypothetical protein COCSUDRAFT_1093, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 632

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 111/328 (33%), Positives = 157/328 (47%), Gaps = 37/328 (11%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SGS+HY R  P  W + + ++K  GL  +  YV WN HEP  GQY ++G  
Sbjct: 28  MDGKPFRIISGSLHYHRIHPAQWKDRMLRTKALGLNTLSVYVPWNLHEPFPGQYNWDGFA 87

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPG---------IQFRTTNNP 124
           DL  ++   QE GL++ LR GPY CAEW++GGFP WL              +  R+ +  
Sbjct: 88  DLEAYLALAQEQGLYVLLRPGPYICAEWDFGGFPWWLASSKAGLCSTSSHSVTLRSDDPA 147

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVGGELYVKWA 180
           + E + R+   +  L K      S+GG I++ QVENE+G    N ++   + G +     
Sbjct: 148 YLELVDRWWKVL--LPKIGRFLYSRGGNILMVQVENEFGFVGPNEKYMRHLVGTVRAS-L 204

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTC---------NGFYCDGFTPNSPSK-PIMWTE 230
            D A+   T  P  + +     D +++           N  +      N+P K P M +E
Sbjct: 205 GDDALIYTTDPPPNIAKGTLPGDEVLSVVDFGAGWFDLNWAFSQQRAMNAPGKSPPMCSE 264

Query: 231 NYSGWFLSFGYAVPFRPVE---DLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
            Y+GW   +G  +    V+   D    V  F    G+  N YM  GGTNFG TAGG +  
Sbjct: 265 FYTGWLTRWGEKMANTSVDQFLDTLHGVLGFANNTGSV-NLYMVHGGTNFGFTAGGSIDN 323

Query: 286 -----VATSYDYDAPIDEYGFIRQPKWG 308
                  TSYDYDAPI E G   QP  G
Sbjct: 324 GVYWACITSYDYDAPISEAGDTGQPGIG 351


>gi|163848976|ref|YP_001637020.1| beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
 gi|163670265|gb|ABY36631.1| Beta-galactosidase [Chloroflexus aurantiacus J-10-fl]
          Length = 897

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 151/313 (48%), Gaps = 17/313 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           +  +V      + +DGK   L SG +HY R     W  L+ +++  GL  I+T + WN H
Sbjct: 1   MQHSVRVHRNGIELDGKPFYLLSGCVHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRH 60

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G++ F    DL  F+    E GL   +R GPY CAEW  GG P WL     ++ R+
Sbjct: 61  EPQPGEFDFSEEADLGAFLDLCHELGLKAIVRPGPYICAEWENGGLPAWLTASGDMRLRS 120

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGV-GGELYVKW 179
            +  F++ + R+   ++ ++         GGPIIL Q+ENE+    WA GV G + + + 
Sbjct: 121 DDPAFRDAVLRWFDTLMPILVPRQY--PHGGPIILCQIENEH----WASGVYGADTHQQT 174

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMWTENYSGWF 236
            A  A+     VP   C       P          +         P  P++ +E +SGWF
Sbjct: 175 LAQAALERGIVVPQYTCVGAMPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWF 234

Query: 237 LSF-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV--ATS 289
            ++ G+    +    L   + +    G    +++M+ GGTNF    GRT GG L+   TS
Sbjct: 235 DNWGGHRQTRKTAAKLDMTLHQLTAVGCAGFSHWMWAGGTNFGFWGGRTVGGDLIHMTTS 294

Query: 290 YDYDAPIDEYGFI 302
           YDYDAP+DEYG +
Sbjct: 295 YDYDAPVDEYGRL 307


>gi|393780989|ref|ZP_10369190.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
           CL02T12C01]
 gi|392677324|gb|EIY70741.1| hypothetical protein HMPREF1071_00058 [Bacteroides salyersiae
           CL02T12C01]
          Length = 776

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++G+  ++++  +HY R     W   I+  K  G+  I  YVFWN HE   GQ+ F
Sbjct: 32  KTFLLNGEPFIVKAAELHYTRIPQPYWEHRIKMCKALGMNTICLYVFWNIHEQEEGQFDF 91

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E +
Sbjct: 92  TGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERV 151

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN- 188
             F+ K+ + +    L  ++GG II+ QVENEYG    +YG   + YV    D       
Sbjct: 152 GIFMKKVGEQLVP--LQITRGGNIIMVQVENEYG----SYGT-DKPYVSAIRDMVRGAGF 204

Query: 189 TSVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWF 236
           T VP   C        +A D ++ T N   G   D          P  P+M +E +SGWF
Sbjct: 205 TEVPLFQCDWSSNFTNNALDDLLWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWF 264

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYD 291
             +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYD
Sbjct: 265 DHWGRKHETRPAKDMVQGLKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYD 323

Query: 292 YDAPIDEYGFIRQPKWGHLRELHKA 316
           YDAPI E G+  + K+  LR+L K 
Sbjct: 324 YDAPISEAGWTTE-KYFLLRDLLKG 347


>gi|164519029|ref|NP_001019529.2| beta-galactosidase-1-like protein 3 precursor [Rattus norvegicus]
          Length = 644

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 155/320 (48%), Gaps = 29/320 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G + ++  GSIHY R   E W + + K +  G   + TY+ WN HE  RG++ F  
Sbjct: 69  FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 128

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  +V   +  GL++ LR GPY CAE + GG P WL   PG   RTTN  F E + +
Sbjct: 129 ILDLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDK 188

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   K   L   +GGP+I  QVENEYG+         + Y+++       LN  +
Sbjct: 189 YFDHLIP--KILPLQYRRGGPVIAVQVENEYGSFR-----NDKNYMEYIKKAL--LNRGI 239

Query: 192 PWVMCQQEDAPDPIINTCNG---------FYCDGFTP---NSPSKPIMWTENYSGWFLSF 239
             ++   ++     I +  G         F  D F         KPIM  E ++GW+ S+
Sbjct: 240 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 299

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
           G     +   ++   + RFF  G +F N YM+ GGTNFG   GG        V TSYDYD
Sbjct: 300 GSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYD 358

Query: 294 APIDEYGFIRQPKWGHLREL 313
           A + E G   + K+  LR+L
Sbjct: 359 AVLSEAGDYTE-KYFKLRKL 377


>gi|307707961|ref|ZP_07644436.1| beta-galactosidase [Streptococcus mitis NCTC 12261]
 gi|307616026|gb|EFN95224.1| beta-galactosidase [Streptococcus mitis NCTC 12261]
          Length = 595

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 151/307 (49%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRLRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L   +GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLSRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|383110805|ref|ZP_09931623.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
 gi|313694380|gb|EFS31215.1| hypothetical protein BSGG_1915 [Bacteroides sp. D2]
          Length = 778

 Score =  152 bits (383), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+    K+  LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348



 Score = 41.2 bits (95), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                    P QTL+ +P  W+  GEN +++ +  G   + I  L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608


>gi|337283005|ref|YP_004622476.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
 gi|335370598|gb|AEH56548.1| beta-galactosidase [Streptococcus parasanguinis ATCC 15912]
          Length = 595

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/307 (34%), Positives = 149/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           + G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +GQ+ F GR 
Sbjct: 12  LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q  GL++ +R  P+ CAEW +GG P WL     ++ R+++  F E + R+ 
Sbjct: 72  DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDMRIRSSDPAFIEAVDRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
             ++ L+    +   QGGPI++ QVENEYG+   + AY       +K    T     +  
Sbjct: 131 DHLLGLLTPYQV--DQGGPILMMQVENEYGSYGEDKAYLRAIRDLMKKKGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      + +  T N         G   + F       P+M  E + GWF  + 
Sbjct: 189 PWRAALRAGTLIEEDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFWDGWFTRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             V  R  E+LA AV    E G    N YM+ GGTNFG   G    G L     TSYDY 
Sbjct: 249 EPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYG 306

Query: 294 APIDEYG 300
           A ++E G
Sbjct: 307 ALLNEQG 313


>gi|164519028|ref|NP_001106794.1| beta-galactosidase-1-like protein 3 precursor [Mus musculus]
          Length = 662

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 109/320 (34%), Positives = 155/320 (48%), Gaps = 29/320 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G + ++  GSIHY R   E W + + K +  G   + TY+ WN HE  RG++ F  
Sbjct: 69  FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 128

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  +V   +  GL++ LR GPY CAE + GG P WL   P    RTTN  F E + +
Sbjct: 129 ILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDK 188

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   K   L    GGP+I  QVENEYG+ +         Y+K A      L   +
Sbjct: 189 YFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGI 239

Query: 192 PWVMCQQEDAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSF 239
             ++   +D     I + NG       + FT +S          KPIM  E ++GW+ S+
Sbjct: 240 VELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSW 299

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
           G     +  E++   V +F   G +F N YM+ GGTNFG   GG        V TSYDYD
Sbjct: 300 GSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYD 358

Query: 294 APIDEYGFIRQPKWGHLREL 313
           A + E G   + K+  LR+L
Sbjct: 359 AVLSEAGDYTE-KYFKLRKL 377


>gi|256393561|ref|YP_003115125.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
 gi|256359787|gb|ACU73284.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
          Length = 584

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/303 (33%), Positives = 146/303 (48%), Gaps = 24/303 (7%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DG+   + SG +HY R  P  W + +RK++  GL  I+TY+ WN HE   G + F G  
Sbjct: 13  LDGQPFRIVSGGLHYFRVHPAQWSDRLRKARLMGLNTIDTYIPWNLHERRPGTFDFGGIL 72

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL  F+      GL + LR GPY C EW  GG P WL   P +  R+T+  F + ++ +L
Sbjct: 73  DLAAFLDAAAAEGLHVLLRPGPYICGEWEGGGLPSWLLADPDLALRSTDPAFLQAVEAYL 132

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
             I+ ++       ++GGP+I  QVENEYG    AYG     Y++   +   +    VP+
Sbjct: 133 DAIMPIVLPR--LGTRGGPVIAVQVENEYG----AYG-SDTAYMERLYEALTSRGIDVPF 185

Query: 194 VMCQQ----EDAPDPIINTCNGF------YCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
               Q     D   P +     F               P+ P+M  E ++GWF  +G   
Sbjct: 186 FTSDQPNDLADGALPGVLATANFGGKVTASLAALRAQQPTGPLMCAEFWNGWFDYWGGTH 245

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAPID 297
             R  ED   A+    + G +  N+YM+ GGTNFG T G           TSYDYD+P+D
Sbjct: 246 AQRSAEDAGAALEEMLQAGASV-NFYMFHGGTNFGFTNGANDKGTYRATVTSYDYDSPLD 304

Query: 298 EYG 300
           E G
Sbjct: 305 EAG 307


>gi|195030628|ref|XP_001988170.1| GH10713 [Drosophila grimshawi]
 gi|193904170|gb|EDW03037.1| GH10713 [Drosophila grimshawi]
          Length = 680

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 156/324 (48%), Gaps = 36/324 (11%)

Query: 8   DHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           DH A   +++GK     SGS HY R+ P+ W   +R  +  GL  ++TYV W+ H P  G
Sbjct: 59  DHVANTFLMNGKPFRYVSGSFHYFRALPDAWRSRLRTMRASGLNALDTYVEWSLHNPHDG 118

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTNNP 124
           +Y +EG  D+VRF++  QE   ++ LR GPY CAE + GG P WL    P I+ RT +  
Sbjct: 119 EYDWEGIADIVRFLEIAQEEDFYIVLRPGPYICAERDNGGLPHWLFTKYPDIKVRTNDPN 178

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD-- 182
           +  E+ ++ A+++  +K  +L    GG II+ QVENEYG    AY      Y+ W  D  
Sbjct: 179 YIAEVGKWYAQLMPRLK--HLLFGNGGKIIMVQVENEYG----AYHACDHDYLNWLRDET 232

Query: 183 ------TAVNLNTSVPWVMCQQEDAPDPIINTCNG----FYCDG----FTPNSPSKPIMW 228
                  A+     +P          +    T  G    F  D          P+ P++ 
Sbjct: 233 DKYVENKALLFTVDIPNERMHCGKIDNVFATTDFGIDRIFEIDKIWELLRGIQPTGPLVN 292

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG----- 283
           +E Y GW   +      R  +++A A+ +    G +  N YM+FGGTNFG TAG      
Sbjct: 293 SEFYPGWLTHWQEMNQRRDGKEVADALKKILSYGASV-NLYMFFGGTNFGFTAGANYDLD 351

Query: 284 -----PLVATSYDYDAPIDEYGFI 302
                    TSYDYDA +DE G +
Sbjct: 352 GGIGYAADITSYDYDAVMDEAGGV 375


>gi|307710114|ref|ZP_07646558.1| beta-galactosidase [Streptococcus mitis SK564]
 gi|307619094|gb|EFN98226.1| beta-galactosidase [Streptococcus mitis SK564]
          Length = 595

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRLRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLSRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELAEAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|348573621|ref|XP_003472589.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase-1-like protein
           3-like [Cavia porcellus]
          Length = 679

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 109/325 (33%), Positives = 151/325 (46%), Gaps = 19/325 (5%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A+ T       ++G + ++  GSIHY R   E W + + K K  G   + TY+ WN HE
Sbjct: 92  TASTTKGRAHFTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYIPWNLHE 151

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P RG++ F G  DL  FV    E GL++ LR GPY CAE + GG P WL   P  Q RTT
Sbjct: 152 PQRGKFVFSGNLDLEAFVLLAAEIGLWVILRPGPYICAEIDLGGLPSWLLQNPKTQLRTT 211

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
              F + +  +   ++  M    L    GGP+I  QVENEYG+          L      
Sbjct: 212 ERTFVDAVDAYFDHLMRRMVP--LQYHHGGPVIAVQVENEYGSFNRDGQYMAYLKEALLK 269

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTC-------NGFYCDGFTPNSPSKPIMWTENYSG 234
              V L  +  +       +   ++ T        N FY          KPI+  E + G
Sbjct: 270 RGIVELLFTCDYYKDVVNGSLKGVLATVNLGSLGKNSFY--QLLQVQSHKPILIMEYWVG 327

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR------TAGGPLVAT 288
           W+ S+G     +   ++A  V+ F + G +F N YM+ GGTNFG         G   V T
Sbjct: 328 WYDSWGLPHANKSAAEVAHTVSTFIKNGISF-NVYMFHGGTNFGFINAAGIVEGRRSVTT 386

Query: 289 SYDYDAPIDEYGFIRQPKWGHLREL 313
           SYDYDA + E G   + K+  LREL
Sbjct: 387 SYDYDAVLSEAGDYTE-KYFKLREL 410


>gi|81889875|sp|Q5XIL5.1|GLBL3_RAT RecName: Full=Beta-galactosidase-1-like protein 3
 gi|53734228|gb|AAH83665.1| Galactosidase, beta 1-like 3 [Rattus norvegicus]
          Length = 631

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 155/320 (48%), Gaps = 29/320 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G + ++  GSIHY R   E W + + K +  G   + TY+ WN HE  RG++ F  
Sbjct: 56  FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 115

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  +V   +  GL++ LR GPY CAE + GG P WL   PG   RTTN  F E + +
Sbjct: 116 ILDLEAYVLLAKTLGLWVILRPGPYICAEVDLGGLPSWLLRNPGSNLRTTNKDFIEAVDK 175

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   K   L   +GGP+I  QVENEYG+         + Y+++       LN  +
Sbjct: 176 YFDHLIP--KILPLQYRRGGPVIAVQVENEYGSFR-----NDKNYMEYIKKAL--LNRGI 226

Query: 192 PWVMCQQEDAPDPIINTCNG---------FYCDGFTP---NSPSKPIMWTENYSGWFLSF 239
             ++   ++     I +  G         F  D F         KPIM  E ++GW+ S+
Sbjct: 227 VELLLTSDNESGIRIGSVKGALATINVNSFIKDSFVKLHRMQNDKPIMIMEYWTGWYDSW 286

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
           G     +   ++   + RFF  G +F N YM+ GGTNFG   GG        V TSYDYD
Sbjct: 287 GSKHTEKSANEIRRTIYRFFSYGLSF-NVYMFHGGTNFGFINGGYHENGHTNVVTSYDYD 345

Query: 294 APIDEYGFIRQPKWGHLREL 313
           A + E G   + K+  LR+L
Sbjct: 346 AVLSEAGDYTE-KYFKLRKL 364


>gi|387878583|ref|YP_006308886.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
 gi|386792040|gb|AFJ25075.1| Beta-galactosidase 3 [Streptococcus parasanguinis FW213]
          Length = 595

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 107/315 (33%), Positives = 151/315 (47%), Gaps = 41/315 (13%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           + G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +GQ+ F GR 
Sbjct: 12  LKGQPFKILSGAIHYFRIDPADWYHSLFNLKALGFNTVETYVPWNVHEPRKGQFDFSGRL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++  Q  GL++ +R  P+ CAEW +GG P WL     ++ R+++  F E + R+ 
Sbjct: 72  DLERFIQIAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDMRIRSSDPAFIEAVDRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
             ++ L+ +  +   QGGPI++ QVENEYG    +YG   ++Y++   D       + P 
Sbjct: 131 DHLLGLLTRYQV--DQGGPILMMQVENEYG----SYG-EDKVYLRAIRDLMKKKGVTCPL 183

Query: 194 VMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWTENY 232
                 D P            D +  T N         G   + F       P+M  E +
Sbjct: 184 FTS---DGPWRATLRAGTLIEDDLFVTGNFGSKAAYNFGQMQEFFDEYGKKWPLMCMEFW 240

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL--- 285
            GWF  +   V  R  E+LA AV    E G    N YM+ GGTNFG   G    G L   
Sbjct: 241 DGWFTRWKEPVIQREPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLP 298

Query: 286 VATSYDYDAPIDEYG 300
             TSYDY A ++E G
Sbjct: 299 QVTSYDYGALLNEQG 313


>gi|148693363|gb|EDL25310.1| mCG125130, isoform CRA_b [Mus musculus]
          Length = 688

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 109/320 (34%), Positives = 155/320 (48%), Gaps = 29/320 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G + ++  GSIHY R   E W + + K +  G   + TY+ WN HE  RG++ F  
Sbjct: 95  FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 154

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  +V   +  GL++ LR GPY CAE + GG P WL   P    RTTN  F E + +
Sbjct: 155 ILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDK 214

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   K   L    GGP+I  QVENEYG+ +         Y+K A      L   +
Sbjct: 215 YFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGI 265

Query: 192 PWVMCQQEDAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSF 239
             ++   +D     I + NG       + FT +S          KPIM  E ++GW+ S+
Sbjct: 266 VELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSW 325

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
           G     +  E++   V +F   G +F N YM+ GGTNFG   GG        V TSYDYD
Sbjct: 326 GSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYD 384

Query: 294 APIDEYGFIRQPKWGHLREL 313
           A + E G   + K+  LR+L
Sbjct: 385 AVLSEAGDYTE-KYFKLRKL 403


>gi|256424388|ref|YP_003125041.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
 gi|256039296|gb|ACU62840.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
          Length = 586

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 114/329 (34%), Positives = 166/329 (50%), Gaps = 40/329 (12%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  ++D K   + SG +H  R   E W   I+ +K  G   I  YVFWNYHE   G++ F
Sbjct: 17  KDFLLDSKPYQIISGEMHPARIPKEYWRHRIQMAKAMGCNTIAAYVFWNYHEQEEGKFDF 76

Query: 70  --EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKE 127
             E R D+V F+K VQE G+++ LR GPY CAEW +GG P +L  IP I+ R  +  +  
Sbjct: 77  TSENR-DIVAFIKMVQEEGMWVMLRPGPYVCAEWEFGGLPPYLLRIPDIKVRCMDPRYIA 135

Query: 128 EMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL 187
             +R++  + + +K   L  + GGPI++ QVENEYG    ++G   E  +K   D  V  
Sbjct: 136 ATERYIKALSEEVKP--LQITNGGPIVMVQVENEYG----SFGNDREYMLK-VKDMWVQN 188

Query: 188 NTSVPW--------VMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWF 236
             +VP+         + +    P   I   +G     F      +P  P   +E+Y GW 
Sbjct: 189 GINVPFYTADGPVSALLEAGSVPGAAIGLDSGSSEGDFAAAEKQNPDVPSFSSESYPGWL 248

Query: 237 LSFG--YAVPFRP--VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------P 284
             +G  +A P +   V+++ F      +T  +F N Y+  GGTNFG TAG         P
Sbjct: 249 THWGEKWARPDKAGIVKEVKF----LMDTKRSF-NLYVIHGGTNFGFTAGANSGGKGYEP 303

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
            + TSYDYDAPI+E G     K+  LR+L
Sbjct: 304 DL-TSYDYDAPINEQGDTTA-KYNALRDL 330


>gi|422729668|ref|ZP_16786066.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
 gi|315149788|gb|EFT93804.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
          Length = 604

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 110/324 (33%), Positives = 159/324 (49%), Gaps = 26/324 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
           L+ D   ++L SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 21  LLNDQPFKIL-SGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPL---VATSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNF    G +A G +     TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFEFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|143955283|sp|A2RSQ1.1|GLBL3_MOUSE RecName: Full=Beta-galactosidase-1-like protein 3
 gi|124297651|gb|AAI32201.1| Glb1l3 protein [Mus musculus]
 gi|124297899|gb|AAI32203.1| Glb1l3 protein [Mus musculus]
          Length = 649

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 109/320 (34%), Positives = 155/320 (48%), Gaps = 29/320 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G + ++  GSIHY R   E W + + K +  G   + TY+ WN HE  RG++ F  
Sbjct: 56  FTLEGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSE 115

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  +V   +  GL++ LR GPY CAE + GG P WL   P    RTTN  F E + +
Sbjct: 116 ILDLEAYVLLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDK 175

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   K   L    GGP+I  QVENEYG+ +         Y+K A      L   +
Sbjct: 176 YFDHLIP--KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGI 226

Query: 192 PWVMCQQEDAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSF 239
             ++   +D     I + NG       + FT +S          KPIM  E ++GW+ S+
Sbjct: 227 VELLLTSDDKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSW 286

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
           G     +  E++   V +F   G +F N YM+ GGTNFG   GG        V TSYDYD
Sbjct: 287 GSKHIEKSAEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYD 345

Query: 294 APIDEYGFIRQPKWGHLREL 313
           A + E G   + K+  LR+L
Sbjct: 346 AVLSEAGDYTE-KYFKLRKL 364


>gi|167524869|ref|XP_001746770.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775040|gb|EDQ88666.1| predicted protein [Monosiga brevicollis MX1]
          Length = 600

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 104/304 (34%), Positives = 152/304 (50%), Gaps = 32/304 (10%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR-FDLVRFVKT 81
           SGS+HY R   E W + +  +K  GL  I TYV WN+HE   G + FE    DL RF+  
Sbjct: 70  SGSLHYFRIPAEYWLDRLEMAKHMGLNTISTYVPWNFHEVGPGSFDFETHAHDLARFLNL 129

Query: 82  VQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMK 141
             E GL + +R  PY CAEW++GG P  L   P ++ R++N+ F +E++R+   ++ +++
Sbjct: 130 AHEVGLRVLIRPSPYICAEWDFGGLPARLMANPDLELRSSNDAFLDEVERYYDALMPILR 189

Query: 142 QENLFASQGGPIILAQVENEYGNVEWAYGVG--------------GELYVKWAADTAVNL 187
              L AS GGPII   VENEYG    +YG                G +   +  D A  L
Sbjct: 190 P--LQASNGGPIIAFYVENEYG----SYGADRDYLQALVAMMRDRGIVEQMFTCDNAQGL 243

Query: 188 NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRP 247
           +        Q  +  D +       + D      P +P+M +E ++GWF   G       
Sbjct: 244 SRGALPGALQTINFQDNVER-----HLDQLAHFQPDQPLMVSEYWTGWFDHDGEEHHTFD 298

Query: 248 VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA--TSYDYDAPIDEYGFIR 303
            EDL   + +  + G +F N Y++ GGT+FG  AG   P     TSYDYDAP+ E+G + 
Sbjct: 299 SEDLVEGLQKILDRGASF-NLYVFHGGTSFGWNAGANSPYAPDITSYDYDAPLSEHGQV- 356

Query: 304 QPKW 307
            PK+
Sbjct: 357 TPKY 360


>gi|445493871|ref|ZP_21460915.1| beta-galactosidase [Janthinobacterium sp. HH01]
 gi|444790032|gb|ELX11579.1| beta-galactosidase [Janthinobacterium sp. HH01]
          Length = 783

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 107/309 (34%), Positives = 159/309 (51%), Gaps = 29/309 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   ++ G +H+ R   E WP  ++  K  GL  +  Y+FWNYHE   G++ + G
Sbjct: 41  FLLDGKPLQIRCGEMHFARVPREYWPHRLKAIKAMGLNTVCAYLFWNYHEWREGKFDWSG 100

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNNP-FKEEM 129
           + D V F +  ++ GL++ LR GPYACAEW  GG P W L+  PG  F  T +P F +  
Sbjct: 101 QRDAVEFCRLARQEGLWVILRPGPYACAEWEMGGLPWWLLNKHPGDAFLRTRDPAFVDPA 160

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL-YVKWAADTAVNLN 188
           +R+L ++  ++    +  +QGGPI++ QVENEYG        G +L Y++      ++  
Sbjct: 161 RRYLREVGRVLAPMQV--TQGGPILMVQVENEYGF------FGEDLEYMRTMRQALLDAR 212

Query: 189 TSVPWVMCQQEDAPD----PIINTCNGFYCD---GFTPNSPSK--PIMWTENYSGWFLSF 239
             VP   C   +A      P + T   F  D   GF   +  +  P+M  E YSGWF ++
Sbjct: 213 FDVPLFQCNPTNAVAKTHLPGMLTVANFGSDPAGGFKALAAVQQAPLMCGEYYSGWFDTW 272

Query: 240 GYAVPFRPVEDLAFAV--ARFFETGGTFQNYYMYFGGTNFGRTAG--GPLV--ATSYDYD 293
           G   P R  ++ +  V      +  G+F + YM  GGT F    G   P     TSYDYD
Sbjct: 273 GN--PHRRGDNTSAVVDIQAMLKANGSF-SLYMAHGGTTFSLWGGCDRPFRPDTTSYDYD 329

Query: 294 APIDEYGFI 302
           API E G++
Sbjct: 330 APISEAGWV 338


>gi|384108880|ref|ZP_10009768.1| Beta-galactosidase [Treponema sp. JC4]
 gi|383869584|gb|EID85195.1| Beta-galactosidase [Treponema sp. JC4]
          Length = 592

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 170/352 (48%), Gaps = 32/352 (9%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK   + SGSIHY R  PE W + + K K  G   +ETY+ WN  EP +G++ F+
Sbjct: 9   TFLLDGKPFQIISGSIHYFRVVPEYWQDRLEKLKNMGCNTVETYIPWNITEPRKGEFCFD 68

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  D  +F+   Q+ GL+  +R  PY CAEW  GG P W+  +PG++ R  N P+ + ++
Sbjct: 69  GLCDFEKFLDLAQKLGLYAIVRPSPYICAEWELGGLPSWIFTVPGLEPRCKNEPYYQNVR 128

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG--NVEWAYGVGGELYVKWAADTAVNLN 188
            +   ++  +    +   +GG IIL Q+ENEYG    + +Y    E  ++    T   + 
Sbjct: 129 DYYKVLLPRLVNHQI--DKGGNIILMQIENEYGYYGKDMSYMHFLEGLMREGGITVPFVT 186

Query: 189 TSVPWVMCQQEDAPDPIINTCN-GFYCDGFTPNSPSK--------PIMWTENYSGWFLSF 239
           +  PW         D  + T N G +      N            P+M  E + GWF ++
Sbjct: 187 SDGPWGKMFIHGQCDGALPTGNFGSHARPLFANMKRMMKKTGNRGPLMCMEFWIGWFDAW 246

Query: 240 G-----YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------AT 288
           G      +   R ++DL + + +    G    N+YM+ GGTNFG   G           T
Sbjct: 247 GNKEHKTSKLKRNIKDLNYMLKK----GNV--NFYMFHGGTNFGFMNGSNYFTKLTPDTT 300

Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEA 340
           SYDYDAP+ E G I + K+   + + K  +  EE  +S+    QK   K++A
Sbjct: 301 SYDYDAPLSEDGKITE-KYRTFQSIIKKYRDFEEMPLSTK-IEQKAYGKVKA 350


>gi|160885481|ref|ZP_02066484.1| hypothetical protein BACOVA_03481 [Bacteroides ovatus ATCC 8483]
 gi|423290348|ref|ZP_17269197.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
           CL02T12C04]
 gi|156109103|gb|EDO10848.1| glycosyl hydrolase family 35 [Bacteroides ovatus ATCC 8483]
 gi|392665735|gb|EIY59258.1| hypothetical protein HMPREF1069_04240 [Bacteroides ovatus
           CL02T12C04]
          Length = 778

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/323 (32%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     +  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDVALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN-T 189
            F+ ++   +    L  ++GG II+ QVENEYG    +YG   + YV    D       T
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGT-DKPYVSAVRDLVRESGFT 207

Query: 190 SVPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+  + K+  LR+L K
Sbjct: 327 DAPISEAGWTTE-KFFLLRDLLK 348



 Score = 41.2 bits (95), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 81/197 (41%), Gaps = 47/197 (23%)

Query: 512 LNEGINTLDILSMMVGLQNYG-AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVE 569
           L +GI  LDIL   +G  N+  +  D  G        ++L +G +      W +Y   V+
Sbjct: 457 LKKGIQ-LDILVEAMGRVNFDKSIHDRKGI----TEKVELISGNQTKELKNWTVYNFPVD 511

Query: 570 GEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVN 629
             +I   K S       K   T+P      +YK+TF   +  G   L++++ GKG  WVN
Sbjct: 512 YSFIKDKKYSDT-----KILPTMPA-----YYKSTFTL-DKVGDTFLDMSTWGKGMVWVN 560

Query: 630 GQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLV 689
           G ++GR+W                               P QTL+ +P  W+  GEN ++
Sbjct: 561 GHAMGRFWEI----------------------------GPQQTLF-MPGCWLKEGENEIL 591

Query: 690 IHEELGGDPSKISLLTK 706
           + +  G   + I  L K
Sbjct: 592 VLDLKGPTRASIKGLKK 608


>gi|153807689|ref|ZP_01960357.1| hypothetical protein BACCAC_01971 [Bacteroides caccae ATCC 43185]
 gi|149130051|gb|EDM21263.1| glycosyl hydrolase family 35 [Bacteroides caccae ATCC 43185]
          Length = 775

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 113/330 (34%), Positives = 161/330 (48%), Gaps = 34/330 (10%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V  ++    I+GK   L  G +HYPR   E W + + +++  GL  +  YVFWN+HE  
Sbjct: 29  QVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRARAMGLNTVSAYVFWNFHERQ 88

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G + F G+ D+  FV+  QE GL++ LR GPY CAEW++GG+P WL     + +R+ + 
Sbjct: 89  PGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDLTYRSKDP 148

Query: 124 PFKEEMKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            F    +R+   I +L KQ   L  + GG II+ QVENEYG    +Y    E Y+    D
Sbjct: 149 RFMSYCERY---IKELGKQLAPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRD 200

Query: 183 TAVNLNTSVPWVMCQ---QEDAPD--PIINTCNGFYCDGF----TPNSPSKPIMWTENYS 233
                  +VP   C    Q +A      + T NG + +          P  P    E Y 
Sbjct: 201 MLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFKIVDKYHPGGPYFVAEFYP 260

Query: 234 GWFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG- 283
            WF  +G    +V + RP E L + +       G   + YM+ GGTNF       T+GG 
Sbjct: 261 AWFDEWGKRHSSVAYERPAEQLDWMLGH-----GVSVSMYMFHGGTNFWYMNGANTSGGF 315

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
               TSYDYDAP+ E+G    PK+   RE+
Sbjct: 316 RPQPTSYDYDAPLGEWGNCY-PKYHAFREI 344



 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 68/277 (24%), Positives = 118/277 (42%), Gaps = 58/277 (20%)

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTS---DYLWYTASIHVMPGQGKEVFLN 475
           ++ F+  E K       +F +   +E + + +D      Y+ Y  +I   PG+ K   L 
Sbjct: 364 TTTFATVELKESAPLTTAFHQTIQSEDVLSMEDVGADFGYIHYQTTIKT-PGKQK---LI 419

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I+ L   A++ V+ K VA    + D      +  +++++   TL+IL    G  NYG   
Sbjct: 420 IQDLRDYAVILVDGKQVA----SLDRRYNQNSTTLDIHKVPATLEILVENTGRVNYGPDI 475

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
                G+ S +L     G   L+        G     + L K  +++ SF ++   +P  
Sbjct: 476 LFNRKGITSQVLW----GNEKLT--------GWSITPLPLYKEEVSSLSFGQEIKGVPA- 522

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
               +++ TF+  E +G   ++++  GKG  WVNG+S+GR+W+                 
Sbjct: 523 ----FHRGTFII-EQQGDCFVDMSQWGKGAVWVNGKSLGRFWNI---------------- 561

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                        P QTLY IP  W+  GEN +V+ E
Sbjct: 562 ------------GPQQTLY-IPAPWLKKGENEIVVFE 585


>gi|414880685|tpg|DAA57816.1| TPA: putative RAN GTPase activating family protein [Zea mays]
          Length = 598

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/292 (34%), Positives = 146/292 (50%), Gaps = 54/292 (18%)

Query: 269 YMYFGGTNFGRTAGGPLVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSD 328
           + Y GGTNFGRT+GGP + TSYDYDAP+DEYG IRQPK+GHL++LH  I+  E+ L+   
Sbjct: 308 FKYHGGTNFGRTSGGPYITTSYDYDAPLDEYGNIRQPKYGHLKDLHDLIRSMEKILV--- 364

Query: 329 PTHQKLGAKLEAHIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLPAWSVSILPDCKNV 388
             H K         Y+ +S    A   +     D  VT +G  + +PAWSVSILPDCK V
Sbjct: 365 --HGK---------YNDTSYGKNAIFVD----RDVKVTLSGGTHLVPAWSVSILPDCKTV 409

Query: 389 VFNTAKVISQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVG--ISGNR-SFVRPDLAEQ 445
            +NTAK+ +Q +       ++ N  E    +  +SW  E +   ++ +R SF    L EQ
Sbjct: 410 AYNTAKIKTQTS----VMVKKANSVEKEPEALRWSWMPENLKPFMTDHRDSFRHSQLLEQ 465

Query: 446 INTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVN----------------K 489
           I T+ D SDYLWY  S+    G+G    L + + GH     +                 +
Sbjct: 466 ITTSTDQSDYLWYRTSLE-HKGEGSYT-LYVNTSGHEMAKLLGRWSVRLPAPVSGEAPLR 523

Query: 490 KLVAFGYGNHDFAN-----------FLINKKIELNEGINTLDILSMMVGLQN 530
           K + F    H               F +   ++L+ G N + +LS  VGL++
Sbjct: 524 KELRFSPQRHSRTQGQNYSADGAFVFQLQSPVKLHSGKNYVSLLSGTVGLKS 575


>gi|312903555|ref|ZP_07762735.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
 gi|422689128|ref|ZP_16747240.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
 gi|422731840|ref|ZP_16788189.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
 gi|310633431|gb|EFQ16714.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0635]
 gi|315162138|gb|EFU06155.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0645]
 gi|315577890|gb|EFU90081.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0630]
          Length = 604

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 156/324 (48%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV W+ HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|442626280|ref|NP_001260120.1| beta galactosidase, isoform B [Drosophila melanogaster]
 gi|440213416|gb|AGB92656.1| beta galactosidase, isoform B [Drosophila melanogaster]
          Length = 670

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 39/325 (12%)

Query: 6   TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           T DH A   ++DG+     SGS HY R+ PE W   +R  +  GL  ++TYV W+ H P 
Sbjct: 45  TIDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 104

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
            G+Y +EG  D+V+F++  QE   ++ LR GPY CAE + GG P WL    P I+ RT +
Sbjct: 105 DGEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTND 164

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             +  E+ ++ A+++   + ++LF   GG II+ QVENEYG+    +      Y+ W  D
Sbjct: 165 PNYISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 217

Query: 183 --------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKP 225
                    A+     +P   + C +        D     IN  +  +        P+ P
Sbjct: 218 ETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGP 276

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
           ++ +E Y GW   +      R  +++A A+        +  N YM+FGGTNFG TAG   
Sbjct: 277 LVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANY 335

Query: 284 --------PLVATSYDYDAPIDEYG 300
                       TSYDYDA +DE G
Sbjct: 336 NLDGGIGYAADITSYDYDAVMDEAG 360


>gi|24582088|ref|NP_608978.2| beta galactosidase, isoform A [Drosophila melanogaster]
 gi|21430516|gb|AAM50936.1| LP09580p [Drosophila melanogaster]
 gi|22945722|gb|AAF52321.2| beta galactosidase, isoform A [Drosophila melanogaster]
          Length = 672

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/325 (32%), Positives = 158/325 (48%), Gaps = 39/325 (12%)

Query: 6   TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           T DH A   ++DG+     SGS HY R+ PE W   +R  +  GL  ++TYV W+ H P 
Sbjct: 47  TIDHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 106

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
            G+Y +EG  D+V+F++  QE   ++ LR GPY CAE + GG P WL    P I+ RT +
Sbjct: 107 DGEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFTKYPSIKMRTND 166

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             +  E+ ++ A+++   + ++LF   GG II+ QVENEYG+    +      Y+ W  D
Sbjct: 167 PNYISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 219

Query: 183 --------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKP 225
                    A+     +P   + C +        D     IN  +  +        P+ P
Sbjct: 220 ETEKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGP 278

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-- 283
           ++ +E Y GW   +      R  +++A A+        +  N YM+FGGTNFG TAG   
Sbjct: 279 LVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANY 337

Query: 284 --------PLVATSYDYDAPIDEYG 300
                       TSYDYDA +DE G
Sbjct: 338 NLDGGIGYAADITSYDYDAVMDEAG 362


>gi|257090118|ref|ZP_05584479.1| beta-galactosidase [Enterococcus faecalis CH188]
 gi|256998930|gb|EEU85450.1| beta-galactosidase [Enterococcus faecalis CH188]
          Length = 594

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/325 (32%), Positives = 158/325 (48%), Gaps = 27/325 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV W+ HEP +G ++FEG
Sbjct: 10  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWDLHEPQKGTFHFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 70  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 129 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 186

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 187 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 246

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVATSY 290
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G         P + TSY
Sbjct: 247 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQI-TSY 303

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHK 315
           DYDAP+DE G   +  +   + LH+
Sbjct: 304 DYDAPLDEQGNPTEKYFALQKMLHE 328


>gi|313149603|ref|ZP_07811796.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
 gi|313138370|gb|EFR55730.1| glycoside hydrolase family 35 [Bacteroides fragilis 3_1_12]
          Length = 628

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 162/346 (46%), Gaps = 46/346 (13%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  NL  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGNLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
               TSYDYDAPI E G++  PK+  +R +   I+   +Y +   P
Sbjct: 326 QPDLTSYDYDAPISEAGWV-TPKYDSIRNV---IRKYVKYTVPEAP 367



 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWVTPKYDSIRNVIRKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 VPEAPAPNPVIEIPSI-KLTKVADVLAFAEKQKPVSADT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGL 575
            TL IL   +G  NYG+       G+ S + I  K       +GEW +YQ+ +  E   L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVKIAGKE-----ITGEWDMYQLPM-SEMPDL 519

Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
            K+  A++          +    + Y+ TF   +  G   +++ + GKG  +VNG +IGR
Sbjct: 520 AKLK-ADAHANVPAEAAKLKGCPVLYEGTFTL-DNVGDTFIDMENWGKGIIFVNGVNIGR 577

Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
           YW                               P QTLY IP  W+  G N +VI E+L 
Sbjct: 578 YWKV----------------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLN 608

Query: 696 GDPSKISLLTKT 707
             P       KT
Sbjct: 609 EVPQAEVKTVKT 620


>gi|257876100|ref|ZP_05655753.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
 gi|257810266|gb|EEV39086.1| glycosyl hydrolase [Enterococcus casseliflavus EC20]
          Length = 591

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 169/352 (48%), Gaps = 43/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FVK  Q  GL + LR   Y CAEW +GG P WL   P ++ R+T+  F  +++ 
Sbjct: 70  MKDICAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  + GGP+I+ QVENEYG    +YG+  + Y++   +        V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEEYGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
           P  +   + A + +++       D F T N  S+                   PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRAGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
              +SYDYDA + E G   +P      ++ KAIK     +  ++P  ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPT-DKYYQVQKAIKEACPEVWQANPRTKQLAA 345


>gi|1911627|gb|AAB50770.1| beta-galactosidase [dogs, spleen, Peptide Partial, 667 aa]
          Length = 667

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 114/349 (32%), Positives = 161/349 (46%), Gaps = 34/349 (9%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            + Y H   + DG+     SGSIHY       W + + K K  GL  I+TYV WN+HEP 
Sbjct: 33  TIDYSHNRFLKDGQPFRYISGSIHYSHVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 92

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F G  D+  F+K   E GL + LR GPY CAEW+ GG P WL     I  R+++ 
Sbjct: 93  PGQYQFSGEQDVEYFIKLAHELGLLVILRPGPYICAEWDMGGLPAWLLLKESIILRSSDP 152

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            +   + ++L  ++  MK   L    GGPII  QVENEYG    +Y      Y+++    
Sbjct: 153 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITMQVENEYG----SYFTCDYDYLRFLQKL 206

Query: 184 -AVNLNTSVPWVMCQQEDAPDPIINTC---NGFYCD-GFTP-------------NSPSKP 225
              +L      V+    D  + +   C    G Y    F P             + P  P
Sbjct: 207 FHHHLGND---VLLFTTDGANELFLQCGALQGLYATVDFGPGANITAAFQIQRKSEPKGP 263

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
           ++ +E Y+GW   +G        E +A ++      G    N YM+ GGTNF    G  +
Sbjct: 264 LVNSEFYTGWLDHWGQPHSTVRTEVVASSLHDILAHGANV-NLYMFIGGTNFAYWNGANM 322

Query: 286 ----VATSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIKLCEEYLISSDP 329
                 TSYDYDAP+ E   + + K+  LRE + K  K+ E ++  S P
Sbjct: 323 PYQAQPTSYDYDAPLSEAADLTE-KYFALREVIRKFEKVPEGFIPPSTP 370


>gi|423248537|ref|ZP_17229553.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
           CL03T00C08]
 gi|423253485|ref|ZP_17234416.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
           CL03T12C07]
 gi|392657385|gb|EIY51022.1| hypothetical protein HMPREF1067_01060 [Bacteroides fragilis
           CL03T12C07]
 gi|392659750|gb|EIY53368.1| hypothetical protein HMPREF1066_00563 [Bacteroides fragilis
           CL03T00C08]
          Length = 773

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/332 (31%), Positives = 159/332 (47%), Gaps = 29/332 (8%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           R  +++G   V+++  +HY R     W   I   K  G+  I  Y+FWNYHE   G++ F
Sbjct: 31  RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G  ++ +F K  Q+ G+++ LR GPYACAEW  GG P WL     ++ R+ N  F E  
Sbjct: 91  SGEKNVAKFCKLAQKHGMYIILRPGPYACAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG-------ELYVKWAAD 182
           + F+ ++   +    L  + GG II+ QVENE+G     YGV         ++  +   D
Sbjct: 151 EIFMKELGKQLAPLQL--ANGGNIIMVQVENEFG----GYGVDKPYMTAIRDIVCRAGFD 204

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
            +V       W    + +A D ++ T N   G   D      +   P  P+M +E +SGW
Sbjct: 205 KSVLFQCD--WDSTFELNALDDLLWTLNFGTGANIDKEFKKLSTVRPDTPLMCSEFWSGW 262

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
           F  +G     RP E +   +    +   +F + YM  GGT FG   G        + +SY
Sbjct: 263 FDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSSY 321

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           DYDAPI E G+   PK+  L+EL    +  EE
Sbjct: 322 DYDAPISEAGWT-TPKYYLLQELLGKYRSPEE 352



 Score = 41.2 bits (95), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 56/254 (22%), Positives = 99/254 (38%), Gaps = 50/254 (19%)

Query: 440 PDL--AEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYG 497
           PD   +EQ+   +D +           +P       L I      A ++ + KL+ +   
Sbjct: 382 PDFVQSEQVKPMEDFNQGWGSILYRTTLPATEANTLLRITEAHDWAQIYADGKLLGYLDR 441

Query: 498 NHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDL 557
             D    ++    +L EG   LDI    +G  N+G+           V LI  K  K+ +
Sbjct: 442 RKDDNQVILP---QLPEGTQ-LDIWVEAMGRVNFGSTVHDRKGITEKVELI--KPDKQAV 495

Query: 558 SSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLAL 616
           +   W +Y + V+ ++    K S +NS            +   +YK TF   +  G   +
Sbjct: 496 TLKNWKVYSIPVDYKFAARKKYS-SNSR----------PEGPAYYKATFNLTK-TGDTFI 543

Query: 617 NLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHI 676
           ++++ GKG  WVNG ++GR+W                               P QTL+ +
Sbjct: 544 DMSTWGKGMVWVNGHALGRFWEI----------------------------GPQQTLF-L 574

Query: 677 PRTWVHPGENLLVI 690
           P  W+  G+N +++
Sbjct: 575 PGCWLKKGKNEIIV 588


>gi|414879450|tpg|DAA56581.1| TPA: hypothetical protein ZEAMMB73_811947 [Zea mays]
          Length = 154

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 63/102 (61%), Positives = 85/102 (83%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
             VTYD RAL++DG RR+L SG +HYPRSTPE+WP+LI K+K+GGL+VI+TYVFWN HEP
Sbjct: 36  GEVTYDGRALILDGARRMLFSGDMHYPRSTPEMWPDLIAKAKKGGLDVIQTYVFWNAHEP 95

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYG 104
           ++GQ+ FEGR+DLV+F++ +   GL++ LRIGP+  +EW YG
Sbjct: 96  VQGQFNFEGRYDLVKFIREIHAQGLYVSLRIGPFVESEWKYG 137


>gi|332672111|ref|YP_004455119.1| beta-galactosidase [Cellulomonas fimi ATCC 484]
 gi|332341149|gb|AEE47732.1| Beta-galactosidase [Cellulomonas fimi ATCC 484]
          Length = 583

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/307 (32%), Positives = 149/307 (48%), Gaps = 33/307 (10%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DG    + SG++HY R  P+ W + + +++E GL  IETY+ WN H P RG++  +G  D
Sbjct: 14  DGTPVRILSGALHYFRHHPDQWRDRLTRARELGLNTIETYIPWNAHSPARGEFRTDGILD 73

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L RF+  V   G++  +R GPY CAEW  GG P WL F  G   R     +   ++ +  
Sbjct: 74  LGRFLDEVAAQGMWAIVRPGPYICAEWTGGGLPGWL-FTAGAAVRRHEPTYLAAIQDYYE 132

Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE---LYVKWAADTAV-----N 186
            +  ++    +   +GGP++L QVENEYG    AYG   +     VK   ++ +      
Sbjct: 133 AVAGIVAPRQV--DRGGPVVLVQVENEYG----AYGDDKDYLRALVKLLRESGITTPLTT 186

Query: 187 LNTSVPWVMCQQEDAPDPIINTCNGFYCDG------FTPNSPSKPIMWTENYSGWFLSFG 240
           ++   PW++   E+   P ++    F             + P+ P+M  E + GWF S+G
Sbjct: 187 IDQPEPWML---ENGSLPELHKTGSFGSRAAERLATLREHQPTGPLMCAEFWDGWFDSWG 243

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDYD 293
                      A  +      G +  N YM  GGTNFG T G        P+V TSYDYD
Sbjct: 244 LHHHTTDAAASAHELDTLLAAGASV-NLYMVCGGTNFGFTNGANDKGTYVPIV-TSYDYD 301

Query: 294 APIDEYG 300
           AP+DE G
Sbjct: 302 APLDEAG 308


>gi|350588684|ref|XP_003130139.3| PREDICTED: galactosidase, beta 1-like 3 [Sus scrofa]
          Length = 656

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 111/318 (34%), Positives = 152/318 (47%), Gaps = 17/318 (5%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G   ++  GSIHY R   E W + + K K  G   + TYV WN HEP RG++ F G
Sbjct: 82  FTLEGHEFLILGGSIHYFRVPRESWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSG 141

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  F+    E GL++ LR GPY C+E + GG P  L   P  Q RTTN+ F E +  
Sbjct: 142 NLDMEAFILLAAEVGLWVILRPGPYICSEIDLGGLPSRLLQDPTSQLRTTNHSFIEAVDE 201

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L  +I   +   L   +GGPII  QVENEYG+          L+        V L  + 
Sbjct: 202 YLDHLI--ARVVPLQYRKGGPIIAVQVENEYGSFHKDEAYMPYLHKALLKRGIVELLLTS 259

Query: 192 PWVMCQQEDAPDPIINTCN------GFYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF 245
                  +     ++ T N      G + D +   S +KPI+  E + GWF ++G     
Sbjct: 260 DNTNEVLKGHIKGVLATVNMKSFKEGEFKDLYQVQS-NKPILIMEFWVGWFDTWGNKHAV 318

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEY 299
           R   D+   +  F     +F N YM+ GGTNFG   G         V TSYDYDA + E 
Sbjct: 319 RDAIDVENTIFDFIRLEISF-NVYMFHGGTNFGFMNGATYFEQHRGVVTSYDYDAVLTEA 377

Query: 300 GFIRQPKWGHLRELHKAI 317
           G    PK+  LREL K+I
Sbjct: 378 G-DYTPKFFKLRELFKSI 394


>gi|422698394|ref|ZP_16756303.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
 gi|315173078|gb|EFU17095.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1346]
          Length = 604

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWHHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N               F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFDMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVAT-------SYD 291
           +   +  R  ++LA +V      G    N YM+ GGTNFG   G     T       SYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGTNFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|422735885|ref|ZP_16792151.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
 gi|315167420|gb|EFU11437.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
          Length = 604

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/324 (32%), Positives = 155/324 (47%), Gaps = 25/324 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +G ++FEG
Sbjct: 20  FLLNGQPFKILSGAIHYFRVDPSDWYHSLYNLKALGFNTVETYVPWNLHEPQKGTFHFEG 79

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+K  QE GL+  +R  PY CAEW +GGFP WL   PG + R+ N  + + +  
Sbjct: 80  ILDLERFLKLAQELGLYAIVRPSPYICAEWEFGGFPAWLLNEPG-RMRSNNPTYLKHVAE 138

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNT 189
           +   +++ +    L  + GG I++ Q+ENEYG+   E AY       +     TA    +
Sbjct: 139 YYDVLMEKIVPHQL--ANGGNILMIQIENEYGSFGEEKAYLRAIRDLMIARGVTAPFFTS 196

Query: 190 SVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLS 238
             PW    +  +   D I+ T N         G     F  +    P+M  E + GWF  
Sbjct: 197 DGPWRATLRAGSMIEDDILVTGNFGSKAKENFGMMQAFFEEHGKKWPLMCMEFWDGWFNR 256

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-------TSYD 291
           +   +  R  ++LA +V      G    N YM+ GG NFG   G            TSYD
Sbjct: 257 WKEPIIKRDPQELAESVREALALGSI--NLYMFHGGINFGFMNGCSARGTIDLPQITSYD 314

Query: 292 YDAPIDEYGFIRQPKWGHLRELHK 315
           YDAP+DE G   +  +   + LH+
Sbjct: 315 YDAPLDEQGNPTEKYFALQKMLHE 338


>gi|297194972|ref|ZP_06912370.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
 gi|297152570|gb|EFH31854.1| beta-galactosidase [Streptomyces pristinaespiralis ATCC 25486]
          Length = 599

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/335 (32%), Positives = 160/335 (47%), Gaps = 36/335 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A+ T      ++DG+   L SG++HY R     W   +   +  GL  +ETYV WN HEP
Sbjct: 9   ADFTVGDTDFLLDGRPVRLLSGALHYFRVHEGQWGHRLAMLRAMGLNCVETYVPWNLHEP 68

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+Y  +G   L RF+  V  AG++  +R GPY CAEW  GG P WL    G + RT +
Sbjct: 69  EPGRYADDG--ALGRFLDAVHAAGMWAIVRPGPYICAEWENGGLPFWLTGRVGRRVRTED 126

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEW--AY 169
             +   ++R+  +++  + +  +  ++GGP+++ QVENEYG+           VE   + 
Sbjct: 127 PEYLGHVERWFTRLLPQVVEREI--TRGGPVVMVQVENEYGSYGSDGGYLRQLVELLRSC 184

Query: 170 GVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWT 229
           GVG  L+     +  +    SVP V+            +  G        + P+ P+M  
Sbjct: 185 GVGVPLFTSDGPEDHMLSGGSVPGVLATVN------FGSGAGEAFAALRRHRPTGPLMCM 238

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
           E + GWF  +G     R  ED A A+    E G +  N YM  GGT+FG  AG    G L
Sbjct: 239 EFWCGWFEHWGAEPARRDAEDAARALREILEAGASV-NVYMAHGGTSFGGWAGANRSGEL 297

Query: 286 -------VATSYDYDAPIDEYGFIRQPKWGHLREL 313
                    TSYDYDAP+DE G   +  W   RE+
Sbjct: 298 HDGVLEPTVTSYDYDAPVDEAGRPTEKFW-RFREV 331


>gi|256423546|ref|YP_003124199.1| beta-galactosidase [Chitinophaga pinensis DSM 2588]
 gi|256038454|gb|ACU61998.1| Beta-galactosidase [Chitinophaga pinensis DSM 2588]
          Length = 610

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 152/316 (48%), Gaps = 31/316 (9%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           T    A ++DGK   + SG IHYPR   E W + ++ +K  GL  I TYVFWN HEP +G
Sbjct: 27  TLGDTAFLLDGKPLQMISGEIHYPRVPRECWRDRMKMAKAMGLNTIGTYVFWNVHEPEKG 86

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           QY F G  D+  FVK  +E  L++ LR  PY CAEW +GG+P WL  I G++ R+    +
Sbjct: 87  QYDFSGNNDIAAFVKMAKEEDLWVVLRPSPYVCAEWEFGGYPYWLQEIKGLKVRSKEPQY 146

Query: 126 KEEMKRFLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNV---EWAYGVGGELYVKWAA 181
            E  + +   I+ + KQ + L  + GG I++ Q+ENEYG+    +    +  +++V+   
Sbjct: 147 LEAYRNY---IMAVGKQLSPLLVTHGGNILMVQIENEYGSYSDDKDYLDINRKMFVEAGF 203

Query: 182 D--------TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYS 233
           D         A   N  +P ++       DP+         +  +   P     W   + 
Sbjct: 204 DGLLYTCDPKAAIKNGHLPGLLPAINGVDDPL--QVKQLINENHSGKGPYYIAEWYPAWF 261

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---------P 284
            W+ +  + VP+R       +V       G   N YM+ GGT  G   G          P
Sbjct: 262 DWWGTKHHTVPYRQYLGKLDSVL----AAGISINMYMFHGGTTRGFMNGANANDADPYEP 317

Query: 285 LVATSYDYDAPIDEYG 300
            + +SYDYDAP+DE G
Sbjct: 318 QI-SSYDYDAPLDEAG 332



 Score = 48.5 bits (114), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 51/224 (22%), Positives = 85/224 (37%), Gaps = 51/224 (22%)

Query: 469 GKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGL 528
           G++  L ++ L    +V VN K      G  D  +   +  ++L  G   LD+L   +G 
Sbjct: 413 GRKGLLQLKELRDYCVVMVNGKRA----GVLDRRSKRDSIALDLPAGKVKLDLLVENLGR 468

Query: 529 QNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQ 588
            N+G +      G+   +L D +  K            G +   +  DK+    +   K 
Sbjct: 469 INFGPYLLSNRKGITEKVLFDRQELK------------GWQQYGLPFDKLPAVAAKGIKA 516

Query: 589 GSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
           G+ +P       Y+      +  G   L++++ GKG  W+NG  +GRYW           
Sbjct: 517 GANVPT------YRQGTFTLDKTGDTWLDMSNWGKGAVWINGHHLGRYWQV--------- 561

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                               P QT+Y +P  W+  G N +VI E
Sbjct: 562 -------------------GPQQTIY-VPAEWLKKGMNDIVIME 585


>gi|293370654|ref|ZP_06617206.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
 gi|292634388|gb|EFF52925.1| glycosyl hydrolase family 35 [Bacteroides ovatus SD CMC 3f]
          Length = 778

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 95  GQNDIATFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       S
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFS 207

Query: 191 -VPWVMCQ-----QEDAPDPIINTCN---GFYCDG----FTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKRLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+    K+  LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348



 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                    P QTL+ +P  W+  GEN +++ +  G   + I  L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608


>gi|290956543|ref|YP_003487725.1| glycosyl hydrolase family 42 [Streptomyces scabiei 87.22]
 gi|260646069|emb|CBG69162.1| putative glycosyl hydrolase (family 42) [Streptomyces scabiei
           87.22]
          Length = 591

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 108/330 (32%), Positives = 156/330 (47%), Gaps = 31/330 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP-I 63
           +T      +++G+   + SG++HY R  P++W + +RK++  GL  +ETYV WN H+P  
Sbjct: 6   LTTSSDGFLLNGEPFRIVSGAMHYFRIHPDLWADRLRKARLMGLNTVETYVPWNLHQPDP 65

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
                 +G  DL R++   +  GL + LR GPY CAEW+ GG P WL   PGI+ R+++ 
Sbjct: 66  DSPLVLDGLLDLPRYLSLARAEGLHVLLRPGPYICAEWDGGGLPSWLTSDPGIRLRSSDP 125

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT 183
            F + +  +L   I L       A+ GGP+I  QVENEYG    AYG     Y+K     
Sbjct: 126 RFTDALDGYLD--ILLPPLLPYMAANGGPVIAVQVENEYG----AYG-DDTAYLKHVHQA 178

Query: 184 AVNLNTSVPWVMCQQEDA---------PDPIINTCNGFYCD----GFTPNSPSKPIMWTE 230
                       C Q  +         P  +     G   +        + P  P+M +E
Sbjct: 179 LRARGVEELLFTCDQAGSGHHLAAGSLPGVLSTATFGGKIEESLAALRAHMPEGPLMCSE 238

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG-------G 283
            + GWF  +G     R  E  A  + +    G +  N YM+ GGTNFG T G        
Sbjct: 239 FWIGWFDHWGEEHHVRDAESAAADLDKLLAAGASV-NIYMFHGGTNFGFTNGANHDQCYA 297

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
           P+V TSYDYDA + E G    PK+   RE+
Sbjct: 298 PIV-TSYDYDAALTESG-DPGPKYHAFREV 325


>gi|336412039|ref|ZP_08592497.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
 gi|423261296|ref|ZP_17242197.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
           CL07T00C01]
 gi|423267821|ref|ZP_17246801.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
           CL07T12C05]
 gi|423272270|ref|ZP_17251238.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
           CL05T00C42]
 gi|423276726|ref|ZP_17255658.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
           CL05T12C13]
 gi|423283105|ref|ZP_17261990.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
           615]
 gi|335939211|gb|EGN01088.1| hypothetical protein HMPREF1018_04515 [Bacteroides sp. 2_1_56FAA]
 gi|387774329|gb|EIK36442.1| hypothetical protein HMPREF1055_04474 [Bacteroides fragilis
           CL07T00C01]
 gi|392695462|gb|EIY88674.1| hypothetical protein HMPREF1079_04320 [Bacteroides fragilis
           CL05T00C42]
 gi|392695591|gb|EIY88799.1| hypothetical protein HMPREF1056_04488 [Bacteroides fragilis
           CL07T12C05]
 gi|392696055|gb|EIY89256.1| hypothetical protein HMPREF1080_04311 [Bacteroides fragilis
           CL05T12C13]
 gi|404581379|gb|EKA86078.1| hypothetical protein HMPREF1204_01528 [Bacteroides fragilis HMW
           615]
          Length = 628

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
               TSYDYDAPI E G++  PK+  +R +   IK   +Y I   P    +  ++ +   
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380

Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
           +K ++  A        SSD  +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404



 Score = 56.2 bits (134), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 88/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
            TL IL   +G  NYG+       G+ S + I      +++  G  +YQ+ ++ E   L 
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520

Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
           K+  A++          +    + Y+ TF   +  G   +++ S GKG  +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W                               P QTLY IP  W+  GEN +VI E+L  
Sbjct: 579 WKV----------------------------GPQQTLY-IPGVWLKKGENKIVIFEQLNE 609

Query: 697 DPSKISLLTKT 707
            P       KT
Sbjct: 610 TPQTEVKTVKT 620


>gi|265767790|ref|ZP_06095322.1| beta-galactosidase [Bacteroides sp. 2_1_16]
 gi|263252462|gb|EEZ23990.1| beta-galactosidase [Bacteroides sp. 2_1_16]
          Length = 628

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
               TSYDYDAPI E G++  PK+  +R +   IK   +Y I   P    +  ++ +   
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380

Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
           +K ++  A        SSD  +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404



 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSVEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
            TL IL   +G  NYG+       G+ S + I      +++  G  +YQ+ ++ E   L 
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520

Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
           K+  A++          +    + Y+ TF   +  G   +++ S GKG  +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W                               P QTLY +P  W+  GEN +VI E+L  
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609

Query: 697 DPSKISLLTKT 707
            P       KT
Sbjct: 610 TPQTEVKTVKT 620


>gi|321461557|gb|EFX72588.1| hypothetical protein DAPPUDRAFT_58801 [Daphnia pulex]
          Length = 648

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 159/328 (48%), Gaps = 47/328 (14%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF- 69
             +++GK   + SG++HY R  P  W + +RK +  G+ V+ETYV WN HEP +  + F 
Sbjct: 35  GFLLNGKPFRIFSGAVHYFRVHPAYWRDRLRKLRAAGITVVETYVAWNLHEPQKNVFDFG 94

Query: 70  EGR------FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
           +G        DL  F++T  E  LF+ LR GPY C+EW++GG P WL   P +  RT+  
Sbjct: 95  KGNNDMSIFLDLKLFIQTAYEEDLFVILRPGPYICSEWDFGGLPSWLLRDPTMHVRTSYG 154

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQG-GPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
           P+ + + ++L K+ +L+      +S G GPII  QVENEYG+  +      + Y++  +D
Sbjct: 155 PYVDRVDKYLEKLSNLVNHMQFTSSYGKGPIIAFQVENEYGSFGYQDHPRDKAYLQHLSD 214

Query: 183 TAVNLNTSVPWVMCQQEDAP---------DPIINTCNGFYCDGFTPN-------SPSKPI 226
              +L       +    D+P           ++ T N  +  G T          P+ P+
Sbjct: 215 KMKSLGLK---ELFFTSDSPAGYLDWGSIPGVLQTAN--FQSGATQEFKMLQELQPNMPL 269

Query: 227 MWTENYSGWF----LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG 282
           M TE +SGWF      F   +  +  E     +  F  +     ++YM+ GGTNFG   G
Sbjct: 270 MVTEFWSGWFDHWTQDFRKGLKLKDFETSLMEILSFDAS----VSFYMFHGGTNFGFMNG 325

Query: 283 GPLVA----------TSYDYDAPIDEYG 300
             +            TSYDYDAP+ E G
Sbjct: 326 ANVRKEYPGGYLPDITSYDYDAPLSEAG 353


>gi|375360076|ref|YP_005112848.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
 gi|383119863|ref|ZP_09940600.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
 gi|251944025|gb|EES84544.1| hypothetical protein BSHG_4164 [Bacteroides sp. 3_2_5]
 gi|301164757|emb|CBW24316.1| putative exported beta-galactosidase [Bacteroides fragilis 638R]
          Length = 628

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
               TSYDYDAPI E G++  PK+  +R +   IK   +Y I   P    +  ++ +   
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380

Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
           +K ++  A        SSD  +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
            TL IL   +G  NYG+       G+ S + I      +++  G  +YQ+ ++ E   L 
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520

Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
           K+  A++          +    + Y+ TF   +  G   +++ S GKG  +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W                               P QTLY +P  W+  GEN +VI E+L  
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609

Query: 697 DPSKISLLTKT 707
            P       KT
Sbjct: 610 TPQTEVKTVKT 620


>gi|324507659|gb|ADY43243.1| Beta-galactosidase [Ascaris suum]
          Length = 655

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 104/340 (30%), Positives = 164/340 (48%), Gaps = 32/340 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S ++   +   ++DG+     SGSIHY R  P+ W + + + +  GL  I+ Y+ WN+HE
Sbjct: 31  SFSIDPQNNVFLLDGRSFRYISGSIHYFRVHPDQWNDRLSRMRAAGLNAIQFYIPWNFHE 90

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              G++ F+G  ++  F++   +  L+  +RIGPY CAEW  GG P WL     I+ RT+
Sbjct: 91  IYEGKHRFDGSRNITHFLQLAMQNELYALVRIGPYICAEWENGGAPWWLLKYKDIKMRTS 150

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F + +KR+   ++ ++K        GGPI++ Q+ENEYG+ +        ++++  A
Sbjct: 151 DKRFLDAVKRWFDVLLPILKPN--LRKNGGPILMLQLENEYGSFDGGCDRNYTIFLRDLA 208

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD-GFTPNS---------------P 222
                 +     V+    D  D     C    G Y    F P S               P
Sbjct: 209 RRHFGDD-----VVLYTTDGGDDFYLKCGTIPGVYATVDFGPASSEAIDHCFASQRQYEP 263

Query: 223 SKPIMWTENYSGWFLSFGYAVP-FRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA 281
             P++ +E Y GWFL++       +PV ++       FE G  F NYYM+ GGTNF    
Sbjct: 264 HGPLVNSEFYPGWFLTWSQKERGDQPVHNVINGSKYMFEKGANF-NYYMFHGGTNFAFWN 322

Query: 282 GGPL---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
           GG     + TSYDY AP+ E   I   K+  +R+  K I+
Sbjct: 323 GGATKTAITTSYDYFAPLSEAADITD-KYLAIRDWIKTIE 361


>gi|60683238|ref|YP_213382.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
 gi|60494672|emb|CAH09473.1| putative exported beta-galactosidase [Bacteroides fragilis NCTC
           9343]
          Length = 628

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
               TSYDYDAPI E G++  PK+  +R +   IK   +Y I   P    +  ++ +   
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380

Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
           +K ++  A        SSD  +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
            TL IL   +G  NYG+       G+ S + I      +++  G  +YQ+ ++ E   L 
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520

Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
           K+  A++          +    + Y+ TF   +  G   +++ S GKG  +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W                               P QTLY +P  W+  GEN +VI E+L  
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609

Query: 697 DPSKISLLTKT 707
            P       KT
Sbjct: 610 TPQTEVKTVKT 620


>gi|333384209|ref|ZP_08475850.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826788|gb|EGJ99602.1| hypothetical protein HMPREF9455_04016 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 632

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 110/351 (31%), Positives = 161/351 (45%), Gaps = 48/351 (13%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   + SG +HYPR   + W   ++  K  GL  + TYVFWN HEP  G++ F  
Sbjct: 37  FVYDGKPVRIISGEMHYPRIPHQYWRHRMQMLKAMGLNAVATYVFWNAHEPEPGKWDFTE 96

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             +L  ++K   E GL + LR GPY CAEW +GG+P WL  +  ++ R  N    E+  +
Sbjct: 97  DKNLAEYIKIAGEEGLMVILRPGPYVCAEWEFGGYPWWLQNVEEMELRRDN----EQFLK 152

Query: 132 FLAKIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLN 188
           +    I+ + QE  NL  ++GGPII+ Q ENE+G+ V     +  E + ++ A     L 
Sbjct: 153 YTQLYINRLYQEVGNLQITKGGPIIMVQAENEFGSYVSQRKDIPLEEHRRYNAKIVQQLK 212

Query: 189 T-------------------SVPWVM--CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIM 227
           T                   +VP  +     E   D +    N +       N    P M
Sbjct: 213 TAGFDIPSFTSDGSWLFEGGAVPGALPTANGESNIDNLKKVVNRY-------NGGQGPYM 265

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
             E Y GW   +    P      +A    ++ +   +  NYYM  GGTNFG T+G     
Sbjct: 266 VAEFYPGWLAHWVEPHPQVSATSVARQTEKYLQNDVSI-NYYMVHGGTNFGFTSGANYDK 324

Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT 330
                   TSYDYDAP+ E G++  PK+  LR +   I+   +Y +   P+
Sbjct: 325 KHDIQPDLTSYDYDAPVSEAGWV-TPKFDSLRNV---IQKYVDYTLPEAPS 371



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 105/243 (43%), Gaps = 54/243 (22%)

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           L I+ L   A V+ N +    G  N  F  + ++  +  N   +TL+IL   +G  NYG+
Sbjct: 429 LEIKGLRDYATVYTNDEKA--GELNRYFNKYTMDIDVPFN---STLEILVENMGRINYGS 483

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEG--EYIGLDKISL--ANSSFWKQG 589
                  G+ S + I+      ++  G  +  + ++   ++  +D+ S+   N S  K  
Sbjct: 484 EIIHNTKGIISPVRIN----DMEIEGGWQMISIPMDKAPDFSKMDQASVYDNNESAIKSL 539

Query: 590 STLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
           +  PV      YK TF   E  G   +N+   GKG  ++NG++IGRYW            
Sbjct: 540 AGKPV-----LYKGTFNLTE-TGDTFINMEDWGKGIIFINGKNIGRYW------------ 581

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP------SKISL 703
              Y G             P QTLY IP  W+  GEN ++I E+L   P      +K+ +
Sbjct: 582 ---YVG-------------PQQTLY-IPGVWLKKGENKIIIFEQLNDKPHTEVRTTKVPV 624

Query: 704 LTK 706
           L K
Sbjct: 625 LAK 627


>gi|299147339|ref|ZP_07040404.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
 gi|298514617|gb|EFI38501.1| beta-galactosidase (Lactase) [Bacteroides sp. 3_1_23]
          Length = 778

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       S
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFS 207

Query: 191 -VPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     RP +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRPAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+    K+  LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348



 Score = 41.6 bits (96), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 53/128 (41%), Gaps = 32/128 (25%)

Query: 579 SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
           S  N   +     LP   +  +YK+TF   +  G   L++++ GKG  WVNG ++GR+W 
Sbjct: 513 SFINDKKYSDTKILPTMPA--YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWE 569

Query: 639 AYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDP 698
                                         P QTL+ +P  W+  GEN +++ +  G   
Sbjct: 570 I----------------------------GPQQTLF-MPGCWLKEGENEILVLDLKGPTR 600

Query: 699 SKISLLTK 706
           + I  L K
Sbjct: 601 ASIKGLKK 608


>gi|427399434|ref|ZP_18890672.1| hypothetical protein HMPREF9710_00268 [Massilia timonae CCUG 45783]
 gi|425721626|gb|EKU84536.1| hypothetical protein HMPREF9710_00268 [Massilia timonae CCUG 45783]
          Length = 786

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 105/308 (34%), Positives = 154/308 (50%), Gaps = 28/308 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   ++ G +H+ R   E W   ++  K  GL  +  Y+FWNYHE   G++ + G
Sbjct: 42  FLLDGKPLQIRCGEMHFSRVPREYWTHRLKTIKAMGLNSVCAYLFWNYHEWREGRFDWAG 101

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF-RTTNNPFKEEMK 130
           + D   F +  Q+ GL++ LR GPYACAEW  GG P WL   PG  F R+T+  F    +
Sbjct: 102 QRDAAEFCRLAQQEGLWVILRPGPYACAEWEMGGLPWWLLKQPGDAFLRSTSEAFLAPSR 161

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
           R+L ++  ++  + +  ++GGPI++ QVENEYG     YG   + Y++      ++    
Sbjct: 162 RWLREVGRVLGPQQV--TRGGPILMVQVENEYG----FYGEDLD-YMRALRQAVLDAGFD 214

Query: 191 VPWVMCQQEDAPD----PIINTCNGFYCDGFTPNSPSK--------PIMWTENYSGWFLS 238
           VP   C   +A      P + +   F   G  P +  K        P+M  E YSGWF +
Sbjct: 215 VPLFQCNPTNAVAKTHIPELYSVANF---GSNPEAGFKALAEVQQGPLMCGEYYSGWFDT 271

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--GPLV--ATSYDYDA 294
           +G       VE+    +       G+F + YM  GGT FG   G   P     TSYDYDA
Sbjct: 272 WGAPHRRGGVENAVADIRTMLAANGSF-SLYMAHGGTTFGLWGGCDRPFRPDTTSYDYDA 330

Query: 295 PIDEYGFI 302
           PI E G+I
Sbjct: 331 PISEAGWI 338


>gi|312866933|ref|ZP_07727144.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
 gi|311097415|gb|EFQ55648.1| putative beta-galactosidase [Streptococcus parasanguinis F0405]
          Length = 595

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 106/318 (33%), Positives = 152/318 (47%), Gaps = 41/318 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A  + G+   + SG+IHY R  P  W   +   K  G   +ETY+ WN HEP +GQ+ F 
Sbjct: 9   AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYIPWNAHEPRKGQFDFS 68

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           GR DL RF++T Q  GL++ +R  P+ CAEW +GG P WL     ++ R+++  F E + 
Sbjct: 69  GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPAFIEAVD 127

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
           R+  +++ L+    +   +GGPI++ QVENEYG    +YG   + Y++   D       +
Sbjct: 128 RYYDRLLGLLTPYQV--DRGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVT 180

Query: 191 VPWVMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWT 229
            P       D P            + +  T N         G   + F       P+M  
Sbjct: 181 CPLFTS---DGPWRATLRAGTLIEEDLFVTGNFGSKAAYNFGQMKEFFDEYGKRWPLMCM 237

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
           E + GWF  +   V  R  E+LA AV    E G    N YM+ GGTNFG   G    G L
Sbjct: 238 EFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTL 295

Query: 286 ---VATSYDYDAPIDEYG 300
                TSYDY A ++E G
Sbjct: 296 DLPQVTSYDYGALLNEQG 313


>gi|384512509|ref|YP_005707602.1| beta-galactosidase [Enterococcus faecalis OG1RF]
 gi|430358961|ref|ZP_19425649.1| beta-galactosidase [Enterococcus faecalis OG1X]
 gi|327534398|gb|AEA93232.1| beta-galactosidase [Enterococcus faecalis OG1RF]
 gi|429513519|gb|ELA03099.1| beta-galactosidase [Enterococcus faecalis OG1X]
          Length = 592

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 70  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKKKGVRLRSTDPIFMTKVRN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345


>gi|53715303|ref|YP_101295.1| beta-galactosidase [Bacteroides fragilis YCH46]
 gi|52218168|dbj|BAD50761.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
          Length = 628

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/384 (31%), Positives = 177/384 (46%), Gaps = 47/384 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNT-- 189
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L    
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADVG 212

Query: 190 -SVPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
            +VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
               TSYDYDAPI E G++  PK+  +R +   IK   +Y I   P    +  ++ +   
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380

Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
           +K ++  A        SSD  +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
            TL IL   +G  NYG+       G+ S + I      +++  G  +YQ+ ++ E   L 
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520

Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
           K+  A++          +    + Y+ TF   +  G   +++ S GKG  +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W                               P QTLY +P  W+  GEN +VI E+L  
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609

Query: 697 DPSKISLLTKT 707
            P       KT
Sbjct: 610 TPQTEVKTVKT 620


>gi|414156558|ref|ZP_11412859.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
 gi|410869551|gb|EKS17511.1| hypothetical protein HMPREF9186_01279 [Streptococcus sp. F0442]
          Length = 595

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 109/315 (34%), Positives = 152/315 (48%), Gaps = 41/315 (13%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           + G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +GQ+ F GR 
Sbjct: 12  LKGQPFKILSGAIHYFRIDPTDWYHSLYNLKALGFNTVETYVPWNAHEPKKGQFDFSGRL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q  GL++ +R  P+ CAEW +GG P WL     ++ R+++  F E + R+ 
Sbjct: 72  DLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPAFIEAIDRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            +++ L+    +   +GGPI++ QVENEYG    +YG   + Y++   D       + P 
Sbjct: 131 DRLLGLLTPYQV--DRGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVTCPL 183

Query: 194 VMCQQEDAP-DPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTENY 232
                 D P    + T      D F T N  SK                   P+M  E +
Sbjct: 184 FTS---DGPWRATLRTGTLIEEDLFVTGNFGSKAAYNFGQMKEFFNEYGKKWPLMCMEFW 240

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL--- 285
            GWF  +   V  R  E+LA AV    E G    N YM+ GGTNFG   G    G L   
Sbjct: 241 DGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTLDLP 298

Query: 286 VATSYDYDAPIDEYG 300
             TSYDY A ++E G
Sbjct: 299 QVTSYDYGALLNEQG 313


>gi|392950288|ref|ZP_10315845.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
 gi|392434570|gb|EIW12537.1| Beta-galactosidase 3 [Lactobacillus pentosus KCA1]
          Length = 588

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 98/318 (30%), Positives = 148/318 (46%), Gaps = 37/318 (11%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           +  +++G+   + SG++HY R  P  W + + K K  GL  +ETY+ WN HEP  GQ+ F
Sbjct: 10  KEFLLNGQPFKIYSGAVHYFRIAPSEWRDTLEKLKAAGLNTVETYIPWNVHEPQEGQFVF 69

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
           E R+D+ +FVK  Q  GL++ LR  PY CAEW +GG P WL   P +  R+    F E++
Sbjct: 70  EDRYDIGKFVKLAQSIGLYVILRPSPYICAEWEFGGLPAWLLRYPDMVVRSNTPRFMEKV 129

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNT 189
             +   +  ++    L  + GGP+++ QVENEYG    ++G   + Y++           
Sbjct: 130 ANYYEALFKVLVP--LQITHGGPVLMMQVENEYG----SFG-NDKAYLRHVKSLMETNGV 182

Query: 190 SVPWVMC----QQEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTE 230
            VP        QQ      +I   + F    F   S                  P+M  E
Sbjct: 183 DVPLFTADGSWQQALKAGSLIED-DVFVTANFGSKSRENLAELRQFMLMHHKNWPLMCME 241

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAG 282
            + GWF  +   +  R  +     +A   +   +F N YM+ GGTNFG        +   
Sbjct: 242 FWDGWFNRWQEEIVTRSADSFQTDLAELVKEQASF-NLYMFRGGTNFGFFNGCSSRQNVD 300

Query: 283 GPLVATSYDYDAPIDEYG 300
            P + TSYDYDA + E G
Sbjct: 301 YPQI-TSYDYDAVLHEDG 317


>gi|322703307|gb|EFY94918.1| beta-calactosidase, putative [Metarhizium anisopliae ARSEF 23]
          Length = 645

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/323 (30%), Positives = 150/323 (46%), Gaps = 31/323 (9%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
            N TYD    ++DG    L  G +   R  P  W + ++ +K  GL  I +YVFWN  EP
Sbjct: 32  GNFTYDRHNFLLDGVPIQLIGGQMDPQRIPPAYWTQRLQMAKAMGLNTIFSYVFWNNIEP 91

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G + F+GR D+ RF++  Q+ GL++ LR GPY C E  +GGFP WL  IPG+  R  N
Sbjct: 92  TEGSWDFDGRNDIARFLRLAQQEGLYVVLRPGPYICGEHEWGGFPSWLAQIPGMAVRQNN 151

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY----------G 170
            PF +  + +L ++   +   ++  SQGGP+++ Q+ENEYG+   + AY           
Sbjct: 152 KPFLDASRNYLEQLGKHLAATHI--SQGGPVLMTQLENEYGSFGKDKAYLRAMADMLKAN 209

Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
             G LY       +     S+  ++ + +  P       + +  D     +   P +  E
Sbjct: 210 FDGFLYTNDGGGKSYLDGGSLHGILAETDGDPKTGFAARDQYVTD----PTMLGPQLDGE 265

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFE------TGGTFQNYYMYFGGTNFGRTAGG- 283
            Y  W   +    P++       A  R  +       G    + YM+ GGTN+G   GG 
Sbjct: 266 YYVTWIDDWSSNSPYQYTSGRPDATKRVLDDLDWILAGNNSFSIYMFHGGTNWGFENGGI 325

Query: 284 ------PLVATSYDYDAPIDEYG 300
                   V TSYDY AP+DE G
Sbjct: 326 WVDNRLNAVTTSYDYGAPLDESG 348


>gi|334330512|ref|XP_001374407.2| PREDICTED: beta-galactosidase-1-like protein 2 [Monodelphis
           domestica]
          Length = 673

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 109/320 (34%), Positives = 158/320 (49%), Gaps = 31/320 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G R  +  GSIHY R   E W + + K K  GL  + TY+ WN HEP RG++ F G
Sbjct: 90  FLLEGSRFRIFGGSIHYFRVPREYWKDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSG 149

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+   + GL++ LR GPY C+EW+ GG P WL     ++ RTT   F + +  
Sbjct: 150 NLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYVGFIKAVDL 209

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGN----------VEWAYGVGGELYVKWAA 181
           +  ++I   +   L  +QGGPII  QVENEYG+          ++ A    G + +   +
Sbjct: 210 YFNQLIP--RVVPLQYTQGGPIIAVQVENEYGSYDKDPNYMPYIKMALLKRGIVELLMTS 267

Query: 182 DTAVNLNTS-VPWVMC--QQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFLS 238
           D    L+   V  V+     ++    I N     Y   F  N   KP M TE ++GWF +
Sbjct: 268 DNKDGLSGGYVEGVLATINLKNVDSIIFN-----YLQSFQDN---KPTMVTEFWTGWFDT 319

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA------TSYDY 292
           +G        +D+  +V+   + G +  N YM+ GGTNFG   G           TSYDY
Sbjct: 320 WGGPHHIVDADDVMVSVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFTDYQADVTSYDY 378

Query: 293 DAPIDEYGFIRQPKWGHLRE 312
           DA + E G    PK+  LRE
Sbjct: 379 DAILTEAG-DYTPKFFKLRE 397


>gi|335430223|ref|ZP_08557118.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
 gi|334888639|gb|EGM26936.1| beta-galactosidase Bga35A [Haloplasma contractile SSD-17B]
          Length = 587

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 101/299 (33%), Positives = 139/299 (46%), Gaps = 30/299 (10%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           +G +HY R+  + W + + K K  G   +ETYV WN HE  +G Y F G  D+  F++  
Sbjct: 22  AGGMHYFRTMKDSWKDRLIKLKAMGCNTVETYVPWNMHEAKKGVYAFNGNLDIKAFIELA 81

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
           Q   LF+ +R  PY CAEW +GG P WL   PG++ RT   PF + +K +   +  ++  
Sbjct: 82  QSLELFVIVRPSPYICAEWEFGGLPAWLLKDPGMKVRTVYKPFMKHVKEYFEVLFKILAP 141

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQ----- 197
             L   Q GPIIL Q+ENEYG     YG   E Y+        +  T+VP V        
Sbjct: 142 --LQIDQDGPIILMQIENEYG----YYGNDKE-YLSTLLKIMRDFGTTVPVVTSDGPWGE 194

Query: 198 -------QEDAPDPIINTCNGF--YCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPF-RP 247
                    D   P +N   G   + + F     +KP+M  E + GWF ++G      R 
Sbjct: 195 ALDAGSLLADVSLPTMNFGTGAKEHIENFKEKYVNKPVMCMEFWVGWFDAWGDDRHHTRD 254

Query: 248 VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------ATSYDYDAPIDEYG 300
             D A  +      G    N YM+ GGTNFG   G   +       TSYDYDA + E G
Sbjct: 255 ASDAANELRDILNEGSV--NIYMFHGGTNFGFMNGANDLEELKPDVTSYDYDAILTECG 311


>gi|325567414|ref|ZP_08144081.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
 gi|325158847|gb|EGC70993.1| beta-galactosidase [Enterococcus casseliflavus ATCC 12755]
          Length = 591

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 120/354 (33%), Positives = 171/354 (48%), Gaps = 47/354 (13%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLDGKPIKLISGAIHYFRMTPAQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FVK  Q  GL + LR   Y CAEW +GG P WL   P ++ R+T+  F  +++ 
Sbjct: 70  MKDICAFVKQAQTLGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  + GGP+I+ QVENEYG    +YG+  + Y++   +        V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEEYGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
           P  +   + A + +++       D F T N  S+                   PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQP--KWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
              +SYDYDA + E G   +P  K+ H++   KAIK     +  + P  ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPTDKYYHVQ---KAIKEACPEVWQAKPRTKQLAA 345


>gi|417918764|ref|ZP_12562312.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
 gi|342827747|gb|EGU62128.1| glycosyl hydrolase family 35 [Streptococcus parasanguinis SK236]
          Length = 595

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 107/318 (33%), Positives = 152/318 (47%), Gaps = 41/318 (12%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A  + G+   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP +GQ+ F 
Sbjct: 9   AFYLKGQPFKILSGAIHYFRIDPADWYHSLYNLKALGFNTVETYVPWNAHEPRKGQFDFS 68

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           GR DL RF++T Q  GL++ +R  P+ CAEW +GG P WL     ++ R+++  F E + 
Sbjct: 69  GRLDLERFIQTAQSLGLYMIVRPSPFICAEWEFGGLPAWL-LEEDLRIRSSDPVFIEAVD 127

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
           R+  +++ L+    +   +GGPI++ QVENEYG    +YG   + Y++   D       +
Sbjct: 128 RYYDRLLGLLTPYQV--DRGGPILMMQVENEYG----SYGEDKD-YLRAIRDLMKEKGVT 180

Query: 191 VPWVMCQQEDAP------------DPIINTCN---------GFYCDGFTPNSPSKPIMWT 229
            P       D P            + +  T N         G   + F       P+M  
Sbjct: 181 CPLFTS---DGPWRATLRAGTLIEEDLFVTGNFGSKATYNFGQMKEFFDEYGKRWPLMCM 237

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL 285
           E + GWF  +   V  R  E+LA AV    E G    N YM+ GGTNFG   G    G L
Sbjct: 238 EFWDGWFTRWKEPVIQRDPEELAEAVHEVLELGSI--NLYMFHGGTNFGFMNGCSARGTL 295

Query: 286 ---VATSYDYDAPIDEYG 300
                TSYDY A ++E G
Sbjct: 296 DLPQVTSYDYGALLNEQG 313


>gi|195388836|ref|XP_002053084.1| GJ23531 [Drosophila virilis]
 gi|194151170|gb|EDW66604.1| GJ23531 [Drosophila virilis]
          Length = 640

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 112/331 (33%), Positives = 154/331 (46%), Gaps = 30/331 (9%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           V Y++   + DG      SGS HY R+ P+ W   +R  +  GL  + TYV W+ H P  
Sbjct: 28  VDYENDRFLKDGLPFRFISGSFHYFRAHPDTWSRHLRTMRAAGLNAVTTYVEWSLHNPRD 87

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNN 123
           G Y + G  DL RF++   +  L + LR GPY CAE + GGFP W L+  PGIQ RT + 
Sbjct: 88  GVYVWNGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLNKFPGIQLRTADI 147

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW---- 179
            +  E++ + A++  + +        GGPII+ QVENEYG    +Y      Y  W    
Sbjct: 148 NYLSEVRIWYAQL--MTRIVPYLYGNGGPIIMVQVENEYG----SYFACDVNYRNWLRDE 201

Query: 180 ----AADTAVNLNTSVPWVM----CQQEDAPDPIINTCN-GFYCDGFTPNSPSKPIMWTE 230
                 D AV      P V+     Q   A      T N            P  P++  E
Sbjct: 202 TQSHVKDNAVLFTNDGPTVLRCGKIQNVLATMDFGATTNLKDIWSKLRRYEPKGPLVNAE 261

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG------GP 284
            Y GW   +   +     E +        E+G +  N+YM++GGTNFG TAG      G 
Sbjct: 262 YYPGWLTHWTEPMANVSTEAITGTFIDMLESGASV-NFYMFYGGTNFGFTAGANDNGPGN 320

Query: 285 LVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
            +A  TSYDYDAP+ E G    PK+  LR +
Sbjct: 321 YIADITSYDYDAPMTEAG-DPTPKYMALRHI 350


>gi|354581347|ref|ZP_09000251.1| Beta-galactosidase [Paenibacillus lactis 154]
 gi|353201675|gb|EHB67128.1| Beta-galactosidase [Paenibacillus lactis 154]
          Length = 587

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 106/335 (31%), Positives = 165/335 (49%), Gaps = 37/335 (11%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SG+IHY R  PE W + + K K  GL  +ETY+ WN+HEP  G++ F G  D+  F+   
Sbjct: 22  SGAIHYFRVVPEYWEDRLLKLKACGLNTVETYIPWNWHEPDEGRFNFSGMADIEAFITLA 81

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
            + GL + +R  PY CAEW +GG P WL   P +Q R  +  F +++  +  ++I  +  
Sbjct: 82  GKLGLHVIVRPSPYICAEWEFGGLPAWLLQDPHMQLRCLDPKFLKKVDAYYDELIPRLVP 141

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAP 202
             L ++ GGPII  Q+ENEYG    +YG     Y+++  +  +     V  ++   +   
Sbjct: 142 --LLSTNGGPIIAVQIENEYG----SYG-NDTAYLQYLQEALIARGVDV--LLFTSDGPT 192

Query: 203 DPIIN--TCNGFYCDGFTPNSPSK------------PIMWTENYSGWFLSFGYAVPFRPV 248
           D ++   T  G        + PS+            P+M  E ++GWF  +      R  
Sbjct: 193 DGMLQGGTVPGVTATVNFGSRPSEAFAKLREYRSEDPLMCMEYWNGWFDHWMKPHHTRDS 252

Query: 249 EDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------PLVATSYDYDAPIDEYGF 301
           ED A   A     G +  N+YM+ GGTNFG   G        P + TSYDYDAP+ E G 
Sbjct: 253 EDAASVFAEMLALGASV-NFYMFHGGTNFGFYNGANYHDKYEPTI-TSYDYDAPLSECGD 310

Query: 302 IRQPKWGHLREL---HKAIKLCEEYLISSDPTHQK 333
           +   K+  +R++   H+ ++L +   +  DP  +K
Sbjct: 311 VTT-KYEAVRQVIAKHQGVELGDLPAL-PDPVRKK 343


>gi|195116355|ref|XP_002002721.1| GI11295 [Drosophila mojavensis]
 gi|193913296|gb|EDW12163.1| GI11295 [Drosophila mojavensis]
          Length = 678

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 106/336 (31%), Positives = 160/336 (47%), Gaps = 44/336 (13%)

Query: 2   SANVTYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNY 59
           + + + DH+A   +++GK     +GS HY R+ PE W   +R  +  GL  ++TYV W+ 
Sbjct: 49  TTSFSIDHQANTFLLNGKPFRYVAGSFHYFRALPEAWRNRLRTMRAAGLNALDTYVEWSL 108

Query: 60  HEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQF 118
           H P  G+Y +EG  DLV+F++  QE   ++ LR GPY CAE + GG P WL    P I+ 
Sbjct: 109 HNPHDGEYNWEGIADLVKFLEIAQEEDFYIVLRPGPYICAERDNGGLPHWLFTKYPDIKV 168

Query: 119 RTTNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK 178
           RT +  +  E+ ++ A+++  +K  +L    GG II+ QVENEY     AY      Y+ 
Sbjct: 169 RTNDPRYIAEVSKWYAELMPRLK--HLLIGNGGKIIMVQVENEYA----AYYACDHDYLN 222

Query: 179 WAAD--------TAVNLNTSVP--WVMCQQEDAPDPIINTCNGFYCDG----------FT 218
           W  D         A+     +P   + C + D     +     F  D             
Sbjct: 223 WLRDETDKYVENKALLFTVDIPNERMHCGKIDN----VFATTDFGIDRIHEIDQIWKYLR 278

Query: 219 PNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG 278
              P+ P++ +E Y GW   +      R  +++A A+        +  N YM+FGGTNFG
Sbjct: 279 SVQPTGPLVNSEFYPGWLTHWQEMNQRRDPQEVASALKTILSYNASV-NLYMFFGGTNFG 337

Query: 279 RTAGG----------PLVATSYDYDAPIDEYGFIRQ 304
            TAG               TSYDYDA +DE G + +
Sbjct: 338 FTAGANYDLDGSIGYTADITSYDYDAVMDEAGGVTK 373



 Score = 40.0 bits (92), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 54/181 (29%), Positives = 79/181 (43%), Gaps = 36/181 (19%)

Query: 472 VFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI-NTLDILSMMVGLQN 530
             L +E L   A VFV+++LV  G  + +   +     + L++G  +TL +L    G  N
Sbjct: 459 TLLQVEDLRDRAHVFVDQQLV--GTLSREARIY----ALPLSKGWGSTLQLLVENQGRIN 512

Query: 531 YGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGS 590
           Y    D  G  +F  + + L NG   L   +W            L+ I++ N   W+Q  
Sbjct: 513 YDRANDTKG--IFGKVTLQLHNGGA-LPLEDWTTTA------YPLEAITIEN---WRQ-- 558

Query: 591 TLPVNKSL--------------IWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
            LP N +L              I Y  +F   E  G   LN A  GKG A+VNG ++GRY
Sbjct: 559 KLPENAALDSSIAKQRLLRSGPILYTGSFQVSE-VGDTYLNPAGWGKGVAYVNGFNLGRY 617

Query: 637 W 637
           W
Sbjct: 618 W 618


>gi|423217397|ref|ZP_17203893.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
           CL03T12C61]
 gi|392628556|gb|EIY22582.1| hypothetical protein HMPREF1061_00666 [Bacteroides caccae
           CL03T12C61]
          Length = 775

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/330 (34%), Positives = 160/330 (48%), Gaps = 34/330 (10%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            V  ++    I+GK   L  G +HYPR   E W + + ++   GL  +  YVFWN+HE  
Sbjct: 29  QVKIENGTFNINGKDVQLICGEMHYPRIPHEYWRDRLHRAHAMGLNTVSAYVFWNFHERQ 88

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G + F G+ D+  FV+  QE GL++ LR GPY CAEW++GG+P WL     + +R+ + 
Sbjct: 89  PGVFDFSGQADIAEFVRIAQEEGLYVILRPGPYVCAEWDFGGYPSWLLKEKDLTYRSKDP 148

Query: 124 PFKEEMKRFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
            F    +R+   I +L KQ   L  + GG II+ QVENEYG    +Y    E Y+    D
Sbjct: 149 RFMSYCERY---IKELGKQLAPLTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRD 200

Query: 183 TAVNLNTSVPWVMCQ---QEDAPD--PIINTCNGFYCDGF----TPNSPSKPIMWTENYS 233
                  +VP   C    Q +A      + T NG + +          P  P    E Y 
Sbjct: 201 MLQEAGFNVPLFTCDGGGQVEAGHIAGALPTLNGVFGEDIFKIVDKYHPGGPYFVAEFYP 260

Query: 234 GWFLSFGY---AVPF-RPVEDLAFAVARFFETGGTFQNYYMYFGGTNF-----GRTAGG- 283
            WF  +G    +V + RP E L + +       G   + YM+ GGTNF       T+GG 
Sbjct: 261 AWFDEWGKRHSSVAYERPAEQLDWMLGH-----GVSVSMYMFHGGTNFWYMNGANTSGGF 315

Query: 284 PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
               TSYDYDAP+ E+G    PK+   RE+
Sbjct: 316 RPQPTSYDYDAPLGEWGNCY-PKYHAFREI 344



 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 68/277 (24%), Positives = 118/277 (42%), Gaps = 58/277 (20%)

Query: 419 SSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSD---YLWYTASIHVMPGQGKEVFLN 475
           ++ F+  E K       +F +   +E + + +D      Y+ Y  +I   PG+ K   L 
Sbjct: 364 TTTFATVELKESAPLTTAFHQTIQSEDVLSMEDVGTDFGYIHYQTTIKT-PGKQK---LI 419

Query: 476 IESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWF 535
           I+ L   A++ V+ K VA    + D      +  +++++   TL+IL    G  NYG   
Sbjct: 420 IQDLRDYAVILVDGKQVA----SLDRRYNQNSTTLDIHKVPATLEILVENTGRVNYGPDI 475

Query: 536 DVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLPVN 595
                G+ S +L     G   L+        G     + L K  +++ SF ++   +P  
Sbjct: 476 LFNRKGITSQVLW----GNEKLT--------GWSITPLPLYKEEVSSLSFGQEIKGVPA- 522

Query: 596 KSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRG 655
               +++ TF+  E +G   ++++  GKG  WVNG+S+GR+W+                 
Sbjct: 523 ----FHRGTFII-EQQGDCFVDMSQWGKGAVWVNGKSLGRFWNI---------------- 561

Query: 656 SYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHE 692
                        P QTLY IP  W+  GEN +V+ E
Sbjct: 562 ------------GPQQTLY-IPAPWLKKGENEIVVFE 585


>gi|424760912|ref|ZP_18188500.1| putative beta-galactosidase [Enterococcus faecalis R508]
 gi|402402633|gb|EJV35336.1| putative beta-galactosidase [Enterococcus faecalis R508]
          Length = 593

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|354466872|ref|XP_003495895.1| PREDICTED: beta-galactosidase-1-like protein 3-like [Cricetulus
           griseus]
          Length = 761

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 154/317 (48%), Gaps = 23/317 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             +DG + ++  GSIHY R   E W + + K +  G   + TY+ WN HE  RG + F  
Sbjct: 186 FTLDGHKFMIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQNRGTFDFSE 245

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  +V      GL++ LR GPY CAE + GG P WL   P +Q RTT   F + + +
Sbjct: 246 ILDLEAYVSLAATLGLWVILRPGPYICAEVDLGGLPSWLLGYPELQLRTTQQEFLDAVDK 305

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL--YVKWAADTA--VNL 187
           +   +I  +        +GGP+I  Q+ENEYG    ++   G+   Y+K A      V L
Sbjct: 306 YFDHLIPRILPLQYL--RGGPVIAVQIENEYG----SFSKDGDYMEYIKEALQKRGIVEL 359

Query: 188 NTSVPWVMCQQEDAPDPIINTCN--GFYCDGFTP---NSPSKPIMWTENYSGWFLSFGYA 242
             +       Q  +    + T N   F  D F         KPIM  E ++GWF ++G  
Sbjct: 360 LLTSDNHKGIQTGSVKGALTTINMASFEKDSFIKLLQMQNDKPIMVMEYWTGWFDTWGRE 419

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAPI 296
              +  E++ + V+RF + G +F N YM+ GGTNFG   G         V TSYDYDA +
Sbjct: 420 HNVKSAEEIRYTVSRFIKYGISF-NMYMFHGGTNFGFINGAFHYDKHSSVVTSYDYDAVL 478

Query: 297 DEYGFIRQPKWGHLREL 313
            E G   + K+  LR+L
Sbjct: 479 TEAGDYTE-KYFKLRKL 494


>gi|257870316|ref|ZP_05649969.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
 gi|257804480|gb|EEV33302.1| glycosyl hydrolase [Enterococcus gallinarum EG2]
          Length = 593

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 110/314 (35%), Positives = 149/314 (47%), Gaps = 35/314 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G    L SG+IHY R  P+ W   +   K  G   +ETYV WN HEP +G + FEG
Sbjct: 10  FLMNGSPFKLLSGAIHYFRVHPDDWEHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL RF+   QE GL++ LR  PY CAEW +GG P WL    G + R  +  +   +  
Sbjct: 70  ILDLERFLSLAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG-RLRACDPSYLAHVAE 128

Query: 132 F----LAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVG-GELYVKWAADTA 184
           +    L KII          S GG I++ QVENEYG+   E AY     E+ +    D  
Sbjct: 129 YYDVLLPKIIPYQ------LSHGGNILMIQVENEYGSYGEEKAYLRAIKEMLINRGIDMP 182

Query: 185 VNLNTSVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYS 233
           +   +  PW    +  +   D ++ T N             D F  ++   P+M  E + 
Sbjct: 183 L-FTSDGPWQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWD 241

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
           GWF  +   +  R  +DLA +V    E G    N YM+ GGTNFG       R A     
Sbjct: 242 GWFNRWNEPIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQ 299

Query: 287 ATSYDYDAPIDEYG 300
            TSYDYDAP+DE G
Sbjct: 300 VTSYDYDAPLDEQG 313


>gi|418977089|ref|ZP_13524926.1| glycosyl hydrolase family 35 [Streptococcus mitis SK575]
 gi|383350422|gb|EID28291.1| glycosyl hydrolase family 35 [Streptococcus mitis SK575]
          Length = 601

 Score =  150 bits (378), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G++ FEG  
Sbjct: 18  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFRFEGAL 77

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 78  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 136

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 137 DQLLSRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 194

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 195 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 254

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 255 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 312

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 313 ALLDEEG 319


>gi|424687003|ref|ZP_18123658.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
 gi|402366194|gb|EJV00591.1| putative beta-galactosidase [Enterococcus faecalis ERV25]
          Length = 593

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|229548754|ref|ZP_04437479.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
 gi|257421063|ref|ZP_05598053.1| glycosyl hydrolase [Enterococcus faecalis X98]
 gi|312951816|ref|ZP_07770707.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
 gi|422691033|ref|ZP_16749073.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
 gi|422707894|ref|ZP_16765431.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
 gi|229306094|gb|EEN72090.1| possible beta-galactosidase [Enterococcus faecalis ATCC 29200]
 gi|257162887|gb|EEU92847.1| glycosyl hydrolase [Enterococcus faecalis X98]
 gi|310630219|gb|EFQ13502.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0102]
 gi|315154243|gb|EFT98259.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0031]
 gi|315154885|gb|EFT98901.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0043]
          Length = 593

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|257418414|ref|ZP_05595408.1| beta-galactosidase [Enterococcus faecalis T11]
 gi|257160242|gb|EEU90202.1| beta-galactosidase [Enterococcus faecalis T11]
          Length = 592

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 70  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345


>gi|257415380|ref|ZP_05592374.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
 gi|257157208|gb|EEU87168.1| beta-galactosidase [Enterococcus faecalis ARO1/DG]
          Length = 593

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|443718372|gb|ELU09030.1| hypothetical protein CAPTEDRAFT_226658 [Capitella teleta]
          Length = 347

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 71/154 (46%), Positives = 102/154 (66%), Gaps = 2/154 (1%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
           A  ++GK+ +L SG++HY R  PE W + + K K  GL  +ETYV WN HE +RG + F 
Sbjct: 10  AFFLNGKKTLLLSGAVHYFRVVPEYWRDRLLKVKAAGLNCVETYVAWNAHEAVRGTFDFS 69

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G  DL RF++  Q+ GL++ LR GPY C+EW++GG P WL   P ++ RT+  P+ E + 
Sbjct: 70  GILDLRRFIQIAQDVGLYVLLRPGPYICSEWDFGGLPSWLLHDPEMKVRTSYPPYLEAVD 129

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGN 164
            +LAKI+ L+   +L  S+GGPII  Q+ENEYG+
Sbjct: 130 AYLAKILPLVN--DLQMSKGGPIIAVQLENEYGS 161


>gi|227517783|ref|ZP_03947832.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
 gi|424678087|ref|ZP_18114931.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
 gi|424681129|ref|ZP_18117923.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
 gi|424685648|ref|ZP_18122340.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
 gi|424689662|ref|ZP_18126226.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
 gi|424693525|ref|ZP_18129955.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
 gi|424698239|ref|ZP_18134537.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
 gi|424701365|ref|ZP_18137539.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
 gi|424702750|ref|ZP_18138894.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
 gi|424711867|ref|ZP_18144074.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
 gi|424717978|ref|ZP_18147248.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
 gi|424722429|ref|ZP_18151489.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
 gi|424723619|ref|ZP_18152577.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
 gi|424733091|ref|ZP_18161660.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
 gi|424746203|ref|ZP_18174452.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
 gi|424755204|ref|ZP_18183090.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
 gi|227074744|gb|EEI12707.1| possible beta-galactosidase [Enterococcus faecalis TX0104]
 gi|402351976|gb|EJU86842.1| putative beta-galactosidase [Enterococcus faecalis ERV116]
 gi|402352513|gb|EJU87362.1| putative beta-galactosidase [Enterococcus faecalis ERV103]
 gi|402358223|gb|EJU92905.1| putative beta-galactosidase [Enterococcus faecalis ERV129]
 gi|402367111|gb|EJV01460.1| putative beta-galactosidase [Enterococcus faecalis ERV31]
 gi|402371797|gb|EJV05943.1| putative beta-galactosidase [Enterococcus faecalis ERV62]
 gi|402373001|gb|EJV07093.1| putative beta-galactosidase [Enterococcus faecalis ERV41]
 gi|402373959|gb|EJV08006.1| putative beta-galactosidase [Enterococcus faecalis ERV37]
 gi|402382684|gb|EJV16335.1| putative beta-galactosidase [Enterococcus faecalis ERV65]
 gi|402383232|gb|EJV16843.1| putative beta-galactosidase [Enterococcus faecalis ERV68]
 gi|402386182|gb|EJV19689.1| putative beta-galactosidase [Enterococcus faecalis ERV63]
 gi|402388743|gb|EJV22170.1| putative beta-galactosidase [Enterococcus faecalis ERV72]
 gi|402392403|gb|EJV25665.1| putative beta-galactosidase [Enterococcus faecalis ERV81]
 gi|402397550|gb|EJV30559.1| putative beta-galactosidase [Enterococcus faecalis ERV73]
 gi|402397571|gb|EJV30579.1| putative beta-galactosidase [Enterococcus faecalis ERV85]
 gi|402401167|gb|EJV33955.1| putative beta-galactosidase [Enterococcus faecalis ERV93]
          Length = 593

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|194857009|ref|XP_001968877.1| GG24263 [Drosophila erecta]
 gi|190660744|gb|EDV57936.1| GG24263 [Drosophila erecta]
          Length = 672

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 106/327 (32%), Positives = 158/327 (48%), Gaps = 43/327 (13%)

Query: 6   TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           T DH A   ++DG+     SGS HY R+ PE W   +R  +  GL  ++TYV W+ H P 
Sbjct: 47  TIDHAANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPH 106

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTN 122
            G+Y +EG  D+V+F++  Q+   ++ LR GPY CAE + GG P WL    P I+ RT +
Sbjct: 107 DGEYNWEGIADVVKFLEIAQQEDFYIILRPGPYICAERDNGGLPHWLFAKYPSIKMRTND 166

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             +  E+ ++ A+++   + ++LF   GG II+ QVENEYG+    +      Y+ W  D
Sbjct: 167 PNYIAEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 219

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTC----NGFYCDGFTPN---------------SPS 223
                 T    +     D P+  + +C    N F    F  +                P+
Sbjct: 220 ETEKYVTGKALLFTV--DIPNEKM-SCGKIENVFATTDFGIDRINEIDQIWAMLRTLQPT 276

Query: 224 KPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG 283
            P++ +E Y GW   +      R  +++A A+        +  N YM+FGGTNFG TAG 
Sbjct: 277 GPLVNSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGA 335

Query: 284 ----------PLVATSYDYDAPIDEYG 300
                         TSYDYDA +DE G
Sbjct: 336 NYNLDGGIGYAADITSYDYDAVMDEAG 362


>gi|417846883|ref|ZP_12492867.1| beta-galactosidase [Streptococcus mitis SK1073]
 gi|339458003|gb|EGP70556.1| beta-galactosidase [Streptococcus mitis SK1073]
          Length = 595

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + + + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRLRSSDPAYIDAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLSRLVPHLL--DNGGNILIMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|422727867|ref|ZP_16784288.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
 gi|315151617|gb|EFT95633.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0012]
          Length = 593

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|224542300|ref|ZP_03682839.1| hypothetical protein CATMIT_01478 [Catenibacterium mitsuokai DSM
           15897]
 gi|224524842|gb|EEF93947.1| glycosyl hydrolase family 35 [Catenibacterium mitsuokai DSM 15897]
          Length = 577

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 159/327 (48%), Gaps = 35/327 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +IDG++  + SG++HY R  PE W + +   K+ G   +ETY+ WN HEP +G++ F+G
Sbjct: 10  FIIDGQKTKIISGAVHYFRIVPEYWEDTLLDLKDMGCNAVETYIPWNLHEPYKGKFDFDG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
           + D+  F++  ++ GL++ +R  PY C+EW  GG P WL     I+ RT ++ + + ++ 
Sbjct: 70  QKDVCAFLELAKKLGLYVIIRPSPYICSEWELGGLPAWLLKDSDIRLRTNDSVYMKHLEE 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           + A ++ ++ +  +  ++ G IILAQ+ENEYG    +Y    + Y+K            V
Sbjct: 130 YYAVLLPMIAKYQI--NREGTIILAQLENEYG----SYNQDKD-YLKALLKMMREYGIEV 182

Query: 192 PWVMCQ---QEDAPDPIINTCNGFYCDGFTPNSPSK---------------PIMWTENYS 233
           P        +E      +   + F    F  N+                  PIM  E + 
Sbjct: 183 PIFTADGTWEEALEAGSLFEEDVFPTGNFGSNAKENIAVLKEFMKKHQIVAPIMCMEFWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
           GWF  +   +  R  E+L  +     + G    N+YM+ GGTNFG       R       
Sbjct: 243 GWFNRWNMEIVKRDPEELVQSAKEMIDLGSI--NFYMFHGGTNFGWMNGCSARKEHDLPQ 300

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLREL 313
            TSYDYDA + EYG  +  K+  LR++
Sbjct: 301 ITSYDYDAILTEYG-AKTEKYHLLRKM 326


>gi|423280524|ref|ZP_17259436.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
           610]
 gi|404583731|gb|EKA88404.1| hypothetical protein HMPREF1203_03653 [Bacteroides fragilis HMW
           610]
          Length = 628

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 111/346 (32%), Positives = 162/346 (46%), Gaps = 46/346 (13%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
               TSYDYDAPI E G++  PK+  +R +   I+   +Y +   P
Sbjct: 326 QPDLTSYDYDAPISEAGWV-TPKYDSIRNV---IRKYVKYTVPEAP 367



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWVTPKYDSIRNVIRKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 VPEAPAPNPVIEIPSI-KLTKVADVLAFAEKQKPVSADT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGL 575
            TL IL   +G  NYG+       G+ S + I  K       +GEW +YQ+ +  E   L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVKIAGKE-----ITGEWDMYQLPM-SEMPDL 519

Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
            K+  A++          +    + Y+ TF   +  G   +++ + GKG  +VNG +IGR
Sbjct: 520 AKLK-ADAHANVPAEAAKLKGCPVLYEGTFTL-DNVGDTFIDMENWGKGIIFVNGVNIGR 577

Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
           YW                               P QTLY IP  W+  G N +VI E+L 
Sbjct: 578 YWKV----------------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLN 608

Query: 696 GDPSKISLLTKT 707
             P       KT
Sbjct: 609 EVPQAEVKTVKT 620


>gi|257083732|ref|ZP_05578093.1| beta-galactosidase [Enterococcus faecalis Fly1]
 gi|256991762|gb|EEU79064.1| beta-galactosidase [Enterococcus faecalis Fly1]
          Length = 593

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLSPLQITQGGPVIMMQVENEYG----SYGM-EKAYLQQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|194221516|ref|XP_001490197.2| PREDICTED: beta-galactosidase-like [Equus caballus]
          Length = 641

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 162/342 (47%), Gaps = 20/342 (5%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            + Y H   + DG+     SGSIHY R     W + + K K  GL  I+TYV WN+HEP 
Sbjct: 12  KIDYSHNRFLKDGQPFRYISGSIHYFRIPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 71

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            GQY F    D+  F++   E GL + LR GPY CAEW+ GG P WL     I  R+++ 
Sbjct: 72  PGQYQFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKQSIVLRSSDP 131

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAY-GVGGELYVKW 179
            +   + ++L  ++  MK   L    GGPII  QVENEYG+    ++ Y     +L+ + 
Sbjct: 132 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIITVQVENEYGSYFTCDYDYLRFLQKLFHQH 189

Query: 180 AADTAVNLNTS---VPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMWTENY 232
             D  +   T      ++ C         ++  +G            + P  P++ +E Y
Sbjct: 190 LGDDVLLFTTDGIFQKFLKCGALQGLYATVDFGSGINVTAAFQIQRKSEPRGPLINSEFY 249

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----VAT 288
           +GW   +G        + +A  +     +G    N YM+ GGTNF    G  L      T
Sbjct: 250 TGWLDHWGQRHSKAKTDVVASTLYDILASGANV-NMYMFIGGTNFAYWNGANLPYQPQPT 308

Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAI-KLCEEYLISSDP 329
           SYDYDAP+ E G + + K+  LR++ K   K+ E ++  S P
Sbjct: 309 SYDYDAPLSEAGDLTE-KYFALRDVIKKFEKVPEGFIPPSTP 349


>gi|29375402|ref|NP_814556.1| glycosyl hydrolase [Enterococcus faecalis V583]
 gi|29342862|gb|AAO80626.1| glycosyl hydrolase, family 35 [Enterococcus faecalis V583]
          Length = 592

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 70  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345


>gi|424665121|ref|ZP_18102157.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
           616]
 gi|404574985|gb|EKA79730.1| hypothetical protein HMPREF1205_00996 [Bacteroides fragilis HMW
           616]
          Length = 628

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 111/346 (32%), Positives = 162/346 (46%), Gaps = 46/346 (13%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKVTPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+KT  E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGDLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVNQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQVGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
               TSYDYDAPI E G++  PK+  +R +   I+   +Y +   P
Sbjct: 326 QPDLTSYDYDAPISEAGWV-TPKYDSIRNV---IRKYVKYTVPEAP 367



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDLTSYDYDAPISEAGWVTPKYDSIRNVIRKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +  F      EQ+N       Y+
Sbjct: 363 VPEAPAPNPVIEIPSI-KLTKVADVLAFAEKQKPVSADTPFT----FEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGL 575
            TL IL   +G  NYG+       G+ S + I  K       +GEW +YQ+ +  E   L
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVKIAGKE-----ITGEWDMYQLPM-SEMPDL 519

Query: 576 DKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGR 635
            K+  A++          +    + Y+ TF   +  G   +++ + GKG  +VNG +IGR
Sbjct: 520 AKLK-ADAHANVPAEAAKLKGCPVLYEGTFTL-DNVGDTFIDMENWGKGIIFVNGVNIGR 577

Query: 636 YWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELG 695
           YW                               P QTLY IP  W+  G N +VI E+L 
Sbjct: 578 YWKV----------------------------GPQQTLY-IPGVWLKKGTNKIVIFEQLN 608

Query: 696 GDPSKISLLTKT 707
             P       KT
Sbjct: 609 EVPQAEVKTVKT 620


>gi|195473731|ref|XP_002089146.1| GE18961 [Drosophila yakuba]
 gi|194175247|gb|EDW88858.1| GE18961 [Drosophila yakuba]
          Length = 672

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 105/323 (32%), Positives = 157/323 (48%), Gaps = 39/323 (12%)

Query: 8   DHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           DH A   ++DG+     SGS HY R+ PE W   +R  +  GL  ++TYV W+ H P  G
Sbjct: 49  DHEANTFMLDGQPFRYVSGSFHYFRAVPESWRSRLRTMRASGLNALDTYVEWSLHNPHDG 108

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHF-IPGIQFRTTNNP 124
           +Y +EG  D+V+F++  QE   ++ LR GPY CAE + GG P WL    P I+ RT +  
Sbjct: 109 EYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFAKYPSIKMRTNDPN 168

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD-- 182
           +  E+ ++ A+++   + ++LF   GG II+ QVENEYG+    +      Y+ W  D  
Sbjct: 169 YISEVGKWYAELMP--RLQHLFVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRDET 221

Query: 183 ------TAVNLNTSVP--WVMCQQ-------EDAPDPIINTCNGFYCDGFTPNSPSKPIM 227
                  A+     +P   + C +        D     IN  +  +        P+ P++
Sbjct: 222 EKYVSGKALLFTVDIPNEKMSCGKIENVFATTDFGIDRINEIDKIWA-MLRALQPTGPLV 280

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG---- 283
            +E Y GW   +      R  +++A A+        +  N YM+FGGTNFG TAG     
Sbjct: 281 NSEFYPGWLTHWQEQNQRRDGQEVANALRTILSYNASV-NLYMFFGGTNFGFTAGANYNL 339

Query: 284 ------PLVATSYDYDAPIDEYG 300
                     TSYDYDA +DE G
Sbjct: 340 DGGIGYAADITSYDYDAVMDEAG 362


>gi|1352080|sp|P48982.1|BGAL_XANMN RecName: Full=Beta-galactosidase; Short=Lactase; Flags: Precursor
 gi|1045034|gb|AAC41485.1| beta-galactosidase [Xanthomonas axonopodis pv. manihotis]
          Length = 598

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 116/320 (36%), Positives = 152/320 (47%), Gaps = 25/320 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 37  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 96

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FVK     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 97  NNDVAAFVKEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 156

Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNL 187
           +L     L KQ + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L
Sbjct: 157 YLDA---LAKQVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-L 212

Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G  
Sbjct: 213 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 272

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
                    A         G +  N YM+ GGT+FG   G               TSYDY
Sbjct: 273 HAATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDY 331

Query: 293 DAPIDEYGFIRQPKWGHLRE 312
           DA +DE G    PK+  +R+
Sbjct: 332 DAILDEAGHP-TPKFALMRD 350


>gi|334138027|ref|ZP_08511451.1| beta-galactosidase [Paenibacillus sp. HGF7]
 gi|333604560|gb|EGL15950.1| beta-galactosidase [Paenibacillus sp. HGF7]
          Length = 601

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 111/325 (34%), Positives = 156/325 (48%), Gaps = 38/325 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
           L+ D   R++ SG++HY R  PE W + + K K  G   +ETYV WN HEP  G++ F G
Sbjct: 12  LLNDKPLRII-SGALHYFRVVPEYWRDRLLKMKACGCNTVETYVAWNVHEPEEGKFDFGG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D++ FV+   E GL + +R  PY CAEW +GG P WL     +Q R ++        +
Sbjct: 71  IADVIAFVELAGELGLHVIVRPSPYICAEWEFGGLPAWLLKDSEMQLRCSD-------PK 123

Query: 132 FLAKI-----IDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVN 186
           FLAK+     + L K   L  + GGPII  QVENEYG    +YG   + Y+ +  D  + 
Sbjct: 124 FLAKVDAYYDVLLPKFVPLLCTNGGPIIAMQVENEYG----SYG-NDKAYLGYLRDGMIA 178

Query: 187 LNTSVPWV--------MCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTENYSG 234
               V           M Q    PD +     G   +     F    P +P+M  E ++G
Sbjct: 179 RGIDVLLFTSDGPTDEMLQGGTLPDVLATVNFGSRPEESFAKFREYRPDEPLMCMEFWNG 238

Query: 235 WFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------AT 288
           WF  +      R  ED A  +      G +  N+YM+ GGTNFG  +G   +       T
Sbjct: 239 WFDHWMEEHHTRDGEDAARVLDDMLGAGASV-NFYMFHGGTNFGFYSGANHIKTYEPTVT 297

Query: 289 SYDYDAPIDEYGFIRQPKWGHLREL 313
           SYDYDAP+ E G +   K+   RE+
Sbjct: 298 SYDYDAPLTERGDL-TAKYEAFREV 321


>gi|327260596|ref|XP_003215120.1| PREDICTED: beta-galactosidase-like [Anolis carolinensis]
          Length = 679

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 102/320 (31%), Positives = 146/320 (45%), Gaps = 27/320 (8%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S ++ Y  +  + DG +    SGSIHY R     W + + K    GL  ++ Y+ WNYHE
Sbjct: 70  SFSIDYTDKCFLKDGVKFRYISGSIHYFRIPRAYWKDRLLKMYMSGLNAVQIYIPWNYHE 129

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P+ G Y F+G  DL  F+       L + LR GPY CAEW  GG P WL   P I  RT+
Sbjct: 130 PLSGVYNFDGDRDLEGFLDLAANFDLLVILRPGPYICAEWEMGGIPSWLLAKPNIILRTS 189

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F + + ++ + ++  +K        GG II  QVENEYG+    Y    +      A
Sbjct: 190 DPDFLQAVDKWFSVLLPKIKPH--LYINGGNIISVQVENEYGSY---YACDYDYLRHLEA 244

Query: 182 DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTP-------------NSPSKPIM 227
                L   V           + +  T +G Y    F P             + P+ P++
Sbjct: 245 VFRSYLGKKVVLFTTDGTKESELLCGTLHGLYTTVDFGPEENVTEAFEKQRIHEPNGPLV 304

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL-- 285
            +E Y+GW   +G     +  ED+A  + +  E G    N YM+ GGTNFG  +G     
Sbjct: 305 NSEYYTGWLDYWGEPHSTKSAEDVARGLEKMLELGANV-NMYMFQGGTNFGYWSGADYNN 363

Query: 286 -----VATSYDYDAPIDEYG 300
                + TSYDYDAP+ E G
Sbjct: 364 GIYNPITTSYDYDAPLSEAG 383


>gi|255973889|ref|ZP_05424475.1| beta-galactosidase [Enterococcus faecalis T2]
 gi|307284354|ref|ZP_07564519.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
 gi|255966761|gb|EET97383.1| beta-galactosidase [Enterococcus faecalis T2]
 gi|306503294|gb|EFM72546.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0860]
          Length = 593

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTRQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|420261585|ref|ZP_14764229.1| glycosyl hydrolase [Enterococcus sp. C1]
 gi|394771519|gb|EJF51280.1| glycosyl hydrolase [Enterococcus sp. C1]
          Length = 591

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 168/352 (47%), Gaps = 43/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLDGKPIKLISGAIHYFRMTPVQWTDSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FVK  Q  GL + LR   Y CAEW +GG P WL   P ++ R+T+  F  +++ 
Sbjct: 70  MKDICAFVKQAQTIGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  + GGP+I+ QVENEYG    +YG+  + Y++   +        V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEEYGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
           P  +   + A + +++       D F T N  S+                   PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDIFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGR----TAGGPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG     +A G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFYNGCSARGALDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
              +SYDYDA + E G   +P      ++ KAIK     +  + P  ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPT-DKYYQVQKAIKEACPEVWQAKPRTKQLAA 345


>gi|297483826|ref|XP_002693891.1| PREDICTED: galactosidase, beta 1-like 3 [Bos taurus]
 gi|296479482|tpg|DAA21597.1| TPA: galactosidase, beta 1-like [Bos taurus]
          Length = 899

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/328 (34%), Positives = 153/328 (46%), Gaps = 37/328 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G   ++  GS+HY R     W + + K +  G   + TYV WN HEP RG + F G
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  F+   +E GL++ LR GPY C+E + GG P WL   P  Q RTTN  F   + +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNK 440

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
           +   +I  +        QGGPII  QVENEYG    + AY           G+GG L   
Sbjct: 441 YFDHLIPRVALLQYL--QGGPIIAVQVENEYGFFYKDEAYMPYLLQALQQRGIGGLLLT- 497

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGW 235
            A  T   +   +  V+          IN   GF  D F         KPI+  E + GW
Sbjct: 498 -ADSTEEVMRGHIKGVLAS--------IN-MKGFKVDSFKHLYKLQRHKPILIMEFWVGW 547

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATS 289
           F ++G       V ++  +V+ F   G +F N YM+ GGTNFG   G         V TS
Sbjct: 548 FDTWGIDHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTS 606

Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           YDYDA + E G     K+  LR L ++I
Sbjct: 607 YDYDAVLTEAGDYTA-KYFMLRSLFESI 633


>gi|397699203|ref|YP_006536991.1| beta-galactosidase [Enterococcus faecalis D32]
 gi|397335842|gb|AFO43514.1| beta-galactosidase [Enterococcus faecalis D32]
          Length = 593

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|312901648|ref|ZP_07760918.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
 gi|311291259|gb|EFQ69815.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0470]
          Length = 593

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|256761574|ref|ZP_05502154.1| beta-galactosidase [Enterococcus faecalis T3]
 gi|422736227|ref|ZP_16792491.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
 gi|256682825|gb|EEU22520.1| beta-galactosidase [Enterococcus faecalis T3]
 gi|315166978|gb|EFU10995.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1341]
          Length = 593

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|255971270|ref|ZP_05421856.1| beta-galactosidase [Enterococcus faecalis T1]
 gi|255962288|gb|EET94764.1| beta-galactosidase [Enterococcus faecalis T1]
          Length = 593

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTRQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLTVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|227554928|ref|ZP_03984975.1| possible beta-galactosidase [Enterococcus faecalis HH22]
 gi|422713751|ref|ZP_16770500.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
 gi|422716430|ref|ZP_16773136.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
 gi|227175936|gb|EEI56908.1| possible beta-galactosidase [Enterococcus faecalis HH22]
 gi|315575268|gb|EFU87459.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309B]
 gi|315581351|gb|EFU93542.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0309A]
          Length = 593

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|21243811|ref|NP_643393.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
 gi|390989312|ref|ZP_10259611.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
           859]
 gi|21109406|gb|AAM37929.1| beta-galactosidase [Xanthomonas axonopodis pv. citri str. 306]
 gi|372556070|emb|CCF66586.1| beta-galactosidase [Xanthomonas axonopodis pv. punicae str. LMG
           859]
          Length = 613

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 112/319 (35%), Positives = 152/319 (47%), Gaps = 23/319 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  HNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  + + +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPH 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
                   A         G +  N YM+ GGT+FG   G               TSYDYD
Sbjct: 276 AATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYD 334

Query: 294 APIDEYGFIRQPKWGHLRE 312
           A +DE G    PK+  +R+
Sbjct: 335 AILDEAGHP-TPKFALMRD 352


>gi|294627330|ref|ZP_06705916.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
           ICPB 11122]
 gi|292598412|gb|EFF42563.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
           ICPB 11122]
          Length = 613

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 116/322 (36%), Positives = 156/322 (48%), Gaps = 29/322 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  + + +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK-- 273

Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
           P    +  A   A  FE     G   N YM+ GGT+FG   G               TSY
Sbjct: 274 PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331

Query: 291 DYDAPIDEYGFIRQPKWGHLRE 312
           DYDA +DE G    PK+  +R+
Sbjct: 332 DYDAILDEAGHP-TPKFALMRD 352


>gi|358415935|ref|XP_600640.6| PREDICTED: uncharacterized protein LOC522360 [Bos taurus]
          Length = 1360

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 114/328 (34%), Positives = 154/328 (46%), Gaps = 37/328 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G   ++  GS+HY R     W + + K +  G   + TYV WN HEP RG + F G
Sbjct: 321 FTLEGHEFLILGGSVHYFRVPRASWRDRLLKLRACGFNTVTTYVPWNLHEPERGTFDFSG 380

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  F+   +E GL++ LR GPY C+E + GG P WL   P  Q RTTN  F   + +
Sbjct: 381 NLDLEAFILLAEEVGLWVILRPGPYICSEMDLGGLPSWLLQDPTSQLRTTNRSFVNAVNK 440

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
           +   +I   +   L   QGGPII  QVENEYG    + AY           G+GG L   
Sbjct: 441 YFDHLIP--RVALLQYLQGGPIIAVQVENEYGFFYKDEAYMPYLLQALQQRGIGGLLLT- 497

Query: 179 WAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGW 235
            A  T   +   +  V+          IN   GF  D F         KPI+  E + GW
Sbjct: 498 -ADSTEEVMRGHIKGVLAS--------IN-MKGFKVDSFKHLYKLQRHKPILIMEFWVGW 547

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATS 289
           F ++G       V ++  +V+ F   G +F N YM+ GGTNFG   G         V TS
Sbjct: 548 FDTWGIDHRVMGVNEVEKSVSEFIRYGISF-NVYMFHGGTNFGFMNGATSFEKHRGVTTS 606

Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAI 317
           YDYDA + E G     K+  LR L ++I
Sbjct: 607 YDYDAVLTEAGDYTA-KYFMLRSLFESI 633


>gi|148273884|ref|YP_001223445.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
           michiganensis NCPPB 382]
 gi|147831814|emb|CAN02784.1| putative beta-galactosidase [Clavibacter michiganensis subsp.
           michiganensis NCPPB 382]
          Length = 599

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 100/305 (32%), Positives = 144/305 (47%), Gaps = 26/305 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DG+   + +G++HY R  P+ W + IRK++  GL+ IETYV WN H P RG +      
Sbjct: 20  LDGRPHRVIAGALHYFRVHPDQWADRIRKARLMGLDTIETYVAWNAHSPERGAFDTSAGL 79

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF+  V   G+   +R GPY CAEW+ GG P WL   P +  R +   +   +  FL
Sbjct: 80  DLGRFLDLVHAEGMHAIVRPGPYICAEWDGGGLPGWLFEDPAVGVRRSEPLYLAAVDEFL 139

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            ++ +++    +    GGP+IL Q+ENEYG    AYG   + Y++   D        VP 
Sbjct: 140 RRVYEIVAPRQI--DMGGPVILVQIENEYG----AYGDDAD-YLRHLVDLTRESGIIVPL 192

Query: 194 VMCQQEDAPDPIINTCNGFYCDG------------FTPNSPSKPIMWTENYSGWFLSFGY 241
               Q         + +  +  G               + P+ P+M +E + GWF  +G 
Sbjct: 193 TTVDQPTDEMLSRGSLDELHRTGSFGSRATERLATLRRHQPTGPLMCSEFWDGWFDHWGE 252

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYDAP 295
                   D A  +      G +  N YM+ GGTNFG T G           TSYDYDAP
Sbjct: 253 HHHTTSAADAAAELDALLAAGASV-NIYMFHGGTNFGFTNGANHKGTYQSHVTSYDYDAP 311

Query: 296 IDEYG 300
           +DE G
Sbjct: 312 LDETG 316


>gi|313241117|emb|CBY33414.1| unnamed protein product [Oikopleura dioica]
          Length = 608

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 109/347 (31%), Positives = 159/347 (45%), Gaps = 38/347 (10%)

Query: 13  VIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR 72
           +   K R+L SGS+HY R   E W + + K K  GL  ++TY+ WN HEP  G + FE  
Sbjct: 12  LFKSKTRIL-SGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDE 70

Query: 73  FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN-PFKEEMKR 131
            D+  F+K  ++ GL++ +R GPY CAEW +GGFP WL     +  R T +  +   ++ 
Sbjct: 71  LDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +   ++      S+GGPII  QVENEY     +Y    E Y+ W  +   ++    
Sbjct: 131 WFTVLFSQLRDHQW--SRGGPIISIQVENEYA----SYNKDSE-YLPWVKNLLTDVGKCF 183

Query: 192 PWVMCQQED--------APDPII-----NTCNGF-YCDGFTPNSPSKPIMWTENYSGWFL 237
              +  + +         PD  +     +  N F   D   PN   +P M TE ++GWF 
Sbjct: 184 LLKIINETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQPN---RPKMVTEFWAGWFD 240

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPLVAT 288
            +G                R     G+  N YM+ GGT+FG  AG         G    T
Sbjct: 241 HWGQQGHSTLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTT 300

Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHKAI--KLCEEYLISSDPTHQK 333
           SYDYDAP+ E G + + KW   RE+ K    K   +  +   P  QK
Sbjct: 301 SYDYDAPLSESGDLTE-KWNVTREIIKEFFPKYINDSYVFRRPEIQK 346


>gi|256959941|ref|ZP_05564112.1| beta-galactosidase [Enterococcus faecalis Merz96]
 gi|293384307|ref|ZP_06630193.1| beta-galactosidase [Enterococcus faecalis R712]
 gi|293388457|ref|ZP_06632963.1| beta-galactosidase [Enterococcus faecalis S613]
 gi|312907112|ref|ZP_07766105.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
 gi|312979309|ref|ZP_07791007.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
 gi|256950437|gb|EEU67069.1| beta-galactosidase [Enterococcus faecalis Merz96]
 gi|291078380|gb|EFE15744.1| beta-galactosidase [Enterococcus faecalis R712]
 gi|291082147|gb|EFE19110.1| beta-galactosidase [Enterococcus faecalis S613]
 gi|310626889|gb|EFQ10172.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 512]
 gi|311287903|gb|EFQ66459.1| glycosyl hydrolase family 35 [Enterococcus faecalis DAPTO 516]
          Length = 593

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLQQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLRKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|156408171|ref|XP_001641730.1| predicted protein [Nematostella vectensis]
 gi|156228870|gb|EDO49667.1| predicted protein [Nematostella vectensis]
          Length = 647

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 107/333 (32%), Positives = 163/333 (48%), Gaps = 25/333 (7%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           LS ++ YD+   + DGK     SG +HY R     W + + K K  G+  ++TYV WN H
Sbjct: 18  LSFSIDYDNNCFMKDGKPFRYISGGMHYFRVPQYYWKDRLLKLKASGMNTVQTYVPWNLH 77

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EPI  QY F G  +L  F++  Q   L + LR GPY CAEW++GG P WL   P I  R+
Sbjct: 78  EPIPKQYNFAGNANLTSFLEIAQSLDLLVILRPGPYICAEWDFGGLPGWLLKDPSIVIRS 137

Query: 121 TN-NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGN---VEWAYGVG-GEL 175
           +    + E +  +++ ++ L+K        GGP+I+ QVENEYG+    +  Y +   +L
Sbjct: 138 SQGKAYMEAVDAWMSVLLPLVKP--FLYENGGPVIMVQVENEYGDYIHCDHQYMLHLQQL 195

Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSP---------SKPI 226
           +     D  +   T     +   E    P + T   F  +   P+ P           P+
Sbjct: 196 FRYHLTDDIILFTTDDGSNLTAIECGTLPSLYTTVDFGANT-DPSIPFANQRKLQQKGPL 254

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL- 285
           + +E Y+GW   +G     R  + +A A+ +      +  N YM+ GGTNFG  +G    
Sbjct: 255 VNSEFYTGWLDYWGTPHQTRTSKVVADALDKILALNASV-NLYMFEGGTNFGFWSGADFH 313

Query: 286 -----VATSYDYDAPIDEYGFIRQPKWGHLREL 313
                V TSYDYDAP+ E G + + K+  +RE+
Sbjct: 314 GQYQPVPTSYDYDAPLTEAGDLTE-KYHAIREV 345


>gi|383939096|ref|ZP_09992284.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae SK674]
 gi|418972932|ref|ZP_13520979.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae ATCC
           BAA-960]
 gi|383350776|gb|EID28631.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae ATCC
           BAA-960]
 gi|383714006|gb|EID70024.1| glycosyl hydrolase family 35 [Streptococcus pseudopneumoniae SK674]
          Length = 595

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKPFKILSGAIHYFRIPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|418518035|ref|ZP_13084189.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
           GSPB1386]
 gi|410705285|gb|EKQ63761.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
           GSPB1386]
          Length = 613

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 116/322 (36%), Positives = 156/322 (48%), Gaps = 29/322 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  HNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  + + +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK-- 273

Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
           P    +  A   A  FE     G   N YM+ GGT+FG   G               TSY
Sbjct: 274 PHAATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331

Query: 291 DYDAPIDEYGFIRQPKWGHLRE 312
           DYDA +DE G    PK+  +R+
Sbjct: 332 DYDAILDEAGHP-TPKFALMRD 352


>gi|423259078|ref|ZP_17240001.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
           CL07T00C01]
 gi|423263951|ref|ZP_17242954.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
           CL07T12C05]
 gi|387776658|gb|EIK38758.1| hypothetical protein HMPREF1055_02278 [Bacteroides fragilis
           CL07T00C01]
 gi|392706217|gb|EIY99340.1| hypothetical protein HMPREF1056_00641 [Bacteroides fragilis
           CL07T12C05]
          Length = 773

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 105/332 (31%), Positives = 158/332 (47%), Gaps = 29/332 (8%)

Query: 10  RALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYF 69
           R  +++G   V+++  +HY R     W   I   K  G+  I  Y+FWNYHE   G++ F
Sbjct: 31  RTFLLNGNPFVVKAAELHYARIPEPYWEHRILMCKALGMNTICLYMFWNYHEQQEGKFDF 90

Query: 70  EGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEM 129
            G  ++ +F K  Q+ G+++ LR GPY CAEW  GG P WL     ++ R+ N  F E  
Sbjct: 91  SGEKNVAKFCKLAQKHGMYIILRPGPYVCAEWEMGGLPWWLLKEKDMKVRSLNPYFMERT 150

Query: 130 KRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG-------ELYVKWAAD 182
           + F+ ++   +    L  + GG II+ QVENE+G     YGV         ++  +   D
Sbjct: 151 EIFMKELGKQLAPLQL--ANGGNIIMVQVENEFG----GYGVDKPYMTAIRDIVCRAGFD 204

Query: 183 TAVNLNTSVPWVMCQQEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGW 235
            +V       W    + +A D ++ T N   G   D      +   P  P+M +E +SGW
Sbjct: 205 KSVLFQCD--WDSTFELNALDDLLWTLNFGTGANIDKEFKKLSTVRPDTPLMCSEFWSGW 262

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
           F  +G     RP E +   +    +   +F + YM  GGT FG   G        + +SY
Sbjct: 263 FDHWGRKHETRPAEKMVEGIKDMLDRNISF-SLYMTHGGTTFGHWGGANSPTYSAMCSSY 321

Query: 291 DYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           DYDAPI E G+   PK+  L+EL    +  EE
Sbjct: 322 DYDAPISEAGWT-TPKYYLLQELLGKYRSPEE 352



 Score = 41.2 bits (95), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 56/254 (22%), Positives = 99/254 (38%), Gaps = 50/254 (19%)

Query: 440 PDL--AEQINTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYG 497
           PD   +EQ+   +D +           +P       L I      A ++ + KL+ +   
Sbjct: 382 PDFVQSEQVKPMEDFNQGWGSILYRTTLPATEANTLLRITEAHDWAQIYADGKLLGYLDR 441

Query: 498 NHDFANFLINKKIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDL 557
             D    ++    +L EG   LDI    +G  N+G+           V LI  K  K+ +
Sbjct: 442 RKDDNQVILP---QLPEGTQ-LDIWVEAMGRVNFGSTVHDRKGITEKVELI--KPDKQAV 495

Query: 558 SSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLAL 616
           +   W +Y + V+ ++    K S +NS            +   +YK TF   +  G   +
Sbjct: 496 TLKNWKVYSIPVDYKFAARKKYS-SNSR----------PEGPAYYKATFNLTK-TGDTFI 543

Query: 617 NLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHI 676
           ++++ GKG  WVNG ++GR+W                               P QTL+ +
Sbjct: 544 DMSTWGKGMVWVNGHALGRFWEI----------------------------GPQQTLF-L 574

Query: 677 PRTWVHPGENLLVI 690
           P  W+  G+N +++
Sbjct: 575 PGCWLKKGKNEIIV 588


>gi|418519416|ref|ZP_13085468.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
           GSPB2388]
 gi|410704860|gb|EKQ63339.1| beta-galactosidase [Xanthomonas axonopodis pv. malvacearum str.
           GSPB2388]
          Length = 613

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 112/319 (35%), Positives = 152/319 (47%), Gaps = 23/319 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  HNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  + + +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPH 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
                   A         G +  N YM+ GGT+FG   G               TSYDYD
Sbjct: 276 AATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYD 334

Query: 294 APIDEYGFIRQPKWGHLRE 312
           A +DE G    PK+  +R+
Sbjct: 335 AILDEAGHP-TPKFALMRD 352


>gi|342162833|ref|YP_004767472.1| beta-galactosidase [Streptococcus pseudopneumoniae IS7493]
 gi|341932715|gb|AEL09612.1| beta-galactosidase (Lactase) [Streptococcus pseudopneumoniae
           IS7493]
          Length = 595

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKPFKILSGAIHYFRIPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313



 Score = 42.0 bits (97), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 83/205 (40%), Gaps = 57/205 (27%)

Query: 513 NEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLK---NGKRDLSSGEWIYQVG 567
            +G++ LDIL   +G  NYG  F  D    G+ + +  DL    N K         Y + 
Sbjct: 437 KKGLSRLDILIENMGRVNYGHKFLADTQRKGIRTGVCKDLHFLLNWKH--------YPLP 488

Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
           ++      +KI  +    W QG          +Y   F   E K    L+L+  GKG A+
Sbjct: 489 LDNP----EKIDFSKG--WTQGQP-------AFYAYDFTVEEPKDTY-LDLSEFGKGVAF 534

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNGQ++GR+W+                              P  +LY IP +++  G N 
Sbjct: 535 VNGQNLGRFWNV----------------------------GPTLSLY-IPHSYLKEGANR 565

Query: 688 LVIHEELGGDPSKISLLTK-TGQHI 711
           ++I E  G    +I L  K T +HI
Sbjct: 566 IIIFETEGQYKEEIHLTRKPTLKHI 590


>gi|417848939|ref|ZP_12494871.1| beta-galactosidase [Streptococcus mitis SK1080]
 gi|339457687|gb|EGP70254.1| beta-galactosidase [Streptococcus mitis SK1080]
          Length = 595

 Score =  149 bits (376), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            ++   +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLFPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|298384202|ref|ZP_06993762.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
 gi|383123627|ref|ZP_09944306.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
 gi|251839745|gb|EES67828.1| hypothetical protein BSIG_3219 [Bacteroides sp. 1_1_6]
 gi|298262481|gb|EFI05345.1| beta-galactosidase (Lactase) [Bacteroides sp. 1_1_14]
          Length = 624

 Score =  149 bits (376), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 106/338 (31%), Positives = 160/338 (47%), Gaps = 50/338 (14%)

Query: 16  GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
           G+   + SG +HY R   + W   ++  K  GL  + TYVFWN HE   G++ F G  +L
Sbjct: 35  GEEIPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94

Query: 76  VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
             +++   E G+ + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K+++ +
Sbjct: 95  AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKYIDR 154

Query: 136 IIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-PWV 194
           + + +   +L  ++GGPII+ Q ENE+G+           YV    D  +  + S    +
Sbjct: 155 LYEEVG--DLQCTKGGPIIMVQCENEFGS-----------YVSQRKDIPLEEHRSYNAKI 201

Query: 195 MCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIMWT 229
             Q  DA    P+  +   +  +G        T N  S                 P M  
Sbjct: 202 KGQLADAGFTIPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGDKGPYMVA 261

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA-- 287
           E YSGW   +G   P     ++A     + +   +F N+YM  GGTNFG T+G       
Sbjct: 262 EFYSGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKR 320

Query: 288 ------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
                 TSYDYDAPI E G++  PK+  +R  + K +K
Sbjct: 321 DIQPDLTSYDYDAPISEAGWL-TPKYDSIRSVIQKYVK 357


>gi|170034400|ref|XP_001845062.1| beta-galactosidase [Culex quinquefasciatus]
 gi|167875695|gb|EDS39078.1| beta-galactosidase [Culex quinquefasciatus]
          Length = 611

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 115/394 (29%), Positives = 182/394 (46%), Gaps = 46/394 (11%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           ++ Y+    ++DG+     SGS HY R+ P  W  ++R  +  GL  + TY+ W+ HEP 
Sbjct: 10  SIDYERDTFLLDGEPFRFISGSFHYFRALPGSWRHILRAMRAAGLNAVMTYIEWSTHEPT 69

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTN 122
            G Y +    DL +F++  +E  L++ LR GPY CAE + GGFP W L   P I+ RT +
Sbjct: 70  EGDYRWNEIADLEQFIRIAEEENLYVILRPGPYICAERDMGGFPYWLLTKFPNIKLRTQD 129

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA- 181
           + +  E++++ + ++  +++      +GGP+I+  +ENEYG    ++    + Y+K+   
Sbjct: 130 SDYMREVQKWYSVLMPRIQK--YLYGRGGPVIMVSIENEYG----SFSACDKTYLKFLKN 183

Query: 182 --------DTAVNLNTSVPWVMCQQEDAPDPIINTCN-------GFYCDGFTPNSPSKPI 226
                   D  +  N     + C +      I+ T +         Y        P  P+
Sbjct: 184 MTESYIQYDAVLFTNDGPEQLNCGRIPG---ILATLDFGSTGSPERYWQKLRKVQPKGPL 240

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTA----G 282
           +  E Y GW   +   +  R          R     G   N+YM+FGGTNF  TA    G
Sbjct: 241 VNAEFYPGWLTHWMEPMA-RTATGPVVDTLRLMLNQGANVNFYMFFGGTNFAFTAGANDG 299

Query: 283 GP----LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKL-GAK 337
           GP       TSYDYDAP+DE G    PK+  LR++     + E       P  QKL   K
Sbjct: 300 GPGKFNTDITSYDYDAPLDEAG-DPTPKYFALRDV-----ILEYMPDPGVPVPQKLPKMK 353

Query: 338 LEAHIYHK----SSNDCAAFLANYDSSSDANVTF 367
           L      +    +SN+    LA Y  ++D  ++F
Sbjct: 354 LPPVTLTQYGFLTSNEARQALAKYIFTNDRTLSF 387


>gi|387790696|ref|YP_006255761.1| beta-galactosidase [Solitalea canadensis DSM 3403]
 gi|379653529|gb|AFD06585.1| beta-galactosidase [Solitalea canadensis DSM 3403]
          Length = 790

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 104/309 (33%), Positives = 150/309 (48%), Gaps = 28/309 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++GK  ++++G IH+PR   E W   I+  K  G+  I  Y+FWN+HE    Q+ F G
Sbjct: 45  FLLNGKPFLIRAGEIHFPRIPREYWDHRIKLCKAMGMNTICIYLFWNFHEQKPDQFDFTG 104

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
           + D+  FVK VQ  G++  +R GPYACAEW+ GG P WL   P ++ RT  + +   M+R
Sbjct: 105 QKDVAAFVKLVQANGMYCIVRPGPYACAEWDMGGLPWWLLKKPDLKVRTLEDRY--FMER 162

Query: 132 FLAKIIDLMKQENLFASQ-GGPIILAQVENEYGNVEWAYGVGGELY------VKWAADTA 184
               + ++ KQ  L   Q GG II+ QVENEY     A+G   E        +K A    
Sbjct: 163 SAKYLKEVGKQLALLQIQNGGNIIMVQVENEYA----AFGNSAEYMDANRKNLKDAGFNK 218

Query: 185 VNLNTSVPWVMCQQEDAPDP----IINTCNGFYCD----GFTPNSPSKPIMWTENYSGWF 236
           V L     W         DP     +N   G   D    GF    P+ P+M +E ++GWF
Sbjct: 219 VQL-MRCDWSSTFNSYITDPEVAITLNFGAGSDVDKQFKGFQEKHPTAPLMCSEYWTGWF 277

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYD 291
             +G     R +     ++    +   +F + YM  GGT FG+  G        +  SYD
Sbjct: 278 DHWGRPHETRSINSFIGSLKDMMDRKISF-SLYMAHGGTTFGQWGGANSPPYSAMVASYD 336

Query: 292 YDAPIDEYG 300
           Y+API E G
Sbjct: 337 YNAPIGEQG 345



 Score = 43.9 bits (102), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 48/167 (28%), Positives = 70/167 (41%), Gaps = 23/167 (13%)

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG- 532
           L I  +   A VF+N KLV    G  D        +I   +    LDIL    G  N+G 
Sbjct: 432 LIITEVHDWAQVFINGKLV----GKLDRRRADSTIEIPATKAGAVLDILVEATGRVNFGE 487

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGST 591
           A  D  G        +++ +G        W +Y   V+ ++        AN+ F KQ   
Sbjct: 488 AVIDRKGI----TEKVEISDGSTVQELKNWTVYNFPVDYQF-------QANAKFVKQKVN 536

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWS 638
            P      WY+  F   +  G   ++L++ GKG  WVNG +IGR+W 
Sbjct: 537 GPA-----WYRAKFNLNQ-TGDTYIDLSTWGKGMIWVNGYNIGRFWK 577


>gi|313238883|emb|CBY13879.1| unnamed protein product [Oikopleura dioica]
          Length = 601

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 105/327 (32%), Positives = 153/327 (46%), Gaps = 36/327 (11%)

Query: 13  VIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGR 72
           +   K R+L SGS+HY R   E W + + K K  GL  ++TY+ WN HEP  G + FE  
Sbjct: 12  LFKSKTRIL-SGSLHYFRVPKEYWRDRLEKLKGAGLNTVQTYIGWNLHEPREGDFIFEDE 70

Query: 73  FDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN-PFKEEMKR 131
            D+  F+K  ++ GL++ +R GPY CAEW +GGFP WL     +  R T +  +   ++ 
Sbjct: 71  LDVSEFLKIAKDVGLYVIMRPGPYICAEWEWGGFPAWLLTKENMIVRQTKSEAYLAAVQN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +   ++      S+GGPII  QVENEY     +Y    E Y+ W  +   ++    
Sbjct: 131 WFTVLFSQLRDHQW--SRGGPIISIQVENEYA----SYNKDSE-YLPWVKNLLTDVGKCF 183

Query: 192 PWVMCQQED--------APDPII-----NTCNGF-YCDGFTPNSPSKPIMWTENYSGWFL 237
              +  + +         PD  +     +  N F   D   PN   +P M TE ++GWF 
Sbjct: 184 LLKIINETNFFLKGAHLLPDTFLTANFQSVGNAFEVLDKLQPN---RPKMVTEFWAGWFD 240

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPLVAT 288
            +G                R     G+  N YM+ GGT+FG  AG         G    T
Sbjct: 241 HWGQQGHSLLSPTTFNKTMREILNAGSSVNQYMFHGGTSFGWMAGSNWLSKKQRGTSDTT 300

Query: 289 SYDYDAPIDEYGFIRQPKWGHLRELHK 315
           SYDYDAP+ E G + + KW   RE+ K
Sbjct: 301 SYDYDAPLSESGDLTE-KWNVTREIIK 326


>gi|256396208|ref|YP_003117772.1| beta-galactosidase [Catenulispora acidiphila DSM 44928]
 gi|256362434|gb|ACU75931.1| Beta-galactosidase [Catenulispora acidiphila DSM 44928]
          Length = 625

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 102/335 (30%), Positives = 160/335 (47%), Gaps = 40/335 (11%)

Query: 10  RALVIDGKRRV-------LQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           R L IDG R +       + S +IHY R  P++W + +++ +  G   +E Y+ WN+H+P
Sbjct: 5   RVLTIDGGRFLRGGREHRIVSAAIHYFRIHPDLWRDRLQRLRAMGCNTVECYIAWNFHQP 64

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
                 F+G  D+  FV+   E G  +  R GPY CAEW++GG P WL     ++ RTT+
Sbjct: 65  TPAAPRFDGWRDVAGFVRLAGELGFDVIARPGPYICAEWDFGGLPAWLLADENVRLRTTD 124

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGEL------- 175
             +   +  +  ++I ++ +  L A++GGP++  Q+ENEYG+    +G   +        
Sbjct: 125 PVYLAAVDAWFDELIPVLAE--LQATRGGPVVAVQIENEYGS----FGADPDYLDHLRKG 178

Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD----GFTPNSPSKPIMWTEN 231
            ++   DT +  +     +M      PD +     G   D          P  P +  E 
Sbjct: 179 LIERGVDTLLFTSDGPQELMLAGGTVPDVLATVNFGSRADEAFATLRRVRPDDPPVCMEF 238

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
           ++GWF  FG     R  +D A ++      GG+  N+YM  GGTNFG  AG         
Sbjct: 239 WNGWFDHFGEPHHTRSAQDAARSLDEILAAGGSV-NFYMGHGGTNFGFWAGANHSGVGTG 297

Query: 284 -----PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
                P + TSYDYDAP+ E G +  PK+   RE+
Sbjct: 298 DPGYQPTI-TSYDYDAPVGEAGEL-TPKFHLFREV 330


>gi|319900291|ref|YP_004160019.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
 gi|319415322|gb|ADV42433.1| Beta-galactosidase [Bacteroides helcogenes P 36-108]
          Length = 629

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 101/336 (30%), Positives = 159/336 (47%), Gaps = 49/336 (14%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           ++GK+  + SG +HY R   + W   ++  K  GL  + TYVFWN+HE   G++ F G  
Sbjct: 38  LNGKQTPILSGEMHYARIPHQYWRHRLQMMKGMGLNAVATYVFWNHHETEPGKWDFTGDK 97

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  ++KT  E G+ + LR GPY CAEW +GG+P WL  +PG++ R  N  F +  + ++
Sbjct: 98  NLAEYIKTAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVPGMEIRRDNPQFLKHTEAYI 157

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
            ++   +   +L  ++GGPI++ Q ENE+G+           YV    D  +  + +   
Sbjct: 158 QRLYKEVG--HLQCTKGGPIVMVQCENEFGS-----------YVAQRKDITLQEHRAYNA 204

Query: 194 VMCQQ--EDAPDPIINTCNGFY------CDGFTPNSPSK------------------PIM 227
            + QQ  +   D  + T +G +       +G  P +  +                  P M
Sbjct: 205 KIKQQLADAGFDVPLFTSDGSWLFEGGSTEGALPTANGETDIANLKKVVNQYHGGQGPYM 264

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
             E Y GW   +    P      +A     + +   +F N YM  GGTNFG T+G     
Sbjct: 265 VAEFYPGWLSHWAEPFPQVSASSVARTTESYLKNDVSF-NVYMVHGGTNFGFTSGANYDK 323

Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRELHK 315
                   TSYDYDAPI E G++  PK+  +R + K
Sbjct: 324 KRDIQPDLTSYDYDAPISEAGWV-TPKYDSIRAVIK 358


>gi|315500613|ref|YP_004089415.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
 gi|315418625|gb|ADU15264.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
          Length = 785

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 116/367 (31%), Positives = 169/367 (46%), Gaps = 37/367 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DG+   ++ G +H+PR   E WP  ++  K  GL  +  Y+FWNYHE   GQ+ +EG
Sbjct: 42  FLLDGRPIQIRCGEMHFPRVPREYWPHRLKMIKAMGLNAVCAYLFWNYHEWNEGQFDWEG 101

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQF-RTTNNPFKEEMK 130
           + D   F +  Q+ GL++ LR GPYACAEW  GG P WL    G  F RT    F     
Sbjct: 102 QRDAAAFCRMAQKEGLWVILRPGPYACAEWEMGGLPWWLLKAEGDAFLRTRAEAFTGPAH 161

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGV-------GGELYVKW 179
           R++ ++   +    L  ++GGPI++ QVENEYG    ++E+  G+       G ++ +  
Sbjct: 162 RWIEEVGRHLGP--LQVTKGGPILMVQVENEYGFFGNDLEYLQGMRKAVEQAGFDVPLFQ 219

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPI--INTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
              T V   T +P ++       DP    NT                P+M  E YSGWF 
Sbjct: 220 CNPTHVVAKTHIPELLSVANFGNDPETGFNTLRAVQ---------RAPLMCGEYYSGWFD 270

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--GPLV--ATSYDYD 293
            +G       V+     +    +  G+F + YM  GGT+FG   G   P     TSYDYD
Sbjct: 271 VWGAGHRTGGVQSSVADIKWMLQQNGSF-SLYMAHGGTSFGLWGGCDRPFQPDTTSYDYD 329

Query: 294 APIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIYHKSSNDCAAF 353
           API E G I + K+   R   +      E L +  P    +       +   S  +CA  
Sbjct: 330 APISEAGRIGE-KFEAYRSAMRPFLKAGERLPAPPPQKDTMA------LAPFSLEECAPV 382

Query: 354 LANYDSS 360
            A Y S+
Sbjct: 383 SAGYTSN 389


>gi|410972395|ref|XP_003992645.1| PREDICTED: beta-galactosidase-1-like protein 3 [Felis catus]
          Length = 664

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 111/324 (34%), Positives = 160/324 (49%), Gaps = 29/324 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             + G + ++  GSIHY R   E W + + K K  G   + TYV WN HEP RG++ F G
Sbjct: 91  FTLGGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTLTTYVPWNLHEPQRGKFDFSG 150

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  FV    E GL++ LR GPY C+E + GG P WL   P +  RTT   F E + +
Sbjct: 151 NLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPKMILRTTYKGFVEAVNK 210

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   +   L   + GPII  QVENEYG+  +A       Y++ A      L   +
Sbjct: 211 YFDHLIS--RVVPLQYRKRGPIIAVQVENEYGS--FAEDKDYMPYIQKAL-----LERGI 261

Query: 192 PWVMCQQEDAP-------DPIINTCN--GFYCDGFTPNSP---SKPIMWTENYSGWFLSF 239
             ++   +DA        + ++ T N   F  + F   S    +KPIM  E + GWF ++
Sbjct: 262 VELLMTSDDAKHMLKGYIEGVLATINMNTFQINDFKQLSQVQRNKPIMVMEFWVGWFDTW 321

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYD 293
           G     +  ED+   V++F  +  +F N YM+ GGTNFG   G         V TSYDYD
Sbjct: 322 GGKHMIKNAEDVEDTVSKFITSEISF-NVYMFHGGTNFGFMNGATYFGKHRGVVTSYDYD 380

Query: 294 APIDEYGFIRQPKWGHLRELHKAI 317
           A + E G   + K+  LR+L  ++
Sbjct: 381 AVLTEAGDYTE-KYFKLRKLFGSV 403


>gi|422694237|ref|ZP_16752232.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
 gi|315148319|gb|EFT92335.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4244]
          Length = 593

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R     
Sbjct: 242 WDGWFNRWGEPVIHREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGEKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|60683116|ref|YP_213260.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60494550|emb|CAH09349.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 769

 Score =  149 bits (375), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)

Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
           ++L  GK+      W +Y   V+         S      +K G+  T+P      +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F   +  G   L++++ GKG  WVNG +IGR+W                           
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|375359947|ref|YP_005112719.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
 gi|301164628|emb|CBW24187.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
          Length = 769

 Score =  149 bits (375), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)

Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
           ++L  GK+      W +Y   V+         S      +K G+  T+P      +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F   +  G   L++++ GKG  WVNG +IGR+W                           
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|265767009|ref|ZP_06094838.1| beta-galactosidase [Bacteroides sp. 2_1_16]
 gi|263253386|gb|EEZ24862.1| beta-galactosidase [Bacteroides sp. 2_1_16]
          Length = 769

 Score =  149 bits (375), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)

Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
           ++L  GK+      W +Y   V+         S      +K G+  T+P      +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F   +  G   L++++ GKG  WVNG +IGR+W                           
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|257866484|ref|ZP_05646137.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
 gi|257873001|ref|ZP_05652654.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
 gi|257800442|gb|EEV29470.1| glycosyl hydrolase [Enterococcus casseliflavus EC30]
 gi|257807165|gb|EEV35987.1| glycosyl hydrolase [Enterococcus casseliflavus EC10]
          Length = 591

 Score =  149 bits (375), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 117/352 (33%), Positives = 168/352 (47%), Gaps = 43/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   L SG+IHY R T   W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLDGKPIKLISGAIHYFRMTSAQWADSLYNLKALGANTVETYIPWNLHEPREGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FVK  Q  GL + LR   Y CAEW +GG P WL   P ++ R+T+  F  +++ 
Sbjct: 70  MKDIFAFVKQAQALGLMVILRPSVYICAEWEFGGLPAWLLNEP-MRLRSTDPRFMAKVRN 128

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  + GGP+I+ QVENEYG    +YG+  + Y++   +        V
Sbjct: 129 YFQVL--LPKLVPLQITHGGPVIMMQVENEYG----SYGM-EKAYLRQTKELMEECGIDV 181

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF-TPNSPSK-------------------PIMWTEN 231
           P  +   + A + +++       D F T N  S+                   PIM  E 
Sbjct: 182 P--LFTSDGAWEEVLDAGTLIEDDVFVTGNFGSRSKENAAVMKEFMAKHGKNWPIMCMEY 239

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL-- 285
           + GWF  +G  +  R  +DLA  V      G    N YM+ GGTNFG + G    G L  
Sbjct: 240 WDGWFNRWGEPIIKRDGQDLANEVKEMLAVGSL--NLYMFHGGTNFGFSNGCSARGALDL 297

Query: 286 -VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGA 336
              +SYDYDA + E G   +P      ++ KAIK     +  ++P  ++L A
Sbjct: 298 PQVSSYDYDALLTEAG---EPT-DKYYQVQKAIKEACPEVWQANPRTKQLAA 345


>gi|225872227|ref|YP_002753682.1| glycosyl hydrolase [Acidobacterium capsulatum ATCC 51196]
 gi|225791474|gb|ACO31564.1| glycosyl hydrolase, family 35 [Acidobacterium capsulatum ATCC
           51196]
          Length = 664

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 106/305 (34%), Positives = 149/305 (48%), Gaps = 19/305 (6%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V+DG+   + SG +HY R     W   ++ +K  GL  I TYVFWN HEP  G++ F G
Sbjct: 37  FVLDGQPFQIISGEMHYERIPRAYWKARLQMAKAMGLNTIATYVFWNLHEPEPGKFDFSG 96

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFR-TTNNPFKEEMK 130
             DL +F++  Q+ GL + LR GPY+CAEW +GGFP WL   P +Q    +N+P  E MK
Sbjct: 97  NADLAQFIRDAQQTGLKVLLRAGPYSCAEWEFGGFPAWLMKNPKMQTALRSNDP--EFMK 154

Query: 131 RFLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNL 187
                I+ L ++   L    GGPII  Q+ENEYG+   + AY    +     A  T   L
Sbjct: 155 PAEQWILRLGREVAPLQVGYGGPIIGVQIENEYGDFGGDAAYLEHLKKIFLKAGFTQSLL 214

Query: 188 NTSVPWVMCQQEDAPD--PIINTCNGFYC---DGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            T+ P     +   P     +N   G      D        +P++ +E ++GWF  +G  
Sbjct: 215 YTANPSRALVRGSIPGVYSAVNFAPGHAAQALDSLAQLRAGQPLLSSEYWTGWFDHWGEP 274

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV-------ATSYDYDAP 295
              +P+  L      +    G   N YM+ GGT+FG  +G            TSYDY AP
Sbjct: 275 HQSKPLS-LQVKDFNYILRHGAGVNLYMFHGGTSFGMMSGSSWTKHQFLPDVTSYDYGAP 333

Query: 296 IDEYG 300
           +DE G
Sbjct: 334 LDEAG 338


>gi|219847209|ref|YP_002461642.1| beta-galactosidase [Chloroflexus aggregans DSM 9485]
 gi|219541468|gb|ACL23206.1| Beta-galactosidase [Chloroflexus aggregans DSM 9485]
          Length = 898

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 151/315 (47%), Gaps = 17/315 (5%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           ++A V    + + +D +   L SG IHY R     W  L+ +++  GL  I+T + WN H
Sbjct: 1   MNATVRVGRQGIELDSRPFYLLSGCIHYFRWPRAEWRPLLEQARWAGLNTIDTVIPWNRH 60

Query: 61  EPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           EP  G + F    DL  F+    + GL + +R GPY CAEW  GG P WL     ++ RT
Sbjct: 61  EPQPGVFDFADEADLGAFLDLCHDLGLKVIVRPGPYICAEWENGGLPAWLTANGDLRLRT 120

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGV-GGELYVKW 179
            +  F   + R+   ++ ++       ++GGPIIL Q+ENE+    WA GV G + + + 
Sbjct: 121 NDPVFLSAVLRWFDTLMPILVPRQ--HTRGGPIILCQIENEH----WASGVYGADEHQQT 174

Query: 180 AADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNS---PSKPIMWTENYSGWF 236
            A  A      VP   C       P          +         P  P++ +E +SGWF
Sbjct: 175 LARAAFERGIEVPQYTCMGATPGYPEFRNGWSGIAEKLVQTRQLWPDNPLIVSELWSGWF 234

Query: 237 LSF-GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV--ATS 289
            ++ G+    +    L   + +    G    +++M+ GGTNF    GRT GG L+   T 
Sbjct: 235 DNWGGHRQTRKSAAKLDMILHQLTAVGCAGFSHWMWAGGTNFGYWGGRTVGGDLIHMTTG 294

Query: 290 YDYDAPIDEYGFIRQ 304
           YDYDAPIDEYG + +
Sbjct: 295 YDYDAPIDEYGRLTE 309


>gi|53715181|ref|YP_101173.1| beta-galactosidase [Bacteroides fragilis YCH46]
 gi|52218046|dbj|BAD50639.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
          Length = 769

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)

Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
           ++L  GK+      W +Y   V+         S      +K G+  T+P      +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F   +  G   L++++ GKG  WVNG +IGR+W                           
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|423285593|ref|ZP_17264475.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
           615]
 gi|404579108|gb|EKA83826.1| hypothetical protein HMPREF1204_04013 [Bacteroides fragilis HMW
           615]
          Length = 769

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 58/237 (24%), Positives = 96/237 (40%), Gaps = 55/237 (23%)

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG- 532
           + I  +   A VF + KL+A    +     F +   + L +G   +DIL   +G  N+  
Sbjct: 414 MKITEVHDWAQVFADGKLLA--RLDRRRGEFALQLPV-LKKGTR-IDILVEAMGRVNFDE 469

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS- 590
           +  D  G        ++L  GK+      W +Y   V+         S      +K G+ 
Sbjct: 470 SIHDRKGI----TEKVELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTA 517

Query: 591 -TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
            T+P      +Y+TTF   +  G   L++++ GKG  WVNG +IGR+W            
Sbjct: 518 QTMPA-----YYRTTFRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI---------- 561

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                              P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|423260608|ref|ZP_17241530.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
           CL07T00C01]
 gi|423266742|ref|ZP_17245744.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
           CL07T12C05]
 gi|387775162|gb|EIK37271.1| hypothetical protein HMPREF1055_03807 [Bacteroides fragilis
           CL07T00C01]
 gi|392699974|gb|EIY93143.1| hypothetical protein HMPREF1056_03431 [Bacteroides fragilis
           CL07T12C05]
          Length = 769

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 58/237 (24%), Positives = 96/237 (40%), Gaps = 55/237 (23%)

Query: 474 LNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG- 532
           + I  +   A VF + KL+A    +     F +   + L +G   +DIL   +G  N+  
Sbjct: 414 MKITEVHDWAQVFADGKLLA--RLDRRRGEFALQLPV-LKKGTR-IDILVEAMGRVNFDE 469

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS- 590
           +  D  G        ++L  GK+      W +Y   V+         S      +K G+ 
Sbjct: 470 SIHDRKGI----TEKVELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTA 517

Query: 591 -TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTK 649
            T+P      +Y+TTF   +  G   L++++ GKG  WVNG +IGR+W            
Sbjct: 518 QTMPA-----YYRTTFRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI---------- 561

Query: 650 KCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                              P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|431919435|gb|ELK17954.1| Beta-galactosidase [Pteropus alecto]
          Length = 675

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 111/344 (32%), Positives = 154/344 (44%), Gaps = 26/344 (7%)

Query: 5   VTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIR 64
           + Y+H   + DG+     SGSIHY R     W + + K K  GL  I+ YV WN+HEP  
Sbjct: 54  IDYNHNRFLKDGQPFRYISGSIHYFRVPRFYWKDRLLKMKMAGLNAIQVYVPWNFHEPQP 113

Query: 65  GQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNP 124
           GQY F    D+  F++   E  L + LR GPY CAEW  GG P WL    GI  R+++  
Sbjct: 114 GQYQFSEDHDVEHFIQLAHELTLLVILRPGPYICAEWEMGGLPAWLLQKEGIILRSSDPD 173

Query: 125 FKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADT- 183
           + E + ++L  I+  MK        GGPII  QVENEYG    +Y      Y+++   + 
Sbjct: 174 YLEAVDKWLGVILPKMKP--FLYQNGGPIITVQVENEYG----SYFTCDYDYLRFLQKSF 227

Query: 184 AVNLNTSVPWVMCQQEDAPDPIINTCNGFYCD-GFTPNS-------------PSKPIMWT 229
             +L   V            P   T  G Y    F P +             P  P++ +
Sbjct: 228 RYHLGNDVILFTTDGVYKDLPHCGTLQGLYSTVDFGPGANITDAFLLQRKYEPKGPLINS 287

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLVA 287
           E Y+GW   +G        E +  ++      G    N YM+ GGTNF    G   P  A
Sbjct: 288 EFYTGWLDHWGQPHSTVTTEAVVSSLHDILAHGANV-NLYMFIGGTNFAYWNGANIPYQA 346

Query: 288 --TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
             TSYDYDAP+ E G + +  +     + K  K+ E  +  S P
Sbjct: 347 QPTSYDYDAPLSEAGDLTKKYFAVRDVIQKFQKVPEGPIPPSTP 390


>gi|198475912|ref|XP_002132214.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
 gi|198137462|gb|EDY69616.1| GA25341 [Drosophila pseudoobscura pseudoobscura]
          Length = 672

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 105/328 (32%), Positives = 158/328 (48%), Gaps = 45/328 (13%)

Query: 6   TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           T DH +   V++G+     +GS HY R+ PE W   +R  +  GL  ++TYV W+ H P 
Sbjct: 48  TIDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNPH 107

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTN 122
            G Y +EG  D+V+F++  QE   ++ LR GPY CAE + GG P WL    P I+ RT++
Sbjct: 108 DGVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTSD 167

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
           + +  E+ ++ A+++   + ++L    GG II+ QVENEYG+ E       + Y+ W  D
Sbjct: 168 SNYMAEVGKWYAELMP--RLQHLLIGNGGKIIMVQVENEYGDYEC-----DKDYLNWLRD 220

Query: 183 --------TAVNLNTSVP--WVMCQQEDAPDPIINTCNGFYCDG----------FTPNSP 222
                    A+   T +P   + C + D     +     F  D                P
Sbjct: 221 ETEKYVNGNALLFTTDIPNERMSCGKIDN----VFATTDFGIDRIHEIDDIWAMLRKLQP 276

Query: 223 SKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG 282
           + P++ +E Y GW   +      R  + +A A+        +  N YM+FGGTNFG TAG
Sbjct: 277 TGPLVNSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAG 335

Query: 283 G----------PLVATSYDYDAPIDEYG 300
                          TSYDYDA +DE G
Sbjct: 336 ANYNLDGGVGYAADITSYDYDAVMDEAG 363


>gi|78048770|ref|YP_364945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
 gi|78037200|emb|CAJ24945.1| beta-galactosidase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
          Length = 650

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 115/320 (35%), Positives = 152/320 (47%), Gaps = 25/320 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 76  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 135

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 136 NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQS 195

Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNL 187
           +L     L KQ + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L
Sbjct: 196 YLDA---LAKQVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-L 251

Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G  
Sbjct: 252 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 311

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
                    A         G +  N YM+ GGT+FG   G               TSYDY
Sbjct: 312 HAATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDY 370

Query: 293 DAPIDEYGFIRQPKWGHLRE 312
           DA +DE G    PK+  +R+
Sbjct: 371 DAILDEAGHP-TPKFALMRD 389


>gi|195146534|ref|XP_002014239.1| GL19091 [Drosophila persimilis]
 gi|194106192|gb|EDW28235.1| GL19091 [Drosophila persimilis]
          Length = 672

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 107/328 (32%), Positives = 159/328 (48%), Gaps = 45/328 (13%)

Query: 6   TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           T DH +   V++G+     +GS HY R+ PE W   +R  +  GL  ++TYV W+ H P 
Sbjct: 48  TIDHESNSFVLNGEPFRYVAGSFHYFRAVPEAWRSRLRTMRASGLNAVDTYVEWSLHNPH 107

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTN 122
            G Y +EG  D+V+F++  QE   ++ LR GPY CAE + GG P WL    P I+ RT++
Sbjct: 108 DGVYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTSD 167

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
           + +  E+ ++ A+++   + ++L    GG II+ QVENEYG+ E       + Y+ W  D
Sbjct: 168 SNYMAEVGKWYAELMP--RLQHLLIGNGGKIIMVQVENEYGDYEC-----DKDYLNWLRD 220

Query: 183 TA---VNLN-----TSVP--WVMCQQEDAPDPIINTCNGFYCDG----------FTPNSP 222
                VN N     T +P   + C + D     +     F  D                P
Sbjct: 221 ETEKYVNRNALLFTTDIPNERMSCGKIDN----VFATTDFGIDRIHEIDDIWTMLRKLQP 276

Query: 223 SKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG 282
           + P++ +E Y GW   +      R  + +A A+        +  N YM+FGGTNFG TAG
Sbjct: 277 TGPLVNSEFYPGWLTHWQEMNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAG 335

Query: 283 G----------PLVATSYDYDAPIDEYG 300
                          TSYDYDA +DE G
Sbjct: 336 ANYNLDGGIGYAADITSYDYDAVMDEAG 363


>gi|16611713|gb|AAL27306.1|AF376481_1 BgaC [Carnobacterium maltaromaticum]
          Length = 586

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 109/306 (35%), Positives = 147/306 (48%), Gaps = 44/306 (14%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SG+IHY R  PE W   ++  K  G   +ETYV WN HEP +GQY F    DL RF++  
Sbjct: 21  SGAIHYFRVVPEYWEHRLKLLKNMGCNTVETYVAWNQHEPKKGQYVFSDALDLRRFIQLA 80

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRF----LAKIID 138
              GL + LR  PY CAE+ +GG P WL     ++ R+T  PF E ++ +      ++ID
Sbjct: 81  DSLGLKVILRPSPYICAEFEFGGLPAWLLKDRHMRVRSTYPPFMERVRLYYRELFKEVID 140

Query: 139 LMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE-LYVKWAADTAVNLNTSVPWVMCQ 197
           L        + GGPIIL QVENEYG      G G E  Y++           +VP V   
Sbjct: 141 LQ------ITSGGPIILMQVENEYG------GYGSEKKYLQELVTMMKENGVTVPLVTSD 188

Query: 198 ------------QEDAPDPIINTCNGFYCDGFTPNSPSK----PIMWTENYSGWFLSFGY 241
                       QE A  P +N C     + F   +  K    P+M  E + GWF ++  
Sbjct: 189 GPWGDMLENGSLQESAL-PTVN-CGSAIPEHFDRLAAFKQKKGPLMVMEYWIGWFDAWQD 246

Query: 242 AVPFRP-VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPLV--ATSYDYDA 294
                  V+    ++    + G    N+YM+ GGTNFG   G    G L+   TSYDYDA
Sbjct: 247 KKHHTTDVKSSVESLEEILKRGSV--NFYMFHGGTNFGFMNGANYYGKLLPDTTSYDYDA 304

Query: 295 PIDEYG 300
           P++EYG
Sbjct: 305 PLNEYG 310


>gi|134096920|ref|YP_001102581.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
 gi|291006638|ref|ZP_06564611.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
 gi|133909543|emb|CAL99655.1| beta-galactosidase [Saccharopolyspora erythraea NRRL 2338]
          Length = 594

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 111/333 (33%), Positives = 164/333 (49%), Gaps = 34/333 (10%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           A +T      ++DG+   + +G +HY R+ P+ W   + + +  GL  ++TYV WN+HEP
Sbjct: 15  AGLTVRGNEFLLDGEPFRIIAGEMHYFRTHPDQWRNRLDRMRALGLNSVDTYVAWNFHEP 74

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
            RG+  F G  D+VRFV+T  EAGL + +R GPY CAEW++GG P WL        R ++
Sbjct: 75  RRGEVDFTGWRDVVRFVETAAEAGLKVIIRPGPYICAEWDFGGLPAWLLESGNPPLRCSD 134

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA- 181
             + E   R+  ++  L +   L A++GGP++  QVENEYG    +YG       +  A 
Sbjct: 135 PAYTELTLRWFDEL--LPRLAPLQATRGGPVLAFQVENEYG----SYGNDQTHLEQLRAG 188

Query: 182 ------DTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTP------NSPSKPIMWT 229
                 D+ +  +      M +  + PD  + T N F  D   P        P  P+  T
Sbjct: 189 MLERGIDSLLFCSNGPSDYMLRGGNLPD-TLATVN-FAGDPTAPFEALREYQPEGPLWCT 246

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------ 283
           E + GWF  +G         + A  V R    G +  + YM  GGTNFG  AG       
Sbjct: 247 EFWDGWFDHWGEEHHTTDPVETAGHVDRMLAAGASV-SLYMAVGGTNFGWWAGANYDTSK 305

Query: 284 ----PLVATSYDYDAPIDEYGFIRQPKWGHLRE 312
               P + TSYDYD+PI E G + + K+  +RE
Sbjct: 306 DQYQPTI-TSYDYDSPIGEAGELTE-KFQRIRE 336


>gi|423252157|ref|ZP_17233159.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
           CL03T00C08]
 gi|423252477|ref|ZP_17233408.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
           CL03T12C07]
 gi|392647903|gb|EIY41596.1| hypothetical protein HMPREF1066_04169 [Bacteroides fragilis
           CL03T00C08]
 gi|392660553|gb|EIY54162.1| hypothetical protein HMPREF1067_00052 [Bacteroides fragilis
           CL03T12C07]
          Length = 628

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 119/384 (30%), Positives = 176/384 (45%), Gaps = 47/384 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           +GK   + SG +HY R   + W   ++  K  GL  + TYVFWN HEP  G++ F G  +
Sbjct: 37  NGKITPVLSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKN 96

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           L  F+K   E G+ + LR GPY CAEW +GG+P WL  + G++ R  N  F +  K +  
Sbjct: 97  LAEFIKIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAY-- 154

Query: 135 KIIDLMKQE--NLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTS- 190
             ID + +E  +L  ++GGPI++ Q ENE+G+ V     +  E +  + A     L  + 
Sbjct: 155 --IDRLYKEVGSLQCTKGGPIVMVQCENEFGSYVAQRKDIPLEEHRAYNAKIKQQLADAG 212

Query: 191 --VPWVMCQ-----QEDAPDPIINTCNG------------FYCDGFTPNSPSKPIMWTEN 231
             VP          +  A    + T NG             Y DG        P M  E 
Sbjct: 213 FNVPLFTSDGSWLFEGGATPGALPTANGESDIENLKKVVDQYHDG------KGPYMVAEF 266

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA---- 287
           Y GW   +    P      +A    ++ +   +F N+YM  GGTNFG T+G         
Sbjct: 267 YPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF-NFYMVHGGTNFGFTSGANYDKKRDI 325

Query: 288 ----TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPTHQKLGAKLEAHIY 343
               TSYDYDAPI E G++  PK+  +R +   IK   +Y I   P    +  ++ +   
Sbjct: 326 QPDMTSYDYDAPISEAGWV-TPKYDSIRNV---IKKYVKYTIPEAPAPNPV-IEIPSIQL 380

Query: 344 HKSSNDCAAFLANYDSSSDANVTF 367
           +K ++  A        SSD  +TF
Sbjct: 381 NKVADVLAFAEKQKPVSSDTPLTF 404



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 143/371 (38%), Gaps = 57/371 (15%)

Query: 341 HIYHKSSNDCAAFLANYDSSSDANVTFNGNVYFLP----AWSVSILPDCKNVVFNTAKVI 396
           ++ H  +N      ANYD   D         Y  P     W        +NV+    K  
Sbjct: 303 YMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVKYT 362

Query: 397 SQRNNGDHPFAQQKNVNELLLASSAFSWYEEKVGISGNRSFVRPDLAEQINTTKDTSDYL 456
                  +P  +  ++ +L   +   ++ E++  +S +     P   EQ+N       Y+
Sbjct: 363 IPEAPAPNPVIEIPSI-QLNKVADVLAFAEKQKPVSSDT----PLTFEQLN---QGYGYV 414

Query: 457 WYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGI 516
            YT   +  P  G    L I  L   A+V+V+ + V  G  N +   + +  ++  N   
Sbjct: 415 LYTRHFN-QPISGT---LEIPGLRDYAVVYVDGEQV--GVLNRNTKTYSMEIEVPFNA-- 466

Query: 517 NTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLD 576
            TL IL   +G  NYG+       G+ S + I      +++  G  +YQ+ ++ E   L 
Sbjct: 467 -TLQILVENMGRINYGSEIVHNTKGIISPVQI----AGKEIVGGWDMYQLPMD-EMPDLT 520

Query: 577 KISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
           K+  A++          +    + Y+ TF   +  G   +++ S GKG  +VNG +IGRY
Sbjct: 521 KLK-ADTHKNVPSEVAKLKGCPVLYEGTFTL-DKVGDTFMDMESWGKGIVFVNGVNIGRY 578

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W                               P QTLY +P  W+  GEN +VI E+L  
Sbjct: 579 WKV----------------------------GPQQTLY-VPGVWLKKGENKIVIFEQLNE 609

Query: 697 DPSKISLLTKT 707
            P       KT
Sbjct: 610 TPQTEVKTVKT 620


>gi|325845662|ref|ZP_08168945.1| putative beta-galactosidase [Turicibacter sp. HGF1]
 gi|325488263|gb|EGC90689.1| putative beta-galactosidase [Turicibacter sp. HGF1]
          Length = 589

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/316 (32%), Positives = 149/316 (47%), Gaps = 38/316 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   + SG+IHY R  P+ W   +   K  G   +ETYV WN HE   GQ+ F G
Sbjct: 10  FLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DLV FVK  +E GL + LR GPY CAEW  GG P WL     ++ R  +  F E+++ 
Sbjct: 70  GKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVEN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   ++ L+    L  ++GGP+I+ QVENEYG+         +LY++       +    V
Sbjct: 130 YFKVLLPLIVP--LQVTKGGPVIMVQVENEYGSFS-----NDKLYLRALKKMIEDAGIDV 182

Query: 192 P-------W--VMCQQEDAPDPIINTCNGFYCDG---------FTPNSPSK-PIMWTENY 232
           P       W   +       + ++ T N F   G         F      K P+M  E +
Sbjct: 183 PLFTSDGAWEQALMSGTLIEEEVLVTAN-FGSRGNENFDVLQSFMEKHDKKWPLMCMEFW 241

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------P 284
            GWF  +   +  R  +++   +    + G    N YM+ GGTNFG   G         P
Sbjct: 242 CGWFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLP 299

Query: 285 LVATSYDYDAPIDEYG 300
            V TSYDYDA + E+G
Sbjct: 300 QV-TSYDYDAFLTEWG 314


>gi|383116237|ref|ZP_09936989.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
 gi|251945420|gb|EES85858.1| hypothetical protein BSHG_3290 [Bacteroides sp. 3_2_5]
          Length = 769

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 106/334 (31%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + Y+    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYISAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)

Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
           ++L  GK+      W +Y   V+         S      +K G+  T+P      +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F   +  G   L++++ GKG  WVNG +IGR+W                           
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|422700666|ref|ZP_16758509.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
 gi|315170851|gb|EFU14868.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1342]
          Length = 593

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 166/352 (47%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL    G++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKGVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   +  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPMQITQGGPVIMMQVENEYG----SYGM-EKAYLQQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|320162379|ref|YP_004175604.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
 gi|319996233|dbj|BAJ65004.1| beta-galactosidase [Anaerolinea thermophila UNI-1]
          Length = 583

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 105/331 (31%), Positives = 160/331 (48%), Gaps = 29/331 (8%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           + +T +     +DG+   + +G++HY R  P  W + + K K  GL  +ETYV WN HEP
Sbjct: 2   STLTIEGDHFELDGEPFRILAGAMHYFRVHPAYWKDRLLKLKAMGLNTVETYVAWNLHEP 61

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G+++F    ++ R+++   E GL++ +R GPY CAEW  GG P WL   P ++ R   
Sbjct: 62  HEGEFHFGDWLNIERYIELAGELGLYVIVRPGPYICAEWEMGGLPAWLLKDPQMKLRCMY 121

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGG-------EL 175
            P+ + +  + ++++  +    L +++GGPII  QVENEYG    +YG          EL
Sbjct: 122 QPYLDAVGEYFSQLMHRLVP--LQSTRGGPIIAMQVENEYG----SYGNDTRYLKYLEEL 175

Query: 176 YVKWAADTAVNLNTSVPWVMCQQEDAPD--PIINTCN--GFYCDGFTPNSPSKPIMWTEN 231
             +   D  +     V   M Q    P     +N  N  G   +         P++  E 
Sbjct: 176 LRQCGVDVLLFTADGVADEMMQYGSLPHLFKAVNFGNRPGDAFEKLREYQTGGPLLVAEF 235

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-------- 283
           + GWF  +G     R   ++A  +      G +  N YM+ GGTNFG   G         
Sbjct: 236 WDGWFDHWGERHHTRSAGEVARVLDDLLSEGASV-NLYMFHGGTNFGFMNGANAFPSPHY 294

Query: 284 -PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
            P V TSYDYDAP+ E G I  PK+  +RE+
Sbjct: 295 TPTV-TSYDYDAPLSECGNI-TPKYEAMREV 323


>gi|289166983|ref|YP_003445250.1| beta-galactosidase 3 [Streptococcus mitis B6]
 gi|288906548|emb|CBJ21380.1| beta-galactosidase 3 [Streptococcus mitis B6]
          Length = 595

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKPFKILSGAIHYFRIPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R++   + E + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRIRSSAPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLSRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEDRGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|12852936|dbj|BAB29584.1| unnamed protein product [Mus musculus]
          Length = 586

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 108/312 (34%), Positives = 151/312 (48%), Gaps = 29/312 (9%)

Query: 20  VLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFV 79
           ++  GSIHY R   E W + + K +  G   + TY+ WN HE  RG++ F    DL  +V
Sbjct: 1   MIVGGSIHYFRVPREYWKDRLLKLQACGFNTVTTYIPWNLHEQERGKFDFSEILDLEAYV 60

Query: 80  KTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDL 139
              +  GL++ LR GPY CAE + GG P WL   P    RTTN  F E + ++   +I  
Sbjct: 61  LLAKTIGLWVILRPGPYICAEVDLGGLPSWLLRNPVTDLRTTNKGFIEAVDKYFDHLIP- 119

Query: 140 MKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQE 199
            K   L    GGP+I  QVENEYG+ +         Y+K A      L   +  ++   +
Sbjct: 120 -KILPLQYRHGGPVIAVQVENEYGSFQKDRNYMN--YLKKAL-----LKRGIVELLLTSD 171

Query: 200 DAPDPIINTCNG----FYCDGFTPNS--------PSKPIMWTENYSGWFLSFGYAVPFRP 247
           D     I + NG       + FT +S          KPIM  E ++GW+ S+G     + 
Sbjct: 172 DKDGIQIGSVNGALTTINMNSFTKDSFIKLHKMQSDKPIMIMEYWTGWYDSWGSKHIEKS 231

Query: 248 VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYGF 301
            E++   V +F   G +F N YM+ GGTNFG   GG        V TSYDYDA + E G 
Sbjct: 232 AEEIRHTVYKFISYGLSF-NMYMFHGGTNFGFINGGRYENHHISVVTSYDYDAVLSEAGD 290

Query: 302 IRQPKWGHLREL 313
             + K+  LR+L
Sbjct: 291 YTE-KYFKLRKL 301


>gi|403528012|ref|YP_006662899.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
 gi|403230439|gb|AFR29861.1| beta-galactosidase GLB [Arthrobacter sp. Rue61a]
          Length = 598

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 163/350 (46%), Gaps = 58/350 (16%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           +A ++Y    L   G+   + +G+IHY R  P++W + +R+ K  G   ++TYV WN+H+
Sbjct: 3   NALLSYHDAVLYRSGEPYRILAGAIHYFRVHPDLWQDRLRRLKAMGANTVDTYVAWNFHQ 62

Query: 62  PIRGQY-YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRT 120
           P R +   F G  DL RF+    E GL + +R GPY CAEW+ GGFP WL  IPGI  R 
Sbjct: 63  PKRDEAPDFSGWQDLGRFMDLAAEEGLDVIVRPGPYICAEWDNGGFPSWLTGIPGIGLRC 122

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWA 180
            +  F   ++ +   ++ ++       S GGP++  Q+ENEYG    +YG   E Y++W 
Sbjct: 123 MDPVFTAAIEEWFDHLLPIVASRQ--TSAGGPVVAVQIENEYG----SYGDDHE-YIRWN 175

Query: 181 ADTAVNLNTSVPWVMCQQEDAPDPIINTCNG---FYCDG--------------------- 216
                            +E     ++ T +G   ++ DG                     
Sbjct: 176 R-------------RALEERGITELLFTADGGTDYFLDGGAVEGTWATATLGSRGDEAVA 222

Query: 217 -FTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGT 275
            +    P +P    E + GWF  +G     R  ED A    +  + GG+    YM  GGT
Sbjct: 223 TWQRRRPGEPFFNVEFWGGWFDHWGEHHHGRDAEDAALEARKMLDLGGSL-CAYMAHGGT 281

Query: 276 NFGRTAGG--------PLVATSYDYDAPIDEYGFIRQPKWGHLR-ELHKA 316
           NFG  +G         P V TSYD DAPI E G +  PK+   R E ++A
Sbjct: 282 NFGLRSGSNHDGTMLQPTV-TSYDSDAPIAENGAL-TPKFHAFRKEFYRA 329


>gi|449458169|ref|XP_004146820.1| PREDICTED: beta-galactosidase 17-like [Cucumis sativus]
          Length = 719

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 166/353 (47%), Gaps = 45/353 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DGK   +  G +HY R+ PE W + + ++K  GL  I+TY+ WN HEP  G + F G  +
Sbjct: 79  DGKPFQIIGGDLHYFRTLPEYWEDRLLRAKALGLNTIQTYIPWNLHEPKPGNFTFNGIAN 138

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRTTNNPFKEEMKRFL 133
           +V F++  Q+    + LR GPY CAEW+ GGFP W L  +P  + R+++  + + ++R+ 
Sbjct: 139 IVSFIQLCQKLDFLVLLRPGPYICAEWDLGGFPAWLLSKMPASRLRSSDPGYLQWVERWW 198

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-----------VEWAYGVGGELYVKWAAD 182
             I  L K   L  + GGPII+ Q+ENE+G+           V  A G  G+  + +  D
Sbjct: 199 GII--LPKVAPLLYNNGGPIIMVQIENEFGSYGDDQAYLHHLVALARGYLGDEIILYTTD 256

Query: 183 ---------TAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-PIMWTENY 232
                      +  N     V     + P PI N    F       N P K P +  E Y
Sbjct: 257 GGTRETLEKGTIRGNAVFSAVDFSTGERPWPIFNLQKEF-------NPPGKSPPLTAEFY 309

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF----GRTAGGPLV-- 286
           +GW   +G  +        A A+       G+    YM  GGTNF    G   G  ++  
Sbjct: 310 TGWLTHWGENIATTDANSTAAALNEILAGKGS-AVLYMAHGGTNFGFYNGANTGNDVLDY 368

Query: 287 ---ATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDPT-HQKLG 335
               TSYDYDAPI E G +   K+  +R   + I+     LI S P+ ++K+G
Sbjct: 369 KPDLTSYDYDAPIKESGDVDNAKYEAIR---RVIQHYSGALIPSVPSNNEKIG 418


>gi|293376766|ref|ZP_06622988.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
 gi|292644632|gb|EFF62720.1| glycosyl hydrolase family 35 [Turicibacter sanguinis PC909]
          Length = 589

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/316 (32%), Positives = 149/316 (47%), Gaps = 38/316 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   + SG+IHY R  P+ W   +   K  G   +ETYV WN HE   GQ+ F G
Sbjct: 10  FLVDGKPTRIMSGAIHYFRIMPDHWEHSLYNLKALGFNTVETYVPWNLHEMREGQFDFTG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DLV FVK  +E GL + LR GPY CAEW  GG P WL     ++ R  +  F E+++ 
Sbjct: 70  GKDLVSFVKKAEEIGLMVILRPGPYICAEWENGGLPAWLLNYHDMKIRCDDELFLEKVEN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   ++ L+    L  ++GGP+I+ QVENEYG+         +LY++       +    V
Sbjct: 130 YFKVLLPLIVP--LQVTKGGPVIMVQVENEYGSFS-----NDKLYLRALKKMIEDAGIDV 182

Query: 192 P-------W--VMCQQEDAPDPIINTCNGFYCDG---------FTPNSPSK-PIMWTENY 232
           P       W   +       + ++ T N F   G         F      K P+M  E +
Sbjct: 183 PLFTSDGAWEQALMSGTLIEEEVLVTAN-FGSRGNENFDVLQSFMEKHDKKWPLMCMEFW 241

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------P 284
            GWF  +   +  R  +++   +    + G    N YM+ GGTNFG   G         P
Sbjct: 242 CGWFNRWNEDIILRDADEVMTCMKELLQRGSL--NLYMFHGGTNFGFMNGSCAGKIGNLP 299

Query: 285 LVATSYDYDAPIDEYG 300
            V TSYDYDA + E+G
Sbjct: 300 QV-TSYDYDAFLTEWG 314


>gi|58581392|ref|YP_200408.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
 gi|58425986|gb|AAW75023.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae KACC 10331]
          Length = 651

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/310 (36%), Positives = 149/310 (48%), Gaps = 28/310 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 77  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 136

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 137 NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 196

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  +   +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 197 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 253

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 254 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK-- 311

Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
           P    +  A   A  FE     G   N YM+ GGT+FG   G               TSY
Sbjct: 312 PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 369

Query: 291 DYDAPIDEYG 300
           DYDA +DE G
Sbjct: 370 DYDAIVDEAG 379


>gi|424665378|ref|ZP_18102414.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
           616]
 gi|404574622|gb|EKA79370.1| hypothetical protein HMPREF1205_01253 [Bacteroides fragilis HMW
           616]
          Length = 624

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 110/340 (32%), Positives = 158/340 (46%), Gaps = 54/340 (15%)

Query: 16  GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
           G+   + SG +HY R   + W   ++  K  GL  + TYVFWN HE   G++ F G  +L
Sbjct: 35  GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94

Query: 76  VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
             +++   E G+ + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K++   
Sbjct: 95  AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151

Query: 136 IIDLMKQE--NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-P 192
            ID + QE   L  ++GGPII+ Q ENE+G+           YV    D +   + S   
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGS-----------YVSQRKDISFEEHRSYNA 199

Query: 193 WVMCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIM 227
            +  Q  DA    P+  +   +  +G        T N  S                 P M
Sbjct: 200 KIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYM 259

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
             E Y GW   +G   P     ++A     + +   +F N+YM  GGTNFG T+G     
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNNVSF-NFYMVHGGTNFGFTSGANYDK 318

Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
                   TSYDYDAPI E G+I  PK+  +R  + K +K
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357


>gi|332375542|gb|AEE62912.1| unknown [Dendroctonus ponderosae]
          Length = 454

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 168/354 (47%), Gaps = 52/354 (14%)

Query: 1   LSANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYH 60
           L+AN ++      ++ K   + SG++HY R     W + +RK +  GL  +ETYV WN H
Sbjct: 27  LNANQSF----FTLNDKLIKIYSGAMHYFRVPRPYWRDRLRKIRAAGLNTVETYVPWNLH 82

Query: 61  EPIRGQY-------YFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFI 113
           EP  G++        FE    L  F+   +E  LF+ LR GPY C+E+N GGFP WL   
Sbjct: 83  EPENGKFDFGEGGSEFEDFLHLEEFLNAAKEEDLFVILRTGPYICSEYNSGGFPSWLLRE 142

Query: 114 PGIQFRTTNNPFKEEMKRFLAKIIDLMKQENLFASQ-GGPIILAQVENEYGNVEWAYGVG 172
             + FRT+   + + + RF   ++ L+     F  Q GGP+I  QVENEYGN+E      
Sbjct: 143 KPMGFRTSEENYMKFVTRFFNVVLTLLAA---FQFQLGGPVIAFQVENEYGNLENGAAFQ 199

Query: 173 ---------GELYVK-------WAADTAVNLNTS--VPWVMCQQEDAPDPIINTCNGFYC 214
                     +L++K        +AD+ +   TS  +P  + Q  +  D  +N  N    
Sbjct: 200 PDKVYMEELRQLFLKNGIVELLTSADSPLWKGTSGTLPGELFQTANFGDNAVNQLNK--L 257

Query: 215 DGFTPNSPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGG 274
           + F    P +P+M  E + GWF + G     +  ED    +   F    +F N YM+ GG
Sbjct: 258 EEF---QPGRPLMVMEYWIGWFDNVGGEHSVKSDEDSRRVLEDIFSKNASF-NAYMFHGG 313

Query: 275 TNFGRTAGGPL------------VATSYDYDAPIDEYGFIRQPKWGHLRELHKA 316
           TNF    G  L            + TSYDYDAPI E G  R  K+  ++EL  A
Sbjct: 314 TNFWFNNGANLDNDLMDNSGYTAITTSYDYDAPISESGGYRN-KYFIVKELVAA 366


>gi|281422858|ref|ZP_06253857.1| beta-galactosidase [Prevotella copri DSM 18205]
 gi|281403124|gb|EFB33804.1| beta-galactosidase [Prevotella copri DSM 18205]
          Length = 788

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 103/329 (31%), Positives = 156/329 (47%), Gaps = 33/329 (10%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           T   +  +++GK  V+++  +HYPR     W   I+  K  G+  +  YVFWN HE   G
Sbjct: 33  TTGDKTFLLNGKPFVVKAAELHYPRIPRAYWEHRIKMCKALGMNTVCLYVFWNIHEQEEG 92

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           ++ F G  D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I+ R  +  F
Sbjct: 93  KFDFTGNNDVAAFCRLAQKNGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIRLREQDPYF 152

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV---------------EWAYG 170
            + ++ F  ++   +    L    GGPII+ QVENEYG+                +  + 
Sbjct: 153 MQRVEIFEKEVGKQLAP--LTIQNGGPIIMVQVENEYGSYGKDKPYVSAIRDIVRKSGFD 210

Query: 171 VGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTE 230
                   W+++   N    + W M     A     N    F   G     P+ P M +E
Sbjct: 211 KVSLFQCDWSSNFLNNGLDDLTWTMNFGTGA-----NIDQQFKRLGEV--RPNAPKMCSE 263

Query: 231 NYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------P 284
            +SGWF  +G     RP +D+   +      G +F + YM  GGT+FG  AG       P
Sbjct: 264 FWSGWFDKWGARHETRPAKDMVEGMDEMLSKGISF-SLYMTHGGTSFGHWAGANSPGFQP 322

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLREL 313
            V TSYDYDAPI+E+G +  PK+  L+++
Sbjct: 323 DV-TSYDYDAPINEWG-LATPKFYELQKM 349


>gi|194761012|ref|XP_001962726.1| GF14288 [Drosophila ananassae]
 gi|190616423|gb|EDV31947.1| GF14288 [Drosophila ananassae]
          Length = 661

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 105/324 (32%), Positives = 155/324 (47%), Gaps = 37/324 (11%)

Query: 6   TYDHRA--LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
           T DH A   ++DG+     SGS HY R+ PE W   +R  +  GL  ++TYV W+ H P 
Sbjct: 36  TIDHEANSFMLDGEPFRYVSGSFHYFRAVPEAWRSRLRTMRASGLNALDTYVEWSLHNPH 95

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWL-HFIPGIQFRTTN 122
             +Y +EG  D+V+F++  QE   ++ LR GPY CAE + GG P WL    P I+ RT +
Sbjct: 96  EDEYNWEGIADVVKFLEIAQEEDFYIILRPGPYICAERDNGGLPHWLFKKYPSIKMRTND 155

Query: 123 NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAAD 182
             +  E+ ++ A++  + + ++L    GG II+ QVENEYG+    +      Y+ W  D
Sbjct: 156 PDYIAEVGKWYAQL--MPRLQHLLVGNGGKIIMVQVENEYGDYACDHD-----YLNWLRD 208

Query: 183 --------TAVNLNTSVP--WVMCQQED----APDPIINTCNGF--YCDGFTPNSPSKPI 226
                    A+     +P   + C + D      D  I+  N             P+ P+
Sbjct: 209 ETEKYVSGKALLFTVDIPNEKMSCGKIDNVFATTDFGIDRINEIDEIWKMLRVQQPTGPL 268

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           + +E Y GW   +      R  + +A A+        +  N YM+FGGTNFG TAG    
Sbjct: 269 VNSEFYPGWLTHWQEQNQRRDGQVVADALKTILSYNASV-NLYMFFGGTNFGFTAGANYD 327

Query: 284 -------PLVATSYDYDAPIDEYG 300
                      TSYDYDA +DE G
Sbjct: 328 LDGGIGYAADITSYDYDAVMDEAG 351


>gi|381169756|ref|ZP_09878919.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
           941]
 gi|380689774|emb|CCG35406.1| beta-galactosidase [Xanthomonas citri pv. mangiferaeindicae LMG
           941]
          Length = 613

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 115/319 (36%), Positives = 155/319 (48%), Gaps = 29/319 (9%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFD 74
           DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G  D
Sbjct: 42  DGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSGHND 101

Query: 75  LVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLA 134
           +  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + +L 
Sbjct: 102 VAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLD 161

Query: 135 KIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLNTSV 191
            + + +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L TS 
Sbjct: 162 ALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LFTSD 218

Query: 192 PWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAVPFR 246
              M      PD   ++N   G      D      P +P M  E ++GWF  +G   P  
Sbjct: 219 GADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK--PHA 276

Query: 247 PVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
             +  A   A  FE     G   N YM+ GGT+FG   G               TSYDYD
Sbjct: 277 ATD--ARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDYD 334

Query: 294 APIDEYGFIRQPKWGHLRE 312
           A +DE G    PK+  +R+
Sbjct: 335 AILDEAGHP-TPKFALMRD 352


>gi|432894411|ref|XP_004075980.1| PREDICTED: beta-galactosidase-1-like protein 2-like [Oryzias
           latipes]
          Length = 640

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 108/318 (33%), Positives = 148/318 (46%), Gaps = 35/318 (11%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S+N T + +  +I G       GSIHY R     W + + K K  GL  + TYV WN HE
Sbjct: 52  SSNFTLERKPFLILG-------GSIHYFRVPKAYWEDRLLKLKACGLNTLTTYVPWNLHE 104

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
           P RG + FEG  DL  ++      G+++ LR GPY CAEW+ GG P WL     ++ RTT
Sbjct: 105 PERGVFDFEGELDLEAYLGLAASLGIWVILRPGPYICAEWDLGGLPSWLLRDQNMRLRTT 164

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
              F   +  +   +I   K      S+GGPII  QVENEYG    +Y +  E Y+ +  
Sbjct: 165 YPGFTAAVDSYFDHLIK--KVAPYQYSRGGPIIAVQVENEYG----SYAMDEE-YMPFIK 217

Query: 182 DTAVNLNTSVPWVMCQQED-----APDPIINTCNGFYCDG-----FTPNSPSKPIMWTEN 231
           +  ++   +   V    +D          + T N    D           P KP M  E 
Sbjct: 218 EALLSRGITELLVTSDNKDGLKLGGVKGALETINFQKLDPEEIKYLEKIQPQKPKMVMEY 277

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNF---------GRTAG 282
           +SGWF  +G      P E++   V    +   +  N YM+ GGTNF         GR + 
Sbjct: 278 WSGWFDLWGGLHHVFPAEEMMAVVTEILKLDMSI-NLYMFHGGTNFGFMSGAFAVGRPSP 336

Query: 283 GPLVATSYDYDAPIDEYG 300
            P+V TSYDYDAP+ E G
Sbjct: 337 APMV-TSYDYDAPLSEAG 353



 Score = 49.7 bits (117), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 55/224 (24%), Positives = 90/224 (40%), Gaps = 48/224 (21%)

Query: 475 NIESLGHAALVFVNKKLV-AFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYGA 533
           ++ ++   ALVFV K+ V    Y   + +       I   +G  TL +L    G  NYG 
Sbjct: 446 SLNNIRDRALVFVEKQFVGVLDYKEQELS-------IPDGKGKRTLGLLVENCGRVNYGK 498

Query: 534 WFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWKQGSTLP 593
             D    GL   I ++  N  RD      I+ + ++ +++      L +S+ WK     P
Sbjct: 499 TLDEQRKGLVGDIQLN-ANILRDFM----IHSLDMKPDFVS----RLQSSAQWKSMREKP 549

Query: 594 VNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDY 653
              +  +++T            L L    KG  +VNG+++GRYWS               
Sbjct: 550 SFPA--FFQTKLYLSSSPKDTFLKLPGWSKGVVFVNGKNLGRYWSV-------------- 593

Query: 654 RGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
                          P QTLY +P  W++  +N +++ EEL  D
Sbjct: 594 --------------GPQQTLY-VPGAWLNRWDNEIIVFEELETD 622


>gi|423251759|ref|ZP_17232772.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
           CL03T00C08]
 gi|423255080|ref|ZP_17236010.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
           CL03T12C07]
 gi|392649184|gb|EIY42863.1| hypothetical protein HMPREF1066_03782 [Bacteroides fragilis
           CL03T00C08]
 gi|392652521|gb|EIY46180.1| hypothetical protein HMPREF1067_02654 [Bacteroides fragilis
           CL03T12C07]
          Length = 769

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 106/334 (31%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L +
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLR 340



 Score = 43.9 bits (102), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)

Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
           ++L  GK+      W +Y   V+         S      +K G+  T+P      +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F   +  G   L++++ GKG  WVNG +IGR+W                           
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|53715536|ref|YP_101528.1| beta-galactosidase [Bacteroides fragilis YCH46]
 gi|60683489|ref|YP_213633.1| beta-galactosidase [Bacteroides fragilis NCTC 9343]
 gi|375360299|ref|YP_005113071.1| putative beta-galactosidase [Bacteroides fragilis 638R]
 gi|423280737|ref|ZP_17259649.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
           610]
 gi|52218401|dbj|BAD50994.1| beta-galactosidase precursor [Bacteroides fragilis YCH46]
 gi|60494923|emb|CAH09735.1| putative beta-galactosidase [Bacteroides fragilis NCTC 9343]
 gi|301164980|emb|CBW24544.1| putative beta-galactosidase [Bacteroides fragilis 638R]
 gi|404583944|gb|EKA88617.1| hypothetical protein HMPREF1203_03866 [Bacteroides fragilis HMW
           610]
          Length = 624

 Score =  148 bits (373), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/340 (32%), Positives = 158/340 (46%), Gaps = 54/340 (15%)

Query: 16  GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
           G+   + SG +HY R   + W   ++  K  GL  + TYVFWN HE   G++ F G  +L
Sbjct: 35  GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94

Query: 76  VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
             +++   E G+ + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K++   
Sbjct: 95  AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151

Query: 136 IIDLMKQE--NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-P 192
            ID + QE   L  ++GGPII+ Q ENE+G+           YV    D +   + S   
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGS-----------YVSQRKDISFEEHRSYNA 199

Query: 193 WVMCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIM 227
            +  Q  DA    P+  +   +  +G        T N  S                 P M
Sbjct: 200 KIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYM 259

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
             E Y GW   +G   P     ++A     + +   +F N+YM  GGTNFG T+G     
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDK 318

Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
                   TSYDYDAPI E G+I  PK+  +R  + K +K
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357


>gi|325925751|ref|ZP_08187124.1| beta-galactosidase [Xanthomonas perforans 91-118]
 gi|325543808|gb|EGD15218.1| beta-galactosidase [Xanthomonas perforans 91-118]
          Length = 611

 Score =  148 bits (373), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 114/320 (35%), Positives = 152/320 (47%), Gaps = 25/320 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   + SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 37  FVRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 96

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 97  NNDVAAFVREAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQS 156

Query: 132 FLAKIIDLMKQ-ENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNL 187
           +L     L KQ + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L
Sbjct: 157 YLDA---LAKQVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-L 212

Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G  
Sbjct: 213 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 272

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
                    A         G +  N YM+ GGT+FG   G               TSYDY
Sbjct: 273 HAATDARQQAEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSYDY 331

Query: 293 DAPIDEYGFIRQPKWGHLRE 312
           DA +DE G    PK+  +R+
Sbjct: 332 DAILDEAGHP-TPKFALMRD 350


>gi|84623327|ref|YP_450699.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
 gi|188577369|ref|YP_001914298.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|84367267|dbj|BAE68425.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae MAFF 311018]
 gi|188521821|gb|ACD59766.1| beta-galactosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 613

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/310 (36%), Positives = 149/310 (48%), Gaps = 28/310 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  +   +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 159 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK-- 273

Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
           P    +  A   A  FE     G   N YM+ GGT+FG   G               TSY
Sbjct: 274 PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331

Query: 291 DYDAPIDEYG 300
           DYDA +DE G
Sbjct: 332 DYDAIVDEAG 341


>gi|307705099|ref|ZP_07641979.1| beta-galactosidase [Streptococcus mitis SK597]
 gi|307621359|gb|EFO00416.1| beta-galactosidase [Streptococcus mitis SK597]
          Length = 595

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++T Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQTAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            ++   +    L    GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLFPRLVPHLL--DNGGNILMMQVENEYGSYGEDKAYLRVIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|195108029|ref|XP_001998595.1| GI23552 [Drosophila mojavensis]
 gi|193915189|gb|EDW14056.1| GI23552 [Drosophila mojavensis]
          Length = 641

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/341 (32%), Positives = 162/341 (47%), Gaps = 44/341 (12%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S  V Y++   + DG+     +GS HY R+ P+ W   +R  +  GL  + TYV W+ H 
Sbjct: 25  SFTVDYENDRFLKDGRPFHFIAGSFHYFRAHPDTWSRHLRTMRAAGLNAVTTYVEWSLHN 84

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVW-LHFIPGIQFRT 120
           P  G Y + G  DL RF++   +  L + LR GPY CAE + GGFP W L+  PGIQ RT
Sbjct: 85  PRDGVYVWTGIADLERFIRLAVDEDLLVILRPGPYICAERDMGGFPYWLLNKFPGIQLRT 144

Query: 121 TNNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKW- 179
            +  +  E++ + +++  + +        GGPII+ QVENEYG    +Y      Y  W 
Sbjct: 145 ADINYLSEVRIWYSQL--MARIGPYLYGNGGPIIMVQVENEYG----SYFACDANYRNWL 198

Query: 180 -------AADTAVNLNTSVPWVM-CQQEDAPDPIINTCNGFYCDGFTPN----------- 220
                    D+AV      P V+ C +      ++ T +     G T N           
Sbjct: 199 RDETQNHVKDSAVLFTNDGPGVLRCGKIQG---VLATMDF----GATSNLKDVWAKLRQY 251

Query: 221 SPSKPIMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRT 280
            P  P++  E Y GW   +   +       +        ++G +  N+YM++GGTNFG T
Sbjct: 252 QPKGPLVNAEYYPGWLTHWTEPMANVSTSAITGTFIDMLDSGASV-NFYMFYGGTNFGFT 310

Query: 281 AG------GPLVA--TSYDYDAPIDEYGFIRQPKWGHLREL 313
           AG      G  +A  TSYDYDAP+ E G    PK+  LR++
Sbjct: 311 AGANDNGPGNYIADITSYDYDAPMTEAG-DPTPKYMALRQI 350


>gi|384420175|ref|YP_005629535.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
 gi|353463088|gb|AEQ97367.1| beta-galactosidase [Xanthomonas oryzae pv. oryzicola BLS256]
          Length = 613

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/310 (36%), Positives = 149/310 (48%), Gaps = 28/310 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  NNDVAAFVQEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGQGNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  +   +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 159 YLDAVAKQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 216 TSDGAEMLANGTLPDTLAVVNFAPGEAKSAFDKLIAFRPDQPRMVGEYWAGWFDHWGK-- 273

Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
           P    +  A   A  FE     G   N YM+ GGT+FG   G               TSY
Sbjct: 274 PHAATD--ATQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331

Query: 291 DYDAPIDEYG 300
           DYDA +DE G
Sbjct: 332 DYDAIVDEAG 341


>gi|395816938|ref|XP_003781939.1| PREDICTED: LOW QUALITY PROTEIN: beta-galactosidase [Otolemur
           garnettii]
          Length = 669

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 108/338 (31%), Positives = 156/338 (46%), Gaps = 27/338 (7%)

Query: 4   NVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPI 63
            + Y     + DG+     SGSIHY R     W + + K K  GL  I+TYV WN+HEP 
Sbjct: 33  KIDYSRDRFLKDGQPFRYISGSIHYSRLPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPQ 92

Query: 64  RGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNN 123
            G+Y F    D+  F++   E GL + LR GPY CAEW+ GG P WL     +  R+++ 
Sbjct: 93  PGKYQFSEDHDVEYFIQLAHELGLLVILRPGPYICAEWDMGGLPAWLLEKESMILRSSDP 152

Query: 124 PFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVK----- 178
            +   + ++L  ++  MK   L    GGPII  QVENEYG    +Y      Y++     
Sbjct: 153 DYLAAVDKWLGVLLPKMKP--LLYQNGGPIISVQVENEYG----SYFTCDHDYMRFLLKR 206

Query: 179 ---WAADTAVNLNTS---VPWVMCQQEDAPDPIINTCNGFYCDGF----TPNSPSKPIMW 228
              +  D  V   T      ++ C         ++   G            + P  P++ 
Sbjct: 207 FRYYLGDDVVLFTTDGIFEKYLNCGALQGLYATVDFGTGVNITAAFKLQRKSEPKGPLIN 266

Query: 229 TENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--PLV 286
           +E Y+GW   +G        ED+AF++      G +  N YM+ GGTNF    G   P  
Sbjct: 267 SEFYTGWLDHWGQPHSTVKTEDVAFSLFDILARGASV-NLYMFTGGTNFAYWNGANIPYS 325

Query: 287 A--TSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           A  TSYDYDAP+ E G + + K+  LR + +  K   E
Sbjct: 326 AQPTSYDYDAPLSEAGDLTE-KYFALRSVIQKFKETPE 362


>gi|423260402|ref|ZP_17241324.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
           CL07T00C01]
 gi|423266536|ref|ZP_17245538.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
           CL07T12C05]
 gi|387774956|gb|EIK37065.1| hypothetical protein HMPREF1055_03601 [Bacteroides fragilis
           CL07T00C01]
 gi|392699768|gb|EIY92937.1| hypothetical protein HMPREF1056_03225 [Bacteroides fragilis
           CL07T12C05]
          Length = 624

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/340 (32%), Positives = 158/340 (46%), Gaps = 54/340 (15%)

Query: 16  GKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDL 75
           G+   + SG +HY R   + W   ++  K  GL  + TYVFWN HE   G++ F G  +L
Sbjct: 35  GETTPILSGEMHYARIPHQYWRHRLQMMKGMGLNTVATYVFWNLHEVEPGKWDFSGDKNL 94

Query: 76  VRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAK 135
             +++   E G+ + LR GPY CAEW +GG+P WL  IPG++ R  N  F +  K++   
Sbjct: 95  AEYIRIAGEEGMMVILRPGPYVCAEWEFGGYPWWLQNIPGMEIRRDNTEFLKYTKKY--- 151

Query: 136 IIDLMKQE--NLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV-P 192
            ID + QE   L  ++GGPII+ Q ENE+G+           YV    D +   + S   
Sbjct: 152 -IDRLYQEVGPLQCTKGGPIIMVQCENEFGS-----------YVSQRKDISFEEHRSYNA 199

Query: 193 WVMCQQEDA--PDPIINTCNGFYCDGF-------TPNSPSK----------------PIM 227
            +  Q  DA    P+  +   +  +G        T N  S                 P M
Sbjct: 200 KIKGQLADAGFTVPLFTSDGSWLFEGGCVAGALPTANGESDIANLKKVVNQYHGGKGPYM 259

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLVA 287
             E Y GW   +G   P     ++A     + +   +F N+YM  GGTNFG T+G     
Sbjct: 260 VAEFYPGWLSHWGEPFPQVSASEIARQTEAYLQNDVSF-NFYMVHGGTNFGFTSGANYDK 318

Query: 288 --------TSYDYDAPIDEYGFIRQPKWGHLRE-LHKAIK 318
                   TSYDYDAPI E G+I  PK+  +R  + K +K
Sbjct: 319 KRDIQPDLTSYDYDAPISEAGWI-TPKYDSIRSVIQKYVK 357


>gi|336410484|ref|ZP_08590961.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
 gi|335944314|gb|EGN06136.1| hypothetical protein HMPREF1018_02978 [Bacteroides sp. 2_1_56FAA]
          Length = 769

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 107/334 (32%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L K
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFLLRDLLK 340



 Score = 43.9 bits (102), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 67/162 (41%), Gaps = 46/162 (28%)

Query: 548 IDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS--TLPVNKSLIWYKTT 604
           ++L  GK+      W +Y   V+         S      +K G+  T+P      +Y+TT
Sbjct: 481 VELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGTAQTMPA-----YYRTT 527

Query: 605 FLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQK 664
           F   +  G   L++++ GKG  WVNG +IGR+W                           
Sbjct: 528 FRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI------------------------- 561

Query: 665 HCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 ---GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|423270210|ref|ZP_17249181.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
           CL05T00C42]
 gi|423276168|ref|ZP_17255110.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
           CL05T12C13]
 gi|392698134|gb|EIY91316.1| hypothetical protein HMPREF1079_02263 [Bacteroides fragilis
           CL05T00C42]
 gi|392699308|gb|EIY92489.1| hypothetical protein HMPREF1080_03763 [Bacteroides fragilis
           CL05T12C13]
          Length = 769

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 106/334 (31%), Positives = 154/334 (46%), Gaps = 31/334 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              GQ+ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLREARPETPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               + +SYDYDAPI E G+    K+  LR+L +
Sbjct: 308 SYSAMCSSYDYDAPISEPGWTTD-KYFQLRDLLR 340



 Score = 45.1 bits (105), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 97/238 (40%), Gaps = 57/238 (23%)

Query: 474 LNIESLGHAALVFVNKKLVA-FGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
           + I  +   A VFV+ KL+A       +FA  L      L +G   +DIL   +G  N+ 
Sbjct: 414 MKITEVHDWAQVFVDGKLLARLDRRRGEFALQLP----ALKKGTR-IDILVEAMGRVNFD 468

Query: 533 -AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGS 590
            +  D  G        ++L  GK+      W +Y   V+         S      +K G+
Sbjct: 469 ESIHDRKGI----TEKVELVRGKQSAELKNWTVYSFPVD--------YSFVQDKRYKNGT 516

Query: 591 --TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCT 648
             T+P      +Y+TTF   +  G   L++++ GKG  WVNG +IGR+W           
Sbjct: 517 ARTMPA-----YYRTTFRL-DKVGDTFLDMSTWGKGMVWVNGLAIGRFWEI--------- 561

Query: 649 KKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                               P QTL+ +P  W+  GEN +++ +  G + + I  L K
Sbjct: 562 -------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGPEKASIRGLKK 599


>gi|307289489|ref|ZP_07569436.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
 gi|422703871|ref|ZP_16761687.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
 gi|306499556|gb|EFM68926.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0109]
 gi|315164595|gb|EFU08612.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX1302]
          Length = 593

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKSVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|139439964|ref|ZP_01773301.1| Hypothetical protein COLAER_02339 [Collinsella aerofaciens ATCC
           25986]
 gi|133774730|gb|EBA38550.1| glycosyl hydrolase family 35 [Collinsella aerofaciens ATCC 25986]
          Length = 598

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 102/329 (31%), Positives = 147/329 (44%), Gaps = 33/329 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++D +   + SG+IHY R  P  W   +   K  G   +ETYV WN HEP  G + F G
Sbjct: 10  FLLDDEPFTILSGAIHYMRVHPSDWHHSLYNLKALGFNTVETYVPWNLHEPKPGVFDFSG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  F+      GL+  +R  P+ CAEW +GG P WL     ++ R+++  F   + +
Sbjct: 70  SIDLAAFLDEAASLGLYAIVRPSPFICAEWEFGGMPAWLLREHDMRPRSSDPKFLAHVAQ 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   ++ ++    +   +GG II+ QVENEYG+         + Y++      V    SV
Sbjct: 130 YYDHLMPILVSRQI--DKGGNIIMMQVENEYGSY-----CEDKDYLRAIRRLMVERGVSV 182

Query: 192 -------PWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSK-----------PIMWTENYS 233
                  PW  C +          C G +      N  +            P+M  E + 
Sbjct: 183 PLCTSDGPWRGCLRAGTLIDDDVLCTGNFGSHAKENFEALSAFHKEHGKQWPLMCMELWD 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
           GWF  +G  V  R  EDLA  V    E GG+  N YM+ GGTNFG       R       
Sbjct: 243 GWFNRYGENVIRRDPEDLASCVREVLELGGSL-NLYMFHGGTNFGFMNGCSARHTHDLHQ 301

Query: 287 ATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
            TSYDYDAP+DE G   +  +   R +H+
Sbjct: 302 VTSYDYDAPLDEQGNPTEKYFAIQRTVHE 330



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 82/335 (24%), Positives = 126/335 (37%), Gaps = 78/335 (23%)

Query: 393 AKVISQRNNGDHPFAQQKNVNELL--------LASSAFSWYE----EKVGISGNRSFVRP 440
           A +  Q N  +  FA Q+ V+EL         L   AFS  +    E+V +      +  
Sbjct: 309 APLDEQGNPTEKYFAIQRTVHELYPDIAQSKPLTKKAFSMPDISVSERVSLFNVLDILSE 368

Query: 441 DLAEQ----INTTKDTSDYLWYTASIHVMPGQGKEVFLNIESLGHAALVFVNKKLVAFGY 496
            +  Q    +     +  Y  YT ++     +  E  + +      A +FVN   VA  Y
Sbjct: 369 PIEAQYPMPMEEMGQSYGYTLYTTTVE--RDRADEERIRVIDARDRAQMFVNGDKVATQY 426

Query: 497 GNHDFANFLINKKIE--LNEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLKN 552
             H      I + I   L    N LD+L+  +G  NYG     D    G+ + + +DL  
Sbjct: 427 QEH------IGEDIHCVLPCEHNRLDVLTEDMGRVNYGHKLLADTQHKGIRTGVCVDLH- 479

Query: 553 GKRDLSSGEWIYQVGVEGEYIGLDKI-SLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGK 611
                      +  G E   + LD I +L  S+ W +G          +Y+  F   E  
Sbjct: 480 -----------FVTGWEMRCLPLDNIDNLDYSAGWVEGQP-------SFYRAKFDISEPA 521

Query: 612 GPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQ 671
               ++    GKG A+VNG ++GR+W                               P  
Sbjct: 522 DTF-IDTTGFGKGVAFVNGTNVGRFWDK----------------------------GPIM 552

Query: 672 TLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
           TLY +P   +HPG N LV+ E  G   +KISL ++
Sbjct: 553 TLY-VPHGLLHPGTNELVMFETEGVYDAKISLRSE 586


>gi|423278914|ref|ZP_17257828.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
           610]
 gi|404585906|gb|EKA90510.1| hypothetical protein HMPREF1203_02045 [Bacteroides fragilis HMW
           610]
          Length = 769

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 105/332 (31%), Positives = 153/332 (46%), Gaps = 31/332 (9%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           + N T      +++GK   +++  +HY R     W   I   K  G+  I  YVFWN HE
Sbjct: 18  AQNFTIGKNTFLLNGKSFTVKAAELHYTRIPAPYWEHRIEMCKALGMNTICLYVFWNIHE 77

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              G++ F G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT 
Sbjct: 78  QTEGKFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIVLRTL 137

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAA 181
           +  F E    F+ ++   +    L  ++GG II+ QVENEYG    AY V  + YV    
Sbjct: 138 DPYFMERTAIFMKEVGKQLAP--LQITRGGNIIMVQVENEYG----AYAV-DKPYVSAIR 190

Query: 182 DTAVNLN-TSVPWVMCQ-----QEDAPDPIINTCNGFYCDG---------FTPNSPSKPI 226
           D   +   T VP   C        +  D ++ T N  +  G              P  P+
Sbjct: 191 DIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTIN--FGTGANIEQQFKRLKEARPDTPL 248

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--- 283
           M +E +SGWF  +G     RP + +   +    +   +F + YM  GGT FG   G    
Sbjct: 249 MCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISF-SLYMAHGGTTFGHWGGANNP 307

Query: 284 --PLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
               + +SYDYDAPI E G+    K+  LR+L
Sbjct: 308 AYSAMCSSYDYDAPISEPGWATD-KYFQLRDL 338



 Score = 42.0 bits (97), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/130 (26%), Positives = 55/130 (42%), Gaps = 37/130 (28%)

Query: 579 SLANSSFWKQGS--TLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRY 636
           S      +K G+  T+P      +YK TF   +  G   L++++ GKG  WVNG +IGR+
Sbjct: 505 SFVQDKKYKSGTAQTMPA-----YYKATFHLDKA-GDTFLDMSTWGKGMVWVNGIAIGRF 558

Query: 637 WSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGG 696
           W                               P QTL+ +P  W+  GEN +++ +  G 
Sbjct: 559 WEI----------------------------GPQQTLF-MPGCWLKEGENEIIVLDLKGP 589

Query: 697 DPSKISLLTK 706
           + + +  L K
Sbjct: 590 EKASVRGLKK 599


>gi|328956117|ref|YP_004373450.1| beta-galactosidase [Coriobacterium glomerans PW2]
 gi|328456441|gb|AEB07635.1| Beta-galactosidase [Coriobacterium glomerans PW2]
          Length = 597

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 101/314 (32%), Positives = 156/314 (49%), Gaps = 38/314 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DG+   ++SG+IHY R  P+ W   +   K  G   +ETY+ WN HEP + ++      
Sbjct: 12  MDGRPFQIRSGAIHYFRLHPDDWEHSLYNLKAMGFNTVETYIPWNMHEPHKDEFRITAET 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D  RF+    + GL+  +R  P+ CAEW +GG P WL    G++ R+ +  F E +  + 
Sbjct: 72  DFERFLGLASDLGLWAIVRPSPFICAEWEFGGLPAWLLAERGMRIRSNDPRFLERLALYY 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYG----NVEWAYGVGGELYVKWAADTAVNLNT 189
             ++  + +  +  ++G  II+ Q+ENEYG    + ++   V  +L V+   D  V L T
Sbjct: 132 DMLMPHLAKHQI--TRGANIIMMQIENEYGSYCEDSDYMRSV-RDLMVERGID--VKLCT 186

Query: 190 SV-PWVMCQQEDA--PDPIINTCN-------------GFYCDGFTPNSPSKPIMWTENYS 233
           S  PW  CQ+  +   D ++ T N             GF+ +    +  + P+M  E ++
Sbjct: 187 SDGPWRACQRAGSLIEDNVLATGNFGSHATENFAALKGFHKE----HGKTWPLMCMEFWA 242

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV------- 286
           GWF  +G +V  R  E+LA +V      G    N YM+ GGTNFG   G           
Sbjct: 243 GWFNRWGESVVRRDPEELARSVREALREGSI--NLYMFHGGTNFGFMNGCSARHDHDLHQ 300

Query: 287 ATSYDYDAPIDEYG 300
            TSYDYDAP+DE G
Sbjct: 301 ITSYDYDAPLDEAG 314


>gi|319945941|ref|ZP_08020191.1| beta-galactosidase [Streptococcus australis ATCC 700641]
 gi|417919516|ref|ZP_12563047.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
 gi|319748006|gb|EFW00250.1| beta-galactosidase [Streptococcus australis ATCC 700641]
 gi|342832897|gb|EGU67186.1| glycosyl hydrolase family 35 [Streptococcus australis ATCC 700641]
          Length = 595

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 105/323 (32%), Positives = 153/323 (47%), Gaps = 44/323 (13%)

Query: 23  SGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTV 82
           SG+IHY R   E W   +   K  G   +ETYV WN HEP RG ++FEG  DL  F++  
Sbjct: 21  SGAIHYFRIDREDWYHSLYNLKALGFNTVETYVPWNAHEPQRGHFHFEGNLDLEHFIQVA 80

Query: 83  QEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
           QE  L++ LR  P+ C+EW +GG P WL     ++ R+++  F EE+ R+  +++  + +
Sbjct: 81  QELDLYVILRPSPFICSEWEFGGLPAWL-IEKDLRIRSSDPAFLEEVARYYDELLPRVAK 139

Query: 143 ENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAP 202
             L   +GG I++ QVENEYG    +YG   + Y++   D  +  + + P       D P
Sbjct: 140 YQL--DRGGNILMMQVENEYG----SYG-EDKAYLRAIRDLMIERDITCPLFTS---DGP 189

Query: 203 DPIINTCNGFYCDG---------------------FTPNSPSKPIMWTENYSGWFLSFGY 241
                       DG                     F  +    P+M  E + GWF  +  
Sbjct: 190 WRATLRAGTLIEDGLFVTGNFGSRANYNFSQMKEFFAEHDRKWPLMCMEFWDGWFNRWKE 249

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVATSYDYD 293
            +  R  E+LA AV    + G    N YM+ GGTNFG         T   P V TSYDYD
Sbjct: 250 PIIKRDPEELAEAVHEVLQEGSI--NLYMFHGGTNFGFMNGCSARGTVDLPQV-TSYDYD 306

Query: 294 APIDEYGFIRQPKWGHLRELHKA 316
           A +DE G    PK+  ++++ K 
Sbjct: 307 ALLDEQG-NPTPKYDAVKKMMKT 328


>gi|294665218|ref|ZP_06730516.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
           ICPB 10535]
 gi|292605006|gb|EFF48359.1| beta-galactosidase [Xanthomonas fuscans subsp. aurantifolii str.
           ICPB 10535]
          Length = 613

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 115/322 (35%), Positives = 156/322 (48%), Gaps = 29/322 (9%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFSG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  NNDVAAFVREAAAQGLNIILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGE-LYVKWAADTAVNLN 188
           +L  + + +  + L    GGPII  QVENEYG+   + AY      +YVK   D A+ L 
Sbjct: 159 YLDALANQV--QPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G   
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGK-- 273

Query: 244 PFRPVEDLAFAVARFFE---TGGTFQNYYMYFGGTNFGRTAGGPL----------VATSY 290
           P    +  A   A  FE     G   + YM+ GGT+FG   G               TSY
Sbjct: 274 PHAATD--ARQQAEEFEWILRQGHSASLYMFIGGTSFGFMNGANFQNNPSDHYAPQTTSY 331

Query: 291 DYDAPIDEYGFIRQPKWGHLRE 312
           DYDA +DE G    PK+  +R+
Sbjct: 332 DYDAILDEAGHP-TPKFALMRD 352


>gi|357050580|ref|ZP_09111778.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
           30_1]
 gi|355381233|gb|EHG28360.1| hypothetical protein HMPREF9478_01761 [Enterococcus saccharolyticus
           30_1]
          Length = 593

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 148/314 (47%), Gaps = 35/314 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G    L SG+IHY R  P+ W   +   K  G   +ETYV WN HEP +G + FEG
Sbjct: 10  FLMNGSPFKLLSGAIHYFRVHPDDWRHSLYNLKALGFNTVETYVPWNLHEPHKGLFQFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  F+   QE GL++ LR  PY CAEW +GG P WL    G + R  +  +   +  
Sbjct: 70  ILDLEHFLSLAQELGLYVILRPSPYICAEWEFGGLPAWLLKESG-RLRACDPSYLAHVAE 128

Query: 132 F----LAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVG-GELYVKWAADTA 184
           +    L KII          S GG I++ QVENEYG+   E AY     E+ +    D  
Sbjct: 129 YYDVLLPKIIPYQ------LSHGGNILMIQVENEYGSYGEEKAYLRAIKEMLINRGIDMP 182

Query: 185 VNLNTSVPWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYS 233
           +   +  PW    +  +   D ++ T N             D F  ++   P+M  E + 
Sbjct: 183 L-FTSDGPWQAALRAGSLIEDDVLVTGNFGSRAKENFAAMQDFFDQHNKKWPLMCMEFWD 241

Query: 234 GWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGPLV 286
           GWF  +   +  R  +DLA +V    E G    N YM+ GGTNFG       R A     
Sbjct: 242 GWFNRWNEPIIRRDPDDLAESVKEALEIGSV--NLYMFHGGTNFGFMNGCSARGAVDLPQ 299

Query: 287 ATSYDYDAPIDEYG 300
            TSYDYDAP+DE G
Sbjct: 300 VTSYDYDAPLDEQG 313


>gi|417923406|ref|ZP_12566873.1| glycosyl hydrolase family 35 [Streptococcus mitis SK569]
 gi|342837055|gb|EGU71256.1| glycosyl hydrolase family 35 [Streptococcus mitis SK569]
          Length = 595

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 149/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G++ FEG  
Sbjct: 12  LDGKPFKILSGAIHYFRIPPEDWSHSLYNLKALGFNTVETYVAWNLHEPREGEFNFEGAL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++  Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L   +GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLSRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRHLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|453049630|gb|EME97211.1| beta-galactosidase [Streptomyces mobaraensis NBRC 13819 = DSM
           40847]
          Length = 584

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/313 (34%), Positives = 147/313 (46%), Gaps = 39/313 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           IDG+   L SG++HY R     WP  +   +  GL  +ETYV WN HEP+ G+ +  G  
Sbjct: 13  IDGREVRLLSGALHYFRVHEGHWPHRLAMLRAMGLNCVETYVPWNRHEPVEGRLHDVG-- 70

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L RF+     AGL+  +R GPY CAEW  GG P WL    G + RT++  F   +  +L
Sbjct: 71  ELGRFLDAAGAAGLYAIVRPGPYVCAEWENGGLPHWLTGRLGRRVRTSDPEFLRAVDGWL 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPW 193
             +   +        +GGP++L QVENEYG+    YG   + Y++       +    VP 
Sbjct: 131 EAVGAELTGRQF--GRGGPVVLVQVENEYGS----YG-SDQPYLEHLVGRLRDSGVVVPL 183

Query: 194 VMCQQEDAPDPIINTCNGFYCDGFTPN---------------SPSKPIMWTENYSGWFLS 238
           V     D P+  + T         T N                P+ P+M  E + GWF  
Sbjct: 184 VTS---DGPEDHMLTGGTVPGATATVNFGSGAREAFRVLRRHRPAGPLMCMEFWCGWFAH 240

Query: 239 FGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG---------GPL--VA 287
           +G A   R   + A A+    E G +  N YM  GGTNFG  AG         G L    
Sbjct: 241 WGGAPAARDAGEAAEALREVLECGASV-NVYMAHGGTNFGGWAGANRAGAEHRGALRPTT 299

Query: 288 TSYDYDAPIDEYG 300
           TSYDYDAP+DEYG
Sbjct: 300 TSYDYDAPVDEYG 312


>gi|422877900|ref|ZP_16924370.1| beta-galactosidase [Streptococcus sanguinis SK1056]
 gi|332358593|gb|EGJ36417.1| beta-galactosidase [Streptococcus sanguinis SK1056]
          Length = 592

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 107/333 (32%), Positives = 155/333 (46%), Gaps = 32/333 (9%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+I Y R  P+ W + +   K  G   +ETY+ W  HEP  GQ+  EG  
Sbjct: 12  LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D   + K V+E GL+L +R  PY CAE+++GG P WL   P ++ R  +  F E++  F 
Sbjct: 72  DFEAYFKLVKEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
             +   +      + QGGPI++ QVENEYG+   + AY       +K    T     +  
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSYAEDKAYMRSIAQMMKVRGVTVPLFTSDG 189

Query: 192 PWVMCQQ-----ED--------APDPIINTCNGFYCDGFTPNSPSK-PIMWTENYSGWFL 237
            W+   +     ED           P  NT N      F      K P+M TE + GWF 
Sbjct: 190 TWIEALESGTLIEDDIFVTGNFGSQPKENTDN---LRAFMERYGKKWPLMCTEFWDGWFS 246

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVATS 289
            +   + +R  EDLA  V    + G    N ++  GGTNFG        +T   P + TS
Sbjct: 247 RWSEEIVWREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI-TS 303

Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           YD+DAPI E+G   +  +   R  H+     E+
Sbjct: 304 YDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336


>gi|188990653|ref|YP_001902663.1| beta-galactosidase [Xanthomonas campestris pv. campestris str.
           B100]
 gi|167732413|emb|CAP50607.1| exported beta-galactosidase [Xanthomonas campestris pv. campestris]
          Length = 680

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/320 (34%), Positives = 152/320 (47%), Gaps = 25/320 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   + SG+IH+ R     W + ++K++  GL  +ETYVFWN  EP +GQ+ F  
Sbjct: 106 FVRDGKPYQVLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFNA 165

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPYACAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 166 NNDVAAFVREAAAQGLNVILRPGPYACAEWETGGYPAWLFGKDNIRVRSRDPRFLAASQA 225

Query: 132 FLAKIIDLMKQEN-LFASQGGPIILAQVENEYGNVEWAYGVGGE---LYVKWAADTAVNL 187
           +L  +    KQ + L    GGPII  QVENEYG+ +  +    +   +YVK   D A+ L
Sbjct: 226 YLDAV---SKQVHPLLNHNGGPIIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDAL-L 281

Query: 188 NTSVPWVMCQQEDAPD--PIINTCNG---FYCDGFTPNSPSKPIMWTENYSGWFLSFGYA 242
            TS    M      PD   ++N   G      D      P +P M  E ++GWF  +G  
Sbjct: 282 FTSDGADMLANGTLPDTLAVVNFAPGEAKSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKP 341

Query: 243 VPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDY 292
                 +     +      G +  N YM+ GGT+FG   G               TSYDY
Sbjct: 342 HASTDAKQQTEELEWILRQGHS-ANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDY 400

Query: 293 DAPIDEYGFIRQPKWGHLRE 312
           DA +DE G    PK+  +R+
Sbjct: 401 DAILDEAGRA-TPKFALMRD 419


>gi|15837442|ref|NP_298130.1| beta-galactosidase [Xylella fastidiosa 9a5c]
 gi|9105744|gb|AAF83650.1|AE003923_8 beta-galactosidase [Xylella fastidiosa 9a5c]
          Length = 612

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 107/313 (34%), Positives = 149/313 (47%), Gaps = 34/313 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            + DG+   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  E   GQ+ F G
Sbjct: 35  FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPY CAEW  GGFP WL   P ++ R+ +  F +  +R
Sbjct: 95  NNDISAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
           +L  +   ++   L    GGPII  QVENEYG+   +  Y           G+GG L   
Sbjct: 155 YLEALGTQVRP--LLNGNGGPIIAVQVENEYGSYGDDHGYLQAVRALFIKAGLGGALL-- 210

Query: 179 WAADTAVNL-NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
           + AD A  L N ++P V+     AP            D      P +P +  E ++GWF 
Sbjct: 211 FTADGAQMLGNGTLPDVLAAVNVAPGEAKQA-----LDKLATFHPGQPQLVGEYWAGWFD 265

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-----RTAGGPL-----VA 287
            +G        +  A  +      G +  N YM+ GGT+FG        GGP        
Sbjct: 266 QWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPSDHYSPQT 324

Query: 288 TSYDYDAPIDEYG 300
           TSYDYDA +DE G
Sbjct: 325 TSYDYDAALDEAG 337


>gi|345880280|ref|ZP_08831835.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
 gi|343923634|gb|EGV34320.1| hypothetical protein HMPREF9431_00499 [Prevotella oulorum F0390]
          Length = 621

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/334 (31%), Positives = 159/334 (47%), Gaps = 50/334 (14%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE-GRF 73
           DGK   + SG +HY R     W   ++  K  GL  + +YVFWN+HE   G + ++ G  
Sbjct: 39  DGKPTQIHSGELHYARVPAPYWRHRLQMMKAMGLNAVTSYVFWNHHETSPGVWDWQTGNH 98

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           ++  F+K   E GL + LR GPY CAEW +GG+P WL    G+  RT N PF +  + ++
Sbjct: 99  NIRNFIKIAGEEGLMVILRPGPYCCAEWEFGGYPWWLPKAKGLVIRTDNKPFLDSCRVYI 158

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAADTAVNLNTSVP 192
            ++ + ++  +L  ++GGP+++ Q ENE+G+ V     +  E++ K+AA           
Sbjct: 159 NQLANQVR--DLQITKGGPVVMVQAENEFGSYVAQRKDIPLEVHKKYAAQ---------- 206

Query: 193 WVMCQQEDAP-DPIINTCNGFY------CDGFTPNSPSK------------------PIM 227
            +  Q  DA  D  + T +G +       +G  P +  +                  P M
Sbjct: 207 -IRQQLLDAGFDIPMFTSDGSWLFKGGSIEGALPTANGEGNIEKLKQVVNEYHGGVGPYM 265

Query: 228 WTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPLV- 286
             E Y GW   +    P    E +     ++ + G +F NYYM  GGTNFG T G     
Sbjct: 266 VAEFYPGWLSHWAEPFPRVSTESVVKQTKKYLDNGVSF-NYYMVHGGTNFGFTTGANYSN 324

Query: 287 -------ATSYDYDAPIDEYGFIRQPKWGHLREL 313
                   TSYDYDAPI E G+  + K+  +R L
Sbjct: 325 ATNLQPDMTSYDYDAPISEAGWATE-KYNAIRAL 357


>gi|302526862|ref|ZP_07279204.1| beta-galactosidase [Streptomyces sp. AA4]
 gi|302435757|gb|EFL07573.1| beta-galactosidase [Streptomyces sp. AA4]
          Length = 609

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 100/314 (31%), Positives = 148/314 (47%), Gaps = 35/314 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   + SG+IHY R  P+ W + + + K  GL  +ETYV WN+H+P  G+  F G
Sbjct: 40  FLLDGKPFQIVSGAIHYFRLRPDQWHDRLSRLKALGLNTVETYVAWNFHQPTPGRADFRG 99

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  F++T  E G  + +R  PY CAEW +GG P WL     ++ R  +  + + +  
Sbjct: 100 DRDLPAFIRTAGELGFQVIVRPSPYICAEWEFGGLPAWLLADRNMELRCADPAYLKAVDA 159

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
           +  ++I  +    L A  GGPI+  Q+ENEYG+   + +Y           G+   L+V 
Sbjct: 160 WYDQLIPQLTP--LEAQHGGPIVAVQIENEYGSYGNDTSYLAHLRDSLRSRGITSLLFVA 217

Query: 179 WAADTAVNLNTSVPWVM--CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWF 236
             A         +P  +     +  P P I     F         P  P+M  E + GWF
Sbjct: 218 DGASEFFMRFGELPGTLEAGTGDGDPAPSIAALKAF--------RPGAPVMMAEYWDGWF 269

Query: 237 LSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG--------PLVAT 288
             +G        +  A  + +   TG +  N YM  GGTN+G TAG         P V T
Sbjct: 270 DHWGEPHHTTDPQQTAAHIDQLLATGASV-NLYMACGGTNYGFTAGANTSGLQYQPTV-T 327

Query: 289 SYDYDAPIDEYGFI 302
           SYDYD+P+ E G +
Sbjct: 328 SYDYDSPVGEAGDV 341


>gi|315499712|ref|YP_004088515.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
 gi|315417724|gb|ADU14364.1| glycoside hydrolase family 35 [Asticcacaulis excentricus CB 48]
          Length = 613

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/352 (31%), Positives = 162/352 (46%), Gaps = 39/352 (11%)

Query: 3   ANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEP 62
           +  T      ++DG+   L +G +HYPR   E+W + +RK K  GL  + TY FW+ HE 
Sbjct: 30  SRFTIKDDQFLLDGQPLHLMAGEMHYPRIPRELWRDRLRKLKALGLNTLSTYTFWSAHEK 89

Query: 63  IRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTN 122
             G Y F G  D+  +VK  QE GL + LR GPYACAEW+ GG+P W    P I+ R+ +
Sbjct: 90  KPGVYDFSGNLDVAAWVKMAQEEGLHVLLRPGPYACAEWDNGGYPAWFLNDPDIRPRSLD 149

Query: 123 ----NPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYG-------------NV 165
                P  + +KR   ++       +L   +GGP+++ Q+ENEYG             + 
Sbjct: 150 PRYMGPSGQWLKRLGQEVA------HLEIDKGGPVLMTQIENEYGSYGNDLNYMRAVRDQ 203

Query: 166 EWAYGVGGELYVKWAADTAVNLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
             A G  G+LY    A  AV  N ++P +            +   G +   +       P
Sbjct: 204 VRAAGFSGQLYTVDGA--AVIENGALPELFNGINFG---TYDKAEGEFAR-YAKFKTKGP 257

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL 285
            M TE + GWF  FG       +  L  ++    +   +F ++YM  GGT+F   AG   
Sbjct: 258 RMCTELWGGWFDHFGEVHSNMEISPLMESLKWMLDNRISF-SFYMLHGGTSFAFDAGANF 316

Query: 286 VAT--------SYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEEYLISSDP 329
             T        SYDYDA +DE G +  PK+   REL +     E +    +P
Sbjct: 317 HKTHGYQPDISSYDYDAMLDEAGRV-TPKYEAARELFRRYLPPERFTALPEP 367



 Score = 42.7 bits (99), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 43/130 (33%), Positives = 61/130 (46%), Gaps = 21/130 (16%)

Query: 509 KIELNEGINTLDILSMMVGLQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGV 568
           ++ L  G +TLD+L   +G  NYG        GL   + +   NGK  L+   W +Q   
Sbjct: 455 EVSLKAG-DTLDLLIDAMGHVNYGDQIGKDQKGLIGPVTL---NGK-PLTG--WTHQG-- 505

Query: 569 EGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWV 628
               + LD +S+    F +Q    P      +Y+ TF   E  G   L+L   GKG  WV
Sbjct: 506 ----VPLDDLSVLR--FKRQRVNGPA-----FYRGTFETSEA-GFTFLDLRGWGKGYVWV 553

Query: 629 NGQSIGRYWS 638
           NG ++GRYWS
Sbjct: 554 NGHNLGRYWS 563


>gi|237734327|ref|ZP_04564808.1| beta-galactosidase [Mollicutes bacterium D7]
 gi|365831197|ref|ZP_09372750.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
 gi|374624872|ref|ZP_09697289.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
           8_2_54BFAA]
 gi|229382557|gb|EEO32648.1| beta-galactosidase [Coprobacillus sp. D7]
 gi|365262188|gb|EHM92085.1| hypothetical protein HMPREF1021_01514 [Coprobacillus sp. 3_3_56FAA]
 gi|373916155|gb|EHQ47903.1| hypothetical protein HMPREF0978_00609 [Coprobacillus sp.
           8_2_54BFAA]
          Length = 584

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 102/334 (30%), Positives = 154/334 (46%), Gaps = 43/334 (12%)

Query: 9   HRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYY 68
           ++   I+G +  + SG++HY R  PE W + +   K  G   +ETYV WN HEP +G+Y 
Sbjct: 7   NKEFFINGNKVKIISGAVHYFRIVPEYWRDTLLDLKAMGCNTVETYVPWNLHEPYQGKYD 66

Query: 69  FEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEE 128
           F G  D+  F+K  +E  LF+ LR  PY CAEW  GG P WL   P I+ RT +  + + 
Sbjct: 67  FSGIKDIETFLKLAEELELFVILRASPYICAEWEMGGLPAWLLKYPRIRLRTNDKQYLKC 126

Query: 129 MKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLN 188
           + ++ + ++  + +  +  +Q GPIILAQ+ENEYG    +YG   E Y+           
Sbjct: 127 LDQYFSILLPKLSKYQI--TQNGPIILAQLENEYG----SYGEDKE-YLLAVYQMMRKYG 179

Query: 189 TSVPWV--------------MCQQEDAP--------DPIINTCNGFYCDGFTPNSPSKPI 226
             VP                + +++  P           I     F       +  + P+
Sbjct: 180 IEVPLFTADGTWHEALNAGSLLEKKVFPTGNFGSQAKENITVLKKF----MESHQITAPL 235

Query: 227 MWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------R 279
           M  E + GWF  +   +  R  ++   +       G    N+YM+ GGTNFG       R
Sbjct: 236 MCMEFWDGWFNRWNQEIIKRDPQEFVNSAQEMLSLGSV--NFYMFQGGTNFGWMNGCSAR 293

Query: 280 TAGGPLVATSYDYDAPIDEYGFIRQPKWGHLREL 313
                   TSYDYDA + EYG  +  K+  LRE+
Sbjct: 294 KEHDLPQITSYDYDAILTEYG-AKTEKYHLLREV 326


>gi|395520729|ref|XP_003764476.1| PREDICTED: beta-galactosidase-1-like protein 2 [Sarcophilus
           harrisii]
          Length = 704

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 107/322 (33%), Positives = 158/322 (49%), Gaps = 25/322 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G    +  GSIHY R   E W + + K K  GL  + TY+ WN HEP RG++ F G
Sbjct: 122 FLLEGSHFQIFGGSIHYFRVPREYWRDRLLKLKACGLNTLTTYIPWNLHEPERGKFNFSG 181

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+   + GL++ LR GPY C+EW+ GG P WL     ++ RTT   F + + R
Sbjct: 182 NLDVEAFVQMAADIGLWVILRPGPYICSEWDLGGLPSWLLQDSSMELRTTYAGFLKAVDR 241

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   +   L   QGGPII  QVENEYG+ +         Y+ +     ++   + 
Sbjct: 242 YFNHLIP--RVVPLQYKQGGPIIAVQVENEYGSYD-----KDSNYMPYIKKALMSRGINE 294

Query: 192 PWVMCQQEDA-----PDPIINTCNGFYCDGFTPN-----SPSKPIMWTENYSGWFLSFGY 241
             +    +D       + ++ T N  + D    N       +KP M TE ++GWF ++G 
Sbjct: 295 LLMTSDNKDGLSGGYLEGVLATVNLKHVDSMIFNYLHSFQENKPTMVTEYWTGWFDTWGG 354

Query: 242 AVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPLVA--TSYDYDAP 295
                  +D+   V+   + G +  N YM+ GGTNFG   G    G  +A  TSYDYDA 
Sbjct: 355 PHNIVDADDVVVTVSSIIQMGASL-NLYMFHGGTNFGFMNGAQHFGEYLADVTSYDYDAI 413

Query: 296 IDEYGFIRQPKWGHLRELHKAI 317
           + E G    PK+  LRE    I
Sbjct: 414 LTEAG-DYTPKFFKLREFFSTI 434


>gi|294779195|ref|ZP_06744602.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
 gi|294453706|gb|EFG22101.1| glycosyl hydrolase family 35 [Enterococcus faecalis PC1.1]
          Length = 592

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 70  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345


>gi|322378066|ref|ZP_08052553.1| glycosyl hydrolase, family 35 [Streptococcus sp. M334]
 gi|321281048|gb|EFX58061.1| glycosyl hydrolase, family 35 [Streptococcus sp. M334]
          Length = 595

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 150/307 (48%), Gaps = 25/307 (8%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGAQ 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL RF++  Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLERFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKDMRIRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
            +++  +    L   +GG I++ QVENEYG+   + AY       ++    T     +  
Sbjct: 131 DQLLPRLVPHLL--DKGGNILMMQVENEYGSYGEDKAYLRAIRQLMEERGVTCPLFTSDG 188

Query: 192 PWVMCQQEDA--PDPIINTCN---------GFYCDGFTPNSPSKPIMWTENYSGWFLSFG 240
           PW    +      D +  T N             + F  +    P+M  E + GWF  + 
Sbjct: 189 PWRATLKAGTLIEDDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 241 YAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG----GPL---VATSYDYD 293
             +  R  ++LA AV    E G    N YM+ GGTNFG   G    G L     TSYDYD
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQVTSYDYD 306

Query: 294 APIDEYG 300
           A +DE G
Sbjct: 307 ALLDEEG 313


>gi|194213011|ref|XP_001503026.2| PREDICTED: beta-galactosidase-1-like protein 3-like [Equus
           caballus]
          Length = 880

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 155/324 (47%), Gaps = 29/324 (8%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G + ++  GSIHY R   E W + + K K  G   + TYV WN HEP RG++ F G
Sbjct: 248 FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGRFDFSG 307

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  FV T  E GL++ LR GPY C+E + GG P  L   P +  RTT+  F E + +
Sbjct: 308 NLDLEAFVLTAAEIGLWVILRPGPYICSEIDLGGLPSRLLQDPQVNLRTTDKGFVEAVDK 367

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +I   +  +L   +GGPII  QVENEYG+         + Y+ +       L   +
Sbjct: 368 YFDHLIS--RVVHLQYRKGGPIIAVQVENEYGSF-----YKDKDYMPYLQQAL--LKRGI 418

Query: 192 PWVMCQQEDAPDPIINTCNG---------FYCDGFT---PNSPSKPIMWTENYSGWFLSF 239
             ++   ++  D +     G         F  D F         KPIM  E + GWF ++
Sbjct: 419 VELLLTSDNVDDVLKGYIKGVLATINMKKFRKDAFQHLYKVQRDKPIMIMEYWVGWFDTW 478

Query: 240 GYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------PLVATSYDYD 293
           G     +   D+   V+ F +   +F N YM+ GGTNFG   G         V TSYDYD
Sbjct: 479 GSKHEVKDAGDVKNTVSEFIKFEISF-NVYMFHGGTNFGFINGAINFVKHAGVVTSYDYD 537

Query: 294 APIDEYGFIRQPKWGHLRELHKAI 317
           A + E G   + K+  LR+L  +I
Sbjct: 538 AVLTEAGDYTK-KYFKLRKLFGSI 560


>gi|256957323|ref|ZP_05561494.1| beta-galactosidase [Enterococcus faecalis DS5]
 gi|257077681|ref|ZP_05572042.1| beta-galactosidase [Enterococcus faecalis JH1]
 gi|307270129|ref|ZP_07551446.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
 gi|422710565|ref|ZP_16767610.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
 gi|422721468|ref|ZP_16778057.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
 gi|422867159|ref|ZP_16913760.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
 gi|256947819|gb|EEU64451.1| beta-galactosidase [Enterococcus faecalis DS5]
 gi|256985711|gb|EEU73013.1| beta-galactosidase [Enterococcus faecalis JH1]
 gi|306513498|gb|EFM82113.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX4248]
 gi|315031294|gb|EFT43226.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0017]
 gi|315035298|gb|EFT47230.1| glycosyl hydrolase family 35 [Enterococcus faecalis TX0027]
 gi|329577710|gb|EGG59137.1| putative beta-galactosidase [Enterococcus faecalis TX1467]
          Length = 593

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 11  FLLNGQPIKIISGAIHYFRMTPSQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 70

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 71  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 130

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 131 YFQVL--LPKLAPLQITQGGPVIMMQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 183

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 184 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 241

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 242 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 300 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 346


>gi|423212381|ref|ZP_17198910.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694827|gb|EIY88053.1| hypothetical protein HMPREF1074_00442 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 725

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 106/308 (34%), Positives = 155/308 (50%), Gaps = 34/308 (11%)

Query: 26  IHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEA 85
           +HYPR   E W + +++++  GL  +  YVFWN+HE   G++ F G+ D+  FV+T QE 
Sbjct: 1   MHYPRIPHEYWRDRLKRARAMGLNTVSAYVFWNFHERQPGEFDFTGQADIAEFVRTAQEE 60

Query: 86  GLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ-EN 144
           GL++ LR GPY CAEW++GG+P WL     + +R+ +  F    +R+   I +L KQ  +
Sbjct: 61  GLYVILRPGPYVCAEWDFGGYPSWLLKEKDMIYRSKDPRFLSYCERY---IKELGKQLSS 117

Query: 145 LFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVPWVMCQ---QEDA 201
           L  + GG II+ QVENEYG    +Y    E Y+    D       +VP   C    Q +A
Sbjct: 118 LTINNGGNIIMVQVENEYG----SYAADKE-YLAAIRDMIKEAGFNVPLFTCDGGGQVEA 172

Query: 202 P--DPIINTCNGFYCDGF----TPNSPSKPIMWTENYSGWFLSFGY---AVPF-RPVEDL 251
              +  + T NG + +             P    E Y  WF  +G    +V + RP E L
Sbjct: 173 GHIEGALPTLNGVFGEDIFKVVDNYHKGGPYFVAEFYPAWFDEWGKRHSSVAYERPAEQL 232

Query: 252 AFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEYGFIRQP 305
            + ++      G   + YM+ GGTNF  T G           TSYDYDAP+ E+G    P
Sbjct: 233 DWMLSH-----GVSVSMYMFHGGTNFWYTNGANTGGGYQPQPTSYDYDAPLGEWGNC-YP 286

Query: 306 KWGHLREL 313
           K+   RE+
Sbjct: 287 KYHAFREV 294


>gi|414888317|tpg|DAA64331.1| TPA: hypothetical protein ZEAMMB73_578897 [Zea mays]
          Length = 284

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 134/298 (44%), Gaps = 32/298 (10%)

Query: 528 LQNYGAWFDVAGAGLFSVILIDLKNGKRDLSSGEWIYQVGVEGEYIGLDKISLANSSFWK 587
           LQ+ G       +G+   ++  L  G  DL    W ++  +EGE   +          WK
Sbjct: 6   LQDSGGELAEVKSGIQECLIQGLNTGTLDLQVNGWGHKAALEGEDKEIYSEKGVGKVQWK 65

Query: 588 QGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGC 647
                   ++  WYK  F  P+G  P+ L+++SM KG  +VNG+ +GRYW +Y       
Sbjct: 66  PAEN---GRAATWYKRYFDEPDGDDPVVLDMSSMDKGMIFVNGEGVGRYWVSY------- 115

Query: 648 TKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTKT 707
                          +   G P+Q LYHIPR ++   +NLLV+ EE  G P  I + T T
Sbjct: 116 ---------------RTLAGTPSQALYHIPRPFLKSKDNLLVVFEEEMGKPDGILVQTVT 160

Query: 708 GQHICSFVSEADPPPVDSW-----KPNLGVVSSSPQVRLACERGWHIAAINFASYGIPEG 762
              IC F+SE +P  + +W     K  L     S +  L C     I  + FAS+G PEG
Sbjct: 161 RDDICLFISEHNPGQIKTWDTDGDKIKLIAEDHSRRGTLMCPPEKTIQEVVFASFGNPEG 220

Query: 763 NCGSFRPGACHM-DVLPIVQKACVGQIECSIPVSSAYLGVSAGACPGLLKALAVEAHC 819
            CG+F  G CH  +   IV+K C+G+  C +PV     G     C      L V+  C
Sbjct: 221 MCGNFTVGTCHTPNAKQIVEKECLGKPSCMLPVDHTVYGADIN-CQSTTATLGVQVRC 277


>gi|187736173|ref|YP_001878285.1| beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
 gi|187426225|gb|ACD05504.1| Beta-galactosidase [Akkermansia muciniphila ATCC BAA-835]
          Length = 780

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 156/320 (48%), Gaps = 36/320 (11%)

Query: 6   TYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRG 65
           + +    ++DGK   + SG +HYPR   + W +  ++ K  G+  + TY+FWN HEP  G
Sbjct: 35  STNQENFLMDGKPVKIISGEMHYPRVPRQHWKDRFQRIKAMGMNTVCTYLFWNVHEPEPG 94

Query: 66  QYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPF 125
           ++ F G  D V F+K  Q+AGL++ +R GPY CAEW +GGFP WL     ++ R+ +  F
Sbjct: 95  KWDFSGNLDFVEFIKEAQKAGLWVIVRPGPYVCAEWEFGGFPGWLLKDEDLKVRSQDPRF 154

Query: 126 KEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAV 185
            E    +L K+  ++  E L  ++GGPII+AQVENEYG    +YG   + YVK   D   
Sbjct: 155 LEPAMAYLKKVCSML--EPLQITKGGPIIMAQVENEYG----SYGSDKD-YVKKHLDV-- 205

Query: 186 NLNTSVPWVMCQQEDAPD-------------PIIN---TCNGFYCDGFTPNSPSKPIMWT 229
            +   +P V+    D P+             P +N      G + +    +    P +  
Sbjct: 206 -IRKELPGVVPFTSDGPNDWMIKNGTLPGVVPAMNFGGGAKGAFAN-LEKHKGKTPRING 263

Query: 230 ENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG------ 283
           E + GWF  +G        E     +    E   +  N +M  GGT+FG   G       
Sbjct: 264 EFWVGWFDHWGKPKNGGSTEGFNRDLKWMLENNVS-PNLFMAHGGTSFGFMNGANWEGAY 322

Query: 284 -PLVATSYDYDAPIDEYGFI 302
            P V T+YDY API E G +
Sbjct: 323 TPDV-TNYDYGAPISENGTL 341


>gi|419456662|ref|ZP_13996611.1| beta-galactosidase family protein [Streptococcus pneumoniae
           GA02254]
 gi|379533348|gb|EHY98561.1| beta-galactosidase family protein [Streptococcus pneumoniae
           GA02254]
          Length = 595

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 105/322 (32%), Positives = 151/322 (46%), Gaps = 55/322 (17%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+IHY R  PE W   +   K  G   +ETYV WN HEP  G+++FEG  
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPREGEFHFEGDL 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           DL +F++  Q+ GL+  +R  P+ CAEW +GG P WL     ++ R+++  + E + R+ 
Sbjct: 72  DLEKFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRIRSSDPAYIEAVGRYY 130

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK-- 178
            +++  +    L    GG I++ QVENEYG+   + AY           GV   L+    
Sbjct: 131 DQLLPRLVSRLL--DNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEECGVTCPLFTSDG 188

Query: 179 -WAADTAV------------NLNTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
            W A   V            N  +  P+   Q ++                F  +    P
Sbjct: 189 PWRATLKVGTLIEEDLFVTGNFGSKAPYNFSQMQEF---------------FDEHGKKWP 233

Query: 226 IMWTENYSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAG--- 282
           +M  E + GWF  +   +  R  ++LA AV    E G    N YM+ GGTNFG   G   
Sbjct: 234 LMCMEFWDGWFNRWKEPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSA 291

Query: 283 -GPL---VATSYDYDAPIDEYG 300
            G L     TSYDYDA +DE G
Sbjct: 292 RGTLDLPQVTSYDYDALLDEEG 313



 Score = 40.8 bits (94), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 82/205 (40%), Gaps = 57/205 (27%)

Query: 513 NEGINTLDILSMMVGLQNYGAWF--DVAGAGLFSVILIDLK---NGKRDLSSGEWIYQVG 567
            +G++ LDIL   +G  NYG  F  D    G+ + +  DL    N K         Y + 
Sbjct: 437 KKGLSRLDILIENMGRVNYGHKFLADTQRKGIRTGVCKDLHFLLNWKH--------YPLP 488

Query: 568 VEGEYIGLDKISLANSSFWKQGSTLPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAW 627
           ++      +KI  +    W QG          +Y   F   E K    L+L+  GKG A+
Sbjct: 489 LDNP----EKIDFSKG--WTQGQP-------AFYAYDFTVEEPKDTY-LDLSEFGKGVAF 534

Query: 628 VNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENL 687
           VNGQ++GR+W+                              P  +LY IP  ++  G N 
Sbjct: 535 VNGQNLGRFWNV----------------------------GPTLSLY-IPHCYLKEGANR 565

Query: 688 LVIHEELGGDPSKISLLTK-TGQHI 711
           ++I E  G    +I L  K T +HI
Sbjct: 566 IIIFETEGQYKEEIHLTRKPTLKHI 590


>gi|325922356|ref|ZP_08184130.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
 gi|325547138|gb|EGD18218.1| beta-galactosidase [Xanthomonas gardneri ATCC 19865]
          Length = 613

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 109/319 (34%), Positives = 150/319 (47%), Gaps = 23/319 (7%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            V DGK   L SG+IH+ R   E W + ++K++  GL  +ETYVFWN  EP +GQ+ F G
Sbjct: 39  FVRDGKPYQLLSGAIHFQRIPREYWKDRLQKARALGLNTVETYVFWNLVEPQQGQFDFAG 98

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPY CAEW  GG+P WL     I+ R+ +  F    + 
Sbjct: 99  NNDVAAFVREAAAQGLNVILRPGPYTCAEWEAGGYPAWLFGKDNIRVRSRDPRFLAASQA 158

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGE---LYVKWAADTAVNLN 188
           +L  +   +    L    GGPII  QVENEYG+ +  +    +   +YVK   D A+ L 
Sbjct: 159 YLDAVSKQV--HPLLNHNGGPIIAVQVENEYGSYDDDHAYMADNRAMYVKAGFDDAL-LF 215

Query: 189 TSVPWVMCQQEDAPD--PIINTCNGFYCDGF---TPNSPSKPIMWTENYSGWFLSFGYAV 243
           TS    M      PD   ++N   G     F       P +P M  E ++GWF  +G   
Sbjct: 216 TSDGADMLANGTLPDTLAVVNFAPGEAKTAFEKLIKFRPEQPRMVGEYWAGWFDHWGKPH 275

Query: 244 PFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL----------VATSYDYD 293
                +            G +  N YM+ GGT+FG   G               TSYDYD
Sbjct: 276 ASTDAKQQTEEFEWILRQGHS-ANLYMFIGGTSFGFMNGANFQGNPSDHYAPQTTSYDYD 334

Query: 294 APIDEYGFIRQPKWGHLRE 312
           A +DE G    PK+  +R+
Sbjct: 335 AILDEAGRP-TPKFALMRD 352


>gi|295113973|emb|CBL32610.1| Beta-galactosidase [Enterococcus sp. 7L76]
          Length = 592

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 165/352 (46%), Gaps = 44/352 (12%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            +++G+   + SG+IHY R TP  W + +   K  G   +ETY+ WN HEP  G Y FEG
Sbjct: 10  FLLNGQPIKIISGAIHYFRMTPRQWEDSLYNLKALGANTVETYIPWNIHEPEEGVYDFEG 69

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             ++  FV+  ++  L + LR   Y CAEW +GG P WL     ++ R+T+  F  +++ 
Sbjct: 70  MKNIEAFVRLAEKLNLLVILRPSAYICAEWEFGGLPAWLLKEKDVRLRSTDPIFMTKVRN 129

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +   +  L K   L  +QGGP+I+ QVENEYG    +YG+  + Y++        L   V
Sbjct: 130 YFQVL--LPKLAPLQITQGGPVIMIQVENEYG----SYGM-EKAYLRQTKQIMEELGIEV 182

Query: 192 PWVMCQQEDAPDPIINTCNGFYCDGF--------------------TPNSPSKPIMWTEN 231
           P  +   + A + +++       D F                    T +    P+M  E 
Sbjct: 183 P--LFTSDGAWEEVLDAGTLIEEDVFVTGNFGSHSKENAAVLKKFMTRHGKKWPLMCMEY 240

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-------RTAGGP 284
           + GWF  +G  V  R   DLA  V      G    N YM+ GGTNFG       R A   
Sbjct: 241 WDGWFNRWGEPVIQREGTDLAKEVKDMLAVGSL--NLYMFHGGTNFGFYNGCSARGAKDL 298

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK-LCEEYLISSDPTHQKLG 335
              TSYDYDA + E G   +  +     + KAIK +C E +  + P  +KLG
Sbjct: 299 PQVTSYDYDALLTEAGEPTEKYYA----VQKAIKEVCPE-VWQAQPRTKKLG 345


>gi|324509196|gb|ADY43870.1| Beta-galactosidase [Ascaris suum]
          Length = 639

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 102/337 (30%), Positives = 164/337 (48%), Gaps = 26/337 (7%)

Query: 2   SANVTYDHRALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHE 61
           S ++ Y ++  ++DG+     SGSIHY R  P+ W + + + +  GL  I+ Y+ WN+HE
Sbjct: 26  SFSIDYVNKRFLLDGQPFRYISGSIHYFRVHPDQWNDRLSRMRAAGLNAIQFYIPWNFHE 85

Query: 62  PIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTT 121
              G   F+G  ++ RF+    +  L+  +RIGPY C EW  GG P WL     I+ RT+
Sbjct: 86  IYEGVIGFDGGRNITRFLSLAAQNELYALVRIGPYICGEWENGGLPWWLLKYDDIKMRTS 145

Query: 122 NNPFKEEMKRFLAKIIDLMKQENLFASQGGPIILAQVENEYGNV-------------EWA 168
           +  F   ++R+   ++ ++K        GGPI++ QVENEYG+              +  
Sbjct: 146 DKRFIRAVERWFGVLLPILKPS--LRKNGGPILMIQVENEYGSFTEGCDRKYTTFLRDLT 203

Query: 169 YGVGGELYVKWAADTAVNLNT---SVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKP 225
               G+  V +  D A N +    S+P V    +  P+        F         P+ P
Sbjct: 204 IKHLGDDVVLYTTDGANNQSLKCGSIPGVFATVDFGPNSEEQIDKNFATQ--RSYEPNGP 261

Query: 226 IMWTENYSGWFLSFGYAVPFRP-VEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGP 284
           ++ +E Y GW +++       P V+++       F+ G +F NYYM++GGTNF    G  
Sbjct: 262 LVNSEFYPGWIVTWSQKGRIDPSVDEIINGSKYMFKLGASF-NYYMFYGGTNFAFWNGAE 320

Query: 285 L---VATSYDYDAPIDEYGFIRQPKWGHLRELHKAIK 318
               V TSYDY AP+ E   I + K+  +R   K+I+
Sbjct: 321 TTSAVITSYDYFAPLTEAADINE-KFVAIRNWIKSIE 356


>gi|397689967|ref|YP_006527221.1| Beta-galactosidase [Melioribacter roseus P3M]
 gi|395811459|gb|AFN74208.1| Beta-galactosidase [Melioribacter roseus P3M]
          Length = 772

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 102/314 (32%), Positives = 149/314 (47%), Gaps = 36/314 (11%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            ++DGK   ++ G +H+ R   E W   I+  K  G+  I  Y+FWN+HE   G + ++G
Sbjct: 31  FLLDGKPFQIRCGELHFARIPKEYWRHRIKMMKAMGMNTICAYLFWNFHERTPGNFKWDG 90

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+ +F K  QE GL++ LR GPY CAEW  GG P WL     I+ RT +  F    + 
Sbjct: 91  EADVAQFCKIAQEEGLWVILRPGPYVCAEWEMGGLPWWLLKNENIKLRTKDPLFINASRN 150

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSV 191
           +L ++  ++    L  + GGPIIL QVENE+G     +      Y+    D  +    +V
Sbjct: 151 YLMEVGRVLAP--LQITNGGPIILVQVENEHG-----FYADDPEYMGIIKDAILEAGFNV 203

Query: 192 PWVMCQQEDAPDPIINTCNGFYCD-------GFTPNS---------PSKPIMWTENYSGW 235
           P   C      +P  +   G+  D       G  P           P  P+M  E YSGW
Sbjct: 204 PLFAC------NPTYHLEKGYRKDIFPVVNFGSNPEEAFRALRKILPEGPLMCGEFYSGW 257

Query: 236 FLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSY 290
           F ++G    F  ++     +    +TG +F + YM  GGT FG  AG      P V +SY
Sbjct: 258 FDTWGNPHTFGEIDRYLKDMEYMLKTGASF-SIYMAHGGTTFGFWAGADRPFKPDV-SSY 315

Query: 291 DYDAPIDEYGFIRQ 304
           DY AP+ E G+  +
Sbjct: 316 DYGAPVTEAGWTSE 329


>gi|422861007|ref|ZP_16907651.1| beta-galactosidase [Streptococcus sanguinis SK330]
 gi|327468658|gb|EGF14137.1| beta-galactosidase [Streptococcus sanguinis SK330]
          Length = 592

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 108/338 (31%), Positives = 155/338 (45%), Gaps = 42/338 (12%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+I Y R  P+ W + +   K  G   +ETY+ W  HEP  GQ+  EG  
Sbjct: 12  LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFQAEGML 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D   + K V+E GL+L +R  PY CAE+++GG P WL   P ++ R  +  F E++  F 
Sbjct: 72  DFEAYFKLVEEMGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTSVP- 192
             +   +      + QGGPI++ QVENEYG+         + Y++  A        SVP 
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSY-----AEDKAYMRSIAQMMKVRGVSVPL 184

Query: 193 ------WVMCQQ-----ED--------APDPIINTCNGFYCDGFTPNSPSK-PIMWTENY 232
                 W+   +     ED           P  NT N      F      K P+M TE +
Sbjct: 185 FTSDGTWIEALESGTLIEDDIFVTGNFGSQPKENTDN---LRAFMERYGKKWPLMCTEFW 241

Query: 233 SGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGP 284
            GWF  +   +  R  EDLA  V    + G    N ++  GGTNFG        +T   P
Sbjct: 242 DGWFSRWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLP 299

Query: 285 LVATSYDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
            + TSYD+DAPI E+G   +  +   R  H+     E+
Sbjct: 300 QI-TSYDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336


>gi|282859441|ref|ZP_06268546.1| glycosyl hydrolase family 35 [Prevotella bivia JCVIHMP010]
 gi|424900868|ref|ZP_18324410.1| beta-galactosidase [Prevotella bivia DSM 20514]
 gi|282587669|gb|EFB92869.1| glycosyl hydrolase family 35 [Prevotella bivia JCVIHMP010]
 gi|388593068|gb|EIM33307.1| beta-galactosidase [Prevotella bivia DSM 20514]
          Length = 622

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 106/332 (31%), Positives = 154/332 (46%), Gaps = 42/332 (12%)

Query: 15  DGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQY-YFEGRF 73
           DGK   + SG +HY R     W   ++  K  GL V+ +YVFWN+HE   G + +  G  
Sbjct: 40  DGKPLQIYSGELHYARVPAPYWRHRLQMMKAMGLNVVTSYVFWNHHEVAPGVWDWSTGNH 99

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           +L  FVKT  E G+ + LR GPY CAEW +GG+P WL    G+  RT N PF +  + ++
Sbjct: 100 NLREFVKTAAEEGMKVILRPGPYCCAEWEFGGYPWWLPKTKGLVVRTDNQPFLDSCRVYI 159

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGN-VEWAYGVGGELYVKWAA-------DTAV 185
            ++   ++  +L  ++GGPII+ Q ENE+G+ V     +  E +  ++A       D   
Sbjct: 160 NQLASQVR--DLQVTKGGPIIMVQAENEFGSYVAQRPDIPLETHKAYSAKIRQQLLDAGF 217

Query: 186 NL---NTSVPWVM-----------CQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTEN 231
           N+    +   W+               ED  D +    N ++           P M  E 
Sbjct: 218 NIPMFTSDGSWLFKGGVIEGVLPTANGEDNIDNLKKVVNEYHGG-------QGPYMVAEF 270

Query: 232 YSGWFLSFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------ 285
           Y GW   +    P      +     ++ +   +F NYYM  GGTNFG  AG         
Sbjct: 271 YPGWLSHWAEKFPQVSTTSVVTQTKKYLDNKVSF-NYYMVHGGTNFGFMAGANCDNIHKL 329

Query: 286 --VATSYDYDAPIDEYGFIRQPKWGHLRELHK 315
               TSYDYDAPI E G++   K+  LR L K
Sbjct: 330 QPDMTSYDYDAPISEAGWVTD-KYTALRNLMK 360



 Score = 41.2 bits (95), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 93/226 (41%), Gaps = 53/226 (23%)

Query: 473 FLNIESLGHAALVFVNKKLVAFGYGNHDFANFLINKKIELNEGINTLDILSMMVGLQNYG 532
            + I  L   A ++VN + V  G  N  F    +   I  N    TLDIL    G  NYG
Sbjct: 431 MMKIPGLADYATIYVNGERV--GELNRVFGKHEMEIDIPFNA---TLDILVENWGRINYG 485

Query: 533 AWFDVAGAGLFSVILIDLKNGKRDLSSGEW-IYQVGVEGEYIGLDKISLANSSFWKQGST 591
            +   +  G+   I I+      ++ +G W +Y++ ++ +    D   ++NS      S 
Sbjct: 486 KFIVNSTKGITLPITIN-----DNVITGSWQMYKLPMDKQ---PDLTDISNS----YNSG 533

Query: 592 LPVNKSLIWYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKC 651
           LPV      Y  +F + +  G   L++   GKG  +VNG ++GRYW              
Sbjct: 534 LPV-----LYSGSF-SVDKVGDTFLDMEKWGKGIVFVNGVNLGRYWRI------------ 575

Query: 652 DYRGSYDASKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGD 697
                            P  TLY +P  ++  GEN +V+ E+L  +
Sbjct: 576 ----------------GPQHTLY-LPGCFLKQGENKIVVFEQLNDE 604


>gi|242077941|ref|XP_002443739.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
 gi|241940089|gb|EES13234.1| hypothetical protein SORBIDRAFT_07g001163 [Sorghum bicolor]
          Length = 111

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 61/109 (55%), Positives = 86/109 (78%)

Query: 34  EVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRFDLVRFVKTVQEAGLFLHLRI 93
           ++WP+LI K+KEGGL+VI+TYVFWN HEP++GQY FEGR+D VRF+K +Q  GL+++LRI
Sbjct: 1   QMWPKLIAKAKEGGLDVIQTYVFWNVHEPVQGQYNFEGRYDFVRFIKEIQGQGLYVNLRI 60

Query: 94  GPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFLAKIIDLMKQ 142
           GP+  +EW YGGFP WLH +P I FR+ N PFK  ++  L +++ L++ 
Sbjct: 61  GPFIESEWKYGGFPFWLHDVPNITFRSDNEPFKPSVRNMLGELVSLLEH 109


>gi|336415312|ref|ZP_08595652.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940908|gb|EGN02770.1| hypothetical protein HMPREF1017_02760 [Bacteroides ovatus
           3_8_47FAA]
          Length = 778

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 106/323 (32%), Positives = 154/323 (47%), Gaps = 27/323 (8%)

Query: 11  ALVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFE 70
             ++DGK  V+++  +HY R     W   I   K  G+  I  Y+FWN HE   G++ F 
Sbjct: 35  TFLLDGKPFVVKAAELHYTRIPQAYWEHRIEMCKALGMNTICIYIFWNIHEQEEGKFDFS 94

Query: 71  GRFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMK 130
           G+ D+  F +  Q+ G+++ +R GPY CAEW  GG P WL     I  RT +  + E + 
Sbjct: 95  GQNDIAAFCRAAQKHGMYVIVRPGPYVCAEWEMGGLPWWLLKKKDIALRTLDPYYMERVG 154

Query: 131 RFLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNLNTS 190
            F+ ++   +    L  ++GG II+ QVENEYG    +YG+  + YV    D       S
Sbjct: 155 IFMKEVGKQLAP--LQVNKGGNIIMVQVENEYG----SYGI-DKPYVSAVRDLVRESGFS 207

Query: 191 -VPWVMCQ-----QEDAPDPIINTCN---GFYCD----GFTPNSPSKPIMWTENYSGWFL 237
            VP   C        +A D +I T N   G   D          P  P+M +E +SGWF 
Sbjct: 208 DVPLFQCDWSSNFTNNALDDLIWTVNFGTGANIDQQFKKLKELRPETPLMCSEFWSGWFD 267

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGG-----PLVATSYDY 292
            +G     R  +D+   +    +   +F + YM  GGT FG   G        + +SYDY
Sbjct: 268 HWGRKHETRLAKDMVQGIKDMLDRNISF-SLYMTHGGTTFGHWGGANNPAYSAMCSSYDY 326

Query: 293 DAPIDEYGFIRQPKWGHLRELHK 315
           DAPI E G+    K+  LR+L K
Sbjct: 327 DAPISEPGWTTD-KFFLLRDLLK 348



 Score = 41.2 bits (95), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 47/107 (43%), Gaps = 30/107 (28%)

Query: 600 WYKTTFLAPEGKGPLALNLASMGKGQAWVNGQSIGRYWSAYLAPSTGCTKKCDYRGSYDA 659
           +YK+TF   +  G   L++++ GKG  WVNG ++GR+W                      
Sbjct: 532 YYKSTFTL-DKVGDTFLDMSTWGKGMVWVNGHAMGRFWEI-------------------- 570

Query: 660 SKCQKHCGQPAQTLYHIPRTWVHPGENLLVIHEELGGDPSKISLLTK 706
                    P QTL+ +P  W+  GEN +++ +  G   + I  L K
Sbjct: 571 --------GPQQTLF-MPGCWLKEGENEILVLDLKGPTRASIKGLKK 608


>gi|397498227|ref|XP_003819886.1| PREDICTED: beta-galactosidase-1-like protein 3 [Pan paniscus]
          Length = 653

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 105/318 (33%), Positives = 153/318 (48%), Gaps = 17/318 (5%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
             ++G + ++  GSIHY R   E W + + K K  G   + TYV WN HEP RG++ F G
Sbjct: 80  FTLEGHKFLIFGGSIHYFRVPREYWRDRLLKLKACGFNTVTTYVPWNLHEPERGKFDFSG 139

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             DL  FV    E GL++ LR GPY C+E + GG P WL   P +  RTTN  F E +++
Sbjct: 140 NLDLEAFVLMAAEIGLWVILRPGPYICSEMDLGGLPSWLLQDPRLLLRTTNKSFIEAVEK 199

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNVEWAYGVGGELYVKWAADTAVNL---N 188
           +   +I   +   L   QGGP+I  QVENEYG+          L+        V L   +
Sbjct: 200 YFDHLIP--RVIPLQYRQGGPVIAVQVENEYGSFNKDKTYMPYLHKALLRRGIVELLLTS 257

Query: 189 TSVPWVMCQQEDAPDPIINTCNGFYCDGFT---PNSPSKPIMWTENYSGWFLSFGYAVPF 245
                V+          IN     + D F         KP++  E + GWF  +G     
Sbjct: 258 DGEKHVLSGHTKGVLAAIN-LQKLHQDTFNQLHKIQRDKPLLIMEYWVGWFDRWGDKHHV 316

Query: 246 RPVEDLAFAVARFFETGGTFQNYYMYFGGTNFGRTAGGPL------VATSYDYDAPIDEY 299
           +  +++  AV+ F +   +F N YM+ GGTNFG   G         + TSYDYDA + E 
Sbjct: 317 KDAKEVEHAVSEFIKYEISF-NVYMFHGGTNFGFMNGATYFGKHSGIVTSYDYDAVLTEA 375

Query: 300 GFIRQPKWGHLRELHKAI 317
           G   + K+  L++L +++
Sbjct: 376 GDYTE-KYLKLQKLFQSV 392


>gi|71275091|ref|ZP_00651378.1| Beta-galactosidase [Xylella fastidiosa Dixon]
 gi|170731075|ref|YP_001776508.1| beta-galactosidase [Xylella fastidiosa M12]
 gi|71163900|gb|EAO13615.1| Beta-galactosidase [Xylella fastidiosa Dixon]
 gi|71730559|gb|EAO32637.1| Beta-galactosidase [Xylella fastidiosa Ann-1]
 gi|167965868|gb|ACA12878.1| Beta-galactosidase [Xylella fastidiosa M12]
          Length = 612

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 110/326 (33%), Positives = 156/326 (47%), Gaps = 35/326 (10%)

Query: 12  LVIDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEG 71
            + DG+   L SG+IH+ R     W + ++K++  GL  +ETYVFWN  E   GQ+ F G
Sbjct: 35  FIRDGRPYQLISGAIHFQRIPRAYWKDRLQKARAMGLNTVETYVFWNLVELREGQFDFTG 94

Query: 72  RFDLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKR 131
             D+  FV+     GL + LR GPY CAEW  GGFP WL   P ++ R+ +  F +  +R
Sbjct: 95  NNDIGAFVREAASQGLNVILRPGPYVCAEWEAGGFPAWLFADPTLRVRSQDPRFLDASQR 154

Query: 132 FLAKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAY-----------GVGGELYVK 178
           +L  +   ++   L  S GGPII  QVENEYG+   +  Y           G+GG L   
Sbjct: 155 YLEALGTQVRP--LLNSNGGPIIAMQVENEYGSYGDDHGYLQAVRALFIKAGLGGALL-- 210

Query: 179 WAADTAVNL-NTSVPWVMCQQEDAPDPIINTCNGFYCDGFTPNSPSKPIMWTENYSGWFL 237
           + +D A  L N ++P V+     AP            D      P +P +  E ++GWF 
Sbjct: 211 FTSDGAQMLGNGTLPDVLAAVNVAPGEAKQA-----LDKLATFHPGQPQLVGEYWAGWFD 265

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG-----RTAGGP-----LVA 287
            +G        +  A  +      G +  N YM+ GGT+FG        GGP        
Sbjct: 266 QWGKPHAQTDAKQQADEIEWMLRQGHSI-NLYMFVGGTSFGFMNGANFQGGPGDHYSPQT 324

Query: 288 TSYDYDAPIDEYGFIRQPKWGHLREL 313
           TSYDYDA +DE G    PK+   R++
Sbjct: 325 TSYDYDAALDEAGRP-MPKFALFRDV 349


>gi|422824944|ref|ZP_16873129.1| beta-galactosidase [Streptococcus sanguinis SK405]
 gi|422827211|ref|ZP_16875390.1| beta-galactosidase [Streptococcus sanguinis SK678]
 gi|422857055|ref|ZP_16903709.1| beta-galactosidase [Streptococcus sanguinis SK1]
 gi|324992224|gb|EGC24146.1| beta-galactosidase [Streptococcus sanguinis SK405]
 gi|324994315|gb|EGC26229.1| beta-galactosidase [Streptococcus sanguinis SK678]
 gi|327459541|gb|EGF05887.1| beta-galactosidase [Streptococcus sanguinis SK1]
          Length = 592

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 107/333 (32%), Positives = 154/333 (46%), Gaps = 32/333 (9%)

Query: 14  IDGKRRVLQSGSIHYPRSTPEVWPELIRKSKEGGLEVIETYVFWNYHEPIRGQYYFEGRF 73
           +DGK   + SG+I Y R  P+ W + +   K  G   +ETY+ W  HEP  GQ+  EG  
Sbjct: 12  LDGKPFKILSGAIQYFRLHPDQWRDTLYNLKALGFNTVETYIPWALHEPQEGQFKAEGML 71

Query: 74  DLVRFVKTVQEAGLFLHLRIGPYACAEWNYGGFPVWLHFIPGIQFRTTNNPFKEEMKRFL 133
           D   + K V+E GL+L +R  PY CAE+++GG P WL   P ++ R  +  F E++  F 
Sbjct: 72  DFEAYFKLVKETGLYLIVRPTPYICAEFDFGGLPAWLLRYPSMRLRVNHPLFLEKVSHFY 131

Query: 134 AKIIDLMKQENLFASQGGPIILAQVENEYGNV--EWAYGVGGELYVKWAADTAVNLNTSV 191
             +   +      + QGGPI++ QVENEYG+   + AY       +K    T     +  
Sbjct: 132 DWLFPKLLPYQ--SDQGGPILMMQVENEYGSYAEDKAYMRSIAQMMKVRGVTVPLFTSDG 189

Query: 192 PWVMCQQ-----ED--------APDPIINTCNGFYCDGFTPNSPSK-PIMWTENYSGWFL 237
            W+   +     ED           P  NT N      F      K P+M TE + GWF 
Sbjct: 190 TWIEALESGTLIEDDIFVTGNFGSQPKENTDN---LRAFMERYGKKWPLMCTEFWDGWFS 246

Query: 238 SFGYAVPFRPVEDLAFAVARFFETGGTFQNYYMYFGGTNFG--------RTAGGPLVATS 289
            +   +  R  EDLA  V    + G    N ++  GGTNFG        +T   P + TS
Sbjct: 247 RWSEEIVRREAEDLAQDVKEMLQLGSM--NLFLLRGGTNFGFISGCSARKTKDLPQI-TS 303

Query: 290 YDYDAPIDEYGFIRQPKWGHLRELHKAIKLCEE 322
           YD+DAPI E+G   +  +   R  H+     E+
Sbjct: 304 YDFDAPITEWGQPTEKYYAVQRVTHEVFPELEQ 336


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.137    0.439 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,516,308,602
Number of Sequences: 23463169
Number of extensions: 675811089
Number of successful extensions: 1256876
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2107
Number of HSP's successfully gapped in prelim test: 195
Number of HSP's that attempted gapping in prelim test: 1243565
Number of HSP's gapped (non-prelim): 5395
length of query: 821
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 670
effective length of database: 8,816,256,848
effective search space: 5906892088160
effective search space used: 5906892088160
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)