BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 039348
         (651 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
           Thetaiotaomicron
          Length = 612

 Score =  175 bits (443), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 109/327 (33%), Positives = 160/327 (48%), Gaps = 29/327 (8%)

Query: 26  EYDANALIIDGQRKVIISGSIHYPRSTPEMWPDLIQKAKDGGLDAIETYIFWNVHEPRRR 85
           E   N  +++G+  V+ +  IHYPR   E W   I+  K  G + I  Y+FWN HEP   
Sbjct: 9   EVGKNTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPEEG 68

Query: 86  EYDFSGNLDFVKFFKLVQDAGLYAILRIGPYVCAEWNYGGFPMWLHNTPGIQLRTDNDIF 145
            YDF+G  D   F +L Q+ G Y I+R GPYVCAEW  GG P WL     I+LR  +  +
Sbjct: 69  RYDFAGQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDPYY 128

Query: 146 KNEMQVFTTKIVNMCKEANLFASQGGPIILAQIENEYG--NIMEKYGDAGKQYIKWCANM 203
              +++F  ++      A+L  S+GG II  Q+ENEYG   I + Y    +  +K     
Sbjct: 129 XERVKLFLNEVGKQL--ADLQISKGGNIIXVQVENEYGAFGIDKPYISEIRDXVKQAGFT 186

Query: 204 AVAQNISEPWIMCQ-----QSDAPEPMINTCN---GFYCDQ----FKPNNPKSPKMWTEN 251
            V      P   C      +++A + ++ T N   G   D+     K   P +P   +E 
Sbjct: 187 GV------PLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEF 240

Query: 252 WTGWFKLWGGRDPQRTAEDLAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGGPY-----I 306
           W+GWF  WG +   R+AE+L            +  + Y  HGGT+FG   G  +      
Sbjct: 241 WSGWFDHWGAKHETRSAEELVKGXKEXLDR-NISFSLYXTHGGTSFGHWGGANFPNFSPT 299

Query: 307 ATSYDYNAPLDEYGNLNQPKWGHLKQL 333
            TSYDY+AP++E G +  PK+  ++ L
Sbjct: 300 CTSYDYDAPINESGKVT-PKYLEVRNL 325


>pdb|4E8C|A Chain A, Crystal Structure Of Streptococcal Beta-Galactosidase In
           Complex With Galactose
 pdb|4E8C|B Chain B, Crystal Structure Of Streptococcal Beta-Galactosidase In
           Complex With Galactose
 pdb|4E8D|A Chain A, Crystal Structure Of Streptococcal Beta-Galactosidase
 pdb|4E8D|B Chain B, Crystal Structure Of Streptococcal Beta-Galactosidase
          Length = 595

 Score =  164 bits (416), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 108/309 (34%), Positives = 155/309 (50%), Gaps = 27/309 (8%)

Query: 34  IDGQRKVIISGSIHYPRSTPEMWPDLIQKAKDGGLDAIETYIFWNVHEPRRREYDFSGNL 93
           +DG+   I+SG+IHY R  PE W   +   K  G + +ETY+ WN+HEP   E+ F G+L
Sbjct: 12  LDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPCEGEFHFEGDL 71

Query: 94  DFVKFFKLVQDAGLYAILRIGPYVCAEWNYGGFPMWLHNTPGIQLRTDNDIFKNEMQVFT 153
           D  KF ++ QD GLYAI+R  P++CAEW +GG P WL  T  +++R+ +  +   +  + 
Sbjct: 72  DLEKFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWLL-TKNMRIRSSDPAYIEAVGRYY 130

Query: 154 TKIVNMCKEANLFASQGGPIILAQIENEYGNIMEK--YGDAGKQYIKWCANMAVAQNISE 211
            +++            GG I++ Q+ENEYG+  E   Y  A +Q ++ C           
Sbjct: 131 DQLLPRLVPRL--LDNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEECGVTCPLFTSDG 188

Query: 212 PWIMCQQSDA--PEPMINTCN-------GFYCDQ--FKPNNPKSPKMWTENWTGWFKLWG 260
           PW    ++     E +  T N        F   Q  F  +  K P M  E W GWF  W 
Sbjct: 189 PWRATLKAGTLIEEDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLMCMEFWDGWFNRWK 248

Query: 261 GRDPQRTAEDLAFSVARFFQNGGVLNNYYMYHGGTNFGRTAGG--------PYIATSYDY 312
                R  ++LA +V    + G +  N YM+HGGTNFG   G         P + TSYDY
Sbjct: 249 EPIITRDPKELADAVREVLEQGSI--NLYMFHGGTNFGFMNGCSARGTLDLPQV-TSYDY 305

Query: 313 NAPLDEYGN 321
           +A LDE GN
Sbjct: 306 DALLDEEGN 314


>pdb|3THC|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THD|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
          Length = 654

 Score =  163 bits (412), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 154/322 (47%), Gaps = 28/322 (8%)

Query: 23  IKVEYDANALIIDGQRKVIISGSIHYPRSTPEMWPDLIQKAKDGGLDAIETYIFWNVHEP 82
            +++Y  ++ + DGQ    ISGSIHY R     W D + K K  GL+AI+TY+ WN HEP
Sbjct: 9   FEIDYSRDSFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEP 68

Query: 83  RRREYDFSGNLDFVKFFKLVQDAGLYAILRIGPYVCAEWNYGGFPMWLHNTPGIQLRTDN 142
              +Y FS + D   F +L  + GL  ILR GPY+CAEW  GG P WL     I LR+ +
Sbjct: 69  WPGQYQFSEDHDVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSD 128

Query: 143 DIFKNEMQVFTTKIVNMCKEANLFASQGGPIILAQIENEYGNIMEKYGDAGKQYIKWCAN 202
             +   +  +    V + K   L    GGP+I  Q+ENEYG+    Y      Y+++   
Sbjct: 129 PDYLAAVDKWLG--VLLPKMKPLLYQNGGPVITVQVENEYGS----YFACDFDYLRFLQK 182

Query: 203 MAVAQNISEPWIMCQQSDAPEPMIN--TCNGFY-----------CDQF---KPNNPKSPK 246
                ++ +  ++     A +  +      G Y            D F   +   PK P 
Sbjct: 183 R-FRHHLGDDVVLFTTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGPL 241

Query: 247 MWTENWTGWFKLWGGRDPQRTAEDLAFSVARFFQNGGVLNNYYMYHGGTNFG--RTAGGP 304
           + +E +TGW   WG        E +A S+      G  + N YM+ GGTNF     A  P
Sbjct: 242 INSEFYTGWLDHWGQPHSTIKTEAVASSLYDILARGASV-NLYMFIGGTNFAYWNGANSP 300

Query: 305 YIA--TSYDYNAPLDEYGNLNQ 324
           Y A  TSYDY+APL E G+L +
Sbjct: 301 YAAQPTSYDYDAPLSEAGDLTE 322


>pdb|1TG7|A Chain A, Native Structure Of Beta-Galactosidase From Penicillium
           Sp.
 pdb|1XC6|A Chain A, Native Structure Of Beta-Galactosidase From Penicillium
           Sp. In Complex With Galactose
          Length = 971

 Score =  139 bits (350), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 118/414 (28%), Positives = 180/414 (43%), Gaps = 49/414 (11%)

Query: 25  VEYDANALIIDGQRKVIISGSIH-YPRSTPEMWPDLIQKAKDGGLDAIETYIFWNVHEPR 83
           V +D +++ ++G+R +I SG +H Y      ++ D+ +K K  G + +  Y+ W + E  
Sbjct: 6   VTWDEHSIFVNGERLMIFSGEVHPYRLPVASLYIDIFEKVKALGFNCVSFYVDWALLEGN 65

Query: 84  RREYDFSGNLDFVKFFKLVQDAGLYAILRIGPYVCAEWNYGGFPMWLHNTPGIQLRTDND 143
              Y   G  D   FF   ++AG+Y + R GPY+ AE + GGFP WL    GI LRT ++
Sbjct: 66  PGHYSAEGIFDLQPFFDAAKEAGIYLLARPGPYINAEVSGGGFPGWLQRVDGI-LRTSDE 124

Query: 144 IFKNEMQVFTTKIVNMCKEANLFASQGGPIILAQIENEYGNIMEKY-GDAGKQYIKWCAN 202
            +      + + I     +A +  + GGPIIL Q ENEY      Y G     Y+++  +
Sbjct: 125 AYLKATDNYASNIAATIAKAQI--TNGGPIILYQPENEYSGACCGYNGFPDGSYMQYIED 182

Query: 203 MAVAQNISEPWIM----CQQSDAPEPMINTCN---------GFYC--------------- 234
            A    I  P+I         +AP       +         GF C               
Sbjct: 183 HARDAGIVVPFISNDAWAAGHNAPGTGAGAVDIYGHDSYPLGFDCANPSTWPSGNLPTYF 242

Query: 235 -DQFKPNNPKSPKMWTENWTGWFKLWGGRDPQRTAEDLAFSVAR-FFQN----GGVLNNY 288
               +  +P +P    E   G F  WGG    + A  L     R F++N    G    N 
Sbjct: 243 HTSHEQQSPSTPYSLVEFQGGAFDPWGGVGFAKCAALLNHEFERVFYKNDFSFGVAFLNL 302

Query: 289 YMYHGGTNFGRTAGGPYIATSYDYNAPLDEYGNLNQPKWGHLKQLHASIKQAEKFLTN-- 346
           YM  GGTN+G   G P   TSYDY + + E  N+ + K+  LK L    K +  +L    
Sbjct: 303 YMIFGGTNWGNL-GHPGGYTSYDYGSAISESRNITREKYSELKLLGNFAKVSPGYLVANP 361

Query: 347 GTVETKSVMNFVDLTKFTL---NATGERFCMLSNSDNTGDYTADLGEDGKFFVP 397
           G + T +  N  DLT   L   N++   F ++ +S    DY++    + K  VP
Sbjct: 362 GDLSTSTYTNTADLTVTPLLGSNSSASSFFVIRHS----DYSSQASVEYKLTVP 411


>pdb|3OG2|A Chain A, Native Crystal Structure Of Trichoderma Reesei
           Beta-Galactosidase
 pdb|3OGR|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Galactose
 pdb|3OGS|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Iptg
 pdb|3OGV|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Petg
          Length = 1003

 Score =  121 bits (303), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 112/406 (27%), Positives = 177/406 (43%), Gaps = 46/406 (11%)

Query: 25  VEYDANALIIDGQRKVIISGSIHYPR-STPEMWPDLIQKAKDGGLDAIETYIFWNVHEPR 83
           V +D ++L + G+R VI SG +H  R   P ++ D+  K K  G + +  Y+ W + E +
Sbjct: 26  VTWDEHSLFVHGERVVIFSGEVHPFRLPVPSLYLDVFHKIKALGFNTVSFYVDWALLEGK 85

Query: 84  RREYDFSGNLDFVKFFKLVQDAGLYAILRIGPYVCAEWNYGGFPMWLHNTPGIQLRTDND 143
              +   G      FF+    AG+Y + R GPY+ AE + GGFP WL    G +LRTD  
Sbjct: 86  PGRFRADGIFSLEPFFEAATKAGIYLLARPGPYINAEVSGGGFPGWLQRVKG-KLRTDAP 144

Query: 144 IFKNEMQVFTTKIVNMCKEANLFASQGGPIILAQIENEYGNIMEKYGDAGKQYIKWCANM 203
            + +    +   I ++  +A +  + GGP+IL Q ENEY    E      K Y+++  + 
Sbjct: 145 DYLHATDNYVAHIASIIAKAQI--TNGGPVILYQPENEYSGAAEGVLFPNKPYMQYVIDQ 202

Query: 204 AVAQNISEPWI----MCQQSDAPEPMINTCN---------GFYCDQFK--PNN------- 241
           A    I  P I        + AP   + + +         GF C      P+N       
Sbjct: 203 ARNAGIIVPLINNDAFPGGTGAPGTGLGSVDIYGHDGYPLGFDCAHPSAWPDNGLPTTWR 262

Query: 242 -------PKSPKMWTENWTGWFKLWGGRDPQRTAEDLAFSVAR-FFQN----GGVLNNYY 289
                  P +P    E   G F  +GG   ++ +  +     R F++N    G  + N Y
Sbjct: 263 QDHLNISPSTPFSLVEFQGGAFDPFGGWGFEQCSALVNHEFERVFYKNNMAAGVTIFNIY 322

Query: 290 MYHGGTNFGRTAGGPYIATSYDYNAPLDEYGNLNQPKWGHLKQLHASIKQAEKFLT---- 345
           M  GGTN+G   G P   TSYDY A + E   +++ K+  LK     +K +  ++T    
Sbjct: 323 MTFGGTNWGNL-GHPGGYTSYDYGASIREDRRIDREKYSELKLQGQFLKVSPGYITATPE 381

Query: 346 NGTVETKSVMNFVDLTKFTLNATGERFCMLS---NSDNTGDYTADL 388
           N T    S    + +T      +G+ F +     +S +T  YT  L
Sbjct: 382 NATQGVYSDSQNIVITPLLAKESGDFFVVRHANYSSTDTASYTVKL 427


>pdb|3OAA|D Chain D, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|E Chain E, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|F Chain F, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|L Chain L, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|M Chain M, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|N Chain N, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|T Chain T, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|U Chain U, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|V Chain V, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|BB Chain b, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|CC Chain c, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
 pdb|3OAA|DD Chain d, Structure Of The E.Coli F1-Atp Synthase Inhibited By
           Subunit Epsilon
          Length = 459

 Score = 31.2 bits (69), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 6/74 (8%)

Query: 319 YGNLNQPKWGHLKQLHASIKQAEKFLTNGTVETKSVMNFVD-LTKFTLNATGERFCMLSN 377
           YG +N+P    L+     +  AEKF   G    + V+ FVD + ++TL  T E   +L  
Sbjct: 206 YGQMNEPPGNRLRVALTGLTMAEKFRDEG----RDVLLFVDNIYRYTLAGT-EVSALLGR 260

Query: 378 SDNTGDYTADLGED 391
             +   Y   L E+
Sbjct: 261 MPSAVGYQPTLAEE 274


>pdb|1FA2|A Chain A, Crystal Structure Of Beta-Amylase From Sweet Potato
          Length = 498

 Score = 30.0 bits (66), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 15/51 (29%), Positives = 30/51 (58%), Gaps = 4/51 (7%)

Query: 58  DLIQKAKDGGLDAIETYIFWNVHEPR-RREYDFSGNLDFVKFFKLVQDAGL 107
           D +++ K GG D +   ++W + E +  ++YD+S    + + F+LV+  GL
Sbjct: 38  DELKQVKAGGCDGVMVDVWWGIIEAKGPKQYDWSA---YRELFQLVKKCGL 85


>pdb|3U7V|A Chain A, The Structure Of A Putative Beta-Galactosidase From
           Caulobacter Crescentus Cb15
          Length = 552

 Score = 29.3 bits (64), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 58/299 (19%), Positives = 110/299 (36%), Gaps = 65/299 (21%)

Query: 2   ASSRTEFVLVAGLAMLCSCLAIK-------------VEYDA-NALIIDGQRKVIISGSIH 47
           A   + F    GLA+   C  +              V  D  +AL +DG   + ++  ++
Sbjct: 7   AXGXSRFATAVGLALALVCGPLASGAHAADAAXPQLVTKDGRHALXVDGAPFLXLAAQVN 66

Query: 48  ----YPRSTPEMWPDLIQKAKDGGLDAIETYIFWNVHEPRRREYDFSGNLDFVKFFKLVQ 103
               +P    ++WP  I+K    G + ++  I W   EP   ++DFS  LD +   +  +
Sbjct: 67  NSSAWPSQXAKVWP-AIEKV---GANTVQVPIAWEQIEPVEGQFDFS-YLDLL--LEQAR 119

Query: 104 DAGLYAIL-------RIGPYVCAEW---NYGGFPMWLHNTPGIQLRTDNDIFKNEMQVFT 153
           +  +  +L          P    EW   +   FP  + +  G +  + + + K+ +    
Sbjct: 120 ERKVRLVLLWFGTWKNSSPSYAPEWVKLDDKRFPRLIKDD-GERSYSXSPLAKSTLDADR 178

Query: 154 TKIVNMCKEANLFASQGGPIILAQIENE---YGNIMEKYGDAGKQYIKWCANMAVAQNI- 209
              V +        +    +I  Q+ENE   YG++ + +G A ++     A   + + + 
Sbjct: 179 KAFVALXTHLKAKDAAQKTVIXVQVENETGTYGSVRD-FGPAAQKVFNGPAPATLVKAVG 237

Query: 210 SEP--WIMCQQSDAPE----------------------PMINTCNGFYCDQFKPNNPKS 244
           ++P  W      DA E                      P+    N    D  KP +PK+
Sbjct: 238 AKPGTWSQAFGKDADEFFHAWHIGRFVDQVAAGGKAVYPLPXYVNAALRDPIKPGDPKT 296


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.317    0.134    0.419 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 21,072,439
Number of Sequences: 62578
Number of extensions: 946728
Number of successful extensions: 2037
Number of sequences better than 100.0: 14
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 2010
Number of HSP's gapped (non-prelim): 19
length of query: 651
length of database: 14,973,337
effective HSP length: 105
effective length of query: 546
effective length of database: 8,402,647
effective search space: 4587845262
effective search space used: 4587845262
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 55 (25.8 bits)