BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 006635
         (637 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
           Thetaiotaomicron
          Length = 612

 Score =  167 bits (424), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 111/327 (33%), Positives = 153/327 (46%), Gaps = 25/327 (7%)

Query: 28  TVTYDRKAILINGQRRILISGSIHYPRSTPEMWEDLIRKAKDGGLDVIDTYVFWNGHEPS 87
           T    +   L+NG+  ++ +  IHYPR   E WE  I+  K  G + I  YVFWN HEP 
Sbjct: 7   TFEVGKNTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPE 66

Query: 88  PGHYNFEGSYDLVRFIKTVQRVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 147
            G Y+F G  D+  F +  Q  G Y  +R GPYVCAEW  GG P WL     I  R  + 
Sbjct: 67  EGRYDFAGQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDP 126

Query: 148 PFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKM 207
            +   ++ F  ++ + + +  L  S+GG II  Q+ENEYG    + G            +
Sbjct: 127 YYXERVKLFLNEVGKQLAD--LQISKGGNIIXVQVENEYG----AFGIDKPYISEIRDXV 180

Query: 208 AVGLDTGVPWVMCK-----EDDAPDPV---INSCNGFYCDA----FSPNKPYKPTLWTEA 255
                TGVP   C      E++A D +   IN   G   D         +P  P   +E 
Sbjct: 181 KQAGFTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEF 240

Query: 256 WSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGGPF-----I 310
           WSGWF  +G     R  ++L       + +  S F+ Y  HGGT+FG   G  F      
Sbjct: 241 WSGWFDHWGAKHETRSAEELVKGXKEXLDRNIS-FSLYXTHGGTSFGHWGGANFPNFSPT 299

Query: 311 TTSYDYDAPLDEYGLMRQPKYGHLKQL 337
            TSYDYDAP++E G +  PKY  ++ L
Sbjct: 300 CTSYDYDAPINESGKV-TPKYLEVRNL 325


>pdb|3THC|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THD|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
          Length = 654

 Score =  159 bits (401), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 114/348 (32%), Positives = 162/348 (46%), Gaps = 34/348 (9%)

Query: 29  VTYDRKAILINGQRRILISGSIHYPRSTPEMWEDLIRKAKDGGLDVIDTYVFWNGHEPSP 88
           + Y R + L +GQ    ISGSIHY R     W+D + K K  GL+ I TYV WN HEP P
Sbjct: 11  IDYSRDSFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPWP 70

Query: 89  GHYNFEGSYDLVRFIKTVQRVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGP 148
           G Y F   +D+  F++    +GL   LR GPY+CAEW  GG P WL     I  R+ +  
Sbjct: 71  GQYQFSEDHDVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSDPD 130

Query: 149 FKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKM- 207
           +  A+  +   ++  MK   L    GGP+I  Q+ENEYG    S  A    Y+ +  K  
Sbjct: 131 YLAAVDKWLGVLLPKMK--PLLYQNGGPVITVQVENEYG----SYFACDFDYLRFLQKRF 184

Query: 208 --AVGLDTGVPWVMCKEDDAPDPVIN--SCNGFY-----------CDAFSPNK---PYKP 249
              +G D     V+   D A    +   +  G Y            DAF   +   P  P
Sbjct: 185 RHHLGDDV----VLFTTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGP 240

Query: 250 TLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFG--RTAGG 307
            + +E ++GW   +G        + +A ++   + +G S  N YM+ GGTNF     A  
Sbjct: 241 LINSEFYTGWLDHWGQPHSTIKTEAVASSLYDILARGAS-VNLYMFIGGTNFAYWNGANS 299

Query: 308 PFIT--TSYDYDAPLDEYGLMRQPKYGHLKQLHEAIKLCEYALVSSDP 353
           P+    TSYDYDAPL E G + +  +     + +  K+ E  +  S P
Sbjct: 300 PYAAQPTSYDYDAPLSEAGDLTEKYFALRNIIQKFEKVPEGPIPPSTP 347


>pdb|4E8C|A Chain A, Crystal Structure Of Streptococcal Beta-Galactosidase In
           Complex With Galactose
 pdb|4E8C|B Chain B, Crystal Structure Of Streptococcal Beta-Galactosidase In
           Complex With Galactose
 pdb|4E8D|A Chain A, Crystal Structure Of Streptococcal Beta-Galactosidase
 pdb|4E8D|B Chain B, Crystal Structure Of Streptococcal Beta-Galactosidase
          Length = 595

 Score =  150 bits (380), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 149/324 (45%), Gaps = 49/324 (15%)

Query: 33  RKAILINGQRRILISGSIHYPRSTPEMWEDLIRKAKDGGLDVIDTYVFWNGHEPSPGHYN 92
           R    ++G+   ++SG+IHY R  PE W   +   K  G + ++TYV WN HEP  G ++
Sbjct: 7   RDDFYLDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPCEGEFH 66

Query: 93  FEGSYDLVRFIKTVQRVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNGPFKVA 152
           FEG  DL +F++  Q +GLYA +R  P++CAEW FGG P WL     +  R+ +  +  A
Sbjct: 67  FEGDLDLEKFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRIRSSDPAYIEA 125

Query: 153 MQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGP--ESKSLGAAGHAYVNWAAKMAVG 210
           +  +  +++  +         GG I++ Q+ENEYG   E K+   A    +         
Sbjct: 126 VGRYYDQLLPRLVPRL--LDNGGNILMMQVENEYGSYGEDKAYLRAIRQLMEECGVTCPL 183

Query: 211 LDTGVPW-------VMCKEDDAPDPVINSCNGFYCDAFSPNKPYK--------------- 248
             +  PW        + +ED            F    F    PY                
Sbjct: 184 FTSDGPWRATLKAGTLIEED-----------LFVTGNFGSKAPYNFSQMQEFFDEHGKKW 232

Query: 249 PTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQKGGSFFNYYMYHGGTNFGRTAGG- 307
           P +  E W GWF  +   +  R  ++LA AV   +++G    N YM+HGGTNFG   G  
Sbjct: 233 PLMCMEFWDGWFNRWKEPIITRDPKELADAVREVLEQGS--INLYMFHGGTNFGFMNGCS 290

Query: 308 -------PFITTSYDYDAPLDEYG 324
                  P + TSYDYDA LDE G
Sbjct: 291 ARGTLDLPQV-TSYDYDALLDEEG 313


>pdb|1TG7|A Chain A, Native Structure Of Beta-Galactosidase From Penicillium
           Sp.
 pdb|1XC6|A Chain A, Native Structure Of Beta-Galactosidase From Penicillium
           Sp. In Complex With Galactose
          Length = 971

 Score =  130 bits (327), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 161/372 (43%), Gaps = 47/372 (12%)

Query: 29  VTYDRKAILINGQRRILISGSIH-YPRSTPEMWEDLIRKAKDGGLDVIDTYVFWNGHEPS 87
           VT+D  +I +NG+R ++ SG +H Y      ++ D+  K K  G + +  YV W   E +
Sbjct: 6   VTWDEHSIFVNGERLMIFSGEVHPYRLPVASLYIDIFEKVKALGFNCVSFYVDWALLEGN 65

Query: 88  PGHYNFEGSYDLVRFIKTVQRVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 147
           PGHY+ EG +DL  F    +  G+Y   R GPY+ AE + GGFP WL+ V GI  RT + 
Sbjct: 66  PGHYSAEGIFDLQPFFDAAKEAGIYLLARPGPYINAEVSGGGFPGWLQRVDGI-LRTSDE 124

Query: 148 PFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEY-GPESKSLGAAGHAYVNWAAK 206
            +  A   +   I   +   ++  + GGPIIL Q ENEY G      G    +Y+ +   
Sbjct: 125 AYLKATDNYASNIAATIAKAQI--TNGGPIILYQPENEYSGACCGYNGFPDGSYMQYIED 182

Query: 207 MAVGLDTGVPWVM----CKEDDAPDPVINSCN---------GFYC--------------- 238
            A      VP++         +AP     + +         GF C               
Sbjct: 183 HARDAGIVVPFISNDAWAAGHNAPGTGAGAVDIYGHDSYPLGFDCANPSTWPSGNLPTYF 242

Query: 239 ----DAFSPNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQK-----GGSF 289
               +  SP+ PY      E   G F  +GG    +    L     R   K     G +F
Sbjct: 243 HTSHEQQSPSTPYS---LVEFQGGAFDPWGGVGFAKCAALLNHEFERVFYKNDFSFGVAF 299

Query: 290 FNYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLMRQPKYGHLKQLHEAIKLCEYALV 349
            N YM  GGTN+G   G P   TSYDY + + E   + + KY  LK L    K+    LV
Sbjct: 300 LNLYMIFGGTNWGNL-GHPGGYTSYDYGSAISESRNITREKYSELKLLGNFAKVSPGYLV 358

Query: 350 SSDPTVTSLGTY 361
            ++P   S  TY
Sbjct: 359 -ANPGDLSTSTY 369


>pdb|3OG2|A Chain A, Native Crystal Structure Of Trichoderma Reesei
           Beta-Galactosidase
 pdb|3OGR|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Galactose
 pdb|3OGS|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Iptg
 pdb|3OGV|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Petg
          Length = 1003

 Score =  116 bits (290), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 156/375 (41%), Gaps = 46/375 (12%)

Query: 29  VTYDRKAILINGQRRILISGSIHYPR-STPEMWEDLIRKAKDGGLDVIDTYVFWNGHEPS 87
           VT+D  ++ ++G+R ++ SG +H  R   P ++ D+  K K  G + +  YV W   E  
Sbjct: 26  VTWDEHSLFVHGERVVIFSGEVHPFRLPVPSLYLDVFHKIKALGFNTVSFYVDWALLEGK 85

Query: 88  PGHYNFEGSYDLVRFIKTVQRVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNG 147
           PG +  +G + L  F +   + G+Y   R GPY+ AE + GGFP WL+ V G   RTD  
Sbjct: 86  PGRFRADGIFSLEPFFEAATKAGIYLLARPGPYINAEVSGGGFPGWLQRVKG-KLRTDAP 144

Query: 148 PFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSLGAAGHAYVNWAAKM 207
            +  A   +   I  ++   ++  + GGP+IL Q ENEY   ++ +      Y+ +    
Sbjct: 145 DYLHATDNYVAHIASIIAKAQI--TNGGPVILYQPENEYSGAAEGVLFPNKPYMQYVIDQ 202

Query: 208 AVGLDTGVPWV--------------------MCKED------DAPDPVINSCNGFYCD-- 239
           A      VP +                    +   D      D   P     NG      
Sbjct: 203 ARNAGIIVPLINNDAFPGGTGAPGTGLGSVDIYGHDGYPLGFDCAHPSAWPDNGLPTTWR 262

Query: 240 ----AFSPNKPYKPTLWTEAWSGWFTEFGGAVHRRPVQDLAFAVARFIQK-----GGSFF 290
                 SP+ P+      E   G F  FGG    +    +     R   K     G + F
Sbjct: 263 QDHLNISPSTPFS---LVEFQGGAFDPFGGWGFEQCSALVNHEFERVFYKNNMAAGVTIF 319

Query: 291 NYYMYHGGTNFGRTAGGPFITTSYDYDAPLDEYGLMRQPKYGHLKQLHEAIKLCEYALVS 350
           N YM  GGTN+G   G P   TSYDY A + E   + + KY  LK   + +K+     ++
Sbjct: 320 NIYMTFGGTNWGNL-GHPGGYTSYDYGASIREDRRIDREKYSELKLQGQFLKVSP-GYIT 377

Query: 351 SDPTVTSLGTYQQAH 365
           + P   + G Y  + 
Sbjct: 378 ATPENATQGVYSDSQ 392


>pdb|2OSY|A Chain A, Endo-Glycoceramidase Ii From Rhodococcus Sp.: Lactosyl-
           Enzyme Intermediate
 pdb|2OSY|B Chain B, Endo-Glycoceramidase Ii From Rhodococcus Sp.: Lactosyl-
           Enzyme Intermediate
          Length = 481

 Score = 29.6 bits (65), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 27/63 (42%), Gaps = 7/63 (11%)

Query: 53  PRSTPEMWE-DLIRKAKDGGLDVIDTYVFWNGHEPSPGHYNFEGSYDLVRFIKTVQRVGL 111
           P   P+  E DL R+  D G + +   + W   EP+P      G YD     +   RVG 
Sbjct: 60  PDGMPQFTEADLAREYADMGTNFVRFLISWRSVEPAP------GVYDQQYLDRVEDRVGW 113

Query: 112 YAH 114
           YA 
Sbjct: 114 YAE 116


>pdb|2OSW|A Chain A, Endo-Glycoceramidase Ii From Rhodococcus Sp.
 pdb|2OSW|B Chain B, Endo-Glycoceramidase Ii From Rhodococcus Sp.
 pdb|2OYK|A Chain A, Endo-Glycoceramidase Ii From Rhodococcus Sp.:
           Cellobiose-Like Isofagomine Complex
 pdb|2OYK|B Chain B, Endo-Glycoceramidase Ii From Rhodococcus Sp.:
           Cellobiose-Like Isofagomine Complex
 pdb|2OYL|A Chain A, Endo-Glycoceramidase Ii From Rhodococcus Sp.:
           Cellobiose-Like Imidazole Complex
 pdb|2OYL|B Chain B, Endo-Glycoceramidase Ii From Rhodococcus Sp.:
           Cellobiose-Like Imidazole Complex
 pdb|2OYM|A Chain A, Endo-Glycoceramidase Ii From Rhodococcus Sp.:
           Five-Membered Iminocyclitol Complex
 pdb|2OYM|B Chain B, Endo-Glycoceramidase Ii From Rhodococcus Sp.:
           Five-Membered Iminocyclitol Complex
          Length = 481

 Score = 29.6 bits (65), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 27/63 (42%), Gaps = 7/63 (11%)

Query: 53  PRSTPEMWE-DLIRKAKDGGLDVIDTYVFWNGHEPSPGHYNFEGSYDLVRFIKTVQRVGL 111
           P   P+  E DL R+  D G + +   + W   EP+P      G YD     +   RVG 
Sbjct: 60  PDGMPQFTEADLAREYADMGTNFVRFLISWRSVEPAP------GVYDQQYLDRVEDRVGW 113

Query: 112 YAH 114
           YA 
Sbjct: 114 YAE 116


>pdb|2OSX|A Chain A, Endo-glycoceramidase Ii From Rhodococcus Sp.: Ganglioside
           Gm3 Complex
          Length = 481

 Score = 29.6 bits (65), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 27/63 (42%), Gaps = 7/63 (11%)

Query: 53  PRSTPEMWE-DLIRKAKDGGLDVIDTYVFWNGHEPSPGHYNFEGSYDLVRFIKTVQRVGL 111
           P   P+  E DL R+  D G + +   + W   EP+P      G YD     +   RVG 
Sbjct: 60  PDGMPQFTEADLAREYADMGTNFVRFLISWRSVEPAP------GVYDQQYLDRVEDRVGW 113

Query: 112 YAH 114
           YA 
Sbjct: 114 YAE 116


>pdb|1B30|A Chain A, Xylanase From Penicillium Simplicissimum, Complex With
           1,2- (4-Deoxy-Beta-L-Threo-Hex-4-Enopyranosyluronic
           Acid)-Beta- 1,4-Xylotriose)
 pdb|1B31|A Chain A, Xylanase From Penicillium Simplicissimum, Native With
           Peg200 As Cryoprotectant
 pdb|1B3V|A Chain A, Xylanase From Penicillium Simplicissimum, Complex With
           Xylose
 pdb|1B3W|A Chain A, Xylanase From Penicillium Simplicissimum, Complex With
           Xylobiose
 pdb|1B3X|A Chain A, Xylanase From Penicillium Simplicissimum, Complex With
           Xylotriose
 pdb|1B3Y|A Chain A, Xylanase From Penicillium Simplicissimum, Complex With
           Xylotetraose
 pdb|1B3Z|A Chain A, Xylanase From Penicillium Simplicissimum, Complex With
           Xylopentaose
 pdb|1BG4|A Chain A, Xylanase From Penicillium Simplicissimum
          Length = 302

 Score = 29.3 bits (64), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 56/135 (41%), Gaps = 24/135 (17%)

Query: 81  WNGHEPSPGHYNFEGSYDLVRFIKTVQRVGLYAHLRIGPYVCAEWNFGGFPVWLKYVPG- 139
           W+  EP+ G + F GS  LV F ++  ++     +R           G   VW   +PG 
Sbjct: 52  WDATEPNRGQFTFSGSDYLVNFAQSNGKL-----IR-----------GHTLVWHSQLPGW 95

Query: 140 ISFRTDNGPFKVAMQGFTQKIVQMMKNEKLFASQGGPIILSQIENEYGPESKSL--GAAG 197
           +S  TD       ++     ++   K  K++A      +L++I NE G    S+     G
Sbjct: 96  VSSITDKNTLISVLKNHITTVMTRYKG-KIYAWD----VLNEIFNEDGSLRNSVFYNVIG 150

Query: 198 HAYVNWAAKMAVGLD 212
             YV  A + A  +D
Sbjct: 151 EDYVRIAFETARSVD 165


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.318    0.135    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 20,723,816
Number of Sequences: 62578
Number of extensions: 939269
Number of successful extensions: 2250
Number of sequences better than 100.0: 10
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 2225
Number of HSP's gapped (non-prelim): 15
length of query: 637
length of database: 14,973,337
effective HSP length: 105
effective length of query: 532
effective length of database: 8,402,647
effective search space: 4470208204
effective search space used: 4470208204
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 55 (25.8 bits)