BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 006904
         (626 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|3D3A|A Chain A, Crystal Structure Of A Beta-Galactosidase From Bacteroides
           Thetaiotaomicron
          Length = 612

 Score =  169 bits (427), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 155/323 (47%), Gaps = 27/323 (8%)

Query: 32  RKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPSPGNYN 91
           +   L+NG+  ++ +  IHYPR   + WE  I+  K  G + I  YVFWN HEP  G Y+
Sbjct: 12  KNTFLLNGEPFVVKAAEIHYPRIPKEYWEHRIKXCKALGXNTICLYVFWNFHEPEEGRYD 71

Query: 92  FEGRYDLVRFIKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRA 151
           F G+ D+  F +  Q+ G Y  +R GPYVCAEW  GG P WL     I  R  +  +   
Sbjct: 72  FAGQKDIAAFCRLAQENGXYVIVRPGPYVCAEWEXGGLPWWLLKKKDIKLREQDPYYXER 131

Query: 152 MQGFTEKIVNLMKSENLFESQGGPIILSQIENEYGAQSKLLGAAGHNYMTWAAKMAVEMG 211
           ++ F  ++   +   +L  S+GG II  Q+ENEYGA           Y++       + G
Sbjct: 132 VKLFLNEVGKQLA--DLQISKGGNIIXVQVENEYGA-----FGIDKPYISEIRDXVKQAG 184

Query: 212 -TGVPWVMCK-----EEDAPDPV---INSCNGFYCDA----FTPNQPYKPTIWTEAWSGW 258
            TGVP   C      E +A D +   IN   G   D         +P  P   +E WSGW
Sbjct: 185 FTGVPLFQCDWNSNFENNALDDLLWTINFGTGANIDEQFKRLKELRPDTPLXCSEFWSGW 244

Query: 259 FTEFGGPIHQRPVQDLAFAAARFIQKGGSFINYYMYHGGTNFGRSAGGPF-----ITTSY 313
           F  +G     R  ++L       + +  SF + Y  HGGT+FG   G  F       TSY
Sbjct: 245 FDHWGAKHETRSAEELVKGXKEXLDRNISF-SLYXTHGGTSFGHWGGANFPNFSPTCTSY 303

Query: 314 DYDAPIDEYGLIRQPKYGHLKEL 336
           DYDAPI+E G +  PKY  ++ L
Sbjct: 304 DYDAPINESGKV-TPKYLEVRNL 325


>pdb|3THC|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THC|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
           With Galactose
 pdb|3THD|A Chain A, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|B Chain B, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|C Chain C, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
 pdb|3THD|D Chain D, Crystal Structure Of Human Beta-Galactosidase In Complex
           With 1- Deoxygalactonojirimycin
          Length = 654

 Score =  160 bits (405), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 127/410 (30%), Positives = 184/410 (44%), Gaps = 42/410 (10%)

Query: 28  VTYDRKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPSP 87
           + Y R + L +GQ     SGSIHY R     W+D + K K  GL+ I+TYV WN HEP P
Sbjct: 11  IDYSRDSFLKDGQPFRYISGSIHYSRVPRFYWKDRLLKMKMAGLNAIQTYVPWNFHEPWP 70

Query: 88  GNYNFEGRYDLVRFIKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEP 147
           G Y F   +D+  F++   + GL   LR GPY+CAEW  GG P WL     I  R+ +  
Sbjct: 71  GQYQFSEDHDVEYFLRLAHELGLLVILRPGPYICAEWEMGGLPAWLLEKESILLRSSDPD 130

Query: 148 FKRAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYGAQSKLLGAAGHNYMTWAAK-M 206
           +  A+  +   ++  MK   L    GGP+I  Q+ENEYG+      A   +Y+ +  K  
Sbjct: 131 YLAAVDKWLGVLLPKMKP--LLYQNGGPVITVQVENEYGSYF----ACDFDYLRFLQKRF 184

Query: 207 AVEMGTGVPWVMCKEEDAPDPVIN--SCNGFY-----------CDAFTPN---QPYKPTI 250
              +G  V  V+   + A    +   +  G Y            DAF      +P  P I
Sbjct: 185 RHHLGDDV--VLFTTDGAHKTFLKCGALQGLYTTVDFGTGSNITDAFLSQRKCEPKGPLI 242

Query: 251 WTEAWSGWFTEFGGPIHQRPVQDLAFAAARFIQKGGSFINYYMYHGGTNFG--RSAGGPF 308
            +E ++GW   +G P      + +A +    + +G S +N YM+ GGTNF     A  P+
Sbjct: 243 NSEFYTGWLDHWGQPHSTIKTEAVASSLYDILARGAS-VNLYMFIGGTNFAYWNGANSPY 301

Query: 309 IT--TSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCERALVSADPIVTSLGGFQQAHVY 366
               TSYDYDAP+ E G + + KY  L+ +   I+  E+  V   PI  S   F    V 
Sbjct: 302 AAQPTSYDYDAPLSEAGDLTE-KYFALRNI---IQKFEK--VPEGPIPPSTPKFAYGKVT 355

Query: 367 SSESGDCAAFLSNYDTKSAARVLFN------NMHYNLPPWSISVLPDCRN 410
             +     A L         + L+         HY    +  ++  DC N
Sbjct: 356 LEKLKTVGAALDILCPSGPIKSLYPLTFIQVKQHYGFVLYRTTLPQDCSN 405


>pdb|4E8C|A Chain A, Crystal Structure Of Streptococcal Beta-Galactosidase In
           Complex With Galactose
 pdb|4E8C|B Chain B, Crystal Structure Of Streptococcal Beta-Galactosidase In
           Complex With Galactose
 pdb|4E8D|A Chain A, Crystal Structure Of Streptococcal Beta-Galactosidase
 pdb|4E8D|B Chain B, Crystal Structure Of Streptococcal Beta-Galactosidase
          Length = 595

 Score =  155 bits (393), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 172/373 (46%), Gaps = 49/373 (13%)

Query: 32  RKALLINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPSPGNYN 91
           R    ++G+   + SG+IHY R  P+ W   +   K  G + +ETYV WN+HEP  G ++
Sbjct: 7   RDDFYLDGKSFKILSGAIHYFRVPPEDWYHSLYNLKALGFNTVETYVAWNLHEPCEGEFH 66

Query: 92  FEGRYDLVRFIKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNEPFKRA 151
           FEG  DL +F++  Q  GLYA +R  P++CAEW FGG P WL     +  R+ +  +  A
Sbjct: 67  FEGDLDLEKFLQIAQDLGLYAIVRPSPFICAEWEFGGLPAWL-LTKNMRIRSSDPAYIEA 125

Query: 152 MQGFTEKIVNLMKSENLFESQGGPIILSQIENEYGAQSKLLGAAGHNYMTWAAKMAVEMG 211
           +  + ++++  +    L    GG I++ Q+ENEYG+  +        Y+    ++  E G
Sbjct: 126 VGRYYDQLLPRLVPRLL--DNGGNILMMQVENEYGSYGE-----DKAYLRAIRQLMEECG 178

Query: 212 TGVPWVMCKEEDAP------DPVINSCNGFYCDAFTPNQPYK---------------PTI 250
              P       D P         +   + F    F    PY                P +
Sbjct: 179 VTCPLFTS---DGPWRATLKAGTLIEEDLFVTGNFGSKAPYNFSQMQEFFDEHGKKWPLM 235

Query: 251 WTEAWSGWFTEFGGPIHQRPVQDLAFAAARFIQKGGSFINYYMYHGGTNF----GRSAGG 306
             E W GWF  +  PI  R  ++LA A    +++G   IN YM+HGGTNF    G SA G
Sbjct: 236 CMEFWDGWFNRWKEPIITRDPKELADAVREVLEQGS--INLYMFHGGTNFGFMNGCSARG 293

Query: 307 PFI---TTSYDYDAPIDEYG-------LIRQPKYGHLKELHRAIKMCERAL-VSADPIVT 355
                  TSYDYDA +DE G        +++    H  E  +   + + ++ + A P+V 
Sbjct: 294 TLDLPQVTSYDYDALLDEEGNPTAKYLAVKKMMATHFSEYPQLEPLYKESMELDAIPLVE 353

Query: 356 SLGGFQQAHVYSS 368
            +  F+     SS
Sbjct: 354 KVSLFETLDSLSS 366


>pdb|1TG7|A Chain A, Native Structure Of Beta-Galactosidase From Penicillium
           Sp.
 pdb|1XC6|A Chain A, Native Structure Of Beta-Galactosidase From Penicillium
           Sp. In Complex With Galactose
          Length = 971

 Score =  143 bits (361), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 164/358 (45%), Gaps = 40/358 (11%)

Query: 28  VTYDRKALLINGQRRILFSGSIH-YPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPS 86
           VT+D  ++ +NG+R ++FSG +H Y      ++ D+ +K K  G + +  YV W + E +
Sbjct: 6   VTWDEHSIFVNGERLMIFSGEVHPYRLPVASLYIDIFEKVKALGFNCVSFYVDWALLEGN 65

Query: 87  PGNYNFEGRYDLVRFIKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 146
           PG+Y+ EG +DL  F    ++AG+Y   R GPY+ AE + GGFP WL+ V GI  RT +E
Sbjct: 66  PGHYSAEGIFDLQPFFDAAKEAGIYLLARPGPYINAEVSGGGFPGWLQRVDGI-LRTSDE 124

Query: 147 PFKRAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEY-GAQSKLLGAAGHNYMTWAAK 205
            + +A   +   I   +    +  + GGPIIL Q ENEY GA     G    +YM +   
Sbjct: 125 AYLKATDNYASNIAATIAKAQI--TNGGPIILYQPENEYSGACCGYNGFPDGSYMQYIED 182

Query: 206 MAVEMGTGVPWVM----CKEEDAPDPVINSCN---------GFYC--------------- 237
            A + G  VP++         +AP     + +         GF C               
Sbjct: 183 HARDAGIVVPFISNDAWAAGHNAPGTGAGAVDIYGHDSYPLGFDCANPSTWPSGNLPTYF 242

Query: 238 -DAFTPNQPYKPTIWTEAWSGWFTEFGGPIHQRPVQDLAFAAARFIQK-----GGSFINY 291
             +     P  P    E   G F  +GG    +    L     R   K     G +F+N 
Sbjct: 243 HTSHEQQSPSTPYSLVEFQGGAFDPWGGVGFAKCAALLNHEFERVFYKNDFSFGVAFLNL 302

Query: 292 YMYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCERALVS 349
           YM  GGTN+G + G P   TSYDY + I E   I + KY  LK L    K+    LV+
Sbjct: 303 YMIFGGTNWG-NLGHPGGYTSYDYGSAISESRNITREKYSELKLLGNFAKVSPGYLVA 359


>pdb|3OG2|A Chain A, Native Crystal Structure Of Trichoderma Reesei
           Beta-Galactosidase
 pdb|3OGR|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Galactose
 pdb|3OGS|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Iptg
 pdb|3OGV|A Chain A, Complex Structure Of Beta-Galactosidase From Trichoderma
           Reesei With Petg
          Length = 1003

 Score =  130 bits (327), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 129/461 (27%), Positives = 197/461 (42%), Gaps = 53/461 (11%)

Query: 28  VTYDRKALLINGQRRILFSGSIHYPR-STPDMWEDLIQKAKDGGLDVIETYVFWNVHEPS 86
           VT+D  +L ++G+R ++FSG +H  R   P ++ D+  K K  G + +  YV W + E  
Sbjct: 26  VTWDEHSLFVHGERVVIFSGEVHPFRLPVPSLYLDVFHKIKALGFNTVSFYVDWALLEGK 85

Query: 87  PGNYNFEGRYDLVRFIKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPGISFRTDNE 146
           PG +  +G + L  F +   KAG+Y   R GPY+ AE + GGFP WL+ V G   RTD  
Sbjct: 86  PGRFRADGIFSLEPFFEAATKAGIYLLARPGPYINAEVSGGGFPGWLQRVKG-KLRTDAP 144

Query: 147 PFKRAMQGFTEKIVNLMKSENLFESQGGPIILSQIENEYGAQSKLLGAAGHNYMTWAAKM 206
            +  A   +   I +++    +  + GGP+IL Q ENEY   ++ +      YM +    
Sbjct: 145 DYLHATDNYVAHIASIIAKAQI--TNGGPVILYQPENEYSGAAEGVLFPNKPYMQYVIDQ 202

Query: 207 AVEMGTGVPWV----MCKEEDAPDPVINSCN---------GFYC-------DAFTPN--- 243
           A   G  VP +          AP   + S +         GF C       D   P    
Sbjct: 203 ARNAGIIVPLINNDAFPGGTGAPGTGLGSVDIYGHDGYPLGFDCAHPSAWPDNGLPTTWR 262

Query: 244 ------QPYKPTIWTEAWSGWFTEFGGPIHQRPVQDLAFAAARFIQK-----GGSFINYY 292
                  P  P    E   G F  FGG   ++    +     R   K     G +  N Y
Sbjct: 263 QDHLNISPSTPFSLVEFQGGAFDPFGGWGFEQCSALVNHEFERVFYKNNMAAGVTIFNIY 322

Query: 293 MYHGGTNFGRSAGGPFITTSYDYDAPIDEYGLIRQPKYGHLKELHRAIKMCERALVSADP 352
           M  GGTN+G + G P   TSYDY A I E   I + KY  LK   + +K+     ++A P
Sbjct: 323 MTFGGTNWG-NLGHPGGYTSYDYGASIREDRRIDREKYSELKLQGQFLKVSP-GYITATP 380

Query: 353 IVTSLGGFQQAH------VYSSESGDC----AAFLSNYDTKS-AARVLFNNMHYNLPPWS 401
              + G +  +       + + ESGD      A  S+ DT S   ++  +     +P   
Sbjct: 381 ENATQGVYSDSQNIVITPLLAKESGDFFVVRHANYSSTDTASYTVKLPTSAGDLTIPQLG 440

Query: 402 ISVLPDCRNVVFNTAKVGVQTSQMEMLPANAEMFSWESYFE 442
            S+    R+   +     V   +  +L + AE+F+W  + E
Sbjct: 441 GSLTLTGRDSKIHVTDYPV--GKFTLLYSTAEIFTWNEFAE 479


>pdb|3NYD|A Chain A, Crystal Structure Of Kemp Eliminase Hg-2 Complexed With
           Transition State Analog 5-Nitro Benzotriazole
 pdb|3NYD|B Chain B, Crystal Structure Of Kemp Eliminase Hg-2 Complexed With
           Transition State Analog 5-Nitro Benzotriazole
          Length = 316

 Score = 31.6 bits (70), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 51/132 (38%), Gaps = 19/132 (14%)

Query: 36  LINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPSPGNYNFEGR 95
           LI  + ++ F  +    R T      +IQ   D G+   E  + W+  EPS GN+NF G 
Sbjct: 9   LIKARGKVYFGVATDQNRLTTGKNAAIIQA--DFGMVWPENSMKWDATEPSQGNFNFAGA 66

Query: 96  YDLVRFIKTIQKAGLYAHLRIGPYVCAEWNFGGFPVWLKYVPG-ISFRTDNEPFKRAMQG 154
             LV + +   K        IG         GG  VW   +P  +S  TD       M+ 
Sbjct: 67  DYLVNWAQQNGKL-------IG---------GGMLVWHSQLPSWVSSITDKNTLTNVMKN 110

Query: 155 FTEKIVNLMKSE 166
               ++   K +
Sbjct: 111 HITTLMTRYKGK 122


>pdb|1TUX|A Chain A, High Resolution Crystal Structure Of A Thermostable
           Xylanase From Thermoascus Aurantiacus
          Length = 301

 Score = 30.4 bits (67), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 32/68 (47%), Gaps = 2/68 (2%)

Query: 36  LINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPSPGNYNFEGR 95
           LI+ + ++ F  +    R T      +IQ   D G    E  + W+  EPS GN+NF G 
Sbjct: 9   LIDARGKVYFGVATDQNRLTTGKNAAIIQA--DFGQVTPENSMKWDATEPSQGNFNFAGA 66

Query: 96  YDLVRFIK 103
             LV + +
Sbjct: 67  DYLVNWAQ 74


>pdb|1K6A|A Chain A, Structural Studies On The Mobility In The Active Site Of
           The Thermoascus Aurantiacus Xylanase I
          Length = 303

 Score = 30.4 bits (67), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 31/68 (45%), Gaps = 2/68 (2%)

Query: 36  LINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPSPGNYNFEGR 95
           LI  + ++ F  +    R T      +IQ   D G    E  + W+  EPS GN+NF G 
Sbjct: 9   LIKARGKVYFGVATDQNRLTTGKNAAIIQA--DFGQVTPENSMKWDATEPSQGNFNFAGA 66

Query: 96  YDLVRFIK 103
             LV + +
Sbjct: 67  DYLVNWAQ 74


>pdb|1GOK|A Chain A, Thermostable Xylanase I From Thermoascus Aurantiacus-
           Crystal Form Ii
 pdb|1GOO|A Chain A, Thermostable Xylanase I From Thermoascus Aurantiacus -
           Cryocooled Glycerol Complex
 pdb|1GOM|A Chain A, Thermostable Xylanase I From Thermoascus Aurantiacus -
           Crystal Form I
 pdb|1GOR|A Chain A, Thermostable Xylanase I From Thermoascus Aurantiacus-
           Xylobiose Complex At 100 K
 pdb|1GOQ|A Chain A, Thermostable Xylanase I From Thermoascus Aurantiacus-Room
           Temperature Xylobiose Complex
          Length = 303

 Score = 30.4 bits (67), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 31/68 (45%), Gaps = 2/68 (2%)

Query: 36  LINGQRRILFSGSIHYPRSTPDMWEDLIQKAKDGGLDVIETYVFWNVHEPSPGNYNFEGR 95
           LI  + ++ F  +    R T      +IQ   D G    E  + W+  EPS GN+NF G 
Sbjct: 9   LIKARGKVYFGVATDQNRLTTGKNAAIIQA--DFGQVTPENSMKWDATEPSQGNFNFAGA 66

Query: 96  YDLVRFIK 103
             LV + +
Sbjct: 67  DYLVNWAQ 74


>pdb|1CYG|A Chain A, Cyclodextrin Glucanotransferase (E.C.2.4.1.19) (Cgtase)
          Length = 680

 Score = 29.3 bits (64), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 377 LSNYDTKSAARVLFNNMHYNLPPWSISV-LPDCRNVVFNTAKVGVQ 421
           L N+DT  A   +FN + Y+ P W I V +P+ + + F   K   Q
Sbjct: 606 LGNWDTSKAIGPMFNQVVYSYPTWYIDVSVPEGKTIEFKFIKKDSQ 651


>pdb|3O2L|A Chain A, Crystal Structure Of An Inactive Kemp Elimination Design
           Hg-1
          Length = 317

 Score = 28.9 bits (63), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 80  WNVHEPSPGNYNFEGRYDLVRFIKTIQKAGLYAHLRIGPYVCAEW 124
           W+  EPS GN+NF G   LV + +   K  +  H  +G +    W
Sbjct: 52  WDATEPSQGNFNFAGADYLVNWAQQNGKL-IRGHTLVGHFYLPSW 95


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.320    0.136    0.434 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 20,365,365
Number of Sequences: 62578
Number of extensions: 925960
Number of successful extensions: 2045
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 2018
Number of HSP's gapped (non-prelim): 21
length of query: 626
length of database: 14,973,337
effective HSP length: 105
effective length of query: 521
effective length of database: 8,402,647
effective search space: 4377779087
effective search space used: 4377779087
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 54 (25.4 bits)