BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 043973
         (124 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticum
 pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticium In Complex With Cellobiose
          Length = 441

 Score = 77.4 bits (189), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 45/109 (41%), Positives = 63/109 (57%), Gaps = 16/109 (14%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
           +VDYILG NP  MSY++G+G N+    HHR ++  + AN   +       +P  H     
Sbjct: 336 QVDYILGDNPANMSYIIGYGSNWCIHPHHRAANGYTYANGDNA-------KPAKH----- 383

Query: 76  ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA----YFAG 120
           +L GA+VGGP+QND F DD + Y ++E A   NA +VG LA    +F G
Sbjct: 384 LLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLAGAIKFFGG 432


>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 2.5.
 pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 5.6.
 pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 6.5
          Length = 433

 Score = 71.2 bits (173), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 46/103 (44%), Positives = 57/103 (55%), Gaps = 12/103 (11%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
           ++DY LG      S++ GFG N   R HHR SS       P    CD      F+S +PN
Sbjct: 334 QIDYALGDG--GRSFVCGFGSNPPTRPHHRSSSC-----PPAPATCDWNT---FNSPDPN 383

Query: 76  --ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
             +L GA+VGGP+QND + DDRSDY H+E AT  NA     LA
Sbjct: 384 YHVLSGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALA 426


>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
          Length = 611

 Score = 62.0 bits (149), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 53/101 (52%), Gaps = 14/101 (13%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
           ++DY+LG NPL   Y+VG+  N  +  HHR +S L  AN                S +  
Sbjct: 424 QMDYLLGKNPLNRCYVVGYSSNSVKYPHHRAASGLKDANDS--------------SPHKY 469

Query: 76  ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
           +L GA+VGGP+ +D   D  +DY ++E A   NAA VG  A
Sbjct: 470 VLYGALVGGPDASDQHVDRTNDYIYNEVAIDYNAAFVGACA 510


>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
 pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
          Length = 605

 Score = 60.8 bits (146), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 42/105 (40%), Positives = 54/105 (51%), Gaps = 19/105 (18%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHR---GSSLLSLANHPQSIRCDGGFEPFFHSS 72
           +++Y LG NP   SY+VGFG N  R  HHR   GS   S+A+  +               
Sbjct: 349 QINYALGDNPRNSSYVVGFGNNPPRNPHHRTAHGSWTDSIASPAE--------------- 393

Query: 73  NPNILVGAIVGGPNQ-NDGFPDDRSDYSHSEPATYINAAMVGPLA 116
           N ++L GA+VGGP   ND + DDR DY  +E AT  NA     LA
Sbjct: 394 NRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALA 438


>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
           Endoglucanase From Separately Expressed Gh9 And Cbm3c
           Modules
          Length = 466

 Score = 56.6 bits (135), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 38/101 (37%), Positives = 51/101 (50%), Gaps = 14/101 (13%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
           + DY LG      S++VGFG N  +R HHR +              D   EP  H    +
Sbjct: 371 QADYALGST--GRSFVVGFGENPPKRPHHRTA---------HGSWADSQMEPPEHR---H 416

Query: 76  ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
           +L GA+VGGP+  D + DD S+Y+ +E A   NA  VG LA
Sbjct: 417 VLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLA 457


>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
 pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
          Length = 475

 Score = 55.8 bits (133), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 52/101 (51%), Gaps = 14/101 (13%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
           ++DY LG      S++VGFG N  +R HHR +          S   D    P +H    +
Sbjct: 371 QIDYALG--SAGRSFVVGFGTNPPKRPHHRTA---------HSSWADSQSIPSYHR---H 416

Query: 76  ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
            L GA+VGGP  +D + DD S+Y ++E A   NA  VG LA
Sbjct: 417 TLYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALA 457


>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
 pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
          Length = 614

 Score = 47.0 bits (110), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 14/101 (13%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
           ++DY LG      S++VG+G N  +  HHR +              D    P +H    +
Sbjct: 348 QIDYALGST--GRSFVVGYGVNPPQHPHHRTA---------HGSWTDQMTSPTYHR---H 393

Query: 76  ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
            + GA+VGGP+  DG+ D+ ++Y ++E A   NA   G LA
Sbjct: 394 TIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434


>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
 pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
          Length = 614

 Score = 47.0 bits (110), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 14/101 (13%)

Query: 16  KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
           ++DY LG      S++VG+G N  +  HHR +              D    P +H    +
Sbjct: 348 QIDYALGST--GRSFVVGYGVNPPQHPHHRTA---------HGSWTDQMTSPTYHR---H 393

Query: 76  ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
            + GA+VGGP+  DG+ D+ ++Y ++E A   NA   G LA
Sbjct: 394 TIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434


>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
           Thermoacidophilic Alicyclobacillus Acidocaldarius
 pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
 pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellobiose
 pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellotetraose
 pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
           Isofagomine
 pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
           Isofagomine
 pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
           Isofagomine
          Length = 537

 Score = 42.0 bits (97), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 32/95 (33%), Positives = 43/95 (45%), Gaps = 5/95 (5%)

Query: 18  DYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPNIL 77
           DY+ G NPL   Y+ GFG    R  HHR  S+    +HP      GG  P  H  +  I 
Sbjct: 436 DYLFGANPLGQCYVTGFGQRPVRHPHHR-PSVADDVDHPVPGMVVGG--PNRHLQD-EIA 491

Query: 78  VGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMV 112
              + G P   + + D +  YS +E A Y N+  V
Sbjct: 492 RAQLAGRPAM-EAYIDHQDSYSTNEVAVYWNSPAV 525


>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
           Angstroms Resolution
          Length = 639

 Score = 29.3 bits (64), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 41/102 (40%), Gaps = 20/102 (19%)

Query: 17  VDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPNI 76
           + ++ G N    SY+ G G N     H R S              DG +EP+     P  
Sbjct: 480 ISHVFGRNYYNRSYVTGLGINPPMNPHDRRSG------------ADGIWEPW-----PGY 522

Query: 77  LVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLAYF 118
           LVG    GP     + D +  Y  +E A   NAA++  LA F
Sbjct: 523 LVGGGWPGPKD---WVDIQDSYQTNEIAINWNAALIYALAGF 561


>pdb|3KWD|A Chain A, Inactive Truncation Of The Beta-Carboxysomal
           Gamma-Carbonic Anhydrase, Ccmm, Form 1
 pdb|3KWE|A Chain A, Inactive Truncation Of The Beta-Carboxysomal
           Gamma-Carbonic Anhydrase, Ccmm, Form 2
          Length = 213

 Score = 26.9 bits (58), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 58  SIRCDGGFEPFFHSSNPNILVGAIVGGPNQNDGFPDDRSDYS 99
           SIR D G  PF   S  NI  G ++ G  Q     DD  +YS
Sbjct: 71  SIRADEG-TPFHIGSRTNIQDGVVIHGLQQGRVIGDDGQEYS 111


>pdb|3KWC|A Chain A, Oxidized, Active Structure Of The Beta-carboxysomal
           Gamma-carbonic Anhydrase, Ccmm
 pdb|3KWC|B Chain B, Oxidized, Active Structure Of The Beta-carboxysomal
           Gamma-carbonic Anhydrase, Ccmm
 pdb|3KWC|C Chain C, Oxidized, Active Structure Of The Beta-carboxysomal
           Gamma-carbonic Anhydrase, Ccmm
 pdb|3KWC|D Chain D, Oxidized, Active Structure Of The Beta-carboxysomal
           Gamma-carbonic Anhydrase, Ccmm
 pdb|3KWC|E Chain E, Oxidized, Active Structure Of The Beta-carboxysomal
           Gamma-carbonic Anhydrase, Ccmm
 pdb|3KWC|F Chain F, Oxidized, Active Structure Of The Beta-carboxysomal
           Gamma-carbonic Anhydrase, Ccmm
          Length = 229

 Score = 26.6 bits (57), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 20/42 (47%), Gaps = 1/42 (2%)

Query: 58  SIRCDGGFEPFFHSSNPNILVGAIVGGPNQNDGFPDDRSDYS 99
           SIR D G  PF   S  NI  G ++ G  Q     DD  +YS
Sbjct: 71  SIRADEG-TPFHIGSRTNIQDGVVIHGLQQGRVIGDDGQEYS 111


>pdb|2WD6|A Chain A, Crystal Structure Of The Variable Domain Of The
           Streptococcus Gordonii Surface Protein Sspb
 pdb|2WD6|B Chain B, Crystal Structure Of The Variable Domain Of The
           Streptococcus Gordonii Surface Protein Sspb
          Length = 349

 Score = 26.6 bits (57), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 15/51 (29%), Positives = 25/51 (49%)

Query: 39  SRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPNILVGAIVGGPNQND 89
           S +IH   S+    +N  +S+        +  +S+PN   GAIVG   Q++
Sbjct: 232 SIKIHENNSAYADSSNAEKSLGARWNTSEWDTTSSPNNWYGAIVGEITQSE 282


>pdb|3BO7|A Chain A, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
           Cis-Trans Isomerase, 541.M00136
 pdb|3BO7|B Chain B, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
           Cis-Trans Isomerase, 541.M00136
 pdb|3BO7|C Chain C, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
           Cis-Trans Isomerase, 541.M00136
 pdb|3BO7|D Chain D, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
           Cis-Trans Isomerase, 541.M00136
          Length = 201

 Score = 26.6 bits (57), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 3/54 (5%)

Query: 34  FGPNFSRRIHHRGSSLLSLAN---HPQSIRCDGGFEPFFHSSNPNILVGAIVGG 84
           F   F  R+ H+G  +LS+AN   H         F+   H +N + + G +VGG
Sbjct: 89  FEDEFDNRLVHQGIGVLSMANDGKHSNLSEFFITFKSCEHLNNKHTIFGRVVGG 142


>pdb|2DRS|A Chain A, Crystal Structure Of Reducing-End-Xylose Releasing
           Exo-Oligoxylanase D263s Mutant
          Length = 396

 Score = 25.4 bits (54), Expect = 8.9,   Method: Composition-based stats.
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 89  DGFPDDRSDYSHSEPATYINAAMVGPLAYFAGG 121
           DG P+D   Y H   ++Y  AA +G  A + GG
Sbjct: 248 DGTPNDEKGYGHFFSSSYRVAANIGLDAEWFGG 280


>pdb|1HLG|A Chain A, Crystal Structure Of Human Gastric Lipase
 pdb|1HLG|B Chain B, Crystal Structure Of Human Gastric Lipase
          Length = 371

 Score = 25.4 bits (54), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 19/40 (47%), Gaps = 6/40 (15%)

Query: 83  GGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLAYFAGGK 122
           G P QN      R  Y  S+P  Y   AM  P+A + GGK
Sbjct: 282 GSPVQN------RMHYDQSQPPYYNVTAMNVPIAVWNGGK 315


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.320    0.140    0.442 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,469,495
Number of Sequences: 62578
Number of extensions: 186364
Number of successful extensions: 261
Number of sequences better than 100.0: 21
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 234
Number of HSP's gapped (non-prelim): 23
length of query: 124
length of database: 14,973,337
effective HSP length: 85
effective length of query: 39
effective length of database: 9,654,207
effective search space: 376514073
effective search space used: 376514073
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 45 (21.9 bits)