BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 043973
(124 letters)
Database: pdbaa
62,578 sequences; 14,973,337 total letters
Searching..................................................done
>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticum
pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticium In Complex With Cellobiose
Length = 441
Score = 77.4 bits (189), Expect = 2e-15, Method: Composition-based stats.
Identities = 45/109 (41%), Positives = 63/109 (57%), Gaps = 16/109 (14%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
+VDYILG NP MSY++G+G N+ HHR ++ + AN + +P H
Sbjct: 336 QVDYILGDNPANMSYIIGYGSNWCIHPHHRAANGYTYANGDNA-------KPAKH----- 383
Query: 76 ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA----YFAG 120
+L GA+VGGP+QND F DD + Y ++E A NA +VG LA +F G
Sbjct: 384 LLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLAGAIKFFGG 432
>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 2.5.
pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 5.6.
pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 6.5
Length = 433
Score = 71.2 bits (173), Expect = 1e-13, Method: Composition-based stats.
Identities = 46/103 (44%), Positives = 57/103 (55%), Gaps = 12/103 (11%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
++DY LG S++ GFG N R HHR SS P CD F+S +PN
Sbjct: 334 QIDYALGDG--GRSFVCGFGSNPPTRPHHRSSSC-----PPAPATCDWNT---FNSPDPN 383
Query: 76 --ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
+L GA+VGGP+QND + DDRSDY H+E AT NA LA
Sbjct: 384 YHVLSGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALA 426
>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
Length = 611
Score = 62.0 bits (149), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 53/101 (52%), Gaps = 14/101 (13%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
++DY+LG NPL Y+VG+ N + HHR +S L AN S +
Sbjct: 424 QMDYLLGKNPLNRCYVVGYSSNSVKYPHHRAASGLKDANDS--------------SPHKY 469
Query: 76 ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
+L GA+VGGP+ +D D +DY ++E A NAA VG A
Sbjct: 470 VLYGALVGGPDASDQHVDRTNDYIYNEVAIDYNAAFVGACA 510
>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
Length = 605
Score = 60.8 bits (146), Expect = 2e-10, Method: Composition-based stats.
Identities = 42/105 (40%), Positives = 54/105 (51%), Gaps = 19/105 (18%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHR---GSSLLSLANHPQSIRCDGGFEPFFHSS 72
+++Y LG NP SY+VGFG N R HHR GS S+A+ +
Sbjct: 349 QINYALGDNPRNSSYVVGFGNNPPRNPHHRTAHGSWTDSIASPAE--------------- 393
Query: 73 NPNILVGAIVGGPNQ-NDGFPDDRSDYSHSEPATYINAAMVGPLA 116
N ++L GA+VGGP ND + DDR DY +E AT NA LA
Sbjct: 394 NRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALA 438
>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
Endoglucanase From Separately Expressed Gh9 And Cbm3c
Modules
Length = 466
Score = 56.6 bits (135), Expect = 4e-09, Method: Composition-based stats.
Identities = 38/101 (37%), Positives = 51/101 (50%), Gaps = 14/101 (13%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
+ DY LG S++VGFG N +R HHR + D EP H +
Sbjct: 371 QADYALGST--GRSFVVGFGENPPKRPHHRTA---------HGSWADSQMEPPEHR---H 416
Query: 76 ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
+L GA+VGGP+ D + DD S+Y+ +E A NA VG LA
Sbjct: 417 VLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLA 457
>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
Length = 475
Score = 55.8 bits (133), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 52/101 (51%), Gaps = 14/101 (13%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
++DY LG S++VGFG N +R HHR + S D P +H +
Sbjct: 371 QIDYALG--SAGRSFVVGFGTNPPKRPHHRTA---------HSSWADSQSIPSYHR---H 416
Query: 76 ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
L GA+VGGP +D + DD S+Y ++E A NA VG LA
Sbjct: 417 TLYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALA 457
>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
Length = 614
Score = 47.0 bits (110), Expect = 3e-06, Method: Composition-based stats.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 14/101 (13%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
++DY LG S++VG+G N + HHR + D P +H +
Sbjct: 348 QIDYALGST--GRSFVVGYGVNPPQHPHHRTA---------HGSWTDQMTSPTYHR---H 393
Query: 76 ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
+ GA+VGGP+ DG+ D+ ++Y ++E A NA G LA
Sbjct: 394 TIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434
>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
Length = 614
Score = 47.0 bits (110), Expect = 3e-06, Method: Composition-based stats.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 14/101 (13%)
Query: 16 KVDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPN 75
++DY LG S++VG+G N + HHR + D P +H +
Sbjct: 348 QIDYALGST--GRSFVVGYGVNPPQHPHHRTA---------HGSWTDQMTSPTYHR---H 393
Query: 76 ILVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLA 116
+ GA+VGGP+ DG+ D+ ++Y ++E A NA G LA
Sbjct: 394 TIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434
>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
Thermoacidophilic Alicyclobacillus Acidocaldarius
pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellobiose
pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellotetraose
pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
Isofagomine
pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
Isofagomine
pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
Isofagomine
Length = 537
Score = 42.0 bits (97), Expect = 8e-05, Method: Composition-based stats.
Identities = 32/95 (33%), Positives = 43/95 (45%), Gaps = 5/95 (5%)
Query: 18 DYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPNIL 77
DY+ G NPL Y+ GFG R HHR S+ +HP GG P H + I
Sbjct: 436 DYLFGANPLGQCYVTGFGQRPVRHPHHR-PSVADDVDHPVPGMVVGG--PNRHLQD-EIA 491
Query: 78 VGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMV 112
+ G P + + D + YS +E A Y N+ V
Sbjct: 492 RAQLAGRPAM-EAYIDHQDSYSTNEVAVYWNSPAV 525
>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
Angstroms Resolution
Length = 639
Score = 29.3 bits (64), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 41/102 (40%), Gaps = 20/102 (19%)
Query: 17 VDYILGVNPLKMSYMVGFGPNFSRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPNI 76
+ ++ G N SY+ G G N H R S DG +EP+ P
Sbjct: 480 ISHVFGRNYYNRSYVTGLGINPPMNPHDRRSG------------ADGIWEPW-----PGY 522
Query: 77 LVGAIVGGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLAYF 118
LVG GP + D + Y +E A NAA++ LA F
Sbjct: 523 LVGGGWPGPKD---WVDIQDSYQTNEIAINWNAALIYALAGF 561
>pdb|3KWD|A Chain A, Inactive Truncation Of The Beta-Carboxysomal
Gamma-Carbonic Anhydrase, Ccmm, Form 1
pdb|3KWE|A Chain A, Inactive Truncation Of The Beta-Carboxysomal
Gamma-Carbonic Anhydrase, Ccmm, Form 2
Length = 213
Score = 26.9 bits (58), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 58 SIRCDGGFEPFFHSSNPNILVGAIVGGPNQNDGFPDDRSDYS 99
SIR D G PF S NI G ++ G Q DD +YS
Sbjct: 71 SIRADEG-TPFHIGSRTNIQDGVVIHGLQQGRVIGDDGQEYS 111
>pdb|3KWC|A Chain A, Oxidized, Active Structure Of The Beta-carboxysomal
Gamma-carbonic Anhydrase, Ccmm
pdb|3KWC|B Chain B, Oxidized, Active Structure Of The Beta-carboxysomal
Gamma-carbonic Anhydrase, Ccmm
pdb|3KWC|C Chain C, Oxidized, Active Structure Of The Beta-carboxysomal
Gamma-carbonic Anhydrase, Ccmm
pdb|3KWC|D Chain D, Oxidized, Active Structure Of The Beta-carboxysomal
Gamma-carbonic Anhydrase, Ccmm
pdb|3KWC|E Chain E, Oxidized, Active Structure Of The Beta-carboxysomal
Gamma-carbonic Anhydrase, Ccmm
pdb|3KWC|F Chain F, Oxidized, Active Structure Of The Beta-carboxysomal
Gamma-carbonic Anhydrase, Ccmm
Length = 229
Score = 26.6 bits (57), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 20/42 (47%), Gaps = 1/42 (2%)
Query: 58 SIRCDGGFEPFFHSSNPNILVGAIVGGPNQNDGFPDDRSDYS 99
SIR D G PF S NI G ++ G Q DD +YS
Sbjct: 71 SIRADEG-TPFHIGSRTNIQDGVVIHGLQQGRVIGDDGQEYS 111
>pdb|2WD6|A Chain A, Crystal Structure Of The Variable Domain Of The
Streptococcus Gordonii Surface Protein Sspb
pdb|2WD6|B Chain B, Crystal Structure Of The Variable Domain Of The
Streptococcus Gordonii Surface Protein Sspb
Length = 349
Score = 26.6 bits (57), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 15/51 (29%), Positives = 25/51 (49%)
Query: 39 SRRIHHRGSSLLSLANHPQSIRCDGGFEPFFHSSNPNILVGAIVGGPNQND 89
S +IH S+ +N +S+ + +S+PN GAIVG Q++
Sbjct: 232 SIKIHENNSAYADSSNAEKSLGARWNTSEWDTTSSPNNWYGAIVGEITQSE 282
>pdb|3BO7|A Chain A, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
Cis-Trans Isomerase, 541.M00136
pdb|3BO7|B Chain B, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
Cis-Trans Isomerase, 541.M00136
pdb|3BO7|C Chain C, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
Cis-Trans Isomerase, 541.M00136
pdb|3BO7|D Chain D, Crystal Structure Of Toxoplasma Gondii Peptidyl-Prolyl
Cis-Trans Isomerase, 541.M00136
Length = 201
Score = 26.6 bits (57), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 3/54 (5%)
Query: 34 FGPNFSRRIHHRGSSLLSLAN---HPQSIRCDGGFEPFFHSSNPNILVGAIVGG 84
F F R+ H+G +LS+AN H F+ H +N + + G +VGG
Sbjct: 89 FEDEFDNRLVHQGIGVLSMANDGKHSNLSEFFITFKSCEHLNNKHTIFGRVVGG 142
>pdb|2DRS|A Chain A, Crystal Structure Of Reducing-End-Xylose Releasing
Exo-Oligoxylanase D263s Mutant
Length = 396
Score = 25.4 bits (54), Expect = 8.9, Method: Composition-based stats.
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 89 DGFPDDRSDYSHSEPATYINAAMVGPLAYFAGG 121
DG P+D Y H ++Y AA +G A + GG
Sbjct: 248 DGTPNDEKGYGHFFSSSYRVAANIGLDAEWFGG 280
>pdb|1HLG|A Chain A, Crystal Structure Of Human Gastric Lipase
pdb|1HLG|B Chain B, Crystal Structure Of Human Gastric Lipase
Length = 371
Score = 25.4 bits (54), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 19/40 (47%), Gaps = 6/40 (15%)
Query: 83 GGPNQNDGFPDDRSDYSHSEPATYINAAMVGPLAYFAGGK 122
G P QN R Y S+P Y AM P+A + GGK
Sbjct: 282 GSPVQN------RMHYDQSQPPYYNVTAMNVPIAVWNGGK 315
Database: pdbaa
Posted date: Mar 3, 2013 10:34 PM
Number of letters in database: 14,973,337
Number of sequences in database: 62,578
Lambda K H
0.320 0.140 0.442
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,469,495
Number of Sequences: 62578
Number of extensions: 186364
Number of successful extensions: 261
Number of sequences better than 100.0: 21
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 234
Number of HSP's gapped (non-prelim): 23
length of query: 124
length of database: 14,973,337
effective HSP length: 85
effective length of query: 39
effective length of database: 9,654,207
effective search space: 376514073
effective search space used: 376514073
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 45 (21.9 bits)