BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 042201
         (511 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 2.5.
 pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 5.6.
 pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 6.5
          Length = 433

 Score =  309 bits (791), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 181/463 (39%), Positives = 253/463 (54%), Gaps = 42/463 (9%)

Query: 39  FAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKFG 98
           + Y+  L  ++LF+E QRSG+LP  Q+V WR +SAL+D   +  +L GGY+DAGD VKFG
Sbjct: 3   YDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFG 62

Query: 99  WPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGNA 158
           +PMAY+ ++L+W  ++++   SS   L   R A++W TD+ ++AHTS    Y QVG G+A
Sbjct: 63  FPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGDA 122

Query: 159 DHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHSK 218
           DH  W RPEDM   R  Y+I +  PG               IVF+ VD  YS+ LL H++
Sbjct: 123 DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHAR 182

Query: 219 SLFEFADKHRGSYQASCP----FYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLS---N 271
            LF+FA+ +RG Y  S      FY S + Y+DEL+WAAAWLY+A+ DN YL+   S    
Sbjct: 183 QLFDFANNYRGKYSDSITDARNFYAS-ADYRDELVWAAAWLYRATNDNTYLNTAESLYDE 241

Query: 272 QGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSVRIETT 331
            G        +WD+K +G Q+LLAK     NKQ   +K  V+S+V  L+        + T
Sbjct: 242 FGLQNWGGGLNWDSKVSGVQVLLAKL---TNKQ--AYKDTVQSYVNYLINNQ-----QKT 291

Query: 332 PGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKSQVDY 391
           P GLLYI     L++  +A  ++              L+      SAS    FA++Q+DY
Sbjct: 292 PKGLLYIDMWGTLRHAANAAFIM--------------LEAAELGLSASSYRQFAQTQIDY 337

Query: 392 ILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXXXXXVH 451
            LG      S++ GFGS  P + HHR +S P     PA  +C+               V 
Sbjct: 338 ALGDG--GRSFVCGFGSNPPTRPHHRSSSCP-----PAPATCD---WNTFNSPDPNYHVL 387

Query: 452 VGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLA 494
            GA+VGGPD ND + D R+DY H E  T  NA F  ++A L+A
Sbjct: 388 SGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALAALVA 430


>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
 pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
          Length = 605

 Score =  281 bits (719), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 172/473 (36%), Positives = 248/473 (52%), Gaps = 49/473 (10%)

Query: 39  FAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKFG 98
           F Y +AL K++ F+E QRSGKLPE+ RV WRG+S L+DG    ++L GG+YDAGD+VKFG
Sbjct: 4   FNYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFG 63

Query: 99  WPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGNA 158
           +PMA++ ++L+W A+E         Q+ YL+  +RW  D+ ++AH SP  LY QVGDG+A
Sbjct: 64  FPMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDA 123

Query: 159 DHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHSK 218
           DH+ W   E M   R  +++    PG               IVF   D  Y++ L+ H+K
Sbjct: 124 DHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAK 183

Query: 219 SLFEFADKHRGSYQASCP---FYCSYSGYQDELLWAAAWLYKASEDNKYL---DY---VL 269
            L+ FAD +RG Y    P   FY S+SGYQDEL+W A WLYKA+ D+ YL   +Y    L
Sbjct: 184 QLYTFADTYRGVYSDCVPAGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDFL 243

Query: 270 SNQGWSQVAS---EFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSV 326
           S +  + + S     +WD+K  G  +LLAKE  G  K +      ++ +   +    +  
Sbjct: 244 STEQQTDLRSYRWTIAWDDKSYGTYVLLAKE-TGKQKYIDDANRWLDYWTVGV----NGQ 298

Query: 327 RIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAK 386
           R+  +PGG+  +     L+Y  +   +  +Y+K ++                 +   FA 
Sbjct: 299 RVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDP-----------VRKQRYHDFAV 347

Query: 387 SQVDYILGKNPMKMSYMAGFGSKFPLQIHHR---GASIPSIGAHPAKVSCNDGXXXXXXX 443
            Q++Y LG NP   SY+ GFG+  P   HHR   G+   SI A PA+             
Sbjct: 348 RQINYALGDNPRNSSYVVGFGNNPPRNPHHRTAHGSWTDSI-ASPAE------------- 393

Query: 444 XXXXXXVHVGAIVGGPDS-NDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLAQ 495
                 V  GA+VGGP S ND + D R DY   E  T  NA F  ++A L+ +
Sbjct: 394 ---NRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEE 443


>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
           Endoglucanase From Separately Expressed Gh9 And Cbm3c
           Modules
          Length = 466

 Score =  260 bits (665), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 170/477 (35%), Positives = 229/477 (48%), Gaps = 56/477 (11%)

Query: 35  STTTFAYRDALGKAILFFEGQRSGKLPESQ-RVKWRGNSALSDGKPENVNLIGGYYDAGD 93
           +T  F Y +AL KAI F+E QRSGKL  S  R+ WRG+S L DGK   ++L GG+YDAGD
Sbjct: 20  ATGAFNYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGD 79

Query: 94  NVKFGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQV 153
           +VKF  PM+YS ++L WA  EY+       Q  ++   I+W  D+ ++ H      Y QV
Sbjct: 80  HVKFNLPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQV 139

Query: 154 GDGNADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRL 213
           GDG+ADH  W   E M   R  Y++   SPG               I+FKKVD  YS   
Sbjct: 140 GDGHADHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKEC 199

Query: 214 LNHSKSLFEFAD--KHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYV--- 268
           L H+K LFEFAD  K    Y A+  FY S+SG+ DEL WAA WLY A+ D+ YLD     
Sbjct: 200 LKHAKELFEFADTTKSDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESY 259

Query: 269 -----------LSNQGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVC 317
                      +    W+Q      WD+   G  +LLA+      K     + H++ +  
Sbjct: 260 SDKWGYEPQTNIPKYKWAQC-----WDDVTYGTYLLLARIKNDNGKYKEAIERHLDWWTT 314

Query: 318 ALMPESSSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFS 377
               E    RI  TP GL ++    +L+Y T+   L  +YS   N         G    +
Sbjct: 315 GYNGE----RITYTPKGLAWLDQWGSLRYATTTAFLACVYSDWEN---------GDKEKA 361

Query: 378 ASQISAFAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGA--SIPSIGAHPAKVSCND 435
            + +  FA+SQ DY LG      S++ GFG   P + HHR A  S       P +     
Sbjct: 362 KTYLE-FARSQADYALGS--TGRSFVVGFGENPPKRPHHRTAHGSWADSQMEPPE----- 413

Query: 436 GXXXXXXXXXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPL 492
                         V  GA+VGGPDS D + D  ++Y+  E     NA FVG +A +
Sbjct: 414 -----------HRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLAKM 459


>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
 pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
          Length = 475

 Score =  250 bits (638), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 162/477 (33%), Positives = 236/477 (49%), Gaps = 58/477 (12%)

Query: 35  STTTFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDN 94
           ++ +F Y +AL KAI+F+E Q SGKLP   R  WRG+SAL DG+   ++L GG++DAGD+
Sbjct: 22  ASGSFNYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDH 81

Query: 95  VKFGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVG 154
           VKF  PM+Y+ ++LSWA  EY+       QL ++   I W  D+ ++ H S    Y QVG
Sbjct: 82  VKFNLPMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVG 141

Query: 155 DGNADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLL 214
           DG+ DH  W   E M   R  +++T  SPG               IV K  +   ++  L
Sbjct: 142 DGSKDHAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYL 201

Query: 215 NHSKSLFEFAD--KHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLSN- 271
            H+K L+EFA+  K    Y A+  +Y S+SG+ DEL WAA WLY A+ D+ YL    S  
Sbjct: 202 QHAKELYEFAEVTKSDAGYTAANGYYNSWSGFYDELSWAAVWLYLATNDSTYLTKAESYV 261

Query: 272 QGWSQVASEFS--------WDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPES 323
           Q W +++   +        WD+   GA +LLAK   G +    + + H++ +      E 
Sbjct: 262 QNWPKISGSNTIDYKWAHCWDDVHNGAALLLAK-ITGKDIYKQIIESHLDYWTTGYNGE- 319

Query: 324 SSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISA 383
              RI+ TP GL ++    +L+Y T+   L F+YS  +          G           
Sbjct: 320 ---RIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWV----------GCPSTKKEIYRK 366

Query: 384 FAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGA--------SIPSIGAHPAKVSCND 435
           F +SQ+DY LG      S++ GFG+  P + HHR A        SIPS   H        
Sbjct: 367 FGESQIDYALGS--AGRSFVVGFGTNPPKRPHHRTAHSSWADSQSIPSYHRHTL------ 418

Query: 436 GXXXXXXXXXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPL 492
                            GA+VGGP S+D + D  ++Y + E     NA FVG++A +
Sbjct: 419 ----------------YGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALAKM 459


>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
 pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
          Length = 614

 Score =  221 bits (562), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 152/479 (31%), Positives = 232/479 (48%), Gaps = 43/479 (8%)

Query: 38  TFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKF 97
           T+ Y +AL K+I+F+E QRSG LP  +R  WR +S + DG    V+L GG+YDAGD+VKF
Sbjct: 3   TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62

Query: 98  GWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGN 157
             PM+Y+ ++L+W+  E +       Q  Y+   I+W  D+ ++ + +P   Y QVGDG 
Sbjct: 63  NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122

Query: 158 ADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHS 217
            DH  W   E M   R  +++ +  PG               +VFK  D  Y+ + ++H+
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182

Query: 218 KSLFEFADKHR--GSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLS---NQ 272
           K+LF+ ADK +    Y A+   Y S S + D+L WAA WLY A+ D+ YLD   S   N 
Sbjct: 183 KNLFDMADKAKSDAGYTAAS-GYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNW 241

Query: 273 GWSQVASEFS------WDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSV 326
           G  Q     +      WD+   GA++LLAK     NKQL  +K  +E  +       +  
Sbjct: 242 GKEQQTDIIAYKWGQXWDDVHYGAELLLAKLT---NKQL--YKDSIEMNLDFWTTGVNGT 296

Query: 327 RIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAK 386
           R+  TP GL ++    +L++ T+   L  +Y++            G      S    F K
Sbjct: 297 RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAE----------WEGCTPSKVSVYKDFLK 346

Query: 387 SQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXX 446
           SQ+DY LG      S++ G+G   P   HHR A     G+   +++              
Sbjct: 347 SQIDYALGS--TGRSFVVGYGVNPPQHPHHRTAH----GSWTDQMTS----------PTY 390

Query: 447 XXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLAQSEQEYLHFFQ 505
                 GA+VGGPD+ D + D   +Y + E     NA F G++A +   S  + +  F+
Sbjct: 391 HRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFK 449


>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
 pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
          Length = 614

 Score =  221 bits (562), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 152/479 (31%), Positives = 232/479 (48%), Gaps = 43/479 (8%)

Query: 38  TFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKF 97
           T+ Y +AL K+I+F+E QRSG LP  +R  WR +S + DG    V+L GG+YDAGD+VKF
Sbjct: 3   TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62

Query: 98  GWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGN 157
             PM+Y+ ++L+W+  E +       Q  Y+   I+W  D+ ++ + +P   Y QVGDG 
Sbjct: 63  NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122

Query: 158 ADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHS 217
            DH  W   E M   R  +++ +  PG               +VFK  D  Y+ + ++H+
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182

Query: 218 KSLFEFADKHR--GSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLS---NQ 272
           K+LF+ ADK +    Y A+   Y S S + D+L WAA WLY A+ D+ YLD   S   N 
Sbjct: 183 KNLFDMADKAKSDAGYTAAS-GYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNW 241

Query: 273 GWSQVASEFS------WDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSV 326
           G  Q     +      WD+   GA++LLAK     NKQL  +K  +E  +       +  
Sbjct: 242 GKEQQTDIIAYKWGQCWDDVHYGAELLLAKLT---NKQL--YKDSIEMNLDFWTTGVNGT 296

Query: 327 RIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAK 386
           R+  TP GL ++    +L++ T+   L  +Y++            G      S    F K
Sbjct: 297 RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAE----------WEGCTPSKVSVYKDFLK 346

Query: 387 SQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXX 446
           SQ+DY LG      S++ G+G   P   HHR A     G+   +++              
Sbjct: 347 SQIDYALGS--TGRSFVVGYGVNPPQHPHHRTAH----GSWTDQMTS----------PTY 390

Query: 447 XXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLAQSEQEYLHFFQ 505
                 GA+VGGPD+ D + D   +Y + E     NA F G++A +   S  + +  F+
Sbjct: 391 HRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFK 449


>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticum
 pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticium In Complex With Cellobiose
          Length = 441

 Score =  214 bits (545), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 141/467 (30%), Positives = 218/467 (46%), Gaps = 59/467 (12%)

Query: 38  TFAYRDALGKAILFFEGQRSG-KLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVK 96
           T  Y  AL  +I+FF+  + G +  E+    WRG    +DG    V+L GGY+DAGD+VK
Sbjct: 3   THDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVK 62

Query: 97  FGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDG 156
           FG P  YS ++L W+  E++    +      +   +++ TD+ L++H + TT Y QVG+G
Sbjct: 63  FGLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEG 122

Query: 157 NADHQCWERPEDMDTPR-TLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLN 215
           NADH  W  PE+    R +LY+    SP                + +K +DS Y+++ LN
Sbjct: 123 NADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLN 182

Query: 216 HSKSLFEFADKHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYV------- 268
            +K L+     ++G       FY + S + D+L WAA WLY A+ D+ Y+          
Sbjct: 183 AAKELYAMGKANQGVGNGQS-FYQATS-FGDDLAWAATWLYTATNDSTYITDAEQFITLG 240

Query: 269 -LSNQGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSVR 327
              N+   Q      WD+ +  A + LA        Q++  +I+ ++            +
Sbjct: 241 NTMNENKMQDKWTMCWDDMYVPAALRLA--------QITGKQIYKDAIEFNF--NYWKTQ 290

Query: 328 IETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKS 387
           + TTPGGL ++ +   L+Y  + ++++ +Y K                     +   AK 
Sbjct: 291 VTTTPGGLKWLSNWGVLRYAAAESMVMLVYCK---------------QNPDQSLLDLAKK 335

Query: 388 QVDYILGKNPMKMSYMAGFGSKFPLQIHHRGAS----IPSIGAHPAKVSCNDGXXXXXXX 443
           QVDYILG NP  MSY+ G+GS + +  HHR A+         A PAK             
Sbjct: 336 QVDYILGDNPANMSYIIGYGSNWCIHPHHRAANGYTYANGDNAKPAK------------- 382

Query: 444 XXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVA 490
                 +  GA+VGGPD ND+F D    Y + E     NA  VG +A
Sbjct: 383 -----HLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLA 424


>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
          Length = 611

 Score =  179 bits (454), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 143/509 (28%), Positives = 229/509 (44%), Gaps = 89/509 (17%)

Query: 39  FAYRDALGKAILFFEGQRSGK-LPESQRVKWRG--------------NSALSDG------ 77
           + Y  AL  ++ F++    G  + E+  + WRG              N+ +SDG      
Sbjct: 38  YNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSNR 97

Query: 78  ---KPE---NVNLIGGYYDAGDNVKFGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGA 131
               P+    V++ GG++DAGD+VKFG P AY+ S + W   E++ +  +  Q  +    
Sbjct: 98  SVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEVI 157

Query: 132 IRWGTDFILR-----AHTSPTTLYTQVGDGNADHQCWERPEDMDTPRTLYRITSDSPGXX 186
           +R+  D+ +R     A  +      QVGDG+ DH  W  PE+    R  + IT + PG  
Sbjct: 158 LRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGTD 217

Query: 187 XXXXXXXXXXXXXIVFKKVDSIYSSRLLNHSKSLFEFADKH-RGSYQAS--CPFYCSYSG 243
                        + FK  D  Y+++ L+++K+LF+FA+K+ +G  Q       Y   S 
Sbjct: 218 IISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQGEDGPKGYYGSSK 277

Query: 244 YQDELLWAAAWLYKASEDNKYLDYVL------SNQGWSQVASEFSWDNKFAGAQMLLAK- 296
           +QD+  WAAAWLY A+++  YLD         +  GW        W++ ++G   +LA+ 
Sbjct: 278 WQDDYCWAAAWLYLATQNEHYLDEAFKYYDYYAPPGWIHC-----WNDVWSGTACILAEI 332

Query: 297 -------------EFFGGNKQLSLFKIHVESFVCALMPESSSVRIETTPGGLLYIRDSSN 343
                         +   + +    +I     +  L+ + S   I  TPGG +++    +
Sbjct: 333 NDLYDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGITVTPGGYVFLNQWGS 392

Query: 344 LQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKSQVDYILGKNPMKMSYM 403
            +Y T+A L+  +Y K                 + S+ + +A+SQ+DY+LGKNP+   Y+
Sbjct: 393 ARYNTAAQLIALVYDKHHGD-------------TPSKYANWARSQMDYLLGKNPLNRCYV 439

Query: 404 AGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXXXXXVHVGAIVGGPDSND 463
            G+ S      HHR AS    G   A  S                 V  GA+VGGPD++D
Sbjct: 440 VGYSSNSVKYPHHRAAS----GLKDANDS------------SPHKYVLYGALVGGPDASD 483

Query: 464 QFKDLRTDYSHAEPTTYMNAAFVGSVAPL 492
           Q  D   DY + E     NAAFVG+ A L
Sbjct: 484 QHVDRTNDYIYNEVAIDYNAAFVGACAGL 512


>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
           Thermoacidophilic Alicyclobacillus Acidocaldarius
 pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
 pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellobiose
 pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellotetraose
 pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
           Isofagomine
 pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
           Isofagomine
 pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
           Isofagomine
          Length = 537

 Score = 63.2 bits (152), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 116/487 (23%), Positives = 186/487 (38%), Gaps = 74/487 (15%)

Query: 40  AYRDALGKAILFFEGQRSGK-LPESQRVKW-RGNSALSDGK----PENVNLIGGYYDAGD 93
           AYRD L   + FF+ Q  G  LPE +   W  G    SD K       +   GG++DAGD
Sbjct: 87  AYRDVLEAMLRFFDYQLCGVVLPEDEAGPWAHGACHTSDAKVFGTERALACPGGWHDAGD 146

Query: 94  NVKFGWPMAYSVSLLSW------AAVEYQREISSVNQLGYLRGAIRWGTDFI--LRAHTS 145
             K+  P A +V+ L        AA+ + R + SV++  +L  A+    + I  L     
Sbjct: 147 YGKYTVPAAKAVADLLLAHEYFPAALAHVRPMRSVHRAPHLPPALEVAREEIAWLLTMQD 206

Query: 146 PTT--LYTQVGDGNADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFK 203
           P T  +Y +V   +       RPED D P  L  I+                    +V++
Sbjct: 207 PATGGVYHKVTTPSFP-PLDTRPEDDDAPLVLSPIS-----YAATATFCAAMAHAALVYR 260

Query: 204 KVDSIYSSRLLNHSKSLFEFADKHRGSYQASCPFYCS---------YSGYQDELLWAAAW 254
             D   SS   + ++  + +   H        PF+            +  +DELLWA+  
Sbjct: 261 PFDPALSSCCADAARRAYAWLGAHEMQ-----PFHNPDGILTGEYGDAELRDELLWASCA 315

Query: 255 LYKASEDNKY-------LDYVLS-NQGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLS 306
           L + + D+ +       LD  L    GW+ VA  +   +     +  ++ +    NK  S
Sbjct: 316 LLRMTGDSAWARVCEPLLDLDLPWELGWADVAL-YGVMDYLRTPRAAVSDDV--RNKVKS 372

Query: 307 LFKIHVESFVCALMPESSSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHI 366
                +++   A M ES    I       ++    SN+  +  A  + FL ++ +   H 
Sbjct: 373 RLLRELDAL--AAMAESHPFGIPMRDDDFIW---GSNMVLLNRA--MAFLLAEGVGVLH- 424

Query: 367 NGLQCGSAHFSASQISAFAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGA 426
                 +AH         A+   DY+ G NP+   Y+ GFG +     HHR  S+     
Sbjct: 425 -----PAAH-------TVAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHR-PSVADDVD 471

Query: 427 HPAKVSCNDGXXXXXXXXXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFV 486
           HP       G             +    + G P + + + D +  YS  E   Y N+  V
Sbjct: 472 HPVP-----GMVVGGPNRHLQDEIARAQLAGRP-AMEAYIDHQDSYSTNEVAVYWNSPAV 525

Query: 487 GSVAPLL 493
             +A LL
Sbjct: 526 FVIAALL 532


>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
           Angstroms Resolution
          Length = 639

 Score = 42.0 bits (97), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 52/225 (23%), Positives = 89/225 (39%), Gaps = 26/225 (11%)

Query: 201 VFKKVDSIYSSRLLNHSKSLFEFADKHRGSYQA-----SCPFYCSYSGYQDELLWAAAWL 255
           +F+  D  Y+ + +N +K  +EF   +  +  A     S   Y + S   D+ LWAAA +
Sbjct: 302 IFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFSTGEYATVSD-ADDRLWAAAEM 360

Query: 256 YKASEDNKYL-DYV-LSNQGWSQVASEFSWDN-KFAGAQMLLAKEFFGGNKQLSLFKIHV 312
           ++   D +YL D+   + Q   ++ ++F WDN    G    L  E  G N  L      V
Sbjct: 361 WETLGDEEYLRDFENRAAQFSKKIEADFDWDNVANLGMFTYLLSERPGKNPAL------V 414

Query: 313 ESFVCALMPESSSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCG 372
           +S   +L+  + S+           +R S N  Y  +     +             LQ  
Sbjct: 415 QSIKDSLLSTADSI-----------VRTSQNHGYGRTLGTTYYWGCNGTVVRQTMILQVA 463

Query: 373 SAHFSASQISAFAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHR 417
           +     +     A   + ++ G+N    SY+ G G   P+  H R
Sbjct: 464 NKISPNNDYVNAALDAISHVFGRNYYNRSYVTGLGINPPMNPHDR 508


>pdb|1GXM|A Chain A, Family 10 Polysaccharide Lyase From Cellvibrio Cellulosa
 pdb|1GXM|B Chain B, Family 10 Polysaccharide Lyase From Cellvibrio Cellulosa
 pdb|1GXN|A Chain A, Family 10 Polysaccharide Lyase From Cellvibrio Cellulosa
          Length = 332

 Score = 32.3 bits (72), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 47/107 (43%), Gaps = 23/107 (21%)

Query: 41  YRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGK-PENVNLIGGYYDAGDNVKFGW 99
           YRDA+ KA  F                   NS  S G  P+   L GGY D       G 
Sbjct: 95  YRDAVRKAANFLV-----------------NSQYSTGALPQFYPLKGGYSDHATFNDNG- 136

Query: 100 PMAYSVSLLSWAA---VEYQREISSVNQLGYLRGAIRWGTDFILRAH 143
            MAY++++L +AA     +  ++ S N     + A+  GTD+IL+A 
Sbjct: 137 -MAYALTVLDFAANKRAPFDTDVFSDNDRTRFKTAVTKGTDYILKAQ 182


>pdb|1GXO|A Chain A, Mutant D189a Of Family 10 Polysaccharide Lyase From
           Cellvibrio Cellulosa In Complex With Trigalaturonic Acid
          Length = 332

 Score = 32.3 bits (72), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 47/107 (43%), Gaps = 23/107 (21%)

Query: 41  YRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGK-PENVNLIGGYYDAGDNVKFGW 99
           YRDA+ KA  F                   NS  S G  P+   L GGY D       G 
Sbjct: 95  YRDAVRKAANFLV-----------------NSQYSTGALPQFYPLKGGYSDHATFNDNG- 136

Query: 100 PMAYSVSLLSWAA---VEYQREISSVNQLGYLRGAIRWGTDFILRAH 143
            MAY++++L +AA     +  ++ S N     + A+  GTD+IL+A 
Sbjct: 137 -MAYALTVLDFAANKRAPFDTDVFSDNDRTRFKTAVTKGTDYILKAQ 182


>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 609

 Score = 30.8 bits (68), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 389 VDYILGKNPMKMSYMAGFGSKFPLQIHH 416
           + Y+LG+N M  SY+ G+G + PLQ  H
Sbjct: 504 ISYLLGRNAMDQSYVTGYGER-PLQNPH 530


>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 610

 Score = 30.8 bits (68), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 389 VDYILGKNPMKMSYMAGFGSKFPLQIHH 416
           + Y+LG+N M  SY+ G+G + PLQ  H
Sbjct: 504 ISYLLGRNAMDQSYVTGYGER-PLQNPH 530


>pdb|3H7L|A Chain A, Crystal Structure Of Endoglucanase-Related Protein From
           Vibrio Parahaemolyticus
 pdb|3H7L|B Chain B, Crystal Structure Of Endoglucanase-Related Protein From
           Vibrio Parahaemolyticus
 pdb|3H7L|C Chain C, Crystal Structure Of Endoglucanase-Related Protein From
           Vibrio Parahaemolyticus
          Length = 586

 Score = 30.4 bits (67), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 380 QISAFAKSQVDYILGKNPMKMSYMAGFGSKFP 411
           Q+S FA+  +++I+G NP  M  + G G   P
Sbjct: 469 QLSVFAQDALNWIVGLNPYDMCMLDGHGRNNP 500


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.319    0.134    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,289,812
Number of Sequences: 62578
Number of extensions: 627897
Number of successful extensions: 1136
Number of sequences better than 100.0: 15
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 1070
Number of HSP's gapped (non-prelim): 23
length of query: 511
length of database: 14,973,337
effective HSP length: 103
effective length of query: 408
effective length of database: 8,527,803
effective search space: 3479343624
effective search space used: 3479343624
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 54 (25.4 bits)