BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 047277
         (498 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 2.5.
 pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 5.6.
 pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 6.5
          Length = 433

 Score =  283 bits (723), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 178/477 (37%), Positives = 244/477 (51%), Gaps = 60/477 (12%)

Query: 21  SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
           ++DY   L  S+LF+E QRSG+LP +Q++TWRKDSAL D    G DLTGGY+DAGD VKF
Sbjct: 2   AYDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKF 61

Query: 81  NFPMAFSTTMLAWSVLEF--GKSMESDLQHALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
            FPMA++ T+LAW +++F  G S    L     A++WATDYF+KA +     Y QVG+  
Sbjct: 62  GFPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGD 121

Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
            DH  W RPEDM  AR  Y +    PG                 FR+ D TY+  LL  A
Sbjct: 122 ADHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHA 181

Query: 199 KTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENI 258
           + +F FA+ YRG Y+D +      FY   + Y DELVW AAWLYRAT    Y N   E++
Sbjct: 182 RQLFDFANNYRGKYSDSITD-ARNFYAS-ADYRDELVWAAAWLYRATNDNTYLN-TAESL 238

Query: 259 Y------NLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPK 312
           Y      N G G                  WD+K +G+ VL+++  N         +   
Sbjct: 239 YDEFGLQNWGGG----------------LNWDSKVSGVQVLLAKLTN------KQAYKDT 276

Query: 313 ADTFVCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCG 372
             ++V  L+          +P GLL+      L+HA   +F++             +   
Sbjct: 277 VQSYVNYLINNQQK-----TPKGLLYIDMWGTLRHAANAAFIM-------------LEAA 318

Query: 373 NVVVNRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIG 432
            + ++ +   Q A+ Q+DY LG      S++ G+G N P R HHRSSS P          
Sbjct: 319 ELGLSASSYRQFAQTQIDYALGDG--GRSFVCGFGSNPPTRPHHRSSSCPPA-----PAT 371

Query: 433 CQDGTKYYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
           C   T  ++S + N + L+GA+VGGPD ND+Y D RSD+VH+E  T  NA     LA
Sbjct: 372 CDWNT--FNSPDPNYHVLSGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALA 426


>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
 pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
          Length = 605

 Score =  268 bits (686), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 171/470 (36%), Positives = 236/470 (50%), Gaps = 39/470 (8%)

Query: 23  DYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKFNF 82
           +Y +AL KS+ F+E QRSGKLP N R++WR DS L DG  +G+DLTGG+YDAGD+VKF F
Sbjct: 5   NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGF 64

Query: 83  PMAFSTTMLAWSVLEF--GKSMESDLQHALDAIRWATDYFLKATSIPGIVYAQVGEPYGD 140
           PMAF+ TMLAW  +E   G      + +  D +RW  DYF+KA   P ++Y QVG+   D
Sbjct: 65  PMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDAD 124

Query: 141 HDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKT 200
           H  W   E M   R  + V    PG                 F   D  YA  L+  AK 
Sbjct: 125 HKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAKQ 184

Query: 201 VFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYN 260
           ++ FAD YRG Y+D +      FY  +SGY+DELVWGA WLY+AT   +Y     E  Y+
Sbjct: 185 LYTFADTYRGVYSDCVP--AGAFYNSWSGYQDELVWGAYWLYKATGDDSYLA-KAEYEYD 241

Query: 261 LGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTFVCSL 320
             S  + T+             WD+K+ G  VL+++            +I  A+ ++   
Sbjct: 242 FLSTEQQTDLRSYRWT----IAWDDKSYGTYVLLAK------ETGKQKYIDDANRWLDYW 291

Query: 321 LPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNRAR 380
                 + V YSPGG+        L++A   +F+ +VY++ +             V + R
Sbjct: 292 TVGVNGQRVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDP----------VRKQR 341

Query: 381 LVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKYY 440
               A  Q++Y LG NP N SY+VG+G N P+  HHR++        T  I         
Sbjct: 342 YHDFAVRQINYALGDNPRNSSYVVGFGNNPPRNPHHRTAH----GSWTDSIA-------- 389

Query: 441 DSKNSNPNELTGAVVGGPDA-NDSYADSRSDFVHSEPTTYINAPLVGLLA 489
            S   N + L GA+VGGP + ND+Y D R D+V +E  T  NA     LA
Sbjct: 390 -SPAENRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALA 438


>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
           Endoglucanase From Separately Expressed Gh9 And Cbm3c
           Modules
          Length = 466

 Score =  234 bits (597), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 164/473 (34%), Positives = 231/473 (48%), Gaps = 46/473 (9%)

Query: 23  DYGDALAKSILFFEGQRSGKLPPNQ-RMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKFN 81
           +YG+AL K+I F+E QRSGKL  +  R+ WR DS L+DG   G+DLTGG+YDAGD+VKFN
Sbjct: 25  NYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFN 84

Query: 82  FPMAFSTTMLAWSVLEFGKSMESDLQ--HALDAIRWATDYFLKATSIPGIVYAQVGEPYG 139
            PM++S  ML W+V E+  + +   Q  H L+ I+WA DYF+K      + Y QVG+ + 
Sbjct: 85  LPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQVGDGHA 144

Query: 140 DHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAK 199
           DH  W   E M   R  Y V +  PG                 F+  D  Y+ + L  AK
Sbjct: 145 DHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKECLKHAK 204

Query: 200 TVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIY 259
            +F FAD  + S + Y       FY  +SG+ DEL W A WLY AT   +Y +   E+  
Sbjct: 205 ELFEFADTTK-SDDGYTA--ANGFYNSWSGFYDELSWAAVWLYLATNDSSYLD-KAESYS 260

Query: 260 NLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTFVCS 319
           +       TN             WD+   G  +L++R  N  N           D +   
Sbjct: 261 DKWGYEPQTNIPKYKWAQC----WDDVTYGTYLLLARIKND-NGKYKEAIERHLDWWTTG 315

Query: 320 LLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNRA 379
              E     ++Y+P GL +     +L++ATT +FL  VYS        D   G+    + 
Sbjct: 316 YNGE----RITYTPKGLAWLDQWGSLRYATTTAFLACVYS--------DWENGDKEKAKT 363

Query: 380 RLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKY 439
            L + A+ Q DY LGS     S++VG+G+N P+R HHR++                   +
Sbjct: 364 YL-EFARSQADYALGST--GRSFVVGFGENPPKRPHHRTAH----------------GSW 404

Query: 440 YDSKNSNPNE---LTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
            DS+   P     L GA+VGGPD+ D+Y D  S++  +E     NA  VGLLA
Sbjct: 405 ADSQMEPPEHRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLA 457


>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
 pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
          Length = 614

 Score =  232 bits (591), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 162/473 (34%), Positives = 235/473 (49%), Gaps = 45/473 (9%)

Query: 21  SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
           +++YG+AL KSI+F+E QRSG LP ++R  WR DS ++DG  +GVDLTGG+YDAGD+VKF
Sbjct: 3   TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62

Query: 81  NFPMAFSTTMLAWSVLEFGKSMESDLQ--HALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
           N PM++++ MLAWS+ E   + +   Q  + +D I+WA DYF+K    PG+ Y QVG+  
Sbjct: 63  NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122

Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
            DH  W   E M   R  + V    PG                 F+S+D TYA K ++ A
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182

Query: 199 KTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENI 258
           K +F  AD+ +       G      Y   S + D+L W A WLY AT    Y +     +
Sbjct: 183 KNLFDMADKAKSD----AGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYV 238

Query: 259 YNLGSGRKITNXXXXXXXXXTEFG--WDNKNAGINVLVSRFINTINSNSSNPFIPKADTF 316
            N G  ++             ++G  WD+ + G  +L+++  N      S   I     F
Sbjct: 239 PNWGKEQQTD-------IIAYKWGQCWDDVHYGAELLLAKLTNKQLYKDS---IEMNLDF 288

Query: 317 VCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVV 376
             + +  +    VSY+P GL +     +L+HATT +FL  VY+ +            V V
Sbjct: 289 WTTGVNGT---RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEW-----EGCTPSKVSV 340

Query: 377 NRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDG 436
            +  L    K Q+DY LGS     S++VGYG N PQ  HHR++      Q T        
Sbjct: 341 YKDFL----KSQIDYALGST--GRSFVVGYGVNPPQHPHHRTAHGSWTDQMT-------- 386

Query: 437 TKYYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
                S   + + + GA+VGGPD  D Y D  +++V++E     NA   G LA
Sbjct: 387 -----SPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434


>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
 pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
          Length = 614

 Score =  232 bits (591), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 162/471 (34%), Positives = 233/471 (49%), Gaps = 41/471 (8%)

Query: 21  SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
           +++YG+AL KSI+F+E QRSG LP ++R  WR DS ++DG  +GVDLTGG+YDAGD+VKF
Sbjct: 3   TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62

Query: 81  NFPMAFSTTMLAWSVLEFGKSMESDLQ--HALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
           N PM++++ MLAWS+ E   + +   Q  + +D I+WA DYF+K    PG+ Y QVG+  
Sbjct: 63  NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122

Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
            DH  W   E M   R  + V    PG                 F+S+D TYA K ++ A
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182

Query: 199 KTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENI 258
           K +F  AD+ +       G      Y   S + D+L W A WLY AT    Y +     +
Sbjct: 183 KNLFDMADKAKSD----AGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYV 238

Query: 259 YNLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTFVC 318
            N G  ++                WD+ + G  +L+++  N      S   I     F  
Sbjct: 239 PNWGKEQQTDIIAYKWGQX-----WDDVHYGAELLLAKLTNKQLYKDS---IEMNLDFWT 290

Query: 319 SLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNR 378
           + +  +    VSY+P GL +     +L+HATT +FL  VY+ +            V V +
Sbjct: 291 TGVNGT---RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEW-----EGCTPSKVSVYK 342

Query: 379 ARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTK 438
             L    K Q+DY LGS     S++VGYG N PQ  HHR++      Q T          
Sbjct: 343 DFL----KSQIDYALGST--GRSFVVGYGVNPPQHPHHRTAHGSWTDQMT---------- 386

Query: 439 YYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
              S   + + + GA+VGGPD  D Y D  +++V++E     NA   G LA
Sbjct: 387 ---SPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434


>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
 pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
          Length = 475

 Score =  217 bits (553), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 154/481 (32%), Positives = 228/481 (47%), Gaps = 60/481 (12%)

Query: 21  SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
           S +YG+AL K+I+F+E Q SGKLP   R  WR DSAL+DG   G+DLTGG++DAGD+VKF
Sbjct: 25  SFNYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDHVKF 84

Query: 81  NFPMAFSTTMLAWSVLEFGKSM--ESDLQHALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
           N PM+++ TML+W+V E+  +      L+H L+ I W  DYF+K      + Y QVG+  
Sbjct: 85  NLPMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVGDGS 144

Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
            DH  W   E M   R  + V++  PG                  +  + T A   L  A
Sbjct: 145 KDHAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYLQHA 204

Query: 199 KTVFVFAD--RYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVE 256
           K ++ FA+  +    Y    G     +Y  +SG+ DEL W A WLY AT    Y      
Sbjct: 205 KELYEFAEVTKSDAGYTAANG-----YYNSWSGFYDELSWAAVWLYLATNDSTYLTKAES 259

Query: 257 NIYNLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTF 316
            + N     KI+              WD+ + G  +L+++          + +    ++ 
Sbjct: 260 YVQNW---PKISGSNTIDYKWA--HCWDDVHNGAALLLAKI------TGKDIYKQIIESH 308

Query: 317 VCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVV 376
           +         + + Y+P GL +     +L++ATT +FL  VYS ++         G    
Sbjct: 309 LDYWTTGYNGERIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWV---------GCPST 359

Query: 377 NRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHR--------SSSLPSVAQHT 428
            +    +  + Q+DY LGS     S++VG+G N P+R HHR        S S+PS  +HT
Sbjct: 360 KKEIYRKFGESQIDYALGS--AGRSFVVGFGTNPPKRPHHRTAHSSWADSQSIPSYHRHT 417

Query: 429 GKIGCQDGTKYYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLL 488
                                L GA+VGGP ++DSY D  S++V++E     NA  VG L
Sbjct: 418 ---------------------LYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGAL 456

Query: 489 A 489
           A
Sbjct: 457 A 457


>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticum
 pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticium In Complex With Cellobiose
          Length = 441

 Score =  197 bits (501), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 147/475 (30%), Positives = 212/475 (44%), Gaps = 59/475 (12%)

Query: 21  SHDYGDALAKSILFFEGQRSG-KLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVK 79
           +HDY  AL  SI+FF+  + G +   N    WR      DG  +GVDLTGGY+DAGD+VK
Sbjct: 3   THDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVK 62

Query: 80  FNFPMAFSTTMLAWSVLEFGKSMES--DLQHALDAIRWATDYFLKATSIPGIVYAQVGEP 137
           F  P  +S  +L WS+ EF +S ++  +    L  +++ TDYFLK+       Y QVGE 
Sbjct: 63  FGLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEG 122

Query: 138 YGDHDCWERPEDMDTAR-TPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLN 196
             DH  W  PE+    R + Y      P                  +++ DS YA K LN
Sbjct: 123 NADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLN 182

Query: 197 RAKTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVE 256
            AK ++      +G  N         FY   + + D+L W A WLY AT    Y     E
Sbjct: 183 AAKELYAMGKANQGVGNGQ------SFY-QATSFGDDLAWAATWLYTATNDSTYIT-DAE 234

Query: 257 NIYNLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTF 316
               LG+                   WD+      + +++        +       A  F
Sbjct: 235 QFITLGN-----TMNENKMQDKWTMCWDDMYVPAALRLAQI-------TGKQIYKDAIEF 282

Query: 317 VCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVV 376
             +         V+ +PGGL +      L++A   S +++VY +     N D        
Sbjct: 283 NFNYWK----TQVTTTPGGLKWLSNWGVLRYAAAESMVMLVYCK----QNPD-------- 326

Query: 377 NRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDG 436
               L+ +AK QVDYILG NP NMSY++GYG N+    HHR++               +G
Sbjct: 327 --QSLLDLAKKQVDYILGDNPANMSYIIGYGSNWCIHPHHRAA---------------NG 369

Query: 437 TKYYDSKNSNPNE--LTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
             Y +  N+ P +  LTGA+VGGPD ND + D  + + ++E     NA LVG+LA
Sbjct: 370 YTYANGDNAKPAKHLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLA 424


>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
          Length = 611

 Score =  154 bits (388), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 140/520 (26%), Positives = 224/520 (43%), Gaps = 97/520 (18%)

Query: 21  SHDYGDALAKSILFFEGQRSGK-LPPNQRMTWRKD--------------SALEDGF---- 61
            ++Y  AL  S+ F++    G  +  N  ++WR D              + + DGF    
Sbjct: 37  EYNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSN 96

Query: 62  --------QIGVDLTGGYYDAGDNVKFNFPMAFSTTMLAWSVLEFGKSMESDLQ--HALD 111
                      VD++GG++DAGD+VKF  P A++ + + W   EF     +  Q  HA  
Sbjct: 97  RSVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEV 156

Query: 112 AIRWATDYFLKAT--SIPGIVYA---QVGEPYGDHDCWERPEDMDTARTPYAVSKQFPGX 166
            +R+  DYF++ T     G V A   QVG+   DH  W  PE+    R  + ++K+ PG 
Sbjct: 157 ILRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGT 216

Query: 167 XXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKTVFVFADRY-RGSYNDYLGKWVCPFYC 225
                           F+ TD  YA K L+ AK +F FA++  +G      G      Y 
Sbjct: 217 DIISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQGEDGP---KGYY 273

Query: 226 DFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYNLGSGRKITNXXXXXXXXXTEFGWDN 285
             S ++D+  W AAWLY AT+  +Y          L    K  +             W++
Sbjct: 274 GSSKWQDDYCWAAAWLYLATQNEHY----------LDEAFKYYDYYAPPGWIHC---WND 320

Query: 286 KNAGINVLVSRFINTINSNSSNPFIPKADTF--------------VCSLLPESPTKSVSY 331
             +G   +++  IN +    S  F  +                  +  LL +     ++ 
Sbjct: 321 VWSGTACILAE-INDLYDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGITV 379

Query: 332 SPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNRARLVQVAKGQVDY 391
           +PGG +F     + ++ T    + +VY         D H G+     ++    A+ Q+DY
Sbjct: 380 TPGGYVFLNQWGSARYNTAAQLIALVY---------DKHHGDT---PSKYANWARSQMDY 427

Query: 392 ILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKYYDSKNSNPNE-- 449
           +LG NPLN  Y+VGY  N  +  HHR++S               G K  D+ +S+P++  
Sbjct: 428 LLGKNPLNRCYVVGYSSNSVKYPHHRAAS---------------GLK--DANDSSPHKYV 470

Query: 450 LTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
           L GA+VGGPDA+D + D  +D++++E     NA  VG  A
Sbjct: 471 LYGALVGGPDASDQHVDRTNDYIYNEVAIDYNAAFVGACA 510


>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
           Thermoacidophilic Alicyclobacillus Acidocaldarius
 pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
 pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellobiose
 pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellotetraose
 pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
           Isofagomine
 pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
           Isofagomine
 pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
           Isofagomine
          Length = 537

 Score = 45.8 bits (107), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 49/106 (46%), Gaps = 6/106 (5%)

Query: 384 VAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKYYDSK 443
           VA+   DY+ G+NPL   Y+ G+G+   +  HHR S    V      +      ++    
Sbjct: 430 VAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHRPSVADDVDHPVPGMVVGGPNRHLQ-- 487

Query: 444 NSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
               +E+  A + G  A ++Y D +  +  +E   Y N+P V ++A
Sbjct: 488 ----DEIARAQLAGRPAMEAYIDHQDSYSTNEVAVYWNSPAVFVIA 529


>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 609

 Score = 38.9 bits (89), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 100/470 (21%), Positives = 161/470 (34%), Gaps = 78/470 (16%)

Query: 66  DLTGGYYDAGDNVKF--NFPMAFSTTMLA-----------WSVLEFG----KSMESDLQH 108
           D+TGG+YDAGD+ K+  N  +A  T M             W     G        +    
Sbjct: 169 DVTGGWYDAGDHGKYVVNGGIAVWTLMNMYERAKIRGLDNWGPYRDGGMNIPEQNNGYPD 228

Query: 109 ALDAIRWATDYFLKAT-------SIPGIVYAQVGEPYGDHDCWERPEDMDTARTPYAVSK 161
            LD  RW  ++F K         SI G+V+ ++      HD       M     P     
Sbjct: 229 ILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKI------HDFRWTALGMLPHEDPQPRYL 282

Query: 162 QFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKTVFVFADRYRGSYNDYLGKWVC 221
           +                    ++  D T+A   L +A+  +  A ++   Y +Y      
Sbjct: 283 RPVSTAATLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQAALKHPDIYAEYTPGSGG 342

Query: 222 P----FYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYNLGSGRKITNXXXXXXXX 277
           P    +  D+ G  DE  W A  LY  T    Y NY++ + + L    K+          
Sbjct: 343 PGGGPYNDDYVG--DEFYWAACELYVTTGKDEYKNYLMNSPHYLEMPAKMGENGGANGED 400

Query: 278 XTEFG---WDNKNAGINVLVSRFINTINS----NSSNPFIPKADTFVCSLLPES---PTK 327
              +G   W        + ++   N + +     + N     AD ++ ++  +    P K
Sbjct: 401 NGLWGCFTWGTTQGLGTITLALVENGLPATDIQKARNNIAKAADRWLENIEEQGYRLPIK 460

Query: 328 SVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNV--VVNRARLVQVA 385
                 GG    P GSN       SF+L          N+ I  G        ++ +   
Sbjct: 461 QAEDERGGY---PWGSN-------SFIL----------NQMIVMGYAYDFTGDSKYLDGM 500

Query: 386 KGQVDYILGSNPLNMSYMVGYGKNFPQRIHHR----SSSLPSVAQHTGKIGCQDGTKYYD 441
              + Y+LG N ++ SY+ GYG+   Q  H R     +S    A   G I     +++ D
Sbjct: 501 FDGISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPAPPPGIISGGPNSRFED 560

Query: 442 SKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLAYF 491
                   +  AV         + D    +  +E T   NAP   + AY 
Sbjct: 561 PT------INAAVKKDTPPQKCFIDHTDSWSTNEITVNWNAPFAWVTAYL 604


>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 610

 Score = 37.7 bits (86), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 85/392 (21%), Positives = 137/392 (34%), Gaps = 68/392 (17%)

Query: 66  DLTGGYYDAGDNVKF--NFPMAFSTTMLA-----------WSVLEFG----KSMESDLQH 108
           D+TGG+YDAGD+ K+  N  +A  T M             W     G        +    
Sbjct: 169 DVTGGWYDAGDHGKYVVNGGIAVWTLMNMYERAKIRGLDNWGPYRDGGMNIPEQNNGYPD 228

Query: 109 ALDAIRWATDYFLKAT-------SIPGIVYAQVGEPYGDHDCWERPEDMDTARTPYAVSK 161
            LD  RW  ++F K         SI G+V+ ++      HD       M     P     
Sbjct: 229 ILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKI------HDFRWTALGMLPHEDPQPRYL 282

Query: 162 QFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKTVFVFADRYRGSYNDYLGKWVC 221
           +                    ++  D T+A   L +A+  +  A ++   Y +Y      
Sbjct: 283 RPVSTAATLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQAALKHPDIYAEYTPGSGG 342

Query: 222 P----FYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYNLGSGRKITNXXXXXXXX 277
           P    +  D+ G  DE  W A  LY  T    Y NY++ + + L    K+          
Sbjct: 343 PGGGPYNDDYVG--DEFYWAACELYVTTGKDEYKNYLMNSPHYLEMPAKMGENGGANGED 400

Query: 278 XTEFG---WDNKNAGINVLVSRFINTINS----NSSNPFIPKADTFVCSLLPES---PTK 327
              +G   W        + ++   N + +     + N     AD ++ ++  +    P K
Sbjct: 401 NGLWGCFTWGTTQGLGTITLALVENGLPATDIQKARNNIAKAADRWLENIEEQGYRLPIK 460

Query: 328 SVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNV--VVNRARLVQVA 385
                 GG    P GSN       SF+L          N+ I  G        ++ +   
Sbjct: 461 QAEDERGGY---PWGSN-------SFIL----------NQMIVMGYAYDFTGDSKYLDGM 500

Query: 386 KGQVDYILGSNPLNMSYMVGYGKNFPQRIHHR 417
              + Y+LG N ++ SY+ GYG+   Q  H R
Sbjct: 501 FDGISYLLGRNAMDQSYVTGYGERPLQNPHDR 532


>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
           Angstroms Resolution
          Length = 639

 Score = 30.0 bits (66), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 25/62 (40%), Gaps = 13/62 (20%)

Query: 371 CGNVVVNRARLVQVAK-------------GQVDYILGSNPLNMSYMVGYGKNFPQRIHHR 417
           C   VV +  ++QVA                + ++ G N  N SY+ G G N P   H R
Sbjct: 449 CNGTVVRQTMILQVANKISPNNDYVNAALDAISHVFGRNYYNRSYVTGLGINPPMNPHDR 508

Query: 418 SS 419
            S
Sbjct: 509 RS 510


>pdb|4F7U|Q Chain Q, Macromolecular Machine 6
 pdb|4F7U|P Chain P, Macromolecular Machine 6
          Length = 129

 Score = 29.6 bits (65), Expect = 4.0,   Method: Composition-based stats.
 Identities = 13/30 (43%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 130 VYAQVGEPYGD-HDCWERPEDMDTARTPYA 158
           VY  V E +G+  +CW  PED+ T  T Y+
Sbjct: 86  VYGDVDEQFGEVTECWLMPEDIATVDTMYS 115


>pdb|3VPS|A Chain A, Structure Of A Novel Nad Dependent-Ndp-Hexosamine
           5,6-Dehydratase, Tuna, Involved In Tunicamycin
           Biosynthesis
 pdb|3VPS|B Chain B, Structure Of A Novel Nad Dependent-Ndp-Hexosamine
           5,6-Dehydratase, Tuna, Involved In Tunicamycin
           Biosynthesis
          Length = 321

 Score = 29.3 bits (64), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 1/40 (2%)

Query: 123 ATSIPGIVYAQVGEPYGDHDCWERPEDMD-TARTPYAVSK 161
           +  +P +V     E YG  D    PED   + R+PYA SK
Sbjct: 108 SVGVPKVVVGSTCEVYGQADTLPTPEDSPLSPRSPYAASK 147


>pdb|3HV1|A Chain A, Crystal Structure Of A Polar Amino Acid Abc Uptake
           Transporter Substrate Binding Protein From Streptococcus
           Thermophilus
 pdb|3HV1|B Chain B, Crystal Structure Of A Polar Amino Acid Abc Uptake
           Transporter Substrate Binding Protein From Streptococcus
           Thermophilus
          Length = 268

 Score = 28.5 bits (62), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 414 IHHRSSSLPSVAQHTGK-IGCQDGTKYYDSKNSNPNELTGAV 454
           +  +SS + SVA   GK +G Q G+  YD+ N++P  L   V
Sbjct: 112 VTKKSSGIDSVAGXAGKTLGAQAGSSGYDAFNASPKILKDVV 153


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.318    0.135    0.417 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,383,346
Number of Sequences: 62578
Number of extensions: 668137
Number of successful extensions: 1217
Number of sequences better than 100.0: 15
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 1150
Number of HSP's gapped (non-prelim): 24
length of query: 498
length of database: 14,973,337
effective HSP length: 103
effective length of query: 395
effective length of database: 8,527,803
effective search space: 3368482185
effective search space used: 3368482185
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 53 (25.0 bits)