BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 010522
         (508 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 2.5.
 pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 5.6.
 pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 6.5
          Length = 433

 Score =  328 bits (841), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 187/461 (40%), Positives = 263/461 (57%), Gaps = 37/461 (8%)

Query: 43  HNYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
           ++Y+  L  S++F+E QRSG+LP +Q++TWR+DS L D      DL GGY+DAGD VKFG
Sbjct: 3   YDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFG 62

Query: 103 FPMAFTTTMLSWSVIEF--GGLMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
           FPMA+T T+L+W +I+F  G    G L + R+A++WATDY +KA    +  Y QVG  + 
Sbjct: 63  FPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGDA 122

Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
           DHA W RPEDM   R   KID + PGSD+AGET       S+VFR  D TYS  L+  A 
Sbjct: 123 DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHAR 182

Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
           ++F FA+ YRG YS+ +      FY S + Y+DEL+W AAWL++AT + TYLN  +    
Sbjct: 183 QLFDFANNYRGKYSDSITD-ARNFYAS-ADYRDELVWAAAWLYRATNDNTYLNTAESLYD 240

Query: 281 ILGAAEYDNTFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGASFSSAQ 340
             G   +     WD+K  G ++LL+      KL +   YKD   +++  LI     ++ Q
Sbjct: 241 EFGLQNWGGGLNWDSKVSGVQVLLA------KLTNKQAYKDTVQSYVNYLI-----NNQQ 289

Query: 341 YTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIAKRQVD 400
            TP GLL+      +++  + +F++L  A+               ++A   R  A+ Q+D
Sbjct: 290 KTPKGLLYIDMWGTLRHAANAAFIMLEAAE-------------LGLSASSYRQFAQTQID 336

Query: 401 YILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVL 460
           Y LGD     S++ G+G   P R HHR SS P     PA   C   ++   S  PN +VL
Sbjct: 337 YALGDG--GRSFVCGFGSNPPTRPHHRSSSCP-----PAPATCD--WNTFNSPDPNYHVL 387

Query: 461 VGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL 501
            GA+VGGPDQ+D + D+R DY  +E +T  NA    ALA L
Sbjct: 388 SGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALAAL 428


>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
 pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
          Length = 605

 Score =  321 bits (823), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 193/472 (40%), Positives = 262/472 (55%), Gaps = 40/472 (8%)

Query: 44  NYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFGF 103
           NY +AL KS+ F+E QRSGKLP N R++WR DSGL DGA + +DL GG+YDAGD+VKFGF
Sbjct: 5   NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGF 64

Query: 104 PMAFTTTMLSWSVIEF--GGLMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANKD 161
           PMAFT TML+W  IE   G +  G++   ++ +RW  DY +KA   P+ +YVQVGD + D
Sbjct: 65  PMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDAD 124

Query: 162 HACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAIR 221
           H  W   E M   R   K+D + PGSDVA ET       S+VF   DP Y+  LV+ A +
Sbjct: 125 HKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAKQ 184

Query: 222 VFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQI 281
           ++ FAD YRG YS+ +      FY S+SGYQDEL+WGA WL+KAT + +YL   +     
Sbjct: 185 LYTFADTYRGVYSDCVP--AGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDF 242

Query: 282 LGAAE------YDNTFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGAS 335
           L   +      Y  T  WD+K  G  +LL+K    QK      Y D A+ ++     G +
Sbjct: 243 LSTEQQTDLRSYRWTIAWDDKSYGTYVLLAKETGKQK------YIDDANRWLDYWTVGVN 296

Query: 336 FSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIA 395
                Y+PGG+    +   ++Y  +T+F+ L YAK +             V  +R    A
Sbjct: 297 GQRVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDP----------VRKQRYHDFA 346

Query: 396 KRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSP 455
            RQ++Y LGDNP   SY+VG+G   P+  HHR        AH       S    + S + 
Sbjct: 347 VRQINYALGDNPRNSSYVVGFGNNPPRNPHHR-------TAH------GSWTDSIASPAE 393

Query: 456 NPNVLVGAIVGGPDQ-HDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFG 506
           N +VL GA+VGGP   +D + D+R DY  +E +T  NA    ALA L   +G
Sbjct: 394 NRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEEYG 445


>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
           Endoglucanase From Separately Expressed Gh9 And Cbm3c
           Modules
          Length = 466

 Score =  251 bits (640), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 164/477 (34%), Positives = 236/477 (49%), Gaps = 47/477 (9%)

Query: 44  NYRDALTKSIIFFEGQRSGKLPPNQ-RMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
           NY +AL K+I F+E QRSGKL  +  R+ WR DSGL DG    +DL GG+YDAGD+VKF 
Sbjct: 25  NYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFN 84

Query: 103 FPMAFTTTMLSWSVIEFGGLMK--GELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
            PM+++  ML W+V E+    K  G+  +    I+WA DY +K     D  Y QVGD + 
Sbjct: 85  LPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQVGDGHA 144

Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
           DHA W   E M   R   K+DR+ PGS V  ET       S++F++ D  YSK  ++ A 
Sbjct: 145 DHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKECLKHAK 204

Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
            +F+FAD  +   S+        FY S+SG+ DEL W A WL+ AT + +YL+  +    
Sbjct: 205 ELFEFADTTK---SDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESYSD 261

Query: 281 ILG------AAEYDNTFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGA 334
             G        +Y     WD+   G  +LL++           +YK+  +  +     G 
Sbjct: 262 KWGYEPQTNIPKYKWAQCWDDVTYGTYLLLARI-----KNDNGKYKEAIERHLDWWTTGY 316

Query: 335 SFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTI 394
           +     YTP GL +     +++Y T+T+FL   Y+ +           G    AK     
Sbjct: 317 NGERITYTPKGLAWLDQWGSLRYATTTAFLACVYSDWEN---------GDKEKAKTYLEF 367

Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
           A+ Q DY LG      S++VG+G   P+R HHR                ++  S   SQ 
Sbjct: 368 ARSQADYALGST--GRSFVVGFGENPPKRPHHR----------------TAHGSWADSQM 409

Query: 455 PNP---NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFGQL 508
             P   +VL GA+VGGPD  D + D+  +Y  +E +   NA  VG LA +   +G+L
Sbjct: 410 EPPEHRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLAKMYKLYGEL 466


>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
 pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
          Length = 614

 Score =  243 bits (621), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 165/473 (34%), Positives = 237/473 (50%), Gaps = 43/473 (9%)

Query: 43  HNYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
           +NY +AL KSI+F+E QRSG LP ++R  WR DSG+ DG+ + VDL GG+YDAGD+VKF 
Sbjct: 4   YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63

Query: 103 FPMAFTTTMLSWSVIEFGGL--MKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
            PM++T+ ML+WS+ E        G+ +   + I+WA DY +K    P   Y QVGD  K
Sbjct: 64  LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123

Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
           DH+ W   E M   R   K+D + PGS V   T       ++VF+ SDPTY++  +  A 
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183

Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
            +F  ADK +    +         Y S S + D+L W A WL+ AT + TYL+  +    
Sbjct: 184 NLFDMADKAK----SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVP 239

Query: 281 ILGAAEYDNTFG------WDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGA 334
             G  +  +         WD+ H GA +LL+      KL +   YKD  +  +     G 
Sbjct: 240 NWGKEQQTDIIAYKWGQCWDDVHYGAELLLA------KLTNKQLYKDSIEMNLDFWTTGV 293

Query: 335 SFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTI 394
           + +   YTP GL +     ++++ T+ +FL   YA++          G T       +  
Sbjct: 294 NGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWE---------GCTPSKVSVYKDF 344

Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
            K Q+DY LG      S++VGYG   PQ  HHR        AH       S    M S +
Sbjct: 345 LKSQIDYALGST--GRSFVVGYGVNPPQHPHHR-------TAH------GSWTDQMTSPT 389

Query: 455 PNPNVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL-AHSFG 506
            + + + GA+VGGPD  D + DE  +Y  +E +   NA   GALA +  HS G
Sbjct: 390 YHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGG 442


>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
 pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
          Length = 614

 Score =  243 bits (621), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 165/473 (34%), Positives = 237/473 (50%), Gaps = 43/473 (9%)

Query: 43  HNYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
           +NY +AL KSI+F+E QRSG LP ++R  WR DSG+ DG+ + VDL GG+YDAGD+VKF 
Sbjct: 4   YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63

Query: 103 FPMAFTTTMLSWSVIEFGGL--MKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
            PM++T+ ML+WS+ E        G+ +   + I+WA DY +K    P   Y QVGD  K
Sbjct: 64  LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123

Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
           DH+ W   E M   R   K+D + PGS V   T       ++VF+ SDPTY++  +  A 
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183

Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
            +F  ADK +    +         Y S S + D+L W A WL+ AT + TYL+  +    
Sbjct: 184 NLFDMADKAK----SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVP 239

Query: 281 ILGAAEYDNTFG------WDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGA 334
             G  +  +         WD+ H GA +LL+      KL +   YKD  +  +     G 
Sbjct: 240 NWGKEQQTDIIAYKWGQXWDDVHYGAELLLA------KLTNKQLYKDSIEMNLDFWTTGV 293

Query: 335 SFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTI 394
           + +   YTP GL +     ++++ T+ +FL   YA++          G T       +  
Sbjct: 294 NGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWE---------GCTPSKVSVYKDF 344

Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
            K Q+DY LG      S++VGYG   PQ  HHR        AH       S    M S +
Sbjct: 345 LKSQIDYALGST--GRSFVVGYGVNPPQHPHHR-------TAH------GSWTDQMTSPT 389

Query: 455 PNPNVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL-AHSFG 506
            + + + GA+VGGPD  D + DE  +Y  +E +   NA   GALA +  HS G
Sbjct: 390 YHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGG 442


>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
 pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
          Length = 475

 Score =  240 bits (613), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 162/472 (34%), Positives = 240/472 (50%), Gaps = 43/472 (9%)

Query: 44  NYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFGF 103
           NY +AL K+I+F+E Q SGKLP   R  WR DS L DG    +DL GG++DAGD+VKF  
Sbjct: 27  NYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDHVKFNL 86

Query: 104 PMAFTTTMLSWSVIEFGG--LMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANKD 161
           PM++T TMLSW+V E+    +  G+L +    I W  DY +K        Y QVGD +KD
Sbjct: 87  PMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVGDGSKD 146

Query: 162 HACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAIR 221
           HA W   E M   R   K+ ++ PGS V  ET       S+V +  +PT +   ++ A  
Sbjct: 147 HAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYLQHAKE 206

Query: 222 VFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYL----NYIQV 277
           +++FA+  +   S+        +Y S+SG+ DEL W A WL+ AT + TYL    +Y+Q 
Sbjct: 207 LYEFAEVTK---SDAGYTAANGYYNSWSGFYDELSWAAVWLYLATNDSTYLTKAESYVQN 263

Query: 278 NGQILGAAEYDNTFG--WDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGAS 335
             +I G+   D  +   WD+ H GA +LL+      K+     YK   ++ +     G +
Sbjct: 264 WPKISGSNTIDYKWAHCWDDVHNGAALLLA------KITGKDIYKQIIESHLDYWTTGYN 317

Query: 336 FSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRL-RTI 394
               +YTP GL +     +++Y T+T+FL   Y+ ++          G   T K + R  
Sbjct: 318 GERIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWV----------GCPSTKKEIYRKF 367

Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
            + Q+DY LG      S++VG+G   P+R HHR        AH +     S  S  +   
Sbjct: 368 GESQIDYALGS--AGRSFVVGFGTNPPKRPHHR-------TAHSSWADSQSIPSYHR--- 415

Query: 455 PNPNVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFG 506
              + L GA+VGGP   D + D+  +Y  +E +   NA  VGALA +   +G
Sbjct: 416 ---HTLYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALAKMYQLYG 464


>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticum
 pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticium In Complex With Cellobiose
          Length = 441

 Score =  213 bits (542), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 152/478 (31%), Positives = 228/478 (47%), Gaps = 64/478 (13%)

Query: 43  HNYRDALTKSIIFFEGQRSG-KLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKF 101
           H+Y  AL  SIIFF+  + G +   N    WR     TDG+ + VDL GGY+DAGD+VKF
Sbjct: 4   HDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVKF 63

Query: 102 GFPMAFTTTMLSWSVIEFGGLMK--GELRNAREAIRWATDYLLKATVHPDTIYVQVGDAN 159
           G P  ++  +L WS+ EF       G      + +++ TDY LK+  +  T Y QVG+ N
Sbjct: 64  GLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEGN 123

Query: 160 KDHACWERPEDMDTPR-TVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRR 218
            DH  W  PE+    R ++ K D + P SD+  ET        L ++  D  Y+   +  
Sbjct: 124 ADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLNA 183

Query: 219 AIRVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYL----NY 274
           A  ++      +G   NG       FY + S + D+L W A WL+ AT + TY+     +
Sbjct: 184 AKELYAMGKANQGV-GNGQS-----FYQATS-FGDDLAWAATWLYTATNDSTYITDAEQF 236

Query: 275 IQVNGQILGAAEYDN-TFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHAD-NFICSLIP 332
           I +   +      D  T  WD+ +V A + L+      ++     YKD  + NF      
Sbjct: 237 ITLGNTMNENKMQDKWTMCWDDMYVPAALRLA------QITGKQIYKDAIEFNF------ 284

Query: 333 GASFSSAQY--TPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKR 390
             ++   Q   TPGGL +  +   ++Y  + S ++L Y K                  + 
Sbjct: 285 --NYWKTQVTTTPGGLKWLSNWGVLRYAAAESMVMLVYCK--------------QNPDQS 328

Query: 391 LRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIM 450
           L  +AK+QVDYILGDNP  MSY++GYG  +    HHR                ++G++  
Sbjct: 329 LLDLAKKQVDYILGDNPANMSYIIGYGSNWCIHPHHR---------------AANGYTYA 373

Query: 451 KSQSPNP--NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFG 506
              +  P  ++L GA+VGGPDQ+D+F D+   Y+ +E +   NA LVG LA     FG
Sbjct: 374 NGDNAKPAKHLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLAGAIKFFG 431


>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
          Length = 611

 Score =  167 bits (424), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 144/521 (27%), Positives = 216/521 (41%), Gaps = 94/521 (18%)

Query: 42  KHNYRDALTKSIIFFEGQRSGK-LPPNQRMTWRRD--------------SGLTDG----- 81
           ++NY  AL  S+ F++    G  +  N  ++WR D              + ++DG     
Sbjct: 37  EYNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSN 96

Query: 82  -------AAMHVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGLMK--GELRNARE 132
                      VD+ GG++DAGD+VKFG P A+  + + W   EF    +  G+  +A  
Sbjct: 97  RSVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEV 156

Query: 133 AIRWATDYLLKATVHPDTIYV-----QVGDANKDHACWERPEDMDTPRTVIKIDRNYPGS 187
            +R+  DY ++ T    +  V     QVGD + DHA W  PE+    R    I +  PG+
Sbjct: 157 ILRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGT 216

Query: 188 DVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKKFVCPFYCS 247
           D+   T        + F+ +DP Y+   +  A  +F FA+K       G +     +Y S
Sbjct: 217 DIISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQG-EDGPKGYYGS 275

Query: 248 YSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQILGAAEYDNTFGWDNKHVGARILLSKA 307
            S +QD+  W AAWL+ AT+N  YL+      +     +Y    GW   H    +    A
Sbjct: 276 -SKWQDDYCWAAAWLYLATQNEHYLD------EAFKYYDYYAPPGW--IHCWNDVWSGTA 326

Query: 308 FLVQKLQSLHEYKDHADNF---------------------ICSLIPGASFSSAQYTPGGL 346
            ++ ++  L  Y   + NF                     I  L+   S      TPGG 
Sbjct: 327 CILAEINDL--YDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGITVTPGGY 384

Query: 347 LFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIAKRQVDYILGDN 406
           +F     + +Y T+   + L Y K+               T  +    A+ Q+DY+LG N
Sbjct: 385 VFLNQWGSARYNTAAQLIALVYDKHHGD------------TPSKYANWARSQMDYLLGKN 432

Query: 407 PLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVLVGAIVG 466
           PL   Y+VGY     +  HHR                +SG       SP+  VL GA+VG
Sbjct: 433 PLNRCYVVGYSSNSVKYPHHR---------------AASGLKDANDSSPHKYVLYGALVG 477

Query: 467 GPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFGQ 507
           GPD  D+  D   DY  +E +   NA  VGA A L   FG 
Sbjct: 478 GPDASDQHVDRTNDYIYNEVAIDYNAAFVGACAGLYRFFGD 518


>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
           Thermoacidophilic Alicyclobacillus Acidocaldarius
 pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
 pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellobiose
 pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellotetraose
 pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
           Isofagomine
 pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
           Isofagomine
 pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
           Isofagomine
          Length = 537

 Score = 61.6 bits (148), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 117/511 (22%), Positives = 184/511 (36%), Gaps = 103/511 (20%)

Query: 42  KHNYRDALTKSIIFFEGQRSGK-LPPNQRMTWRRDSGLTDGAAMH-----VDLVGGYYDA 95
           +  YRD L   + FF+ Q  G  LP ++   W   +  T  A +      +   GG++DA
Sbjct: 85  RRAYRDVLEAMLRFFDYQLCGVVLPEDEAGPWAHGACHTSDAKVFGTERALACPGGWHDA 144

Query: 96  GDNVKFGFPMAFTTTMLSWSVIEFGGLMK---------------GELRNAREAIRWATDY 140
           GD  K+  P A     L  +   F   +                  L  ARE I W    
Sbjct: 145 GDYGKYTVPAAKAVADLLLAHEYFPAALAHVRPMRSVHRAPHLPPALEVAREEIAWLLTM 204

Query: 141 LLKATVHPDTIYVQVGDANKDHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXX 200
              AT     +Y +V   +       RPED D P  +  I      +  A          
Sbjct: 205 QDPAT---GGVYHKVTTPSFP-PLDTRPEDDDAPLVLSPISYAATATFCAA-----MAHA 255

Query: 201 SLVFRRSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKKFVCPFYCSYSGY-----QDEL 255
           +LV+R  DP  S      A R +     + GA+   ++ F  P       Y     +DEL
Sbjct: 256 ALVYRPFDPALSSCCADAARRAY----AWLGAHE--MQPFHNPDGILTGEYGDAELRDEL 309

Query: 256 LWGAAWLHKATRNPTYLNYIQVNGQILGAAEYDNTFGWDNKHVGARILLSKAFLVQKLQS 315
           LW +  L + T +  +    +V   +L     D    W+       +     +L     +
Sbjct: 310 LWASCALLRMTGDSAW---ARVCEPLL-----DLDLPWELGWADVALYGVMDYLRTPRAA 361

Query: 316 LHEYKDHADNFICSLIP---GASFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYL 372
           +    D   N + S +     A  + A+  P G+  +  D    ++  ++ +LL  A   
Sbjct: 362 V---SDDVRNKVKSRLLRELDALAAMAESHPFGIPMRDDD----FIWGSNMVLLNRAMAF 414

Query: 373 TSAHRVVNCGGTTVTAKRLRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLP 432
             A       G  V      T+A+R  DY+ G NPL   Y+ G+G R  +  HHR S   
Sbjct: 415 LLAE------GVGVLHPAAHTVAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHRPSVAD 468

Query: 433 SVGAHPARIQCSSGFSIMKSQSPNPNVLVGAIVGGPDQH----------------DRFPD 476
            V  HP                     + G +VGGP++H                + + D
Sbjct: 469 DVD-HP---------------------VPGMVVGGPNRHLQDEIARAQLAGRPAMEAYID 506

Query: 477 ERPDYEQSEPSTYMNAPLVGALAYLAHSFGQ 507
            +  Y  +E + Y N+P V  +A L  + G+
Sbjct: 507 HQDSYSTNEVAVYWNSPAVFVIAALLEARGR 537


>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 609

 Score = 32.3 bits (72), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 31/104 (29%), Positives = 42/104 (40%), Gaps = 4/104 (3%)

Query: 399 VDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQ-CSSGFSIMKSQSPNP 457
           + Y+LG N +  SY+ GYG R  Q  H R  +  +    PA      SG    + + P  
Sbjct: 504 ISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPAPPPGIISGGPNSRFEDPTI 563

Query: 458 NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL 501
           N  V      P Q   F D    +  +E +   NAP     AYL
Sbjct: 564 NAAVKKDT--PPQK-CFIDHTDSWSTNEITVNWNAPFAWVTAYL 604


>pdb|2XR7|A Chain A, Crystal Structure Of Nicotiana Tabacum Malonyltransferase
           (Ntmat1) Complexed With Malonyl-Coa
          Length = 453

 Score = 32.3 bits (72), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 30/122 (24%), Positives = 47/122 (38%), Gaps = 7/122 (5%)

Query: 346 LLFKMSDSNMQYV-----TSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIAKRQVD 400
           L +K+  S   +V     T    L LT   YL  A  V  C         LR +    V 
Sbjct: 41  LFYKLPISRPDFVQTIIPTLKDSLSLTLKYYLPLAGNVA-CPQDWSGYPELRYVTGNSVS 99

Query: 401 YILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVL 460
            I  ++    +Y++GY PR  +  +H    L      P  +Q +   +I  +  PN  + 
Sbjct: 100 VIFSESDXDFNYLIGYHPRNTKDFYHFVPQLAEPKDAPG-VQLAPVLAIQVTLFPNHGIS 158

Query: 461 VG 462
           +G
Sbjct: 159 IG 160


>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
           Angstroms Resolution
          Length = 639

 Score = 31.6 bits (70), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 93/445 (20%), Positives = 154/445 (34%), Gaps = 84/445 (18%)

Query: 80  DGAAMHVDLVGGYYDAGDNVKFGFPMAFT--TTMLSWS----VIEFGGLMKGELRNA--- 130
           +G     D   G++DAGD  K+      T  +  L+W      +E   L   E  N+   
Sbjct: 174 NGQHTKKDSTKGWHDAGDYNKYVVNAGITVGSMFLAWEHFKDQLEPVALEIPEKNNSIPD 233

Query: 131 -REAIRWATDYLLKATVHPDTIYVQVGDANKDHACWERPEDMDTPRTVIKIDRNYPGSDV 189
             + +++  D++L       +  V    + ++   +  PE+    R  +      P S  
Sbjct: 234 FLDELKYEIDWILTMQYPDGSGRVAHKVSTRNFGGFIMPENEHDERFFV------PWSSA 287

Query: 190 A-GETXXXXXXXSLVFRRSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKKFVCPFYCSY 248
           A  +        + +FR  DP Y++  +  A   ++F              F    Y + 
Sbjct: 288 ATADFVAMTAMAARIFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFSTGEYATV 347

Query: 249 SGYQDELLWGAAWLHKATRNPTYL-NYIQVNGQILGAAEYDNTFGWDNKHVGARILLSKA 307
           S   D+ LW AA + +   +  YL ++     Q     E D  F WDN            
Sbjct: 348 SD-ADDRLWAAAEMWETLGDEEYLRDFENRAAQFSKKIEAD--FDWDN------------ 392

Query: 308 FLVQKLQSLHEYKDHADNFICSLIPGASFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLT 367
             V  L            ++ S  PG + +  Q     LL   +DS ++  TS +     
Sbjct: 393 --VANLGMF--------TYLLSERPGKNPALVQSIKDSLL-STADSIVR--TSQNH---G 436

Query: 368 YAKYLTSAHRVVNCGGTTVTAKRLRTIAKR-------------QVDYILGDNPLKMSYMV 414
           Y + L + +    C GT V    +  +A +              + ++ G N    SY+ 
Sbjct: 437 YGRTLGTTY-YWGCNGTVVRQTMILQVANKISPNNDYVNAALDAISHVFGRNYYNRSYVT 495

Query: 415 GYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVLVGAIVGGPDQHDRF 474
           G G   P   H R S    +                    P P  LVG    GP     +
Sbjct: 496 GLGINPPMNPHDRRSGADGI------------------WEPWPGYLVGGGWPGPKD---W 534

Query: 475 PDERPDYEQSEPSTYMNAPLVGALA 499
            D +  Y+ +E +   NA L+ ALA
Sbjct: 535 VDIQDSYQTNEIAINWNAALIYALA 559


>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 610

 Score = 31.2 bits (69), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 43/108 (39%), Gaps = 4/108 (3%)

Query: 399 VDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQ-CSSGFSIMKSQSPNP 457
           + Y+LG N +  SY+ GYG R  Q  H R  +  +    PA      SG    + + P  
Sbjct: 504 ISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPAPPPGIISGGPNSRFEDPTI 563

Query: 458 NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSF 505
           N  V      P Q   F D    +  ++ +   NAP     AYL   +
Sbjct: 564 NAAVKKDT--PPQK-CFIDHTDSWGTNQITVNWNAPFAWVTAYLDEQY 608


>pdb|2AYX|A Chain A, Solution Structure Of The E.Coli Rcsc C-Terminus (Residues
           700-949) Containing Linker Region And Phosphoreceiver
           Domain
          Length = 254

 Score = 30.8 bits (68), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 4/46 (8%)

Query: 391 LRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGA 436
           L  ++K  +D +L D  + M  M GY  R  QRI   G +LP +G 
Sbjct: 166 LNVLSKNHIDIVLSD--VNMPNMDGY--RLTQRIRQLGLTLPVIGV 207


>pdb|3OLM|A Chain A, Structure And Function Of A Ubiquitin Binding Site Within
           The Catalytic Domain Of A Hect Ubiquitin Ligase
          Length = 429

 Score = 30.4 bits (67), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 243 PFYC--SYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQILGAAEYDNTFGWDNKHVGA 300
           PFYC   YS Y +   +       +  NP +LNY +  G+++G   +   F  D   VGA
Sbjct: 134 PFYCLFEYSAYDN---YTIQINPNSGINPEHLNYFKFIGRVVGLGVFHRRF-LDAFFVGA 189

Query: 301 --RILLSKAFLVQKLQSL 316
             +++L K  ++Q ++ +
Sbjct: 190 LYKMMLRKKVVLQDMEGV 207


>pdb|2AYZ|A Chain A, Solution Structure Of The E.Coli Rcsc C-Terminus (Residues
           817-949) Containing Phosphoreceiver Domain
          Length = 133

 Score = 30.4 bits (67), Expect = 2.4,   Method: Composition-based stats.
 Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 4/45 (8%)

Query: 391 LRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVG 435
           L  ++K  +D +L D  + M  M GY  R  QRI   G +LP +G
Sbjct: 45  LNVLSKNHIDIVLSD--VNMPNMDGY--RLTQRIRQLGLTLPVIG 85


>pdb|3GFX|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 4.5 CalciumCY-Digmp Complex
 pdb|3GFX|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 4.5 CalciumCY-Digmp Complex
 pdb|3GFY|A Chain A, Klebsiella Pneumoniae Blrp1 With Fmn And Cyclic Digmp, No
           Metal Ions
 pdb|3GFY|B Chain B, Klebsiella Pneumoniae Blrp1 With Fmn And Cyclic Digmp, No
           Metal Ions
 pdb|3GFZ|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 6 Manganese/cy-digmp
           Complex
 pdb|3GFZ|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 6 Manganese/cy-digmp
           Complex
 pdb|3GG0|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 9.0 ManganeseCY-Digmp
           Complex
 pdb|3GG0|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 9.0 ManganeseCY-Digmp
           Complex
 pdb|3GG1|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 8.0 CalciumCY-Digmp Complex
 pdb|3GG1|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 8.0 CalciumCY-Digmp Complex
          Length = 413

 Score = 29.3 bits (64), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 122 LMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDAN 159
           L+ G L +  +A+ W  D LL A + PD + ++V +  
Sbjct: 248 LLPGSLYHHPDAVGWLMDSLLAAGLRPDQVLIEVTETE 285


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.321    0.136    0.425 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,681,231
Number of Sequences: 62578
Number of extensions: 680172
Number of successful extensions: 1285
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 1220
Number of HSP's gapped (non-prelim): 23
length of query: 508
length of database: 14,973,337
effective HSP length: 103
effective length of query: 405
effective length of database: 8,527,803
effective search space: 3453760215
effective search space used: 3453760215
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (25.4 bits)