BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037588
         (493 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 2.5.
 pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 5.6.
 pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
           Takasagoensis, At Ph 6.5
          Length = 433

 Score =  268 bits (684), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 170/468 (36%), Positives = 234/468 (50%), Gaps = 54/468 (11%)

Query: 35  FDYKDALGKSIIFLEAQRSGRLPPRHRPAWRGDSGLEDGKQANVDLVGGYYDAGDNVKYG 94
           +DYK  L  S++F EAQRSGRLP   +  WR DS L D      DL GGY+DAGD VK+G
Sbjct: 3   YDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFG 62

Query: 95  LPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQVGDPVQ 154
            PMA+T + L+   + +     SAG L +   A++W TDYF+K  + +N+ Y QVG    
Sbjct: 63  FPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGDA 122

Query: 155 DHQCWARPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKAK 214
           DH  W RPE+M   R    ID+  PG                VFR VD  YS  LL  A+
Sbjct: 123 DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHAR 182

Query: 215 LLFHFANSHKGTYDGECP----FYCSYSGYNDELLWAATWLYMATRNPTYLEYIK----E 266
            LF FAN+++G Y         FY S + Y DEL+WAA WLY AT + TYL   +    E
Sbjct: 183 QLFDFANNYRGKYSDSITDARNFYAS-ADYRDELVWAAAWLYRATNDNTYLNTAESLYDE 241

Query: 267 DSISAIVAEFSWDLKYAGAQILLSEMFFESGDQALANFKNQADSYVCSVLPESPYHQVYM 326
             +       +WD K +G Q+LL+++   +  QA   +K+   SYV  ++          
Sbjct: 242 FGLQNWGGGLNWDSKVSGVQVLLAKL---TNKQA---YKDTVQSYVNYLINNQQ-----K 290

Query: 327 TPGGLIH------YRDGANAQYVTGTALLFSIYADVLAKHNQKITCGDQQFGSSQLMAFA 380
           TP GL++       R  ANA ++                    +   +    +S    FA
Sbjct: 291 TPKGLLYIDMWGTLRHAANAAFIM-------------------LEAAELGLSASSYRQFA 331

Query: 381 KKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMSFVNFFLKNMP 440
           + Q+DY LG    GRS++ GFG +PPTR HHR +S P     +  +C     N F    P
Sbjct: 332 QTQIDYALGDG--GRSFVCGFGSNPPTRPHHRSSSCPP----APATCDW---NTFNSPDP 382

Query: 441 NPNELTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKLAS 488
           N + L+GA+VGGPD++DN+ D+R D    E  T  N+     LA L +
Sbjct: 383 NYHVLSGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALAALVA 430


>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
 pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
 pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
 pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
 pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
          Length = 605

 Score =  251 bits (641), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 165/470 (35%), Positives = 232/470 (49%), Gaps = 45/470 (9%)

Query: 35  FDYKDALGKSIIFLEAQRSGRLPPRHRPAWRGDSGLEDGKQANVDLVGGYYDAGDNVKYG 94
           F+Y +AL KS+ F EAQRSG+LP  +R +WRGDSGL DG    +DL GG+YDAGD+VK+G
Sbjct: 4   FNYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFG 63

Query: 95  LPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQVGDPVQ 154
            PMAFT + L+  A+   +    +G++  + D +RW  DYF+K     N LYVQVGD   
Sbjct: 64  FPMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDA 123

Query: 155 DHQCWARPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKAK 214
           DH+ W   E M   R    +D   PG                VF   D  Y+  L+  AK
Sbjct: 124 DHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAK 183

Query: 215 LLFHFANSHKGTYDGECP---FYCSYSGYNDELLWAATWLYMATRNPTYL-----EYI-- 264
            L+ FA++++G Y    P   FY S+SGY DEL+W A WLY AT + +YL     EY   
Sbjct: 184 QLYTFADTYRGVYSDCVPAGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDFL 243

Query: 265 ---KEDSISAIVAEFSWDLKYAGAQILLSEMFFESGDQALANFKNQ-ADSYVCSVLPESP 320
              ++  + +     +WD K  G  +LL++   E+G Q   +  N+  D +   V  +  
Sbjct: 244 STEQQTDLRSYRWTIAWDDKSYGTYVLLAK---ETGKQKYIDDANRWLDYWTVGVNGQ-- 298

Query: 321 YHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQKITCGDQQFGSSQLMAFA 380
             +V  +PGG+         +Y   TA +  +YA V+          D      +   FA
Sbjct: 299 --RVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVI----------DDPVRKQRYHDFA 346

Query: 381 KKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMSFVNFFLKNMP 440
            +Q++Y LG NPR  SY+VGFG +PP   HHR A               S+ +       
Sbjct: 347 VRQINYALGDNPRNSSYVVGFGNNPPRNPHHRTAH-------------GSWTDSIASPAE 393

Query: 441 NPNELTGAIVGGP-DKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKLASH 489
           N + L GA+VGGP   +D + D+R D    E  T  N+     LA L   
Sbjct: 394 NRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEE 443


>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
           Endoglucanase From Separately Expressed Gh9 And Cbm3c
           Modules
          Length = 466

 Score =  251 bits (640), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 161/469 (34%), Positives = 235/469 (50%), Gaps = 42/469 (8%)

Query: 31  ANADFDYKDALGKSIIFLEAQRSGRLPPRH-RPAWRGDSGLEDGKQANVDLVGGYYDAGD 89
           A   F+Y +AL K+I F E QRSG+L     R  WRGDSGL+DGK A +DL GG+YDAGD
Sbjct: 20  ATGAFNYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGD 79

Query: 90  NVKYGLPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQV 149
           +VK+ LPM+++ + L  +   Y    K +G+  ++++ I+W  DYF+K    K+  Y QV
Sbjct: 80  HVKFNLPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQV 139

Query: 150 GDPVQDHQCWARPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRL 209
           GD   DH  W   E M   R    +D  +PG                +F+ VD  YS+  
Sbjct: 140 GDGHADHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKEC 199

Query: 210 LNKAKLLFHFANSHKGT--YDGECPFYCSYSGYNDELLWAATWLYMATRNPTYLE----- 262
           L  AK LF FA++ K    Y     FY S+SG+ DEL WAA WLY+AT + +YL+     
Sbjct: 200 LKHAKELFEFADTTKSDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESY 259

Query: 263 -----YIKEDSISAIVAEFSWDLKYAGAQILLSEMFFESGDQALANFKNQADSYVCSVLP 317
                Y  + +I        WD    G  +LL+ +  ++G    A  +   D +      
Sbjct: 260 SDKWGYEPQTNIPKYKWAQCWDDVTYGTYLLLARIKNDNGKYKEA-IERHLDWWTTGYNG 318

Query: 318 ESPYHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQKITCGDQQFGSSQLM 377
           E    ++  TP GL       + +Y T TA L  +Y+D           GD++   + L 
Sbjct: 319 E----RITYTPKGLAWLDQWGSLRYATTTAFLACVYSD--------WENGDKEKAKTYL- 365

Query: 378 AFAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMSFVNFFLK 437
            FA+ Q DY LG+   GRS++VGFG++PP R HHR A               S+ +  ++
Sbjct: 366 EFARSQADYALGST--GRSFVVGFGENPPKRPHHRTAH-------------GSWADSQME 410

Query: 438 NMPNPNELTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKL 486
              + + L GA+VGGPD  DN+ D+  +    E     N+  +G+LAK+
Sbjct: 411 PPEHRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLAKM 459


>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
 pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
          Length = 475

 Score =  217 bits (553), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 149/468 (31%), Positives = 225/468 (48%), Gaps = 42/468 (8%)

Query: 31  ANADFDYKDALGKSIIFLEAQRSGRLPPRHRPAWRGDSGLEDGKQANVDLVGGYYDAGDN 90
           A+  F+Y +AL K+I+F E Q SG+LP   R  WRGDS L+DG+   +DL GG++DAGD+
Sbjct: 22  ASGSFNYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDH 81

Query: 91  VKYGLPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQVG 150
           VK+ LPM++T + LS +   Y      +G+L+++++ I W  DYF+K    K   Y QVG
Sbjct: 82  VKFNLPMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVG 141

Query: 151 DPVQDHQCWARPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLL 210
           D  +DH  W   E M+  R    +   +PG                V +  +   +   L
Sbjct: 142 DGSKDHAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYL 201

Query: 211 NKAKLLFHFANSHKGT--YDGECPFYCSYSGYNDELLWAATWLYMATRNPTYL------- 261
             AK L+ FA   K    Y     +Y S+SG+ DEL WAA WLY+AT + TYL       
Sbjct: 202 QHAKELYEFAEVTKSDAGYTAANGYYNSWSGFYDELSWAAVWLYLATNDSTYLTKAESYV 261

Query: 262 -EYIKEDSISAIVAEFS--WDLKYAGAQILLSEMFFESGDQALANFKNQADSYVCSVLPE 318
             + K    + I  +++  WD  + GA +LL+++     D      ++  D +      E
Sbjct: 262 QNWPKISGSNTIDYKWAHCWDDVHNGAALLLAKI--TGKDIYKQIIESHLDYWTTGYNGE 319

Query: 319 SPYHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQKITCGDQQFGSSQLMA 378
               ++  TP GL       + +Y T TA L  +Y+D +   + K               
Sbjct: 320 ----RIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWVGCPSTKKEI---------YRK 366

Query: 379 FAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMSFVNFFLKN 438
           F + Q+DY LG+   GRS++VGFG +PP R HHR A     +  S  S            
Sbjct: 367 FGESQIDYALGS--AGRSFVVGFGTNPPKRPHHRTAHSSWADSQSIPSY----------- 413

Query: 439 MPNPNELTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKL 486
             + + L GA+VGGP   D++ D+  +    E     N+  +G LAK+
Sbjct: 414 --HRHTLYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALAKM 459


>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum Complexed With Cellobiose
 pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
           Cellulolyticum
 pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
 pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
           Cellulolyticum Complexed With A Thio-Oligosaccharide
           Inhibitor
          Length = 614

 Score =  210 bits (535), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 150/478 (31%), Positives = 221/478 (46%), Gaps = 65/478 (13%)

Query: 35  FDYKDALGKSIIFLEAQRSGRLPPRHRPAWRGDSGLEDGKQANVDLVGGYYDAGDNVKYG 94
           ++Y +AL KSI+F E QRSG LP   R  WR DSG++DG    VDL GG+YDAGD+VK+ 
Sbjct: 4   YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63

Query: 95  LPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQVGDPVQ 154
           LPM++T + L+ S          +G+ + +MD I+W  DYF+K +      Y QVGD  +
Sbjct: 64  LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123

Query: 155 DHQCWARPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKAK 214
           DH  W   E M+  R    +D+  PG                VF+  D  Y+ + ++ AK
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183

Query: 215 LLFHFANSHKGTYDGECPFYCSYSGYN------------DELLWAATWLYMATRNPTYLE 262
            LF  A+  K           S +GY             D+L WAA WLY+AT + TYL+
Sbjct: 184 NLFDMADKAK-----------SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLD 232

Query: 263 --------YIKEDSISAIVAEFS--WDLKYAGAQILLSEMFFESGDQALAN-FKNQADSY 311
                   + KE     I  ++   WD  + GA++LL+++   +  Q   +  +   D +
Sbjct: 233 KAESYVPNWGKEQQTDIIAYKWGQCWDDVHYGAELLLAKL---TNKQLYKDSIEMNLDFW 289

Query: 312 VCSVLPESPYHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQKITCGDQQF 371
              V       +V  TP GL       + ++ T  A L  +YA+       K+       
Sbjct: 290 TTGVNGT----RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKV------- 338

Query: 372 GSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMSF 431
             S    F K Q+DY LG+   GRS++VG+G +PP   HHR A               S+
Sbjct: 339 --SVYKDFLKSQIDYALGST--GRSFVVGYGVNPPQHPHHRTAH-------------GSW 381

Query: 432 VNFFLKNMPNPNELTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKLASH 489
            +       + + + GA+VGGPD  D + DE  +    E     N+   G LAK+  H
Sbjct: 382 TDQMTSPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKH 439


>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
 pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
           Cellotriose
          Length = 614

 Score =  210 bits (534), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 150/478 (31%), Positives = 221/478 (46%), Gaps = 65/478 (13%)

Query: 35  FDYKDALGKSIIFLEAQRSGRLPPRHRPAWRGDSGLEDGKQANVDLVGGYYDAGDNVKYG 94
           ++Y +AL KSI+F E QRSG LP   R  WR DSG++DG    VDL GG+YDAGD+VK+ 
Sbjct: 4   YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63

Query: 95  LPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQVGDPVQ 154
           LPM++T + L+ S          +G+ + +MD I+W  DYF+K +      Y QVGD  +
Sbjct: 64  LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123

Query: 155 DHQCWARPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKAK 214
           DH  W   E M+  R    +D+  PG                VF+  D  Y+ + ++ AK
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183

Query: 215 LLFHFANSHKGTYDGECPFYCSYSGYN------------DELLWAATWLYMATRNPTYLE 262
            LF  A+  K           S +GY             D+L WAA WLY+AT + TYL+
Sbjct: 184 NLFDMADKAK-----------SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLD 232

Query: 263 --------YIKEDSISAIVAEFS--WDLKYAGAQILLSEMFFESGDQALAN-FKNQADSY 311
                   + KE     I  ++   WD  + GA++LL+++   +  Q   +  +   D +
Sbjct: 233 KAESYVPNWGKEQQTDIIAYKWGQXWDDVHYGAELLLAKL---TNKQLYKDSIEMNLDFW 289

Query: 312 VCSVLPESPYHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQKITCGDQQF 371
              V       +V  TP GL       + ++ T  A L  +YA+       K+       
Sbjct: 290 TTGVNGT----RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKV------- 338

Query: 372 GSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMSF 431
             S    F K Q+DY LG+   GRS++VG+G +PP   HHR A               S+
Sbjct: 339 --SVYKDFLKSQIDYALGST--GRSFVVGYGVNPPQHPHHRTAH-------------GSW 381

Query: 432 VNFFLKNMPNPNELTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKLASH 489
            +       + + + GA+VGGPD  D + DE  +    E     N+   G LAK+  H
Sbjct: 382 TDQMTSPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKH 439


>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticum
 pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
           Cellulolyticium In Complex With Cellobiose
          Length = 441

 Score =  188 bits (478), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 136/460 (29%), Positives = 211/460 (45%), Gaps = 51/460 (11%)

Query: 36  DYKDALGKSIIFLEAQRSG-RLPPRHRPAWRGDSGLEDGKQANVDLVGGYYDAGDNVKYG 94
           DY  AL  SIIF +A + G +    +   WRG     DG    VDL GGY+DAGD+VK+G
Sbjct: 5   DYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVKFG 64

Query: 95  LPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQVGDPVQ 154
           LP  ++ + L  S   + +   + G    ++  +++ TDYFLK+       Y QVG+   
Sbjct: 65  LPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEGNA 124

Query: 155 DHQCWARPEEMKTPR-TVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKA 213
           DH  W  PEE    R ++   D  +P                  ++ +D  Y+ + LN A
Sbjct: 125 DHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLNAA 184

Query: 214 KLLFHFANSHKGTYDGECPFYCSYSGYNDELLWAATWLYMATRNPTYLEYIKE-DSISAI 272
           K L+    +++G  +G+  FY + S + D+L WAATWLY AT + TY+   ++  ++   
Sbjct: 185 KELYAMGKANQGVGNGQS-FYQATS-FGDDLAWAATWLYTATNDSTYITDAEQFITLGNT 242

Query: 273 VAE--------FSWDLKYAGAQILLSEMFFESGDQALANFKNQADSYVCSVLPESPYHQV 324
           + E          WD  Y  A + L+++   +G Q   +      +Y  +        QV
Sbjct: 243 MNENKMQDKWTMCWDDMYVPAALRLAQI---TGKQIYKDAIEFNFNYWKT--------QV 291

Query: 325 YMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQKITCGDQQFGSSQLMAFAKKQM 384
             TPGGL    +    +Y    +++  +Y               +Q     L+  AKKQ+
Sbjct: 292 TTTPGGLKWLSNWGVLRYAAAESMVMLVYC--------------KQNPDQSLLDLAKKQV 337

Query: 385 DYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMSFVNFFLKNMPNPNE 444
           DY+LG NP   SY++G+G +     HHR A+      N   + P   +            
Sbjct: 338 DYILGDNPANMSYIIGYGSNWCIHPHHRAAN-GYTYANGDNAKPAKHL------------ 384

Query: 445 LTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLA 484
           LTGA+VGGPD++D F D+      TE     N+  +GVLA
Sbjct: 385 LTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLA 424


>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
          Length = 611

 Score =  160 bits (405), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 149/539 (27%), Positives = 226/539 (41%), Gaps = 80/539 (14%)

Query: 1   MAMGTFAKASMAM---IALTWPAFFQGLLFTVYANADFDYKDALGKSIIFLEAQRSGR-L 56
           M    FAK ++A    I +T          +  A  +++Y  AL  S+ F +A   G  +
Sbjct: 1   MRKNLFAKRAVAFLLGIVITAAGIVSFNTVSTSAAGEYNYAKALQYSMFFYDANMCGTGV 60

Query: 57  PPRHRPAWRGDSGLEDGK--------------------------QANVDLVGGYYDAGDN 90
                 +WRGD  + D +                             VD+ GG++DAGD+
Sbjct: 61  DENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSNRSVLDPDGDGKVDVSGGFHDAGDH 120

Query: 91  VKYGLPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYV--- 147
           VK+GLP A+  ST+      +  + ++ G+  +    +R+  DYF++ + R     V   
Sbjct: 121 VKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEVILRYFNDYFMRCTFRDASGNVVAF 180

Query: 148 --QVGDPVQDHQCWARPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPY 205
             QVGD   DH  W  PE     R    I  + PG                 F+  D  Y
Sbjct: 181 CHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGTDIISATAASLAINYMNFKDTDPQY 240

Query: 206 SRRLLNKAKLLFHFANSH-KGTYDGEC--PFYCSYSGYNDELLWAATWLYMATRNPTYLE 262
           + + L+ AK LF FA  + KG   GE     Y   S + D+  WAA WLY+AT+N  YL+
Sbjct: 241 AAKSLDYAKALFDFAEKNPKGVVQGEDGPKGYYGSSKWQDDYCWAAAWLYLATQNEHYLD 300

Query: 263 --YIKEDSISAIVAEFSWDLKYAGAQILLSEM--FFESGDQAL------ANFKNQADSY- 311
             +   D  +       W+  ++G   +L+E+   ++   Q        A+ KNQ +   
Sbjct: 301 EAFKYYDYYAPPGWIHCWNDVWSGTACILAEINDLYDKDSQNFEDRYKRASNKNQWEQID 360

Query: 312 ----VCSVLPESPYHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQKITCG 367
               +  +L +     + +TPGG +      +A+Y T   L+  +Y     KH+     G
Sbjct: 361 FWKPIQDLLDKWSGGGITVTPGGYVFLNQWGSARYNTAAQLIALVY----DKHH-----G 411

Query: 368 DQQFGSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSC 427
           D     S+   +A+ QMDYLLG NP  R Y+VG+  +     HHR AS  L++ N   S 
Sbjct: 412 DT---PSKYANWARSQMDYLLGKNPLNRCYVVGYSSNSVKYPHHRAAS-GLKDAND--SS 465

Query: 428 PMSFVNFFLKNMPNPNELTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKL 486
           P  +V            L GA+VGGPD  D   D   D    E     N+  +G  A L
Sbjct: 466 PHKYV------------LYGALVGGPDASDQHVDRTNDYIYNEVAIDYNAAFVGACAGL 512


>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
           Thermoacidophilic Alicyclobacillus Acidocaldarius
 pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
 pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellobiose
 pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
           With Cellotetraose
 pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
           Isofagomine
 pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
           Isofagomine
 pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
           Isofagomine
          Length = 537

 Score = 56.6 bits (135), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 102/476 (21%), Positives = 174/476 (36%), Gaps = 58/476 (12%)

Query: 37  YKDALGKSIIFLEAQRSGRLPPRHR--PAWRGDSGLED----GKQANVDLVGGYYDAGDN 90
           Y+D L   + F + Q  G + P     P   G     D    G +  +   GG++DAGD 
Sbjct: 88  YRDVLEAMLRFFDYQLCGVVLPEDEAGPWAHGACHTSDAKVFGTERALACPGGWHDAGDY 147

Query: 91  VKYGLPMAFTISTLSLSALYYHQELKSAGELQNVMDAIRWGTDYFLKTSSRKNKLYVQVG 150
            KY +P A  ++ L L+  Y+   L     +++V  A        L+ +  +    + + 
Sbjct: 148 GKYTVPAAKAVADLLLAHEYFPAALAHVRPMRSVHRAPHLPPA--LEVAREEIAWLLTMQ 205

Query: 151 DPVQD---HQCWA--------RPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFR 199
           DP      H+           RPE+   P  +  I                      V+R
Sbjct: 206 DPATGGVYHKVTTPSFPPLDTRPEDDDAPLVLSPISYAA-----TATFCAAMAHAALVYR 260

Query: 200 GVDRPYSRRLLNKAKLLFHFANSHK----GTYDGECPFYCSYSGYNDELLWAATWLYMAT 255
             D   S    + A+  + +  +H+       DG        +   DELLWA+  L   T
Sbjct: 261 PFDPALSSCCADAARRAYAWLGAHEMQPFHNPDGILTGEYGDAELRDELLWASCALLRMT 320

Query: 256 RNPTYLEYIKEDSISAIVAEFSWDLKYAGAQILLSEMFFESGDQALAN-FKNQADSYVCS 314
            +  +    +      +  +  W+L +A   +     +  +   A+++  +N+  S +  
Sbjct: 321 GDSAWARVCE----PLLDLDLPWELGWADVALYGVMDYLRTPRAAVSDDVRNKVKSRLLR 376

Query: 315 VLPESPYHQVYMTPGGLIHYRD----GANAQYVTGTALLFSIYADVLAKHNQKITCGDQQ 370
            L ++        P G+    D    G+N   +   A+ F +   V   H    T     
Sbjct: 377 EL-DALAAMAESHPFGIPMRDDDFIWGSN-MVLLNRAMAFLLAEGVGVLHPAAHT----- 429

Query: 371 FGSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHRGASVPLENVNSGVSCPMS 430
                    A++  DYL GANP G+ Y+ GFG+ P    HHR       +V   V  P+ 
Sbjct: 430 --------VAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHR------PSVADDVDHPVP 475

Query: 431 FVNFFLKNMPNPNELTGAIVGGPDKHDNFNDERWDGPMTEPCTYVNSLAIGVLAKL 486
            +     N    +E+  A + G    + + D +      E   Y NS A+ V+A L
Sbjct: 476 GMVVGGPNRHLQDEIARAQLAGRPAMEAYIDHQDSYSTNEVAVYWNSPAVFVIAAL 531


>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
           Angstroms Resolution
          Length = 639

 Score = 52.8 bits (125), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 89/426 (20%), Positives = 159/426 (37%), Gaps = 57/426 (13%)

Query: 16  LTWPAFFQGLLFTVYANADFDYKDALGKSIIFLEAQRSGR--------LPPRHRPAWRGD 67
           L  P   + + F +  N    Y+DA   +++ +   R G         +   H P    D
Sbjct: 111 LAVPGVGKSVNFKIAMNV---YEDAFKTAMLGMYLLRCGTSVSATYNGIHYSHGPCHTND 167

Query: 68  SGLE--DGKQANVDLVGGYYDAGDNVKYGLPMAFTISTLSLSALYYHQELKSAG------ 119
           + L+  +G+    D   G++DAGD  KY +    T+ ++ L+  ++  +L+         
Sbjct: 168 AYLDYINGQHTKKDSTKGWHDAGDYNKYVVNAGITVGSMFLAWEHFKDQLEPVALEIPEK 227

Query: 120 --ELQNVMDAIRWGTDYFLKTSSRKNKLYVQVGDPVQDHQCWARPEEMKTPRTVLMIDSK 177
              + + +D +++  D+ L          V      ++   +  PE     R  +   S 
Sbjct: 228 NNSIPDFLDELKYEIDWILTMQYPDGSGRVAHKVSTRNFGGFIMPENEHDERFFVPWSSA 287

Query: 178 TPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKAKLLFHF-ANSHKGTYDGECPF--- 233
                              +FR  D  Y+ + +N AK+ + F  N+    +  +  F   
Sbjct: 288 ATADFVAMTAMAAR-----IFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFSTG 342

Query: 234 -YCSYSGYNDELLWAATWLYMATRNPTYLEYIKEDSIS---AIVAEFSWD--LKYAGAQI 287
            Y + S  +D L WAA  ++    +  YL   +  +      I A+F WD          
Sbjct: 343 EYATVSDADDRL-WAAAEMWETLGDEEYLRDFENRAAQFSKKIEADFDWDNVANLGMFTY 401

Query: 288 LLSEMFFESGDQALA-NFKNQADSYVCSVLPESPYHQVYMTPGGLIHYRDGANAQYVTGT 346
           LLSE      + AL  + K+   S   S++  S  H    T G   ++  G N   V  T
Sbjct: 402 LLSER--PGKNPALVQSIKDSLLSTADSIVRTSQNHGYGRTLGTTYYW--GCNGTVVRQT 457

Query: 347 ALLFSIYADVLAKHNQKITCGDQQFGSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPP 406
            +L    A+ ++ +N  +               A   + ++ G N   RSY+ G G +PP
Sbjct: 458 MIL--QVANKISPNNDYVNA-------------ALDAISHVFGRNYYNRSYVTGLGINPP 502

Query: 407 TRAHHR 412
              H R
Sbjct: 503 MNPHDR 508


>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 610

 Score = 40.0 bits (92), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 89/410 (21%), Positives = 141/410 (34%), Gaps = 86/410 (20%)

Query: 62  PAWRGDSGLEDGKQANV--DLVGGYYDAGDNVKYGLPMAFTISTL-------SLSALYYH 112
           P W  D       Q N   D+ GG+YDAGD+ KY +     + TL        +  L   
Sbjct: 150 PTWPQDDEYAGIPQKNYTKDVTGGWYDAGDHGKYVVNGGIAVWTLMNMYERAKIRGLDNW 209

Query: 113 QELKSAG----ELQN----VMDAIRWGTDYFLK---TSSRKNKLYVQVGDPVQDHQCWA- 160
              +  G    E  N    ++D  RW  ++F K   T      +   V   + D +  A 
Sbjct: 210 GPYRDGGMNIPEQNNGYPDILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKIHDFRWTAL 269

Query: 161 --RPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKAKLLFH 218
              P E   PR +  + +                    +++  D  ++   L KA++ + 
Sbjct: 270 GMLPHEDPQPRYLRPVSTAA-----TLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQ 324

Query: 219 FANSHKGTY---------DGECPFYCSYSGYNDELLWAATWLYMATRNPTYLEYIKED-- 267
            A  H   Y          G  P+   Y G  DE  WAA  LY+ T    Y  Y+     
Sbjct: 325 AALKHPDIYAEYTPGSGGPGGGPYNDDYVG--DEFYWAACELYVTTGKDEYKNYLMNSPH 382

Query: 268 --SISAIVAE--------------FSWDLKYAGAQILLSEMFFESG------DQALANFK 305
              + A + E              F+W        I L+    E+G       +A  N  
Sbjct: 383 YLEMPAKMGENGGANGEDNGLWGCFTWGTTQGLGTITLA--LVENGLPATDIQKARNNIA 440

Query: 306 NQADSYVCSVLPES---PYHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQ 362
             AD ++ ++  +    P  Q     GG   Y  G+N+ ++    ++     D       
Sbjct: 441 KAADRWLENIEEQGYRLPIKQAEDERGG---YPWGSNS-FILNQMIVMGYAYDFT----- 491

Query: 363 KITCGDQQFGSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHR 412
                    G S+ +      + YLLG N   +SY+ G+G+ P    H R
Sbjct: 492 ---------GDSKYLDGMFDGISYLLGRNAMDQSYVTGYGERPLQNPHDR 532


>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
           Cellobiohydrolase Cbha From C. Thermocellum
          Length = 609

 Score = 40.0 bits (92), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 89/410 (21%), Positives = 141/410 (34%), Gaps = 86/410 (20%)

Query: 62  PAWRGDSGLEDGKQANV--DLVGGYYDAGDNVKYGLPMAFTISTL-------SLSALYYH 112
           P W  D       Q N   D+ GG+YDAGD+ KY +     + TL        +  L   
Sbjct: 150 PTWPQDDEYAGIPQKNYTKDVTGGWYDAGDHGKYVVNGGIAVWTLMNMYERAKIRGLDNW 209

Query: 113 QELKSAG----ELQN----VMDAIRWGTDYFLK---TSSRKNKLYVQVGDPVQDHQCWA- 160
              +  G    E  N    ++D  RW  ++F K   T      +   V   + D +  A 
Sbjct: 210 GPYRDGGMNIPEQNNGYPDILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKIHDFRWTAL 269

Query: 161 --RPEEMKTPRTVLMIDSKTPGXXXXXXXXXXXXXXXXVFRGVDRPYSRRLLNKAKLLFH 218
              P E   PR +  + +                    +++  D  ++   L KA++ + 
Sbjct: 270 GMLPHEDPQPRYLRPVSTAA-----TLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQ 324

Query: 219 FANSHKGTY---------DGECPFYCSYSGYNDELLWAATWLYMATRNPTYLEYIKED-- 267
            A  H   Y          G  P+   Y G  DE  WAA  LY+ T    Y  Y+     
Sbjct: 325 AALKHPDIYAEYTPGSGGPGGGPYNDDYVG--DEFYWAACELYVTTGKDEYKNYLMNSPH 382

Query: 268 --SISAIVAE--------------FSWDLKYAGAQILLSEMFFESG------DQALANFK 305
              + A + E              F+W        I L+    E+G       +A  N  
Sbjct: 383 YLEMPAKMGENGGANGEDNGLWGCFTWGTTQGLGTITLA--LVENGLPATDIQKARNNIA 440

Query: 306 NQADSYVCSVLPES---PYHQVYMTPGGLIHYRDGANAQYVTGTALLFSIYADVLAKHNQ 362
             AD ++ ++  +    P  Q     GG   Y  G+N+ ++    ++     D       
Sbjct: 441 KAADRWLENIEEQGYRLPIKQAEDERGG---YPWGSNS-FILNQMIVMGYAYDFT----- 491

Query: 363 KITCGDQQFGSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPPTRAHHR 412
                    G S+ +      + YLLG N   +SY+ G+G+ P    H R
Sbjct: 492 ---------GDSKYLDGMFDGISYLLGRNAMDQSYVTGYGERPLQNPHDR 532


>pdb|2O3A|A Chain A, Crystal Structure Of A Protein Af_0751 From Archaeoglobus
           Fulgidus
 pdb|2O3A|B Chain B, Crystal Structure Of A Protein Af_0751 From Archaeoglobus
           Fulgidus
          Length = 178

 Score = 30.0 bits (66), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 3/50 (6%)

Query: 97  MAFTISTLSLSALYYHQELKSAGELQNVMDAI-RWGTDYFLKTSSRKNKL 145
           +A T        +Y+  E KS    ++V D + RWG D+F+K  S K  L
Sbjct: 24  VALTARAFGAKGIYFDTEDKSV--FESVRDVVERWGGDFFIKAVSWKKLL 71


>pdb|1H0H|A Chain A, Tungsten Containing Formate Dehydrogenase From
           Desulfovibrio Gigas
 pdb|1H0H|K Chain K, Tungsten Containing Formate Dehydrogenase From
           Desulfovibrio Gigas
          Length = 977

 Score = 29.6 bits (65), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 2/36 (5%)

Query: 201 VDRPYSRRLLNKAKLLFHFANSHKGTYDGECPFYCS 236
           ++ P+S+ L N   L  HFA   K   D   PF CS
Sbjct: 820 IEHPFSKTLHNPTAL--HFATEEKAVCDPRYPFICS 853


>pdb|1UTJ|A Chain A, Trypsin Specificity As Elucidated By Lie Calculations,
           X-Ray Structures And Association Constant Measurements
 pdb|1UTK|A Chain A, Trypsin Specificity As Elucidated By Lie Calculations,
           X-Ray Structures And Association Constant Measurements
 pdb|1UTL|M Chain M, Trypsin Specificity As Elucidated By Lie Calculations,
           X-Ray Structures And Association Constant Measurements
 pdb|1UTM|A Chain A, Trypsin Specificity As Elucidated By Lie Calculations,
           X-Ray Structures And Association Constant Measurements
          Length = 242

 Score = 29.6 bits (65), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 5/62 (8%)

Query: 317 PESPYHQVYMTPGGLIHYRDGA--NAQYVTGTALLFSIYADV-LAKHNQKITCGDQQFGS 373
           P S  HQV +  G   H+  G+  N  +V   A  +    +V L +HN K+T G +QF S
Sbjct: 29  PYSQPHQVSLNSG--YHFCGGSLVNENWVVSAAHCYKSRVEVRLGEHNIKVTEGSEQFIS 86

Query: 374 SQ 375
           S 
Sbjct: 87  SS 88


>pdb|2ANL|A Chain A, X-Ray Crystal Structure Of The Aspartic Protease
           Plasmepsin 4 From The Malarial Parasite Plasmodium
           Malariae Bound To An Allophenylnorstatine Based
           Inhibitor
 pdb|2ANL|B Chain B, X-Ray Crystal Structure Of The Aspartic Protease
           Plasmepsin 4 From The Malarial Parasite Plasmodium
           Malariae Bound To An Allophenylnorstatine Based
           Inhibitor
          Length = 327

 Score = 29.3 bits (64), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 25/93 (26%), Positives = 43/93 (46%), Gaps = 13/93 (13%)

Query: 349 LFSIYADVLAKHNQKITCG--DQQFGSSQLMAFAKKQMDYLLGANPRGRSYMVGFGKDPP 406
           LF+ Y  V  KH+  +T G  +++F   +L  + K   D     +       V FGK   
Sbjct: 153 LFTFYLPVHDKHSGYLTIGGIEEKFYEGEL-TYEKLNHDLFWQVDLD-----VNFGK--- 203

Query: 407 TRAHHRGASVPLENVNSGVSCPMSFVNFFLKNM 439
                  A+V +++  S ++ P SF+N F K++
Sbjct: 204 --TSMEKANVIVDSGTSTITAPTSFINKFFKDL 234


>pdb|1BZX|E Chain E, The Crystal Structure Of Anionic Salmon Trypsin In Complex
           With Bovine Pancreatic Trypsin Inhibitor
 pdb|2STA|E Chain E, Anionic Salmon Trypsin In Complex With Squash Seed
           Inhibitor (Cucurbita Maxima Trypsin Inhibitor I)
 pdb|2STB|E Chain E, Anionic Salmon Trypsin In Complex With Squash Seed
           Inhibitor (cucurbita Pepo Trypsin Inhibitor Ii)
          Length = 222

 Score = 29.3 bits (64), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 5/62 (8%)

Query: 317 PESPYHQVYMTPGGLIHYRDGA--NAQYVTGTALLFSIYADV-LAKHNQKITCGDQQFGS 373
           P S  HQV +  G   H+  G+  N  +V   A  +    +V L +HN K+T G +QF S
Sbjct: 9   PYSQPHQVSLNSG--YHFCGGSLVNENWVVSAAHCYKSRVEVRLGEHNIKVTEGSEQFIS 66

Query: 374 SQ 375
           S 
Sbjct: 67  SS 68


>pdb|2ZPS|A Chain A, Crystal Structure Of Anionic Trypsin Isoform 3 From Chum
           Salmon
          Length = 222

 Score = 28.9 bits (63), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 29/56 (51%), Gaps = 5/56 (8%)

Query: 322 HQVYMTPGGLIHYRDGA--NAQYVTGTALLFSIYADV-LAKHNQKITCGDQQFGSS 374
           HQV +  G   H+  G+  N  +V   A  +    +V L +HN K+T G +QF SS
Sbjct: 14  HQVSLNSG--YHFCGGSLVNENWVVSAAHCYKTRVEVRLGEHNIKVTEGSEQFISS 67


>pdb|1BIT|A Chain A, The Crystal Structure Of Anionic Salmon Trypsin In A
           Second Crystal Form
          Length = 237

 Score = 28.5 bits (62), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 14/92 (15%)

Query: 287 ILLSEMFFESGDQALANFKNQADSYVCSVLPESPYHQVYMTPGGLIHYRDGA--NAQYVT 344
           +L+   F    D+ +  ++ +A S           HQV +  G   H+  G+  N  +V 
Sbjct: 3   LLIGAAFATEDDKIVGGYECKAYSQA---------HQVSLNSG--YHFCGGSLVNENWVV 51

Query: 345 GTALLFSIYADV-LAKHNQKITCGDQQFGSSQ 375
             A  +    +V L +HN K+T G +QF SS 
Sbjct: 52  SAAHCYKSRVEVRLGEHNIKVTEGSEQFISSS 83


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.320    0.135    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,529,393
Number of Sequences: 62578
Number of extensions: 664120
Number of successful extensions: 1204
Number of sequences better than 100.0: 24
Number of HSP's better than 100.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 1146
Number of HSP's gapped (non-prelim): 28
length of query: 493
length of database: 14,973,337
effective HSP length: 103
effective length of query: 390
effective length of database: 8,527,803
effective search space: 3325843170
effective search space used: 3325843170
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (25.0 bits)