BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 042201
(511 letters)
Database: pdbaa
62,578 sequences; 14,973,337 total letters
Searching..................................................done
>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 2.5.
pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 5.6.
pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 6.5
Length = 433
Score = 309 bits (791), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 181/463 (39%), Positives = 253/463 (54%), Gaps = 42/463 (9%)
Query: 39 FAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKFG 98
+ Y+ L ++LF+E QRSG+LP Q+V WR +SAL+D + +L GGY+DAGD VKFG
Sbjct: 3 YDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFG 62
Query: 99 WPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGNA 158
+PMAY+ ++L+W ++++ SS L R A++W TD+ ++AHTS Y QVG G+A
Sbjct: 63 FPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGDA 122
Query: 159 DHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHSK 218
DH W RPEDM R Y+I + PG IVF+ VD YS+ LL H++
Sbjct: 123 DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHAR 182
Query: 219 SLFEFADKHRGSYQASCP----FYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLS---N 271
LF+FA+ +RG Y S FY S + Y+DEL+WAAAWLY+A+ DN YL+ S
Sbjct: 183 QLFDFANNYRGKYSDSITDARNFYAS-ADYRDELVWAAAWLYRATNDNTYLNTAESLYDE 241
Query: 272 QGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSVRIETT 331
G +WD+K +G Q+LLAK NKQ +K V+S+V L+ + T
Sbjct: 242 FGLQNWGGGLNWDSKVSGVQVLLAKL---TNKQ--AYKDTVQSYVNYLINNQ-----QKT 291
Query: 332 PGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKSQVDY 391
P GLLYI L++ +A ++ L+ SAS FA++Q+DY
Sbjct: 292 PKGLLYIDMWGTLRHAANAAFIM--------------LEAAELGLSASSYRQFAQTQIDY 337
Query: 392 ILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXXXXXVH 451
LG S++ GFGS P + HHR +S P PA +C+ V
Sbjct: 338 ALGDG--GRSFVCGFGSNPPTRPHHRSSSCP-----PAPATCD---WNTFNSPDPNYHVL 387
Query: 452 VGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLA 494
GA+VGGPD ND + D R+DY H E T NA F ++A L+A
Sbjct: 388 SGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALAALVA 430
>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
Length = 605
Score = 281 bits (719), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 172/473 (36%), Positives = 248/473 (52%), Gaps = 49/473 (10%)
Query: 39 FAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKFG 98
F Y +AL K++ F+E QRSGKLPE+ RV WRG+S L+DG ++L GG+YDAGD+VKFG
Sbjct: 4 FNYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFG 63
Query: 99 WPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGNA 158
+PMA++ ++L+W A+E Q+ YL+ +RW D+ ++AH SP LY QVGDG+A
Sbjct: 64 FPMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDA 123
Query: 159 DHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHSK 218
DH+ W E M R +++ PG IVF D Y++ L+ H+K
Sbjct: 124 DHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAK 183
Query: 219 SLFEFADKHRGSYQASCP---FYCSYSGYQDELLWAAAWLYKASEDNKYL---DY---VL 269
L+ FAD +RG Y P FY S+SGYQDEL+W A WLYKA+ D+ YL +Y L
Sbjct: 184 QLYTFADTYRGVYSDCVPAGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDFL 243
Query: 270 SNQGWSQVAS---EFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSV 326
S + + + S +WD+K G +LLAKE G K + ++ + + +
Sbjct: 244 STEQQTDLRSYRWTIAWDDKSYGTYVLLAKE-TGKQKYIDDANRWLDYWTVGV----NGQ 298
Query: 327 RIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAK 386
R+ +PGG+ + L+Y + + +Y+K ++ + FA
Sbjct: 299 RVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDP-----------VRKQRYHDFAV 347
Query: 387 SQVDYILGKNPMKMSYMAGFGSKFPLQIHHR---GASIPSIGAHPAKVSCNDGXXXXXXX 443
Q++Y LG NP SY+ GFG+ P HHR G+ SI A PA+
Sbjct: 348 RQINYALGDNPRNSSYVVGFGNNPPRNPHHRTAHGSWTDSI-ASPAE------------- 393
Query: 444 XXXXXXVHVGAIVGGPDS-NDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLAQ 495
V GA+VGGP S ND + D R DY E T NA F ++A L+ +
Sbjct: 394 ---NRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEE 443
>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
Endoglucanase From Separately Expressed Gh9 And Cbm3c
Modules
Length = 466
Score = 260 bits (665), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 170/477 (35%), Positives = 229/477 (48%), Gaps = 56/477 (11%)
Query: 35 STTTFAYRDALGKAILFFEGQRSGKLPESQ-RVKWRGNSALSDGKPENVNLIGGYYDAGD 93
+T F Y +AL KAI F+E QRSGKL S R+ WRG+S L DGK ++L GG+YDAGD
Sbjct: 20 ATGAFNYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGD 79
Query: 94 NVKFGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQV 153
+VKF PM+YS ++L WA EY+ Q ++ I+W D+ ++ H Y QV
Sbjct: 80 HVKFNLPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQV 139
Query: 154 GDGNADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRL 213
GDG+ADH W E M R Y++ SPG I+FKKVD YS
Sbjct: 140 GDGHADHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKEC 199
Query: 214 LNHSKSLFEFAD--KHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYV--- 268
L H+K LFEFAD K Y A+ FY S+SG+ DEL WAA WLY A+ D+ YLD
Sbjct: 200 LKHAKELFEFADTTKSDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESY 259
Query: 269 -----------LSNQGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVC 317
+ W+Q WD+ G +LLA+ K + H++ +
Sbjct: 260 SDKWGYEPQTNIPKYKWAQC-----WDDVTYGTYLLLARIKNDNGKYKEAIERHLDWWTT 314
Query: 318 ALMPESSSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFS 377
E RI TP GL ++ +L+Y T+ L +YS N G +
Sbjct: 315 GYNGE----RITYTPKGLAWLDQWGSLRYATTTAFLACVYSDWEN---------GDKEKA 361
Query: 378 ASQISAFAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGA--SIPSIGAHPAKVSCND 435
+ + FA+SQ DY LG S++ GFG P + HHR A S P +
Sbjct: 362 KTYLE-FARSQADYALGS--TGRSFVVGFGENPPKRPHHRTAHGSWADSQMEPPE----- 413
Query: 436 GXXXXXXXXXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPL 492
V GA+VGGPDS D + D ++Y+ E NA FVG +A +
Sbjct: 414 -----------HRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLAKM 459
>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
Length = 475
Score = 250 bits (638), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 162/477 (33%), Positives = 236/477 (49%), Gaps = 58/477 (12%)
Query: 35 STTTFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDN 94
++ +F Y +AL KAI+F+E Q SGKLP R WRG+SAL DG+ ++L GG++DAGD+
Sbjct: 22 ASGSFNYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDH 81
Query: 95 VKFGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVG 154
VKF PM+Y+ ++LSWA EY+ QL ++ I W D+ ++ H S Y QVG
Sbjct: 82 VKFNLPMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVG 141
Query: 155 DGNADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLL 214
DG+ DH W E M R +++T SPG IV K + ++ L
Sbjct: 142 DGSKDHAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYL 201
Query: 215 NHSKSLFEFAD--KHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLSN- 271
H+K L+EFA+ K Y A+ +Y S+SG+ DEL WAA WLY A+ D+ YL S
Sbjct: 202 QHAKELYEFAEVTKSDAGYTAANGYYNSWSGFYDELSWAAVWLYLATNDSTYLTKAESYV 261
Query: 272 QGWSQVASEFS--------WDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPES 323
Q W +++ + WD+ GA +LLAK G + + + H++ + E
Sbjct: 262 QNWPKISGSNTIDYKWAHCWDDVHNGAALLLAK-ITGKDIYKQIIESHLDYWTTGYNGE- 319
Query: 324 SSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISA 383
RI+ TP GL ++ +L+Y T+ L F+YS + G
Sbjct: 320 ---RIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWV----------GCPSTKKEIYRK 366
Query: 384 FAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGA--------SIPSIGAHPAKVSCND 435
F +SQ+DY LG S++ GFG+ P + HHR A SIPS H
Sbjct: 367 FGESQIDYALGS--AGRSFVVGFGTNPPKRPHHRTAHSSWADSQSIPSYHRHTL------ 418
Query: 436 GXXXXXXXXXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPL 492
GA+VGGP S+D + D ++Y + E NA FVG++A +
Sbjct: 419 ----------------YGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALAKM 459
>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
Length = 614
Score = 221 bits (562), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 152/479 (31%), Positives = 232/479 (48%), Gaps = 43/479 (8%)
Query: 38 TFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKF 97
T+ Y +AL K+I+F+E QRSG LP +R WR +S + DG V+L GG+YDAGD+VKF
Sbjct: 3 TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62
Query: 98 GWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGN 157
PM+Y+ ++L+W+ E + Q Y+ I+W D+ ++ + +P Y QVGDG
Sbjct: 63 NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122
Query: 158 ADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHS 217
DH W E M R +++ + PG +VFK D Y+ + ++H+
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182
Query: 218 KSLFEFADKHR--GSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLS---NQ 272
K+LF+ ADK + Y A+ Y S S + D+L WAA WLY A+ D+ YLD S N
Sbjct: 183 KNLFDMADKAKSDAGYTAAS-GYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNW 241
Query: 273 GWSQVASEFS------WDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSV 326
G Q + WD+ GA++LLAK NKQL +K +E + +
Sbjct: 242 GKEQQTDIIAYKWGQXWDDVHYGAELLLAKLT---NKQL--YKDSIEMNLDFWTTGVNGT 296
Query: 327 RIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAK 386
R+ TP GL ++ +L++ T+ L +Y++ G S F K
Sbjct: 297 RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAE----------WEGCTPSKVSVYKDFLK 346
Query: 387 SQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXX 446
SQ+DY LG S++ G+G P HHR A G+ +++
Sbjct: 347 SQIDYALGS--TGRSFVVGYGVNPPQHPHHRTAH----GSWTDQMTS----------PTY 390
Query: 447 XXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLAQSEQEYLHFFQ 505
GA+VGGPD+ D + D +Y + E NA F G++A + S + + F+
Sbjct: 391 HRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFK 449
>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
Length = 614
Score = 221 bits (562), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 152/479 (31%), Positives = 232/479 (48%), Gaps = 43/479 (8%)
Query: 38 TFAYRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVKF 97
T+ Y +AL K+I+F+E QRSG LP +R WR +S + DG V+L GG+YDAGD+VKF
Sbjct: 3 TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62
Query: 98 GWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDGN 157
PM+Y+ ++L+W+ E + Q Y+ I+W D+ ++ + +P Y QVGDG
Sbjct: 63 NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122
Query: 158 ADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLNHS 217
DH W E M R +++ + PG +VFK D Y+ + ++H+
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182
Query: 218 KSLFEFADKHR--GSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYVLS---NQ 272
K+LF+ ADK + Y A+ Y S S + D+L WAA WLY A+ D+ YLD S N
Sbjct: 183 KNLFDMADKAKSDAGYTAAS-GYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNW 241
Query: 273 GWSQVASEFS------WDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSV 326
G Q + WD+ GA++LLAK NKQL +K +E + +
Sbjct: 242 GKEQQTDIIAYKWGQCWDDVHYGAELLLAKLT---NKQL--YKDSIEMNLDFWTTGVNGT 296
Query: 327 RIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAK 386
R+ TP GL ++ +L++ T+ L +Y++ G S F K
Sbjct: 297 RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAE----------WEGCTPSKVSVYKDFLK 346
Query: 387 SQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXX 446
SQ+DY LG S++ G+G P HHR A G+ +++
Sbjct: 347 SQIDYALGS--TGRSFVVGYGVNPPQHPHHRTAH----GSWTDQMTS----------PTY 390
Query: 447 XXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVAPLLAQSEQEYLHFFQ 505
GA+VGGPD+ D + D +Y + E NA F G++A + S + + F+
Sbjct: 391 HRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFK 449
>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticum
pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticium In Complex With Cellobiose
Length = 441
Score = 214 bits (545), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 141/467 (30%), Positives = 218/467 (46%), Gaps = 59/467 (12%)
Query: 38 TFAYRDALGKAILFFEGQRSG-KLPESQRVKWRGNSALSDGKPENVNLIGGYYDAGDNVK 96
T Y AL +I+FF+ + G + E+ WRG +DG V+L GGY+DAGD+VK
Sbjct: 3 THDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVK 62
Query: 97 FGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGAIRWGTDFILRAHTSPTTLYTQVGDG 156
FG P YS ++L W+ E++ + + +++ TD+ L++H + TT Y QVG+G
Sbjct: 63 FGLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEG 122
Query: 157 NADHQCWERPEDMDTPR-TLYRITSDSPGXXXXXXXXXXXXXXXIVFKKVDSIYSSRLLN 215
NADH W PE+ R +LY+ SP + +K +DS Y+++ LN
Sbjct: 123 NADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLN 182
Query: 216 HSKSLFEFADKHRGSYQASCPFYCSYSGYQDELLWAAAWLYKASEDNKYLDYV------- 268
+K L+ ++G FY + S + D+L WAA WLY A+ D+ Y+
Sbjct: 183 AAKELYAMGKANQGVGNGQS-FYQATS-FGDDLAWAATWLYTATNDSTYITDAEQFITLG 240
Query: 269 -LSNQGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLSLFKIHVESFVCALMPESSSVR 327
N+ Q WD+ + A + LA Q++ +I+ ++ +
Sbjct: 241 NTMNENKMQDKWTMCWDDMYVPAALRLA--------QITGKQIYKDAIEFNF--NYWKTQ 290
Query: 328 IETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKS 387
+ TTPGGL ++ + L+Y + ++++ +Y K + AK
Sbjct: 291 VTTTPGGLKWLSNWGVLRYAAAESMVMLVYCK---------------QNPDQSLLDLAKK 335
Query: 388 QVDYILGKNPMKMSYMAGFGSKFPLQIHHRGAS----IPSIGAHPAKVSCNDGXXXXXXX 443
QVDYILG NP MSY+ G+GS + + HHR A+ A PAK
Sbjct: 336 QVDYILGDNPANMSYIIGYGSNWCIHPHHRAANGYTYANGDNAKPAK------------- 382
Query: 444 XXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFVGSVA 490
+ GA+VGGPD ND+F D Y + E NA VG +A
Sbjct: 383 -----HLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLA 424
>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
Length = 611
Score = 179 bits (454), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 143/509 (28%), Positives = 229/509 (44%), Gaps = 89/509 (17%)
Query: 39 FAYRDALGKAILFFEGQRSGK-LPESQRVKWRG--------------NSALSDG------ 77
+ Y AL ++ F++ G + E+ + WRG N+ +SDG
Sbjct: 38 YNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSNR 97
Query: 78 ---KPE---NVNLIGGYYDAGDNVKFGWPMAYSVSLLSWAAVEYQREISSVNQLGYLRGA 131
P+ V++ GG++DAGD+VKFG P AY+ S + W E++ + + Q +
Sbjct: 98 SVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEVI 157
Query: 132 IRWGTDFILR-----AHTSPTTLYTQVGDGNADHQCWERPEDMDTPRTLYRITSDSPGXX 186
+R+ D+ +R A + QVGDG+ DH W PE+ R + IT + PG
Sbjct: 158 LRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGTD 217
Query: 187 XXXXXXXXXXXXXIVFKKVDSIYSSRLLNHSKSLFEFADKH-RGSYQAS--CPFYCSYSG 243
+ FK D Y+++ L+++K+LF+FA+K+ +G Q Y S
Sbjct: 218 IISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQGEDGPKGYYGSSK 277
Query: 244 YQDELLWAAAWLYKASEDNKYLDYVL------SNQGWSQVASEFSWDNKFAGAQMLLAK- 296
+QD+ WAAAWLY A+++ YLD + GW W++ ++G +LA+
Sbjct: 278 WQDDYCWAAAWLYLATQNEHYLDEAFKYYDYYAPPGWIHC-----WNDVWSGTACILAEI 332
Query: 297 -------------EFFGGNKQLSLFKIHVESFVCALMPESSSVRIETTPGGLLYIRDSSN 343
+ + + +I + L+ + S I TPGG +++ +
Sbjct: 333 NDLYDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGITVTPGGYVFLNQWGS 392
Query: 344 LQYVTSATLLLFLYSKTLNTAHINGLQCGSAHFSASQISAFAKSQVDYILGKNPMKMSYM 403
+Y T+A L+ +Y K + S+ + +A+SQ+DY+LGKNP+ Y+
Sbjct: 393 ARYNTAAQLIALVYDKHHGD-------------TPSKYANWARSQMDYLLGKNPLNRCYV 439
Query: 404 AGFGSKFPLQIHHRGASIPSIGAHPAKVSCNDGXXXXXXXXXXXXXVHVGAIVGGPDSND 463
G+ S HHR AS G A S V GA+VGGPD++D
Sbjct: 440 VGYSSNSVKYPHHRAAS----GLKDANDS------------SPHKYVLYGALVGGPDASD 483
Query: 464 QFKDLRTDYSHAEPTTYMNAAFVGSVAPL 492
Q D DY + E NAAFVG+ A L
Sbjct: 484 QHVDRTNDYIYNEVAIDYNAAFVGACAGL 512
>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
Thermoacidophilic Alicyclobacillus Acidocaldarius
pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellobiose
pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellotetraose
pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
Isofagomine
pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
Isofagomine
pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
Isofagomine
Length = 537
Score = 63.2 bits (152), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 116/487 (23%), Positives = 186/487 (38%), Gaps = 74/487 (15%)
Query: 40 AYRDALGKAILFFEGQRSGK-LPESQRVKW-RGNSALSDGK----PENVNLIGGYYDAGD 93
AYRD L + FF+ Q G LPE + W G SD K + GG++DAGD
Sbjct: 87 AYRDVLEAMLRFFDYQLCGVVLPEDEAGPWAHGACHTSDAKVFGTERALACPGGWHDAGD 146
Query: 94 NVKFGWPMAYSVSLLSW------AAVEYQREISSVNQLGYLRGAIRWGTDFI--LRAHTS 145
K+ P A +V+ L AA+ + R + SV++ +L A+ + I L
Sbjct: 147 YGKYTVPAAKAVADLLLAHEYFPAALAHVRPMRSVHRAPHLPPALEVAREEIAWLLTMQD 206
Query: 146 PTT--LYTQVGDGNADHQCWERPEDMDTPRTLYRITSDSPGXXXXXXXXXXXXXXXIVFK 203
P T +Y +V + RPED D P L I+ +V++
Sbjct: 207 PATGGVYHKVTTPSFP-PLDTRPEDDDAPLVLSPIS-----YAATATFCAAMAHAALVYR 260
Query: 204 KVDSIYSSRLLNHSKSLFEFADKHRGSYQASCPFYCS---------YSGYQDELLWAAAW 254
D SS + ++ + + H PF+ + +DELLWA+
Sbjct: 261 PFDPALSSCCADAARRAYAWLGAHEMQ-----PFHNPDGILTGEYGDAELRDELLWASCA 315
Query: 255 LYKASEDNKY-------LDYVLS-NQGWSQVASEFSWDNKFAGAQMLLAKEFFGGNKQLS 306
L + + D+ + LD L GW+ VA + + + ++ + NK S
Sbjct: 316 LLRMTGDSAWARVCEPLLDLDLPWELGWADVAL-YGVMDYLRTPRAAVSDDV--RNKVKS 372
Query: 307 LFKIHVESFVCALMPESSSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHI 366
+++ A M ES I ++ SN+ + A + FL ++ + H
Sbjct: 373 RLLRELDAL--AAMAESHPFGIPMRDDDFIW---GSNMVLLNRA--MAFLLAEGVGVLH- 424
Query: 367 NGLQCGSAHFSASQISAFAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHRGASIPSIGA 426
+AH A+ DY+ G NP+ Y+ GFG + HHR S+
Sbjct: 425 -----PAAH-------TVAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHR-PSVADDVD 471
Query: 427 HPAKVSCNDGXXXXXXXXXXXXXVHVGAIVGGPDSNDQFKDLRTDYSHAEPTTYMNAAFV 486
HP G + + G P + + + D + YS E Y N+ V
Sbjct: 472 HPVP-----GMVVGGPNRHLQDEIARAQLAGRP-AMEAYIDHQDSYSTNEVAVYWNSPAV 525
Query: 487 GSVAPLL 493
+A LL
Sbjct: 526 FVIAALL 532
>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
Angstroms Resolution
Length = 639
Score = 42.0 bits (97), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 52/225 (23%), Positives = 89/225 (39%), Gaps = 26/225 (11%)
Query: 201 VFKKVDSIYSSRLLNHSKSLFEFADKHRGSYQA-----SCPFYCSYSGYQDELLWAAAWL 255
+F+ D Y+ + +N +K +EF + + A S Y + S D+ LWAAA +
Sbjct: 302 IFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFSTGEYATVSD-ADDRLWAAAEM 360
Query: 256 YKASEDNKYL-DYV-LSNQGWSQVASEFSWDN-KFAGAQMLLAKEFFGGNKQLSLFKIHV 312
++ D +YL D+ + Q ++ ++F WDN G L E G N L V
Sbjct: 361 WETLGDEEYLRDFENRAAQFSKKIEADFDWDNVANLGMFTYLLSERPGKNPAL------V 414
Query: 313 ESFVCALMPESSSVRIETTPGGLLYIRDSSNLQYVTSATLLLFLYSKTLNTAHINGLQCG 372
+S +L+ + S+ +R S N Y + + LQ
Sbjct: 415 QSIKDSLLSTADSI-----------VRTSQNHGYGRTLGTTYYWGCNGTVVRQTMILQVA 463
Query: 373 SAHFSASQISAFAKSQVDYILGKNPMKMSYMAGFGSKFPLQIHHR 417
+ + A + ++ G+N SY+ G G P+ H R
Sbjct: 464 NKISPNNDYVNAALDAISHVFGRNYYNRSYVTGLGINPPMNPHDR 508
>pdb|1GXM|A Chain A, Family 10 Polysaccharide Lyase From Cellvibrio Cellulosa
pdb|1GXM|B Chain B, Family 10 Polysaccharide Lyase From Cellvibrio Cellulosa
pdb|1GXN|A Chain A, Family 10 Polysaccharide Lyase From Cellvibrio Cellulosa
Length = 332
Score = 32.3 bits (72), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 47/107 (43%), Gaps = 23/107 (21%)
Query: 41 YRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGK-PENVNLIGGYYDAGDNVKFGW 99
YRDA+ KA F NS S G P+ L GGY D G
Sbjct: 95 YRDAVRKAANFLV-----------------NSQYSTGALPQFYPLKGGYSDHATFNDNG- 136
Query: 100 PMAYSVSLLSWAA---VEYQREISSVNQLGYLRGAIRWGTDFILRAH 143
MAY++++L +AA + ++ S N + A+ GTD+IL+A
Sbjct: 137 -MAYALTVLDFAANKRAPFDTDVFSDNDRTRFKTAVTKGTDYILKAQ 182
>pdb|1GXO|A Chain A, Mutant D189a Of Family 10 Polysaccharide Lyase From
Cellvibrio Cellulosa In Complex With Trigalaturonic Acid
Length = 332
Score = 32.3 bits (72), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 47/107 (43%), Gaps = 23/107 (21%)
Query: 41 YRDALGKAILFFEGQRSGKLPESQRVKWRGNSALSDGK-PENVNLIGGYYDAGDNVKFGW 99
YRDA+ KA F NS S G P+ L GGY D G
Sbjct: 95 YRDAVRKAANFLV-----------------NSQYSTGALPQFYPLKGGYSDHATFNDNG- 136
Query: 100 PMAYSVSLLSWAA---VEYQREISSVNQLGYLRGAIRWGTDFILRAH 143
MAY++++L +AA + ++ S N + A+ GTD+IL+A
Sbjct: 137 -MAYALTVLDFAANKRAPFDTDVFSDNDRTRFKTAVTKGTDYILKAQ 182
>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 609
Score = 30.8 bits (68), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 389 VDYILGKNPMKMSYMAGFGSKFPLQIHH 416
+ Y+LG+N M SY+ G+G + PLQ H
Sbjct: 504 ISYLLGRNAMDQSYVTGYGER-PLQNPH 530
>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 610
Score = 30.8 bits (68), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 389 VDYILGKNPMKMSYMAGFGSKFPLQIHH 416
+ Y+LG+N M SY+ G+G + PLQ H
Sbjct: 504 ISYLLGRNAMDQSYVTGYGER-PLQNPH 530
>pdb|3H7L|A Chain A, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
pdb|3H7L|B Chain B, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
pdb|3H7L|C Chain C, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
Length = 586
Score = 30.4 bits (67), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 380 QISAFAKSQVDYILGKNPMKMSYMAGFGSKFP 411
Q+S FA+ +++I+G NP M + G G P
Sbjct: 469 QLSVFAQDALNWIVGLNPYDMCMLDGHGRNNP 500
Database: pdbaa
Posted date: Mar 3, 2013 10:34 PM
Number of letters in database: 14,973,337
Number of sequences in database: 62,578
Lambda K H
0.319 0.134 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,289,812
Number of Sequences: 62578
Number of extensions: 627897
Number of successful extensions: 1136
Number of sequences better than 100.0: 15
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 1070
Number of HSP's gapped (non-prelim): 23
length of query: 511
length of database: 14,973,337
effective HSP length: 103
effective length of query: 408
effective length of database: 8,527,803
effective search space: 3479343624
effective search space used: 3479343624
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 54 (25.4 bits)