BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047277
(498 letters)
Database: pdbaa
62,578 sequences; 14,973,337 total letters
Searching..................................................done
>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 2.5.
pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 5.6.
pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 6.5
Length = 433
Score = 283 bits (723), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 178/477 (37%), Positives = 244/477 (51%), Gaps = 60/477 (12%)
Query: 21 SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
++DY L S+LF+E QRSG+LP +Q++TWRKDSAL D G DLTGGY+DAGD VKF
Sbjct: 2 AYDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKF 61
Query: 81 NFPMAFSTTMLAWSVLEF--GKSMESDLQHALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
FPMA++ T+LAW +++F G S L A++WATDYF+KA + Y QVG+
Sbjct: 62 GFPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGD 121
Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
DH W RPEDM AR Y + PG FR+ D TY+ LL A
Sbjct: 122 ADHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHA 181
Query: 199 KTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENI 258
+ +F FA+ YRG Y+D + FY + Y DELVW AAWLYRAT Y N E++
Sbjct: 182 RQLFDFANNYRGKYSDSITD-ARNFYAS-ADYRDELVWAAAWLYRATNDNTYLN-TAESL 238
Query: 259 Y------NLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPK 312
Y N G G WD+K +G+ VL+++ N +
Sbjct: 239 YDEFGLQNWGGG----------------LNWDSKVSGVQVLLAKLTN------KQAYKDT 276
Query: 313 ADTFVCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCG 372
++V L+ +P GLL+ L+HA +F++ +
Sbjct: 277 VQSYVNYLINNQQK-----TPKGLLYIDMWGTLRHAANAAFIM-------------LEAA 318
Query: 373 NVVVNRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIG 432
+ ++ + Q A+ Q+DY LG S++ G+G N P R HHRSSS P
Sbjct: 319 ELGLSASSYRQFAQTQIDYALGDG--GRSFVCGFGSNPPTRPHHRSSSCPPA-----PAT 371
Query: 433 CQDGTKYYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
C T ++S + N + L+GA+VGGPD ND+Y D RSD+VH+E T NA LA
Sbjct: 372 CDWNT--FNSPDPNYHVLSGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALA 426
>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
Length = 605
Score = 268 bits (686), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 171/470 (36%), Positives = 236/470 (50%), Gaps = 39/470 (8%)
Query: 23 DYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKFNF 82
+Y +AL KS+ F+E QRSGKLP N R++WR DS L DG +G+DLTGG+YDAGD+VKF F
Sbjct: 5 NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGF 64
Query: 83 PMAFSTTMLAWSVLEF--GKSMESDLQHALDAIRWATDYFLKATSIPGIVYAQVGEPYGD 140
PMAF+ TMLAW +E G + + D +RW DYF+KA P ++Y QVG+ D
Sbjct: 65 PMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDAD 124
Query: 141 HDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKT 200
H W E M R + V PG F D YA L+ AK
Sbjct: 125 HKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAKQ 184
Query: 201 VFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYN 260
++ FAD YRG Y+D + FY +SGY+DELVWGA WLY+AT +Y E Y+
Sbjct: 185 LYTFADTYRGVYSDCVP--AGAFYNSWSGYQDELVWGAYWLYKATGDDSYLA-KAEYEYD 241
Query: 261 LGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTFVCSL 320
S + T+ WD+K+ G VL+++ +I A+ ++
Sbjct: 242 FLSTEQQTDLRSYRWT----IAWDDKSYGTYVLLAK------ETGKQKYIDDANRWLDYW 291
Query: 321 LPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNRAR 380
+ V YSPGG+ L++A +F+ +VY++ + V + R
Sbjct: 292 TVGVNGQRVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDP----------VRKQR 341
Query: 381 LVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKYY 440
A Q++Y LG NP N SY+VG+G N P+ HHR++ T I
Sbjct: 342 YHDFAVRQINYALGDNPRNSSYVVGFGNNPPRNPHHRTAH----GSWTDSIA-------- 389
Query: 441 DSKNSNPNELTGAVVGGPDA-NDSYADSRSDFVHSEPTTYINAPLVGLLA 489
S N + L GA+VGGP + ND+Y D R D+V +E T NA LA
Sbjct: 390 -SPAENRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALA 438
>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
Endoglucanase From Separately Expressed Gh9 And Cbm3c
Modules
Length = 466
Score = 234 bits (597), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 164/473 (34%), Positives = 231/473 (48%), Gaps = 46/473 (9%)
Query: 23 DYGDALAKSILFFEGQRSGKLPPNQ-RMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKFN 81
+YG+AL K+I F+E QRSGKL + R+ WR DS L+DG G+DLTGG+YDAGD+VKFN
Sbjct: 25 NYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFN 84
Query: 82 FPMAFSTTMLAWSVLEFGKSMESDLQ--HALDAIRWATDYFLKATSIPGIVYAQVGEPYG 139
PM++S ML W+V E+ + + Q H L+ I+WA DYF+K + Y QVG+ +
Sbjct: 85 LPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQVGDGHA 144
Query: 140 DHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAK 199
DH W E M R Y V + PG F+ D Y+ + L AK
Sbjct: 145 DHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKECLKHAK 204
Query: 200 TVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIY 259
+F FAD + S + Y FY +SG+ DEL W A WLY AT +Y + E+
Sbjct: 205 ELFEFADTTK-SDDGYTA--ANGFYNSWSGFYDELSWAAVWLYLATNDSSYLD-KAESYS 260
Query: 260 NLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTFVCS 319
+ TN WD+ G +L++R N N D +
Sbjct: 261 DKWGYEPQTNIPKYKWAQC----WDDVTYGTYLLLARIKND-NGKYKEAIERHLDWWTTG 315
Query: 320 LLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNRA 379
E ++Y+P GL + +L++ATT +FL VYS D G+ +
Sbjct: 316 YNGE----RITYTPKGLAWLDQWGSLRYATTTAFLACVYS--------DWENGDKEKAKT 363
Query: 380 RLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKY 439
L + A+ Q DY LGS S++VG+G+N P+R HHR++ +
Sbjct: 364 YL-EFARSQADYALGST--GRSFVVGFGENPPKRPHHRTAH----------------GSW 404
Query: 440 YDSKNSNPNE---LTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
DS+ P L GA+VGGPD+ D+Y D S++ +E NA VGLLA
Sbjct: 405 ADSQMEPPEHRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLA 457
>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
Length = 614
Score = 232 bits (591), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 162/473 (34%), Positives = 235/473 (49%), Gaps = 45/473 (9%)
Query: 21 SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
+++YG+AL KSI+F+E QRSG LP ++R WR DS ++DG +GVDLTGG+YDAGD+VKF
Sbjct: 3 TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62
Query: 81 NFPMAFSTTMLAWSVLEFGKSMESDLQ--HALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
N PM++++ MLAWS+ E + + Q + +D I+WA DYF+K PG+ Y QVG+
Sbjct: 63 NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122
Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
DH W E M R + V PG F+S+D TYA K ++ A
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182
Query: 199 KTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENI 258
K +F AD+ + G Y S + D+L W A WLY AT Y + +
Sbjct: 183 KNLFDMADKAKSD----AGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYV 238
Query: 259 YNLGSGRKITNXXXXXXXXXTEFG--WDNKNAGINVLVSRFINTINSNSSNPFIPKADTF 316
N G ++ ++G WD+ + G +L+++ N S I F
Sbjct: 239 PNWGKEQQTD-------IIAYKWGQCWDDVHYGAELLLAKLTNKQLYKDS---IEMNLDF 288
Query: 317 VCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVV 376
+ + + VSY+P GL + +L+HATT +FL VY+ + V V
Sbjct: 289 WTTGVNGT---RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEW-----EGCTPSKVSV 340
Query: 377 NRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDG 436
+ L K Q+DY LGS S++VGYG N PQ HHR++ Q T
Sbjct: 341 YKDFL----KSQIDYALGST--GRSFVVGYGVNPPQHPHHRTAHGSWTDQMT-------- 386
Query: 437 TKYYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
S + + + GA+VGGPD D Y D +++V++E NA G LA
Sbjct: 387 -----SPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434
>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
Length = 614
Score = 232 bits (591), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 162/471 (34%), Positives = 233/471 (49%), Gaps = 41/471 (8%)
Query: 21 SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
+++YG+AL KSI+F+E QRSG LP ++R WR DS ++DG +GVDLTGG+YDAGD+VKF
Sbjct: 3 TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKF 62
Query: 81 NFPMAFSTTMLAWSVLEFGKSMESDLQ--HALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
N PM++++ MLAWS+ E + + Q + +D I+WA DYF+K PG+ Y QVG+
Sbjct: 63 NLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGG 122
Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
DH W E M R + V PG F+S+D TYA K ++ A
Sbjct: 123 KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHA 182
Query: 199 KTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENI 258
K +F AD+ + G Y S + D+L W A WLY AT Y + +
Sbjct: 183 KNLFDMADKAKSD----AGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYV 238
Query: 259 YNLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTFVC 318
N G ++ WD+ + G +L+++ N S I F
Sbjct: 239 PNWGKEQQTDIIAYKWGQX-----WDDVHYGAELLLAKLTNKQLYKDS---IEMNLDFWT 290
Query: 319 SLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNR 378
+ + + VSY+P GL + +L+HATT +FL VY+ + V V +
Sbjct: 291 TGVNGT---RVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEW-----EGCTPSKVSVYK 342
Query: 379 ARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTK 438
L K Q+DY LGS S++VGYG N PQ HHR++ Q T
Sbjct: 343 DFL----KSQIDYALGST--GRSFVVGYGVNPPQHPHHRTAHGSWTDQMT---------- 386
Query: 439 YYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
S + + + GA+VGGPD D Y D +++V++E NA G LA
Sbjct: 387 ---SPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALA 434
>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
Length = 475
Score = 217 bits (553), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 154/481 (32%), Positives = 228/481 (47%), Gaps = 60/481 (12%)
Query: 21 SHDYGDALAKSILFFEGQRSGKLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVKF 80
S +YG+AL K+I+F+E Q SGKLP R WR DSAL+DG G+DLTGG++DAGD+VKF
Sbjct: 25 SFNYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDHVKF 84
Query: 81 NFPMAFSTTMLAWSVLEFGKSM--ESDLQHALDAIRWATDYFLKATSIPGIVYAQVGEPY 138
N PM+++ TML+W+V E+ + L+H L+ I W DYF+K + Y QVG+
Sbjct: 85 NLPMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVGDGS 144
Query: 139 GDHDCWERPEDMDTARTPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRA 198
DH W E M R + V++ PG + + T A L A
Sbjct: 145 KDHAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYLQHA 204
Query: 199 KTVFVFAD--RYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVE 256
K ++ FA+ + Y G +Y +SG+ DEL W A WLY AT Y
Sbjct: 205 KELYEFAEVTKSDAGYTAANG-----YYNSWSGFYDELSWAAVWLYLATNDSTYLTKAES 259
Query: 257 NIYNLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTF 316
+ N KI+ WD+ + G +L+++ + + ++
Sbjct: 260 YVQNW---PKISGSNTIDYKWA--HCWDDVHNGAALLLAKI------TGKDIYKQIIESH 308
Query: 317 VCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVV 376
+ + + Y+P GL + +L++ATT +FL VYS ++ G
Sbjct: 309 LDYWTTGYNGERIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWV---------GCPST 359
Query: 377 NRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHR--------SSSLPSVAQHT 428
+ + + Q+DY LGS S++VG+G N P+R HHR S S+PS +HT
Sbjct: 360 KKEIYRKFGESQIDYALGS--AGRSFVVGFGTNPPKRPHHRTAHSSWADSQSIPSYHRHT 417
Query: 429 GKIGCQDGTKYYDSKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLL 488
L GA+VGGP ++DSY D S++V++E NA VG L
Sbjct: 418 ---------------------LYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGAL 456
Query: 489 A 489
A
Sbjct: 457 A 457
>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticum
pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticium In Complex With Cellobiose
Length = 441
Score = 197 bits (501), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 147/475 (30%), Positives = 212/475 (44%), Gaps = 59/475 (12%)
Query: 21 SHDYGDALAKSILFFEGQRSG-KLPPNQRMTWRKDSALEDGFQIGVDLTGGYYDAGDNVK 79
+HDY AL SI+FF+ + G + N WR DG +GVDLTGGY+DAGD+VK
Sbjct: 3 THDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVK 62
Query: 80 FNFPMAFSTTMLAWSVLEFGKSMES--DLQHALDAIRWATDYFLKATSIPGIVYAQVGEP 137
F P +S +L WS+ EF +S ++ + L +++ TDYFLK+ Y QVGE
Sbjct: 63 FGLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEG 122
Query: 138 YGDHDCWERPEDMDTAR-TPYAVSKQFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLN 196
DH W PE+ R + Y P +++ DS YA K LN
Sbjct: 123 NADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLN 182
Query: 197 RAKTVFVFADRYRGSYNDYLGKWVCPFYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVE 256
AK ++ +G N FY + + D+L W A WLY AT Y E
Sbjct: 183 AAKELYAMGKANQGVGNGQ------SFY-QATSFGDDLAWAATWLYTATNDSTYIT-DAE 234
Query: 257 NIYNLGSGRKITNXXXXXXXXXTEFGWDNKNAGINVLVSRFINTINSNSSNPFIPKADTF 316
LG+ WD+ + +++ + A F
Sbjct: 235 QFITLGN-----TMNENKMQDKWTMCWDDMYVPAALRLAQI-------TGKQIYKDAIEF 282
Query: 317 VCSLLPESPTKSVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVV 376
+ V+ +PGGL + L++A S +++VY + N D
Sbjct: 283 NFNYWK----TQVTTTPGGLKWLSNWGVLRYAAAESMVMLVYCK----QNPD-------- 326
Query: 377 NRARLVQVAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDG 436
L+ +AK QVDYILG NP NMSY++GYG N+ HHR++ +G
Sbjct: 327 --QSLLDLAKKQVDYILGDNPANMSYIIGYGSNWCIHPHHRAA---------------NG 369
Query: 437 TKYYDSKNSNPNE--LTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
Y + N+ P + LTGA+VGGPD ND + D + + ++E NA LVG+LA
Sbjct: 370 YTYANGDNAKPAKHLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLA 424
>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
Length = 611
Score = 154 bits (388), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 140/520 (26%), Positives = 224/520 (43%), Gaps = 97/520 (18%)
Query: 21 SHDYGDALAKSILFFEGQRSGK-LPPNQRMTWRKD--------------SALEDGF---- 61
++Y AL S+ F++ G + N ++WR D + + DGF
Sbjct: 37 EYNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSN 96
Query: 62 --------QIGVDLTGGYYDAGDNVKFNFPMAFSTTMLAWSVLEFGKSMESDLQ--HALD 111
VD++GG++DAGD+VKF P A++ + + W EF + Q HA
Sbjct: 97 RSVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEV 156
Query: 112 AIRWATDYFLKAT--SIPGIVYA---QVGEPYGDHDCWERPEDMDTARTPYAVSKQFPGX 166
+R+ DYF++ T G V A QVG+ DH W PE+ R + ++K+ PG
Sbjct: 157 ILRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGT 216
Query: 167 XXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKTVFVFADRY-RGSYNDYLGKWVCPFYC 225
F+ TD YA K L+ AK +F FA++ +G G Y
Sbjct: 217 DIISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQGEDGP---KGYY 273
Query: 226 DFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYNLGSGRKITNXXXXXXXXXTEFGWDN 285
S ++D+ W AAWLY AT+ +Y L K + W++
Sbjct: 274 GSSKWQDDYCWAAAWLYLATQNEHY----------LDEAFKYYDYYAPPGWIHC---WND 320
Query: 286 KNAGINVLVSRFINTINSNSSNPFIPKADTF--------------VCSLLPESPTKSVSY 331
+G +++ IN + S F + + LL + ++
Sbjct: 321 VWSGTACILAE-INDLYDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGITV 379
Query: 332 SPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNVVVNRARLVQVAKGQVDY 391
+PGG +F + ++ T + +VY D H G+ ++ A+ Q+DY
Sbjct: 380 TPGGYVFLNQWGSARYNTAAQLIALVY---------DKHHGDT---PSKYANWARSQMDY 427
Query: 392 ILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKYYDSKNSNPNE-- 449
+LG NPLN Y+VGY N + HHR++S G K D+ +S+P++
Sbjct: 428 LLGKNPLNRCYVVGYSSNSVKYPHHRAAS---------------GLK--DANDSSPHKYV 470
Query: 450 LTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
L GA+VGGPDA+D + D +D++++E NA VG A
Sbjct: 471 LYGALVGGPDASDQHVDRTNDYIYNEVAIDYNAAFVGACA 510
>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
Thermoacidophilic Alicyclobacillus Acidocaldarius
pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellobiose
pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellotetraose
pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
Isofagomine
pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
Isofagomine
pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
Isofagomine
Length = 537
Score = 45.8 bits (107), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 49/106 (46%), Gaps = 6/106 (5%)
Query: 384 VAKGQVDYILGSNPLNMSYMVGYGKNFPQRIHHRSSSLPSVAQHTGKIGCQDGTKYYDSK 443
VA+ DY+ G+NPL Y+ G+G+ + HHR S V + ++
Sbjct: 430 VAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHRPSVADDVDHPVPGMVVGGPNRHLQ-- 487
Query: 444 NSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLA 489
+E+ A + G A ++Y D + + +E Y N+P V ++A
Sbjct: 488 ----DEIARAQLAGRPAMEAYIDHQDSYSTNEVAVYWNSPAVFVIA 529
>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 609
Score = 38.9 bits (89), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 100/470 (21%), Positives = 161/470 (34%), Gaps = 78/470 (16%)
Query: 66 DLTGGYYDAGDNVKF--NFPMAFSTTMLA-----------WSVLEFG----KSMESDLQH 108
D+TGG+YDAGD+ K+ N +A T M W G +
Sbjct: 169 DVTGGWYDAGDHGKYVVNGGIAVWTLMNMYERAKIRGLDNWGPYRDGGMNIPEQNNGYPD 228
Query: 109 ALDAIRWATDYFLKAT-------SIPGIVYAQVGEPYGDHDCWERPEDMDTARTPYAVSK 161
LD RW ++F K SI G+V+ ++ HD M P
Sbjct: 229 ILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKI------HDFRWTALGMLPHEDPQPRYL 282
Query: 162 QFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKTVFVFADRYRGSYNDYLGKWVC 221
+ ++ D T+A L +A+ + A ++ Y +Y
Sbjct: 283 RPVSTAATLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQAALKHPDIYAEYTPGSGG 342
Query: 222 P----FYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYNLGSGRKITNXXXXXXXX 277
P + D+ G DE W A LY T Y NY++ + + L K+
Sbjct: 343 PGGGPYNDDYVG--DEFYWAACELYVTTGKDEYKNYLMNSPHYLEMPAKMGENGGANGED 400
Query: 278 XTEFG---WDNKNAGINVLVSRFINTINS----NSSNPFIPKADTFVCSLLPES---PTK 327
+G W + ++ N + + + N AD ++ ++ + P K
Sbjct: 401 NGLWGCFTWGTTQGLGTITLALVENGLPATDIQKARNNIAKAADRWLENIEEQGYRLPIK 460
Query: 328 SVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNV--VVNRARLVQVA 385
GG P GSN SF+L N+ I G ++ +
Sbjct: 461 QAEDERGGY---PWGSN-------SFIL----------NQMIVMGYAYDFTGDSKYLDGM 500
Query: 386 KGQVDYILGSNPLNMSYMVGYGKNFPQRIHHR----SSSLPSVAQHTGKIGCQDGTKYYD 441
+ Y+LG N ++ SY+ GYG+ Q H R +S A G I +++ D
Sbjct: 501 FDGISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPAPPPGIISGGPNSRFED 560
Query: 442 SKNSNPNELTGAVVGGPDANDSYADSRSDFVHSEPTTYINAPLVGLLAYF 491
+ AV + D + +E T NAP + AY
Sbjct: 561 PT------INAAVKKDTPPQKCFIDHTDSWSTNEITVNWNAPFAWVTAYL 604
>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 610
Score = 37.7 bits (86), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 85/392 (21%), Positives = 137/392 (34%), Gaps = 68/392 (17%)
Query: 66 DLTGGYYDAGDNVKF--NFPMAFSTTMLA-----------WSVLEFG----KSMESDLQH 108
D+TGG+YDAGD+ K+ N +A T M W G +
Sbjct: 169 DVTGGWYDAGDHGKYVVNGGIAVWTLMNMYERAKIRGLDNWGPYRDGGMNIPEQNNGYPD 228
Query: 109 ALDAIRWATDYFLKAT-------SIPGIVYAQVGEPYGDHDCWERPEDMDTARTPYAVSK 161
LD RW ++F K SI G+V+ ++ HD M P
Sbjct: 229 ILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKI------HDFRWTALGMLPHEDPQPRYL 282
Query: 162 QFPGXXXXXXXXXXXXXXXXXFRSTDSTYAVKLLNRAKTVFVFADRYRGSYNDYLGKWVC 221
+ ++ D T+A L +A+ + A ++ Y +Y
Sbjct: 283 RPVSTAATLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQAALKHPDIYAEYTPGSGG 342
Query: 222 P----FYCDFSGYEDELVWGAAWLYRATKAPNYWNYVVENIYNLGSGRKITNXXXXXXXX 277
P + D+ G DE W A LY T Y NY++ + + L K+
Sbjct: 343 PGGGPYNDDYVG--DEFYWAACELYVTTGKDEYKNYLMNSPHYLEMPAKMGENGGANGED 400
Query: 278 XTEFG---WDNKNAGINVLVSRFINTINS----NSSNPFIPKADTFVCSLLPES---PTK 327
+G W + ++ N + + + N AD ++ ++ + P K
Sbjct: 401 NGLWGCFTWGTTQGLGTITLALVENGLPATDIQKARNNIAKAADRWLENIEEQGYRLPIK 460
Query: 328 SVSYSPGGLLFKPGGSNLQHATTLSFLLVVYSRYLKQANRDIHCGNV--VVNRARLVQVA 385
GG P GSN SF+L N+ I G ++ +
Sbjct: 461 QAEDERGGY---PWGSN-------SFIL----------NQMIVMGYAYDFTGDSKYLDGM 500
Query: 386 KGQVDYILGSNPLNMSYMVGYGKNFPQRIHHR 417
+ Y+LG N ++ SY+ GYG+ Q H R
Sbjct: 501 FDGISYLLGRNAMDQSYVTGYGERPLQNPHDR 532
>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
Angstroms Resolution
Length = 639
Score = 30.0 bits (66), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 18/62 (29%), Positives = 25/62 (40%), Gaps = 13/62 (20%)
Query: 371 CGNVVVNRARLVQVAK-------------GQVDYILGSNPLNMSYMVGYGKNFPQRIHHR 417
C VV + ++QVA + ++ G N N SY+ G G N P H R
Sbjct: 449 CNGTVVRQTMILQVANKISPNNDYVNAALDAISHVFGRNYYNRSYVTGLGINPPMNPHDR 508
Query: 418 SS 419
S
Sbjct: 509 RS 510
>pdb|4F7U|Q Chain Q, Macromolecular Machine 6
pdb|4F7U|P Chain P, Macromolecular Machine 6
Length = 129
Score = 29.6 bits (65), Expect = 4.0, Method: Composition-based stats.
Identities = 13/30 (43%), Positives = 18/30 (60%), Gaps = 1/30 (3%)
Query: 130 VYAQVGEPYGD-HDCWERPEDMDTARTPYA 158
VY V E +G+ +CW PED+ T T Y+
Sbjct: 86 VYGDVDEQFGEVTECWLMPEDIATVDTMYS 115
>pdb|3VPS|A Chain A, Structure Of A Novel Nad Dependent-Ndp-Hexosamine
5,6-Dehydratase, Tuna, Involved In Tunicamycin
Biosynthesis
pdb|3VPS|B Chain B, Structure Of A Novel Nad Dependent-Ndp-Hexosamine
5,6-Dehydratase, Tuna, Involved In Tunicamycin
Biosynthesis
Length = 321
Score = 29.3 bits (64), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
Query: 123 ATSIPGIVYAQVGEPYGDHDCWERPEDMD-TARTPYAVSK 161
+ +P +V E YG D PED + R+PYA SK
Sbjct: 108 SVGVPKVVVGSTCEVYGQADTLPTPEDSPLSPRSPYAASK 147
>pdb|3HV1|A Chain A, Crystal Structure Of A Polar Amino Acid Abc Uptake
Transporter Substrate Binding Protein From Streptococcus
Thermophilus
pdb|3HV1|B Chain B, Crystal Structure Of A Polar Amino Acid Abc Uptake
Transporter Substrate Binding Protein From Streptococcus
Thermophilus
Length = 268
Score = 28.5 bits (62), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 16/42 (38%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 414 IHHRSSSLPSVAQHTGK-IGCQDGTKYYDSKNSNPNELTGAV 454
+ +SS + SVA GK +G Q G+ YD+ N++P L V
Sbjct: 112 VTKKSSGIDSVAGXAGKTLGAQAGSSGYDAFNASPKILKDVV 153
Database: pdbaa
Posted date: Mar 3, 2013 10:34 PM
Number of letters in database: 14,973,337
Number of sequences in database: 62,578
Lambda K H
0.318 0.135 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,383,346
Number of Sequences: 62578
Number of extensions: 668137
Number of successful extensions: 1217
Number of sequences better than 100.0: 15
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 1150
Number of HSP's gapped (non-prelim): 24
length of query: 498
length of database: 14,973,337
effective HSP length: 103
effective length of query: 395
effective length of database: 8,527,803
effective search space: 3368482185
effective search space used: 3368482185
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 53 (25.0 bits)