BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 012581
(460 letters)
Database: pdbaa
62,578 sequences; 14,973,337 total letters
Searching..................................................done
>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 2.5.
pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 5.6.
pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 6.5
Length = 433
Score = 281 bits (718), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 169/405 (41%), Positives = 230/405 (56%), Gaps = 41/405 (10%)
Query: 36 DYHRALHMSILFFEGQRSGRLPPDQRVKWRRDSALHDGATSGVDLTGGYYDAGDNVKFGF 95
DY + L S+LF+E QRSGRLP DQ+V WR+DSAL+D G DLTGGY+DAGD VKFGF
Sbjct: 4 DYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFGF 63
Query: 96 PMAFTTTMLSWSIIDF--GKNMGPELKYAIKAVKWGTDYLLKATAKPGVVYVQVGDAFSD 153
PMA+T T+L+W +IDF G + L KAVKW TDY +KA Y QVG +D
Sbjct: 64 PMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGDAD 123
Query: 154 HNCWERPEDMDTLRTAYKIDVSHPGSDVAGETXXXXXXXSIVFRSRDPRYSRLLLSRAIE 213
H W RPEDM R AYKID S PGSD+AGET SIVFR+ D YS LL+ A +
Sbjct: 124 HAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHARQ 183
Query: 214 VFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKASRRREYREYI--LRNE 271
+F+FA+ +RG YS S+ FY + Y+DEL+W AAWL++A+ Y L +E
Sbjct: 184 LFDFANNYRGKYSDSITDARN-FYASAD-YRDELVWAAAWLYRATNDNTYLNTAESLYDE 241
Query: 272 VVLHA-GDTINEFGWDNKHAGINVLISKEVLMGKADYFESFKQNADGFICSILPGISHPQ 330
L G +N WD+K +G+ VL++K L K +++K ++ ++
Sbjct: 242 FGLQNWGGGLN---WDSKVSGVQVLLAK--LTNK----QAYKDTVQSYVNYLI-----NN 287
Query: 331 VQYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSASPNLLRQLAKHQ 390
Q +P GL++ ++H + +F++L + G ++S RQ A+ Q
Sbjct: 288 QQKTPKGLLYIDMWGTLRHAANAAFIML----------EAAELGLSASS---YRQFAQTQ 334
Query: 391 VDYILGDNPVGMSYMVGYGARYPQRIHHRGSSLPSVQAHPAHIGC 435
+DY LGD G S++ G+G+ P R HHR SS P PA C
Sbjct: 335 IDYALGDG--GRSFVCGFGSNPPTRPHHRSSSCP-----PAPATC 372
>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
Length = 605
Score = 260 bits (664), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 157/392 (40%), Positives = 213/392 (54%), Gaps = 26/392 (6%)
Query: 36 DYHRALHMSILFFEGQRSGRLPPDQRVKWRRDSALHDGATSGVDLTGGYYDAGDNVKFGF 95
+Y AL S+ F+E QRSG+LP + RV WR DS L+DGA G+DLTGG+YDAGD+VKFGF
Sbjct: 5 NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGF 64
Query: 96 PMAFTTTMLSWSIIDF--GKNMGPELKYAIKAVKWGTDYLLKATAKPGVVYVQVGDAFSD 153
PMAFT TML+W I+ G ++ Y ++W DY +KA P V+YVQVGD +D
Sbjct: 65 PMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDAD 124
Query: 154 HNCWERPEDMDTLRTAYKIDVSHPGSDVAGETXXXXXXXSIVFRSRDPRYSRLLLSRAIE 213
H W E M R ++K+D S PGSDVA ET SIVF DP Y+ L+ A +
Sbjct: 125 HKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAKQ 184
Query: 214 VFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKAS------RRREYREYI 267
++ FAD +RG YS + FY +GYQDEL+WGA WL+KA+ + EY
Sbjct: 185 LYTFADTYRGVYSDCVP--AGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDF 242
Query: 268 LRNEVVLHAGDTINEFGWDNKHAGINVLISKEVLMGKADYFESFKQNADGFICSILPGIS 327
L E WD+K G VL++KE GK Y + + D + G++
Sbjct: 243 LSTEQQTDLRSYRWTIAWDDKSYGTYVLLAKET--GKQKYIDDANRWLDYWTV----GVN 296
Query: 328 HPQVQYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSASPNLLRQLA 387
+V YSPGG+ +++ + +F+ L Y+ KV+ R A
Sbjct: 297 GQRVPYSPGGMAVLDTWGALRYAANTAFVALVYA-------KVIDDPVRK---QRYHDFA 346
Query: 388 KHQVDYILGDNPVGMSYMVGYGARYPQRIHHR 419
Q++Y LGDNP SY+VG+G P+ HHR
Sbjct: 347 VRQINYALGDNPRNSSYVVGFGNNPPRNPHHR 378
>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
Length = 614
Score = 227 bits (578), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 140/396 (35%), Positives = 207/396 (52%), Gaps = 29/396 (7%)
Query: 32 ASQNDYHRALHMSILFFEGQRSGRLPPDQRVKWRRDSALHDGATSGVDLTGGYYDAGDNV 91
A +Y AL SI+F+E QRSG LP D+R WR DS + DG+ GVDLTGG+YDAGD+V
Sbjct: 1 AGTYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHV 60
Query: 92 KFGFPMAFTTTMLSWSIID--FGKNMGPELKYAIKAVKWGTDYLLKATAKPGVVYVQVGD 149
KF PM++T+ ML+WS+ + + + KY + +KW DY +K PGV Y QVGD
Sbjct: 61 KFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGD 120
Query: 150 AFSDHNCWERPEDMDTLRTAYKIDVSHPGSDVAGETXXXXXXXSIVFRSRDPRYSRLLLS 209
DH+ W E M R ++K+D S PGS V T ++VF+S DP Y+ +S
Sbjct: 121 GGKDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCIS 180
Query: 210 RAIEVFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKASRRREY---REY 266
A +F+ AD+ + S + Y + + D+L W A WL+ A+ Y E
Sbjct: 181 HAKNLFDMADKAK----SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAES 236
Query: 267 ILRNEVVLHAGDTIN-EFG--WDNKHAGINVLISKEVLMGKADYFESFKQNADGFICSIL 323
+ N D I ++G WD+ H G +L++K L K Y +S + N D +
Sbjct: 237 YVPNWGKEQQTDIIAYKWGQCWDDVHYGAELLLAK--LTNKQLYKDSIEMNLDFWTT--- 291
Query: 324 PGISHPQVQYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSASPNLL 383
G++ +V Y+P GL + +++H T+ +FL Y+ + G + ++
Sbjct: 292 -GVNGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWE---------GCTPSKVSVY 341
Query: 384 RQLAKHQVDYILGDNPVGMSYMVGYGARYPQRIHHR 419
+ K Q+DY LG G S++VGYG PQ HHR
Sbjct: 342 KDFLKSQIDYALGS--TGRSFVVGYGVNPPQHPHHR 375
>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
Length = 614
Score = 227 bits (578), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 140/396 (35%), Positives = 207/396 (52%), Gaps = 29/396 (7%)
Query: 32 ASQNDYHRALHMSILFFEGQRSGRLPPDQRVKWRRDSALHDGATSGVDLTGGYYDAGDNV 91
A +Y AL SI+F+E QRSG LP D+R WR DS + DG+ GVDLTGG+YDAGD+V
Sbjct: 1 AGTYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHV 60
Query: 92 KFGFPMAFTTTMLSWSIID--FGKNMGPELKYAIKAVKWGTDYLLKATAKPGVVYVQVGD 149
KF PM++T+ ML+WS+ + + + KY + +KW DY +K PGV Y QVGD
Sbjct: 61 KFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGD 120
Query: 150 AFSDHNCWERPEDMDTLRTAYKIDVSHPGSDVAGETXXXXXXXSIVFRSRDPRYSRLLLS 209
DH+ W E M R ++K+D S PGS V T ++VF+S DP Y+ +S
Sbjct: 121 GGKDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCIS 180
Query: 210 RAIEVFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKASRRREY---REY 266
A +F+ AD+ + S + Y + + D+L W A WL+ A+ Y E
Sbjct: 181 HAKNLFDMADKAK----SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAES 236
Query: 267 ILRNEVVLHAGDTIN-EFG--WDNKHAGINVLISKEVLMGKADYFESFKQNADGFICSIL 323
+ N D I ++G WD+ H G +L++K L K Y +S + N D +
Sbjct: 237 YVPNWGKEQQTDIIAYKWGQXWDDVHYGAELLLAK--LTNKQLYKDSIEMNLDFWTT--- 291
Query: 324 PGISHPQVQYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSASPNLL 383
G++ +V Y+P GL + +++H T+ +FL Y+ + G + ++
Sbjct: 292 -GVNGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWE---------GCTPSKVSVY 341
Query: 384 RQLAKHQVDYILGDNPVGMSYMVGYGARYPQRIHHR 419
+ K Q+DY LG G S++VGYG PQ HHR
Sbjct: 342 KDFLKSQIDYALGS--TGRSFVVGYGVNPPQHPHHR 375
>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
Length = 475
Score = 217 bits (552), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 136/407 (33%), Positives = 215/407 (52%), Gaps = 30/407 (7%)
Query: 36 DYHRALHMSILFFEGQRSGRLPPDQRVKWRRDSALHDGATSGVDLTGGYYDAGDNVKFGF 95
+Y AL +I+F+E Q SG+LP R WR DSAL DG +G+DLTGG++DAGD+VKF
Sbjct: 27 NYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDHVKFNL 86
Query: 96 PMAFTTTMLSWSIIDFGKNM--GPELKYAIKAVKWGTDYLLKATAKPGVVYVQVGDAFSD 153
PM++T TMLSW++ ++ +L++ + ++W DY +K V Y QVGD D
Sbjct: 87 PMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVGDGSKD 146
Query: 154 HNCWERPEDMDTLRTAYKIDVSHPGSDVAGETXXXXXXXSIVFRSRDPRYSRLLLSRAIE 213
H W E M R ++K+ S PGS V ET SIV + R+P + L A E
Sbjct: 147 HAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYLQHAKE 206
Query: 214 VFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKASRRREY---REYILRN 270
++EFA+ + S + + +Y +G+ DEL W A WL+ A+ Y E ++N
Sbjct: 207 LYEFAEVTK---SDAGYTAANGYYNSWSGFYDELSWAAVWLYLATNDSTYLTKAESYVQN 263
Query: 271 EVVLHAGDTIN---EFGWDNKHAGINVLISKEVLMGKADYFESFKQNADGFICSILPGIS 327
+ +TI+ WD+ H G +L++K + GK + +KQ + + G +
Sbjct: 264 WPKISGSNTIDYKWAHCWDDVHNGAALLLAK--ITGK----DIYKQIIESHLDYWTTGYN 317
Query: 328 HPQVQYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSASPNLLRQLA 387
+++Y+P GL + ++++ T+ +FL YS++V G S + R+
Sbjct: 318 GERIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWV---------GCPSTKKEIYRKFG 368
Query: 388 KHQVDYILGDNPVGMSYMVGYGARYPQRIHHRG--SSLPSVQAHPAH 432
+ Q+DY LG G S++VG+G P+R HHR SS Q+ P++
Sbjct: 369 ESQIDYALGS--AGRSFVVGFGTNPPKRPHHRTAHSSWADSQSIPSY 413
>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
Endoglucanase From Separately Expressed Gh9 And Cbm3c
Modules
Length = 466
Score = 211 bits (536), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 134/409 (32%), Positives = 206/409 (50%), Gaps = 31/409 (7%)
Query: 36 DYHRALHMSILFFEGQRSGRLPPDQ-RVKWRRDSALHDGATSGVDLTGGYYDAGDNVKFG 94
+Y AL +I F+E QRSG+L R+ WR DS L DG +G+DLTGG+YDAGD+VKF
Sbjct: 25 NYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFN 84
Query: 95 FPMAFTTTMLSWSIIDFGKNMGPELKY--AIKAVKWGTDYLLKATAKPGVVYVQVGDAFS 152
PM+++ ML W++ ++ +Y + +KW DY +K + V Y QVGD +
Sbjct: 85 LPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQVGDGHA 144
Query: 153 DHNCWERPEDMDTLRTAYKIDVSHPGSDVAGETXXXXXXXSIVFRSRDPRYSRLLLSRAI 212
DH W E M R +YK+D S PGS V ET SI+F+ D YS+ L A
Sbjct: 145 DHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKECLKHAK 204
Query: 213 EVFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKASRRREY--REYILRN 270
E+FEFAD + S + FY +G+ DEL W A WL+ A+ Y + +
Sbjct: 205 ELFEFADTTK---SDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESYSD 261
Query: 271 EVVLHAGDTINEFG----WDNKHAGINVLISKEVLMGKADYFESFKQNADGFICSILPGI 326
+ I ++ WD+ G +L+++ + Y E+ +++ D + G
Sbjct: 262 KWGYEPQTNIPKYKWAQCWDDVTYGTYLLLAR-IKNDNGKYKEAIERHLDWWTT----GY 316
Query: 327 SHPQVQYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSASPNLLRQL 386
+ ++ Y+P GL + ++++ T+ +FL YS++ + G + + L +
Sbjct: 317 NGERITYTPKGLAWLDQWGSLRYATTTAFLACVYSDWEN--------GDKEKAKTYL-EF 367
Query: 387 AKHQVDYILGDNPVGMSYMVGYGARYPQRIHHR---GSSLPSVQAHPAH 432
A+ Q DY LG G S++VG+G P+R HHR GS S P H
Sbjct: 368 ARSQADYALG--STGRSFVVGFGENPPKRPHHRTAHGSWADSQMEPPEH 414
>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticum
pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticium In Complex With Cellobiose
Length = 441
Score = 179 bits (455), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 195/404 (48%), Gaps = 49/404 (12%)
Query: 32 ASQNDYHRALHMSILFFEGQRSGRLPPDQRV-KWRRDSALHDGATSGVDLTGGYYDAGDN 90
A +DY AL SI+FF+ + G + V WR DG+ GVDLTGGY+DAGD+
Sbjct: 1 AGTHDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDH 60
Query: 91 VKFGFPMAFTTTMLSWSIIDFGKNM---GPELKYAIKAVKWGTDYLLKATAKPGVVYVQV 147
VKFG P ++ +L WS+ +F ++ G K ++ +K+ TDY LK+ Y QV
Sbjct: 61 VKFGLPQGYSAAILGWSLYEFKESFDATGNTTK-MLQQLKYFTDYFLKSHPNSTTFYYQV 119
Query: 148 GDAFSDHNCWERPEDMDTLR-TAYKIDVSHPGSDVAGETXXXXXXXSIVFRSRDPRYSRL 206
G+ +DH W PE+ R + YK D S P SD+ ET + +++ D Y+
Sbjct: 120 GEGNADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATK 179
Query: 207 LLSRAIEVFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKASRRREYREY 266
L+ A E++ ++G + + F D+L W A WL+ A+ Y
Sbjct: 180 CLNAAKELYAMGKANQGVGNGQSFYQATSF-------GDDLAWAATWLYTATND---STY 229
Query: 267 ILRNEVVLHAGDTINE--------FGWDNKHAGINVLISKEVLMGKADYFESFKQNADGF 318
I E + G+T+NE WD+ + + +++ + GK Y ++ + N + +
Sbjct: 230 ITDAEQFITLGNTMNENKMQDKWTMCWDDMYVPAALRLAQ--ITGKQIYKDAIEFNFNYW 287
Query: 319 ICSILPGISHPQVQYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSA 378
QV +PGGL + + +++ + S ++L Y K P S
Sbjct: 288 ---------KTQVTTTPGGLKWLSNWGVLRYAAAESMVMLVYC-------KQNPDQS--- 328
Query: 379 SPNLLRQLAKHQVDYILGDNPVGMSYMVGYGARYPQRIHHRGSS 422
L LAK QVDYILGDNP MSY++GYG+ + HHR ++
Sbjct: 329 ----LLDLAKKQVDYILGDNPANMSYIIGYGSNWCIHPHHRAAN 368
>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
Length = 611
Score = 120 bits (301), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 111/451 (24%), Positives = 190/451 (42%), Gaps = 87/451 (19%)
Query: 31 SASQNDYHRALHMSILFFEGQRSGR-LPPDQRVKWRRDSALHDG---------------- 73
+A + +Y +AL S+ F++ G + + + WR D ++D
Sbjct: 34 AAGEYNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFI 93
Query: 74 ----------ATSGVDLTGGYYDAGDNVKFGFPMAFTTTMLSWSIIDFGKNMGPELKYAI 123
VD++GG++DAGD+VKFG P A+ + + W +F K+ A+
Sbjct: 94 SSNRSVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEF-KDQFRATGQAV 152
Query: 124 KA---VKWGTDYLLKAT---AKPGVVYV--QVGDAFSDHNCWERPEDMDTLRTAYKIDVS 175
A +++ DY ++ T A VV QVGD DH W PE+ R + I
Sbjct: 153 HAEVILRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKE 212
Query: 176 HPGSDVAGETXXXXXXXSIVFRSRDPRYSRLLLSRAIEVFEFADRH-----------RGA 224
PG+D+ T + F+ DP+Y+ L A +F+FA+++ +G
Sbjct: 213 KPGTDIISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQGEDGPKGY 272
Query: 225 YSSSLHHVVCPFYCDVNGYQDELLWGAAWLHKASRRREYREYILRNEVVLHAGDTINEFG 284
Y SS +QD+ W AAWL+ A++ Y + + I+
Sbjct: 273 YGSS-------------KWQDDYCWAAAWLYLATQNEHYLDEAFKYYDYYAPPGWIH--C 317
Query: 285 WDNKHAG-------INVLISKEVLMGKADYFESFKQNA----DGF--ICSILPGISHPQV 331
W++ +G IN L K+ + Y + +N D + I +L S +
Sbjct: 318 WNDVWSGTACILAEINDLYDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGI 377
Query: 332 QYSPGGLIFKAGGSNMQHVTSLSFLLLAYSNYVSHANKVVPCGSRSASPNLLRQLAKHQV 391
+PGG +F + ++ T+ + L Y + H + +P+ A+ Q+
Sbjct: 378 TVTPGGYVFLNQWGSARYNTAAQLIALVYDKH--HGD----------TPSKYANWARSQM 425
Query: 392 DYILGDNPVGMSYMVGYGARYPQRIHHRGSS 422
DY+LG NP+ Y+VGY + + HHR +S
Sbjct: 426 DYLLGKNPLNRCYVVGYSSNSVKYPHHRAAS 456
>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
Thermoacidophilic Alicyclobacillus Acidocaldarius
pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellobiose
pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellotetraose
pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
Isofagomine
pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
Isofagomine
pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
Isofagomine
Length = 537
Score = 46.6 bits (109), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 95/426 (22%), Positives = 145/426 (34%), Gaps = 72/426 (16%)
Query: 37 YHRALHMSILFFEGQRSGR-LPPDQRVKWRRDSALHD------GATSGVDLTGGYYDAGD 89
Y L + FF+ Q G LP D+ W A H G + GG++DAGD
Sbjct: 88 YRDVLEAMLRFFDYQLCGVVLPEDEAGPWAH-GACHTSDAKVFGTERALACPGGWHDAGD 146
Query: 90 NVKFGFPMAFTTTMLSWSIIDFGKNMG-----------PELKYAIKAVKWGTDYLL--KA 136
K+ P A L + F + P L A++ + +LL +
Sbjct: 147 YGKYTVPAAKAVADLLLAHEYFPAALAHVRPMRSVHRAPHLPPALEVAREEIAWLLTMQD 206
Query: 137 TAKPGVVYVQVGDAFSDHNCWERPEDMDTLRTAYKIDVSHPGSDVAGETXXXXXXXSIVF 196
A GV + +F + RPED D I + + A ++V+
Sbjct: 207 PATGGVYHKVTTPSFPPLDT--RPEDDDAPLVLSPISYAATATFCAA-----MAHAALVY 259
Query: 197 RSRDPRYSRLLLSRAIEVFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWLHK 256
R DP S A + + H + ++ Y D +DELLW + L +
Sbjct: 260 RPFDPALSSCCADAARRAYAWLGAHEMQPFHNPDGILTGEYGDAE-LRDELLWASCALLR 318
Query: 257 ASRRREYREYILRNEVVLHAGDTINEFGWDNKHAGINVLISKEVLMGKADYFES------ 310
+ + E +L D E GW + L G DY +
Sbjct: 319 MTGDSAWARVC---EPLLDL-DLPWELGW-----------ADVALYGVMDYLRTPRAAVS 363
Query: 311 --FKQNADGFICSILPGISHPQVQYSPGGLIFKAG----GSNMQHVTSLSFLLLAYSNYV 364
+ + L ++ + P G+ + GSNM + LLA V
Sbjct: 364 DDVRNKVKSRLLRELDALAA-MAESHPFGIPMRDDDFIWGSNMVLLNRAMAFLLAEGVGV 422
Query: 365 SHANKVVPCGSRSASPNLLRQLAKHQVDYILGDNPVGMSYMVGYGARYPQRIHHRGSSLP 424
H +A+ DY+ G NP+G Y+ G+G R + HHR S
Sbjct: 423 LHPAA--------------HTVAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHRPSVAD 468
Query: 425 SVQAHP 430
V HP
Sbjct: 469 DVD-HP 473
>pdb|3H7L|A Chain A, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
pdb|3H7L|B Chain B, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
pdb|3H7L|C Chain C, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
Length = 586
Score = 31.2 bits (69), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 63/131 (48%), Gaps = 22/131 (16%)
Query: 38 HRALHMSILFFEGQRSGRLPPDQRVKWRRDSALHDGATSGVDLTGGYYDA-GDNVKFGFP 96
++ L I +F+ QR G + Q R+ L+ T+ D+ GG+YDA GD K+
Sbjct: 98 NQTLSDVIHYFKSQRCGGVFDQQD---RQVPVLNANQTA--DVHGGWYDASGDVSKYLSH 152
Query: 97 MAFT-------TTMLSWSIIDFGKNM--GPELKYA------IKAVKWGTDYLLKATAKPG 141
+++ T M+ W+I+ G ++ G E A I+ +G D+L++ + G
Sbjct: 153 LSYANYLNPQQTPMVVWNILK-GLSLLEGSEDIAAFTRTRLIEEALFGADFLVRMQNEKG 211
Query: 142 VVYVQVGDAFS 152
Y+ V D +S
Sbjct: 212 FFYMTVFDKWS 222
>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 609
Score = 30.0 bits (66), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 391 VDYILGDNPVGMSYMVGYGARYPQRIHHRGSSLPSVQAHPA 431
+ Y+LG N + SY+ GYG R Q H R + + + PA
Sbjct: 504 ISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPA 544
>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 610
Score = 30.0 bits (66), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 22/41 (53%)
Query: 391 VDYILGDNPVGMSYMVGYGARYPQRIHHRGSSLPSVQAHPA 431
+ Y+LG N + SY+ GYG R Q H R + + + PA
Sbjct: 504 ISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPA 544
>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
Angstroms Resolution
Length = 639
Score = 28.9 bits (63), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 23/94 (24%), Positives = 40/94 (42%), Gaps = 4/94 (4%)
Query: 195 VFRSRDPRYSRLLLSRAIEVFEFADRHRGAYSSSLHHVVCPFYCDVNGYQDELLWGAAWL 254
+FR DP+Y+ ++ A +EF + ++ Y V+ D+ LW AA +
Sbjct: 302 IFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFSTGEYATVSD-ADDRLWAAAEM 360
Query: 255 HKASRRREY-REYILRNEVVLHAGDTINEFGWDN 287
+ EY R++ N + +F WDN
Sbjct: 361 WETLGDEEYLRDF--ENRAAQFSKKIEADFDWDN 392
Database: pdbaa
Posted date: Mar 3, 2013 10:34 PM
Number of letters in database: 14,973,337
Number of sequences in database: 62,578
Lambda K H
0.320 0.136 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,484,863
Number of Sequences: 62578
Number of extensions: 628976
Number of successful extensions: 1408
Number of sequences better than 100.0: 14
Number of HSP's better than 100.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 1356
Number of HSP's gapped (non-prelim): 22
length of query: 460
length of database: 14,973,337
effective HSP length: 102
effective length of query: 358
effective length of database: 8,590,381
effective search space: 3075356398
effective search space used: 3075356398
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (25.0 bits)