BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 006835
(629 letters)
Database: pdbaa
62,578 sequences; 14,973,337 total letters
Searching..................................................done
>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 2.5.
pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 5.6.
pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 6.5
Length = 433
Score = 357 bits (915), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 205/469 (43%), Positives = 271/469 (57%), Gaps = 43/469 (9%)
Query: 21 HDYGQALSKSILFFEAQRSGYLPHNQRVTWRANSGLNDGKASGVDLVGGYYDAGDNVKFG 80
+DY Q L S+LF+EAQRSG LP +Q+VTWR +S LND G DL GGY+DAGD VKFG
Sbjct: 3 YDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFG 62
Query: 81 LPMAFTVTMMSWSIIEYGKQMAASGELGHAMNAIKWGTDYFIKAHPQPNVLYGEVGDGNT 140
PMA+T T+++W +I++ +++G L A+KW TDYFIKAH N YG+VG G+
Sbjct: 63 FPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGDA 122
Query: 141 DHNCWQRPEDMTTDRRAYKIDPSNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRHAY 200
DH W RPEDMT R AYKID S PGSDLAGET SIVFR+ D YS+ LL HA
Sbjct: 123 DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHAR 182
Query: 201 QLFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQASGNQHYLDYLGKNGDS 260
QLFDFA+ YRGKY SIT A+ +Y S + Y DEL+WAAAWLY+A+ + YL+ D
Sbjct: 183 QLFDFANNYRGKYSDSITDARNFYAS-ADYRDELVWAAAWLYRATNDNTYLNTAESLYDE 241
Query: 261 MGGTGWGMTEFGWDVKYPGVQTLVAKFLMQGKAGHYAPVFERYQQKAEYFMCSCLGKGSR 320
G WG WD K GVQ L+AK + + Y+ + ++ +
Sbjct: 242 FGLQNWG-GGLNWDSKVSGVQVLLAKLTNK----------QAYKDTVQSYVNYLI----N 286
Query: 321 NVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAPAELLGFAKS 380
N QKTP GL++ W ++ +A+F+ L+ + ++ + FA++
Sbjct: 287 NQQKTPKGLLYIDMWGTLRHAANAAFIM-------------LEAAELGLSASSYRQFAQT 333
Query: 381 QVDYILGDNPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYATWFSSKGSD 440
Q+DY LGD R S++ G+G+N P R HHR+SS P+ +C W + D
Sbjct: 334 QIDYALGDGGR--SFVCGFGSNPPTRPHHRSSSCP-----PAPATC-----DWNTFNSPD 381
Query: 441 PN--LLVGAVVGGPDAYDNFGDRRDNYEQTEPATYNNAPILGILARLNA 487
PN +L GA+VGGPD DN+ D R +Y E AT NA LA L A
Sbjct: 382 PNYHVLSGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALAALVA 430
>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
Length = 605
Score = 320 bits (819), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 193/480 (40%), Positives = 267/480 (55%), Gaps = 48/480 (10%)
Query: 22 DYGQALSKSILFFEAQRSGYLPHNQRVTWRANSGLNDGKASGVDLVGGYYDAGDNVKFGL 81
+Y +AL KS+ F+EAQRSG LP N RV+WR +SGLNDG G+DL GG+YDAGD+VKFG
Sbjct: 5 NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGF 64
Query: 82 PMAFTVTMMSWSIIEYGKQMAASGELGHAMNAIKWGTDYFIKAHPQPNVLYGEVGDGNTD 141
PMAFT TM++W IE + SG++ + + ++W DYFIKAHP PNVLY +VGDG+ D
Sbjct: 65 PMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDAD 124
Query: 142 HNCWQRPEDMTTDRRAYKIDPSNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRHAYQ 201
H W E M +R ++K+DPS PGSD+A ET SIVF DPAY++ L++HA Q
Sbjct: 125 HKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAKQ 184
Query: 202 LFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQASGNQHYL-------DYL 254
L+ FAD YRG Y + A +Y S SGY DEL+W A WLY+A+G+ YL D+L
Sbjct: 185 LYTFADTYRGVYSDCVP-AGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDFL 243
Query: 255 GKNGDS-MGGTGWGMTEFGWDVKYPGVQTLVAKFLMQGKAGHYAPVFERYQQKAEYFMCS 313
+ + W + WD K G L+AK GK ++Y A ++
Sbjct: 244 STEQQTDLRSYRWTI---AWDDKSYGTYVLLAK--ETGK--------QKYIDDANRWLDY 290
Query: 314 -CLGKGSRNVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAPA 372
+G + V +PGG+ W +++ + +F+A VY+ + R +
Sbjct: 291 WTVGVNGQRVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDPVRKQRYH------- 343
Query: 373 ELLGFAKSQVDYILGDNPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYAT 432
FA Q++Y LGDNPR +SY+VG+GNN P+ HHR + G +
Sbjct: 344 ---DFAVRQINYALGDNPRNSSYVVGFGNNPPRNPHHR--------------TAHGSWTD 386
Query: 433 WFSSKGSDPNLLVGAVVGGP-DAYDNFGDRRDNYEQTEPATYNNAPILGILARLNAGHGG 491
+S + ++L GA+VGGP D + D R +Y E AT NA LA L +GG
Sbjct: 387 SIASPAENRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEEYGG 446
>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
Endoglucanase From Separately Expressed Gh9 And Cbm3c
Modules
Length = 466
Score = 310 bits (795), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 184/474 (38%), Positives = 258/474 (54%), Gaps = 49/474 (10%)
Query: 22 DYGQALSKSILFFEAQRSGYLPHNQ-RVTWRANSGLNDGKASGVDLVGGYYDAGDNVKFG 80
+YG+AL K+I F+E QRSG L + R+ WR +SGL+DGK +G+DL GG+YDAGD+VKF
Sbjct: 25 NYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFN 84
Query: 81 LPMAFTVTMMSWSIIEYGKQMAASGELGHAMNAIKWGTDYFIKAHPQPNVLYGEVGDGNT 140
LPM+++ M+ W++ EY SG+ H +N IKW DYFIK HP+ +V Y +VGDG+
Sbjct: 85 LPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQVGDGHA 144
Query: 141 DHNCWQRPEDMTTDRRAYKIDPSNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRHAY 200
DH W E M +R +YK+D S+PGS + ET SI+F+ D YS E L+HA
Sbjct: 145 DHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKECLKHAK 204
Query: 201 QLFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQASGNQHYLDYLGKNGDS 260
+LF+FAD K D T A +Y S SG+ DEL WAA WLY A+ + YLD D
Sbjct: 205 ELFEFADT--TKSDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESYSDK 262
Query: 261 MG-GTGWGMTEFGW-----DVKYPGVQTLVAKFLMQGKAGHYAPVFERYQQKAEYFMCSC 314
G + ++ W DV Y G L+A+ ++ G Y ER +
Sbjct: 263 WGYEPQTNIPKYKWAQCWDDVTY-GTYLLLAR--IKNDNGKYKEAIER------HLDWWT 313
Query: 315 LGKGSRNVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAPAEL 374
G + TP GL + +W ++++ T+ +FLA VYSD+ K
Sbjct: 314 TGYNGERITYTPKGLAWLDQWGSLRYATTTAFLACVYSDWENGDKEKAKT---------Y 364
Query: 375 LGFAKSQVDYILGDNPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYATWF 434
L FA+SQ DY LG R S++VG+G N P+R HHR + + +W
Sbjct: 365 LEFARSQADYALGSTGR--SFVVGFGENPPKRPHHRTA-----------------HGSWA 405
Query: 435 SSKGSDP---NLLVGAVVGGPDAYDNFGDRRDNYEQTEPATYNNAPILGILARL 485
S+ P ++L GA+VGGPD+ DN+ D NY E A NA +G+LA++
Sbjct: 406 DSQMEPPEHRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLAKM 459
>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
Length = 475
Score = 273 bits (699), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 175/481 (36%), Positives = 257/481 (53%), Gaps = 53/481 (11%)
Query: 22 DYGQALSKSILFFEAQRSGYLPHNQRVTWRANSGLNDGKASGVDLVGGYYDAGDNVKFGL 81
+YG+AL K+I+F+E Q SG LP+ R WR +S L DG+ +G+DL GG++DAGD+VKF L
Sbjct: 27 NYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDHVKFNL 86
Query: 82 PMAFTVTMMSWSIIEYGKQMAASGELGHAMNAIKWGTDYFIKAHPQPNVLYGEVGDGNTD 141
PM++T TM+SW++ EY SG+L H +N I+W DYF+K HP V Y +VGDG+ D
Sbjct: 87 PMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVGDGSKD 146
Query: 142 HNCWQRPEDMTTDRRAYKIDPSNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRHAYQ 201
H W E M +R ++K+ S+PGS + ET SIV + +P ++ L+HA +
Sbjct: 147 HAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYLQHAKE 206
Query: 202 LFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQASGNQHYL---DYLGKNG 258
L++FA+ K D T A YY S SG+ DEL WAA WLY A+ + YL + +N
Sbjct: 207 LYEFAE--VTKSDAGYTAANGYYNSWSGFYDELSWAAVWLYLATNDSTYLTKAESYVQNW 264
Query: 259 DSMGGTG-----WGMTEFGWDVKYPGVQTLVAKFLMQGKAGHYAPVFERYQQKAE-YFMC 312
+ G+ W WD + G L+AK + GK + Y+Q E +
Sbjct: 265 PKISGSNTIDYKWAHC---WDDVHNGAALLLAK--ITGK--------DIYKQIIESHLDY 311
Query: 313 SCLGKGSRNVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAPA 372
G ++ TP GL + +W ++++ T+ +FLA VYSD++ +
Sbjct: 312 WTTGYNGERIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWVGCPSTKKEI-------- 363
Query: 373 ELLGFAKSQVDYILGDNPRATSYMVGYGNNYPQRVHHRA--SSIVSIKVNPSFVSCRGGY 430
F +SQ+DY LG R S++VG+G N P+R HHR SS + PS+
Sbjct: 364 -YRKFGESQIDYALGSAGR--SFVVGFGTNPPKRPHHRTAHSSWADSQSIPSY------- 413
Query: 431 ATWFSSKGSDPNLLVGAVVGGPDAYDNFGDRRDNYEQTEPATYNNAPILGILARLNAGHG 490
+ L GA+VGGP + D++ D NY E A NA +G LA++ +G
Sbjct: 414 ---------HRHTLYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALAKMYQLYG 464
Query: 491 G 491
G
Sbjct: 465 G 465
>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
Length = 614
Score = 271 bits (692), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 178/480 (37%), Positives = 250/480 (52%), Gaps = 50/480 (10%)
Query: 21 HDYGQALSKSILFFEAQRSGYLPHNQRVTWRANSGLNDGKASGVDLVGGYYDAGDNVKFG 80
++YG+AL KSI+F+E QRSG LP ++R WR +SG+ DG GVDL GG+YDAGD+VKF
Sbjct: 4 YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63
Query: 81 LPMAFTVTMMSWSIIEYGKQMAASGELGHAMNAIKWGTDYFIKAHPQPNVLYGEVGDGNT 140
LPM++T M++WS+ E SG+ + M+ IKW DYFIK +P P V Y +VGDG
Sbjct: 64 LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123
Query: 141 DHNCWQRPEDMTTDRRAYKIDPSNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRHAY 200
DH+ W E M +R ++K+D S PGS + T ++VF+ SDP Y+ + + HA
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183
Query: 201 QLFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQASGNQHYLD----YLGK 256
LFD ADK K D T A Y S S + D+L WAA WLY A+ + YLD Y+
Sbjct: 184 NLFDMADK--AKSDAGYTAASG-YYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPN 240
Query: 257 NGDS----MGGTGWGMTEFGWDVKYPGVQTLVAKFLMQGKAGHYAPVFERYQQKAEYFM- 311
G + WG WD + G + L+AK + + Y+ E +
Sbjct: 241 WGKEQQTDIIAYKWGQC---WDDVHYGAELLLAKLTNK----------QLYKDSIEMNLD 287
Query: 312 CSCLGKGSRNVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAP 371
G V TP GL + +W +++ T+ +FLA VY+++ C+ V+
Sbjct: 288 FWTTGVNGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWEG-------CTPSKVSV 340
Query: 372 AELLGFAKSQVDYILGDNPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYA 431
+ F KSQ+DY LG R S++VGYG N PQ HHR + G +
Sbjct: 341 YK--DFLKSQIDYALGSTGR--SFVVGYGVNPPQHPHHR--------------TAHGSWT 382
Query: 432 TWFSSKGSDPNLLVGAVVGGPDAYDNFGDRRDNYEQTEPATYNNAPILGILARLNAGHGG 491
+S + + GA+VGGPD D + D +NY E A NA G LA++ GG
Sbjct: 383 DQMTSPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGG 442
>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
Length = 614
Score = 271 bits (692), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 178/480 (37%), Positives = 250/480 (52%), Gaps = 50/480 (10%)
Query: 21 HDYGQALSKSILFFEAQRSGYLPHNQRVTWRANSGLNDGKASGVDLVGGYYDAGDNVKFG 80
++YG+AL KSI+F+E QRSG LP ++R WR +SG+ DG GVDL GG+YDAGD+VKF
Sbjct: 4 YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63
Query: 81 LPMAFTVTMMSWSIIEYGKQMAASGELGHAMNAIKWGTDYFIKAHPQPNVLYGEVGDGNT 140
LPM++T M++WS+ E SG+ + M+ IKW DYFIK +P P V Y +VGDG
Sbjct: 64 LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123
Query: 141 DHNCWQRPEDMTTDRRAYKIDPSNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRHAY 200
DH+ W E M +R ++K+D S PGS + T ++VF+ SDP Y+ + + HA
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183
Query: 201 QLFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQASGNQHYLD----YLGK 256
LFD ADK K D T A Y S S + D+L WAA WLY A+ + YLD Y+
Sbjct: 184 NLFDMADK--AKSDAGYTAASG-YYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPN 240
Query: 257 NGDS----MGGTGWGMTEFGWDVKYPGVQTLVAKFLMQGKAGHYAPVFERYQQKAEYFM- 311
G + WG WD + G + L+AK + + Y+ E +
Sbjct: 241 WGKEQQTDIIAYKWGQX---WDDVHYGAELLLAKLTNK----------QLYKDSIEMNLD 287
Query: 312 CSCLGKGSRNVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAP 371
G V TP GL + +W +++ T+ +FLA VY+++ C+ V+
Sbjct: 288 FWTTGVNGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWEG-------CTPSKVSV 340
Query: 372 AELLGFAKSQVDYILGDNPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYA 431
+ F KSQ+DY LG R S++VGYG N PQ HHR + G +
Sbjct: 341 YK--DFLKSQIDYALGSTGR--SFVVGYGVNPPQHPHHR--------------TAHGSWT 382
Query: 432 TWFSSKGSDPNLLVGAVVGGPDAYDNFGDRRDNYEQTEPATYNNAPILGILARLNAGHGG 491
+S + + GA+VGGPD D + D +NY E A NA G LA++ GG
Sbjct: 383 DQMTSPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGG 442
>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticum
pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticium In Complex With Cellobiose
Length = 441
Score = 239 bits (611), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 161/474 (33%), Positives = 235/474 (49%), Gaps = 64/474 (13%)
Query: 21 HDYGQALSKSILFFEAQRSG-YLPHNQRVTWRANSGLNDGKASGVDLVGGYYDAGDNVKF 79
HDY AL SI+FF+A + G N WR DG GVDL GGY+DAGD+VKF
Sbjct: 4 HDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVKF 63
Query: 80 GLPMAFTVTMMSWSIIEYGKQMAASGELGHAMNAIKWGTDYFIKAHPQPNVLYGEVGDGN 139
GLP ++ ++ WS+ E+ + A+G + +K+ TDYF+K+HP Y +VG+GN
Sbjct: 64 GLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEGN 123
Query: 140 TDHNCWQRPEDMTTDRRA-YKIDPSNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRH 198
DH W PE+ T R + YK DPS+P SD+ ET + +++ D AY+++ L
Sbjct: 124 ADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLNA 183
Query: 199 AYQLFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQASGNQHYL---DYLG 255
A +L+ GK + + Q +Y++ S + D+L WAA WLY A+ + Y+ +
Sbjct: 184 AKELYAM-----GKANQGVGNGQSFYQATS-FGDDLAWAATWLYTATNDSTYITDAEQFI 237
Query: 256 KNGDSMGGT----GWGMTEFGWDVKYPGVQTLVAKFLMQGKAGHYAPVFERYQQKAEYFM 311
G++M W M WD Y +A+ + GK + Y+ E+
Sbjct: 238 TLGNTMNENKMQDKWTMC---WDDMYVPAALRLAQ--ITGK--------QIYKDAIEF-- 282
Query: 312 CSCLGKGSRNVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAP 371
V TPGGL + W +++ + S + VY C
Sbjct: 283 --NFNYWKTQVTTTPGGLKWLSNWGVLRYAAAESMVMLVY------------CKQN--PD 326
Query: 372 AELLGFAKSQVDYILGDNPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYA 431
LL AK QVDYILGDNP SY++GYG+N+ HHRA++ GY
Sbjct: 327 QSLLDLAKKQVDYILGDNPANMSYIIGYGSNWCIHPHHRAAN---------------GY- 370
Query: 432 TWFSSKGSDP--NLLVGAVVGGPDAYDNFGDRRDNYEQTEPATYNNAPILGILA 483
T+ + + P +LL GA+VGGPD D F D + Y+ TE A NA ++G+LA
Sbjct: 371 TYANGDNAKPAKHLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLA 424
>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
Length = 611
Score = 205 bits (521), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 155/510 (30%), Positives = 241/510 (47%), Gaps = 80/510 (15%)
Query: 21 HDYGQALSKSILFFEAQRSGY-LPHNQRVTWRA--------------NSGLNDGKASG-- 63
++Y +AL S+ F++A G + N ++WR N+ ++DG S
Sbjct: 38 YNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSNR 97
Query: 64 ----------VDLVGGYYDAGDNVKFGLPMAFTVTMMSWSIIEYGKQMAASGELGHAMNA 113
VD+ GG++DAGD+VKFGLP A+ + + W E+ Q A+G+ HA
Sbjct: 98 SVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEVI 157
Query: 114 IKWGTDYFIKA---HPQPNVL--YGEVGDGNTDHNCWQRPEDMTTDRRAYKIDPSNPGSD 168
+++ DYF++ NV+ +VGDG+ DH W PE+ T RR + I PG+D
Sbjct: 158 LRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGTD 217
Query: 169 LAGETXXXXXXXSIVFRHSDPAYSSELLRHAYQLFDFADKY-RGKYDGSITVAQKYYRSI 227
+ T + F+ +DP Y+++ L +A LFDFA+K +G G K Y
Sbjct: 218 IISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQGED--GPKGYYGS 275
Query: 228 SGYNDELLWAAAWLYQASGNQHYLDYLGKNGDSMGGTGWGMTEFGWDVKYPGVQTLVAKF 287
S + D+ WAAAWLY A+ N+HYLD K D GW W+ + G ++A+
Sbjct: 276 SKWQDDYCWAAAWLYLATQNEHYLDEAFKYYDYYAPPGWIHC---WNDVWSGTACILAEI 332
Query: 288 --LMQGKAGHYAPVFERYQQKAEYFMC-------SCLGKGS-RNVQKTPGGLIFRQRWNN 337
L + ++ ++R K ++ L K S + TPGG +F +W +
Sbjct: 333 NDLYDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGITVTPGGYVFLNQWGS 392
Query: 338 MQFVTSASFLATVYSDYLASAGRDLKCSAGNVAPAELLGFAKSQVDYILGDNPRATSYMV 397
++ T+A +A VY + P++ +A+SQ+DY+LG NP Y+V
Sbjct: 393 ARYNTAAQLIALVYDKHHGD------------TPSKYANWARSQMDYLLGKNPLNRCYVV 440
Query: 398 GYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYATWFSSKGSDPN--LLVGAVVGGPDAY 455
GY +N + HHRA+S + + S P+ +L GA+VGGPDA
Sbjct: 441 GYSSNSVKYPHHRAASGLK------------------DANDSSPHKYVLYGALVGGPDAS 482
Query: 456 DNFGDRRDNYEQTEPATYNNAPILGILARL 485
D DR ++Y E A NA +G A L
Sbjct: 483 DQHVDRTNDYIYNEVAIDYNAAFVGACAGL 512
>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
Thermoacidophilic Alicyclobacillus Acidocaldarius
pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellobiose
pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellotetraose
pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
Isofagomine
pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
Isofagomine
pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
Isofagomine
Length = 537
Score = 50.1 bits (118), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 107/488 (21%), Positives = 182/488 (37%), Gaps = 59/488 (12%)
Query: 23 YGQALSKSILFFEAQRSGY-LPHNQRVTW-RANSGLNDGKASGVD----LVGGYYDAGDN 76
Y L + FF+ Q G LP ++ W +D K G + GG++DAGD
Sbjct: 88 YRDVLEAMLRFFDYQLCGVVLPEDEAGPWAHGACHTSDAKVFGTERALACPGGWHDAGDY 147
Query: 77 VKFGLPMAFTVTMMSWS------IIEYGKQMAASGELGHAMNAIKWGTD----YFIKAHP 126
K+ +P A V + + + + + M + H A++ + P
Sbjct: 148 GKYTVPAAKAVADLLLAHEYFPAALAHVRPMRSVHRAPHLPPALEVAREEIAWLLTMQDP 207
Query: 127 QPNVLYGEVGDGNTDHNCWQRPEDMTTDRRAYKIDPSNPGSDLAGETXXXXXXXSIVFRH 186
+Y +V + RPED D + P + + ++V+R
Sbjct: 208 ATGGVYHKVTTPSFP-PLDTRPED---DDAPLVLSPISYAA--TATFCAAMAHAALVYRP 261
Query: 187 SDPAYSS---ELLRHAYQLFDFADKYRGKYDGSITVAQKYYRSISGYNDELLWAAAWLYQ 243
DPA SS + R AY A + + ++ + +Y + DELLWA+ L +
Sbjct: 262 FDPALSSCCADAARRAYAWLG-AHEMQPFHNPDGILTGEY--GDAELRDELLWASCALLR 318
Query: 244 ASGNQHYLDYLGKNGDSMGGTGWGMTEFGW-DVKYPGVQTLVAKFLMQGKAGHYAPVFER 302
+G+ + D W E GW DV GV +L +A V +
Sbjct: 319 MTGDSAWARVCEPLLDL--DLPW---ELGWADVALYGVM----DYLRTPRAA----VSDD 365
Query: 303 YQQKAEYFMCSCLGKGSRNVQKTPGGLIFRQRWNNMQFVTSASFLATVYSDYLASAGRDL 362
+ K + + L + + P G+ R ++ + ++ L + LA L
Sbjct: 366 VRNKVKSRLLRELDALAAMAESHPFGIPMRD--DDFIWGSNMVLLNRAMAFLLAEGVGVL 423
Query: 363 KCSAGNVAPAELLGFAKSQVDYILGDNPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPS 422
+A VA + DY+ G NP Y+ G+G + HHR S + +P
Sbjct: 424 HPAAHTVA--------QRAADYLFGANPLGQCYVTGFGQRPVRHPHHRPSVADDVD-HPV 474
Query: 423 FVSCRGGYATWFSSKGSDPNLLVGAVVGGPDAYDNFGDRRDNYEQTEPATYNNAPILGIL 482
GG + + A + G A + + D +D+Y E A Y N+P + ++
Sbjct: 475 PGMVVGGPNRHLQDE------IARAQLAGRPAMEAYIDHQDSYSTNEVAVYWNSPAVFVI 528
Query: 483 ARLNAGHG 490
A L G
Sbjct: 529 AALLEARG 536
>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 609
Score = 32.7 bits (73), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 61/278 (21%), Positives = 96/278 (34%), Gaps = 50/278 (17%)
Query: 232 DELLWAAAWLYQASGNQ----------HYLDYLGKNGDSMGGTG-----WGMTEFGWDVK 276
DE WAA LY +G HYL+ K G++ G G WG F W
Sbjct: 355 DEFYWAACELYVTTGKDEYKNYLMNSPHYLEMPAKMGENGGANGEDNGLWGC--FTWGTT 412
Query: 277 YPGVQTLVAKFLMQGKAGHYAPVFERYQQKAEYFMCSCLGKGSRNVQKTPGGLIFRQ--- 333
G+ T+ + G P + QKA + + N+++ L +Q
Sbjct: 413 Q-GLGTITLALVENG-----LPATDI--QKARNNIAKAADRWLENIEEQGYRLPIKQAED 464
Query: 334 -----RWNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAPAELLGFAKSQVDYILGD 388
W + F+ + + D+ + + L + Y+LG
Sbjct: 465 ERGGYPWGSNSFILNQMIVMGYAYDFTGDS--------------KYLDGMFDGISYLLGR 510
Query: 389 NPRATSYMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYATWFSSKGSDPNLLVGAV 448
N SY+ GYG Q H R + + K P+ G + +S+ DP +
Sbjct: 511 NAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPA--PPPGIISGGPNSRFEDPTINAAVK 568
Query: 449 VGGPDAYDNFGDRRDNYEQTEPATYNNAPILGILARLN 486
P F D D++ E NAP + A L+
Sbjct: 569 KDTP-PQKCFIDHTDSWSTNEITVNWNAPFAWVTAYLD 605
>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
Angstroms Resolution
Length = 639
Score = 32.0 bits (71), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 47/210 (22%), Positives = 83/210 (39%), Gaps = 29/210 (13%)
Query: 58 DGKASGVDLVGGYYDAGDNVKFGLPMAFTVTMMSWSIIEYGKQMAASG-ELGHAMNAI-- 114
+G+ + D G++DAGD K+ + TV M + + Q+ E+ N+I
Sbjct: 174 NGQHTKKDSTKGWHDAGDYNKYVVNAGITVGSMFLAWEHFKDQLEPVALEIPEKNNSIPD 233
Query: 115 -----KWGTDYFIKAHPQPNVLYGEVGDGNTDHNCWQR-------PEDMTTDRRAYKIDP 162
K+ D+ + P+ G G H R PE+ D R +
Sbjct: 234 FLDELKYEIDWILTMQ-YPD------GSGRVAHKVSTRNFGGFIMPEN-EHDERFFVPWS 285
Query: 163 SNPGSDLAGETXXXXXXXSIVFRHSDPAYSSELLRHAYQLFDF-ADKYRGKYDGSITVAQ 221
S +D T +FR DP Y+ + + A ++F + + +
Sbjct: 286 SAATADFVAMTAMAAR----IFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFST 341
Query: 222 KYYRSISGYNDELLWAAAWLYQASGNQHYL 251
Y ++S +D L WAAA +++ G++ YL
Sbjct: 342 GEYATVSDADDRL-WAAAEMWETLGDEEYL 370
>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 610
Score = 30.0 bits (66), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 69/332 (20%), Positives = 115/332 (34%), Gaps = 57/332 (17%)
Query: 183 VFRHSDPAYSSELLRHAYQLFDFADKYRGKYDGSITVAQKYYRSISGYND-----ELLWA 237
+++ DP ++++ L A + A K+ Y + YND E WA
Sbjct: 303 LWKDYDPTFAADCLEKAEIAWQAALKHPDIY--AEYTPGSGGPGGGPYNDDYVGDEFYWA 360
Query: 238 AAWLYQASGNQ----------HYLDYLGKNGDSMGGTG-----WGMTEFGWDVKYPGVQT 282
A LY +G HYL+ K G++ G G WG F W G+ T
Sbjct: 361 ACELYVTTGKDEYKNYLMNSPHYLEMPAKMGENGGANGEDNGLWGC--FTWGTTQ-GLGT 417
Query: 283 LVAKFLMQGKAGHYAPVFERYQQKAEYFMCSCLGKGSRNVQKTPGGLIFRQ--------R 334
+ + G P + QKA + + N+++ L +Q
Sbjct: 418 ITLALVENG-----LPATDI--QKARNNIAKAADRWLENIEEQGYRLPIKQAEDERGGYP 470
Query: 335 WNNMQFVTSASFLATVYSDYLASAGRDLKCSAGNVAPAELLGFAKSQVDYILGDNPRATS 394
W + F+ + + D+ ++ L + Y+LG N S
Sbjct: 471 WGSNSFILNQMIVMGYAYDFTGD--------------SKYLDGMFDGISYLLGRNAMDQS 516
Query: 395 YMVGYGNNYPQRVHHRASSIVSIKVNPSFVSCRGGYATWFSSKGSDPNLLVGAVVGGPDA 454
Y+ GYG Q H R + + K P+ G + +S+ DP + P
Sbjct: 517 YVTGYGERPLQNPHDRFWTPQTSKRFPA--PPPGIISGGPNSRFEDPTINAAVKKDTP-P 573
Query: 455 YDNFGDRRDNYEQTEPATYNNAPILGILARLN 486
F D D++ + NAP + A L+
Sbjct: 574 QKCFIDHTDSWGTNQITVNWNAPFAWVTAYLD 605
>pdb|3M62|A Chain A, Crystal Structure Of Ufd2 In Complex With The
Ubiquitin-Like (Ubl) Domain Of Rad23
pdb|3M63|A Chain A, Crystal Structure Of Ufd2 In Complex With The
Ubiquitin-Like (Ubl) Domain Of Dsk2
Length = 968
Score = 30.0 bits (66), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 205 FADKYRGKYDGSITVAQKYYRSISGYNDELLWAA 238
F DK+ +Y SI + + YY+ I Y ++L+W +
Sbjct: 641 FYDKFNSRYSISIILEELYYK-IPSYKNQLIWQS 673
>pdb|2QIZ|A Chain A, Structure Of The Yeast U-Box-Containing Ubiquitin Ligase
Ufd2p
Length = 982
Score = 29.6 bits (65), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 205 FADKYRGKYDGSITVAQKYYRSISGYNDELLWAA 238
F DK+ +Y SI + + YY+ I Y ++L+W +
Sbjct: 655 FYDKFNSRYSISIILEELYYK-IPSYKNQLIWQS 687
>pdb|2QJ0|A Chain A, Structure Of The Yeast U-Box-Containing Ubiquitin Ligase
Ufd2p
Length = 982
Score = 29.6 bits (65), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 205 FADKYRGKYDGSITVAQKYYRSISGYNDELLWAA 238
F DK+ +Y SI + + YY+ I Y ++L+W +
Sbjct: 655 FYDKFNSRYSISIILEELYYK-IPSYKNQLIWQS 687
Database: pdbaa
Posted date: Mar 3, 2013 10:34 PM
Number of letters in database: 14,973,337
Number of sequences in database: 62,578
Lambda K H
0.317 0.134 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 19,486,942
Number of Sequences: 62578
Number of extensions: 863390
Number of successful extensions: 1725
Number of sequences better than 100.0: 18
Number of HSP's better than 100.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 1661
Number of HSP's gapped (non-prelim): 30
length of query: 629
length of database: 14,973,337
effective HSP length: 105
effective length of query: 524
effective length of database: 8,402,647
effective search space: 4402987028
effective search space used: 4402987028
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 54 (25.4 bits)