BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007041
(620 letters)
Database: pdbaa
62,578 sequences; 14,973,337 total letters
Searching..................................................done
>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 2.5.
pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 5.6.
pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 6.5
Length = 433
Score = 225 bits (574), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 155/466 (33%), Positives = 230/466 (49%), Gaps = 62/466 (13%)
Query: 110 NYTLALHKALMFFNAQRSGKLPKHNNVSWRGNSCLKDGKSASSSISKDLVGGFYDAGDAI 169
+Y L +L+F+ AQRSG+LP V+WR +S L D +DL GG++DAGD +
Sbjct: 4 DYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALND----QGDQGQDLTGGYFDAGDFV 59
Query: 170 KFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLKTFNHSADSISRMVA 229
KF FP ++T T+L+W +I++ A Y +AG L+ ++ +KW TDYF+K S +
Sbjct: 60 KFGFPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHT----SQNEFYG 115
Query: 230 QVGAGDTSGGSTTPNDHYCWMRPEDIDYDRPVTECSSC---SDXXXXXXXXXXXXXIVFK 286
QVG GD DH W RPED+ RP + + SD IVF+
Sbjct: 116 QVGQGDA--------DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFR 167
Query: 287 D-NKAYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNSTSYWDEFIWGGAWLYYATG 345
+ + YS L+ A+ LF F+ R +YS +A FY S Y DE +W AWLY AT
Sbjct: 168 NVDGTYSNNLLTHARQLFDFANNYRGKYSDSITDARNFYASADYRDELVWAAAWLYRATN 227
Query: 346 NSSYLQLATTPGLGKHAGVF-WGILDNSVLSWDNKXXXXXXXXXXXXXXXXPGYPYEEML 404
+++YL A + L G+ WG L+WD+K Y++ +
Sbjct: 228 DNTYLNTAES--LYDEFGLQNWG----GGLNWDSKVSGVQVLLAKLTNKQ----AYKDTV 277
Query: 405 RTFHNQTSITMCSYLPTFTTWNRTRGLIQLNHGNPQPLQYVVNAAFLATLFSDYLDAADT 464
+++ N I P +GL+ ++ L++ NAAF+ L+AA+
Sbjct: 278 QSYVNYL-INNQQKTP--------KGLLYIDMWG--TLRHAANAAFIM------LEAAEL 320
Query: 465 PGWYCGPNFFSTDTLREFAKTQIDYILGKNPRKMSYIVGFGNHYPKHVHHRGASIPKNKI 524
S + R+FA+TQIDY LG R S++ GFG++ P HHR +S P
Sbjct: 321 G--------LSASSYRQFAQTQIDYALGDGGR--SFVCGFGSNPPTRPHHRSSSCPPAPA 370
Query: 525 RYNCKGGWKLRDSSKPNPNTLVGALVAGPDKHDGFRDVRTNYNYTE 570
C W +S PN + L GALV GPD++D + D R++Y + E
Sbjct: 371 --TCD--WNTFNSPDPNYHVLSGALVGGPDQNDNYVDDRSDYVHNE 412
>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
Length = 605
Score = 204 bits (520), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 160/479 (33%), Positives = 217/479 (45%), Gaps = 77/479 (16%)
Query: 110 NYTLALHKALMFFNAQRSGKLPKHNNVSWRGNSCLKDGKSASSSISKDLVGGFYDAGDAI 169
NY AL K++ F+ AQRSGKLP++N VSWRG+S L DG + + DL GG+YDAGD +
Sbjct: 5 NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDG----ADVGLDLTGGWYDAGDHV 60
Query: 170 KFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLKTFNHSADSISRMVA 229
KF FP +FT TML+W IE Y +G++ ++K+ ++W DYF+K H + ++ +
Sbjct: 61 KFGFPMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKA--HPSPNV--LYV 116
Query: 230 QVGAGDTSGGSTTPNDHYCWMRPEDIDYDRPVTECS-SC--SDXXXXXXXXXXXXXIVFK 286
QVG GD DH W E + +RP + SC SD IVF
Sbjct: 117 QVGDGDA--------DHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFA 168
Query: 287 DNK-AYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNS-TSYWDEFIWGGAWLYYAT 344
D+ AY+ LV AK L+ F+ R YS A FYNS + Y DE +WG WLY AT
Sbjct: 169 DDDPAYAATLVQHAKQLYTFADTYRGVYS-DCVPAGAFYNSWSGYQDELVWGAYWLYKAT 227
Query: 345 GNSSYLQLA-------TTPGLGKHAGVFWGILDNSVLSWDNKXXXXXXXXXXXXXXXXPG 397
G+ SYL A +T W I +WD+K
Sbjct: 228 GDDSYLAKAEYEYDFLSTEQQTDLRSYRWTI------AWDDKSYGTYVLLAKETGKQKYI 281
Query: 398 YPYEEMLRTFHNQTSITMCSYLP----TFTTWNRTRGLIQLNHGNPQPLQYVVNAAFLAT 453
L + + Y P TW R Y N AF+A
Sbjct: 282 DDANRWLDYWTVGVNGQRVPYSPGGMAVLDTWGALR--------------YAANTAFVAL 327
Query: 454 LFSDYLDAADTPGWYCGPNFFSTDTLREFAKTQIDYILGKNPRKMSYIVGFGNHYPKHVH 513
+++ +D + +FA QI+Y LG NPR SY+VGFGN+ P++ H
Sbjct: 328 VYAKVID-----------DPVRKQRYHDFAVRQINYALGDNPRNSSYVVGFGNNPPRNPH 376
Query: 514 HRGASIPKNKIRYNCKGGWKLRDSS-KPNPNTLVGALVAGP-DKHDGFRDVRTNYNYTE 570
HR A G W +S N + L GALV GP +D + D R +Y E
Sbjct: 377 HRTA-----------HGSWTDSIASPAENRHVLYGALVGGPGSPNDAYTDDRQDYVANE 424
>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
Length = 475
Score = 182 bits (461), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 154/478 (32%), Positives = 209/478 (43%), Gaps = 62/478 (12%)
Query: 98 VPRHHHSHARPDNYTLALHKALMFFNAQRSGKLPKHNNVSWRGNSCLKDGKSASSSISKD 157
VPR H + NY AL KA+MF+ Q SGKLP +WRG+S LKDG+ D
Sbjct: 15 VPRGSHMASGSFNYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNG----LD 70
Query: 158 LVGGFYDAGDAIKFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLKTF 217
L GG++DAGD +KFN P S+T TMLSW+V EY + +G+L H+ I+W DYF+K
Sbjct: 71 LTGGWFDAGDHVKFNLPMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCH 130
Query: 218 NHSADSISRMVAQVGAGDTSGGSTTPNDHYCWMRPEDIDYDRP---VTECSSCSDXXXXX 274
S+ V GD S DH W E + +RP VT+ S S
Sbjct: 131 P------SKYVYYYQVGDGS------KDHAWWGPAEVMQMERPSFKVTQSSPGSTVVAET 178
Query: 275 XXXXXXXXIVFKD-NKAYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNSTS-YWDE 332
IV KD N + + AK L++F+ ++ AG A +YNS S ++DE
Sbjct: 179 AASLAAASIVLKDRNPTKAATYLQHAKELYEFAEVTKS--DAGYTAANGYYNSWSGFYDE 236
Query: 333 FIWGGAWLYYATGNSSYLQLATT--PGLGKHAGVFWGILDNS-VLSWDNKXXXXXXXXXX 389
W WLY AT +S+YL A + K +G +D WD+
Sbjct: 237 LSWAAVWLYLATNDSTYLTKAESYVQNWPKISGS--NTIDYKWAHCWDDVHNGAALLLAK 294
Query: 390 XXXXXXPGYPYEEMLRTFHNQTSITMCSYLPTFTTWNRTRGLIQLNHGNPQPLQYVVNAA 449
E L + + Y P W G L+Y A
Sbjct: 295 ITGKDIYKQIIESHLDYWTTGYNGERIKYTPKGLAWLDQWG----------SLRYATTTA 344
Query: 450 FLATLFSDYLDAADTPGWYCGPNFFSTDTLREFAKTQIDYILGKNPRKMSYIVGFGNHYP 509
FLA ++SD++ T + R+F ++QIDY LG R S++VGFG + P
Sbjct: 345 FLAFVYSDWVGCPST----------KKEIYRKFGESQIDYALGSAGR--SFVVGFGTNPP 392
Query: 510 KHVHHRGASIPKNKIRYNCKGGWKLRDSSKP-NPNTLVGALVAGPDKHDGFRDVRTNY 566
K HHR A W S + +TL GALV GP D + D +NY
Sbjct: 393 KRPHHRTA-----------HSSWADSQSIPSYHRHTLYGALVGGPGSDDSYTDDISNY 439
>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
Endoglucanase From Separately Expressed Gh9 And Cbm3c
Modules
Length = 466
Score = 181 bits (458), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 152/475 (32%), Positives = 208/475 (43%), Gaps = 70/475 (14%)
Query: 110 NYTLALHKALMFFNAQRSGKLPKHN-NVSWRGNSCLKDGKSASSSISKDLVGGFYDAGDA 168
NY AL KA+ F+ QRSGKL ++WRG+S L DGK A DL GG+YDAGD
Sbjct: 25 NYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGI----DLTGGWYDAGDH 80
Query: 169 IKFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLKTFNHSADSISRMV 228
+KFN P S++ ML W+V EY ++ +G+ NH+ IKW DYF+K H +
Sbjct: 81 VKFNLPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKC--HPEKDV--YY 136
Query: 229 AQVGAGDTSGGSTTPNDHYCWMRPEDIDYDRP---VTECSSCSDXXXXXXXXXXXXXIVF 285
QVG G DH W E + +RP V S S I+F
Sbjct: 137 YQVGDGHA--------DHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIF 188
Query: 286 KD-NKAYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNSTS-YWDEFIWGGAWLYYA 343
K + YS++ + AK LF+F+ ++ G A FYNS S ++DE W WLY A
Sbjct: 189 KKVDGEYSKECLKHAKELFEFADTTKS--DDGYTAANGFYNSWSGFYDELSWAAVWLYLA 246
Query: 344 TGNSSYLQLATTPGLGKHAGVFWGILDNSVL-------SWDNKXXXXXXXXXXXXXXXXP 396
T +SSYL A + WG + + WD+
Sbjct: 247 TNDSSYLDKAES------YSDKWGYEPQTNIPKYKWAQCWDDVTYGTYLLLARIKNDNG- 299
Query: 397 GYPYEEMLRTFHNQTSITMCSYLPTFTTWNRTRGLIQLNHGNPQPLQYVVNAAFLATLFS 456
Y+E + + + T+T +GL L+ L+Y AFLA ++S
Sbjct: 300 --KYKEAIERHLDWWTTGYNGERITYT----PKGLAWLDQWG--SLRYATTTAFLACVYS 351
Query: 457 DYLDAADTPGWYCGPNFFSTDTLREFAKTQIDYILGKNPRKMSYIVGFGNHYPKHVHHRG 516
D+ + T EFA++Q DY LG R S++VGFG + PK HHR
Sbjct: 352 DWENGDKE----------KAKTYLEFARSQADYALGSTGR--SFVVGFGENPPKRPHHRT 399
Query: 517 ASIPKNKIRYNCKGGWKLRDSSKP-NPNTLVGALVAGPDKHDGFRDVRTNYNYTE 570
A G W P + + L GALV GPD D + D +NY E
Sbjct: 400 A-----------HGSWADSQMEPPEHRHVLYGALVGGPDSTDNYTDDISNYTCNE 443
>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
Length = 614
Score = 173 bits (438), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 147/469 (31%), Positives = 212/469 (45%), Gaps = 69/469 (14%)
Query: 110 NYTLALHKALMFFNAQRSGKLPKHNNVSWRGNSCLKDGKSASSSISKDLVGGFYDAGDAI 169
NY AL K++MF+ QRSG LP +WR +S +KDG S + DL GG+YDAGD +
Sbjct: 5 NYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDG----SDVGVDLTGGWYDAGDHV 60
Query: 170 KFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLKTFNHSADSISRMVA 229
KFN P S+T ML+WS+ E Y+ +G+ ++ + IKW DYF+K +
Sbjct: 61 KFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKC----NPTPGVYYY 116
Query: 230 QVGAGDTSGGSTTPNDHYCWMRPEDIDYDRP---VTECSSCSDXXXXXXXXXXXXXIVFK 286
QVG G DH W E + +RP V S +VFK
Sbjct: 117 QVGDGG--------KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFK 168
Query: 287 -DNKAYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNSTSYWDEFIWGGAWLYYATG 345
+ Y++K + AK LF + +A+ AG A+ +Y+S+S++D+ W WLY AT
Sbjct: 169 SSDPTYAEKCISHAKNLFDMA--DKAKSDAGYTAASGYYSSSSFYDDLSWAAVWLYLATN 226
Query: 346 NSSYLQLATT--PGLGKHA-----GVFWGILDNSVLSWDNKXXXXXXXXXXXXXXXXPGY 398
+S+YL A + P GK WG WD+
Sbjct: 227 DSTYLDKAESYVPNWGKEQQTDIIAYKWG------QCWDDVHYGAELLLAKLTNKQLYKD 280
Query: 399 PYEEMLRTFHNQTSITMCSYLPTFTTWNRTRGLIQLNHGNPQPLQYVVNAAFLATLFSDY 458
E L + + T SY P W L Q L++ AFLA +++++
Sbjct: 281 SIEMNLDFWTTGVNGTRVSYTPKGLAW-----LFQWGS-----LRHATTQAFLAGVYAEW 330
Query: 459 LDAADTPGWYCGPNFFSTDTLREFAKTQIDYILGKNPRKMSYIVGFGNHYPKHVHHRGAS 518
C P+ S ++F K+QIDY LG R S++VG+G + P+H HHR A
Sbjct: 331 EG--------CTPSKVS--VYKDFLKSQIDYALGSTGR--SFVVGYGVNPPQHPHHRTA- 377
Query: 519 IPKNKIRYNCKGGWKLRDSSKP-NPNTLVGALVAGPDKHDGFRDVRTNY 566
G W + +S + +T+ GALV GPD DG+ D NY
Sbjct: 378 ----------HGSWTDQMTSPTYHRHTIYGALVGGPDNADGYTDEINNY 416
>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
Length = 614
Score = 173 bits (438), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 147/469 (31%), Positives = 212/469 (45%), Gaps = 69/469 (14%)
Query: 110 NYTLALHKALMFFNAQRSGKLPKHNNVSWRGNSCLKDGKSASSSISKDLVGGFYDAGDAI 169
NY AL K++MF+ QRSG LP +WR +S +KDG S + DL GG+YDAGD +
Sbjct: 5 NYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDG----SDVGVDLTGGWYDAGDHV 60
Query: 170 KFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLKTFNHSADSISRMVA 229
KFN P S+T ML+WS+ E Y+ +G+ ++ + IKW DYF+K +
Sbjct: 61 KFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKC----NPTPGVYYY 116
Query: 230 QVGAGDTSGGSTTPNDHYCWMRPEDIDYDRP---VTECSSCSDXXXXXXXXXXXXXIVFK 286
QVG G DH W E + +RP V S +VFK
Sbjct: 117 QVGDGG--------KDHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFK 168
Query: 287 -DNKAYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNSTSYWDEFIWGGAWLYYATG 345
+ Y++K + AK LF + +A+ AG A+ +Y+S+S++D+ W WLY AT
Sbjct: 169 SSDPTYAEKCISHAKNLFDMA--DKAKSDAGYTAASGYYSSSSFYDDLSWAAVWLYLATN 226
Query: 346 NSSYLQLATT--PGLGKHA-----GVFWGILDNSVLSWDNKXXXXXXXXXXXXXXXXPGY 398
+S+YL A + P GK WG WD+
Sbjct: 227 DSTYLDKAESYVPNWGKEQQTDIIAYKWG------QXWDDVHYGAELLLAKLTNKQLYKD 280
Query: 399 PYEEMLRTFHNQTSITMCSYLPTFTTWNRTRGLIQLNHGNPQPLQYVVNAAFLATLFSDY 458
E L + + T SY P W L Q L++ AFLA +++++
Sbjct: 281 SIEMNLDFWTTGVNGTRVSYTPKGLAW-----LFQWGS-----LRHATTQAFLAGVYAEW 330
Query: 459 LDAADTPGWYCGPNFFSTDTLREFAKTQIDYILGKNPRKMSYIVGFGNHYPKHVHHRGAS 518
C P+ S ++F K+QIDY LG R S++VG+G + P+H HHR A
Sbjct: 331 EG--------CTPSKVS--VYKDFLKSQIDYALGSTGR--SFVVGYGVNPPQHPHHRTA- 377
Query: 519 IPKNKIRYNCKGGWKLRDSSKP-NPNTLVGALVAGPDKHDGFRDVRTNY 566
G W + +S + +T+ GALV GPD DG+ D NY
Sbjct: 378 ----------HGSWTDQMTSPTYHRHTIYGALVGGPDNADGYTDEINNY 416
>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticum
pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticium In Complex With Cellobiose
Length = 441
Score = 139 bits (349), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 127/479 (26%), Positives = 195/479 (40%), Gaps = 91/479 (18%)
Query: 110 NYTLALHKALMFFNAQRSGKLPKHNNV-SWRGNSCLKDGKSASSSISKDLVGGFYDAGDA 168
+Y+ AL +++FF+A + G NNV WRG DG S + DL GG++DAGD
Sbjct: 5 DYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDG----SDVGVDLTGGYHDAGDH 60
Query: 169 IKFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLKTFNHSADSISRMV 228
+KF PQ ++ +L WS+ E+ ++A G + + +K+ TDYFLK+ +S +
Sbjct: 61 VKFGLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNS----TTFY 116
Query: 229 AQVGAGDTSGGSTTPNDHYCWMRPEDIDYDRP----VTECSSCSDXXXXXXXXXXXXXIV 284
QVG G+ DH W PE+ RP S SD +
Sbjct: 117 YQVGEGNA--------DHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLN 168
Query: 285 FKD-NKAYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNSTSYWDEFIWGGAWLYYA 343
+K+ + AY+ K ++ AK L+ + + G FY +TS+ D+ W WLY A
Sbjct: 169 YKNIDSAYATKCLNAAKELYAMGKANQ-----GVGNGQSFYQATSFGDDLAWAATWLYTA 223
Query: 344 TGNSSY-------LQLATTPGLGKHAGVFWGILDNSVLSWDNKXXXXXXXXXXXXXXXXP 396
T +S+Y + L T K + D + WD+
Sbjct: 224 TNDSTYITDAEQFITLGNTMNENK-------MQDKWTMCWDDMYVPAALRLAQITGKQIY 276
Query: 397 GYPYEEMLRTFHNQTSITMCSYLPTFTTWNRTR-----GLIQLNHGNPQPLQYVVNAAFL 451
E + Q + T L + W R ++ L + P Q +++ L
Sbjct: 277 KDAIEFNFNYWKTQVTTTPGG-LKWLSNWGVLRYAAAESMVMLVYCKQNPDQSLLD---L 332
Query: 452 ATLFSDYLDAADTPGWYCGPNFFSTDTLREFAKTQIDYILGKNPRKMSYIVGFGNHYPKH 511
A DY+ D P + YI+G +G+++ H
Sbjct: 333 AKKQVDYI-LGDNPA-------------------NMSYIIG-----------YGSNWCIH 361
Query: 512 VHHRGASIPKNKIRYNCKGGWKLRDSSKPNPNTLVGALVAGPDKHDGFRDVRTNYNYTE 570
HHR A+ Y G D++KP + L GALV GPD++D F D Y YTE
Sbjct: 362 PHHRAAN------GYTYANG----DNAKPAKHLLTGALVGGPDQNDKFLDDANQYQYTE 410
>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
Length = 611
Score = 111 bits (278), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 142/314 (45%), Gaps = 44/314 (14%)
Query: 68 IVSRKIFVWTVGTLLVSALLAGFITLIVKTVPRHHHSHARPDNYTLALHKALMFFNAQRS 127
+ +++ + +G ++ +A + F T+ + NY AL ++ F++A
Sbjct: 5 LFAKRAVAFLLGIVITAAGIVSFNTVSTSAAGEY--------NYAKALQYSMFFYDANMC 56
Query: 128 GKLPKHNNV-SWRGNSCLKDGK----SASSSISK------------------DLVGGFYD 164
G N++ SWRG+ + D + S ++++S D+ GGF+D
Sbjct: 57 GTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSNRSVLDPDGDGKVDVSGGFHD 116
Query: 165 AGDAIKFNFPQSFTITMLSWSVIEYSAKYEAAGELNHVKEIIKWGTDYFLK-TFNHSADS 223
AGD +KF P+++ + + W E+ ++ A G+ H + I+++ DYF++ TF ++ +
Sbjct: 117 AGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEVILRYFNDYFMRCTFRDASGN 176
Query: 224 ISRMVAQVGAGDTSGGSTTPNDHYCWMRPE-DIDYDRP--VTECSSCSDXXXXXXXXXXX 280
+ QVG GD DH W PE D + R +T+ +D
Sbjct: 177 VVAFCHQVGDGDI--------DHAFWGAPENDTMFRRGWFITKEKPGTDIISATAASLAI 228
Query: 281 XXIVFKD-NKAYSQKLVHGAKTLFQFSRQQRARYSAGSAEAAKFYNSTSYWDEFIWGGAW 339
+ FKD + Y+ K + AK LF F+ + G +Y S+ + D++ W AW
Sbjct: 229 NYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQGEDGPKGYYGSSKWQDDYCWAAAW 288
Query: 340 LYYATGNSSYLQLA 353
LY AT N YL A
Sbjct: 289 LYLATQNEHYLDEA 302
Score = 62.8 bits (151), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 48/89 (53%), Gaps = 12/89 (13%)
Query: 482 FAKTQIDYILGKNPRKMSYIVGFGNHYPKHVHHRGASIPKNKIRYNCKGGWKLRDSSKPN 541
+A++Q+DY+LGKNP Y+VG+ ++ K+ HHR AS G K + S P+
Sbjct: 420 WARSQMDYLLGKNPLNRCYVVGYSSNSVKYPHHRAAS------------GLKDANDSSPH 467
Query: 542 PNTLVGALVAGPDKHDGFRDVRTNYNYTE 570
L GALV GPD D D +Y Y E
Sbjct: 468 KYVLYGALVGGPDASDQHVDRTNDYIYNE 496
>pdb|2QV5|A Chain A, Crystal Structure Of Uncharacterized Protein Atu2773 From
Agrobacterium Tumefaciens C58
pdb|2QV5|B Chain B, Crystal Structure Of Uncharacterized Protein Atu2773 From
Agrobacterium Tumefaciens C58
Length = 261
Score = 38.1 bits (87), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 40/93 (43%), Gaps = 10/93 (10%)
Query: 98 VPRHHHSHARPDNYTLALHKALMFFNAQRSGKLPKHNNVSWRGNSCLKDGKSASSSISKD 157
+ R H S A+ NYT ++ F A++S P ++ RG L DG SA S
Sbjct: 113 IDRLHRSMAKITNYTGVMNYLGGRFLAEQSALEPVMRDIGKRGLLFLDDGSSAQS----- 167
Query: 158 LVGGFYDAGDAIKFNFPQSFTITMLSWSVIEYS 190
L GG A + PQ F +L V E S
Sbjct: 168 LSGGIAKA-----ISAPQGFADVLLDGEVTEAS 195
>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
Thermoacidophilic Alicyclobacillus Acidocaldarius
pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellobiose
pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellotetraose
pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
Isofagomine
pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
Isofagomine
pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
Isofagomine
Length = 537
Score = 37.7 bits (86), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 30/74 (40%), Gaps = 19/74 (25%)
Query: 483 AKTQIDYILGKNPRKMSYIVGFGNHYPKHVHHRGASIPKNKIRYNCKGGWKLRDSSKPNP 542
A+ DY+ G NP Y+ GFG +H HHR + D P P
Sbjct: 431 AQRAADYLFGANPLGQCYVTGFGQRPVRHPHHRPSVA---------------DDVDHPVP 475
Query: 543 NTLVGALVAGPDKH 556
+VG GP++H
Sbjct: 476 GMVVG----GPNRH 485
Score = 34.7 bits (78), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 45/109 (41%), Gaps = 5/109 (4%)
Query: 76 WTVGTLLVSALLAGFITLIVKTVPRHHHSHARPDNYTLALHKALMFFNAQRSGK-LPKHN 134
W VG + + G TL V T+ H R Y L L FF+ Q G LP+
Sbjct: 56 WYVGDF-TALRVPGTYTLTVGTLEARVVIHRR--AYRDVLEAMLRFFDYQLCGVVLPEDE 112
Query: 135 NVSWRGNSC-LKDGKSASSSISKDLVGGFYDAGDAIKFNFPQSFTITML 182
W +C D K + + GG++DAGD K+ P + + L
Sbjct: 113 AGPWAHGACHTSDAKVFGTERALACPGGWHDAGDYGKYTVPAAKAVADL 161
>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 610
Score = 31.6 bits (70), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 22/92 (23%)
Query: 102 HHSHARPDNYTLALHKALMFFNAQRSG---KLPKHNNVSW------------RGNSCL-- 144
H R D YT + AL FF +RSG ++P W +G++ +
Sbjct: 92 HPFDIRKDIYTQMKYDALAFFYHKRSGIPIEMPYAGGEQWTRPAGHIGIEPNKGDTNVPT 151
Query: 145 --KDGKSA---SSSISKDLVGGFYDAGDAIKF 171
+D + A + +KD+ GG+YDAGD K+
Sbjct: 152 WPQDDEYAGIPQKNYTKDVTGGWYDAGDHGKY 183
>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 609
Score = 31.6 bits (70), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 22/92 (23%)
Query: 102 HHSHARPDNYTLALHKALMFFNAQRSG---KLPKHNNVSW------------RGNSCL-- 144
H R D YT + AL FF +RSG ++P W +G++ +
Sbjct: 92 HPFDIRKDIYTQMKYDALAFFYHKRSGIPIEMPYAGGEQWTRPAGHIGIEPNKGDTNVPT 151
Query: 145 --KDGKSA---SSSISKDLVGGFYDAGDAIKF 171
+D + A + +KD+ GG+YDAGD K+
Sbjct: 152 WPQDDEYAGIPQKNYTKDVTGGWYDAGDHGKY 183
>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
Angstroms Resolution
Length = 639
Score = 30.4 bits (67), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 43/198 (21%), Positives = 79/198 (39%), Gaps = 8/198 (4%)
Query: 156 KDLVGGFYDAGDAIKFNFPQSFTITMLSWSVIEYSAKYE-AAGELNHVKEIIKWGTDYFL 214
KD G++DAGD K+ T+ + + + + E A E+ I FL
Sbjct: 180 KDSTKGWHDAGDYNKYVVNAGITVGSMFLAWEHFKDQLEPVALEIPEKNNSIP----DFL 235
Query: 215 KTFNHSADSISRMVAQVGAGDTSGGSTTPNDHYCWMRPEDIDYDRPVTECSSCSDXXXXX 274
+ D I M G+G + +T N ++ PE+ +R SS +
Sbjct: 236 DELKYEIDWILTMQYPDGSGRVAHKVSTRN-FGGFIMPENEHDERFFVPWSSAA-TADFV 293
Query: 275 XXXXXXXXIVFKDNKAYSQKLVHGAKTLFQFSRQQRAR-YSAGSAEAAKFYNSTSYWDEF 333
I + Y++K ++ AK ++F + A ++ S + Y + S D+
Sbjct: 294 AMTAMAARIFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFSTGEYATVSDADDR 353
Query: 334 IWGGAWLYYATGNSSYLQ 351
+W A ++ G+ YL+
Sbjct: 354 LWAAAEMWETLGDEEYLR 371
>pdb|3H7L|A Chain A, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
pdb|3H7L|B Chain B, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
pdb|3H7L|C Chain C, Crystal Structure Of Endoglucanase-Related Protein From
Vibrio Parahaemolyticus
Length = 586
Score = 30.4 bits (67), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/64 (26%), Positives = 30/64 (46%)
Query: 479 LREFAKTQIDYILGKNPRKMSYIVGFGNHYPKHVHHRGASIPKNKIRYNCKGGWKLRDSS 538
L FA+ +++I+G NP M + G G + P ++ G K + GG++ +
Sbjct: 470 LSVFAQDALNWIVGLNPYDMCMLDGHGRNNPDYLPQYGFFNAKGGVCNGITGGFEDEEDI 529
Query: 539 KPNP 542
NP
Sbjct: 530 AFNP 533
>pdb|1EB6|A Chain A, Deuterolysin From Aspergillus Oryzae
Length = 177
Score = 29.3 bits (64), Expect = 7.4, Method: Composition-based stats.
Identities = 21/62 (33%), Positives = 31/62 (50%), Gaps = 5/62 (8%)
Query: 211 DYFLKTFNHSADSISRMVAQVG--AGDTSGGSTT--PNDHYCWMRPEDIDYDRPV-TECS 265
+YF T + +++ + V AG TSGGSTT ND Y + P + Y P E +
Sbjct: 42 EYFKTTDQQTRTTVAERLRAVAKEAGSTSGGSTTYHCNDPYGYCEPNVLAYTLPSKNEIA 101
Query: 266 SC 267
+C
Sbjct: 102 NC 103
Database: pdbaa
Posted date: Mar 3, 2013 10:34 PM
Number of letters in database: 14,973,337
Number of sequences in database: 62,578
Lambda K H
0.319 0.134 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,493,112
Number of Sequences: 62578
Number of extensions: 772368
Number of successful extensions: 1685
Number of sequences better than 100.0: 15
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 1620
Number of HSP's gapped (non-prelim): 31
length of query: 620
length of database: 14,973,337
effective HSP length: 105
effective length of query: 515
effective length of database: 8,402,647
effective search space: 4327363205
effective search space used: 4327363205
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 54 (25.4 bits)