BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 010522
(508 letters)
Database: pdbaa
62,578 sequences; 14,973,337 total letters
Searching..................................................done
>pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 2.5.
pdb|1KSC|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 5.6.
pdb|1KSD|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes
Takasagoensis, At Ph 6.5
Length = 433
Score = 328 bits (841), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 187/461 (40%), Positives = 263/461 (57%), Gaps = 37/461 (8%)
Query: 43 HNYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
++Y+ L S++F+E QRSG+LP +Q++TWR+DS L D DL GGY+DAGD VKFG
Sbjct: 3 YDYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFG 62
Query: 103 FPMAFTTTMLSWSVIEF--GGLMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
FPMA+T T+L+W +I+F G G L + R+A++WATDY +KA + Y QVG +
Sbjct: 63 FPMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTSQNEFYGQVGQGDA 122
Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
DHA W RPEDM R KID + PGSD+AGET S+VFR D TYS L+ A
Sbjct: 123 DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHAR 182
Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
++F FA+ YRG YS+ + FY S + Y+DEL+W AAWL++AT + TYLN +
Sbjct: 183 QLFDFANNYRGKYSDSITD-ARNFYAS-ADYRDELVWAAAWLYRATNDNTYLNTAESLYD 240
Query: 281 ILGAAEYDNTFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGASFSSAQ 340
G + WD+K G ++LL+ KL + YKD +++ LI ++ Q
Sbjct: 241 EFGLQNWGGGLNWDSKVSGVQVLLA------KLTNKQAYKDTVQSYVNYLI-----NNQQ 289
Query: 341 YTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIAKRQVD 400
TP GLL+ +++ + +F++L A+ ++A R A+ Q+D
Sbjct: 290 KTPKGLLYIDMWGTLRHAANAAFIMLEAAE-------------LGLSASSYRQFAQTQID 336
Query: 401 YILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVL 460
Y LGD S++ G+G P R HHR SS P PA C ++ S PN +VL
Sbjct: 337 YALGDG--GRSFVCGFGSNPPTRPHHRSSSCP-----PAPATCD--WNTFNSPDPNYHVL 387
Query: 461 VGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL 501
GA+VGGPDQ+D + D+R DY +E +T NA ALA L
Sbjct: 388 SGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALAAL 428
>pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|1JS4|B Chain B, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA
pdb|4TF4|A Chain A, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|4TF4|B Chain B, EndoEXOCELLULASE:CELLOPENTAOSE FROM THERMOMONOSPORA
pdb|3TF4|A Chain A, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|3TF4|B Chain B, EndoEXOCELLULASE:CELLOTRIOSE FROM THERMOMONOSPORA
pdb|1TF4|A Chain A, EndoEXOCELLULASE FROM THERMOMONOSPORA
pdb|1TF4|B Chain B, EndoEXOCELLULASE FROM THERMOMONOSPORA
Length = 605
Score = 321 bits (823), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 193/472 (40%), Positives = 262/472 (55%), Gaps = 40/472 (8%)
Query: 44 NYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFGF 103
NY +AL KS+ F+E QRSGKLP N R++WR DSGL DGA + +DL GG+YDAGD+VKFGF
Sbjct: 5 NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGF 64
Query: 104 PMAFTTTMLSWSVIEF--GGLMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANKD 161
PMAFT TML+W IE G + G++ ++ +RW DY +KA P+ +YVQVGD + D
Sbjct: 65 PMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVLYVQVGDGDAD 124
Query: 162 HACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAIR 221
H W E M R K+D + PGSDVA ET S+VF DP Y+ LV+ A +
Sbjct: 125 HKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAKQ 184
Query: 222 VFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQI 281
++ FAD YRG YS+ + FY S+SGYQDEL+WGA WL+KAT + +YL +
Sbjct: 185 LYTFADTYRGVYSDCVP--AGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDF 242
Query: 282 LGAAE------YDNTFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGAS 335
L + Y T WD+K G +LL+K QK Y D A+ ++ G +
Sbjct: 243 LSTEQQTDLRSYRWTIAWDDKSYGTYVLLAKETGKQK------YIDDANRWLDYWTVGVN 296
Query: 336 FSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIA 395
Y+PGG+ + ++Y +T+F+ L YAK + V +R A
Sbjct: 297 GQRVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDP----------VRKQRYHDFA 346
Query: 396 KRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSP 455
RQ++Y LGDNP SY+VG+G P+ HHR AH S + S +
Sbjct: 347 VRQINYALGDNPRNSSYVVGFGNNPPRNPHHR-------TAH------GSWTDSIASPAE 393
Query: 456 NPNVLVGAIVGGPDQ-HDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFG 506
N +VL GA+VGGP +D + D+R DY +E +T NA ALA L +G
Sbjct: 394 NRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEEYG 445
>pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive
Endoglucanase From Separately Expressed Gh9 And Cbm3c
Modules
Length = 466
Score = 251 bits (640), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 164/477 (34%), Positives = 236/477 (49%), Gaps = 47/477 (9%)
Query: 44 NYRDALTKSIIFFEGQRSGKLPPNQ-RMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
NY +AL K+I F+E QRSGKL + R+ WR DSGL DG +DL GG+YDAGD+VKF
Sbjct: 25 NYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFN 84
Query: 103 FPMAFTTTMLSWSVIEFGGLMK--GELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
PM+++ ML W+V E+ K G+ + I+WA DY +K D Y QVGD +
Sbjct: 85 LPMSYSAAMLGWAVYEYEDAFKQSGQYNHILNNIKWACDYFIKCHPEKDVYYYQVGDGHA 144
Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
DHA W E M R K+DR+ PGS V ET S++F++ D YSK ++ A
Sbjct: 145 DHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIFKKVDGEYSKECLKHAK 204
Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
+F+FAD + S+ FY S+SG+ DEL W A WL+ AT + +YL+ +
Sbjct: 205 ELFEFADTTK---SDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESYSD 261
Query: 281 ILG------AAEYDNTFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGA 334
G +Y WD+ G +LL++ +YK+ + + G
Sbjct: 262 KWGYEPQTNIPKYKWAQCWDDVTYGTYLLLARI-----KNDNGKYKEAIERHLDWWTTGY 316
Query: 335 SFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTI 394
+ YTP GL + +++Y T+T+FL Y+ + G AK
Sbjct: 317 NGERITYTPKGLAWLDQWGSLRYATTTAFLACVYSDWEN---------GDKEKAKTYLEF 367
Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
A+ Q DY LG S++VG+G P+R HHR ++ S SQ
Sbjct: 368 ARSQADYALGST--GRSFVVGFGENPPKRPHHR----------------TAHGSWADSQM 409
Query: 455 PNP---NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFGQL 508
P +VL GA+VGGPD D + D+ +Y +E + NA VG LA + +G+L
Sbjct: 410 EPPEHRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLAKMYKLYGEL 466
>pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1GA2|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum Complexed With Cellobiose
pdb|1G87|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1G87|B Chain B, The Crystal Structure Of Endoglucanase 9g From Clostridium
Cellulolyticum
pdb|1KFG|A Chain A, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
pdb|1KFG|B Chain B, The X-Ray Crystal Structure Of Cel9g From Clostridium
Cellulolyticum Complexed With A Thio-Oligosaccharide
Inhibitor
Length = 614
Score = 243 bits (621), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 165/473 (34%), Positives = 237/473 (50%), Gaps = 43/473 (9%)
Query: 43 HNYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
+NY +AL KSI+F+E QRSG LP ++R WR DSG+ DG+ + VDL GG+YDAGD+VKF
Sbjct: 4 YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63
Query: 103 FPMAFTTTMLSWSVIEFGGL--MKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
PM++T+ ML+WS+ E G+ + + I+WA DY +K P Y QVGD K
Sbjct: 64 LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123
Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
DH+ W E M R K+D + PGS V T ++VF+ SDPTY++ + A
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183
Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
+F ADK + + Y S S + D+L W A WL+ AT + TYL+ +
Sbjct: 184 NLFDMADKAK----SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVP 239
Query: 281 ILGAAEYDNTFG------WDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGA 334
G + + WD+ H GA +LL+ KL + YKD + + G
Sbjct: 240 NWGKEQQTDIIAYKWGQCWDDVHYGAELLLA------KLTNKQLYKDSIEMNLDFWTTGV 293
Query: 335 SFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTI 394
+ + YTP GL + ++++ T+ +FL YA++ G T +
Sbjct: 294 NGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWE---------GCTPSKVSVYKDF 344
Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
K Q+DY LG S++VGYG PQ HHR AH S M S +
Sbjct: 345 LKSQIDYALGST--GRSFVVGYGVNPPQHPHHR-------TAH------GSWTDQMTSPT 389
Query: 455 PNPNVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL-AHSFG 506
+ + + GA+VGGPD D + DE +Y +E + NA GALA + HS G
Sbjct: 390 YHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGG 442
>pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
pdb|1K72|B Chain B, The X-ray Crystal Structure Of Cel9g Complexed With
Cellotriose
Length = 614
Score = 243 bits (621), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 165/473 (34%), Positives = 237/473 (50%), Gaps = 43/473 (9%)
Query: 43 HNYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFG 102
+NY +AL KSI+F+E QRSG LP ++R WR DSG+ DG+ + VDL GG+YDAGD+VKF
Sbjct: 4 YNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFN 63
Query: 103 FPMAFTTTMLSWSVIEFGGL--MKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANK 160
PM++T+ ML+WS+ E G+ + + I+WA DY +K P Y QVGD K
Sbjct: 64 LPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGK 123
Query: 161 DHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAI 220
DH+ W E M R K+D + PGS V T ++VF+ SDPTY++ + A
Sbjct: 124 DHSWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAK 183
Query: 221 RVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQ 280
+F ADK + + Y S S + D+L W A WL+ AT + TYL+ +
Sbjct: 184 NLFDMADKAK----SDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVP 239
Query: 281 ILGAAEYDNTFG------WDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGA 334
G + + WD+ H GA +LL+ KL + YKD + + G
Sbjct: 240 NWGKEQQTDIIAYKWGQXWDDVHYGAELLLA------KLTNKQLYKDSIEMNLDFWTTGV 293
Query: 335 SFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTI 394
+ + YTP GL + ++++ T+ +FL YA++ G T +
Sbjct: 294 NGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWE---------GCTPSKVSVYKDF 344
Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
K Q+DY LG S++VGYG PQ HHR AH S M S +
Sbjct: 345 LKSQIDYALGST--GRSFVVGYGVNPPQHPHHR-------TAH------GSWTDQMTSPT 389
Query: 455 PNPNVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL-AHSFG 506
+ + + GA+VGGPD D + DE +Y +E + NA GALA + HS G
Sbjct: 390 YHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGG 442
>pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module
pdb|4DOE|A Chain A, The Liganded Structure Of Cbescii Cela Gh9 Module
Length = 475
Score = 240 bits (613), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 162/472 (34%), Positives = 240/472 (50%), Gaps = 43/472 (9%)
Query: 44 NYRDALTKSIIFFEGQRSGKLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKFGF 103
NY +AL K+I+F+E Q SGKLP R WR DS L DG +DL GG++DAGD+VKF
Sbjct: 27 NYGEALQKAIMFYEFQMSGKLPNWVRNNWRGDSALKDGQDNGLDLTGGWFDAGDHVKFNL 86
Query: 104 PMAFTTTMLSWSVIEFGG--LMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDANKD 161
PM++T TMLSW+V E+ + G+L + I W DY +K Y QVGD +KD
Sbjct: 87 PMSYTGTMLSWAVYEYKDAFVKSGQLEHILNQIEWVNDYFVKCHPSKYVYYYQVGDGSKD 146
Query: 162 HACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAIR 221
HA W E M R K+ ++ PGS V ET S+V + +PT + ++ A
Sbjct: 147 HAWWGPAEVMQMERPSFKVTQSSPGSTVVAETAASLAAASIVLKDRNPTKAATYLQHAKE 206
Query: 222 VFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYL----NYIQV 277
+++FA+ + S+ +Y S+SG+ DEL W A WL+ AT + TYL +Y+Q
Sbjct: 207 LYEFAEVTK---SDAGYTAANGYYNSWSGFYDELSWAAVWLYLATNDSTYLTKAESYVQN 263
Query: 278 NGQILGAAEYDNTFG--WDNKHVGARILLSKAFLVQKLQSLHEYKDHADNFICSLIPGAS 335
+I G+ D + WD+ H GA +LL+ K+ YK ++ + G +
Sbjct: 264 WPKISGSNTIDYKWAHCWDDVHNGAALLLA------KITGKDIYKQIIESHLDYWTTGYN 317
Query: 336 FSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRL-RTI 394
+YTP GL + +++Y T+T+FL Y+ ++ G T K + R
Sbjct: 318 GERIKYTPKGLAWLDQWGSLRYATTTAFLAFVYSDWV----------GCPSTKKEIYRKF 367
Query: 395 AKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQS 454
+ Q+DY LG S++VG+G P+R HHR AH + S S +
Sbjct: 368 GESQIDYALGS--AGRSFVVGFGTNPPKRPHHR-------TAHSSWADSQSIPSYHR--- 415
Query: 455 PNPNVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFG 506
+ L GA+VGGP D + D+ +Y +E + NA VGALA + +G
Sbjct: 416 ---HTLYGALVGGPGSDDSYTDDISNYVNNEVACDYNAGFVGALAKMYQLYG 464
>pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticum
pdb|1IA7|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C.
Cellulolyticium In Complex With Cellobiose
Length = 441
Score = 213 bits (542), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 152/478 (31%), Positives = 228/478 (47%), Gaps = 64/478 (13%)
Query: 43 HNYRDALTKSIIFFEGQRSG-KLPPNQRMTWRRDSGLTDGAAMHVDLVGGYYDAGDNVKF 101
H+Y AL SIIFF+ + G + N WR TDG+ + VDL GGY+DAGD+VKF
Sbjct: 4 HDYSTALKDSIIFFDANKCGPQAGENNVFDWRGACHTTDGSDVGVDLTGGYHDAGDHVKF 63
Query: 102 GFPMAFTTTMLSWSVIEFGGLMK--GELRNAREAIRWATDYLLKATVHPDTIYVQVGDAN 159
G P ++ +L WS+ EF G + +++ TDY LK+ + T Y QVG+ N
Sbjct: 64 GLPQGYSAAILGWSLYEFKESFDATGNTTKMLQQLKYFTDYFLKSHPNSTTFYYQVGEGN 123
Query: 160 KDHACWERPEDMDTPR-TVIKIDRNYPGSDVAGETXXXXXXXSLVFRRSDPTYSKLLVRR 218
DH W PE+ R ++ K D + P SD+ ET L ++ D Y+ +
Sbjct: 124 ADHTYWGAPEEQTGQRPSLYKADPSSPASDILSETSAALTLMYLNYKNIDSAYATKCLNA 183
Query: 219 AIRVFQFADKYRGAYSNGLKKFVCPFYCSYSGYQDELLWGAAWLHKATRNPTYL----NY 274
A ++ +G NG FY + S + D+L W A WL+ AT + TY+ +
Sbjct: 184 AKELYAMGKANQGV-GNGQS-----FYQATS-FGDDLAWAATWLYTATNDSTYITDAEQF 236
Query: 275 IQVNGQILGAAEYDN-TFGWDNKHVGARILLSKAFLVQKLQSLHEYKDHAD-NFICSLIP 332
I + + D T WD+ +V A + L+ ++ YKD + NF
Sbjct: 237 ITLGNTMNENKMQDKWTMCWDDMYVPAALRLA------QITGKQIYKDAIEFNF------ 284
Query: 333 GASFSSAQY--TPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKR 390
++ Q TPGGL + + ++Y + S ++L Y K +
Sbjct: 285 --NYWKTQVTTTPGGLKWLSNWGVLRYAAAESMVMLVYCK--------------QNPDQS 328
Query: 391 LRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIM 450
L +AK+QVDYILGDNP MSY++GYG + HHR ++G++
Sbjct: 329 LLDLAKKQVDYILGDNPANMSYIIGYGSNWCIHPHHR---------------AANGYTYA 373
Query: 451 KSQSPNP--NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFG 506
+ P ++L GA+VGGPDQ+D+F D+ Y+ +E + NA LVG LA FG
Sbjct: 374 NGDNAKPAKHLLTGALVGGPDQNDKFLDDANQYQYTEVALDYNAGLVGVLAGAIKFFG 431
>pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt
Length = 611
Score = 167 bits (424), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 144/521 (27%), Positives = 216/521 (41%), Gaps = 94/521 (18%)
Query: 42 KHNYRDALTKSIIFFEGQRSGK-LPPNQRMTWRRD--------------SGLTDG----- 81
++NY AL S+ F++ G + N ++WR D + ++DG
Sbjct: 37 EYNYAKALQYSMFFYDANMCGTGVDENSLLSWRGDCHVYDARLPLDSQNTNMSDGFISSN 96
Query: 82 -------AAMHVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGLMK--GELRNARE 132
VD+ GG++DAGD+VKFG P A+ + + W EF + G+ +A
Sbjct: 97 RSVLDPDGDGKVDVSGGFHDAGDHVKFGLPEAYAASTVGWGYYEFKDQFRATGQAVHAEV 156
Query: 133 AIRWATDYLLKATVHPDTIYV-----QVGDANKDHACWERPEDMDTPRTVIKIDRNYPGS 187
+R+ DY ++ T + V QVGD + DHA W PE+ R I + PG+
Sbjct: 157 ILRYFNDYFMRCTFRDASGNVVAFCHQVGDGDIDHAFWGAPENDTMFRRGWFITKEKPGT 216
Query: 188 DVAGETXXXXXXXSLVFRRSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKKFVCPFYCS 247
D+ T + F+ +DP Y+ + A +F FA+K G + +Y S
Sbjct: 217 DIISATAASLAINYMNFKDTDPQYAAKSLDYAKALFDFAEKNPKGVVQG-EDGPKGYYGS 275
Query: 248 YSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQILGAAEYDNTFGWDNKHVGARILLSKA 307
S +QD+ W AAWL+ AT+N YL+ + +Y GW H + A
Sbjct: 276 -SKWQDDYCWAAAWLYLATQNEHYLD------EAFKYYDYYAPPGW--IHCWNDVWSGTA 326
Query: 308 FLVQKLQSLHEYKDHADNF---------------------ICSLIPGASFSSAQYTPGGL 346
++ ++ L Y + NF I L+ S TPGG
Sbjct: 327 CILAEINDL--YDKDSQNFEDRYKRASNKNQWEQIDFWKPIQDLLDKWSGGGITVTPGGY 384
Query: 347 LFKMSDSNMQYVTSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIAKRQVDYILGDN 406
+F + +Y T+ + L Y K+ T + A+ Q+DY+LG N
Sbjct: 385 VFLNQWGSARYNTAAQLIALVYDKHHGD------------TPSKYANWARSQMDYLLGKN 432
Query: 407 PLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVLVGAIVG 466
PL Y+VGY + HHR +SG SP+ VL GA+VG
Sbjct: 433 PLNRCYVVGYSSNSVKYPHHR---------------AASGLKDANDSSPHKYVLYGALVG 477
Query: 467 GPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSFGQ 507
GPD D+ D DY +E + NA VGA A L FG
Sbjct: 478 GPDASDQHVDRTNDYIYNEVAIDYNAAFVGACAGLYRFFGD 518
>pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The
Thermoacidophilic Alicyclobacillus Acidocaldarius
pdb|3GZK|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela
pdb|3H2W|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellobiose
pdb|3H3K|A Chain A, Structure Of A. Acidocaldarius Cellulase Cela In Complex
With Cellotetraose
pdb|3RX5|A Chain A, Structure Of Aacel9a In Complex With Cellotriose-Like
Isofagomine
pdb|3RX7|A Chain A, Structure Of Aacel9a In Complex With Cellotetraose-Like
Isofagomine
pdb|3RX8|A Chain A, Structure Of Aacel9a In Complex With Cellobiose-Like
Isofagomine
Length = 537
Score = 61.6 bits (148), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 117/511 (22%), Positives = 184/511 (36%), Gaps = 103/511 (20%)
Query: 42 KHNYRDALTKSIIFFEGQRSGK-LPPNQRMTWRRDSGLTDGAAMH-----VDLVGGYYDA 95
+ YRD L + FF+ Q G LP ++ W + T A + + GG++DA
Sbjct: 85 RRAYRDVLEAMLRFFDYQLCGVVLPEDEAGPWAHGACHTSDAKVFGTERALACPGGWHDA 144
Query: 96 GDNVKFGFPMAFTTTMLSWSVIEFGGLMK---------------GELRNAREAIRWATDY 140
GD K+ P A L + F + L ARE I W
Sbjct: 145 GDYGKYTVPAAKAVADLLLAHEYFPAALAHVRPMRSVHRAPHLPPALEVAREEIAWLLTM 204
Query: 141 LLKATVHPDTIYVQVGDANKDHACWERPEDMDTPRTVIKIDRNYPGSDVAGETXXXXXXX 200
AT +Y +V + RPED D P + I + A
Sbjct: 205 QDPAT---GGVYHKVTTPSFP-PLDTRPEDDDAPLVLSPISYAATATFCAA-----MAHA 255
Query: 201 SLVFRRSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKKFVCPFYCSYSGY-----QDEL 255
+LV+R DP S A R + + GA+ ++ F P Y +DEL
Sbjct: 256 ALVYRPFDPALSSCCADAARRAY----AWLGAHE--MQPFHNPDGILTGEYGDAELRDEL 309
Query: 256 LWGAAWLHKATRNPTYLNYIQVNGQILGAAEYDNTFGWDNKHVGARILLSKAFLVQKLQS 315
LW + L + T + + +V +L D W+ + +L +
Sbjct: 310 LWASCALLRMTGDSAW---ARVCEPLL-----DLDLPWELGWADVALYGVMDYLRTPRAA 361
Query: 316 LHEYKDHADNFICSLIP---GASFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYL 372
+ D N + S + A + A+ P G+ + D ++ ++ +LL A
Sbjct: 362 V---SDDVRNKVKSRLLRELDALAAMAESHPFGIPMRDDD----FIWGSNMVLLNRAMAF 414
Query: 373 TSAHRVVNCGGTTVTAKRLRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLP 432
A G V T+A+R DY+ G NPL Y+ G+G R + HHR S
Sbjct: 415 LLAE------GVGVLHPAAHTVAQRAADYLFGANPLGQCYVTGFGQRPVRHPHHRPSVAD 468
Query: 433 SVGAHPARIQCSSGFSIMKSQSPNPNVLVGAIVGGPDQH----------------DRFPD 476
V HP + G +VGGP++H + + D
Sbjct: 469 DVD-HP---------------------VPGMVVGGPNRHLQDEIARAQLAGRPAMEAYID 506
Query: 477 ERPDYEQSEPSTYMNAPLVGALAYLAHSFGQ 507
+ Y +E + Y N+P V +A L + G+
Sbjct: 507 HQDSYSTNEVAVYWNSPAVFVIAALLEARGR 537
>pdb|1UT9|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 609
Score = 32.3 bits (72), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 42/104 (40%), Gaps = 4/104 (3%)
Query: 399 VDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQ-CSSGFSIMKSQSPNP 457
+ Y+LG N + SY+ GYG R Q H R + + PA SG + + P
Sbjct: 504 ISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPAPPPGIISGGPNSRFEDPTI 563
Query: 458 NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYL 501
N V P Q F D + +E + NAP AYL
Sbjct: 564 NAAVKKDT--PPQK-CFIDHTDSWSTNEITVNWNAPFAWVTAYL 604
>pdb|2XR7|A Chain A, Crystal Structure Of Nicotiana Tabacum Malonyltransferase
(Ntmat1) Complexed With Malonyl-Coa
Length = 453
Score = 32.3 bits (72), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 47/122 (38%), Gaps = 7/122 (5%)
Query: 346 LLFKMSDSNMQYV-----TSTSFLLLTYAKYLTSAHRVVNCGGTTVTAKRLRTIAKRQVD 400
L +K+ S +V T L LT YL A V C LR + V
Sbjct: 41 LFYKLPISRPDFVQTIIPTLKDSLSLTLKYYLPLAGNVA-CPQDWSGYPELRYVTGNSVS 99
Query: 401 YILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVL 460
I ++ +Y++GY PR + +H L P +Q + +I + PN +
Sbjct: 100 VIFSESDXDFNYLIGYHPRNTKDFYHFVPQLAEPKDAPG-VQLAPVLAIQVTLFPNHGIS 158
Query: 461 VG 462
+G
Sbjct: 159 IG 160
>pdb|1CLC|A Chain A, Three-Dimensional Structure Of Endoglucanase D At 1.9
Angstroms Resolution
Length = 639
Score = 31.6 bits (70), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 93/445 (20%), Positives = 154/445 (34%), Gaps = 84/445 (18%)
Query: 80 DGAAMHVDLVGGYYDAGDNVKFGFPMAFT--TTMLSWS----VIEFGGLMKGELRNA--- 130
+G D G++DAGD K+ T + L+W +E L E N+
Sbjct: 174 NGQHTKKDSTKGWHDAGDYNKYVVNAGITVGSMFLAWEHFKDQLEPVALEIPEKNNSIPD 233
Query: 131 -REAIRWATDYLLKATVHPDTIYVQVGDANKDHACWERPEDMDTPRTVIKIDRNYPGSDV 189
+ +++ D++L + V + ++ + PE+ R + P S
Sbjct: 234 FLDELKYEIDWILTMQYPDGSGRVAHKVSTRNFGGFIMPENEHDERFFV------PWSSA 287
Query: 190 A-GETXXXXXXXSLVFRRSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKKFVCPFYCSY 248
A + + +FR DP Y++ + A ++F F Y +
Sbjct: 288 ATADFVAMTAMAARIFRPYDPQYAEKCINAAKVSYEFLKNNPANVFANQSGFSTGEYATV 347
Query: 249 SGYQDELLWGAAWLHKATRNPTYL-NYIQVNGQILGAAEYDNTFGWDNKHVGARILLSKA 307
S D+ LW AA + + + YL ++ Q E D F WDN
Sbjct: 348 SD-ADDRLWAAAEMWETLGDEEYLRDFENRAAQFSKKIEAD--FDWDN------------ 392
Query: 308 FLVQKLQSLHEYKDHADNFICSLIPGASFSSAQYTPGGLLFKMSDSNMQYVTSTSFLLLT 367
V L ++ S PG + + Q LL +DS ++ TS +
Sbjct: 393 --VANLGMF--------TYLLSERPGKNPALVQSIKDSLL-STADSIVR--TSQNH---G 436
Query: 368 YAKYLTSAHRVVNCGGTTVTAKRLRTIAKR-------------QVDYILGDNPLKMSYMV 414
Y + L + + C GT V + +A + + ++ G N SY+
Sbjct: 437 YGRTLGTTY-YWGCNGTVVRQTMILQVANKISPNNDYVNAALDAISHVFGRNYYNRSYVT 495
Query: 415 GYGPRYPQRIHHRGSSLPSVGAHPARIQCSSGFSIMKSQSPNPNVLVGAIVGGPDQHDRF 474
G G P H R S + P P LVG GP +
Sbjct: 496 GLGINPPMNPHDRRSGADGI------------------WEPWPGYLVGGGWPGPKD---W 534
Query: 475 PDERPDYEQSEPSTYMNAPLVGALA 499
D + Y+ +E + NA L+ ALA
Sbjct: 535 VDIQDSYQTNEIAINWNAALIYALA 559
>pdb|1RQ5|A Chain A, Structural Basis For The Exocellulase Activity Of The
Cellobiohydrolase Cbha From C. Thermocellum
Length = 610
Score = 31.2 bits (69), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 43/108 (39%), Gaps = 4/108 (3%)
Query: 399 VDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGAHPARIQ-CSSGFSIMKSQSPNP 457
+ Y+LG N + SY+ GYG R Q H R + + PA SG + + P
Sbjct: 504 ISYLLGRNAMDQSYVTGYGERPLQNPHDRFWTPQTSKRFPAPPPGIISGGPNSRFEDPTI 563
Query: 458 NVLVGAIVGGPDQHDRFPDERPDYEQSEPSTYMNAPLVGALAYLAHSF 505
N V P Q F D + ++ + NAP AYL +
Sbjct: 564 NAAVKKDT--PPQK-CFIDHTDSWGTNQITVNWNAPFAWVTAYLDEQY 608
>pdb|2AYX|A Chain A, Solution Structure Of The E.Coli Rcsc C-Terminus (Residues
700-949) Containing Linker Region And Phosphoreceiver
Domain
Length = 254
Score = 30.8 bits (68), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 4/46 (8%)
Query: 391 LRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVGA 436
L ++K +D +L D + M M GY R QRI G +LP +G
Sbjct: 166 LNVLSKNHIDIVLSD--VNMPNMDGY--RLTQRIRQLGLTLPVIGV 207
>pdb|3OLM|A Chain A, Structure And Function Of A Ubiquitin Binding Site Within
The Catalytic Domain Of A Hect Ubiquitin Ligase
Length = 429
Score = 30.4 bits (67), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 243 PFYC--SYSGYQDELLWGAAWLHKATRNPTYLNYIQVNGQILGAAEYDNTFGWDNKHVGA 300
PFYC YS Y + + + NP +LNY + G+++G + F D VGA
Sbjct: 134 PFYCLFEYSAYDN---YTIQINPNSGINPEHLNYFKFIGRVVGLGVFHRRF-LDAFFVGA 189
Query: 301 --RILLSKAFLVQKLQSL 316
+++L K ++Q ++ +
Sbjct: 190 LYKMMLRKKVVLQDMEGV 207
>pdb|2AYZ|A Chain A, Solution Structure Of The E.Coli Rcsc C-Terminus (Residues
817-949) Containing Phosphoreceiver Domain
Length = 133
Score = 30.4 bits (67), Expect = 2.4, Method: Composition-based stats.
Identities = 17/45 (37%), Positives = 24/45 (53%), Gaps = 4/45 (8%)
Query: 391 LRTIAKRQVDYILGDNPLKMSYMVGYGPRYPQRIHHRGSSLPSVG 435
L ++K +D +L D + M M GY R QRI G +LP +G
Sbjct: 45 LNVLSKNHIDIVLSD--VNMPNMDGY--RLTQRIRQLGLTLPVIG 85
>pdb|3GFX|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 4.5 CalciumCY-Digmp Complex
pdb|3GFX|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 4.5 CalciumCY-Digmp Complex
pdb|3GFY|A Chain A, Klebsiella Pneumoniae Blrp1 With Fmn And Cyclic Digmp, No
Metal Ions
pdb|3GFY|B Chain B, Klebsiella Pneumoniae Blrp1 With Fmn And Cyclic Digmp, No
Metal Ions
pdb|3GFZ|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 6 Manganese/cy-digmp
Complex
pdb|3GFZ|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 6 Manganese/cy-digmp
Complex
pdb|3GG0|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 9.0 ManganeseCY-Digmp
Complex
pdb|3GG0|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 9.0 ManganeseCY-Digmp
Complex
pdb|3GG1|A Chain A, Klebsiella Pneumoniae Blrp1 Ph 8.0 CalciumCY-Digmp Complex
pdb|3GG1|B Chain B, Klebsiella Pneumoniae Blrp1 Ph 8.0 CalciumCY-Digmp Complex
Length = 413
Score = 29.3 bits (64), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 122 LMKGELRNAREAIRWATDYLLKATVHPDTIYVQVGDAN 159
L+ G L + +A+ W D LL A + PD + ++V +
Sbjct: 248 LLPGSLYHHPDAVGWLMDSLLAAGLRPDQVLIEVTETE 285
Database: pdbaa
Posted date: Mar 3, 2013 10:34 PM
Number of letters in database: 14,973,337
Number of sequences in database: 62,578
Lambda K H
0.321 0.136 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,681,231
Number of Sequences: 62578
Number of extensions: 680172
Number of successful extensions: 1285
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 1220
Number of HSP's gapped (non-prelim): 23
length of query: 508
length of database: 14,973,337
effective HSP length: 103
effective length of query: 405
effective length of database: 8,527,803
effective search space: 3453760215
effective search space used: 3453760215
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (25.4 bits)