Citrus Sinensis ID: 010632
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 505 | ||||||
| 2290681 | 505 | acidic cellulase [Citrus sinensis] | 1.0 | 1.0 | 0.998 | 0.0 | |
| 224057986 | 494 | predicted protein [Populus trichocarpa] | 0.976 | 0.997 | 0.847 | 0.0 | |
| 255555765 | 494 | endo-1,4-beta-glucanase, putative [Ricin | 0.978 | 1.0 | 0.838 | 0.0 | |
| 429326590 | 494 | korrigan [Populus tomentosa] | 0.976 | 0.997 | 0.839 | 0.0 | |
| 13383303 | 494 | endo-1,4-beta glucanase [Populus alba] | 0.976 | 0.997 | 0.837 | 0.0 | |
| 347466579 | 494 | endo-1,4-beta-glucanase [Populus trichoc | 0.976 | 0.997 | 0.833 | 0.0 | |
| 429326592 | 494 | korrigan [Populus tomentosa] | 0.976 | 0.997 | 0.831 | 0.0 | |
| 4691450 | 494 | endo-1,4-beta glucanase [Populus alba] g | 0.976 | 0.997 | 0.829 | 0.0 | |
| 1125032 | 494 | cellulase precursor [Populus alba] | 0.976 | 0.997 | 0.833 | 0.0 | |
| 359474153 | 466 | PREDICTED: endoglucanase 1 isoform 1, pa | 0.912 | 0.989 | 0.874 | 0.0 |
| >gi|2290681|gb|AAB65155.1| acidic cellulase [Citrus sinensis] | Back alignment and taxonomy information |
|---|
Score = 1047 bits (2708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 504/505 (99%), Positives = 505/505 (100%)
Query: 1 MVLSILRKMDPVTKFSLTLQILGLTLCALSLLCSAFTFQDYSDALGKSILFFEGQRSGRL 60
MVLSILRKMDPVTKFSLTLQILGLTLCALSLLCSAFTFQDYSDALGKSILFFEGQRSGRL
Sbjct: 1 MVLSILRKMDPVTKFSLTLQILGLTLCALSLLCSAFTFQDYSDALGKSILFFEGQRSGRL 60
Query: 61 PPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLSWSVIEFGSSMQ 120
PPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLSWSVIEFGSSMQ
Sbjct: 61 PPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLSWSVIEFGSSMQ 120
Query: 121 NHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDHHCWERPEDMDTPRNVYKVSTQ 180
NHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDHHCWERPEDMDTPRNVYKVSTQ
Sbjct: 121 NHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDHHCWERPEDMDTPRNVYKVSTQ 180
Query: 181 NPGSDVAAETAAALAAASVVFKDSDPSYSTKLLKTAMKVFDFADKYRGSYSDSLNSVVCP 240
NPGSDVAAETAAALAAASVVFKDSDPSYSTKLLKTAMKVFDFADKYRGSYSDSLNSVVCP
Sbjct: 181 NPGSDVAAETAAALAAASVVFKDSDPSYSTKLLKTAMKVFDFADKYRGSYSDSLNSVVCP 240
Query: 241 FYCSYSGYLDELLWGASWLHRASQNSSYLAYIQSNGHILGADDDDYSFSWDDKRAGTKVL 300
+YCSYSGYLDELLWGASWLHRASQNSSYLAYIQSNGHILGADDDDYSFSWDDKRAGTKVL
Sbjct: 241 YYCSYSGYLDELLWGASWLHRASQNSSYLAYIQSNGHILGADDDDYSFSWDDKRAGTKVL 300
Query: 301 LSKGFLEKNTQEFQLYKAHSDNYICSLIPGSSSFQAQYTAGGLFYKASESNLQYVTTTAF 360
LSKGFLEKNTQEFQLYKAHSDNYICSLIPGSSSFQAQYTAGGLFYKASESNLQYVTTTAF
Sbjct: 301 LSKGFLEKNTQEFQLYKAHSDNYICSLIPGSSSFQAQYTAGGLFYKASESNLQYVTTTAF 360
Query: 361 LLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYILGDNPAKMSYMVGFGERYPQH 420
LLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYILGDNPAKMSYMVGFGERYPQH
Sbjct: 361 LLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYILGDNPAKMSYMVGFGERYPQH 420
Query: 421 VHHRGSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVLTGAILGGPDNRDNFADDRNNYQQ 480
VHHRGSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVLTGAILGGPDNRDNFADDRNNYQQ
Sbjct: 421 VHHRGSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVLTGAILGGPDNRDNFADDRNNYQQ 480
Query: 481 SEPATYINAPFVGAVAFFSSKTTTN 505
SEPATYINAPFVGAVAFFSSKTTTN
Sbjct: 481 SEPATYINAPFVGAVAFFSSKTTTN 505
|
Source: Citrus sinensis Species: Citrus sinensis Genus: Citrus Family: Rutaceae Order: Sapindales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224057986|ref|XP_002299423.1| predicted protein [Populus trichocarpa] gi|222846681|gb|EEE84228.1| predicted protein [Populus trichocarpa] gi|347466577|gb|AEO97201.1| endo-1,4-beta-glucanase [Populus trichocarpa] gi|347466631|gb|AEO97228.1| endo-1,4-beta-glucanase [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|255555765|ref|XP_002518918.1| endo-1,4-beta-glucanase, putative [Ricinus communis] gi|223541905|gb|EEF43451.1| endo-1,4-beta-glucanase, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|429326590|gb|AFZ78635.1| korrigan [Populus tomentosa] | Back alignment and taxonomy information |
|---|
| >gi|13383303|dbj|BAB39482.1| endo-1,4-beta glucanase [Populus alba] | Back alignment and taxonomy information |
|---|
| >gi|347466579|gb|AEO97202.1| endo-1,4-beta-glucanase [Populus trichocarpa] gi|347466633|gb|AEO97229.1| endo-1,4-beta-glucanase [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|429326592|gb|AFZ78636.1| korrigan [Populus tomentosa] | Back alignment and taxonomy information |
|---|
| >gi|4691450|dbj|BAA77239.1| endo-1,4-beta glucanase [Populus alba] gi|13383305|dbj|BAB39483.1| endo-1,4-beta-glucanase [Populus alba] | Back alignment and taxonomy information |
|---|
| >gi|1125032|dbj|BAA06877.1| cellulase precursor [Populus alba] | Back alignment and taxonomy information |
|---|
| >gi|359474153|ref|XP_002279780.2| PREDICTED: endoglucanase 1 isoform 1, partial [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 505 | ||||||
| TAIR|locus:2024670 | 501 | CEL2 "cellulase 2" [Arabidopsi | 0.904 | 0.912 | 0.613 | 8e-156 | |
| TAIR|locus:2137824 | 516 | GH9B13 "glycosyl hydrolase 9B1 | 0.908 | 0.889 | 0.595 | 4.5e-153 | |
| TAIR|locus:2017704 | 484 | CEL5 "cellulase 5" [Arabidopsi | 0.904 | 0.944 | 0.568 | 3.7e-142 | |
| TAIR|locus:2825314 | 484 | CEL3 "cellulase 3" [Arabidopsi | 0.912 | 0.952 | 0.565 | 7.8e-142 | |
| TAIR|locus:2033600 | 492 | GH9B1 "glycosyl hydrolase 9B1" | 0.916 | 0.941 | 0.547 | 3e-140 | |
| TAIR|locus:2028015 | 489 | GH9B6 "glycosyl hydrolase 9B6" | 0.910 | 0.940 | 0.557 | 7.2e-139 | |
| TAIR|locus:2120242 | 497 | GH9B18 "glycosyl hydrolase 9B1 | 0.912 | 0.927 | 0.543 | 6.5e-138 | |
| TAIR|locus:2118519 | 478 | GH9B14 "glycosyl hydrolase 9B1 | 0.897 | 0.947 | 0.508 | 7.2e-123 | |
| TAIR|locus:2120232 | 493 | GH9B17 "glycosyl hydrolase 9B1 | 0.912 | 0.935 | 0.476 | 2.4e-122 | |
| TAIR|locus:2128394 | 479 | GH9B15 "glycosyl hydrolase 9B1 | 0.897 | 0.945 | 0.497 | 2.3e-119 |
| TAIR|locus:2024670 CEL2 "cellulase 2" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 1519 (539.8 bits), Expect = 8.0e-156, P = 8.0e-156
Identities = 282/460 (61%), Positives = 342/460 (74%)
Query: 40 DYSDALGKSILFFEGQRSGRLPPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGL 99
+Y DAL KSILFFEGQRSG+LPPNQ++TWR NSGLSDGS+ +VDLVGGYYDAGDN+KFG
Sbjct: 43 NYKDALSKSILFFEGQRSGKLPPNQRMTWRSNSGLSDGSALNVDLVGGYYDAGDNMKFGF 102
Query: 100 PMAFTTTLLSWSVIEFGSSMQNHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDH 159
PMAFTTT+LSWS+IEFG M++ L NAK AIRW TD+LLKA T+ P +YVQVGDPNMDH
Sbjct: 103 PMAFTTTMLSWSLIEFGGLMKSELPNAKDAIRWATDFLLKA-TSHPDTIYVQVGDPNMDH 161
Query: 160 HCWERPEDMDTPRNVYKVSTQNPGSDVXXXXXXXXXXXSVVFKDSDPSYSTKLLKTAMKV 219
CWERPEDMDTPR+V+KV NPGSD+ S+VF+ DPSYS LL+ A+ V
Sbjct: 162 ACWERPEDMDTPRSVFKVDKNNPGSDIAGEIAAALAAASIVFRKCDPSYSNHLLQRAITV 221
Query: 220 FDFADKYRGSYSDSLNSVVCPFYCSYSGYLDELLWGASWLHRASQNSSYLAYIQSNGHIL 279
F FADKYRG YS L VCPFYCSYSGY DELLWGA+WL +A+ N +YL YI++NG IL
Sbjct: 222 FTFADKYRGPYSAGLAPEVCPFYCSYSGYQDELLWGAAWLQKATNNPTYLNYIKANGQIL 281
Query: 280 GAXXXXXXXXXXXKRAGTKVLLSKGFLEKNTQEFQLYKAHSDNYICSLIPGSSSFQAQYT 339
GA K G ++LLSK FL + + + YK H+D++ICS++PG+SS +QYT
Sbjct: 282 GADEFDNMFSWDNKHVGARILLSKEFLIQKVKSLEEYKEHADSFICSVLPGASS--SQYT 339
Query: 340 AGGLFYKASESNLQYVTTTAFLLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYI 399
GGL +K ESN+QYVT+T+FLLLTYAKYL+S VA CG S V L ++AKKQVDY+
Sbjct: 340 PGGLLFKMGESNMQYVTSTSFLLLTYAKYLTSARTVAYCGGSVVTPARLRSIAKKQVDYL 399
Query: 400 LGDNPAKMSYMVGFGERYPQHVHHRGSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVLTG 459
LG NP KMSYMVG+G +YP+ +HHRGSSLPS+ HP I C+DGF S+SPNPN L G
Sbjct: 400 LGGNPLKMSYMVGYGLKYPRRIHHRGSSLPSVAVHPTRIQCHDGFSLFTSQSPNPNDLVG 459
Query: 460 AILGGPXXXXXXXXXXXXYQQSEPATYINAPFVGAVAFFS 499
A++GGP Y +SEPATYINAP VGA+A+ +
Sbjct: 460 AVVGGPDQNDQFPDERSDYGRSEPATYINAPLVGALAYLA 499
|
|
| TAIR|locus:2137824 GH9B13 "glycosyl hydrolase 9B13" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2017704 CEL5 "cellulase 5" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2825314 CEL3 "cellulase 3" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2033600 GH9B1 "glycosyl hydrolase 9B1" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2028015 GH9B6 "glycosyl hydrolase 9B6" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2120242 GH9B18 "glycosyl hydrolase 9B18" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2118519 GH9B14 "glycosyl hydrolase 9B14" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2120232 GH9B17 "glycosyl hydrolase 9B17" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2128394 GH9B15 "glycosyl hydrolase 9B15" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 505 | |||
| PLN02266 | 510 | PLN02266, PLN02266, endoglucanase | 0.0 | |
| PLN02308 | 492 | PLN02308, PLN02308, endoglucanase | 0.0 | |
| pfam00759 | 437 | pfam00759, Glyco_hydro_9, Glycosyl hydrolase famil | 0.0 | |
| PLN03009 | 495 | PLN03009, PLN03009, cellulase | 0.0 | |
| PLN02175 | 484 | PLN02175, PLN02175, endoglucanase | 0.0 | |
| PLN02613 | 498 | PLN02613, PLN02613, endoglucanase | 0.0 | |
| PLN02171 | 629 | PLN02171, PLN02171, endoglucanase | 0.0 | |
| PLN00119 | 489 | PLN00119, PLN00119, endoglucanase | 0.0 | |
| PLN02340 | 614 | PLN02340, PLN02340, endoglucanase | 1e-179 | |
| PLN02420 | 525 | PLN02420, PLN02420, endoglucanase | 1e-175 | |
| PLN02909 | 486 | PLN02909, PLN02909, Endoglucanase | 1e-174 | |
| PLN02345 | 469 | PLN02345, PLN02345, endoglucanase | 1e-171 |
| >gnl|CDD|215150 PLN02266, PLN02266, endoglucanase | Back alignment and domain information |
|---|
Score = 739 bits (1908), Expect = 0.0
Identities = 316/501 (63%), Positives = 383/501 (76%), Gaps = 17/501 (3%)
Query: 15 FSLTLQILGLTLCALSLLCSAFTF----------------QDYSDALGKSILFFEGQRSG 58
S TL L + L LLC+ F+ +Y DAL KSILFFEGQRSG
Sbjct: 5 LSSTLLRLFIFLAFSLLLCNGFSSSSNNPFHHRHHPRLASHNYRDALTKSILFFEGQRSG 64
Query: 59 RLPPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLSWSVIEFGSS 118
+LP +Q++TWR +SGLSDGS+ HVDLVGGYYDAGDNVKFG PMAFTTT+LSWSVIEFG
Sbjct: 65 KLPSSQRMTWRRDSGLSDGSAMHVDLVGGYYDAGDNVKFGFPMAFTTTMLSWSVIEFGGL 124
Query: 119 MQNHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDHHCWERPEDMDTPRNVYKVS 178
M++ L+NAK AIRW TDYLLKA TA P +YVQVGD N DH CWERPEDMDTPR+V+KV
Sbjct: 125 MKSELQNAKDAIRWATDYLLKA-TAHPDTIYVQVGDANKDHACWERPEDMDTPRSVFKVD 183
Query: 179 TQNPGSDVAAETAAALAAASVVFKDSDPSYSTKLLKTAMKVFDFADKYRGSYSDSLNSVV 238
PGSDVAAETAAALAAAS+VF+ SDP+YS L++ A++VF FADKYRG+YS+ L V
Sbjct: 184 KNTPGSDVAAETAAALAAASLVFRKSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKPDV 243
Query: 239 CPFYCSYSGYLDELLWGASWLHRASQNSSYLAYIQSNGHILGADDDDYSFSWDDKRAGTK 298
CPFYCSYSGY DELLWGA+WLH+A++N +YL YIQ NG ILGAD+ D +F WD+K G +
Sbjct: 244 CPFYCSYSGYQDELLWGAAWLHKATKNPTYLNYIQVNGQILGADEFDNTFGWDNKHVGAR 303
Query: 299 VLLSKGFLEKNTQEFQLYKAHSDNYICSLIPGSSSFQAQYTAGGLFYKASESNLQYVTTT 358
+LLSK FL + Q YK H+DN+ICS+IPG+ QYT GGL +K S+SN+QYVT+T
Sbjct: 304 ILLSKAFLVQKVQSLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMSDSNMQYVTST 363
Query: 359 AFLLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYILGDNPAKMSYMVGFGERYP 418
+FLLLTYAKYL+S V CG + V L ++AKKQVDY+LGDNP KMSYMVG+G RYP
Sbjct: 364 SFLLLTYAKYLTSAKTVVNCGGTVVTPARLRSIAKKQVDYLLGDNPLKMSYMVGYGPRYP 423
Query: 419 QHVHHRGSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVLTGAILGGPDNRDNFADDRNNY 478
+ +HHRGSSLPS+ AHP I C+ GF + S+SPNPNVL GA++GGPD D F D+R++Y
Sbjct: 424 RRIHHRGSSLPSVAAHPAKIQCSQGFSIMNSQSPNPNVLVGAVVGGPDQHDRFPDERSDY 483
Query: 479 QQSEPATYINAPFVGAVAFFS 499
+QSEPATYINAP VGA+A+ +
Sbjct: 484 EQSEPATYINAPLVGALAYLA 504
|
Length = 510 |
| >gnl|CDD|177943 PLN02308, PLN02308, endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|216104 pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9 | Back alignment and domain information |
|---|
| >gnl|CDD|166650 PLN03009, PLN03009, cellulase | Back alignment and domain information |
|---|
| >gnl|CDD|177832 PLN02175, PLN02175, endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|215331 PLN02613, PLN02613, endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|215115 PLN02171, PLN02171, endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|177732 PLN00119, PLN00119, endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|215194 PLN02340, PLN02340, endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|178040 PLN02420, PLN02420, endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|178497 PLN02909, PLN02909, Endoglucanase | Back alignment and domain information |
|---|
| >gnl|CDD|177979 PLN02345, PLN02345, endoglucanase | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 505 | |||
| PLN02266 | 510 | endoglucanase | 100.0 | |
| PLN02308 | 492 | endoglucanase | 100.0 | |
| PLN02175 | 484 | endoglucanase | 100.0 | |
| PLN02340 | 614 | endoglucanase | 100.0 | |
| PLN00119 | 489 | endoglucanase | 100.0 | |
| PLN02420 | 525 | endoglucanase | 100.0 | |
| PLN02909 | 486 | Endoglucanase | 100.0 | |
| PLN02613 | 498 | endoglucanase | 100.0 | |
| PLN03009 | 495 | cellulase | 100.0 | |
| PLN02171 | 629 | endoglucanase | 100.0 | |
| PLN02345 | 469 | endoglucanase | 100.0 | |
| PF00759 | 444 | Glyco_hydro_9: Glycosyl hydrolase family 9; InterP | 100.0 | |
| cd00249 | 384 | AGE AGE domain; N-acyl-D-glucosamine 2-epimerase d | 97.16 | |
| cd00249 | 384 | AGE AGE domain; N-acyl-D-glucosamine 2-epimerase d | 95.83 | |
| KOG2787 | 403 | consensus Lanthionine synthetase C-like protein 1 | 94.83 | |
| COG1331 | 667 | Highly conserved protein containing a thioredoxin | 94.81 | |
| PF03663 | 370 | Glyco_hydro_76: Glycosyl hydrolase family 76 ; Int | 94.5 | |
| PF07221 | 346 | GlcNAc_2-epim: N-acylglucosamine 2-epimerase (GlcN | 91.17 | |
| PF07944 | 520 | DUF1680: Putative glycosyl hydrolase of unknown fu | 91.04 | |
| COG3533 | 589 | Uncharacterized protein conserved in bacteria [Fun | 85.21 |
| >PLN02266 endoglucanase | Back alignment and domain information |
|---|
Probab=100.00 E-value=1e-126 Score=1010.31 Aligned_cols=485 Identities=64% Similarity=1.132 Sum_probs=445.7
Q ss_pred HHHHHHHHHhh-hhh-c--------ccCCChhhHHHHHHHHHHhhHHhcCCCCCCCCCCCCCCCCCCCCCCCcccccccc
Q 010632 18 TLQILGLTLCA-LSL-L--------CSAFTFQDYSDALGKSILFFEGQRSGRLPPNQQLTWRGNSGLSDGSSYHVDLVGG 87 (505)
Q Consensus 18 ~~~~~~~~~~~-~~~-~--------~~~~~~~~Y~~~l~~sl~ff~~QR~G~l~~~~~~~w~~~s~l~Dg~~~~~dlsGG 87 (505)
+|+..-+++|+ .++ + ...+++++|.++|++||+||++||||++|++++++||+++|+.||.+.++||+||
T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Y~~aL~~sl~fy~~QRsG~lp~~~~~~Wr~ds~~~Dg~~~~~DlsGG 93 (510)
T PLN02266 14 IFLAFSLLLCNGFSSSSNNPFHHRHHPRLASHNYRDALTKSILFFEGQRSGKLPSSQRMTWRRDSGLSDGSAMHVDLVGG 93 (510)
T ss_pred HHHHHHHHhcCCCccccCCcccccCCCCCCcchHHHHHHHHHHHHHHhcCcCCCcccCCCCcccCCCCCCCcCcccCCCc
Confidence 34445567787 222 2 3567889999999999999999999999999999999999999999999999999
Q ss_pred eeccCCCcccccchHHHHHHHHHHHHHhhhcchhhHHHHHHHHHHHHHHHHHcccCCCCceEEEcCCCCCCCCCCCCCCC
Q 010632 88 YYDAGDNVKFGLPMAFTTTLLSWSVIEFGSSMQNHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDHHCWERPED 167 (505)
Q Consensus 88 W~DAGD~~Ky~~~~a~s~~~L~~a~~ef~~~~~~~~pdiLde~kwg~D~llkmq~~~~g~~y~~vg~~~~dh~~W~~Pe~ 167 (505)
|||||||+||++|+|+++++|+|+++||++.|.+++||||||||||+|||||| |+.+|+||+|||++..||.+|++||+
T Consensus 94 wyDAGD~~Kf~~p~a~s~t~L~w~~~ef~~~~~~~~pd~Ldelkw~~D~llk~-~~~~~~vy~qVg~~~~Dh~~W~~Pe~ 172 (510)
T PLN02266 94 YYDAGDNVKFGFPMAFTTTMLSWSVIEFGGLMKSELQNAKDAIRWATDYLLKA-TAHPDTIYVQVGDANKDHACWERPED 172 (510)
T ss_pred ceeCCCCceecchHHHHHHHHHHHHHhhhhccccccHHHHHHHHHHHHHHHHh-ccCCCeEEEEeCCCCCCcccCCChhh
Confidence 99999999999999999999999999999999999999999999999999999 88999999999999999999999999
Q ss_pred CCCCcceeecCCCCCchHHHHHHHHHHHHHhhhcccCChHHHHHHHHHHHHHHHHHHHccCCCCCCCCCcccccccCCCC
Q 010632 168 MDTPRNVYKVSTQNPGSDVAAETAAALAAASVVFKDSDPSYSTKLLKTAMKVFDFADKYRGSYSDSLNSVVCPFYCSYSG 247 (505)
Q Consensus 168 ~~~~R~~~~~~~~~p~s~~a~~~aAalA~as~vf~~~D~~~A~~~L~~A~~ay~~a~~~~~~y~~~~~~~~~g~Y~s~~~ 247 (505)
++.+|++|.|+.++|||++++++|||||+||||||++||+||++||++||++|+||+++|+.|.+.+....+++|.+.++
T Consensus 173 ~~~~R~~y~i~~~~pgsd~a~e~AAALAaas~vfk~~D~~yA~~~L~~Ak~ly~fa~~~~g~y~~~~~~~~~~~y~s~s~ 252 (510)
T PLN02266 173 MDTPRSVFKVDKNTPGSDVAAETAAALAAASLVFRKSDPTYSKLLVRRAIRVFQFADKYRGAYSNGLKPDVCPFYCSYSG 252 (510)
T ss_pred cCCCCeeEEeCCCCCchHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHhCCCCccCCCCcccCCCcccCCc
Confidence 99999999999999999999999999999999999999999999999999999999999999877543334678887578
Q ss_pred chhHHHHHHHHHHHHcCChhHHHHHHHcccccCCCCCCCccCccchhhHHHHhhcccccccchHHHHHHHHHHHHHHHHh
Q 010632 248 YLDELLWGASWLHRASQNSSYLAYIQSNGHILGADDDDYSFSWDDKRAGTKVLLSKGFLEKNTQEFQLYKAHSDNYICSL 327 (505)
Q Consensus 248 ~~De~~wAA~eLy~aTgd~~Yl~~~~~~~~~~~~~~~~~~~~Wd~~~~~~~~lla~~~~~~~~~~~~~~~~~ad~~~~~~ 327 (505)
+.||++|||+|||++|||++|+++++.....++.....+.++||++..|+++||++.+..+..+.++.|++.++.++|.+
T Consensus 253 ~~DEl~WAAawLy~ATGd~~Yl~~~~~~~~~~g~~~~~~~~~WD~k~~ga~vLLa~~~~~~~~~~~~~yk~~~d~~~~~~ 332 (510)
T PLN02266 253 YQDELLWGAAWLHKATKNPTYLNYIQVNGQILGADEFDNTFGWDNKHVGARILLSKAFLVQKVQSLHEYKGHADNFICSV 332 (510)
T ss_pred chHHHHHHHHHHHHHhCCHHHHHHHHHHHhhccccccCCccCcchhHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHhc
Confidence 99999999999999999999999998875545433345689999999999999988544444567788999999999999
Q ss_pred CCCCCCCcccccCCccccccCCChHHHHHHHHHHHHHHHhHhhhCCCccccCCCccchHHHHHHHHhcccccccCCCCCc
Q 010632 328 IPGSSSFQAQYTAGGLFYKASESNLQYVTTTAFLLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYILGDNPAKM 407 (505)
Q Consensus 328 ~~~~~~~~~~~t~~g~~w~~~w~n~~~~~n~a~l~~~y~~~l~~~~~~~~~~~~~~~~~~y~~~a~~qldYiLG~Np~~~ 407 (505)
.+++++..+++||||+.|..+|||+||++|++||+++|++++..++.++.||+......+|+++|++|||||||+||+++
T Consensus 333 ~~~~~~~~~~~TpGGL~~~~~wg~lrYa~~~afla~vya~~l~~~~~~~~cg~~~~~~~~~~~fA~~QidYiLG~NP~~~ 412 (510)
T PLN02266 333 IPGAPFSSTQYTPGGLLFKMSDSNMQYVTSTSFLLLTYAKYLTSAKTVVNCGGTVVTPARLRSIAKKQVDYLLGDNPLKM 412 (510)
T ss_pred cCCCCCCccccCCCeeEEeCCCCcHHHHHHHHHHHHHHHHHHHhcCCceecCCCccCHHHHHHHHHhhhceeccCCCCCC
Confidence 88887767899999999999999999999999999999999988777888998777889999999999999999999999
Q ss_pred ceEeeeCCCCCCCccccCCCCCCCCCCCCccccCCCcccccCCCCCCCCcccccccCCCCCCCcccCcCccccCcccccC
Q 010632 408 SYMVGFGERYPQHVHHRGSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVLTGAILGGPDNRDNFADDRNNYQQSEPATYI 487 (505)
Q Consensus 408 SyVtG~G~~~p~~pHHR~s~~~~~~~~p~~~~c~~g~~~~~~~~pn~~~l~GalvGGPn~~d~y~D~~~~y~~nEvaId~ 487 (505)
|||||||.|+|++||||+||||++..+|.++.|.+||.|++++.||||+|.||||||||..|.|.|+|.+|++|||||||
T Consensus 413 SyvVG~G~~~P~~pHHR~as~p~~~~~~~~~~C~~g~~~~~~~~~n~~vL~GAlVGGPd~~D~y~D~r~~y~~nEva~dy 492 (510)
T PLN02266 413 SYMVGYGPRYPRRIHHRGSSLPSVAAHPAKIQCSQGFSIMNSQSPNPNVLVGAVVGGPDQHDRFPDERSDYEQSEPATYI 492 (510)
T ss_pred ceEEecCCCCCccccccCCCCCCcccCccccCCCCCccccCCCCCCcceecceeecCCCCCCCCCccccccccCcceeec
Confidence 99999999999999999999999988899999999999999999999999999999999999999999999999999999
Q ss_pred chHHHHHHHHhhcccC
Q 010632 488 NAPFVGAVAFFSSKTT 503 (505)
Q Consensus 488 NA~lv~~la~l~~~~~ 503 (505)
||+||++||+|.+.-+
T Consensus 493 NA~~vgalA~l~~~yg 508 (510)
T PLN02266 493 NAPLVGALAYLAHSYG 508 (510)
T ss_pred chHHHHHHHHHHHHhc
Confidence 9999999999987644
|
|
| >PLN02308 endoglucanase | Back alignment and domain information |
|---|
| >PLN02175 endoglucanase | Back alignment and domain information |
|---|
| >PLN02340 endoglucanase | Back alignment and domain information |
|---|
| >PLN00119 endoglucanase | Back alignment and domain information |
|---|
| >PLN02420 endoglucanase | Back alignment and domain information |
|---|
| >PLN02909 Endoglucanase | Back alignment and domain information |
|---|
| >PLN02613 endoglucanase | Back alignment and domain information |
|---|
| >PLN03009 cellulase | Back alignment and domain information |
|---|
| >PLN02171 endoglucanase | Back alignment and domain information |
|---|
| >PLN02345 endoglucanase | Back alignment and domain information |
|---|
| >PF00759 Glyco_hydro_9: Glycosyl hydrolase family 9; InterPro: IPR001701 O-Glycosyl hydrolases 3 | Back alignment and domain information |
|---|
| >cd00249 AGE AGE domain; N-acyl-D-glucosamine 2-epimerase domain; Responsible for intermediate epimerization during biosynthesis of N-acetylneuraminic acid | Back alignment and domain information |
|---|
| >cd00249 AGE AGE domain; N-acyl-D-glucosamine 2-epimerase domain; Responsible for intermediate epimerization during biosynthesis of N-acetylneuraminic acid | Back alignment and domain information |
|---|
| >KOG2787 consensus Lanthionine synthetase C-like protein 1 [Defense mechanisms] | Back alignment and domain information |
|---|
| >COG1331 Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones] | Back alignment and domain information |
|---|
| >PF03663 Glyco_hydro_76: Glycosyl hydrolase family 76 ; InterPro: IPR005198 O-Glycosyl hydrolases 3 | Back alignment and domain information |
|---|
| >PF07221 GlcNAc_2-epim: N-acylglucosamine 2-epimerase (GlcNAc 2-epimerase); InterPro: IPR010819 N-acylglucosamine 2-epimerase (AGE, 5 | Back alignment and domain information |
|---|
| >PF07944 DUF1680: Putative glycosyl hydrolase of unknown function (DUF1680); InterPro: IPR012878 The members of this family are sequences derived from hypothetical bacterial and eukaryotic proteins of unknown function | Back alignment and domain information |
|---|
| >COG3533 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 505 | ||||
| 1ks8_A | 433 | The Structure Of Endoglucanase From Termite, Nasuti | 3e-74 | ||
| 1js4_A | 605 | EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA Le | 1e-73 | ||
| 2xfg_A | 466 | Reassembly And Co-Crystallization Of A Family 9 Pro | 5e-59 | ||
| 4dod_A | 475 | The Structure Of Cbescii Cela Gh9 Module Length = 4 | 6e-57 | ||
| 1ga2_A | 614 | The Crystal Structure Of Endoglucanase 9g From Clos | 2e-54 | ||
| 1k72_A | 614 | The X-ray Crystal Structure Of Cel9g Complexed With | 2e-54 | ||
| 1ia6_A | 441 | Crystal Structure Of The Cellulase Cel9m Of C. Cell | 2e-48 | ||
| 2yik_A | 611 | Catalytic Domain Of Clostridium Thermocellum Celt L | 4e-37 | ||
| 3ez8_A | 537 | Crystal Structure Of Endoglucanase Cel9a From The T | 4e-08 |
| >pdb|1KS8|A Chain A, The Structure Of Endoglucanase From Termite, Nasutitermes Takasagoensis, At Ph 2.5. Length = 433 | Back alignment and structure |
|
| >pdb|1JS4|A Chain A, EndoEXOCELLULASE:CELLOBIOSE FROM THERMOMONOSPORA Length = 605 | Back alignment and structure |
| >pdb|2XFG|A Chain A, Reassembly And Co-Crystallization Of A Family 9 Processive Endoglucanase From Separately Expressed Gh9 And Cbm3c Modules Length = 466 | Back alignment and structure |
| >pdb|4DOD|A Chain A, The Structure Of Cbescii Cela Gh9 Module Length = 475 | Back alignment and structure |
| >pdb|1GA2|A Chain A, The Crystal Structure Of Endoglucanase 9g From Clostridium Cellulolyticum Complexed With Cellobiose Length = 614 | Back alignment and structure |
| >pdb|1K72|A Chain A, The X-ray Crystal Structure Of Cel9g Complexed With Cellotriose Length = 614 | Back alignment and structure |
| >pdb|1IA6|A Chain A, Crystal Structure Of The Cellulase Cel9m Of C. Cellulolyticum Length = 441 | Back alignment and structure |
| >pdb|2YIK|A Chain A, Catalytic Domain Of Clostridium Thermocellum Celt Length = 611 | Back alignment and structure |
| >pdb|3EZ8|A Chain A, Crystal Structure Of Endoglucanase Cel9a From The Thermoacidophilic Alicyclobacillus Acidocaldarius Length = 537 | Back alignment and structure |
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 505 | |||
| 1ks8_A | 433 | Endo-B-1,4-glucanase; cellulase, endoglucanase, te | 1e-178 | |
| 2xfg_A | 466 | Endoglucanase 1; hydrolase-sugar binding protein c | 1e-178 | |
| 1tf4_A | 605 | T. fusca endo/EXO-cellulase E4 catalytic domain an | 1e-178 | |
| 1g87_A | 614 | Endocellulase 9G; endoglucanase, cellulose binding | 1e-174 | |
| 1ia6_A | 441 | Cellulase CEL9M; cellullase, alpha barrel, hydrola | 1e-167 | |
| 2yik_A | 611 | Endoglucanase; hydrolase; 2.10A {Clostridium therm | 1e-166 | |
| 3gzk_A | 537 | Cellulase; fold from GH9 from CAZY database, glyco | 1e-165 | |
| 1clc_A | 639 | Endoglucanase CELD; EC: 3.2.1.4; cellulase, glycos | 1e-153 | |
| 1ut9_A | 609 | Cellulose 1,4-beta-cellobiosidase; hydrolase, glyc | 1e-137 | |
| 3h7l_A | 586 | Endoglucanase; dehydrogenase, PSI-2, NYSGXRC, stru | 6e-86 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 2e-10 |
| >1ks8_A Endo-B-1,4-glucanase; cellulase, endoglucanase, termite, glycosyl hydrolase, family 9, (alpha/alpha)6; 1.40A {Nasutitermes takasagoensis} SCOP: a.102.1.2 PDB: 1ksc_A 1ksd_A Length = 433 | Back alignment and structure |
|---|
Score = 507 bits (1307), Expect = e-178
Identities = 188/463 (40%), Positives = 258/463 (55%), Gaps = 38/463 (8%)
Query: 40 DYSDALGKSILFFEGQRSGRLPPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGL 99
DY L S+LF+E QRSGRLP +Q++TWR +S L+D DL GGY+DAGD VKFG
Sbjct: 4 DYKQVLRDSLLFYEAQRSGRLPADQKVTWRKDSALNDQGDQGQDLTGGYFDAGDFVKFGF 63
Query: 100 PMAFTTTLLSWSVIEFGSSMQ--NHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNM 157
PMA+T T+L+W +I+F + L++ + A++W TDY +KA T+ Y QVG +
Sbjct: 64 PMAYTATVLAWGLIDFEAGYSSAGALDDGRKAVKWATDYFIKAHTS-QNEFYGQVGQGDA 122
Query: 158 DHHCWERPEDMDTPRNVYKVSTQNPGSDVAAETAAALAAASVVFKDSDPSYSTKLLKTAM 217
DH W RPEDM R YK+ T PGSD+A ETAAALAAAS+VF++ D +YS LL A
Sbjct: 123 DHAFWGRPEDMTMARPAYKIDTSRPGSDLAGETAAALAAASIVFRNVDGTYSNNLLTHAR 182
Query: 218 KVFDFADKYRGSYSDSLNSVVCPFYCSYSGYLDELLWGASWLHRASQNSSYLAYIQSNGH 277
++FDFA+ YRG YSDS + FY S + Y DEL+W A+WL+RA+ +++YL +S
Sbjct: 183 QLFDFANNYRGKYSDS-ITDARNFYAS-ADYRDELVWAAAWLYRATNDNTYLNTAESLYD 240
Query: 278 ILGADDDDYSFSWDDKRAGTKVLLSKGFLEKNTQEFQLYKAHSDNYICSLIPGSSSFQAQ 337
G + +WD K +G +VLL+K Q YK +Y+ LI Q
Sbjct: 241 EFGLQNWGGGLNWDSKVSGVQVLLAK------LTNKQAYKDTVQSYVNYLINN-----QQ 289
Query: 338 YTAGGLFYKASESNLQYVTTTAFLLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVD 397
T GL Y L++ AF++L A+ + A + A+ Q+D
Sbjct: 290 KTPKGLLYIDMWGTLRHAANAAFIMLEAAELG-------------LSASSYRQFAQTQID 336
Query: 398 YILGDNPAKMSYMVGFGERYPQHVHHRGSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVL 457
Y LGD S++ GFG P HHR SS P A + S PN +VL
Sbjct: 337 YALGDGG--RSFVCGFGSNPPTRPHHRSSSCPPAPATC-------DWNTFNSPDPNYHVL 387
Query: 458 TGAILGGPDNRDNFADDRNNYQQSEPATYINAPFVGAVAFFSS 500
+GA++GGPD DN+ DDR++Y +E AT NA F A+A +
Sbjct: 388 SGALVGGPDQNDNYVDDRSDYVHNEVATDYNAGFQSALAALVA 430
|
| >2xfg_A Endoglucanase 1; hydrolase-sugar binding protein complex, family-9 glycoside hydrolase, hydrolase, sugar binding protein; 1.68A {Clostridium thermocellum} Length = 466 | Back alignment and structure |
|---|
| >1tf4_A T. fusca endo/EXO-cellulase E4 catalytic domain and cellulose-binding domain; glycosyl hydrolase, cellulose degradation; 1.90A {Thermobifida fusca} SCOP: a.102.1.2 b.2.2.2 PDB: 1js4_A 3tf4_A* 4tf4_A* Length = 605 | Back alignment and structure |
|---|
| >1g87_A Endocellulase 9G; endoglucanase, cellulose binding domain, (ALPH 6-helix barrel, beta barrel, hydrolase; 1.60A {Clostridium cellulolyticum} SCOP: a.102.1.2 b.2.2.2 PDB: 1ga2_A* 1k72_A* 1kfg_A* Length = 614 | Back alignment and structure |
|---|
| >1ia6_A Cellulase CEL9M; cellullase, alpha barrel, hydrolase; 1.80A {Clostridium cellulolyticum} SCOP: a.102.1.2 PDB: 1ia7_A* Length = 441 | Back alignment and structure |
|---|
| >2yik_A Endoglucanase; hydrolase; 2.10A {Clostridium thermocellum} Length = 611 | Back alignment and structure |
|---|
| >3gzk_A Cellulase; fold from GH9 from CAZY database, glycosidase, hydrolase; 1.80A {Alicyclobacillus acidocaldarius subsp} PDB: 3ez8_A 3h2w_A* 3h3k_A* 3rx5_A* 3rx7_A* 3rx8_A* Length = 537 | Back alignment and structure |
|---|
| >1clc_A Endoglucanase CELD; EC: 3.2.1.4; cellulase, glycosyl hydrolase; 1.90A {Clostridium thermocellum} SCOP: a.102.1.2 b.1.18.2 Length = 639 | Back alignment and structure |
|---|
| >1ut9_A Cellulose 1,4-beta-cellobiosidase; hydrolase, glycoside hydrolase, family 9, cellobiohydrolase; 2.1A {Clostridium thermocellum} SCOP: a.102.1.2 b.1.18.2 PDB: 1rq5_A* Length = 609 | Back alignment and structure |
|---|
| >3h7l_A Endoglucanase; dehydrogenase, PSI-2, NYSGXRC, structural GEN protein structure initiative; 2.30A {Vibrio parahaemolyticus} Length = 586 | Back alignment and structure |
|---|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 505 | |||
| 2xfg_A | 466 | Endoglucanase 1; hydrolase-sugar binding protein c | 100.0 | |
| 1ks8_A | 433 | Endo-B-1,4-glucanase; cellulase, endoglucanase, te | 100.0 | |
| 1ia6_A | 441 | Cellulase CEL9M; cellullase, alpha barrel, hydrola | 100.0 | |
| 2yik_A | 611 | Endoglucanase; hydrolase; 2.10A {Clostridium therm | 100.0 | |
| 1tf4_A | 605 | T. fusca endo/EXO-cellulase E4 catalytic domain an | 100.0 | |
| 1g87_A | 614 | Endocellulase 9G; endoglucanase, cellulose binding | 100.0 | |
| 3gzk_A | 537 | Cellulase; fold from GH9 from CAZY database, glyco | 100.0 | |
| 1clc_A | 639 | Endoglucanase CELD; EC: 3.2.1.4; cellulase, glycos | 100.0 | |
| 1ut9_A | 609 | Cellulose 1,4-beta-cellobiosidase; hydrolase, glyc | 100.0 | |
| 3h7l_A | 586 | Endoglucanase; dehydrogenase, PSI-2, NYSGXRC, stru | 100.0 | |
| 2gz6_A | 388 | N-acetyl-D-glucosamine 2-epimerase; anabaena SP. C | 96.69 | |
| 2gz6_A | 388 | N-acetyl-D-glucosamine 2-epimerase; anabaena SP. C | 96.3 | |
| 3k7x_A | 349 | LIN0763 protein; Q92DQ0, LKR23, NESG, structural g | 96.02 | |
| 3gt5_A | 402 | N-acetylglucosamine 2-epimerase; structural genomi | 95.89 | |
| 1fp3_A | 402 | N-acyl-D-glucosamine 2-epimerase; alpha/alpha-barr | 94.53 | |
| 3gt5_A | 402 | N-acetylglucosamine 2-epimerase; structural genomi | 94.37 | |
| 3k7x_A | 349 | LIN0763 protein; Q92DQ0, LKR23, NESG, structural g | 94.04 | |
| 2ahf_A | 377 | Unsaturated glucuronyl hydrolase; alpha6/alpha6 ba | 92.64 | |
| 2zzr_A | 397 | Unsaturated glucuronyl hydrolase; alpha barrel; 1. | 92.36 | |
| 1nc5_A | 373 | Hypothetical protein YTER; structural genomics, he | 90.98 | |
| 1nc5_A | 373 | Hypothetical protein YTER; structural genomics, he | 90.44 | |
| 3k11_A | 445 | Putative glycosyl hydrolase; structural genomics, | 89.0 | |
| 3pmm_A | 382 | Putative cytoplasmic protein; structural genomics, | 87.94 | |
| 2zbl_A | 421 | Putative isomerase; N-acyl-D-glucosamine 2-epimera | 86.75 | |
| 3k11_A | 445 | Putative glycosyl hydrolase; structural genomics, | 86.66 | |
| 1fp3_A | 402 | N-acyl-D-glucosamine 2-epimerase; alpha/alpha-barr | 86.59 | |
| 2ahf_A | 377 | Unsaturated glucuronyl hydrolase; alpha6/alpha6 ba | 83.08 | |
| 3pmm_A | 382 | Putative cytoplasmic protein; structural genomics, | 80.26 |
| >2xfg_A Endoglucanase 1; hydrolase-sugar binding protein complex, family-9 glycoside hydrolase, hydrolase, sugar binding protein; 1.68A {Clostridium thermocellum} | Back alignment and structure |
|---|
Probab=100.00 E-value=1.7e-112 Score=905.26 Aligned_cols=435 Identities=38% Similarity=0.647 Sum_probs=386.8
Q ss_pred CCChhhHHHHHHHHHHhhHHhcCCCC-CCCCCCCCCCCCCCCCCCCcccccccceeccCCCcccccchHHHHHHHHHHHH
Q 010632 35 AFTFQDYSDALGKSILFFEGQRSGRL-PPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLSWSVI 113 (505)
Q Consensus 35 ~~~~~~Y~~~l~~sl~ff~~QR~G~l-~~~~~~~w~~~s~l~Dg~~~~~dlsGGW~DAGD~~Ky~~~~a~s~~~L~~a~~ 113 (505)
+.++++|.++|+++|+||++||||++ |+.++++||++||++||++.++||+|||||||||+||++|+++++++|+|+++
T Consensus 20 ~~~~~~Y~~~l~~sl~ff~~QRsG~~vp~~~~~~Wr~~shl~Dg~~~~~DlsGGwyDAGD~~Ky~~~~a~t~~~L~w~~~ 99 (466)
T 2xfg_A 20 ATGAFNYGEALQKAIFFYECQRSGKLDSSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFNLPMSYSAAMLGWAVY 99 (466)
T ss_dssp -CCSSCHHHHHHHHHHHHHHTBCBCCCTTTCSBSSCCCBCTTTTGGGTSCCCCSBCCSSSCCEEHHHHHHHHHHHHHHHH
T ss_pred ccchhHHHHHHHHHHHHHHHhcCCCCCcccccCCCCCCCcccccccccccCCCCceeCCCCCeeccccHHHHHHHHHHHH
Confidence 34567999999999999999999997 99999999999999999999999999999999999999999999999999999
Q ss_pred Hhhhcc--hhhHHHHHHHHHHHHHHHHHcccCCCCceEEEcCCCCCCCCCCCCCCCCCCCcceeecCCCCCchHHHHHHH
Q 010632 114 EFGSSM--QNHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDHHCWERPEDMDTPRNVYKVSTQNPGSDVAAETA 191 (505)
Q Consensus 114 ef~~~~--~~~~pdiLde~kwg~D~llkmq~~~~g~~y~~vg~~~~dh~~W~~Pe~~~~~R~~~~~~~~~p~s~~a~~~a 191 (505)
||++.| .+++||||||||||+|||||| |+++|+||+|||++..||.+|++||+++.+|++|.++.++|+|++++++|
T Consensus 100 e~~~~~~~~~~~~d~ldeikwg~D~llk~-~~~~~~~y~qVgd~~~Dh~~W~~pe~~~~~R~~y~i~~~~pgsd~a~e~A 178 (466)
T 2xfg_A 100 EYEDAFKQSGQYNHILNNIKWACDYFIKC-HPEKDVYYYQVGDGHADHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETS 178 (466)
T ss_dssp HCHHHHHHTTCHHHHHHHHHHHHHHHHHT-CSBTTEEEEEESCHHHHHTCCSCGGGCCSCCCEEEEESSSCCHHHHHHHH
T ss_pred HhHHHHhcCCCchHHHHHHHHHHHHHHHh-ccCCCcEEEEeCCCCccccccCCcccCCCCCceeEecCCCCccHHHHHHH
Confidence 999999 479999999999999999999 99999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHhhhcccCChHHHHHHHHHHHHHHHHHHHccCCCCCCCCCcccccccCCCCchhHHHHHHHHHHHHcCChhHHHH
Q 010632 192 AALAAASVVFKDSDPSYSTKLLKTAMKVFDFADKYRGSYSDSLNSVVCPFYCSYSGYLDELLWGASWLHRASQNSSYLAY 271 (505)
Q Consensus 192 AalA~as~vf~~~D~~~A~~~L~~A~~ay~~a~~~~~~y~~~~~~~~~g~Y~s~~~~~De~~wAA~eLy~aTgd~~Yl~~ 271 (505)
||||+||||||++||+||++||++||++|+||++||+.|... ..+++|+|++++.||++|||+|||++|||++|+++
T Consensus 179 AAlAaAS~vfk~~D~~yA~~~L~~Ak~l~~fA~~~~~~~~~~---~~~~~Y~s~s~~~DEl~WAAawLy~ATgd~~Yl~~ 255 (466)
T 2xfg_A 179 AALAIASIIFKKVDGEYSKECLKHAKELFEFADTTKSDDGYT---AANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDK 255 (466)
T ss_dssp HHHHHHHHHTTTTCHHHHHHHHHHHHHHHHHHHHHCCCTTCC---TTTTTSCCSSCSHHHHHHHHHHHHHHHCCHHHHHH
T ss_pred HHHHHHHHhccccCHHHHHHHHHHHHHHHHHHHhcCCcCCCC---ccccccCCCCCCchHHHHHHHHHHHHhCCHHHHHH
Confidence 999999999999999999999999999999999999988432 25689999889999999999999999999999999
Q ss_pred HHHcccccCCC------CCCCccCccchhhHHHHhhcccccccch-HHHHHHHHHHHHHHHHhCCCCCCCcccccCCccc
Q 010632 272 IQSNGHILGAD------DDDYSFSWDDKRAGTKVLLSKGFLEKNT-QEFQLYKAHSDNYICSLIPGSSSFQAQYTAGGLF 344 (505)
Q Consensus 272 ~~~~~~~~~~~------~~~~~~~Wd~~~~~~~~lla~~~~~~~~-~~~~~~~~~ad~~~~~~~~~~~~~~~~~t~~g~~ 344 (505)
++++...++.. ...+.|+||++..+++++|++. .+.. +.++.++..++.+++ +....++++||+|+.
T Consensus 256 a~~~~~~~~~~~~~~~~~~~~~~~Wd~~~~g~~~lla~~--~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~tp~Gl~ 329 (466)
T 2xfg_A 256 AESYSDKWGYEPQTNIPKYKWAQCWDDVTYGTYLLLARI--KNDNGKYKEAIERHLDWWTT----GYNGERITYTPKGLA 329 (466)
T ss_dssp HHHTTTTSCBSTTSSSBCCSSCCBTTBCHHHHHHHHHHH--HCSSCHHHHHHHHHHHHHTT----CBTTBCCCBCTTSCB
T ss_pred HHHHHHHhccccccccccccCcCCcchhHHHHHHHHHHh--cCCcHHHHHHHHHHHHHHHh----ccCCCccccCCcccc
Confidence 99976544321 2346799999999999988763 2222 445556666655543 333347899999999
Q ss_pred cccCCChHHHHHHHHHHHHHHHhHhhhCCCccccCCCccchHHHHHHHHhcccccccCCCCCcceEeeeCCCCCCCcccc
Q 010632 345 YKASESNLQYVTTTAFLLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYILGDNPAKMSYMVGFGERYPQHVHHR 424 (505)
Q Consensus 345 w~~~w~n~~~~~n~a~l~~~y~~~l~~~~~~~~~~~~~~~~~~y~~~a~~qldYiLG~Np~~~SyVtG~G~~~p~~pHHR 424 (505)
|...||+++|++|.+||+++|++++. |+ ..+.++|+++|++|||||||+| ++|||||||+|+|++||||
T Consensus 330 ~~~~Wgs~~~~~n~afl~~~y~~~~~-------~~--~~~~~~y~~~A~~qidYiLG~N--~~SyvvG~G~~~p~~pHHR 398 (466)
T 2xfg_A 330 WLDQWGSLRYATTTAFLACVYSDWEN-------GD--KEKAKTYLEFARSQADYALGST--GRSFVVGFGENPPKRPHHR 398 (466)
T ss_dssp CCCSSSHHHHHHHHHHHHHHHHTCTT-------SC--HHHHHHHHHHHHHHHHHHHTTT--SSCCBTTSSSSCCSCCSCH
T ss_pred ccCCCCchHHHHHHHHHHHHHHHhcC-------CC--cccHHHHHHHHHHhHHhhcCCC--CcceEeccCCCCCCCCcCc
Confidence 99999999999999999999987542 32 2346899999999999999999 9999999999999999999
Q ss_pred CCCCCCCCCCCCccccCCCcccccCCCCCCCCcccccccCCCCCCCcccCcCccccCcccccCchHHHHHHHHhhcccC
Q 010632 425 GSSLPSIHAHPDHIACNDGFQYLYSRSPNPNVLTGAILGGPDNRDNFADDRNNYQQSEPATYINAPFVGAVAFFSSKTT 503 (505)
Q Consensus 425 ~s~~~~~~~~p~~~~c~~g~~~~~~~~pn~~~l~GalvGGPn~~d~y~D~~~~y~~nEvaId~NA~lv~~la~l~~~~~ 503 (505)
+|+||... +.+.+.|||++|+|+||||||..|+|+|+|.+|++|||||||||+||++||+|...-+
T Consensus 399 ~ss~~~~~-------------~~~~p~p~~~vl~GaLVGGP~~~D~y~D~~~~y~~nEvaid~NA~~v~~la~l~~~~g 464 (466)
T 2xfg_A 399 TAHGSWAD-------------SQMEPPEHRHVLYGALVGGPDSTDNYTDDISNYTCNEVACDYNAGFVGLLAKMYKLYG 464 (466)
T ss_dssp HHHTCSSS-------------CTTSSSSCSSCCTTCBCCCCCTTSCCCCCTTCHHHHCCCHHHHHHHHHHHHHHHHHHT
T ss_pred hhccCccc-------------cccCCCCCCCCCccceeeCCCCccCcCcccccccccccchhhhHHHHHHHHHHHHHcC
Confidence 99986431 2356789999999999999999999999999999999999999999999999987643
|
| >1ks8_A Endo-B-1,4-glucanase; cellulase, endoglucanase, termite, glycosyl hydrolase, family 9, (alpha/alpha)6; 1.40A {Nasutitermes takasagoensis} SCOP: a.102.1.2 PDB: 1ksc_A 1ksd_A | Back alignment and structure |
|---|
| >1ia6_A Cellulase CEL9M; cellullase, alpha barrel, hydrolase; 1.80A {Clostridium cellulolyticum} SCOP: a.102.1.2 PDB: 1ia7_A* | Back alignment and structure |
|---|
| >2yik_A Endoglucanase; hydrolase; 2.10A {Clostridium thermocellum} | Back alignment and structure |
|---|
| >1tf4_A T. fusca endo/EXO-cellulase E4 catalytic domain and cellulose-binding domain; glycosyl hydrolase, cellulose degradation; 1.90A {Thermobifida fusca} SCOP: a.102.1.2 b.2.2.2 PDB: 1js4_A 3tf4_A* 4tf4_A* | Back alignment and structure |
|---|
| >1g87_A Endocellulase 9G; endoglucanase, cellulose binding domain, (ALPH 6-helix barrel, beta barrel, hydrolase; 1.60A {Clostridium cellulolyticum} SCOP: a.102.1.2 b.2.2.2 PDB: 1ga2_A* 1k72_A* 1kfg_A* | Back alignment and structure |
|---|
| >3gzk_A Cellulase; fold from GH9 from CAZY database, glycosidase, hydrolase; 1.80A {Alicyclobacillus acidocaldarius subsp} PDB: 3ez8_A 3h2w_A* 3h3k_A* 3rx5_A* 3rx7_A* 3rx8_A* | Back alignment and structure |
|---|
| >1clc_A Endoglucanase CELD; EC: 3.2.1.4; cellulase, glycosyl hydrolase; 1.90A {Clostridium thermocellum} SCOP: a.102.1.2 b.1.18.2 | Back alignment and structure |
|---|
| >1ut9_A Cellulose 1,4-beta-cellobiosidase; hydrolase, glycoside hydrolase, family 9, cellobiohydrolase; 2.1A {Clostridium thermocellum} SCOP: a.102.1.2 b.1.18.2 PDB: 1rq5_A* | Back alignment and structure |
|---|
| >3h7l_A Endoglucanase; dehydrogenase, PSI-2, NYSGXRC, structural GEN protein structure initiative; 2.30A {Vibrio parahaemolyticus} | Back alignment and structure |
|---|
| >2gz6_A N-acetyl-D-glucosamine 2-epimerase; anabaena SP. CH1, isomer; 2.00A {Anabaena SP} | Back alignment and structure |
|---|
| >2gz6_A N-acetyl-D-glucosamine 2-epimerase; anabaena SP. CH1, isomer; 2.00A {Anabaena SP} | Back alignment and structure |
|---|
| >3k7x_A LIN0763 protein; Q92DQ0, LKR23, NESG, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 1.89A {Listeria innocua} | Back alignment and structure |
|---|
| >3gt5_A N-acetylglucosamine 2-epimerase; structural genomics, isomerase, PSI-2; 1.70A {Xylella fastidiosa} | Back alignment and structure |
|---|
| >1fp3_A N-acyl-D-glucosamine 2-epimerase; alpha/alpha-barrel, isomer; 2.00A {Sus scrofa} SCOP: a.102.1.3 | Back alignment and structure |
|---|
| >3gt5_A N-acetylglucosamine 2-epimerase; structural genomics, isomerase, PSI-2; 1.70A {Xylella fastidiosa} | Back alignment and structure |
|---|
| >3k7x_A LIN0763 protein; Q92DQ0, LKR23, NESG, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 1.89A {Listeria innocua} | Back alignment and structure |
|---|
| >2ahf_A Unsaturated glucuronyl hydrolase; alpha6/alpha6 barrel, glycoside hydrolase family 88; 1.52A {Bacillus SP} PDB: 2ahg_A* 2fv0_A* 2fv1_A* 2d5j_A 1vd5_A 2fuz_A | Back alignment and structure |
|---|
| >2zzr_A Unsaturated glucuronyl hydrolase; alpha barrel; 1.75A {Streptococcus agalactiae} PDB: 3anj_A 3ank_A* 3ani_A* | Back alignment and structure |
|---|
| >1nc5_A Hypothetical protein YTER; structural genomics, helix barrel, PSI, protein structure initiative, midwest center for structural genomics, MCSG; 1.60A {Bacillus subtilis} SCOP: a.102.1.6 PDB: 2d8l_A* 2gh4_A* | Back alignment and structure |
|---|
| >1nc5_A Hypothetical protein YTER; structural genomics, helix barrel, PSI, protein structure initiative, midwest center for structural genomics, MCSG; 1.60A {Bacillus subtilis} SCOP: a.102.1.6 PDB: 2d8l_A* 2gh4_A* | Back alignment and structure |
|---|
| >3k11_A Putative glycosyl hydrolase; structural genomics, joint CENT structural genomics, JCSG, protein structure initiative; HET: MSE MES; 1.80A {Bacteroides thetaiotaomicron vpi-5482} | Back alignment and structure |
|---|
| >3pmm_A Putative cytoplasmic protein; structural genomics, PSI-biology, protein structure initiati midwest center for structural genomics; 1.90A {Klebsiella pneumoniae subsp} PDB: 3qwt_A | Back alignment and structure |
|---|
| >2zbl_A Putative isomerase; N-acyl-D-glucosamine 2-epimerase protein family; HET: BMA; 1.60A {Salmonella typhimurium} PDB: 2afa_A 2rgk_A* | Back alignment and structure |
|---|
| >3k11_A Putative glycosyl hydrolase; structural genomics, joint CENT structural genomics, JCSG, protein structure initiative; HET: MSE MES; 1.80A {Bacteroides thetaiotaomicron vpi-5482} | Back alignment and structure |
|---|
| >1fp3_A N-acyl-D-glucosamine 2-epimerase; alpha/alpha-barrel, isomer; 2.00A {Sus scrofa} SCOP: a.102.1.3 | Back alignment and structure |
|---|
| >2ahf_A Unsaturated glucuronyl hydrolase; alpha6/alpha6 barrel, glycoside hydrolase family 88; 1.52A {Bacillus SP} PDB: 2ahg_A* 2fv0_A* 2fv1_A* 2d5j_A 1vd5_A 2fuz_A | Back alignment and structure |
|---|
| >3pmm_A Putative cytoplasmic protein; structural genomics, PSI-biology, protein structure initiati midwest center for structural genomics; 1.90A {Klebsiella pneumoniae subsp} PDB: 3qwt_A | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 505 | ||||
| d1tf4a1 | 460 | a.102.1.2 (A:1-460) Endo/exocellulase:cellobiose E | 1e-174 | |
| d1g87a1 | 454 | a.102.1.2 (A:3-456) Endo/exocellulase:cellobiose E | 1e-164 | |
| d1ks8a_ | 433 | a.102.1.2 (A:) Endo-b-1,4-glucanase {Termite (Nasu | 1e-164 | |
| d1ia6a_ | 431 | a.102.1.2 (A:) Nonprocessive cellulase Cel9M {Clos | 1e-151 | |
| d1ut9a1 | 511 | a.102.1.2 (A:306-816) Cellulose 1,4-beta-cellobios | 1e-122 | |
| d1clca1 | 441 | a.102.1.2 (A:135-575) CelD cellulase, C-terminal d | 1e-121 |
| >d1tf4a1 a.102.1.2 (A:1-460) Endo/exocellulase:cellobiose E-4, N-terminal domain {Thermomonospora fusca [TaxId: 2021]} Length = 460 | Back information, alignment and structure |
|---|
class: All alpha proteins fold: alpha/alpha toroid superfamily: Six-hairpin glycosidases family: Cellulases catalytic domain domain: Endo/exocellulase:cellobiose E-4, N-terminal domain species: Thermomonospora fusca [TaxId: 2021]
Score = 496 bits (1279), Expect = e-174
Identities = 186/471 (39%), Positives = 265/471 (56%), Gaps = 41/471 (8%)
Query: 40 DYSDALGKSILFFEGQRSGRLPPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGL 99
+Y++AL KS+ F+E QRSG+LP N +++WRG+SGL+DG+ +DL GG+YDAGD+VKFG
Sbjct: 5 NYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGF 64
Query: 100 PMAFTTTLLSWSVIEFGSSMQ--NHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNM 157
PMAFT T+L+W IE + K +RW DY +KA +P LYVQVGD +
Sbjct: 65 PMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKA-HPSPNVLYVQVGDGDA 123
Query: 158 DHHCWERPEDMDTPRNVYKVSTQNPGSDVAAETAAALAAASVVFKDSDPSYSTKLLKTAM 217
DH W E M R +KV PGSDVAAETAAA+AA+S+VF D DP+Y+ L++ A
Sbjct: 124 DHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAK 183
Query: 218 KVFDFADKYRGSYSDSLNSVVCPFYCSYSGYLDELLWGASWLHRASQNSSYLAYIQSNGH 277
+++ FAD YRG YSD + + FY S+SGY DEL+WGA WL++A+ + SYLA +
Sbjct: 184 QLYTFADTYRGVYSDCVPA--GAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYD 241
Query: 278 ILGADDDD------YSFSWDDKRAGTKVLLSKGFLEKNTQEFQLYKAHSDNYICSLIPGS 331
L + ++ +WDDK GT VLL+K Q Y ++ ++ G
Sbjct: 242 FLSTEQQTDLRSYRWTIAWDDKSYGTYVLLAK------ETGKQKYIDDANRWLDYWTVGV 295
Query: 332 SSFQAQYTAGGLFYKASESNLQYVTTTAFLLLTYAKYLSSNGGVATCGSSTVKAENLIAL 391
+ + Y+ GG+ + L+Y TAF+ L YAK + V+ +
Sbjct: 296 NGQRVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVI----------DDPVRKQRYHDF 345
Query: 392 AKKQVDYILGDNPAKMSYMVGFGERYPQHVHHRGSSLPSIHAHPDHIACNDGFQYLYSRS 451
A +Q++Y LGDNP SY+VGFG P++ HHR + + S +
Sbjct: 346 AVRQINYALGDNPRNSSYVVGFGNNPPRNPHHRTAH-------------GSWTDSIASPA 392
Query: 452 PNPNVLTGAILGGPDNR-DNFADDRNNYQQSEPATYINAPFVGAVAFFSSK 501
N +VL GA++GGP + D + DDR +Y +E AT NA F A+A +
Sbjct: 393 ENRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEE 443
|
| >d1g87a1 a.102.1.2 (A:3-456) Endo/exocellulase:cellobiose E-4, N-terminal domain {Clostridium cellulolyticum, atcc 35319 [TaxId: 1521]} Length = 454 | Back information, alignment and structure |
|---|
| >d1ks8a_ a.102.1.2 (A:) Endo-b-1,4-glucanase {Termite (Nasutitermes takasagoensis) [TaxId: 62960]} Length = 433 | Back information, alignment and structure |
|---|
| >d1ia6a_ a.102.1.2 (A:) Nonprocessive cellulase Cel9M {Clostridium cellulolyticum [TaxId: 1521]} Length = 431 | Back information, alignment and structure |
|---|
| >d1ut9a1 a.102.1.2 (A:306-816) Cellulose 1,4-beta-cellobiosidase CbhA {Clostridium thermocellum [TaxId: 1515]} Length = 511 | Back information, alignment and structure |
|---|
| >d1clca1 a.102.1.2 (A:135-575) CelD cellulase, C-terminal domain {Clostridium thermocellum [TaxId: 1515]} Length = 441 | Back information, alignment and structure |
|---|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 505 | |||
| d1tf4a1 | 460 | Endo/exocellulase:cellobiose E-4, N-terminal domai | 100.0 | |
| d1ks8a_ | 433 | Endo-b-1,4-glucanase {Termite (Nasutitermes takasa | 100.0 | |
| d1g87a1 | 454 | Endo/exocellulase:cellobiose E-4, N-terminal domai | 100.0 | |
| d1ia6a_ | 431 | Nonprocessive cellulase Cel9M {Clostridium cellulo | 100.0 | |
| d1clca1 | 441 | CelD cellulase, C-terminal domain {Clostridium the | 100.0 | |
| d1ut9a1 | 511 | Cellulose 1,4-beta-cellobiosidase CbhA {Clostridiu | 100.0 | |
| d1fp3a_ | 402 | N-acyl-D-glucosamine 2-epimerase {Pig (Sus scrofa) | 92.54 | |
| d2afaa1 | 411 | Putative NAG isomerase YihS {Salmonella typhimuriu | 92.52 | |
| d1fp3a_ | 402 | N-acyl-D-glucosamine 2-epimerase {Pig (Sus scrofa) | 92.13 | |
| d2afaa1 | 411 | Putative NAG isomerase YihS {Salmonella typhimuriu | 91.1 | |
| d1w6ka1 | 448 | Lanosterol synthase {Human (Homo sapiens) [TaxId: | 84.92 | |
| d1nc5a_ | 363 | Hypothetical protein YteR {Bacillus subtilis [TaxI | 81.78 |
| >d1tf4a1 a.102.1.2 (A:1-460) Endo/exocellulase:cellobiose E-4, N-terminal domain {Thermomonospora fusca [TaxId: 2021]} | Back information, alignment and structure |
|---|
class: All alpha proteins fold: alpha/alpha toroid superfamily: Six-hairpin glycosidases family: Cellulases catalytic domain domain: Endo/exocellulase:cellobiose E-4, N-terminal domain species: Thermomonospora fusca [TaxId: 2021]
Probab=100.00 E-value=1.5e-105 Score=853.79 Aligned_cols=434 Identities=42% Similarity=0.738 Sum_probs=378.3
Q ss_pred hhhHHHHHHHHHHhhHHhcCCCCCCCCCCCCCCCCCCCCCCCcccccccceeccCCCcccccchHHHHHHHHHHHHHhhh
Q 010632 38 FQDYSDALGKSILFFEGQRSGRLPPNQQLTWRGNSGLSDGSSYHVDLVGGYYDAGDNVKFGLPMAFTTTLLSWSVIEFGS 117 (505)
Q Consensus 38 ~~~Y~~~l~~sl~ff~~QR~G~l~~~~~~~w~~~s~l~Dg~~~~~dlsGGW~DAGD~~Ky~~~~a~s~~~L~~a~~ef~~ 117 (505)
+.+|.++|+++|+||++||||.+|+.++++||++||+.||++.++||+|||||||||+||++|+++++++|+|++++||+
T Consensus 3 ~~~Y~~al~~sl~ff~~QRsG~~p~~~~~~wr~~s~~~Dg~~~~~DvsGGWyDAGD~~KY~~~~a~~~~~L~~a~~~~~~ 82 (460)
T d1tf4a1 3 AFNYAEALQKSMFFYEAQRSGKLPENNRVSWRGDSGLNDGADVGLDLTGGWYDAGDHVKFGFPMAFTATMLAWGAIESPE 82 (460)
T ss_dssp SSCHHHHHHHHHHHHHHTBCBSCCTTCCCTTCCCBCTTTTGGGTSCCCCSBCCSSSCCEEHHHHHHHHHHHHHHHHHCHH
T ss_pred CCCHHHHHHHHHHHHHHccCcCCCcccCCCCCccccCCCcccCcCCCCCCceeCCCcceecccHHHHHHHHHHHHHHChH
Confidence 34899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cc--hhhHHHHHHHHHHHHHHHHHcccCCCCceEEEcCCCCCCCCCCCCCCCCCCCcceeecCCCCCchHHHHHHHHHHH
Q 010632 118 SM--QNHLENAKAAIRWGTDYLLKASTATPGALYVQVGDPNMDHHCWERPEDMDTPRNVYKVSTQNPGSDVAAETAAALA 195 (505)
Q Consensus 118 ~~--~~~~pdiLde~kwg~D~llkmq~~~~g~~y~~vg~~~~dh~~W~~Pe~~~~~R~~~~~~~~~p~s~~a~~~aAalA 195 (505)
.| .|++||||||||||+|||+|| |+++|.||+|||++..||..|++|+.++.+|+.+.+...+|+|++++++|||||
T Consensus 83 ~~~~~~~~pdlLdE~~wglD~llkm-q~~~g~~y~qvgdg~~dh~~w~~~~~~~~~~~~~~~~~~~~~t~~~~~~aAalA 161 (460)
T d1tf4a1 83 GYIRSGQMPYLKDNLRWVNDYFIKA-HPSPNVLYVQVGDGDADHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMA 161 (460)
T ss_dssp HHHHTTCHHHHHHHHHHHHHHHHHT-CSBTTBEEEEESCHHHHHHCCSCGGGCCSCCCEEEEBTTBCCHHHHHHHHHHHH
T ss_pred hhhcCCCcHHHHHHHHHHHHHHHhc-ccCCCeEEEEecCCCccccccCCCcCCCCCCCcceecCCCccHHHHHHHHHHHH
Confidence 98 489999999999999999999 889999999999999999999999999889999988888899999999999999
Q ss_pred HHhhhcccCChHHHHHHHHHHHHHHHHHHHccCCCCCCCCCcccccccCCCCchhHHHHHHHHHHHHcCChhHHHHHHHc
Q 010632 196 AASVVFKDSDPSYSTKLLKTAMKVFDFADKYRGSYSDSLNSVVCPFYCSYSGYLDELLWGASWLHRASQNSSYLAYIQSN 275 (505)
Q Consensus 196 ~as~vf~~~D~~~A~~~L~~A~~ay~~a~~~~~~y~~~~~~~~~g~Y~s~~~~~De~~wAA~eLy~aTgd~~Yl~~~~~~ 275 (505)
+|||||+++||+||++||++||++|+||++||+.|.+..+. ...+|.+ +++.||++|||+|||++|||++|+++++..
T Consensus 162 ~as~v~~~~d~~~A~~~l~aA~~a~~~a~~~~~~~~~~~~~-~~~~~~~-~~~~De~~wAAaeLy~aTG~~~Y~~~a~~~ 239 (460)
T d1tf4a1 162 ASSIVFADDDPAYAATLVQHAKQLYTFADTYRGVYSDCVPA-GAFYNSW-SGYQDELVWGAYWLYKATGDDSYLAKAEYE 239 (460)
T ss_dssp HHHHHHTTTCHHHHHHHHHHHHHHHHHHHHSCCCGGGTSTT-HHHHCCS-SCSHHHHHHHHHHHHHHHCCHHHHHHHHHH
T ss_pred HHHHhccccCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC-CCcCCCc-cccchHHHHHHHHHHHHhCCHHHHHHHHHH
Confidence 99999999999999999999999999999999998776432 3345666 889999999999999999999999999987
Q ss_pred ccccCC------CCCCCccCccchhhHHHHhhcccccccchHHHHHHHHHHHHHHHHhCCCCCCCcccccCCccccccCC
Q 010632 276 GHILGA------DDDDYSFSWDDKRAGTKVLLSKGFLEKNTQEFQLYKAHSDNYICSLIPGSSSFQAQYTAGGLFYKASE 349 (505)
Q Consensus 276 ~~~~~~------~~~~~~~~Wd~~~~~~~~lla~~~~~~~~~~~~~~~~~ad~~~~~~~~~~~~~~~~~t~~g~~w~~~w 349 (505)
...+.. ....+.++|+++..+..+++++.. . +..+++.++.+++.+........++++++++.|...|
T Consensus 240 ~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~la~~~--~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W 313 (460)
T d1tf4a1 240 YDFLSTEQQTDLRSYRWTIAWDDKSYGTYVLLAKET--G----KQKYIDDANRWLDYWTVGVNGQRVPYSPGGMAVLDTW 313 (460)
T ss_dssp GGGSCBCTTSSCBCCSSCCSSSCCHHHHHHHHHHHH--C----CHHHHHHHHHHHHHTTTCBTTBCCCBCTTSCBCCCSS
T ss_pred hhhcccccccccccccccccccchhHHHHHHHHHHh--h----hhHHHHHHHHHHHHHhhhcccccCCcCCCcceecCCC
Confidence 543321 123456899999888877775421 1 2344555666665554443334667789999888899
Q ss_pred ChHHHHHHHHHHHHHHHhHhhhCCCccccCCCccchHHHHHHHHhcccccccCCCCCcceEeeeCCCCCCCccccCCCCC
Q 010632 350 SNLQYVTTTAFLLLTYAKYLSSNGGVATCGSSTVKAENLIALAKKQVDYILGDNPAKMSYMVGFGERYPQHVHHRGSSLP 429 (505)
Q Consensus 350 ~n~~~~~n~a~l~~~y~~~l~~~~~~~~~~~~~~~~~~y~~~a~~qldYiLG~Np~~~SyVtG~G~~~p~~pHHR~s~~~ 429 (505)
|++++++|.+++++++.+.+.+ ..++++|+++|++|||||||+||+++|||||+|+|+|++||||.++|+
T Consensus 314 Gsn~~~~n~a~~~~~a~~~~~~----------~~~~~~y~~~A~~qldylLG~Np~~~SyVtG~G~~~p~~pHHR~s~~~ 383 (460)
T d1tf4a1 314 GALRYAANTAFVALVYAKVIDD----------PVRKQRYHDFAVRQINYALGDNPRNSSYVVGFGNNPPRNPHHRTAHGS 383 (460)
T ss_dssp SHHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHHHHTCSTTCCCSBTTSSSSCCSCCSCHHHHTC
T ss_pred chHHHHHHHHHHHHHHHHHhcc----------cCCHHHHHHHHHHhhheecccCCCCcCCCccCCCCCCCCCcCccccCC
Confidence 9999999999999888765421 234578999999999999999999999999999999999999998875
Q ss_pred CCCCCCCccccCCCcccccCCCCCCCCcccccccCCC-CCCCcccCcCccccCcccccCchHHHHHHHHhhcccC
Q 010632 430 SIHAHPDHIACNDGFQYLYSRSPNPNVLTGAILGGPD-NRDNFADDRNNYQQSEPATYINAPFVGAVAFFSSKTT 503 (505)
Q Consensus 430 ~~~~~p~~~~c~~g~~~~~~~~pn~~~l~GalvGGPn-~~d~y~D~~~~y~~nEvaId~NA~lv~~la~l~~~~~ 503 (505)
.. ++...+.|||++|+|+|||||| ..+||+|++++|++|||||||||+||++||+|+++.+
T Consensus 384 ~~-------------~~~~~~~p~~~~l~G~lvGGPn~~~~~y~D~~~~y~~nEvai~~NA~lv~~la~l~~~~~ 445 (460)
T d1tf4a1 384 WT-------------DSIASPAENRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFSSALAMLVEEYG 445 (460)
T ss_dssp SS-------------SCTTSSSSCSSCCTTCBCCCCSSTTCCCCCCTTCTTTTCCCGGGGHHHHHHHHHHHHHHC
T ss_pred Cc-------------cccCCCCCCCCCcceeEecCCCCCCCCcCcCcchhhhhhhHHHHHHHHHHHHHHHHHhcC
Confidence 42 1345678999999999999999 5799999999999999999999999999999988744
|
| >d1ks8a_ a.102.1.2 (A:) Endo-b-1,4-glucanase {Termite (Nasutitermes takasagoensis) [TaxId: 62960]} | Back information, alignment and structure |
|---|
| >d1g87a1 a.102.1.2 (A:3-456) Endo/exocellulase:cellobiose E-4, N-terminal domain {Clostridium cellulolyticum, atcc 35319 [TaxId: 1521]} | Back information, alignment and structure |
|---|
| >d1ia6a_ a.102.1.2 (A:) Nonprocessive cellulase Cel9M {Clostridium cellulolyticum [TaxId: 1521]} | Back information, alignment and structure |
|---|
| >d1clca1 a.102.1.2 (A:135-575) CelD cellulase, C-terminal domain {Clostridium thermocellum [TaxId: 1515]} | Back information, alignment and structure |
|---|
| >d1ut9a1 a.102.1.2 (A:306-816) Cellulose 1,4-beta-cellobiosidase CbhA {Clostridium thermocellum [TaxId: 1515]} | Back information, alignment and structure |
|---|
| >d1fp3a_ a.102.1.3 (A:) N-acyl-D-glucosamine 2-epimerase {Pig (Sus scrofa) [TaxId: 9823]} | Back information, alignment and structure |
|---|
| >d2afaa1 a.102.1.3 (A:4-415) Putative NAG isomerase YihS {Salmonella typhimurium [TaxId: 90371]} | Back information, alignment and structure |
|---|
| >d1fp3a_ a.102.1.3 (A:) N-acyl-D-glucosamine 2-epimerase {Pig (Sus scrofa) [TaxId: 9823]} | Back information, alignment and structure |
|---|
| >d2afaa1 a.102.1.3 (A:4-415) Putative NAG isomerase YihS {Salmonella typhimurium [TaxId: 90371]} | Back information, alignment and structure |
|---|
| >d1w6ka1 a.102.4.2 (A:6-99,A:379-732) Lanosterol synthase {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
| >d1nc5a_ a.102.1.6 (A:) Hypothetical protein YteR {Bacillus subtilis [TaxId: 1423]} | Back information, alignment and structure |
|---|