RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy10824
(185 letters)
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 139 bits (352), Expect = 2e-41
Identities = 43/117 (36%), Positives = 63/117 (53%), Gaps = 4/117 (3%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGS WA A+SDR+CI T + +
Sbjct: 1 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEV 60
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCNC 185
S++ LLTCC + GD C GG P AW + G+ +GG Y S C+ + C
Sbjct: 61 SAEDLLTCCGSM-CGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEA 116
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 138 bits (350), Expect = 1e-40
Identities = 44/117 (37%), Positives = 64/117 (54%), Gaps = 4/117 (3%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + +LP FD R+Q+P C I ++ Q +CGSCWA A+SDR+CI T + +
Sbjct: 58 FTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEV 117
Query: 132 SSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS---CQRFDRGNCNC 185
S++ LLTCC + GD C GG P AW + G+ +GG Y S C+ + C
Sbjct: 118 SAEDLLTCCGSM-CGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH 173
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 133 bits (336), Expect = 4e-39
Identities = 42/98 (42%), Positives = 59/98 (60%), Gaps = 2/98 (2%)
Query: 77 ELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHL 136
E+P FD RK++P C +I ++ QS CGSCWA A+SDR CI + G+ + LS+ L
Sbjct: 2 EIPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDL 61
Query: 137 LTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDYGS 174
L+CC +C G CEGG AW Y ++ G+ TG +
Sbjct: 62 LSCCESCGLG--CEGGILGPAWDYWVKEGIVTGSSKEN 97
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 134 bits (338), Expect = 8e-39
Identities = 43/117 (36%), Positives = 60/117 (51%), Gaps = 3/117 (2%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAAL 115
++ N L F + ++ LP FD + +PNC I + QS CGSCWA+A +A+
Sbjct: 50 KKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAM 109
Query: 116 SDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY 172
SDR C G D +S+ LL CC+ C G C GG+P RAW Y G+ +
Sbjct: 110 SDRFCTMG-GVQDVHISAGDLLACCSDCGDG--CNGGDPDRAWAYFSSTGLVSDYCQ 163
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 110 bits (277), Expect = 3e-30
Identities = 37/125 (29%), Positives = 58/125 (46%), Gaps = 14/125 (11%)
Query: 71 DYQSNTELPEEFDLRKQ--YPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRL- 127
+Y S +LP+ +D R + + + CGSCWA A+T+A++DR+ I +G
Sbjct: 29 EYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWP 88
Query: 128 DHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY------GSCQRF-DR 180
LS +++ C G CEGGN + W Y ++G+P C +F
Sbjct: 89 STLLSVQNVIDCG--NAGS--CEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQC 144
Query: 181 GNCNC 185
G CN
Sbjct: 145 GTCNE 149
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 102 bits (256), Expect = 4e-26
Identities = 29/128 (22%), Positives = 47/128 (36%), Gaps = 11/128 (8%)
Query: 56 QEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAAL 115
P + Q LP +D R + + V+ Q++CGSC++ A+ L
Sbjct: 185 SRKIPRPKPAPLTAEIQQKILFLPTSWDWRNVHG-INFVSPVRNQASCGSCYSFASMGML 243
Query: 116 SDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPM-RAWYYMLENGVPTGGDY-- 172
R+ I T LS +++C CEGG P A Y + G+ +
Sbjct: 244 EARIRILTNNSQTPILSPQEVVSC---SQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPY 300
Query: 173 ----GSCQ 176
C+
Sbjct: 301 TGTDSPCK 308
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 81.8 bits (202), Expect = 3e-19
Identities = 20/103 (19%), Positives = 32/103 (31%), Gaps = 9/103 (8%)
Query: 71 DYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+ LP + DL + V Q GSC A A AA+ Q +
Sbjct: 50 EKSVIAALPPKVDLTPPFQ-------VYDQGRIGSCTANALAAAIQFERIHDKQS-PEFI 101
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMR-AWYYMLENGVPTGGDY 172
S + G + G +R + + GV ++
Sbjct: 102 PSRLFIYYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEW 144
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 65.4 bits (160), Expect = 4e-13
Identities = 32/131 (24%), Positives = 52/131 (39%), Gaps = 15/131 (11%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSD 117
+ S L D R + V+ Q CGS W+ +TT A+
Sbjct: 96 AQKPKHPENLRMPYVSSKKPLAASVDWRSNA-----VSEVKDQGQCGSSWSFSTTGAVEG 150
Query: 118 RMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY----- 172
++ + GRL +LS +L+ C ++ G C+GG A+ Y+ + G+ + Y
Sbjct: 151 QLALQR-GRL-TSLSEQNLIDCSSSY-GNAGCDGGWMDSAFSYIHDYGIMSESAYPYEAQ 207
Query: 173 -GSCQRFDRGN 182
C RFD
Sbjct: 208 GDYC-RFDSSQ 217
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 64.2 bits (157), Expect = 9e-13
Identities = 36/144 (25%), Positives = 58/144 (40%), Gaps = 21/144 (14%)
Query: 49 LSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGS 105
L + + + + ++ P FD R Q G V+ Q +CGS
Sbjct: 87 HGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQ-------GMVSPVKNQGSCGS 139
Query: 106 CWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN- 164
WA ++T A+ +M IA D ++S L+ C G C GG A+ Y+ +N
Sbjct: 140 SWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVPNALG---CSGGWMNDAFTYVAQNG 196
Query: 165 GVPTGGDY------GSCQRFDRGN 182
G+ + G Y G+C +D
Sbjct: 197 GIDSEGAYPYEMADGNC-HYDPNQ 219
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 63.4 bits (155), Expect = 2e-12
Identities = 33/135 (24%), Positives = 54/135 (40%), Gaps = 23/135 (17%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAA 114
++ + P+ D RK+ G V+ Q CGSCWA ++ A
Sbjct: 80 VPLSHSRSNDTLYIPEWEGRAPDSVDYRKK-------GYVTPVKNQGQCGSCWAFSSVGA 132
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY- 172
L ++ T G+L LS +L+ C + G C GG A+ Y+ +N G+ + Y
Sbjct: 133 LEGQLKKKT-GKL-LNLSPQNLVDCVSENDG---CGGGYMTNAFQYVQKNRGIDSEDAYP 187
Query: 173 -----GSCQRFDRGN 182
SC ++
Sbjct: 188 YVGQEESC-MYNPTG 201
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 63.4 bits (155), Expect = 2e-12
Identities = 36/134 (26%), Positives = 54/134 (40%), Gaps = 22/134 (16%)
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAA 114
+ ++ + + LPE D RK+ G V+ Q +CGSCWA + A
Sbjct: 87 IDATIEQSYDEEFINEDIVNLPENVDWRKK-------GAVTPVRHQGSCGSCWAFSAVAT 139
Query: 115 LSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY-- 172
+ I T G+L LS L+ C C+GG P A Y+ +NG+ Y
Sbjct: 140 VEGINKIRT-GKLV-ELSEQELVDC---ERRSHGCKGGYPPYALEYVAKNGIHLRSKYPY 194
Query: 173 ----GSCQRFDRGN 182
G+C R +
Sbjct: 195 KAKQGTC-RAKQVG 207
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 63.4 bits (155), Expect = 2e-12
Identities = 32/121 (26%), Positives = 55/121 (45%), Gaps = 20/121 (16%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
N LP+ D R++ G V+ Q +CG+ WA + AL ++ + T G+L
Sbjct: 93 SNPNRILPDSVDWREK-------GCVTEVKYQGSCGAAWAFSAVGALEAQLKLKT-GKL- 143
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRG 181
+LS+ +L+ C G C GG A+ Y+++N G+ + Y C ++D
Sbjct: 144 VSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSK 202
Query: 182 N 182
Sbjct: 203 Y 203
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 61.4 bits (150), Expect = 4e-12
Identities = 28/113 (24%), Positives = 38/113 (33%), Gaps = 22/113 (19%)
Query: 74 SNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
N P E DLR+ +++Q CGS WA + AA +
Sbjct: 6 INGNAPAEIDLRQM-------RTVTPIRMQGGCGSAWAFSGVAATESAYLAYR-QQS-LD 56
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY------GSCQR 177
L+ L+ C C G R Y+ NGV Y SC+R
Sbjct: 57 LAEQELVD----CASQHGCHGDTIPRGIEYIQHNGVVQESYYRYVAREQSCRR 105
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 62.3 bits (152), Expect = 4e-12
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 21/121 (17%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
E P D R++ G V+ Q CGSCWA + T AL +M T GRL
Sbjct: 91 EPLFYEAPRSVDWREK-------GYVTPVKNQGQCGSCWAFSATGALEGQMFRKT-GRL- 141
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRG 181
+LS +L+ C G + C GG A+ Y+ +N G+ + Y SC +++
Sbjct: 142 ISLSEQNLVDCSGPQ-GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPK 199
Query: 182 N 182
Sbjct: 200 Y 200
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 60.7 bits (148), Expect = 1e-11
Identities = 31/138 (22%), Positives = 43/138 (31%), Gaps = 23/138 (16%)
Query: 49 LSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGS 105
L + L +E N P E DLR+ +++Q CGS
Sbjct: 62 LMSAEAFEHLKTQFDLNAETN-ACSINGNAPAEIDLRQM-------RTVTPIRMQGGCGS 113
Query: 106 CWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENG 165
WA + AA + L+ L+ C C G R Y+ NG
Sbjct: 114 AWAFSGVAATESAYLAYR-DQS-LDLAEQELVD----CASQHGCHGDTIPRGIEYIQHNG 167
Query: 166 VPTGGDY------GSCQR 177
V Y SC+R
Sbjct: 168 VVQESYYRYVAREQSCRR 185
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 59.2 bits (144), Expect = 4e-11
Identities = 30/120 (25%), Positives = 49/120 (40%), Gaps = 20/120 (16%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
+N +P++ D R+ G V+ Q NCGS WA +TT + +
Sbjct: 86 EANNRAVPDKIDWRES-------GYVTEVKDQGNCGSGWAFSTTGTMEGQYMKNE-RTS- 136
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY------GSCQRFDRGN 182
+ S L+ C G + C GG A+ Y+ + G+ T Y G C R+++
Sbjct: 137 ISFSEQQLVDCSRPW-GNNGCGGGLMENAYQYLKQFGLETESSYPYTAVEGQC-RYNKQL 194
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 58.1 bits (141), Expect = 1e-10
Identities = 20/103 (19%), Positives = 37/103 (35%), Gaps = 9/103 (8%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTL 131
+ + D C + V+ Q NC + W A+ L C+ +
Sbjct: 4 FCNKEYCNRLKDENN----CISNLQVEDQGNCDTSWIFASKYHLETIRCMKG-YEP-TKI 57
Query: 132 SSDHLLTCCAACTGGDVCEGG-NPMRAWYYMLENG-VPTGGDY 172
S+ ++ C D C+ G +PM + + G +P +Y
Sbjct: 58 SALYVANCYKGE-HKDRCDEGSSPMEFLQIIEDYGFLPAESNY 99
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 56.4 bits (137), Expect = 3e-10
Identities = 32/115 (27%), Positives = 55/115 (47%), Gaps = 20/115 (17%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
LP+ D R++ G V+ Q +CG+CWA + AL ++ + T G+L +LS+
Sbjct: 2 LPDSVDWREK-------GCVTEVKYQGSCGACWAFSAVGALEAQLKLKT-GKLV-SLSAQ 52
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
+L+ C G C GG A+ Y+++N G+ + Y C ++D
Sbjct: 53 NLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSKY 106
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 55.9 bits (136), Expect = 4e-10
Identities = 37/114 (32%), Positives = 50/114 (43%), Gaps = 22/114 (19%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
LPE D RK+ G V+ Q +CGSCWA + A + I T G+L LS
Sbjct: 1 LPENVDWRKK-------GAVTPVRHQGSCGSCWAFSAVATVEGINKIRT-GKL-VELSEQ 51
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY------GSCQRFDRGN 182
L+ C G C+GG P A Y+ +NG+ Y G+C R +
Sbjct: 52 ELVDCERRSHG---CKGGYPPYALEYVAKNGIHLRSKYPYKAKQGTC-RAKQVG 101
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 56.0 bits (136), Expect = 4e-10
Identities = 29/89 (32%), Positives = 37/89 (41%), Gaps = 11/89 (12%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q CGSCWA +T A + I T G L LS L+ C G C+GG +
Sbjct: 19 QGACGSCWAFSTIATVEGINKIVT-GNL-LELSEQELVDCDKHSYG---CKGGYQTTSLQ 73
Query: 160 YMLENGVPTGGDY------GSCQRFDRGN 182
Y+ NGV T Y C+ D+
Sbjct: 74 YVANNGVHTSKVYPYQAKQYKCRATDKPG 102
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 55.2 bits (134), Expect = 6e-10
Identities = 31/89 (34%), Positives = 44/89 (49%), Gaps = 11/89 (12%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q+ CGSCWA +T A + I T G+L +LS LL C G C+GG +
Sbjct: 19 QNPCGSCWAFSTVATIEGINKIIT-GQL-ISLSEQELLDCERRSHG---CDGGYQTTSLQ 73
Query: 160 YMLENGVPTGGDY------GSCQRFDRGN 182
Y+++NGV T +Y G C+ D+
Sbjct: 74 YVVDNGVHTEREYPYEKKQGRCRAKDKKG 102
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 54.8 bits (133), Expect = 8e-10
Identities = 30/114 (26%), Positives = 45/114 (39%), Gaps = 21/114 (18%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+PE D R++ G V+ Q +CGSCWA + + + I T G L + S
Sbjct: 1 IPEYVDWRQK-------GAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRT-GNL-NQYSEQ 51
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY------GSCQRFDRGN 182
LL C C GG P A + + G+ Y C+ ++G
Sbjct: 52 ELLDC---DRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGP 102
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 54.8 bits (133), Expect = 8e-10
Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 13/90 (14%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q CGSCWA ++ AL ++ T G+L LS +L+ C + G C GG A+
Sbjct: 19 QGQCGSCWAFSSVGALEGQLKKKT-GKLL-NLSPQNLVDCVSENDG---CGGGYMTNAFQ 73
Query: 160 YMLEN-GVPTGGDY------GSCQRFDRGN 182
Y+ +N G+ + Y SC ++
Sbjct: 74 YVQKNRGIDSEDAYPYVGQEESC-MYNPTG 102
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 54.0 bits (131), Expect = 2e-09
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 16/112 (14%)
Query: 79 PEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLT 138
P D RK+ + V+ Q +CGSCW +TT AL + IAT G++ +L+ L+
Sbjct: 2 PPSMDWRKKGNFVS---PVKNQGSCGSCWTFSTTGALESAVAIAT-GKM-LSLAEQQLVD 56
Query: 139 CCAACTGGDV-CEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
C A + C+GG P +A+ Y+ N G+ Y C +F
Sbjct: 57 C--AQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDHC-KFQPDK 105
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 53.8 bits (130), Expect = 2e-09
Identities = 28/122 (22%), Positives = 43/122 (35%), Gaps = 24/122 (19%)
Query: 72 YQSNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLD 128
+D R G V+ Q+ CGSCWA ++ ++ + I L
Sbjct: 14 PADAKLDRIAYDWRLH-------GGVTPVKDQALCGSCWAFSSVGSVESQYAIRK-KALF 65
Query: 129 HTLSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY-------GSCQRFDR 180
S L+ C G C GG A+ M++ G+ + DY +C R
Sbjct: 66 -LFSEQELVDCSVKNNG---CYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPETC-NLKR 120
Query: 181 GN 182
N
Sbjct: 121 CN 122
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 53.7 bits (130), Expect = 2e-09
Identities = 31/119 (26%), Positives = 44/119 (36%), Gaps = 22/119 (18%)
Query: 74 SNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+D R V+ Q NCGSCWA ++ ++ + I +L T
Sbjct: 14 EENFDHAAYDWRLH-------SGVTPVKDQKNCGSCWAFSSIGSVESQYAIRK-NKLI-T 64
Query: 131 LSSDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
LS L+ C G C GG A+ M+E G+ GDY + DR
Sbjct: 65 LSEQELVDCSFKNYG---CNGGLINNAFEDMIELGGICPDGDYPYVSDAPNLCNIDRCT 120
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 53.7 bits (130), Expect = 3e-09
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 23/120 (19%)
Query: 74 SNTELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHT 130
+ELP D R + G V+ Q +CGSCWA +TT AL C T G+L +
Sbjct: 3 LPSELPAGVDWRSR-------GCVTPVKDQRDCGSCWAFSTTGALEGAHCAKT-GKL-VS 53
Query: 131 LSSDHLLTCCAACTGGDV-CEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
LS L+ C + G+ C GG A+ Y+L++ G+ + Y C R
Sbjct: 54 LSEQELMDC--SRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDEEC-RAQSCE 110
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 53.3 bits (129), Expect = 4e-09
Identities = 35/116 (30%), Positives = 49/116 (42%), Gaps = 23/116 (19%)
Query: 77 ELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+LP+ D R+ G V+ Q CGSCWA +T AA+ I T G L +LS
Sbjct: 2 DLPDSIDWREN-------GAVVPVKNQGGCGSCWAFSTVAAVEGINQIVT-GDL-ISLSE 52
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
L+ C A G C GG A+ +++ N G+ + Y G C
Sbjct: 53 QQLVDCTTANHG---CRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC-NSTVNA 104
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 52.8 bits (128), Expect = 5e-09
Identities = 33/114 (28%), Positives = 47/114 (41%), Gaps = 22/114 (19%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+P D R++ G V+ Q CGSCW ++ AA+ I T G+L +LS
Sbjct: 1 IPTSIDWRQK-------GAVTPVRNQGGCGSCWTFSSVAAVEGINKIVT-GQL-LSLSEQ 51
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLENGVPTGGDY------GSCQRFDRGN 182
LL C G C GG P+ A Y+ +G+ Y C R +
Sbjct: 52 ELLDCERRSYG---CRGGFPLYALQYVANSGIHLRQYYPYEGVQRQC-RASQAK 101
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 52.7 bits (127), Expect = 6e-09
Identities = 37/120 (30%), Positives = 52/120 (43%), Gaps = 22/120 (18%)
Query: 76 TELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
++LP D R++ G V+ Q CGSCWA +T ++ I T G L +LS
Sbjct: 2 SDLPPSVDWRQK-------GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRT-GSL-VSLS 52
Query: 133 SDHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGNCNC 185
L+ C A G C+GG A+ Y+ N G+ T Y G+C R N
Sbjct: 53 EQELIDCDTADNDG--CQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTC-NVARAAQNS 109
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 52.1 bits (126), Expect = 8e-09
Identities = 30/90 (33%), Positives = 44/90 (48%), Gaps = 11/90 (12%)
Query: 100 QSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWY 159
Q CGSCWA + T AL +M T GRL +LS +L+ C G + C GG A+
Sbjct: 19 QGQCGSCWAFSATGALEGQMFRKT-GRLI-SLSEQNLVDCSGPQ-GNEGCNGGLMDYAFQ 75
Query: 160 YMLEN-GVPTGGDY------GSCQRFDRGN 182
Y+ +N G+ + Y SC +++
Sbjct: 76 YVQDNGGLDSEESYPYEATEESC-KYNPKY 104
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 51.9 bits (125), Expect = 9e-09
Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 22/109 (20%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
LPE+ D RK+ G V+ Q +CGSCWA +T + + I T G L +LS
Sbjct: 1 LPEQIDWRKK-------GAVTPVKNQGSCGSCWAFSTVSTVESINQIRT-GNL-ISLSEQ 51
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQ 176
L+ C G C GG + A+ Y++ N G+ T +Y G CQ
Sbjct: 52 ELVDCDKKNHG---CLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQ 97
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 52.2 bits (126), Expect = 1e-08
Identities = 33/115 (28%), Positives = 49/115 (42%), Gaps = 22/115 (19%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
+P D RK+ G V+ Q CGSCWA +T A+ I T +L +LS
Sbjct: 2 VPASVDWRKK-------GAVTSVKDQGQCGSCWAFSTIVAVEGINQIKT-NKL-VSLSEQ 52
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
L+ C G C GG A+ ++ + G+ T +Y G+C + N
Sbjct: 53 ELVDCDTDQNQG--CNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTC-DVSKEN 104
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 51.7 bits (125), Expect = 1e-08
Identities = 37/115 (32%), Positives = 51/115 (44%), Gaps = 23/115 (20%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
LP D R + G ++ Q CGSCWA + AA+ I T G+L +LS
Sbjct: 1 LPSFVDWRSK-------GAVNSIKNQKQCGSCWAFSAVAAVESINKIRT-GQL-ISLSEQ 51
Query: 135 HLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
L+ C A G C GG A+ Y++ N G+ T +Y GSC + R
Sbjct: 52 ELVDCDTASHG---CNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSC-KPYRLR 102
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 51.7 bits (125), Expect = 1e-08
Identities = 33/114 (28%), Positives = 45/114 (39%), Gaps = 23/114 (20%)
Query: 79 PEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
P E+D R + G V+ Q CGSCWA + T + + + G L +LS
Sbjct: 2 PPEWDWRSK-------GAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQ-GTL-LSLSEQE 52
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
LL C C GG P A+ + G+ T DY SC +F
Sbjct: 53 LLDCDKMDKA---CMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSC-QFSAEK 102
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 51.4 bits (124), Expect = 2e-08
Identities = 31/116 (26%), Positives = 51/116 (43%), Gaps = 23/116 (19%)
Query: 77 ELPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSS 133
+ PE +D K+ G V+ Q CGS WA + T A+ IAT G L +LS
Sbjct: 1 DAPESWDWSKK-------GVITKVKFQGQCGSGWAFSATGAIEAAHAIAT-GNLV-SLSE 51
Query: 134 DHLLTCCAACTGGDVCEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
L+ C G C G +++ +++++ G+ + DY G C + +
Sbjct: 52 QELIDCVDESEG---CYNGWHYQSFEWVVKHGGIASEADYPYKARDGKC-KANEIQ 103
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 49.0 bits (118), Expect = 1e-07
Identities = 34/116 (29%), Positives = 48/116 (41%), Gaps = 23/116 (19%)
Query: 78 LPEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSD 134
LP+ D R G ++ Q CGS WA +T AA+ IAT G L +LS
Sbjct: 1 LPDYVDWRSS-------GAVVDIKDQGQCGSAWAFSTIAAVEGINKIAT-GDL-ISLSEQ 51
Query: 135 HLLTCCAACTGGDV-CEGGNPMRAWYYMLEN-GVPTGGDY------GSCQRFDRGN 182
L+ C T C+GG + +++ N G+ T +Y G C D
Sbjct: 52 ELVDC--GRTQNTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQC-NLDLQQ 104
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 45.2 bits (108), Expect = 2e-06
Identities = 27/118 (22%), Positives = 41/118 (34%), Gaps = 26/118 (22%)
Query: 79 PEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
P D R + G V+ Q CGSCWA + + + +A L LS
Sbjct: 2 PAAVDWRAR-------GAVTAVKDQGQCGSCWAFSAIGNVECQWFLAG-HPLT-NLSEQM 52
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLEN---GVPTGGDY------GSCQ--RFDRGN 182
L++C +G C GG A+ ++++ V T Y G
Sbjct: 53 LVSCDKTDSG---CSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHT 107
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 43.3 bits (102), Expect = 1e-05
Identities = 31/108 (28%), Positives = 44/108 (40%), Gaps = 22/108 (20%)
Query: 79 PEEFDLRKQYPNCTNIG---HVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLSSDH 135
P D RK+ G V+ Q CG CWA T A+ I T GRL ++S
Sbjct: 2 PASIDWRKK-------GAVTSVKDQGACGMCWAFGATGAIEGIDAITT-GRLI-SVSEQQ 52
Query: 136 LLTCCAACTGGDVCEGGNPMRAWYYMLENG-------VPTGGDYGSCQ 176
++ C T GG+ A+ +++ NG P G G+C
Sbjct: 53 IVDC---DTXXXXXXGGDADDAFRWVITNGGIASDANYPYTGVDGTCD 97
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 37.1 bits (85), Expect = 0.003
Identities = 28/142 (19%), Positives = 48/142 (33%), Gaps = 31/142 (21%)
Query: 45 LKFGLSLTPQSQ-EP--------NPDLQLGSEHFGDYQSNTELPEEFDLR----KQYPNC 91
L +P++ E +P+ S+H + + + R K Y NC
Sbjct: 187 LNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENC 246
Query: 92 TNI-GHVQ----LQSNCGSCWAIATT--AALSDRMCIATQGR--LDH---TLSSDHLLTC 139
+ +VQ + SC + TT ++D + AT LDH TL+ D + +
Sbjct: 247 LLVLLNVQNAKAWNAFNLSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSL 306
Query: 140 CAACTGGDVCE------GGNPM 155
+ NP
Sbjct: 307 LLKYLDCRPQDLPREVLTTNPR 328
Score = 29.1 bits (64), Expect = 0.90
Identities = 20/94 (21%), Positives = 27/94 (28%), Gaps = 14/94 (14%)
Query: 9 VNHSHHLLLRHVTRDSNPGLWADPDI---LKSSP--------SFLSSLKFGLSLTPQSQE 57
VN H L V + + P I LK S + +
Sbjct: 408 VNKLHKYSL--VEKQPKESTISIPSIYLELKVKLENEYALHRSIVDHYNIPKTFDSDDLI 465
Query: 58 PNPDLQLGSEHFGDYQSNTELPEEFDL-RKQYPN 90
P Q H G + N E PE L R + +
Sbjct: 466 PPYLDQYFYSHIGHHLKNIEHPERMTLFRMVFLD 499
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 33.7 bits (76), Expect = 0.026
Identities = 22/123 (17%), Positives = 39/123 (31%), Gaps = 21/123 (17%)
Query: 73 QSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSCWAIATTAALSDRMCIATQGRLDHTLS 132
+ +E F K+ P I V+ Q+ G+CW ++ + L + G+ ++ LS
Sbjct: 5 KKVSEEGFVFTTVKENP----ITSVKNQNRAGTCWCYSSYSFL--ESELLRMGKGEYDLS 58
Query: 133 SDHLLTCC---------AACTGGDVCEGGNPMRAWYYMLENGV------PTGGDYGSCQR 177
+ +GG+ A Y M G+ G Y
Sbjct: 59 EMFTVYNTYLDRADAAVRTHGDVSFSQGGSFYDALYGMETFGLVPEEEMRPGMMYADTLS 118
Query: 178 FDR 180
Sbjct: 119 NHT 121
>1ig0_A Thiamin pyrophosphokinase; protein-substrate complex, compound
active site, alpha-beta- alpha, beta sandwich,
transferase; HET: VIB; 1.80A {Saccharomyces cerevisiae}
SCOP: b.82.6.1 c.100.1.1
Length = 319
Score = 28.7 bits (63), Expect = 0.98
Identities = 11/96 (11%), Positives = 29/96 (30%), Gaps = 4/96 (4%)
Query: 46 KFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGS 105
K +++ Q+ + + D + ++ E + + I + +
Sbjct: 111 KNKVTIIKQTTQYSTDFTKCVNLISLHFNSPEFRSLISNKDNLQSNHGIELEKGIHTLYN 170
Query: 106 CWAIATTAALSDRMCI----ATQGRLDHTLSSDHLL 137
+ + + + GR D T+ S L
Sbjct: 171 TMTESLVFSKVTPISLLALGGIGGRFDQTVHSITQL 206
>3ihk_A Thiamin pyrophosphokinase; structural genomics, PSI-2, protein
structure initiative, northeast structural genomics
consortium, NESG, SMR83; HET: TPP; 3.00A {Streptococcus
mutans}
Length = 218
Score = 27.4 bits (60), Expect = 2.4
Identities = 15/75 (20%), Positives = 24/75 (32%), Gaps = 5/75 (6%)
Query: 70 GDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGSC---WAIATTAALSDRMCIATQGR 126
GD+ S + EEF K + + + + A GR
Sbjct: 44 GDFDSVS--AEEFKQIKAKAKKLVMAPAEKNDTDTELALKTIFDCFGRVEIIVFGAFGGR 101
Query: 127 LDHTLSSDHLLTCCA 141
+DH LS+ L +
Sbjct: 102 IDHMLSNIFLPSDPD 116
>2g9z_A Thiamine pyrophosphokinase; thiamin-PNP, TPK, thiamin
pyrophosphokinase, structural GENO profun, bacterial
targets at IGS-CNRS, france, BIGS; HET: VNP; 1.96A
{Candida albicans} PDB: 2hh9_A*
Length = 348
Score = 26.8 bits (58), Expect = 5.1
Identities = 14/96 (14%), Positives = 27/96 (28%), Gaps = 4/96 (4%)
Query: 46 KFGLSLTPQSQEPNPDLQLGSEHFGDYQSNTELPEEFDLRKQYPNCTNIGHVQLQSNCGS 105
G + QS + D + E + G +L + +
Sbjct: 140 SHGSKIIRQSSQYYNDFTKSIHCIQLHYQLNHTKENWFESI----DEVDGLAKLWNGLNN 195
Query: 106 CWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCA 141
+ ++ + A GR D T+ S + L
Sbjct: 196 SSDVVVDIDITIYVLNAIGGRFDQTVQSINQLYIMN 231
>3ijm_A Uncharacterized restriction endonuclease-like FOL superfamily
protein; DUF820, cyanobacteria, PD(D/E)XK superfamily,
structural GEN PSI-2; 1.70A {Spirosoma linguale}
Length = 151
Score = 25.9 bits (56), Expect = 6.7
Identities = 16/60 (26%), Positives = 29/60 (48%), Gaps = 8/60 (13%)
Query: 9 VNHSHHLLLRHVTRDSNPGLWADPDILKSSPSFLSSLKFGLSLTPQSQ----EPNPDLQL 64
+N+SH + L+ + ++ + G+ A S ++ L GL QS+ EP P+ L
Sbjct: 4 MNYSHPISLKTLVQEDDIGVNAPII----HQSVIARLTAGLYPLYQSKKIPFEPLPETML 59
>3s4y_A Thiamin pyrophosphokinase 1; structural genomics, structural
genomics consortium, transferase; HET: TPP; 1.80A {Homo
sapiens} PDB: 1ig3_A* 2f17_A*
Length = 247
Score = 26.3 bits (57), Expect = 7.0
Identities = 6/58 (10%), Positives = 11/58 (18%)
Query: 103 CGSCWAIATTAALSDRMCIATQGRLDHTLSSDHLLTCCAACTGGDVCEGGNPMRAWYY 160
GR D ++S + L T + +
Sbjct: 112 LQKKIEEKDLKVDVIVTLGGLAGRFDQIMASVNTLFQATHITPFPIIIIQEESLIYLL 169
>1wlo_A SUFE protein; structural genomics, riken structural
genomics/proteomics in RSGI, unknown function; NMR
{Thermus thermophilus}
Length = 136
Score = 25.6 bits (56), Expect = 7.5
Identities = 5/22 (22%), Positives = 7/22 (31%)
Query: 33 DILKSSPSFLSSLKFGLSLTPQ 54
+L+ P F TP
Sbjct: 93 AVLEVPPGFYRGYGLEEFFTPL 114
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.319 0.135 0.445
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 2,929,767
Number of extensions: 158752
Number of successful extensions: 492
Number of sequences better than 10.0: 1
Number of HSP's gapped: 442
Number of HSP's successfully gapped: 61
Length of query: 185
Length of database: 6,701,793
Length adjustment: 88
Effective length of query: 97
Effective length of database: 4,244,745
Effective search space: 411740265
Effective search space used: 411740265
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 54 (24.3 bits)