Query psy282
Match_columns 233
No_of_seqs 301 out of 2087
Neff 8.0
Searched_HMMs 46136
Date Fri Aug 16 17:45:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy282.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/282hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1542|consensus 100.0 9E-34 1.9E-38 243.1 10.8 130 1-133 235-366 (372)
2 KOG1542|consensus 100.0 8.7E-32 1.9E-36 230.9 3.1 126 101-228 229-366 (372)
3 PTZ00203 cathepsin L protease; 100.0 9.5E-30 2.1E-34 225.4 13.8 127 2-133 207-335 (348)
4 cd02621 Peptidase_C1A_Cathepsi 100.0 2.6E-29 5.7E-34 213.3 13.5 130 1-134 88-238 (243)
5 cd02248 Peptidase_C1A Peptidas 100.0 1E-28 2.2E-33 204.5 14.0 126 1-132 78-206 (210)
6 cd02620 Peptidase_C1A_Cathepsi 100.0 8.4E-29 1.8E-33 209.3 13.3 129 1-133 84-233 (236)
7 cd02698 Peptidase_C1A_Cathepsi 100.0 1.5E-28 3.3E-33 208.1 13.6 125 1-132 86-232 (239)
8 KOG1543|consensus 100.0 1.9E-28 4.2E-33 215.5 13.2 126 1-132 189-317 (325)
9 PTZ00021 falcipain-2; Provisio 99.9 2.4E-27 5.1E-32 216.7 12.8 123 2-133 345-483 (489)
10 KOG1543|consensus 99.9 1.5E-28 3.3E-33 216.2 4.7 119 102-225 185-315 (325)
11 PTZ00049 cathepsin C-like prot 99.9 1.8E-26 3.8E-31 215.9 13.2 130 1-134 472-672 (693)
12 PTZ00200 cysteine proteinase; 99.9 1.8E-26 4E-31 209.8 12.9 123 1-133 312-440 (448)
13 PTZ00203 cathepsin L protease; 99.9 5.4E-27 1.2E-31 207.8 6.8 118 102-223 199-330 (348)
14 cd02621 Peptidase_C1A_Cathepsi 99.9 1.2E-26 2.6E-31 196.9 8.2 89 135-223 131-232 (243)
15 PTZ00364 dipeptidyl-peptidase 99.9 1E-25 2.3E-30 208.1 14.3 134 1-134 294-455 (548)
16 PF00112 Peptidase_C1: Papain 99.9 4.2E-26 9E-31 189.2 9.8 127 1-133 82-215 (219)
17 cd02698 Peptidase_C1A_Cathepsi 99.9 2.5E-26 5.3E-31 194.6 6.6 81 136-219 137-219 (239)
18 cd02620 Peptidase_C1A_Cathepsi 99.9 2.5E-26 5.5E-31 194.2 6.3 84 136-222 143-227 (236)
19 PTZ00021 falcipain-2; Provisio 99.9 4.3E-26 9.3E-31 208.4 7.9 122 102-231 339-480 (489)
20 cd02248 Peptidase_C1A Peptidas 99.9 7.5E-25 1.6E-29 181.3 7.7 84 137-222 117-201 (210)
21 PTZ00200 cysteine proteinase; 99.9 4E-24 8.8E-29 194.5 7.5 103 111-219 314-428 (448)
22 PTZ00462 Serine-repeat antigen 99.9 2E-23 4.4E-28 200.6 12.4 126 2-133 614-776 (1004)
23 cd02619 Peptidase_C1 C1 Peptid 99.9 6E-23 1.3E-27 170.5 12.4 119 1-123 82-212 (223)
24 PTZ00364 dipeptidyl-peptidase 99.9 6.4E-24 1.4E-28 196.3 5.9 89 136-224 342-450 (548)
25 PTZ00049 cathepsin C-like prot 99.9 2E-23 4.3E-28 195.5 6.8 89 136-224 556-667 (693)
26 PF00112 Peptidase_C1: Papain 99.9 2.8E-23 6.1E-28 172.2 5.2 83 136-220 122-206 (219)
27 PTZ00462 Serine-repeat antigen 99.9 1.4E-22 3E-27 194.9 6.4 80 138-219 681-766 (1004)
28 smart00645 Pept_C1 Papain fami 99.8 3.8E-21 8.2E-26 155.5 8.7 70 151-222 93-164 (174)
29 cd02619 Peptidase_C1 C1 Peptid 99.8 1.4E-21 3E-26 162.3 6.1 86 136-221 124-215 (223)
30 smart00645 Pept_C1 Papain fami 99.8 3.4E-20 7.3E-25 149.9 9.3 74 56-131 93-168 (174)
31 KOG1544|consensus 99.8 4.1E-20 8.8E-25 157.7 8.8 134 1-135 291-457 (470)
32 KOG1544|consensus 99.8 7.8E-20 1.7E-24 156.0 3.6 89 136-224 352-451 (470)
33 cd00585 Peptidase_C1B Peptidas 99.6 1.5E-16 3.2E-21 144.5 5.9 77 142-218 303-399 (437)
34 COG4870 Cysteine protease [Pos 99.5 2.8E-14 6E-19 124.4 5.2 78 140-219 227-314 (372)
35 cd00585 Peptidase_C1B Peptidas 99.4 8.8E-13 1.9E-17 119.9 9.3 90 32-123 286-399 (437)
36 COG4870 Cysteine protease [Pos 99.1 2.2E-10 4.7E-15 100.3 7.9 80 41-124 223-314 (372)
37 PF03051 Peptidase_C1_2: Pepti 98.8 4.8E-09 1E-13 95.7 4.3 81 138-218 300-400 (438)
38 PF03051 Peptidase_C1_2: Pepti 98.0 2.2E-05 4.8E-10 71.9 8.7 87 36-122 290-399 (438)
39 COG3579 PepC Aminopeptidase C 97.2 9.4E-05 2E-09 64.7 -0.0 74 144-217 308-401 (444)
40 COG3579 PepC Aminopeptidase C 93.8 0.05 1.1E-06 48.0 2.9 80 41-120 296-399 (444)
41 PF13529 Peptidase_C39_2: Pept 89.7 0.26 5.6E-06 37.0 2.3 56 137-203 89-144 (144)
42 PF13529 Peptidase_C39_2: Pept 88.2 1.9 4.1E-05 32.1 6.3 55 41-108 88-144 (144)
43 PF09778 Guanylate_cyc_2: Guan 78.1 8.8 0.00019 32.0 6.5 64 41-106 112-180 (212)
44 KOG4128|consensus 72.9 0.35 7.6E-06 42.7 -3.1 40 178-217 370-413 (457)
45 PF14399 Transpep_BrtH: NlpC/p 71.7 12 0.00025 32.5 6.1 57 41-106 77-133 (317)
46 PF14399 Transpep_BrtH: NlpC/p 66.9 4.7 0.0001 35.0 2.6 56 137-201 78-133 (317)
47 PF09778 Guanylate_cyc_2: Guan 62.4 13 0.00027 31.1 4.1 64 136-201 112-180 (212)
48 PF05543 Peptidase_C47: Stapho 61.3 25 0.00055 28.3 5.5 54 41-108 90-144 (175)
49 PF12385 Peptidase_C70: Papain 59.8 27 0.00058 27.8 5.3 39 41-96 97-135 (166)
50 cd00044 CysPc Calpains, domain 49.9 26 0.00057 30.7 4.3 27 179-205 235-263 (315)
51 PF05543 Peptidase_C47: Stapho 48.3 27 0.00058 28.2 3.7 36 178-218 118-154 (175)
52 COG4990 Uncharacterized protei 40.9 19 0.00041 29.3 1.8 47 136-204 122-168 (195)
53 COG4990 Uncharacterized protei 29.2 1.7E+02 0.0036 23.9 5.4 37 41-95 122-158 (195)
54 KOG4128|consensus 20.2 16 0.00034 32.6 -2.0 79 41-119 305-410 (457)
No 1
>KOG1542|consensus
Probab=100.00 E-value=9e-34 Score=243.08 Aligned_cols=130 Identities=42% Similarity=0.790 Sum_probs=122.5
Q ss_pred CCCCCCCCCCCCCCCCCccccccCCCceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCC
Q psy282 1 MGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETC 79 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c 79 (233)
.||+.|++|||++..++ .|..++....+.|++ |..++ ||++|.+.|.++|||+|+|++..+|+|.+||+.+....|
T Consensus 235 gGL~~E~dYPY~g~~~~--~C~~~~~~~~v~I~~-f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P~~~~C 311 (372)
T KOG1542|consen 235 GGLEKEKDYPYTGKKGN--QCHFDKSKIVVSIKD-FSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFYRGGVSCPSKYIC 311 (372)
T ss_pred CCccccccCCccccCCC--ccccchhhceEEEec-cEecCCCHHHHHHHHHhcCCeEEEEchHHHHHhcccccCCCcccC
Confidence 48999999999998875 899999999999999 99999 999999999999999999998899999999999854479
Q ss_pred CCCCCCcEEEEEeeeccC-CccEEEEecCCCCCCCCCcceeeeccccccCccccc
Q psy282 80 SPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH 133 (233)
Q Consensus 80 ~~~~~~H~v~ivG~g~~~-~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~ 133 (233)
++..+||+|+|||||... ..||||||||||+.|||+||+|+.|+.|.||++..+
T Consensus 312 s~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mv 366 (372)
T KOG1542|consen 312 SPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMV 366 (372)
T ss_pred CccccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccccccccch
Confidence 998899999999999998 999999999999999999999999999999998764
No 2
>KOG1542|consensus
Probab=99.97 E-value=8.7e-32 Score=230.87 Aligned_cols=126 Identities=32% Similarity=0.620 Sum_probs=108.9
Q ss_pred EEEEecCCCCCCCCCcceeeeccccccCccccc-----------ccCccceeeeeecccceEEEEeccccccccCcceec
Q psy282 101 YWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH-----------FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRK 169 (233)
Q Consensus 101 ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~-----------~~~e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~ 169 (233)
-|+++. |..-.|+.|||.....+.|.+.... ..+|++|.+.|.++|||+|+|++..+|+|.+||+.+
T Consensus 229 ~~~~~~--gGL~~E~dYPY~g~~~~~C~~~~~~~~v~I~~f~~l~~nE~~ia~wLv~~GPi~vgiNa~~mQ~YrgGV~~P 306 (372)
T KOG1542|consen 229 KYIKKA--GGLEKEKDYPYTGKKGNQCHFDKSKIVVSIKDFSMLSNNEDQIAAWLVTFGPLSVGINAKPMQFYRGGVSCP 306 (372)
T ss_pred HHHHHh--CCccccccCCccccCCCccccchhhceEEEeccEecCCCHHHHHHHHHhcCCeEEEEchHHHHHhcccccCC
Confidence 355665 5667799999988876789875421 247899999999999999999999999999999987
Q ss_pred CCCCCCCCCCCEEEEEEEeeecC-CcCEEEEEcCCCCCCCCCcEEEEEccCCCCccCCCC
Q psy282 170 NDETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFKIEHTLRSHLTHDIP 228 (233)
Q Consensus 170 ~~~~~~~~~~~Hav~iVGy~~~~-~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~~~~~ 228 (233)
....|++..++|||||||||..+ .+||||||||||++|||+||||+.||.|.|.++.++
T Consensus 307 ~~~~Cs~~~~~HaVLlvGyG~~g~~~PYWIVKNSWG~~WGE~GY~~l~RG~N~CGi~~mv 366 (372)
T KOG1542|consen 307 SKYICSPKLLNHAVLLVGYGSSGYEKPYWIVKNSWGTSWGEKGYYKLCRGSNACGIADMV 366 (372)
T ss_pred CcccCCccccCceEEEEeecCCCCCCceEEEECCccccccccceEEEeccccccccccch
Confidence 67789987899999999999988 899999999999999999999999999988665554
No 3
>PTZ00203 cathepsin L protease; Provisional
Probab=99.96 E-value=9.5e-30 Score=225.42 Aligned_cols=127 Identities=28% Similarity=0.588 Sum_probs=109.7
Q ss_pred CCCCCCCCCCCCCCCCccccccCCC-ceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCC
Q psy282 2 GLESEKDYPYKNANGEKFKCAYDKS-KVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETC 79 (233)
Q Consensus 2 Gi~~e~~yPY~~~~~~~~~C~~~~~-~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c 79 (233)
|+++|++|||++.++..+.|+.... ...+++++ |..++ ++++|+++|+++|||+|+|++++|++|++|||.. |
T Consensus 207 gi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~-~~~i~~~e~~~~~~l~~~GPv~v~i~a~~f~~Y~~GIy~~----c 281 (348)
T PTZ00203 207 TVFTEKSYPYVSGNGDVPECSNSSELAPGARIDG-YVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----C 281 (348)
T ss_pred CCCccccCCCccCCCCCCcCCCCcccccceEecc-eeecCcCHHHHHHHHHhCCCEEEEEEhhhhcCccCceeec----c
Confidence 4789999999987764346874333 24567888 88887 8999999999999999999998999999999963 7
Q ss_pred CCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCccccc
Q psy282 80 SPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH 133 (233)
Q Consensus 80 ~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~ 133 (233)
+...+||+|+|||||.+++.+|||||||||+.|||+||+|+.++.+.|++..++
T Consensus 282 ~~~~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~Cgi~~~~ 335 (348)
T PTZ00203 282 IGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGYP 335 (348)
T ss_pred CCCCCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCcccccceE
Confidence 666689999999999998999999999999999999999999988999987654
No 4
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.96 E-value=2.6e-29 Score=213.27 Aligned_cols=130 Identities=29% Similarity=0.585 Sum_probs=107.1
Q ss_pred CCCCCCCCCCCCC-CCCCccccccCC-CceeEEeCCceEEe------CCHHHHHHHHHhcCCEEEEEecC-CCCCcCCce
Q psy282 1 MGLESEKDYPYKN-ANGEKFKCAYDK-SKVKLFTGKDFLHF------NGSETMKKILYKYGPLSVLLNSD-LIHDYNGTP 71 (233)
Q Consensus 1 ~Gi~~e~~yPY~~-~~~~~~~C~~~~-~~~~~~i~~~~~~v------~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi 71 (233)
+||++|++|||+. ... .|+... ...++++.+ |..+ +++++||++|+++|||+++|++. +|++|++||
T Consensus 88 ~Gi~~e~~yPY~~~~~~---~C~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GI 163 (243)
T cd02621 88 FGIVTEDYFPYTADDDR---PCKASPSECRRYYFSD-YNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGV 163 (243)
T ss_pred cCcCCCceeCCCCCCCC---CCCCCccccccccccc-eeEcccccccCCHHHHHHHHHHcCCEEEEEEecccccccCCeE
Confidence 4899999999998 566 898655 334455555 4443 48999999999999999999995 999999999
Q ss_pred eeCCC--CCCCC--------CCCCcEEEEEeeeccC--CccEEEEecCCCCCCCCCcceeeeccccccCcccccc
Q psy282 72 IRKND--ETCSP--------YDLGHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLHF 134 (233)
Q Consensus 72 ~~~~~--~~c~~--------~~~~H~v~ivG~g~~~--~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~~ 134 (233)
|.... ..|+. ..++|+|+|||||.+. +.+|||||||||+.|||+||+|++++.+.|++.+.+.
T Consensus 164 y~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirNSWG~~WGe~Gy~~i~~~~~~cgi~~~~~ 238 (243)
T cd02621 164 YHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAV 238 (243)
T ss_pred ECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEcCCCCCCCcCCeEEEecCCcccCcccceE
Confidence 98752 01543 2579999999999886 8999999999999999999999999889999987643
No 5
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.96 E-value=1e-28 Score=204.49 Aligned_cols=126 Identities=40% Similarity=0.780 Sum_probs=114.0
Q ss_pred CCCCCCCCCCCCCCCCCccccccCCCceeEEeCCceEEeC--CHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCC
Q psy282 1 MGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDE 77 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~--~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~ 77 (233)
+|+++|++|||..... .|+.......+++.+ |..++ ++++||++|+++|||+++|.+. +|+.|++|||..+
T Consensus 78 ~Gi~~e~~yPY~~~~~---~C~~~~~~~~~~i~~-~~~i~~~~~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~~-- 151 (210)
T cd02248 78 GGLASESDYPYTGKDG---TCKYNSSKVGAKITG-YSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSGP-- 151 (210)
T ss_pred CCcCccccCCccCCCC---CccCCCCcccEEEee-EEEcCCCcHHHHHHHHhhcCCEEEEEecCcccccCCCCceeCC--
Confidence 5899999999998777 898777678899999 99887 5899999999999999999995 9999999999987
Q ss_pred CCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCcccc
Q psy282 78 TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFL 132 (233)
Q Consensus 78 ~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~ 132 (233)
.|....++|+|+|||||.+.+.+|||+|||||+.||++||+|+.++++.|++...
T Consensus 152 ~~~~~~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i~~~~~~cgi~~~ 206 (210)
T cd02248 152 CCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASY 206 (210)
T ss_pred CCCCCcCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEEEcCCCccCceee
Confidence 4556678999999999998899999999999999999999999998889999754
No 6
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.96 E-value=8.4e-29 Score=209.34 Aligned_cols=129 Identities=25% Similarity=0.435 Sum_probs=106.2
Q ss_pred CCCCCCCCCCCCCCCCCc---------------cccccCCC----ceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEe
Q psy282 1 MGLESEKDYPYKNANGEK---------------FKCAYDKS----KVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLN 60 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~~~---------------~~C~~~~~----~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~ 60 (233)
+|+++|++|||.+.+... +.|..... ...+++.. +..+. ++++||++|+++|||+++|+
T Consensus 84 ~G~~~e~~yPY~~~~~~~~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~~~~~-~~~~~~~~~~ik~~l~~~GPv~v~i~ 162 (236)
T cd02620 84 TGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKHKGKS-AYSVPSDETDIMKEIMTNGPVQAAFT 162 (236)
T ss_pred cCCCcCCEecCcCCCCccCCCCCCCCCCCCCCCCCCCcCCccccceeeeeecc-eeeeCCHHHHHHHHHHHCCCeEEEEE
Confidence 589999999998765410 12543221 22345666 66665 78999999999999999999
Q ss_pred c-CCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCccccc
Q psy282 61 S-DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLH 133 (233)
Q Consensus 61 ~-~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~ 133 (233)
+ ++|+.|++|||... |+...++|||+|||||.+++.+|||||||||+.|||+||+|+.++.+.|++.+..
T Consensus 163 ~~~~f~~Y~~Giy~~~---~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~ri~~~~~~cgi~~~~ 233 (236)
T cd02620 163 VYEDFLYYKSGVYQHT---SGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEV 233 (236)
T ss_pred echhhhhcCCcEEeec---CCCCcCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEEEEccCcccccccce
Confidence 9 59999999999765 6666789999999999999999999999999999999999999988999998754
No 7
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.96 E-value=1.5e-28 Score=208.12 Aligned_cols=125 Identities=24% Similarity=0.505 Sum_probs=105.5
Q ss_pred CCCCCCCCCCCCCCCCCcccccc---------------CCCceeEEeCCceEEeCCHHHHHHHHHhcCCEEEEEecC-CC
Q psy282 1 MGLESEKDYPYKNANGEKFKCAY---------------DKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSD-LI 64 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~~~~~C~~---------------~~~~~~~~i~~~~~~v~~~~~i~~~l~~~GPv~v~i~~~-~f 64 (233)
+|+++|++|||++.+. .|+. .+....+++++ |..++++++||++|.++|||+|+|++. +|
T Consensus 86 ~Gl~~e~~yPY~~~~~---~C~~~~~~~~c~~~~~c~~~~~~~~~~i~~-~~~~~~~~~i~~~l~~~GPV~v~i~~~~~f 161 (239)
T cd02698 86 HGIPDETCNPYQAKDG---ECNPFNRCGTCNPFGECFAIKNYTLYFVSD-YGSVSGRDKMMAEIYARGPISCGIMATEAL 161 (239)
T ss_pred cCcCCCCeeCCcCCCC---CCcCCCCCCCcccCcccccccccceEEeee-ceecCCHHHHHHHHHHcCCEEEEEEecccc
Confidence 4899999999998766 5542 11123567888 888888999999999999999999995 89
Q ss_pred CCcCCceeeCCCCCCCCCCCCcEEEEEeeeccC-CccEEEEecCCCCCCCCCcceeeeccc-----cccCcccc
Q psy282 65 HDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFFKIERGN-----NACGKDFL 132 (233)
Q Consensus 65 ~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~-~~~ywivknswg~~wge~gy~~~~~~~-----~~Cgi~~~ 132 (233)
+.|++|||+.. .| ...++|+|+|||||+++ +.+|||||||||+.|||+||+|+.++. +.|+|++.
T Consensus 162 ~~Y~~GIy~~~--~~-~~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~~i~rg~~~~~~~~~~i~~~ 232 (239)
T cd02698 162 ENYTGGVYKEY--VQ-DPLINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTSSYKGARYNLAIEED 232 (239)
T ss_pred cccCCeEEccC--CC-CCcCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceEEEEccCCcccccccccccc
Confidence 99999999876 34 45689999999999886 899999999999999999999999987 77887643
No 8
>KOG1543|consensus
Probab=99.96 E-value=1.9e-28 Score=215.47 Aligned_cols=126 Identities=32% Similarity=0.609 Sum_probs=112.8
Q ss_pred CCCCC-CCCCCCCCCCCCccccccCCCceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCC
Q psy282 1 MGLES-EKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN-GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDE 77 (233)
Q Consensus 1 ~Gi~~-e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~ 77 (233)
+|+++ |++|||.+..+ .|+.+.....+.+.+ +..++ +|++|+++|+.+|||+|+|++. +|+.|++|||.++
T Consensus 189 ~G~~t~~~~Ypy~~~~~---~C~~~~~~~~~~~~~-~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~~~-- 262 (325)
T KOG1543|consen 189 NGGVTECENYPYIGKDG---TCKSNKKDKTVTIKG-FYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYAEE-- 262 (325)
T ss_pred hCCCCCCcCCCCcCCCC---CccCCCccceeEeee-eeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEeCC--
Confidence 46677 99999999999 999888777788888 88888 9999999999999999999995 9999999999998
Q ss_pred CCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeeccccccCcccc
Q psy282 78 TCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFL 132 (233)
Q Consensus 78 ~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~ 132 (233)
.|....++|+|+|||||++++.+|||++||||+.|||+||+|+.|+.+.|+++..
T Consensus 263 ~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~~~ 317 (325)
T KOG1543|consen 263 KGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKCGIASE 317 (325)
T ss_pred CCCCCCCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCchhhhcc
Confidence 3433368999999999996678999999999999999999999999999986553
No 9
>PTZ00021 falcipain-2; Provisional
Probab=99.95 E-value=2.4e-27 Score=216.67 Aligned_cols=123 Identities=31% Similarity=0.592 Sum_probs=106.0
Q ss_pred CCCCCCCCCCCCCC-CCccccccCCCceeEEeCCceEEeCCHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCCCC
Q psy282 2 GLESEKDYPYKNAN-GEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC 79 (233)
Q Consensus 2 Gi~~e~~yPY~~~~-~~~~~C~~~~~~~~~~i~~~~~~v~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~~c 79 (233)
||++|++|||++.. + .|+.......++|.+ |..++ +++|+++|+.+|||+|+|++. +|++|++|||... |
T Consensus 345 Gl~tE~~YPY~~~~~~---~C~~~~~~~~~~i~~-y~~i~-~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~~~---C 416 (489)
T PTZ00021 345 GLCSEDDYPYVSDTPE---LCNIDRCKEKYKIKS-YVSIP-EDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFDGE---C 416 (489)
T ss_pred ccCcccccCccCCCCC---ccccccccccceeee-EEEec-HHHHHHHHHhcCCeEEEEEeecccccCCCCcCCCC---C
Confidence 89999999999874 5 798665556788999 98886 678999999999999999995 9999999999764 8
Q ss_pred CCCCCCcEEEEEeeeccCC----------ccEEEEecCCCCCCCCCcceeeeccc----cccCccccc
Q psy282 80 SPYDLGHAVLLVGYGKQDD----------IPYWLVRNSWGPIGPDEGFFKIERGN----NACGKDFLH 133 (233)
Q Consensus 80 ~~~~~~H~v~ivG~g~~~~----------~~ywivknswg~~wge~gy~~~~~~~----~~Cgi~~~~ 133 (233)
+. .+||||+|||||.+++ .+|||||||||+.|||+||+|+.++. +.|++...+
T Consensus 417 ~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~~g~~n~CGI~t~a 483 (489)
T PTZ00021 417 GE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETDENGLMKTCSLGTEA 483 (489)
T ss_pred CC-ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcCCCCCCCCCCCcccc
Confidence 65 4799999999997642 47999999999999999999999875 589997643
No 10
>KOG1543|consensus
Probab=99.95 E-value=1.5e-28 Score=216.17 Aligned_cols=119 Identities=28% Similarity=0.554 Sum_probs=95.3
Q ss_pred EEEecCCCCCCCCCcceeeeccccccCcccc-----------cccCccceeeeeecccceEEEEec-cccccccCcceec
Q psy282 102 WLVRNSWGPIGPDEGFFKIERGNNACGKDFL-----------HFNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRK 169 (233)
Q Consensus 102 wivknswg~~wge~gy~~~~~~~~~Cgi~~~-----------~~~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~ 169 (233)
|+.+| |..-+++.|++... ++.|..... ...++.+|++++..+|||+++|++ ..|++|++|||.
T Consensus 185 yi~~~--G~~t~~~~Ypy~~~-~~~C~~~~~~~~~~~~~~~~~~~~e~~i~~~v~~~GPv~v~~~a~~~F~~Y~~GVy~- 260 (325)
T KOG1543|consen 185 YIKKN--GGVTECENYPYIGK-DGTCKSNKKDKTVTIKGFYNVPANEEAIAEAVAKNGPVSVAIDAYEDFSLYKGGVYA- 260 (325)
T ss_pred HHHHh--CCCCCCcCCCCcCC-CCCccCCCccceeEeeeeeecCcCHHHHHHHHHhcCCeEEEEeehhhhhhccCceEe-
Confidence 55666 44445888987765 356765441 123588999999999999999999 589999999997
Q ss_pred CCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEEEEccCCCCccC
Q psy282 170 NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIEHTLRSHLTH 225 (233)
Q Consensus 170 ~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~~ 225 (233)
...|....++|||+|||||+.++.+|||||||||+.|||+|||||.|+.+.+.++
T Consensus 261 -~~~~~~~~~~Hav~iVGyG~~~~~~YWivkNSWG~~WGe~Gy~ri~r~~~~~~I~ 315 (325)
T KOG1543|consen 261 -EEKGDDKEGDHAVLIVGYGTGDGVDYWIVKNSWGTDWGEKGYFRIARGVNKCGIA 315 (325)
T ss_pred -CCCCCCCCCCceEEEEEEcCCCCceeEEEEcCCCCCcccCceEEEecCCCchhhh
Confidence 3344432589999999999966789999999999999999999999999888543
No 11
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.94 E-value=1.8e-26 Score=215.86 Aligned_cols=130 Identities=29% Similarity=0.635 Sum_probs=103.5
Q ss_pred CCCCCCCCCCCCCCCCCccccccCCC---------------------------------------ceeEEeCCceEEe--
Q psy282 1 MGLESEKDYPYKNANGEKFKCAYDKS---------------------------------------KVKLFTGKDFLHF-- 39 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~---------------------------------------~~~~~i~~~~~~v-- 39 (233)
+||++|+||||++..+ .|+.... ..++++++ |..+
T Consensus 472 ~GI~tEscYPY~a~~g---~C~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~y~k~-y~yI~g 547 (693)
T PTZ00049 472 QGIPLDKVFPYTATEQ---TCPYQVDQSANSMNGSANLRQINAVFFSSETQSDMHADFEAPISSEPARWYAKD-YNYIGG 547 (693)
T ss_pred CCCCcCCccCCcCCCC---CCCCCCCCccccccccccccccccccccccccccccccccccccccccceeeee-eEEecc
Confidence 4899999999998877 6754211 11234555 5544
Q ss_pred -------CCHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCC----CCCCCC--------------CCCcEEEEEee
Q psy282 40 -------NGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKND----ETCSPY--------------DLGHAVLLVGY 93 (233)
Q Consensus 40 -------~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~----~~c~~~--------------~~~H~v~ivG~ 93 (233)
.++++|+++|+++|||+|+|++. +|++|++|||..+. ..|+.. .+||||+||||
T Consensus 548 ~y~~~~~~~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGw 627 (693)
T PTZ00049 548 CYGCNQCNGEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGW 627 (693)
T ss_pred cccccCCCCHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEe
Confidence 26899999999999999999995 99999999998531 036422 46999999999
Q ss_pred ecc--CC--ccEEEEecCCCCCCCCCcceeeeccccccCcccccc
Q psy282 94 GKQ--DD--IPYWLVRNSWGPIGPDEGFFKIERGNNACGKDFLHF 134 (233)
Q Consensus 94 g~~--~~--~~ywivknswg~~wge~gy~~~~~~~~~Cgi~~~~~ 134 (233)
|.+ ++ .+|||+|||||+.||++||+|+.|+.+.|+|++.+.
T Consensus 628 G~d~enG~~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGIEs~a~ 672 (693)
T PTZ00049 628 GEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSL 672 (693)
T ss_pred ccccCCCcccCEEEEECCCCCCcccCceEEEEcCCCccCCcccee
Confidence 985 45 389999999999999999999999999999987654
No 12
>PTZ00200 cysteine proteinase; Provisional
Probab=99.94 E-value=1.8e-26 Score=209.84 Aligned_cols=123 Identities=31% Similarity=0.631 Sum_probs=102.9
Q ss_pred CCCCCCCCCCCCCCCCCccccccCCCceeEEeCCceEEeCCHHHHHHHHHhcCCEEEEEecC-CCCCcCCceeeCCCCCC
Q psy282 1 MGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIRKNDETC 79 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~~~~~C~~~~~~~~~~i~~~~~~v~~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~~~~~~c 79 (233)
+||++|++|||++..+ .|+.... ..++|.+ |..+++.+.|+++|. +|||+|+|++. +|+.|++|||..+ |
T Consensus 312 ~Gi~~e~~YPY~~~~~---~C~~~~~-~~~~i~~-y~~~~~~~~l~~~l~-~GPV~v~i~~~~~f~~Yk~GIy~~~---C 382 (448)
T PTZ00200 312 KGLSSSSDVPYLAKDG---KCVVSST-KKVYIDS-YLVAKGKDVLNKSLV-ISPTVVYIAVSRELLKYKSGVYNGE---C 382 (448)
T ss_pred cCccccccCCCCCCCC---CCcCCCC-CeeEecc-eEecCHHHHHHHHHh-cCCEEEEeecccccccCCCCccccc---c
Confidence 5999999999999988 8986543 4567888 887766566666664 69999999995 9999999999764 8
Q ss_pred CCCCCCcEEEEEeeec--cCCccEEEEecCCCCCCCCCcceeeecc---ccccCccccc
Q psy282 80 SPYDLGHAVLLVGYGK--QDDIPYWLVRNSWGPIGPDEGFFKIERG---NNACGKDFLH 133 (233)
Q Consensus 80 ~~~~~~H~v~ivG~g~--~~~~~ywivknswg~~wge~gy~~~~~~---~~~Cgi~~~~ 133 (233)
+.. +||||+|||||. +++.+|||||||||+.|||+||+|+.++ .+.|++.+.+
T Consensus 383 ~~~-~nHaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r~~~g~n~CGI~~~~ 440 (448)
T PTZ00200 383 GKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGILTVG 440 (448)
T ss_pred CCC-CcEEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEeCCCCCCcCCccccc
Confidence 654 899999999996 4678999999999999999999999985 4789997653
No 13
>PTZ00203 cathepsin L protease; Provisional
Probab=99.93 E-value=5.4e-27 Score=207.84 Aligned_cols=118 Identities=25% Similarity=0.590 Sum_probs=90.6
Q ss_pred EEEecCCCCCCCCCcceeeeccc--cccCccc----------c-c-ccCccceeeeeecccceEEEEeccccccccCcce
Q psy282 102 WLVRNSWGPIGPDEGFFKIERGN--NACGKDF----------L-H-FNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPI 167 (233)
Q Consensus 102 wivknswg~~wge~gy~~~~~~~--~~Cgi~~----------~-~-~~~e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~ 167 (233)
|++++.-|..-.|+.|||..... ..|.... + . ..+++.|+++|.+.|||+++|++..|++|++|||
T Consensus 199 yi~~~~~ggi~~e~~YPY~~~~~~~~~C~~~~~~~~~~~i~~~~~i~~~e~~~~~~l~~~GPv~v~i~a~~f~~Y~~GIy 278 (348)
T PTZ00203 199 WVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVL 278 (348)
T ss_pred HHHHhcCCCCCccccCCCccCCCCCCcCCCCcccccceEecceeecCcCHHHHHHHHHhCCCEEEEEEhhhhcCccCcee
Confidence 55544334444577888865431 1464211 0 0 1245678888988999999999988999999998
Q ss_pred ecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEEEEccCCCCc
Q psy282 168 RKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIEHTLRSHL 223 (233)
Q Consensus 168 ~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~ 223 (233)
. .|....+||||+|||||++++++|||||||||++|||+|||||+|+.+.|.
T Consensus 279 ~----~c~~~~~nHaVliVGYG~~~g~~YWiikNSWG~~WGe~GY~ri~rg~n~Cg 330 (348)
T PTZ00203 279 T----SCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACL 330 (348)
T ss_pred e----ccCCCCCCeEEEEEEEecCCCceEEEEEcCCCCCcCcCceEEEEcCCCccc
Confidence 5 386656899999999999889999999999999999999999999987653
No 14
>cd02621 Peptidase_C1A_CathepsinC Cathepsin C; also known as Dipeptidyl Peptidase I (DPPI), an atypical papain-like cysteine peptidase with chloride dependency and dipeptidyl aminopeptidase activity, resulting from its tetrameric structure which limits substrate access. Each subunit of the tetramer is composed of three peptides: the heavy and light chains, which together adopts the papain fold and forms the catalytic domain; and the residual propeptide region, which forms a beta barrel and points towards the substrate's N-terminus. The subunit composition is the result of the unique characteristic of procathepsin C maturation involving the cleavage of the catalytic domain and the non-autocatalytic excision of an activation peptide within its propeptide region. By removing N-terminal dipeptide extensions, cathepsin C activates granule serine peptidases (granzymes) involved in cell-mediated apoptosis, inflammation and tissue remodelling. Loss-of-function mutations in cathepsin C are assoc
Probab=99.93 E-value=1.2e-26 Score=196.91 Aligned_cols=89 Identities=33% Similarity=0.617 Sum_probs=74.9
Q ss_pred cCccceeeeeecccceEEEEec-cccccccCcceecCC--CCCCC--------CCCCEEEEEEEeeecC--CcCEEEEEc
Q psy282 135 NGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKND--ETCSP--------YDLGHAVLLVGYGKQD--DIPYWLVRN 201 (233)
Q Consensus 135 ~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~--~~~~~--------~~~~Hav~iVGy~~~~--~~~ywivkN 201 (233)
.++++|+++|.++|||+++|++ +.|++|++|||.... ..|+. ..++|||+|||||++. +++||||||
T Consensus 131 ~~~~~ik~~i~~~GPv~v~~~~~~~F~~Y~~GIy~~~~~~~~C~~~~~~~~~~~~~~HaV~iVGyg~~~~~g~~YWiirN 210 (243)
T cd02621 131 TNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKN 210 (243)
T ss_pred CCHHHHHHHHHHcCCEEEEEEecccccccCCeEECcCCcccccccccccccCcccCCeEEEEEEeeccCCCCCcEEEEEc
Confidence 3567899999999999999998 589999999986321 12643 2479999999999876 899999999
Q ss_pred CCCCCCCCCcEEEEEccCCCCc
Q psy282 202 SWGPIGPDEGFFKIEHTLRSHL 223 (233)
Q Consensus 202 SWg~~WG~~Gy~~i~~~~~~~~ 223 (233)
|||++|||+|||||+|+.+.+-
T Consensus 211 SWG~~WGe~Gy~~i~~~~~~cg 232 (243)
T cd02621 211 SWGSSWGEKGYFKIRRGTNECG 232 (243)
T ss_pred CCCCCCCcCCeEEEecCCcccC
Confidence 9999999999999999876553
No 15
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.93 E-value=1e-25 Score=208.13 Aligned_cols=134 Identities=22% Similarity=0.443 Sum_probs=105.0
Q ss_pred CCCCCCCCC--CCCCCCCCccccccCCCceeEEeCCc-----eEEeC-CHHHHHHHHHhcCCEEEEEecC-CCCCcCCce
Q psy282 1 MGLESEKDY--PYKNANGEKFKCAYDKSKVKLFTGKD-----FLHFN-GSETMKKILYKYGPLSVLLNSD-LIHDYNGTP 71 (233)
Q Consensus 1 ~Gi~~e~~y--PY~~~~~~~~~C~~~~~~~~~~i~~~-----~~~v~-~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi 71 (233)
+||++|++| ||++.++....|+......+++++++ |..+. ++++|+++|+++|||+|+|++. +|+.|++||
T Consensus 294 ~GI~tE~dY~~PY~~~dg~~~~Ck~~~~~~~y~~~~~~~I~gyy~~~~~e~~I~~eI~~~GPVsVaIda~~df~~YksGi 373 (548)
T PTZ00364 294 FGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENS 373 (548)
T ss_pred CCcccccccCCCCCCCCCCCCCCCCCcccceeeeeeeEEecceeecCCcHHHHHHHHHHcCCeEEEEEechHHHhcCCCC
Confidence 589999999 99877663335876554444444430 44344 7899999999999999999995 899999999
Q ss_pred eeCC-----C-CCC---C-------CCCCCcEEEEEeeec-cCCccEEEEecCCCC--CCCCCcceeeeccccccCcccc
Q psy282 72 IRKN-----D-ETC---S-------PYDLGHAVLLVGYGK-QDDIPYWLVRNSWGP--IGPDEGFFKIERGNNACGKDFL 132 (233)
Q Consensus 72 ~~~~-----~-~~c---~-------~~~~~H~v~ivG~g~-~~~~~ywivknswg~--~wge~gy~~~~~~~~~Cgi~~~ 132 (233)
|... . +.| + ...+||||+|||||. +++.+|||+|||||+ .|||+||+|+.|+.+.|++++.
T Consensus 374 y~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YWIVKNSWGt~~~WGE~GYfRI~RG~N~CGIes~ 453 (548)
T PTZ00364 374 TEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESE 453 (548)
T ss_pred ccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEEEEECCCCCCCCcccCCeEEEEcCCCcccccce
Confidence 8631 0 011 1 235799999999997 478899999999999 9999999999999999999876
Q ss_pred cc
Q psy282 133 HF 134 (233)
Q Consensus 133 ~~ 134 (233)
+.
T Consensus 454 ~v 455 (548)
T PTZ00364 454 VV 455 (548)
T ss_pred ee
Confidence 53
No 16
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.93 E-value=4.2e-26 Score=189.19 Aligned_cols=127 Identities=33% Similarity=0.630 Sum_probs=110.1
Q ss_pred CCCCCCCCCCCCCCC-CCccccccCCCce-eEEeCCceEEeC--CHHHHHHHHHhcCCEEEEEecC--CCCCcCCceeeC
Q psy282 1 MGLESEKDYPYKNAN-GEKFKCAYDKSKV-KLFTGKDFLHFN--GSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRK 74 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~-~~~~~C~~~~~~~-~~~i~~~~~~v~--~~~~i~~~l~~~GPv~v~i~~~--~f~~y~~Gi~~~ 74 (233)
+|+++|++|||.... . .|....... ..++.. |..+. ++++||++|.++|||++.|.+. +|+.|++|||..
T Consensus 82 ~Gi~~e~~~pY~~~~~~---~c~~~~~~~~~~~i~~-~~~~~~~~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~~ 157 (219)
T PF00112_consen 82 NGIVTEEDYPYNGNENP---TCKSKKSNSYYVKIKG-YGKVKDNDIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYDP 157 (219)
T ss_dssp TSBEBTTTS--SSSSSC---SSCHSGGGEEEBEESE-EEEEESTCHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEECS
T ss_pred Ccccccccccccccccc---cccccccccccccccc-cccccccchhHHHHHHhhCceeeeeeeccccccccccceeeec
Confidence 599999999999876 5 798765544 478888 88887 5999999999999999999994 599999999998
Q ss_pred CCCCCCCCCCCcEEEEEeeeccCCccEEEEecCCCCCCCCCcceeeecccc-ccCccccc
Q psy282 75 NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNN-ACGKDFLH 133 (233)
Q Consensus 75 ~~~~c~~~~~~H~v~ivG~g~~~~~~ywivknswg~~wge~gy~~~~~~~~-~Cgi~~~~ 133 (233)
+ .|....++|+|+|||||.+.+..|||+|||||+.||++||+|+.++.+ .|++++.+
T Consensus 158 ~--~~~~~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~~i~~~~~~~c~i~~~~ 215 (219)
T PF00112_consen 158 P--DCSNESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYFRISYDYNNECGIESQA 215 (219)
T ss_dssp T--SSSSSSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEEEEESSSSSGGGTTSSE
T ss_pred c--ccccccccccccccccccccceeeEeeehhhCCccCCCeEEEEeeCCCCcCccCcee
Confidence 7 587788999999999999999999999999999999999999999876 99998654
No 17
>cd02698 Peptidase_C1A_CathepsinX Cathepsin X; the only papain-like lysosomal cysteine peptidase exhibiting carboxymonopeptidase activity. It can also act as a carboxydipeptidase, like cathepsin B, but has been shown to preferentially cleave substrates through a monopeptidyl carboxypeptidase pathway. The propeptide region of cathepsin X, the shortest among papain-like peptidases, is covalently attached to the active site cysteine in the inactive form of the enzyme. Little is known about the biological function of cathepsin X. Some studies point to a role in early tumorigenesis. A more recent study indicates that cathepsin X expression is restricted to immune cells suggesting a role in phagocytosis and the regulation of the immune response.
Probab=99.93 E-value=2.5e-26 Score=194.63 Aligned_cols=81 Identities=28% Similarity=0.649 Sum_probs=71.8
Q ss_pred CccceeeeeecccceEEEEec-cccccccCcceecCCCCCCCCCCCEEEEEEEeeecC-CcCEEEEEcCCCCCCCCCcEE
Q psy282 136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD-DIPYWLVRNSWGPIGPDEGFF 213 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~-~~~ywivkNSWg~~WG~~Gy~ 213 (233)
+++.|+++|.++|||+++|++ +.|++|++|||. ...|. ..++|||+|||||+++ +++|||||||||++|||+|||
T Consensus 137 ~~~~i~~~l~~~GPV~v~i~~~~~f~~Y~~GIy~--~~~~~-~~~~HaV~IVGyG~~~~g~~YWiikNSWG~~WGe~Gy~ 213 (239)
T cd02698 137 GRDKMMAEIYARGPISCGIMATEALENYTGGVYK--EYVQD-PLINHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWF 213 (239)
T ss_pred CHHHHHHHHHHcCCEEEEEEecccccccCCeEEc--cCCCC-CcCCeEEEEEEEEecCCCCEEEEEEcCCCcccCcCceE
Confidence 467799999899999999999 479999999985 33443 4679999999999876 899999999999999999999
Q ss_pred EEEccC
Q psy282 214 KIEHTL 219 (233)
Q Consensus 214 ~i~~~~ 219 (233)
||+|+.
T Consensus 214 ~i~rg~ 219 (239)
T cd02698 214 RIVTSS 219 (239)
T ss_pred EEEccC
Confidence 999998
No 18
>cd02620 Peptidase_C1A_CathepsinB Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane
Probab=99.93 E-value=2.5e-26 Score=194.22 Aligned_cols=84 Identities=26% Similarity=0.556 Sum_probs=74.1
Q ss_pred CccceeeeeecccceEEEEec-cccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEE
Q psy282 136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFK 214 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~ 214 (233)
++++|+.+|.++|||+++|++ +.|++|++|||. ..|....++|||+|||||++++++|||||||||+.|||+||||
T Consensus 143 ~~~~ik~~l~~~GPv~v~i~~~~~f~~Y~~Giy~---~~~~~~~~~HaV~iVGyg~~~g~~YWivrNSWG~~WGe~Gy~r 219 (236)
T cd02620 143 DETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQ---HTSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFR 219 (236)
T ss_pred HHHHHHHHHHHCCCeEEEEEechhhhhcCCcEEe---ecCCCCcCCeEEEEEEEeccCCeeEEEEEeCCCCCCCCCcEEE
Confidence 467899999999999999999 589999999985 2355556799999999999889999999999999999999999
Q ss_pred EEccCCCC
Q psy282 215 IEHTLRSH 222 (233)
Q Consensus 215 i~~~~~~~ 222 (233)
|+|+.+.+
T Consensus 220 i~~~~~~c 227 (236)
T cd02620 220 ILRGSNEC 227 (236)
T ss_pred EEccCccc
Confidence 99987544
No 19
>PTZ00021 falcipain-2; Provisional
Probab=99.93 E-value=4.3e-26 Score=208.42 Aligned_cols=122 Identities=24% Similarity=0.466 Sum_probs=90.6
Q ss_pred EEEecCCCCCCCCCcceeeeccccccCccc---------ccccCccceeeeeecccceEEEEec-cccccccCcceecCC
Q psy282 102 WLVRNSWGPIGPDEGFFKIERGNNACGKDF---------LHFNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKND 171 (233)
Q Consensus 102 wivknswg~~wge~gy~~~~~~~~~Cgi~~---------~~~~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~ 171 (233)
|++++ +....|+.|||.......|.... +....++.++++|...|||+++|++ +.|++|++|||.
T Consensus 339 yi~~~--gGl~tE~~YPY~~~~~~~C~~~~~~~~~~i~~y~~i~~~~lk~al~~~GPVsv~i~a~~~f~~YkgGIy~--- 413 (489)
T PTZ00021 339 DMIEL--GGLCSEDDYPYVSDTPELCNIDRCKEKYKIKSYVSIPEDKFKEAIRFLGPISVSIAVSDDFAFYKGGIFD--- 413 (489)
T ss_pred hhhhc--cccCcccccCccCCCCCccccccccccceeeeEEEecHHHHHHHHHhcCCeEEEEEeecccccCCCCcCC---
Confidence 55554 34556888998765335664321 1111246788899889999999999 589999999984
Q ss_pred CCCCCCCCCEEEEEEEeeecC----------CcCEEEEEcCCCCCCCCCcEEEEEccCCCCccCCCCCCC
Q psy282 172 ETCSPYDLGHAVLLVGYGKQD----------DIPYWLVRNSWGPIGPDEGFFKIEHTLRSHLTHDIPGVP 231 (233)
Q Consensus 172 ~~~~~~~~~Hav~iVGy~~~~----------~~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~~~~~~~~ 231 (233)
..|+. .++|||+|||||+++ +.+|||||||||++|||+|||||+|+.+.. .++|||.
T Consensus 414 ~~C~~-~~nHAVlIVGYG~e~~~~~~~~~~~~~~YWIVKNSWGt~WGE~GY~rI~r~~~g~--~n~CGI~ 480 (489)
T PTZ00021 414 GECGE-EPNHAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRIETDENGL--MKTCSLG 480 (489)
T ss_pred CCCCC-ccceEEEEEEecCcCCcccccccCCCCCEEEEECCCCCCcccCeEEEEEcCCCCC--CCCCCCc
Confidence 46875 579999999999763 257999999999999999999999986421 3455553
No 20
>cd02248 Peptidase_C1A Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to h
Probab=99.91 E-value=7.5e-25 Score=181.32 Aligned_cols=84 Identities=38% Similarity=0.791 Sum_probs=74.4
Q ss_pred ccceeeeeecccceEEEEecc-ccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEEEE
Q psy282 137 SETMKKILYKYGPLSVGLNSH-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKI 215 (233)
Q Consensus 137 e~~i~~~l~~~GPv~v~i~~~-~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~~i 215 (233)
.++||++|.++|||+++|.+. .|++|++|||.. ..|....++|||+|||||++.+.+|||||||||+.||++|||||
T Consensus 117 ~~~ik~~l~~~gPV~~~~~~~~~f~~y~~Giy~~--~~~~~~~~~Hav~iVGy~~~~~~~ywiv~NSWG~~WG~~Gy~~i 194 (210)
T cd02248 117 EEALKAALANYGPVSVAIDASSSFQFYKGGIYSG--PCCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRI 194 (210)
T ss_pred HHHHHHHHhhcCCEEEEEecCcccccCCCCceeC--CCCCCCcCCEEEEEEEEeecCCceEEEEEcCCCCccccCcEEEE
Confidence 578999999999999999984 899999999863 34545568999999999998889999999999999999999999
Q ss_pred EccCCCC
Q psy282 216 EHTLRSH 222 (233)
Q Consensus 216 ~~~~~~~ 222 (233)
+++.+.|
T Consensus 195 ~~~~~~c 201 (210)
T cd02248 195 ARGSNLC 201 (210)
T ss_pred EcCCCcc
Confidence 9988654
No 21
>PTZ00200 cysteine proteinase; Provisional
Probab=99.90 E-value=4e-24 Score=194.55 Aligned_cols=103 Identities=25% Similarity=0.489 Sum_probs=77.7
Q ss_pred CCCCCcceeeeccccccCccc--------cc-ccCccceeeeeecccceEEEEec-cccccccCcceecCCCCCCCCCCC
Q psy282 111 IGPDEGFFKIERGNNACGKDF--------LH-FNGSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSPYDLG 180 (233)
Q Consensus 111 ~wge~gy~~~~~~~~~Cgi~~--------~~-~~~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~~~~~ 180 (233)
.-.|+.|||... +..|.... +. ..+.+.+++++ ..|||+++|++ ..|++|++|||. +.|+. .++
T Consensus 314 i~~e~~YPY~~~-~~~C~~~~~~~~~i~~y~~~~~~~~l~~~l-~~GPV~v~i~~~~~f~~Yk~GIy~---~~C~~-~~n 387 (448)
T PTZ00200 314 LSSSSDVPYLAK-DGKCVVSSTKKVYIDSYLVAKGKDVLNKSL-VISPTVVYIAVSRELLKYKSGVYN---GECGK-SLN 387 (448)
T ss_pred ccccccCCCCCC-CCCCcCCCCCeeEecceEecCHHHHHHHHH-hcCCEEEEeecccccccCCCCccc---cccCC-CCc
Confidence 345777887654 35564321 11 11234455555 57999999999 489999999985 35876 489
Q ss_pred EEEEEEEeee--cCCcCEEEEEcCCCCCCCCCcEEEEEccC
Q psy282 181 HAVLLVGYGK--QDDIPYWLVRNSWGPIGPDEGFFKIEHTL 219 (233)
Q Consensus 181 Hav~iVGy~~--~~~~~ywivkNSWg~~WG~~Gy~~i~~~~ 219 (233)
|||+|||||. +++.+|||||||||++|||+|||||+|+.
T Consensus 388 HaV~lVGyG~d~~~g~~YWIIkNSWG~~WGe~GY~ri~r~~ 428 (448)
T PTZ00200 388 HAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTN 428 (448)
T ss_pred EEEEEEEecccCCCCCceEEEEcCCCCCcccCeeEEEEeCC
Confidence 9999999984 46789999999999999999999999974
No 22
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.90 E-value=2e-23 Score=200.65 Aligned_cols=126 Identities=27% Similarity=0.591 Sum_probs=99.6
Q ss_pred CCCCCCCCCCCC--CCCCccccccCCC------------------ceeEEeCCceEEeCC----------HHHHHHHHHh
Q psy282 2 GLESEKDYPYKN--ANGEKFKCAYDKS------------------KVKLFTGKDFLHFNG----------SETMKKILYK 51 (233)
Q Consensus 2 Gi~~e~~yPY~~--~~~~~~~C~~~~~------------------~~~~~i~~~~~~v~~----------~~~i~~~l~~ 51 (233)
||++|++|||.+ ..+ .|+.... ...+.+++ |..+.+ +++||++|++
T Consensus 614 gLptESdYPYt~k~~~g---~Cp~~~~~w~n~~~~~kll~~~~~~~~~i~~kg-Y~~~~s~~~~~n~d~~i~~IK~eI~~ 689 (1004)
T PTZ00462 614 FLPADSNYLYNYTKVGE---DCPDEEDHWMNLLDHGKILNHNKKEPNSLDGKA-YRAYESEHFHDKMDAFIKIIKDEIMN 689 (1004)
T ss_pred CCcccccCCCccCCCCC---CCCCCcccccccccccccccccccccceeeccc-eEEecccccccchhhHHHHHHHHHHh
Confidence 588999999985 444 7874321 01234456 665541 4689999999
Q ss_pred cCCEEEEEecCCCCCcC-CceeeCCCCCCCCCCCCcEEEEEeeecc-----CCccEEEEecCCCCCCCCCcceeeec-cc
Q psy282 52 YGPLSVLLNSDLIHDYN-GTPIRKNDETCSPYDLGHAVLLVGYGKQ-----DDIPYWLVRNSWGPIGPDEGFFKIER-GN 124 (233)
Q Consensus 52 ~GPv~v~i~~~~f~~y~-~Gi~~~~~~~c~~~~~~H~v~ivG~g~~-----~~~~ywivknswg~~wge~gy~~~~~-~~ 124 (233)
+|||+|+|++.+|+.|. +|||... .|+....+|||+|||||.+ .+.+|||||||||+.||++||+|+.| +.
T Consensus 690 kGPVaV~IdAsdf~~Y~~sGIyv~~--~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnGYFKI~r~g~ 767 (1004)
T PTZ00462 690 KGSVIAYIKAENVLGYEFNGKKVQN--LCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVDMYGP 767 (1004)
T ss_pred cCCEEEEEEeehHHhhhcCCccccC--CCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCeEEEEEeCCC
Confidence 99999999998788884 8998766 5877678999999999974 25799999999999999999999987 57
Q ss_pred cccCccccc
Q psy282 125 NACGKDFLH 133 (233)
Q Consensus 125 ~~Cgi~~~~ 133 (233)
+.|++....
T Consensus 768 n~CGin~i~ 776 (1004)
T PTZ00462 768 SHCEDNFIH 776 (1004)
T ss_pred CCCccchhe
Confidence 999987654
No 23
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.89 E-value=6e-23 Score=170.50 Aligned_cols=119 Identities=24% Similarity=0.282 Sum_probs=100.2
Q ss_pred CCCCCCCCCCCCCCCCCcccccc----CCCceeEEeCCceEEeC--CHHHHHHHHHhcCCEEEEEecC-CCCCcCCceee
Q psy282 1 MGLESEKDYPYKNANGEKFKCAY----DKSKVKLFTGKDFLHFN--GSETMKKILYKYGPLSVLLNSD-LIHDYNGTPIR 73 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~~~~~C~~----~~~~~~~~i~~~~~~v~--~~~~i~~~l~~~GPv~v~i~~~-~f~~y~~Gi~~ 73 (233)
+|+++|++|||..... .|.. ......+++.+ |..+. ++++||++|.++|||+++|.+. .|..|++|++.
T Consensus 82 ~Gi~~e~~~Py~~~~~---~~~~~~~~~~~~~~~~~~~-y~~~~~~~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~ 157 (223)
T cd02619 82 KGIPPEEDYPYGAESD---GEEPKSEAALNAAKVKLKD-YRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIY 157 (223)
T ss_pred cCCCccccCCCCCCCC---CCCCCCccchhhcceeecc-eeEeCchhHHHHHHHHHHCCCEEEEEEcccchhcccCcccc
Confidence 5899999999998877 5542 23346688889 88887 6899999999999999999995 99999999973
Q ss_pred ---CCCCCCCCCCCCcEEEEEeeeccC--CccEEEEecCCCCCCCCCcceeeecc
Q psy282 74 ---KNDETCSPYDLGHAVLLVGYGKQD--DIPYWLVRNSWGPIGPDEGFFKIERG 123 (233)
Q Consensus 74 ---~~~~~c~~~~~~H~v~ivG~g~~~--~~~ywivknswg~~wge~gy~~~~~~ 123 (233)
.....|....++|||+|||||.+. +.+|||+|||||+.||++||.|+.++
T Consensus 158 ~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~~Gy~~i~~~ 212 (223)
T cd02619 158 EEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRISYE 212 (223)
T ss_pred ccccccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCccccCCEEEEehh
Confidence 111145667789999999999886 89999999999999999999999885
No 24
>PTZ00364 dipeptidyl-peptidase I precursor; Provisional
Probab=99.89 E-value=6.4e-24 Score=196.30 Aligned_cols=89 Identities=21% Similarity=0.408 Sum_probs=72.2
Q ss_pred CccceeeeeecccceEEEEec-cccccccCccee-----cCC-CCCC----------CCCCCEEEEEEEeee-cCCcCEE
Q psy282 136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIR-----KND-ETCS----------PYDLGHAVLLVGYGK-QDDIPYW 197 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~-----~~~-~~~~----------~~~~~Hav~iVGy~~-~~~~~yw 197 (233)
++++|+.+|.++|||+++|++ +.|++|++|++. +.+ ..|. ...+||||+|||||+ +++++||
T Consensus 342 ~e~~I~~eI~~~GPVsVaIda~~df~~YksGiy~gi~~~~~~~~~~~~~~~~~~~~~~~~~nHAVlIVGYG~de~G~~YW 421 (548)
T PTZ00364 342 DPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYW 421 (548)
T ss_pred cHHHHHHHHHHcCCeEEEEEechHHHhcCCCCccCeeccccccccccccCCcccccccccCCeEEEEEEecccCCCceEE
Confidence 456799999999999999999 478899998863 111 1121 135799999999997 5688999
Q ss_pred EEEcCCCC--CCCCCcEEEEEccCCCCcc
Q psy282 198 LVRNSWGP--IGPDEGFFKIEHTLRSHLT 224 (233)
Q Consensus 198 ivkNSWg~--~WG~~Gy~~i~~~~~~~~~ 224 (233)
|||||||+ +|||+|||||+|+.+++-+
T Consensus 422 IVKNSWGt~~~WGE~GYfRI~RG~N~CGI 450 (548)
T PTZ00364 422 LVLDPWGSRRSWCDGGTRKIARGVNAYNI 450 (548)
T ss_pred EEECCCCCCCCcccCCeEEEEcCCCcccc
Confidence 99999999 9999999999999876644
No 25
>PTZ00049 cathepsin C-like protein; Provisional
Probab=99.88 E-value=2e-23 Score=195.46 Aligned_cols=89 Identities=29% Similarity=0.567 Sum_probs=73.3
Q ss_pred CccceeeeeecccceEEEEecc-ccccccCcceecCC----CCCCCC--------------CCCEEEEEEEeeec--CC-
Q psy282 136 GSETMKKILYKYGPLSVGLNSH-LIHFYNGTPIRKND----ETCSPY--------------DLGHAVLLVGYGKQ--DD- 193 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~~-~~~~y~~gi~~~~~----~~~~~~--------------~~~Hav~iVGy~~~--~~- 193 (233)
++..|+.+|..+|||+++|++. .|++|++|||...+ ..|... .++|||+|||||++ ++
T Consensus 556 ~E~~Im~eI~~~GPVsVsIda~~dF~~YksGVY~~~~~~h~~~C~~d~~~~~~~~~~~G~e~~NHAVlIVGwG~d~enG~ 635 (693)
T PTZ00049 556 GEKIMMNEIYRNGPIVASFEASPDFYDYADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGK 635 (693)
T ss_pred CHHHHHHHHHhcCCEEEEEEechhhhcCCCccccCcccccccccCCccccccccccccccccCceEEEEEEeccccCCCc
Confidence 5678999999999999999994 79999999996321 147432 36999999999975 45
Q ss_pred -cCEEEEEcCCCCCCCCCcEEEEEccCCCCcc
Q psy282 194 -IPYWLVRNSWGPIGPDEGFFKIEHTLRSHLT 224 (233)
Q Consensus 194 -~~ywivkNSWg~~WG~~Gy~~i~~~~~~~~~ 224 (233)
.+|||||||||+.|||+|||||+|+.+.+.+
T Consensus 636 ~~~YWIVRNSWGt~WGenGYfKI~RG~N~CGI 667 (693)
T PTZ00049 636 LYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGI 667 (693)
T ss_pred ccCEEEEECCCCCCcccCceEEEEcCCCccCC
Confidence 3799999999999999999999999876533
No 26
>PF00112 Peptidase_C1: Papain family cysteine protease This is family C1 in the peptidase classification. ; InterPro: IPR000668 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to the peptidase family C1, sub-family C1A (papain family, clan CA). It includes proteins classed as non-peptidase homologs. These are have either been shown experimentally to lack peptidase activity or lack one or more of the active site residues. The papain family has a wide variety of activities, including broad-range (papain) and narrow-range endo-peptidases, aminopeptidases, dipeptidyl peptidases and enzymes with both exo- and endo-peptidase activity []. Members of the papain family are widespread, found in baculovirus [], eubacteria, yeast, and practically all protozoa, plants and mammals []. The proteins are typically lysosomal or secreted, and proteolytic cleavage of the propeptide is required for enzyme activation, although bleomycin hydrolase is cytosolic in fungi and mammals []. Papain-like cysteine proteinases are essentially synthesised as inactive proenzymes (zymogens) with N-terminal propeptide regions. The activation process of these enzymes includes the removal of propeptide regions. The propeptide regions serve a variety of functions in vivo and in vitro. The pro-region is required for the proper folding of the newly synthesised enzyme, the inactivation of the peptidase domain and stabilisation of the enzyme against denaturing at neutral to alkaline pH conditions. Amino acid residues within the pro-region mediate their membrane association, and play a role in the transport of the proenzyme to lysosomes. Among the most notable features of propeptides is their ability to inhibit the activity of their cognate enzymes and that certain propeptides exhibit high selectivity for inhibition of the peptidases from which they originate []. The catalytic residues of papain are Cys-25 and His-159, other important residues being Gln-19, which helps form the 'oxyanion hole', and Asn-175, which orientates the imidazole ring of His-159. ; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 3MOR_B 3HHI_B 1S4V_A 3F75_A 1MEG_A 1PCI_C 1PPO_A 3HD3_B 1F29_A 1EWL_A ....
Probab=99.88 E-value=2.8e-23 Score=172.18 Aligned_cols=83 Identities=36% Similarity=0.683 Sum_probs=74.2
Q ss_pred CccceeeeeecccceEEEEec-c-ccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCCCCCCCCcEE
Q psy282 136 GSETMKKILYKYGPLSVGLNS-H-LIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFF 213 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~-~-~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg~~WG~~Gy~ 213 (233)
+.++|+++|.++|||+++|.+ + .|+.|++||+. ...|.....+|||+|||||++.+++|||||||||+.||++|||
T Consensus 122 ~~~~ik~~L~~~gpV~~~~~~~~~~f~~~~~gi~~--~~~~~~~~~~Hav~iVGy~~~~~~~~wiv~NSWG~~WG~~Gy~ 199 (219)
T PF00112_consen 122 DIEDIKKALMKYGPVVASIDVSSEDFQNYKSGIYD--PPDCSNESGGHAVLIVGYDDENGKGYWIVKNSWGTDWGDNGYF 199 (219)
T ss_dssp CHHHHHHHHHHHSSEEEEEEEESHHHHTEESSEEC--STSSSSSSEEEEEEEEEEEEETTEEEEEEE-SBTTTSTBTTEE
T ss_pred chhHHHHHHhhCceeeeeeeccccccccccceeee--ccccccccccccccccccccccceeeEeeehhhCCccCCCeEE
Confidence 467899999999999999998 4 49999999986 3367766789999999999999999999999999999999999
Q ss_pred EEEccCC
Q psy282 214 KIEHTLR 220 (233)
Q Consensus 214 ~i~~~~~ 220 (233)
||+|+.+
T Consensus 200 ~i~~~~~ 206 (219)
T PF00112_consen 200 RISYDYN 206 (219)
T ss_dssp EEESSSS
T ss_pred EEeeCCC
Confidence 9999987
No 27
>PTZ00462 Serine-repeat antigen protein; Provisional
Probab=99.86 E-value=1.4e-22 Score=194.93 Aligned_cols=80 Identities=30% Similarity=0.549 Sum_probs=67.9
Q ss_pred cceeeeeecccceEEEEeccccccc-cCcceecCCCCCCCCCCCEEEEEEEeeec-----CCcCEEEEEcCCCCCCCCCc
Q psy282 138 ETMKKILYKYGPLSVGLNSHLIHFY-NGTPIRKNDETCSPYDLGHAVLLVGYGKQ-----DDIPYWLVRNSWGPIGPDEG 211 (233)
Q Consensus 138 ~~i~~~l~~~GPv~v~i~~~~~~~y-~~gi~~~~~~~~~~~~~~Hav~iVGy~~~-----~~~~ywivkNSWg~~WG~~G 211 (233)
..|+++|...|||+|+|++..|++| .+|++. ...|+...++|||+|||||++ ++++|||||||||+.|||+|
T Consensus 681 ~~IK~eI~~kGPVaV~IdAsdf~~Y~~sGIyv--~~~Cgs~~~nHAVlIVGYGt~in~eg~gk~YWIVRNSWGt~WGEnG 758 (1004)
T PTZ00462 681 KIIKDEIMNKGSVIAYIKAENVLGYEFNGKKV--QNLCGDDTADHAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEG 758 (1004)
T ss_pred HHHHHHHHhcCCEEEEEEeehHHhhhcCCccc--cCCCCCCcCCceEEEEEecccccccCCCCceEEEEcCCCCCcCCCe
Confidence 4688889899999999999667777 488865 346986678999999999974 25789999999999999999
Q ss_pred EEEEEccC
Q psy282 212 FFKIEHTL 219 (233)
Q Consensus 212 y~~i~~~~ 219 (233)
||||+|..
T Consensus 759 YFKI~r~g 766 (1004)
T PTZ00462 759 YFKVDMYG 766 (1004)
T ss_pred EEEEEeCC
Confidence 99999953
No 28
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.84 E-value=3.8e-21 Score=155.46 Aligned_cols=70 Identities=34% Similarity=0.719 Sum_probs=58.9
Q ss_pred EEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeec-CCcCEEEEEcCCCCCCCCCcEEEEEccC-CCC
Q psy282 151 SVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DDIPYWLVRNSWGPIGPDEGFFKIEHTL-RSH 222 (233)
Q Consensus 151 ~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~-~~~~ywivkNSWg~~WG~~Gy~~i~~~~-~~~ 222 (233)
++.+.+..|++|++|||. ...|.....+|+|+|||||.+ ++++|||||||||+.|||+|||||.|+. +.+
T Consensus 93 ~~~~~~~~f~~Y~~Gi~~--~~~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c 164 (174)
T smart00645 93 SVAIDASDFQFYKSGIYD--HPGCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNEC 164 (174)
T ss_pred EEEEEcccccCCcCeEEC--CCCCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCcc
Confidence 555666669999999885 235765557999999999987 8899999999999999999999999987 443
No 29
>cd02619 Peptidase_C1 C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some exceptions like cathepsins B, C, H and X, which are exopeptidases. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds while mammalian CPs are primarily lysosomal enzymes responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. Bleomycin hydrolase (BH) is a CP that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. It forms a hexameric ring barrel str
Probab=99.84 E-value=1.4e-21 Score=162.31 Aligned_cols=86 Identities=27% Similarity=0.279 Sum_probs=72.7
Q ss_pred CccceeeeeecccceEEEEec-cccccccCccee---cCCCCCCCCCCCEEEEEEEeeecC--CcCEEEEEcCCCCCCCC
Q psy282 136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIR---KNDETCSPYDLGHAVLLVGYGKQD--DIPYWLVRNSWGPIGPD 209 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~---~~~~~~~~~~~~Hav~iVGy~~~~--~~~ywivkNSWg~~WG~ 209 (233)
+++.|+++|.+.|||+++|.+ +.|.+|+++++. .....|.....+|||+|||||++. +++|||||||||+.||+
T Consensus 124 ~~~~ik~aL~~~gPv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Hav~ivGy~~~~~~~~~~~i~~NSwG~~wg~ 203 (223)
T cd02619 124 NIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGD 203 (223)
T ss_pred hHHHHHHHHHHCCCEEEEEEcccchhcccCccccccccccccCCCccCCeEEEEEeecCCCCCCCCEEEEEeCCCCcccc
Confidence 357899999999999999998 578889988762 123345556789999999999876 88999999999999999
Q ss_pred CcEEEEEccCCC
Q psy282 210 EGFFKIEHTLRS 221 (233)
Q Consensus 210 ~Gy~~i~~~~~~ 221 (233)
+|||||.++...
T Consensus 204 ~Gy~~i~~~~~~ 215 (223)
T cd02619 204 NGYGRISYEDVY 215 (223)
T ss_pred CCEEEEehhhhh
Confidence 999999998643
No 30
>smart00645 Pept_C1 Papain family cysteine protease.
Probab=99.82 E-value=3.4e-20 Score=149.93 Aligned_cols=74 Identities=38% Similarity=0.784 Sum_probs=64.8
Q ss_pred EEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeecc-CCccEEEEecCCCCCCCCCcceeeeccc-cccCccc
Q psy282 56 SVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ-DDIPYWLVRNSWGPIGPDEGFFKIERGN-NACGKDF 131 (233)
Q Consensus 56 ~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~-~~~~ywivknswg~~wge~gy~~~~~~~-~~Cgi~~ 131 (233)
++.+.+.+|+.|++|||+.+ .|+....+|+|+|||||.+ ++.+|||++||||+.||++||+|+.++. +.|+++.
T Consensus 93 ~~~~~~~~f~~Y~~Gi~~~~--~~~~~~~~Hav~ivGyg~~~~g~~yWii~NSwG~~WG~~G~~~i~~~~~~~c~i~~ 168 (174)
T smart00645 93 SVAIDASDFQFYKSGIYDHP--GCGSGTLDHAVLIVGYGTEENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEA 168 (174)
T ss_pred EEEEEcccccCCcCeEECCC--CCCCCcccEEEEEEEEeecCCCeeEEEEECCCCCCcccCeEEEEEcCCCCccCcee
Confidence 56666668999999999886 4766668999999999987 8899999999999999999999999986 8899854
No 31
>KOG1544|consensus
Probab=99.82 E-value=4.1e-20 Score=157.73 Aligned_cols=134 Identities=22% Similarity=0.441 Sum_probs=103.0
Q ss_pred CCCCCCCCCCCCCCCC-Ccccc------------------ccCCC--ceeEEeCCceEEeC-CHHHHHHHHHhcCCEEEE
Q psy282 1 MGLESEKDYPYKNANG-EKFKC------------------AYDKS--KVKLFTGKDFLHFN-GSETMKKILYKYGPLSVL 58 (233)
Q Consensus 1 ~Gi~~e~~yPY~~~~~-~~~~C------------------~~~~~--~~~~~i~~~~~~v~-~~~~i~~~l~~~GPv~v~ 58 (233)
.|++...||||.+.+. ..+.| ..... +..+++.. =+.|+ +|++|+++|+++|||.+.
T Consensus 291 rGvVsdhCYP~~~dQ~~~~~~C~m~sR~~grgkRqat~~CPn~~~~Sn~iyq~tP-PYrVSSnE~eImkElM~NGPVQA~ 369 (470)
T KOG1544|consen 291 RGVVSDHCYPFSGDQAGPAPPCMMHSRAMGRGKRQATAHCPNSYVNSNDIYQVTP-PYRVSSNEKEIMKELMENGPVQAL 369 (470)
T ss_pred ccccccccccccCCCCCCCCCceeeccccCcccccccCcCCCcccccCceeeecC-CeeccCCHHHHHHHHHhCCChhhh
Confidence 4899999999985433 22234 32211 13344444 33344 999999999999999999
Q ss_pred Eec-CCCCCcCCceeeCCCCCC-----CCCCCCcEEEEEeeeccC-----CccEEEEecCCCCCCCCCcceeeecccccc
Q psy282 59 LNS-DLIHDYNGTPIRKNDETC-----SPYDLGHAVLLVGYGKQD-----DIPYWLVRNSWGPIGPDEGFFKIERGNNAC 127 (233)
Q Consensus 59 i~~-~~f~~y~~Gi~~~~~~~c-----~~~~~~H~v~ivG~g~~~-----~~~ywivknswg~~wge~gy~~~~~~~~~C 127 (233)
|.+ ++|..|++|||.+.+..- ....+.|+|.|.|||++. ..+||+.-||||+.|||+||+++.++.|.|
T Consensus 370 m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG~~WGE~GYFriLRGvNec 449 (470)
T KOG1544|consen 370 MEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGYFRILRGVNEC 449 (470)
T ss_pred hhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCCCeeEEEEeecccccccccCceEEEeccccch
Confidence 999 599999999998864211 123578999999999874 367999999999999999999999999999
Q ss_pred Cccccccc
Q psy282 128 GKDFLHFN 135 (233)
Q Consensus 128 gi~~~~~~ 135 (233)
.|++..+.
T Consensus 450 dIEsfvIg 457 (470)
T KOG1544|consen 450 DIESFVIG 457 (470)
T ss_pred hhhHhhhh
Confidence 99887543
No 32
>KOG1544|consensus
Probab=99.78 E-value=7.8e-20 Score=156.01 Aligned_cols=89 Identities=22% Similarity=0.472 Sum_probs=75.4
Q ss_pred CccceeeeeecccceEEEEec-cccccccCcceecCCCCCCC-----CCCCEEEEEEEeeecC-----CcCEEEEEcCCC
Q psy282 136 GSETMKKILYKYGPLSVGLNS-HLIHFYNGTPIRKNDETCSP-----YDLGHAVLLVGYGKQD-----DIPYWLVRNSWG 204 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~-~~~~~y~~gi~~~~~~~~~~-----~~~~Hav~iVGy~~~~-----~~~ywivkNSWg 204 (233)
+|++|++.|+++|||.+.|.+ .+|++|++|||.+.+.+... ..+.|+|.|.|||++. -++||+..||||
T Consensus 352 nE~eImkElM~NGPVQA~m~VHEDFF~YkgGiY~H~~~~~~~~e~yr~~gtHsVk~tGWG~~~~~~G~~~KyW~aANSWG 431 (470)
T KOG1544|consen 352 NEKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWG 431 (470)
T ss_pred CHHHHHHHHHhCCChhhhhhhhhhhhhhccceeeccccccCCchhhhhcccceEEEeecccccCCCCCeeEEEEeecccc
Confidence 577899999999999999999 58999999999865444332 2579999999999863 267999999999
Q ss_pred CCCCCCcEEEEEccCCCCcc
Q psy282 205 PIGPDEGFFKIEHTLRSHLT 224 (233)
Q Consensus 205 ~~WG~~Gy~~i~~~~~~~~~ 224 (233)
+.|||+|||||.|+.|++-+
T Consensus 432 ~~WGE~GYFriLRGvNecdI 451 (470)
T KOG1544|consen 432 PAWGERGYFRILRGVNECDI 451 (470)
T ss_pred cccccCceEEEeccccchhh
Confidence 99999999999999987733
No 33
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.64 E-value=1.5e-16 Score=144.45 Aligned_cols=77 Identities=19% Similarity=0.272 Sum_probs=63.5
Q ss_pred eeeecccceEEEEeccccccccCcceecCCC------------------CCCCCCCCEEEEEEEeeecC-Cc-CEEEEEc
Q psy282 142 KILYKYGPLSVGLNSHLIHFYNGTPIRKNDE------------------TCSPYDLGHAVLLVGYGKQD-DI-PYWLVRN 201 (233)
Q Consensus 142 ~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~------------------~~~~~~~~Hav~iVGy~~~~-~~-~ywivkN 201 (233)
++|..++||.++.++..++.|++||+..... .|.++..+|||+||||+.++ |. .||+|||
T Consensus 303 ~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ivGv~~D~~g~p~yw~VkN 382 (437)
T cd00585 303 AQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMVLTGVDLDEDGKPVKWKVEN 382 (437)
T ss_pred HHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEEEEEEEecCCCCcceEEEEc
Confidence 6777889999999998788999999842110 14445679999999999754 66 5999999
Q ss_pred CCCCCCCCCcEEEEEcc
Q psy282 202 SWGPIGPDEGFFKIEHT 218 (233)
Q Consensus 202 SWg~~WG~~Gy~~i~~~ 218 (233)
|||+.||++|||+|+++
T Consensus 383 SWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 383 SWGEKVGKKGYFVMSDD 399 (437)
T ss_pred ccCCCCCCCcceehhHH
Confidence 99999999999999985
No 34
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.48 E-value=2.8e-14 Score=124.40 Aligned_cols=78 Identities=26% Similarity=0.359 Sum_probs=50.2
Q ss_pred eeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeec----------CCcCEEEEEcCCCCCCCC
Q psy282 140 MKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ----------DDIPYWLVRNSWGPIGPD 209 (233)
Q Consensus 140 i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~----------~~~~ywivkNSWg~~WG~ 209 (233)
+++++...|-++..|..+...++.-.+-.+ ..-+....+|||+||||++. .+...||||||||+.||+
T Consensus 227 i~~~~~~yg~~s~~~~id~~~~~~~~~~~~--~~~s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSWGt~wG~ 304 (372)
T COG4870 227 IKAMFGFYGAVSSSMYIDATNSLGICIPYP--YVDSGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGE 304 (372)
T ss_pred hHHHHhhhccccceeEEecccccccccCCC--CCCccccccceEEEEeccccccccccccCCCCCceEEEECcccccccc
Confidence 555555666665444442222222111111 11111467999999999974 235599999999999999
Q ss_pred CcEEEEEccC
Q psy282 210 EGFFKIEHTL 219 (233)
Q Consensus 210 ~Gy~~i~~~~ 219 (233)
+|||||...-
T Consensus 305 ~GYfwisY~y 314 (372)
T COG4870 305 NGYFWISYYY 314 (372)
T ss_pred CceEEEEeee
Confidence 9999999854
No 35
>cd00585 Peptidase_C1B Peptidase C1B subfamily (MEROPS database nomenclature); composed of eukaryotic bleomycin hydrolases (BH) and bacterial aminopeptidases C (pepC). The proteins of this subfamily contain a large insert relative to the C1A peptidase (papain) subfamily. BH is a cysteine peptidase that detoxifies bleomycin by hydrolysis of an amide group. It acts as a carboxypeptidase on its C-terminus to convert itself into an aminopeptidase and peptide ligase. BH is found in all tissues in mammals as well as in many other eukaryotes. Bleomycin, a glycopeptide derived from the fungus Streptomyces verticullus, is an effective anticancer drug due to its ability to induce DNA strand breaks. Human BH is the major cause of tumor cell resistance to bleomycin chemotherapy, and is also genetically linked to Alzheimer's disease. In addition to its peptidase activity, the yeast BH (Gal6) binds DNA and acts as a repressor in the Gal4 regulatory system. BH forms a hexameric ring barrel structure w
Probab=99.40 E-value=8.8e-13 Score=119.92 Aligned_cols=90 Identities=19% Similarity=0.268 Sum_probs=70.4
Q ss_pred eCCceEEeCCHHHHH----HHHHhcCCEEEEEecCCCCCcCCceeeCCCC------------------CCCCCCCCcEEE
Q psy282 32 TGKDFLHFNGSETMK----KILYKYGPLSVLLNSDLIHDYNGTPIRKNDE------------------TCSPYDLGHAVL 89 (233)
Q Consensus 32 i~~~~~~v~~~~~i~----~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~------------------~c~~~~~~H~v~ 89 (233)
... |..++ .+.|+ ++|...+||.++.++..|+.|++||+..... .|.....+|||+
T Consensus 286 ~~~-y~Nvp-~d~l~~~~~~~L~~g~pV~~g~Dv~~~~~~k~GI~d~~~~~~~~~f~~~~~~~KaeRl~~~es~~tHAM~ 363 (437)
T cd00585 286 PIL-YLNVP-MDVLKKAAIAQLKDGEPVWFGCDVGKFSDRKSGILDTDLFDYELLFGIDFGLNKAERLDYGESLMTHAMV 363 (437)
T ss_pred cce-EEecC-HHHHHHHHHHHHhcCCCEEEEEEcChhhccCCccccCcccchhhhcCccccCCHHHHHhhcCCcCCeEEE
Confidence 345 66665 44444 6778888999999998888999999954310 144456799999
Q ss_pred EEeeeccC-Cc-cEEEEecCCCCCCCCCcceeeecc
Q psy282 90 LVGYGKQD-DI-PYWLVRNSWGPIGPDEGFFKIERG 123 (233)
Q Consensus 90 ivG~g~~~-~~-~ywivknswg~~wge~gy~~~~~~ 123 (233)
|||++.+. +. .||+|+||||..||++||+++..+
T Consensus 364 ivGv~~D~~g~p~yw~VkNSWG~~~G~~Gy~~ms~~ 399 (437)
T cd00585 364 LTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVMSDD 399 (437)
T ss_pred EEEEEecCCCCcceEEEEcccCCCCCCCcceehhHH
Confidence 99999854 76 599999999999999999988653
No 36
>COG4870 Cysteine protease [Posttranslational modification, protein turnover, chaperones]
Probab=99.11 E-value=2.2e-10 Score=100.28 Aligned_cols=80 Identities=25% Similarity=0.316 Sum_probs=57.7
Q ss_pred CHHHHHHHHHhcCCEEEEEec--CCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccC----------CccEEEEecCC
Q psy282 41 GSETMKKILYKYGPLSVLLNS--DLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQD----------DIPYWLVRNSW 108 (233)
Q Consensus 41 ~~~~i~~~l~~~GPv~v~i~~--~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~----------~~~ywivknsw 108 (233)
+...|++++...|-++..|.. ..+....-+.+... +....+|||+||||+..- +...|||||||
T Consensus 223 dnG~i~~~~~~yg~~s~~~~id~~~~~~~~~~~~~~~----s~~~~gHAv~iVGyDDs~~~n~~~~~~~g~GAfiikNSW 298 (372)
T COG4870 223 DNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVD----SGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSW 298 (372)
T ss_pred cccchHHHHhhhccccceeEEecccccccccCCCCCC----ccccccceEEEEeccccccccccccCCCCCceEEEECcc
Confidence 555688888888877755554 34444344444443 236789999999998642 23479999999
Q ss_pred CCCCCCCcceeeeccc
Q psy282 109 GPIGPDEGFFKIERGN 124 (233)
Q Consensus 109 g~~wge~gy~~~~~~~ 124 (233)
|+.||++||+++...+
T Consensus 299 Gt~wG~~GYfwisY~y 314 (372)
T COG4870 299 GTNWGENGYFWISYYY 314 (372)
T ss_pred ccccccCceEEEEeee
Confidence 9999999999987653
No 37
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.78 E-value=4.8e-09 Score=95.68 Aligned_cols=81 Identities=21% Similarity=0.315 Sum_probs=55.8
Q ss_pred cceeeeeecccceEEEEeccccccccCcceecCCC----------C--------CCCCCCCEEEEEEEeee-cCCc-CEE
Q psy282 138 ETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDE----------T--------CSPYDLGHAVLLVGYGK-QDDI-PYW 197 (233)
Q Consensus 138 ~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~----------~--------~~~~~~~Hav~iVGy~~-~~~~-~yw 197 (233)
..+.++|..+-||-.|-++..+...+.|+....-. . ...+..+|||+|||.+. ++|. .+|
T Consensus 300 ~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~~D~~g~p~~w 379 (438)
T PF03051_consen 300 DAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVDLDEDGKPVRW 379 (438)
T ss_dssp HHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEEE-TTSSEEEE
T ss_pred HHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEEeccCCCeeEE
Confidence 35666777788999999997555666776521100 0 01235699999999997 5666 499
Q ss_pred EEEcCCCCCCCCCcEEEEEcc
Q psy282 198 LVRNSWGPIGPDEGFFKIEHT 218 (233)
Q Consensus 198 ivkNSWg~~WG~~Gy~~i~~~ 218 (233)
+|+||||+..|.+|||.|+..
T Consensus 380 kVeNSWG~~~g~kGy~~msd~ 400 (438)
T PF03051_consen 380 KVENSWGTDNGDKGYFYMSDD 400 (438)
T ss_dssp EEE-SBTTTSTBTTEEEEEHH
T ss_pred EEEcCCCCCCCCCcEEEECHH
Confidence 999999999999999999863
No 38
>PF03051 Peptidase_C1_2: Peptidase C1-like family This family is a subfamily of the Prosite entry; InterPro: IPR004134 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of proteins belong to MEROPS peptidase family C1, sub-family C1B (bleomycin hydrolase, clan CA). This family contains prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases.; GO: 0004197 cysteine-type endopeptidase activity, 0006508 proteolysis; PDB: 3PW3_F 2CB5_A 1CB5_C 2DZZ_A 2E02_A 2E01_A 2E03_A 1A6R_A 1GCB_A 3GCB_A ....
Probab=98.03 E-value=2.2e-05 Score=71.89 Aligned_cols=87 Identities=22% Similarity=0.361 Sum_probs=58.6
Q ss_pred eEEeC-C--HHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCC------------------CCCCCCCCcEEEEEeee
Q psy282 36 FLHFN-G--SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDE------------------TCSPYDLGHAVLLVGYG 94 (233)
Q Consensus 36 ~~~v~-~--~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~------------------~c~~~~~~H~v~ivG~g 94 (233)
|.+++ + .+.+.++|...-||..+-++..+...+.||.....- .-.....+||++|+|..
T Consensus 290 ylNvpid~lk~~~i~~Lk~G~~VwfgcDV~k~~~~k~Gi~D~~~~d~~~~fg~~~~~~K~~Rl~~~eS~~tHAM~itGv~ 369 (438)
T PF03051_consen 290 YLNVPIDELKDAAIKSLKAGYPVWFGCDVGKFFDRKNGIMDTDLYDYDSLFGVDFNMSKAERLDYGESTMTHAMVITGVD 369 (438)
T ss_dssp EEE--HHHHHHHHHHHHHTT--EEEEEETTTTEETTTTEE-TTSB-HHHHHT--S-S-HHHHHHTTSS--EEEEEEEEEE
T ss_pred EeccCHHHHHHHHHHHHHcCCcEEEeccCCccccccchhhccchhhhhhhhccccccCHHHHHHhCCCCCceeEEEEEEE
Confidence 56666 1 245566777767999999997666778888854310 00113569999999999
Q ss_pred c-cCCcc-EEEEecCCCCCCCCCcceeeec
Q psy282 95 K-QDDIP-YWLVRNSWGPIGPDEGFFKIER 122 (233)
Q Consensus 95 ~-~~~~~-ywivknswg~~wge~gy~~~~~ 122 (233)
. +++.| +|.|+||||...|.+||+.+..
T Consensus 370 ~D~~g~p~~wkVeNSWG~~~g~kGy~~msd 399 (438)
T PF03051_consen 370 LDEDGKPVRWKVENSWGTDNGDKGYFYMSD 399 (438)
T ss_dssp E-TTSSEEEEEEE-SBTTTSTBTTEEEEEH
T ss_pred eccCCCeeEEEEEcCCCCCCCCCcEEEECH
Confidence 7 56765 8999999999999999987653
No 39
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=97.16 E-value=9.4e-05 Score=64.66 Aligned_cols=74 Identities=18% Similarity=0.225 Sum_probs=49.5
Q ss_pred eecccceEEEEeccccccccCcceecCCC------------------CCCCCCCCEEEEEEEeee-cCCc-CEEEEEcCC
Q psy282 144 LYKYGPLSVGLNSHLIHFYNGTPIRKNDE------------------TCSPYDLGHAVLLVGYGK-QDDI-PYWLVRNSW 203 (233)
Q Consensus 144 l~~~GPv~v~i~~~~~~~y~~gi~~~~~~------------------~~~~~~~~Hav~iVGy~~-~~~~-~ywivkNSW 203 (233)
+...-+|=+|-++..++-.+.||+.-... ..+.+.+.|||+|.|.+. ++|. --|.|.|||
T Consensus 308 ~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~g~p~rwkVENSW 387 (444)
T COG3579 308 MQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDETGNPLRWKVENSW 387 (444)
T ss_pred HhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccCCCceeeEeeccc
Confidence 33455676776665555555555421100 011234689999999985 4443 389999999
Q ss_pred CCCCCCCcEEEEEc
Q psy282 204 GPIGPDEGFFKIEH 217 (233)
Q Consensus 204 g~~WG~~Gy~~i~~ 217 (233)
|..=|.+|||-++.
T Consensus 388 G~d~G~~GyfvaSd 401 (444)
T COG3579 388 GKDVGKKGYFVASD 401 (444)
T ss_pred ccccCCCceEeehH
Confidence 99999999998865
No 40
>COG3579 PepC Aminopeptidase C [Amino acid transport and metabolism]
Probab=93.84 E-value=0.05 Score=48.01 Aligned_cols=80 Identities=21% Similarity=0.323 Sum_probs=54.7
Q ss_pred CHHHHHHH----HHhcCCEEEEEecCCCCCcCCceeeCCC-------------C-----CCCCCCCCcEEEEEeeecc-C
Q psy282 41 GSETMKKI----LYKYGPLSVLLNSDLIHDYNGTPIRKND-------------E-----TCSPYDLGHAVLLVGYGKQ-D 97 (233)
Q Consensus 41 ~~~~i~~~----l~~~GPv~v~i~~~~f~~y~~Gi~~~~~-------------~-----~c~~~~~~H~v~ivG~g~~-~ 97 (233)
+.+.+|++ +...-||-.+-++..|+.-+.||..... + +.+.....||++|.|...+ +
T Consensus 296 ~me~lkkl~~~q~qagetVwFG~dvgq~s~rk~Gimdtd~~~~~s~~g~~~~q~KA~RldY~eSLmTHAMvlTGvd~d~~ 375 (444)
T COG3579 296 DMERLKKLAIKQMQAGETVWFGCDVGQLSDRKTGIMDTDIYDYESSLGINLTQDKAGRLDYGESLMTHAMVLTGVDLDET 375 (444)
T ss_pred cHHHHHHHHHHHHhcCCcEEeecCchhhcccccceeeehhccchhhhCCCcccchhhccccchHHHHHHHHhhccccccC
Confidence 44445443 3444588888777788888888764321 0 1122345799999999864 4
Q ss_pred Ccc-EEEEecCCCCCCCCCcceee
Q psy282 98 DIP-YWLVRNSWGPIGPDEGFFKI 120 (233)
Q Consensus 98 ~~~-ywivknswg~~wge~gy~~~ 120 (233)
+.| -|-|+||||..-|.+||+-.
T Consensus 376 g~p~rwkVENSWG~d~G~~Gyfva 399 (444)
T COG3579 376 GNPLRWKVENSWGKDVGKKGYFVA 399 (444)
T ss_pred CCceeeEeecccccccCCCceEee
Confidence 444 59999999999999999754
No 41
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=89.70 E-value=0.26 Score=36.98 Aligned_cols=56 Identities=27% Similarity=0.417 Sum_probs=32.1
Q ss_pred ccceeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCC
Q psy282 137 SETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW 203 (233)
Q Consensus 137 e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSW 203 (233)
...+++.|.+..||.+.+....-.. .+..+. .....|.|+|+||+++. +++|..+|
T Consensus 89 ~~~i~~~i~~G~Pvi~~~~~~~~~~-~~~~~~-------~~~~~H~vvi~Gy~~~~---~~~v~DP~ 144 (144)
T PF13529_consen 89 FDDIKQEIDAGRPVIVSVNSGWRPP-NGDGYD-------GTYGGHYVVIIGYDEDG---YVYVNDPW 144 (144)
T ss_dssp HHHHHHHHHTT--EEEEEETTSS---TTEEEE-------E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred HHHHHHHHHCCCcEEEEEEcccccC-CCCCcC-------CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence 4678888988889999987421111 111111 12468999999999643 78887776
No 42
>PF13529 Peptidase_C39_2: Peptidase_C39 like family; PDB: 3ERV_A.
Probab=88.23 E-value=1.9 Score=32.10 Aligned_cols=55 Identities=27% Similarity=0.439 Sum_probs=32.1
Q ss_pred CHHHHHHHHHhcCCEEEEEecC--CCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccCCccEEEEecCC
Q psy282 41 GSETMKKILYKYGPLSVLLNSD--LIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSW 108 (233)
Q Consensus 41 ~~~~i~~~l~~~GPv~v~i~~~--~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~~~~ywivknsw 108 (233)
+.+.|+++|....||.+.+... .. ....+. .....|.|+|+||..+. ++++...|
T Consensus 88 ~~~~i~~~i~~G~Pvi~~~~~~~~~~---~~~~~~-------~~~~~H~vvi~Gy~~~~---~~~v~DP~ 144 (144)
T PF13529_consen 88 SFDDIKQEIDAGRPVIVSVNSGWRPP---NGDGYD-------GTYGGHYVVIIGYDEDG---YVYVNDPW 144 (144)
T ss_dssp -HHHHHHHHHTT--EEEEEETTSS-----TTEEEE-------E-TTEEEEEEEEE-SSE----EEEE-TT
T ss_pred cHHHHHHHHHCCCcEEEEEEcccccC---CCCCcC-------CCcCCEEEEEEEEeCCC---EEEEeCCC
Confidence 7899999999988999999742 12 111122 23468999999997543 55554443
No 43
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=78.11 E-value=8.8 Score=31.96 Aligned_cols=64 Identities=25% Similarity=0.331 Sum_probs=39.4
Q ss_pred CHHHHHHHHHhcCCEEEEEecCCCC--CcCCceee--CCCCCC-CCCCCCcEEEEEeeeccCCccEEEEec
Q psy282 41 GSETMKKILYKYGPLSVLLNSDLIH--DYNGTPIR--KNDETC-SPYDLGHAVLLVGYGKQDDIPYWLVRN 106 (233)
Q Consensus 41 ~~~~i~~~l~~~GPv~v~i~~~~f~--~y~~Gi~~--~~~~~c-~~~~~~H~v~ivG~g~~~~~~ywivkn 106 (233)
+.++|...|...||+.+-+++..+. .-+.-... .+.+-| +.....|-|+|+||....+. .+.+|
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~~~--~~yrd 180 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAATKE--FEYRD 180 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCCCCe--EEEeC
Confidence 6899999999989888888875333 11222221 111111 23567999999999754433 44444
No 44
>KOG4128|consensus
Probab=72.89 E-value=0.35 Score=42.74 Aligned_cols=40 Identities=30% Similarity=0.386 Sum_probs=32.4
Q ss_pred CCCEEEEEEEeee----cCCcCEEEEEcCCCCCCCCCcEEEEEc
Q psy282 178 DLGHAVLLVGYGK----QDDIPYWLVRNSWGPIGPDEGFFKIEH 217 (233)
Q Consensus 178 ~~~Hav~iVGy~~----~~~~~ywivkNSWg~~WG~~Gy~~i~~ 217 (233)
...||+++.+-+. +++-.-|-|.||||+.-|.+|+..|..
T Consensus 370 lmthAml~T~v~~kd~~~g~~~~~rVenswgkd~gkkg~~~mt~ 413 (457)
T KOG4128|consen 370 LMTHAMLLTSVGLKDPATGGLNEHRVENSWGKDLGKKGVNKMTA 413 (457)
T ss_pred HHHHHHHhhhccccCcccCCchhhhhhchhhhhccccchhhhhH
Confidence 4689999999883 233457999999999999999977754
No 45
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=71.66 E-value=12 Score=32.53 Aligned_cols=57 Identities=16% Similarity=0.280 Sum_probs=35.9
Q ss_pred CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeeccCCccEEEEec
Q psy282 41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRN 106 (233)
Q Consensus 41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~~~~~ywivkn 106 (233)
-.+.|+++|....||.+.++.-.+ .|....+ ......|.++|+||..++ ..+.++..
T Consensus 77 ~~~~l~~~l~~g~pv~~~~D~~~l-py~~~~~-------~~~~~~H~i~v~G~d~~~-~~~~v~D~ 133 (317)
T PF14399_consen 77 AWEELKEALDAGRPVIVWVDMYYL-PYRPNYY-------KKHHADHYIVVYGYDEEE-DVFYVSDP 133 (317)
T ss_pred HHHHHHHHHhCCCceEEEeccccC-CCCcccc-------ccccCCcEEEEEEEeCCC-CEEEEEcC
Confidence 346888888886699999776322 2222212 123458999999997653 34555554
No 46
>PF14399 Transpep_BrtH: NlpC/p60-like transpeptidase
Probab=66.92 E-value=4.7 Score=35.01 Aligned_cols=56 Identities=14% Similarity=0.254 Sum_probs=35.2
Q ss_pred ccceeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEc
Q psy282 137 SETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRN 201 (233)
Q Consensus 137 e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkN 201 (233)
.+.+++.|.++.||.+.++.-.+.++.. + .......|.|+|+||++++ ..+.++-.
T Consensus 78 ~~~l~~~l~~g~pv~~~~D~~~lpy~~~--~------~~~~~~~H~i~v~G~d~~~-~~~~v~D~ 133 (317)
T PF14399_consen 78 WEELKEALDAGRPVIVWVDMYYLPYRPN--Y------YKKHHADHYIVVYGYDEEE-DVFYVSDP 133 (317)
T ss_pred HHHHHHHHhCCCceEEEeccccCCCCcc--c------cccccCCcEEEEEEEeCCC-CEEEEEcC
Confidence 3568888877779999977633322221 1 1222458999999999652 34556544
No 47
>PF09778 Guanylate_cyc_2: Guanylylate cyclase; InterPro: IPR018616 Members of this family of proteins catalyse the conversion of guanosine triphosphate (GTP) to 3',5'-cyclic guanosine monophosphate (cGMP) and pyrophosphate.
Probab=62.40 E-value=13 Score=31.07 Aligned_cols=64 Identities=23% Similarity=0.376 Sum_probs=37.5
Q ss_pred CccceeeeeecccceEEEEeccccc--cccCcceecCCCCC--C-CCCCCEEEEEEEeeecCCcCEEEEEc
Q psy282 136 GSETMKKILYKYGPLSVGLNSHLIH--FYNGTPIRKNDETC--S-PYDLGHAVLLVGYGKQDDIPYWLVRN 201 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~~~~~--~y~~gi~~~~~~~~--~-~~~~~Hav~iVGy~~~~~~~ywivkN 201 (233)
+.+++.+.|..++|+.+-+++..+. .=+..........| . +.-.+|-|+|+||+... +-+++||
T Consensus 112 s~~ei~~hl~~g~~aIvLVd~~~L~C~~Ck~~~~~~~~~~~~~~~~~Y~GHYVVlcGyd~~~--~~~~yrd 180 (212)
T PF09778_consen 112 SIQEIIEHLSSGGPAIVLVDASLLHCDLCKSNCFDPIGSKCFGRSPDYQGHYVVLCGYDAAT--KEFEYRD 180 (212)
T ss_pred cHHHHHHHHhCCCcEEEEEccccccChhhcccccccccccccCCCCCccEEEEEEEeecCCC--CeEEEeC
Confidence 4567888888889988888875433 11221111111122 1 22358999999999653 2355554
No 48
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=61.30 E-value=25 Score=28.32 Aligned_cols=54 Identities=22% Similarity=0.482 Sum_probs=34.4
Q ss_pred CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeec-cCCccEEEEecCC
Q psy282 41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK-QDDIPYWLVRNSW 108 (233)
Q Consensus 41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~-~~~~~ywivknsw 108 (233)
+-+++++.+.++.|+.+..+. . .. -+.....||++||||-. .++.+++++=|-|
T Consensus 90 s~~eV~~~~~~nk~i~i~~~~----------v-~~---~~~~~~gHAlavvGya~~~~g~~~y~~WNPW 144 (175)
T PF05543_consen 90 SFDEVKKLIDNNKGIAILADR----------V-EQ---TNGPHAGHALAVVGYAKPNNGQKTYYFWNPW 144 (175)
T ss_dssp -HHHHHHHHHTT-EEEEEEEE----------T-TS---CTTB--EEEEEEEEEEEETTSEEEEEEE-TT
T ss_pred CHHHHHHHHHcCCCeEEEecc----------c-cc---CCCCccceeEEEEeeeecCCCCeEEEEeCCc
Confidence 788889988888888776643 1 11 12334689999999976 4557777776755
No 49
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=59.81 E-value=27 Score=27.79 Aligned_cols=39 Identities=26% Similarity=0.428 Sum_probs=29.1
Q ss_pred CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeecc
Q psy282 41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ 96 (233)
Q Consensus 41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~~ 96 (233)
+.+.+++.|.++||+.++.++ + +.....|+++|.|-..+
T Consensus 97 t~e~~~~LL~~yGPLwv~~~~-------------P----~~~~~~H~~ViTGI~~d 135 (166)
T PF12385_consen 97 TAEGLANLLREYGPLWVAWEA-------------P----GDSWVAHASVITGIDGD 135 (166)
T ss_pred CHHHHHHHHHHcCCeEEEecC-------------C----CCcceeeEEEEEeecCC
Confidence 678999999999999998643 2 23445688888876543
No 50
>cd00044 CysPc Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Probab=49.93 E-value=26 Score=30.69 Aligned_cols=27 Identities=26% Similarity=0.503 Sum_probs=23.6
Q ss_pred CCEEEEEEEeeecC--CcCEEEEEcCCCC
Q psy282 179 LGHAVLLVGYGKQD--DIPYWLVRNSWGP 205 (233)
Q Consensus 179 ~~Hav~iVGy~~~~--~~~ywivkNSWg~ 205 (233)
.+||=.|++.-+.+ +...-.+||-||.
T Consensus 235 ~~HaY~Vl~~~~~~~~~~~lv~lrNPWg~ 263 (315)
T cd00044 235 KGHAYSVLDVREVQEEGLRLLRLRNPWGV 263 (315)
T ss_pred cCcceEEeEEEEEccCceEEEEecCCccC
Confidence 48999999998766 7889999999995
No 51
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=48.28 E-value=27 Score=28.19 Aligned_cols=36 Identities=25% Similarity=0.471 Sum_probs=24.0
Q ss_pred CCCEEEEEEEeee-cCCcCEEEEEcCCCCCCCCCcEEEEEcc
Q psy282 178 DLGHAVLLVGYGK-QDDIPYWLVRNSWGPIGPDEGFFKIEHT 218 (233)
Q Consensus 178 ~~~Hav~iVGy~~-~~~~~ywivkNSWg~~WG~~Gy~~i~~~ 218 (233)
..+||+++|||-. .++.++.++=|=| +++++-+...
T Consensus 118 ~~gHAlavvGya~~~~g~~~y~~WNPW-----~~~~~~~sa~ 154 (175)
T PF05543_consen 118 HAGHALAVVGYAKPNNGQKTYYFWNPW-----WNDVMIQSAK 154 (175)
T ss_dssp --EEEEEEEEEEEETTSEEEEEEE-TT------SS-EEEETT
T ss_pred ccceeEEEEeeeecCCCCeEEEEeCCc-----cCCcEEEecC
Confidence 5689999999987 5568899998877 4455555544
No 52
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.93 E-value=19 Score=29.27 Aligned_cols=47 Identities=26% Similarity=0.366 Sum_probs=32.3
Q ss_pred CccceeeeeecccceEEEEeccccccccCcceecCCCCCCCCCCCEEEEEEEeeecCCcCEEEEEcCCC
Q psy282 136 GSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWG 204 (233)
Q Consensus 136 ~e~~i~~~l~~~GPv~v~i~~~~~~~y~~gi~~~~~~~~~~~~~~Hav~iVGy~~~~~~~ywivkNSWg 204 (233)
+..+|+..|.+..||.+=... |+- .--|+|+|+||++. +...-+.||
T Consensus 122 sl~~ik~ql~kg~PV~iw~T~--~~~----------------~s~H~v~itgyDk~----n~yynDpyG 168 (195)
T COG4990 122 SLSDIKGQLLKGRPVVIWVTN--FHS----------------YSIHSVLITGYDKY----NIYYNDPYG 168 (195)
T ss_pred cHHHHHHHHhcCCcEEEEEec--ccc----------------cceeeeEeeccccc----ceEeccccc
Confidence 456788889888898754332 221 12599999999976 566666664
No 53
>COG4990 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.21 E-value=1.7e+02 Score=23.94 Aligned_cols=37 Identities=30% Similarity=0.456 Sum_probs=28.1
Q ss_pred CHHHHHHHHHhcCCEEEEEecCCCCCcCCceeeCCCCCCCCCCCCcEEEEEeeec
Q psy282 41 GSETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGK 95 (233)
Q Consensus 41 ~~~~i~~~l~~~GPv~v~i~~~~f~~y~~Gi~~~~~~~c~~~~~~H~v~ivG~g~ 95 (233)
+..+|+..|.+..||.+-... |.. ..-|+++|.||+.
T Consensus 122 sl~~ik~ql~kg~PV~iw~T~--~~~----------------~s~H~v~itgyDk 158 (195)
T COG4990 122 SLSDIKGQLLKGRPVVIWVTN--FHS----------------YSIHSVLITGYDK 158 (195)
T ss_pred cHHHHHHHHhcCCcEEEEEec--ccc----------------cceeeeEeecccc
Confidence 889999999998898875543 322 2369999999953
No 54
>KOG4128|consensus
Probab=20.25 E-value=16 Score=32.64 Aligned_cols=79 Identities=19% Similarity=0.181 Sum_probs=49.0
Q ss_pred CHHHHHHHHHh----cCCEEEEEecCCCCCcCCceeeCCC--------------C-----CCCCCCCCcEEEEEeeecc-
Q psy282 41 GSETMKKILYK----YGPLSVLLNSDLIHDYNGTPIRKND--------------E-----TCSPYDLGHAVLLVGYGKQ- 96 (233)
Q Consensus 41 ~~~~i~~~l~~----~GPv~v~i~~~~f~~y~~Gi~~~~~--------------~-----~c~~~~~~H~v~ivG~g~~- 96 (233)
+.+.|++.+.+ ..||-.+-+...|..-++|+.+-.- . +.......||+++.+-|..
T Consensus 305 ~~d~l~k~vv~sl~~~kaVwfgcd~~k~~~~K~G~~dl~l~~~~l~fG~~l~~~~KAeRl~y~eSlmthAml~T~v~~kd 384 (457)
T KOG4128|consen 305 SMDILMKIVVTSLEGDKAVWFGCDIRKAISLKSGPLDLRLHQFDLLFGFKLGESTKAERLDYRESLMTHAMLLTSVGLKD 384 (457)
T ss_pred CHHHHHHHHHHHhcCCcceEEecccHhhhhcccCccchhhccCceeeeeeccccchhhhhhHHHHHHHHHHHhhhccccC
Confidence 56677766654 3466666555455555666542210 0 0111245799998888742
Q ss_pred ---CCccEEEEecCCCCCCCCCccee
Q psy282 97 ---DDIPYWLVRNSWGPIGPDEGFFK 119 (233)
Q Consensus 97 ---~~~~ywivknswg~~wge~gy~~ 119 (233)
.+-.-|-|+||||..-|.+|+..
T Consensus 385 ~~~g~~~~~rVenswgkd~gkkg~~~ 410 (457)
T KOG4128|consen 385 PATGGLNEHRVENSWGKDLGKKGVNK 410 (457)
T ss_pred cccCCchhhhhhchhhhhccccchhh
Confidence 23446999999999999999843
Done!